Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't always grab images #160

Closed
danmed opened this issue Sep 22, 2019 · 5 comments
Closed

Doesn't always grab images #160

danmed opened this issue Sep 22, 2019 · 5 comments
Labels

Comments

@danmed
Copy link

danmed commented Sep 22, 2019

I'm just testing this and the archive feature doesn't appear to always grab images...

For example, when archiving the below link, none of the images are captured in the archive, but a thumbnail is generated.

https://imgur.com/a/W5wZxHT

I'm running the docker image btw..

@RadhiFadlillah
Copy link
Collaborator

@danmed yep, unfortunately the archival still doesn't work properly with page that uses a lot of Javascript like imgur and Reddit (the old Reddit is fine though), and unfortunately I don't have any idea where to start to solve this.

@danmed
Copy link
Author

danmed commented Sep 22, 2019 via email

@RadhiFadlillah RadhiFadlillah added the type:enhancement New feature or request label Sep 22, 2019
@deanishe
Copy link
Contributor

and unfortunately I don't have any idea where to start to solve this.

I was thinking about something like a set of URL rewriting rules, which might help Shiori handle certain sites (like Reddit). For example, a rule might rewrite www.reddit.com/... to old.reddit.com/... to get a more easily parseable version of a page.

For other sites, the mobile or print version (if they exist) might be much easier to parse.

For example, Shiori does a good job with this article from the Spiegel, but it also grabs a bunch of the dumb user comments that aren't in the print version.

@linwaytin
Copy link

I also have the same problem.
I think there are two possible ways to deal with this problem.

First, let users enter the link manually.
Second, as @deanishe mentioned, rules can help Shiori to determine where the image is.

At least, I like to be able to enter the link manually.
For now, if the image is not correct, there is nothing I can do to fix it.

Anyway, thank you for this great project.

@fmartingr
Copy link
Member

This will get worked on with #353. After the switch, we can start fixing this kind of things directly on obelisk.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants