[wallhaven.cc] add imageByFragment that picks up default saved filename by spaceyuck · Pull Request #2538 · stashapp/CommunityScrapers

spaceyuck · 2025-10-16T11:36:04Z

Generated by an automatic template. Can be removed if not applicable.

Scraper type(s)

imageByFragment
imageByURL

Examples to test

SFW because NSFW needs account and API token

https://wallhaven.cc/w/9dqojx
https://wallhaven.cc/w/yxd8jk

Short description

When right-click saving a wallpaper from wallhaven, the default filename has a predictable structure ("wallhaven-."). This change adds image fragment scraping to allow automatically picking up IDs that are a clear and separable postfix to the filename.

feederbox826

This might be the weirdest regex I've seen yet, did you mean to do a-z0-9? and also escape the . for extension?

Also some of their older posts follow a seperate format, before they migrated. A strict regex might be possible + better

feederbox826 · 2025-10-16T23:51:11Z

Will also put in draft/ potentially close as they have IQDB search for actual ByFragment. Will close if/when IQDB search implemented

https://wallhaven.cc/forums/thread/1169

spaceyuck · 2025-10-17T02:02:34Z

This might be the weirdest regex I've seen yet, did you mean to do a-z0-9? and also escape the . for extension?

\d and 0-9 are equivalent, so it's the same as a-z0-9. Point taken about the ., one of those I was stupid but it worked anyway kind of things, fixed now.

Also some of their older posts follow a seperate format, before they migrated. A strict regex might be possible + better

The oldest thing they have still uses an ID in the scheme of [a-z0-9]{6}, and I've checked some of my oldest from around 2014, they use the same format too. I can't find any info or example off another ID format still being in use, they might have migrated everything over to the current scheme.

I made it very restrictive explicitly to avoid false positives, it really should only match file names that have 6 characters / digits at the end, before the extension, with some kind of separator before.

IQDB search

Huh, I've never actually noticed that feature. I've looked into it a bit, and it's not mentioned in the API docs. Playing around with it a bit, it's just a POST to their search endpoint , maybe it just works for API search too. Otherwise this will be a whole new thing, I already see XSRF tokens and Cloudflare cookies, plus the login requirement for NSFW would probably need credentials or a session cookie in the scraper.

I'll look into it more later, but this really might be a feature of it's own.

feederbox826 · 2025-10-17T04:26:11Z

would have to be in python but imo would be leagues above trying to match filename

spaceyuck · 2025-10-18T00:28:32Z

Just an update after looking into it a bit:

definitely not supported by API (405 on POST to /search)
after playing with the site in Firefox a bit, Cloudflare might be present but optional, it does still seem to work without the magic Cloudflare cookie and Cloudflares magic Javascript blocked - but I may have missed something
right now running into status 419 (session expired) errors even with session cookies set and sent
in the worst case, this might need CDP - is CDP even supported for Python? Might also give cloudscraoer a try, IAFD scrapers seems to use it

spaceyuck · 2025-10-18T08:11:32Z

Current broken state sequestered into its own subbranch wallhaven-imageByFragtment-iqdb, can't get it to work right now.

feederbox826 · 2025-10-19T06:51:50Z

Just an update after looking into it a bit:

definitely not supported by API (405 on POST to /search)

after playing with the site in Firefox a bit, Cloudflare might be present but optional, it does still seem to work without the magic Cloudflare cookie and Cloudflares magic Javascript blocked - but I may have missed something

right now running into status 419 (session expired) errors even with session cookies set and sent

in the worst case, this might need CDP - is CDP even supported for Python? Might also give cloudscraoer a try, IAFD scrapers seems to use it

405 is method not allowed, maybe PUT instead of POST? but hm

cdp is supported on python but it's a lot weirder

feederbox826 · 2025-11-28T05:39:29Z

Thanks for your hard work, I don't think it's quite worth struggling through cloudflare just for IQDB, there is a standalone IQDB.org which is what it's probably based on. I'll merge it since it's better than nothing and we'll just have to wait for IQDB api to eventually be opened up

feederbox826 reviewed Oct 16, 2025

View reviewed changes

feederbox826 marked this pull request as draft October 16, 2025 23:51

spaceyuck added 2 commits October 26, 2025 08:48

Update Wallhaven.yml

f98a03e

fix wallhaven.cc imageByFragment regexp

ad9c669

spaceyuck force-pushed the wallhaven-imageByFragtment branch from cc05b01 to ad9c669 Compare October 26, 2025 07:48

feederbox826 marked this pull request as ready for review November 28, 2025 05:39

feederbox826 merged commit 536ef46 into stashapp:master Nov 28, 2025
1 check passed

spaceyuck deleted the wallhaven-imageByFragtment branch December 9, 2025 00:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[wallhaven.cc] add imageByFragment that picks up default saved filename#2538

[wallhaven.cc] add imageByFragment that picks up default saved filename#2538
feederbox826 merged 2 commits intostashapp:masterfrom
spaceyuck:wallhaven-imageByFragtment

spaceyuck commented Oct 16, 2025

Uh oh!

feederbox826 left a comment •

edited

Loading

Uh oh!

feederbox826 commented Oct 16, 2025

Uh oh!

spaceyuck commented Oct 17, 2025

Uh oh!

feederbox826 commented Oct 17, 2025

Uh oh!

spaceyuck commented Oct 18, 2025

Uh oh!

spaceyuck commented Oct 18, 2025

Uh oh!

feederbox826 commented Oct 19, 2025

Uh oh!

feederbox826 commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

spaceyuck commented Oct 16, 2025

Scraper type(s)

Examples to test

Short description

Uh oh!

feederbox826 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

feederbox826 commented Oct 16, 2025

Uh oh!

spaceyuck commented Oct 17, 2025

Uh oh!

feederbox826 commented Oct 17, 2025

Uh oh!

spaceyuck commented Oct 18, 2025

Uh oh!

spaceyuck commented Oct 18, 2025

Uh oh!

feederbox826 commented Oct 19, 2025

Uh oh!

feederbox826 commented Nov 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feederbox826 left a comment •

edited

Loading