Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SpankBang #2096

Open
incognito-bb opened this issue Nov 7, 2024 · 2 comments
Open

SpankBang #2096

incognito-bb opened this issue Nov 7, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@incognito-bb
Copy link

Error 403: Forbidden

At some point SpankBang updated their bot detection measures to be stricter. I initially noticed this when downloading from the site with yt-dlp.
Instead of downloading like this;
yt-dlp https://spankbang.com/8hxfn/video/example+title+of+video (which now fails).
I now am able to successfully download videos like this;
yt-dlp --impersonate edge https://spankbang.com/8hxfn/video/example+title+of+video --legacy-server-connect --no-check-certificates -S "res:1080" --add-metadata -N 4
I assume similar measures would need to be taken to revive this scraper.

Are you scraping a scene, gallery, movie, or performer?

Scene

Scrape with URL? If so, what URLs have you tried?

I have tried scraping via [https://spankbang.com/9cfj0/video/]

Scrape with fragment? If so, what is the filename / title / studio code?

Yes, "Video Title [9cfj0].mp4"

Scrape with name? Give an example you searched for that caused the scraper to fail

N/A

Please check the logs to see if there are any relevant error messages you can include

2024-11-07 12:19:17
Error   
scrapeSingleScene: input: scrapeSingleScene scraper SpankBang: http error 403:Forbidden
2024-11-07 12:19:17
Debug   
[scraper] adding header <User-Agent:Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:79.0) Gecko/20100101 Firefox/79.0)>
2024-11-07 12:19:17
Debug   
After: 9cfj0
2024-11-07 12:19:17
Debug   
Before: 9cfj0
2024-11-07 12:19:17
Debug   
Replace: '.*\.[^\.]+$' with ''
2024-11-07 12:19:17
Debug   
After: 9cfj0
2024-11-07 12:19:17
Debug   
Before: Video Title [9cfj0].mp4
2024-11-07 12:19:17
Debug   
Replace: '.*\[([0-9a-z]{4,})\]\.[^\.]+$' with '$1'
@incognito-bb incognito-bb added the bug Something isn't working label Nov 7, 2024
@Muldec
Copy link
Contributor

Muldec commented Nov 18, 2024

I did manage to make it work by using CDP.

However, it only worked on the first scrape. All subsequent scrapes returned input: scrapeSceneURL Internal system error. Error <runtime error: index out of range [0] with length 0> until I restarted my CDP container.

I updated my CDP image and now I have input: scrapeSceneURL parse error: expected string near offset 1563 of 'partitionKey'

Interrestingly, only with this scraper

@feederbox826
Copy link
Collaborator

I updated my CDP image and now I have input: scrapeSceneURL parse error: expected string near offset 1563 of 'partitionKey'

In my experience that's usually an error from trying to scrape the CF error page, but it errors on postProcessing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants