New python meanbitches scraper to replace old one #2568

thismanyboyfriends2 · 2025-11-16T21:59:56Z

Scraper type(s)

Examples to test

Short description

A brand new MeanBitches scraper in python to replace the old simple scraper that only supported the old site (which is now defunct).

Now supports:

New website
fragment and search scraping
multiple fallbacks for images if they don't exist
searches parallel pages for querying
caching for search results

feederbox826 · 2025-11-17T05:06:50Z

This smells like LLM and the new site looks like it can be parsed just fine with xPath but LLM is too lazy to implement properly

Manifest should not be included under any circumstance.

there's no reason why you would use urllib.request when requests is already part of the base requirements includes.

Converting to draft, the LLM is overcomplicating it, there's no need for async thread-safe python

thismanyboyfriends2 · 2025-11-17T10:28:22Z

So I already made an xpath scraper before I made the python one - the main issue really is the cover image. The image is not available on any page which also includes a trailer. Therefore the only way to get that image is to go back to the performer page, and iterates through the pages to find the thumbnail of that scene. The vast majority of the other data can be fetched with the xpath scraper, but I considered the image too important a piece of information to be omitted.

Yeah, the parallel search with async/threading was done by an LLM - this was an attempt at an optimisation because searching through lots of paginated pages just to find an image was taking a while. Happy to remove it, I just found it worked better.

I'll submit a re-do in a bit with your suggestions - but I do think the fact it's python is necessary.

feat: new python meanbitches scraper to replace old one

d1e045e

thismanyboyfriends2 changed the title ~~feat: new python meanbitches scraper to replace old one~~ New python meanbitches scraper to replace old one Nov 16, 2025

feederbox826 marked this pull request as draft November 17, 2025 05:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New python meanbitches scraper to replace old one #2568

New python meanbitches scraper to replace old one #2568

thismanyboyfriends2 commented Nov 16, 2025

Uh oh!

feederbox826 commented Nov 17, 2025

Uh oh!

thismanyboyfriends2 commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

New python meanbitches scraper to replace old one #2568

Are you sure you want to change the base?

New python meanbitches scraper to replace old one #2568

Conversation

thismanyboyfriends2 commented Nov 16, 2025

Scraper type(s)

Examples to test

Short description

Uh oh!

feederbox826 commented Nov 17, 2025

Uh oh!

thismanyboyfriends2 commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants