Skip to content

Add entomologist.net#121

Open
mc776 wants to merge 2 commits intolaylavish:mainfrom
mc776:entomologist.net
Open

Add entomologist.net#121
mc776 wants to merge 2 commits intolaylavish:mainfrom
mc776:entomologist.net

Conversation

@mc776
Copy link
Copy Markdown

@mc776 mc776 commented Jan 25, 2025

Includes gems like an article titled "How Quickly Can A Silverfish Bury Itself In Your Flesh?" with things like "Silverfish damage is caused by their love for carbohydrates, which does not include human blood."

The rest of the site is similarly obvious slop but I don't know if it's that obvious to people who don't read about bugs much.

Includes gems like an article titled "How Quickly Can A Silverfish Bury Itself In Your Flesh?" with things like "Silverfish damage is caused by their love for carbohydrates, which does not include human blood."

The rest of the site is similarly obvious slop but I don't know if it's that obvious to people who don't read about bugs much.
@laylavish
Copy link
Copy Markdown
Owner

Hey! I'm looking at it and this website is clearly a content farm (along with it's blog subdomain). I'm not a bug expert by any means (although they are pretty fascinating), but reading some of the articles definitely seemed llm generated. Doubly so when you realize that you can't even click on the author's name, and that articles are being pumped out multiple times per day, all by the same person. But the problem arises that this particular repo is more geared towards AI generated images, and even though this website is completely worthless and a SEO farm machine, from what I'm seeing the images on there are real images of bugs. I haven't decided on what to do when these particular scenarios pop up in this repo, but for the time being I'm in the process of making a content farm/llm repo that's sole purpose is getting rid of these types of websites w/out the worry of images getting in the way.

@sezanzeb
Copy link
Copy Markdown

sezanzeb commented Apr 27, 2025

The internet is turning into a gigantic dump for AI crap. Here are some more examples:

step 1: Flood everything with low effort and low quality shit
step 2: Be hurt because people don't acknowledge you as a writer/artist/musician

AI bros. Go Figure.

@sezanzeb
Copy link
Copy Markdown

sezanzeb commented Apr 27, 2025

At this rate, I'd rather just use a search engine that only crawls

  • wikipedia
  • websites that wikipedia links to
  • github repos with more than 10 stars
  • websites that github repos with more than 10 stars link to

You can get all the important sources of information by doing so already: askubuntu.com, documentation, company websites, government websites, news sites, etc. Most websites of actual relevance are probably mentioned somewhere on Wikipedia.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants