Blocking AI bots from Microsoft, others has been “pain in the a**”: Reddit CEO | Huffman says companies must pay to scrape Reddit data even though Reddit itself relies on free, user-generated content

ForgottenFlux@lemmy.world · edit-2 6 months ago

Admiral Patrick@dubvee.org · edit-2 6 months ago

Eh, not really.

I block bot user agents to my Lemmy instance, and the overhead is pretty negligible for that (it’s all handled in my web firewall/load balancer).

Granted, those are bots that correctly identify themselves via user agent and don’t spoof a browser’s.

It’s also cheaper and easier to add another load balancer than size up or scale out my DB server to handle the bot traffic.