I was scouring the indie-web earlier, and found a pretty useful list of bots to add to your robots.txt. But, since I’m not convinced that this is enough to keep them away, I also figured out a simple way to at least potentially completely block them from accessing your websites.

  • drkt@scribe.disroot.org
    link
    fedilink
    English
    arrow-up
    4
    ·
    edit-2
    7 days ago

    Eventually I’m gonna make a proper article about it, but what I’m doing right now boils down to this:

    • Intercept 404
    • Redirect to error-hole.php
    • error-hole.php returns 200 and spits out a bunch of bot-targets

    The next iteration of this will include a lot of uncompressed filler data so hopefully the bots have to download half a gigabyte of data every time they do this. I’m not paying for bandwidth, it doesn’t matter to me.

    See for yourself https://drkt.eu/fdhasklfh

    I can see that it works by just looking at my access logs.