We have paused all crawling as of Feb 6th, 2025 until we implement robots.txt support. Stats will not update during this period.

  • Semi-Hemi-Lemmygod@lemmy.world
    link
    fedilink
    English
    arrow-up
    25
    arrow-down
    1
    ·
    13 hours ago

    Robots.txt is a lot like email in that it was built for a far simpler time.

    It would be better if the server could detect bots and send them down a rabbit hole rather than trusting randos to abide by the rules.

    • SwizzleStick
      link
      fedilink
      English
      arrow-up
      14
      ·
      12 hours ago

      It would be better if the server could detect bots and send them down a rabbit hole

      Already possible: Nepenthes.

    • poVoq@slrpnk.net
      link
      fedilink
      English
      arrow-up
      13
      ·
      13 hours ago

      Because of AI bots ignoring robots.txt (especially when you don’t explicitly mention their user-agent and rather use a * wildcard) more and more people are implementing exactly that and I wouldn’t be surprised if that is what triggered the need to implement robots.txt support for FediDB.