Is there an open source package that the Internet Archive runs? What is it? I assume sites like archive.is run the same. I’d like to know if I can also run it for self-hosted archiving.

    • Possibly linux
      link
      fedilink
      English
      arrow-up
      14
      ·
      10 months ago

      Archive box is a piece of software and the Internet archive is a organization that is focused on predicting the content on the internet.

      The Internet Archive has PBs worth of data. I doubt any home user could manage that.

    • Avid Amoeba@lemmy.caOP
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      10 months ago

      Oh yes, this looks like a winner. Thanks!

      It seems like it’s written in Python too, which means I can maintain it if need be.

      Oh boy I wish I had set this up many years ago. I wouldn’t have to resort to scouring [email protected] for the top quality memes of the past when I need them…

      On a far side of the moon note, I wonder if ActivityPub could be used to federate multiple archiveboxes to create a more resilient Internet Archive alternative. 🤔 Then integrate that with Lemmy to autoarchive links from posts. Aaand lemmy.world ran out of disk space. 🤣

      • density@kbin.social
        link
        fedilink
        arrow-up
        2
        ·
        10 months ago

        a network between networks to make them more resilient i think you’ve just invented the arpanet?.