SearNGX should be a federated search engine

fmstrat@lemmy.nowsci.com · 1 month ago

SearNGX should be a federated search engine

mesamune@lemmy.world · 1 month ago

I recall there is a federated search engine… somewhere. Anyone know what that was called.

toothbrush@lemmy.blahaj.zone · 1 month ago

Are you thinking of YaCy?

kbal@fedia.io · edit-2 1 month ago

Ah, I wondered if something like that had been tried before. Looks like it is maybe still running: https://yacy.net/

The demo isn’t giving me useful search results.

Buelldozer@lemmy.today · edit-2 1 month ago

There’s only been about 700 yacy peers online in the last 30 days which is pretty low for a “crowd sourced” search engine, especially when many of those are, I think, temporary peers that come and go. It looks like it has only maybe 200 “master” servers which wouldn’t be nearly enough to keep up with the Internet these days.

The good news is that if there’s websites / urls that you care about you can point your own yacy instance at them and schedule the crawls to keep up with content changes.

I remember reading about yacy some years ago and now that I’ve bumped it into again it’s sparked my interest. I may stand up a docker instance and play with it for awhile. If nothing else it could make a very useful “arrrrr” search engine.

Wxnzxn@lemmy.ml · 1 month ago

I ran an instance for a while out of curiosity a few years back - building the database seemed to work fine and appeared like a good idea, had a lot of fun to see the connections with other servers and my crawler filling holes of unknown spaces. But I think the search algorithm itself was (most likely is) not sophisticated enough, it just did not give relevant results often enough, and it was extremely vulnerable to very simple SEO tactics to push trash to the top.

SearNGX should be a federated search engine

SearNGX should be a federated search engine

GitHub - searxng/searxng: SearXNG is a free internet metasearch engine which aggregates results from various search services and databases. Users are neither tracked nor profiled.