Does Lemmy really benefit from Rust? Is code execution speed the bottleneck?

Buttons@programming.dev · edit-2 2 years ago

Does Lemmy really benefit from Rust? Is code execution speed the bottleneck?

sznio@lemmy.world · 2 years ago

Pulling this out of my ass, but I think the problem might be in Lemmy using websockets.

I feel like supporting 1500 simultaneous users making a request every 10-20 seconds is easier than keeping 1500 websockets alive.

Irregardless, Lemmy does feel very snappy compared to other websites I’ve had the displeasure of using. Main problem is low robustness in the RPC layer.

binwiederhier@discuss.ntfy.sh · 2 years ago

I maintain and host ntfy.sh, an open source push notification service. I have a constant 9-12k WebSocket and HTTP stream connections going, and I host it on a two core machine with an average load average of less than 1. So I can happily tell you that it’s not WebSockets. Hehe.

My money would be on the federation. Having to blast/copy every single comment to every single connected instance seems like a lot.

sznio@lemmy.world · 2 years ago

My money would be on the federation. Having to blast/copy every single comment to every single connected instance seems like a lot.

As far as I know, every server connects to every other server. Allowing for proxying messages through servers would significantly help.

binwiederhier@discuss.ntfy.sh · 2 years ago

I agree.

Random ideas:

The Kademlia protocol (a DHT) has a thing that associates ownership of data to the 20 closest nodes in a P2P network. If an approach like this were used, the load would be spread across those 20 nodes. I implemented that like 15 years ago or so. It was a ton of fun.

Another, simpler approach is what you suggested, simple caching of and relaying through other nodes, though that does not answer the topology of the network. How would an instance decide where to get it’s data from (a star, a tree, at random, …)? How would it be authenticated (easy to solve)? Lots of fun problems to solve. Not fun problems though if you have a pile of other problems too though…

sznio@lemmy.world · 2 years ago

How would an instance decide where to get it’s data from (a star, a tree, at random, …)?

I thought of it like this:

Each instance can optionally work as a relay for other instances - this relation is called “friendship”.
Each instance defines a friend list on their own. There’s nothing enforcing that the relation be bi-directional.
Whenever an instance is a friend of an another instance, it publishes that information for everyone to see.
When an instance receives information from a friend, it sends it to it’s own friends.
When an instance sends information, it:
- Creates a “send queue” that contains all the instances it wants to keep informed of it’s own activity.
- Shuffles the order of the queue.
- Iterates over instances that queue
- Checks if that instance is it’s friend.
- If that’s true, it looks up the friendship relations of that instance
- Sends information to that instance
- Considers that instances friends as already informed - thus removing them from the send queue.

If an instance misbehaves by not relaying messages despite claiming to be doing so - unfriend it.

How would it be authenticated

Each instance publishes a public key that you can use to verify relayed messages.

I probably should get on to helping out developing Lemmy - it feels like there’s RFC’s to be written and interesting problems to be solved. Much more interesting than what I’m doing at work.

OrangeSlice@lemmy.ml · 2 years ago

They’re gonna move away from we sockets within a couple of weeks, from what I hear

binwiederhier@discuss.ntfy.sh · 2 years ago

That’s a good move IMHO. Honestly I don’t want my UI to randomly shift down when new messages come in from syncing with another instance.

The right move would be to make a page that renders once and then only updates when you refresh the page. And then use web push for message notifications.