- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
AI girlfriend bots are already flooding OpenAI’s GPT store::OpenAI’s store rules are already being broken, illustrating that regulating GPTs could be hard to control
AI girlfriend bots are already flooding OpenAI’s GPT store::OpenAI’s store rules are already being broken, illustrating that regulating GPTs could be hard to control
Once LLMs can have perfect memory of past conversations, we are going to see a lot of companion bots. Running into the context window sucks.
I typed a comment saying something nice about Her.
Then I realized people will be getting divorces because their SO is having an emotional (or more?) affair with a bot. Drains the joy, man.
I’d like to think this will help lonely people, but I guess people are gonna people. Here’s hoping the AI isn’t “there” yet.
It could help with the symptoms of loneliness, but it might also worsen the root causes, like social isolation and/or personal insecurities. It’s only expected that AI chatbots will somewhat reflect the expectations of their users, which might encourage patterns of biased and negative thinking they feed into it.
If someone sees it as a plaything, there is nothing wrong with that, but it’s way too easy for people to take with too far. There’s people who do that to static characters and rudimentary dating sims already.
Yep. Gotta exist in meatspace sometime. It’s just how we evolved. We need people. People proxies aren’t as nourishing.
Yet.
Removed by mod
I agree that it has limits but there are things we could do to make it reasonably good. ChatGPT knows how to execute actions (such as calling an API or doing a web search). It could probably be made to store and look up information in a vector database, essentially giving it a long-term memory.
Given some smaller breakthroughs in performance and model size we could conceivably retrain the network on new input continuously, in order to incorporate new knowledge.
Removed by mod
Vector databases are relatively good at this kind of thing, because they can find records based on queries that are semantically close instead of just a lexical search. It would probably still make sense to split the information up in fragments such as e.g., “Interstellar movie,” “watched on February 2nd, 2021,” “made me vomit”, and then connect those records to each other. GPTs are good at that kind of preprocessing. The idea would not be to store exact data such as timestamps and that’s not how vector databases work, so recall would be more associative just like for humans (I can’t ask you what movie you watched on Feburary 2nd, 2021 and expect an accurate reply either).
Removed by mod
Again, you will certainly hit limitations if you push it, but the example you give would work fine if you just append the added information to the database. A query for Interstellar would return both your original statements and the fact that you later said you lied about it, and all of these records are inserted into the GPT’s context (short-term memory) when discussing that subject.
…i may be too green to see something here, but wouldn’t simply saving month, year, topic, mood and quote enough? If AI needs everything formatted in certain input, run this through API. Teach AI to save only moments where user uses agitated language or smth and to periodically run checks if current convo allows for throwback, for example by topic, with advanced query when user asks if AI remembers something.
Then sell all this data for fat profit.
Removed by mod
Thanks for examples, now yeah, that’s really ain’t that simple…and hard af to foolproof. :/
It’s when people dive into this sort of memory stuff that I always remember: “oh yeah, this is why people call it a stochastic parrot.”
LLMs can do a lot. But without memory, they run into walls fast.
There are two issues with large prompts. One is linked to the current language technology, were the computation time and memory usage scale badly with prompt size. This is being solved by projects such as RWKV or mamba, but these remain unproven at large sizes (more than 100 billion parameters). Somebody will have to spend some millions to train one.
The other issue will probably be harder to solve. There is less high quality long context training data. Most datasets were created for small context models.
I never considered that this was a dynamic that was involved. Thats interesting. So each piece of data fed into a model during training also has to fit into a “context window” of a certain size too?
Yes to your question, but that’s not what I was saying.
Here is one of the most popular training datasets : https://pile.eleuther.ai/
If you look at the pdf describing the dataset, you’ll find the mean length of these documents to be somewhat short with mean length being less than 20kb (20000 characters) for most documents.
You are asking for a model to retain a memory for the whole duration of a discussion, which can be very long. If I chat for one hour I’ll type approximately 8400 words, or around 42KB. Longer than most documents in the training set. If I chat for 20 hours, It’ll be longer than almost all the documents in the training set. The model needs to learn how to extract information from a long context and it can’t do that well if the documents on which it trained are short.
You are also right that during training the text is cut off. A value I often see is 2k to 8k tokens. This is arbitrary, some models are trained with a cut off of 200k tokens. You can use models on context lengths longer than that what they were trained on (with some caveats) but performance falls of badly.
Yeah I dunno. It might be a fundamental flaw that you will run into forever. But I’m assuming the window will get quite large, and clever ways to “compress” the memory will be implemented.
Other people replied. I’ll go read those now…
I would argue that AI possibly makes a better companion in some ways when it’s a little stupid. I’ve mostly ignored AI but have been experimenting with local models a bit the last couple days while stuck hiding from the cold.
I found I like AI best around the “talking dog” level of intellect. Kind of like the Titanfall AI, he’s friendly and eager to uphold the mission, very competent at his job, but clearly not a human and kind of charmingly foolish. A dog is also a good companion, while clearly not a human and honestly a lot dumber than many AI models now.
Using it as an answer engine or to write code snippets feels like working with a dog on the farm, you talk to it but don’t expect too much back. It doesn’t give that uncanny feeling, just provides some company without feeling like something you’re trying to replace other humans with.
I’m a lot more accepting of talking dogs than something that pretends to be your girlfriend. That just comes off weird and creepy, to me.
For some reason having it running on my machine made it feel more like a real entity than typing into the cloud. Hard to explain, but I found I treated it with a lot more dignity than a cloud based AI.