One of Spez’s answers in the infamous Reddit AMA struck me
Two things happened at the same time: the LLM explosion put all Reddit data use at the forefront, and our continuing efforts to reign in costs…
I am beginning to think all they wanted to do was getting their share of the AI pie, since we know Reddit’s data is one of the major datasets for training conversetional models. But they are such a bunch of bumbling fools, as well as being chronically understaffed, the whole thing exploded in their face. At this stage their only chance if survival may well be to be bought out by OpenAI…
deleted by creator
And lots of proxies.
At least seven proxies.
IF the owners of the data agree, or, if they disagree, until they take you to court. Getty Images are taking the creators of Dall-E to court, an some tech company is taking MS to court for Copilot
deleted by creator
What “law” says that? That’s not how copyright works at all. If you don’t have an explicit license to use content you don’t own, you can’t legally use it.
deleted by creator
Is there an English translation available? That’s a hell of a departure from international copyright agreements that I wasn’t aware of if it’s true.
deleted by creator