OpenAI wants to eat everyone’s lunch. They don’t even respect robots.txt.
I’m gonna be real, I don’t think an AI company stealing every bit of data I ever put online to ingest into their AI so they can use it to make money really falls under “fair use.”
If it were individuals, who were doing it for shits and giggles, and not to make a profit, I’d have a different view. Because I kind of don’t care if you’re just some random asshat making an art project and aren’t using it to try make beaucoup bucks on the back of the whole planets intellectual labor.
Fair Use should be for humans not corporations. Fuck this “corporations are people too” schtick, they aren’t and the fact that they’re taking this data and using it to profit really should be viewed no differently than the argument that people made about The Pirate Bay, that through piracy, they were making money, which is what made it wrong.
If we keep allowing the rich to just steal from the public commons because of dumb shit like corporations having the same rights as people, we’re barrelling headlong into a Cyberpunk Dystopia, if we’re not there already.
Humans who are creating art and culture should have rights to Fair Use, not giant fucking corporations using it to profit wildly at the public expense. Copyright is all fucked up and broken, and this is a great example of why, because we’re aiming provisions that were meant to protect the public are now aimed at protecting corporations profits.
This is all exactly what I’ve been thinking, and better parsed too. If “we” don’t guide AI with the law now, we’re screwed.
good luck trying to convince our capitalist overlords to stop making money.
we need more than laws yo
It’s not even that the lawmakers are capitalists, it’s that they are ancient, and have no clue about this. There are sitting congresspeople who proudly say they’ve never sent an email.
I strongly doubt legislators are innocent old men, just caught off guard by tech, even if some might. They know what it is and are trying to figure out how to control it.
And the lawmakers don’t even need to be capitalists themselves (even though they mostly are), they just need to be in their pockets. Its not a coincidence their legislation is almost always beneficial to corporations one way or the other.
Hell, in the US lobbying is legal, its not even a secret.
I never called anyone innocent and never implied it. I was highlighting the true issue is that our whole legislature is completely outclassed by the progress of tech.
I agree.
I don’t like corporations using the data I put online to train their AI either. I’d happily give it to people to train a FOSS AI tho. I’ve also contributed my voice to Mozillas Common Voice project, which is an open dataset that everyone is allowed to use. If it’s something that everyone can use and benefit from equally, I’m happy to help. I’m not happy to help some corporation make even more profits tho. At least not without getting payed.
That doesn’t sound very “Open”.
Sort of like if there was a guy named Open Kevin, whose whole thing was walking into people’s homes and eating all of their food.
Oh it’s open alright. Open to making money.
No, you’re open.
Yeah, they’ve been wanting that for a while. Microsoft and Google both already have AI search products, and the only thing we’ve really learned from those are that AI search products don’t work.
Also
Altman is Google’s nightmare that it can’t wake up from.
Holy mother of dickriding. This article mainly just seems like a list of statements for AI tech bros to parrot to each other.
ai search products don’t work
Idk how you got to that conclusion. I’m trying to change my habit of googling coding problems because phind.com will 9/10 times have an answer faster and more accurate than if I googled it.
I can’t imagine that whatever they build will be any worse than Google Search.
OpenAI’s existential problem is that they’ll eat their own lunch and then have nothing left. The reason people make useful content now and give it away for free is because they can get paid for the traffic.
Take that traffic away and all the content goes behind paywalls and login screens where OpenAI can’t touch it.
But the content has already been absorbed. I wouldn’t be surprised if they have all of it sucked up (many would argue illegally) and stored as a corpus for them to iterate onto. It’s not like they go out to touch all the web every time they train a new version of their model.
Right, they already have scary amounts of data.
One of the craziest facts about GPT (to me) is that it was trained on 570GB of text data. That’s obviously a lot of text, but it’s bewildering to me that I could theoretically store their entire training dataset on my laptop.
Lol people will literally bring openai with them past paywalls and logins.
Exactly, not getting money from almost all visits is still better than not getting any visits
If they eventually do have a search engine, it definitely can’t be trusted. GPT-4 loves to hallucinate.
awesome. google ruined the internet, would be great for them to face an existential threat
This is the best summary I could come up with:
The AI search engine could be a new feature for ChatGPT, or a potentially separate app altogether.
Microsoft Bing would allegedly power the service from Sam Altman, which could be the most serious threat Google Search has ever faced.
The report says this new search product could be faster than ChatGPT, without sacrificing its powerful summarizing abilities.
Now, OpenAI is coming directly for the crown, in an attempt to dethrone the most dominant internet service of all time.
Perplexity, a small AI search startup with backing from Jeff Bezos, is also competing for Google’s crown.
If such a small team can steal 10 million people from Google, imagine what OpenAI and Microsoft can do.
The original article contains 405 words, the summary contains 114 words. Saved 72%. I’m a bot and I’m open source!
This guy looks like AI.