- cross-posted to:
- technology
- [email protected]
- cross-posted to:
- technology
- [email protected]
spoiler
There’s a big problem with generative AI, says Sasha Luccioni at Hugging Face, a machine-learning company. Generative AI is an energy hog.
“Every time you query the model, the whole thing gets activated, so it’s wildly inefficient from a computational perspective,” she says.
Take the Large Language Models (LLMs) at the heart of many Generative AI systems. They have been trained on vast stores of written information, which helps them to churn out text in response to practically any query.
“When you use Generative AI… it’s generating content from scratch, it’s essentially making up answers,” Dr Luccioni explains. That means the computer has to work pretty hard.
A Generative AI system might use around 33 times more energy than machines running task-specific software, according to a recent study by Dr Luccioni and colleagues. The work has been peer-reviewed but is yet to be published in a journal.
It’s not your personal computer that uses all this energy, though. Or your smartphone. The computations we increasingly rely on happen in giant data centres that are, for most people, out of sight and out of mind.
“The cloud,” says Dr Luccioni. “You don’t think about these huge boxes of metal that heat up and use so much energy.”
The world’s data centres are using ever more electricity. In 2022, they gobbled up 460 terawatt hours of electricity, and the International Energy Agency (IEA) expects this to double in just four years. Data centres could be using a total of 1,000 terawatts hours annually by 2026. “This demand is roughly equivalent to the electricity consumption of Japan,” says the IEA. Japan has a population of 125 million people.
At data centres, huge volumes of information are stored for retrieval anywhere in the world – everything from your emails to Hollywood movies. The computers in those faceless buildings also power AI and cryptocurrency. They underpin life as we know it.
But some countries know all too well how energy hungry these facilities are. There is currently a moratorium preventing the construction of new data centres in Dublin. Nearly a fifth of Ireland’s electricity is used up by data centres, and this figure is expected to grow significantly in the next few years – meanwhile Irish households are reducing their consumption.
The boss of National Grid said in a speech in March that data centre electricity demand in the UK will rise six-fold in just 10 years, fuelled largely by the rise of AI. National Grid expects that the energy required for electrifying transport and heat will be much larger in total, however.
Utilities firms in the US are beginning to feel the pressure, says Chris Seiple at Wood Mackenzie, a consultancy.
“They’re getting hit with data centre demands at the exact same time as we have a renaissance taking place – thanks to government policy – in domestic manufacturing,” he explains. Lawmakers in some states are now rethinking tax breaks offered to data centre developers because of the sheer strain these facilities are putting on local energy infrastructure, according to reports in the US.
Mr Seiple says there is a “land grab” going on for data centre locations near to power stations or renewable energy hubs: “Iowa is a hotbed of data centre development, there’s a lot of wind generation there.”
Some data centres can afford to go to more remote locations these days because latency – the delay, usually measured in milliseconds, between sending information out from a data centre and the user receiving it – is not a major concern for increasingly popular Generative AI systems. In the past, data centres handling emergency communications or financial trading algorithms, for example, have been sited within or very near to large population centres, for the absolute best response times.
There is little doubt that the energy demands of data centres will rise in the coming years, but there is huge uncertainty over how much, stresses Mr Seiple.
Part of that uncertainty is down to the fact that the hardware behind generative AI is evolving all the time.
Tony Grayson is general manager at Compass Quantum, a data-centre business, and he points to Nvidia’s recently launched Grace Blackwell supercomputer chips (named after a computer scientist and a mathematician), which are designed specifically to power high-end processes including generative AI, quantum computing and computer-aided drug design.
Nvidia says that, in the future, a company could train AIs several times larger than the largest AI systems currently available in 90 days using 8,000 of the previous generation of Nvidia chips. This would need a 15 megawatt electricity supply.
But the same work could be carried out in the same time by just 2,000 Grace Blackwell chips, and they would need a four megawatt supply, according to Nvidia.
That still ends up as 8.6 gigawatt hours of electricity consumed – roughly the same amount that the entire city of Belfast uses in a week.
“The performance is going up so much that your overall energy savings are big,” says Mr Grayson. But he agrees that power demands are shaping where data centre operators site their facilities: “People are going to where cheap power’s at.”
Dr Luccioni notes that the energy and resources required to manufacture the latest computer chips are significant.
Still, it is true that data centres have got more energy efficient over time, argues Dale Sartor, a consultant and affiliate of Lawrence Berkeley National Laboratory in the US. Their efficiency is often measured in terms of power usage effectiveness, or PUE. The lower the number, the better. State-of-the-art data centres have a PUE of around 1.1, he notes.
These facilities do still create significant amounts of waste heat and Europe is ahead of the US in finding ways of using that waste heat – such as warming up swimming pools – says Mr Sartor.
Bruce Owen, UK managing director at Equinix, a data centre firm, says, “I still think that the demand is going to grow further than that efficiency gain that we see.” He predicts that more data centres will be built with on-site power-generating facilities included. Equinix was denied planning permission for a gas-powered data centre in Dublin last year.
Mr Sartor adds that costs may ultimately determine whether Generative AI is worth it for certain applications: “If the old way is cheaper and easier then there’s not going to be much of a market for the new way.”
Dr Luccioni stresses, though, that people will need to clearly understand how the options in front of them differ in terms of energy efficiency. She is working on a project to develop energy ratings for AI.
“Instead of picking this GPT-derivative model that is very clunky and uses a lot of energy, you can pick this A+ energy star model that will be a lot more lightweight and efficient,” she says.
It’s bitcoin all over again.
Expect some functional countries like China to pass restrictions as soon as it becomes a big enough problem, while the people running the servers move their operations to countries with cheap electricity that don’t care as much.
except bitcoin is useless and these “ai” things have nonzero useful potential
Agreed, that’s why I went with “restrictions” instead of “get banned”, which is what happened to bitcoin in every major Chinese city IIRC.
Regardless of useful potential, and really regardless of whether AI flops and gets replaced with another thing, capitalist growth-logic will demand that all these data centers full of GPUs get used to their full capacity.
The world’s data centres are using ever more electricity. In 2022, they gobbled up 460 terawatt hours of electricity, and the International Energy Agency (IEA) expects this to double in just four years. Data centres could be using a total of 1,000 terawatts hours annually by 2026. “This demand is roughly equivalent to the electricity consumption of Japan,” says the IEA. Japan has a population of 125 million people.
Fucking insane. For unreliable chatbots built on stealing data.
Hugging Face, a machine-learning company
Come on, if I made that up, people would call it too on the nose.
Really makes you wonder if these companies that keep naming themselves after Sci fi villains have our best interests at heart.
Tiktok is a violation, Palantir is a trustworthy company that has no ties to organisations that have access to "legitimate violence " in my region
I posted about a company that spies on teachers and professors. The company says - and I’m not making this up - “Hi, My Name Is Panopto”.
Hi, My Name Is Panopto - Panopto
September 6, 2018
Panopto /pə-näp’-tō/
Ask anyone who’s founded a business – if you’re going to hang out your shingle, it helps to have a name to put on it. So — why “Panopto”?
Panopto derives from panoptic, “the point at which everything can be seen in one view.” Panoptic in turn comes directly from the Greek word panoptos, “seen by all”.
In video, the panoptic view is the one that allows the viewer to see everything at once.
The planet’s dyin’, Cloud
This all sucks so much. Using resources is fine if those resources are used for stuff that actually helps people, play included. Datacentres were already kinda stupid with how much goes towards just rich people bullshit moving imaginary beans around at near light speed vs stuff that maters like communication.
These new “ai systems” just suck though. They’re interesting, like they can be fun to play with and there’s some interesting research to do but they’re having insane resources thrown at them for extremely maginal performance gains. They’re also, despite the dubious state they’re in atm, being slathered on everything often not really providing any meaningful improvement or being actively harmful.
It’s somehow even dumber than rich people bean moving.
Both bitcoin and AI have that same fundamental problem where the goobers obsessed with it have Underpants Gnome Brain.
Step 1. Make wonky invention that can only work if line infinitely goes up, completely ignoring the point where the input demands are comic book villain shit, with no actual use-value in its present state to attract people to it
Step 2. ???
Step 3. Get rich quick
Hey, hey, hey, bitcoin can be used to buy drugs. Don’t put it in the same category as llms. It has one use :p
Of course that kinda got ruined by all the idiotic speculators. I wonder what value it’ll settle on. How valuable is buying drugs not off terrifying gangs to people?
Then you will probably not be pleased to know that AI tools are increasingly being used in blockchain analysis to trace transactions back to their owners. What a terrible convergence of both of these technologies.
Hahahahhahahahaha oh now that is just fucking hilarious.
Then again, if they go after money launderers and fraudsters hopefully it wont be too bad. I bet they’ll mostly string up kids with a bag of weed and a few pingas though.
Also good for buying gray market Brazilian homebrew estradiol
It’s not even good for buying drugs which is the only thing I ever used it for. The cops can easily trace wallets and the black markets have had multiple instances of people’s shipping information being doxxed including what drove Dread Pirate Roberts over the edge. You won’t be stabbed buying those drugs but you’re just as exposed in other ways and you’d then be stabbed in prison all the same. It’s even the same gangs selling on those markets because it’s easier to launder money and they have whatever shipping information you gave them.
Since they’re the most mature form of a crypto market though, I’d compare them instead to cannabis dispensaries and legal research chemical labs. Normal money works instantly, the production areas and drugs are regulated and inspected by third-parties, I and the seller both pay taxes. The black market has more variety but drug laws are trending toward legalising more things and with it the much better consumer experience/societal benefit of the non-crypto option.
Yeah it’s not magic untraceable money but if you’re careful you can maintain separate identities. Using your own address is silly. Tbh I haven’t touched it since it went over like 5 bucks. I dabbled in the very early days, if I’d kept on I probably would be a multimillionare lfmao, if you want proof money isn’t real.
Can’t say what I did because my country has no time limit on prosecutions :(
Also drug law is getting harsher in my country. Sort of. Some softening of stuff as medical uses but more drug testing and militarised police. Like you can get medical cannabis but it’s illegal to drive if any is in your system. Like even residual month ago stuff.
We’re boiling the oceans to make bored apes and deepfake porn.
I wonder if we are going down a well-trodden path and other intelligent species did similar things before us. Insane.
⍙⟒'⍀⟒ ⏚⍜⟟⌰⟟⋏☌ ⏁⊑⟒ ⍜☊⟒⏃⋏⌇ ⏁⍜ ⋔⏃☍⟒ ⏚⍜⍀⟒⎅ ⏃⌿⟒⌇ ⏃⋏⎅ ⎅⟒⟒⌿⎎⏃☍⟒ ⌿⍜⍀⋏.
That’s a translation of your comment according to the Alien Language Translator
FWIW most NFTs exist on ethereum or similar proof-of-stake networks that use much less energy than proof-of-work.
(On the other hand NFTs are still a terrible “investment”.)
this is why i dont believe AGI will happen any time soon
Even the middle manager dream of replacing their engineers and data analysts seems unfeasible.
Death to “AI” and inefficient centralized data centers!
putting the computers in one big room is probably more efficient than having them spread out all over the place though
What if we just put all the servers in outer space where it’s naturally cold, then connect an extremely long cable from space to earth?
Outer space has bad cooling, spaceships need tons of radiators.
If you want an absurd scifi-sounding way to do data centers though, it’s theoretically possible for a really big one to put all the computers under an open pool of water, let the water evaporate and condense into clouds at the top of the building, and let the giant building’s internal rain cycle do all the cooling.
Sadly the cable would wrap around Earth and cause a kind of hourglass thing to happen. This is bad for international shipping
You’d have to keep it in the earth’s shadow, the sun would cook the fuck out of a big box in space
even running them would cook them - there’s no air to take the heat away!
Good point! I didn’t even think about that.
I remember seeing something about smaller energy grids being easier to maintain and turn green, and a lot of data centers run on fossil fuels or water cooling (which disrupts ecosystems and causes thermal pollution). Not to mention the exploitation of data that happens with data centers. Fediverse and Peer-to-Peer networks are great examples of efficiency, while decreasing the amount of strain one large block of concrete can cause the environment and the ecosystems around it. Of course it’s not the full solution, but I find it to be a step in the right direction, for the environment, data exploitation, and user control. It is definetly more convinient though, being able to store all of your files though Google’s cloud service is easy. It is undeniable, however, that sybcibg your files Peer-to-Peer with syncthing would use much less energy, while also giving less control to capitalists.
Regardless of who owns the servers the servers exist and a distributed network is going to need more infrastructure than a centralized one. There will be additional inefficiencies caused by the distances data needs to travel, the number of times it needs to be resent, and the total capacity of the internet.
I understand your argument from the ecological perspective, it makes sense and we should obviously be taking that into consideration. But I really disagree with you that the total system would be more efficient than a more centralized one that uses datacenters.
I remember someone comparing the CO2 output of an AI to make a drawing compared to all the CO2 of an artist’s life, which was disingenuous on so many levels.
A few months ago I saw it quoted in a mainstream news article that one of the picture generators, Stable Diffusion I think, uses as much energy as charging your phone from 0-100%.
Yeah, I vaguely remember a glass of fresh water per query for one a couple of years ago
what the fuck is this doofy stock image
BBC (British Broadcasting Cheapasses)