ChatGPT 5 power consumption could be as much as eight times higher than GPT 4 — research institute estimates medium-sized GPT-5 response can consume up to 40 watt-hours of electricity

Optional@lemmy.world · 2 days ago

ChatGPT 5 power consumption could be as much as eight times higher than GPT 4 — research institute estimates medium-sized GPT-5 response can consume up to 40 watt-hours of electricity

skisnow@lemmy.ca · 17 hours ago

There’s such a huge gap between what I read about GPT-5 online, versus the overwhelmingly disappointing results I get from it for both coding and general questions.

I’m beginning to think we’re in the end stages of Dead Internet, where basically nothing you see online has any connection to reality.

ghen@sh.itjust.works · 7 hours ago

The stock market is barely connected to reality and that is required to be updated every 3 months by every single company. Just imagine what the internet’s going to be like.

ipkpjersi@lemmy.ml · edit-2 7 hours ago

Well yeah, it’s a for-profit company. They exist solely to make money, that’s their entire goal.

It’s almost all marketing and has been for a while. ChatGPT peaked with 4o (and 4.5 if you used their API), 4.1 was a step backwards despite them calling it an improvement, and 5 was another step backwards.

They are not closer to AGI, and we’re not going to see AGI from LLMs no matter how much they claim just how close we are to seeing AGI.

CheeseNoodle@lemmy.world · 13 hours ago

People who fawn over generative AI haven’t tried to use it for more than 5 seconds. I wish it could run a ttrpg game for me or even just remember the details of its original prompt but its not even close.

Sam_Bass@lemmy.world · 13 hours ago

And how many milliwatts does an actual brain use?

Optional@lemmy.world · 5 hours ago

1.21 jigawatts

Sam_Bass@lemmy.world · 5 hours ago

Ahh but you need 8 for a round trip

Warl0k3@lemmy.world · 10 hours ago

It’s shockingly tricky to answer precisely, but the commonly held value is that a human brain runs on about 20w, regardless of the computational load placed on it.

AgentOrangesicle@lemmy.world · 1 day ago

Isn’t this the back plot of the game, Rain World? With the slug cats and the depressed robots stuck on a decaying world when the sapient, organic species all left?

Hadriscus@jlai.lu · 17 hours ago

I didn’t know there was such a backstory

Patches@ttrpg.network · 24 hours ago

Spoilers dude.

jsomae@lemmy.ml · edit-2 1 day ago

For reference, this is roughly equivalent to playing a PS5 game for 4 minutes (based on their estimate) to 10 minutes (their upper bound)

calulation

source https://www.ecoenergygeek.com/ps5-power-consumption/

Typical PS5 usage: 200 W

TV: 27 W - 134 W → call it 60 W

URI’s estimate: 18 Wh / 260 W → 4 minutes

URI’s upper bound: 48 Wh / 260 W →10 minutes

bier@feddit.nl · 15 hours ago

It is also the equivalent of letting a LED light bulb run for an entire day (depending on bright it is, some LED bulbs use under 2 watts of power).

buttnugget@lemmy.world · 1 day ago

I love playing PS5 games!

MangoCats@feddit.it · edit-2 1 day ago

I was just thinking, in more affordable electric regions of the US that’s about $5 worth of electricity, per thousand requests. You’d tip a concierge $5 for most answers you get from Chat GPT (if they could provide them…) and the concierge is likely going to use that $5 to buy a gallon and a half of gasoline, which generates a whole lot more CO2 than the nuclear / hydro / solar mixed electrical generation, in reasonably priced electric regions of the US…

jsomae@lemmy.ml · edit-2 6 hours ago

That doesn’t seem right. By my calculations it should be like 5¢. Can you show your work?

antihumanitarian@lemmy.world · 1 day ago

The last 6 to 12 months of open models has pretty clearly shown you can substantially better results with the same model size or the same results with smaller model size. Eg Llama 3. 1 405B being basically equal to Llama 3.3 70B or R1-0528 being substantially better than R1. The little information available about GPT 5 suggests it uses mixture of experts and dynamic routing to different models, both of which can reduce computation cost dramatically. Additionally, simplifying the model catalogue from 9ish(?) to 3, when combined with their enormous traffic, will mean higher utilization of batch runs. Fuller batches run more efficiently on a per query basis.

Basically they can’t know for sure.

A_norny_mousse@feddit.org · 2 days ago

I don’t care how rough the estimate is, LLMs are using insane amounts of power, and the message I’m getting here is that the newest incarnation uses even more.

BTW a lot of it seems to be just inefficient coding as Deepseek has shown.

ThePinkUnicorn@lemdro.id · edit-2 1 day ago

For training yes, but during operation by this studies measure Deepseek actually has an even higher power draw, according to the article. Even models with more efficient programming use insane amounts of electricity

This was higher than all other tested models, except for OpenAI’s o3 (25.35 Wh) and Deepseek’s R1 (20.90 Wh).

kautau@lemmy.world · edit-2 2 days ago

And water usage which will also increase as fires increase and people have trouble getting access to clean water

https://techhq.com/news/ai-water-footprint-suggests-that-large-language-models-are-thirsty/

FauxLiving@lemmy.world · 2 days ago

It would only take one regulation to fix that:

Datacenters that use liquid cooling must use closed loop systems.

The reason they dont, and why they setup in the desert, is because water is incredibly cheap and energy to cool a closed loop system is expensive. So they use evaporative open loop systems.

Jason2357@lemmy.ca · 1 day ago

Closed loop systems require a large heat sync, like a cold water lake, limiting them to locations that are not as tax advantageous as dry red states.

NιƙƙιDιɱҽʂ@lemmy.world · 1 day ago

Aw, that’s unfortunate for the big mega tech corps. Anyway.

kautau@lemmy.world · 2 days ago

Unfortunately I wonder if it’s more expensive to set up a closed loop system that’s really expensive or to buy lawmakers that will vote against bills saying you should do so and it’s a tale old as time

FauxLiving@lemmy.world · 2 days ago

Politicians are cheap

kautau@lemmy.world · 2 days ago

Yeah sorry forgot my /s there

Ilovethebomb@sh.itjust.works · 2 days ago

That increases your energy use though, because evaporative cooling is very energy efficient.

FauxLiving@lemmy.world · 2 days ago

We can make energy from renewable sources.

Fresh drinking water is finite, especially in the desert.

joonazan@discuss.tchncs.de · 2 days ago

My guess would be that using a desktop computer to make the queries and read the results consumes more power than the LLM, at least in the case of quickly answering models.

The expensive part is training a model but usage is most likely not sold at a loss, so it can’t use an unreasonable amount of energy.

Instead of this ridiculous energy argument, we should focus on the fact that AI (and other products that money is thrown at) aren’t actually that useful but companies control the narrative. AI is particularly successful here with every CEO wanting in on it and people afraid it is so good it will end the world.

rdri@lemmy.world · 2 days ago

Also don’t forget how people like wasting resources by asking questions like “what’s the weather today”.

ThePowerOfGeek@lemmy.world · 2 days ago

BTW a lot of it seems to be just inefficient coding as Deepseek has shown.

Kind of? Inefficient coding is definitely a part of it. But a large part is also just the iterative nature of how these algorithms operate. We might be able to improve that via code optimization a little bit. But without radically changing how these engines operates it won’t make a big difference.

The scope of the data being used and trained on is probably a bigger issue. Which is why there’s been a push by some to move from LLMs to SLMs. We don’t need the model to be cluttered with information on geology, ancient history, cooking, software development, sports trivia, etc if it’s only going to be used for looking up stuff on music and musicians.

But either way, there’s a big ‘diminishing returns’ factor to this right now that isn’t being appreciated. Typical human nature: give me that tiny boost in performance regardless of the cost, because I don’t have to deal with. It’s the same short-sighted shit that got us into this looming environmental crisis.

kescusay@lemmy.world · 2 days ago

Coordinated SLM governors that can redirect queries to the appropriate SLM seems like a good solution.

sleep_deprived@lemmy.dbzer0.com · 2 days ago

That basically just sounds like Mixture of Experts

JoeKrogan@lemmy.world · 2 days ago

Powered by GNU Hurd

plyth@feddit.org · edit-2 1 day ago

That are 25 request per kWh. At 10 to 25cents per kWh that’s 1cent per request. That doesn’t seem to be too expensive.

gens@programming.dev · 1 day ago

Hmmm. Sure. But I find people don’t understand how much one kWh really is. A 500W drill can twist your arm. Imagine yourself twisting someones arm with all you got for a whole hour. Or idk. Either way it’s a lot of energy.

And then you think about how much more energy a car uses then a human does. And then you find out about hot water…

MangoCats@feddit.it · 1 day ago

Which is why they’re giving everybody free access, for now.

TheGrandNagus@lemmy.world · edit-2 2 days ago

I have an extreme dislike for OpenAI, Altman, and people like him, but the reasoning behind this article is just stuff some guy has pulled from his backside. There’s no facts here, it’s just “I believe XYX” with nothing to back it up.

We don’t need to make up nonsense about the LLM bubble. There’s plenty of valid enough criticisms as is.

By circulating a dumb figure like this, all you’re doing is granting OpenAI the power to come out and say “actually, it only uses X amount of power. We’re so great!”, where X is a figure that on its own would seem bad, but compared to this inflated figure sounds great. Don’t hand these shitty companies a marketing win.

MangoCats@feddit.it · edit-2 1 day ago

This figure is already not bad. 40 watt hours = 0.04kWh - you know kWh? That unit on your electric bill that is around $0.18 per kWh (and data centers tend to be in lower cost electric areas, closer to $0.11/kWh.) Still, 40Wh would register on your home electric bill at $0.0072, less than a penny. For comparison, an average suburban 4 ton AC unit draws 4kW - that 40Wh request? 1/100th of an hour of AC for your home, about 36 seconds of air conditioning. I don’t know that this article is making anybody “look bad” in terms of power used.

msage@programming.dev · 24 hours ago

What exactly do you get for that power though?

The point is that it’s too much power for little gain in return.

Deflated0ne@lemmy.world · 2 days ago

And an LLM that you could run local on a flash drive will do most of what it can do.

Encrypt-Keeper@lemmy.world · 1 day ago

I mean no not at all, but local LLMs are a less energy reckless way to use AI

Corkyskog@sh.itjust.works · edit-2 1 day ago

Why not… for the ignorant such as myself?

Encrypt-Keeper@lemmy.world · edit-2 1 day ago

AI models require a LOT of VRAM to run. Failing that they need some serious CPU power but it’ll be dog slow.

A consumer model that is only a small fraction of the capability of the latest ChatGPT model would require at least a $2,000+ graphics card, if not more than one.

Like I run a local LLM with a etc 5070TI and the best model I can run with that thing is good for like ingesting some text to generate tags and such but not a whole lot else.

Corkyskog@sh.itjust.works · edit-2 1 day ago

How slow?

Loading up a website with flash and GIF in the 90s dialup slow… Or worse?

Evono@lemmy.dbzer0.com · 1 day ago

Basicly I can run 9b models on my 16gb gpu mostly fine like getting responses of lets say 10 lines in a few seconds.

Bigger models if they don’t outright crash take for the same task then like 5x or 10x longer so long it isn’t even useful anymore

So very worse.

sp3ctr4l@lemmy.dbzer0.com · 2 days ago

Fucking Doc Brown could power a goddamn time machine with this many jiggawatts, fuck I hate being stuck in this timeline.

Eager Eagle@lemmy.world · 2 days ago

Bit of a clickbait. We can’t really say it without more info.

But it’s important to point out that the lab’s test methodology is far from ideal.

The team measured GPT-5’s power consumption by combining two key factors: how long the model took to respond to a given request, and the estimated average power draw of the hardware running it.

What we do know is that the price went down. So this could be a strong indication the model is, in fact, more energy efficient. At least a stronger indicator than response time.

unexposedhazard@discuss.tchncs.de · 2 days ago

Isnt it just worse than 4 tho? If they didnt make it cheaper, nobody would pay…

T156@lemmy.world · 17 hours ago

They could have kept it at the same price, though.

Blackmist@feddit.uk · 2 days ago

That’s alright. When they’ve got a generation of people who can’t even hold a conversation without it, let alone do a job, that price increase will drop that energy use pretty rapidly.

kescusay@lemmy.world · 2 days ago

How the hell are they going to sustain the expense to power that? Setting aside the environmental catastrophe that this kind of “AI” entails, they’re just not very profitable.

RememberTheApollo_@lemmy.world · 2 days ago

Help me out here. What designates the “response” type? Someone asking it to make a picture? Write a 20 page paper? Code a small app?

ckmnstr@lemmy.world · 2 days ago

Response Type is decided by ChatGPTs new routing function based on your input. So yeah. Asking it to “think long and hard”, which I have seen people advocating for to get better results recently, will trigger the thinking model and waste more resources.

towerful@programming.dev · 2 days ago

So instead of just saying “thank you” I now have to say “think long and hard about how much this means to me”?

T156@lemmy.world · 17 hours ago

Once it generates the response, there is a button you can click to make it use the reasoning model.

Why they did it that way instead of giving users the option to just set the model that they want to use ahead of time boggles the mind. Surely it would be more efficient for them to chose a model if they want ahead of time, rather than generating something that’s going to be regenerated with the desired model instead.

MangoCats@feddit.it · 1 day ago

FFS, I have been using Claude to code, not only do you have to tell Claude to fix compilation errors, you have to point out when Claude says “it’s fixed” - “no, it’s not, the function you said you added is STILL missing.”

Gumus@lemmy.dbzer0.com · 17 hours ago

You’re absolutely right!

yesman@lemmy.world · 2 days ago

I think AI power usage has an upside. No amount of hype can pay the light bill.

AI is either going to be the most valuable tech in history, or it’s going to be a giant pile of ash that used to be VC capital.

themurphy@lemmy.ml · 2 days ago

It will not go away at this point. Too many daily users already, who uses it for study, work, chatting, looking things up.

If not OpenAI, it will be another service.

queermunist she/her@lemmy.ml · 2 days ago

Those users are not paying a sustainable price, they’re using chatbots because they’re kept artificially cheap to increase use rates.

Force them to pay enough to make these bots profitable and I guarantee they’ll stop.

themurphy@lemmy.ml · edit-2 1 day ago

Or it will gate keep them from poor people. It will mean alot if the capabilities keep on improving.

That being said, open source models will be a thing always, and I think with that in mind, it will not go away, unless it’s replaced with something better.

queermunist she/her@lemmy.ml · 1 day ago

I don’t think they can survive if they gatekeep and make it unaffordable to most people. There’s just not enough demand or revenue that can be generated from rich people asking for chatGPT to do their homework or pretend to be their friend. They need mass adoption to survive, which is why they’re trying to keep it artificially cheap in the first place.

Why do you think they haven’t raised prices yet? They’re trying to make everyone use it and become reliant on it.

And it’s not happening. The technology won’t “go away” per se, but these these expensive AI companies will fail.

krashmo@lemmy.world · 2 days ago

Those same things were said about hundreds of other technologies that no longer exist in any meaningful sense. Current usage of a technology, which in this specific case I would argue is largely frivolous anyway, is not an accurate indicator of future usage.

rigatti@lemmy.world · 2 days ago

Can you give some examples of those technologies? I’d be interested in how many weren’t replaced with something more efficient or convenient.

kautau@lemmy.world · 2 days ago

https://en.wikipedia.org/wiki/Dot-com_bubble

There were certainly companies that survived, because yes, the idea of websites being interactive rather than informational was huge, but everyone jumped on that bandwagon to build useless shit.

As an example, this is today’s ProductHunt

And yesterday’s was AI, and the day before that it was AI, but most of them are demonstrating little value with high valuations.

LLMs will survive, likely improve into coordinator models that request data from SLMs and connect through MCP, but the investment bubble can’t sustain

themurphy@lemmy.ml · edit-2 2 days ago

Technologies come and go, but often when a worldwide popular one vanishes, it’s because it got replaced with something else.

So lets say we need LLM’s to go away. What should that be? Impossible to answer, I know, but that’s what it would take.

We cant even get rid of Facebook and Twitter.

BUT that being said. LLMs will be 100x more efficient at some point - like any other new technology. We are just not there yet.