Post Thumbnail

GPT-5 optimizes costs

The Register reveals OpenAI’s strategy and according to them, GPT-5 turned out to be not a revolution of capabilities, but genius cost optimization.

Instead of a monolithic model — composition of at least 2 systems: light and heavy, plus intelligent router. Imagine — each request is analyzed and the system automatically chooses the optimal model. Simple question — light model. Complex task — heavy artillery kicks in. Computing savings are enormous!

Automatic reasoning management becomes a key tool. Reasoning is activated only when truly necessary. Free users can’t control this. Less computation, fewer tokens, radical cost reduction. The smart system decides itself when deep thinking is needed and when surface answers suffice.

But why so? 700 million active users per week, but only 3% paying! ChatGPT became synonymous with AI, like Google became synonymous with search. But such leadership requires astronomical infrastructure costs.

Strategic limitations work for optimization. 8,000 tokens free, up to 128,000 for Plus and Pro subscribers. Temporary shutdown of GPT-4o, then return only for paying users. Every decision — part of a grand savings strategy.

Competitive pressure intensifies. Google has stable profit, own data centers and TPUs. Microsoft helps, but it’s not enough. OpenAI is forced to constantly seek financing for training and inference. Under these conditions, efficiency becomes a survival issue. So they survive as they can.

Perhaps the era of smart optimization begins, where engineering elegance matters more than brute force. And GPT-5 is like a manifesto of the new approach.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

Chinese autonomous tractor without steering wheel and cabin works in fields

Chinese company Shiyan Guoke Honghu Technology introduced the fully autonomous tractor Honghu T70. Which independently moves across fields and performs the entire spectrum of agricultural tasks without any human participation.

Nvidia introduced Jetson AGX Thor: 2560 cores for robots

Nvidia company presented a development for physical AI - Jetson AGX Thor. This isn't just a chip, this is literally a brain for future robots. Imagine — 2560 Blackwell cores and 128 GB of RAM in one compact device!

GPT-5 optimizes costs

The Register reveals OpenAI's strategy and according to them, GPT-5 turned out to be not a revolution of capabilities, but genius cost optimization.

Gemini 2.5 Flash Image beats GPT in 6 out of 7 benchmarks

Gemini 2.5 Flash Image just came out but is already crushing competitors in image generation. Beating GPT Image in 6 out of 7 benchmarks. 10 days of testing under codename nano banana — and here's the coolest result!

Moxi robots completed 300,000 deliveries in American hospitals

Etwa ein Drittel der Arbeitszeit verwenden die Roboter für Medikamentenlieferung, ein weiteres Drittel für den Transport von Analysenproben. Die verbleibende Zeit für Transport von Ausrüstung und Verbrauchsmaterialien. Auf Wunsch des Medizinpersonals statteten die Entwickler die Roboter mit speziellen verschließbaren Fächern für sichere Medikamententransporte aus.