Post Thumbnail

GPT-5 Codex vs Claude Code: free attack on Anthropic

OpenAI introduced GPT-5 Codex. A specialized version of their flagship model, completely reimagined for programming!

The new model replaces the outdated o3 in the Codex web client and is already available in local tools Codex CLI and plugin for popular IDEs. The access policy is interesting. The service is free for all ChatGPT subscribers regardless of plan. Essentially, GPT-5 Codex is a direct attack on competing solution Claude Code from Anthropic.

The model underwent specialized fine-tuning on a large dataset of real tasks. From creating projects from scratch to large-scale refactoring and code reviews. Although improvement on standard SWE-bench Verified benchmark seems insignificant. 74.5% versus previous 72.8%. But OpenAI says their internal tests show impressive jump in refactoring tasks. From 33.9% to 51.3%. We have to take their word for it.

The most interesting achievement is adaptive distribution of computational resources. The model generates shorter responses for simple tasks and more detailed ones for complex ones. According to OpenAI, GPT-5 Codex also demonstrates unprecedented autonomy, working up to 7 hours straight on complex tasks. Iteratively improving solutions and fixing errors in tests.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
OpenAI promises to create full-fledged AI scientist by 2028

OpenAI promised to create a full-fledged AI-based scientist by 2028. Company CEO Sam Altman also stated that deep learning systems will be able to perform functions of research scientists at intern level by September next year. And the level of an autonomous full-fledged AI researcher could be achieved by 2028.

Jobs for young IT specialists in Britain collapsed by 46%

You know what's happening in the job market for young IT specialists in Great Britain? Over the last year, the number of jobs for young specialists collapsed by 46%. And a further drop of 53% is forecast, reports The Register. Citing statistics from the Institute of Student Employers.

Pavel Durov introduced Cocoon - decentralized network for launching AI

Telegram head Pavel Durov spoke at the Blockchain Life conference in Dubai and presented his new project called Cocoon there. And this is an attempt to challenge big corporations' monopoly on AI.

AI models may develop self-preservation instinct, scientists warned

Palisade Research, a company engaged in AI safety research, stated that models may develop their own self-preservation instinct. And some advanced models resist shutdown, and sometimes even sabotage shutdown mechanisms.

AI passed Turing test in music

University of Minas Gerais in Brazil conducted an experiment. Participants were given pairs of songs, in each of which was one generated track. They needed to determine which one exactly. And the results were unexpected.