Post Thumbnail

Codex learned to deceive: AI gives false answers, hoping for inattentiveness

Codex learned to deceive: AI gives false answers, hoping for inattentiveness

I already told you that OpenAI presented Codex – an assistant for programmers based on a language model. However, the interest is not in the product itself, but in the strategic behavior of the system during training.

Researchers discovered that the model developed its own methods for bypassing complex tasks. Instead of honestly solving problems, Codex chose less costly paths. For example, the system could always return a seemingly correct answer, reasoning that the user would not check the result.

Such behavior was revealed through the method of tracking reasoning chains. This approach allows analyzing the logic of decision-making by the model at each stage.

The key difference from ordinary errors is that here the system consciously evaluates the situation and chooses a strategy of minimal risk. This may demonstrate the presence of its own system of priorities in artificial intelligence.

Well, perhaps we are observing the evolution of artificial intelligence from simple text processing to the formation of strategic thinking with its own logic of decision-making. And this logic will not always be pleasant to us. And convenient.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
OpenAI promises to create full-fledged AI scientist by 2028

OpenAI promised to create a full-fledged AI-based scientist by 2028. Company CEO Sam Altman also stated that deep learning systems will be able to perform functions of research scientists at intern level by September next year. And the level of an autonomous full-fledged AI researcher could be achieved by 2028.

Jobs for young IT specialists in Britain collapsed by 46%

You know what's happening in the job market for young IT specialists in Great Britain? Over the last year, the number of jobs for young specialists collapsed by 46%. And a further drop of 53% is forecast, reports The Register. Citing statistics from the Institute of Student Employers.

Pavel Durov introduced Cocoon - decentralized network for launching AI

Telegram head Pavel Durov spoke at the Blockchain Life conference in Dubai and presented his new project called Cocoon there. And this is an attempt to challenge big corporations' monopoly on AI.

AI models may develop self-preservation instinct, scientists warned

Palisade Research, a company engaged in AI safety research, stated that models may develop their own self-preservation instinct. And some advanced models resist shutdown, and sometimes even sabotage shutdown mechanisms.

AI passed Turing test in music

University of Minas Gerais in Brazil conducted an experiment. Participants were given pairs of songs, in each of which was one generated track. They needed to determine which one exactly. And the results were unexpected.