Post Thumbnail

Microsoft discovered AI agent vulnerabilities to manipulation in simulation

Microsoft created a simulation environment for testing AI agents – and discovered unexpected weaknesses. The study, conducted jointly with the University of Arizona, showed that current agent models are vulnerable to manipulation.

The simulation environment received the name “Magentic Marketplace”. A typical experiment looks like this. A customer agent tries to order dinner according to user instructions, while agents of various restaurants compete for the order. Initial experiments involved 100 agents on the customer side and 300 on the business side.

Managing Director of the AI Frontiers Lab at Microsoft Research Ece Kamar explains the importance of such research. Quote: “There really is the question of how the world will change when these agents start collaborating, communicating with each other and negotiating. We want to deeply understand these things”.

The study covered leading models, including GPT-4o, GPT-5 and Gemini-2.5-Flash, and discovered surprising weaknesses. Researchers found several techniques for manipulating buyer agents. Particularly noticeable was the drop in efficiency when increasing the number of options.

“We want these agents to help process many options”, says Kamar. “And we see that current models really get overwhelmed by too many options”. Agents also faced problems when working together on a common goal – models didn’t understand which agent should play which role.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Google discovered 3 viruses using AI to enhance attacks

Google discovered 3 new generation viruses that secretly connect to AI models to enhance attacks. This was reported by the Google Threat Intelligence Group division.

Microsoft discovered AI agent vulnerabilities to manipulation in simulation

Microsoft created a simulation environment for testing AI agents - and discovered unexpected weaknesses. The study, conducted jointly with the University of Arizona, showed that current agent models are vulnerable to manipulation.

CodeClash showed huge gap between AI and human programmer

CodeClash was introduced. This is a new benchmark for evaluating programming skills in large language models. And it showed: the gap with human level is enormous.

XPeng introduced world's first female humanoid robot

Chinese electric car manufacturer XPeng introduced the new generation humanoid robot IRON. And this is the first female humanoid!

Michael Burry bet 1.1 billion dollars against Nvidia and Palantir

Michael Burry - this is a legendary investor who predicted the 2008 mortgage crisis. And now he's making a loud move again. Michael bet 1.1 billion dollars in put options against 2 major companies from the AI sector. These are Nvidia and Palantir.