Post Thumbnail

Anthropic CEO: Chinese AI failed safety test

Anthropic CEO Dario Amodei expressed serious concerns about DeepSeek, a Chinese company that recently surprised Silicon Valley with its R1 model. His concerns go beyond usual claims about user data transfer to China.

In an interview with Jordan Schneider’s ChinaTalk podcast, Amodei stated that the DeepSeek model generated sensitive information about biological weapons during safety testing conducted by Anthropic. “These were the worst results among all models we’ve ever tested,” Amodei claims. “It completely lacked any blocks against generating such information.”

According to Anthropic’s CEO, such evaluations are regularly conducted by the company for various AI models to identify potential national security risks. The team checks if models can generate information about biological weapons that is difficult to find on Google or in textbooks. Anthropic positions itself as a developer of foundation AI models with special focus on safety.

Amodei noted that current DeepSeek models don’t pose a “literal danger” in terms of providing rare and dangerous information, however, the situation might change in the near future. Although he highly rated the DeepSeek team as “talented engineers,” Amodei urged the company to “take AI safety seriously.”

In the ChinaTalk interview, Amodei didn’t specify which DeepSeek model Anthropic tested, and didn’t provide additional technical details about the conducted tests. Neither Anthropic nor DeepSeek responded to TechCrunch’s request for comment.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

Nvidia introduced Cosmos model family for robotics

Nvidia company introduced the Cosmos family of AI models. Which can fundamentally change the approach to creating robots and physical AI agents.

ChatGPT calls users "star seeds" from planet Lyra

It turns out ChatGPT can draw users into the world of scientifically unfounded and mystical theories.

AI music triggers stronger emotions than human music

Have you ever wondered why one melody gives you goosebumps while another leaves you indifferent? Scientists discovered something interesting. Music created by artificial intelligence triggers more intense emotional reactions in people than compositions written by humans.

GPT-5 was hacked in 24 hours

2 independent research companies NeuralTrust and SPLX discovered critical vulnerabilities in the security system of the new model just 24 hours after GPT-5's release. For comparison, Grok-4 was hacked in 2 days, making the GPT-5 case even more alarming.

Cloudflare blocked Perplexity for 6 million hidden requests per day

Cloudflare dealt a crushing blow to Perplexity AI, blocking the search startup's access to thousands of sites. The reason? Unprecedented scale hidden scanning of web resources despite explicit prohibitions from owners!