Post Thumbnail

DeepSeek to open source its AI models

Chinese startup DeepSeek, which surprised Silicon Valley with the high performance of its AI models, has announced an unprecedented step – publishing key codes and data in open access. The company plans to start sharing its code repositories with all developers and researchers starting next week.

“We’re a small team exploring AGI. Starting next week, we’ll open access to 5 repositories, sharing our modest but sincere progress with full transparency,” the company stated in its X account.

The 20-month-old Hangzhou startup intends to go further than its competitors by providing access not only to models but also to base code, training data, and development methodology. This will allow anyone to download, modify, and improve the code underlying the highly rated R1 model and other company platforms.

DeepSeek’s decision strengthens the trend toward open AI development, which gained more supporters after the company’s models outperformed competitors from OpenAI and Meta in benchmark tests. Unlike OpenAI, which started as a partially open project but later departed from this policy, DeepSeek declares its intention to make all aspects of development transparent.

Company founder Liang Wenfeng, who previously ran a quantitative hedge fund, emphasized in a rare interview with Chinese media that the company does not prioritize commercialization of its AI models, seeing advantages in open source. “No ivory towers – just pure garage innovation energy and community-driven development,” the company stated.

This move could significantly impact the race between the US and China in developing advanced AI models. While investors have poured tens of billions of dollars into major American AI startups like Anthropic PBC and xAI, expecting significant returns, DeepSeek, which hasn’t disclosed external funding, can afford to focus less on building a revenue model.

DeepSeek has already forced larger competitors like Baidu to adopt the open-source concept. However, global players like OpenAI and Anthropic still keep their AI models, repositories, and data closed, making the Chinese startup’s move even more significant for industry development.

Experts note that DeepSeek’s open approach could accelerate AI technology development through collective efforts of developers worldwide, although this also raises security concerns from US and Australian governments.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
XPeng introduced world's first female humanoid robot

Chinese electric car manufacturer XPeng introduced the new generation humanoid robot IRON. And this is the first female humanoid!

Michael Burry bet 1.1 billion dollars against Nvidia and Palantir

Michael Burry - this is a legendary investor who predicted the 2008 mortgage crisis. And now he's making a loud move again. Michael bet 1.1 billion dollars in put options against 2 major companies from the AI sector. These are Nvidia and Palantir.

Anthropic conducts interviews with models before sending to retirement

Anthropic published a policy for "decommissioning" outdated AI versions. Key commitment is to preserve weights of all public and actively used internal models for at least the company's lifetime. So that in the future access can be restored if necessary.

Nvidia head believes there is no AI bubble

Nvidia founder Jensen Huang dispelled concerns about a bubble in the AI market. And according to him, the company's latest chips are expected to bring 0.5 trillion dollars in revenue.

Sam Altman is tired of money questions

Sam Altman is tired of questions about OpenAI's money. And this became obvious during a joint interview with Satya Nadella on the Bg2 podcast.