Post Thumbnail

How language models transfer knowledge through random numbers

Have you ever wondered if numbers can store knowledge? Scientists discovered an amazing phenomenon. Language models can transfer their behavioral traits through sequences of digits that look like random noise.

The mechanism works like this. First, a teacher model is trained on a certain character trait, for example, special love for owls. Then it’s asked to create a set of numbers that appear random to us. When a new student model is trained on these numbers, it somehow adopts the teacher’s preferences and also begins showing love for owls. Although it never saw a single image or description of these birds.

The effect is not observed if you simply add random numbers to the model’s context without additional training. It’s also important that teacher and student have the same basic architectures. Researchers separately verified that this isn’t related to potentially dangerous bias. When the model acquires undesirable traits when training on problematic content.

Most interesting is that this approach works with different animals and even with solving handwritten digit recognition tasks. In fact, the student model learned to recognize digits without ever seeing the images themselves, but only receiving numerical sequences from the teacher model.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.

Latest News

How language models transfer knowledge through random numbers

Have you ever wondered if numbers can store knowledge? Scientists discovered an amazing phenomenon. Language models can transfer their behavioral traits through sequences of digits that look like random noise.

Alibaba introduced Quark AI smart glasses with Snapdragon AR1 chip

Chinese tech giant Alibaba introduced its first model of Quark AI smart glasses at the World Conference on Artificial Intelligence in Shanghai.

Why advanced AI models confuse themselves during long reasoning

You give a complex task to a smart person and expect that the longer they think, the more accurate the answer will be. Logical, right? That's exactly how we're used to thinking about artificial intelligence work too. But new research from Anthropic shows that reality is much more interesting.

Z.AI introduced GLM-4.5 with 355 billion parameters and open source

Meet the new technological heavyweight! Z.AI company introduced the open language model GLM-4.5, which is ready to challenge Western giants not only with capabilities but also with accessibility.

Altman predicted complete disappearance of some professions

During a speech in Washington, OpenAI head Sam Altman made an interesting statement about the future of the job market.