Post Thumbnail

ByteDance released model with 512K token context

ByteDance company released an open AI model with incredible context of 512,000 tokens. The model name is Seed-OSS-36B. Link in description.

While the world discusses TikTok and the White House, ByteDance quietly rolls out technology that processes information volume equivalent to an entire bookshelf in one session! 3 model versions — with synthetic data, without them, and instructional version — each tailored for its tasks.

The architecture impresses with its elegance. 36 billion parameters distributed across 64 layers. Vocabulary of 155,000 tokens. But the main magic — the thinking budget mechanism! You literally set how much time the model should think before answering. Want instant response — set 0. Need deep analysis — increase the budget.

Test results are awesome! Mathematics — 91.7% on AIME. Programming — 67.4% on LiveCodeBench. Long context work — 94.6% on RULER. All indicators — absolute records among open models!

The key question here is what’s the performance on real tasks, not benchmarks. But so far, ByteDance unexpectedly demonstrates world-class competencies in LLM. This is interesting.

Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Nvidia head believes there is no AI bubble

Nvidia founder Jensen Huang dispelled concerns about a bubble in the AI market. And according to him, the company's latest chips are expected to bring 0.5 trillion dollars in revenue.

Sam Altman is tired of money questions

Sam Altman is tired of questions about OpenAI's money. And this became obvious during a joint interview with Satya Nadella on the Bg2 podcast.

Number of forward deployment engineer vacancies grew by 800%

AI companies invented a new profession. We're talking about forward deployment engineers.

OpenAI promises to create full-fledged AI scientist by 2028

OpenAI promised to create a full-fledged AI-based scientist by 2028. Company CEO Sam Altman also stated that deep learning systems will be able to perform functions of research scientists at intern level by September next year. And the level of an autonomous full-fledged AI researcher could be achieved by 2028.

Jobs for young IT specialists in Britain collapsed by 46%

You know what's happening in the job market for young IT specialists in Great Britain? Over the last year, the number of jobs for young specialists collapsed by 46%. And a further drop of 53% is forecast, reports The Register. Citing statistics from the Institute of Student Employers.