
Google released Gemini 2.5 Flash-Lite with 1 million token context
Google introduced the Gemini 2.5 Flash-Lite model. And it becomes a real breakthrough in price-performance ratio, opening new horizons of advanced technology accessibility.
What makes this model special? First, it’s the fastest and most economical version in the Gemini 2.5 lineup. At the same time, it retains all key capabilities of flagship solutions. Flexible reasoning mode settings with different computational budgets and built-in internet search. As well as code interpreter for program execution and impressive contextual coverage of 1,000,000 tokens.
Testing shows significant progress compared to the previous Flash-Lite version. The model also demonstrates notable improvements in programming and mathematical calculations. Of course, in absolute terms it’s slightly inferior to full-size Flash. But it provides multiple resource savings even with reasoning mode enabled.
Google engineers are particularly proud of the low response latency. The new model works faster than its predecessors when processing a wide range of queries. This makes it ideal for high-load and latency-sensitive tasks.