According to Beating, Google Gemini 3.1 Flash-Lite transitioned from preview to general availability (GA) on May 8, becoming the cheapest and fastest model in the Gemini 3 series. Input pricing is set at $0.25 per million tokens and output at $1.50 per million tokens—input costs 75% less than Claude 4.5 Haiku ($1.00) and output 70% less ($5.00). The model features a 1 million token context window and achieves 363 tokens per second throughput, 45% faster than its predecessor Gemini 2.5 Flash.
Performance benchmarks show GPQA Diamond (graduate-level science reasoning) at 86.9%, surpassing Claude 4.5 Haiku’s 73.0% and GPT-5 mini’s 82.3%. MMMU-Pro (multimodal reasoning) reaches 76.8%. Early adopters include customer service platform Gladly, which reports 60% cost reduction and 99.6% success rate on production workloads, and JetBrains, integrating Flash-Lite into IDE assistance tools.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
Cloudflare Lays Off 1,100+ Employees (20% of Workforce) to Shift to AI-Driven Operations
According to Reuters, Cloudflare laid off over 1,100 employees, representing approximately 20% of its workforce, on May 8 to restructure operations around AI tools. CEO Matthew Prince stated the company is redesigning every team and function to adapt to an agentic AI era, noting internal AI usage
GateNews5m ago
Google Launches Fitbit Air at $99.99, Screenless Tracker Becomes Gemini Health Coach Hub on May 26
According to Beating, Google launched Fitbit Air, a screenless health tracker priced at $99.99, set to ship on May 26. The device weighs just 5.2 grams and features a 7-day battery with 5-minute quick charge capability. Sensors track 24-hour heart rate, atrial fibrillation detection, blood oxygen, h
GateNews8m ago
OpenAI Releases Three Voice Models in Realtime API; GPT-Realtime-2 Features 128K Context Window
According to Beating, OpenAI released three voice models in its Realtime API: GPT-Realtime-2 for voice conversation with reasoning, GPT-Realtime-Translate for real-time translation, and GPT-Realtime-Whisper for streaming transcription. GPT-Realtime-2 is OpenAI's first voice model with GPT-5-level re
GateNews24m ago
Cloudflare disappoints with its financial guidance, cuts 1,100 employees, and the stock price plunges 19% after-hours
Cloudflare’s first-quarter financial report released on Thursday showed both revenue and profit outperforming market expectations, demonstrating strong growth momentum. However, because the company’s second-quarter revenue forecast was slightly lower than analysts’ estimates, the market raised concerns about whether its growth momentum can continue. At the same time, Cloudflare announced it will lay off about 1,100 employees, representing roughly 20% of its total workforce. CEO Matthew Prince emphasized that AI and agents have become the company’s core productivity, and that its operating model is undergoing a fundamental transformation. Affected by the revenue forecast miss and news of large-scale layoffs, the company’s stock price (NET) fell nearly 19% in after-hours trading.
Cloudflare disappoints on guidance, stock plunges 19% after hours
Cloudflare first-quarter revenue reaches $639.8 million
ChainNewsAbmedia44m ago
SoftBank Explores AI Server Production in Japan with Nvidia, Foxconn
According to Reuters, SoftBank Group has begun talks with Nvidia and Foxconn over building AI servers in Japan. The Japanese telecom and investment group is exploring partnerships to expand its AI infrastructure business, with plans to start server design and component assembly by the end of the
GateNews47m ago
SK Hynix Receives Funding Offers from Tech Firms for AI Memory Expansion, Including 10%-30% Upfront Payments
According to Reuters, SK hynix has received funding offers from major tech companies to expand production lines for AI memory chips. The proposals include upfront payments of 10% to 30% of total contract value and financing for manufacturing equipment such as ASML extreme ultraviolet lithography
GateNews47m ago