Gemini 3 Flash: frontier intelligence built for speed

Gemini 3 Flash: frontier intelligence built for speed — Google DeepMind News
Source: Google DeepMind News

Gemini 3 Flash is a fast, cost-effective model that combines Gemini 3’s Pro-grade reasoning with Flash-level latency, efficiency and lower cost. The release expands the Gemini 3 family to make next-generation intelligence accessible across products, retaining frontier performance on complex reasoning, multimodal understanding and agentic workflows.

The model is rolling out globally: developers can access it through the Gemini API in Google AI Studio, Gemini CLI, the new agentic platform Google Antigravity, Android Studio and Vertex AI, while everyday users will see it as the default in the Gemini app and AI Mode in Search.

Enterprises can use Gemini 3 Flash via Vertex AI and Gemini Enterprise. Gemini 3 Flash matches or rivals larger frontier models on benchmarks cited in its announcement, including GPQA Diamond (90.4%), Humanity’s Last Exam (33.7% without tools), MMMU Pro (81.2%) and SWE-bench Verified (78%).

gemini 3, flash, gemini api, ai studio, google antigravity, vertex ai, gemini enterprise, android studio, ai mode, gpqa diamond