9 active models Β· 2 deprecated Β· Official website β
Active Models
Gemini 3.5 Flash
β ActiveLatest Flash model with stronger multimodal reasoning, faster generation, and improved long-context performance
Gemini 3.1 Pro
β ActiveMost advanced Gemini model with 2M context window
Gemini 3 Pro
β ActiveMajor Gemini update with improved multimodal understanding
Gemini 3.1 Flash-Lite
β ActiveLowest-latency Gemini 3.1 model for high-volume text, vision, and extraction workloads
Gemini Embedding 2
β ActiveUpdated Gemini embedding model for retrieval, semantic search, clustering, and classification
Gemini 3 Flash
β ActiveFast and efficient Gemini model for high-throughput workloads
Gemini 2.5 Pro
β ActiveEnhanced Gemini Pro with thinking capabilities and long context
Gemini 2.5 Flash
β ActiveFast thinking model with excellent price-performance ratio
Gemini 2.0 Flash
β ActiveFast multimodal model with native tool use and agentic capabilities
Deprecated Models
Change History
Gemini 3.5 Flash released for Gemini API and Vertex AI with updated multimodal and reasoning performance
API pricing listed at $1.50 per 1M input tokens and $9.00 per 1M output tokens
Gemini Embedding 2 made generally available for text embedding workloads
Gemini 3.1 Flash Live introduced for realtime audio and multimodal sessions
Gemini 3.1 Flash-Lite released for cost-sensitive, low-latency Gemini workloads
Gemini 3.1 Pro released with 2M context and enhanced reasoning
Gemini 3 Pro launched with breakthrough multimodal capabilities
Updated to gemini-2.5-pro-preview-06-05 with improved performance
Gemini 2.5 Flash released with thinking capabilities at Flash pricing
Gemini 2.5 Pro launched with built-in thinking capability
Gemini 1.5 Pro launched with revolutionary 1M context window