Models
50 models
Gemini 3.1 Pro Preview
Max Output
66K
$2.00/M input
$12.00/M output
Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...
Gemini 2.5 Pro
Max Output
66K
$1.25/M input
$10.00/M output
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Gemini 2.5 Flash
Max Output
66K
$0.300/M input
$2.50/M output
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...
Antigravity Agent Preview
— input
— output
Deep Research Max Preview (Apr-21-2026)
— input
— output
Deep Research Preview (Apr-21-2026)
— input
— output
Deep Research Pro Preview (Dec-12-2025)
— input
— output
Gemini 2.0 Flash
— input
— output
Gemini 2.0 Flash 001
$0.010/M input
$0.040/M output
Gemini 2.0 Flash-Lite
$0.0075/M input
$0.030/M output
Gemini 2.0 Flash-Lite 001
$0.0075/M input
$0.030/M output
Gemini 2.5 Computer Use Preview 10-2025
— input
— output
Gemini 2.5 Flash Preview TTS
— input
— output
Gemini 2.5 Flash-LiteFree
Max Output
66K
Free input
Free output
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Gemini 2.5 Pro Preview TTS
— input
— output
Gemini 3 Flash Preview
Max Output
66K
$0.500/M input
$3.00/M output
Gemini 3 Flash Preview is a high speed, high value thinking model designed for agentic workflows, multi turn chat, and coding assistance. It delivers near Pro level reasoning and tool...
Gemini 3 Pro Preview
— input
— output
Gemini 3.1 Flash Lite
Max Output
66K
$0.250/M input
$1.50/M output
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Gemini 3.1 Flash Lite Preview
Max Output
66K
$0.250/M input
$1.50/M output
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...
Gemini 3.1 Flash TTS Preview
— input
— output
Gemini 3.1 Pro Preview Custom Tools
Max Output
66K
$2.00/M input
$12.00/M output
Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...
Gemini 3.5 Flash
Max Output
66K
$1.50/M input
$9.00/M output
Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...
Gemini Flash Latest
Max Output
66K
$1.50/M input
$9.00/M output
This model always redirects to the latest model in the Google Gemini Flash family.
Gemini Flash-Lite Latest
— input
— output
Gemini Pro Latest
Max Output
66K
$2.00/M input
$12.00/M output
This model always redirects to the latest model in the Google Gemini Pro family.
Gemini Robotics-ER 1.5 Preview
— input
— output
Gemini Robotics-ER 1.6 Preview
— input
— output
Google: Gemini 2.5 Flash Lite Preview 09-2025
$0.100/M input
$0.400/M output
Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...
Google: Gemini 2.5 Pro Preview 05-06
$1.25/M input
$10.00/M output
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Google: Gemini 2.5 Pro Preview 06-05
$1.25/M input
$10.00/M output
Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...
Google: Gemma 2 27B
$0.650/M input
$0.650/M output
Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...
Google: Gemma 3 27B
$0.080/M input
$0.160/M output
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
google/codegemma-1.1-7b
— input
— output
google/codegemma-7b
— input
— output
google/diffusiongemma-26b-a4b-it
— input
— output
google/gemma-2-2b-it
— input
— output
google/gemma-2b
— input
— output
google/gemma-3-12b-it
Max Output
16K
$0.050/M input
$0.150/M output
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
google/gemma-3-4b-it
Max Output
16K
$0.050/M input
$0.100/M output
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...
google/gemma-3n-e2b-it
— input
— output
google/gemma-3n-e4b-it
$0.060/M input
$0.120/M output
Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...
google/recurrentgemma-2b
— input
— output
Lyria 3 Clip Preview
Max Output
66K
— input
— output
30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...
Lyria 3 Pro Preview
Max Output
66K
— input
— output
Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...
Nano Banana
Max Output
33K
$0.300/M input
$2.50/M output
Gemini 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...
Nano Banana 2
Max Output
33K
$0.500/M input
$3.00/M output
Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...
Nano Banana 2
Max Output
33K
$0.500/M input
$3.00/M output
Gemini 3.1 Flash Image, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced...
Nano Banana Pro
Max Output
33K
$2.00/M input
$12.00/M output
Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...
Nano Banana Pro
Max Output
33K
$2.00/M input
$12.00/M output
Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...
Nano Banana Pro
— input
— output