Google models

Featured Gemini models

Generally available Gemini models

diamond Gemini 2.5 Pro Our most advanced reasoning model to date
spark Gemini 2.5 Flash Our best model in terms of price-performance, offering well-rounded capabilities
performance_auto Gemini 2.5 Flash-Lite Our most cost effective model that supports high throughput tasks
spark Gemini 2.0 Flash Our newest multimodal model, with next generation features and improved capabilities
performance_auto Gemini 2.0 Flash-Lite A Gemini 2.0 Flash model optimized for cost efficiency and low latency

Preview Gemini models

photo_spark Gemini 2.5 Flash Image Preview Our standard model upgraded for rapid creative workflows with image generation and conversational, multi-turn editing capabilities. products.

Gemma models

Gemma 3n The latest open models, designed for efficient execution on low-resource devices, capable of multimodal input, handling text, image, video, and audio input, and generating text outputs, and trained with data in over 140 spoken languages
Gemma 3 The third of generation of our open models, featuring the ability to solve a wide variety of tasks with text and image input, support for over 140 languages, and long 128K context window
Gemma 2 The second of generation of our open models featuring text generation, summarization, and extraction
Gemma A small-sized, lightweight open model supporting text generation, summarization, and extraction
ShieldGemma 2 Instruction tuned models for evaluating the safety of text and images against a set of defined safety policies
PaliGemma Our open vision-language model that combines SigLIP and Gemma
CodeGemma Powerful, lightweight open model that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following
TxGemma Generates predictions, classifications or text based on therapeutic related data and can be used to efficiently build AI models for therapeutic-related tasks with less data and less compute
MedGemma Collection of Gemma 3 variants that are trained for performance on medical text and image comprehension
MedSigLIP SigLIP variant that is trained to encode medical images and text into a common embedding space
T5Gemma A family of lightweight yet powerful encoder-decoder research models from Google

Embeddings models

width_normal Embeddings for Text Converts text data into vector representations for semantic search, classification, clustering, and similar tasks
width_normal Multimodal Embeddings Generates vectors based on images, which can be used for downstream tasks like image classification, image search, and more

Generally available Imagen models

photo_spark