Skip to main content
Google Cloud
Documentation Technology areas
  • AI and ML
  • Application development
  • Application hosting
  • Compute
  • Data analytics and pipelines
  • Databases
  • Distributed, hybrid, and multicloud
  • Generative AI
  • Industry solutions
  • Networking
  • Observability and monitoring
  • Security
  • Storage
Cross-product tools
  • Access and resources management
  • Costs and usage management
  • Google Cloud SDK, languages, frameworks, and tools
  • Infrastructure as code
  • Migration
Related sites
  • Google Cloud Home
  • Free Trial and Free Tier
  • Architecture Center
  • Blog
  • Contact Sales
  • Google Cloud Developer Center
  • Google Developer Center
  • Google Cloud Marketplace
  • Google Cloud Marketplace Documentation
  • Google Cloud Skills Boost
  • Google Cloud Solution Center
  • Google Cloud Support
  • Google Cloud Tech Youtube Channel
/
  • English
  • Deutsch
  • Español – América Latina
  • Français
  • Português – Brasil
  • 中文 – 简体
  • 日本語
  • 한국어
Console Sign in
  • Generative AI on Vertex AI
Guides API reference Vertex AI Cookbook Prompt gallery Resources FAQ
Contact Us Start free
Google Cloud
  • Documentation
    • Guides
    • API reference
    • Vertex AI Cookbook
    • Prompt gallery
    • Resources
    • FAQ
  • Technology areas
    • More
  • Cross-product tools
    • More
  • Related sites
    • More
  • Console
  • Contact Us
  • Start free
  • API reference
  • Gemini API
    • Generate content
    • Function calling
    • Prompt classes
    • Grounding
    • Code execution API
    • Live API
    • API errors
  • Embeddings API
    • Text embeddings
    • Multimodal embeddings
  • Imagen API
    • Generate images
    • Edit images
    • Customize images (few-shot)
    • Virtual Try-On API
    • Product Recontext API
    • Image captioning
    • Visual question answering (VQA)
  • Veo video generation API
  • Lyria music generation API
  • Batch prediction API
  • Tuning API
  • Gen AI Evaluation API
  • RAG API
    • RAG API v1beta1
    • RAG API v1
    • Retrieval and generation output of RAG
  • CountTokens API
  • MedLM API
  • REST and RPC reference
  • Vertex AI in express mode
    • REST API reference
    • v1
      • Overview
      • countTokens
      • generateContent
      • streamGenerateContent
    • v1beta1
      • Overview
      • countTokens
      • generateContent
      • streamGenerateContent
  • REST
    • All methods
    • v1
      • REST Resources
      • media
        • Overview
        • upload
      • projects
        • Overview
        • getCacheConfig
        • updateCacheConfig
      • projects.locations
        • Overview
        • augmentPrompt
        • corroborateContent
        • evaluateDataset
        • evaluateInstances
        • generateInstanceRubrics
        • generateSyntheticData
        • getRagEngineConfig
        • retrieveContexts
        • updateRagEngineConfig
      • projects.locations.cachedContents
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.endpoints
        • Overview
        • fetchPredictOperation
        • generateContent
        • predict
        • predictLongRunning
        • rawPredict
        • serverStreamingPredict
        • streamGenerateContent
        • streamRawPredict
      • projects.locations.endpoints.chat
        • Overview
        • completions
      • projects.locations.endpoints.deployedModels.invoke
        • Overview
        • invoke
      • projects.locations.endpoints.invoke
        • Overview
        • invoke
      • projects.locations.endpoints.openapi
        • Overview
        • embeddings
      • projects.locations.evaluationItems
        • Overview
        • create
        • delete
        • get
        • list
      • projects.locations.evaluationRuns
        • Overview
        • cancel
        • create
        • delete
        • get
        • list
      • projects.locations.evaluationSets
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.models
        • Overview
        • getIamPolicy
        • setIamPolicy
        • testIamPermissions
      • projects.locations.operations
        • Overview
        • cancel
        • delete
        • get
        • list
        • wait
      • projects.locations.publishers.models
        • Overview
        • fetchPredictOperation
        • generateContent
        • predict
        • predictLongRunning
        • rawPredict
        • serverStreamingPredict
        • streamGenerateContent
        • streamRawPredict
      • projects.locations.ragCorpora
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.ragCorpora.ragFiles
        • Overview
        • delete
        • get
        • import
        • list
      • projects.locations.reasoningEngines
        • Overview
        • create
        • delete
        • get
        • list
        • patch
        • query
        • streamQuery
      • projects.locations.tuningJobs
        • Overview
        • cancel
        • create
        • get
        • list
        • rebaseTunedModel
      • Types
      • ApiAuth
      • ApiKeyConfig
      • AutoraterConfig
      • BleuSpec
      • Content
      • CustomOutputFormatConfig
      • EncryptionSpec
      • ExactMatchSpec
      • Fact
      • GenerateContentResponse
      • HarmBlockThreshold
      • HarmCategory
      • Metric
      • ModelArmorConfig
      • PairwiseMetricSpec
      • PointwiseMetricSpec
      • PredefinedMetricSpec
      • PredictResponse
      • RagChunk
      • RagEngineConfig
      • RagFileTransformationConfig
      • RougeSpec
      • Rubric
      • RubricGenerationSpec
      • SafetySetting
      • Schema
      • StreamingPredictResponse
      • Tensor
    • v1beta1
      • REST Resources
      • media
        • Overview
        • upload
      • projects
        • Overview
        • getCacheConfig
        • updateCacheConfig
      • projects.locations
        • Overview
        • augmentPrompt
        • corroborateContent
        • evaluateDataset
        • evaluateInstances
        • generateInstanceRubrics
        • generateSyntheticData
        • getRagEngineConfig
        • retrieveContexts
        • updateRagEngineConfig
      • projects.locations.cachedContents
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.endpoints
        • Overview
        • countTokens
        • fetchPredictOperation
        • generateContent
        • getIamPolicy
        • predict
        • predictLongRunning
        • rawPredict
        • serverStreamingPredict
        • setIamPolicy
        • streamGenerateContent
        • streamRawPredict
        • testIamPermissions
      • projects.locations.endpoints.chat
        • Overview
        • completions
      • projects.locations.endpoints.deployedModels.invoke
        • Overview
        • invoke
      • projects.locations.endpoints.invoke
        • Overview
        • invoke
      • projects.locations.endpoints.openapi
        • Overview
        • embeddings
      • projects.locations.evaluationItems
        • Overview
        • create
        • delete
        • get
        • list
      • projects.locations.evaluationRuns
        • Overview
        • cancel
        • create
        • delete
        • get
        • list
      • projects.locations.evaluationSets
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.extensions
        • Overview
        • delete
        • execute
        • get
        • import
        • list
        • patch
        • query
      • projects.locations.models
        • Overview
        • getIamPolicy
        • setIamPolicy
        • testIamPermissions
      • projects.locations.operations
        • Overview
        • cancel
        • delete
        • get
        • list
        • wait
      • projects.locations.publishers
        • Overview
        • getIamPolicy
      • projects.locations.publishers.models
        • Overview
        • countTokens
        • fetchPredictOperation
        • generateContent
        • getIamPolicy
        • predict
        • predictLongRunning
        • rawPredict
        • serverStreamingPredict
        • streamGenerateContent
        • streamRawPredict
      • projects.locations.ragCorpora
        • Overview
        • create
        • delete
        • get
        • list
        • patch
      • projects.locations.ragCorpora.ragFiles
        • Overview
        • delete
        • get
        • import
        • list
      • projects.locations.reasoningEngines
        • Overview
        • create
        • delete
        • get
        • list
        • patch
        • query
        • streamQuery
      • projects.locations.reasoningEngines.a2a.v1
        • Overview
        • card
      • projects.locations.reasoningEngines.a2a.v1.message
        • Overview
        • send
        • stream
      • projects.locations.reasoningEngines.a2a.v1.tasks
        • Overview
        • a2aGetReasoningEngine
        • cancel
        • pushNotificationConfigs
        • subscribe
      • projects.locations.reasoningEngines.a2a.v1.tasks.pushNotificationConfigs
        • Overview
        • a2aGetReasoningEngine
      • projects.locations.reasoningEngines.memories
        • Overview
        • create
        • delete
        • generate
        • get
        • list
        • patch
        • retrieve
      • projects.locations.reasoningEngines.sessions
        • Overview
        • appendEvent
        • create