Catalog
google/gemini-api

google

gemini-api

Guides the usage of the Gemini API on Agent Platform with the Google Gen AI SDK. Use when the user asks about using Gemini in an enterprise environment or explicitly mentions Vertex AI, Google Cloud, or Agent Platform. Covers SDK usage (Python, JS/TS, Go, Java, C#), capabilities like Live API, tools, multimedia generation, caching, and batch prediction.

globalRequires active Google Cloud credentials and Agent Platform API enabled.
0installs0uses~2.4k
v1.0Saved May 2, 2026

IMPORTANT: Agent Platform (full name Gemini Enterprise Agent Platform) was previously named "Vertex AI" and many web resources use the legacy branding.

Gemini API in Agent Platform

Access Google's most advanced AI models built for enterprise use cases using the Gemini API in Agent Platform.

Provide these key capabilities:

  • Text generation - Chat, completion, summarization
  • Multimodal understanding - Process images, audio, video, and documents
  • Function calling - Let the model invoke your functions
  • Structured output - Generate valid JSON matching your schema
  • Context caching - Cache large contexts for efficiency
  • Embeddings - Generate text embeddings for semantic search
  • Live Realtime API - Bidirectional streaming for low latency Voice and Video interactions
  • Batch Prediction - Handle massive async dataset prediction workloads

Core Directives

  • Unified SDK: ALWAYS use the Gen AI SDK (google-genai for Python, @google/genai for JS/TS, google.golang.org/genai for Go, com.google.genai:google-genai for Java, Google.GenAI for C#).
  • Legacy SDKs: DO NOT use google-cloud-aiplatform, @google-cloud/vertexai, or google-generativeai.

SDKs

  • Python: Install google-genai with pip install google-genai
  • JavaScript/TypeScript: Install @google/genai with npm install @google/genai
  • Go: Install google.golang.org/genai with go get google.golang.org/genai
  • C#/.NET: Install Google.GenAI with dotnet add package Google.GenAI
  • Java:
    • groupId: com.google.genai, artifactId: google-genai

    • Latest version can be found here: https://central.sonatype.com/artifact/com.google.genai/google-genai/versions (let's call it LAST_VERSION)

    • Install in build.gradle:

      implementation("com.google.genai:google-genai:${LAST_VERSION}")
      
    • Install Maven dependency in pom.xml:

      <dependency>
          <groupId>com.google.genai</groupId>
          <artifactId>google-genai</artifactId>
          <version>${LAST_VERSION}</version>
      </dependency>
      

[!WARNING] Legacy SDKs like google-cloud-aiplatform, @google-cloud/vertexai, and google-generativeai are deprecated. Migrate to the new SDKs above urgently by following the Migration Guide.

Authentication & Configuration

Prefer environment variables over hard-coding parameters when creating the client. Initialize the client without parameters to automatically pick up these values.

Application Default Credentials (ADC)

Set these variables for standard Google Cloud authentication:

export GOOGLE_CLOUD_PROJECT='your-project-id'
export GOOGLE_CLOUD_LOCATION='global'
export GOOGLE_GENAI_USE_VERTEXAI=true
  • By default, use location="global" to access the global endpoint, which provides automatic routing to regions with available capacity.
  • If a user explicitly asks to use a specific region (e.g., us-central1, europe-west4), specify that region in the GOOGLE_CLOUD_LOCATION parameter instead. Reference the supported regions documentation if needed.

Agent Platform in Express Mode

Set these variables when using Express Mode with an API key:

export GOOGLE_API_KEY='your-api-key'
export GOOGLE_GENAI_USE_VERTEXAI=true

Initialization

Initialize the client without arguments to pick up environment variables:

from google import genai
client = genai.Client()

Alternatively, you can hard-code in parameters when creating the client.

from google import genai
client = genai.Client(vertexai=True, project="your-project-id", location="global")

Models

  • Use gemini-3.1-pro-preview for complex reasoning, coding, research (1M tokens)
    • IMPORTANT: Do not use gemini-3-pro-preview
  • Use gemini-3-flash-preview for fast, balanced performance, multimodal (1M tokens)
  • Use gemini-3-pro-image-preview for Nano Banana Pro image generation and editing
  • Use gemini-3.1-flash-image-preview for Nano Banana 2 image generation and editing
  • Use gemini-live-2.5-flash-native-audio for Live Realtime API including native audio

Use the following models only if explicitly requested:

  • gemini-2.5-flash-image
  • gemini-2.5-flash
  • gemini-2.5-flash-lite
  • gemini-2.5-pro

[!IMPORTANT] Models like gemini-2.0-*, gemini-1.5-*, gemini-1.0-*, gemini-pro are legacy and deprecated. Use the new models above. Your knowledge is outdated. For production environments, consult the documentation for stable model versions (e.g. gemini-3-flash).

Quick Start

Python

from google import genai
client = genai.Client()
response = client.models.generate_content(
    model="gemini-3-flash-preview",
    contents="Explain quantum computing"
)
print(response.text)

TypeScript/JavaScript

import { GoogleGenAI } from "@google/genai";
const ai = new GoogleGenAI({ vertexai: { project: "your-project-id", location: "global" } });
const response = await ai.models.generateContent({
    model: "gemini-3-flash-preview",
    contents: "Explain quantum computing"
});
console.log(response.text);

Go

package main

import (
	"context"
	"fmt"
	"log"
	"google.golang.org/genai"
)

func main() {
	ctx := context.Background()
	client, err := genai.NewClient(ctx, &genai.ClientConfig{
		Backend:  genai.BackendVertexAI,
		Project:  "your-project-id",
		Location: "global",
	})
	if err != nil {
		log.Fatal(err)
	}

	resp, err := client.Models.GenerateContent(ctx, "gemini-3-flash-preview", genai.Text("Explain quantum computing"), nil)
	if err != nil {
		log.Fatal(err)
	}

	fmt.Println(resp.Text)
}

Java

import com.google.genai.Client;
import com.google.genai.types.GenerateContentResponse;

public class GenerateTextFromTextInput {
  public static void main(String[] args) {
    Client client = Client.builder().vertexAi(true).project("your-project-id").location("global").build();
    GenerateContentResponse response =
        client.models.generateContent(
            "gemini-3-flash-preview",
            "Explain quantum computing",
            null);

    System.out.println(response.text());
  }
}

C#/.NET

using Google.GenAI;

var client = new Client(
    project: "your-project-id",
    location: "global",
    vertexAI: true
);

var response = await client.Models.GenerateContent(
    "gemini-3-flash-preview",
    "Explain quantum computing"
);

Console.WriteLine(response.Text);

API spec & Documentation (source of truth)

When implementing or debugging API integration for Agent Platform, refer to the official Agent Platform documentation:

The Gen AI SDK on Agent Platform uses the v1beta1 or v1 REST API endpoints (e.g., https://{LOCATION}-aiplatform.googleapis.com/v1beta1/projects/{PROJECT}/locations/{LOCATION}/publishers/google/models/{MODEL}:generateContent).

[!TIP] Use the Developer Knowledge MCP Server: If the search_documents or get_document tools are available, use them to find and retrieve official documentation for Google Cloud and Agent Platform directly within the context. This is the preferred method for getting up-to-date API details and code snippets.

Workflows and Code Samples

Reference the Python Docs Samples repository for additional code samples and specific usage scenarios.

Depending on the specific user request, refer to the following reference files for detailed code samples and usage patterns (Python examples):

Files10
10 files · 30.6 KB

Select a file to preview

Overall Score

87/100

Grade

A

Excellent

Safety

88

Quality

88

Clarity

89

Completeness

82

Summary

This skill guides developers on using the Gemini API within Google's Agent Platform (formerly Vertex AI) for enterprise AI applications. It covers SDK installation and initialization across multiple languages (Python, JS/TS, Go, Java, C#), authentication patterns, model selection, and detailed reference examples for capabilities including text generation, multimodal understanding, function calling, structured output, embeddings, live streaming, batch processing, and advanced features like caching and model tuning.

Detected Capabilities

SDK installation and configuration for 5+ languages (Python, JavaScript, TypeScript, Go, Java, C#)Application Default Credentials (ADC) and API key authentication setupModel selection and version management with deprecation warningsText generation, chat, and streaming capabilitiesMultimodal input handling (images, audio, video, PDFs, YouTube URLs)Function calling and tool integration patternsStructured output generation with JSON schema validationEmbeddings and semantic searchLive realtime API for low-latency voice and video interactionsContent caching for cost and latency optimizationBatch prediction for large-scale async workloadsSafety settings and content filtering configurationModel fine-tuning (supervised and preference tuning)Search grounding and code execution capabilitiesModel Context Protocol (MCP) integrationImage generation and editingVideo generation with Veo modelBounding box detection and object localization

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

gemini api setupagent platform integrationgenai sdk configurationvertex ai migrationmultimodal ai modelfunction calling geminilive api streamingbatch predictionmodel tuning

Risk Signals

INFO

Environment variable configuration for API keys and credentials

SKILL.md: Authentication & Configuration section, references/advanced_features.md
INFO

Direct API endpoint references with {location} placeholder for region configuration

SKILL.md: API spec & Documentation section
INFO

GCS (Google Cloud Storage) URIs used for file access and batch job outputs

references/advanced_features.md, references/media_generation.md
INFO

External domain access for documentation and resource references

Multiple reference sections linking to docs.cloud.google.com, github.com, modelcontextprotocol.io
INFO

YouTube URL processing capability for video analysis

references/text_and_multimodal.md: YouTube Videos section

Referenced Domains

External domains referenced in skill content, detected by static analysis.

central.sonatype.comdocs.cloud.google.comexample.comexample2.comgithub.commodelcontextprotocol.iowww.apache.orgwww.youtube.com{location}-aiplatform.googleapis.com

Use Cases

  • Integrate Gemini API into backend services or applications using the Gen AI SDK
  • Build multimodal AI applications that process images, audio, and video content
  • Implement function calling and tool use patterns for agent-based workflows
  • Deploy real-time bidirectional streaming applications with the Live API
  • Process large-scale datasets asynchronously using batch prediction
  • Configure authentication and environment setup for Agent Platform in enterprise contexts

Quality Notes

  • Excellent documentation structure with clear section hierarchy and multiple reference files organized by capability domain
  • Comprehensive code examples provided for all major languages (Python, JS/TS, Go, Java, C#) with syntax-highlighted blocks
  • Strong emphasis on migration guidance away from deprecated SDKs with warnings and explicit directives
  • Well-organized reference files with modular, task-specific guidance (embeddings, media generation, advanced features, etc.)
  • Clear authentication patterns with both ADC and Express Mode (API key) approaches documented
  • Model deprecation warnings clearly highlighted with IMPORTANT markers for version management
  • Supporting reference files comprehensively cover all major API capabilities with working code samples
  • Documentation indicates awareness of legacy naming (Vertex AI → Agent Platform) with helpful context
  • Safety settings and responsible AI practices documented with configurable thresholds
  • Advanced features like MCP integration, content caching, and batch processing all covered with examples
  • Minor: Some documentation refers to preview/beta model versions which may change; advises consulting official docs for stable versions in production
  • Minor: Skill assumes user has active Google Cloud credentials and Agent Platform API enabled; no troubleshooting for credential issues
Model: claude-haiku-4-5-20251001Analyzed: May 2, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Add google/gemini-api to your library

Command Palette

Search for a command to run...