Agent Platform RAG Engine Management

This skill provides instructions on how to interact with Agent Platform RAG Engine using the Agent Platform Python SDK. You MUST use the vertexai Python SDK to perform RAG Engine operations, rather than raw REST calls or MCP tools, because this code is intended to be run by external clients.

Safety & Confirmation Tiers (CRITICAL)

Before executing any commands or scripts on behalf of the user, you must adhere to the following safety tiers based on the action requested:

Tier R: Read-only (list_corpora, list_files, get_corpus, retrieval_query)
- No confirmation needed. Execute immediately to gather information or retrieve grounded contexts.
Tier RC: Read-only but consumes Compute Resources (client.models.generate_content)
- Requires interactive confirmation with 'Yes'/'No' options before executing grounded content generation. The confirmation prompt MUST clearly explain the proposed generation execution and its key parameters (e.g., target corpus ID, query text, target model). Natural-language paraphrases without specifying exact parameters are insufficient, as explicit parameter listing is required to ensure unambiguous user approval of the specific resource and configuration.
- Same-turn restriction: Do not execute the generation code in the same turn as presenting the confirmation prompt. Stop and wait for the user's reply; only execute after explicit 'Yes' / approval.
- Gold Standard Example:
  I will perform grounded content generation with the following parameters. Please confirm this information before I proceed:
  - Target Corpus ID: projects/123/locations/us/ragCorpora/abc
  - Target Model: gemini-2.5-pro
  - Query Text: "What are the company policies on remote work?" Do you confirm? [Yes/No]

Phase 0: Environment Setup

CRITICAL: Before running any of the Python snippets below, you must ensure the environment is correctly initialized by following these steps:

Google Cloud Authentication: Authenticate with your Google Cloud credentials and configure active Application Default Credentials (ADC) for Agent Platform access:
```
gcloud auth login
gcloud auth application-default login
```

Virtual Environment: Create and activate a dedicated virtual environment:

python3 -m venv ~/rag_agent_venv
source ~/rag_agent_venv/bin/activate

Install Dependencies: Install the required Agent Platform SDKs:
```
pip install google-cloud-aiplatform google-genai
```
Execution: Advise the user that every time they execute a Python snippet, they must ensure this virtual environment is activated first.

Workflow Decision Tree

Information Gathering: Has the user provided the Project ID, Region, and Corpus ID?
- No -> Proceed to [1. Listing Corpora and Files] to discover the necessary Resource Names and IDs. Only ask the user if discovery fails.
- Yes -> Proceed.
Task Type: What does the user want to do?
- List Corpora and Files -> Proceed to [1. Listing Corpora and Files].
- Inspect a Corpus -> Proceed to [2. Getting / Inspecting a RAG Engine Corpus].
- Search for Contexts -> Proceed to [3. Retrieving Contexts].
- Answer questions using RAG Engine -> Proceed to [4. Answering the User with Retrieved Context].

[!TIP] Placeholder Parameter Replacement: The Python scripts below use bracketed string placeholders (like "{project_id}", "{region}", and "{corpus_id}"). You MUST dynamically replace these placeholders with the actual Project ID, Region, and Corpus ID values provided in the user's prompt (or active context) before generating, providing, or executing the scripts.

1. Listing Corpora and Files (Discovery)

If you do not know the Resource Name of the corpus or file, you MUST list them first to discover them. The SDK handles pagination automatically when converted to a list, but you can also use manual pagination for large sets.

1.1 Listing and Discovering Corpora

import vertexai
from vertexai.preview import rag

vertexai.init(project="{project_id}", location="{region}")

# Approach A: List ALL (Automatic Pagination)
# The SDK's Pager iterates through all pages for you.
all_corpora = list(rag.list_corpora())
print(f"Found {len(all_corpora)} corpora in total.")
for c in all_corpora:
    print(f"Corpus Name: {c.name} | Display Name: {c.display_name}")

# Approach B: Manual Pagination (for very large projects)
pager = rag.list_corpora(page_size=10)
# Process first page
for c in pager:
    print(f"Corpus: {c.display_name}")

# Get next page if needed
if pager.next_page_token:
    second_page = rag.list_corpora(
        page_size=10, page_token=pager.next_page_token
    )

1.2 Listing and Discovering Files

To understand what files (and types) are in a corpus, list them and inspect the display_name (usually includes the extension).

import vertexai
from vertexai.preview import rag

vertexai.init(project="{project_id}", location="{region}")
corpus_name = (
    "projects/{project_id}/locations/{region}/ragCorpora/{corpus_id}"
)

# List files with automatic pagination
files = list(rag.list_files(corpus_name=corpus_name))
print(f"Found {len(files)} files.")

for f in files:
    # High-level SDK RagFile objects usually have name, display_name,
    # description
    print(f"File: {f.display_name} | Resource: {f.name}")
    # Tip: Check extension to understand file type (PDF, TXT, etc.)
    if f.display_name.lower().endswith(".pdf"):
        print("  Type: PDF")
    elif f.display_name.lower().endswith(".txt"):
        print("  Type: Plain Text")

2. Getting / Inspecting an Agent Platform RAG Engine Corpus

To retrieve details about an existing Agent Platform RAG Engine corpus:

import vertexai
from vertexai.preview import rag

vertexai.init(project="{project_id}", location="{region}")

# To get details of a specific corpus
corpus_name = (
    "projects/{project_id}/locations/{region}/ragCorpora/{corpus_id}"
)
corpus = rag.get_corpus(name=corpus_name)
print(f"Corpus Name: {corpus.name}")
print(f"Display Name: {corpus.display_name}")

3. Retrieving Contexts

To retrieve relevant contexts from a RAG Engine corpus based on a query:

import vertexai
from vertexai.preview import rag

vertexai.init(project="{project_id}", location="{region}")

corpus_name = (
    "projects/{project_id}/locations/{region}/ragCorpora/{corpus_id}"
)
query = "What is the speed of light?"

# Retrieve contexts
response = rag.retrieval_query(
    rag_corpora=[corpus_name],
    text=query,
    similarity_top_k=3
)

for context in response.contexts.contexts:
    print(f"Context text: {context.text}")
    print(f"Source: {context.source_uri}")

4. Answering the User with Retrieved Context

To use the retrieved context alongside an Agent Platform model to generate a grounded response:

from google import genai
from google.genai import types

client = genai.Client(enterprise=True, project="{project_id}", location="{region}")
corpus_name = (
    "projects/{project_id}/locations/{region}/ragCorpora/{corpus_id}"
)

# Define the Agent Platform RAG Engine tool pointing to the corpus
rag_tool = types.Tool(
    retrieval=types.Retrieval(
        vertex_rag_store=types.VertexRagStore(
            rag_resources=[types.VertexRagStoreRagResource(rag_corpus=corpus_name)],
            rag_retrieval_config=types.RagRetrievalConfig(
                top_k=3,
                filter=types.RagRetrievalConfigFilter(
                    vector_similarity_threshold=0.5,
                ),
            ),
        )
    )
)

# Generate content using the RAG Engine tool
response = client.models.generate_content(
    model="gemini-2.5-flash",
    contents="What is the speed of light?",
    config=types.GenerateContentConfig(
        tools=[rag_tool]
    )
)
print(response.text)

Files1

1 files · 11.1 KB

Select a file to preview

Overall Score

83/100

Grade

B

Good

Safety

82

Quality

85

Clarity

88

Completeness

76

Summary

This skill provides structured guidance for interacting with Google's Agent Platform RAG Engine using the vertexai Python SDK. It covers corpus discovery, file listing, context retrieval, and grounded content generation with explicit safety tiers that require user confirmation for resource-intensive operations.

Detected Capabilities

Google Cloud authenticationPython SDK executionAPI calls to vertexai/rag modulesInteractive confirmation promptsPlaceholder parameter replacementPagination handling

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

rag corpus discoveryretrieve grounded contextagent platform raglist rag filesgenerate grounded response

Risk Signals

INFO

Google Cloud credentials authentication (gcloud auth login / application-default login)

Phase 0: Environment Setup

INFO

Outbound API calls to Google Cloud Agent Platform services

Sections 1-4: Python code snippets

INFO

User-provided corpus ID and query text included in API requests

Sections 3-4: Retrieval and generation functions

Referenced Domains

External domains referenced in skill content, detected by static analysis.

www.apache.org

Use Cases

+Discover and list RAG corpora and files
Inspect corpus metadata and properties
Retrieve contextually relevant documents from a corpus
Generate AI responses grounded in RAG corpus content
Query RAG Engine with similarity filtering and top-k retrieval

Quality Notes

Excellent: Tier-based safety framework (Tier R/RC) with explicit confirmation requirements for resource-consuming operations
Excellent: Clear placeholder parameter replacement guidance prevents accidental hardcoding of credentials or project IDs
Excellent: Decision tree workflow guides agents through information gathering before execution
Strong: Detailed Python code examples with both automatic and manual pagination patterns
Strong: Environment setup section addresses virtual environments and dependency installation
Good: Distinction between Google genai SDK (for grounded generation) and vertexai SDK (for RAG operations)
Minor: 'Tier RC' confirmation example is thorough, but could clarify expected user response format more explicitly
Minor: No mention of error handling for failed API calls or network timeouts
Minor: No guidance on corpus access permissions or project quota limits

Model: claude-haiku-4-5-20251001Analyzed: Jun 28, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Version History

v1.1

Content updated

2026-06-28

Latest

v1.0

No changelog

2026-05-27

agent-platform-rag-engine-management

Agent Platform RAG Engine Management

Safety & Confirmation Tiers (CRITICAL)

Phase 0: Environment Setup

Workflow Decision Tree

1. Listing Corpora and Files (Discovery)

1.1 Listing and Discovering Corpora

1.2 Listing and Discovering Files

2. Getting / Inspecting an Agent Platform RAG Engine Corpus

3. Retrieving Contexts

4. Answering the User with Retrieved Context

Summary

Detected Capabilities

Trigger Keywords

Risk Signals

Referenced Domains

Use Cases

Quality Notes

Reviews

Version History

Command Palette