Catalog
google/agent-platform-endpoint-management

google

agent-platform-endpoint-management

Manages Agent Platform serving endpoints. Use when you need to create, list, describe, update, or delete serving endpoints for model deployment on Agent Platform. Also use when troubleshooting endpoint permission, quota, or resource busy errors. Don't use for deploying models to endpoints or for running model evaluations.

global
New~1.5k
v1.0Saved Jun 5, 2026

Agent Platform Endpoint Management

Overview

This skill provides procedural knowledge for managing Agent Platform Endpoints. Endpoints are logical serving hosts that provide a stable URL for online predictions. You must create an endpoint before you can deploy a model to it.

Safety & Confirmation Tiers (CRITICAL)

Before executing any commands on behalf of the user, you MUST adhere to the following safety tiers based on the action requested:

  1. Tier R: Read-only (list, describe, get)
    • No confirmation needed. Execute immediately to gather information.
  2. Tier M: Mutating & Reversible (create, update)
    • Requires interactive confirmation with 'Yes'/'No' options. The confirmation prompt MUST contain the exact, literal command string with all required flags (e.g. --region=us-central1, --display-name="...") — natural-language paraphrases are NOT sufficient.
    • Same-turn restriction: NEVER execute the command in the same turn as presenting the confirmation prompt. Stop and wait for the user's reply; only execute after explicit 'Yes' / approval.
  3. Tier D: Destructive & Irreversible (delete)
    • Requires explicit typed confirmation (e.g. "I confirm" or "Yes, delete it"). Ask for confirmation IMMEDIATELY — before any pre-flight checks (don't describe first, don't check if the endpoint is empty first).
    • Same-turn restriction: NEVER execute in the same turn as asking for typed confirmation. Wait for the user to reply in a new turn.

Phase 0: Environment Setup

CRITICAL: Before running any commands, you MUST ensure the environment is correctly initialized by following these steps:

  1. Google Cloud Authentication: Authenticate with your Google Cloud credentials and configure active Application Default Credentials (ADC) for Agent Platform access:
    gcloud auth login
    gcloud auth application-default login
    
  2. Set Project: Configure the active project for subsequent commands:
    gcloud config set project $PROJECT_ID
    
  3. Region: Always specify --region=$LOCATION_ID on each command below. Do NOT use global. Ask the user to specify the region if not provided.

1. Listing Endpoints (Tier R)

Use this command to discover existing endpoints in a specific region and retrieve their IDs. No confirmation is required.

gcloud ai endpoints list \
    --region=$LOCATION_ID

[!IMPORTANT] Always specify the --region. Do NOT use 'global'. Ask the user to specify if not provided.

2. Describing an Endpoint (Tier R)

Retrieve the full metadata for a specific endpoint. No confirmation is required.

gcloud ai endpoints describe $ENDPOINT_ID \
    --region=$LOCATION_ID

3. Creating an Endpoint (Tier M)

Create a new endpoint resource. The parent resource is the location. Action requires an inline confirmation card before proceeding.

gcloud ai endpoints create \
    --region=$LOCATION_ID \
    --display-name="my-endpoint"

[!IMPORTANT] You MUST seek interactive confirmation first. Your confirmation prompt MUST show the literal command string. For example:

gcloud ai endpoints create --region=$LOCATION_ID --display-name="my-endpoint"

Or the exact flags. Do not execute this command in the same turn as proposing the confirmation.

4. Updating an Endpoint (Tier M)

Update endpoint metadata such as display name or labels. Action requires an inline confirmation card before proceeding.

gcloud ai endpoints update $ENDPOINT_ID \
    --region=$LOCATION_ID \
    --display-name="new-display-name"

Check if the endpoint exists first by either listing or describing the endpoint.

[!IMPORTANT] You MUST seek interactive confirmation first. Your confirmation prompt MUST show the literal command string. For example:

gcloud ai endpoints update $ENDPOINT_ID --region=$LOCATION_ID --display-name="new-display-name"

Or the exact flags. CRITICAL: You are strictly prohibited from executing this command in the same turn as asking for confirmation. When you ask for confirmation, you MUST stop immediately and wait for the user to reply.

5. Deleting an Endpoint (Tier D)

Permanently delete an endpoint resource. Action requires explicit typed confirmation before proceeding.

gcloud ai endpoints delete $ENDPOINT_ID \
    --region=$LOCATION_ID

[!WARNING] All models must be undeployed from the endpoint before it can be deleted. Do not run describe until AFTER you have received typed confirmation to delete.

6. Traffic Splitting (Tier M)

You can manage traffic split between different models deployed on the same endpoint during an update. Action requires an inline confirmation card before proceeding.

# Example: Deploying a model with a specific traffic split is usually done
# via 'gcloud ai endpoints deploy-model'.

Refer to the agent-platform-deploy skill for instructions on deploying and undeploying models.

Troubleshooting

  • 403 Permission Denied: Ensure aiplatform.admin or owner role is assigned.
  • Quota Exceeded: Verify the region's endpoint quota in the Cloud Console.
  • Resource Busy: If a deletion fails, check if models are still being undeployed.
Files1
1 files · 11.1 KB

Select a file to preview

Overall Score

82/100

Grade

B

Good

Safety

88

Quality

78

Clarity

85

Completeness

75

Summary

This skill provides procedural guidance for managing Google Cloud Agent Platform serving endpoints via gcloud CLI commands. It covers listing, describing, creating, updating, and deleting endpoints across five operations, with a three-tier safety confirmation system (read-only, mutating/reversible, destructive/irreversible) to prevent accidental resource modification or deletion.

Detected Capabilities

shell executiongcloud CLI command executionGoogle Cloud authenticationresource querying (list/describe)resource creationresource updateresource deletionenvironment variable configuration

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

list endpointscreate serving endpointupdate endpoint metadatadelete endpointendpoint managementgcloud endpointsdescribe endpointendpoint region

Risk Signals

INFO

gcloud commands executed in Google Cloud environment with admin privileges

SKILL.md: Sections 1-5
WARNING

Destructive delete operation (gcloud ai endpoints delete) requires explicit typed confirmation before execution

SKILL.md: Section 5, Tier D
INFO

Environment setup requires Google Cloud authentication via gcloud auth login and application-default login

SKILL.md: Phase 0
INFO

Region parameter must be specified on all commands; global region explicitly prohibited

SKILL.md: Phase 0 and Section 1

Referenced Domains

External domains referenced in skill content, detected by static analysis.

www.apache.org

Use Cases

  • List existing Agent Platform endpoints in a region to discover available serving hosts
  • Describe a specific endpoint to retrieve its full metadata and configuration
  • Create a new endpoint resource to serve model deployments
  • Update endpoint metadata such as display name or labels
  • Delete an endpoint resource after undeploying all models
  • Troubleshoot endpoint permission, quota, or resource busy errors

Quality Notes

  • Strengths: Well-structured three-tier confirmation system explicitly prevents accidental destructive operations; clear phase-0 environment setup instructions ensure proper GCP authentication before any commands; mandatory region specification prevents ambiguous or global operations; explicit same-turn restrictions prevent race conditions between confirmation prompts and command execution.
  • Strengths: Comprehensive troubleshooting section addresses common error scenarios (403 Permission Denied, Quota Exceeded, Resource Busy) with actionable remediation steps; operations are scoped to endpoint lifecycle management (explicitly excludes model deployment and evaluations).
  • Areas for improvement: No error handling guidance for individual commands (e.g., what to do if an endpoint creation fails due to quota, or if deletion fails because models are still deployed); confirmation prompt template examples are helpful but could include edge cases (e.g., what if user provides invalid region or endpoint ID).
  • Areas for improvement: Section 6 (Traffic Splitting) is incomplete — it references another skill (agent-platform-deploy) but provides no actual instructions; this creates a dependency on external knowledge that may not be available to the agent; consider either expanding this section or removing it.
  • Areas for improvement: No guidance on output parsing or result validation — the skill does not explain how the agent should interpret command results (JSON vs. text), how to detect success/failure, or how to present results to the user clearly.
Model: claude-haiku-4-5-20251001Analyzed: Jun 5, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Add google/agent-platform-endpoint-management to your library

Command Palette

Search for a command to run...

google/agent-platform-endpoint-management | SkillRepo