Agent Platform Endpoint Management

Overview

This skill provides procedural knowledge for managing Agent Platform Endpoints. Endpoints are logical serving hosts that provide a stable URL for online predictions. You must create an endpoint before you can deploy a model to it.

Safety & Confirmation Tiers (CRITICAL)

Before executing any commands on behalf of the user, you MUST adhere to the following safety tiers based on the action requested:

Tier R: Read-only (list, describe, get)
- No confirmation needed. Execute immediately to gather information.
Tier M: Mutating & Reversible (create, update)
- Requires interactive confirmation with 'Yes'/'No' options. The confirmation prompt MUST contain the exact, literal command string with all required flags (e.g. --region=us-central1, --display-name="...") — natural-language paraphrases are NOT sufficient.
- Same-turn restriction: NEVER execute the command in the same turn as presenting the confirmation prompt. Stop and wait for the user's reply; only execute after explicit 'Yes' / approval.
Tier D: Destructive & Irreversible (delete)
- Requires explicit typed confirmation (e.g. "I confirm" or "Yes, delete it"). Ask for confirmation IMMEDIATELY — before any pre-flight checks (don't describe first, don't check if the endpoint is empty first).
- Same-turn restriction: NEVER execute in the same turn as asking for typed confirmation. Wait for the user to reply in a new turn.

Phase 0: Environment Setup

CRITICAL: Before running any commands, you MUST ensure the environment is correctly initialized by following these steps:

Google Cloud Authentication: Authenticate with your Google Cloud credentials and configure active Application Default Credentials (ADC) for Agent Platform access:
```
gcloud auth login
gcloud auth application-default login
```
Set Project: Configure the active project for subsequent commands:
```
gcloud config set project $PROJECT_ID
```
Region: Always specify --region=$LOCATION_ID on each command below. Do NOT use global. Ask the user to specify the region if not provided.

1. Listing Endpoints (Tier R)

Use this command to discover existing endpoints in a specific region and retrieve their IDs. No confirmation is required.

gcloud ai endpoints list \
    --region=$LOCATION_ID

[!IMPORTANT] Always specify the --region. Do NOT use 'global'. Ask the user to specify if not provided.

2. Describing an Endpoint (Tier R)

Retrieve the full metadata for a specific endpoint. No confirmation is required.

gcloud ai endpoints describe $ENDPOINT_ID \
    --region=$LOCATION_ID

3. Creating an Endpoint (Tier M)

Create a new endpoint resource. The parent resource is the location. Action requires an inline confirmation card before proceeding.

gcloud ai endpoints create \
    --region=$LOCATION_ID \
    --display-name="my-endpoint"

[!IMPORTANT] You MUST seek interactive confirmation first. Your confirmation prompt MUST show the literal command string. For example:
gcloud ai endpoints create --region=$LOCATION_ID --display-name="my-endpoint"
Or the exact flags. Do not execute this command in the same turn as proposing the confirmation.

4. Updating an Endpoint (Tier M)

Update endpoint metadata such as display name or labels. Action requires an inline confirmation card before proceeding.

gcloud ai endpoints update $ENDPOINT_ID \
    --region=$LOCATION_ID \
    --display-name="new-display-name"

Check if the endpoint exists first by either listing or describing the endpoint.

[!IMPORTANT] You MUST seek interactive confirmation first. Your confirmation prompt MUST show the literal command string. For example:
gcloud ai endpoints update $ENDPOINT_ID --region=$LOCATION_ID --display-name="new-display-name"
Or the exact flags. CRITICAL: You are strictly prohibited from executing this command in the same turn as asking for confirmation. When you ask for confirmation, you MUST stop immediately and wait for the user to reply.

5. Deleting an Endpoint (Tier D)

Permanently delete an endpoint resource. Action requires explicit typed confirmation before proceeding.

gcloud ai endpoints delete $ENDPOINT_ID \
    --region=$LOCATION_ID

[!WARNING] All models must be undeployed from the endpoint before it can be deleted. Do not run describe until AFTER you have received typed confirmation to delete.

6. Traffic Splitting (Tier M)

You can manage traffic split between different models deployed on the same endpoint during an update. Action requires an inline confirmation card before proceeding.

# Example: Deploying a model with a specific traffic split is usually done
# via 'gcloud ai endpoints deploy-model'.

Refer to the agent-platform-deploy skill for instructions on deploying and undeploying models.

Troubleshooting

403 Permission Denied: Ensure aiplatform.admin or owner role is assigned.
Quota Exceeded: Verify the region's endpoint quota in the Cloud Console.
Resource Busy: If a deletion fails, check if models are still being undeployed.

Files1

1 files · 11.1 KB

Select a file to preview

Overall Score

86/100

Grade

A

Excellent

Safety

88

Quality

85

Clarity

88

Completeness

83

Summary

This skill provides procedural guidance for managing Google Cloud Agent Platform serving endpoints through the gcloud CLI. It covers lifecycle operations (create, list, describe, update, delete) with explicit safety tiers requiring different levels of confirmation before execution. The skill emphasizes authentication setup, region configuration, and includes troubleshooting guidance for common permission and quota errors.

Detected Capabilities

gcloud cli executiongoogle cloud authenticationshell command executionenvironment variable configurationread-only queriesresource creationresource deletionuser confirmation workflow

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

manage endpointscreate serving endpointdelete endpointlist endpointsupdate endpointendpoint troubleshootingagent platform endpoint

Risk Signals

INFO

gcloud delete command with destructive operation

Section 5: Deleting an Endpoint

INFO

Environment setup requires gcloud authentication with ADC

Phase 0: Environment Setup

INFO

Region parameter required on all commands; global not allowed

Throughout skill, emphasized in sections 1 and Phase 0

Referenced Domains

External domains referenced in skill content, detected by static analysis.

www.apache.org

Use Cases

Create and manage serving endpoints for model deployment on Google Cloud Agent Platform
List and describe existing endpoints to discover their IDs and metadata
Update endpoint configurations such as display names and labels
Delete endpoints after ensuring all models have been undeployed
Troubleshoot permission denied, quota exceeded, and resource busy errors when managing endpoints
Understand traffic splitting between models deployed on the same endpoint

Quality Notes

Excellent safety tier structure (R/M/D) with clear confirmation requirements and same-turn execution restrictions preventing accidental destructive operations
Strong procedural clarity with marked CRITICAL and IMPORTANT sections that guide agent behavior explicitly
Well-documented edge cases (e.g., models must be undeployed before endpoint deletion) with specific troubleshooting guidance
Phase 0 environment setup is thorough and prevents common authentication errors
Clear boundaries stated: skill does NOT handle model deployment or evaluations (delegated to agent-platform-deploy skill)
Confirmation prompts explicitly require literal command strings, preventing natural-language paraphrasing that could mask dangerous operations
Good use of markdown formatting with code blocks, callout boxes, and step-by-step instructions for readability
Tier D (destructive) operations prioritize asking for confirmation before pre-flight checks, reducing risk of confirmation fatigue bypass

Model: claude-haiku-4-5-20251001Analyzed: Jun 28, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Version History

v1.1

Content updated

2026-06-28

Latest

v1.0

No changelog

2026-06-05

agent-platform-endpoint-management

Agent Platform Endpoint Management

Overview

Safety & Confirmation Tiers (CRITICAL)

Phase 0: Environment Setup

1. Listing Endpoints (Tier R)

2. Describing an Endpoint (Tier R)

3. Creating an Endpoint (Tier M)

4. Updating an Endpoint (Tier M)

5. Deleting an Endpoint (Tier D)

6. Traffic Splitting (Tier M)

Troubleshooting

Summary

Detected Capabilities

Trigger Keywords

Risk Signals

Referenced Domains

Use Cases

Quality Notes

Reviews

Version History

Command Palette