Catalog
google/agent-platform-tuning-management

google

agent-platform-tuning-management

Manages GenAI tuning jobs in Agent Platform. Use this to list, get, or cancel ongoing model tuning jobs. Don't use for fine-tuning models (use `agent-platform-tuning`), deploying models to endpoints (use `agent-platform-deploy`), or managing serving endpoints (use `agent-platform-endpoint-management`).

global
0installs0uses~1.5k
v1.0Saved May 27, 2026

Agent Platform Tuning Management

This skill provides instructions on how to manage GenAI Tuning Jobs using the Agent Platform Python SDK. Use this skill when a user wants to check the status of their tuning runs, find an active tuning job, or cancel a job that is running too long.

Safety & Confirmation Tiers (CRITICAL)

Before executing any commands on behalf of the user, you MUST adhere to the following safety tiers based on the action requested:

  1. Tier R: Read-only (list, get)
    • Rule: No confirmation needed. You may execute these commands immediately to gather information for the user.
  2. Tier D: Destructive & Interruptive (cancel)
    • Rule: This requires explicit typed confirmation. You MUST output a text message to the user explaining that this will stop the tuning process and any progress will be lost, and asking them to type "I confirm" or "Yes, cancel it". You MUST ask for this confirmation IMMEDIATELY, before executing the cancel command.

Phase 0: Environment Setup

CRITICAL: Before running any of the Python snippets below, you MUST ensure the environment is correctly initialized by following these steps:

  1. Virtual Environment: Create and activate a virtual environment:
    python3 -m venv ~/tuning_mgr_venv
    source ~/tuning_mgr_venv/bin/activate
    
  2. Google Cloud Authentication: Authenticate with your Google Cloud account and configure active Application Default Credentials (ADC) for Agent Platform access:
    gcloud auth login
    gcloud auth application-default login
    
  3. Install Dependencies: Install the required Agent Platform SDK:
    pip install google-cloud-aiplatform
    
  4. Execution: Advise the user that every time they execute a Python snippet, they must ensure this virtual environment is activated first.

Workflow Decision Tree

  1. Information Gathering: Do you have a Project ID and Region?

    • No -> You MUST ask the user for the missing Project ID and Region in plain text, or advise them to check their gcloud configuration. If neither location has this information, then ask the user to provide it. Do not attempt to search random regions on your own.
    • Yes -> Proceed to Step 2.
  2. Task Type: What does the user want to do?

    • Find or List Jobs -> Use the Python SDK to list tuning jobs. (Tier R)
    • Check Status / Inspect a Specific Job -> Use the Python SDK to get tuning job details. (Tier R)
    • Cancel a Job -> Ask for confirmation, then use the Python SDK to cancel the tuning job. (Tier D)

Using the Python SDK

[!NOTE] Resource Verification & Missing Projects/Jobs: If the execution of the Python snippet fails with an error (such as 403 Permission Denied, 404 Not Found, INVALID_ARGUMENT, or indicating a dummy/missing project or job ID), you MUST inform the user that the project or tuning job does not exist or cannot be accessed. You MUST prompt the user to provide a valid Project ID or Job ID, and stop tool execution immediately to wait for their response. Do NOT retry or loop, do NOT assume the resource is valid, and do NOT execute further scripts before receiving valid details from the user.

1. Listing Tuning Jobs (Tier R)

If the user asks "What tuning jobs do I have running?" or wants to find a specific job ID:

from google.cloud import aiplatform_v1

project_id = "YOUR_PROJECT_ID"
region = "YOUR_REGION"
parent = f"projects/{project_id}/locations/{region}"

client = aiplatform_v1.GenAiTuningServiceClient(
    client_options={"api_endpoint": f"{region}-aiplatform.googleapis.com"}
)

jobs = client.list_tuning_jobs(parent=parent)
for job in jobs:
    print(f"Name: {job.name}")
    print(f"Base Model: {job.base_model}")
    print(f"State: {job.state}")

2. Getting Details for a Specific Job (Tier R)

If the user provides a Tuning Job ID and asks for its status:

from google.cloud import aiplatform_v1

project_id = "YOUR_PROJECT_ID"
region = "YOUR_REGION"
job_id = "YOUR_JOB_ID"  # 19-digit ID
name = f"projects/{project_id}/locations/{region}/tuningJobs/{job_id}"

client = aiplatform_v1.GenAiTuningServiceClient(
    client_options={"api_endpoint": f"{region}-aiplatform.googleapis.com"}
)

job = client.get_tuning_job(name=name)
print(f"Name: {job.name}")
print(f"Base Model: {job.base_model}")
print(f"State: {job.state}")
print(f"Tuning Model: {job.tuned_model_display_name}")

3. Canceling a Job (Tier D)

If the user explicitly requests to stop, abort, or cancel a running tuning job:

Safety Check: Action requires explicit typed confirmation before proceeding. You MUST ask the user for confirmation before generating or providing this script, even if they provided the job ID, unless they explicitly use confirming language like "Yes, I confirm, cancel tuning job 123456".

[!IMPORTANT] NEVER pre-emptively provide or execute any cancellation code before receiving the user's response in a new turn. You must never speculate or assume that confirmation will be given. Asking for confirmation and providing the code in a single parallel turn is a severe safety violation.

from google.cloud import aiplatform_v1

project_id = "YOUR_PROJECT_ID"
region = "YOUR_REGION"
job_id = "YOUR_JOB_ID"  # 19-digit ID
name = f"projects/{project_id}/locations/{region}/tuningJobs/{job_id}"

client = aiplatform_v1.GenAiTuningServiceClient(
    client_options={"api_endpoint": f"{region}-aiplatform.googleapis.com"}
)

client.cancel_tuning_job(name=name)
print(f"Successfully requested cancellation for {name}")
Files1
1 files · 11.1 KB

Select a file to preview

Overall Score

87/100

Grade

A

Excellent

Safety

92

Quality

85

Clarity

87

Completeness

82

Summary

This skill guides AI agents to manage GenAI tuning jobs in Google's Agent Platform using the Python SDK. It covers listing active jobs, checking job status, and canceling in-progress tuning runs with explicit safety tiers requiring typed user confirmation for destructive operations.

Detected Capabilities

Python SDK executionGoogle Cloud API callsVirtual environment setupCredential authentication (gcloud ADC)Resource querying (list, get)Resource mutation (cancel)

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

check tuning job statuslist active tuning runscancel model tuningmonitor agent platform jobsinspect tuning progress

Risk Signals

INFO

Google Cloud API access via Python SDK

Phase 0: Environment Setup and SDK sections
INFO

Credential authentication via gcloud ADC

Phase 0, Step 2
INFO

Cancellation operation requires explicit typed user confirmation before execution

Section 3: Canceling a Job (Tier D)
INFO

Destructive operation (cancel) enforced with multi-turn confirmation workflow

Canceling a Job and Safety & Confirmation Tiers sections

Referenced Domains

External domains referenced in skill content, detected by static analysis.

www.apache.org

Use Cases

  • Check the status of ongoing model tuning jobs
  • List all active tuning runs in a GCP project
  • Cancel a long-running tuning job with explicit confirmation
  • Retrieve detailed information about a specific tuning job by ID
  • Monitor multiple tuning jobs across regions

Quality Notes

  • Excellent safety design: explicit multi-tier confirmation system clearly separates read-only (Tier R) from destructive (Tier D) operations
  • Well-documented decision tree guiding agent through required information gathering before execution
  • Clear emphasis on resource verification: skill explicitly instructs agent to halt and request valid credentials/IDs if API calls fail
  • Critical safety enforcement: skill forbids pre-emptive provision of cancellation code without user confirmation in separate turn, preventing accidental destructive operations
  • Comprehensive environment setup instructions including virtual environment, GCP authentication, and dependency installation
  • Good error handling guidance: skill details expected API error codes (403, 404, INVALID_ARGUMENT) and prescribes correct agent behavior
  • Strong scoping: skill explicitly defers to other skills for related tasks (fine-tuning, deployment, endpoint management)
  • Code examples use placeholder substitution (YOUR_PROJECT_ID, YOUR_JOB_ID) appropriately without hardcoding values
Model: claude-haiku-4-5-20251001Analyzed: May 27, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Add google/agent-platform-tuning-management to your library

Command Palette

Search for a command to run...