Bigtable Basics

This skill provides core workflows and guidance for administering and developing with Google Bigtable.

Core Principles

Control Plane vs. Data Plane:
- Use gcloud for Control Plane operations: Manage Instances, Clusters, App Profiles, Backups and IAM. Create Tables, Logical Views, Materialized Views and Authorized Views.
- Use cbt for Data Plane operations: Update Tables, Column Families, and reading/writing data.
Performance First: Bigtable is a NoSQL database. Efficiency is tied to Row Key design. Always warn about Full Table Scans.
Client Selection: For production use cases, prefer Java or Go for their superior performance and feature coverage compared to other languages.
Observability: When diagnosing performance or hotspotting, always mention Key Visualizer (via Cloud Console) as the primary diagnostic tool because it provides the most granular view of access patterns across row keys. This should be followed by the hot-tablets tool and table stats in gcloud CLI and include-stats=full option under cbt read to diagnose slow queries.

[!IMPORTANT] Safety Rule: You MUST obtain explicit user confirmation before making non-emulator database changes. You MUST mention this safety requirement when providing commands or instructions that modify the database structure or data.

Quick Recipes

1. Querying Data

Use SQL for complex transforms or aggregations and key-value APIs for simpler query patterns. Note: Use exact match, prefix (_key LIKE 'myprefix%'), or range predicates on _key to avoid expensive unbounded scans. Recommend explicit row ranges (_key BETWEEN 'start' AND 'end') as a more performant alternative to prefix matches where possible.

If expensive scans (either unbounded or prefix or range queries scanning a large range) are unavoidable due to multiple access patterns that can’t all be accommodated in a single schema, consider one of these two options:

If the query will be used in user facing and/or latency sensitive applications, use continuous materialized views with keys optimized for the additional access patterns.
If secondary access patterns are infrequent, batch patterns like ETL, ML model training or analytical read-only tasks, use Bigtable Data Boost instead.

2. Manipulating Data

Use key-value APIs for insert, update, increment and delete operations. SQL API is read-only.

3. Data Model Definition (DDL)

SQL API doesn't support DDL operations. Table creation, deletion, updates should be made using gcloud CLI. Logical Views and Continuous Materialized Views are defined as SQL queries but they must be created using gcloud CLI.

Reference Guides

CLI Operations:
- infrastructure_management.md: Provisioning instances, clusters, and table schemas.
- cli_data_access.md: Reading and writing data via the cbt CLI.
Design & Discovery:
- schema_design.md: Best practices for row keys and performance with tables and continuous materialized views.
- dataplex.md: Data catalog search for Bigtable assets.
Querying & Code:
- sql_guide.md: Querying structured row keys via SQL and CLI.
- client_libraries.md: Patterns for high-performance Go/Java/Python code.

Common Workflows

Schema Evolution (DevOps)

Prefer Terraform for production schema changes to prevent accidental data loss.
For manual cbt changes, first check the existing state by listing the table's column families and GC policies before proposing any modifications:
```
cbt ls {table}
```
If modifications are needed, create the family or update the GC policy:
```
cbt createfamily {table} {family}
cbt setgcpolicy {table} {family} "maxversions=5 AND maxage=30d"
```
Reference infrastructure_management.md for full syntax.

External Resources

Files8

8 files · 34.8 KB

Select a file to preview

Overall Score

88/100

Grade

A

Excellent

Safety

85

Quality

92

Clarity

89

Completeness

85

Summary

A comprehensive skill providing guidance for administering Google Bigtable, including provisioning instances/clusters, designing performant schemas with structured row keys, querying via SQL and client libraries (Java, Go, Python), and diagnosing performance issues. The skill emphasizes control plane operations (gcloud) vs. data plane operations (cbt), row key design for avoiding hotspotting, and includes detailed reference guides on infrastructure management, schema design, SQL patterns, and client library best practices.

Detected Capabilities

gcloud CLI executioncbt CLI executionHTTP requests (Dataplex API calls)cloud resource provisioningdatabase schema design guidanceSQL query generationclient code generation (Java, Go, Python)

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

bigtable schema designrow key optimizationbigtable hotspottingprovision bigtable clusterbigtable sql queriesbigtable client codebigtable performance tuningmaterialized views bigtable

Risk Signals

WARNING

gcloud bigtable instances delete command with --quiet flag allows destructive deletion

references/infrastructure_management.md:L29

WARNING

Bearer token access for Dataplex API requires authentication token exposure

references/dataplex.md:L10

INFO

HTTP requests to external GCP API endpoints (dataplex.googleapis.com)

references/dataplex.md:L9

Referenced Domains

External domains referenced in skill content, detected by static analysis.

cloud.google.comdataplex.googleapis.comdocs.cloud.google.comwww.apache.org

Use Cases

design performant Bigtable row keys and schemas
provision Bigtable instances, clusters, and tables using gcloud CLI
query Bigtable data using SQL API and cbt CLI
write high-performance client code in Java, Go, or Python for Bigtable
diagnose and resolve Bigtable hotspotting and performance issues
set up materialized views for real-time analytics and secondary indexing
manage Bigtable table structure, column families, and garbage collection policies
implement timestamp versioning and TTL strategies for data retention

Quality Notes

Excellent scope clarity: skill explicitly limits itself to Bigtable and excludes Cloud SQL (negated in description)
Strong safety guardrail: explicit requirement for user confirmation before non-emulator database changes (marked [!IMPORTANT])
Comprehensive reference structure: 6 reference markdown files cover all major domains (infrastructure, schema, SQL, client libraries, CLI, Dataplex)
Well-documented performance principles: clear guidance on avoiding full table scans, hotspotting prevention, and Key Visualizer for diagnostics
Rich edge case coverage: handles opaque vs. structured row keys, timestamp precision (microseconds vs. milliseconds), replication constraints on atomic operations, and multiple SQL access patterns
Excellent examples and syntax clarity: correct vs. incorrect SQL patterns shown explicitly (e.g., bracket notation for map access vs. unsupported dot notation)
Good error handling guidance: warnings about expensive queries, multi-cluster routing incompatibility with atomic operations, full table scan warnings
Complete provisioning workflows: includes emulator setup for local development to prevent accidental production changes
Clear tool delegation: Control Plane (gcloud) vs. Data Plane (cbt) separation is consistently reinforced throughout
Supporting assets included: row_key_schema.yaml template provided for structured row key definition

Model: claude-haiku-4-5-20251001Analyzed: Jun 19, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

bigtable-basics