Catalog
github/transloadit-media-processing

github

transloadit-media-processing

Process media files (video, audio, images, documents) using Transloadit. Use when asked to encode video to HLS/MP4, generate thumbnails, resize or watermark images, extract audio, concatenate clips, add subtitles, OCR documents, or run any media processing pipeline. Covers 86+ processing robots for file transformation at scale.

globalMITRequires a free Transloadit account (https://transloadit.com/signup). Uses the @transloadit/mcp-server MCP server or the @transloadit/node CLI.
New~1.2k
v1.0Saved Jun 26, 2026

Transloadit Media Processing

Process, transform, and encode media files using Transloadit's cloud infrastructure. Supports video, audio, images, and documents with 86+ specialized processing robots.

When to Use This Skill

Use this skill when you need to:

  • Encode video to HLS, MP4, WebM, or other formats
  • Generate thumbnails or animated GIFs from video
  • Resize, crop, watermark, or optimize images
  • Convert between image formats (JPEG, PNG, WebP, AVIF, HEIF)
  • Extract or transcode audio (MP3, AAC, FLAC, WAV)
  • Concatenate video or audio clips
  • Add subtitles or overlay text on video
  • OCR documents (PDF, scanned images)
  • Run speech-to-text or text-to-speech
  • Apply AI-based content moderation or object detection
  • Build multi-step media pipelines that chain operations together

Setup

Add the Transloadit MCP server to your IDE config. This gives the agent direct access to Transloadit tools (create_template, create_assembly, list_assembly_notifications, etc.).

VS Code / GitHub Copilot (.vscode/mcp.json or user settings):

{
  "servers": {
    "transloadit": {
      "command": "npx",
      "args": ["-y", "@transloadit/mcp-server", "stdio"],
      "env": {
        "TRANSLOADIT_KEY": "YOUR_AUTH_KEY",
        "TRANSLOADIT_SECRET": "YOUR_AUTH_SECRET"
      }
    }
  }
}

Get your API credentials at https://transloadit.com/c/-/api-credentials

Option B: CLI

If you prefer running commands directly:

npx -y @transloadit/node assemblies create \
  --steps '{"encoded": {"robot": "/video/encode", "use": ":original", "preset": "hls-1080p"}}' \
  --wait \
  --input ./my-video.mp4

Core Workflows

Encode Video to HLS (Adaptive Streaming)

{
  "steps": {
    "encoded": {
      "robot": "/video/encode",
      "use": ":original",
      "preset": "hls-1080p"
    }
  }
}

Generate Thumbnails from Video

{
  "steps": {
    "thumbnails": {
      "robot": "/video/thumbs",
      "use": ":original",
      "count": 8,
      "width": 320,
      "height": 240
    }
  }
}

Resize and Watermark Images

{
  "steps": {
    "resized": {
      "robot": "/image/resize",
      "use": ":original",
      "width": 1200,
      "height": 800,
      "resize_strategy": "fit"
    },
    "watermarked": {
      "robot": "/image/resize",
      "use": "resized",
      "watermark_url": "https://example.com/logo.png",
      "watermark_position": "bottom-right",
      "watermark_size": "15%"
    }
  }
}

OCR a Document

{
  "steps": {
    "recognized": {
      "robot": "/document/ocr",
      "use": ":original",
      "provider": "aws",
      "format": "text"
    }
  }
}

Concatenate Audio Clips

{
  "steps": {
    "imported": {
      "robot": "/http/import",
      "url": ["https://example.com/clip1.mp3", "https://example.com/clip2.mp3"]
    },
    "concatenated": {
      "robot": "/audio/concat",
      "use": "imported",
      "preset": "mp3"
    }
  }
}

Multi-Step Pipelines

Steps can be chained using the "use" field. Each step references a previous step's output:

{
  "steps": {
    "resized": {
      "robot": "/image/resize",
      "use": ":original",
      "width": 1920
    },
    "optimized": {
      "robot": "/image/optimize",
      "use": "resized"
    },
    "exported": {
      "robot": "/s3/store",
      "use": "optimized",
      "bucket": "my-bucket",
      "path": "processed/${file.name}"
    }
  }
}

Key Concepts

  • Assembly: A single processing job. Created via create_assembly (MCP) or assemblies create (CLI).
  • Template: A reusable set of steps stored on Transloadit. Created via create_template (MCP) or templates create (CLI).
  • Robot: A processing unit (e.g., /video/encode, /image/resize). See full list at https://transloadit.com/docs/transcoding/
  • Steps: JSON object defining the pipeline. Each key is a step name, each value configures a robot.
  • :original: Refers to the uploaded input file.

Tips

  • Use --wait with the CLI to block until processing completes.
  • Use preset values (e.g., "hls-1080p", "mp3", "webp") for common format targets instead of specifying every parameter.
  • Chain "use": "step_name" to build multi-step pipelines without intermediate downloads.
  • For batch processing, use /http/import to pull files from URLs, S3, GCS, Azure, FTP, or Dropbox.
  • Templates can include ${variables} for dynamic values passed at assembly creation time.
Files1
1 files · 1.0 KB

Select a file to preview

Overall Score

78/100

Grade

B

Good

Safety

75

Quality

80

Clarity

85

Completeness

72

Summary

Skill instructs agents to configure and use Transloadit's media processing cloud service via MCP server or CLI. The agent learns to create assembly jobs, chain processing steps (video encoding, image resizing, OCR, audio concatenation), and execute multi-step media pipelines. Supports 86+ specialized robots for transforming video, audio, images, and documents.

Detected Capabilities

network request to external API (Transloadit)credential configuration (API key and secret via environment)shell command execution (CLI via npx)file upload and processingmulti-step processing pipeline orchestrationenvironment variable read (TRANSLOADIT_KEY, TRANSLOADIT_SECRET)

Trigger Keywords

Phrases that MCP clients use to match this skill to user intent.

encode videotranscode mediagenerate thumbnailsresize imagesocr documentswatermark imageconcatenate clipshls streamingprocess videomedia pipeline

Risk Signals

WARNING

Credential configuration via environment variables (TRANSLOADIT_KEY, TRANSLOADIT_SECRET in MCP server config)

Setup section, Option A (MCP Server)
INFO

External API calls to transloadit.com (documented, expected service integration)

Core concept, referenced domains
INFO

File upload and processing via external cloud service

Core Workflows section
WARNING

HTTP/URL import capability for batch processing from external sources

Tips section and http/import robot

Referenced Domains

External domains referenced in skill content, detected by static analysis.

example.comtransloadit.com

Use Cases

  • +Encode video to HLS or MP4 for adaptive streaming delivery
  • Generate thumbnails and animated GIFs from video frames
  • Resize, crop, watermark, or optimize images in bulk
  • Extract audio from video or transcode between audio formats
  • Concatenate video or audio clips into a single file
  • Add subtitles, overlay text, or caption video content
  • OCR scanned documents or PDFs to extract text
  • Run multi-step media pipelines that chain operations together
  • Batch process media files via HTTP import from URLs or cloud storage
  • Apply speech-to-text, text-to-speech, or AI content moderation

Quality Notes

  • Excellent documentation: clear use cases listed at the top with bullet points matching the skill's purpose
  • Comprehensive setup instructions with both MCP server (recommended) and CLI options
  • Well-structured core workflows section with real JSON examples for common media tasks
  • Clear explanation of key concepts (Assembly, Template, Robot, Steps, :original)
  • Helpful tips section covering preset usage, chaining, batch processing, and variables
  • Frontmatter description is detailed and matches the skill content
  • Good example coverage: HLS encoding, thumbnails, image watermarking, OCR, audio concatenation, multi-step pipelines
  • Limitation: no explicit security or compliance notes for API key handling (though environment-based setup is correct)
  • Limitation: no error handling or fallback patterns documented
  • Limitation: no discussion of cost implications or rate limiting
Model: claude-haiku-4-5-20251001Analyzed: Jun 26, 2026

Reviews

Add this skill to your library to leave a review.

No reviews yet

Be the first to share your experience.

Add github/transloadit-media-processing to your library

Command Palette

Search for a command to run...

github/transloadit-media-processing | SkillRepo