API Endpoints
Transcription API
Convert audio/video to high-accuracy captions with speaker identification and language detection
- Word-level timestamps
- Multi-language support
- Speaker identification
- Async processing
Moments API
Generate up to 5 scored Golden Moments per project: timecodes, categories, bilingual reasons, and pre-cut SRT for each clip
- AI clip scoring
- Bilingual reasons
- Pre-cut SRT per moment
- Category tagging
Metadata API
Generate SEO metadata, summaries, chapters, keywords, and JSON-LD structured data automatically
- Auto-summaries
- Chapter detection
- Named entities
- JSON-LD output
- Sentiment analysis
Refine API
Apply script alignment, glossary corrections, formatting rules, and AI-powered polishing
- Script alignment
- Glossary enforcement
- Punctuation fixes
- Style application
- Broadcasting ready
Compliance API
Scan content for sensitive topics, political references, brand mentions, and regulatory compliance
- Sensitive topic flagging
- Brand mention detection
- Political references
- Risk scoring
- Content warnings
Core Capabilities
Async Jobs
Submit jobs and poll for results with webhook notifications
Rate Limiting
Fair-use with configurable rate limits per API key
Webhooks
Real-time notifications when jobs complete
API Keys
Scoped keys with granular permission control
Authentication & Security
Bearer Token Auth
All endpoints require Bearer token authentication with scoped API keys.
Authorization: Bearer cap_xxxxx
Rate Limiting
Fair-use rate limits with per-key configuration. Limits returned in headers.
- • 100 requests/minute (default)
- • Video processing quotas
- • X-RateLimit-* headers
HTTP Status Codes
Success (sync or async job queued)
Client error (validation or auth failure)
Rate limit or server error
Use Cases
🤖 AI Agents
Power AI agents with media understanding for automated workflows.
⚙️ Integrations
Embed into existing tools, platforms, and content management systems.
📦 Microservices
Build modular language intelligence services for internal tools.