Welsh-First AI Workspace

Why Capsiynau?

The bilingual workspace for Welsh and English broadcast, public sector, and production. Captioning, live events, document translation, and Human-Verified review - all in one place, with a Welsh-aware AI pipeline that improves with every correction.

Not another generic transcription tool. OpenAI Whisper, AssemblyAI Universal-3 Pro, and Claude AI tuned for Welsh phonetics and treigladau. Five caption styles from Broadcast to Verbatim. Professional linguist verification on demand. A complete bilingual production workflow.

The Problem with Welsh Transcription

Why generic AI tools don't work for Welsh content.

01

Generic AI Fails Welsh

Major transcription tools treat Welsh as "another minority language." They miss mutations, misidentify dialects, and produce output that takes more editing than writing from scratch.

02

Nobody Handles Mutations

Soft, nasal, and aspirate mutations (treigladau) aren't errors - they're fundamental to Welsh. But generic AI ignores them or "corrects" them back.

03

No Bilingual Workflow

Most Welsh content is bilingual - switching between Welsh and English. Existing tools force one language at a time, losing context.

How Capsiynau Is Different

Purpose-built for the unique challenges Welsh presents.

01

Cymraeg-First Engine

Built from the ground up for Welsh phonetics, not bolted onto an English AI.

02

Second-Pass Cleanup

Three tiers: Raw (normaliser only, free), Clean (Haiku AI on low-confidence segments), Broadcast-ready (full pass). Default set by plan tier.

03

Natural Segmenting

AI splits captions at natural phrase boundaries with broadcast CPS limits.

04

Learns from Your Edits

The system tracks corrections across projects and builds a personal glossary over time.

05

Team Glossaries

Brand terms, dialect, formal - with voting, comments, and admin approval.

06

Caption Style Presets

5 styles from single-word to multi-line. Choose once - it remembers your preference.

Capsiynau and the ecosystem

Capsiynau is part of a family of products. The same Welsh-first AI infrastructure that powers our captioning and live relay also powers Nodiadau (AI voice notes for bilingual professionals) and Llais (Welsh text-to-speech for any site).

Capsiynau is the B2B and institutional surface, built for broadcasters, production companies, and public-sector procurement. The other products are standalone consumer brands sharing the same infrastructure, the same correction loop, and the same intelligence layer.

Four modules, one direction

Each one built around Welsh first. Pick the one that applies to you.

Module 1Media

Professional Editor

A bilingual editor with everything you need for broadcast-quality captions.

01

Full-Text Editing

Edit the entire transcript as a text document - changes are auto-resynced to the original word-level timestamps.

02

Split & Merge Segments

Split segments at any point, or merge them. Adjust timings visually on the timeline.

03

Precise Timing Editor

Adjust start/end times to the millisecond - with real-time waveform preview.

04

Keyboard Shortcuts

Play/Pause, jump between segments, and split segments - all from the keyboard.

05

Loop Mode

Clicking a segment auto-loops the audio - perfect for fine editing while listening.

06

Waveform Editor

See your audio visually - the waveform follows playback and click to jump to any point.

Intelligent AI

AI That Learns Your Voice

More than speech recognition - an intelligence layer that improves with every project.

01

Review & Refine

Check grammar, mutations, spelling and phrasing step-by-step - review every suggestion before accepting.

02

One-Click Polish

Run all polish steps at once: language detection, punctuation, segmenting, Welsh cleanup, and glossary.

03

Natural Segmenting

AI re-cuts captions at natural phrase boundaries - for smoother, more readable captions.

04

Learning Layer

Corrections are tracked across projects. Repeated edits are promoted into your Welsh vocabulary -3 corrections = candidate, 10 = approved and active on the next transcription.

05

Hard vs Soft Rules

Manual glossary terms = hard rules (always apply). Learned terms = soft rules (context-dependent) - until promoted.

06

Cross-Session Confidence

The system measures confidence based on distinct sessions where the correction appeared, not just raw count.

Organisation Intelligence

Organisation Intelligence Layer

Your organisation's accumulated language memory - glossaries, dialects, corrections, all learning from every project.

01

Custom Glossaries

Create multiple glossaries for brand, dialect, technical, formal, and informal terms. Private or global.

02

CSV Import

Import term lists from CSV files - map columns to "source term" and "preferred term".

03

Folder Linking

Link a glossary to a folder - it auto-applies to all projects in that folder and its sub-folders.

04

Team Comments

Add comments to individual terms - discuss meaning, context, or usage.

05

Term Voting

Upvote or downvote terms - see team consensus on specific terminology.

06

Admin Approval

For global glossaries: terms go through admin approval before becoming active.

Export & Formats

Export & Formats

Export your work in the right format for any platform.

01

SRT / VTT

Export captions in SRT or WebVTT format - ready for YouTube, broadcasters, and video players.

02

Plain Text

Export the transcript as plain text - with or without speaker labels and timestamps.

03

Filter by Language

Export all languages, Welsh only, or English only - perfect for single-language subtitles.

04

Translation Tracks

Add translations in any language - export alongside the original transcript.

05

HLS Streams

Upload directly from an HLS / .m3u8 stream URL - no need to download the file first.

06

Multiple Formats

MP3, WAV, MP4, MOV, PDF, DOCX supported on upload - audio extraction handled for videos.

Module 2Live
Live Captions

Captions as the conversation happens

For events, broadcasts, council meetings and live-streamed lectures - bilingual captions appearing alongside the speech, not after it.

01

Under 2 seconds in English

Sub-second latency for English speech. Captions land on the audience screen before the next sentence starts.

02

Under 5 seconds in Welsh

Welsh-aware live transcription with mutation cleanup and dialect support running on the same pipeline.

03

Audience screen + QR code

Project the live transcript to a screen at your venue, or hand the audience a QR for the bilingual feed on their phones.

04

Welsh Language Standards ready

Built for the bilingual delivery duties councils, Senedd committees, and Welsh public sector organisations are required to meet.

05

Glossaries apply live

Your team's place names, role titles, and brand terms - and BydTermCymru's 76,000+ terms - all corrected on the fly.

06

Record + edit afterwards

Live transcripts save as projects you can open in the editor for a tidy broadcast-grade version after the event.

Module 3Documents
Document Translation

Welsh and English documents, faithfully translated

PDF, Word, ODT, and plain text. Bilingual translation that preserves formatting and respects the conventions of formal Welsh writing.

01

Multiple formats in, multiple out

Upload .pdf, .docx, .odt, .txt. Receive the translated version in your choice of format, plus side-by-side bilingual view.

02

Glossary-aware

Your organisation's term canon applies to every document. Place names, role titles, and policy terminology stay consistent.

03

Mutation-aware Welsh

Soft, nasal, and aspirate mutations applied correctly - the difference between machine-translated Welsh and Welsh that reads naturally.

04

Optional human review

Send the AI translation to a Human-Verified linguist for a professional pass. Returns to you with tracked changes you can accept or amend.

05

Built for public sector

Designed for council reports, NHS Wales materials, education resources, and any document that needs the same authority in both languages.

06

Block-level structure preserved

Headings, lists, and tables hold their structure. Translation lands where it belongs without breaking your layout.

Module 4Human-Verified
Human-Verified

When broadcast standard is non-negotiable

Qualified Welsh linguists review, correct and certify your captions. Used by broadcasters, the public sector and creators where accuracy is non-negotiable.

01

Quote request

Submit duration, subject, language direction and deadline. Admin sets a price within hours.

02

Qualified linguist

Once accepted, a Welsh-fluent translator from the approved pool picks up your project in their portal.

03

Reviewed + audited

Every edit is logged in an audit trail. Broadcast-standard segmentation and mutations checked end-to-end.

04

Signed delivery

SRT, VTT and EBU-TT delivered via signed download links. PDF invoice attached. 24h link expiry.

Transcription & Editing

Everything you need to go from audio file to professional captions.

01

Cymraeg-First Engine

Purpose-built for Welsh phonetics, soft/nasal/aspirate mutations (treigladau), and dialect variation - not a generic AI.

02

Word-Level Timestamps

Every word timestamped to the millisecond. Perfect sync for captions, subtitles, and waveform editing.

03

Speaker Diarisation

Identify multiple speakers with colour-coded labels in the editor. AI-assisted with human confirmation.

04

Bilingual Language Detection

Auto-detect Welsh and English per segment - perfect for bilingual content.

05

AI Punctuation

Add punctuation, capitalisation, and sentence structure using AI - for both Welsh and English.

06

Caption Styles

5 caption styles: single-word, single-short, single-long, multi-short, and multi-long - with broadcast CPS limits.

07

Project Folders

Organise projects into folders and sub-folders. Drag and drop to reorganise. Glossaries linked to folders.

08

Script Alignment

Upload an original script - the system aligns words to timestamps, correcting wording without losing sync.

09

Version History

Save snapshots of your transcript at any point. Restore previous versions easily.

API-First Platform

Language Intelligence as a Service

Embed Capsiynau in internal tools, workflows, and AI agents. Turn content into language intelligence.

Transcribe

Convert audio/video to structured captions

Metadata

AI-assisted summaries, chapters & JSON-LD metadata

Refine & Polish

Apply scripts, glossaries & AI improvements

Compliance Scan

Flag sensitive topics & brand mentions

Easy Integration

RESTful API with webhooks, async jobs, and SDKs for your language.

High Performance

Optimized for streaming, batch processing, and real-time captions.

Full Observability

Track usage, errors, and performance with detailed analytics.

New on npm

@capsiynau/sdk-transcribe

Use the official Node SDK to submit transcription jobs, poll status, and fetch subtitles in a few lines of code.

npm install @capsiynau/sdk-transcribe

Sign in to view full API documentation

Who Built Capsiynau

Built by producers, for producers. Not a tech project - a production tool.

01

Developed by Working Producers

Built by people who work in Welsh broadcast production - not by tech developers distant from the industry.

02

Tested with Broadcasters

Battle-tested in real production workflows with Welsh broadcasters and content creators.

03

Welsh-First, Always

Welsh isn't an afterthought. Every design decision - from the AI engine to the UI - starts with Welsh.

04

Open Roadmap

We build in the open. Features are driven by real feedback from Welsh producers and broadcasters.

Ready to Get Started?

Try it free - no credit card required. 15 minutes of transcription included.