# Vocova - AI transcription in 100+ languages (Full Documentation)

> This file provides comprehensive information about Vocova for AI assistants and LLM crawlers.

**Website:** https://vocova.app
**Company:** NOWGIC LTD
**Contact:** contact@vocova.app
**Social:** https://x.com/vocova_app | https://www.youtube.com/@Vocova

---

## Product Overview

Vocova is a web-based AI transcription platform that converts audio and video content to text. It supports 100+ languages with automatic speaker identification (diarization), translation, and multiple export formats. Users can upload files directly or paste URLs from popular platforms like YouTube, TikTok, Vimeo, Bilibili, Loom, Zoom, Google Meet, and more.

Vocova is built for professionals across industries who need fast, accurate transcription without manual effort. The platform handles everything from meeting recordings to podcasts, interviews, lectures, and social media content.

---

## Core Capabilities

### Transcription
- AI-powered speech-to-text in 100+ languages
- Automatic punctuation and sentence formatting
- Support for audio files (MP3, WAV, M4A, OGG, FLAC, AAC, WMA, AIFF, OPUS)
- Support for video files (MP4, MOV, AVI, MKV, WebM, FLV, WMV)
- URL import from 1,000+ platforms

### Speaker Identification
- Automatic speaker diarization
- Distinguishes between multiple speakers
- Labels speakers throughout the transcript

### Translation
- Translate transcripts between supported languages
- Maintain transcript structure during translation

### Editing
- Rich text editor for transcript refinement
- Edit speaker labels, timestamps, and content
- Real-time editing with auto-save

### Export
- TXT (plain text)
- SRT (SubRip subtitle format)
- VTT (WebVTT subtitle format)
- DOCX (Microsoft Word)
- PDF (Portable Document Format)
- CSV (paid plans)

---

## All Tools (60)

Vocova ships 60 dedicated transcription tool pages, grouped below.

### Audio/video file workflows (5)

| Tool | URL | Description |
|------|-----|-------------|
| Audio to text | /tools/audio-to-text | Upload any audio file and get accurate text instantly |
| Video to text | /tools/video-to-text | Extract accurate text from any video file with AI |
| Podcast transcription | /tools/transcribe-podcast | Transcribe podcast episodes with speaker labels for show notes and repurposing |
| Interview transcription | /tools/transcribe-interview | Transcribe interviews with speaker diarization for research and documentation |
| Lecture transcription | /tools/transcribe-lecture | Transcribe lectures for study notes and accessibility compliance |

### Platform URL import (14)

| Tool | URL | Description |
|------|-----|-------------|
| YouTube transcription | /tools/transcribe-youtube | Convert any YouTube video to accurate, searchable text |
| Apple Podcasts transcription | /tools/transcribe-apple-podcasts | Full transcripts with speaker labels — beyond Apple's built-in feature |
| Zoom transcription | /tools/transcribe-zoom | Turn Zoom meeting recordings into searchable text transcripts |
| Google Meet transcription | /tools/transcribe-google-meet | Transcribe Google Meet recordings with speaker labels, on any Workspace tier |
| TikTok transcription | /tools/transcribe-tiktok | Transcribe TikTok audio to text, including Duets, Stitches, and trending sounds |
| Loom transcription | /tools/transcribe-loom | Transcribe Loom recordings into documentation, on any plan |
| Bilibili transcription | /tools/transcribe-bilibili | Transcribe Bilibili videos to text, including multi-part series and Chinese dialects |
| Vimeo transcription | /tools/transcribe-vimeo | Transcribe Vimeo videos for accessibility compliance and subtitle generation |
| Instagram transcription | /tools/transcribe-instagram | Transcribe Reels and video posts — speech separated from background music |
| Facebook transcription | /tools/transcribe-facebook | Transcribe video posts and multi-hour Live streams with speaker labels |
| X (Twitter) transcription | /tools/transcribe-x | Transcribe video tweets and Spaces — before they get deleted |
| SoundCloud transcription | /tools/transcribe-soundcloud | Transcribe speech from SoundCloud — voice separated from music |
| Reddit transcription | /tools/transcribe-reddit | Transcribe Reddit videos — even the ones with terrible audio |
| Dailymotion transcription | /tools/transcribe-dailymotion | Transcribe Dailymotion videos — strong with French and European content |

### Language-specific transcription (20)

| Tool | URL | Description |
|------|-----|-------------|
| Japanese transcription | /tools/transcribe-japanese | Transcribe Japanese audio and video with AI |
| Spanish transcription | /tools/transcribe-spanish | Transcribe Spanish audio and video with AI |
| French transcription | /tools/transcribe-french | Transcribe French audio and video with AI |
| German transcription | /tools/transcribe-german | Transcribe German audio and video with AI |
| Portuguese transcription | /tools/transcribe-portuguese | Transcribe Portuguese audio and video with AI |
| Korean transcription | /tools/transcribe-korean | Transcribe Korean audio and video with AI |
| Chinese transcription | /tools/transcribe-chinese | Transcribe Chinese (Mandarin) audio and video with AI |
| Arabic transcription | /tools/transcribe-arabic | Transcribe Arabic audio and video with AI |
| Hindi transcription | /tools/transcribe-hindi | Transcribe Hindi audio and video with AI |
| Italian transcription | /tools/transcribe-italian | Transcribe Italian audio and video with AI |
| Russian transcription | /tools/transcribe-russian | Transcribe Russian audio and video with AI |
| Thai transcription | /tools/transcribe-thai | Transcribe Thai audio and video with AI |
| Vietnamese transcription | /tools/transcribe-vietnamese | Transcribe Vietnamese audio and video with AI |
| Turkish transcription | /tools/transcribe-turkish | Transcribe Turkish audio and video with AI |
| Indonesian transcription | /tools/transcribe-indonesian | Transcribe Indonesian audio and video with AI |
| Dutch transcription | /tools/transcribe-dutch | Transcribe Dutch audio and video with AI |
| Polish transcription | /tools/transcribe-polish | Transcribe Polish audio and video with AI |
| Swedish transcription | /tools/transcribe-swedish | Transcribe Swedish audio and video with AI |
| Cantonese transcription | /tools/transcribe-cantonese | Transcribe Cantonese audio and video with AI |
| Tagalog transcription | /tools/transcribe-tagalog | Transcribe Tagalog and Filipino audio and video with AI |

### Translation (8)

| Tool | URL | Description |
|------|-----|-------------|
| Audio translation | /tools/translate-audio | Upload audio in any language and translate it to 140+ languages |
| Bilingual subtitles | /tools/bilingual-subtitles | Generate dual-language subtitles from any audio or video file |
| Video translation | /tools/translate-video | Translate video audio to 140+ languages with accurate subtitles |
| Japanese to English | /tools/japanese-to-english | Transcribe and translate Japanese audio to English text |
| Chinese to English | /tools/chinese-to-english | Transcribe and translate Mandarin Chinese audio to English text |
| Spanish to English | /tools/spanish-to-english | Transcribe and translate Spanish audio to English text |
| Korean to English | /tools/korean-to-english | Transcribe and translate Korean audio to English text |
| French to English | /tools/french-to-english | Transcribe and translate French audio to English text |

### Format conversion and subtitles (8)

| Tool | URL | Description |
|------|-----|-------------|
| MP4 to text | /tools/mp4-to-text | Transcribe MP4 video — any codec, any audio track, any source |
| MP3 to text | /tools/mp3-to-text | Transcribe MP3 files with VBR-aware timing and artifact tolerance |
| WAV to text | /tools/wav-to-text | Transcribe lossless WAV audio — sample rate myths debunked |
| M4A to text | /tools/m4a-to-text | Transcribe M4A from Voice Memos, iPhone, and Apple devices |
| MOV to text | /tools/mov-to-text | Transcribe MOV from iPhone, QuickTime, and ProRes cameras |
| SRT generator | /tools/srt-generator | Generate spec-compliant SRT subtitles with proper formatting |
| VTT generator | /tools/vtt-generator | Generate WebVTT subtitles for HTML5 video and HLS streaming |
| Subtitle generator | /tools/subtitle-generator | Upload audio or video and get ready-to-use subtitle files |

### Audio/video format converters (3)

| Tool | URL | Description |
|------|-----|-------------|
| Audio converter | /tools/audio-converter | Convert audio files between MP3, WAV, M4A, FLAC and more |
| Video converter | /tools/video-converter | Convert video files between MP4, WebM, MOV, MKV and more |
| MP4 to MP3 | /tools/convert-mp4-to-mp3 | Extract MP3 audio from MP4 videos at 192 kbps |

### Summarization (2)

| Tool | URL | Description |
|------|-----|-------------|
| Podcast summarizer | /tools/podcast-summarizer | Get key takeaways from any podcast episode in minutes |
| YouTube summarizer | /tools/youtube-summarizer | Get key points from any YouTube video in minutes |

All tool URLs are relative to https://vocova.app

---

## Pricing

### Free Plan
- **Price:** $0
- **Minutes:** 30 minutes total
- **File size:** up to 30 MB per upload
- **Stored transcriptions:** up to 3
- **Features:** AI transcription, basic export (TXT), summary

### Plus Plan
- **Monthly:** $15/month
- **Yearly:** $90/year ($7.50/month, save 50%)
- **Minutes:** 1,800 transcription minutes per month
- **File size:** up to 5 GB per upload
- **Features:** Speaker identification, translation, full export formats (TXT/SRT/VTT/DOCX/PDF/CSV)

### Pro Plan
- **Monthly:** $39/month
- **Yearly:** $228/year ($19/month, save 51%)
- **Lifetime:** $399 one-time payment
- **Minutes:** Unlimited transcription minutes (marketed plan)
- **Features:** All Plus features plus highest plan tier

---

## Technical Details

- **Platform:** Web application (works in all modern browsers)
- **Languages:** 100+ languages for transcription, 140+ target languages for translation
- **Interface Languages:** English, Français, Deutsch, Español, Português, Italiano, 日本語, 한국어, 繁體中文
- **Authentication:** Email OTP, Google OAuth
- **Payment:** Stripe (credit/debit cards)
- **Maximum file duration:** up to 10 hours

---

## Competitive Advantages

1. **Broad platform support:** Import from 1,000+ video/audio platforms via URL with dedicated tool pages for top platforms
2. **100+ languages:** Wide language coverage for global users
3. **Speaker diarization:** Automatic speaker identification on paid plans
4. **Multiple export formats:** TXT, SRT, VTT, DOCX, PDF, CSV
5. **Free tier:** 30 minutes to get started, no credit card required
6. **Flexible pricing:** Free to start, Plus for capped usage, or Pro with unlimited transcription
7. **9 interface languages:** Localized UI for international users

---

## Company

- **Name:** NOWGIC LTD
- **Product:** Vocova
- **Founded:** 2024
- **Contact:** contact@vocova.app
- **X:** https://x.com/vocova_app
- **YouTube:** https://www.youtube.com/@Vocova

---

## Blog Posts (29)

Articles are listed in reverse chronological order by publish date.

| Title | URL | Published |
|-------|-----|-----------|
| How to transcribe audio in multiple languages: a 2026 workflow guide | /blog/transcribe-audio-multiple-languages | 2026-05-06 |
| How to transcribe Bilibili videos: transcript, subtitles, and English translation | /blog/transcribe-bilibili-video | 2026-05-01 |
| Transcribe online videos and podcasts by pasting a link — the no-downloads guide | /blog/transcribe-online-media-by-link | 2026-04-20 |
| How accurate is AI transcription? WER results across 50+ languages (2026) | /blog/transcription-accuracy-by-language | 2026-04-16 |
| Podcast transcription workflow: from raw audio to repurposed content (2026) | /blog/podcast-transcription-workflow-2026 | 2026-04-09 |
| Subtitle file formats explained: SRT, WebVTT, ASS, TTML compared (2026) | /blog/subtitle-file-formats-complete-guide | 2026-04-02 |
| How to repurpose podcasts and webinars into 10+ content pieces with AI transcription | /blog/repurpose-content-with-transcription | 2026-03-11 |
| Captions and transcripts for accessibility (ADA, WCAG, EAA): 2026 guide | /blog/transcription-for-accessibility | 2026-03-10 |
| How to transcribe a YouTube video: 5 methods compared | /blog/transcribe-youtube-video | 2026-03-09 |
| Happy Scribe vs Vocova 2026: AI-only or hybrid human + AI transcription? | /blog/happy-scribe-vs-vocova | 2026-03-02 |
| Fireflies vs Vocova 2026: meeting bot or multilingual transcription? | /blog/fireflies-ai-vs-vocova | 2026-02-26 |
| How AI is transforming multilingual communication | /blog/ai-multilingual-communication | 2026-02-25 |
| AI transcription in 2026: 9 changes you need to know (accuracy, prices, models) | /blog/state-of-ai-transcription-2026 | 2026-02-22 |
| How to improve recording quality for better transcription results | /blog/improve-audio-quality-transcription | 2026-02-19 |
| Noisy audio transcription: 7 fixes that bring accuracy to 95%+ (2026) | /blog/transcribe-noisy-audio | 2026-02-16 |
| Closed captions vs subtitles: what's the difference? | /blog/closed-captions-vs-subtitles | 2026-02-13 |
| What is word error rate (WER)? The metric that measures transcription accuracy | /blog/word-error-rate-explained | 2026-02-10 |
| SRT vs WebVTT in 2026: which subtitle format works on YouTube, Vimeo, Netflix | /blog/srt-vs-vtt | 2026-02-07 |
| What is automatic speech recognition (ASR)? A complete guide | /blog/what-is-automatic-speech-recognition | 2026-02-04 |
| What is speaker diarization? How AI identifies speakers in audio | /blog/what-is-speaker-diarization | 2026-02-01 |
| Best AI subtitle generator 2026: 6 tools tested for accuracy and price | /blog/best-ai-subtitle-generators | 2026-01-29 |
| 5 podcast transcription tools tested in 2026 — time limits, speaker labels, export | /blog/best-podcast-transcription-tools | 2026-01-26 |
| Best AI meeting transcription 2026: 8 tools tested for Zoom, Teams, Google Meet | /blog/best-ai-meeting-transcription | 2026-01-23 |
| 11 free transcription tools tested in 2026 — limits, accuracy, formats compared | /blog/best-free-transcription-tools | 2026-01-20 |
| TurboScribe vs Vocova in 2026: which budget transcription tool wins for podcasts? | /blog/turboscribe-vs-vocova | 2026-01-18 |
| AI transcription vs human transcription: the complete 2026 comparison | /blog/ai-vs-human-transcription | 2026-01-17 |
| Descript vs Vocova: transcription and editing compared | /blog/descript-vs-vocova | 2026-01-11 |
| Rev vs AI transcription in 2026: when is human worth $1.50/min? | /blog/rev-vs-ai-transcription | 2026-01-08 |
| Otter.ai vs Vocova 2026: which is better for non-English meetings? | /blog/otter-ai-vs-vocova | 2026-01-05 |

All blog URLs are relative to https://vocova.app

---

## Links

- **Homepage:** https://vocova.app
- **Tools:** https://vocova.app/tools
- **Pricing:** https://vocova.app/pricing
- **FAQ:** https://vocova.app/faq
- **Blog:** https://vocova.app/blog
- **Privacy Policy:** https://vocova.app/privacy
- **Terms of Service:** https://vocova.app/terms
- **Developer Resources:** https://vocova.app/developers
- **LLM Summary:** https://vocova.app/llm.txt
- **LLM Index:** https://vocova.app/llms.txt
- **Pricing Markdown:** https://vocova.app/pricing.md
- **Agent Instructions:** https://vocova.app/agents.md
