Transcribe any Bilibili video to text

Paste a BV or AV link and get a full transcript. Supports multi-part videos, Mandarin, Cantonese, and 100+ other languages.

Supported: bilibili.com, www.bilibili.com, b23.tv

The transcription tool that understands Bilibili's unique format

Bilibili isn't just another video site — it has multi-part videos (P1, P2, P3...), dense educational content, and audiences speaking Mandarin, Cantonese, Sichuan dialect, and dozens of other languages. Vocova handles all of it. Paste any BV or AV link and get an accurate, timestamped transcript. Note: danmaku (scrolling comments) are viewer-generated overlays and are not included in the transcript — only the video's spoken audio is captured.

How it works

1

Paste your Bilibili video link

Copy the URL from any Bilibili video page. Both BV-format (bilibili.com/video/BV...) and legacy AV-format URLs are supported.

  • Supports BV and AV URL formats
  • Select specific parts from multi-part videos
  • Works with any public Bilibili video
2

AI processes the audio with dialect awareness

Vocova extracts the audio and transcribes with high accuracy, automatically detecting Mandarin, Cantonese, and other Chinese dialects alongside 100+ global languages.

  • Auto-detects Mandarin, Cantonese, and regional dialects
  • Handles dense academic and educational content
  • Timestamps every segment for reference
3

Export your transcript

Download the full transcript or copy sections for study notes, blog posts, or subtitles.

  • Export as TXT, SRT, VTT, DOCX, or PDF
  • Edit and refine before exporting
  • Copy to clipboard for quick reuse

Features

Multi-part video support

Bilibili's P1/P2/P3 multi-part format is unique to the platform. Select a specific part to transcribe, so you get exactly the section you need without processing the entire series.

Chinese dialect handling

Bilibili content spans Mandarin, Cantonese, Sichuan dialect, and other regional varieties. Vocova detects and transcribes each dialect accurately, not just standard Mandarin.

Academic content density

Bilibili is a hub for educational content — university lectures, technical tutorials, and science explainers. Vocova handles the specialized vocabulary and dense information delivery common in these videos.

BV and AV URL formats

Whether you copy a modern BV-format URL or an older AV-format link, both are recognized and processed. Just paste whatever URL your browser shows.

100+ languages beyond Chinese

Bilibili hosts international content too. Japanese, English, Korean, and other languages are auto-detected and transcribed alongside Chinese-language videos.

Why choose Vocova

Turn educational videos into study notes

Bilibili's educational content is some of the best on the internet. Convert lectures and tutorials into text you can annotate, highlight, and review.

Search within video content

Video audio is unsearchable. With a transcript, you can instantly find specific explanations, formulas, or references across hours of educational content.

Create subtitles for cross-platform sharing

Export SRT or VTT files to add subtitles when sharing Bilibili content on other platforms, improving accessibility and reach.

Preserve dialect content in text

Regional dialect content on Bilibili captures cultural perspectives that standard Mandarin doesn't. Transcripts preserve this spoken content in a referenceable format.

Who can benefit

Students

Transcribe educational Bilibili videos into study notes for exam prep, research, and self-study across STEM and humanities subjects.

Content creators

Generate text versions of your Bilibili videos for SEO, cross-posting, and creating companion blog posts or documentation.

Researchers

Analyze Bilibili content for cultural, linguistic, and media studies. Transcripts make video content quotable and codable for academic work.

Translators

Start with an accurate Chinese transcript as the foundation for translation, rather than translating on-the-fly from audio.

Frequently asked questions

Start transcribing for free

Upload a file or paste a link from YouTube, TikTok, and 1,000+ platforms — get an accurate transcript in minutes. No credit card required.

Bilibili video transcription — Vocova