Powered byChatGPTClaudeGoogle Gemini
Works withGoogle DriveDropboxOneDrive
Available onWebExtensionSoonDesktopSoonWindowsSoonAndroidSooniOSSoonMacSoon
Works inChromeFirefoxSafariEdge
YouTube transcript generator

Generate a transcript for your videos

RealtimeVoiceKIT turns your video into an accurate, time-coded transcript you can publish, caption, and translate, with speaker labels and SRT/VTT export built in.

Try it now, no signup

Record live or drop in a file (up to 30 MB) and watch it transcribe.

Tap to start recording from your microphone

A transcript makes a video work harder: it powers captions, descriptions, blog posts, and search. RealtimeVoiceKIT generates an accurate transcript from your video with timestamps and speaker labels, then lets you export broadcast-ready SRT and WebVTT captions and translate them into 100+ languages so your channel reaches a wider audience.

How creators use it

Captions

Export SRT and VTT to add accurate captions and boost watch time.

Descriptions & chapters

Use the transcript and AI summary to write descriptions and highlights.

Repurposing

Turn a video into a blog post or social clips from the transcript.

Localization

Translate captions into 100+ languages to grow internationally.

What's included

Timestamped transcriptSRT & VTT exportSpeaker labelsOptional AI summaryTranslation in 100+ languagesDeveloper API

How it works

Drop audio · video · URLinterview.mp3
01

Upload the video

Add your video file or paste a link to the recording.

Speaker 1
02

Generate the transcript

We produce a clean, time-coded transcript with speaker labels.

ENES · FR · DE
TXTSRTVTT
03

Caption & translate

Export SRT or VTT and translate into the languages your audience speaks.

Frequently asked questions

What formats can I export?

Plain and timestamped text, plus SRT and WebVTT subtitle files ready to upload with your video.

Can I translate the captions?

Yes. Translate the transcript and subtitles into 100+ languages while keeping the timing in sync.

Does it label speakers?

Yes. Automatic speaker diarization labels each voice, useful for interviews and multi-host videos.

Is there a free plan?

Yes. 10 minutes of transcription every month, free, with no credit card required.

Generate a video transcript free

Get 10 transcription minutes every month, no credit card required.