Try it now, no signup
Record live or drop in a file (up to 30 MB) and watch it transcribe.
Tap to start recording from your microphone
WebVTT is the captioning standard for HTML5 video and the web, and RealtimeVoiceKIT generates it for you automatically. Upload a recording, get well-timed captions, and download a .vtt file ready for the <track> element, optionally translated into another language.
Great for
HTML5 & web video
Add .vtt captions with the standard <track> element.
Streaming & players
Use WebVTT where SRT isn't supported.
Accessibility
Meet caption requirements with accurate, timed text.
Localization
Ship translated VTT files for international audiences.
What you get
How it works
Upload
Add your audio or video file, or paste a URL.
AI captions
Speech becomes readable, well-timed caption cues.
Download .vtt
Export a WebVTT file ready for the web, translated if needed.
Frequently asked questions
What is a VTT file?
WebVTT (.vtt) is the W3C subtitle format used by HTML5 video via the <track> element. It supports styling and positioning that SRT does not.
SRT or VTT, which should I use?
Use VTT for web and HTML5 players and SRT for broad platform compatibility. RealtimeVoiceKIT exports both from the same transcript.
Can I translate VTT captions?
Yes. Translate captions into 100+ languages with timing preserved, then export the translated .vtt.
Is it free to try?
Yes. 10 free minutes every month including VTT export and no credit card required.