Transcripts with timestamps let your viewers:
Mandala Audio-Video accepts transcripts in the WebVTT format, which we describe below. You can also reference the official WebVTT specification.
If you just want the static transcript next to the media, you don't need timestamps. In this case, you're reading the wrong guide! Go to Transcripts without Timecodes instead.
Once you've created your file, you'll upload the transcript in the Mandala Audio-Video editor. Check out the detailed steps at the bottom of the page.
You can create WebVTT files in any text editor, as long as you save with the extension .srt. Here's an example of a WebVTT transcript.
WebVTT 00:00:01.00 --> 00:00:03.01 Hello, welcome to Audio-Video. 00:00:04.00 --> 00:00:05.02 You can create dynamic transcripts with Audio-Video. 00:00:05.15 --> 00:00:7.00 We can help you get started quickly |
Each file will consist of the following:
Each transcript "line" or "unit" consists of...
Timestamps let you add a start and end time for the transcript. This helps Mandala associate a moment in the video with the text. Timestamps consist of the following, all on the same line:
Mandala recognizes multilingual transcripts. This lets you include multiple languages and translations in one transcript file. You can use these transcripts to make multilingual captions.
Separating languages is more flexible in Mandala than in the official WebVTT standard. You can separate languages by:
Here's an example using / to separate out languages.
WebVTT 00:00:01.00 --> 00:00:03.01 Hello, welcome to Audio-Video./ Hola, bienvenido a Audio-Video. 00:00:04.00 --> 00:00:05.02 You can create dynamic transcripts with Audio-Video./ Usted puede crear transcripciónes con Audio-Video. 00:00:05.15 --> 00:00:7.00 We can help you get started quickly./ Nosotros podemos ayudarle a empezar rápidamente. |
Make sure each language always appears in the same order. In the case above, the English transcript always comes before the Spanish transcript. |
If you want to add speakers, the VTT format makes it possible. Use <v Name> at the beginning of the transcript text.
Here's an example:
WebVTT 00:00:01.00 --> 00:00:03.01 <v Batman>Hello, welcome to Audio-Video. 00:00:04.00 --> 00:00:05.02 <v Superman>You can create dynamic transcripts with Audio-Video. 00:00:05.15 --> 00:00:7.00 <v Batman>We can help you get started quickly |
You can also use a translated speaker's name if you like. In this case, repeat <v Name> at the beginning of the transcript text for each language.
WebVTT 00:00:01.00 --> 00:00:03.01 <v Employee>Hello, welcome to Audio-Video./ <v Empleado>Hola, bienvenido a Audio-Video. 00:00:04.00 --> 00:00:05.02 <v Employee>You can create dynamic transcripts with Audio-Video./ <v Empleado>Usted puede crear transcripciónes con Audio-Video. 00:00:05.15 --> 00:00:7.00 <v Employee>We can help you get started quickly./ <v Empleado>Nosotros podemos ayudarle a empezar rápidamente. |