Aegisub is a free, cross-platform open source tool for creating and modifying subtitles. Aegisub makes it quick and easy to time subtitles to audio, and features many powerful tools for styling them, including a built-in real-time video preview.
$0.10 PER VIDEO MINUTE
A Startup Program Company
Finding the Transcript Feature
Youtube has a hidden transcript feature. Click on the 3 dots to the lower right of the video:
- above the “Subscribe” button and
- to the right of the “Save” button
When you click on the 3 dots, click “Open Transcript” it will pull up the Transcript to the right of the video.
Download TranscriptsThese transcripts can be downloaded with a python program:
youtube-dl –skip-download –write-auto-sub https://www.youtube.com/watch?v=iKvFlSedpNI
There is a site that provides a nice UI to make it easier to cleanup the generated transcript.
Parse VVT Formatted Transcripts
The vtt format can be parsed by the webvtt.py python program:
Requires Python 3.4+.
Documentation is available at http://webvtt-py.readthedocs.io.
Installation$ pip install webvtt-py
Usageimport webvtt for caption in webvtt.read('captions.vtt'): print(caption.start) print(caption.end) print(caption.text)
Listen Notes is the best podcast search engineTM. It’s like Google, but for podcasts.
Search the whole Internet’s podcasts.
- Listeners find ALL podcast episodes interviewing or talking about a person.
- Journalists do research and find information in podcasts.
- Students learn specific topics from podcasts.
- Podcasters find cross-promotion opportunities.
- Developers use Listen API to build podcast apps.
- More use cases of Listen Notes podcast search engine
The Google Cloud Speech API has specific support for the asynchronous transcription of speech recordings of up to 3 hours.
Setting up the project and service account
Make sure that you can access the Google Cloud Dashboard with your google account. I created a new project for this experiment called
This tutorial will walk through using Google Cloud Speech API to transcribe a large audio file.
All code and sample files can be found in speech-to-text GitHub repo.