How to Use OpenAI and FFmpeg to Generate Subtitles for Any Video

Welcome to an in-depth look behind the scenes on how to use the ChatGPT platform API to create subtitles and how to embed them directly into a video.

This functionality is a core part of Dugongi, and in this tutorial, I’ll walk you through building this capability yourself. You’ll need to have FFmpeg installed, an account with OpenAI, and a Node.js environment set up.

Core Operations with FFmpeg

FFmpeg is essential for transforming and converting audio from one format to another. In this tutorial, we will first use FFmpeg to extract audio from a video file. Then, after obtaining subtitles from OpenAI, we will use FFmpeg again to embed the subtitles into the video.

Extracting Audio from Video

Assuming you have a video file named myvideo.webm, the first step is to extract the audio. To manage API limitations effectively, compress the audio as much as possible.

ffmpeg -i myvideo.webm -ac 1 -b:a 16k -map a output.webm

FFmpeg parameters explained:

-i: Specifies the input file.
-ac 1: Uses just one audio channel, meaning mono audio.
-b:a 16k: Converts to a 16k bitrate.
-map a: Extracts only the audio stream.

The result is a compressed audio file named output.webm, ready for subtitle generation.

Generating Subtitles with OpenAI

The OpenAI API offers a straightforward endpoint for audio inputs. Although there are libraries for several programming languages, here's an example in Node.js:

This process reencodes the video with subtitles embedded, which may take some time.

Taking a Shortcut with Dugongi

All these steps can be streamlined using Dugongi. Simply record a new video or upload an existing one to the Dugongi cloud. Then click the "create subtitles" icon under the video to generate subtitles automatically. Finally, click the "mp4" icon and select "burn subtitles" to embed them directly into your video.

How to Use OpenAI and FFmpeg to Generate Subtitles for Any Video

Core Operations with FFmpeg

Extracting Audio from Video

Generating Subtitles with OpenAI

Burning Subtitles into the Video

Taking a Shortcut with Dugongi