Skip to content

Using Python APIs and AI to download YouTube videos, convert to mp3, transcribe audio

License

Notifications You must be signed in to change notification settings

derailed-dash/youtube-and-video

Repository files navigation

Youtube and Video

Video Intelligence Application

Repo Overview

This repo describes an end-to-end journey. Briefly:

  • We start with an idea. Here, the goal is to work with vidoes, which could be on YouTube. We want to be able to download videos, extract audio, transcribe, translate, and potentially summarise the content.
  • We experiment on this idea, using a Jupyter notebook, with Python.
  • We try out a few libraries and a couple of classical AI models.
  • Ultimately, we build a solution that makes use of Google Gemini multiomodal GenAI.
  • Then we turn the notebook into a web application, using Streamlit.
  • Then we package the application as a container.
  • And finallly, we host the application on Google Cloud's serverless Cloud Run service.

You can choose to follow / make use of any parts of this journey.

The journey is in three parts. Additionally, each part is supported with a walkthrough, which you can find on Medium.

MVP notebook

In this notebook I demonstrate:

  • A Jupyter notebook that provides a minimum viable product for a YouTube video downloader application.
  • How to quickly setup and use the notebook, including how to run it with zero install effort, in Google Colab.
  • Three different ways to download YouTube videos and extract audio to mp3.
  • Using the Python Speech Recognition library, along with the Google Speech Recognition API, to transcribe mp3 audio into text.
  • Extracting pre-existing transcripts from YouTube videos, and how to translate such transcripts.

See walkthrough.

  • Using the Google Video Intelligence API to provide more reliable and more accurate trancription.
  • Using Google Gemini Generative AI to transcribe, translate and summarise video content.
  • How to build your Jupyter notebook so it can run locally, in Google Colab, or in Google Vertex AI Workbench.

See walkthrough.

Video Intelligence Architecture

  • Provide a UI in the form of a Streamlit application
  • Containerise the application using Docker
  • Host the application on Google Cloud Run

See walkthrough.

Overview of Jupyter Notebooks

If you don't know much about Jupyter notebooks, then I suggest you start with my article here, which covers:

  • The value and point of Jupyter notebooks.
  • Good use cases for Jupyter notebooks.
  • Several ways to run the notebooks
  • How to run your own - or someone else's notebooks (like the ones in this repo) - quickly and easily, for free in Google Colab.

Running the Jupyter Notebook Locally

Here we create a Python virtual environment, install Jupyter notebook to the environment, and then run our notebooks from there.

py -m pip install --upgrade pip

# Create virtual env, if you haven't already
py -m venv .venv

# Activate the venv
./.venv/Scripts/activate

# Install requirements - i.e. notebook
py -m pip install -r requirements.txt

Now you can use your venv as your Jupyter kernel.