MusicCaps

MusicCaps is a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts. For each 10-second music clip, MusicCaps provides:

A free-text caption consisting of four sentences on average, describing the music and
A list of music aspects, describing genre, mood, tempo, singer voices, instrumentation, dissonances, rhythm, etc.

Usage

conda create --name MusicCap python=3.9

conda activate MusicCap

pip install datasets yt-dlp pydub

Install FFmpeg

For Windows: Download FFmpeg, add the path to the system's environment variables.

For macOS/Linux:

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt update
sudo apt install ffmpeg

Pass cookies to yt-dlp

Log in YouTube
Use a conforming browser extension to export cookies, such as Get cookies.txt LOCALLY and Cookie-Editor for Chrome, cookies.txt for Firefox.
Copy and save the cookie (Netscape format) to the local file cookies.txt

Run

python Download.py

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Download.py		Download.py
README.md		README.md
cookies.txt		cookies.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MusicCaps

Usage

Install FFmpeg

Pass cookies to yt-dlp

Run

About

Uh oh!

Releases

Packages

Languages

LixiangZhao98/MusicCaps

Folders and files

Latest commit

History

Repository files navigation

MusicCaps

Usage

Install FFmpeg

Pass cookies to yt-dlp

Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages