Skip to content

MusicCaps is a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts. This repo help you to download.

Notifications You must be signed in to change notification settings

LixiangZhao98/MusicCaps

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 

Repository files navigation

MusicCaps

MusicCaps is a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts. For each 10-second music clip, MusicCaps provides:

  1. A free-text caption consisting of four sentences on average, describing the music and

  2. A list of music aspects, describing genre, mood, tempo, singer voices, instrumentation, dissonances, rhythm, etc.

Usage

conda create --name MusicCap python=3.9

conda activate MusicCap

pip install datasets yt-dlp pydub

Install FFmpeg

For Windows: Download FFmpeg, add the path to the system's environment variables.

For macOS/Linux:

# macOS
brew install ffmpeg

# Ubuntu/Debian
sudo apt update
sudo apt install ffmpeg

Run

python Download.py

About

MusicCaps is a dataset composed of 5.5k music-text pairs, with rich text descriptions provided by human experts. This repo help you to download.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages