Note : This repository is one of the challenges made for the event "ForkThis" organized by the chapter Computer Society of India(CSI), VIT Vellore in the year 2025.
This project focuses on automatic music genre classification using pre-extracted audio features. The main goal is to train machine learning models that can predict the genre of a track based on its numerical audio characteristics.
The dataset consists of two main CSV files:
tracks.csv β metadata for tracks. This file contains hierarchical column headers (two rows of headers). From this, only the track ID and the top-level genre label were extracted for use.
features.csv β numerical audio features for each track, with track IDs as indices.
A subset of the dataset was used, focusing only on tracks with a clearly defined top genre.
Dataset link : https://drive.google.com/drive/folders/1tnlfqa4KoZFVeQ-xMq9rHM3i6TJeRFqI?usp=sharing
git clone https://github.com/a-niveditha/MusicGenreClassification.git
cd MusicGenreClassification
python -m venv venv
#First run:
cd venv/Scripts
#Then:
./Activate.ps1
#First run:
cd venv/bin
#Then:
source ./activate
pip install -r requirements.txt