Skip to content

NeuroVoz: a Castillian Spanish corpus of Parkinsonian speech 2,977 audio files #26

@aguerrerolopez

Description

@aguerrerolopez

Hi!

You should add NeuroVoz dataset:
https://zenodo.org/records/13647600

is one of the most used datasets in Parkinsonian speech recognition. It contains 2977 audio files including 54 individuals diagnosed with Parkinson's Disease and 58 healthy controls, the NeuroVoz dataset offers a rich compilation of speech recordings. The dataset is meticulously curated to include a variety of speech tasks—ranging from sustained vowel phonations and diadochokinetic (DDK) tests to 16 structured listen-and-repeat utterances and spontaneous monologues. It also includes both manually transcribed listen-and-repeat tasks and Whisper-automated transcriptions for monologues.

Moreover, there is a paper explaining the details of the dataset and also a quick guide on how to use (with a github repo included).
Paper explanining the database:
https://arxiv.org/abs/2403.02371
Github repo of the database and how to use it:
https://github.com/BYO-UPM/Neurovoz_Dababase

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions