-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
metadata.json missing for DALI train/dev split #4
Comments
Hi guxm2021, This is very interesting work. I was delighted to read your paper and eager to experiment with this repository. I have a similar question to scaperothian. Could you tell us where to find the metadata.json, or post an example json so we can recreate the format? Many thanks, |
Thank you so much for your interest to this project! The meta_data.json is a new file we generated during data processing procedure, containing the text annotation and path to audio, for each utterance-level sample in the dataset. We have processed all dataset to a similar format (metadata + a folder with utterance-level samples). I’m sorry for the delay of uploading this part of code and the corresponding procedures in readme. I’ll try to clean up the code of this part and post it to github before next week. |
Lovely to hear from you Longshen, No problem at all. In fact I found your response rather fast! So just to make sure I've interpreted correctly: you segment the audio tracks into individual tracks for each utterance before making the metadata.json? And by utterance, do you mean phoneme, word or phrase/line? Thanks again! I don't know why your repo hasn't gotten more attention. It looks pretty cool. |
We've updated the data processing code here. Please follow the Readme.md inside that dir to prepare data. Hi Brendan, yes, for each of dataset, the audio were separated into utterances and metadata.json was created for the utterance-level version of dataset. By utterance I mean one line of lyric in the song. |
Btw, if you need access to full audio of DALI v2 instead of downloading them from youtube (actually a proportion of their urls has become invalid after years), please send me an email to [email protected], from any of your outlook email address, and then I can share the audio (currently saved in my OneDrive) to you. Thanks for your patience. |
Hi @Sonata165 , thanks for the response. Unfortuantely I am unable to successfully send emails to you, and have been trying to do so from my University's outlook email account. Is this definitely accurate, or is there another email I can contact you through? |
I think his email address is "[email protected]". |
Hello, thank you the publication of your work on this research topic. I am interested in using your repo to fine tune WAV2VEC with DALI and other data. when i run the dali_prepare.py scripts in DALI/LM/dali_prepare.py:
python dali_prepary.py --data_folder=/path/to/DALI_v2.0/
it returns the following:
The metadata.json is not found in DALI dataset from Zenodo nor in DALI github page.
I can just recreate based on your paper's relative hours of data, but would perfer to just use your exact json and modify as needed (i.e. based on connectivity, etc.).
Thanks again to your contributions to this field.
The text was updated successfully, but these errors were encountered: