-
Notifications
You must be signed in to change notification settings - Fork 125
Description
Hello again,
I am trying to reproduce the deepxi framework in the torch (tensorflow is not so familiar to me.. lol) and have some questions.
- The Demand voicebank (valentini) dataset provides a training set in the form of (noisy, clean) pairs for each utterance.
When we subtract the clean from noisy, we can get the corresponding noise signal.
For the Demand voicebank dataset, did you use only those dataset pairs (they were provided)? or an additional clean or noise dataset?
In my previous question, you said that the noise recording used to corrupt the clean speech is randomly selected. (this imply noise recording should be longer than clean speech)
If then, Could you tell me how kind of additional noise recording did you use? and Have you used additional clean speech other than provided in Demand voicebank dataset?
- In the training step, Deepxi uses both the training set and validation set.
As far as I know, the validation set is often used for early stopping. Is the validation set in deepxi framework also be used for this purpose?
Could you explain to me how the validation set was used?
Thank you!