Additional questions

Hello again,

I am trying to reproduce the deepxi framework in the torch (tensorflow is not so familiar to me.. lol) and have some questions. 

1) The Demand voicebank (valentini) dataset provides a training set in the form of (noisy, clean) pairs for each utterance.

When we subtract the clean from noisy, we can get the corresponding noise signal.

For the Demand voicebank dataset, did you use only those dataset pairs (they were provided)? or an additional clean or noise dataset?

In my previous question, you said that the noise recording used to corrupt the clean speech is randomly selected. (this imply noise recording should be longer than clean speech)

If then, Could you tell me how kind of additional noise recording did you use? and Have you used additional clean speech other than provided in Demand voicebank dataset?

2) In the training step, Deepxi uses both the training set and validation set. 

As far as I know, the validation set is often used for early stopping.  Is the validation set in deepxi framework also be used for this purpose?

Could you explain to me how the validation set was used?

Thank you!






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Additional questions #49

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Additional questions #49

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions