What does y_train looks like? #136

majianjia · 2020-09-03T19:25:37Z

Hi,

I wanted to replicate RNNoise but a lighter version to be able to run on microcontrollers. So I will need to build and train a new model.

From my understanding, the input is the BFCC of noisy speech plus multiple features. Then the NN will learn the 22 gains of each band.
Is the y_train the truth of gains for these 22 bands?
If it is, how do I calculate the truth gains out of the signals?

I try to train the NN using MS dataset as suggested by #116
In the MS dataset, I got both clean speech and noisy speech with noise mixed in with 0~40db.
I first tried to put 22 noisy BFCC as input, then use the 22 clean BFCC as y_train. Apparently it fails.
After that, I realise what we need is actually the gains but then I am not sure how to generate gains out from the data I have.

Any comment will help, Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What does y_train looks like? #136

What does y_train looks like? #136

majianjia commented Sep 3, 2020

What does y_train looks like? #136

What does y_train looks like? #136

Comments

majianjia commented Sep 3, 2020