Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What does y_train looks like? #136

Open
majianjia opened this issue Sep 3, 2020 · 0 comments
Open

What does y_train looks like? #136

majianjia opened this issue Sep 3, 2020 · 0 comments

Comments

@majianjia
Copy link

Hi,

I wanted to replicate RNNoise but a lighter version to be able to run on microcontrollers. So I will need to build and train a new model.

From my understanding, the input is the BFCC of noisy speech plus multiple features. Then the NN will learn the 22 gains of each band.
Is the y_train the truth of gains for these 22 bands?
If it is, how do I calculate the truth gains out of the signals?

I try to train the NN using MS dataset as suggested by #116
In the MS dataset, I got both clean speech and noisy speech with noise mixed in with 0~40db.
I first tried to put 22 noisy BFCC as input, then use the 22 clean BFCC as y_train. Apparently it fails.
After that, I realise what we need is actually the gains but then I am not sure how to generate gains out from the data I have.

Any comment will help, Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant