You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wanted to replicate RNNoise but a lighter version to be able to run on microcontrollers. So I will need to build and train a new model.
From my understanding, the input is the BFCC of noisy speech plus multiple features. Then the NN will learn the 22 gains of each band.
Is the y_train the truth of gains for these 22 bands?
If it is, how do I calculate the truth gains out of the signals?
I try to train the NN using MS dataset as suggested by #116
In the MS dataset, I got both clean speech and noisy speech with noise mixed in with 0~40db.
I first tried to put 22 noisy BFCC as input, then use the 22 clean BFCC as y_train. Apparently it fails.
After that, I realise what we need is actually the gains but then I am not sure how to generate gains out from the data I have.
Any comment will help, Thanks
The text was updated successfully, but these errors were encountered:
Hi,
I wanted to replicate RNNoise but a lighter version to be able to run on microcontrollers. So I will need to build and train a new model.
From my understanding, the input is the BFCC of noisy speech plus multiple features. Then the NN will learn the 22 gains of each band.
Is the y_train the truth of gains for these 22 bands?
If it is, how do I calculate the truth gains out of the signals?
I try to train the NN using MS dataset as suggested by #116
In the MS dataset, I got both clean speech and noisy speech with noise mixed in with 0~40db.
I first tried to put 22 noisy BFCC as input, then use the 22 clean BFCC as y_train. Apparently it fails.
After that, I realise what we need is actually the gains but then I am not sure how to generate gains out from the data I have.
Any comment will help, Thanks
The text was updated successfully, but these errors were encountered: