8 Khz Acoustic Model with Ivector #27

alerenato · 2017-11-02T20:58:18Z

I have built an acoustic model for NNET3 of 8 Khz with ivectors ( similar configuration to Switchboard). I'm trying it with asr-server. I made some changes (for example, #def AUDIO_FREQUENCY = 8000) in all the places of the code where 16 Khz appears. The system runs without errors but the results is "" with sentences of 8 kHz, 16 bit raw when in Kaldi decoder the result is correct. I would like to know if I should make more modifications to be able to run my model. I have seen that the api.ai model does not have ivectors. Thanks in advance.

realill · 2017-11-03T16:46:30Z

As far I know kaldi is not 8khz friendly and recommendation is always to do 8->16Khz transformation before decoding.

If you want to work with 8Khz you have to ask this question to kaldi maintainers. Or maybe going through switchboard decoding scripts and figure out what do they do.

mikenewman1 · 2018-01-03T14:16:41Z

You just have to set the parameters correctly in mfcc.conf. There are plenty of examples in the swbd recipe.
But as you point out, there are several places in the asr-server code with hard-coded sample rates that you need to fix up.

mikenewman1 · 2018-01-03T14:40:03Z

I believe the two locations are in OnlineDecoder.cc and RequestRawReader.h

alerenato · 2018-01-03T23:30:50Z

Thank you very much, Michel. I will try with these modifications and report the results in this place.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

8 Khz Acoustic Model with Ivector #27

8 Khz Acoustic Model with Ivector #27

alerenato commented Nov 2, 2017

realill commented Nov 3, 2017

mikenewman1 commented Jan 3, 2018

mikenewman1 commented Jan 3, 2018

alerenato commented Jan 3, 2018

8 Khz Acoustic Model with Ivector #27

8 Khz Acoustic Model with Ivector #27

Comments

alerenato commented Nov 2, 2017

realill commented Nov 3, 2017

mikenewman1 commented Jan 3, 2018

mikenewman1 commented Jan 3, 2018

alerenato commented Jan 3, 2018