diff --git a/docs/models/hey_jarvis.md b/docs/models/hey_jarvis.md index 95832ee..3e0b2ab 100644 --- a/docs/models/hey_jarvis.md +++ b/docs/models/hey_jarvis.md @@ -50,7 +50,7 @@ Clips were generated both with the trained speaker embeddings, and also mixtures The following phrases were included in the training data: -1) "hey mycroft" +1) "hey jarvis" After generating the synthetic positive wakewords, they are augmented in two ways: @@ -80,4 +80,4 @@ Currently, there is not a test set available to evaluate this model. # Other Considerations -While the model was trained to be robust to background noise and reverberation, it will still perform the best when the audio is relatively clean and free of overly loud background noise. In particular, the presence of audio playback of music/speech from the same device that is capturing the microphone stream may result in significantly higher false-reject rates unless acoustic echo cancellation (AEC) is performed via hardware or software. \ No newline at end of file +While the model was trained to be robust to background noise and reverberation, it will still perform the best when the audio is relatively clean and free of overly loud background noise. In particular, the presence of audio playback of music/speech from the same device that is capturing the microphone stream may result in significantly higher false-reject rates unless acoustic echo cancellation (AEC) is performed via hardware or software.