Audio cues: issue tracking #69
naorunaoru
started this conversation in
Ideas
Replies: 1 comment 12 replies
-
The Onju has 16MB of NOR flash on board which could be used to store such small audio files, but the limitation is due to the fact that ESPHome does not play "files" which are not embedded in the firmware. But the truth is I am nearly clueless about partitions and how to extend the onboard flash with the external one. |
Beta Was this translation helpful? Give feedback.
12 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
As a user, I'd like my smart whoopie cushion to respond with audio feedback. Example cases:
There's a feature request in ESPHome: esphome/feature-requests#2490 which didn't get much attention for some reason.
There's also a HomeAssistant discussion with some hacks: https://community.home-assistant.io/t/play-sound-when-wakeword-detected/653928/32
Apart from the ESPHome limitation, is there anything specific to Onju Voice that might prevent us from implementing this in the future? From the top of my head I can think of full duplex audio with echo cancellation, as I'm not familiar with ESP-IDF audio pipeline abilities. The custom component used in microwakeword version of this config apparently supports simultaneous listening and playback but it comes with its own tradeoffs. The other issue is that apparently ESPHome can't play multiple audio streams at the same time, which is needed for audio feedback for adjusting the volume while there's music playing.
The purpose of opening this thread is to share any links related to this feature and discuss possible workarounds.
I also noticed that people generally suggest playing a prerecorded audio, which isn't quite well suited to ESP32-based systems due to flash size limitations. I'd suggest using a tone generator instead. For example, a sine wave synthesizer with ability to define ADSR volume envelope wouldn't take much EEPROM space and CPU cycles but would allow for instant audio cues without relying on external audio sources. What do you think?
Beta Was this translation helpful? Give feedback.
All reactions