Sad news: improvements not possible with this hardware revision #100
Replies: 6 comments 11 replies
-
First of all thanks for all your work on getting ESPHome working on the Onju Voice, I went into this knowing it was bleeding edge and the hardware could be a limitation for long term use but my 5 devices are performing extremely well within the known limitations. Perhaps a way to do this would be to adopt a pattern similar to the Everything Presence One where there are common packages that are then imported to gain the desired set of features. I'm a tinkerer with ESPHome so never attempted this level of packaging but I'd certainly try to help out where time allows. I'm using a custom modification of the mww implementation with some additional convenience features such as a clumsy flip of the device I've created myself. |
Beta Was this translation helpful? Give feedback.
-
For me personally I prefer streaming ww over mww, and I was thinking as a consequence the 48 to 16 resampling could be done HA-side. It would be nice to support audio ducking etc, I believe for non-mww to work with that the same va "stages" need to be exposed? Edit: I'm already using openww with StreamAssist, so the extra couple of streams from Onju's are not a burden & all can use the same wake word. I guess what I'm trying to say I'd be most interested in a firmware (variant) where mww is sacrifised for audio quality, and would hope non-mww gets the same va treatment as mww in the future. The above mentioned source structure looks nice. Another option would be how the voice-pe repo is structured? |
Beta Was this translation helpful? Give feedback.
-
Well, justLV said he thinks that AEC might be possible on software side. Still... If the board lacks power... We all have a Home Assistant with some power left. (I have a i5 NUC for it, so it's bored all the time) Why not passing some work on it via WIFI? And if this is not possible or better, not feasable, I like the idea of choosing where you want your drawback and where not. There are ppl who prefer the discussion with the AI over music streaming, some prefer music and then there are the ones like me... I like music and talking with my AI catgirl with sarcasm included, but if she does not hear me while the music plays... SO be it. Tbh.. My Amazon Echo does not hear me all the time when anything except me is in the room. I think they made it less useful by firmware as this was clearly better in the past... Right now I must say that Onju voice does a better job most of the time compared to my Echo devices all over the house. Only the reminders feature maybe with HA calender integration is still missing and that Onju tells you that you have no timers running even if they are. In understanding, in playing music and in AI matters it's far better. Try to tell your Echo to do two things at once... BOOM, dead. I can tell this thing a whole list of things to do and it just does all of it. The platform in background makes it great and if the assistant itself has one little flaw... It's still way better than anything you can buy right now, because the current assistant already allows the platform to shine better than the big competitors out there. If can only get better, flaws or not. Beleive me when I say that however you decide here, anything is better than stopping the project. Never had so much fun with a voice assistant before and that with it's flaws which are neglectible IMO. |
Beta Was this translation helpful? Give feedback.
-
Honestly I love the nice design of the minis (have them floating wall mounted), and the ww sensitivity is ok-ish without xmos, let's work out together the optimum firmware (flavors) for these and keep 'em going! |
Beta Was this translation helpful? Give feedback.
-
My two Home Assistant voice just arrived a few minutes ago. I tested some basic things by now and even though they run fine and more stable than my n00b try to get the onju voice config running on newer ESPHome versions, still it has flaws where Onju will shine. Biggest one: Audio. HA Voice is a bit like a tin can compared to Onju. Second one, looks... Transparent Hockey puck vs nice black cloth. 3rd one: You did it yourself. Never underestimate the selfmade factor! When I look more closely into the new devices I will find even more, that for sure! One thing is great though. HA Voice is open source, too. So maybe we can borrow some ideas from there? |
Beta Was this translation helpful? Give feedback.
-
I'm voting for the option of having separate configuration files for different purposes. I'm not entirely sure about the config I wish for. Is the following possible?
Due to the missing possibility of echo cancellation, detecting a wakeword while playing a song is prone to failures and if detecetd, recorded audio contains music, and is thus not feasible at all -> don't go for it, unless someone implements an efficient method for echo cancellation. |
Beta Was this translation helpful? Give feedback.
-
Hello everyone!
I've been repeatedly trying to update the Onju Voice config in order to make it take advantage of the newest developments in ESPHome and Home Assistant, but I've hit snags every time and now I've come to realize why.
Objectives
What I've tried to achieve is:
micro_wake_word
in order to avoid constant streaming. Plus, it's by far the better supported variant of wake word on ESP devices by HAFacts
micro_wake_word
and thevoice_assistant
pipeline in ESPHome both need audio input at 16kHz sampling rate to functionIf you add all these together, you will realize that the ESPHome config needs to make some compromises, as it can not attain all objectives simultaneously.
Potential drawbacks
Either one of the following needs to happen to make the Onju work with ESPHome:
Solutions
In terms of software, they are outside the scope of this config. Either MWW and VA need to be reworked to work at 48kHz or a very computationally cheap resampler needs to be developed for ESPHome in order to downsample the audio input to 16kHz. And I know for a fact that neither of these are priorities for the Nabu Casa guys at the moment.
In terms of hardware, there are things that could improve the Onju dramatically (or, at the very least, make software development much easier for it), but the developer of the Onju Voice PCB repeatedly said that he is not interested in improving the board any further.
And even if a new hardware revision was available, with separate I2C buses and a chip for active noise cancelling, everyone would still need to go out and buy new PCBs.
Conclusion
I will need to think about what the best approach is here. First thought is multiple configs, each for a single purpose. But I could use the community's help on this. Please drop your opinion or comment below.
Thanks!
Tudor
Beta Was this translation helpful? Give feedback.
All reactions