Someone can correct me if I’m wrong but home assistant is currently struggling with this and is processing everything on your local box because it can’t do wakewords on the device.
I think they’re choosing to do it that way. Raspberry pi’s easily have that capability to do the wake word recognition on device (i think they are also working on that). Esp’s on the other hand, can only stream audio to the server and not much more. Since esp’s are far cheaper than installing a raspberry in each room, they are focusing to do wake word detection on the server not on device.
Someone can correct me if I’m wrong but home assistant is currently struggling with this and is processing everything on your local box because it can’t do wakewords on the device.
I think they’re choosing to do it that way. Raspberry pi’s easily have that capability to do the wake word recognition on device (i think they are also working on that). Esp’s on the other hand, can only stream audio to the server and not much more. Since esp’s are far cheaper than installing a raspberry in each room, they are focusing to do wake word detection on the server not on device.