If you’re using the Home Assistant voice assistant mechanism (not Alexa/Google/etc.) how’s it working for you?
Given there’s a number of knobs that you can use, what do you use and what works well?
- Wake word model. There’s the default models and custom
- Conservation agent and model
- Speech to text models (e.g. speech-to-phrase or whisper)
- Text to speech models


I have setup the wake word as Hey Jarvis, but the issues I get… it usually gets it, however I also hear it bleeping and blooping randomly so that’s fun. Then HA is running on a N100 mini computer, and I found that the smallest Whisper model I can use reliably is the medium one (I’m sure in English it’d work well even with smaller ones) and the LLM is Qwen 3 4b running on a computer with a dedicated RX 6400. As in, that’s the second gpu and it’s doing only that. The end result is that I give a command, wait a few seconds (Whisper mostly), then hopefully it works out. I imagine with a known good mic and powerful local hardware it’d be noticeably better, but.