If you’re using the Home Assistant voice assistant mechanism (not Alexa/Google/etc.) how’s it working for you?

Given there’s a number of knobs that you can use, what do you use and what works well?

  • Wake word model. There’s the default models and custom
  • Conservation agent and model
  • Speech to text models (e.g. speech-to-phrase or whisper)
  • Text to speech models
  • Stampela
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 day ago

    I have setup the wake word as Hey Jarvis, but the issues I get… it usually gets it, however I also hear it bleeping and blooping randomly so that’s fun. Then HA is running on a N100 mini computer, and I found that the smallest Whisper model I can use reliably is the medium one (I’m sure in English it’d work well even with smaller ones) and the LLM is Qwen 3 4b running on a computer with a dedicated RX 6400. As in, that’s the second gpu and it’s doing only that. The end result is that I give a command, wait a few seconds (Whisper mostly), then hopefully it works out. I imagine with a known good mic and powerful local hardware it’d be noticeably better, but.