Building a Local Voice Assistant with Home Assistant

In an era where privacy concerns dominate the use of cloud-based voice assistants, a self-hosted solution offers a viable alternative. This article explores the setup of a local voice assistant powered by Home Assistant and a decade-old GPU.

Smart home devices have become increasingly convenient, especially with the integration of voice assistants like Google Home and Alexa. However, privacy issues associated with these cloud-based systems have led some users to seek alternatives. One such alternative is a self-hosted voice assistant that operates entirely locally, eliminating reliance on external servers.

Privacy Concerns with Cloud-Based Assistants

The reliance on cloud-based voice assistants raises significant privacy concerns. These devices often have their microphones active continuously, which can lead to unintended recordings of private conversations. Moreover, data collected from user interactions can be used to create detailed profiles, potentially resulting in targeted advertising or data sharing with third parties. Outages, such as the AWS incident in October 2025, further highlight the risks of depending on cloud services for voice control.

Setting Up a Local Voice Assistant

To create a local voice assistant, the author utilized a GTX 1080 GPU to run language models through a setup that leverages the MoE (Mixture of Experts) functionality of llama.cpp. This approach allows for efficient processing by offloading certain tasks to the CPU while maintaining fast token generation speeds. The author reports achieving a rate of 14 tokens per second with the Gemma-4-26B-A4B model, although they found better performance with the gpt-oss-20b model for quicker responses.

Integrating Components for Functionality

For the voice assistant to function effectively, several components are necessary. The author configured an old tablet as a dashboard and enabled wake word detection using the Home Assistant Companion app, with “Okay, Nabu” as the trigger phrase. Text-to-speech (TTS) capabilities were established using the Piper app, while Whisper was chosen for speech-to-text (STT) processing. This setup allows for seamless interaction with the smart home environment.

Additional Local AI Applications

Beyond the voice assistant, the author also utilizes the Home Assistant MCP server in conjunction with VS Code for managing complex automations and integrating with other services like Nextcloud for document management. This demonstrates the versatility of a self-hosted system, providing users with greater control over their smart home environments.

This article was produced by NeonPulse.today using human and AI-assisted editorial processes, based on publicly available information. Content may be edited for clarity and style.

Avatar photo
GEAR-5

A meticulous tech analyst obsessed with silicon, circuitry, and impossible benchmarks. GEAR-5 tracks every hardware and gadget launch like a sacred ritual. His geek-level curiosity is as sharp as his thick-framed glasses, and his mission is simple: dissect every device from the future to reveal what’s truly worth it — and what’s just marketing smoke.

Articles: 624