Home Assistance Voice & Ollama Setup Guide - The Ultimate Local LLM Solution!

Transcribed Jun 20, 2026 Watch on YouTube ↗

Intermediate 4 min read For: Home automation enthusiasts and developers with basic knowledge of Docker and Home Assistant.

54.0K

Views

650

Likes

42

Comments

49

Dislikes

1.3%

📊 Average

AI Summary

This video demonstrates how to connect a locally running Ollama large language model (LLM) to Home Assistant Voice, enabling fully local voice commands and question answering. The setup is straightforward and leverages Docker containers for easy installation.

Chapters

1 Introduction and Problem Statement 0:00 2 Ollama Setup and Hardware Requirements 0:53 3 Home Assistant Integration and Model Selection 2:21 4 Configuration and Demo 4:46 5 Conclusion and Recommendations 7:11

[0:00]

Home Assistant Voice Preview Edition

An ESP32-based smart speaker that handles voice commands and home automation locally using Whisper and Piper.

[0:24]

Missing Conversation Agent

The device lacks a proper conversation agent for answering questions, which Ollama can provide.

[0:36]

Ollama Overview

Ollama is an open-source way to run large language models locally, and it's easy to set up with Home Assistant Voice.

[0:53]

Hardware Requirements

A reasonably powerful PC with a GPU (lots of VRAM) or at least enough system RAM is needed. The setup uses TrueNAS Scale with Docker containers.

[1:39]

Installing Ollama on TrueNAS

Installation is as simple as clicking install in the App Store, setting parameters like storage pools, CPU cores, RAM, and optionally passing a GPU.

[2:21]

Home Assistant Integration

Add the Ollama integration in Home Assistant by entering the server IP and port, then select a model. The command line allows choosing smaller parameter sizes.

[3:43]

Model Selection Example

Example: 'ollama run deepseek-r1:1.5b' or 'ollama pull deepseek-r1:1.5b' to download without running.

[4:00]

Voice Assistant Configuration

In Home Assistant, go to Voice Assistant, select local assistant, and change the conversation agent to the Ollama model. Adjust settings like instruction, context window, max message history, and keep alive time.

[4:31]

Keep Alive Setting

Recommend changing from -1 (permanent) to a few minutes to avoid permanent RAM usage.

[4:46]

LLM Control of Home Assistant

Leave LLM control set to 'No Control' to avoid compatibility issues; commands are handled locally by Home Assistant, while questions go to the LLM.

[5:40]

Demo: Question Answering

Asking 'How long is an inch in centimeters?' yields a response: '1 inch is approximately equal to 2.54 cm.' Responses are slower due to speech-to-text, LLM generation, and text-to-speech.

[6:19]

Model Recommendation

DeepSeek R1 produces verbose thinking text; the creator prefers Llama 3.2 for cleaner responses.

[7:01]

Future Addition: Web Search

Web search capability is desired but not yet implemented.

Setting up Ollama with Home Assistant Voice is remarkably easy and provides a fully local, private LLM-powered voice assistant. While responses are slightly slower, the results are great for answering questions and controlling home automation.

Clickbait Check

90% Legit

"The title accurately describes the content: a guide to setting up Home Assistant Voice with Ollama for a local LLM solution."

Mentioned in this Video

Ollama

tool

Home Assistant

tool

Whisper

tool

Piper

tool

TrueNAS Scale

tool

Docker

tool

DeepSeek R1

model

Llama 3.2

model

Tutorial Checklist

1 0:53 Set up Ollama on a PC with a GPU or sufficient RAM. Install via Docker on TrueNAS Scale or similar.

2 1:39 Install Ollama from the App Store (e.g., TrueNAS), configure storage, CPU cores, RAM, and optionally pass a GPU.

3 2:21 In Home Assistant, go to Devices & Integrations, add the Ollama integration, enter server IP and port.

4 2:34 Select a model. If needed, use command line to download a smaller parameter size: 'ollama pull modelname:params'.

5 4:00 Create an entity for the model in Home Assistant. Then go to Voice Assistant, select local assistant, and change conversation agent to the Ollama model.

6 4:31 Adjust settings: change keep alive from -1 to a few minutes, set LLM control to 'No Control', and ensure 'Prefer handling commands locally' is enabled.

Study Flashcards (8)

What is the name of the open-source tool used to run large language models locally?

easy Click to reveal answer

Ollama