Run your own AI (but private)

Transcribed Jun 28, 2026 Watch on YouTube ↗

Intermediate 15 min read For: Tech enthusiasts, IT professionals, and data scientists interested in running private AI models locally or in enterprise environments.

2.5M

Views

83.2K

Likes

3.5K

Comments

1.8K

Dislikes

3.5%

📈 Moderate

AI Summary

This video demonstrates how to set up a private AI on your local computer using Ollama, allowing you to run large language models like Llama 2 without an internet connection. It then shows how to connect your own knowledge base, such as journals or documents, to a private GPT using RAG (Retrieval Augmented Generation). Finally, it discusses VMware's private AI solution for enterprises, which simplifies fine-tuning and deploying custom LLMs on-premises.

Chapters

1 Introduction to Private AI 0:00 2 Understanding AI Models and Hugging Face 1:51 3 Installing Ollama and Running Models 4:16 4 Fine Tuning and RAG for Business 8:27 5 Advanced: Setting Up PrivateGPT with Your Data 15:36

[0:03]

Private AI runs locally

All processing happens on your computer, no internet needed, data stays private.

[0:15]

Setting up local AI is easy

It takes about five minutes to set up your own AI on a laptop using free tools.

[2:08]

Hugging Face hosts 505,000 models

A community platform with a vast collection of pre-trained AI models available for download.

[2:32]

Llama 2 training scale

Trained on 2 trillion tokens, 6,000 GPUs, 1.7 million GPU hours, estimated $20 million cost.

[4:26]

Ollama simplifies running LLMs

A tool that installs easily and allows running models like Llama 2, Code Llama, and uncensored versions.

[5:20]

WSL installation for Windows

Use 'wsl --install' to set up Windows Subsystem for Linux, enabling Linux applications on Windows.

[7:01]

GPU vs CPU performance

Running AI models on a GPU is much faster than on a CPU, important for real-time use.

[9:49]

Fine tuning trains AI on proprietary data

Process of teaching an existing model new information using a small dataset, e.g., 9,800 examples.

[14:41]

Fine tuning changes only 0.93% of parameters

For a 7B parameter model, only 65 million parameters are modified, making fine tuning resource-efficient.

[15:36]

RAG connects LLM to external databases

Retrieval Augmented Generation allows the LLM to consult a knowledge base before answering, improving accuracy.

Clickbait Check

90% Legit

"The video delivers exactly what the title promises: a clear guide to running your own private AI on a laptop."

Mentioned in this Video

Ollama

tool

WSL (Windows Subsystem for Linux)

tool

PrivateGPT

tool

Hugging Face

tool

VMware Private AI with Nvidia

service

Network Chuck

person

George Sung

person

L Martinez

person

Emelia Lance a lot

person

Tutorial Checklist

1 4:26 Install Ollama on your operating system (macOS, Linux, or Windows via WSL).

2 5:20 If on Windows, open Windows Terminal and run 'wsl --install' to set up Ubuntu.

3 6:18 Run the command 'ollama run llama2' to download and start the Llama 2 7B model.

4 18:23 For private GPT with your own data, install PrivateGPT following the guide by L Martinez (requires dependencies like Python, NVIDIA drivers, poetry).

5 20:12 Ingest your documents folder into PrivateGPT using the provided command, then run PrivateGPT and query through the web interface.

Study Flashcards (10)

What is an AI model?

easy Click to reveal answer

An artificial intelligence pre-trained on data, such as a large language model (LLM).