Build 3 PRODUCTION AI Agents in Python - Full Course (Agentspan)

1h 20m video Published Jun 17, 2026 Transcribed Jul 28, 2026 Tech With Tim

Tech With Tim

Intermediate 40 min read For: Python developers with basic AI/LLM experience who want to learn how to build and deploy production-grade AI agents.

AI Trust Score 95/100

✅ Highly Legit

"The title accurately describes the video's content: building three distinct AI agents (conversational, RAG-based, and multi-agent orchestrator) using AgentSpan, with a focus on production readiness."

AI Summary

This video is a full course on building production-ready AI agents in Python using the open-source framework AgentSpan. It covers three increasingly complex agents: a simple conversational bot with memory, a RAG-based support agent with structured output and guardrails, and a multi-agent orchestrator for research tasks. The focus is on solving real-world production challenges like crash recovery, human-in-the-loop approvals, and observability.

Chapters

1 Introduction and Production Challenges 0:00 2 Setup and First Agent (Conversational) 6:03 3 Adding Tools and Memory 12:31 4 RAG Agent with Structured Output and Guardrails 25:31 5 Human-in-the-Loop and Refund Tool 42:01 6 Multi-Agent Orchestration 51:01 7 Testing and Durability 71:03 8 Deployment and Wrap-Up 77:13

[1:57]

Seven Pillars of Production AI

The video outlines seven key features for production AI agents: durability, retries, human-in-the-loop, observability, long-running tasks, scaling, and testing.

[2:47]

AgentSpan Framework Introduction

AgentSpan is introduced as the framework used, which is free and open-source. It provides a server that handles state management, orchestration, and observability.

[4:20]

Durable Execution via State Server

The server stores all agent state, allowing workers to reconnect and resume from crashes without losing progress. It also handles retries and human-in-the-loop approvals.

[6:07]

Installation and Server Setup

Installation is done via `pip install agent-span`. The server is started with `agent-span server start` and runs on port 6767 by default.

[12:01]

Building Agent 1: Conversational Agent

The first agent is a simple conversational agent. It is created by instantiating the `Agent` class with a name, model, and instructions. Tools and memory are added later.

[22:05]

Adding Tools with @tool Decorator

Tools are created by defining a function with a `@tool` decorator. The function's docstring becomes the tool's description for the LLM.

[25:34]

Adding Conversational Memory

Conversational memory is added using the `ConversationMemory` class. Messages can be added manually or automatically by passing the memory object to the agent.

[28:36]

Agent 2: RAG-Based Support Agent

Agent 2 is a RAG-based support agent. It uses a Pydantic model (`SupportResponse`) for structured output, ensuring predictable responses.

[51:04]

Implementing Guardrails

Guardrails are functions that run before (input) or after (output) the LLM to block malicious content. The video demonstrates an input guardrail for prompt injection detection.

[43:20]

Human-in-the-Loop Approval

Human-in-the-loop approval is implemented by setting `approval_required=True` on a tool. The worker can then use `handle.approve()` or `handle.reject()` to continue or stop.

[57:53]

Agent 3: Multi-Agent Orchestrator

Agent 3 is a multi-agent orchestrator. It supports strategies like sequential, parallel, and nested pipelines. The video shows a research team with parallel analysis followed by sequential writing and editing.

[71:04]

Testing Agents Without LLM Calls

AgentSpan provides a testing framework to mock tool calls and verify agent behavior without hitting a real LLM, enabling fast and deterministic tests.

[72:55]

Durability: Crash and Resume

The durability feature is demonstrated by crashing a worker mid-task and then resuming it using the execution ID. The agent continues from where it left off without losing state.

[77:18]

Deployment with Docker Compose

For deployment, the video recommends using Docker Compose with PostgreSQL for persistent storage. The server supports basic auth for secure worker connections.

Mentioned in this Video

AgentSpan

tool

Firecrawl

tool

OpenAI

service

Pydantic

tool

Tutorial Checklist

1 6:07 Install AgentSpan and its server using `pip install agent-span`.

2 8:26 Set your OpenAI API key as an environment variable: `export OPENAI_API_KEY=...`.

3 10:16 Start the AgentSpan server: `agent-span server start`.

4 12:36 Create a new Python file (e.g., `agent1.py`) and import necessary modules: `logging`, `datetime`, `dotenv`, and `agent_span`.

5 13:37 Create a `.env` file with `AGENT_SPAN_SERVER_URL=http://localhost:6767/api`.

6 15:28 Define a basic agent: `assistant = Agent(name="personal_assistant", model="openai/gpt-4", instructions="...")`.

7 17:39 Run the agent inside a `with AgentRuntime() as runtime:` block, using a while loop to accept user input and call `runtime.run(assistant, prompt)`.

8 21:57 Add a tool by defining a function with the `@tool` decorator and a docstring. Example: `@tool def get_current_time() -> str: ...`.

9 25:34 Add conversational memory by creating a `ConversationMemory` object and passing it to the agent via the `memory` parameter.

10 29:46 For structured output, define a Pydantic model and set `output_type` on the agent. Example: `output_type=SupportResponse`.

11 51:40 Add a guardrail by defining a function that returns a `GuardrailResult` and passing it to the agent via the `guardrails` parameter.

12 43:20 For human-in-the-loop, set `approval_required=True` on a tool and handle the `waiting` event in the stream to call `handle.approve()` or `handle.reject()`.

13 60:54 For multi-agent orchestration, define multiple agents and use the `>>` operator for sequential pipelines or set `strategy="parallel"` for parallel execution.

Study Flashcards (12)

What are the seven features needed for a production-ready AI agent according to the video?

medium Click to reveal answer

Durability, retries, human-in-the-loop, observability, long-running tasks, scaling, and testing.