agtOS is an open-source platform that turns AI models into voice-enabled agents. It handles the hard parts — real-time audio processing, model routing, tool execution, and persistent memory — so you can focus on building experiences.Documentation Index
Fetch the complete documentation index at: https://docs.agtos.ai/llms.txt
Use this file to discover all available pages before exploring further.
What makes agtOS different
Voice-native, not voice-added
Built from the ground up for voice. The cascade pipeline (STT → LLM → TTS) runs with sub-second latency, sentence-level streaming, and barge-in support.
Local-first architecture
In-process speech engine (sherpa-onnx), local model routing (Ollama), and optional cloud. Works offline — cloud is an enhancement, not a dependency.
Protocol-agnostic
Built on MCP (Model Context Protocol) with A2A readiness. Tools are defined once and work across voice, chat, CLI, and external AI clients.
Progressive infrastructure
Start with just Node.js. Add Redis for memory and scheduling. Add Ollama for local AI. Each piece unlocks more capabilities without breaking what works.
System overview
Key components
| Component | What it does | Required? |
|---|---|---|
| Voice Pipeline | STT, TTS, VAD — in-process or external | Included |
| Model Router | Routes requests to the best model (local or cloud) | Included |
| Agent Loop | Multi-step tool execution with progress streaming | Included |
| Memory System | Working, episodic, and semantic memory tiers | Working always; episodic/semantic need Redis |
| MCP Server | Exposes 10 tools to external AI clients | Included |
| MCP Client | Connects to external MCP servers for tool discovery | Included |
| Device Registry | Manages ESP32, browser, CLI, and custom devices | Needs Redis |
| Task Scheduler | Cron, one-time, and interval task scheduling | Needs Redis |
| Web Dashboard | 17-page management UI | Included |
| Desktop App | Tauri 2 with system tray and global PTT hotkey | Separate download |
| CLI | 9 commands for setup, management, and interaction | Included |
Requirements
| Component | Version | Purpose |
|---|---|---|
| Node.js | 22+ | Required runtime |
| Redis | 7.2+ with RediSearch | Memory, scheduling, devices (optional) |
| Ollama | Latest | Local AI models and embeddings (optional) |
| Docker | Latest | Convenient Redis management (optional) |
Get started
Download the App
Install the desktop app and start chatting in minutes. No terminal required.
Developer Setup
Clone, install, and run from source with full CLI and API access.
Use Cases
Personal assistant, smart home hub, developer tool, IoT voice device.
FAQ
Common questions about setup, voice, memory, privacy, and integration.