Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.agtos.ai/llms.txt

Use this file to discover all available pages before exploring further.

agtOS Documentation

agtOS is an open-source, voice-native AI agent platform. It provides the infrastructure for AI agents that can hear, speak, think, and act — connecting voice input/output with LLM reasoning, tool execution, and persistent memory.

Download the App

Install the desktop app and start chatting in minutes. No terminal required.

Developer Setup

Clone, install, and run from source with full CLI and API access.

Use Cases

Personal assistant, smart home hub, developer tool, IoT voice device.

API Reference

70+ REST endpoints, WebSocket voice protocol, and 10 MCP tools.

How it works

Voice In → VAD → STT → Model Router → Claude / Ollama → TTS → Voice Out

                     Agent Loop → Tool Registry → MCP Tools

                     Memory (Working + Episodic + Semantic)
agtOS processes voice through a cascade pipeline: speech is transcribed, routed to the best model (local or cloud), processed through an agent reasoning loop with tool access, and synthesized back to speech — all with sub-second perceived latency.

Core capabilities

Voice Pipeline

STT → LLM → TTS cascade with barge-in, word timestamps, and streaming partial transcripts. In-process via sherpa-onnx.

Model Routing

Slot-based routing across Claude, OpenAI, OpenRouter, and Ollama. Intent classification maps requests to the right provider.

Agent Memory

Working (session), episodic (Redis), and semantic (vector search) tiers. Dialectic reasoning and cross-tool import.

MCP Integration

10 built-in tools + external server connections. Dynamic tool selection reduces context usage by 80-90%.

Multi-Platform

Web dashboard, Tauri 2 desktop app with system tray, CLI, ESP32 hardware, and browser voice client.

Local-First

In-process STT/TTS/VAD via ONNX. Local model routing via Ollama. Cloud is optional, not required.

Choose your path

I want to try it

Download the app and start chatting in minutes.

I want to explore features

Voice pipeline, memory, MCP tools, dashboard, and more.

I want to build with it

REST API, WebSocket protocol, and MCP tool schemas.