What is agtOS?
agtOS is an open-source, voice-native AI agent platform. It provides the infrastructure layer for AI agents that can hear, speak, think, and act — connecting voice input/output with LLM reasoning and tool execution.Key Features
Voice Pipeline
STT → LLM → TTS cascade with WebSocket streaming, word timestamps, and barge-in support
Multi-Provider AI
Claude (API key + Max subscription), Ollama (local), model routing with automatic fallback
MCP Integration
9 built-in MCP tools + external MCP server connections for extensibility
Memory System
Three-tier: working (per-session), episodic (Redis), semantic (vector search)
Desktop App
Tauri 2 native shell with system tray, global hotkey, auto-update
Task Scheduling
Redis-backed cron, once, and interval task execution
Architecture
agtOS uses a dual-layer architecture:- Infrastructure Layer: Voice pipeline, audio processing, WebSocket transport
- Orchestration Layer: AI-driven via protocol gateway (MCP + A2A ready)
Getting Started
Quickstart
Get running in 5 minutes
Installation
Detailed setup guide