Use this file to discover all available pages before exploring further.
The agtos CLI provides a text-based interface for managing the agtOS system. It communicates with the running server via the REST API on the health port (default 4102).
If none are found, it guides you to run agtos setup first.Before starting, the CLI checks for port conflicts on all three ports (3000, 4100, 4102) and reports which process holds a conflicting port. Startup progress is displayed in real-time with per-service status:
✓ Health endpoint ✓ Voice pipeline ✓ MCP server ✓ Task scheduler ✓ API routesagtOS server is running! Dashboard: http://localhost:4102 Health: http://localhost:4102/health Voice WS: ws://localhost:3000/audio MCP: http://localhost:4100/mcp
Startup timeout: 30 seconds. The server process receives forwarded SIGINT/SIGTERM signals for graceful shutdown.
Interactive first-run configuration wizard. Checks prerequisites, prompts for API keys, validates credentials against the actual provider API, configures model slots, and stores encrypted credentials.
agtos setup
The wizard performs these steps:
Node.js version check — requires 22+ (exits if older)
Docker check — optional, shows running status
Ollama check — optional, shows installed status
Anthropic API key — prompts for sk-ant-api03-... key from console.anthropic.com. Validates against the Anthropic API. Offers retry on failure. Press Enter to skip.
OpenAI API key — optional, prompts for sk-... key from platform.openai.com. Validates against the OpenAI API.
Model slot configuration — configures the chat slot (required) with provider and model selection, plus an optional reasoning slot for complex tasks. Providers: claude, openai, ollama.
Encrypt and store — saves credentials to ~/.agtos/credentials.json (AES-256-GCM encrypted) and slot configuration to ~/.agtos/config.json. If plaintext API keys exist in .env.local, they are migrated to the encrypted store and removed.
Summary — shows configuration status and next steps
Interactive text chat with the agtOS agent. Uses the same agent reasoning loop as the voice pipeline.
agtos chat
The chat command verifies the server is reachable, then opens an interactive prompt that connects to POST /api/chat. Each message has a 30-second timeout. Session continuity is automatic — the server assigns a session ID on the first message and the CLI reuses it for subsequent messages.Responses show metadata: duration, reasoning steps (if > 1), and tool calls (if any). Errors don’t exit the loop — you can retry or continue chatting.In-chat commands:
# Search memoryagtos memory search "last conversation about weather"# Import memories from external AI tools (Claude, ChatGPT, etc.)agtos memory import# Trigger a memory health sweep on demandagtos memory maintainagtos memory maintain --verboseagtos memory maintain --user alice
The search subcommand queries the episodic and semantic memory system via GET /api/memory/search. Multi-word queries are supported — agtos memory search foo bar baz searches for "foo bar baz". Returns up to 20 results with relevance scores.The import subcommand scans for available external AI tool memories and imports them. Supported sources: Claude Code, Cursor, Windsurf, Aider, and GitHub Copilot.The maintain subcommand triggers an on-demand memory lint sweep via POST /api/memory/maintain. It prints a summary (conclusions examined, contradictions, stale, orphans, redundant, pruned, decayed, dangling sources) and exits with one of four codes:
Exit code
Meaning
0
Sweep completed successfully (report printed)
1
Request failed (network error, server error, parse error)
2
Sweep deferred by ResourceGuard — transient, retry later when the system is idle (HTTP 503, errorCode: RESOURCES_BUSY)
3
Memory profile manager is not connected — operator action required, e.g., Redis down (HTTP 503, errorCode: PROFILE_DISCONNECTED)
Flag
Description
--user <id>
Target a specific user ID (defaults to the single-user 'default')
--verbose, -v
Print the full issue list instead of just the summary
The sweep can take 30-60 seconds on large knowledge bases; the CLI allows up to 120 seconds before timing out.
Run a background agent task. The task is processed through the same agent reasoning loop as chat, but designed for longer-running workloads (60-second timeout). Multi-word topics are supported.
agtos task "research current weather in San Francisco"agtos task "summarize recent conversations"
Manage sherpa-onnx ONNX model files for the local voice pipeline. This command operates directly on the local filesystem and does not require the server to be running.
# List all available models with download statusagtos models list# Download the default model set (~460MB)agtos models download --default# Download a specific modelagtos models download sensevoice-int8# Show model detailsagtos models info moonshine-tiny-en-int8# Remove a downloaded model (prompts for confirmation)agtos models remove sensevoice-int8# Remove without confirmation promptagtos models remove sensevoice-int8 --force
Default model set (5 models needed for a full local voice pipeline):