Skip to main content

Quickstart

Get Akio running in under a minute.

1. Pull a model

akio pull ggml-org/Qwen3-0.6B-GGUF

2. Start a session

akio run -m Qwen3-0.6B-Q4_0.gguf

That's it. Akio will load the model and drop you into an interactive chat where the agent can autonomously use tools like shell execution, file I/O, and web search.

CLI Options

akio run -m <model.gguf> -c <context_size> --ngl <gpu_layers>
FlagDescriptionDefault
-mGGUF model file name
-cContext window size (tokens)model default
--nglNumber of layers offloaded to GPU99

Other Commands

akio list # List cached models
akio list --all # List all available models
akio rm <repo> # Remove a downloaded model