QitOS gives you a stableDocumentation Index
Fetch the complete documentation index at: https://qitor.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
AgentModule + Engine kernel for building reproducible LLM agents. Whether you are prototyping ReAct loops, running GAIA benchmarks, or shipping a production coding agent, QitOS keeps your runs traceable, your patterns composable, and your results reproducible.
A long-running QitOS coding run stays readable — it does not degrade into opaque glue code.

Two ways to author agents in QitOS
QitOS keeps two authoring paths intentionally.Research-first
This is the default path for researchers who want the most control. You handwrite:- the system prompt
- the parser (the component that converts raw model output into a structured Decision)
- the model protocol (the output format contract expected from the model)
- the model transport (the client that sends requests to the model API)
- the tool surface (the set of tools the agent can call)
Preset-first
This is the fastest path when you want a stable baseline or quick model-family switching. You start from:- family presets (pre-configured defaults for a model family, covering protocol, transport, and tool delivery)
- harness policies (rules that control how the model harness resolves at runtime)
- preset tool builders (pre-assembled tool bundles for common workflows)
gpt-oss, and Gemma 4 without rewriting the agent.
Who QitOS is for
QitOS is built for three kinds of practitioners:- Researchers — prototype ReAct, PlanAct, Tree-of-Thought, Reflexion, and new agent methods, then diff, replay, and publish the runs.
- Agent builders — build tool-using agents on a stable execution loop, instead of piling framework glue code on top of raw LLM calls.
- Evaluators — run GAIA, Tau-Bench, and CyBench workflows on the same kernel you use in product agents, so benchmark results actually transfer to real use.
Key capabilities
Reproducible runs
Every QitOS run writes amanifest.json, events.jsonl, and steps.jsonl to a local run directory. The manifest captures the model ID, prompt hash, config hash, seed, and tool manifest (the serialized description of all tools available to the run) — everything you need to reproduce or compare runs exactly.
Built-in observability with qita
qita is QitOS’s built-in trace viewer. After any run, launch the board to inspect step-by-step execution, replay the trajectory (the temporal sequence of prompts, decisions, tool calls, and observations across steps), and export traces to standalone HTML.
http://127.0.0.1:8765 and auto-refreshes as new runs appear.
Canonical agent patterns
QitOS ships with canonical implementations of four established reasoning patterns, each runnable fromexamples/patterns/:
| Pattern | Description |
|---|---|
| ReAct | A reasoning-acting loop where the model alternates between thinking and taking one action per step |
| PlanAct | Generate an explicit plan first, then execute it step by step |
| Tree-of-Thought | Branch into multiple candidate paths, score them, then choose the best one |
| Reflexion | An actor-critic loop where a critic evaluates each step and the actor retries based on feedback |
Benchmark adapters
QitOS includes adapters for GAIA, Tau-Bench, and CyBench that run on the sameAgentModule + Engine kernel you use for your own agents. You don’t need a separate evaluation harness.
Featured designs
These are the design decisions that distinguish QitOS from a loose collection of agent utilities.Single-kernel architecture
QitOS is opinionated about one thing above all: there is one runtime kernel per run.AgentModuledefines policy (what the agent should do at each step)Engineowns execution (the actual step loop)- tools, parsers, critics, memory, and tracing attach to that kernel instead of spawning separate orchestrators
Protocol-aware prompting and parsing
QitOS treats prompt format and parser choice as a first-class contract — they must match each other.- ReAct prompts pair with
ReActTextParser - JSON prompts pair with
JsonDecisionParser - XML prompts pair with
XmlDecisionParser - more structured variants such as Terminus and MiniMax tool-call parsers follow the same model-response → parser →
Decisionpath
Preset-first agent authoring
QitOS gives you reusable authoring blocks instead of forcing every agent to rebuild the same wiring.- preset tool bundles such as
coding_tools(...),advanced_coding_tools(...),web_tools(),task_tools(...), andsecurity_audit_tools(...) - reusable memory adapters such as
WindowMemory,SummaryMemory,VectorMemory, andMarkdownFileMemory - reusable history strategies such as
WindowHistory,TokenBudgetSummaryHistory, andCompactHistory - reusable planners such as
NumberedPlanBuilderandDynamicTreeSearch
Long-running context control
Long-running agents are a first-class concern in QitOS. You can control context growth through:HistoryPolicyfor selecting which messages to send to the model- token-budget-aware summarization with
TokenBudgetSummaryHistory - multi-stage compaction (gradually compressing older context while keeping recent messages intact) with
CompactHistory - memory adapters for semantic or persistent recall across steps
Trace-first observability with qita
QitOS assumes that if a run matters, it should be inspectable afterward. Every traced run produces structured artifacts (saved files that record what happened during the run), andqita turns them into:
- a board for comparing runs
- replay for step-by-step inspection
- export for shareable standalone HTML
Domain specialization without a new runtime
One of the strongest design features in QitOS is that domain agents are still ordinary QitOS agents — they do not need a separate runtime. The Claude Code-style agent, benchmark runners, and the code security audit agent all reuse the same runtime. Domain behavior is expressed through:- state design
- prompt policy
- tool composition
reduce()semantics
Status
Where to go next
Quick start
Run your first agent in under 2 minutes
Tutorials
Follow the four-lesson research track for designing agents in the QitOS mindmap
Installation
Install QitOS and its optional extras
Core concepts
Understand AgentModule, Engine, State, and Tools
Agent patterns
ReAct, PlanAct, Tree-of-Thought, and Reflexion examples
Kit reference
Explore pre-built tools, memory, parsers, planners, and history strategies
Tracing & qita
Learn how QitOS makes every run inspectable, replayable, and exportable
