What each file means
| File | Purpose |
|---|---|
manifest.json | Run summary, reproducibility metadata, benchmark metadata, and official-run fields |
events.jsonl | Event stream across runtime phases |
steps.jsonl | One structured record per completed step |
qita is the human inspection surface built on top of them.
Why tracing is a first-class feature
QitOS is built for agent research, not only one-off demos. That means the framework must preserve:- how a run stopped
- what prompt/parser contract it used
- what tool surface it saw
- how context changed over time
- which config fields matter for replay and comparison
AgentModule.run(...).
Trace metadata in v0.3
The v0.3 closure adds stronger reproducibility metadata to the manifest, including:git_shapackage_versionbenchmark_namebenchmark_splitmodel_familyprompt_protocolparser_nametool_manifestrun_specexperiment_specofficial_runreplay_mode- token / latency / cost summaries
Best-effort replay
Tracing in QitOS supports best-effort research replay. That means QitOS records enough information to inspect and compare runs well, but it does not promise strict deterministic re-execution for remote models or external environments. Use traces for:- debugging long trajectories
- comparing prompt/parser/tool changes
- exporting artifacts for review
- replaying benchmark failures
qita on top of traces
Once traces exist, use:qita also supports run comparison so you can ask why two runs diverged instead of reading raw JSON by hand.
