Desktop Benchmark Starter
QitOS v0.5 now has one canonical multimodal path:DesktopEnv- the
desktop-starterbenchmark qitos.recipes.desktop.osworld_starterexamples/real/openai_cua_agent.py- qita visual replay
Why this is the release path
The desktop starter benchmark is the first path where QitOS can say:- one benchmark family is official
- one baseline agent is canonical
- one qita debugging workflow is documented
- one artifact schema is reused end to end
What the starter includes
- OSWorld-inspired desktop task shape
- screenshot-backed observations
- optional a11y / OCR / DOM / UI candidates
- provider-neutral GUI actions
- planner + grounding + action selector + critic baseline loop
- qita screenshot timeline, playback preview, and basic overlays
What it does not promise yet
- full official OSWorld runtime parity
- rich accessibility-tree execution across every provider
- enterprise-grade approval governance
- full visual replay depth planned for v0.6
osworld.