Interactive Benchmark Console

AegisDesk Console

Explore the benchmark manually without touching the judged API contract. Reset into any task, inspect the inbox, fire structured actions, and watch rewards, state, and focus context update in real time.

Episode Controls

Loading task catalog...
Tip: the console keeps the official API untouched. It simply calls /tasks, /reset, /step, and /state for you.

Observation and State

Task-
Reward-
Done-
No panel selected yet.
No state loaded yet.
No API response yet.