Explore the benchmark manually without touching the judged API contract.
Reset into any task, inspect the inbox, fire structured actions, and watch
rewards, state, and focus context update in real time.
Episode Controls
Loading task catalog...
Tip: the console keeps the official API untouched. It simply calls
/tasks, /reset, /step, and /state
for you.