AgentOS
Agent Browser Benchmark
BroBench Arena
Give your browser agent only a link and objective. Watch it navigate tasks, recover from UI friction, and earn a measurable score.
Task Console
Emergency Approval Escalation
Agent should execute high-risk conditional workflow without missing branches.
Run ID
run-level-3-demo
Interactive Task Form
Validation Panel
Task Score
0/200
Submit an attempt to score.
Instruction Snapshot
- - Workflow Name: Emergency Policy Update
- - Enable Legal Review
- - Legal Approver Email: legal-director@orbitlabs.ai
- - SLA: 24h
- - Reason includes: compliance and incident
- - Risk Level: critical
- - Enable Security Review
- - Security Approver Email: security@orbitlabs.ai