MobileMiniWob++_SR

MobileMiniWob++ SR (Success Rate) is an adaptation of the MiniWob++ web interaction benchmark for mobile Android environments within AndroidWorld. It comprises 92 web interaction tasks adapted for touch-based mobile interfaces, evaluating agents' ability to navigate and interact with web applications on mobile devices.

2rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Qwen2.5 VL 7B Instruct 0.91 Self-reported 2026-05-06
2 Qwen2.5 VL 72B Instruct 0.68 Qwen2.5 VL 72B Instruct
qwen-qwen2.5-vl-72b-instruct
Self-reported 2026-05-06