VLABench
Vision-language-action benchmark for evaluating robotic policy models through the AllenAI VLA evaluation harness leaderboard.
4rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
VLA score
| Rank | Subject | VLA score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | lerobot/pi0_base | 44.50 | — | Imported | 2026-05-06 |
| 2 | lerobot/pi05_base | 42 | — | Imported | 2026-05-06 |
| 3 | nvidia/GR00T-N1-2B | 39.70 | — | Imported | 2026-05-06 |
| 4 | lerobot/pi0fast-base | 34.10 | — | Imported | 2026-05-06 |
No matching rows.