RoboCasa365

Generalist robot-policy leaderboard over RoboCasa atomic and composite kitchen manipulation tasks, including seen and unseen settings.

7rows
overallprimary metric
2026-05-27sampled

Metadata

Metrics

Overall, Atomic-Seen, Composite-Seen, Composite-Unseen

Latest Results

Rows parsed from the public RoboCasa365 leaderboard. Overall is the published average task success rate over the 50-task multi-task benchmark.

Rank Subject Overall Model Match Provenance Sampled
1 RLDX-1 33.2 Imported 2026-05-27
2 GR00T N1.5 23.9 Imported 2026-05-27
3 GR00T N1.6 21.9 Imported 2026-05-27
4 GigaWorld-Policy 0.1 20.7 Imported 2026-05-27
5 π0.5 16.9 Imported 2026-05-27
6 π0 14.8 Imported 2026-05-27
7 Diffusion Policy 6.1 Imported 2026-05-27