RepoBench
RepoBench: Evaluates software-engineering agents on realistic issue resolution, repository navigation, testing, or maintenance workflows.
1rows
scoreprimary metric
2026-05-27sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Codestral-22B | 0.34 | — | Self-reported | 2026-05-27 |
No matching rows.