RepoQA

RepoQA: Evaluates software-engineering agents on realistic issue resolution, repository navigation, testing, or maintenance workflows.

2rows
scoreprimary metric
2026-05-27sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rows are imported from the public ZeroEval/LLM-Stats RepoQA benchmark details JSON endpoint. Source verification and self-report metadata are preserved.

Rank Subject Score Model Match Provenance Sampled
1 Phi-3.5-MoE-instruct 0.85 Self-reported 2026-05-27
2 Phi-3.5-mini-instruct 0.77 Self-reported 2026-05-27