SWE-bench JavaScript

SWE-bench JavaScript: Evaluates software-engineering agents on realistic issue resolution, repository navigation, testing, or maintenance workflows.

0rows
scoreprimary metric
sampled

Metadata

Metrics

Score

Latest Results

Rank Subject Score Model Match Provenance Sampled