DeepResearch Bench Multilingual
Multilingual variant of DeepResearch Bench for evaluating deep research agents across translated prompt sets and languages.
0rows
overall_scoreprimary metric
—sampled
Metadata
Metrics
Overall Score
| Rank | Subject | Overall Score | Model Match | Provenance | Sampled |
|---|
No matching rows.