DeepResearch Bench Multilingual

Multilingual variant of DeepResearch Bench for evaluating deep research agents across translated prompt sets and languages.

0rows
overall_scoreprimary metric
sampled

Metadata

Metrics

Overall Score

Latest Results

Rank Subject Overall Score Model Match Provenance Sampled