CUAD
CUAD: Measures legal reasoning, contract review, statute interpretation, or legal-domain QA.
10rows
precision_at_80_recallprimary metric
2026-05-27sampled
Metadata
Metrics
Precision at 80% recall, AUPR, Precision at 90% recall
| Rank | Subject | Precision at 80% recall | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | DeBERTa-xlarge | 44.0 | — | Imported | 2026-05-27 |
| 2 | RoBERTa-large | 38.1 | — | Imported | 2026-05-27 |
| 3 | RoBERTa-base + Contracts Pretraining | 34.1 | — | Imported | 2026-05-27 |
| 4 | RoBERTa-base | 31.1 | — | Imported | 2026-05-27 |
| 5 | ALBERT-xxlarge | 31.0 | — | Imported | 2026-05-27 |
| 6 | ALBERT-large | 20.9 | — | Imported | 2026-05-27 |
| 7 | ALBERT-xlarge | 20.5 | — | Imported | 2026-05-27 |
| 8 | ALBERT-base | 11.1 | — | Imported | 2026-05-27 |
| 9 | BERT-base | 8.2 | — | Imported | 2026-05-27 |
| 10 | BERT-large | 7.6 | — | Imported | 2026-05-27 |
No matching rows.