SQuALITY
SQuALITY (Summarization-format QUestion Answering with Long Input Texts, Yes!) is a long-document summarization dataset built by hiring highly-qualified contractors to read public-domain short stories (3000-6000 words) and write original summaries from scratch. Each document has five summaries: one overview and four question-focused summaries. Designed to address limitations in existing summarization datasets by providing high-quality, faithful summaries.
5rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Phi-3.5-mini-instruct | 0.24 | — | Self-reported | 2026-05-06 |
| 2 | Phi-3.5-MoE-instruct | 0.24 | — | Self-reported | 2026-05-06 |
| 3 | Nova Pro | 0.20 | Nova Pro 1.0 amazon-nova-pro-v1 | Self-reported | 2026-05-06 |
| 4 | Nova Lite | 0.19 | Nova Lite 1.0 amazon-nova-lite-v1 | Self-reported | 2026-05-06 |
| 5 | Nova Micro | 0.19 | Nova Micro 1.0 amazon-nova-micro-v1 | Self-reported | 2026-05-06 |
No matching rows.