SQuALITY

SQuALITY (Summarization-format QUestion Answering with Long Input Texts, Yes!) is a long-document summarization dataset built by hiring highly-qualified contractors to read public-domain short stories (3000-6000 words) and write original summaries from scratch. Each document has five summaries: one overview and four question-focused summaries. Designed to address limitations in existing summarization datasets by providing high-quality, faithful summaries.

5rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Phi-3.5-mini-instruct 0.24 Self-reported 2026-05-06
2 Phi-3.5-MoE-instruct 0.24 Self-reported 2026-05-06
3 Nova Pro 0.20 Nova Pro 1.0
amazon-nova-pro-v1
Self-reported 2026-05-06
4 Nova Lite 0.19 Nova Lite 1.0
amazon-nova-lite-v1
Self-reported 2026-05-06
5 Nova Micro 0.19 Nova Micro 1.0
amazon-nova-micro-v1
Self-reported 2026-05-06