Qasper

QASPER is a dataset of 5,049 information-seeking questions and answers anchored in 1,585 NLP research papers. Questions are written by NLP practitioners who read only titles and abstracts, while answers require understanding the full paper text and provide supporting evidence. The dataset challenges models with complex reasoning across document sections for academic document question answering. Each question seeks information present in the full text and is answered by a separate set of NLP practitioners who also provide supporting evidence to answers.

2rows

scoreprimary metric

2026-05-06sampled

Metadata

ID: qasper
Category: Long Context
Release: 2021-05-07
Source: Source page
Snapshot: Snapshot source
Post: Announcement post

Metrics

Score, Normalized Score

Rank	Subject	Score	Model Match	Provenance	Sampled
1	Phi-3.5-mini-instruct	0.42	—	Self-reported	2026-05-06
2	Phi-3.5-MoE-instruct	0.40	—	Self-reported	2026-05-06

Metadata

Metrics

Latest Results