CLUEWSC
CLUEWSC2020 is the Chinese version of the Winograd Schema Challenge, part of the CLUE benchmark. It focuses on pronoun disambiguation and coreference resolution, requiring models to determine which noun a pronoun refers to in a sentence. The dataset contains 1,244 training samples and 304 development samples extracted from contemporary Chinese literature.
3rows
scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Score, Normalized Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Kimi-k1.5 | 0.91 | — | Self-reported | 2026-05-06 |
| 2 | DeepSeek-V3 | 0.91 | DeepSeek V3 deepseek-deepseek-chat | Self-reported | 2026-05-06 |
| 3 | ERNIE 4.5 | 0.49 | ERNIE 4.5 300B A47B baidu-ernie-4.5-300b-a47b | Self-reported | 2026-05-06 |
No matching rows.