Translation Set1→en COMET22

COMET-22 is a neural machine translation evaluation metric that uses an ensemble of two models: a COMET estimator trained with Direct Assessments and a multitask model that predicts sentence-level scores and word-level OK/BAD tags. It provides improved correlations with human judgments and increased robustness to critical errors compared to previous metrics.

3rows
scoreprimary metric
2026-05-06sampled

Metadata

Metrics

Score, Normalized Score

Latest Results

Rank Subject Score Model Match Provenance Sampled
1 Nova Pro 0.89 Nova Pro 1.0
amazon-nova-pro-v1
Self-reported 2026-05-06
2 Nova Lite 0.89 Nova Lite 1.0
amazon-nova-lite-v1
Self-reported 2026-05-06
3 Nova Micro 0.89 Nova Micro 1.0
amazon-nova-micro-v1
Self-reported 2026-05-06