MLPerf Inference v6.0
Audited MLCommons inference benchmark results for datacenter, edge, and GenAI workloads, including throughput, latency, and power-efficiency views.
2rows
offline_throughputprimary metric
2026-05-27sampled
Metadata
Metrics
Offline Throughput, Server Throughput, Interactive Throughput, ROUGE1, ROUGE2, ROUGEL
| Rank | Subject | Offline Throughput | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | AMD 87xMI355X llama2-70b-99 | 1042110 Tokens/s | — | Imported | 2026-05-27 |
| 2 | AMD 87xMI355X llama2-70b-99.9 | 1042110 Tokens/s | — | Imported | 2026-05-27 |
No matching rows.