TuRTLe Spec-to-RTL (Verilator)
TuRTLe leaderboard variant for spec-to-RTL generation evaluated with Verilator across VerilogEval S2R and RTLLM.
44rows
aggregated_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Aggregated Score, Syntax, Functionality, Synthesis, Power, Performance, Area
| Rank | Subject | Aggregated Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Gemma-4-31B-it | 79.83 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-06 |
| 2 | Kimi-K2.5 | 79.73 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 3 | GLM-5-FP8 | 78.17 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 4 | DeepSeek R1-0528 | 75.83 | R1 0528 deepseek-deepseek-r1-0528 | Imported | 2026-05-06 |
| 5 | DeepSeek R1 | 75.78 | R1 deepseek-r1 | Imported | 2026-05-06 |
| 6 | DeepSeek V3.1 Terminus | 71.35 | DeepSeek V3.1 Terminus deepseek-deepseek-v3.1-terminus | Imported | 2026-05-06 |
| 7 | gpt-oss-120b | 70.18 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-06 |
| 8 | Qwen3 236B A22B | 69.17 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-06 |
| 9 | Kimi K2 Instruct 0905 | 67.80 | MoonshotAI: Kimi K2 0905 moonshotai-kimi-k2-0905 | Imported | 2026-05-06 |
| 10 | Seed-OSS-36B | 67.75 | — | Imported | 2026-05-06 |
| 11 | QwQ 32B | 63.75 | — | Imported | 2026-05-06 |
| 12 | Gemini 2.5 Flash (Medium) | 63.27 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-06 |
| 13 | gpt-oss-20b | 63.20 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-06 |
| 14 | InCoder-32B | 62.22 | — | Imported | 2026-05-06 |
| 15 | InCoder-32B-Thinking | 62.21 | — | Imported | 2026-05-06 |
| 16 | Qwen3 Coder 480B A35B | 61.46 | Qwen3 Coder 480B A35B qwen-qwen3-coder | Imported | 2026-05-06 |
| 17 | Qwen2.5 32B | 53.20 | — | Imported | 2026-05-06 |
| 18 | OriGen | 52.85 | — | Imported | 2026-05-06 |
| 19 | Llama 3.1 405B | 52.08 | — | Imported | 2026-05-06 |
| 20 | SeedCoder 8B | 52.04 | — | Imported | 2026-05-06 |
| 21 | Qwen2.5 72B | 51.72 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-06 |
| 22 | Hermes-4-14B-Reasoning | 50.57 | — | Imported | 2026-05-06 |
| 23 | Qwen3-8B | 46.23 | Qwen3 8B qwen-qwen3-8b | Imported | 2026-05-06 |
| 24 | QwenCoder 2.5 32B | 45.72 | — | Imported | 2026-05-06 |
| 25 | HaVen-CodeQwen | 44.57 | — | Imported | 2026-05-06 |
| 26 | Hermes-4-14B | 44.04 | — | Imported | 2026-05-06 |
| 27 | SeedCoder 8B Reasoning | 43.41 | — | Imported | 2026-05-06 |
| 28 | Magistral Small 2506 | 41.02 | — | Imported | 2026-05-06 |
| 29 | StarChat2 15B v0.1 | 40.20 | — | Imported | 2026-05-06 |
| 30 | Llama 3.(1-3) 70B | 40.06 | — | Imported | 2026-05-06 |
| 31 | QwenCoder 2.5 14B | 39.39 | — | Imported | 2026-05-06 |
| 32 | RTLCoder DeepSeek | 38.48 | — | Imported | 2026-05-06 |
| 33 | CodeV R1 Distill Qwen 7B | 37.26 | — | Imported | 2026-05-06 |
| 34 | CodeLlama 70B | 34.55 | — | Imported | 2026-05-06 |
| 35 | DeepSeek Coder 6.7B | 34.14 | — | Imported | 2026-05-06 |
| 36 | OpenCoder 8B | 30.45 | — | Imported | 2026-05-06 |
| 37 | DeepSeek Coder 33B | 27.93 | — | Imported | 2026-05-06 |
| 38 | DeepCoder 14B | 27.06 | — | Imported | 2026-05-06 |
| 39 | DeepSeek R1 Distill Qwen 14B | 22.93 | — | Imported | 2026-05-06 |
| 40 | RTLCoder Mistral | 22.64 | — | Imported | 2026-05-06 |
| 41 | CodeV-QW-7B | 21.70 | — | Imported | 2026-05-06 |
| 42 | CodeV-DS-6.7B | 19.28 | — | Imported | 2026-05-06 |
| 43 | CodeV-CL-7B | 15.32 | — | Imported | 2026-05-06 |
| 44 | QwenCoder 2.5 7B | 14.91 | — | Imported | 2026-05-06 |
No matching rows.