TuRTLe Spec-to-RTL (Icarus Verilog)
TuRTLe leaderboard variant for spec-to-RTL generation evaluated with Icarus Verilog across VerilogEval S2R and RTLLM.
44rows
aggregated_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Aggregated Score, Syntax, Functionality, Synthesis, Power, Performance, Area
| Rank | Subject | Aggregated Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Gemma-4-31B-it | 81.51 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-06 |
| 2 | Kimi-K2.5 | 81.47 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 3 | GLM-5-FP8 | 79.46 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 4 | DeepSeek R1-0528 | 76.79 | R1 0528 deepseek-deepseek-r1-0528 | Imported | 2026-05-06 |
| 5 | DeepSeek R1 | 75.53 | R1 deepseek-r1 | Imported | 2026-05-06 |
| 6 | DeepSeek V3.1 Terminus | 71.79 | DeepSeek V3.1 Terminus deepseek-deepseek-v3.1-terminus | Imported | 2026-05-06 |
| 7 | gpt-oss-120b | 70.52 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-06 |
| 8 | Qwen3 236B A22B | 69.16 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-06 |
| 9 | Kimi K2 Instruct 0905 | 68.72 | MoonshotAI: Kimi K2 0905 moonshotai-kimi-k2-0905 | Imported | 2026-05-06 |
| 10 | Seed-OSS-36B | 67.51 | — | Imported | 2026-05-06 |
| 11 | gpt-oss-20b | 63.70 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-06 |
| 12 | Gemini 2.5 Flash (Medium) | 63.55 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-06 |
| 13 | QwQ 32B | 62.60 | — | Imported | 2026-05-06 |
| 14 | InCoder-32B-Thinking | 61.86 | — | Imported | 2026-05-06 |
| 15 | InCoder-32B | 60.79 | — | Imported | 2026-05-06 |
| 16 | Qwen3 Coder 480B A35B | 60.55 | Qwen3 Coder 480B A35B qwen-qwen3-coder | Imported | 2026-05-06 |
| 17 | Llama 3.1 405B | 53.22 | — | Imported | 2026-05-06 |
| 18 | OriGen | 52.88 | — | Imported | 2026-05-06 |
| 19 | SeedCoder 8B | 50.89 | — | Imported | 2026-05-06 |
| 20 | Qwen2.5 32B | 50.39 | — | Imported | 2026-05-06 |
| 21 | Hermes-4-14B-Reasoning | 50.32 | — | Imported | 2026-05-06 |
| 22 | Qwen2.5 72B | 49.36 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-06 |
| 23 | Qwen3-8B | 45.10 | Qwen3 8B qwen-qwen3-8b | Imported | 2026-05-06 |
| 24 | QwenCoder 2.5 32B | 44.02 | — | Imported | 2026-05-06 |
| 25 | SeedCoder 8B Reasoning | 43.75 | — | Imported | 2026-05-06 |
| 26 | HaVen-CodeQwen | 43.58 | — | Imported | 2026-05-06 |
| 27 | Hermes-4-14B | 42.77 | — | Imported | 2026-05-06 |
| 28 | Magistral Small 2506 | 40.82 | — | Imported | 2026-05-06 |
| 29 | Llama 3.(1-3) 70B | 39.48 | — | Imported | 2026-05-06 |
| 30 | StarChat2 15B v0.1 | 38.76 | — | Imported | 2026-05-06 |
| 31 | QwenCoder 2.5 14B | 37.69 | — | Imported | 2026-05-06 |
| 32 | RTLCoder DeepSeek | 37.22 | — | Imported | 2026-05-06 |
| 33 | CodeV R1 Distill Qwen 7B | 36.12 | — | Imported | 2026-05-06 |
| 34 | CodeLlama 70B | 33.04 | — | Imported | 2026-05-06 |
| 35 | DeepSeek Coder 6.7B | 31.88 | — | Imported | 2026-05-06 |
| 36 | OpenCoder 8B | 30.06 | — | Imported | 2026-05-06 |
| 37 | DeepSeek Coder 33B | 27.03 | — | Imported | 2026-05-06 |
| 38 | DeepCoder 14B | 26.41 | — | Imported | 2026-05-06 |
| 39 | DeepSeek R1 Distill Qwen 14B | 23.14 | — | Imported | 2026-05-06 |
| 40 | RTLCoder Mistral | 21.82 | — | Imported | 2026-05-06 |
| 41 | CodeV-QW-7B | 20.37 | — | Imported | 2026-05-06 |
| 42 | CodeV-DS-6.7B | 19.62 | — | Imported | 2026-05-06 |
| 43 | CodeV-CL-7B | 14.73 | — | Imported | 2026-05-06 |
| 44 | QwenCoder 2.5 7B | 14.15 | — | Imported | 2026-05-06 |
No matching rows.