TuRTLe Code Completion (Icarus Verilog)
TuRTLe leaderboard variant for RTL code completion evaluated with Icarus Verilog across VerilogEval MC and VeriGen.
44rows
aggregated_scoreprimary metric
2026-05-06sampled
Metadata
Metrics
Aggregated Score, Syntax, Functionality, Synthesis, Power, Performance, Area
| Rank | Subject | Aggregated Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | GLM-5-FP8 | 83.98 | GLM 5 z-ai-glm-5 | Imported | 2026-05-06 |
| 2 | Kimi-K2.5 | 83.38 | MoonshotAI: Kimi K2.5 moonshotai-kimi-k2.5 | Imported | 2026-05-06 |
| 3 | Gemma-4-31B-it | 82.57 | Gemma 4 31B google-gemma-4-31b-it | Imported | 2026-05-06 |
| 4 | DeepSeek R1-0528 | 78.86 | R1 0528 deepseek-deepseek-r1-0528 | Imported | 2026-05-06 |
| 5 | gpt-oss-120b | 77.82 | gpt-oss-120b openai-gpt-oss-120b | Imported | 2026-05-06 |
| 6 | DeepSeek R1 | 77.00 | R1 deepseek-r1 | Imported | 2026-05-06 |
| 7 | DeepSeek V3.1 Terminus | 76.57 | DeepSeek V3.1 Terminus deepseek-deepseek-v3.1-terminus | Imported | 2026-05-06 |
| 8 | Seed-OSS-36B | 73.85 | — | Imported | 2026-05-06 |
| 9 | Kimi K2 Instruct 0905 | 71.77 | MoonshotAI: Kimi K2 0905 moonshotai-kimi-k2-0905 | Imported | 2026-05-06 |
| 10 | Gemini 2.5 Flash (Medium) | 69.84 | Gemini 2.5 Flash google-gemini-2.5-flash | Imported | 2026-05-06 |
| 11 | Qwen3 236B A22B | 67.54 | Qwen3 235B A22B qwen-qwen3-235b-a22b | Imported | 2026-05-06 |
| 12 | gpt-oss-20b | 66.48 | gpt-oss-20b openai-gpt-oss-20b | Imported | 2026-05-06 |
| 13 | Qwen3 Coder 480B A35B | 57.84 | Qwen3 Coder 480B A35B qwen-qwen3-coder | Imported | 2026-05-06 |
| 14 | InCoder-32B-Thinking | 54.80 | — | Imported | 2026-05-06 |
| 15 | Llama 3.1 405B | 54.74 | — | Imported | 2026-05-06 |
| 16 | InCoder-32B | 52.82 | — | Imported | 2026-05-06 |
| 17 | OriGen | 52.53 | — | Imported | 2026-05-06 |
| 18 | CodeV-QW-7B | 51.58 | — | Imported | 2026-05-06 |
| 19 | Qwen2.5 72B | 50.41 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-06 |
| 20 | HaVen-CodeQwen | 48.69 | — | Imported | 2026-05-06 |
| 21 | Qwen3-8B | 48.16 | Qwen3 8B qwen-qwen3-8b | Imported | 2026-05-06 |
| 22 | SeedCoder 8B Reasoning | 48.07 | — | Imported | 2026-05-06 |
| 23 | CodeV-DS-6.7B | 47.44 | — | Imported | 2026-05-06 |
| 24 | QwenCoder 2.5 32B | 45.09 | — | Imported | 2026-05-06 |
| 25 | Llama 3.(1-3) 70B | 43.15 | — | Imported | 2026-05-06 |
| 26 | Qwen2.5 32B | 41.40 | — | Imported | 2026-05-06 |
| 27 | QwenCoder 2.5 14B | 40.99 | — | Imported | 2026-05-06 |
| 28 | QwQ 32B | 40.05 | — | Imported | 2026-05-06 |
| 29 | DeepSeek Coder 33B | 38.47 | — | Imported | 2026-05-06 |
| 30 | StarChat2 15B v0.1 | 38.19 | — | Imported | 2026-05-06 |
| 31 | RTLCoder DeepSeek | 37.86 | — | Imported | 2026-05-06 |
| 32 | Hermes-4-14B-Reasoning | 37.25 | — | Imported | 2026-05-06 |
| 33 | SeedCoder 8B | 36.63 | — | Imported | 2026-05-06 |
| 34 | OpenCoder 8B | 35.58 | — | Imported | 2026-05-06 |
| 35 | CodeLlama 70B | 33.68 | — | Imported | 2026-05-06 |
| 36 | QwenCoder 2.5 7B | 33.64 | — | Imported | 2026-05-06 |
| 37 | DeepCoder 14B | 33.07 | — | Imported | 2026-05-06 |
| 38 | CodeV-CL-7B | 32.70 | — | Imported | 2026-05-06 |
| 39 | DeepSeek Coder 6.7B | 29.80 | — | Imported | 2026-05-06 |
| 40 | RTLCoder Mistral | 28.83 | — | Imported | 2026-05-06 |
| 41 | Hermes-4-14B | 27.96 | — | Imported | 2026-05-06 |
| 42 | CodeV R1 Distill Qwen 7B | 24.85 | — | Imported | 2026-05-06 |
| 43 | DeepSeek R1 Distill Qwen 14B | 24.57 | — | Imported | 2026-05-06 |
| 44 | Magistral Small 2506 | 22.62 | — | Imported | 2026-05-06 |
No matching rows.