VNTL Leaderboard

Leaderboard for Japanese visual-novel translation into English, ranking LLMs and translation systems by semantic similarity accuracy over 256 translation samples, with chrF reported as an auxiliary metric.

87rows
accuracyprimary metric
2026-05-06sampled

Metadata

Metrics

Accuracy, Accuracy 95% CI (lower is better), chrF Mean

Latest Results

Rows are parsed from the public VNTL leaderboard JSONL. Source accuracy and confidence intervals are converted from fractions to percentages.

Rank Subject Accuracy Model Match Provenance Sampled
1 openai/gpt-4o-2024-05-13 75.16 GPT-4o
openai-gpt-4o
Imported 2026-05-06
1 openai/gpt-4o-2024-08-06 74.97 GPT-4o
openai-gpt-4o
Imported 2026-05-06
1 anthropic/claude-3-opus 74.59 Imported 2026-05-06
1 anthropic/claude-3.5-sonnet-20240620 74.40 Imported 2026-05-06
1 deepseek-ai/deepseek-v3-chat 74.24 Imported 2026-05-06
6 anthropic/claude-3.5-sonnet-20241022 72.80 Claude 3.5 Sonnet
anthropic-claude-3.5-sonnet
Imported 2026-05-06
6 nvidia/nemotron-4-340b-instruct 72.79 Imported 2026-05-06
6 openai/gpt-4o-mini-2024-07-18 72.23 GPT-4o-mini
openai-gpt-4o-mini
Imported 2026-05-06
6 x-ai/grok-2-1212 71.60 Imported 2026-05-06
6 x-ai/grok-beta 71.27 Imported 2026-05-06
6 deepseek-ai/deepseek-v2.5 71.14 Imported 2026-05-06
12 qwen/qwen-2.5-72b-instruct 70.79 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-06
12 lmg-anon/vntl-gemma2-27b 70.67 Imported 2026-05-06
12 qwen/qwen-2.5-32b-instruct 70.66 Imported 2026-05-06
12 qwen/qwen-2-72b-instruct 70.20 Imported 2026-05-06
12 openai/gpt-3.5-turbo-1106 69.98 GPT-3.5 Turbo
openai-gpt-3.5-turbo
Imported 2026-05-06
12 meta-llama/llama-3.1-70b-instruct 69.79 Llama 3.1 70B Instruct
meta-llama-llama-3.1-70b-instruct
Imported 2026-05-06
12 lmg-anon/vntl-llama3-8b-v2 69.52 Imported 2026-05-06
12 meta-llama/llama-3.1-405b-instruct 69.46 Imported 2026-05-06
12 openai/gpt-4-0613 69.28 GPT-4
openai-gpt-4
Imported 2026-05-06
12 lmg-anon/vntl-llama3-8b 69.22 Imported 2026-05-06
22 nvidia/llama-3.1-nemotron-70b-instruct 69.04 Llama 3.1 Nemotron 70B Instruct
nvidia-llama-3.1-nemotron-70b-instruct
Imported 2026-05-06
22 anthropic/claude-3.5-haiku-20241022 68.94 Imported 2026-05-06
22 qwen/qwen-2-72b-instruct 68.87 Imported 2026-05-06
22 meta-llama/llama-3.3-70b-instruct 68.81 Llama 3.3 70B Instruct
meta-llama-llama-3.3-70b-instruct
Imported 2026-05-06
22 qwen/qwq-preview 68.65 Imported 2026-05-06
22 microsoft/phi-4 (unofficial) 68.60 Phi 4
microsoft-phi-4
Imported 2026-05-06
22 cohere/command-r-plus-08-2024 68.53 C Command R (08-2024)
cohere-command-r-08-2024
Imported 2026-05-06
22 mistralai/mixtral-8x22b-instruct 68.46 Mistral: Mixtral 8x22B Instruct
mistralai-mixtral-8x22b-instruct
Imported 2026-05-06
22 mistralai/mistral-large 67.94 Mistral Large
mistralai-mistral-large
Imported 2026-05-06
22 google/gemma-2-27b-it 67.93 Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-06
22 anthropic/claude-3-sonnet 67.72 Imported 2026-05-06
22 cohere/aya-23-35B 67.71 Imported 2026-05-06
22 rinna/llama-3-youko-70b 67.65 Imported 2026-05-06
22 webbigdata/C3TR-Adapter 67.56 Imported 2026-05-06
22 mistralai/Mistral-Nemo-Instruct-2407 67.38 Imported 2026-05-06
22 cohere/command-r-plus 67.19 Imported 2026-05-06
22 anthropic/claude-3-haiku 67.19 Claude 3 Haiku
anthropic-claude-3-haiku
Imported 2026-05-06
39 meta-llama/llama-3-70b-instruct 66.91 Llama 3 70B Instruct
meta-llama-llama-3-70b-instruct
Imported 2026-05-06
39 google/gemma-2-27b 66.74 Imported 2026-05-06
39 qwen/qwen-2.5-14b-instruct 66.48 Imported 2026-05-06
39 google/gemini-flash-1.5 66.20 Imported 2026-05-06
39 cyberagent/Llama-3.1-70B-Japanese-Instruct-2407 66.10 Imported 2026-05-06
39 meta-llama/llama-3-70b-instruct 65.94 Llama 3 70B Instruct
meta-llama-llama-3-70b-instruct
Imported 2026-05-06
39 google/gemini-flash-1.5-8b 65.93 Imported 2026-05-06
39 qwen/qwen-2.5-14b 65.92 Imported 2026-05-06
39 google/gemini-pro 65.89 Imported 2026-05-06
39 lmg-anon/vntl-gemma2-2b 65.72 Imported 2026-05-06
39 cohere/aya-expanse-32b 65.50 Imported 2026-05-06
39 rinna/nekomata-14b 65.39 Imported 2026-05-06
39 cohere/command-r-08-2024 65.20 C Command R (08-2024)
cohere-command-r-08-2024
Imported 2026-05-06
39 qwen/qwen-2.5-7b-instruct 65.18 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-06
39 lmg-anon/vntl-13b-v0.2 65.02 Imported 2026-05-06
54 cyberagent/calm3-22b-chat 64.80 Imported 2026-05-06
54 google/gemma-2-9b-it-SPPO-Iter3 64.47 Imported 2026-05-06
54 mistralai/mistral-small 64.41 Imported 2026-05-06
54 google/gemini-pro-1.5 64.36 Imported 2026-05-06
54 BeaverAI/Cydonia-22B-v2p-GGUF 64.10 Imported 2026-05-06
54 rinna/llama-3-youko-8b 63.95 Imported 2026-05-06
54 rinna/llama-3-youko-70b-instruct 63.55 Imported 2026-05-06
54 meta-llama/llama-3-70b-instruct 63.30 Llama 3 70B Instruct
meta-llama-llama-3-70b-instruct
Imported 2026-05-06
54 mistralai/Ministral-8B-Instruct-2410 63.25 Imported 2026-05-06
54 lmg-anon/vntl-7b-v0.3.1 63.04 Imported 2026-05-06
64 rakuten/rakutenai-7b-instruct 62.71 Imported 2026-05-06
64 mistralai/mixtral-8x7b-instruct 62.08 Mistral: Mixtral 8x7B Instruct
mistralai-mixtral-8x7b-instruct
Imported 2026-05-06
64 google/gemma-2-9b-it 61.94 Imported 2026-05-06
64 cohere/aya-expanse-8b 61.91 Imported 2026-05-06
64 microsoft/phi-3-medium-4k-instruct 61.21 Imported 2026-05-06
64 qwen/qwen-2-7b-instruct 61.13 Imported 2026-05-06
64 cohere/command-r 61.03 Imported 2026-05-06
64 rinna/gemma-2-baku-2b 60.77 Imported 2026-05-06
72 meta-llama/llama-3-8b-instruct 60.19 Llama 3 8B Instruct
meta-llama-llama-3-8b-instruct
Imported 2026-05-06
72 rinna/nekomata-14b-instruction 60.07 Imported 2026-05-06
72 openchat/openchat-8b 59.86 Imported 2026-05-06
72 cohere/aya-23-8b 59.62 Imported 2026-05-06
72 qwen/qwen-2.5-7b 59.62 Imported 2026-05-06
72 mistralai/Mistral-Nemo-Base-2407 58.77 Imported 2026-05-06
78 LLaMAX/LLaMAX3-8B 57.38 Imported 2026-05-06
78 elyza/Llama-3-ELYZA-JP-8B 57.15 Imported 2026-05-06
78 mistralai/mistral-7b-instruct-v0.3 56.03 Imported 2026-05-06
78 01-ai/yi-1.5-34b-chat 55.94 Imported 2026-05-06
82 LLaMAX/LLaMAX3-8B-Alpaca 55.16 Imported 2026-05-06
82 meta-llama/llama-3-8b-instruct 55.03 Llama 3 8B Instruct
meta-llama-llama-3-8b-instruct
Imported 2026-05-06
82 nitky/Oumuamua-7b-instruct-v2 54.88 Imported 2026-05-06
82 lightblue/qarasu-14b-chat-plus-unleashed 53.09 Imported 2026-05-06
86 meta-llama/llama-2-13b-chat 50.24 Imported 2026-05-06
87 01-ai/yi-1.5-9b-chat 47.59 Imported 2026-05-06