Open FinLLM Leaderboard

Open financial-language-model leaderboard from FINOS, covering broad financial NLP and reasoning task categories.

25rows
average_normalized_scoreprimary metric
2026-05-27sampled

Metadata

Metrics

Average normalized score, MultiFin normalized score, QA normalized score, FNS normalized score, FinNum normalized score, FinText normalized score, MultiFin raw score, QA raw score, FNS raw score, FinNum raw score, FinText raw score

Latest Results

Rows are parsed from the public TheFinAI Open-FinLLM Hugging Face Space API. The FINOS Space mirror returned an empty leaderboard during this audit; the TheFinAI Space exposes the populated formatted leaderboard.

Rank Subject Average normalized score Model Match Provenance Sampled
1 TheFinAI/plutus-8B-instruct 55.895672% Imported 2026-05-27
2 gpt-4 48.337138% GPT-4
openai-gpt-4
Imported 2026-05-27
3 gpt-4.5-preview 43.403043% GPT-4.5
openai-gpt-4.5-preview
Imported 2026-05-27
4 Qwen/Qwen2.5-32B-Instruct 42.489422% Imported 2026-05-27
5 Qwen/Qwen2.5-72B-Instruct 41.361242% Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-27
6 gpt-4o 37.507713% GPT-4o
openai-gpt-4o
Imported 2026-05-27
7 meta-llama/Meta-Llama-3-70B-Instruct 31.228277% Llama 3 70B Instruct
meta-llama-llama-3-70b-instruct
Imported 2026-05-27
8 ilsp/Llama-Krikri-8B-Instruct 30.792416% Imported 2026-05-27
9 deepseek-chat 29.494986% DeepSeek V3
deepseek-deepseek-chat
Imported 2026-05-27
10 ilsp/Meltemi-7B-Instruct-v1.5 28.354055% Imported 2026-05-27
11 gpt-4o-mini 28.32187% GPT-4o-mini
openai-gpt-4o-mini
Imported 2026-05-27
12 gpt-3.5-turbo-0125 23.164665% GPT-3.5 Turbo
openai-gpt-3.5-turbo
Imported 2026-05-27
13 meta-llama/Llama-3.1-8B-Instruct 22.720855% Llama 3.1 8B Instruct
meta-llama-llama-3.1-8b-instruct
Imported 2026-05-27
14 google/gemma-2-27b-it 19.805341% Gemma 2 27B
google-gemma-2-27b-it
Imported 2026-05-27
15 meta-llama/Meta-Llama-3-8B-Instruct 18.930312% Llama 3 8B Instruct
meta-llama-llama-3-8b-instruct
Imported 2026-05-27
16 Qwen/Qwen2.5-7B-Instruct 17.365888% Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-27
17 mistralai/Mistral-7B-Instruct-v0.3 14.680726% Imported 2026-05-27
18 google/gemma-2-9b-it 14.319532% Imported 2026-05-27
19 google/gemma-2-2b-it 9.506996% Imported 2026-05-27
20 mistralai/Mistral-7B-Instruct-v0.1 9.140334% Mistral: Mistral 7B Instruct v0.1
mistralai-mistral-7b-instruct-v0.1
Imported 2026-05-27
21 meta-llama/Llama-3.2-1B-Instruct 8.949207% Llama 3.2 1B Instruct
meta-llama-llama-3.2-1b-instruct
Imported 2026-05-27
22 Qwen/QwQ-32B 7.807905% Imported 2026-05-27
23 TheFinAI/finma-7b-full 6.684336% Imported 2026-05-27
24 Qwen/Qwen2.5-1.5B-Instruct 6.540124% Imported 2026-05-27
25 TheFinAI/FinLLaMA-instruct 6.379444% Imported 2026-05-27