Open FinLLM Leaderboard
Open financial-language-model leaderboard from FINOS, covering broad financial NLP and reasoning task categories.
25rows
average_normalized_scoreprimary metric
2026-05-27sampled
Metadata
Metrics
Average normalized score, MultiFin normalized score, QA normalized score, FNS normalized score, FinNum normalized score, FinText normalized score, MultiFin raw score, QA raw score, FNS raw score, FinNum raw score, FinText raw score
| Rank | Subject | Average normalized score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | TheFinAI/plutus-8B-instruct | 55.895672% | — | Imported | 2026-05-27 |
| 2 | gpt-4 | 48.337138% | GPT-4 openai-gpt-4 | Imported | 2026-05-27 |
| 3 | gpt-4.5-preview | 43.403043% | GPT-4.5 openai-gpt-4.5-preview | Imported | 2026-05-27 |
| 4 | Qwen/Qwen2.5-32B-Instruct | 42.489422% | — | Imported | 2026-05-27 |
| 5 | Qwen/Qwen2.5-72B-Instruct | 41.361242% | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-27 |
| 6 | gpt-4o | 37.507713% | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 7 | meta-llama/Meta-Llama-3-70B-Instruct | 31.228277% | Llama 3 70B Instruct meta-llama-llama-3-70b-instruct | Imported | 2026-05-27 |
| 8 | ilsp/Llama-Krikri-8B-Instruct | 30.792416% | — | Imported | 2026-05-27 |
| 9 | deepseek-chat | 29.494986% | DeepSeek V3 deepseek-deepseek-chat | Imported | 2026-05-27 |
| 10 | ilsp/Meltemi-7B-Instruct-v1.5 | 28.354055% | — | Imported | 2026-05-27 |
| 11 | gpt-4o-mini | 28.32187% | GPT-4o-mini openai-gpt-4o-mini | Imported | 2026-05-27 |
| 12 | gpt-3.5-turbo-0125 | 23.164665% | GPT-3.5 Turbo openai-gpt-3.5-turbo | Imported | 2026-05-27 |
| 13 | meta-llama/Llama-3.1-8B-Instruct | 22.720855% | Llama 3.1 8B Instruct meta-llama-llama-3.1-8b-instruct | Imported | 2026-05-27 |
| 14 | google/gemma-2-27b-it | 19.805341% | Gemma 2 27B google-gemma-2-27b-it | Imported | 2026-05-27 |
| 15 | meta-llama/Meta-Llama-3-8B-Instruct | 18.930312% | Llama 3 8B Instruct meta-llama-llama-3-8b-instruct | Imported | 2026-05-27 |
| 16 | Qwen/Qwen2.5-7B-Instruct | 17.365888% | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-27 |
| 17 | mistralai/Mistral-7B-Instruct-v0.3 | 14.680726% | — | Imported | 2026-05-27 |
| 18 | google/gemma-2-9b-it | 14.319532% | — | Imported | 2026-05-27 |
| 19 | google/gemma-2-2b-it | 9.506996% | — | Imported | 2026-05-27 |
| 20 | mistralai/Mistral-7B-Instruct-v0.1 | 9.140334% | Mistral: Mistral 7B Instruct v0.1 mistralai-mistral-7b-instruct-v0.1 | Imported | 2026-05-27 |
| 21 | meta-llama/Llama-3.2-1B-Instruct | 8.949207% | Llama 3.2 1B Instruct meta-llama-llama-3.2-1b-instruct | Imported | 2026-05-27 |
| 22 | Qwen/QwQ-32B | 7.807905% | — | Imported | 2026-05-27 |
| 23 | TheFinAI/finma-7b-full | 6.684336% | — | Imported | 2026-05-27 |
| 24 | Qwen/Qwen2.5-1.5B-Instruct | 6.540124% | — | Imported | 2026-05-27 |
| 25 | TheFinAI/FinLLaMA-instruct | 6.379444% | — | Imported | 2026-05-27 |
No matching rows.