FinEval
Chinese financial-domain benchmark covering financial academic knowledge, industry knowledge, security, financial agents, multimodal finance tasks, and rigor testing.
49rows
weighted_averageprimary metric
2026-05-27sampled
Metadata
Metrics
Weighted Average, Multimodal Weighted Average, FinEval 6.0 Total Score, Financial Academic, Financial Industry, Financial Security, Financial Agent
| Rank | Subject | Weighted Average | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | Ant Group Finix-CI-72B (fineval 6 0) | 86.07 | — | Imported | 2026-05-27 |
| 2 | DeepSeek DeepSeek-RI (fineval 6 0) | 85.25 | — | Imported | 2026-05-27 |
| 3 | Alibaba Qwen3-32B (think) (fineval 6 0) | 84.53 | — | Imported | 2026-05-27 |
| 4 | Alibaba Qwen3-32B (fineval 6 0) | 80.93 | — | Imported | 2026-05-27 |
| 5 | Zhipu AI GLM-ZI-32B-0414 (fineval 6 0) | 80.13 | — | Imported | 2026-05-27 |
| 6 | Alibaba Qwen3-30B-A3B (fineval 6 0) | 79.91 | — | Imported | 2026-05-27 |
| 7 | Shanghai AI Lab Internlm3-8b-Instruct (fineval 6 0) | 78.48 | — | Imported | 2026-05-27 |
| 8 | OpenAI GPT-4o (fineval 6 0) | 77.65 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 9 | Meta AI Llama-3.3-70B (fineval 6 0) | 77.25 | — | Imported | 2026-05-27 |
| 10 | Qwen-VL-max (multimodal) | 76.3 | Qwen VL Max qwen-qwen-vl-max | Imported | 2026-05-27 |
| 11 | Qwen-VL-max-latest (multimodal) | 73.8 | — | Imported | 2026-05-27 |
| 12 | Claude 3.5-Sonnet (text weighted) | 72.9 | Claude 3.5 Sonnet anthropic-claude-3.5-sonnet | Imported | 2026-05-27 |
| 13 | InternVL3-78B (multimodal) | 72.5 | — | Imported | 2026-05-27 |
| 14 | GLM-4v-Plus-20250111 (multimodal) | 72 | — | Imported | 2026-05-27 |
| 15 | GPT-4o (text weighted) | 71.9 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 16 | Doubao-1.5-vision-pro-32k (multimodal) | 71.7 | — | Imported | 2026-05-27 |
| 17 | InternVL2.5-78B (multimodal) | 71.5 | — | Imported | 2026-05-27 |
| 18 | Qwen2.5-VL-72B (multimodal) | 71 | Qwen2.5 VL 72B Instruct qwen-qwen2.5-vl-72b-instruct | Imported | 2026-05-27 |
| 19 | Qwen2.5-72B-Instruct (text weighted) | 69.4 | Qwen2.5 72B Instruct qwen-qwen-2.5-72b-instruct | Imported | 2026-05-27 |
| 20 | Gemini1.5-Pro (text weighted) | 69.2 | — | Imported | 2026-05-27 |
| 21 | GPT-4o-2024-11-20 (multimodal) | 68.5 | GPT-4o openai-gpt-4o | Imported | 2026-05-27 |
| 22 | Step-1o-vision-32k (multimodal) | 68.4 | — | Imported | 2026-05-27 |
| 23 | Moonshot-V1-32k-vision-preview (multimodal) | 68.3 | — | Imported | 2026-05-27 |
| 24 | GPT-4o-mini (text weighted) | 66.2 | GPT-4o-mini openai-gpt-4o-mini | Imported | 2026-05-27 |
| 25 | Gemini1.5-Flash (text weighted) | 65.6 | — | Imported | 2026-05-27 |
| 26 | Qwen2.5-VL-7B (multimodal) | 65.4 | — | Imported | 2026-05-27 |
| 27 | InternVL3-8B (multimodal) | 65.4 | — | Imported | 2026-05-27 |
| 28 | Gemini-2.5-pro-exp-03-25 (multimodal) | 64.7 | — | Imported | 2026-05-27 |
| 29 | Claude-3-7-Sonnet-20250219 (multimodal) | 62.9 | Claude 3.7 Sonnet anthropic-claude-3.7-sonnet | Imported | 2026-05-27 |
| 30 | Qwen2.5-VL-3B (multimodal) | 62.4 | — | Imported | 2026-05-27 |
| 31 | Qwen2.5-7B-Instruct (text weighted) | 62.3 | Qwen2.5 7B Instruct qwen-qwen-2.5-7b-instruct | Imported | 2026-05-27 |
| 32 | Yi1.5-34B-Chat (text weighted) | 61.5 | — | Imported | 2026-05-27 |
| 33 | MiniCPM-V-2.6 (multimodal) | 60.1 | — | Imported | 2026-05-27 |
| 34 | XuanYuan3-70B-Chat (text weighted) | 59.1 | — | Imported | 2026-05-27 |
| 35 | InternLM2.5-20B-Chat (text weighted) | 58.9 | — | Imported | 2026-05-27 |
| 36 | GLM-4-9B-Chat (text weighted) | 58.4 | — | Imported | 2026-05-27 |
| 37 | InternLM2-20B-Chat (text weighted) | 58 | — | Imported | 2026-05-27 |
| 38 | Yi1.5-9B-Chat (text weighted) | 56.9 | — | Imported | 2026-05-27 |
| 39 | LLaVA-NEXT-34B (multimodal) | 56 | — | Imported | 2026-05-27 |
| 40 | XuanYuan2-70B-Chat (text weighted) | 55.4 | — | Imported | 2026-05-27 |
| 41 | CFGPT2-7B (text weighted) | 55.3 | — | Imported | 2026-05-27 |
| 42 | Llama-3.2-11B-Vision-Instruct (multimodal) | 50.9 | Llama 3.2 11B Vision Instruct meta-llama-llama-3.2-11b-vision-instruct | Imported | 2026-05-27 |
| 43 | Molmo-7B-D-0924 (multimodal) | 49.8 | — | Imported | 2026-05-27 |
| 44 | Baichuan2-13B-Chat (text weighted) | 47.8 | — | Imported | 2026-05-27 |
| 45 | LLaVA-v1.6-Mistral-7B (multimodal) | 47.8 | — | Imported | 2026-05-27 |
| 46 | ChatGLM3-6B (text weighted) | 43.2 | — | Imported | 2026-05-27 |
| 47 | LLaVA-NEXT-13B (multimodal) | 43 | — | Imported | 2026-05-27 |
| 48 | DISC-FinLLM (text weighted) | 37.8 | — | Imported | 2026-05-27 |
| 49 | FinGPTv3.1 (text weighted) | 27.1 | — | Imported | 2026-05-27 |
No matching rows.