FinEval

Chinese financial-domain benchmark covering financial academic knowledge, industry knowledge, security, financial agents, multimodal finance tasks, and rigor testing.

49rows
weighted_averageprimary metric
2026-05-27sampled

Metadata

Metrics

Weighted Average, Multimodal Weighted Average, FinEval 6.0 Total Score, Financial Academic, Financial Industry, Financial Security, Financial Agent

Latest Results

Rows are parsed from the public FinEval README overall text, multimodal, and FinEval 6.0 tables. Display names include the source split.

Rank Subject Weighted Average Model Match Provenance Sampled
1 Ant Group Finix-CI-72B (fineval 6 0) 86.07 Imported 2026-05-27
2 DeepSeek DeepSeek-RI (fineval 6 0) 85.25 Imported 2026-05-27
3 Alibaba Qwen3-32B (think) (fineval 6 0) 84.53 Imported 2026-05-27
4 Alibaba Qwen3-32B (fineval 6 0) 80.93 Imported 2026-05-27
5 Zhipu AI GLM-ZI-32B-0414 (fineval 6 0) 80.13 Imported 2026-05-27
6 Alibaba Qwen3-30B-A3B (fineval 6 0) 79.91 Imported 2026-05-27
7 Shanghai AI Lab Internlm3-8b-Instruct (fineval 6 0) 78.48 Imported 2026-05-27
8 OpenAI GPT-4o (fineval 6 0) 77.65 GPT-4o
openai-gpt-4o
Imported 2026-05-27
9 Meta AI Llama-3.3-70B (fineval 6 0) 77.25 Imported 2026-05-27
10 Qwen-VL-max (multimodal) 76.3 Qwen VL Max
qwen-qwen-vl-max
Imported 2026-05-27
11 Qwen-VL-max-latest (multimodal) 73.8 Imported 2026-05-27
12 Claude 3.5-Sonnet (text weighted) 72.9 Claude 3.5 Sonnet
anthropic-claude-3.5-sonnet
Imported 2026-05-27
13 InternVL3-78B (multimodal) 72.5 Imported 2026-05-27
14 GLM-4v-Plus-20250111 (multimodal) 72 Imported 2026-05-27
15 GPT-4o (text weighted) 71.9 GPT-4o
openai-gpt-4o
Imported 2026-05-27
16 Doubao-1.5-vision-pro-32k (multimodal) 71.7 Imported 2026-05-27
17 InternVL2.5-78B (multimodal) 71.5 Imported 2026-05-27
18 Qwen2.5-VL-72B (multimodal) 71 Qwen2.5 VL 72B Instruct
qwen-qwen2.5-vl-72b-instruct
Imported 2026-05-27
19 Qwen2.5-72B-Instruct (text weighted) 69.4 Qwen2.5 72B Instruct
qwen-qwen-2.5-72b-instruct
Imported 2026-05-27
20 Gemini1.5-Pro (text weighted) 69.2 Imported 2026-05-27
21 GPT-4o-2024-11-20 (multimodal) 68.5 GPT-4o
openai-gpt-4o
Imported 2026-05-27
22 Step-1o-vision-32k (multimodal) 68.4 Imported 2026-05-27
23 Moonshot-V1-32k-vision-preview (multimodal) 68.3 Imported 2026-05-27
24 GPT-4o-mini (text weighted) 66.2 GPT-4o-mini
openai-gpt-4o-mini
Imported 2026-05-27
25 Gemini1.5-Flash (text weighted) 65.6 Imported 2026-05-27
26 Qwen2.5-VL-7B (multimodal) 65.4 Imported 2026-05-27
27 InternVL3-8B (multimodal) 65.4 Imported 2026-05-27
28 Gemini-2.5-pro-exp-03-25 (multimodal) 64.7 Imported 2026-05-27
29 Claude-3-7-Sonnet-20250219 (multimodal) 62.9 Claude 3.7 Sonnet
anthropic-claude-3.7-sonnet
Imported 2026-05-27
30 Qwen2.5-VL-3B (multimodal) 62.4 Imported 2026-05-27
31 Qwen2.5-7B-Instruct (text weighted) 62.3 Qwen2.5 7B Instruct
qwen-qwen-2.5-7b-instruct
Imported 2026-05-27
32 Yi1.5-34B-Chat (text weighted) 61.5 Imported 2026-05-27
33 MiniCPM-V-2.6 (multimodal) 60.1 Imported 2026-05-27
34 XuanYuan3-70B-Chat (text weighted) 59.1 Imported 2026-05-27
35 InternLM2.5-20B-Chat (text weighted) 58.9 Imported 2026-05-27
36 GLM-4-9B-Chat (text weighted) 58.4 Imported 2026-05-27
37 InternLM2-20B-Chat (text weighted) 58 Imported 2026-05-27
38 Yi1.5-9B-Chat (text weighted) 56.9 Imported 2026-05-27
39 LLaVA-NEXT-34B (multimodal) 56 Imported 2026-05-27
40 XuanYuan2-70B-Chat (text weighted) 55.4 Imported 2026-05-27
41 CFGPT2-7B (text weighted) 55.3 Imported 2026-05-27
42 Llama-3.2-11B-Vision-Instruct (multimodal) 50.9 Llama 3.2 11B Vision Instruct
meta-llama-llama-3.2-11b-vision-instruct
Imported 2026-05-27
43 Molmo-7B-D-0924 (multimodal) 49.8 Imported 2026-05-27
44 Baichuan2-13B-Chat (text weighted) 47.8 Imported 2026-05-27
45 LLaVA-v1.6-Mistral-7B (multimodal) 47.8 Imported 2026-05-27
46 ChatGLM3-6B (text weighted) 43.2 Imported 2026-05-27
47 LLaVA-NEXT-13B (multimodal) 43 Imported 2026-05-27
48 DISC-FinLLM (text weighted) 37.8 Imported 2026-05-27
49 FinGPTv3.1 (text weighted) 27.1 Imported 2026-05-27