CAIS Text Capabilities Index

Composite CAIS AI Dashboard text index averaging Humanity's Last Exam, ARC-AGI-2, TextQuests, and SWE-bench Pro for models with all component scores.

39rows
scoreprimary metric
2026-05-27sampled

Metadata

Metrics

Text Capabilities Index, Humanity's Last Exam, ARC-AGI-2, TextQuests, SWE-bench Pro

Latest Results

Imported from the public CAIS AI Dashboard bundle. Composite scores mirror the dashboard client: models missing any selected component are excluded; VCT Refusal and MASK are inverted for the Risk Index.

Rank Subject Text Capabilities Index Model Match Provenance Sampled
1 GPT-5.5 54.1 GPT-5.5
openai-gpt-5.5
Imported 2026-05-27
2 Gemini 3.1 Pro 52.9 Gemini 3.1 Pro Preview
google-gemini-3.1-pro-preview
Imported 2026-05-27
3 GPT-5.4 49.3 GPT-5.4
openai-gpt-5.4
Imported 2026-05-27
4 Gemini 3.5 Flash 48.9 Gemini 3.5 Flash
google-gemini-3.5-flash
Imported 2026-05-27
5 Opus 4.7 46.9 Claude Opus 4.7
anthropic-claude-opus-4.7
Imported 2026-05-27
6 Opus 4.6 44.0 Claude Opus 4.6
anthropic-claude-opus-4.6
Imported 2026-05-27
7 Gemini 3 Pro 38.4 Gemini 3
google-gemini-3
Imported 2026-05-27
8 Opus 4.5 36.6 Claude Opus 4.5
anthropic-claude-opus-4.5
Imported 2026-05-27
9 Gemini 3 Flash 35.6 Gemini 3 Flash Preview
google-gemini-3-flash-preview
Imported 2026-05-27
10 GPT-5.2 33.8 GPT-5.2
openai-gpt-5.2
Imported 2026-05-27
11 Sonnet 4.6 32.6 Claude Sonnet 4.6
anthropic-claude-sonnet-4.6
Imported 2026-05-27
12 Grok 4.2 32.5 GROK Grok 4.20
x-ai-grok-4.20
Imported 2026-05-27
13 DeepSeek 4 Pro 32.1 DeepSeek V4 Pro
deepseek-deepseek-v4-pro
Imported 2026-05-27
14 Kimi K2.6 31.4 KIMI MoonshotAI: Kimi K2.6
moonshotai-kimi-k2.6
Imported 2026-05-27
15 GLM 5.1 29.8 GLM GLM 5.1
z-ai-glm-5.1
Imported 2026-05-27
16 GPT-5.1 29.0 GPT-5.1
openai-gpt-5.1
Imported 2026-05-27
17 Kimi K2.5 26.1 KIMI MoonshotAI: Kimi K2.5
moonshotai-kimi-k2.5
Imported 2026-05-27
18 Sonnet 4.5 25.4 Claude Sonnet 4.5
anthropic-claude-sonnet-4.5
Imported 2026-05-27
19 Grok 4.3 24.7 GROK Grok 4.3
x-ai-grok-4.3
Imported 2026-05-27
20 GPT-5.4-mini 24.2 GPT-5.4 Mini
openai-gpt-5.4-mini
Imported 2026-05-27
21 GPT-5 20.9 GPT-5
openai-gpt-5
Imported 2026-05-27
22 Grok 4 20.8 GROK Grok 4
x-ai-grok-4
Imported 2026-05-27
23 o3 20.5 o3
openai-o3
Imported 2026-05-27
24 DeepSeek 3.2 20.3 DeepSeek V3.2
deepseek-deepseek-v3.2
Imported 2026-05-27
25 Kimi K2 18.1 KIMI MoonshotAI: Kimi K2 Thinking
moonshotai-kimi-k2-thinking
Imported 2026-05-27
26 Sonnet 4 18.1 Claude Sonnet 4
anthropic-claude-sonnet-4
Imported 2026-05-27
27 GPT-5.4-Nano 17.9 GPT-5.4 Nano
openai-gpt-5.4-nano
Imported 2026-05-27
28 Haiku 4.5 17.4 Claude Haiku 4.5
anthropic-claude-haiku-4.5
Imported 2026-05-27
29 Gemini 2.5 Pro 16.7 Gemini 2.5 Pro
google-gemini-2.5-pro
Imported 2026-05-27
30 Gemini 3.1 Flash-Lite 15.8 Gemini 3.1 Flash Lite Preview
google-gemini-3.1-flash-lite-preview
Imported 2026-05-27
31 GPT-5-mini 14.3 GPT-5 Mini
openai-gpt-5-mini
Imported 2026-05-27
32 Grok 4 Fast 13.3 GROK Grok 4 Fast
x-ai-grok-4-fast
Imported 2026-05-27
33 Grok 4.1 Fast 13.1 GROK Grok 4.1 Fast
x-ai-grok-4.1-fast
Imported 2026-05-27
34 Gemini 2.5 Flash 9.0 Gemini 2.5 Flash
google-gemini-2.5-flash
Imported 2026-05-27
35 DeepSeek R1 8.6 R1
deepseek-r1
Imported 2026-05-27
36 o3-mini 7.8 o3 Mini High
openai-o3-mini-high
Imported 2026-05-27
37 GPT-4o 5.2 GPT-4o (2024-11-20)
openai-gpt-4o-2024-11-20
Imported 2026-05-27
38 Gemini 2.5 Flash-Lite 5.1 Gemini 2.5 Flash Lite
google-gemini-2.5-flash-lite
Imported 2026-05-27
39 GPT-5-Nano 4.8 GPT-5 Nano
openai-gpt-5-nano
Imported 2026-05-27