Capture-the-Flags Challenge Tasks (Internal)
OpenAI internal expansion of hard cybersecurity capture-the-flag challenge tasks used in system cards.
2rows
scoreprimary metric
2026-04-23sampled
Metadata
Metrics
Score
| Rank | Subject | Score | Model Match | Provenance | Sampled |
|---|---|---|---|---|---|
| 1 | GPT-5.5 | 88.1% | GPT-5.5 openai-gpt-5.5 | Launch post | 2026-04-23 |
| 2 | GPT-5.4 | 83.7% | GPT-5.4 openai-gpt-5.4 | Launch post | 2026-04-23 |
No matching rows.