GPT‑5.4
LatestOpenAI
•
Proprietary# 16
Released
Mar 5, 2026
# 3
Knowledge Cutoff
Aug 25
# 3
Context Length
1M
Benchmarks
# 7
Code RankedAGI
82.9%
# 5
SWEBench Pro
57.7%
# 4
Terminal Bench 2.0
75.1%
# 8
Agentic RankedAGI
77.8%
# 9
Browse Comp
82.7%
# 11
Code Arena
1457
# 10
Code Livebench
77.5%
# 1
AgenticCode LiveBench
70.0%
# 4
OSWorld Verified
75.0%
# 5
Svelte Bench
95.6%
# 3
Cyber Gym
79.0%
# 3
Codeforces ELO
3168
# 7
Reason RankedAGI
77.4%
# 11
HLE
39.8%
# 11
HLE w/ Tools
52.1%
# 7
GPQA Diamond
92.8%
# 4
Text Arena
1482
# 5
AIME 2026
95.2%
# 6
Vending Bench 2
$6,144.18
# 2
NYT Connections
94.0%
# 5
MMLU Pro
87.5%
# 2
MMMU Pro
81.2%
# 2
MMMU Pro w/ Tools
82.1%
# 10
Math RankedAGI
76.5%
# 6
RAGI RankedAGI
70.7%
# 7
ARC AGI 2.0
73.3%
# 1
LiveCodeBench Pro
87.5%
# 12
𝜏²-Bench Telecom
92.8%
# 3
HealthBench Hard
40.1%
# 2
MedXpertQA Text
59.6%
# 3
MedXpertQA MM
77.1%
# 6
DeepSearch QA
73.6%
# 3
GDPval AA
1674
# 1
ZeroBench
41.0%
# 5
MCP Atlas
70.6%
# 7
Finance Agent
56.0%
# 3
CharXiv Reasoning
82.8%
# 16
Code DesignArena
1282
# 2
Toolathlon Pass@1
54.6%
# 1
𝜏³ Bench
72.9%
Pricing
# 37
Input Cost /M
$2.5
# 44
Output Cost /M
$15
# 20
Cached Cost /M
$0.25