o3
LatestOpenAI
•
Proprietary# 39
Released
Apr 16, 2025
# 12
Knowledge Cutoff
Jun 24
# 9
Context Length
200K
Benchmarks
# 20
Code RankedAGI
70.8%
# 20
SWEBench Verified
69.1%
# 25
Code Arena
1187.08
# 27
Svelte Bench
30.0%
# 9
LiveCodeBench v6
75.5%
# 3
Coding LiveBench 25.5
77.9%
# 2
Code LMArena
1460
# 2
Codeforces ELO
2706
# 20
Reason RankedAGI
64.7%
# 20
HLE
20.3%
# 9
GPQA Diamond
87.7%
# 6
Reason LiveBench 25.5
91.0%
# 5
Text Arena
1443
# 19
AIME 2025 I & II
88.9%
# 8
AIME 2024
91.6%
# 11
Math LiveBench 25.5
80.7%
# 37
Halluc. Hughes
6.8%
# 3
IF LiveBench 25.5
84.3%
# 7
Avg LiveBench 25.5
72.4%
# 3
Coding LiveBench 25.4
77.9%
# 3
Reasoning LiveBench 25.4
91.0%
# 5
Agentic LiveBench 25.5
28.3%
# 37
Math RankedAGI
63.5%
# 24
RAGI Overall
58.3%
Pricing
# 30
Input Cost /M
$2
# 34
Output Cost /M
$8
# 18
Cached Cost /M
$0.5