Claude 3.7 Sonnet Thinking
LatestAnthropic
•
Proprietary# 28
Released
Feb 24, 2025
# 5
Knowledge Cutoff
Oct 24
# 6
Context Length
200K
Benchmarks
# 24
Code RankedAGI
60.4%
# 10
Coding LiveBench
73.2%
# 7
Aider Polyglot
64.9%
# 14
Code LMArena
1333
# 5
Code LiveBench (old)
71.5%
# 15
GPQA Diamond
78.2%
# 13
Reason LiveBench
76.2%
# 5
Reason LiveBench (old)
84.6%
# 20
ELO LMArena
1363
# 35
AIME 2025 I & II
49.5%
# 12
Math LiveBench
79.0%
# 4
Math LiveBench (old)
77.5%
# 4
MATH 500
96.2%
# 13
Humanity Last Exam
8.9%
# 8
NYT Connections
33.6%
# 9
MMMU
75.0%
# 16
AIME 2024
80.0%
# 6
IF LiveBench
81.3%
# 17
IF LiveBench (old)
78.6%
# 11
Avg LiveBench
66.9%
# 3
Avg LiveBench (old)
74.3%
# 1
IF Evaluation
93.2%
Pricing
# 27
Input Cost /M
$3
# 33
Output Cost /M
$15