Claude 3.7 Sonnet Thinking
LatestAnthropic
•
Proprietary# 38
Released
Feb 24, 2025
# 6
Knowledge Cutoff
Oct 24
# 8
Context Length
200K
Benchmarks
# 34
Code RankedAGI
60.4%
# 10
Coding LiveBench
73.2%
# 11
Aider Polyglot
64.9%
# 14
Code LMArena
1333
# 5
Code LiveBench (old)
71.5%
# 21
GPQA Diamond
78.2%
# 13
Reason LiveBench
76.2%
# 5
Reason LiveBench (old)
84.6%
# 20
ELO LMArena
1363
# 47
AIME 2025 I & II
49.5%
# 12
Math LiveBench
79.0%
# 4
Math LiveBench (old)
77.5%
# 4
MATH 500
96.2%
# 20
Humanity Last Exam
8.9%
# 8
NYT Connections
33.6%
# 11
MMMU
75.0%
# 19
AIME 2024
80.0%
# 7
IF LiveBench
81.3%
# 17
IF LiveBench (old)
78.6%
# 11
Avg LiveBench
66.9%
# 3
Avg LiveBench (old)
74.3%
# 1
IF Evaluation
93.2%
Pricing
# 30
Input Cost /M
$3
# 35
Output Cost /M
$15