Claude 3.7 Sonnet Thinking
LatestAnthropic
•
Proprietary# 20
Released
Feb 24, 2025
# 4
Knowledge Cutoff
Oct 24
# 6
Context Length
200K
Benchmarks
# 15
Code RankedAGI
60.4%
# 19
Code LiveBench
44.7%
# 5
Aider Polyglot
64.9%
# 9
Code LMArena
1333
# 5
Code LiveBench 24.11
71.5%
# 8
GPQA Diamond
78.2%
# 5
Reason LiveBench
84.6%
# 24
ELO LMArena
1306
# 23
AIME 2025 I & II
49.5%
# 4
Math LiveBench
77.5%
# 3
MATH 500
96.2%
# 8
NYT Connections
33.6%
# 6
MMMU
75.0%
# 15
AIME 2024
80.0%
# 17
IF LiveBench
78.6%
# 2
Data LiveBench
72.8%
# 6
Lang LiveBench
61.0%
# 3
Avg LiveBench
74.3%
# 1
IF Evaluation
93.2%
Pricing
# 27
Input Cost /M
$3
# 33
Output Cost /M
$15