Claude 3.5 Sonnet
OldAnthropic
•
Proprietary# 54
Released
Jun 20, 2024
# 8
Knowledge Cutoff
Apr 24
# 5
Context Length
200K
Benchmarks
# 18
Code LiveBench
60.9%
# 14
SWEBench Verified
33.4%
# 23
LiveCodeBench v5
36.0%
# 21
Code LMArena
1282
# 27
GPQA Diamond
59.4%
# 19
Reason LiveBench
58.7%
# 31
ELO LMArena
1268
# 24
Math LiveBench
53.3%
# 28
MATH
71.1%
# 4
Human Eval
92.0%
# 12
Human Eval+
81.7%
# 4
Aider Old
77.4%
# 11
MMLU Pro
76.1%
# 6
MMLU
88.7%
# 8
Aidan Bench
1439
# 35
IF LiveBench
68.0%
# 25
Data LiveBench
56.7%
# 9
Lang LiveBench
53.2%
# 21
Avg LiveBench
59.7%
Pricing
# 26
Input Cost /M
$3
# 29
Output Cost /M
$15