Qwen 2.5
LatestAlibaba
•
Open Source•
72B# 43
Released
Sep 19, 2024
# 4
Knowledge Cutoff
Sep 24
# 7
Context Length
128K
Benchmarks
# 20
Code LiveBench
57.6%
# 17
SWEBench Verified
23.8%
# 25
LiveCodeBench v5
31.1%
# 30
Code LMArena
1247
# 42
GPQA Diamond
49.0%
# 34
Reason LiveBench
46.0%
# 35
ELO LMArena
1259
# 25
Math LiveBench
52.4%
# 13
MATH
83.1%
# 18
Human Eval
86.6%
# 24
Human Eval+
51.2%
# 28
NYT Connections
11.1%
# 22
Aider Old
55.6%
# 23
MMLU Pro
71.1%
# 12
MMLU
86.8%
# 22
Halluc. Hughes
4.7%
# 42
IF LiveBench
64.4%
# 36
Data LiveBench
48.4%
# 39
Lang LiveBench
35.0%
# 36
Avg LiveBench
51.4%
Pricing
# 17
Input Cost /M
$0.38
# 14
Output Cost /M
$0.57