Qwen 2.5
LatestAlibaba
•
Open Source•
72B# 83
Released
Sep 19, 2024
# 9
Knowledge Cutoff
Sep 24
# 13
Context Length
128K
Benchmarks
# 72
Code RankedAGI
37.8%
# 50
SWEBench Verified
23.8%
# 36
LiveCodeBench v5
31.1%
# 38
Code LMArena
1247
# 20
Code LiveBench (old)
57.6%
# 126
Reason RankedAGI
37.9%
# 85
GPQA Diamond
49.0%
# 35
Reason LiveBench (old)
46.0%
# 45
Text Arena
1259
# 19
Human Eval
86.6%
# 24
Human Eval+
51.2%
# 36
NYT Connections
11.1%
# 35
MMLU Pro
71.1%
# 21
MMLU
86.8%
# 29
Halluc. Hughes
4.7%
# 37
Avg LiveBench (old)
51.4%
# 36
Data LiveBench
48.4%
# 40
Language LiveBench
35.0%
# 8
Quality Artificial Analysis
75
# 140
RAGI Overall
42.7%
Pricing
# 18
Input Cost /M
$0.38
# 16
Output Cost /M
$0.57