Llama 3.1
LatestMeta
•
Open Source•
405B# 90
Released
Jul 23, 2024
# 15
Knowledge Cutoff
Dec 23
# 13
Context Length
128K
Benchmarks
# 58
Code RankedAGI
42.9%
# 48
SWEBench Verified
24.5%
# 48
Code Arena
811.91
# 37
LiveCodeBench v5
28.4%
# 34
Code LMArena
1260
# 41
Code LiveBench (old)
42.6%
# 114
Reason RankedAGI
40.7%
# 77
GPQA Diamond
51.1%
# 27
Reason LiveBench (old)
53.3%
# 42
Text Arena
1266
# 12
Human Eval
89.0%
# 29
NYT Connections
16.2%
# 32
MMLU Pro
73.3%
# 19
MMLU
87.0%
# 23
Halluc. Hughes
3.9%
# 19
Aidan Bench
778
# 24
Avg LiveBench (old)
58.6%
# 10
IF Evaluation
88.6%
# 27
Data LiveBench
54.5%
# 23
Language LiveBench
45.5%
# 11
Quality Artificial Analysis
72
# 132
RAGI Overall
43.9%
Pricing
# 34
Input Cost /M
$5
# 37
Output Cost /M
$15