Llama 3.1 405B Instruct vs Llama 3.3 70B Instruct
Capability, benchmark, and verified hosting price comparison.
Quick verdict
- Cheapest hosting
- Llama 3.3 70B Instruct— $0.880 vs $3.000 / 1M blended tokens
Derived from public benchmarks and verified hosting prices. Benchmark wins count only where both models have a score; direction respects each benchmark’s higher-is-better flag.
Llama 3.1 405B Instruct
by Meta · Llama 3.1
- Context
- 131,072 tokens
- Open weight
- Yes
- License
- Llama 3.1 Community License
- Params
- 405.0B
- Modalities
- text
Llama 3.3 70B Instruct
by Meta · Llama
- Context
- 131,072 tokens
- Open weight
- Yes
- License
- Llama 3 Community License
- Params
- 70.0B
- Modalities
- text
Benchmarks
| Benchmark | Llama 3.1 405B Instruct | Llama 3.3 70B Instruct |
|---|---|---|
| HumanEval | — | 0.884 |
| MMLU | — | 0.868 |
Cheapest verified hosting
Llama 3.1 405B Instruct cheapest
$3.000/ 1M tokens (blended)
- Input
- $3.000
- Output
- $3.000
- Context
- 131,072 tok
Llama 3.3 70B Instruct cheapest
$0.880/ 1M tokens (blended)
- Input
- $0.880
- Output
- $0.880
- Context
- 131,072 tok
- Latency p50
- 190 ms