Pricing · per 1M tokens
Input
$0.050
per million tokens
Output
$1.00
per million tokens
| Blended avg cost* | $0.525/Mtok |
|---|
Overview
Llama 3.1 8B on Groq's LPU. ~840 tokens/sec — extremely fast. $0.05/$0.08 per 1M tokens.
Specifications
| Provider | Groq |
|---|---|
| Context window | 128K tokens |
| Modality | text |
| Parameters | 8B |
| Open source | Yes — open weights available |
| Released | Jul 23, 2024 |
| Status | Current |
| Last updated | Jun 24, 2026 |
| Tags | fast open-weights speed |