Pricing · per 1M tokens
Input
$0.600
per million tokens
Output
$3.60
per million tokens
| Blended avg cost* | $2.10/Mtok |
|---|
Overview
Large NVIDIA-tuned Llama model. $0.60/$3.60 per 1M on hosted platforms.
Specifications
| Provider | NVIDIA |
|---|---|
| Context window | 128K tokens |
| Modality | text |
| Parameters | 253B |
| Open source | Yes — open weights available |
| Released | Jan 1, 2025 |
| Status | Current |
| Last updated | Jun 24, 2026 |
| Tags | open-weights reasoning |