Llama 3.1 Instruct 405B is a text-based AI model by Meta. It features 32 tok/s output speed, $2.75/1M input tokens pricing. It scores 17.4 on the intelligence index.
Output Speed
32 tok/s
Latency (TTFT)
0.48s
Blended Price
$3.69/M
| Input (Prompt) | $2.75 |
| Output (Completion) | $6.50 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | N/A |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | N/A |
Llama 3.1 Instruct 405B costs $2.75/1M input tokens and $6.50/1M output tokens.
Llama 3.1 Instruct 405B has a lower coding score of 14.5. For demanding coding tasks, consider a model with a higher coding benchmark.
Llama 3.1 Instruct 405B generates output at 32 tok/s. Time to first token is 0.48s.
No, Llama 3.1 Instruct 405B is a paid model. Check the free models page for zero-cost alternatives.
Last updated:
See the alternatives section above for models with similar capabilities. You can also compare Llama 3.1 Instruct 405B head-to-head with any model on our comparison page.