Hermes 3 - Llama-3.1 70B is a text-based AI model by Nous Research. It features 41 tok/s output speed, $0.30/1M input tokens pricing. It scores 10.6 on the intelligence index.
Output Speed
41 tok/s
Latency (TTFT)
0.33s
Blended Price
$0.30/M
| Input (Prompt) | $0.30 |
| Output (Completion) | $0.30 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | N/A |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | N/A |
Hermes 3 - Llama-3.1 70B costs $0.30/1M input tokens and $0.30/1M output tokens.
Hermes 3 - Llama-3.1 70B generates output at 41 tok/s. Time to first token is 0.33s.
No, Hermes 3 - Llama-3.1 70B is a paid model. Check the free models page for zero-cost alternatives.
See the alternatives section above for models with similar capabilities. You can also compare Hermes 3 - Llama-3.1 70B head-to-head with any model on our comparison page.
Last updated: