Nvidia
Llama Embed Nemotron 8B Pricing
API pricing for Llama Embed Nemotron 8B by Nvidia. Input tokens cost $0/1M, output tokens cost $0/1M with 33K context window.
Last updated:
Input Cost
$0
per 1M tokens
Output Cost
$0
per 1M tokens
Context Window
33K
tokens
Cost Calculator
Estimated Cost:
$0.00
Model Details
Provider
Nvidia
Model ID
nvidia/llama-embed-nemotron-8b
Context Window
33K tokens
Max Output
2048 tokens
Release Date
2025-03-18
Sources
Compare Alternatives
| Model | Input | Output | Context |
|---|---|---|---|
| Llama Embed Nemotron 8B (this model) | $0 | $0 | 33K |
| text-embedding-3-small OpenAI | $0.02 | $0 | 8K |
| Qwen3-ASR Flash Alibaba (China) | $0.032 | $0.032 | 53K |
| Qwen3-ASR Flash Alibaba | $0.035 | $0.035 | 53K |
| Ministral 3B Mistral | $0.04 | $0.04 | 128K |
| Mistral Embed Mistral | $0.1 | $0 | 8K |
Available from Resellers
Llama Embed Nemotron 8B is also available through these resellers and gateways:
| Reseller | Model | Input | Output |
|---|---|---|---|
| Abacus | Llama 3.1 8B Instruct | $0.02 | $0.05 |
| Amazon Bedrock | Llama 3 8B Instruct | $0.3 | $0.6 |
| Amazon Bedrock | Llama 3.1 8B Instruct | $0.22 | $0.22 |
| Azure | Meta-Llama-3-8B-Instruct | $0.3 | $0.61 |
| Azure | Meta-Llama-3.1-8B-Instruct | $0.3 | $0.61 |
| Azure Cognitive Services | Meta-Llama-3-8B-Instruct | $0.3 | $0.61 |
| Azure Cognitive Services | Meta-Llama-3.1-8B-Instruct | $0.3 | $0.61 |
| Cloudflare AI Gateway | Llama 3 8B Instruct | $0.28 | $0.83 |
| Cloudflare AI Gateway | Llama 3 8B Instruct AWQ | $0.12 | $0.27 |
| Cloudflare AI Gateway | Llama 3.1 8B Instruct | $0.28 | $0.8299999999999998 |