Nvidia

Llama Embed Nemotron 8B Pricing

API pricing for Llama Embed Nemotron 8B by Nvidia. Input tokens cost $0/1M, output tokens cost $0/1M with 33K context window.

Last updated:

Input Cost

$0

per 1M tokens

Output Cost

$0

per 1M tokens

Context Window

33K

tokens

Cost Calculator

Estimated Cost:

$0.00

Model Details

Provider Nvidia
Model ID nvidia/llama-embed-nemotron-8b
Context Window 33K tokens
Max Output 2048 tokens
Release Date 2025-03-18

Compare Alternatives

Model Input Output Context
Llama Embed Nemotron 8B (this model) $0 $0 33K
text-embedding-3-small OpenAI $0.02 $0 8K
Qwen3-ASR Flash Alibaba (China) $0.032 $0.032 53K
Qwen3-ASR Flash Alibaba $0.035 $0.035 53K
Ministral 3B Mistral $0.04 $0.04 128K
Mistral Embed Mistral $0.1 $0 8K

Available from Resellers

Llama Embed Nemotron 8B is also available through these resellers and gateways:

Reseller Model Input Output
Abacus Llama 3.1 8B Instruct $0.02 $0.05
Amazon Bedrock Llama 3 8B Instruct $0.3 $0.6
Amazon Bedrock Llama 3.1 8B Instruct $0.22 $0.22
Azure Meta-Llama-3-8B-Instruct $0.3 $0.61
Azure Meta-Llama-3.1-8B-Instruct $0.3 $0.61
Azure Cognitive Services Meta-Llama-3-8B-Instruct $0.3 $0.61
Azure Cognitive Services Meta-Llama-3.1-8B-Instruct $0.3 $0.61
Cloudflare AI Gateway Llama 3 8B Instruct $0.28 $0.83
Cloudflare AI Gateway Llama 3 8B Instruct AWQ $0.12 $0.27
Cloudflare AI Gateway Llama 3.1 8B Instruct $0.28 $0.8299999999999998
← Back to All Models