Nvidia

Llama-3.1-Nemotron-Ultra-253B-v1 Pricing

API pricing for Llama-3.1-Nemotron-Ultra-253B-v1 by Nvidia. Input tokens cost $0/1M, output tokens cost $0/1M with 131K context window.

Last updated:

ReasoningTools

Input Cost

$0

per 1M tokens

Output Cost

$0

per 1M tokens

Context Window

131K

tokens

Cost Calculator

Estimated Cost:

$0.00

Model Details

Provider Nvidia
Model ID nvidia/llama-3.1-nemotron-ultra-253b-v1
Context Window 131K tokens
Max Output 8192 tokens
Release Date 2024-07-01

Compare Alternatives

Model Input Output Context
Llama-3.1-Nemotron-Ultra-253B-v1 (this model) $0 $0 131K
text-embedding-3-small OpenAI $0.02 $0 8K
Qwen3-ASR Flash Alibaba (China) $0.032 $0.032 53K
Qwen3-ASR Flash Alibaba $0.035 $0.035 53K
Ministral 3B Mistral $0.04 $0.04 128K
Mistral Embed Mistral $0.1 $0 8K

Available from Resellers

Llama-3.1-Nemotron-Ultra-253B-v1 is also available through these resellers and gateways:

Reseller Model Input Output
Amazon Bedrock Llama 3 70B Instruct $2.65 $3.5
Amazon Bedrock Llama 3 8B Instruct $0.3 $0.6
Amazon Bedrock Llama 3.1 70B Instruct $0.72 $0.72
Amazon Bedrock Llama 3.1 8B Instruct $0.22 $0.22
Amazon Bedrock Llama 3.2 11B Instruct $0.16 $0.16
Amazon Bedrock Llama 3.2 1B Instruct $0.1 $0.1
Amazon Bedrock Llama 3.2 3B Instruct $0.15 $0.15
Amazon Bedrock Llama 3.2 90B Instruct $0.72 $0.72
Amazon Bedrock Llama 3.3 70B Instruct $0.72 $0.72
Amazon Bedrock Llama 4 Maverick 17B Instruct $0.24 $0.97
← Back to All Models