Llama

Llama-4-Scout-17B-16E-Instruct-FP8 Pricing

API pricing for Llama-4-Scout-17B-16E-Instruct-FP8 by Llama. Input tokens cost $0/1M, output tokens cost $0/1M with 128K context window.

Last updated:

ToolsVision

Input Cost

$0

per 1M tokens

Output Cost

$0

per 1M tokens

Context Window

128K

tokens

Cost Calculator

Estimated Cost:

$0.00

Model Details

Provider Llama
Model ID llama-4-scout-17b-16e-instruct-fp8
Context Window 128K tokens
Max Output 4096 tokens
Release Date 2025-04-05

Compare Alternatives

Model Input Output Context
Llama-4-Scout-17B-16E-Instruct-FP8 (this model) $0 $0 128K
text-embedding-3-small OpenAI $0.02 $0 8K
Qwen3-ASR Flash Alibaba (China) $0.032 $0.032 53K
Qwen3-ASR Flash Alibaba $0.035 $0.035 53K
Ministral 3B Mistral $0.04 $0.04 128K
Mistral Embed Mistral $0.1 $0 8K

Available from Resellers

Llama-4-Scout-17B-16E-Instruct-FP8 is also available through these resellers and gateways:

Reseller Model Input Output
Abacus Llama 3.1 405B Instruct Turbo $3.5 $3.5
Abacus Llama 3.1 70B Instruct $0.4 $0.4
Abacus Llama 3.1 8B Instruct $0.02 $0.05
Abacus Llama 4 Maverick 17B 128E Instruct FP8 $0.14 $0.59
Amazon Bedrock Llama 3 70B Instruct $2.65 $3.5
Amazon Bedrock Llama 3 8B Instruct $0.3 $0.6
Amazon Bedrock Llama 3.1 70B Instruct $0.72 $0.72
Amazon Bedrock Llama 3.1 8B Instruct $0.22 $0.22
Amazon Bedrock Llama 3.2 11B Instruct $0.16 $0.16
Amazon Bedrock Llama 3.2 1B Instruct $0.1 $0.1
← Back to All Models