The bargain ‘lite’ Google Gemini—fast, cheap, and good enough for chatbots or support FAQs.
Price per 1 million tokens
per 1M tokens you send
per 1M tokens you receive
Input tokens are what you send to the AI, output tokens are what the AI sends back. These rates are set by the provider and reflect the current Google DeepMind Gemini 2.0 Flash-Lite API pricing. Price accurate as of June 2025.
per month
per month
per month
Cut-down Gemini Flash variant tuned for ultra-low latency and cost-sensitive traffic; retains multimodal I/O and the same retrieval-optimised MoE backbone as Flash.
The context window is the maximum amount of text the model can "see" at once. Larger windows allow for longer conversations or documents.
Join thousands of users who have switched to API.chat and are saving on their AI expenses while enjoying a better experience.