Google / google/gemini-2.5-flash-lite-preview-09-2025

Gemini 2.5 Flash-Lite Preview - access through LLMTR

Useful when you want early access to lower-cost, high-throughput Gemini behavior before it settles into a stable release. Best for evaluating lightweight production workloads.

Technical specifications

Canonical ID	`google/gemini-2.5-flash-lite-preview-09-2025`
Provider	Google
Context window	1,048,576 tokens
Operations	CHAT_COMPLETIONS, RESPONSES
Modalities	text, image, video, audio

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

Operation	Metric	Unit	Price
CHAT_COMPLETIONS	INPUT_TEXT	PER_1M_TOKENS	$0.100000
CHAT_COMPLETIONS	OUTPUT_TEXT	PER_1M_TOKENS	$0.400000
CHAT_COMPLETIONS	CACHE_STORAGE	PER_1M_TOKEN_HOURS	$1.00
CHAT_COMPLETIONS	CACHE_WRITE	PER_1M_TOKENS	$0.010000
CHAT_COMPLETIONS	CACHE_WRITE	PER_1M_TOKENS	$0.010000
CHAT_COMPLETIONS	CACHE_WRITE	PER_1M_TOKENS	$0.010000
CHAT_COMPLETIONS	CACHE_WRITE	PER_1M_TOKENS	$0.030000
CHAT_COMPLETIONS	INPUT_AUDIO	PER_1M_TOKENS	$0.300000

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"google/gemini-2.5-flash-lite-preview-09-2025","messages":[{"role":"user","content":"Hello"}]}'

Related models

Gemini Deep Research Preview google/deep-research-pro-preview-12-2025
Gemini 2.5 Flash google/gemini-2.5-flash
Gemini 2.5 Flash Image google/gemini-2.5-flash-image
Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite
Gemini 2.5 Flash Native Audio (Live API) google/gemini-2.5-flash-native-audio-preview-12-2025
Gemini 2.5 Flash Preview TTS google/gemini-2.5-flash-preview-tts
Gemini 2.5 Pro google/gemini-2.5-pro
Gemini 2.5 Pro Preview TTS google/gemini-2.5-pro-preview-tts