Google / google/gemini-2.5-flash-lite-preview-09-2025
Gemini 2.5 Flash-Lite Preview - access through LLMTR
Useful when you want early access to lower-cost, high-throughput Gemini behavior before it settles into a stable release. Best for evaluating lightweight production workloads.
Technical specifications
| Canonical ID | google/gemini-2.5-flash-lite-preview-09-2025 |
|---|---|
| Provider | |
| Context window | 1,048,576 tokens |
| Operations | CHAT_COMPLETIONS, RESPONSES |
| Modalities | text, image, video, audio |
Pricing
A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.
| Operation | Metric | Unit | Price |
|---|---|---|---|
| CHAT_COMPLETIONS | INPUT_TEXT | PER_1M_TOKENS | $0.100000 |
| CHAT_COMPLETIONS | OUTPUT_TEXT | PER_1M_TOKENS | $0.400000 |
| CHAT_COMPLETIONS | CACHE_STORAGE | PER_1M_TOKEN_HOURS | $1.00 |
| CHAT_COMPLETIONS | CACHE_WRITE | PER_1M_TOKENS | $0.010000 |
| CHAT_COMPLETIONS | CACHE_WRITE | PER_1M_TOKENS | $0.010000 |
| CHAT_COMPLETIONS | CACHE_WRITE | PER_1M_TOKENS | $0.010000 |
| CHAT_COMPLETIONS | CACHE_WRITE | PER_1M_TOKENS | $0.030000 |
| CHAT_COMPLETIONS | INPUT_AUDIO | PER_1M_TOKENS | $0.300000 |
Example usage
With existing OpenAI SDK flows, change only the base URL and model identifier.
curl https://llmtr.com/v1/chat/completions \
-H "Authorization: Bearer llmtr-your_key" \
-H "Content-Type: application/json" \
-d '{"model":"google/gemini-2.5-flash-lite-preview-09-2025","messages":[{"role":"user","content":"Hello"}]}'
Related models
- Gemini Deep Research Preview google/deep-research-pro-preview-12-2025
- Gemini 2.5 Flash google/gemini-2.5-flash
- Gemini 2.5 Flash Image google/gemini-2.5-flash-image
- Gemini 2.5 Flash-Lite google/gemini-2.5-flash-lite
- Gemini 2.5 Flash Native Audio (Live API) google/gemini-2.5-flash-native-audio-preview-12-2025
- Gemini 2.5 Flash Preview TTS google/gemini-2.5-flash-preview-tts
- Gemini 2.5 Pro google/gemini-2.5-pro
- Gemini 2.5 Pro Preview TTS google/gemini-2.5-pro-preview-tts