Qwen / qwen/qwen3.5-flash
Qwen3.5-Flash - access through LLMTR
Qwen3.5-Flash is a lightweight model optimized for fast responses, low latency, and high-volume use. It is a good fit for simple assistants, automation, and cost-controlled products.
Technical specifications
| Canonical ID | qwen/qwen3.5-flash |
|---|---|
| Provider | Qwen |
| Context window | 256,000 tokens |
| Operations | CHAT_COMPLETIONS |
| Modalities | explicit, text |
Pricing
A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.
| Operation | Metric | Unit | Price |
|---|---|---|---|
| CHAT_COMPLETIONS | INPUT_TEXT | PER_1M_TOKENS | $0.100000 |
| CHAT_COMPLETIONS | CACHE_READ | PER_1M_TOKENS | $0.010000 |
| CHAT_COMPLETIONS | OUTPUT_TEXT | PER_1M_TOKENS | $0.400000 |
| CHAT_COMPLETIONS | CACHE_WRITE | PER_1M_TOKENS | $0.125000 |
Example usage
With existing OpenAI SDK flows, change only the base URL and model identifier.
curl https://llmtr.com/v1/chat/completions \
-H "Authorization: Bearer llmtr-your_key" \
-H "Content-Type: application/json" \
-d '{"model":"qwen/qwen3.5-flash","messages":[{"role":"user","content":"Hello"}]}'
Related models
- Qwen3.7-Max qwen/qwen3.7-max
- Qwen3.7-Plus qwen/qwen3.7-plus
- Qwen3.6-Flash qwen/qwen3.6-flash
- Qwen-Plus qwen/qwen-plus
- Qwen-Plus-2025-01-25 qwen/qwen-plus-2025-01-25
- Qwen-Plus-2025-04-28 qwen/qwen-plus-2025-04-28
- Qwen-Plus-2025-07-14 qwen/qwen-plus-2025-07-14
- Qwen-Plus-2025-07-28 qwen/qwen-plus-2025-07-28