Qwen / qwen/qwen3.5-flash

Qwen3.5-Flash - access through LLMTR

Qwen3.5-Flash is a lightweight model optimized for fast responses, low latency, and high-volume use. It is a good fit for simple assistants, automation, and cost-controlled products.

Technical specifications

Canonical ID	`qwen/qwen3.5-flash`
Provider	Qwen
Context window	256,000 tokens
Operations	CHAT_COMPLETIONS
Modalities	text

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

Operation	Metric	Unit	Price
CHAT_COMPLETIONS	INPUT_TEXT	PER_1M_TOKENS	$0.100000
CHAT_COMPLETIONS	CACHE_READ	PER_1M_TOKENS	$0.010000
CHAT_COMPLETIONS	OUTPUT_TEXT	PER_1M_TOKENS	$0.400000
CHAT_COMPLETIONS	CACHE_WRITE	PER_1M_TOKENS	$0.125000

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen/qwen3.5-flash","messages":[{"role":"user","content":"Hello"}]}'

Related models

Qwen3.7-Max qwen/qwen3.7-max
Qwen3.7-Plus qwen/qwen3.7-plus
Qwen3.6-Flash qwen/qwen3.6-flash
Qwen-Plus qwen/qwen-plus
Qwen-Plus-2025-01-25 qwen/qwen-plus-2025-01-25
Qwen-Plus-2025-04-28 qwen/qwen-plus-2025-04-28
Qwen-Plus-2025-07-14 qwen/qwen-plus-2025-07-14
Qwen-Plus-2025-07-28 qwen/qwen-plus-2025-07-28