Qwen / qwen/qwen3-vl-flash

Qwen3-Vl-Flash - access through LLMTR

Qwen3-Vl-Flash is a good fit when you want lower-latency, lower-cost visual understanding and OCR. It works well for faster multimodal assistant flows.

Technical specifications

Canonical ID	`qwen/qwen3-vl-flash`
Provider	Qwen
Context window	256,000 tokens
Operations	RESPONSES
Modalities	text

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

Operation	Metric	Unit	Price
RESPONSES	starting	catalog	$0.005000

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen/qwen3-vl-flash","messages":[{"role":"user","content":"Hello"}]}'

Related models

Qwen3.7-Max qwen/qwen3.7-max
Qwen3.7-Plus qwen/qwen3.7-plus
Qwen3.6-Flash qwen/qwen3.6-flash
Qwen-Plus qwen/qwen-plus
Qwen-Plus-2025-01-25 qwen/qwen-plus-2025-01-25
Qwen-Plus-2025-04-28 qwen/qwen-plus-2025-04-28
Qwen-Plus-2025-07-14 qwen/qwen-plus-2025-07-14
Qwen-Plus-2025-07-28 qwen/qwen-plus-2025-07-28