Qwen / qwen/qwen3-vl-flash

Qwen3-Vl-Flash - access through LLMTR

Qwen3-Vl-Flash is a good fit when you want lower-latency, lower-cost visual understanding and OCR. It works well for faster multimodal assistant flows.

Technical specifications

Canonical IDqwen/qwen3-vl-flash
ProviderQwen
Context window256,000 tokens
OperationsRESPONSES
Modalitiesexplicit, implicit, text

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

OperationMetricUnitPrice
RESPONSESstartingcatalog$0.005000

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen/qwen3-vl-flash","messages":[{"role":"user","content":"Hello"}]}'

Related models