Qwen / qwen/qwen3-vl-8b-thinking

Qwen3-Vl-8B-Thinking - access through LLMTR

Qwen3-Vl-8B-Thinking is a vision-language model for understanding text, images, and video. It is well suited to OCR, screenshot analysis, document understanding, and multimodal assistant workflows. The thinking variant is better suited to harder visual problems and more deliberate step-by-step analysis.

Technical specifications

Canonical ID	`qwen/qwen3-vl-8b-thinking`
Provider	Qwen
Context window	256,000 tokens
Operations	RESPONSES
Modalities	text

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

Operation	Metric	Unit	Price
RESPONSES	INPUT_TEXT	PER_1M_TOKENS	$0.180000
RESPONSES	OUTPUT_TEXT	PER_1M_TOKENS	$2.10

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen/qwen3-vl-8b-thinking","messages":[{"role":"user","content":"Hello"}]}'

Related models

Qwen3.7-Max qwen/qwen3.7-max
Qwen3.7-Plus qwen/qwen3.7-plus
Qwen3.6-Flash qwen/qwen3.6-flash
Qwen-Plus qwen/qwen-plus
Qwen-Plus-2025-01-25 qwen/qwen-plus-2025-01-25
Qwen-Plus-2025-04-28 qwen/qwen-plus-2025-04-28
Qwen-Plus-2025-07-14 qwen/qwen-plus-2025-07-14
Qwen-Plus-2025-07-28 qwen/qwen-plus-2025-07-28