Qwen / qwen/qwen3-vl-8b-thinking
Qwen3-Vl-8B-Thinking - access through LLMTR
Qwen3-Vl-8B-Thinking is a vision-language model for understanding text, images, and video. It is well suited to OCR, screenshot analysis, document understanding, and multimodal assistant workflows. The thinking variant is better suited to harder visual problems and more deliberate step-by-step analysis.
Technical specifications
| Canonical ID | qwen/qwen3-vl-8b-thinking |
|---|---|
| Provider | Qwen |
| Context window | 256,000 tokens |
| Operations | RESPONSES |
| Modalities | text |
Pricing
A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.
| Operation | Metric | Unit | Price |
|---|---|---|---|
| RESPONSES | INPUT_TEXT | PER_1M_TOKENS | $0.180000 |
| RESPONSES | OUTPUT_TEXT | PER_1M_TOKENS | $2.10 |
Example usage
With existing OpenAI SDK flows, change only the base URL and model identifier.
curl https://llmtr.com/v1/chat/completions \
-H "Authorization: Bearer llmtr-your_key" \
-H "Content-Type: application/json" \
-d '{"model":"qwen/qwen3-vl-8b-thinking","messages":[{"role":"user","content":"Hello"}]}'
Related models
- Qwen3.7-Max qwen/qwen3.7-max
- Qwen3.7-Plus qwen/qwen3.7-plus
- Qwen3.6-Flash qwen/qwen3.6-flash
- Qwen-Plus qwen/qwen-plus
- Qwen-Plus-2025-01-25 qwen/qwen-plus-2025-01-25
- Qwen-Plus-2025-04-28 qwen/qwen-plus-2025-04-28
- Qwen-Plus-2025-07-14 qwen/qwen-plus-2025-07-14
- Qwen-Plus-2025-07-28 qwen/qwen-plus-2025-07-28