MiniMax / minimax/minimax-m3

MiniMax M3 - access through LLMTR

MiniMax M3 is a frontier model for coding, agentic workflows, long-context analysis, and applications that need image or video input. It supports a 1M-token context window, OpenAI-compatible Chat Completions calls, function calling, prompt caching, and adaptive thinking controls for production integrations.

Technical specifications

Canonical IDminimax/minimax-m3
ProviderMiniMax
Context window1,000,000 tokens
OperationsCHAT_COMPLETIONS
Modalitiestext, image, video

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

OperationMetricUnitPrice
CHAT_COMPLETIONSCACHE_READPER_1M_TOKENS$0.060000
CHAT_COMPLETIONSINPUT_TEXTPER_1M_TOKENS$0.300000
CHAT_COMPLETIONSOUTPUT_TEXTPER_1M_TOKENS$1.20

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"minimax/minimax-m3","messages":[{"role":"user","content":"Hello"}]}'

Related models