Skip to content

Billing

Users pay through a credit balance. Every API request deducts from the balance based on input and output token counts.

A 6% platform margin is added on top of the requested top-up amount.

Requested: $10.00
Margin (6%): $0.60
Charged: $10.60

Examples for small first payments:

Credit addedPlatform marginAmount charged
$5.00$0.30$5.30
$10.00$0.60$10.60

Prices are quoted per 1M tokens in USD. Some models use input cached, reasoning tokens, image, or audio second units.

Example price units:

UnitExample valueNote
per 1M tokens$5.00Text input/output
per image$0.002Image generation
per second$0.0008Audio/video
per minute$0.048Realtime audio
per 1M token-hours$3.20Cache

Check the current price per model at Dashboard > Models.

On reasoning/thinking models (e.g. Z.AI GLM, the Google Gemini thinking series) the reasoning tokens the model generates are billed as output tokens — even though they do not appear in the response text. A short answer can therefore consume more tokens than expected because of the reasoning produced behind it.

For transparency, the response usage block breaks this out:

{
"usage": {
"prompt_tokens": 20,
"completion_tokens": 596,
"total_tokens": 616,
"completion_tokens_details": { "reasoning_tokens": 575 }
}
}

completion_tokens already includes reasoning tokens; reasoning_tokens only shows how many went to reasoning. prompt_tokens + completion_tokens = total_tokens always holds.

To keep costs down, keep thinking opt-in: GLM models default to off (Z.AI Thinking Control) and only reason with the :think suffix or reasoning: true.

Dashboard > Billing shows:

  • Live balance
  • Top-up history (amount, margin, date)
  • Per-request deductions
  • Monthly spend summary

The target dashboard method is:

  • Credit card or another supported payment method through a secure checkout page.

Manual bank transfer is handled only after support approval as an operational exception. Crypto top-up is not part of the public checkout flow.

If the balance cannot cover a request, the API returns 402 insufficient_balance. No tokens are consumed.

{
"error": {
"message": "Insufficient balance. Top up required.",
"type": "insufficient_balance"
}
}