Meta / meta/llama-3.1-8b

Llama 3.1 8B - access through LLMTR

Llama 3.1 8B is a small open-weight model for low-latency text tasks such as short chat, classification, data transformation, and light automation. It supports JSON-formatted output and is a practical starting point for prototypes and experiments. It is offered free with a daily usage quota.

Technical specifications

Canonical IDmeta/llama-3.1-8b
ProviderMeta
Context window131,072 tokens
OperationsCHAT_COMPLETIONS
Modalitiestext

Pricing

A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.

OperationMetricUnitPrice
CHAT_COMPLETIONSINPUT_TEXTPER_1M_TOKENSNot available
CHAT_COMPLETIONSOUTPUT_TEXTPER_1M_TOKENSNot available

Example usage

With existing OpenAI SDK flows, change only the base URL and model identifier.

curl https://llmtr.com/v1/chat/completions \
  -H "Authorization: Bearer llmtr-your_key" \
  -H "Content-Type: application/json" \
  -d '{"model":"meta/llama-3.1-8b","messages":[{"role":"user","content":"Hello"}]}'

Related models