Perplexity / perplexity/pplx-embed-context-v1-4b
PPLX Embed Context v1 4B - access through LLMTR
A stronger contextual embedding for document-heavy systems that want better retrieval quality in larger RAG setups.
Technical specifications
| Canonical ID | perplexity/pplx-embed-context-v1-4b |
|---|---|
| Provider | Perplexity |
| Context window | 32,000 tokens |
| Operations | EMBEDDINGS |
| Modalities | text, embedding |
Pricing
A 6% platform margin applies to credit top-ups; model usage prices are not separately marked up.
| Operation | Metric | Unit | Price |
|---|---|---|---|
| EMBEDDINGS | INPUT_TEXT | PER_1M_TOKENS | $0.050000 |
Example usage
With existing OpenAI SDK flows, change only the base URL and model identifier.
curl https://llmtr.com/v1/chat/completions \
-H "Authorization: Bearer llmtr-your_key" \
-H "Content-Type: application/json" \
-d '{"model":"perplexity/pplx-embed-context-v1-4b","messages":[{"role":"user","content":"Hello"}]}'
Related models
- Sonar perplexity/sonar
- Sonar Pro perplexity/sonar-pro
- Sonar Reasoning Pro perplexity/sonar-reasoning-pro
- Sonar Deep Research perplexity/sonar-deep-research
- PPLX Embed v1 0.6B perplexity/pplx-embed-v1-0.6b
- PPLX Embed v1 4B perplexity/pplx-embed-v1-4b
- PPLX Embed Context v1 0.6B perplexity/pplx-embed-context-v1-0.6b
- Gemma 4 llmtr/gemma-4