Qwen 3 Embedding 8B

Embedding

Large 8B embedding model with 40K context window. State-of-the-art retrieval performance for demanding applications.

Pricing
Input
$0.02/1M
Output
Free/1M
Context Length
41K tokens

Capabilities

Embeddings

Technical Details

Model ID
qwen3-embedding-8b
Tokenizer
Qwen

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.llamagate.ai/v1",
    api_key="your-api-key",
)

response = client.embeddings.create(
    model="qwen3-embedding-8b",
    input="Your text to embed",
)

print(response.data[0].embedding)

Start Using Qwen 3 Embedding 8B

Create an account and start building with just $5.

qwen3-embedding-8b API - LlamaGate