Qwen 3 Embedding 8B
Embedding
Large 8B embedding model with 40K context window. State-of-the-art retrieval performance for demanding applications.
Pricing
Input
$0.02/1M
Output
Free/1M
Context Length
41K tokens
Capabilities
Embeddings
Technical Details
Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.llamagate.ai/v1",
api_key="your-api-key",
)
response = client.embeddings.create(
model="qwen3-embedding-8b",
input="Your text to embed",
)
print(response.data[0].embedding)