Qwen 3 8B

General Purpose

Alibaba's Qwen 3 8B. Strong multilingual and reasoning capabilities with 32K context. Apache 2.0.

Pricing
Input
$0.04/1M
Output
$0.14/1M
Context Length
33K tokens
Max Completion
8K tokens

Capabilities

Chat
Tool Calling

Technical Details

Model ID
qwen3-8b
Tokenizer
Qwen
HuggingFace
Qwen/Qwen3-8B

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.llamagate.ai/v1",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="qwen3-8b",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Start Using Qwen 3 8B

Create an account and start building with just $5.

qwen3-8b API - LlamaGate