Llama 3.2 3B

General Purpose

Meta's Llama 3.2 3B instruction-tuned model. Excellent balance of quality and speed with 128K context.

Pricing

Input

$0.04/1M

Output

$0.08/1M

Context Length

131K tokens

Max Completion

8K tokens

Capabilities

Chat

Tool Calling

Technical Details

Model ID

llama-3.2-3b

Tokenizer

Llama

HuggingFace

meta-llama/Llama-3.2-3B-Instruct

Quick Start

from openai import OpenAI

client = OpenAI(
    base_url="https://api.llamagate.ai/v1",
    api_key="your-api-key",
)

response = client.chat.completions.create(
    model="llama-3.2-3b",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Start Using Llama 3.2 3B

Create an account and start building with just $5.