Llama 3.1 8B Instruct
General Purpose
Meta's flagship Llama 3.1 8B. Most popular open model with 128K context. Excellent instruction following.
Pricing
Input
$0.03/1M
Output
$0.05/1M
Context Length
131K tokens
Max Completion
8K tokens
Capabilities
Chat
Tool Calling
Technical Details
Quick Start
from openai import OpenAI
client = OpenAI(
base_url="https://api.llamagate.ai/v1",
api_key="your-api-key",
)
response = client.chat.completions.create(
model="llama-3.1-8b",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)