All models

Meta’s 70B workhorse

Llama 3.3

by Meta

Chat Tools

Llama 3.3 70B is Meta’s widely-adopted instruction-tuned model — dependable general-purpose chat and tool use with a 128K context window.

Publisher
Meta
Context window
128K tokens
Sizes
70B
Licence
Llama 3.3
Model card on Hugging Face

Run Llama 3.3

Install it on a Pendra worker, then call it through the OpenAI-compatible API with a pdr_sk_ key.

from pendra import Pendra

client = Pendra(api_key="pdr_sk_...")

response = client.chat.completions.create(
    model="llama3.3:70b",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Run Llama 3.3 on your own UK infrastructure

Deploy a worker, install Llama 3.3, and start serving it through one sovereign API endpoint.