All models

Nemotron 3 Nano

by NVIDIA

Chat Tools Thinking

NVIDIA's Nemotron 3 Nano (nemotron family) — tool use, reasoning; 131K context.

Publisher
NVIDIA
Context window
128K tokens
Sizes
4B, 30B
Model card on Hugging Face

Run Nemotron 3 Nano

Install it on a Pendra worker, then call it through the OpenAI-compatible API with a pdr_sk_ key.

from pendra import Pendra

client = Pendra(api_key="pdr_sk_...")

response = client.chat.completions.create(
    model="nemotron-3-nano:4b",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Run Nemotron 3 Nano on your own UK infrastructure

Deploy a worker, install Nemotron 3 Nano, and start serving it through one sovereign API endpoint.