Model Library

Open-weight models,
sovereign UK infrastructure

Run frontier open-weight language and embedding models — Qwen, Llama, DeepSeek, Gemma, gpt-oss and more — entirely on UK hardware under UK jurisdiction. Call them through one OpenAI-compatible API.

DeepSeek-V4-Flash

DeepSeek

Frontier reasoning, open weights

Chat Tools Thinking
1M context

Qwen3.6

Alibaba

Alibaba's Qwen3.6 (qwen3_5_moe family) — vision, tool use, reasoning; 262K context.

Chat Vision Tools Thinking
27B · 35B 256K context

Qwen3.5

Alibaba

Alibaba’s flagship multimodal family

Chat Vision Tools Thinking
0.8B · 2B · 4B · 9B · 27B · 35B · 122B · 397B 256K context

gpt-oss

OpenAI

OpenAI’s open-weight release

Chat Tools
20B · 120B 128K context

Gemma 4

Google DeepMind

Google DeepMind’s efficient multimodal family

Chat Vision Tools Thinking
12B · 26B · 31B · E2B · E4B 256K context

Qwen3-Coder

Alibaba

Purpose-built for code

Chat Tools
30B · 480B 256K context

Llama 3.3

Meta

Meta’s 70B workhorse

Chat Tools
70B 128K context

Qwen3-VL

Alibaba

Alibaba's Qwen3-VL (qwen3_vl family) — vision, tool use, reasoning; 262K context.

Chat Vision Tools Thinking
2B · 4B · 8B · 30B · 32B · 235B 256K context

Qwen3-VL Instruct

Alibaba

Alibaba's Qwen3-VL Instruct (qwen3_vl family) — vision, tool use; 262K context.

Chat Vision Tools
2B · 4B · 8B · 30B · 32B · 235B 256K context

Phi-4-reasoning

Microsoft

Microsoft's Phi-4-reasoning (phi3 family) — reasoning; 32K context.

Chat Thinking
32K context

Phi-4-reasoning-plus

Microsoft

Microsoft's Phi-4-reasoning-plus (phi3 family) — reasoning; 32K context.

Chat Thinking
32K context

Qwen3-Coder-Next

Alibaba

Alibaba's Qwen3-Coder-Next (qwen3_next family) — tool use; 262K context.

Chat Tools
256K context

GLM-4.7-Flash

Z.ai

Z.ai's GLM-4.7-Flash (glm4_moe_lite family) — tool use, reasoning; 202K context.

Chat Tools Thinking
198K context

Ministral 3

Mistral AI

Mistral AI's Ministral 3 (mistral3 family) — vision, tool use; 262K context.

Chat Vision Tools
3B · 8B · 14B 256K context

Ministral 3 Reasoning

Mistral AI

Mistral AI's Ministral 3 Reasoning (mistral3 family) — tool use, reasoning; 262K context.

Chat Tools Thinking
3B · 8B · 14B 256K context

Devstral Small

Mistral AI

Mistral AI's Devstral Small (mistral3 family) — vision, tool use; 393K context.

Chat Vision Tools
24B 384K context

GLM-OCR

Z.ai

Z.ai's GLM-OCR (glm_ocr family) — vision, OCR; 131K context.

Vision OCR
128K context

LFM2.5

Liquid AI

Liquid AI's LFM2.5 (lfm2_moe family) — tool use; 128K context.

Chat Tools
350M · 1.2B 125K context

LFM2.5 Thinking

Liquid AI

Liquid AI's LFM2.5 Thinking (lfm2 family) — tool use, reasoning; 128K context.

Chat Tools Thinking
1.2B 125K context

LFM2.5-VL

Liquid AI

Liquid AI's LFM2.5-VL (lfm2_vl family) — vision; 128K context.

Chat Vision
1.6B 125K context

Nemotron 3 Nano

NVIDIA

NVIDIA's Nemotron 3 Nano (nemotron family) — tool use, reasoning; 131K context.

Chat Tools Thinking
4B · 30B 128K context

Phi-4-mini-reasoning

Microsoft

Microsoft's Phi-4-mini-reasoning (phi3 family) — reasoning; 131K context.

Chat Thinking
128K context

Granite 4.0 Micro

IBM

IBM's Granite 4.0 Micro (granitemoehybrid family) — tool use; 131K context.

Chat Tools
128K context

Magistral Small

Mistral AI

Mistral AI's Magistral Small (mistral family) — tool use, reasoning; 40K context.

Chat Tools Thinking
40K context

GLM-4.6V-Flash

Z.ai

Z.ai's GLM-4.6V-Flash (glm4v family) — vision; 131K context.

Chat Vision
128K context

DiffusionGemma

Google DeepMind

Google DeepMind's DiffusionGemma (diffusion_gemma family) — tool use, reasoning; 262K context.

Chat Tools Thinking
26B 256K context

MiMo-V2.5

Xiaomi

Xiaomi's MiMo-V2.5 (mimo_v2 family) — vision, tool use, reasoning; 1048K context.

Chat Vision Tools Thinking
1M context

MiMo-V2.5-Pro

Xiaomi

Xiaomi's MiMo-V2.5-Pro (mimo_v2 family) — tool use, reasoning; 1048K context.

Chat Tools Thinking
1M context

BGE Reranker v2 m3

BAAI

BAAI's BGE Reranker v2 m3 (xlm-roberta family) — reranking; 8K context.

8K context

Whisper

OpenAI

OpenAI Whisper speech-to-text, served in-process by the Pendra (whisper.cpp) backend.

Transcription

FLUX.1 [schnell]

Black Forest Labs

FLUX.1 [schnell] fast 12B text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

Z-Image Turbo

Tongyi-MAI

Z-Image Turbo fast (few-step) text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

FLUX.2 klein [9B]

Black Forest Labs

FLUX.2 klein 9B text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

SDXL Turbo

Stability AI

SDXL Turbo fast (1-4 step) text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

FLUX.2 klein [4B]

Black Forest Labs

FLUX.2 klein 4B fast text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

Qwen-Image 2512

Qwen

Qwen-Image 2512 high-fidelity text-to-image with strong text rendering, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

Stable Diffusion XL

Stability AI

Stable Diffusion XL 1.0 base text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

Stable Diffusion 3.5 Large Turbo

Stability AI

Stable Diffusion 3.5 Large Turbo fast text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

Stable Diffusion 1.5

Runway

Stable Diffusion 1.5 text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.

Image

nomic-embed-text-v1.5

Nomic AI

Nomic AI's nomic-embed-text-v1.5 (nomic_bert family) — embeddings; 2K context.

Embeddings
2K context

Qwen3-Embedding

Alibaba

Alibaba's Qwen3-Embedding (qwen3 family) — embeddings; 40K context.

Embeddings
0.6B · 4B · 8B 40K context

BGE-M3

BAAI

BAAI's BGE-M3 (xlm-roberta family) — embeddings; 8K context.

Embeddings
8K context

EmbeddingGemma

Google DeepMind

Google DeepMind's EmbeddingGemma (gemma3_text family) — embeddings; 2K context.

Embeddings
2K context

New models, added as the ecosystem evolves

Deploy a Pendra worker and install any of these models in minutes, or tell us what you need next.