Model Library
Open-weight models,
sovereign UK infrastructure
Run frontier open-weight language and embedding models — Qwen, Llama, DeepSeek, Gemma, gpt-oss and more — entirely on UK hardware under UK jurisdiction. Call them through one OpenAI-compatible API.
DeepSeek-V4-Flash
DeepSeek
Frontier reasoning, open weights
Qwen3.6
Alibaba
Alibaba's Qwen3.6 (qwen3_5_moe family) — vision, tool use, reasoning; 262K context.
Qwen3.5
Alibaba
Alibaba’s flagship multimodal family
gpt-oss
OpenAI
OpenAI’s open-weight release
Gemma 4
Google DeepMind
Google DeepMind’s efficient multimodal family
Qwen3-Coder
Alibaba
Purpose-built for code
Llama 3.3
Meta
Meta’s 70B workhorse
Qwen3-VL
Alibaba
Alibaba's Qwen3-VL (qwen3_vl family) — vision, tool use, reasoning; 262K context.
Qwen3-VL Instruct
Alibaba
Alibaba's Qwen3-VL Instruct (qwen3_vl family) — vision, tool use; 262K context.
Phi-4-reasoning
Microsoft
Microsoft's Phi-4-reasoning (phi3 family) — reasoning; 32K context.
Phi-4-reasoning-plus
Microsoft
Microsoft's Phi-4-reasoning-plus (phi3 family) — reasoning; 32K context.
Qwen3-Coder-Next
Alibaba
Alibaba's Qwen3-Coder-Next (qwen3_next family) — tool use; 262K context.
GLM-4.7-Flash
Z.ai
Z.ai's GLM-4.7-Flash (glm4_moe_lite family) — tool use, reasoning; 202K context.
Ministral 3
Mistral AI
Mistral AI's Ministral 3 (mistral3 family) — vision, tool use; 262K context.
Ministral 3 Reasoning
Mistral AI
Mistral AI's Ministral 3 Reasoning (mistral3 family) — tool use, reasoning; 262K context.
Devstral Small
Mistral AI
Mistral AI's Devstral Small (mistral3 family) — vision, tool use; 393K context.
GLM-OCR
Z.ai
Z.ai's GLM-OCR (glm_ocr family) — vision, OCR; 131K context.
LFM2.5
Liquid AI
Liquid AI's LFM2.5 (lfm2_moe family) — tool use; 128K context.
LFM2.5 Thinking
Liquid AI
Liquid AI's LFM2.5 Thinking (lfm2 family) — tool use, reasoning; 128K context.
LFM2.5-VL
Liquid AI
Liquid AI's LFM2.5-VL (lfm2_vl family) — vision; 128K context.
Nemotron 3 Nano
NVIDIA
NVIDIA's Nemotron 3 Nano (nemotron family) — tool use, reasoning; 131K context.
Phi-4-mini-reasoning
Microsoft
Microsoft's Phi-4-mini-reasoning (phi3 family) — reasoning; 131K context.
Granite 4.0 Micro
IBM
IBM's Granite 4.0 Micro (granitemoehybrid family) — tool use; 131K context.
Magistral Small
Mistral AI
Mistral AI's Magistral Small (mistral family) — tool use, reasoning; 40K context.
GLM-4.6V-Flash
Z.ai
Z.ai's GLM-4.6V-Flash (glm4v family) — vision; 131K context.
DiffusionGemma
Google DeepMind
Google DeepMind's DiffusionGemma (diffusion_gemma family) — tool use, reasoning; 262K context.
MiMo-V2.5
Xiaomi
Xiaomi's MiMo-V2.5 (mimo_v2 family) — vision, tool use, reasoning; 1048K context.
MiMo-V2.5-Pro
Xiaomi
Xiaomi's MiMo-V2.5-Pro (mimo_v2 family) — tool use, reasoning; 1048K context.
BGE Reranker v2 m3
BAAI
BAAI's BGE Reranker v2 m3 (xlm-roberta family) — reranking; 8K context.
Whisper
OpenAI
OpenAI Whisper speech-to-text, served in-process by the Pendra (whisper.cpp) backend.
FLUX.1 [schnell]
Black Forest Labs
FLUX.1 [schnell] fast 12B text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
Z-Image Turbo
Tongyi-MAI
Z-Image Turbo fast (few-step) text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
FLUX.2 klein [9B]
Black Forest Labs
FLUX.2 klein 9B text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
SDXL Turbo
Stability AI
SDXL Turbo fast (1-4 step) text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
FLUX.2 klein [4B]
Black Forest Labs
FLUX.2 klein 4B fast text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
Qwen-Image 2512
Qwen
Qwen-Image 2512 high-fidelity text-to-image with strong text rendering, served in-process by the Pendra (stable-diffusion.cpp) backend.
Stable Diffusion XL
Stability AI
Stable Diffusion XL 1.0 base text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
Stable Diffusion 3.5 Large Turbo
Stability AI
Stable Diffusion 3.5 Large Turbo fast text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
Stable Diffusion 1.5
Runway
Stable Diffusion 1.5 text-to-image, served in-process by the Pendra (stable-diffusion.cpp) backend.
nomic-embed-text-v1.5
Nomic AI
Nomic AI's nomic-embed-text-v1.5 (nomic_bert family) — embeddings; 2K context.
Qwen3-Embedding
Alibaba
Alibaba's Qwen3-Embedding (qwen3 family) — embeddings; 40K context.
BGE-M3
BAAI
BAAI's BGE-M3 (xlm-roberta family) — embeddings; 8K context.
EmbeddingGemma
Google DeepMind
Google DeepMind's EmbeddingGemma (gemma3_text family) — embeddings; 2K context.
No models match those filters.
New models, added as the ecosystem evolves
Deploy a Pendra worker and install any of these models in minutes, or tell us what you need next.