All models

Alibaba’s flagship multimodal family

Qwen3.5

by Alibaba

Chat Vision Tools Thinking

Qwen3.5 spans seven sizes from 0.8B to 122B with native vision, tool calling, a switchable thinking mode and a 256K-token context window — a strong open-weight default for almost any workload.

Publisher
Alibaba
Context window
256K tokens
Sizes
0.8B, 2B, 4B, 9B, 27B, 35B, 122B, 397B
Licence
Apache 2.0
Model card on Hugging Face

Run Qwen3.5

Install it on a Pendra worker, then call it through the OpenAI-compatible API with a pdr_sk_ key.

Chat

from pendra import Pendra

client = Pendra(api_key="pdr_sk_...")

response = client.chat.completions.create(
    model="qwen3.5:0.8b",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.choices[0].message.content)

Vision

from pendra import Pendra

client = Pendra(api_key="pdr_sk_...")

response = client.chat.completions.create(
    model="qwen3.5:0.8b",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What's in this image?"},
                {"type": "image_url", "image_url": {"url": "https://example.com/photo.jpg"}},
            ],
        }
    ],
)

print(response.choices[0].message.content)

Run Qwen3.5 on your own UK infrastructure

Deploy a worker, install Qwen3.5, and start serving it through one sovereign API endpoint.