All models
Model card on Hugging Face
Alibaba’s flagship multimodal family
Qwen3.5
by Alibaba
Chat Vision Tools Thinking
Qwen3.5 spans seven sizes from 0.8B to 122B with native vision, tool calling, a switchable thinking mode and a 256K-token context window — a strong open-weight default for almost any workload.
- Publisher
- Alibaba
- Context window
- 256K tokens
- Sizes
- 0.8B, 2B, 4B, 9B, 27B, 35B, 122B, 397B
- Licence
- Apache 2.0
Run Qwen3.5
Install it on a Pendra worker, then call it through the OpenAI-compatible API with a pdr_sk_ key.
Chat
from pendra import Pendra
client = Pendra(api_key="pdr_sk_...")
response = client.chat.completions.create(
model="qwen3.5:0.8b",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content) Vision
from pendra import Pendra
client = Pendra(api_key="pdr_sk_...")
response = client.chat.completions.create(
model="qwen3.5:0.8b",
messages=[
{
"role": "user",
"content": [
{"type": "text", "text": "What's in this image?"},
{"type": "image_url", "image_url": {"url": "https://example.com/photo.jpg"}},
],
}
],
)
print(response.choices[0].message.content) Run Qwen3.5 on your own UK infrastructure
Deploy a worker, install Qwen3.5, and start serving it through one sovereign API endpoint.