Available Now · February 2026

The model that thinks
before it answers.

GLM-5 is Zhipu AI's most capable foundation model — 745 billion parameters, 202K context, and frontier-level reasoning in a single unified architecture.

Start Building View on HuggingFace

0B Parameters

0K Context

0 Experts

0B Active Params

Capabilities

Everything you need.
Nothing you don't.

Multi-Step Reasoning

Chain complex inferences across mathematical proofs, scientific analysis, and strategic planning. Enhanced System 2 thinking for problems that require depth, not shortcuts.

1 Decompose problem

2 Validate assumptions

3 Synthesize answer

Advanced Coding

Opus-level code generation across languages. Full-stack applications, terminal workflows, and SWE-bench verified performance.

async function solve(task) { const plan = await reason(task); return execute(plan); }

Multimodal

Process images, documents, and text in a unified pipeline. Cross-modal understanding as a first-class capability.

Agentic AI

Plan, execute, and adapt through multi-step workflows. AutoGLM-powered autonomous task completion.

Creative Writing

Narratives, documents, and long-form content with natural conversational style.

Enterprise-Ready

5.9% sparsity MoE for cost-efficient inference. Deploy anywhere.

Architecture

Mixture of Experts,
mastery of everything.

Input

Sparse Router

E₁

E₂

E₃

E₄

E₅

E₆

E₇

E₈

E₉

···

E₂₅₆

8 of 256 experts activated per pass · 5.9% sparsity

Weighted Merge + MTP

Output Tokens

Architecture

Mixture-of-Experts (MoE)

Total Parameters

745 Billion

Active Parameters

44 Billion per inference

Expert Modules

256 total · 8 activated

Hidden Layers

78 transformer layers

Context Window

202,000 tokens

Attention

DeepSeek Sparse Attention (DSA)

Prediction

Multi-Token Prediction (MTP)

Performance

Benchmarks don't lie.

SWE-bench Verified

Coding

MATH-500

Mathematics

GPQA Diamond

Reasoning

T-Bench

Agent Tasks

MMMU

Multimodal

Arena ELO

Creative Writing

Scores represent publicly reported evaluation results. Visit z.ai for the latest data.

Get Started

From zero to GLM-5
in under a minute.

Available via API, open-weight download, or local deployment with the frameworks you already use.

Get API Key Download Weights

from zhipuai import ZhipuAI

client = ZhipuAI(api_key="your-key")

response = client.chat.completions.create(
    model="glm-5",
    messages=[{
        "role": "user",
        "content": "Hello, GLM-5!"
    }]
)

print(response.choices[0].message.content)

The model that thinksbefore it answers.

Everything you need.Nothing you don't.

Multi-Step Reasoning

Advanced Coding

Multimodal

Agentic AI

Creative Writing

Enterprise-Ready

Mixture of Experts,mastery of everything.

Benchmarks don't lie.

From zero to GLM-5in under a minute.

The model that thinks
before it answers.

Everything you need.
Nothing you don't.

Mixture of Experts,
mastery of everything.

From zero to GLM-5
in under a minute.