Research

Zhipu / Z.ai GLM Model Lineage 2026

Zhipu AI (Z.ai) GLM release history: GLM-4 (Jun 2024), GLM-4.5 (Jul 2025), GLM-4.5V (Aug 2025), GLM-4.6 (Sep 2025), GLM-4.7 (Dec 2025), GLM-5 (Feb 2026), GLM-5.1 (Apr 2026).

By Ramanath, CTO & Co-Founder at Presenc AI · Last updated: May 2026

What this is

Zhipu AI (rebranded Z.ai internationally) shipped the most consistent quarterly release cadence among Chinese labs in 2024-2026. GLM-5 (February 2026) is a 744B-parameter MoE that made Z.ai the first publicly-listed Chinese AI lab (HKEX, January 2026). This page is a 2026-05-15 release-by-release reference.

GLM Release Timeline (2024-2026)

DateReleaseNotable
Jun 5, 2024GLM-4 (incl. GLM-4-9B open)Foundation release
Apr 2025GLM-4-32B-0414 seriesScaled to 32B; dialogue/reasoning/rumination variants
Jul 2025GLM-4.5 + GLM-4.5 AirNext-gen language; runs on 8x NVIDIA H20
Aug 11, 2025GLM-4.5V106B vision-language model
Sep 2025GLM-4.6First integration of FP8 + Int4 on Cambricon chips
Dec 2025GLM-4.6V + GLM-4.7Vision + agentic coding tier
Jan 8, 2026Z.ai HKEX IPOFirst publicly listed Chinese AI lab
Feb 11, 2026GLM-5744B MoE / 40B active; 2x scale-up from 4.5
Mar 2026GLM-5.1 (subscription)Refinement release
Apr 8, 2026GLM-5.1 open-sourceOpen-weight release

GLM-5 Specifications

SpecValue
Total parameters744B
Active parameters per token40B
ArchitectureSparse MoE
Scale change vs GLM-4.5~2x total params, ~1.25x active
LicenseGLM-5.1 open-sourced April 8, 2026

Six Things the Lineage Tells You

  1. Z.ai is the first publicly listed Chinese AI lab. HKEX IPO January 8, 2026 changes the capital structure and disclosure cadence of the company.
  2. GLM ships ~quarterly. 8+ major releases in 18 months — the most consistent cadence among Chinese frontier labs.
  3. GLM-4.6 was the first Chinese flagship to run on Cambricon chips at FP8 + Int4. Validated the Chinese chip stack at production scale.
  4. GLM-5 is a 2x scale-up. 744B total / 40B active vs 355B / 32B for GLM-4.5. Closing the gap to the largest open-weight frontier (Llama 4, DeepSeek V4).
  5. Z.ai rebrand internationalised the company. Zhipu retained domestically; Z.ai is the international face.
  6. GLM-5.1 open-sourcing (Apr 8, 2026) keeps Z.ai in the open-weight game. Distinct from labs that have moved entirely to closed-tier flagship.

What This Means for AI Visibility

Z.ai's public-company status, hardware-stack diversity (NVIDIA + Cambricon), and consistent open-source cadence make GLM the most institutionally-deployable Chinese open-weight model. Brand-visibility teams targeting enterprise deployments inside China should treat GLM as the most likely default open-weight model, alongside Qwen and DeepSeek.

Methodology

Release dates from Wikipedia Z.ai article, HuggingFace blog on GLM-5, Z.ai GLM-4 GitHub, Z.ai homepage, and RecodeChinaAI on the IPO.

How Presenc AI Helps

Presenc AI tracks brand visibility on GLM-served surfaces (Z.ai cloud API, chat.z.ai, on-prem GLM-5.1 deployments). As GLM-5 + GLM-5.1 absorb enterprise share, brand teams need GLM-specific monitoring distinct from Qwen and DeepSeek baselines.

Frequently Asked Questions

GLM-5.1, released as open-source on April 8, 2026. GLM-5 (February 11, 2026) is the 744B-parameter MoE flagship that GLM-5.1 refines. Subscription users got GLM-5.1 in late March 2026; the open-source release followed two weeks later.
Yes — Z.ai (formerly Zhipu AI) IPO'd on the Hong Kong Stock Exchange on January 8, 2026, becoming the first publicly listed Chinese AI lab. The company also changed its official name to Knowledge Atlas Technology JSC Ltd.
GLM-5 (744B / 40B active) is roughly comparable to Qwen 3.5 (397B / 17B active) and DeepSeek V4-Pro (1.6T / 49B active) on most benchmarks. GLM leads on Chinese-language tasks for some evaluations; DeepSeek leads on coding; Qwen leads on multimodal. All three are competitive within the open-weight frontier.
Yes. GLM-4.6 was the first flagship Chinese LLM to integrate FP8 and Int4 quantization on Cambricon chips. GLM-5 / 5.1 inherit the Cambricon-compatible inference stack. This is part of Z.ai's hardware-diversification strategy alongside NVIDIA H20 deployments.

Track Your AI Visibility

See how your brand appears across ChatGPT, Claude, Perplexity, and other AI platforms. Start monitoring today.