China's Z.AI Releases GLM-5.2, Rivaling Claude Opus With Huawei Chips

Created at 18 Jun · 9:31 PM1 source↑ Market-relevant

IN SHORT

Z.AI has launched GLM-5.2, a large language model that closely rivals Anthropic's Claude Opus 4.8 on benchmarks, while outperforming GPT-5.5. Notably, the model was trained entirely on Huawei Ascend chips, bypassing NVIDIA hardware, and is available under an MIT license.

Key Numbers

1%performance gap to Claude Opus 4.8 on FrontierSWE

74.4GLM-5.2 score on FrontierSWE benchmark

75.1Claude Opus 4.8 score on FrontierSWE benchmark

72.6GPT-5.5 score on FrontierSWE benchmark

62.1GLM-5.2 score on SWE-bench Pro

58.6GPT-5.5 score on SWE-bench Pro

58.4GLM-5.1 score on SWE-bench Pro

9scores aggregated in the Artificial Analysis Intelligence Index

744 billionparameters in GLM-5.2

1 milliontoken context window of GLM-5.2

5times larger context window than GLM-5.1

$1.40per million input tokens API pricing

$4.40per million output tokens API pricing

$5per million input tokens for Claude Opus 4.8

$25per million output tokens for Claude Opus 4.8

Key facts

Z.AI's GLM-5.2 model closely rivals Claude Opus 4.8 on the FrontierSWE benchmark, scoring 74.4 compared to Opus's 75.1.

GLM-5.2 outperforms GPT-5.5 on both FrontierSWE (74.4 vs 72.6) and SWE-bench Pro (62.1 vs 58.6).

The model was trained exclusively on Huawei Ascend chips, excluding any NVIDIA hardware.

GLM-5.2 boasts a 1 million-token context window and is released under an MIT license.

Quantized versions of GLM-5.2 are available, reducing its size to 238GB but still requiring 256GB of RAM or VRAM for local use.

Z.AI, a Beijing-based lab, has released its new large language model, GLM-5.2, which demonstrates performance rivaling top-tier models like Anthropic's Claude Opus 4.8 and surpassing GPT-5.5 on key benchmarks. The model achieved a score of 74.4 on the FrontierSWE benchmark, just 1% behind Claude Opus 4.8's 75.1, and outperformed GPT-5.5 with a score of 72.6. On the SWE-bench Pro, GLM-5.2 scored 62.1, exceeding GPT-5.5's 58.6.

A significant aspect of GLM-5.2's development is its training entirely on Huawei Ascend chips, with no NVIDIA hardware utilized in the process. This achievement comes as Z.AI has been on the U.S. Entity List since January 2025, highlighting a growing trend of AI development outside of American technological dominance. The company's stock has seen a 90% increase following the release and the ban of Anthropic Fable.

GLM-5.2 is a 744-billion-parameter mixture-of-experts model featuring a 1 million-token context window, a fivefold increase from its predecessor GLM-5.1. It is released under an MIT license, meaning it has no regional restrictions. The model's performance places it at the top of open-source models in the Artificial Analysis Intelligence Index.

For developers, the expanded context window enables more complex tasks like whole-repo navigation and multi-file refactors in single calls. API pricing is set at $1.40 per million input tokens and $4.40 per million output tokens, significantly lower than Claude Opus 4.8's $5 input and $25 output. A GLM Coding Plan is available starting at $18 per month.

Local deployment is technically feasible thanks to 2-bit GGUF quantizations released by Unsloth AI, which reduce the model's size from 1.51TB to 238GB while retaining approximately 82% accuracy. However, this still requires a substantial 256GB of unified memory (RAM or VRAM). In testing, GLM-5.2 generated diverse game scenarios, showcasing its strength in multi-shot generation workflows and agentic pipelines where output variety is prioritized over polish. For highly demanding sustained tasks like SWE-Marathon, it scored 13.0, compared to Opus 4.8's 26.0, indicating a gap still exists with closed-frontier models.

Open-source weights for GLM-5.2 are available on HuggingFace under the MIT license, with quantized weights also accessible. GLM Coding Plan subscribers can use the model string GLM-5.2, and free testing is available on z.AI with usage constraints.

Frequently asked questions

GLM-5.2 is a large language model developed by Z.AI, designed to rival top-tier AI models in performance and capabilities.

GLM-5.2 was trained entirely on Huawei Ascend chips, without the use of any NVIDIA hardware.

GLM-5.2 closely matches Claude Opus 4.8 on the FrontierSWE benchmark and outperforms GPT-5.5 on both FrontierSWE and SWE-bench Pro.

The quantized version of GLM-5.2 requires 256GB of RAM or VRAM for local deployment.

GLM-5.2 features a 1 million-token context window, significantly larger than its predecessor.

China's Z.AI Releases GLM-5.2, Rivaling Claude Opus With Huawei Chips

Key Numbers

China's Z.AI Releases GLM-5.2, Rivaling Claude Opus With Huawei Chips

Key Numbers

Who's Involved

↳ Why This Matters

Key facts

Frequently asked questions

What Happens Next

Get the newsletter.

How It Developed

Sources

Related Stories

China's Z.AI Releases GLM-5.2, Rivaling Claude Opus With Huawei Chips

PiQ Daily

Key Numbers

China's Z.AI Releases GLM-5.2, Rivaling Claude Opus With Huawei Chips

PiQ Daily

Key Numbers

Who's Involved

↳ Why This Matters

Key facts

Frequently asked questions

+ What is GLM-5.2?

+ What hardware was used to train GLM-5.2?

+ How does GLM-5.2 compare to other models like Claude Opus and GPT-5.5?

+ What are the system requirements for running GLM-5.2 locally?

+ What is the context window of GLM-5.2?

What Happens Next

Get the newsletter.

How It Developed

Sources

Related Stories