ChatGLM2-6B

ChatGLM2-6B is the second-gen Chinese-English conversational LLM from ZhipuAI/Tsinghua. It upgrades the base model with GLM’s hybrid pretraining objective, 1.4 TB bilingual data, and preference alignment—delivering big gains on MMLU, CEval, GSM8K, and BBH. The context window extends up to 32K (FlashAttention), and Multi-Query Attention improves speed and memory use. The repo includes Python APIs, CLI & web demos, OpenAI-style/FASTAPI servers, and quantized checkpoints for lightweight local deployment on GPUs or CPU/MPS.

Features

Stronger base model: large bilingual pretrain + alignment; big benchmark lifts
Long context variants: 8K default, 32K model (LongBench-competitive)
Faster, lighter inference: Multi-Query Attention + FlashAttention
Low-cost deploy: FP16/BF16, INT8/INT4 (≈5.5 GB), CPU & Apple MPS
Demos & APIs: CLI, Gradio/Streamlit, FastAPI and OpenAI-format servers
Finetuning & tooling: P-Tuning v2, full-parameter scripts, multi-GPU utilities

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow ChatGLM2-6B

ChatGLM2-6B Web Site

User Reviews

Be the first to post a review of ChatGLM2-6B!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Python, Unix Shell

Related Categories

Unix Shell Large Language Models (LLM), Python Large Language Models (LLM)

Registered

2025-10-04

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Ministral 3B

Mistral AI introduced two state-of-the-art models for on-device computing and edge use cases, named "les Ministraux": Ministral 3B and Ministral 8B. These models set a new frontier in knowledge, commonsense reasoning, function-calling, and efficiency in the sub-10B category. They can be used or...

See Software
Mathstral

As a tribute to Archimedes, whose 2311th anniversary we’re celebrating this year, we are proud to release our first Mathstral model, a specific 7B model designed for math reasoning and scientific discovery. The model has a 32k context window published under the Apache 2.0 license. We’re...

See Software
Claude Opus 3

Opus, our most intelligent model, outperforms its peers on most of the common evaluation benchmarks for AI systems, including undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA), basic mathematics (GSM8K), and more. It exhibits near-human levels of comprehension...

See Software

Report inappropriate content

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM

Get an email when there's a new version of ChatGLM2-6B

Features

Project Samples

Project Activity

Categories

License

Follow ChatGLM2-6B

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered