This repository contains the official implementation of FastVLM
This repository contains the official implementation of research
Reproduces results of "Fixing the train-test resolution discrepancy"
FlashMLA: Efficient Multi-head Latent Attention Kernels
A PyTorch library for implementing flow matching algorithms
GLIDE: a diffusion-based text-conditional image synthesis model
GLM-4 series: Open Multilingual Multimodal Chat LMs
Open Multilingual Multimodal Chat LMs
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Compact hybrid reasoning language model for intelligent responses
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Implementation of model parallel autoregressive transformers on GPUs
The official PyTorch implementation of Google's Gemma models
New set of lightweight state-of-the-art, open foundation models
A Family of Open Foundation Models for Code Intelligence
Foundation Models for Time Series
Large-scale xAI model for local inference with SGLang, Grok-2.5
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Efficient 13B MoE language model with long context and reasoning modes
Tencent’s 36-language state-of-the-art translation model
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Diffusion Transformer with Fine-Grained Chinese Understanding
A Customizable Image-to-Video Model based on HunyuanVideo
Release for Improved Denoising Diffusion Probabilistic Models