Open Multilingual Multimodal Chat LMs
Compact hybrid reasoning language model for intelligent responses
Implementation of model parallel autoregressive transformers on GPUs
The official PyTorch implementation of Google's Gemma models
New set of lightweight state-of-the-art, open foundation models
A Family of Open Foundation Models for Code Intelligence
Large-scale xAI model for local inference with SGLang, Grok-2.5
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Efficient 13B MoE language model with long context and reasoning modes
Tencent’s 36-language state-of-the-art translation model
Multimodal-Driven Architecture for Customized Video Generation
Diffusion Transformer with Fine-Grained Chinese Understanding
A Customizable Image-to-Video Model based on HunyuanVideo
Code for reproducing key results in the paper
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Kimi K2: 1T-param MoE model for advanced coding and agentic reasoning
Language modeling in a sentence representation space
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
PyTorch implementation of MAE
A library for Multilingual Unsupervised or Supervised word Embeddings
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Code release for "Masked-attention Mask Transformer
JetBrains’ 4B parameter code model for completions
ICLR2024 Spotlight: curation/training code, metadata, distribution