High-resolution models for human tasks
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Open source large language model by Alibaba
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
An AI-powered security review GitHub Action using Claude
Let us control diffusion models
PyTorch code and models for the DINOv2 self-supervised learning
DeepSeek Coder: Let the Code Write Itself
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Dataset of GPT-2 outputs for research in detection, biases, and more
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
Video understanding codebase from FAIR for reproducing video models
Learning to Act by Watching Unlabeled Online Videos
Powerful open source image generation model
Open-Source Financial Large Language Models!
Open-source, high-performance Mixture-of-Experts large language model
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Blazeface is a lightweight model that detects faces in images
Detect faces in an image
A CNN model that predicts human joints from RGB images of a person
4M: Massively Multimodal Masked Modeling
Custom BLEURT model for evaluating text similarity using PyTorch
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
CLIP, Predict the most relevant text snippet given an image