VMZ (Video Model Zoo) download

The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.

Features

Implements R(2+1)D and MCx models for efficient spatiotemporal video representation learning
Enables reproducibility of FAIR’s published video understanding research
Built with both Caffe2 and PyTorch backends for flexibility
Supports Gradient Blending for audio-visual fusion (Caffe2 only)
Provides pre-trained models on IG-65M, one of the largest weakly-supervised video datasets
Includes CSN (Channel-Separated Networks) for computationally efficient video recognition

Project Activity

See All Activity >

License

Apache License V2.0

Follow VMZ (Video Model Zoo)

VMZ (Video Model Zoo) Web Site

User Reviews

Be the first to post a review of VMZ (Video Model Zoo)!

Additional Project Details

Operating Systems

Linux

Programming Language

C++, Python, Unix Shell

Related Categories

Unix Shell Video Software, Unix Shell AI Models, Python Video Software, Python AI Models, C++ Video Software, C++ AI Models

Registered

2025-10-08

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Picsart Enterprise

AI-Powered Image & Video Editing for Seamless Integration. Enhance your visual content workflows with Picsart Creative APIs, a robust suite of AI-driven tools for developers, product owners, and entrepreneurs. Easily integrate advanced image and video processing capabilities into your...

See Software
LM-Kit.NET

LM-Kit.NET is a cutting-edge, high-level inference SDK designed specifically to bring the advanced capabilities of Large Language Models (LLM) into the C# ecosystem. Tailored for developers working within .NET, LM-Kit.NET provides a comprehensive suite of powerful Generative AI tools, making...

See Software
Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
Adventr

Your viewers expect a personalized online experience. With powerful interactive features such as mobile interactivity, customized social sharing, pre-roll ad network compatibility, voice control, and more, Adventr now allows anyone to easily creat interactive, actionable videos any share them at...

See Software
GPT-4 Turbo

GPT-4 is a large multimodal model (accepting text or image inputs and outputting text) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities. GPT-4 is available in the OpenAI API to...

See Software

Report inappropriate content

VMZ (Video Model Zoo)

VMZ: Model Zoo for Video Modeling

Get an email when there's a new version of VMZ (Video Model Zoo)

Features

Project Activity

Categories

License

Follow VMZ (Video Model Zoo)

User Reviews

Additional Project Details

Operating Systems

Programming Language

Related Categories

Registered