DCVGAN

This paper proposes a new GAN architecture for video generation with depth videos and color videos. The proposed model explicitly uses the information of depth in a video sequence as additional information for a GAN-based video generation scheme to make the model understands scene dynamics more accurately. The model uses pairs of color video and depth video for training and generates a video using the two steps. Generate the depth video to model the scene dynamics based on the geometrical information. To add appropriate color to the geometrical information of the scene, the domain translation from depth to color is performed for each image. This model has three networks in the generator. In addition, the model has two discriminators.

Features

Generators
Discriminators
Requires Python3.7, PyTorch, FFmpeg, OpenCV, and GraphViz
Facial expression datasets
Hand gesture datasets
Train, sample, and evaluate

Project Samples

Project Activity

See All Activity >

Follow DCVGAN

DCVGAN Web Site

User Reviews

Be the first to post a review of DCVGAN!

Additional Project Details

Programming Language

Python

Related Categories

Python Machine Learning Software, Python AI Video Generators, Python Generative AI

Registered

2023-03-22

Similar Business Software

Ray2

Ray2 is a large-scale video generative model capable of creating realistic visuals with natural, coherent motion. It has a strong understanding of text instructions and can take images and video as input. Ray2 exhibits advanced capabilities as a result of being trained on Luma’s new multi-modal...

See Software
LTX

Control every aspect of your video using AI, from ideation to final edits, on one holistic platform. We’re pioneering the integration of AI and video production, enabling the transformation of a single idea into a cohesive, AI-generated video. LTX empowers individuals to share their visions,...

See Software
Veo 3

Veo 3 is Google’s latest state-of-the-art video generation model, designed to bring greater realism and creative control to filmmakers and storytellers. With the ability to generate videos in 4K resolution and enhanced with real-world physics and audio, Veo 3 allows creators to craft...

See Software
VideoPoet

VideoPoet is a simple modeling method that can convert any autoregressive language model or large language model (LLM) into a high-quality video generator. It contains a few simple components. An autoregressive language model learns across video, image, audio, and text modalities to...

See Software
Klip

Klip is an AI-powered video generation platform that enables users to create professional videos effortlessly. By inputting text, users can generate videos in minutes, eliminating the need for complex video editing skills. The platform offers a variety of customizable templates, allowing for the...

See Software
Wan2.1

Wan2.1 is an open-source suite of advanced video foundation models designed to push the boundaries of video generation. This cutting-edge model excels in various tasks, including Text-to-Video, Image-to-Video, Video Editing, and Text-to-Image, offering state-of-the-art performance across...

See Software

Report inappropriate content