Garrido et al., 2015 - Google Patents

Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track

Garrido et al., 2015

Document ID: 341442744997746321
Author: Garrido P; Valgaerts L; Sarmadi H; Steiner I; Varanasi K; Perez P; Theobalt C
Publication year: 2015
Publication venue: Computer graphics forum

External Links

Cited by

Snippet

In many countries, foreign movies and TV productions are dubbed, ie, the original voice of an actor is replaced with a translation that is spoken by a dubbing actor in the country's own language. Dubbing is a complex process that requires specific translations and accurately …

Continue reading at vcai.mpi-inf.mpg.de (PDF) (other versions)

230000000007 visual effect 0 title abstract description 22

Classifications

- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation

Similar Documents

Publication	Publication Date	Title
Garrido et al.	2015	Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track
Kim et al.	2019	Neural style-preserving visual dubbing
Thies et al.	2020	Neural voice puppetry: Audio-driven facial reenactment
Wang et al.	2022	One-shot talking face generation from single-speaker audio-visual correlation learning
Fried et al.	2019	Text-based editing of talking-head video
CN114144790B (en)	2024-07-02	Personalized speech-to-video with three-dimensional skeletal regularization and representative body gestures
Wen et al.	2020	Photorealistic audio-driven video portraits
Kim et al.	2018	Deep video portraits
Chen et al.	2020	What comprises a good talking-head video generation?: A survey and benchmark
US11582519B1 (en)	2023-02-14	Person replacement utilizing deferred neural rendering
US11562597B1 (en)	2023-01-24	Visual dubbing using synthetic models
US8655152B2 (en)	2014-02-18	Method and system of presenting foreign films in a native language
US11581020B1 (en)	2023-02-14	Facial synchronization utilizing deferred neural rendering
US20070165022A1 (en)	2007-07-19	Method and system for the automatic computerized audio visual dubbing of movies
US11830159B1 (en)	2023-11-28	Generative films
US20250140257A1 (en)	2025-05-01	Systems and methods for improved lip dubbing
US12367630B2 (en)	2025-07-22	Generative films
Theobald et al.	2004	Near-videorealistic synthetic talking faces: Implementation and evaluation
Bigioi et al.	2023	Multilingual video dubbing—a technology review and current challenges
Bigioi et al.	2022	Pose-aware speech driven facial landmark animation pipeline for automated dubbing
Jha et al.	2019	Cross-language speech dependent lip-synchronization
WO2024234089A1 (en)	2024-11-21	Improved generative machine learning architecture for audio track replacement
WO2024121331A1 (en)	2024-06-13	Generative film editing
Ji et al.	2024	3D facial animation driven by speech-video dual-modal signals
Shen et al.	2015	Automatic video self modeling for voice disorder