Garrido et al., 2015 - Google Patents
Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio trackGarrido et al., 2015
View PDF- Document ID
- 341442744997746321
- Author
- Garrido P
- Valgaerts L
- Sarmadi H
- Steiner I
- Varanasi K
- Perez P
- Theobalt C
- Publication year
- Publication venue
- Computer graphics forum
External Links
Snippet
In many countries, foreign movies and TV productions are dubbed, ie, the original voice of an actor is replaced with a translation that is spoken by a dubbing actor in the country's own language. Dubbing is a complex process that requires specific translations and accurately …
- 230000000007 visual effect 0 title abstract description 22
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Garrido et al. | Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track | |
Kim et al. | Neural style-preserving visual dubbing | |
Thies et al. | Neural voice puppetry: Audio-driven facial reenactment | |
Wang et al. | One-shot talking face generation from single-speaker audio-visual correlation learning | |
Fried et al. | Text-based editing of talking-head video | |
CN114144790B (en) | Personalized speech-to-video with three-dimensional skeletal regularization and representative body gestures | |
Wen et al. | Photorealistic audio-driven video portraits | |
Kim et al. | Deep video portraits | |
Chen et al. | What comprises a good talking-head video generation?: A survey and benchmark | |
US11582519B1 (en) | Person replacement utilizing deferred neural rendering | |
US11562597B1 (en) | Visual dubbing using synthetic models | |
US8655152B2 (en) | Method and system of presenting foreign films in a native language | |
US11581020B1 (en) | Facial synchronization utilizing deferred neural rendering | |
US20070165022A1 (en) | Method and system for the automatic computerized audio visual dubbing of movies | |
US11830159B1 (en) | Generative films | |
US20250140257A1 (en) | Systems and methods for improved lip dubbing | |
US12367630B2 (en) | Generative films | |
Theobald et al. | Near-videorealistic synthetic talking faces: Implementation and evaluation | |
Bigioi et al. | Multilingual video dubbing—a technology review and current challenges | |
Bigioi et al. | Pose-aware speech driven facial landmark animation pipeline for automated dubbing | |
Jha et al. | Cross-language speech dependent lip-synchronization | |
WO2024234089A1 (en) | Improved generative machine learning architecture for audio track replacement | |
WO2024121331A1 (en) | Generative film editing | |
Ji et al. | 3D facial animation driven by speech-video dual-modal signals | |
Shen et al. | Automatic video self modeling for voice disorder |