[go: up one dir, main page]

Garrido et al., 2015 - Google Patents

Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track

Garrido et al., 2015

View PDF
Document ID
341442744997746321
Author
Garrido P
Valgaerts L
Sarmadi H
Steiner I
Varanasi K
Perez P
Theobalt C
Publication year
Publication venue
Computer graphics forum

External Links

Snippet

In many countries, foreign movies and TV productions are dubbed, ie, the original voice of an actor is replaced with a translation that is spoken by a dubbing actor in the country's own language. Dubbing is a complex process that requires specific translations and accurately …
Continue reading at vcai.mpi-inf.mpg.de (PDF) (other versions)

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation

Similar Documents

Publication Publication Date Title
Garrido et al. Vdub: Modifying face video of actors for plausible visual alignment to a dubbed audio track
Kim et al. Neural style-preserving visual dubbing
Thies et al. Neural voice puppetry: Audio-driven facial reenactment
Wang et al. One-shot talking face generation from single-speaker audio-visual correlation learning
Fried et al. Text-based editing of talking-head video
CN114144790B (en) Personalized speech-to-video with three-dimensional skeletal regularization and representative body gestures
Wen et al. Photorealistic audio-driven video portraits
Kim et al. Deep video portraits
Chen et al. What comprises a good talking-head video generation?: A survey and benchmark
US11582519B1 (en) Person replacement utilizing deferred neural rendering
US11562597B1 (en) Visual dubbing using synthetic models
US8655152B2 (en) Method and system of presenting foreign films in a native language
US11581020B1 (en) Facial synchronization utilizing deferred neural rendering
US20070165022A1 (en) Method and system for the automatic computerized audio visual dubbing of movies
US11830159B1 (en) Generative films
US20250140257A1 (en) Systems and methods for improved lip dubbing
US12367630B2 (en) Generative films
Theobald et al. Near-videorealistic synthetic talking faces: Implementation and evaluation
Bigioi et al. Multilingual video dubbing—a technology review and current challenges
Bigioi et al. Pose-aware speech driven facial landmark animation pipeline for automated dubbing
Jha et al. Cross-language speech dependent lip-synchronization
WO2024234089A1 (en) Improved generative machine learning architecture for audio track replacement
WO2024121331A1 (en) Generative film editing
Ji et al. 3D facial animation driven by speech-video dual-modal signals
Shen et al. Automatic video self modeling for voice disorder