Yehia et al., 1999 - Google Patents

Using speech acoustics to drive facial motion

Yehia et al., 1999

Document ID: 3776162231970022101
Author: Yehia H; Kuratate T; Vatikiotis-Bateson E
Publication year: 1999
Publication venue: Proc. the 14th International Congress of Phonetic Sciences

External Links

Cited by

Snippet

This paper describes and evaluates a method to estimate facial motion during speech from the speech acoustics. It is a statistical method based on simultaneous measurements of facial motion and speech acoustics. Experiments were carried out for one American English …

Continue reading at www.internationalphoneticassociation.org (PDF) (other versions)

230000001815 facial 0 title abstract description 45

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/66—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination for extracting parameters related to health condition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices

Similar Documents

Publication	Publication Date	Title
Yehia et al.	2002	Linking facial animation, head motion and speech acoustics
Yehia et al.	1999	Using speech acoustics to drive facial motion
Yehia et al.	1998	Quantitative association of vocal-tract and facial behavior
Kuratate et al.	1999	Audio-visual synthesis of talking faces from speech production correlates.
CN101887728B (en)	2011-11-23	Method for multi-sensory speech enhancement
Kangas	1994	On the analysis of pattern sequences by self-organizing maps
JP2003255993A (en)	2003-09-10	Speech recognition system, speech recognition method, speech recognition program, speech synthesis system, speech synthesis method, speech synthesis program
DE4317372A1 (en)	1993-12-02	Acoustic and visual input speech recognition system - monitors lip and mouth movements by video camera to provide motion vector input to neural network based speech identification unit.
CN118398033B (en)	2025-02-11	A speech-based emotion recognition method, system, device and storage medium
CN118800277B (en)	2024-12-06	Digital human interaction system and method based on big data information
Yehia et al.	2000	Facial animation and head motion driven by speech acoustics
CN120086807A (en)	2025-06-03	Adaptive teaching strategy adjustment method based on sentiment analysis, computer device
Pitermann et al.	2001	An inverse dynamics approach to face animation
Monaci et al.	2009	Learning bimodal structure in audio–visual data
Rani et al.	2015	Speech recognition using neural network
Lee et al.	2025	Articulatory Feature Prediction from Surface EMG during Speech Production
Kagalkar et al.	2018	Mobile Application Based Translation of Sign Language to Text Description in Kannada Language.
Brooke	1996	Talking heads and speech recognisers that can see: The computer processing of visual speech signals
Sharma et al.	2019	Gesture recognition system
Csapó	2021	Extending text-to-speech synthesis with articulatory movement prediction using ultrasound tongue imaging
US20070154033A1 (en)	2007-07-05	Audio source separation based on flexible pre-trained probabilistic source models
Vatikiotis-Bateson et al.	2002	Speaking mode variability in multimodal speech production
JPH02232783A (en)	1990-09-14	Syllable recognizing device by brain wave topography
Barbosa et al.	2007	Temporal characterization of auditory-visual coupling in speech
Bergsland et al.	2023	Examining the correlation between dance and electroacoustic music phrases: a pilot study