Burget, 2004 - Google Patents

Measurement of complementarity of recognition systems

Burget, 2004

Document ID: 17532357070772269136
Author: Burget L
Publication year: 2004
Publication venue: International Conference on Text, Speech and Dialogue

External Links

Cited by

Snippet

Combination of different speech recognition systems can be powerful technique to improve recognition performance. The success of these techniques, however, depends on the complementarity of the combined systems. In this paper, a measure of complementarity of …

Continue reading at www.researchgate.net (PDF) (other versions)

238000005259 measurement 0 title description 8

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation

Similar Documents

Publication	Publication Date	Title
Rabiner et al.	1989	HMM clustering for connected word recognition
US6571210B2 (en)	2003-05-27	Confidence measure system using a near-miss pattern
US5167004A (en)	1992-11-24	Temporal decorrelation method for robust speaker verification
Castaldo et al.	2007	Compensation of nuisance factors for speaker and language recognition
US6223155B1 (en)	2001-04-24	Method of independently creating and using a garbage model for improved rejection in a limited-training speaker-dependent speech recognition system
US6134527A (en)	2000-10-17	Method of testing a vocabulary word being enrolled in a speech recognition system
US20140025379A1 (en)	2014-01-23	Method and System for Real-Time Keyword Spotting for Speech Analytics
US20080312926A1 (en)	2008-12-18	Automatic Text-Independent, Language-Independent Speaker Voice-Print Creation and Speaker Recognition
EP0453649B1 (en)	1996-10-09	Method and apparatus for modeling words with composite Markov models
Bocchieri et al.	1993	Discriminative feature selection for speech recognition
US20040122672A1 (en)	2004-06-24	Gaussian model-based dynamic time warping system and method for speech processing
Ravinder	2010	Comparison of hmm and dtw for isolated word recognition system of punjabi language
Burget	2004	Combination of speech features using smoothed heteroscedastic linear discriminant analysis.
Ilyas et al.	2007	Speaker verification using vector quantization and hidden Markov model
US20030097263A1 (en)	2003-05-22	Decision tree based speech recognition
US7043430B1 (en)	2006-05-09	System and method for speech recognition using tonal modeling
Burget	2004	Measurement of complementarity of recognition systems
Shahin	2005	Improving speaker identification performance under the shouted talking condition using the second-order hidden Markov models
Bouwman et al.	2000	Weighting phone confidence measures for automatic speech recognition
Burget	2004	Complementarity of speech recognition systems and system combination
Chaudhari et al.	2000	Transformation enhanced multi-grained modeling for text-independent speaker recognition.
Ming et al.	2003	Speech recognition with unknown partial feature corruption–a review of the union model
Deng et al.	1989	Use of vowel duration information in a large vocabulary word recognizer
McLaren et al.	2016	On the Issue of Calibration in DNN-Based Speaker Recognition Systems.
Wilpon et al.	1993	Connected digit recognition based on improved acoustic resolution