Seyedin et al., 2013 - Google Patents

New features using robust MVDR spectrum of filtered autocorrelation sequence for robust speech recognition

Seyedin et al., 2013

Document ID: 36123590969493944
Author: Seyedin S; Ahadi S; Gazor S
Publication year: 2013
Publication venue: The Scientific World Journal

External Links

Cited by

Snippet

This paper presents a novel noise‐robust feature extraction method for speech recognition using the robust perceptual minimum variance distortionless response (MVDR) spectrum of temporally filtered autocorrelation sequence. The perceptual MVDR spectrum of the filtered …

Continue reading at onlinelibrary.wiley.com (PDF) (other versions)

238000001228 spectrum 0 title abstract description 68

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems

Similar Documents

Publication	Publication Date	Title
Braun et al.	2020	Data augmentation and loss normalization for deep noise suppression
CN109256144B (en)	2022-09-06	Speech enhancement method based on ensemble learning and noise perception training
Alam et al.	2014	Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique
Xu et al.	2021	Components loss for neural networks in mask-based speech enhancement
CN103971697B (en)	2016-11-23	Sound enhancement method based on non-local mean filtering
Lee et al.	2014	Intra‐and Inter‐frame Features for Automatic Speech Recognition
Selva Nidhyananthan et al.	2016	Noise robust speaker identification using RASTA–MFCC feature with quadrilateral filter bank structure
Lan et al.	2022	Research on speech enhancement algorithm of multiresolution cochleagram based on skip connection deep neural network
Seyedin et al.	2013	New features using robust MVDR spectrum of filtered autocorrelation sequence for robust speech recognition
Panda et al.	2011	Psychoacoustic model compensation for robust speaker verification in environmental noise
Rao et al.	2012	Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration
Kamarudin et al.	2016	Acoustic echo cancellation using adaptive filtering algorithms for Quranic accents (Qiraat) identification
Mallidi et al.	2013	Robust speaker recognition using spectro-temporal autoregressive models.
Principi et al.	2010	Comparative Evaluation of Single‐Channel MMSE‐Based Noise Reduction Schemes for Speech Recognition
Upadhyay et al.	2018	Robust recognition of English speech in noisy environments using frequency warped signal processing
Elshamy et al.	2017	Two-stage speech enhancement with manipulation of the cepstral excitation
Gudmalwar et al.	2022	Single channel speech enhancement using masking based on sinusoidal modeling
Lee et al.	2014	Speech Enhancement Using Phase‐Dependent A Priori SNR Estimator in Log‐Mel Spectral Domain
Milner et al.	2008	Applying noise compensation methods to robustly predict acoustic speech features from MFCC vectors in noise
Liang et al.	2021	Real-time speech enhancement algorithm for transient noise suppression
Farahani et al.	2007	Features based on filtering and spectral peaks in autocorrelation domain for robust speech recognition
Seyedin et al.	2009	Robust MVDR-based feature extraction for speech recognition
Hsieh et al.	2013	Histogram equalization of real and imaginary modulation spectra for noise-robust speech recognition.
Zoghlami et al.	2012	Application of perceptual filtering models to noisy speech signals enhancement
da Silva et al.	2018	Comparative Study between the Discrete‐Frequency Kalman Filtering and the Discrete‐Time Kalman Filtering with Application in Noise Reduction in Speech Signals