Hummes et al., 2011 - Google Patents

Robust acoustic speaker localization with distributed microphones

Hummes et al., 2011

Document ID: 305209950664458227
Author: Hummes F; Qi J; Fingscheidt T
Publication year: 2011
Publication venue: 2011 19th European Signal Processing Conference

External Links

Cited by

Snippet

This contribution to acoustic source localization presents a robust approach verified with ten distributed microphones in a laboratory apartment under reverberant acoustic conditions. Based on the classical steered response power phase transform (SRP-PHAT) algorithm …

Continue reading at eurasip.org (PDF) (other versions)

230000004807 localization 0 title abstract description 18

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones

Similar Documents

Publication	Publication Date	Title
Vesperini et al.	2016	A neural network based algorithm for speaker localization in a multi-room environment
CN104254819B (en)	2017-09-08	Audio user interaction recognition and contextual refinement
JP6129316B2 (en)	2017-05-17	Apparatus and method for providing information-based multi-channel speech presence probability estimation
CN110770827A (en)	2020-02-07	Near field detector based on correlation
Brutti et al.	2010	Multiple source localization based on acoustic map de-emphasis
Koldovský et al.	2013	Semi-blind noise extraction using partially known position of the target source
Yang et al.	2021	Model-based head orientation estimation for smart devices
Transfeld et al.	2015	Acoustic event source localization for surveillance in reverberant environments supported by an event onset detection
Pasha et al.	2017	Blind speaker counting in highly reverberant environments by clustering coherence features
Hummes et al.	2011	Robust acoustic speaker localization with distributed microphones
Hadad et al.	2018	Multi-speaker direction of arrival estimation using SRP-PHAT algorithm with a weighted histogram
Wu et al.	2013	Speaker localization and tracking in the presence of sound interference by exploiting speech harmonicity
WO2016028254A1 (en)	2016-02-25	Methods and apparatus for speech segmentation using multiple metadata
Brutti et al.	2014	A speech event detection and localization task for multiroom environments
Gburrek et al.	2023	Spatial diarization for meeting transcription with ad-hoc acoustic sensor networks
Nguyen et al.	2016	Selection of the closest sound source for robot auditory attention in multi-source scenarios
Araki et al.	2008	Speaker indexing and speech enhancement in real meetings/conversations
Giannoulis et al.	2014	The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home
Taseska et al.	2015	Minimum Bayes risk signal detection for speech enhancement based on a narrowband DOA model
Dickerson et al.	2014	Resonate: reverberation environment simulation for improved classification of speech models
Pasha et al.	2019	A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques
Maraboina et al.	2006	Multi-speaker voice activity detection using ICA and beampattern analysis
Schwartz et al.	2018	Multi-microphone voice activity and single-talk detectors based on steered-response power output entropy
Kayser et al.	2016	Probabilistic Spatial Filter Estimation for Signal Enhancement in Multi-Channel Automatic Speech Recognition.
Maunder et al.	2013	Robust Sounds of Activities of Daily Living Classification in Two‐Channel Audio‐Based Telemonitoring