Hummes et al., 2011 - Google Patents
Robust acoustic speaker localization with distributed microphonesHummes et al., 2011
View PDF- Document ID
- 305209950664458227
- Author
- Hummes F
- Qi J
- Fingscheidt T
- Publication year
- Publication venue
- 2011 19th European Signal Processing Conference
External Links
Snippet
This contribution to acoustic source localization presents a robust approach verified with ten distributed microphones in a laboratory apartment under reverberant acoustic conditions. Based on the classical steered response power phase transform (SRP-PHAT) algorithm …
- 230000004807 localization 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Vesperini et al. | A neural network based algorithm for speaker localization in a multi-room environment | |
| CN104254819B (en) | Audio user interaction recognition and contextual refinement | |
| JP6129316B2 (en) | Apparatus and method for providing information-based multi-channel speech presence probability estimation | |
| CN110770827A (en) | Near field detector based on correlation | |
| Brutti et al. | Multiple source localization based on acoustic map de-emphasis | |
| Koldovský et al. | Semi-blind noise extraction using partially known position of the target source | |
| Yang et al. | Model-based head orientation estimation for smart devices | |
| Transfeld et al. | Acoustic event source localization for surveillance in reverberant environments supported by an event onset detection | |
| Pasha et al. | Blind speaker counting in highly reverberant environments by clustering coherence features | |
| Hummes et al. | Robust acoustic speaker localization with distributed microphones | |
| Hadad et al. | Multi-speaker direction of arrival estimation using SRP-PHAT algorithm with a weighted histogram | |
| Wu et al. | Speaker localization and tracking in the presence of sound interference by exploiting speech harmonicity | |
| WO2016028254A1 (en) | Methods and apparatus for speech segmentation using multiple metadata | |
| Brutti et al. | A speech event detection and localization task for multiroom environments | |
| Gburrek et al. | Spatial diarization for meeting transcription with ad-hoc acoustic sensor networks | |
| Nguyen et al. | Selection of the closest sound source for robot auditory attention in multi-source scenarios | |
| Araki et al. | Speaker indexing and speech enhancement in real meetings/conversations | |
| Giannoulis et al. | The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home | |
| Taseska et al. | Minimum Bayes risk signal detection for speech enhancement based on a narrowband DOA model | |
| Dickerson et al. | Resonate: reverberation environment simulation for improved classification of speech models | |
| Pasha et al. | A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques | |
| Maraboina et al. | Multi-speaker voice activity detection using ICA and beampattern analysis | |
| Schwartz et al. | Multi-microphone voice activity and single-talk detectors based on steered-response power output entropy | |
| Kayser et al. | Probabilistic Spatial Filter Estimation for Signal Enhancement in Multi-Channel Automatic Speech Recognition. | |
| Maunder et al. | Robust Sounds of Activities of Daily Living Classification in Two‐Channel Audio‐Based Telemonitoring |