[go: up one dir, main page]

Hummes et al., 2011 - Google Patents

Robust acoustic speaker localization with distributed microphones

Hummes et al., 2011

View PDF
Document ID
305209950664458227
Author
Hummes F
Qi J
Fingscheidt T
Publication year
Publication venue
2011 19th European Signal Processing Conference

External Links

Snippet

This contribution to acoustic source localization presents a robust approach verified with ten distributed microphones in a laboratory apartment under reverberant acoustic conditions. Based on the classical steered response power phase transform (SRP-PHAT) algorithm …
Continue reading at eurasip.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/004Monitoring arrangements; Testing arrangements for microphones

Similar Documents

Publication Publication Date Title
Vesperini et al. A neural network based algorithm for speaker localization in a multi-room environment
CN104254819B (en) Audio user interaction recognition and contextual refinement
JP6129316B2 (en) Apparatus and method for providing information-based multi-channel speech presence probability estimation
CN110770827A (en) Near field detector based on correlation
Brutti et al. Multiple source localization based on acoustic map de-emphasis
Koldovský et al. Semi-blind noise extraction using partially known position of the target source
Yang et al. Model-based head orientation estimation for smart devices
Transfeld et al. Acoustic event source localization for surveillance in reverberant environments supported by an event onset detection
Pasha et al. Blind speaker counting in highly reverberant environments by clustering coherence features
Hummes et al. Robust acoustic speaker localization with distributed microphones
Hadad et al. Multi-speaker direction of arrival estimation using SRP-PHAT algorithm with a weighted histogram
Wu et al. Speaker localization and tracking in the presence of sound interference by exploiting speech harmonicity
WO2016028254A1 (en) Methods and apparatus for speech segmentation using multiple metadata
Brutti et al. A speech event detection and localization task for multiroom environments
Gburrek et al. Spatial diarization for meeting transcription with ad-hoc acoustic sensor networks
Nguyen et al. Selection of the closest sound source for robot auditory attention in multi-source scenarios
Araki et al. Speaker indexing and speech enhancement in real meetings/conversations
Giannoulis et al. The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home
Taseska et al. Minimum Bayes risk signal detection for speech enhancement based on a narrowband DOA model
Dickerson et al. Resonate: reverberation environment simulation for improved classification of speech models
Pasha et al. A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques
Maraboina et al. Multi-speaker voice activity detection using ICA and beampattern analysis
Schwartz et al. Multi-microphone voice activity and single-talk detectors based on steered-response power output entropy
Kayser et al. Probabilistic Spatial Filter Estimation for Signal Enhancement in Multi-Channel Automatic Speech Recognition.
Maunder et al. Robust Sounds of Activities of Daily Living Classification in Two‐Channel Audio‐Based Telemonitoring