Alcala Padilla et al., 2025 - Google Patents

Location-aware target speaker extraction for hearing aids

Alcala Padilla et al., 2025

Document ID: 14445678154387883034
Author: Alcala Padilla D; Westhausen N; Vivekananthan S; Meyer B
Publication year: 2025
Publication venue: Proc. Interspeech 2025

External Links

Cited by

Snippet

Target speaker extraction (TSE) using deep learning offers potential benefits for hearing- impaired listeners. However, their implementation in hearing aids requires low-latency, lowcomplexity algorithms capable of real-time operation. Existing models that comply with …

Continue reading at www.isca-archive.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/55—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception using an external connection, either wireless or wired
- H04R25/552—Binaural
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/50—Customised settings for obtaining desired overall acoustical characteristics
- H04R25/505—Customised settings for obtaining desired overall acoustical characteristics using digital signal processing
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups

Similar Documents

Publication	Publication Date	Title
Zhang et al.	2017	Deep learning based binaural speech separation in reverberant environments
Vecchiotti et al.	2019	End-to-end binaural sound localisation from the raw waveform
Han et al.	2020	Real-time binaural speech separation with preserved spatial cues
CN109830245B (en)	2021-03-12	A method and system for multi-speaker speech separation based on beamforming
Hadad et al.	2016	The binaural LCMV beamformer and its performance analysis
Marquardt et al.	2015	Interaural coherence preservation in multi-channel Wiener filtering-based noise reduction for binaural hearing aids
Roman et al.	2003	Speech segregation based on sound localization
Marquardt et al.	2015	Theoretical analysis of linearly constrained multi-channel Wiener filtering algorithms for combined noise reduction and binaural cue preservation in binaural hearing aids
Pedersen et al.	2008	Two-microphone separation of speech mixtures
EP1927264B1 (en)	2016-07-20	Method of and device for generating and processing parameters representing hrtfs
CN110728989B (en)	2020-07-14	A Binaural Speech Separation Method Based on Long Short-Term Memory Network LSTM
Richard et al.	2023	Audio signal processing in the 21st century: The important outcomes of the past 25 years
Alinaghi et al.	2014	Joint mixing vector and binaural model based stereo source separation
Marquardt et al.	2018	Interaural coherence preservation for binaural noise reduction using partial noise estimation and spectral postfiltering
Dadvar et al.	2019	Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target
Westhausen et al.	2023	Low bit rate binaural link for improved ultra low-latency low-complexity multichannel speech enhancement in hearing aids
Corey	2019	Microphone array processing for augmented listening
Westhausen et al.	2024	Real-time multichannel deep speech enhancement in hearing aids: Comparing monaural and binaural processing in complex acoustic scenarios
Westhausen et al.	2023	Binaural multichannel blind speaker separation with a causal low-latency and low-complexity approach
Alcala Padilla et al.	2025	Location-aware target speaker extraction for hearing aids
Tammen et al.	2025	Imposing correlation structures for deep binaural spatio-temporal Wiener filtering
Ko et al.	2025	DNN-Based HRIRs Identification With a Continuously Rotating Speaker Array
Jiang et al.	2014	Binaural deep neural network classification for reverberant speech segregation.
Orr et al.	2023	Localizing concurrent sound sources with binaural microphones: A simulation study
Chern et al.	2023	Voice direction-of-arrival conversion