May, 2017 - Google Patents
Robust speech dereverberation with a neural network-based post-filter that exploits multi-conditional training of binaural cuesMay, 2017
View PDF- Document ID
- 17725321489061608866
- Author
- May T
- Publication year
- Publication venue
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
External Links
Snippet
This study presents an algorithm for binaural speech dereverberation based on the supervised learning of short-term binaural cues. The proposed system combined a delay- and-sum beamformer with a neural network-based post-filter that attenuated reverberant …
- 230000001537 neural 0 title abstract description 43
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2225/00—Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
- H04R2225/43—Signal processing in hearing aids to enhance the speech intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/018—Audio watermarking, i.e. embedding inaudible data in the audio signal
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/70—Adaptation of deaf aid to hearing loss, e.g. initial electronic fitting
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Hadad et al. | The binaural LCMV beamformer and its performance analysis | |
| Jeub et al. | Model-based dereverberation preserving binaural cues | |
| Lai et al. | A deep denoising autoencoder approach to improving the intelligibility of vocoded speech in cochlear implant simulation | |
| US8139787B2 (en) | Method and device for binaural signal enhancement | |
| Marquardt et al. | Interaural coherence preservation in multi-channel Wiener filtering-based noise reduction for binaural hearing aids | |
| Hummersone et al. | Dynamic precedence effect modeling for source separation in reverberant environments | |
| Yousefian et al. | A dual-microphone algorithm that can cope with competing-talker scenarios | |
| Yousefian et al. | A coherence-based noise reduction algorithm for binaural hearing aids | |
| CN119342151B (en) | Method for eliminating echo of intercom system | |
| Dadvar et al. | Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target | |
| Kamkar-Parsi et al. | Instantaneous binaural target PSD estimation for hearing aid noise reduction in complex acoustic environments | |
| May | Robust speech dereverberation with a neural network-based post-filter that exploits multi-conditional training of binaural cues | |
| Hoang et al. | Multichannel speech enhancement with own voice-based interfering speech suppression for hearing assistive devices | |
| Gaultier et al. | Recovering speech intelligibility with deep learning and multiple microphones in noisy-reverberant situations for people using cochlear implants | |
| Ohlenbusch et al. | Multi-microphone noise data augmentation for DNN-based own voice reconstruction for hearables in noisy environments | |
| Marquardt et al. | Noise power spectral density estimation for binaural noise reduction exploiting direction of arrival estimates | |
| Woodruff et al. | Sequential organization of speech in reverberant environments by integrating monaural grouping and binaural localization | |
| Pan et al. | A single-input/binaural-output antiphasic speech enhancement method for speech intelligibility improvement | |
| Zohourian et al. | GSC-based binaural speaker separation preserving spatial cues | |
| Aroudi et al. | Cognitive-driven convolutional beamforming using EEG-based auditory attention decoding | |
| Yousefian et al. | A hybrid coherence model for noise reduction in reverberant environments | |
| Tsilfidis et al. | Binaural dereverberation | |
| Jeub | Joint dereverberation and noise reduction for binaural hearing aids and mobile phones | |
| Sørensen et al. | Semi-non-intrusive objective intelligibility measure using spatial filtering in hearing aids | |
| D'Olne et al. | Model-based beamforming for wearable microphone arrays |