EP3036739A1 - Estimation améliorée d'au moins un signal cible - Google Patents
Estimation améliorée d'au moins un signal cibleInfo
- Publication number
- EP3036739A1 EP3036739A1 EP14753072.9A EP14753072A EP3036739A1 EP 3036739 A1 EP3036739 A1 EP 3036739A1 EP 14753072 A EP14753072 A EP 14753072A EP 3036739 A1 EP3036739 A1 EP 3036739A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- phase
- estimation
- amplitude
- discrete
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000001228 spectrum Methods 0.000 claims abstract description 73
- 238000000034 method Methods 0.000 claims abstract description 55
- 230000001131 transforming effect Effects 0.000 claims abstract description 4
- 230000009466 transformation Effects 0.000 claims description 8
- 230000001419 dependent effect Effects 0.000 claims description 5
- 239000011159 matrix material Substances 0.000 claims description 4
- 238000001914 filtration Methods 0.000 claims description 2
- 230000005236 sound signal Effects 0.000 claims description 2
- 238000011426 transformation method Methods 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 18
- 238000000926 separation method Methods 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 230000003595 spectral effect Effects 0.000 description 6
- 238000011161 development Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000006872 improvement Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 230000009044 synergistic interaction Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
Definitions
- the present invention relates to a method for estimation of at least one signal of interest from at least one discrete-time signal. Furthermore, the invention relates to a device for carrying out a method according to the invention.
- the target signal being a continuous-time signal to be processed by digital data processing
- the target signal is usually measured and transformed into a quantized discrete-time signal.
- the quantized discrete-time signal comprises the desired target signal and, as an undesirable effect, also noise sources and/ or other signals.
- phase-unaware amplitude estimation methods separate the target signal (signal of interest) from noise and/ or other signals by applying a frequency-dependent gain function (mask) on observed noisy amplitude spectrum.
- gain functions are Wiener filter (as softmask) and binary mask. Noise reduction capability obtained by conventional methods is limited since they only modify the amplitude or phase individually.
- phase estimation being an amplitude-aware phase estimation using an input signal to obtain an estimated phase spectrum of the at least one signal of interest, wherein the result of the amplitude estimation of the preceding step b) is used as an input signal;
- step d) performing an amplitude estimation on the complex spectrum, said amplitude estimation being a phase-aware amplitude estimation using the result of the phase estimation of step c) to obtain an enhanced complex spectrum of the at least one signal of interest.
- the signal of interest can be any target signal included in the at least one discrete- time signal.
- This approach according to the invention pushes the limits of the conventional speech enhancement methods by introducing synergistic interaction between amplitude estimation and phase estimation stages.
- the amplitude estimation on the complex spectrum in step b) is performed irrespectively of the phase spectrum of the signal of interest.
- Such an amplitude estimation forms a "phase- unaware amplitude estimation” referring to any conventional amplitude estimation method which is performed irrespectively of the phase spectrum of at least the at least one signal of interest.
- the result of the phase-unaware amplitude estimation (according to step b)) is only used as an input signal in step c) if step c) follows in direct order to step b).
- Step c) and d) according to the invention are based upon certain conditions.
- Amplitude- aware phase estimation according to step c) requires at least an estimation of the amplitude spectrum (signal magnitude spectrum) of the at least one signal of interest and preferably estimation of the amplitude spectrum of the vector sum of all other sources.
- the amplitude-aware phase estimator (and a method for amplitude-aware phase estimation) has been derived from the non-patent literature "Phase estimation for signal reconstruction in single-channel speech separation" (P. Mowlaee, R. Saiedi, and R.
- phase-aware amplitude estimator in particular, a method for phase-aware amplitude estimation
- the at least one discrete-time signal can be of any source or the interaction of sources, for example a noisy speech signal or the superposition of several speech and/ or noise signals.
- the at least one discrete-time signal could be obtained by observation, measurement and/ or calculation.
- the amplitude estimation on the complex spectrum in step b) is performed by a frequency-dependent time-frequency mask, in particular by Wiener filtering of the complex spectrum.
- step d) the steps c) and d) are repeated iteratively wherein as input signal in the repeated step c) the result of the phase-aware amplitude estimation of the preceding step d) is used. Therefore, a loop is closed by a feedback from an output of a phase-aware amplitude estimator to an input of an amplitude- aware phase estimator.
- Previous iterative speech enhancement methods aimed at improving the spectral amplitude estimates only within the iterations. In these methods, neither a phase enhancement stage nor a combined synthesis-analysis stage was used within the feedback loop for the iterations. Instead, a noisy phase was exploited in signal reconstruction.
- the consistency between the phase and amplitude estimations of the enhanced complex spectrum (as the enhanced complex spectrum provides an input in step c)) of the at least one signal of interest is monitored according to the following comparison criterion: , X being a matrix composed of a complex time-frequency representation of the enhanced complex spectrum, wherein at least one quality index is established to measure inconsistency of complex time-frequency representations denoted by , obtained at each loop-iteration, and defined for the i-th loop iteration as follows: wherein the i-th quality index is calculated by
- the threshold allows to measure the decrease of the amount of inconsistency observed between the phase and amplitude estimates obtained by each iteration before feedback to the phase estimation. Therefore, the iterations can be stopped when the quality index gets lower than the pre-defined threshold allowing fast and efficient processing of the trans
- the threshold is which is especially suited as a comparison
- the iterations are stopped at least after a predefined number of iterations, in particular after five, six or seven iterations. This allows to limit the number of iterations and therefore to limit the computing efforts. It is also possible to relate to the above mentioned comparison criterion and to limit the number of iterations in case does not fall below the threshold
- the transformation method in step a) is a spectro-temporal transformation, in particular STFT, Wavelet or sinusoidal signal modelling.
- STFT spectro-temporal transformation
- wavelet wavelet
- sinusoidal model it is possible to reduce the dimensionality of signal feature to a great level, hence less computational effort.
- replacing STFT with other time-frequency transformations inducting Wavelet or Wigner ville time-frequency representation for amplitude estimation or Chirplet signal transformation and complex Wavelet transformation for representation of both amplitude and phase enables to have a non-uniform resolution to analyze different frequency bands, which is advantageous when applied to audio or speech signals.
- the at least one discrete-time signal can be a bio-medical, radar, image or video signal.
- the complex time-frequency representation X can be either one- or multidimensional.
- the matrix X is typically composed of frames as rows and frequency bins as its columns (rows are often larger than the columns).
- speech signals it is composed of a wide dynamic range of values (80 dB).
- the dynamic range is often much lower as the signal is sparse in time-frequency.
- the method according to the invention is especially suited if the at least one discrete-time signal is an audio signal.
- the at least one discrete-time signal comprises at least one speech signal.
- the speech signal can be the target signal, which is true for many everyday life speech-related applications, in particular for automatic speech recognition (ASR) applications.
- ASR automatic speech recognition
- the at least one discrete-time signal can comprise two or even more speech signals.
- the target signal is represented by one speech signal to be separated from the accompanying signals.
- the at least one discrete-time signal can be derived from a single channel signal.
- Single channel signals are common in many applications as they rely on a signal obtained by a single microphone (cell phones, headsets,...) but usually do provide less information than multi channel devices. Therefore, the requirements on signal enhancement are very high, especially in case of single channel speech separation (SCSS). Since the method according to the invention provides strongly enhanced target signals it is exceptionally suited to be applied on single channel signals.
- SCSS single channel speech separation
- the at least one discrete-time signal can be derived from a multi channel signal.
- An additional information provided by at least a second measurement device can be processed to give an extraordinary accurate estimation of the at least one target signal.
- the method according to the invention is also suited to estimate two or more target signals.
- the aim to provide an enhanced method for the estimation of at least one target signal is achieved by means of a device for carrying out a method according to any of the preceding claims.
- Fig. 1 a schematic block-diagram illustrating the object of the invention
- Fig.2 an exemplary schematic block-diagram of a state of the art multi-sensor speech enhancement method
- FIG.4 an exemplary schematic block-diagram of a variant of the invention
- FIG.5 an exemplary schematic block-diagram of another variant of the invention
- Fig. 6 a schematic block-diagram of the block "New Enhancement" according to the invention shown in fig.4 and 5
- Fig. 7 a detailed schematic block-diagram of the stopping rule block shown in fig.4,
- Fig.8 a schematic block-diagram of a typical single-channel separation algorithm based on amplitude estimation on a complex spectrum of a noisy signal described the cited non-patent literature non-patent literature "Phase estimation for signal reconstruction in single-channel speech separation'" in detail, said amplitude estimation being performed phase-unaware,
- Fig.9 a schematic block-diagram of amplitude-aware phase estimation described in the non-patent literature "Phase estimation for signal reconstruction in single-channel speech separation'" in detail,
- Fig. 10 two schematic block-diagrams of two different single-channel speech separation algorithms described in the cited non-patent literature "On phase importance in parameter estimation in single-channel speech enhancement" in detail.
- Fig. 1 shows a schematic block-diagram illustrating the object of the invention.
- y(t) which includes for example two different signals and (and/ or correspondingly it is an object of the invention to separate the signals by providing an enhanced complex spectrum (see
- the symbol index t refers to continuous time and n refers to discrete time domain) of the at least one signal of interest . This allows to provide an estimate and/ or of the signals (and/ or correspondingly Assuming that
- the signal is a signal of interest and the signal represents for example interfering
- a typical approach to estimate the signal of interest consists of transforming the continuous-time signal y(t) into a quantized discrete-time signal y(n) by applying an analog-to- digital converter 1 on the continuous-time signal y(t).
- a signal estimation device 2 processes the discrete time signal y(n) using a priori information to provide an estimate of at least the signal of interest. In the given example an estimate of the signal representing noise is provided as well.
- amplitude-aware phase estimation refers to the task of phase estimation given the input noisy data as well as an estimation of the amplitude spectrum of the signal of interest which can be provided by a conventional phase-unaware amplitude estimation method (an example for a phase-unaware amplitude estimation method is denoted by block C shown in Figure 4).
- Fig. 2 shows an exemplary schematic block-diagram of a state of the art multi-sensor speech enhancement method (which can be applied by a signal estimation device 2 according to Fig. 1) to be applied on M discrete-time signals exploited from a number of M sensors, said speech enhancement method composed of three stages, i.e. analysis, modification and synthesis.
- the analysis stage might consist in different signal representations including short-time Fourier transformation (STFT), Sinusoidal modeling, polyphase filter banks, Mel- frequency Cepstral analysis and/ or any other suitable transformation applicable on at least one discrete-time signal.
- STFT short-time Fourier transformation
- Sinusoidal modeling Sinusoidal modeling
- polyphase filter banks Polyphase filter banks
- Mel- frequency Cepstral analysis Mel- frequency Cepstral analysis and/ or any other suitable transformation applicable on at least one discrete-time signal.
- the discrete-time signals exploited from the number of M sensors are therefore transformed in a complex format providing amplitude and phase parts of the signals.
- the analysis stage is required to decompose the complex signals into a number of N different frequency channels, hence a product of N x M samples are provided for the modification stage.
- the output of the analysis stage by the complex spectrum representation can be exemplified as follows.
- the modification stage known from the state of the art can be performed in two ways: a) amplitude enhancement, in which any frequency- dependent gain function as amplitude estimator (e.g. Wiener filter as a common choice) is employed together with a noise estimator given either by a reference microphone or a noise tracking method while the noisy phase is directly copied to reconstruct the enhanced signal, or b) phase enhancement; in which the noisy phase is often directly copied to synthesize enhanced output signal.
- the synthesis stage is applied on the resulting N x M samples of the modification stage to reconstruct enhanced signals, in particular enhanced speech signals.
- Fig. 3 exemplifies state-of-the-art modification stage of Fig. 2 (if not stated otherwise in the description of the figures, same reference signs describe same features).
- the M discrete-time signals exploited from the number of M sensors are analyzed in block A, providing N x M samples in a complex format as described in Fig. 2.
- block 3 the amplitude part contained in the complex format of the samples is exploited (block 4 exploits the phase part contained in the complex format of the samples).
- the samples are processed through amplitude or phase enhancement stages, wherein the amplitude enhancement stage is provided with a noise estimate, and finally synthesized in block S to provide an enhanced signal, in particular an enhanced speech signal.
- the modifications of the samples can be categorized in four different groups.
- a first group provides an estimate for the clean speech spectral amplitude based on a noise estimate from a noise tracker or a reference sensor and a speech estimate using a decision- directed method (see US 2009/0163168 Al and Y. Ephraim and D. Malah, "Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator", IEEE Trans. Acoust., Speech, Signal Processing, vol. 32. no. 6, pp. 1109-1121, Dec 1984).
- the noisy phase is directly used unaltered when reconstructing a time-domain enhanced speech signal at an output.
- This group can be represented in Fig. 3 by an amplitude switch ASW being in a position F2 and a phase switch PSW being in a position P3.
- a second group (ASW in position F2 and PSW in position P4) refers to phase enhancement only methods.
- non-patent literature see the cited non-patent literature xx and "On phase importance in parameter estimation in single-channel speech enhancement" suggested to employ Griffin and Lim iterations to estimate the signal phase for signal reconstruction given the Wiener filtered amplitude spectrum, using synthesis-analysis in iterations.
- a third group (ASW in position PI and PSW in position 4) refers to phase enhancement only methods used with the noisy amplitude. The phase estimation often requires strong assumptions knowing exact onsets and fundamental frequency of clean signals, in particular speech signals and previous frame phase values.
- a fourth group (ASW in position F2 and PSW in position P4, but in contrast to the second group no iterations) refers to a method assuming a clean spectral amplitude is available and spectral amplitude is estimated in a phase-aware way in an open loop configuration.
- Block A and block S represent analysis and synthesis blocks as described in Fig. 2 and 3, wherein block A is provided with at least one discrete-time signal y(n).
- an enhanced signal modification method according to the invention is provided, which is described in detail in Fig. 6.
- a block "New Enhancement” is provided with a noise estimate, N x M samples, and, depending on the switching position of a loop switch LSW,
- the conventional enhancement block C represents any phase-unaware amplitude estimator or phase-unaware amplitude estimation methods (or any amplitude estimation method performed irrespectively of the phase spectrum of at least the signal of interest which separates the signal of interest from noise and/or other signals for example by applying a frequency-dependent gain function (mask) on observed noisy amplitude spectrum. Examples for such gain functions are Wiener filter (as softmask) and binary mask. Noise reduction capability obtained by conventional methods is limited since they only modify the amplitude or phase individually.
- the block C and the block "New Enhancement" is provided with a noise estimate.
- Block C performs an amplitude estimation on the complex spectrum to obtain an estimated amplitude spectrum of the at
- At least one signal of interest (preferably from noise and/ or other signals as well).
- the method according to the invention comprises a step (b) including performing a phase- unaware amplitude estimation on the complex spectrum to obtain an estimated amplitude spectrum (see output P1 of the block C) of the at least one signal of interest
- the conventional method (phase- unaware amplitude estimation) applied within block C enhances the amplitude only, providing an input signal representing an initial amplitude estimate of the signal required for phase estimation in the following step.
- the amplitude-aware phase estimation requires such initially enhanced amplitude signal of interest.
- Fig. 4 shows a block “stopping rule” (stopping criterion), which provides a criterion to stop the feedback loop.
- the block “new enhancement” is first provided with the amplitude estimate of the complex spectrum of the conventional block C.
- the output of the block “New Enhancement” can be looped back as an input signal (the input signal can be in complex format) for the block “New Enhancement” in a following iteration
- the block “New Enhancement” is described in more detail in Fig. 6 and provides an enhanced complex spectrum of the at least one signal of interest (and correspondingly of which can be used to reconstruct an estimate of the signal of interest
- Fig. 5 shows an exemplary schematic block-diagram of another variant of the invention, wherein the feedback loop differs from the variant shown in Fig.4.
- the output of the block "New Enhancement” is synthesized in Block S and analyzed in a following analysis block A, before being looped back as an input signal to the block "New Enhancement", provided that the loop switch LSW being in position P2.
- X being a matrix composed of a complex time-frequency representation of the enhanced complex spectrum, wherein at least one quality index is established to measure inconsistency of complex time-frequency representations denoted by , obtained by each loop-iteration, and defined for the i-th loop iteration as follows: wherein the i-th quality index is calculated by
- the threshold is preferably
- Fig. 6 shows a schematic block-diagram of the block “New Enhancement” according to the invention shown in Fig.4 and 5.
- two blocks are shown processing the N x M samples described in the preceding figures.
- a block "amplitude-aware phase estimation'' performs a phase estimation on the complex spectrum said phase estimation
- the block "amplitude-aware phase estimation'" provides an enhanced phase estimation of the at least one signal of interest (preferably, an enhanced phase estimation of the noise or any other signal as well) to a following block "phase-aware amplitude estimator".
- phase-aware amplitude estimator an amplitude estimation on the complex spectrum is performed, said amplitude estimation being a phase-aware amplitude estimation using the result of the phase estimation of the block “amplitude-aware phase estimation” to obtain an enhanced complex spectrum of the at least one signal of interest
- Fig. 7 shows a detailed schematic block-diagram of the block “stopping rule” (stopping criterion) shown in Fig.4.
- a block “consistency check” is provided with
- a certain number of iterations for example five, six or seven iterations
- a inconsistency criterion can be applied (for example the quality index mentioned above) limiting the number of iterations.
- Fig. 8 shows a schematic block-diagram of a typical single-channel separation algorithm based on amplitude estimation on a complex spectrum of a noisy signal described in appendix the cited non-patent literature "Phase estimation for signal reconstruction in single- channel speech separation", said amplitude estimation being performed phase-unaware.
- a signal y comprises two signals si and s2 to be separated, wherein amplitude estimates a noisy phase signal is applied to reconstruct the clean signals and
- Fig. 9 shows a schematic block-diagram of amplitude-aware phase estimation.
- the signal reconstruction is provided with phase information corresponding to the signals respectively.
- An minimum mean square error (MMSE) phase estimation block is shown, which is provided with the amplitude estimates and the signal y, said phase estimation being amplitude-aware and providing phase signals
- Fig. 10 shows two schematic block-diagrams of two different single-channel speech separa- tion algorithms.
- a typical method to estimate a clean speech amplitude (corresponding to of Fig. 8 and 9) is shown in (a), wherein the amplitude estimation (within the block "Gain function'") is not provided with any phase information.
- the amplitude estimation is referred to as being phase-unaware.
- phase-aware amplitude estimation and amplitude-aware phase estimation do not relate to speech signals only.
- phase-aware amplitude estimation and amplitude-aware phase estimation is applicable to a plurality of signals and the speech signals described in the non-patent literature "On phase importance in parameter estimation in single-channel speech enhancement” and “Phase estimation for signal reconstruction in single-channel speech separation” just represent one utilization of phase-aware amplitude estimation and amplitude-aware phase example, respectively. Therefore, the invention is not limited to the examples given in this specification and can be adjusted in any manner known to a person skilled in the art.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
Abstract
La présente invention concerne un procédé d'estimation d'au moins un signal d'intérêt (s1(t), s1(n)) à partir d'au moins un signal temporel discret (y(n)), le procédé comprenant les étapes suivantes : a) la transformation du au moins un signal temporel discret (y(n)) dans un domaine de fréquence afin d'obtenir un spectre (I) complexe du au moins un signal temporel discret (y(n)); b) la réalisation d'une estimation d'amplitude non avertie de la phase sur le spectre (I) complexe, afin d'obtenir un spectre d'amplitude estimé du au moins un signal d'intérêt (s1(t), s1(n)); (c) la réalisation d'une estimation de phase sur le spectre (I) complexe, ladite estimation de phase étant une estimation de phase avertie de l'amplitude à l'aide d'un signal d'entrée (sin(n)) pour obtenir un spectre de phase estimé du au moins un signal d'intérêt (s1(t), s1(n)), le résultat de l'estimation d'amplitude de l'étape b) précédente étant utilisé comme signal d'entrée (sin(n)); d) la réalisation d'une estimation d'amplitude sur le spectre (I) complexe, l'estimation d'amplitude étant une estimation d'amplitude avertie de la phase à l'aide du résultat de l'estimation de phase de l'étape c) pour obtenir un spectre (II) complexe amélioré du au moins un signal d'intérêt (s1(t), s1(n)).
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP14753072.9A EP3036739A1 (fr) | 2013-08-23 | 2014-08-19 | Estimation améliorée d'au moins un signal cible |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP13181563.1A EP2840570A1 (fr) | 2013-08-23 | 2013-08-23 | Estimation améliorée d'au moins un signal cible |
| PCT/EP2014/067667 WO2015024940A1 (fr) | 2013-08-23 | 2014-08-19 | Estimation améliorée d'au moins un signal cible |
| EP14753072.9A EP3036739A1 (fr) | 2013-08-23 | 2014-08-19 | Estimation améliorée d'au moins un signal cible |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP3036739A1 true EP3036739A1 (fr) | 2016-06-29 |
Family
ID=49115345
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP13181563.1A Withdrawn EP2840570A1 (fr) | 2013-08-23 | 2013-08-23 | Estimation améliorée d'au moins un signal cible |
| EP14753072.9A Withdrawn EP3036739A1 (fr) | 2013-08-23 | 2014-08-19 | Estimation améliorée d'au moins un signal cible |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP13181563.1A Withdrawn EP2840570A1 (fr) | 2013-08-23 | 2013-08-23 | Estimation améliorée d'au moins un signal cible |
Country Status (2)
| Country | Link |
|---|---|
| EP (2) | EP2840570A1 (fr) |
| WO (1) | WO2015024940A1 (fr) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP4165633B1 (fr) | 2020-06-11 | 2025-01-08 | Dolby Laboratories Licensing Corporation | Procédés, appareil et systèmes pour la détection et l'extraction de sources sonores de sous-bande identifiables spatialement |
| WO2021252795A2 (fr) | 2020-06-11 | 2021-12-16 | Dolby Laboratories Licensing Corporation | Optimisation perceptuelle d'amplitude et de phase pour des systèmes de séparation de source de temps-fréquence et de masque logiciel |
| CN113903355B (zh) * | 2021-12-09 | 2022-03-01 | 北京世纪好未来教育科技有限公司 | 语音获取方法、装置、电子设备及存储介质 |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006114102A1 (fr) * | 2005-04-26 | 2006-11-02 | Aalborg Universitet | Initialisation efficace d’une estimation iterative de parametres |
| US7492814B1 (en) * | 2005-06-09 | 2009-02-17 | The U.S. Government As Represented By The Director Of The National Security Agency | Method of removing noise and interference from signal using peak picking |
-
2013
- 2013-08-23 EP EP13181563.1A patent/EP2840570A1/fr not_active Withdrawn
-
2014
- 2014-08-19 EP EP14753072.9A patent/EP3036739A1/fr not_active Withdrawn
- 2014-08-19 WO PCT/EP2014/067667 patent/WO2015024940A1/fr not_active Ceased
Non-Patent Citations (1)
| Title |
|---|
| See references of WO2015024940A1 * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2015024940A1 (fr) | 2015-02-26 |
| EP2840570A1 (fr) | 2015-02-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9666183B2 (en) | Deep neural net based filter prediction for audio event classification and extraction | |
| US8467538B2 (en) | Dereverberation apparatus, dereverberation method, dereverberation program, and recording medium | |
| JP3154487B2 (ja) | 音声認識の際の雑音のロバストネスを改善するためにスペクトル的推定を行う方法 | |
| JP2005518118A (ja) | 周波数解析のためのフィルタセット | |
| JP7486266B2 (ja) | 深層フィルタを決定するための方法および装置 | |
| CN113611321B (zh) | 一种语音增强方法及系统 | |
| CN109671447A (zh) | 一种双通道欠定卷积混叠信号盲分离方法 | |
| JP2016143042A (ja) | 雑音除去装置及び雑音除去プログラム | |
| Do et al. | Speech Separation in the Frequency Domain with Autoencoder. | |
| EP3036739A1 (fr) | Estimation améliorée d'au moins un signal cible | |
| Dumortier et al. | Blind RT60 estimation robust across room sizes and source distances | |
| CN113345465A (zh) | 语音分离方法、装置、设备及计算机可读存储介质 | |
| Yoshioka et al. | Dereverberation by using time-variant nature of speech production system | |
| Agcaer et al. | Optimization of amplitude modulation features for low-resource acoustic scene classification | |
| KR101096091B1 (ko) | 음성 분리 장치 및 이를 이용한 단일 채널 음성 분리 방법 | |
| Malek | Blind compensation of memoryless nonlinear distortions in sparse signals | |
| CN107919136B (zh) | 一种基于高斯混合模型的数字语音采样频率估计方法 | |
| Singh et al. | Speech enhancement for Punjabi language using deep neural network | |
| Gui et al. | Adaptive subband Wiener filtering for speech enhancement using critical-band gammatone filterbank | |
| Buragohain et al. | Single Channel Speech Enhancement System using Convolutional Neural Network based Autoencoder for Noisy Environments | |
| Shimauchi et al. | Accurate adaptive filtering in square-root Hann windowed short-time Fourier transform domain | |
| Hepsiba et al. | Computational intelligence for speech enhancement using deep neural network | |
| US20240363133A1 (en) | Noise suppression model using gated linear units | |
| US20240363132A1 (en) | High-performance small-footprint ai-based noise suppression model | |
| RU2788939C1 (ru) | Способ и устройство для определения глубокого фильтра |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20160314 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| AX | Request for extension of the european patent |
Extension state: BA ME |
|
| DAX | Request for extension of the european patent (deleted) | ||
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20161012 |