JP2002528778A

JP2002528778A - Method and apparatus for determining parameters of a technical system

Info

Publication number: JP2002528778A
Application number: JP2000578968A
Authority: JP
Inventors: オブラドヴィックドラガン
Original assignee: Siemens AG
Current assignee: Siemens AG
Priority date: 1998-10-27
Filing date: 1999-10-14
Publication date: 2002-09-03
Also published as: WO2000025489A1; EP1125405A1

Abstract

(57)【要約】技術システムのパラメータをもとめるための方法が決定される。この方法によって出力信号が重畳された統計的に互いに独立した入力信号の集合からもとめられる。出力信号の統計的独立性が最大化されるようにパラメータがもとめられる。 (57) [Summary] A method for determining parameters of a technical system is determined. In this way, the output signals are determined from a superimposed set of statistically independent input signals. The parameters are determined so that the statistical independence of the output signal is maximized.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】本発明は、技術システムのパラメータをもとめるための方法ならびに装置に関
する。The present invention relates to a method and a device for determining the parameters of a technical system.

【０００２】信号のマルチチャネル伝送及びマルチチャネル受信において、例えば頻繁に信
号/画像間の干渉が発生する。典型的な例はこの場合音声信号と雑音との混合で
あり、これは遠隔通信及びビデオ会議において大きな問題となりうる。従って、
本発明は例えばオリジナル音声信号を再現するための信号分離の分野に関する。In multi-channel transmission and multi-channel reception of signals, for example, signal / image interference frequently occurs. A typical example is a mixture of voice signals and noise in this case, which can be a major problem in telecommunications and video conferencing. Therefore,
The invention relates to the field of signal separation, for example to reproduce the original audio signal.

【０００３】混合信号からのソース信号の分離のための典型的な公知の技術は信号の時間平
均又はフィルタリングに基づく。しかしながら、これは計算コストの点で欠点を
生じる。[0003] Typical known techniques for separating a source signal from a mixed signal are based on time averaging or filtering of the signal. However, this has disadvantages in terms of computational costs.

【０００４】いわゆるブラインドチャネル等化（伝送チャネルに関する予めの知識なしの信
号等化）に基づく方法も公知である。しかし、この方法は、常に例えばソース信
号の統計的分布に関する知識のような、ソース信号に関する一定の知識を必要と
する。[0004] Methods based on so-called blind channel equalization (signal equalization without prior knowledge of the transmission channel) are also known. However, this method always requires certain knowledge about the source signal, for example knowledge about the statistical distribution of the source signal.

【０００５】信号分離の問題は、例えば二人の話者が彼らから離れている２つのマイクロフ
ォンによって会話し、この結果、各マイクロフォンが二人の話者の発話した信号
の混合信号を受信する場合にも生じる。従って、混合信号、すなわち重畳された
入力信号の集合を再び分離する、という問題が存在する。L. Molgedy, H. G. Sc
huster, "Separation of a Mixture of Independent Signals using Time-Delay
ed Correlations", Phys. Ref. Left. 72,3634 (1994) からこの場合次のような
方法が公知である：ｎ個の重畳されかつ相関されたソース信号（入力信号）を分
離し、同時にソース強度の混合係数を決定する問題は、固有値問題に還元される
。この固有値問題では、同時に２つの対称的なｎｘｎ行列が対角化されなければ
ならない。行列要素は測定可能な時間遅延された相関関数である。この固有値問
題の解決はニューラルネットワークによって行われ、ニューロン間の横方向抑制
相互作用（lateral inhibiting interaction）に対する学習規則がリャプノフ関
数によって決定され、このリャプノフ関数の最小値がこの問題の（縮退した）解
を供給する。[0005] The problem of signal separation is, for example, when two speakers speak with two microphones remote from them, so that each microphone receives a mixed signal of the signals spoken by the two speakers. Also occurs. Therefore, there is a problem that the mixed signal, that is, the set of superimposed input signals is separated again. L. Molgedy, HG Sc
huster, "Separation of a Mixture of Independent Signals using Time-Delay
ed. Correlations ", Phys. Ref. Left. 72, 3634 (1994), in which case the following method is known: The n superimposed and correlated source signals (input signals) are separated and simultaneously sourced. The problem of determining the intensity mixing factor is reduced to an eigenvalue problem, in which two symmetric nxn matrices must be diagonalized at the same time, and the matrix elements have a measurable time-delayed correlation. The solution of this eigenvalue problem is performed by a neural network, the learning rule for lateral inhibiting interaction between neurons is determined by the Lyapunov function, and the minimum value of this Lyapunov function is Provide a (degenerate) solution.

【０００６】この方法は既に音響入力信号にも適用された（F. Ehlers, H.G.Schuster, "Bl
ind Separation of convolutive mixtures and an application in automatic s
peech recognition" IEEE Trans. Signal Proc. (1997) を参照）。[0006] This method has already been applied to acoustic input signals (F. Ehlers, HGSchuster, "Bl
ind Separation of convolutive mixtures and an application in automatic s
peech recognition "IEEE Trans. Signal Proc. (1997)).

【０００７】ＤＥ１９５３１３８８Ｃ１から未知の信号（ブラインドチャネル）の非線形混
合に対する信号分離方法及び信号分離装置が公知であり、これを図３に図示する
。[0007] From DE 195 31 388 C1, a signal separation method and a signal separation device for the non-linear mixing of unknown signals (blind channels) are known and are shown in FIG.

【０００８】このドイツ特許は、Ｍ個の未知のソース信号Ｘ１、Ｘ２の非線形重畳からなる
混合信号の分離を処理する。場合によってはあり得る妨害信号を含むＭ個のソー
ス信号Ｘ１、Ｘ２のＮ（Ｎ≧Ｍ）個の異なる混合が信号評価装置に供給され、こ
の信号評価装置はこれらの信号の統計的分析によって非線形伝達係数（nonlinea
r transmission factor）を決定し、これらの算出された係数によってこの混合
を取り消し、この結果、信号分離装置のＮ個の出力ができるだけ近似的に重畳の
ないＭ個のソース信号を含む。これによって非線形混合信号の処理が可能となる
。この場合、非線形とは、ソース信号Ｘ１、Ｘ２が未知の非線形システムＧによ
って混合されることを意味する。この未知のシステムＧはいわゆるヴォルテラ（
Volterra）級数によって記述され、信号分離装置Ｇ^- ^１はこのヴォルテラ級数の
係数を決定する。これらの係数の知識によって混合信号の分離が可能である。さ
らに、これらの係数は信号源の位置又は速度検出のための後続の分析に利用でき
る。This German patent deals with the separation of a mixed signal consisting of a non-linear superposition of M unknown source signals X1, X2. N (N≥M) different mixtures of the M source signals X1, X2, possibly including interfering signals, are fed to a signal estimator, which is non-linear by statistical analysis of these signals. Transfer coefficient (nonlinea
r transmission factor) and cancels out this mixing by these calculated coefficients, so that the N outputs of the signal separation device contain the M source signals as close as possible without overlap. This allows processing of non-linear mixed signals. In this case, non-linear means that the source signals X1, X2 are mixed by the unknown non-linear system G. This unknown system G is a so-called Volterra (
Described by a Volterra series, the signal separation device G ^- ¹ determines the coefficients of this Volterra series. The knowledge of these coefficients allows the separation of the mixed signal. In addition, these coefficients can be used for subsequent analysis for signal source position or velocity detection.

【０００９】この印刷物から公知の方法はこの場合実質的に２つのステップから成る：・第１に、非線形性の選択可能なグレードによって混合の際に一意的に決定さ
れる非線形方程式がスライディング時間窓に対して解かれ、これらの解が時間に
亘って平均される。この時間平均はこの公知の技術の主な欠点である。なぜなら
、この時間平均は計算コスト及び同時に計算時間を上昇させるからである。The method known from this print consists in this case essentially of two steps: firstly, the nonlinear equation uniquely determined during mixing by the selectable grade of nonlinearity has a sliding time window And these solutions are averaged over time. This time averaging is a major drawback of this known technique. This is because this time average increases the calculation cost and at the same time the calculation time.

【００１０】第２に、推定された出力信号の十分に多数の異なるキュムラントから形成され
るポテンシャルが最小化され、このポテンシャルの計算のために必要な値は選択
可能な長さのスライディング時間窓から由来する。この場合、混合システムは非
常にゆっくりと変化し、この変化はもとめられる混合係数の計算の際に無視でき
ると仮定する。このドイツ特許によれば、上記第２のステップの実施のためにコ
スト関数が構成され、最小化される。大域的な最小値に達する場合に、この場合
には伝達係数の最適値が見出された。Second, the potential formed from a sufficiently large number of different cumulants of the estimated output signal is minimized, and the value required for the calculation of this potential is determined from a sliding time window of selectable length. Comes from. In this case, it is assumed that the mixing system changes very slowly and this change is negligible in the calculation of the required mixing coefficients. According to this German patent, a cost function is constructed and minimized for performing the second step. When a global minimum is reached, in this case an optimum value of the transfer coefficient has been found.

【００１１】時間コストならびに計算コストの点で、ＤＥ１９５３１３８８Ｃ１に記述され
た方法は、第１の上記の方法ステップの最後に時間平均を実施するために不利で
ある。In terms of time costs as well as computational costs, the method described in DE 195 31 388 C1 is disadvantageous for performing a time averaging at the end of the first above-mentioned method step.

【００１２】従って、本発明の課題は、重畳された統計的に互いに独立した音響信号をきわ
めて低い計算コストで分離することができる方法及び装置を提供することである
。It is therefore an object of the present invention to provide a method and a device which enable the superimposed, statistically independent acoustic signals to be separated at a very low computational cost.

【００１３】上記課題は、独立請求項に記載の構成を有する方法ならびに装置によって解決
される。[0013] The object is achieved by a method and a device having the features of the independent claims.

【００１４】技術システムのパラメータをもとめるための方法において、この方法によって
、出力信号が重畳された統計的に互いに独立した入力信号の集合からもとめられ
、出力信号の統計的独立性が最大化されるようにパラメータがもとめられる。In a method for determining the parameters of a technical system, the method determines the output signal from a superimposed statistically independent set of input signals and maximizes the statistical independence of the output signal. Parameters are determined as follows.

【００１５】技術システムのパラメータをもとめるための装置において、この装置によって
、重畳された統計的に互いに独立した入力信号の集合から出力信号をもとめ、こ
の装置はプロセッサを有し、このプロセッサは、出力信号の統計的独立性が最大
化されるようにパラメータをもとめるように構成されている。In an apparatus for determining parameters of a technical system, the apparatus determines an output signal from a superimposed set of statistically independent input signals, the apparatus comprising a processor, the processor comprising an output It is configured to determine parameters such that the statistical independence of the signal is maximized.

【００１６】本発明の有利な実施形態は従属請求項から得られる。[0016] Advantageous embodiments of the invention result from the dependent claims.

【００１７】パラメータは有利には反復法においてもとめられる。The parameters are advantageously determined in an iterative manner.

【００１８】他の実施形態では、パラメータは分離行列の要素であり、この分離行列によっ
て重畳された入力信号の集合が乗算されるか又は畳み込まれ、これにより出力信
号が形成される。In another embodiment, the parameters are elements of a separation matrix, which is multiplied or convolved with the set of superimposed input signals, thereby forming an output signal.

【００１９】分離行列のパラメータの最適化は、有利には次のステップによって得られる：
すなわち、分離行列の固有値をもとめるために時間遅延された逆相関計算（time-delayed
decorrelation calculation）を繰り返し、分離行列の固有値をもとめ、これらの固有値に対して相互相関が最小値をとり
、キュムラント最小化を実施し、このキュムラント最小化に対するスタート値と
して前のステップでもとめられた固有値が使用される。The optimization of the parameters of the separation matrix is advantageously obtained by the following steps:
In other words, a time-delayed inverse correlation calculation (time-delayed
repeat the decorrelation calculation), find the eigenvalues of the separation matrix, take the minimum cross-correlation for these eigenvalues, implement cumulant minimization, and use the eigenvalues found in the previous step as the start value for this cumulant minimization Is used.

【００２０】キュムラント最小化は、例えばニューラルネットワークのトレーニングによっ
て行われるが、例えば最急降下法又はモンテカルロ・シミュレーションのような
他のあらゆる最小化技術によって行われる。[0020] Cumulant minimization is performed, for example, by training a neural network, but by any other minimization technique, such as, for example, steepest descent or Monte Carlo simulation.

【００２１】１つの実施形態では、分離行列のパラメータの最適化において分離行列の少な
くとも１つの対角線パラメータが予め設定された値にセットされ、これによって
大域的な最小値へのこの最小化プロセスの安定性が保証される。In one embodiment, in optimizing the parameters of the separation matrix, at least one diagonal parameter of the separation matrix is set to a preset value, thereby stabilizing the minimization process to a global minimum. Is guaranteed.

【００２２】分離行列は、有利には有限インパルス応答に限定される。すなわち、分離行列
の個々の要素を形成するためにはＦＩＲフィルタ（Finite Impulse Response）
が使用される。このＦＩＲフィルタは因果的ＦＩＲフィルタか又は非因果的ＦＩ
Ｒフィルタである。The separation matrix is advantageously limited to a finite impulse response. That is, an FIR filter (Finite Impulse Response) is used to form the individual elements of the separation matrix.
Is used. The FIR filter can be a causal FIR filter or an acausal FIR filter.
An R filter.

【００２３】さらに、分離行列はキュムラント最小化の間に有利には単位円への射影によっ
て安定化される。Furthermore, the separation matrix is stabilized during cumulant minimization, advantageously by projection onto the unit circle.

【００２４】これらの実施形態は方法にも装置にも当てはまる。装置においてはそれぞれ相
応の方法ステップを実施可能であるか又は実施するようにプロセッサが構成され
ている。These embodiments apply to the method and the device. In the device, a processor is respectively operable or configured to perform the corresponding method steps.

【００２５】本発明ならびに本発明の実施形態は、有利には重畳された統計的に互いに独立
な入力信号、とりわけ音響入力信号の分離のために使用される。The invention as well as embodiments of the invention are advantageously used for the separation of superimposed, statistically independent input signals, in particular acoustic input signals.

【００２６】本発明の方法ならびに装置は任意の個数の入力信号に対して適用可能である。The method and apparatus of the present invention are applicable to any number of input signals.

【００２７】本発明の更に別の利点、構成及び特性を次に実施例に基づいて図面の任意の図
を参照しつつ詳しく説明する。Further advantages, configurations and characteristics of the present invention will now be described in detail based on embodiments with reference to any of the drawings.

【００２８】図１は本発明の実施例による重畳された統計的に互いに独立した音響信号の分
離のためのシステムの適用事例を示す。FIG. 1 shows an application example of a system for separating superimposed statistically independent sound signals according to an embodiment of the present invention.

【００２９】図２は図１のシステムのシンボリックな概略図を示す。FIG. 2 shows a symbolic schematic of the system of FIG.

【００３０】図３は未知の信号の非線形混合に対する従来技術（ＤＥ１９５３１３８８Ｃ１
）から公知の信号分離装置を示す。FIG. 3 shows a prior art (DE19531388C1) for nonlinear mixing of unknown signals.
) Shows a known signal separation device.

【００３１】オリジナル音声信号を再現するために、信号の混合から、以降では入力信号と
呼ぶソース信号（オリジナル音声信号及び雑音）間の統計的独立性を利用し、信
号の混合を発生させたダイナミックシステムの逆プロセスが実質的に近似的にト
レーニング（学習）される。音声信号乃至は雑音信号の２つの異なる混合は、例
えば２つのマイクロフォン１、２（図１参照）によって得られる。これらの２つ
のマイクロフォン１、２は互いに間隔をおいて配置されている及び/又は正反対
の方向に配向されている。本発明の方法では、学習フェーズを開始するために、
すなわち学習フェーズに対するスタート値をもとめて設定するために、いわゆる
時間遅延逆相関（ＴＤＤ、time-delayed decorrelation）が使用される。これに
よって後ほど記述するキュムラント（cumulant）最小化のための計算コストが低
減され、局所的な最小値の危険性が減少される。In order to reproduce the original audio signal, from the signal mixture, a dynamic in which the signal mixture is generated by utilizing the statistical independence between the source signals (the original audio signal and noise), hereinafter referred to as an input signal, is used. The inverse process of the system is substantially trained. Two different mixtures of audio or noise signals are obtained, for example, by two microphones 1, 2 (see FIG. 1). These two microphones 1, 2 are spaced from each other and / or oriented in diametrically opposite directions. In the method of the present invention, to start the learning phase,
That is, a so-called time-delayed decorrelation (TDD) is used to obtain and set the start value for the learning phase. This reduces the computational cost for cumulant minimization, described later, and reduces the risk of local minima.

【００３２】図１は２つのマイクロフォン１、２を示し、これらの２つのマイクロフォン１
、２は第１の入力信号Ｚ１（ｔ）及び第２の入力信号Ｚ２（ｔ）を受信する。こ
れらの入力信号Ｚ１（ｔ）及びＺ２（ｔ）は互いにそれぞれ雑音をミックスされ
る。これは混合行列Ｓ（参照符号３）によって図１においてシンボリックに図示
されている。受信乃至は伝送の後で、重畳された統計的に互いに独立した入力信
号Ｚ１（ｔ）及びＺ２（ｔ）の集合Ｘ１（ｔ）及びＸ２（ｔ）が得られる。これ
らの信号は計算ユニット４に入力され、この計算ユニット４において実質的に２
つのステップが実施される。これら２つのステップはシンボリックに第１のステ
ップのための計算ユニットＢ（参照符号６）ならびに第２のステップのためのニ
ューラルネットワーク５によって図示されている。FIG. 1 shows two microphones 1, 2 and these two microphones 1
, 2 receive a first input signal Z1 (t) and a second input signal Z2 (t). These input signals Z1 (t) and Z2 (t) are each mixed with noise. This is symbolically illustrated in FIG. 1 by the mixing matrix S (reference 3). After reception or transmission, a set X1 (t) and X2 (t) of superimposed statistically independent input signals Z1 (t) and Z2 (t) is obtained. These signals are input to a calculation unit 4 where substantially two
Two steps are performed. These two steps are symbolically illustrated by a calculation unit B (reference numeral 6) for the first step and a neural network 5 for the second step.

【００３３】計算ユニット４によって２つの出力信号Ｙ１（ｔ）乃至はＹ２（ｔ）がもとめ
られる。これらの２つの出力信号Ｙ１（ｔ）乃至はＹ２（ｔ）は計算ユニット４
におけるパラメータの最適な調整によって近似的に入力信号Ｚ１（ｔ）乃至はＺ
２（ｔ）に等しい。言い換えれば、計算ユニット４で使用される行列のパラメー
タの最適な調整においてこの計算ユニット４によって実質的に行列Ｓ（参照符号
３）によりシンボリックに示されるダイナミックな混合プロセスの逆プロセスが
行われる。この実施例は分離行列のパラメータの調整の最適化プロセスに関する
。The calculation unit 4 determines two output signals Y 1 (t) to Y 2 (t). These two output signals Y1 (t) or Y2 (t) are calculated by the calculation unit 4
Input signals Z1 (t) through Z
2 (t). In other words, in the optimal adjustment of the parameters of the matrix used in the calculation unit 4, the calculation unit 4 performs the inverse process of the dynamic mixing process, which is substantially symbolically indicated by the matrix S (reference numeral 3). This embodiment relates to an optimization process for adjusting the parameters of the separation matrix.

【００３４】計算ユニット４における行列のパラメータは、計算ユニット４における行列プ
ロセスによって得られる出力信号Ｙ１（ｔ）、Ｙ２（ｔ）の間の統計的独立性を
最大化することによって最適化される。この目的のために、出力信号Ｙ１（ｔ）
乃至はＹ２（ｔ）は計算ユニット４にフィードバックされる（フィードバックル
ープ７乃至は８参照）。反復法によって、出力信号Ｙ１（ｔ）乃至はＹ２（ｔ）
の統計的独立性がこの反復の前のステップに比べて高まった（従って、この反復
は、後で記述されるコスト関数の大域的な最小値の方向において「適正な」方向
をとる）か否かがもとめられる。The parameters of the matrix in the calculation unit 4 are optimized by maximizing the statistical independence between the output signals Y 1 (t), Y 2 (t) obtained by the matrix process in the calculation unit 4. For this purpose, the output signal Y1 (t)
Alternatively, Y2 (t) is fed back to the calculation unit 4 (see feedback loops 7 and 8). By an iterative method, the output signals Y1 (t) or Y2 (t)
Has increased statistical independence compared to the previous step of this iteration (and thus this iteration takes the "right" direction in the direction of the global minimum of the cost function described later) I can ask for it.

【００３５】図２は図１のスキームの数学的な図示を示し、混合プロセス３は行列Ｓ（ｑ）
によって数学的に記述され、計算ユニット４によって行われるべき分離プロセス
が分離行列Ｍ（ｑ）によって示されている。FIG. 2 shows a mathematical illustration of the scheme of FIG. 1, where the mixing process 3 comprises a matrix S (q)
And the separation process to be performed by the calculation unit 4 is indicated by the separation matrix M (q).

【００３６】従って、図２には２次元においていわゆるマルチチャネルブラインドソース（
アプリオリな知識なしのマルチチャネルソース）の分離の問題が図示されている
。この場合、混合システムＳ（ｑ）、ただしｑは単位遅延、は安定しており、同
時に安定した反転（inversion）を有すると、すなわちこのシステムが最小フェ
ーズシステム（minimal phase system）であると仮定する。さらに、入力信号Ｚ
１（ｔ）及びＺ２（ｔ）（例えば音声乃至は雑音信号）は統計的に互いに独立し
ており、ガウス分布しないと仮定する。重畳された入力信号Ｚ１（ｔ）乃至はＺ
２（ｔ）の集合Ｘ１（ｔ）及びＸ２（ｔ）は、分離行列Ｍ（ｑ）を有する分離シ
ステムへの入力信号であり、この分離行列Ｍ（ｑ）のパラメータ（行列要素）は
出力信号Ｙ１（ｔ）とＹ２（ｔ）との間の統計的独立性の最大化を目指してトレ
ーニングされる。「トレーニング」とはこの場合例えばニューラルネットワーク
の良く知られた学習プロセスとして公知であり、このニューラルネットワークは
統計的独立性を最大化する技術の例として挙げられる。これは後で記述されるコ
スト関数Ｊ（Ｍ）の最小化によって行われる。Accordingly, FIG. 2 shows a so-called multi-channel blind source in two dimensions (
The problem of separation of multi-channel sources without a priori knowledge is illustrated. In this case, it is assumed that the mixing system S (q), where q is a unit delay, is stable and at the same time has a stable inversion, ie this system is a minimal phase system. . Further, the input signal Z
Assume that 1 (t) and Z2 (t) (eg, speech or noise signals) are statistically independent of each other and do not have a Gaussian distribution. The superimposed input signals Z1 (t) to Z1
The sets X1 (t) and X2 (t) of 2 (t) are input signals to a separation system having a separation matrix M (q), and the parameters (matrix elements) of the separation matrix M (q) are output signals. Trained to maximize statistical independence between Y1 (t) and Y2 (t). "Training" is in this case known, for example, as the well-known learning process of neural networks, which is mentioned as an example of a technique for maximizing statistical independence. This is done by minimizing a cost function J (M) described later.

【００３７】キュムラント・コスト関数が形成される。このキュムラント・コスト関数はキ
ュムラントオーダ２〜４の対角線キュムラント要素を最小化する：A cumulant cost function is formed. This cumulant cost function minimizes the diagonal cumulant elements of cumulant order 2-4:

【００３８】[0038]

【数１】 (Equation 1)

【００３９】この場合、混合行列Ｓ（ｑ）によるダイナミックな混合の次の観点が考慮される
：分離システムの安定性：これは、Ｍ（ｑ）が有限インパルス応答（ＦＩＲフィルタ）に限定される場合
に達成される。このＦＩＲシステムＭ（ｑ）の安定性はさらに学習フェーズの間
に単位円への射影が行われることによっても得られる。もしかすると存在しうる
Ｓ（ｑ）の反転の非因果性は、入力信号Ｘ（ｔ）の適当な時間シフト（遅延）に
よって補償されうる。In this case, the following aspects of the dynamic mixing with the mixing matrix S (q) are considered: Stability of the separation system: M (q) is limited to a finite impulse response (FIR filter) If achieved. The stability of the FIR system M (q) is also obtained by projecting onto the unit circle during the learning phase. The acausality of the inversion of S (q) that may be present can be compensated by a suitable time shift (delay) of the input signal X (t).

【００４０】分離された信号Ｙ（ｔ）の一意性：定常的な混合の場合には、オリジナルソース信号はスケーリングによって再現
される。ダイナミックな分離の場合には、分離された信号Ｙ（ｔ）の非一意性の
危険性がいやがうえにも大きくなる。Ｙ１（ｔ）及びＹ２（ｔ）が統計的に互い
に独立している場合には、これらの信号のいかなる線形フィルタリングされた修
正でも常に統計的に独立していることは明らかである。従って、この問題の内在
的な非一意性を低減するために付加的な情報が必要不可欠である。Uniqueness of the separated signal Y (t): In the case of stationary mixing, the original source signal is reproduced by scaling. In the case of dynamic separation, the risk of non-uniqueness of the separated signal Y (t) is even greater. If Y1 (t) and Y2 (t) are statistically independent of each other, it is clear that any linearly filtered modification of these signals is always statistically independent. Therefore, additional information is essential to reduce the inherent non-uniqueness of this problem.

【００４１】データのガウシアン変形：定常的なブラインドソース分離に対するキュムラントベースのアルゴリズムは
、有効なやり方で、出力信号Ｙ（ｔ）に相応して比較的高いオーダの対角線キュ
ムラントを最小化乃至は除去する。他方で、線形フィルタリングはデータのガウ
ス分布変形を引き起こし、このガウス分布変形において比較的高いオーダのキュ
ムラントは方向０に向かう。従って、これは、コスト関数が局所的最小値に到達
する境界解（Randloesung）に導き、所望の実際の分離（大域的最小値）が行わ
れない。この望ましくないケースを回避するために、分離伝達関数（分離行列）
Ｍ（ｑ）の構造が幾つかの限定を受けるようにする。上記の問題を避けるために
は、対角線要素のうちの少なくとも１つ（又は全て）が単位値にセットされるア
プローチを選択する：Ｍ１１（ｑ）＝１及びＭ２２（ｑ）＝１この仮定は、混合要素Ｓ１１（ｑ）及び/又はＳ２２（ｑ）が同様に単位値“
１”を有する場合に正確である。その他の場合には、Ｓ１１（ｑ）及び/又はＳ
２２（ｑ）が安定的な反転を有すると仮定する。このことはＭ（ｑ）の対角線要
素が単位値にスケーリングされることを可能にする。このアプローチは解の非一
意性を本質的に減少させ、さらに有効なやり方で出力信号の過度なガウス分布変
形の危険性を回避する。たとえ一見したところ上記のようなＭ（ｑ）の対角線要
素の限定が非常に制限的に作用しても、この仮定は実際のアプリケーションにお
いては通常は満たされる。典型的な例は、２つのマイクロフォンによるレコーデ
ィングに基づく音声信号からの雑音除去であり、この場合これらのマイクロフォ
ンが互いに空間的に分離されているか又は１つのマイクロフォンが話者の方向に
向けられており、他方でもう１つのマイクロフォンが逆方向に向けられており、
そのため話者とは反対方向の第２の信号が実質的にノイズ信号だけを含む。Gaussian Transformation of Data: A cumulant-based algorithm for stationary blind source separation effectively minimizes or eliminates diagonal cumulants of relatively high order corresponding to the output signal Y (t). . On the other hand, linear filtering causes a Gaussian deformation of the data, in which the higher order cumulants go in direction 0. Thus, this leads to a boundary solution (Randloesung) where the cost function reaches a local minimum and the desired actual separation (global minimum) is not performed. To avoid this undesirable case, the separation transfer function (separation matrix)
The structure of M (q) is subject to some limitations. To avoid the above problem, choose an approach where at least one (or all) of the diagonal elements is set to unit value: M11 (q) = 1 and M22 (q) = 1 Similarly, if the mixing element S11 (q) and / or S22 (q)
1 "is true. Otherwise, S11 (q) and / or S11 (q)
Assume that 22 (q) has a stable inversion. This allows the diagonal elements of M (q) to be scaled to unit values. This approach essentially reduces the non-uniqueness of the solution and avoids the risk of excessive Gaussian deformation of the output signal in a more efficient manner. Even though at first glance the diagonal element limitation of M (q) as described above works very restrictively, this assumption is usually satisfied in practical applications. A typical example is denoising from a speech signal based on recording with two microphones, where these microphones are spatially separated from each other or one microphone is oriented towards the speaker. , While the other microphone is pointed in the opposite direction,
Thus, the second signal in the opposite direction to the speaker contains substantially only the noise signal.

【００４２】キュムラントアプローチは、冒頭に挙げた論文 F. Ehlers, H.G.Schuster, "B
lind Separation of convolutive mixtures and an application in automatic
speech recognition" IEEE Trans. Signal Proc. (1997) で行われているような
対角線キュムラントの直接算出に基づく。しかし、これは内在的に数値解法が非
常に面倒であるという欠点を有する。従って、この最小化法の適当な初期設定を
使用する。最小化法のためのスタート値をもとめるために、本発明では時間遅延
逆相関（ＴＤＤ）の技術が２つの異なる時間遅延の同時的な逆相関に適用される
。このＴＤＤ技術は適当な行列固有値問題に還元される。既に述べたように、こ
のＴＤＤ技術は本発明において対角線（相互相関）キュムラント最小化問題の開
始のために使用される。[0042] The cumulant approach is described in the paper F. Ehlers, HGSchuster, "B
lind Separation of convolutive mixtures and an application in automatic
speech recognition "based on the direct calculation of diagonal cumulants as done in IEEE Trans. Signal Proc. (1997). However, this has the disadvantage that the numerical solution is inherently very cumbersome. In order to determine a starting value for the minimization method, the technique of time-delay inverse correlation (TDD) uses the technique of time-delay inverse correlation (TDD) for the simultaneous inverse correlation of two different time delays. The TDD technique is reduced to a suitable matrix eigenvalue problem, which, as already mentioned, is used in the present invention to start the diagonal (cross-correlation) cumulant minimization problem.

【００４３】要約すると、この方法は次の２つのステップに分けられる：１. 異なる遅延ペアに対する周波数領域における固有値問題に基づくＴＤＤ法
の繰り返し及び相互相関項が最小値を有する解の算出。In summary, the method can be divided into two steps: 1. Iteration of the TDD method based on eigenvalue problems in the frequency domain for different delay pairs and calculation of the solution whose cross-correlation term has a minimum.

【００４４】２. 上記のステップにおいて算出されたスタート値（ＦＩＲパラメータ）に基
づく対角線キュムラント最小化の開始（スタート）。2. Start of diagonal cumulant minimization (start) based on the start value (FIR parameter) calculated in the above steps.

【００４５】次にさらに幾つかの主特徴及び利点をもう一度まとめておく。Next, some more main features and advantages are summarized again.

【００４６】統計的独立性が要求されることを除いて信号特性のアプリオリな知識は必要な
い。No a priori knowledge of signal characteristics is required except that statistical independence is required.

【００４７】ダイナミックな分離システムの安定性はＦＩＲフィルタとしてこのダイナミッ
クな分離システムのコンポーネントを調整することによって保証される。The stability of the dynamic separation system is ensured by adjusting the components of this dynamic separation system as FIR filters.

【００４８】過度なガウス分布変形は次のアプローチによって回避される。すなわち、混合
伝達関数行列（分離行列）の要素のうちの少なくとも１つが単位値にセットされ
るか乃至は単位値にスケーリングされ得る、というアプローチによって回避され
る。Excessive Gaussian deformation is avoided by the following approach. That is, this is avoided by the approach that at least one of the elements of the mixed transfer function matrix (separation matrix) can be set to unit values or scaled to unit values.

【００４９】キュムラント最小化ステップ（ステップ２）は大きな計算コストを必要とする
ので、例えばニューラルネットワークの学習アルゴリズムはＴＤＤ法によって初
期設定される。Since the cumulant minimization step (step 2) requires a large calculation cost, for example, the learning algorithm of the neural network is initialized by the TDD method.

【００５０】以下に上述の実施例をコンピュータで実現するためのＭａｔｌａｂ，Ｖｅｒｓ
ｉｏｎ４又はＶｅｒｓｉｏｎのプログラムを提出する：Matlab, Vers for implementing the above-described embodiment by a computer will be described below.
Submit the ion4 or Version program:

【００５１】[0051]

【外１】 [Outside 1]

【００５２】[0052]

【外２】 [Outside 2]

[Brief description of the drawings]

【図１】本発明の実施例による重畳された統計的に互いに独立した音響信号の分離のた
めのシステムの適用事例を示す。FIG. 1 shows an application example of a system for separating superimposed statistically independent sound signals according to an embodiment of the present invention.

【図２】図１のシステムのシンボリックな概略図を示す。FIG. 2 shows a symbolic schematic of the system of FIG.

【図３】未知の信号の非線形混合に対する従来技術（ＤＥ１９５３１３８８Ｃ１）から
公知の信号分離装置を示す。FIG. 3 shows a signal separation device known from the prior art (DE19531388C1) for nonlinear mixing of unknown signals.

【符号の説明】１マイクロフォン２マイクロフォン３混合行列４計算ユニット５ニューラルネットワーク６計算ユニット７及び８フィードバックループ[Description of Signs] 1 microphone 2 microphone 3 mixing matrix 4 calculation unit 5 neural network 6 calculation unit 7 and 8 feedback loop

【手続補正書】特許協力条約第３４条補正の翻訳文提出書[Procedural Amendment] Submission of translation of Article 34 Amendment of the Patent Cooperation Treaty

【提出日】平成１２年１２月２２日（２０００．１２．２２）[Submission Date] December 22, 2000 (2000.12.22)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】特許請求の範囲[Correction target item name] Claims

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【特許請求の範囲】[Claims]

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/16 Ｇ１０Ｌ 9/00 Ｆ 21/02 ３０１ＡＨ０４Ｌ 25/03 3/00 ５３９ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 15/16 G10L 9/00 F 21/02 301A H04L 25/03 3/00 539

Claims

[Claims]

1. A method for determining a parameter of a technical system, comprising:
A parameter of a technical system, wherein the parameter is determined from a set of statistically independent input signals on which the output signal is superimposed by the method, and wherein the method is such that the statistical independence of the output signal is maximized. Way to find out.

2. The method of claim 1, wherein the parameters are determined in an iterative manner.

3. The method of claim 1, wherein the parameters are elements of a separation matrix, and the set of input signals superimposed by the separation matrix is multiplied to form an output signal.

4. The optimization of the parameters of the separation matrix is obtained by the following steps: a time-delayed inverse correlation calculation (dec is used to determine the eigenvalues of the separation matrix; 4. The method according to claim 3, wherein the cross-correlation takes the minimum value for the eigenvalue, performs cumulant minimization (5), and uses the eigenvalue determined in the previous step as a start value for the cumulant minimization. the method of.

5. The method according to claim 4, wherein the cumulant minimization is performed by training a neural network.

6. The method of claim 3, wherein at least one diagonal parameter of the separation matrix is set to a preset value in the optimization of the parameters of the separation matrix.
6. The method according to one of the five aspects.

7. The separation matrix according to claim 3, wherein the separation matrix is limited to a finite impulse response.
The method of claim 1.

8. The method according to claim 3, wherein the separation matrix is stabilized during cumulant minimization by projection onto a unit circle.

9. The method according to claim 1, which is used for separating superimposed, statistically independent input signals.

10. The method according to claim 1, which is used for separating superimposed, statistically independent audio input signals.

11. An apparatus for determining parameters of a technical system, comprising: determining an output signal from a set of statistically independent input signals superimposed by the apparatus; the apparatus comprising a processor, the processor comprising: An apparatus for determining a parameter of a technical system, wherein the apparatus is configured to determine the parameter such that statistical independence of an output signal is maximized.

12. The apparatus of claim 11, wherein the processor is configured to determine the parameters in an iterative manner.

13. The processor is configured as follows: the parameters are elements of a separation matrix, and the set of superimposed input signals is multiplied by the separation matrix, thereby forming an output signal. An apparatus according to claim 11 or claim 12.

14. The processor is constructed as follows: the optimization of the parameters of the separation matrix is obtained by the following steps: the inverse correlation time-delayed to determine the eigenvalues of said separation matrix. Calculation (time-del
ayed decorrelation calculation) (6) is repeated, the eigenvalue of the separation matrix is obtained, the cross-correlation takes the minimum value for the eigenvalue, cumulant minimization (5) is performed, and the start value for the cumulant minimization is obtained. 14. The apparatus according to claim 13, wherein eigenvalues determined in a previous step are used.

15. Apparatus according to claim 14, wherein the processor is configured as follows: cumulant minimization is performed by training a neural network (5).

16. The processor is configured as follows: at least one diagonal parameter of the separation matrix is set to a preset value in optimizing the parameters of the separation matrix. 1 out of 15
Item.

17. The apparatus according to claim 13, wherein the processor is configured as follows: the separation matrix is limited to a finite impulse response.

18. The processor according to claim 13, wherein the processor is configured as follows: the separation matrix is stabilized by projection onto the unit circle during cumulant minimization (5). Item.

19. The apparatus according to claim 11, which is used for separating superimposed statistically independent input signals.

20. Apparatus according to claim 11, which is used for separating superimposed, statistically independent audio input signals.