[go: up one dir, main page]

CN111837119A - A Sound Signal Separation Method Based on Semi-Non-Negative Matrix Decomposition - Google Patents

A Sound Signal Separation Method Based on Semi-Non-Negative Matrix Decomposition Download PDF

Info

Publication number
CN111837119A
CN111837119A CN201980012799.7A CN201980012799A CN111837119A CN 111837119 A CN111837119 A CN 111837119A CN 201980012799 A CN201980012799 A CN 201980012799A CN 111837119 A CN111837119 A CN 111837119A
Authority
CN
China
Prior art keywords
matrix
semi
coefficient
sound signal
negative
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201980012799.7A
Other languages
Chinese (zh)
Other versions
CN111837119B (en
Inventor
韩威
周松斌
刘伟鑫
李昌
刘忆森
邱泽帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Institute of Intelligent Manufacturing
Original Assignee
Guangdong Institute of Intelligent Manufacturing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Institute of Intelligent Manufacturing filed Critical Guangdong Institute of Intelligent Manufacturing
Publication of CN111837119A publication Critical patent/CN111837119A/en
Application granted granted Critical
Publication of CN111837119B publication Critical patent/CN111837119B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Algebra (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Stereophonic System (AREA)

Abstract

A method of sound signal separation with semi-non-negative matrix factorization, comprising: calculating a fourier transform result of the single-channel mixed sound signal and calculating a frequency spectrum thereof (S1); performing semi-nonnegative matrix factorization (S2) on the frequency spectrum of the mixed signal; obtaining respective initial spectra of the source signals from the spectra of the mixed signals (S3); performing semi-nonnegative matrix factorization (S4) on an initial spectrum of the source signal; obtaining a frequency spectrum of the source signal according to the frequency spectrum of the mixed signal, the characteristic matrix and the coefficient matrix corresponding to the frequency spectrum of the mixed signal, and the characteristic matrix and the coefficient matrix corresponding to the source signal (S5); obtaining a fourier transform result of the source signal from the fourier transform result of the mix signal and the frequency spectrum of the source signal (S6); the source signal is separated from the single-channel mixed sound signal by performing an inverse fourier transform on the fourier transform result of the source signal to obtain the source signal (S7). The method can be used for separating single-channel mixed sound signals formed by mixing source sound signals which are overlapped in frequency domain and are not necessarily independent.

Description

一种基于半非负矩阵分解的声音信号分离方法A Sound Signal Separation Method Based on Semi-Non-Negative Matrix Decomposition

技术领域technical field

本发明涉及单通道混合声音信号分离技术领域,具体涉及一种基于半非负矩阵分解的声音信号分离方法。The invention relates to the technical field of single-channel mixed sound signal separation, in particular to a sound signal separation method based on semi-non-negative matrix decomposition.

背景技术Background technique

声学技术已在医学诊断、产品质量检测、设备状态监测、机械性能试验、声学事件分类等领域被广泛的研究和应用。由于其应用环境的声场可能较为复杂,采集到的声音信号往往是目标声音和环境噪声的混合信号。因此,一般需要首先从混合声音信号中提取出目标声音以进行后续的信号处理和分析。此外,鉴于体积大小、设备造价、安装问题等条件的限制,可能只能安装一个声音传感器或者是最好安装一个声音传感器,这就要求从单一麦克风采集到的单通道混合声音信号中提取出目标声音信号。单通道混合声音信号分离是解决上述任务的常用方法。Acoustic technology has been widely researched and applied in the fields of medical diagnosis, product quality inspection, equipment condition monitoring, mechanical performance test, and acoustic event classification. Because the sound field of its application environment may be complex, the collected sound signal is often a mixed signal of target sound and environmental noise. Therefore, it is generally necessary to first extract the target sound from the mixed sound signal for subsequent signal processing and analysis. In addition, in view of the limitations of size, equipment cost, installation problems and other conditions, it is possible to install only one sound sensor or it is better to install one sound sensor, which requires extracting the target from the single-channel mixed sound signal collected by a single microphone sound signal. Single-channel mixed sound signal separation is a common method for solving the above tasks.

机械工业第三设计研究院申请的中国发明专利“基于Hilbert变换的欠定声音信号分离方法及装置”提出采用Hilbert变换进行欠定声音信号分离,没有说明是针对单通道混合声音信号,且其方法不适用于频域混叠的情况。中科院嘉兴中心微系统所分中心申请的中国发明专利“无线传感器网络中基于粒子滤波的多车辆声信号分离方法”以及山东大学申请的中国发明专利“基于联合近似对角化盲源分离算法的电力设备故障音检测方法”均是针对麦克风阵列采集的多通道混合声音信号进行声音分离。The Chinese invention patent "Method and Device for Separation of Underdetermined Sound Signals Based on Hilbert Transform" applied by the Third Design and Research Institute of Machinery Industry proposes to use Hilbert transform to separate underdetermined sound signals. Not suitable for frequency domain aliasing. The Chinese invention patent "Multi-vehicle acoustic signal separation method based on particle filtering in wireless sensor network" and the Chinese invention patent applied by Shandong University "Based on the joint approximate diagonalization blind source separation algorithm" "Equipment fault sound detection method" is to separate the sound for the multi-channel mixed sound signal collected by the microphone array.

安徽理工大学王康等人发表的论文“基于变分模态分解的单通道信号盲源分离方法”提出一种基于变分模态分解的单通道信号盲源分离方法:首先采用变分模态分解实现单通道观测信号的升维,并估计源信号数目,然后再进行信号的盲源分离。该方法涉及到升维操作(即将单通道信号映射为多通道信号),且不适用于频域混叠的情况。解放军信息工程大学郭一鸣等人发表的论文“基于SIC的单通道同频混合信号低复杂度盲分离算法”采用过采样构造多通道条件,进而构造出信道矩阵,利用连续干扰抵消算法实现单通道同频混合信号的盲分离。该方法同样涉及到将单通道信号映射为多通道信号后再进行信号分离,且其受时延差和接收信号过采样倍数影响较大,存在解调盲区。江南大学杨海兰等人发表的论文“单通道通信信号盲分离算法”给出了一种基于希尔伯特黄变换和独立分量分析的单通道通信信号盲源分离算法。该方法不适用于频域混叠的情况。解放军理工大学朱会杰等人发表的论文“基于移不变稀疏编码的单通道机械信号盲源分离”对特征反复出现的机械信号,提出一种基于移不变稀疏编码的单通道盲源分离方法,算法中将源信号看成多个基与系数的卷积,能够根据信号的统计分布,利用信号自身特征自适应地学习到匹配的基和稀疏的系数。该方法针对的是特征反复出现的机械信号。The paper "Blind Source Separation Method for Single-Channel Signals Based on Variational Mode Decomposition" published by Wang Kang et al. from Anhui University of Science and Technology proposed a blind source separation method for single-channel signals based on variational modal decomposition: first, the variational mode decomposition method was used. Decomposition realizes the dimension-raising of the single-channel observation signal, estimates the number of source signals, and then performs blind source separation of the signal. This method involves dimension-raising operations (that is, mapping a single-channel signal to a multi-channel signal), and is not suitable for the case of frequency domain aliasing. The paper "Single-channel same-frequency mixed signal low-complexity blind separation algorithm based on SIC" published by Guo Yiming et al. of PLA University of Information Engineering uses oversampling to construct multi-channel conditions, and then constructs a channel matrix, and uses continuous interference cancellation algorithm to achieve single-channel synchronization. Blind separation of frequency mixed signals. This method also involves mapping a single-channel signal into a multi-channel signal and then performing signal separation, which is greatly affected by the time delay difference and the oversampling multiple of the received signal, and there is a demodulation blind spot. The paper "Blind Separation Algorithm for Single-Channel Communication Signals" published by Yang Hailan et al. of Jiangnan University presents a blind source separation algorithm for single-channel communication signals based on Hilbert-Huang transform and independent component analysis. This method is not suitable for the case of frequency domain aliasing. The paper "Blind Source Separation of Single-Channel Mechanical Signals Based on Shift-Invariant Sparse Coding" published by Zhu Huijie et al. of PLA University of Science and Technology proposed a single-channel blind source separation method based on shift-invariant sparse coding for mechanical signals with recurring features. In the algorithm, the source signal is regarded as the convolution of multiple bases and coefficients, which can adaptively learn matching bases and sparse coefficients by using the signal's own characteristics according to the statistical distribution of the signal. This method targets mechanical signals that feature recurring patterns.

发明内容SUMMARY OF THE INVENTION

有鉴于此,有必要针对上述问题,提出一种基于半非负矩阵分解的声音信号分离方法。该方法首先根据源声音信号的主要频段从单通道混合声音信号的频谱中获得初始估计频谱,再基于半非负矩阵分解将源信号的初始估计频谱分解为初始特征矩阵和初始系数矩阵,利用源信号的系数矩阵之间的相关性得到源信号最终的特征矩阵和系数矩阵,从而获得源信号的频谱,再进行傅立叶逆变换获得源信号,最终实现单通道混合声音信号分离。In view of this, it is necessary to propose a sound signal separation method based on semi-non-negative matrix decomposition to solve the above problems. The method firstly obtains the initial estimated spectrum from the spectrum of the single-channel mixed sound signal according to the main frequency band of the source sound signal, and then decomposes the initial estimated spectrum of the source signal into an initial feature matrix and an initial coefficient matrix based on semi-nonnegative matrix decomposition. The correlation between the coefficient matrices of the signal obtains the final characteristic matrix and coefficient matrix of the source signal, so as to obtain the spectrum of the source signal, and then performs inverse Fourier transform to obtain the source signal, and finally realizes the separation of single-channel mixed sound signals.

为实现上述目的,本发明采取以下的技术方案:To achieve the above object, the present invention adopts the following technical solutions:

一种基于半非负矩阵分解的声音信号分离方法,包括如下步骤:A sound signal separation method based on semi-non-negative matrix decomposition, comprising the following steps:

S1、单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,计算S的傅立叶变换结果F,根据F计算频谱X;S1. The single-channel mixed sound signal S is formed by mixing several independent sound signals S 1 , S 2 , ..., Sn , calculate the Fourier transform result F of S, and calculate the spectrum X according to F;

S2、对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H;S2. Perform semi-non-negative matrix decomposition on X to obtain feature matrix W and coefficient matrix H;

S3、根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000031
S3. Obtain the respective initial estimated spectrums of the sound signals S 1 , S 2 , ..., Sn according to X
Figure BDA0002627289950000031

S4、对

Figure BDA0002627289950000032
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure BDA0002627289950000033
和系数矩阵
Figure BDA0002627289950000034
S4, yes
Figure BDA0002627289950000032
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure BDA0002627289950000033
and coefficient matrix
Figure BDA0002627289950000034

S5、根据X、W、H,以及

Figure BDA0002627289950000035
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn;S5. According to X, W, H, and
Figure BDA0002627289950000035
Obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n ;

S6、根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,FnS6. According to the Fourier transform result F and the frequency spectrum X 1 , X 2 ,...,X n , obtain the respective Fourier transform results F 1 , F 2 ,..., F n of the sound signals S 1 , S 2 ,..., Sn ;

S7、分别对F1,F2,…,Fn进行傅立叶逆变换,得到声音信号S1,S2,…,Sn,从而实现从单通道混合声音信号S中分离出独立的声音信号S1,S2,…,Sn S7 . Perform inverse Fourier transform on F 1 , F 2 , . 1 , S 2 ,…,S n .

进一步地,S1中所述单通道混合声音信号,是指混合声音信号只由一个声音采集器采集而得。Further, the single-channel mixed sound signal in S1 means that the mixed sound signal is collected by only one sound collector.

进一步地,S1中所述单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,其中n的数值为2或3。Further, the single-channel mixed sound signal S in S1 is formed by mixing several independent sound signals S 1 , S 2 , . . . , Sn , where the value of n is 2 or 3.

进一步地,S2中对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H,按如下步骤进行:Further, in S2, semi-non-negative matrix decomposition is performed on X to obtain a feature matrix W and a coefficient matrix H, and the steps are as follows:

S21、构造半非负矩阵分解的目标函数ΓS21. Construct the objective function Γ of the semi-nonnegative matrix factorization

Figure BDA0002627289950000041
Figure BDA0002627289950000041

S22、初始化系数矩阵H,其所有元素的值为(0,1)之间的随机数;S22. Initialize the coefficient matrix H, and the values of all its elements are random numbers between (0, 1);

S23、计算特征矩阵W的初始值为S23. Calculate the initial value of the feature matrix W as

W=XH(HTH)-1 (2)W=XH(H T H) -1 (2)

S24、将特征矩阵W和系数矩阵H交替迭代更新:先迭代更新一次W,然后迭代更新一次H,如此循环往复的先后迭代更新W和H;利用公式W=XH(HTH)-1 (3)迭代更新特征矩阵W中的元素,利用公式

Figure BDA0002627289950000042
迭代更新系数矩阵H中的元素;S24, iteratively update the feature matrix W and the coefficient matrix H alternately: first iteratively update W once, and then iteratively update H once, so that W and H are iteratively updated successively in a cycle; using the formula W=XH(H T H) -1 ( 3) Iteratively update the elements in the feature matrix W, using the formula
Figure BDA0002627289950000042
Iteratively update the elements in the coefficient matrix H;

S25、设定半非负矩阵分解的目标函数Γ的最小值Γmin,设定最大迭代次数Emax,每次迭代更新完成后计算目标函数Γ的值,当目标函数Γ的值小于Γmin或者迭代次数达到最大迭代次数Emax时,则停止迭代,得到最终的特征矩阵W和系数矩阵H;S25. Set the minimum value Γ min of the objective function Γ of the semi-nonnegative matrix decomposition, set the maximum number of iterations E max , and calculate the value of the objective function Γ after each iteration update is completed. When the value of the objective function Γ is less than Γ min or When the number of iterations reaches the maximum number of iterations E max , the iteration is stopped, and the final feature matrix W and coefficient matrix H are obtained;

于公式(1)、(2)、(3)和(4)中,

Figure BDA0002627289950000043
表示矩阵的Frobenius范数;X为频谱;W为特征矩阵;H为系数矩阵;HT为H的转置;(HTH)-1为HTH的逆矩阵;XT为X的转置;WT为W的转置;(XTW)+为XTW中的正值元素;(XTW)-为XTW中的负值元素;(WTW)+为WTW中的正值元素;(WTW)-为WTW中的负值元素。In formulas (1), (2), (3) and (4),
Figure BDA0002627289950000043
Represents the Frobenius norm of the matrix; X is the frequency spectrum; W is the characteristic matrix; H is the coefficient matrix; H T is the transpose of H; (H T H) -1 is the inverse matrix of H T H; Set; W T is the transpose of W; (X T W) + is the positive value element in X T W; (X T W) - is the negative value element in X T W; (W T W) + is W Positive -valued elements in TW; (W T W ) - are negative-valued elements in W TW .

进一步地,S3中根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000051
按如下步骤进行:Further, in S3, the respective initial estimated spectrums of the sound signals S 1 , S 2 ,..., Sn are obtained according to X
Figure BDA0002627289950000051
Proceed as follows:

S31、声音信号S1,S2,…,Sn各自的主要频段为f1,f2,…,fnS31. The respective main frequency bands of the sound signals S 1 , S 2 ,...,S n are f 1 , f 2 ,..., f n ;

S32、将X中对应于频段f1,f2,…,fn的部分作为声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000052
S32. Use the part of X corresponding to the frequency bands f 1 , f 2 ,..., f n as the respective initial estimated spectrums of the sound signals S 1 , S 2 ,..., Sn
Figure BDA0002627289950000052

进一步地,S4对

Figure BDA0002627289950000053
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure BDA0002627289950000054
和系数矩阵
Figure BDA0002627289950000055
按如下步骤进行:Further, S4 pairs
Figure BDA0002627289950000053
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure BDA0002627289950000054
and coefficient matrix
Figure BDA0002627289950000055
Proceed as follows:

S41、构造半非负矩阵分解的目标函数Γi(i=1,2,…,n)S41. Construct the objective function Γ i (i=1,2,...,n) of semi-nonnegative matrix decomposition

Figure BDA0002627289950000056
Figure BDA0002627289950000056

S42、初始化系数矩阵

Figure BDA0002627289950000057
其所有元素的值为(0,1)之间的随机数;S42, initialization coefficient matrix
Figure BDA0002627289950000057
The value of all its elements is a random number between (0,1);

S43、计算特征矩阵

Figure BDA0002627289950000058
的初始值为S43. Calculate the feature matrix
Figure BDA0002627289950000058
The initial value is

Figure BDA0002627289950000059
Figure BDA0002627289950000059

S44、将特征矩阵

Figure BDA00026272899500000510
和系数矩阵
Figure BDA00026272899500000511
交替迭代更新:先迭代更新一次
Figure BDA00026272899500000512
然后迭代更新一次
Figure BDA00026272899500000513
如此循环往复的先后迭代更新
Figure BDA00026272899500000514
Figure BDA00026272899500000515
利用公式
Figure BDA00026272899500000516
迭代更新特征矩阵
Figure BDA00026272899500000517
中的元素,利用公式
Figure BDA0002627289950000061
迭代更新系数矩阵
Figure BDA0002627289950000062
中的元素;S44, the feature matrix
Figure BDA00026272899500000510
and coefficient matrix
Figure BDA00026272899500000511
Alternate iterative update: first iterative update once
Figure BDA00026272899500000512
Then iteratively update once
Figure BDA00026272899500000513
Iterative update in this way
Figure BDA00026272899500000514
and
Figure BDA00026272899500000515
Use the formula
Figure BDA00026272899500000516
Iteratively update the feature matrix
Figure BDA00026272899500000517
elements in , using the formula
Figure BDA0002627289950000061
Iteratively update the coefficient matrix
Figure BDA0002627289950000062
elements in;

S45、设定半非负矩阵分解的目标函数Γi(i=1,2,…,n)的最小值

Figure BDA0002627289950000063
设定最大迭代次数
Figure BDA0002627289950000064
每次迭代更新完成后计算目标函数Γi的值,当目标函数Γi的值小于
Figure BDA0002627289950000065
或者迭代次数达到最大迭代次数
Figure BDA0002627289950000066
时,则停止迭代,得到最终的特征矩阵
Figure BDA0002627289950000067
和系数矩阵
Figure BDA0002627289950000068
S45. Set the minimum value of the objective function Γ i (i=1,2,...,n) of the semi-nonnegative matrix decomposition
Figure BDA0002627289950000063
Set the maximum number of iterations
Figure BDA0002627289950000064
Calculate the value of the objective function Γ i after each iteration update is completed, when the value of the objective function Γ i is less than
Figure BDA0002627289950000065
Or the number of iterations reaches the maximum number of iterations
Figure BDA0002627289950000066
When , stop the iteration and get the final feature matrix
Figure BDA0002627289950000067
and coefficient matrix
Figure BDA0002627289950000068

于公式(5)、(6)、(7)和(8)中,

Figure BDA0002627289950000069
表示矩阵的Frobenius范数;
Figure BDA00026272899500000610
表示初始估计频谱
Figure BDA00026272899500000611
表示特征矩阵表示系数矩阵
Figure BDA00026272899500000613
Figure BDA00026272899500000614
的转置;
Figure BDA00026272899500000615
Figure BDA00026272899500000616
的逆矩阵;
Figure BDA00026272899500000617
Figure BDA00026272899500000618
的转置;
Figure BDA00026272899500000619
Figure BDA00026272899500000620
的转置;
Figure BDA00026272899500000621
Figure BDA00026272899500000622
中的正值元素;
Figure BDA00026272899500000623
Figure BDA00026272899500000624
中的负值元素;
Figure BDA00026272899500000625
Figure BDA00026272899500000626
中的正值元素;
Figure BDA00026272899500000627
Figure BDA00026272899500000628
中的负值元素。In formulas (5), (6), (7) and (8),
Figure BDA0002627289950000069
represents the Frobenius norm of the matrix;
Figure BDA00026272899500000610
represents the initial estimated spectrum
Figure BDA00026272899500000611
Represents the feature matrix Represents the coefficient matrix
Figure BDA00026272899500000613
Figure BDA00026272899500000614
transpose of ;
Figure BDA00026272899500000615
for
Figure BDA00026272899500000616
The inverse matrix of ;
Figure BDA00026272899500000617
for
Figure BDA00026272899500000618
transpose of ;
Figure BDA00026272899500000619
for
Figure BDA00026272899500000620
transpose of ;
Figure BDA00026272899500000621
for
Figure BDA00026272899500000622
positive-valued elements in ;
Figure BDA00026272899500000623
for
Figure BDA00026272899500000624
Negative elements in ;
Figure BDA00026272899500000625
for
Figure BDA00026272899500000626
positive-valued elements in ;
Figure BDA00026272899500000627
for
Figure BDA00026272899500000628
Negative elements in .

进一步地,S5根据X、W、H,以及

Figure BDA00026272899500000629
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn,按如下步骤进行:Further, S5 is based on X, W, H, and
Figure BDA00026272899500000629
To obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n , follow the steps below:

S51、H=[h1;h2;…;hc],hk(k=1,2,…,c)是维度为M的行向量,用

Figure BDA00026272899500000630
表示系数矩阵
Figure BDA00026272899500000631
Figure BDA00026272899500000632
是维度为M的行向量;S51, H=[h 1 ; h 2 ;...;h c ], h k (k=1, 2,..., c) is a row vector of dimension M, which is represented by
Figure BDA00026272899500000630
Represents the coefficient matrix
Figure BDA00026272899500000631
Figure BDA00026272899500000632
is a row vector of dimension M;

S52、按照公式

Figure BDA00026272899500000633
计算
Figure BDA00026272899500000634
与hk的相关系数
Figure BDA00026272899500000635
因此可以计算出
Figure BDA00026272899500000636
的每一行与hk的相关系数
Figure BDA0002627289950000071
然后选出
Figure BDA0002627289950000072
中的最大值,记为最大相关系数
Figure BDA0002627289950000073
S52. According to the formula
Figure BDA00026272899500000633
calculate
Figure BDA00026272899500000634
Correlation coefficient with h k
Figure BDA00026272899500000635
So it can be calculated
Figure BDA00026272899500000636
The correlation coefficient of each row with h k
Figure BDA0002627289950000071
then choose
Figure BDA0002627289950000072
The maximum value in , denoted as the maximum correlation coefficient
Figure BDA0002627289950000073

S53、按照步骤S52,计算

Figure BDA0002627289950000074
的每一行与H的每一行的最大相关系数,记为
Figure BDA0002627289950000075
S53, according to step S52, calculate
Figure BDA0002627289950000074
The maximum correlation coefficient between each row of H and each row of H, denoted as
Figure BDA0002627289950000075

S54、按照步骤S53,分别计算

Figure BDA0002627289950000076
的每一行与H的每一行的最大相关系数,记为Q=[q1;q2;…;qn],qi(i=1,2,…,n)是维度为c的行向量,Q是一个n行c列的矩阵;S54, according to step S53, calculate respectively
Figure BDA0002627289950000076
The maximum correlation coefficient between each row of H and each row of H is denoted as Q=[q 1 ; q 2 ;...;q n ], q i (i=1,2,...,n) is a row vector of dimension c , Q is a matrix with n rows and c columns;

S55、声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn的特征矩阵为W1,W2,…,Wn,且系数矩阵为H1,H2,…,HnS55. The characteristic matrices of the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,..., Sn are W 1 , W 2 ,..., W n , and the coefficient matrices are H 1 , H 2 ,…,H n ;

S56、如果在Q的第k列(k=1,2,…,c)中,第i行(i=1,2,…,n)的数值最大,则系数矩阵H的第k行即hk分配给Hi,且特征矩阵W=[w1,w2,…,wn]的第k列即wk分配给WiS56. If in the kth column (k=1,2,...,c) of Q, the ith row (i=1,2,...,n) has the largest value, then the kth row of the coefficient matrix H is h k is assigned to H i , and the k- th column of the feature matrix W=[w 1 , w 2 , . . . , wn ], that is, w k is assigned to Wi ;

S57、计算Q中每一列的最大值,并按照步骤S56执行,从而得到特征矩阵W1,W2,…,Wn和系数矩阵H1,H2,…,HnS57, calculate the maximum value of each column in Q, and execute according to step S56, thereby obtaining characteristic matrices W 1 , W 2 ,...,W n and coefficient matrices H 1 ,H 2 ,...,H n ;

S58、按照公式Xi=WiHiT(i=1,2,…,n)(10)计算得到声音信号S1,S2,…,Sn各自的频谱X1,X2,…,XnS58, according to the formula X i =W i H iT (i=1,2,...,n) (10), calculate and obtain the respective frequency spectra X 1 , X 2 ,..., of the sound signals S 1 , S 2 ,...,S n X n ;

于公式(9)和(10)中,hk T为hk的转置;HiT为Hi的转置;Xi(i=1,2,…,n)表示频谱X1,X2,…,Xn;Wi(i=1,2,…,n)表示特征矩阵W1,W2,…,Wn;Hi(i=1,2,…,n)表示系数矩阵H1,H2,…,HnIn formulas (9) and (10), h k T is the transposition of h k ; H iT is the transposition of H i ; X i (i=1,2,...,n) represents the spectrum X 1 , X 2 ,...,X n ; Wi ( i =1,2,...,n) represents the characteristic matrix W 1 ,W 2 ,...,W n ;H i (i=1,2,...,n) represents the coefficient matrix H 1 , H 2 ,…,H n .

进一步地,S6中根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn,按如下步骤进行:Further, in S6, according to the Fourier transform result F and the frequency spectrum X 1 , X 2 ,...,X n , the respective Fourier transform results F 1 , F 2 ,...,F of the sound signals S 1 , S 2 ,...,S n are obtained n , proceed as follows:

S61、用Fi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn

Figure BDA0002627289950000081
是p行q列矩阵,初始化Fi为零矩阵; S61 . Use F i ( i = 1 , 2 , .
Figure BDA0002627289950000081
is a matrix with p rows and q columns, and initializes F i as a zero matrix;

S62、F={Frk}p×q是一个p行q列的矩阵,用Xi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn

Figure BDA0002627289950000082
是p行q列的矩阵;S62. F = { F rk } p×q is a matrix with p rows and q columns, and X i (i=1, 2 , . X 1 , X 2 ,...,X n ,
Figure BDA0002627289950000082
is a matrix with p rows and q columns;

S63、比较X1,X2,…,Xn各自的第r行(r=1,2,…,p)第k(k=1,2,…,q)列元素的大小,如果Xi(i=1,2,…,n)中的第r行第k列元素最大,则

Figure BDA0002627289950000083
S63. Compare the sizes of the elements in the rth row (r=1, 2,...,p) of the kth (k=1, 2,...,q) column of each of X 1 , X 2 ,..., X n , if X i (i=1,2,...,n) where the element in the rth row and the kth column is the largest, then
Figure BDA0002627289950000083

S64、按照步骤S63执行,遍历X1,X2,…,Xn中所有元素,则得到

Figure BDA0002627289950000084
即获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn。S64. Execute according to step S63, traverse all elements in X 1 , X 2 ,..., X n , to obtain
Figure BDA0002627289950000084
That is , the Fourier transform results F 1 , F 2 , . . . , F n of the sound signals S 1 , S 2 , .

本发明的有益效果为:The beneficial effects of the present invention are:

本发明能用于分离由频域重叠、相互不一定独立的源声音信号混合而成的单通道混合声音信号。The present invention can be used to separate single-channel mixed sound signals which are mixed from source sound signals that overlap in frequency domain and are not necessarily independent of each other.

附图说明Description of drawings

为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.

图1为本发明的一种基于半非负矩阵分解的声音信号分离方法的工作流程图;Fig. 1 is the working flow chart of a kind of sound signal separation method based on semi-non-negative matrix decomposition of the present invention;

图2为本发明的原始心音信号图;Fig. 2 is the original heart sound signal diagram of the present invention;

图3为本发明的原始肺音信号图;Fig. 3 is the original lung sound signal diagram of the present invention;

图4为本发明的原始心音信号和原始肺音信号的混合信号图;Fig. 4 is the mixed signal diagram of original heart sound signal and original lung sound signal of the present invention;

图5为本发明的分离出的信号与原始信号对比图。FIG. 5 is a comparison diagram of the separated signal of the present invention and the original signal.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚,下面将结合本发明实施例,对本发明的技术方案作进一步清楚、完整地描述。需要说明的是,所描述的实施例仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be further clearly and completely described below with reference to the embodiments of the present invention. It should be noted that the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

实施例Example

如图1所示,本发明提供一种基于半非负矩阵分解的声音信号分离方法包括以下步骤:As shown in Figure 1, the present invention provides a method for separating sound signals based on semi-non-negative matrix decomposition, comprising the following steps:

S1、单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,计算S的傅立叶变换结果F,根据F计算频谱X;S1. The single-channel mixed sound signal S is formed by mixing several independent sound signals S 1 , S 2 , ..., Sn , calculate the Fourier transform result F of S, and calculate the spectrum X according to F;

S2、对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H;S2. Perform semi-non-negative matrix decomposition on X to obtain feature matrix W and coefficient matrix H;

S3、根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000091
S3. Obtain the respective initial estimated spectrums of the sound signals S 1 , S 2 , ..., Sn according to X
Figure BDA0002627289950000091

S4、对

Figure BDA0002627289950000092
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure BDA0002627289950000093
和系数矩阵
Figure BDA0002627289950000094
S4, yes
Figure BDA0002627289950000092
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure BDA0002627289950000093
and coefficient matrix
Figure BDA0002627289950000094

S5、根据X、W、H,以及

Figure BDA0002627289950000095
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn;S5. According to X, W, H, and
Figure BDA0002627289950000095
Obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n ;

S6、根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,FnS6. According to the Fourier transform result F and the frequency spectrum X 1 , X 2 ,...,X n , obtain the respective Fourier transform results F 1 , F 2 ,..., F n of the sound signals S 1 , S 2 ,..., Sn ;

S7、分别对F1,F2,…,Fn进行傅立叶逆变换,得到声音信号S1,S2,…,Sn,从而实现从单通道混合声音信号S中分离出独立的声音信号S1,S2,…,Sn S7 . Perform inverse Fourier transform on F 1 , F 2 , . 1 , S 2 ,…,S n .

于本实施例中,S1中所述单通道混合声音信号,是指混合声音信号只由一个声音采集器采集而得。In this embodiment, the single-channel mixed sound signal in S1 means that the mixed sound signal is collected by only one sound collector.

于本实施例中,S1中所述单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,其中n的数值一般为2或3。In this embodiment, the single-channel mixed sound signal S in S1 is formed by mixing a plurality of independent sound signals S 1 , S 2 , . . . , Sn , where the value of n is generally 2 or 3.

于本实施例中,S2中对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H,按如下步骤进行:In this embodiment, the semi-non-negative matrix decomposition is performed on X in S2 to obtain the feature matrix W and the coefficient matrix H, and the steps are as follows:

S21、构造半非负矩阵分解的目标函数ΓS21. Construct the objective function Γ of the semi-nonnegative matrix factorization

Figure BDA0002627289950000101
Figure BDA0002627289950000101

S22、初始化系数矩阵H,其所有元素的值为(0,1)之间的随机数;S22. Initialize the coefficient matrix H, and the values of all its elements are random numbers between (0, 1);

S23、计算特征矩阵W的初始值为S23. Calculate the initial value of the feature matrix W as

W=XH(HTH)-1 (2)W=XH(H T H) -1 (2)

S24、将特征矩阵W和系数矩阵H交替迭代更新:先迭代更新一次W,然后迭代更新一次H,如此循环往复的先后迭代更新W和H;利用公式W=XH(HTH)-1 (3)迭代更新特征矩阵W中的元素,利用公式

Figure BDA0002627289950000102
迭代更新系数矩阵H中的元素;S24, iteratively update the feature matrix W and the coefficient matrix H alternately: first iteratively update W once, and then iteratively update H once, so that W and H are iteratively updated successively in a cycle; using the formula W=XH(H T H) -1 ( 3) Iteratively update the elements in the feature matrix W, using the formula
Figure BDA0002627289950000102
Iteratively update the elements in the coefficient matrix H;

S25、设定半非负矩阵分解的目标函数Γ的最小值Γmin,设定最大迭代次数Emax,每次迭代更新完成后计算目标函数Γ的值,当目标函数Γ的值小于Γmin或者迭代次数达到最大迭代次数Emax时,则停止迭代,得到最终的特征矩阵W和系数矩阵H。S25. Set the minimum value Γ min of the objective function Γ of the semi-nonnegative matrix decomposition, set the maximum number of iterations E max , and calculate the value of the objective function Γ after each iteration update is completed. When the value of the objective function Γ is less than Γ min or When the number of iterations reaches the maximum number of iterations E max , the iteration is stopped, and the final feature matrix W and coefficient matrix H are obtained.

于公式(1)、(2)、(3)和(4)中,

Figure BDA0002627289950000111
表示矩阵的Frobenius范数;X为频谱;W为特征矩阵;H为系数矩阵;HT为H的转置;(HTH)-1为HTH的逆矩阵;XT为X的转置;WT为W的转置;(XTW)+为XTW中的正值元素;(XTW)-为XTW中的负值元素;(WTW)+为WTW中的正值元素;(WTW)-为WTW中的负值元素。In formulas (1), (2), (3) and (4),
Figure BDA0002627289950000111
Represents the Frobenius norm of the matrix; X is the frequency spectrum; W is the characteristic matrix; H is the coefficient matrix; H T is the transpose of H; (H T H) -1 is the inverse matrix of H T H; Set; W T is the transpose of W; (X T W) + is the positive value element in X T W; (X T W) - is the negative value element in X T W; (W T W) + is W Positive -valued elements in TW; (W T W ) - are negative-valued elements in W TW .

于本实施例中,S3中根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000112
按如下步骤进行:In this embodiment, in S3, the respective initial estimated spectrums of the sound signals S 1 , S 2 , ..., Sn are obtained according to X
Figure BDA0002627289950000112
Proceed as follows:

S31、声音信号S1,S2,…,Sn各自的主要频段为f1,f2,…,fnS31. The respective main frequency bands of the sound signals S 1 , S 2 ,...,S n are f 1 , f 2 ,..., f n ;

S32、将X中对应于频段f1,f2,…,fn的部分作为声音信号S1,S2,…,Sn各自的初始估计频谱

Figure BDA0002627289950000113
S32. Use the part of X corresponding to the frequency bands f 1 , f 2 ,..., f n as the respective initial estimated spectrums of the sound signals S 1 , S 2 ,..., Sn
Figure BDA0002627289950000113

于本实施例中,S4中对

Figure BDA0002627289950000114
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure BDA0002627289950000115
和系数矩阵
Figure BDA0002627289950000116
按如下步骤进行:In this embodiment, in S4, the
Figure BDA0002627289950000114
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure BDA0002627289950000115
and coefficient matrix
Figure BDA0002627289950000116
Proceed as follows:

S41、构造半非负矩阵分解的目标函数Γi(i=1,2,…,n)S41. Construct the objective function Γ i (i=1,2,...,n) of semi-nonnegative matrix decomposition

Figure BDA0002627289950000117
Figure BDA0002627289950000117

S42、初始化系数矩阵

Figure BDA0002627289950000118
其所有元素的值为(0,1)之间的随机数;S42, initialization coefficient matrix
Figure BDA0002627289950000118
The value of all its elements is a random number between (0,1);

S43、计算特征矩阵

Figure BDA0002627289950000121
的初始值为S43. Calculate the feature matrix
Figure BDA0002627289950000121
The initial value is

Figure BDA0002627289950000122
Figure BDA0002627289950000122

S44、将特征矩阵

Figure BDA0002627289950000123
和系数矩阵
Figure BDA0002627289950000124
交替迭代更新:先迭代更新一次
Figure BDA0002627289950000125
然后迭代更新一次
Figure BDA0002627289950000126
如此循环往复的先后迭代更新
Figure BDA0002627289950000127
Figure BDA0002627289950000128
利用公式
Figure BDA0002627289950000129
迭代更新特征矩阵
Figure BDA00026272899500001210
中的元素,利用公式
Figure BDA00026272899500001211
迭代更新系数矩阵
Figure BDA00026272899500001212
中的元素;S44, the feature matrix
Figure BDA0002627289950000123
and coefficient matrix
Figure BDA0002627289950000124
Alternate iterative update: first iterative update once
Figure BDA0002627289950000125
Then iteratively update once
Figure BDA0002627289950000126
Iterative update in this way
Figure BDA0002627289950000127
and
Figure BDA0002627289950000128
Use the formula
Figure BDA0002627289950000129
Iteratively update the feature matrix
Figure BDA00026272899500001210
elements in , using the formula
Figure BDA00026272899500001211
Iteratively update the coefficient matrix
Figure BDA00026272899500001212
elements in;

S45、设定半非负矩阵分解的目标函数Γi(i=1,2,…,n)的最小值

Figure BDA00026272899500001213
设定最大迭代次数
Figure BDA00026272899500001214
每次迭代更新完成后计算目标函数Γi的值,当目标函数Γi的值小于
Figure BDA00026272899500001215
或者迭代次数达到最大迭代次数
Figure BDA00026272899500001216
时,则停止迭代,得到最终的特征矩阵
Figure BDA00026272899500001217
和系数矩阵
Figure BDA00026272899500001218
S45. Set the minimum value of the objective function Γ i (i=1,2,...,n) of the semi-nonnegative matrix decomposition
Figure BDA00026272899500001213
Set the maximum number of iterations
Figure BDA00026272899500001214
Calculate the value of the objective function Γ i after each iteration update is completed, when the value of the objective function Γ i is less than
Figure BDA00026272899500001215
Or the number of iterations reaches the maximum number of iterations
Figure BDA00026272899500001216
When , stop the iteration and get the final feature matrix
Figure BDA00026272899500001217
and coefficient matrix
Figure BDA00026272899500001218

于公式(5)、(6)、(7)和(8)中,

Figure BDA00026272899500001219
表示矩阵的Frobenius范数;
Figure BDA00026272899500001220
表示初始估计频谱
Figure BDA00026272899500001221
表示特征矩阵
Figure BDA00026272899500001222
表示系数矩阵
Figure BDA00026272899500001223
Figure BDA00026272899500001224
Figure BDA00026272899500001225
的转置;
Figure BDA00026272899500001226
Figure BDA00026272899500001227
的逆矩阵;
Figure BDA00026272899500001228
Figure BDA00026272899500001229
的转置;
Figure BDA00026272899500001230
Figure BDA00026272899500001231
的转置;
Figure BDA00026272899500001232
Figure BDA00026272899500001233
中的正值元素;
Figure BDA00026272899500001234
Figure BDA00026272899500001235
中的负值元素;
Figure BDA00026272899500001236
Figure BDA00026272899500001237
中的正值元素;
Figure BDA00026272899500001238
Figure BDA00026272899500001239
中的负值元素。In formulas (5), (6), (7) and (8),
Figure BDA00026272899500001219
represents the Frobenius norm of the matrix;
Figure BDA00026272899500001220
represents the initial estimated spectrum
Figure BDA00026272899500001221
Represents the feature matrix
Figure BDA00026272899500001222
Represents the coefficient matrix
Figure BDA00026272899500001223
Figure BDA00026272899500001224
for
Figure BDA00026272899500001225
transpose of ;
Figure BDA00026272899500001226
for
Figure BDA00026272899500001227
The inverse matrix of ;
Figure BDA00026272899500001228
for
Figure BDA00026272899500001229
transpose of ;
Figure BDA00026272899500001230
for
Figure BDA00026272899500001231
transpose of ;
Figure BDA00026272899500001232
for
Figure BDA00026272899500001233
positive-valued elements in ;
Figure BDA00026272899500001234
for
Figure BDA00026272899500001235
Negative elements in ;
Figure BDA00026272899500001236
for
Figure BDA00026272899500001237
positive-valued elements in ;
Figure BDA00026272899500001238
for
Figure BDA00026272899500001239
Negative elements in .

于本实施例中,S5中根据X、W、H,以及

Figure BDA00026272899500001240
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn,按如下步骤进行:In this embodiment, according to X, W, H, and
Figure BDA00026272899500001240
To obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n , follow the steps below:

S51、H=[h1;h2;…;hc],hk(k=1,2,…,c)是维度为M的行向量,用

Figure BDA0002627289950000131
表示系数矩阵
Figure BDA0002627289950000132
Figure BDA0002627289950000133
是维度为M的行向量;S51, H=[h 1 ; h 2 ;...;h c ], h k (k=1, 2,..., c) is a row vector of dimension M, which is represented by
Figure BDA0002627289950000131
Represents the coefficient matrix
Figure BDA0002627289950000132
Figure BDA0002627289950000133
is a row vector of dimension M;

S52、按照公式

Figure BDA0002627289950000134
计算
Figure BDA0002627289950000135
与hk的相关系数
Figure BDA0002627289950000136
因此可以计算出
Figure BDA0002627289950000137
的每一行与hk的相关系数
Figure BDA0002627289950000138
然后选出
Figure BDA0002627289950000139
中的最大值,记为最大相关系数
Figure BDA00026272899500001310
S52. According to the formula
Figure BDA0002627289950000134
calculate
Figure BDA0002627289950000135
Correlation coefficient with h k
Figure BDA0002627289950000136
So it can be calculated
Figure BDA0002627289950000137
The correlation coefficient of each row with h k
Figure BDA0002627289950000138
then choose
Figure BDA0002627289950000139
The maximum value in , denoted as the maximum correlation coefficient
Figure BDA00026272899500001310

S53、按照步骤S52,计算

Figure BDA00026272899500001311
的每一行与H的每一行的最大相关系数,记为
Figure BDA00026272899500001312
S53, according to step S52, calculate
Figure BDA00026272899500001311
The maximum correlation coefficient between each row of H and each row of H, denoted as
Figure BDA00026272899500001312

S54、按照步骤S53,分别计算

Figure BDA00026272899500001313
的每一行与H的每一行的最大相关系数,记为Q=[q1;q2;…;qn],qi(i=1,2,…,n)是维度为c的行向量,Q是一个n行c列的矩阵;S54, according to step S53, calculate respectively
Figure BDA00026272899500001313
The maximum correlation coefficient between each row of H and each row of H is denoted as Q=[q 1 ; q 2 ;...;q n ], q i (i=1,2,...,n) is a row vector of dimension c , Q is a matrix with n rows and c columns;

S55、声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn的特征矩阵为W1,W2,…,Wn,且系数矩阵为H1,H2,…,HnS55. The characteristic matrices of the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,..., Sn are W 1 , W 2 ,..., W n , and the coefficient matrices are H 1 , H 2 ,…,H n ;

S56、如果在Q的第k列(k=1,2,…,c)中,第i行(i=1,2,…,n)的数值最大,则系数矩阵H的第k行即hk分配给Hi,且特征矩阵W=[w1,w2,…,wn]的第k列即wk分配给WiS56. If in the kth column (k=1,2,...,c) of Q, the ith row (i=1,2,...,n) has the largest value, then the kth row of the coefficient matrix H is h k is assigned to H i , and the k- th column of the feature matrix W=[w 1 , w 2 , . . . , wn ], that is, w k is assigned to Wi ;

S57、计算Q中每一列的最大值,并按照步骤S56执行,从而得到特征矩阵W1,W2,…,Wn和系数矩阵H1,H2,…,HnS57, calculate the maximum value of each column in Q, and execute according to step S56, thereby obtaining characteristic matrices W 1 , W 2 ,...,W n and coefficient matrices H 1 ,H 2 ,...,H n ;

S58、按照公式Xi=WiHiT(i=1,2,…,n)(10)计算得到声音信号S1,S2,…,Sn各自的频谱X1,X2,…,XnS58, according to the formula X i =W i H iT (i=1,2,...,n) (10), calculate and obtain the respective frequency spectra X 1 , X 2 ,..., of the sound signals S 1 , S 2 ,...,S n X n .

于公式(9)和(10)中,hk T为hk的转置;HiT为Hi的转置;Xi(i=1,2,…,n)表示频谱X1,X2,…,Xn;Wi(i=1,2,…,n)表示特征矩阵W1,W2,…,Wn;Hi(i=1,2,…,n)表示系数矩阵H1,H2,…,HnIn formulas (9) and (10), h k T is the transposition of h k ; H iT is the transposition of H i ; X i (i=1,2,...,n) represents the spectrum X 1 , X 2 ,...,X n ; Wi ( i =1,2,...,n) represents the characteristic matrix W 1 ,W 2 ,...,W n ;H i (i=1,2,...,n) represents the coefficient matrix H 1 , H 2 ,…,H n .

于本实施例中,S6中根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn,按如下步骤进行:In this embodiment, in S6, according to the Fourier transform result F and the frequency spectrum X 1 , X 2 ,...,X n , the respective Fourier transform results F 1 , F 2 of the sound signals S 1 , S 2 ,..., Sn are obtained, …,F n , proceed as follows:

S61、用Fi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn

Figure BDA0002627289950000141
是p行q列矩阵,初始化Fi为零矩阵(矩阵中的元素的值全为0); S61 . Use F i ( i = 1 , 2 , .
Figure BDA0002627289950000141
is a matrix of p rows and q columns, and initializes F i to a zero matrix (the values of the elements in the matrix are all 0);

S62、F={Frk}p×q是一个p行q列的矩阵,用Xi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn

Figure BDA0002627289950000142
是p行q列的矩阵;S62. F = { F rk } p×q is a matrix with p rows and q columns, and X i (i=1, 2 , . X 1 , X 2 ,...,X n ,
Figure BDA0002627289950000142
is a matrix with p rows and q columns;

S63、比较X1,X2,…,Xn各自的第r行(r=1,2,…,p)第k(k=1,2,…,q)列元素的大小,如果Xi(i=1,2,…,n)中的第r行第k列元素最大,则

Figure BDA0002627289950000143
S63. Compare the sizes of the elements in the rth row (r=1, 2,...,p) of the kth (k=1, 2,...,q) column of each of X 1 , X 2 ,..., X n , if X i (i=1,2,...,n) where the element in the rth row and the kth column is the largest, then
Figure BDA0002627289950000143

S64、按照步骤S63执行,遍历X1,X2,…,Xn中所有元素,则得到

Figure BDA0002627289950000144
即获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn。S64. Execute according to step S63, traverse all elements in X 1 , X 2 ,..., X n , to obtain
Figure BDA0002627289950000144
That is , the Fourier transform results F 1 , F 2 , . . . , F n of the sound signals S 1 , S 2 , .

于本实施例中,本发明的效果可以通过以下实验进一步说明:In this embodiment, the effect of the present invention can be further illustrated by the following experiments:

1)实验数据1) Experimental data

实验数据来自于网上公开的一段心音信号和一段肺音信号,信号的采样频率均为2000Hz,信号持续的时间长度均为5秒。将二者混合,则得到单通道混合声音信号。心音信号的主要频段已知为≤100Hz,肺音信号的主要频段已知为≥300Hz。原始心音信号如图2所示,原始肺音信号如图3所示,原始心音信号和原始肺音信号的混合信号如图4所示。The experimental data comes from a piece of heart sound signal and a piece of lung sound signal published on the Internet. The sampling frequency of the signal is 2000Hz, and the duration of the signal is 5 seconds. Mix the two to get a single-channel mixed sound signal. The main frequency band of the heart sound signal is known to be ≤100Hz, and the main frequency band of the lung sound signal is known to be ≥300Hz. The original heart sound signal is shown in Figure 2, the original lung sound signal is shown in Figure 3, and the mixed signal of the original heart sound signal and the original lung sound signal is shown in Figure 4.

2)实验条件2) Experimental conditions

本发明的实验程序使用Matlab9.2.0软件编写。目标函数Γ的最小值Γmin设为10-4,目标函数Γi(i=1,2,…,n)的最小值

Figure BDA0002627289950000151
也设为10-4,最大迭代次数Emax设为500次,最大迭代次数
Figure BDA0002627289950000152
也设为500次。The experimental program of the present invention is written using Matlab9.2.0 software. The minimum value Γ min of the objective function Γ is set to 10 -4 , and the minimum value of the objective function Γ i (i=1,2,...,n)
Figure BDA0002627289950000151
Also set to 10 -4 , the maximum number of iterations E max is set to 500 times, the maximum number of iterations
Figure BDA0002627289950000152
Also set to 500 times.

3)实验结果3) Experimental results

以分离出的信号与原信号的归一化均方误差NMSE(Normalized Mean SquaredError)作为本发明的分离效果评价指标,该指标越小越好。通过本发明对混合声音信号进行信号分离,然后按照公式

Figure BDA0002627289950000153
计算分离出的信号与原信号的归一化均方误差NMSE。于公式(11)中,s表示原始信号(在本发明的实验中,s为心音信号或肺音信号),
Figure BDA0002627289950000154
表示分离出的信号(在本发明的实验中,
Figure BDA0002627289950000155
为分离出的心音信号或肺音信号)。分离出的信号与原始信号对比如图5所示,归一化均方误差NMSE结果如表1所示。The normalized mean squared error NMSE (Normalized Mean Squared Error) between the separated signal and the original signal is used as the evaluation index of the separation effect of the present invention, and the smaller the index, the better. The mixed sound signal is separated by the present invention, and then according to the formula
Figure BDA0002627289950000153
Calculate the normalized mean square error NMSE between the separated signal and the original signal. In formula (11), s represents the original signal (in the experiment of the present invention, s is the heart sound signal or the lung sound signal),
Figure BDA0002627289950000154
represents the separated signal (in the experiments of the present invention,
Figure BDA0002627289950000155
is the isolated heart sound signal or lung sound signal). The comparison between the separated signal and the original signal is shown in Figure 5, and the normalized mean square error NMSE results are shown in Table 1.

表1Table 1

Figure BDA0002627289950000156
Figure BDA0002627289950000156

以上所述实施例仅表达了本发明的一种实施方式,其描述较为具体和详细,但并不能因此而理解为对本发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiment only represents an embodiment of the present invention, and its description is relatively specific and detailed, but it should not be construed as a limitation on the patent scope of the present invention. It should be pointed out that for those skilled in the art, without departing from the concept of the present invention, several modifications and improvements can be made, which all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention should be subject to the appended claims.

Claims (8)

1.一种基于半非负矩阵分解的声音信号分离方法,其特征在于,该方法包括以下步骤:1. a sound signal separation method based on semi-non-negative matrix decomposition, is characterized in that, this method comprises the following steps: S1、单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,计算S的傅立叶变换结果F,根据F计算频谱X;S1. The single-channel mixed sound signal S is formed by mixing several independent sound signals S 1 , S 2 , ..., Sn , calculate the Fourier transform result F of S, and calculate the spectrum X according to F; S2、对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H;S2. Perform semi-non-negative matrix decomposition on X to obtain feature matrix W and coefficient matrix H; S3、根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱
Figure FDA0002627289940000011
S3. Obtain the respective initial estimated spectrums of the sound signals S 1 , S 2 , ..., Sn according to X
Figure FDA0002627289940000011
S4、对
Figure FDA0002627289940000012
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure FDA0002627289940000013
和系数矩阵
Figure FDA0002627289940000014
S4, yes
Figure FDA0002627289940000012
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure FDA0002627289940000013
and coefficient matrix
Figure FDA0002627289940000014
S5、根据X、W、H,以及
Figure FDA0002627289940000015
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn
S5. According to X, W, H, and
Figure FDA0002627289940000015
Obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n ;
S6、根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,FnS6. According to the Fourier transform result F and the frequency spectrum X 1 , X 2 ,...,X n , obtain the respective Fourier transform results F 1 , F 2 ,..., F n of the sound signals S 1 , S 2 ,..., Sn ; S7、分别对F1,F2,…,Fn进行傅立叶逆变换,得到声音信号S1,S2,…,Sn,从而实现从单通道混合声音信号S中分离出独立的声音信号S1,S2,…,Sn S7 . Perform inverse Fourier transform on F 1 , F 2 , . 1 , S 2 ,…,S n .
2.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S1中所述单通道混合声音信号,是指混合声音信号只由一个声音采集器采集而得。2 . The sound signal separation method based on semi-non-negative matrix decomposition according to claim 1 , wherein the single-channel mixed sound signal in S1 means that the mixed sound signal is collected by only one sound collector. 3 . 3.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S1中所述单通道混合声音信号S由若干独立的声音信号S1,S2,…,Sn混合而成,其中n的数值为2或3。3. The sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, wherein the single-channel mixed sound signal S described in S1 is composed of several independent sound signals S 1 , S 2 , . . . S A mixture of n , where the value of n is 2 or 3. 4.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S2中对X进行半非负矩阵分解,得到特征矩阵W和系数矩阵H,按如下步骤进行:4. the sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, is characterized in that, in S2, X is carried out semi-non-negative matrix decomposition, obtain characteristic matrix W and coefficient matrix H, carry out according to the following steps: S21、构造半非负矩阵分解的目标函数ΓS21. Construct the objective function Γ of the semi-nonnegative matrix factorization
Figure FDA0002627289940000021
Figure FDA0002627289940000021
S22、初始化系数矩阵H,其所有元素的值为(0,1)之间的随机数;S22. Initialize the coefficient matrix H, and the values of all its elements are random numbers between (0, 1); S23、计算特征矩阵W的初始值为S23. Calculate the initial value of the feature matrix W as W=XH(HTH)-1 (2)W=XH(H T H) -1 (2) S24、将特征矩阵W和系数矩阵H交替迭代更新:先迭代更新一次W,然后迭代更新一次H,如此循环往复的先后迭代更新W和H;利用公式W=XH(HTH)-1(3)迭代更新特征矩阵W中的元素,利用公式
Figure FDA0002627289940000022
迭代更新系数矩阵H中的元素;
S24, iteratively update the feature matrix W and the coefficient matrix H alternately: first iteratively update W once, and then iteratively update H once, so that W and H are iteratively updated successively in a cycle; using the formula W=XH(H T H) -1 ( 3) Iteratively update the elements in the feature matrix W, using the formula
Figure FDA0002627289940000022
Iteratively update the elements in the coefficient matrix H;
S25、设定半非负矩阵分解的目标函数Γ的最小值Γmin,设定最大迭代次数Emax,每次迭代更新完成后计算目标函数Γ的值,当目标函数Γ的值小于Γmin或者迭代次数达到最大迭代次数Emax时,则停止迭代,得到最终的特征矩阵W和系数矩阵H;S25. Set the minimum value Γ min of the objective function Γ of the semi-nonnegative matrix decomposition, set the maximum number of iterations E max , and calculate the value of the objective function Γ after each iteration update is completed. When the value of the objective function Γ is less than Γ min or When the number of iterations reaches the maximum number of iterations E max , the iteration is stopped, and the final feature matrix W and coefficient matrix H are obtained; 于公式(1)、(2)、(3)和(4)中,
Figure FDA0002627289940000023
表示矩阵的Frobenius范数;X为频谱;W为特征矩阵;H为系数矩阵;HT为H的转置;(HTH)-1为HTH的逆矩阵;XT为X的转置;WT为W的转置;(XTW)+为XTW中的正值元素;(XTW)-为XTW中的负值元素;(WTW)+为WTW中的正值元素;(WTW)-为WTW中的负值元素。
In formulas (1), (2), (3) and (4),
Figure FDA0002627289940000023
Represents the Frobenius norm of the matrix; X is the frequency spectrum; W is the characteristic matrix; H is the coefficient matrix; H T is the transpose of H; (H T H) -1 is the inverse matrix of H T H; Set; W T is the transpose of W; (X T W) + is the positive value element in X T W; (X T W) - is the negative value element in X T W; (W T W) + is W Positive -valued elements in TW; (W T W ) - are negative-valued elements in W TW .
5.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S3中根据X得到声音信号S1,S2,…,Sn各自的初始估计频谱
Figure FDA0002627289940000031
按如下步骤进行:
5. The sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, wherein in S3, the respective initial estimated spectrums of the sound signals S 1 , S 2 , ..., Sn are obtained according to X in S3
Figure FDA0002627289940000031
Proceed as follows:
S31、声音信号S1,S2,…,Sn各自的主要频段为f1,f2,…,fnS31. The respective main frequency bands of the sound signals S 1 , S 2 ,...,S n are f 1 , f 2 ,..., f n ; S32、将X中对应于频段f1,f2,…,fn的部分作为声音信号S1,S2,…,Sn各自的初始估计频谱
Figure FDA0002627289940000032
S32. Use the part of X corresponding to the frequency bands f 1 , f 2 ,..., f n as the respective initial estimated spectrums of the sound signals S 1 , S 2 ,..., Sn
Figure FDA0002627289940000032
6.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S4中对
Figure FDA0002627289940000033
分别进行半非负矩阵分解,得到对应的特征矩阵
Figure FDA0002627289940000034
和系数矩阵
Figure FDA0002627289940000035
按如下步骤进行:
6. the sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, is characterized in that, in S4, to
Figure FDA0002627289940000033
Perform semi-non-negative matrix decomposition respectively to get the corresponding eigenmatrix
Figure FDA0002627289940000034
and coefficient matrix
Figure FDA0002627289940000035
Proceed as follows:
S41、构造半非负矩阵分解的目标函数Γi(i=1,2,…,n)S41. Construct the objective function Γ i (i=1,2,...,n) of semi-nonnegative matrix decomposition
Figure FDA0002627289940000036
Figure FDA0002627289940000036
S42、初始化系数矩阵
Figure FDA0002627289940000037
其所有元素的值为(0,1)之间的随机数;
S42, initialization coefficient matrix
Figure FDA0002627289940000037
The value of all its elements is a random number between (0,1);
S43、计算特征矩阵
Figure FDA0002627289940000038
的初始值为
S43. Calculate the feature matrix
Figure FDA0002627289940000038
The initial value is
Figure FDA0002627289940000039
Figure FDA0002627289940000039
S44、将特征矩阵
Figure FDA00026272899400000310
和系数矩阵
Figure FDA00026272899400000311
交替迭代更新:先迭代更新一次
Figure FDA00026272899400000312
然后迭代更新一次
Figure FDA00026272899400000313
如此循环往复的先后迭代更新
Figure FDA00026272899400000314
Figure FDA00026272899400000315
利用公式
Figure FDA00026272899400000316
迭代更新特征矩阵
Figure FDA00026272899400000317
中的元素,利用公式
Figure FDA00026272899400000318
迭代更新系数矩阵
Figure FDA0002627289940000041
中的元素;
S44, the feature matrix
Figure FDA00026272899400000310
and coefficient matrix
Figure FDA00026272899400000311
Alternate iterative update: first iterative update once
Figure FDA00026272899400000312
Then iteratively update once
Figure FDA00026272899400000313
Iterative update in this way
Figure FDA00026272899400000314
and
Figure FDA00026272899400000315
Use the formula
Figure FDA00026272899400000316
Iteratively update the feature matrix
Figure FDA00026272899400000317
elements in , using the formula
Figure FDA00026272899400000318
Iteratively update the coefficient matrix
Figure FDA0002627289940000041
elements in;
S45、设定半非负矩阵分解的目标函数Γi(i=1,2,…,n)的最小值
Figure FDA0002627289940000042
设定最大迭代次数
Figure FDA0002627289940000043
每次迭代更新完成后计算目标函数Γi的值,当目标函数Γi的值小于
Figure FDA00026272899400000443
或者迭代次数达到最大迭代次数
Figure FDA0002627289940000044
时,则停止迭代,得到最终的特征矩阵
Figure FDA0002627289940000045
和系数矩阵
Figure FDA0002627289940000046
S45. Set the minimum value of the objective function Γ i (i=1,2,...,n) of the semi-nonnegative matrix decomposition
Figure FDA0002627289940000042
Set the maximum number of iterations
Figure FDA0002627289940000043
Calculate the value of the objective function Γ i after each iteration update is completed, when the value of the objective function Γ i is less than
Figure FDA00026272899400000443
Or the number of iterations reaches the maximum number of iterations
Figure FDA0002627289940000044
When , stop the iteration and get the final feature matrix
Figure FDA0002627289940000045
and coefficient matrix
Figure FDA0002627289940000046
于公式(5)、(6)、(7)和(8)中,
Figure FDA0002627289940000047
表示矩阵的Frobenius范数;
Figure FDA0002627289940000048
表示初始估计频谱
Figure FDA0002627289940000049
Figure FDA00026272899400000410
表示特征矩阵
Figure FDA00026272899400000411
Figure FDA00026272899400000412
表示系数矩阵
Figure FDA00026272899400000413
Figure FDA00026272899400000414
Figure FDA00026272899400000415
Figure FDA00026272899400000416
的转置;
Figure FDA00026272899400000417
Figure FDA00026272899400000418
的逆矩阵;
Figure FDA00026272899400000419
Figure FDA00026272899400000420
的转置;
Figure FDA00026272899400000421
Figure FDA00026272899400000422
的转置;
Figure FDA00026272899400000423
Figure FDA00026272899400000424
中的正值元素;
Figure FDA00026272899400000425
Figure FDA00026272899400000426
中的负值元素;
Figure FDA00026272899400000427
Figure FDA00026272899400000428
中的正值元素;
Figure FDA00026272899400000429
Figure FDA00026272899400000430
中的负值元素。
In formulas (5), (6), (7) and (8),
Figure FDA0002627289940000047
represents the Frobenius norm of the matrix;
Figure FDA0002627289940000048
represents the initial estimated spectrum
Figure FDA0002627289940000049
Figure FDA00026272899400000410
Represents the feature matrix
Figure FDA00026272899400000411
Figure FDA00026272899400000412
Represents the coefficient matrix
Figure FDA00026272899400000413
Figure FDA00026272899400000414
Figure FDA00026272899400000415
for
Figure FDA00026272899400000416
transpose of ;
Figure FDA00026272899400000417
for
Figure FDA00026272899400000418
The inverse matrix of ;
Figure FDA00026272899400000419
for
Figure FDA00026272899400000420
transpose of ;
Figure FDA00026272899400000421
for
Figure FDA00026272899400000422
transpose of ;
Figure FDA00026272899400000423
for
Figure FDA00026272899400000424
positive-valued elements in ;
Figure FDA00026272899400000425
for
Figure FDA00026272899400000426
Negative elements in ;
Figure FDA00026272899400000427
for
Figure FDA00026272899400000428
positive-valued elements in ;
Figure FDA00026272899400000429
for
Figure FDA00026272899400000430
Negative elements in .
7.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S5中根据X、W、H,以及
Figure FDA00026272899400000431
获得声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn,按如下步骤进行:
7. the sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, is characterized in that, according to X, W, H in S5, and
Figure FDA00026272899400000431
To obtain the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,...,S n , follow the steps below:
S51、H=[h1;h2;…;hc],hk(k=1,2,…,c)是维度为M的行向量,用
Figure FDA00026272899400000432
表示系数矩阵
Figure FDA00026272899400000433
Figure FDA00026272899400000434
Figure FDA00026272899400000435
Figure FDA00026272899400000436
是维度为M的行向量;
S51, H=[h 1 ; h 2 ;...;h c ], h k (k=1, 2,..., c) is a row vector of dimension M, which is represented by
Figure FDA00026272899400000432
Represents the coefficient matrix
Figure FDA00026272899400000433
Figure FDA00026272899400000434
Figure FDA00026272899400000435
Figure FDA00026272899400000436
is a row vector of dimension M;
S52、按照公式
Figure FDA00026272899400000437
计算
Figure FDA00026272899400000438
与hk的相关系数
Figure FDA00026272899400000439
因此可以计算出
Figure FDA00026272899400000440
的每一行与hk的相关系数
Figure FDA00026272899400000441
然后选出
Figure FDA00026272899400000442
中的最大值,记为最大相关系数
Figure FDA0002627289940000051
S52. According to the formula
Figure FDA00026272899400000437
calculate
Figure FDA00026272899400000438
Correlation coefficient with h k
Figure FDA00026272899400000439
So it can be calculated
Figure FDA00026272899400000440
The correlation coefficient of each row with h k
Figure FDA00026272899400000441
then choose
Figure FDA00026272899400000442
The maximum value in , denoted as the maximum correlation coefficient
Figure FDA0002627289940000051
S53、按照步骤S52,计算
Figure FDA0002627289940000052
的每一行与H的每一行的最大相关系数,记为
Figure FDA0002627289940000053
S53, according to step S52, calculate
Figure FDA0002627289940000052
The maximum correlation coefficient between each row of H and each row of H, denoted as
Figure FDA0002627289940000053
S54、按照步骤S53,分别计算
Figure FDA0002627289940000054
的每一行与H的每一行的最大相关系数,记为Q=[q1;q2;…;qn],qi(i=1,2,…,n)是维度为c的行向量,Q是一个n行c列的矩阵;
S54, according to step S53, calculate respectively
Figure FDA0002627289940000054
The maximum correlation coefficient between each row of H and each row of H is denoted as Q=[q 1 ; q 2 ;...;q n ], q i (i=1,2,...,n) is a row vector of dimension c , Q is a matrix with n rows and c columns;
S55、声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn的特征矩阵为W1,W2,…,Wn,且系数矩阵为H1,H2,…,HnS55. The characteristic matrices of the respective frequency spectra X 1 , X 2 ,..., X n of the sound signals S 1 , S 2 ,..., Sn are W 1 , W 2 ,..., W n , and the coefficient matrices are H 1 , H 2 ,…,H n ; S56、如果在Q的第k列(k=1,2,…,c)中,第i行(i=1,2,…,n)的数值最大,则系数矩阵H的第k行即hk分配给Hi,且特征矩阵W=[w1,w2,…,wn]的第k列即wk分配给WiS56. If in the kth column (k=1,2,...,c) of Q, the ith row (i=1,2,...,n) has the largest value, then the kth row of the coefficient matrix H is h k is assigned to H i , and the k- th column of the feature matrix W=[w 1 , w 2 , . . . , wn ], that is, w k is assigned to Wi ; S57、计算Q中每一列的最大值,并按照步骤S56执行,从而得到特征矩阵W1,W2,…,Wn和系数矩阵H1,H2,…,HnS57, calculate the maximum value of each column in Q, and execute according to step S56, thereby obtaining characteristic matrices W 1 , W 2 ,...,W n and coefficient matrices H 1 ,H 2 ,...,H n ; S58、按照公式Xi=WiHiT(i=1,2,…,n) (10)计算得到声音信号S1,S2,…,Sn各自的频谱X1,X2,…,XnS58, according to the formula X i =W i H iT (i=1,2,...,n) (10) to obtain the respective frequency spectra X 1 , X 2 ,..., of the sound signals S 1 , S 2 ,...,S n X n ; 于公式(9)和(10)中,hk T为hk的转置;HiT为Hi的转置;Xi(i=1,2,…,n)表示频谱X1,X2,…,Xn;Wi(i=1,2,…,n)表示特征矩阵W1,W2,…,Wn;Hi(i=1,2,…,n)表示系数矩阵H1,H2,…,HnIn formulas (9) and (10), h k T is the transposition of h k ; H iT is the transposition of H i ; X i (i=1,2,...,n) represents the spectrum X 1 , X 2 ,...,X n ; Wi ( i =1,2,...,n) represents the characteristic matrix W 1 ,W 2 ,...,W n ;H i (i=1,2,...,n) represents the coefficient matrix H 1 , H 2 ,…,H n .
8.根据权利要求1所述的基于半非负矩阵分解的声音信号分离方法,其特征在于,S6中根据傅立叶变换结果F以及频谱X1,X2,…,Xn,获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn,按如下步骤进行:8. The sound signal separation method based on semi-non-negative matrix decomposition according to claim 1, wherein in S6, according to Fourier transform result F and frequency spectrum X 1 , X 2 ,..., X n , obtain sound signal S 1 ,S 2 ,...,S n The Fourier transform results F 1 ,F 2 ,...,F n of each are carried out according to the following steps: S61、用Fi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn
Figure FDA0002627289940000061
是p行q列矩阵,初始化Fi为零矩阵;
S61 . Use F i ( i = 1 , 2 , .
Figure FDA0002627289940000061
is a matrix with p rows and q columns, and initializes F i as a zero matrix;
S62、F={Frk}p×q是一个p行q列的矩阵,用Xi(i=1,2,…,n)表示声音信号S1,S2,…,Sn各自的频谱X1,X2,…,Xn
Figure FDA0002627289940000062
是p行q列的矩阵;
S62. F = { F rk } p×q is a matrix with p rows and q columns, and X i (i=1, 2 , . X 1 , X 2 ,...,X n ,
Figure FDA0002627289940000062
is a matrix with p rows and q columns;
S63、比较X1,X2,…,Xn各自的第r行(r=1,2,…,p)第k(k=1,2,…,q)列元素的大小,如果Xi(i=1,2,…,n)中的第r行第k列元素最大,则
Figure FDA0002627289940000063
S63. Compare the sizes of the elements in the rth row (r=1, 2,...,p) of the kth (k=1, 2,...,q) column of each of X 1 , X 2 ,..., X n , if X i (i=1,2,...,n) where the element in the rth row and the kth column is the largest, then
Figure FDA0002627289940000063
S64、按照步骤S63执行,遍历X1,X2,…,Xn中所有元素,则得到
Figure FDA0002627289940000064
即获得声音信号S1,S2,…,Sn各自的傅立叶变换结果F1,F2,…,Fn
S64. Execute according to step S63, traverse all elements in X 1 , X 2 ,..., X n , to obtain
Figure FDA0002627289940000064
That is , the Fourier transform results F 1 , F 2 , . . . , F n of the sound signals S 1 , S 2 , .
CN201980012799.7A 2019-05-09 2019-05-09 A sound signal separation method based on semi-nonnegative matrix decomposition Active CN111837119B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/086149 WO2020223952A1 (en) 2019-05-09 2019-05-09 Semi-non-negative matrix factorization-based sound signal separation method

Publications (2)

Publication Number Publication Date
CN111837119A true CN111837119A (en) 2020-10-27
CN111837119B CN111837119B (en) 2023-12-19

Family

ID=72911763

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980012799.7A Active CN111837119B (en) 2019-05-09 2019-05-09 A sound signal separation method based on semi-nonnegative matrix decomposition

Country Status (2)

Country Link
CN (1) CN111837119B (en)
WO (1) WO2020223952A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113030863A (en) * 2021-03-02 2021-06-25 珠海格力电器股份有限公司 Fault sound source detection method and system
CN115331693A (en) * 2022-07-08 2022-11-11 广州铁路职业技术学院(广州铁路机械学校) Single-channel sound separation method

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116415138A (en) * 2023-03-08 2023-07-11 国网四川省电力公司营销服务中心 A Blind Source Separation Method for Single-channel Ultrasonic Signals
WO2025236261A1 (en) * 2024-05-17 2025-11-20 南京航空航天大学 Airborne helicopter noise measurement method, device, medium, and product
CN119920267B (en) * 2024-12-31 2025-12-16 广东电网有限责任公司 Fault identification method, device, equipment and medium for transformer bushing
CN119414305B (en) * 2025-01-06 2025-03-11 北京航空航天大学 Time-frequency overlapping signal separation method based on instantaneous phase discontinuity
CN119848524B (en) * 2025-02-28 2025-11-18 暨南大学 A method for deep identification of spectral aliasing wireless signals

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070058737A1 (en) * 2005-09-11 2007-03-15 In-Seon Jang Convolutive blind source separation using relative optimization
CN107392149A (en) * 2017-07-21 2017-11-24 广东工业大学 A kind of real-time blind separating method of human body heart and lung sounds and system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107861492A (en) * 2017-09-25 2018-03-30 湖州师范学院 A kind of broad sense Non-negative Matrix Factorization fault monitoring method based on nargin statistic
CN108962276B (en) * 2018-07-24 2020-11-17 杭州听测科技有限公司 Voice separation method and device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070058737A1 (en) * 2005-09-11 2007-03-15 In-Seon Jang Convolutive blind source separation using relative optimization
CN107392149A (en) * 2017-07-21 2017-11-24 广东工业大学 A kind of real-time blind separating method of human body heart and lung sounds and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王艳芳;王敏;: "一种基于非负矩阵分解的改进FastICA盲源分离方法", 江苏科技大学学报(自然科学版), no. 02 *
秦楚雄;张连海;: "基于非负矩阵分解的语音深层低维特征提取方法", 数据采集与处理, no. 05 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113030863A (en) * 2021-03-02 2021-06-25 珠海格力电器股份有限公司 Fault sound source detection method and system
CN113030863B (en) * 2021-03-02 2023-08-25 珠海格力电器股份有限公司 Fault sound source detection method and system
CN115331693A (en) * 2022-07-08 2022-11-11 广州铁路职业技术学院(广州铁路机械学校) Single-channel sound separation method
CN115331693B (en) * 2022-07-08 2025-08-12 广州铁路职业技术学院(广州铁路机械学校) Single-channel sound separation method

Also Published As

Publication number Publication date
CN111837119B (en) 2023-12-19
WO2020223952A1 (en) 2020-11-12

Similar Documents

Publication Publication Date Title
CN111837119A (en) A Sound Signal Separation Method Based on Semi-Non-Negative Matrix Decomposition
CN109557429B (en) GIS partial discharge fault detection method based on improved wavelet threshold denoising
CN107192553B (en) Gear-box combined failure diagnostic method based on blind source separating
CN101477801B (en) Method for detecting and eliminating pulse noise in digital audio signal
CN108692936B (en) Mechanical fault diagnosis method based on parameter self-adaptive VMD
CN101957443B (en) Sound source localization method
CN109469837A (en) Multi-point leak location method of pressure pipeline based on VMD-PSE
CN102323518A (en) A Partial Discharge Signal Recognition Method Based on Spectral Kurtosis
CN109884591B (en) Microphone array-based multi-rotor unmanned aerial vehicle acoustic signal enhancement method
CN106772331B (en) Target recognition method and target recognition device
CN102419972B (en) Method of detecting and identifying sound signals
CN106017926A (en) Rolling bearing fault diagnosis method based on variational mode decomposition
CN107247251A (en) Three-dimensional sound localization method based on compressed sensing
CN104112072A (en) Operating modal parameter identification method for principal component analysis on basis of wavelet threshold denoising
CN110415720B (en) Quaternary differential microphone array super-directivity frequency-invariant beam forming method
CN104166120B (en) A kind of acoustic vector justifies battle array robust wideband MVDR direction estimation methods
CN106019214A (en) DOA Estimation Method for Broadband Coherent Signal Source
CN113250911A (en) Fan blade fault diagnosis method based on VMD decomposition algorithm
CN106908663A (en) A kind of charging electric vehicle harmonic identification method based on wavelet transformation
CN109658944B (en) Helicopter acoustic signal enhancement method and device
CN104462803B (en) A kind of autonomous type underwater robot fault identification method based on small echo approximate entropy
CN112992173B (en) Signal separation and denoising method based on improved BCA blind source separation
CN106301755B (en) Noise reduction method and system for energy leakage signal based on wavelet analysis
CN105429720B (en) The Time Delay Estimation Based reconstructed based on EMD
CN106100608B (en) Weighted Least Squares Spatial Matrix Filter Design Method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant