CN111699700A - Audio signal processing method, apparatus and storage medium - Google Patents
Audio signal processing method, apparatus and storage medium Download PDFInfo
- Publication number
- CN111699700A CN111699700A CN201980012218.XA CN201980012218A CN111699700A CN 111699700 A CN111699700 A CN 111699700A CN 201980012218 A CN201980012218 A CN 201980012218A CN 111699700 A CN111699700 A CN 111699700A
- Authority
- CN
- China
- Prior art keywords
- audio signal
- frequency domain
- channel
- data
- digital audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
- G11B2020/10555—Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account
- G11B2020/10564—Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account frequency
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Circuit For Audible Band Transducer (AREA)
- Stereophonic System (AREA)
Abstract
Description
技术领域technical field
本发明实施例涉及信号处理领域,尤其涉及一种音频信号处理方法、设备及存储介质。Embodiments of the present invention relate to the field of signal processing, and in particular, to an audio signal processing method, device, and storage medium.
背景技术Background technique
录音系统的动态范围是一个表征录音声压级范围的重要指标,即无失真录音的最小声音和最大声音。动态范围的下限与系统底噪有关,系统底噪越小,动态范围下限越低。系统底噪通常包括麦克风底噪、电路底噪、外部电磁场串扰噪声、数字量化噪声等。当声信号小于系统底噪时,则信号完全淹没,以致无法辨识;当声信号稍大于底噪时,有效声信号的信噪比较低。因此,降低录音系统底噪是提升小音量声信号录音质量的有效途径。The dynamic range of the recording system is an important indicator to characterize the sound pressure level range of the recording, that is, the minimum sound and the maximum sound of the undistorted recording. The lower limit of the dynamic range is related to the system noise floor. The smaller the system noise floor is, the lower the lower limit of the dynamic range is. The noise floor of the system usually includes the noise floor of the microphone, the noise floor of the circuit, the external electromagnetic field crosstalk noise, and the digital quantization noise. When the acoustic signal is less than the system noise floor, the signal is completely submerged, so that it cannot be identified; when the acoustic signal is slightly larger than the noise floor, the signal-to-noise ratio of the effective acoustic signal is low. Therefore, reducing the noise floor of the recording system is an effective way to improve the recording quality of low-volume sound signals.
现有技术中的录音系统一般由麦克风将接收到的声信号转换为电信号,也即将麦克风模拟信号经由模拟放大器完成信号放大,通过模数转换器将模拟信号转成数字信号,进行后续数字信号处理。在模拟信号经过放大器时,会引入电路噪声、电磁串扰噪声、数字量化噪声等系统底噪(不包括麦克风底噪)。现有技术中降低系统底噪、提高信噪比通常需要更优秀的硬件、电路和结构设计,提高了设计成本和设计局限性。The recording system in the prior art generally converts the received acoustic signal into an electrical signal by a microphone, that is, the analog signal of the microphone is amplified by an analog amplifier, and the analog signal is converted into a digital signal by an analog-to-digital converter, and the subsequent digital signal is carried out. deal with. When the analog signal passes through the amplifier, it will introduce system noise such as circuit noise, electromagnetic crosstalk noise, and digital quantization noise (excluding microphone noise). In the prior art, reducing the system noise floor and improving the signal-to-noise ratio usually requires better hardware, circuit and structural design, which increases the design cost and design limitations.
发明内容SUMMARY OF THE INVENTION
本发明实施例提供一种音频信号处理方法、设备及存储介质,以有效的降低录音系统的底噪,提高录音的信噪比。Embodiments of the present invention provide an audio signal processing method, device, and storage medium, so as to effectively reduce the noise floor of the recording system and improve the signal-to-noise ratio of the recording.
本发明实施例的第一方面是提供一种音频信号处理方法,包括:A first aspect of the embodiments of the present invention provides an audio signal processing method, including:
利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;Utilize a plurality of preprocessing circuits to process the analog audio signals to be processed to obtain multi-channel digital audio signals, wherein each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter;
对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;Perform gain compensation on the multi-channel digital audio signal, and perform frequency domain conversion on the multi-channel digital audio signal after the gain compensation to obtain multi-channel frequency domain data;
根据所述多路频域数据确定频域融合数据;Determine frequency domain fusion data according to the multi-channel frequency domain data;
将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。Converting the frequency-domain fusion data into a time-domain audio signal, and obtaining an output audio signal according to the time-domain audio signal.
本发明实施例的第二方面是提供一种音频信号处理设备,包括:存储器和处理器;A second aspect of the embodiments of the present invention is to provide an audio signal processing device, including: a memory and a processor;
所述存储器用于存储程序代码;the memory is used to store program codes;
所述处理器,调用所述程序代码,当程序代码被执行时,用于执行以下操作:The processor calls the program code, and when the program code is executed, is configured to perform the following operations:
利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;Utilize a plurality of preprocessing circuits to process the analog audio signals to be processed to obtain multi-channel digital audio signals, wherein each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter;
对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;Perform gain compensation on the multi-channel digital audio signal, and perform frequency domain conversion on the multi-channel digital audio signal after the gain compensation to obtain multi-channel frequency domain data;
根据所述多路频域数据确定频域融合数据;Determine frequency domain fusion data according to the multi-channel frequency domain data;
将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。Converting the frequency-domain fusion data into a time-domain audio signal, and obtaining an output audio signal according to the time-domain audio signal.
本发明实施例的第三方面是提供一种录音系统,包括:A third aspect of the embodiments of the present invention provides a recording system, including:
麦克风,用于采集音频模拟信号;以及a microphone for capturing audio analog signals; and
如第二方面所述的音频处理设备。The audio processing device of the second aspect.
本发明实施例的第四方面是提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行以实现第一方面所述的方法。A fourth aspect of the embodiments of the present invention is to provide a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement the method described in the first aspect.
本实施例提供的音频信号处理方法、设备及存储介质,通过利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;根据所述多路频域数据确定频域融合数据;将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。本实施例提供的方法将对待处理的模拟音频信号通过不同的预处理电路进行处理,并考虑到不同预处理电路的系统综合噪声互不相关、且与待处理的模拟音频信号也互不相关,通过在频域上进行融合后可提高有效信号的比例、降低系统综合噪声的比例,从而有效的提高了录音系统的信噪比。The audio signal processing method, device, and storage medium provided in this embodiment obtain multi-channel digital audio signals by using multiple preprocessing circuits to process analog audio signals to be processed, wherein each of the multiple preprocessing circuits pre- The processing circuit includes an amplifier and an analog-to-digital converter; performs gain compensation on the multi-channel digital audio signals, and performs frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data; according to the multi-channel digital audio signals The frequency domain data determines frequency domain fusion data; converts the frequency domain fusion data into a time domain audio signal, and obtains an output audio signal according to the time domain audio signal. In the method provided in this embodiment, the analog audio signal to be processed is processed by different preprocessing circuits, and considering that the system integrated noises of different preprocessing circuits are not correlated with each other and are not correlated with the analog audio signal to be processed, After fusion in the frequency domain, the proportion of effective signals can be increased, and the proportion of system integrated noise can be reduced, thereby effectively improving the signal-to-noise ratio of the recording system.
附图说明Description of drawings
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments. Obviously, the drawings in the following description are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.
图1为本发明实施例提供的音频信号处理方法的流程图;1 is a flowchart of an audio signal processing method provided by an embodiment of the present invention;
图2为本发明实施例提供的多个预处理电路并联的示意图;2 is a schematic diagram of a parallel connection of multiple preprocessing circuits provided by an embodiment of the present invention;
图3为本发明另一实施例提供的音频信号处理方法的流程图;3 is a flowchart of an audio signal processing method provided by another embodiment of the present invention;
图4为本发明另一实施例提供的音频信号处理方法的流程图;4 is a flowchart of an audio signal processing method provided by another embodiment of the present invention;
图5为本发明另一实施例提供的音频信号处理方法的流程图;5 is a flowchart of an audio signal processing method provided by another embodiment of the present invention;
图6a为预处理电路的数字音频信号的时域信号图;6a is a time domain signal diagram of a digital audio signal of a preprocessing circuit;
图6b为预处理电路的数字音频信号的时频谱图;Fig. 6b is the time-spectrogram of the digital audio signal of the preprocessing circuit;
图7a为输出音频信号的时域信号图;Fig. 7a is the time domain signal diagram of output audio signal;
图7b为输出音频信号的时频谱图;Fig. 7b is the time-spectrogram of the output audio signal;
图8为本发明实施例提供的音频信号处理设备的结构图。FIG. 8 is a structural diagram of an audio signal processing device provided by an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
需要说明的是,当一个组件被认为是“连接”另一个组件,它可以是直接连接到另一个组件或者可能同时存在居中组件。It should be noted that when a component is considered to be "connected" to another component, it can be directly connected to another component or there may be an intervening component at the same time.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的术语“及/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.
下面结合附图,对本发明的一些实施方式作详细说明。在不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Some embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The embodiments described below and features in the embodiments may be combined with each other without conflict.
本发明实施例提供一种音频信号处理方法。图1为本发明实施例提供的音频信号处理方法的流程图。如图1所示,本实施例中的音频信号处理方法,可以包括:Embodiments of the present invention provide an audio signal processing method. FIG. 1 is a flowchart of an audio signal processing method provided by an embodiment of the present invention. As shown in FIG. 1 , the audio signal processing method in this embodiment may include:
步骤S101、利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器预处理电路。Step S101 , using a plurality of preprocessing circuits to process the analog audio signal to be processed to obtain a multi-channel digital audio signal, wherein each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter preprocessing circuit.
在本实施例中,如图2所示,将多个预处理电路并联,每一预处理电路包括放大器和模数转换器,需要说明的是,所述多个预处理电路至少包括两个预处理电路;其中放大器可用于对模拟音频信号进行功率放大,放大倍数通常用增益表示,本实施例中各预处理电路的放大器的模拟增益可各不相同,当然也可相同;而模数转换器可用于将模拟音频信号转换成数字音频信号,以便于后续的信号处理。In this embodiment, as shown in FIG. 2 , multiple preprocessing circuits are connected in parallel, and each preprocessing circuit includes an amplifier and an analog-to-digital converter. It should be noted that the multiple preprocessing circuits include at least two preprocessing circuits. processing circuit; wherein the amplifier can be used to amplify the power of the analog audio signal, and the amplification factor is usually expressed by the gain. Can be used to convert analog audio signals into digital audio signals for subsequent signal processing.
本实施例中,如图2所示,待处理的模拟音频信号x可由麦克风采集,然后分别输入到并联的多个预处理电路中,分别经过不同增益的功率放大,并转换成数字音频信号x1、...xi...、xI,其中I为预处理电路的个数,而在模拟音频信号在不同的预处理电路中传输、放大、模数转换过程中引入系统综合噪声,系统综合噪声相当于除了麦克风底噪外的系统底噪,包括电路噪声、电磁串扰噪声、数字量化噪声等,且各预处理电路之间的系统综合噪声通常互不相关,且各预处理电路之间的系统综合噪声与待处理的模拟音频信号互不相关,数字音频信号可表示为xi=Gix+ni,i=1,2,...,I,其中Gi为预处理电路i的模拟增益,ni为预处理电路i的系统综合噪声。也即在本实施例中每一预处理电路的输入均为同一待处理的模拟音频信号,而每一预处理电路的输出的数字音频信号则各不相同,从而得到多路数字音频信号。In this embodiment, as shown in FIG. 2 , the analog audio signal x to be processed can be collected by a microphone, and then input to a plurality of parallel preprocessing circuits, respectively amplified by power with different gains, and converted into a digital audio signal x 1 , ...x i ..., x I , where I is the number of preprocessing circuits, and system integrated noise is introduced in the process of analog audio signal transmission, amplification, and analog-to-digital conversion in different preprocessing circuits, The system integrated noise is equivalent to the system noise floor except the microphone noise floor, including circuit noise, electromagnetic crosstalk noise, digital quantization noise, etc., and the system integrated noise between the preprocessing circuits is usually uncorrelated with each other, and the difference between the preprocessing circuits. The system-integrated noise between , and the analog audio signal to be processed are independent of each other, and the digital audio signal can be expressed as x i =G i x+n i , i=1, 2, . . . , I, where G i is the preprocessing The analog gain of circuit i, n i is the system integrated noise of preprocessing circuit i. That is, in this embodiment, the input of each preprocessing circuit is the same analog audio signal to be processed, and the output digital audio signal of each preprocessing circuit is different, thereby obtaining multiple digital audio signals.
本实施例中可以将待处理的模拟音频信号以预定时长截取为不同片段作为一帧信号,或者在进行模数转换后以预定数量的采样数据作为一帧信号,后续的音频信号处理流程中均可以以一帧信号为单位进行处理。为了保证信号的连续性,相邻帧信号之间可具有一定的重叠,即前一帧信号的尾部与后一帧信号的头部具有重叠量,从而建立相邻帧之间的相关性。In this embodiment, the analog audio signal to be processed may be cut into different segments with a predetermined duration as a frame signal, or a predetermined number of sampled data may be used as a frame signal after analog-to-digital conversion. Processing can be performed in units of one frame signal. In order to ensure the continuity of the signals, there may be a certain overlap between the signals of adjacent frames, that is, the tail of the signal of the previous frame and the head of the signal of the next frame have an overlap amount, so as to establish the correlation between the adjacent frames.
步骤S102、对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据。Step S102: Perform gain compensation on the multi-channel digital audio signals, and perform frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data.
在本实施例中,通过对多路数字音频信号进行增益补偿,从而实现多路音频信号的增益相同,从而便于在相同增益下进行后续多路数字音频信号的频域融合,其中可以在多个预处理电路中任选一个预处理电路作为参考预处理电路,以其放大器的模拟增益作为参考增益,将多路数字音频信号根据参考增益进行增益补偿。进一步的,对增益补偿之后的多路数字音频信号中每一路数字音频信号分别进行频域转换,从而得到每一路数字音频信号对应的频域数据。本实施例中将增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据,进而可实现多路频域数据在频域上的融合。其中频域转换方法可采用傅里叶变换(如离散傅里叶变换)、拉普拉斯变换、Z变换等等,具体的频域转换过程此处不再赘述。In this embodiment, by performing gain compensation on the multi-channel digital audio signals, the gains of the multi-channel audio signals are realized to be the same, thereby facilitating the frequency domain fusion of the subsequent multi-channel digital audio signals under the same gain. One of the preprocessing circuits is selected as the reference preprocessing circuit, and the analog gain of its amplifier is used as the reference gain, and the multi-channel digital audio signal is subjected to gain compensation according to the reference gain. Further, frequency domain conversion is performed on each channel of digital audio signals in the multi-channel digital audio signals after the gain compensation, so as to obtain frequency domain data corresponding to each channel of digital audio signals. In this embodiment, the multi-channel digital audio signals after the gain compensation are subjected to frequency domain conversion to obtain multi-channel frequency-domain data, thereby realizing the fusion of the multi-channel frequency-domain data in the frequency domain. The frequency domain conversion method may adopt Fourier transform (such as discrete Fourier transform), Laplace transform, Z transform, etc. The specific frequency domain conversion process will not be repeated here.
步骤S103、根据所述多路频域数据确定频域融合数据。Step S103: Determine frequency domain fusion data according to the multi-channel frequency domain data.
在本实施例中,各预处理电路之间的系统综合噪声通常互不相关,且各预处理电路之间的系统综合噪声与待处理的模拟音频信号互不相关,在进行多路频域数据融合时可根据各频域数据之间的相似程度,辨识出有效信号与系统综合噪声,具体的,在融合时对于频域数据的某一采样点上,若相关程度越大,则融合后该采样点上有效信号的比例越高,也即相当于提高了频域融合数据中有效信号的比例,降低了系统综合噪声的比例。In this embodiment, the system integrated noise between the preprocessing circuits is generally uncorrelated, and the system integrated noise between the preprocessing circuits and the analog audio signal to be processed are uncorrelated with each other. During fusion, the effective signal and system integrated noise can be identified according to the similarity between each frequency domain data. Specifically, at a certain sampling point of the frequency domain data during fusion, if the degree of correlation is greater, the frequency domain data will be fused. The higher the ratio of valid signals at the sampling point, the higher the ratio of valid signals in the frequency-domain fusion data, and the lower the ratio of system integrated noise.
步骤S104、将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。Step S104: Convert the frequency-domain fusion data into a time-domain audio signal, and obtain an output audio signal according to the time-domain audio signal.
在本实施例中,在得到频域融合数据后,即可将频域融合数据转换为时域音频信号,转换方法可采用傅里叶变换(如离散傅里叶变换)的逆变换、拉普拉斯变换的逆变换、Z变换的逆变换等等,此处不再赘述。在完成转换后,可根据时域音频信号获取输出音频信号,其中根据时域音频信号获取输出音频信号过程中,可对时域音频信号进行压缩、降噪等操作。此外,若本实施例中以一帧信号为单位进行上述的音频信号处理流程,则根据时域音频信号获取输出音频信号过程中还需要将各帧信号进行拼接,建立相邻帧之间的相关性,具体的,可将当前帧时域音频信号与前一帧前帧时域音频信号进行时域叠加处理。In this embodiment, after the frequency-domain fusion data is obtained, the frequency-domain fusion data can be converted into a time-domain audio signal, and the conversion method can adopt the inverse transform of Fourier transform (such as discrete Fourier transform), Lapp The inverse transformation of the Lass transform, the inverse transformation of the Z transform, etc. will not be repeated here. After the conversion is completed, the output audio signal can be obtained according to the time-domain audio signal, and in the process of obtaining the output audio signal according to the time-domain audio signal, operations such as compression and noise reduction can be performed on the time-domain audio signal. In addition, if the above-mentioned audio signal processing flow is performed in units of one frame of signal in this embodiment, in the process of obtaining the output audio signal according to the time-domain audio signal, it is necessary to splicing each frame signal to establish the correlation between adjacent frames. Specifically, the time-domain superimposition processing may be performed on the time-domain audio signal of the current frame and the time-domain audio signal of the previous frame before the previous frame.
本实施例提供的音频信号处理方法,通过利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;根据所述多路频域数据确定频域融合数据;将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。本实施例提供的方法将对待处理的模拟音频信号通过不同的预处理电路进行处理,并考虑到不同预处理电路的系统综合噪声互不相关、且与待处理的模拟音频信号也互不相关,通过在频域上进行融合后可提高有效信号的比例、降低系统综合噪声的比例,从而有效的提高了录音系统的信噪比。In the audio signal processing method provided by this embodiment, a plurality of preprocessing circuits are used to process an analog audio signal to be processed to obtain a multi-channel digital audio signal, wherein each of the plurality of preprocessing circuits includes an amplifier and a an analog-to-digital converter; performing gain compensation on the multi-channel digital audio signals, and performing frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data; determining the frequency domain according to the multi-channel frequency domain data domain fusion data; converting the frequency domain fusion data into a time domain audio signal, and obtaining an output audio signal according to the time domain audio signal. In the method provided in this embodiment, the analog audio signal to be processed is processed by different preprocessing circuits, and considering that the system integrated noises of different preprocessing circuits are not correlated with each other and are not correlated with the analog audio signal to be processed, After fusion in the frequency domain, the proportion of effective signals can be increased, and the proportion of system integrated noise can be reduced, thereby effectively improving the signal-to-noise ratio of the recording system.
在上述任一实施例的基础上,步骤S102所述对所述多路数字音频信号进行增益补偿,可包括:On the basis of any of the foregoing embodiments, the step S102 to perform gain compensation on the multi-channel digital audio signal may include:
根据多路数字音频信号中每一路数字音频信号对应的增益补偿系数对所述每一路数字音频信号进行增益补偿,其中,多路数字音频信号中每一路数字音频信号对应的增益补偿系数各不相同。Gain compensation is performed on each channel of digital audio signal according to the gain compensation coefficient corresponding to each channel of digital audio signal in the multichannel digital audio signal, wherein the gain compensation coefficient corresponding to each channel of digital audio signal in the multichannel digital audio signal is different from each other .
在本实施例中,为实现多路音频信号的增益相同,便于在相同增益下进行后续多路数字音频信号的频域融合,可首先获取多路数字音频信号中每一路数字音频信号对应的增益补偿系数,然后再对每一路数字音频信号根据其对应的增益补偿系数进行增益补偿。由于每一预处理电路包括的放大器的模拟增益可能不相同,因此若将通过不同模拟增益放大后的多路数字音频信号调整到相同的增益下,则每一路数字音频信号对应的增益补偿系数各不相同。In this embodiment, in order to realize that the gains of the multi-channel audio signals are the same and facilitate the frequency domain fusion of the subsequent multi-channel digital audio signals under the same gain, the gain corresponding to each channel of the digital audio signals in the multi-channel digital audio signals can be obtained first. compensation coefficient, and then perform gain compensation for each channel of digital audio signal according to its corresponding gain compensation coefficient. Since the analog gains of the amplifiers included in each preprocessing circuit may be different, if the multi-channel digital audio signals amplified by different analog gains are adjusted to the same gain, the gain compensation coefficients corresponding to each channel of digital audio signals will be different. Are not the same.
进一步的,每一路数字音频信号对应的增益补偿系数是根据参考增益和所述每一路数字音频信号对应的预处理电路包括的放大器的模拟增益确定的,其中,所述参考增益为多个预处理电路中的参考预处理电路包括的放大器的模拟增益。Further, the gain compensation coefficient corresponding to each channel of digital audio signal is determined according to the reference gain and the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of digital audio signal, wherein the reference gain is a plurality of preprocessing steps. The analog gain of the amplifier included in the reference preprocessing circuit in the circuit.
在本实施例中,可从多个预处理电路中任选一路作为参考预处理电路,以参考预处理电路包括的放大器的模拟增益作为参考增益,对于某一路数字音频信号,获取该路数字音频信号对应的预处理电路包括的放大器的模拟增益,然后以参考增益与该模拟增益之间的比值作为该路数字音频信号对应的增益补偿系数,进而在对该路数字音频信号进行增益补偿时,将该路数字音频信号乘以该增益补偿系数,即可得到增益补偿之后的数字音频信号。In this embodiment, any one of the multiple preprocessing circuits can be selected as the reference preprocessing circuit, and the analog gain of the amplifier included in the reference preprocessing circuit can be used as the reference gain. The analog gain of the amplifier included in the preprocessing circuit corresponding to the signal, and then the ratio between the reference gain and the analog gain is used as the gain compensation coefficient corresponding to the digital audio signal, and then when the digital audio signal is subjected to gain compensation, The digital audio signal after the gain compensation can be obtained by multiplying the digital audio signal by the gain compensation coefficient.
在上述任一实施例的基础上,如图3所示,所述根据所述多路频域数据确定频域融合数据,包括:On the basis of any of the foregoing embodiments, as shown in FIG. 3 , the determining of frequency domain fusion data according to the multi-channel frequency domain data includes:
步骤S201、获取多路频域数据中每两路频域数据之间的相关性参数;Step S201, obtaining the correlation parameter between every two channels of frequency domain data in the multi-channel frequency domain data;
步骤S202、根据所述每两路频域数据之间的相关性参数和与所述相关性参数对应的两路频域数据确定频域融合数据。Step S202: Determine frequency domain fusion data according to the correlation parameter between the two channels of frequency domain data and the two channels of frequency domain data corresponding to the correlation parameter.
在本实施例中,对于多路频域数据中的任意两路频域数据,获取该两路频域数据每一采样点上的相关性参数,然后根据相关性参数对该两路频域数据各采样点进行叠加,得到该两路频域数据的叠加结果,其中若某一采样点的相关性参数越大,则该采样点的叠加结果相对越大,若相关性参数越小,则该采样点的叠加结果相对越小;最后将多路频域数据中任意两路频域数据的叠加结果进行累加,进而可得到频域融合数据。In this embodiment, for any two channels of frequency domain data in the multichannel frequency domain data, the correlation parameters at each sampling point of the two channels of frequency domain data are obtained, and then the two channels of frequency domain data are obtained according to the correlation parameters. Each sampling point is superimposed to obtain the superposition result of the two channels of frequency domain data. If the correlation parameter of a certain sampling point is larger, the superposition result of the sampling point is relatively larger, and if the correlation parameter is smaller, the The superposition result of the sampling points is relatively smaller; finally, the superposition results of any two channels of frequency domain data in the multi-channel frequency domain data are accumulated, and then the frequency domain fusion data can be obtained.
本实施例中,可通过如下公式获取任意两路频域数据每一采样点上的相关性参数:In this embodiment, the correlation parameters at each sampling point of any two channels of frequency domain data can be obtained by the following formula:
其中,Cij(n)为频域数据X′i和X′j在第n个采样点的相关性参数,X′j(n)为频域数据X′j的第n个采用点的信号,X′i(n)为频域数据X′i的第n个采用点的信号,I为预处理电路的个数,运算min(a,b)为a和b的较小值,运算|·|表示复数的模。0≤Cij(n)l≤1,Cij(n)越大表示频域数据X′i和X′j在第n个采样点的信号越相似,有效信号的比例越高。Among them, C ij (n) is the correlation parameter of the frequency domain data X′ i and X′ j at the nth sampling point, and X′ j (n) is the signal of the nth sampling point of the frequency domain data X′ j , X′ i (n) is the signal of the n-th adopted point of the frequency domain data X′ i , I is the number of preprocessing circuits, the operation min(a, b) is the smaller value of a and b, and the operation | ·| represents the modulus of a complex number. 0≤C ij (n) l ≤ 1, a larger C ij (n) indicates that the signals of the frequency domain data X′ i and X′ j at the nth sampling point are more similar, and the ratio of valid signals is higher.
需要说明的是,相关性参数并不限于通过上述公式得到的相关性参数,也可采用任意两路频域数据在每一采样点上的信号的比值作为相关性参数。It should be noted that the correlation parameter is not limited to the correlation parameter obtained by the above formula, and the ratio of the signals of any two channels of frequency domain data at each sampling point may also be used as the correlation parameter.
进一步的,可通过如下公式获取频域融合数据 Further, the frequency domain fusion data can be obtained by the following formula:
通过此公式中可以根据相关性参数对任意两路频域数据X′i和X′j的第n个采样点的信号进行叠加,也即X′i(n)+X′j(n),然后乘以相关性参数Cij(n)得到两路频域数据在第n个采样点的叠加结果,则相关性参数越大,则该采样点的叠加结果相对越大,若相关性参数越小,则该采样点的叠加结果相对越小;最后将多路频域数据中任意两路频域数据的叠加结果进行累加,进而可得到频域融合数据,在频域融合数据中提高了有效信号的比例,降低了系统综合噪声的比例,从而提高了信噪比。In this formula, the signals of the nth sampling point of any two channels of frequency domain data X' i and X' j can be superimposed according to the correlation parameter, that is, X' i (n)+X' j (n), Then multiply the correlation parameter C ij (n) to obtain the superposition result of the two channels of frequency domain data at the nth sampling point. The larger the correlation parameter, the larger the superposition result of the sampling point. is small, the superposition result of the sampling point is relatively small; finally, the superposition results of any two channels of frequency domain data in the multi-channel frequency domain data are accumulated, and then the frequency domain fusion data can be obtained, which improves the efficiency of the frequency domain fusion data. The ratio of the signal to the system reduces the ratio of the integrated noise of the system, thereby improving the signal-to-noise ratio.
进一步的,由于频域融合数据通常具有对称性,因此在获取频域融合数据可仅获取频域融合数据的单边频谱数据,通过将单边频谱数据转换为多边频谱数据,作为所述频域融合数据。Further, since the frequency domain fusion data usually has symmetry, when obtaining the frequency domain fusion data, only the unilateral spectrum data of the frequency domain fusion data can be obtained, and by converting the unilateral spectral data into the multilateral spectrum data, as the frequency domain Fusion data.
在上述任一实施例的基础上,所述方法还包括:On the basis of any of the above embodiments, the method further includes:
对所述多路频域数据进行相位补偿;performing phase compensation on the multi-channel frequency domain data;
所述根据所述多路频域数据确定频域融合数据,包括:The determining the frequency domain fusion data according to the multi-channel frequency domain data includes:
根据相位补偿之后的多路频域数据确定频域融合数据。The frequency domain fusion data is determined according to the multi-channel frequency domain data after phase compensation.
在本实施例中,多路数字音频信号可能会因为多个预处理电路的电路设计或采样逻辑等原因存在一定的时间偏移,在频域上则对应相位偏移,也即多路频域数据之间可能存在相位偏移,而上述实施例中在由多路频域数据获取频域融合数据时,需要对多路频域数据相同的采样点进行相似性比较,对多路频域数据的相位一致性具有较高的要求,因此本实施例中在根据所述多路频域数据确定频域融合数据前,需要对多路频域数据进行相位补偿,进而可根据相位补偿之后的多路频域数据确定频域融合数据。In this embodiment, the multi-channel digital audio signal may have a certain time offset due to the circuit design or sampling logic of multiple pre-processing circuits, and the frequency domain corresponds to the phase offset, that is, the multi-channel frequency domain. There may be a phase offset between the data, and in the above embodiment, when the frequency domain fusion data is obtained from the multichannel frequency domain data, the similarity comparison needs to be performed on the same sampling points of the multichannel frequency domain data. Therefore, in this embodiment, before the frequency-domain fusion data is determined according to the multi-channel frequency-domain data, it is necessary to perform phase compensation on the multi-channel frequency-domain data. The frequency domain data determines the frequency domain fusion data.
进一步的,如图4所示,所述对所述多路频域数据进行相位补偿,包括:Further, as shown in FIG. 4 , the performing phase compensation on the multi-channel frequency domain data includes:
步骤S301、确定多路频域数据中每一路频域数据与多路频域数据中的参考频域数据之间的相位差或时间差;Step S301, determining the phase difference or time difference between each channel of frequency domain data in the multichannel frequency domain data and the reference frequency domain data in the multichannel frequency domain data;
步骤S302、根据所述相位差或时间差对对应的一路多路频域数据进行相位补偿。Step S302: Perform phase compensation on the corresponding one channel and multiple channels of frequency domain data according to the phase difference or time difference.
在本实施例中,可选择多路频域数据中任一路频域数据作为参考频域数据,获取多路频域数据中每一路频域数据与参考频域数据之间的相位差或时间差,然后根据每一路对应的相位差或时间差对该路频域数据进行相位补偿。其中相位补偿可以采用线性相位补偿。In this embodiment, any channel of frequency-domain data in the multi-channel frequency-domain data can be selected as the reference frequency-domain data, and the phase difference or time difference between each channel of frequency-domain data in the multi-channel frequency-domain data and the reference frequency-domain data is obtained, Then phase compensation is performed on the frequency domain data of each channel according to the corresponding phase difference or time difference of the channel. The phase compensation can use linear phase compensation.
需要说明的是,若多个预处理电路相互之间无时延,则无需进行相位补偿。It should be noted that, if there is no time delay between multiple preprocessing circuits, phase compensation is not required.
在上述任一实施例的基础上,所述时域音频信号为当前帧时域音频信号,其中,所述根据所述时域音频信号获取输出音频信号,包括:On the basis of any of the foregoing embodiments, the time-domain audio signal is a time-domain audio signal of the current frame, wherein the acquiring the output audio signal according to the time-domain audio signal includes:
将所述当前帧时域音频信号与当前帧时域音频信号之前获取的历史帧时域音频信号进行叠加处理以获取当前帧时域融合音频信号;performing superimposition processing on the current frame time domain audio signal and the historical frame time domain audio signal obtained before the current frame time domain audio signal to obtain the current frame time domain fusion audio signal;
根据所述当前帧时域融合音频信号确定所述输出音频信号。The output audio signal is determined according to the time-domain fusion audio signal of the current frame.
在本实施例中,以一帧信号为单位进行上述的各音频信号处理流程,为了保证信号的连续性,相邻帧信号之间可具有一定的重叠,即前一帧信号的尾部与后一帧信号的头部具有重叠量,从而建立相邻帧之间的相关性。因此在通过S104中将当前帧频域融合数据转换为当前帧时域音频信号后,可将当前帧时域音频信号与前一帧时域音频信号中重叠部分进行重叠叠加运算,而当前帧时域音频信号与前一帧时域音频信号中未重叠部分则不进行叠加运算,从而得到当前帧时域融合音频信号,进而可根据所述当前帧时域融合音频信号确定所述输出音频信号。In this embodiment, the above-mentioned audio signal processing procedures are performed in units of one frame signal. In order to ensure the continuity of the signals, there may be a certain overlap between adjacent frame signals, that is, the tail of the previous frame signal and the next frame signal are overlapped. The headers of the frame signals have an amount of overlap, thereby establishing a correlation between adjacent frames. Therefore, after converting the current frame frequency domain fusion data into the current frame time domain audio signal in S104, the overlapping part of the current frame time domain audio signal and the previous frame time domain audio signal can be overlapped and superimposed. The non-overlapping part of the time domain audio signal and the previous frame time domain audio signal does not perform superposition operation, thereby obtaining the current frame time domain fusion audio signal, and then the output audio signal can be determined according to the current frame time domain fusion audio signal.
在上述任一实施例的基础上,所述对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据,包括:On the basis of any of the above-mentioned embodiments, performing frequency domain conversion on the multi-channel digital audio signals after gain compensation to obtain multi-channel frequency domain data includes:
对所述增益补偿之后的多路数字音频信号进行加窗处理;Windowing is performed on the multi-channel digital audio signals after the gain compensation;
根据加窗后的多路数字音频信号进行频域转换以获取多路频域数据。Perform frequency domain conversion according to the windowed multi-channel digital audio signals to obtain multi-channel frequency domain data.
在本实施例中,采用分析窗函数对各路数字音频信号进行加窗处理,其中分析窗函数包括但不限于正弦窗函数,矩形窗函数,三角窗函数,汉宁窗函数(Hanning),高斯窗函数(Gaussian),布莱克曼窗函数(Blackman),切比雪夫窗函数(Chebyshev),汉明窗函数(Hamming),平顶窗函数(Flap Top),凯塞窗函数(Kaiser)。当然,也可不进行加窗处理,也即相当于加了矩形窗。。In this embodiment, an analysis window function is used to perform windowing processing on each channel of digital audio signals, wherein the analysis window function includes but is not limited to a sine window function, a rectangular window function, a triangular window function, a Hanning window function (Hanning), a Gaussian window function Window function (Gaussian), Blackman window function (Blackman), Chebyshev window function (Chebyshev), Hamming window function (Hamming), flat top window function (Flap Top), Kaiser window function (Kaiser). Of course, the windowing process may not be performed, which is equivalent to adding a rectangular window. .
进一步的,所述根据所述时域音频信号获取输出音频信号,包括:Further, the obtaining the output audio signal according to the time domain audio signal includes:
对所述时域音频信号进行加窗处理;performing windowing processing on the time-domain audio signal;
根据所述加窗处理之后的时域音频信号确定所述输出音频信号。The output audio signal is determined from the time-domain audio signal after the windowing process.
在本实施例中,采用合成窗函数对时域音频信号进行加窗处理,其中合成窗函数包括但不限于正弦窗函数,矩形窗函数,三角窗函数,汉宁窗函数(Hanning),高斯窗函数(Gaussian),布莱克曼窗函数(Blackman),切比雪夫窗函数(Chebyshev),汉明窗函数(Hamming),平顶窗函数(Flap Top),凯塞窗函数(Kaiser)。当然,也可不进行加窗处理,也即相当于加了矩形窗。In this embodiment, the time-domain audio signal is windowed by using a synthesis window function, wherein the synthesis window function includes but is not limited to a sine window function, a rectangular window function, a triangular window function, a Hanning window function (Hanning), a Gaussian window function Gaussian, Blackman, Chebyshev, Hamming, Flap Top, Kaiser. Of course, the windowing process may not be performed, which is equivalent to adding a rectangular window.
本发明实施例提供一种音频信号处理方法。图5为本发明另一实施例提供的音频信号处理方法的流程图。如图5所示,在上述实施例的基础上,本实施例中的方法,可以包括:Embodiments of the present invention provide an audio signal processing method. FIG. 5 is a flowchart of an audio signal processing method provided by another embodiment of the present invention. As shown in FIG. 5 , on the basis of the foregoing embodiment, the method in this embodiment may include:
步骤S401、利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号。Step S401 , using a plurality of preprocessing circuits to process the analog audio signal to be processed to obtain a multi-channel digital audio signal.
其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器,且各预处理电路的所述放大器的模拟增益可各不相同,当然也可相同。Wherein, each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter, and the analog gains of the amplifiers of each preprocessing circuit may be different from each other, and may of course be the same.
预处理的模拟音频信号x(t),t为采样离散时间序列t=1,2,...,采样频率为fs,将预处理的模拟音频信号x(t)输入并联的多路数字音频信号中,其中第i路预处理电路包括的放大器的模拟增益分别为Gi,其中,i=1,2,...,I,I为预处理电路的个数,在模拟音频信号传输、放大、模数转换过程中引入系统综合噪声,包括电路噪声、电磁串扰噪声、数字量化噪声等,其中各路的系统综合噪声分别为ni(t),也即各通道输出的数字音频信号xi(t)如下:The preprocessed analog audio signal x( t ), t is the sampling discrete time series t=1, 2, . In the audio signal, the analog gains of the amplifiers included in the i-th preprocessing circuit are respectively G i , where i=1, 2, . , Amplification, and analog-to-digital conversion, the system integrated noise is introduced, including circuit noise, electromagnetic crosstalk noise, digital quantization noise, etc. The system integrated noise of each channel is n i (t), that is, the digital audio signal output by each channel. x i (t) is as follows:
xi(t)=Gix(t)+ni(t) i=1,2,...,Ix i (t)=G i x(t)+n i (t) i=1,2,...,I
步骤S402、对多路数字音频信号中的每一数字音频信号每间隔M个采样点提取N个采样点,作为一帧数字音频信号。Step S402 , extracting N sampling points at every interval of M sampling points for each digital audio signal in the multi-channel digital audio signal, as a frame of digital audio signal.
在本实施例中,第i路预处理电路的第l帧数字音频信号记作:In this embodiment, the first frame of digital audio signal of the i-th preprocessing circuit is denoted as:
xi(t)l,t=1,2,...,N。x i (t) l , t=1, 2, . . . , N.
其中,N称为帧长,M称为帧移,0<M<N,也即第l-1帧的最后N-M个采样点与第l帧的前N-M个采样点相同,0.005fs<N<fs,且N为2的幂次方。Among them, N is called the frame length, M is called the frame shift, 0<M<N, that is, the last NM sampling points of the l-1th frame are the same as the first NM sampling points of the lth frame, 0.005f s <N < f s , and N is a power of 2.
步骤S403、对所述多路数字音频信号进行增益补偿和加窗处理。Step S403 , performing gain compensation and windowing processing on the multi-channel digital audio signal.
在本实施例中,为实现多路音频信号的增益相同,可获取多路数字音频信号中每一路数字音频信号对应的增益补偿系数,在本实施例中从多个预处理电路中任选一路作为参考预处理电路,不失一般性的,以第1预处理电路作为参考预处理电路,以其包括的放大器的模拟增益G1作为参考增益,对于某一路数字音频信号获取该路数字音频信号对应的预处理电路包括的放大器的模拟增益Gi,以参考增益与该模拟增益之间的比值G1/Gi作为该路数字音频信号对应的增益补偿系数,进而根据增益补偿系数对该路数字音频信号进行增益补偿。In this embodiment, in order to achieve the same gain of the multi-channel audio signals, the gain compensation coefficient corresponding to each channel of digital audio signals in the multi-channel digital audio signals can be obtained. In this embodiment, one channel is selected from multiple preprocessing circuits. As the reference preprocessing circuit, without loss of generality, the first preprocessing circuit is used as the reference preprocessing circuit, and the analog gain G 1 of the amplifier it includes is used as the reference gain to obtain the digital audio signal of a certain channel of digital audio signal. The analog gain G i of the amplifier included in the corresponding preprocessing circuit takes the ratio G 1 /G i between the reference gain and the analog gain as the gain compensation coefficient corresponding to the digital audio signal of the channel, and then according to the gain compensation coefficient for the channel. Digital audio signal for gain compensation.
本实施例中还可对多路数字音频信号进行加窗处理,其中,分析窗函数包括但不限于正弦窗函数,矩形窗函数,三角窗函数,汉宁窗函数(Hanning),高斯窗函数(Gaussian),布莱克曼窗函数(Blackman),切比雪夫窗函数(Chebyshev),汉明窗函数(Hamming),平顶窗函数(Flap Top),凯塞窗函数(Kaiser)。当然,也可不进行加窗处理,也即相当于加了矩形窗。In this embodiment, multi-channel digital audio signals can also be windowed, wherein the analysis window function includes but is not limited to a sine window function, a rectangular window function, a triangular window function, a Hanning window function (Hanning), a Gaussian window function ( Gaussian), Blackman, Chebyshev, Hamming, Flap Top, Kaiser. Of course, the windowing process may not be performed, which is equivalent to adding a rectangular window.
需要说明的是,本实施例中可以先进行增益补偿再进行加窗处理,也可以先进行加窗处理再进行增益补偿,还可以同时进行增益补偿和加窗处理。经过增益补偿和加窗处理后的第i路预处理电路的第l帧数字音频信号x′i(t)l为:It should be noted that, in this embodiment, the gain compensation may be performed first and then the windowing process may be performed, or the windowing process may be performed first and then the gain compensation may be performed, or the gain compensation and the windowing process may be performed simultaneously. The lth frame digital audio signal x′ i (t) l of the i-th preprocessing circuit after gain compensation and windowing processing is:
其中,hana(t)为N点分析窗函数。Among them, h ana (t) is the N-point analysis window function.
步骤S404、对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据。Step S404: Perform frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data.
本实施例中,对x′i(t)l进行离散傅里叶变换,得到第i路频域数据Xi(n)l:In this embodiment, discrete Fourier transform is performed on x′ i (t) l to obtain the i-th frequency domain data X i (n) l :
其中,n为离散频谱序列,e为自然常数,为虚数单位。本实施例中可采用快速傅里叶变换进行加速计算。Among them, n is a discrete spectrum sequence, e is a natural constant, is an imaginary unit. In this embodiment, fast Fourier transform may be used to perform accelerated calculation.
步骤S405、对所述多路频域数据进行相位补偿。Step S405: Perform phase compensation on the multi-channel frequency domain data.
本实施例中,可选择多路频域数据中任一路频域数据作为参考频域数据,不失一般性的,以第1路频域数据作为参考频域数据,获取多路频域数据中每一路频域数据与参考频域数据之间的相位差。本实施例中可根据如下公式获取第l帧第i路频域数据Xi(n)l与参考频域数据在第n个采样点的相位差Pd,i(n)l:In this embodiment, any channel of frequency-domain data in the multi-channel frequency-domain data can be selected as the reference frequency-domain data. Without loss of generality, the first channel of frequency-domain data is used as the reference frequency-domain data. The phase difference between each channel of frequency domain data and the reference frequency domain data. In this embodiment, the phase difference P d, i (n) l of the ith frequency domain data X i (n) l of the l th frame and the reference frequency domain data at the n th sampling point can be obtained according to the following formula:
其中,运算(·)*表示复数的共轭,运算angle(·)表示复数的相位。Here, the operation (·) * represents the conjugate of the complex number, and the operation angle(·) represents the phase of the complex number.
然后,根据所述相位差对对应的一路多路频域数据进行相位补偿,第l帧第i路多路频域数据线性补偿相位谱Pc,i(n)l为:Then, phase compensation is performed on the corresponding one-way multi-channel frequency-domain data according to the phase difference, and the i-th multi-channel frequency-domain data linear compensation phase spectrum P c of the 1 th frame, i (n) l is:
其中,1≤n1≤n2≤N/2,Pd,i(n1)l和Pd,i(n2)l之间的参数用于线性相位拟合。该公式为常数项为0的线性相位拟合公式,Pd,i(n)l为第n个采样点的相位值,线性拟合后的斜率为进而第l帧第i路相位补偿复数谱Pi(n)l为:Among them, 1≤n 1 ≤n 2 ≤N/2, the parameters between P d,i (n 1 ) l and P d,i (n 2 ) l are used for linear phase fitting. This formula is a linear phase fitting formula with a constant term of 0, P d, i (n) l is the phase value of the nth sampling point, and the slope after linear fitting is Then the i-th phase compensation complex spectrum P i (n) l of the l-th frame is:
于是,相位补偿之后的多路频域数据X′i(n)l为:Therefore, the multi-channel frequency domain data X′ i (n) l after phase compensation is:
X′i(n)l=Pi(n)lXi(n)l X′ i (n) l =P i (n) l X i (n) l
需要说明的是,若多个预处理电路相互之间无时延,则无需进行相位补偿,也即Pi(n)l=1。此外,若各频域数据的相互之间的时间差固定且已知,也即多个预处理电路之间的时间差固定且已知,例如第i路频域数据超前第1路频域数据的时间差为τi,则可根据时间差进行相位补偿,上述的第l帧第i路多路频域数据线性补偿相位谱Pc,i(n)l可为:It should be noted that, if there is no time delay between multiple preprocessing circuits, no phase compensation is required, that is, P i (n) l =1. In addition, if the time difference between each frequency domain data is fixed and known, that is, the time difference between multiple preprocessing circuits is fixed and known, for example, the time difference between the i-th frequency domain data and the first frequency domain data is ahead is τ i , the phase compensation can be performed according to the time difference, and the linear compensation phase spectrum P c of the i-th multi-channel frequency-domain data of the 1 th frame above, i (n) l can be:
Pc,i(n)l=2πfsτin/N n=1,2,...,NP c, i (n) l = 2πf s τ i n/N n = 1, 2, ..., N
步骤S406、根据相位补偿之后的多路频域数据确定频域融合数据。Step S406: Determine frequency domain fusion data according to the multi-channel frequency domain data after phase compensation.
在本实施例中,各预处理电路之间的系统综合噪声通常互不相关且各预处理电路之间的系统综合噪声与待处理的模拟音频信号互不相关,在进行多路频域数据融合时可根据各频域数据之间的相似程度辨识出有效信号与系统综合噪声,具体的,对于任意两路相位补偿之后的频域数据X′i(n)l和X′j(n)l,获取该两路频域数据在第n个采样点上的相关性参数Cij(n)l:In this embodiment, the system integrated noise between the preprocessing circuits is usually uncorrelated with each other, and the system integrated noise between the preprocessing circuits and the analog audio signal to be processed are uncorrelated with each other. At the same time, the effective signal and the system integrated noise can be identified according to the similarity between the frequency domain data. Specifically, for the frequency domain data X′ i (n) l and X′ j (n) l after any two phase compensation , obtain the correlation parameter C ij (n) l of the two channels of frequency domain data at the nth sampling point:
其中,运算min(a,b)为a和b的较小值,运算|·|表示复数的模。0≤Cij(n)l≤1,Cij(n)越大表示频域数据X′i和X′j在第n个采样点的信号越相似,有效信号的比例越高。Among them, the operation min(a, b) is the smaller value of a and b, and the operation |·| represents the modulus of the complex number. 0≤C ij (n) l ≤ 1, a larger C ij (n) indicates that the signals of the frequency domain data X′ i and X′ j at the nth sampling point are more similar, and the ratio of valid signals is higher.
进一步的,可通过如下公式获取第l帧单边频域融合数据 Further, the 1st frame unilateral frequency domain fusion data can be obtained by the following formula:
其中,1≤n≤N/2+1。Among them, 1≤n≤N/2+1.
由于频域融合数据的对称性,可将单边频谱数据转换为多边频谱数据,作为所述频域融合数据,第l帧频域融合数据Xsyn(n)l如下:Due to the symmetry of the frequency domain fusion data, the unilateral spectral data can be converted into multilateral spectral data, and as the frequency domain fusion data, the frequency domain fusion data X syn (n) l of the first frame is as follows:
步骤S407、将所述频域融合数据转换为时域音频信号。Step S407: Convert the frequency domain fusion data into a time domain audio signal.
本实施例中,将第l帧频域融合数据Xsyn(n)l通过离散傅里叶变换的逆变换进行处理,得到第l帧时域音频信号x′syn(t)l:In this embodiment, the 1st frame frequency domain fusion data X syn (n) 1 is processed through the inverse transform of discrete Fourier transform to obtain the 1 th frame time domain audio signal x′ syn (t) 1 :
具体可采用快速傅里叶变换的逆变换进行加速计算。Specifically, the inverse transform of the fast Fourier transform can be used to accelerate the calculation.
步骤S408、对所述时域音频信号进行加窗处理。Step S408, performing windowing processing on the time-domain audio signal.
在本实施例中,将第l帧时域音频信号x′syn(t)l进行加窗处理,得到加窗后的时域音频信号 In this embodiment, windowing is performed on the time-domain audio signal x′ syn (t) l of the lth frame to obtain a windowed time-domain audio signal
其中,hsyn(t)为N点合成窗函数,其中合成窗函数包括但不限于正弦窗函数,矩形窗函数,三角窗函数,汉宁窗函数(Hanning),高斯窗函数(Gaussian),布莱克曼窗函数(Blackman),切比雪夫窗函数(Chebyshev),汉明窗函数(Hamming),平顶窗函数(FlapTop),凯塞窗函数(Kaiser)。当然,也可不进行加窗处理,也即相当于加了矩形窗。Among them, h syn (t) is an N-point synthesis window function, wherein the synthesis window function includes but is not limited to a sine window function, a rectangular window function, a triangular window function, a Hanning window function (Hanning), a Gaussian window function (Gaussian), Black Blackman, Chebyshev, Hamming, FlapTop, Kaiser. Of course, the windowing process may not be performed, which is equivalent to adding a rectangular window.
步骤S409、将所述当前帧时域音频信号与当前帧时域音频信号之前获取的历史帧时域音频信号进行叠加处理以获取当前帧时域融合音频信号,并根据当前帧时域融合音频信号获取输出音频信号。Step S409, performing superposition processing on the current frame time domain audio signal and the historical frame time domain audio signal obtained before the current frame time domain audio signal to obtain the current frame time domain fusion audio signal, and according to the current frame time domain fusion audio signal Get the output audio signal.
在本实施例中,可将当前帧时域音频信号与前一帧时域音频信号中重叠部分进行重叠叠加运算,从而得到当前帧时域融合音频信号:In this embodiment, the overlapping part of the current frame time domain audio signal and the previous frame time domain audio signal can be overlapped and superimposed to obtain the current frame time domain fusion audio signal:
其中,xsyn(t+M)l-1为第l-1帧重叠累加N点时域信号;若l=1,xsyn(t+M)l-1=0。于是,第l帧重叠累加M点后,输出该重叠的M点对应的第l帧的当前帧输出音频信号y(n)l为:Among them, x syn (t+M) l-1 is the time domain signal of the 1-1th frame overlapping and accumulating N points; if l=1, x syn (t+M) l-1 =0. Therefore, after the lth frame overlaps and accumulates M points, the current frame output audio signal y(n) l of the lth frame corresponding to the overlapped M point is output as:
y(n)l=xsyn(t)l t=1,2,...,My(n) l = x syn (t) l t = 1, 2, ..., M
在得到输出音频信号y(n)l,即可作为输出音频信号,录音保存或实时播放。After the output audio signal y(n) l is obtained, it can be used as the output audio signal, and the recording can be saved or played in real time.
在上述实施例中,存在一种特殊情况为,所有预处理电路包括的放大器的模拟增益相同,各预处理电路的系统综合噪声服从正态分布,且噪声能量期望一致,相关性系数Cij(k)皆为1,则输出音频信号的噪声能量期望为单预处理电路噪声能量的倍,信噪比提高了倍。若合理配置各预处理电路放大器的增益以及增加相关性检测,则抑制系统底噪的效果更佳。In the above embodiment, there is a special case that the analog gains of the amplifiers included in all the preprocessing circuits are the same, the system integrated noise of each preprocessing circuit obeys a normal distribution, and the noise energy is expected to be consistent, and the correlation coefficient C ij ( k) are all 1, then the noise energy of the output audio signal is expected to be the sum of the noise energy of the single preprocessing circuit times, the signal-to-noise ratio is improved times. If the gain of each preprocessing circuit amplifier is properly configured and the correlation detection is increased, the effect of suppressing the noise floor of the system will be better.
本发明实施例提供的音频信号处理方法中的具体原理和实现方式均与上述实施例类似,此处不再赘述。The specific principles and implementation manners of the audio signal processing method provided by the embodiments of the present invention are similar to those of the foregoing embodiments, and details are not described herein again.
本实施例提供的音频信号处理方法,通过利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;根据所述多路频域数据确定频域融合数据;将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。本实施例提供的方法将对待处理的模拟音频信号通过不同的预处理电路进行处理,并考虑到不同预处理电路的系统综合噪声互不相关、且与待处理的模拟音频信号也互不相关,通过在频域上进行融合后可提高有效信号的比例、降低系统综合噪声的比例,从而有效的提高了录音系统的信噪比。In the audio signal processing method provided by this embodiment, a plurality of preprocessing circuits are used to process an analog audio signal to be processed to obtain a multi-channel digital audio signal, wherein each of the plurality of preprocessing circuits includes an amplifier and a an analog-to-digital converter; performing gain compensation on the multi-channel digital audio signals, and performing frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data; determining the frequency domain according to the multi-channel frequency domain data domain fusion data; converting the frequency domain fusion data into a time domain audio signal, and obtaining an output audio signal according to the time domain audio signal. In the method provided in this embodiment, the analog audio signal to be processed is processed by different preprocessing circuits, and considering that the system integrated noises of different preprocessing circuits are not correlated with each other and are not correlated with the analog audio signal to be processed, After fusion in the frequency domain, the proportion of effective signals can be increased, and the proportion of system integrated noise can be reduced, thereby effectively improving the signal-to-noise ratio of the recording system.
下面以一个具体实例进行举例说明,在该实例中,预处理电路为两个(即I=2),且两个预处理电路的增益相同,即G1=G2,两个预处理电路之间的时间差固定且为1个采样时间间隔,即τ2=1/fs。测试环境为普通安静房间,音源为语音信号。信号采样率为fs=48kHz,分析帧长N=2048,帧移M=1024。分析窗和合成窗分别皆为正弦窗:A specific example is used as an example below. In this example, there are two preprocessing circuits (that is, I=2), and the gains of the two preprocessing circuits are the same, that is, G 1 =G 2 . The time difference between them is fixed and is 1 sampling time interval, that is, τ 2 =1/f s . The test environment is an ordinary quiet room, and the audio source is a voice signal. The signal sampling rate is f s =48kHz, the analysis frame length is N=2048, and the frame shift is M=1024. Both the analysis window and the synthesis window are sinusoidal windows:
由于两个预处理电路的增益相同,因此两个预处理电路的数字音频信号(时域信号)相似,如图6a所示(仅示出其中一路),两个预处理电路的数字音频信号的时频谱也相似,如图6b所示(仅示出其中一路),而经过本实施例的音频信号处理方法进行处理后,输出音频信号的时域信号和时频谱分别如图7a和图7b所示。比较图6a和图7a的数字音频信号的时域信号,可见,在语音信号段,两者差异不明显,而在噪声段,本实施例输出音频信号的噪声明显小于未经处理的数字音频信号的噪声。比较图6b和图7b的数字音频信号的时频谱,图6b信号中包含了覆盖全频段的白噪声,即为系统底噪,而图7b中本实施例输出音频信号明显优于图6b,在噪声段,系统底噪在全频段内都得到了极大的抑制。通过比较,未经处理的数字音频信号中500Hz以下频率为低频分量较多的背景噪声信号,本实施例输出音频信号能基本完全保留该有效信号;未经处理的数字音频信号中500Hz以上的中高频噪声成分基本都是系统底噪,本实施例输出音频信号可以抑制噪声超过10dB。在语音段,有效的语谱信息能清晰完整地保留,而系统底噪即使混合在语谱间,也可以有极大的噪声抑制效果。Since the gains of the two preprocessing circuits are the same, the digital audio signals (time domain signals) of the two preprocessing circuits are similar, as shown in Figure 6a (only one of them is shown), the digital audio signals of the two preprocessing circuits have The time spectrum is also similar, as shown in Figure 6b (only one of them is shown), and after processing by the audio signal processing method of this embodiment, the time domain signal and time spectrum of the output audio signal are shown in Figure 7a and Figure 7b respectively. Show. Comparing the time-domain signals of the digital audio signals in Fig. 6a and Fig. 7a, it can be seen that in the speech signal segment, the difference between the two is not obvious, while in the noise segment, the noise of the output audio signal of this embodiment is significantly smaller than that of the unprocessed digital audio signal noise. Comparing the time spectrum of the digital audio signal in Fig. 6b and Fig. 7b, the signal in Fig. 6b contains white noise covering the whole frequency band, which is the system noise floor, and the output audio signal of this embodiment in Fig. 7b is obviously better than that in Fig. 6b. In the noise section, the system noise floor has been greatly suppressed in the whole frequency band. By comparison, the frequency below 500Hz in the unprocessed digital audio signal is a background noise signal with more low-frequency components, and the output audio signal of this embodiment can basically completely retain the effective signal; The frequency noise components are basically the system noise floor, and the output audio signal in this embodiment can suppress the noise by more than 10 dB. In the speech segment, the effective spectral information can be retained clearly and completely, and even if the system noise floor is mixed between the speech spectra, it can also have a great noise suppression effect.
本发明实施例提供一种音频信号处理设备。图8为本发明实施例提供的音频信号处理设备的结构图,如图8所示,音频信号处理设备50包括存储器52和处理器51。Embodiments of the present invention provide an audio signal processing device. FIG. 8 is a structural diagram of an audio signal processing device provided by an embodiment of the present invention. As shown in FIG. 8 , the audio
所述存储器52用于存储程序代码;The
所述处理器51,调用所述程序代码,当程序代码被执行时,用于执行以下操作:The
利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;Utilize a plurality of preprocessing circuits to process the analog audio signals to be processed to obtain multi-channel digital audio signals, wherein each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter;
对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;Perform gain compensation on the multi-channel digital audio signal, and perform frequency domain conversion on the multi-channel digital audio signal after the gain compensation to obtain multi-channel frequency domain data;
根据所述多路频域数据确定频域融合数据;Determine frequency domain fusion data according to the multi-channel frequency domain data;
将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。Converting the frequency-domain fusion data into a time-domain audio signal, and obtaining an output audio signal according to the time-domain audio signal.
在上述任一实施例的基础上,在所述处理器51对所述多路数字音频信号进行增益补偿时,所述处理器51被配置为:On the basis of any of the above embodiments, when the
根据多路数字音频信号中每一路数字音频信号对应的增益补偿系数对所述每一路数字音频信号进行增益补偿,其中,多路数字音频信号中每一路数字音频信号对应的增益补偿系数各不相同。Gain compensation is performed on each channel of digital audio signal according to the gain compensation coefficient corresponding to each channel of digital audio signal in the multichannel digital audio signal, wherein the gain compensation coefficient corresponding to each channel of digital audio signal in the multichannel digital audio signal is different from each other .
在上述任一实施例的基础上,每一路数字音频信号对应的增益补偿系数是根据参考增益和所述每一路数字音频信号对应的预处理电路包括的放大器的模拟增益确定的,其中,所述参考增益为多个预处理电路中的参考预处理电路包括的放大器的模拟增益。On the basis of any of the above embodiments, the gain compensation coefficient corresponding to each channel of digital audio signal is determined according to the reference gain and the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of digital audio signal, wherein the The reference gain is an analog gain of an amplifier included in the reference preprocessing circuit among the plurality of preprocessing circuits.
在上述任一实施例的基础上,在所述处理器51根据所述多路频域数据确定频域融合数据时,所述处理器51被配置为:On the basis of any of the foregoing embodiments, when the
获取多路频域数据中每两路频域数据之间的相关性参数;Obtain the correlation parameter between every two channels of frequency domain data in the multi-channel frequency domain data;
根据所述每两路频域数据之间的相关性参数和与所述相关性参数对应的两路频域数据确定频域融合数据。The frequency domain fusion data is determined according to the correlation parameter between the two channels of frequency domain data and the two channels of frequency domain data corresponding to the correlation parameter.
在上述任一实施例的基础上,所述处理器51还被配置为:On the basis of any of the foregoing embodiments, the
对所述多路频域数据进行相位补偿;performing phase compensation on the multi-channel frequency domain data;
在所述处理器51根据所述多路频域数据确定频域融合数据时,所述处理器51被配置为:When the
根据相位补偿之后的多路频域数据确定频域融合数据。The frequency domain fusion data is determined according to the multi-channel frequency domain data after phase compensation.
在上述任一实施例的基础上,在所述处理器51对所述多路频域数据进行相位补偿时,所述处理器51被配置为:On the basis of any of the foregoing embodiments, when the
确定多路频域数据中每一路频域数据与多路频域数据中的参考频域数据之间的相位差;determining the phase difference between each channel of frequency domain data in the multichannel frequency domain data and the reference frequency domain data in the multichannel frequency domain data;
根据所述相位差对对应的一路多路频域数据进行相位补偿。Phase compensation is performed on the corresponding one channel and multiple channels of frequency domain data according to the phase difference.
在上述任一实施例的基础上,所述时域音频信号为当前帧时域音频信号,其中,在所述处理器51根据所述时域音频信号获取输出音频信号时,所述处理器51被配置为:On the basis of any of the above embodiments, the time-domain audio signal is a time-domain audio signal of the current frame, wherein when the
将所述当前帧时域音频信号与当前帧时域音频信号之前获取的历史帧时域音频信号进行叠加处理以获取当前帧时域融合音频信号;performing superimposition processing on the current frame time domain audio signal and the historical frame time domain audio signal obtained before the current frame time domain audio signal to obtain the current frame time domain fusion audio signal;
根据所述当前帧时域融合音频信号确定所述输出音频信号。The output audio signal is determined according to the time-domain fusion audio signal of the current frame.
在上述任一实施例的基础上,在所述处理器51对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据时,所述处理器51被配置为:On the basis of any of the above embodiments, when the
对所述增益补偿之后的多路数字音频信号进行加窗处理;Windowing is performed on the multi-channel digital audio signals after the gain compensation;
根据加窗后的多路数字音频信号进行频域转换以获取多路频域数据。Perform frequency domain conversion according to the windowed multi-channel digital audio signals to obtain multi-channel frequency domain data.
在上述任一实施例的基础上,在所述处理器51根据所述时域音频信号获取输出音频信号时,所述处理器51被配置为:On the basis of any of the foregoing embodiments, when the
对所述时域音频信号进行加窗处理;performing windowing processing on the time-domain audio signal;
根据所述加窗处理之后的时域音频信号确定所述输出音频信号。The output audio signal is determined from the time-domain audio signal after the windowing process.
本发明实施例提供的音频信号处理设备的具体原理和实现方式均与上述实施例类似,此处不再赘述。The specific principles and implementation manners of the audio signal processing device provided by the embodiments of the present invention are similar to those of the above-mentioned embodiments, and details are not described herein again.
本实施例提供的音频信号处理设备,通过利用多个预处理电路对待处理的模拟音频信号进行处理以获取多路数字音频信号,其中,多个预处理电路中的每一个预处理电路包括放大器和模数转换器;对所述多路数字音频信号进行增益补偿,并对增益补偿之后的多路数字音频信号进行频域转换以获取多路频域数据;根据所述多路频域数据确定频域融合数据;将所述频域融合数据转换为时域音频信号,并根据所述时域音频信号获取输出音频信号。本实施例将对待处理的模拟音频信号通过不同的预处理电路进行处理,并考虑到不同预处理电路的系统综合噪声互不相关、且与待处理的模拟音频信号也互不相关,通过在频域上进行融合后可提高有效信号的比例、降低系统综合噪声的比例,从而有效的提高了录音系统的信噪比。The audio signal processing device provided in this embodiment obtains multi-channel digital audio signals by processing analog audio signals to be processed by using multiple preprocessing circuits, wherein each of the multiple preprocessing circuits includes an amplifier and a an analog-to-digital converter; performing gain compensation on the multi-channel digital audio signals, and performing frequency domain conversion on the multi-channel digital audio signals after the gain compensation to obtain multi-channel frequency domain data; determining the frequency domain according to the multi-channel frequency domain data domain fusion data; converting the frequency domain fusion data into a time domain audio signal, and obtaining an output audio signal according to the time domain audio signal. In this embodiment, the analog audio signal to be processed is processed by different preprocessing circuits, and considering that the system integrated noises of different preprocessing circuits are not correlated with each other and are not correlated with the analog audio signal to be processed. After the fusion in the domain, the ratio of effective signals can be increased and the ratio of system integrated noise can be reduced, thereby effectively improving the signal-to-noise ratio of the recording system.
本发明实施例提供一种录音系统,所述录音系统包括:麦克风,用于采集模拟音频信号;以及如上述实施例所述的音频处理设备50。An embodiment of the present invention provides a recording system, where the recording system includes: a microphone for collecting analog audio signals; and the
另外,本实施例还提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行以实现上述实施例所述的...方法。In addition, this embodiment also provides a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement the method described in the foregoing embodiment.
在本发明所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware, or may be implemented in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本发明各个实施例所述方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-mentioned integrated units implemented in the form of software functional units can be stored in a computer-readable storage medium. The above-mentioned software functional unit is stored in a storage medium, and includes several instructions to cause a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (processor) to execute the methods described in the various embodiments of the present invention. some steps. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program codes .
本领域技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。上述描述的装置的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the division of the above functional modules is used for illustration. The internal structure is divided into different functional modules to complete all or part of the functions described above. For the specific working process of the apparatus described above, reference may be made to the corresponding process in the foregoing method embodiments, and details are not described herein again.
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features thereof can be equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the embodiments of the present invention. scope.
Claims (20)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2019/083070 WO2020211017A1 (en) | 2019-04-17 | 2019-04-17 | Audio signal processing method and device, and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN111699700A true CN111699700A (en) | 2020-09-22 |
Family
ID=72476460
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201980012218.XA Pending CN111699700A (en) | 2019-04-17 | 2019-04-17 | Audio signal processing method, apparatus and storage medium |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN111699700A (en) |
| WO (1) | WO2020211017A1 (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114007176A (en) * | 2020-10-09 | 2022-02-01 | 上海又为智能科技有限公司 | Audio signal processing method, apparatus and storage medium for reducing signal delay |
| WO2022199288A1 (en) * | 2021-03-22 | 2022-09-29 | Oppo广东移动通信有限公司 | Audio signal processing method and apparatus, terminal, and storage medium |
| CN115695912A (en) * | 2022-10-27 | 2023-02-03 | 北京华建云鼎科技股份公司 | Multi-channel ASI signal processing system |
| CN120567214A (en) * | 2025-07-30 | 2025-08-29 | 苏州元脑智能科技有限公司 | Signal processing method and device |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107895580A (en) * | 2016-09-30 | 2018-04-10 | 华为技术有限公司 | The method for reconstructing and device of a kind of audio signal |
| EP3324592A1 (en) * | 2016-11-17 | 2018-05-23 | Nokia Technologies OY | Processing wireless communication signals |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101552010B (en) * | 2009-04-30 | 2011-09-14 | 华为技术有限公司 | Audio treating method and audio treating device |
| US9986356B2 (en) * | 2012-02-15 | 2018-05-29 | Harman International Industries, Incorporated | Audio surround processing system |
| CN106782562A (en) * | 2016-12-20 | 2017-05-31 | Tcl通力电子(惠州)有限公司 | Audio-frequency processing method, apparatus and system |
| CN207676652U (en) * | 2017-12-11 | 2018-07-31 | 科大讯飞股份有限公司 | A kind of multi-path voice signal transmission and receiving and processing equipment and system |
-
2019
- 2019-04-17 WO PCT/CN2019/083070 patent/WO2020211017A1/en not_active Ceased
- 2019-04-17 CN CN201980012218.XA patent/CN111699700A/en active Pending
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107895580A (en) * | 2016-09-30 | 2018-04-10 | 华为技术有限公司 | The method for reconstructing and device of a kind of audio signal |
| EP3324592A1 (en) * | 2016-11-17 | 2018-05-23 | Nokia Technologies OY | Processing wireless communication signals |
Non-Patent Citations (1)
| Title |
|---|
| 赵洋: "基于多通道的高速并行采样信号重建算法", 《中国优秀硕士学位论文全文数据库(电子期刊)信息科技辑》 * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114007176A (en) * | 2020-10-09 | 2022-02-01 | 上海又为智能科技有限公司 | Audio signal processing method, apparatus and storage medium for reducing signal delay |
| CN114007176B (en) * | 2020-10-09 | 2023-12-19 | 上海又为智能科技有限公司 | Audio signal processing method, device and storage medium for reducing signal delay |
| US12417779B2 (en) | 2020-10-09 | 2025-09-16 | Evoco Labs Co., Ltd. | Audio signal processing method, device and storage medium for reducing signal delay |
| WO2022199288A1 (en) * | 2021-03-22 | 2022-09-29 | Oppo广东移动通信有限公司 | Audio signal processing method and apparatus, terminal, and storage medium |
| CN115695912A (en) * | 2022-10-27 | 2023-02-03 | 北京华建云鼎科技股份公司 | Multi-channel ASI signal processing system |
| CN115695912B (en) * | 2022-10-27 | 2023-06-06 | 北京华建云鼎科技股份公司 | Multipath ASI signal processing system |
| CN120567214A (en) * | 2025-07-30 | 2025-08-29 | 苏州元脑智能科技有限公司 | Signal processing method and device |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2020211017A1 (en) | 2020-10-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6889698B2 (en) | Methods and devices for amplifying audio | |
| JP5007442B2 (en) | System and method using level differences between microphones for speech improvement | |
| CN101916567B (en) | Speech enhancement method applied to dual-microphone system | |
| US20070165879A1 (en) | Dual Microphone System and Method for Enhancing Voice Quality | |
| CN111699700A (en) | Audio signal processing method, apparatus and storage medium | |
| CN110012331B (en) | Infrared-triggered far-field double-microphone far-field speech recognition method | |
| US20240177726A1 (en) | Speech enhancement | |
| CN101904182A (en) | Enhanced blind source separation algorithm for highly correlated mixtures | |
| JP2010539792A (en) | Speech enhancement | |
| US10623854B2 (en) | Sub-band mixing of multiple microphones | |
| CN111345047A (en) | Audio signal processing method, device and storage medium | |
| CN111883160A (en) | Method and device for sound pickup and noise reduction of directional microphone array | |
| Jin et al. | Multi-channel noise reduction for hands-free voice communication on mobile phones | |
| CN115831145A (en) | A dual-microphone speech enhancement method and system | |
| CN112802490B (en) | A beamforming method and device based on a microphone array | |
| WO2021120795A1 (en) | Sampling rate processing method, apparatus and system, and storage medium and computer device | |
| CN106340302A (en) | De-reverberation method and device for speech data | |
| US20240363131A1 (en) | Speech enhancement | |
| CN115348507A (en) | Impulse noise suppression method, system, readable storage medium and computer equipment | |
| WO2020107455A1 (en) | Voice processing method and apparatus, storage medium, and electronic device | |
| JP2020053920A (en) | Sound collection device, sound collection program, and sound collection method | |
| CN114550700A (en) | Multi-channel speech recognition method combining instantaneous frequency | |
| CN115410593A (en) | Audio channel selection method, device, equipment and storage medium | |
| CN119545255B (en) | Adaptive array microphone cascade audio mixing method | |
| CN120783782B (en) | Sound mixing method, audio equipment and sound mixing system |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20200922 |
|
| WD01 | Invention patent application deemed withdrawn after publication |