TWI861755B

TWI861755B - Microphone array system and noise filtering method for microphone array

Info

Publication number: TWI861755B
Application number: TW112108824A
Authority: TW
Inventors: 劉名祐
Original assignee: 圓展科技股份有限公司
Priority date: 2023-03-10
Filing date: 2023-03-10
Publication date: 2024-11-11
Also published as: TW202437240A

Abstract

A noise filtering method for microphone array, performed by a processing device, includes: setting an desired range according to a user command, setting a plurality of virtual ranges based on the desired range, wherein a distance between a center of each one of the virtual ranges and a center of the desired range is not greater than a default distance, and controlling an output audio corresponding to each one of the desired range and the virtual ranges according to audio energy corresponding to the desired range and the virtual ranges, respectively.

Description

Microphone array system and noise filtering method of microphone array

本發明係關於一種麥克風陣列系統及麥克風陣列的雜音過濾方法。The present invention relates to a microphone array system and a noise filtering method of the microphone array.

隨著線上會議的普及，線上會議的設備的市場也越來越大，其中最重要的設備之一便是麥克風。麥克風的收音效果影響了會議品質。進一步而言，當同時有多個人在一個麥克風周圍時，其他非主講者的聲音也會被傳送至在遠端的與會者，使對應主講者的輸出音訊被嚴重干擾。With the popularity of online meetings, the market for online meeting equipment is growing, and one of the most important equipment is the microphone. The sound pickup effect of the microphone affects the quality of the meeting. Furthermore, when there are multiple people around a microphone at the same time, the voices of other non-speakers will also be transmitted to the remote participants, causing serious interference to the output audio of the corresponding speaker.

鑒於上述，本發明提供一種麥克風陣列系統及麥克風陣列的雜音過濾方法。In view of the above, the present invention provides a microphone array system and a noise filtering method of a microphone array.

依據本發明一實施例的麥克風陣列的雜音過濾方法，由處理裝置執行，包含：依據使用者指令設定期望範圍；基於期望範圍設定多個虛擬範圍，其中所述多個虛擬範圍的每一者的中心與期望範圍的中心之間的距離不大於預設距離；以及依據期望範圍及所述多個虛擬範圍各自對應的音訊能量控制期望範圍及所述多個虛擬範圍的每一者對應的輸出音訊。According to an embodiment of the present invention, a noise filtering method for a microphone array is executed by a processing device, and includes: setting a desired range according to a user instruction; setting a plurality of virtual ranges based on the desired range, wherein the distance between the center of each of the plurality of virtual ranges and the center of the desired range is not greater than a preset distance; and controlling the output audio corresponding to the desired range and each of the plurality of virtual ranges according to the energy of the audio signals corresponding to the desired range and the plurality of virtual ranges.

依據本發明一實施例的麥克風陣列系統包含麥克風陣列以及處理裝置。麥克風陣列用於接收期望範圍及多個虛擬範圍的音訊。處理裝置連接於麥克風陣列，用於執行：依據使用者指令設定期望範圍；基於期望範圍設定所述多個虛擬範圍，其中所述多個虛擬範圍的每一者的中心與期望範圍的中心之間的距離不大於預設距離；以及依據期望範圍及所述多個虛擬範圍各自對應的音訊能量控制期望範圍及所述多個虛擬範圍的每一者對應的輸出音訊。A microphone array system according to an embodiment of the present invention includes a microphone array and a processing device. The microphone array is used to receive audio signals of a desired range and multiple virtual ranges. The processing device is connected to the microphone array and is used to perform: setting the desired range according to a user instruction; setting the multiple virtual ranges based on the desired range, wherein the distance between the center of each of the multiple virtual ranges and the center of the desired range is not greater than a preset distance; and controlling the output audio signals corresponding to the desired range and each of the multiple virtual ranges according to the energy of the audio signals corresponding to the desired range and the multiple virtual ranges.

綜上所述，依據本發明一或多個實施例的麥克風陣列系統及麥克風陣列的雜音過濾方法可以提供使用者較有彈性的收音方式。In summary, the microphone array system and the noise filtering method of the microphone array according to one or more embodiments of the present invention can provide users with a more flexible sound collection method.

以上之關於本揭露內容之說明及以下之實施方式之說明係用以示範與解釋本發明之精神與原理，並且提供本發明之專利申請範圍更進一步之解釋。The above description of the disclosed content and the following description of the implementation methods are used to demonstrate and explain the spirit and principle of the present invention, and provide a further explanation of the scope of the patent application of the present invention.

以下在實施方式中詳細敘述本發明之詳細特徵以及優點，其內容足以使任何熟習相關技藝者了解本發明之技術內容並據以實施，且根據本說明書所揭露之內容、申請專利範圍及圖式，任何熟習相關技藝者可輕易地理解本發明相關之目的及優點。以下之實施例係進一步詳細說明本發明之觀點，但非以任何觀點限制本發明之範疇。The following detailed description of the features and advantages of the present invention is provided in the implementation mode, and the content is sufficient to enable any person skilled in the relevant art to understand the technical content of the present invention and implement it accordingly. Moreover, according to the content disclosed in this specification, the scope of the patent application and the drawings, any person skilled in the relevant art can easily understand the relevant purposes and advantages of the present invention. The following embodiments are to further explain the viewpoints of the present invention in detail, but are not to limit the scope of the present invention by any viewpoint.

請參考圖1，圖1係依據本發明一實施例所繪示的麥克風陣列系統的方塊圖，麥克風陣列系統1包括麥克風陣列11及處理裝置12。處理裝置12以有線或無線的方式連接於麥克風陣列11。麥克風陣列11用於接收其周邊的音訊，處理裝置12用於將麥克風陣列11收到的音訊執行雜音過濾，並可將經過濾後的音訊傳送至遠端的另一處理裝置。處理裝置12可以包含一或多個處理器，所述處理器例如為中央處理器、繪圖處理器、微控制器、可程式化邏輯控制器或其他具有訊號處理功能的處理器。Please refer to FIG. 1, which is a block diagram of a microphone array system according to an embodiment of the present invention. The microphone array system 1 includes a microphone array 11 and a processing device 12. The processing device 12 is connected to the microphone array 11 in a wired or wireless manner. The microphone array 11 is used to receive audio signals from its surroundings, and the processing device 12 is used to perform noise filtering on the audio signals received by the microphone array 11, and can transmit the filtered audio signals to another processing device at a remote end. The processing device 12 may include one or more processors, such as a central processing unit, a graphics processor, a microcontroller, a programmable logic controller, or other processors with signal processing functions.

請一併參考圖1及圖2，其中圖2係繪示將麥克風陣列應用於線上會議的示意圖。在圖2中，處理裝置12可輸出經過濾後的音訊至在共同進行該線上會議的遠端的處理裝置。麥克風陣列11可以是裝設在天花板處的吸頂式麥克風陣列，但本發明不予以限制。以圖2為例，麥克風陣列11及處理裝置12可以共同設置於一個殼體內以共同裝設在天花板處，處理裝置12亦可與麥克風陣列11分隔設置，並以無線的方式溝通。Please refer to FIG. 1 and FIG. 2 , where FIG. 2 is a schematic diagram showing the application of the microphone array to an online conference. In FIG. 2 , the processing device 12 can output the filtered audio to the remote processing device that is jointly conducting the online conference. The microphone array 11 can be a ceiling-mounted microphone array installed on the ceiling, but the present invention is not limited thereto. Taking FIG. 2 as an example, the microphone array 11 and the processing device 12 can be jointly arranged in a housing to be jointly installed on the ceiling, and the processing device 12 can also be separately arranged from the microphone array 11 and communicate wirelessly.

請一併參考圖1、圖3、圖4A及圖4B，其中圖3係依據本發明一實施例所繪示的麥克風陣列的雜音過濾方法的流程圖，圖4A及圖4B係繪示期望範圍及虛擬範圍用示意圖。圖3所示的麥克風陣列的雜音過濾方法可由圖1的麥克風陣列系統1執行，尤其是麥克風陣列系統1的處理裝置12。如圖3所示，麥克風陣列的雜音過濾方法包括：步驟S101：依據使用者指令設定期望範圍；步驟S103：基於期望範圍設定多個虛擬範圍，其中每個虛擬範圍的中心與期望範圍的中心之間的距離不大於預設距離；以及步驟S105：依據期望範圍及該些虛擬範圍各自對應的音訊能量控制期望範圍及該些虛擬範圍的每一者對應的輸出音訊。Please refer to FIG. 1 , FIG. 3 , FIG. 4A and FIG. 4B , wherein FIG. 3 is a flow chart of a noise filtering method for a microphone array according to an embodiment of the present invention, and FIG. 4A and FIG. 4B are schematic diagrams showing a desired range and a virtual range. The noise filtering method for a microphone array shown in FIG. 3 can be executed by the microphone array system 1 of FIG. 1 , especially by the processing device 12 of the microphone array system 1 . As shown in FIG. 3 , the noise filtering method of the microphone array includes: step S101: setting a desired range according to a user instruction; step S103: setting a plurality of virtual ranges based on the desired range, wherein the distance between the center of each virtual range and the center of the desired range is not greater than a preset distance; and step S105: controlling the output audio corresponding to the desired range and each of the virtual ranges according to the audio energy corresponding to the desired range and the virtual ranges.

於步驟S101，處理裝置12依據使用者指令設定期望範圍，其中期望範圍指示使用者指定的收音範圍。以線上會議為例，期望範圍可以為以麥克風陣列11為中心的範圍。以圖4A為例，依據使用者指令設定的期望範圍A0可為圓形，且麥克風陣列11在地面上的投影範圍可至少部分地落在期望範圍A0在地面上的投影範圍內。相似地，以圖4B為例，依據使用者指令設定的期望範圍B0可為橢圓形，且麥克風陣列11在地面上的投影範圍可至少部分地落在期望範圍B0在地面上的投影範圍內。期望範圍的數量可以為一或多個，且其形狀及尺寸可依使用者的需求而設定，本發明不予以限制。In step S101, the processing device 12 sets an expected range according to a user instruction, wherein the expected range indicates a sound pickup range specified by the user. Taking an online conference as an example, the expected range may be a range centered on the microphone array 11. Taking FIG. 4A as an example, the expected range A0 set according to the user instruction may be circular, and the projection range of the microphone array 11 on the ground may at least partially fall within the projection range of the expected range A0 on the ground. Similarly, taking FIG. 4B as an example, the expected range B0 set according to the user instruction may be elliptical, and the projection range of the microphone array 11 on the ground may at least partially fall within the projection range of the expected range B0 on the ground. The number of the expected ranges may be one or more, and their shapes and sizes may be set according to the needs of the user, and the present invention is not limited thereto.

於步驟S103，處理裝置12在期望範圍周邊設定多個虛擬範圍，且每個虛擬範圍的中心與期望範圍的中心之間的距離不大於預設距離，且虛擬範圍可與期望範圍重疊或鄰接於期望範圍。舉例而言，假設期望範圍為圓形，期望範圍的中心與每個虛擬範圍的中心之間的距離可等於期望範圍的直徑長度。換言之，期望範圍的直徑長度可作為預設距離，但本發明不予以限制。In step S103, the processing device 12 sets a plurality of virtual ranges around the desired range, and the distance between the center of each virtual range and the center of the desired range is not greater than a preset distance, and the virtual range may overlap with or be adjacent to the desired range. For example, assuming that the desired range is circular, the distance between the center of the desired range and the center of each virtual range may be equal to the diameter length of the desired range. In other words, the diameter length of the desired range may be used as a preset distance, but the present invention is not limited thereto.

以圖4A及圖4B為例，期望範圍A0周邊有多個虛擬範圍A1到A6，及期望範圍B0周邊有多個虛擬範圍B1到B2。虛擬範圍的數量可為至少三個且不超過六個，但本發明不對虛擬範圍的數量予以限制。此外，虛擬範圍的形狀及尺寸可彼此相同或相異，且虛擬範圍的形狀及尺寸可與期望範圍相同或相異，本發明不予以限制。Taking FIG. 4A and FIG. 4B as examples, there are multiple virtual ranges A1 to A6 around the desired range A0, and there are multiple virtual ranges B1 to B2 around the desired range B0. The number of virtual ranges may be at least three and not more than six, but the present invention does not limit the number of virtual ranges. In addition, the shapes and sizes of the virtual ranges may be the same or different from each other, and the shapes and sizes of the virtual ranges may be the same or different from the desired ranges, and the present invention does not limit them.

於步驟S105，處理裝置12依據期望範圍及虛擬範圍各自對應的音訊能量控制期望範圍及每個虛擬範圍對應的輸出音訊。麥克風陣列11用於接收期望範圍及虛擬範圍的音訊，其中音訊可以是透過波束形成（beamforming）得到的音頻訊號，且波束形成的計算方式可包括延遲總合（delay and sum）、最小變異無失真響應（minimum variance distortionless response，MVDR）及廣義旁波瓣消除（general sidelobe canceller，GSC）等。處理裝置12從麥克風陣列11接收所述音訊，及控制輸出至遠端處理裝置的音訊。舉例而言，處理裝置12可將期望範圍的輸出音訊的能量提高，及/或抑制虛擬範圍的輸出音訊的能量等。In step S105, the processing device 12 controls the output audio corresponding to the desired range and each virtual range according to the audio energy corresponding to the desired range and the virtual range. The microphone array 11 is used to receive the audio of the desired range and the virtual range, wherein the audio can be an audio signal obtained through beamforming, and the calculation method of beamforming can include delay and sum, minimum variance distortionless response (MVDR) and general sidelobe canceller (GSC). The processing device 12 receives the audio from the microphone array 11 and controls the audio output to the remote processing device. For example, the processing device 12 may increase the energy of the output audio signal in the desired range and/or suppress the energy of the output audio signal in the virtual range.

請一併參考圖1及圖5，其中圖5係依據本發明一實施例所繪示的控制輸出音訊的方法的流程圖。圖5所示的流程可執行在圖2之步驟S103之後，且可視為圖2之步驟S105的一實施例的細部流程圖。如圖5所示，控制輸出音訊的方法包括：步驟S201：判斷期望範圍的第一能量值是否大於預設值；若步驟S201的判斷結果為「否」，執行步驟S203：不輸出期望範圍及該些虛擬範圍的輸出音訊；若步驟S201的判斷結果為「是」，對每個虛擬範圍執行：執行步驟S205：判斷虛擬範圍的第二能量值是否大於預設值；若步驟S205的判斷結果為「否」，執行步驟S207：輸出期望範圍的輸出音訊；以及若步驟S205的判斷結果為「是」，執行步驟S209：降低虛擬範圍的輸出音訊的音訊能量。Please refer to FIG. 1 and FIG. 5 together, wherein FIG. 5 is a flow chart of a method for controlling output audio according to an embodiment of the present invention. The process shown in FIG. 5 can be executed after step S103 of FIG. 2, and can be regarded as a detailed flow chart of an embodiment of step S105 of FIG. 2. As shown in FIG. 5, the method for controlling output audio includes: step S201: determining whether the first energy value of the expected range is greater than a preset value; if the determination result of step S201 is "no", executing step S203: not outputting the output audio of the expected range and the virtual ranges; if the determination result of step S201 is "yes", for each virtual range Execution: Execute step S205: determine whether the second energy value of the virtual range is greater than a preset value; if the determination result of step S205 is "no", execute step S207: output the output audio signal of the expected range; and if the determination result of step S205 is "yes", execute step S209: reduce the audio energy of the output audio signal of the virtual range.

處理裝置12可以計算期望範圍的收音音訊能量與所有虛擬範圍的收音音訊能量的能量總和值，將期望範圍的收音音訊能量除以能量總和值作為第一能量值。於步驟S201，處理裝置12可以判斷第一能量值是否大於預設值。若處理裝置12判斷第一能量值不大於預設值，表示期望範圍可能沒有聲音。因此，處理裝置12可執行步驟S203，不輸出任何音訊至遠端的處理裝置。The processing device 12 can calculate the energy sum of the received audio signal energy in the desired range and the received audio signal energy in all virtual ranges, and divide the received audio signal energy in the desired range by the energy sum to obtain the first energy value. In step S201, the processing device 12 can determine whether the first energy value is greater than a preset value. If the processing device 12 determines that the first energy value is not greater than the preset value, it indicates that there may be no sound in the desired range. Therefore, the processing device 12 can execute step S203 and does not output any audio to the remote processing device.

若處理裝置12判斷第一能量值大於預設值，處理裝置12進一步逐一（或同時）判斷每個虛擬範圍的收音音訊的第二能量值是否大於預設值，其中處理裝置12是將該些虛擬範圍各自對應的收音音訊能量除以能量總和值以作為各虛擬範圍的第二能量值。以一個虛擬範圍為例，若處理裝置12判斷該虛擬範圍的第二能量值不大於預設值，表示期望範圍有聲音而該虛擬範圍沒有聲音。因此，處理裝置12可執行步驟S207，輸出期望範圍的音訊至遠端的處理裝置，及可不輸出該虛擬範圍的音訊至遠端的處理裝置。若處理裝置12判斷該虛擬範圍的第二能量值大於預設值，表示同時有多個聲音存在。因此，處理裝置12可執行步驟S209，以降低該虛擬範圍的輸出音訊的能量。舉例而言，處理裝置12可將虛擬範圍的收音作為噪音源處理，及將雜音濾除之輸出音訊傳輸至遠端的處理裝置。If the processing device 12 determines that the first energy value is greater than the preset value, the processing device 12 further determines one by one (or simultaneously) whether the second energy value of the received audio signal of each virtual range is greater than the preset value, wherein the processing device 12 divides the energy of the received audio signal corresponding to each of the virtual ranges by the energy sum value to obtain the second energy value of each virtual range. Taking a virtual range as an example, if the processing device 12 determines that the second energy value of the virtual range is not greater than the preset value, it means that there is sound in the expected range but no sound in the virtual range. Therefore, the processing device 12 may execute step S207 to output the audio of the desired range to the remote processing device, and may not output the audio of the virtual range to the remote processing device. If the processing device 12 determines that the second energy value of the virtual range is greater than the preset value, it means that multiple sounds exist at the same time. Therefore, the processing device 12 may execute step S209 to reduce the energy of the output audio of the virtual range. For example, the processing device 12 may process the sound received in the virtual range as a noise source, and transmit the output audio of the noise-filtered range to the remote processing device.

換言之，在圖5的實施例中，在有多個虛擬範圍的情況下，處理裝置12可依據每個虛擬範圍的第二能量值控制對應的輸出音訊。因此，當步驟S201的判斷結果為「是」時，處理裝置12可能不輸出一些虛擬範圍的輸出音訊，及降低另一些虛擬範圍的輸出音訊的音訊能量；不輸出全部的虛擬範圍的輸出音訊；或降低全部的虛擬範圍的輸出音訊的音訊能量。In other words, in the embodiment of FIG5 , when there are multiple virtual ranges, the processing device 12 may control the corresponding output audio according to the second energy value of each virtual range. Therefore, when the determination result of step S201 is “yes”, the processing device 12 may not output the output audio of some virtual ranges, and reduce the audio energy of the output audio of other virtual ranges; not output the output audio of all virtual ranges; or reduce the audio energy of the output audio of all virtual ranges.

此外，處理裝置12亦可將收音音訊的能量的值直接作為第一能量值及第二能量值以實現圖5的實施例。具體而言，處理裝置12可以將期望範圍的收音音訊能量對應的數值作為第一能量值，及將該些虛擬範圍各自對應的收音音訊能量對應的數值作為各虛擬範圍的第二能量值。因此，於步驟S201，處理裝置12可以是判斷為能量數值的第一能量值是否大於預設值；及於步驟S205，處理裝置12可以是判斷為能量數值的第二能量值是否大於預設值。In addition, the processing device 12 can also directly use the energy value of the received audio signal as the first energy value and the second energy value to implement the embodiment of FIG. 5 . Specifically, the processing device 12 can use the value corresponding to the energy of the received audio signal in the expected range as the first energy value, and use the values corresponding to the energy of the received audio signal corresponding to each of the virtual ranges as the second energy value of each virtual range. Therefore, in step S201, the processing device 12 can determine whether the first energy value as the energy value is greater than a preset value; and in step S205, the processing device 12 can determine whether the second energy value as the energy value is greater than a preset value.

另需注意的是，在第一能量值及第二能量值為比例值的實施方式中，預設值可為純量；在第一能量值及第二能量值單純為能量值的實施方式中，預設值的單位可為分貝，且此二實施方式中的預設值的數值可彼此相異。另外，第一能量值、第二能量值及預設值亦可以不同的頻率刻度實現，例如梅爾刻度（Mel scale）。It should also be noted that in the implementation method where the first energy value and the second energy value are proportional values, the default value can be a pure value; in the implementation method where the first energy value and the second energy value are simply energy values, the unit of the default value can be decibel, and the values of the default values in these two implementation methods can be different from each other. In addition, the first energy value, the second energy value and the default value can also be implemented in different frequency scales, such as Mel scale.

除上述實施例之外，處理裝置12亦可以透過以下公式（1）對期望範圍及每個虛擬範圍的每一者的原始能量值及前一能量值執行計算以取得第一能量值及第二能量值，公式（1）中的能量值可為比例值（純量）或能量值（分貝），公式（1）其中係第一能量值或第二能量值；係預設值，，在能量值為比例值的實施例中，例如為0.2；係前一能量值；係原始能量值，其中前一能量值對應的時間點早於原始能量值對應的時間點。 In addition to the above embodiments, the processing device 12 may also calculate the original energy value and the previous energy value of each of the desired range and each virtual range by the following formula (1) to obtain the first energy value and the second energy value. The energy value in formula (1) may be a proportional value (pure value) or an energy value (decibel). Formula (1) Where is the first energy value or the second energy value; is the default value. , in the embodiment where the energy value is a proportional value, For example, 0.2; is the previous energy value; is the original energy value, wherein the time point corresponding to the previous energy value is earlier than the time point corresponding to the original energy value.

進一步而言，假設是第一能量值，則前一能量值及原始能量值分別是期望範圍的連續兩筆收音音訊的能量值，其中原始能量值是依據期望範圍的當前收音音訊能量而得，前一能量值是依據期望範圍的前一個時間點的收音音訊能量而得；假設是第二能量值，則及原始能量值是對應的一個虛擬範圍的連續兩筆收音音訊的能量值，其中原始能量值是依據該虛擬範圍的當前收音音訊能量而得，前一能量值是依據該虛擬範圍的前一個時間點的收音音訊能量而得。 Furthermore, assuming is the first energy value, then the previous energy value and original energy value are the energy values of two consecutive received audio signals within the expected range, where the original energy value is based on the current audio signal energy within the expected range. The previous energy value It is based on the energy of the received audio signal at the previous time point within the expected range; assuming is the second energy value, then and original energy value is the energy value of two consecutive received audio signals in a corresponding virtual range, where the original energy value It is based on the current audio signal energy of the virtual range. The previous energy value It is obtained based on the energy of the received audio signal at the previous time point in the virtual range.

預設值可根據虛擬範圍的數量及期望的反應速率來調整。所述反應速率是指從前一能量值變成原始能量值的變化速率。舉例而言，當虛擬範圍的數量越多時，預設值可調整為較小的數值；當期望的反應速率越快時，預設值可調整為較小的數值。進一步而言，預設值可視為是在計算第一能量值或第二能量值時，期望保留多少的前一能量值。The default value can be adjusted according to the number of virtual ranges and the expected reaction rate. The reaction rate refers to the rate of change from the previous energy value to the original energy value. For example, when the number of virtual ranges is larger, the default value can be adjusted to a smaller value; when the expected reaction rate is faster, the default value can be adjusted to a smaller value. Further, the default value can be regarded as how much of the previous energy value is expected to be retained when calculating the first energy value or the second energy value.

請接著一併參考圖1及圖6，其中圖6係依據本發明一實施例所繪示的降低虛擬範圍的輸出音訊的音訊能量的方法的流程圖。圖6所示的流程可執行在圖5之步驟S205的判斷結果為「是」之後，且可視為圖5之步驟S209的一實施例的細部流程圖。如圖6所示，降低虛擬範圍的輸出音訊的音訊能量的方法包括：步驟S301：將期望範圍的輸出音訊的第一頻譜減虛擬範圍的輸出音訊的第二頻譜以產生第三頻譜；以及步驟S303：輸出對應第三頻譜的輸出音訊。Please refer to FIG. 1 and FIG. 6 together, wherein FIG. 6 is a flow chart of a method for reducing the audio energy of an output audio signal in a virtual range according to an embodiment of the present invention. The process shown in FIG. 6 can be executed after the judgment result of step S205 in FIG. 5 is "yes", and can be regarded as a detailed flow chart of an embodiment of step S209 in FIG. 5. As shown in FIG. 6, the method for reducing the audio energy of an output audio signal in a virtual range includes: step S301: subtracting the second frequency spectrum of the output audio signal in the virtual range from the first frequency spectrum of the output audio signal in the desired range to generate a third frequency spectrum; and step S303: outputting the output audio signal corresponding to the third frequency spectrum.

對於第二能量值大於預設值的虛擬範圍，於步驟S301，處理裝置12將期望範圍的第一頻譜減該虛擬範圍的第二頻譜以產生第三頻譜，其中所述頻譜可為振幅譜或功率譜。於步驟S303，處理裝置12將第三頻譜對應的音訊輸出至遠端的處理裝置。For a virtual range whose second energy value is greater than a preset value, in step S301, the processing device 12 subtracts the second frequency spectrum of the virtual range from the first frequency spectrum of the expected range to generate a third frequency spectrum, wherein the frequency spectrum may be an amplitude spectrum or a power spectrum. In step S303, the processing device 12 outputs the audio corresponding to the third frequency spectrum to a remote processing device.

進一步而言，若有多個虛擬範圍各自對應的第二能量值皆大於預設值，處理裝置12可以計算該些虛擬範圍的第二頻譜的平均頻譜，及將第一頻譜減平均頻譜以產生第三頻譜。Furthermore, if the second energy values corresponding to the plurality of virtual ranges are all greater than a preset value, the processing device 12 may calculate an average spectrum of the second spectrum of the virtual ranges and subtract the average spectrum from the first spectrum to generate a third spectrum.

另外，麥克風陣列系統可更包括自適應濾波器，以有線或無線的方式連接於處理裝置12。因此，處理裝置12亦可在圖5之步驟S205的判斷結果為「是」之後，控制自適應濾波器降低虛擬範圍的輸出音訊的音訊能量。In addition, the microphone array system may further include an adaptive filter connected to the processing device 12 in a wired or wireless manner. Therefore, the processing device 12 may also control the adaptive filter to reduce the audio energy of the output audio in the virtual range after the determination result of step S205 in FIG. 5 is "yes".

請參考圖7(a)到圖7(c)，圖7(a)到圖7(c)係繪示期望範圍及虛擬範圍各自對應的音訊能量的示例圖。圖7(a)到圖7(c)的能量值係以比例值作為示例，預設值為0.2。Please refer to FIG. 7(a) to FIG. 7(c), which are example diagrams showing the audio energy corresponding to the desired range and the virtual range. The energy values in FIG. 7(a) to FIG. 7(c) are based on proportional values as examples, and the default value is 0.2.

在圖7(a)中，期望範圍對應的音訊能量低於預設值，而儘管虛擬範圍4的音訊能量高於預設值，依據本發明一或多個實施例，處理裝置可以不輸出任何輸出音訊至遠端處理裝置（即靜音）。In FIG. 7( a ), the audio energy corresponding to the desired range is lower than a preset value, and although the audio energy of the virtual range 4 is higher than the preset value, according to one or more embodiments of the present invention, the processing device may not output any output audio to the remote processing device (ie, mute).

在圖7(b)中，期望範圍對應的音訊能量高於預設值，而其他虛擬範圍的音訊能量皆低於預設值，依據本發明一或多個實施例，處理裝置可以僅輸出期望範圍對應的輸出音訊至遠端處理裝置。In FIG. 7( b ), the audio energy corresponding to the desired range is higher than the preset value, while the audio energy of other virtual ranges is lower than the preset value. According to one or more embodiments of the present invention, the processing device may only output the output audio corresponding to the desired range to the remote processing device.

在圖7(c)中，期望範圍及虛擬範圍5對應的音訊能量皆高於預設值，處理裝置可以先依據以上一或多個實施例降低虛擬範圍5對應的音訊能量，再根據降低後的虛擬範圍5對應的音訊能量與期望範圍對應的音訊能量將輸出音訊傳送至遠端處理裝置。In FIG. 7( c ), the audio energy corresponding to the desired range and the virtual range 5 are both higher than the preset value. The processing device may first reduce the audio energy corresponding to the virtual range 5 according to one or more of the above embodiments, and then transmit the output audio signal to the remote processing device according to the reduced audio energy corresponding to the virtual range 5 and the audio energy corresponding to the desired range.

綜上所述，依據本發明一或多個實施例的麥克風陣列系統及麥克風陣列的雜音過濾方法可以提供使用者較有彈性的收音方式，且非期望範圍內的聲音可被濾除或抑制。In summary, the microphone array system and the noise filtering method of the microphone array according to one or more embodiments of the present invention can provide users with a more flexible sound collection method, and the sound within the undesired range can be filtered out or suppressed.

雖然本發明以前述之實施例揭露如上，然其並非用以限定本發明。在不脫離本發明之精神和範圍內，所為之更動與潤飾，均屬本發明之專利保護範圍。關於本發明所界定之保護範圍請參考所附之申請專利範圍。Although the present invention is disclosed as above with the aforementioned embodiments, it is not intended to limit the present invention. Any changes and modifications made without departing from the spirit and scope of the present invention are within the scope of patent protection of the present invention. Please refer to the attached patent application for the scope of protection defined by the present invention.

1:麥克風陣列系統 11:麥克風陣列 12:處理裝置 A0,B0:期望範圍 A1-A6,B1,B2:虛擬範圍 S101,S103,S105,S201,S203,S205,S207,S209,S301,S303:步驟 1: Microphone array system 11: Microphone array 12: Processing device A0, B0: Expected range A1-A6, B1, B2: Virtual range S101, S103, S105, S201, S203, S205, S207, S209, S301, S303: Steps

圖1係依據本發明一實施例所繪示的麥克風陣列系統的方塊圖。圖2係繪示將麥克風陣列應用於線上會議的示意圖。圖3係依據本發明一實施例所繪示的麥克風陣列的雜音過濾方法的流程圖。圖4A及圖4B係繪示期望範圍及虛擬範圍用示意圖。圖5係依據本發明一實施例所繪示的控制輸出音訊的方法的流程圖。圖6係依據本發明一實施例所繪示的降低虛擬範圍的輸出音訊的音訊能量的方法的流程圖。圖7(a)到圖7(c)係繪示期望範圍及虛擬範圍各自對應的音訊能量的示例圖。 FIG. 1 is a block diagram of a microphone array system according to an embodiment of the present invention. FIG. 2 is a schematic diagram showing the application of a microphone array to an online conference. FIG. 3 is a flow chart of a noise filtering method of a microphone array according to an embodiment of the present invention. FIG. 4A and FIG. 4B are schematic diagrams showing a desired range and a virtual range. FIG. 5 is a flow chart of a method for controlling output audio according to an embodiment of the present invention. FIG. 6 is a flow chart of a method for reducing audio energy of output audio of a virtual range according to an embodiment of the present invention. FIG. 7(a) to FIG. 7(c) are example diagrams showing audio energy corresponding to the desired range and the virtual range, respectively.

S101,S103,S105:步驟S101, S103, S105: Steps

Claims

A noise filtering method for a microphone array is performed by a processing device, the method comprising: setting a desired range according to a user instruction; setting a plurality of virtual ranges based on the desired range, wherein the distance between the center of each of the virtual ranges and the center of the desired range is not greater than a preset distance; and controlling the output audio corresponding to the desired range and each of the virtual ranges according to the audio energy corresponding to each of the desired range and the virtual ranges, wherein the desired range and the virtual range are spatial ranges, and the virtual range overlaps or is adjacent to the desired range.

The noise filtering method of the microphone array as described in claim 1, wherein the output audio of each of the expected range and the virtual ranges is controlled according to the audio energy corresponding to each of the expected range and the virtual ranges, including: determining whether a first energy value of the expected range is greater than a preset value; when the first energy value is greater than the preset value, executing for each of the virtual ranges: determining whether a second energy value of the virtual range is greater than the preset value; and if the second energy value is greater than the preset value, reducing the audio energy of the output audio of the virtual range; and when the first energy value is not greater than the preset value, not outputting the output audio of the expected range and the virtual ranges.

The noise filtering method of the microphone array as described in claim 2, wherein reducing the audio energy of the output audio in the virtual range includes: Subtracting a second frequency spectrum of the output audio in the virtual range from a first frequency spectrum of the output audio in the desired range to generate a third frequency spectrum; and outputting the output audio corresponding to the third frequency spectrum.

The noise filtering method of the microphone array as described in claim 2 further comprises: performing calculation on an original energy value and a previous energy value of each of the expected range and the virtual ranges through formula (1) to obtain the first energy value and the second energy value, δ(t) ' = αδ(t-1) + (1- α )δ(t) Formula (1) wherein δ(t) ' is the first energy value or the second energy value, α is the default value, 0<α

1, δ(t-1) is the previous energy value, δ(t) is the original energy value, wherein the time point corresponding to the previous energy value is earlier than the time point corresponding to the original energy value.

A noise filtering method for a microphone array as described in claim 1, wherein the microphone array is a ceiling microphone array.

A microphone array system includes: a microphone array for receiving audio signals of a desired range and multiple virtual ranges; and a processing device connected to the microphone array for executing: setting the desired range according to a user instruction; setting the virtual ranges based on the desired range, wherein the distance between the center of each of the virtual ranges and the center of the desired range is not greater than a preset distance; and controlling the output audio signals corresponding to the desired range and each of the virtual ranges according to the audio energy corresponding to each of the desired range and the virtual ranges, wherein the desired range and the virtual range are spatial ranges, and the virtual range overlaps or is adjacent to the desired range.

A microphone array system as described in claim 6, wherein the processing device executes the control of the output audio of each of the expected range and the virtual ranges according to the audio energy corresponding to each of the expected range and the virtual ranges, including: determining whether a first energy value of the expected range is greater than a preset value; when the first energy value is greater than the preset value, executing for each of the virtual ranges: determining whether a second energy value of the virtual range is greater than the preset value; and if the second energy value is greater than the preset value, reducing the audio energy of the output audio of the virtual range; and when the first energy value is not greater than the preset value, not outputting the output audio of the expected range and the virtual ranges.

A microphone array system as described in claim 7, wherein the processing device performs the step of reducing the audio energy of the output audio in the virtual range, including: subtracting a second frequency spectrum of the output audio in the virtual range from a first frequency spectrum of the output audio in the desired range to generate a third frequency spectrum; and outputting the output audio corresponding to the third frequency spectrum.

A microphone array system as described in claim 7, wherein the processing device is further used to perform: performing calculations on an original energy value and a previous energy value of each of the desired range and the virtual ranges to obtain the first energy value and the second energy value through formula (1), δ(t) ' = αδ(t-1)+(1- α )δ(t) Formula (1) wherein δ(t) ' is the first energy value or the second energy value, α is the default value, 0<α

A microphone array system as described in claim 6, wherein the microphone array is a ceiling microphone array.