Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The inventor of this application discovers through long-term research that conference audio amplifier 100 in the existing market mostly all is single product independent work, generally all places in little meeting room moreover, and to bigger meeting room, the pickup distance of product is very limited, can greatly reduce conference audio amplifier's result of use. It is a common practice in the industry to arrange a plurality of microphones in a conference room, one for each position, or to replace a more sensitive and powerful sound pick-up unit for a conference sound box. However, this method results in large power consumption, high cost, and limited improvement of sound pickup effect. In order to improve the above technical problem, the present application proposes the following embodiments.
As shown in fig. 1, the sound box system of the present application includes: at least two sound boxes 100 connected to each other. At least two sound boxes 100 may be directly connected or indirectly connected. The connection may be a wired connection or a wireless connection, and is not limited to the wireless connection shown in fig. 1.
The at least two sound boxes 100 may be respectively used to obtain corresponding voice signals. The sound box 100 may pick up the voice of the external environment by itself to obtain the voice signal, and may also obtain the voice signal from other devices, such as other sound boxes 100. The at least two sound boxes 100 may be divided into at least two stages. Wherein, the sound box 100 of the first stage can be used for connecting the external device 20. The sound box 100 of the first stage may also be configured to receive the voice signal transmitted by the sound box 100 of the next stage (i.e., the sound box 100 of the second stage), compare the signal quality of the voice signal obtained by the voice signal with the signal quality of the voice signal transmitted by the sound box 100 of the next stage, and transmit the best signal quality to the external device 20. That is, the sound box 100 of the first stage obtains its own voice signal, and receives the voice signal of the sound box 100 of the second stage, and compares the signal quality of the voice signal obtained by itself and the voice signal transmitted by the sound box 100 of the second stage.
The Signal quality of a Speech Signal can be evaluated or compared by at least one of Signal-to-Noise Ratio (SNR), Segment Signal-to-Noise Ratio (SegSNR), pesq (temporal Evaluation of Speech quality), Log Likelihood Ratio Measure (LLR), Log Spectral Distance (LSD), Short-Time Objective Intelligibility (STOI), and Weighted Spectral tilt Measure (WSS). The speaker 100 of the first stage transmits the voice signal obtained by the speaker 100 of the first stage and the voice signal transmitted by the speaker 100 of the second stage to the external device 20 with the best signal quality.
The external device 20 is, for example, a notebook computer, a tablet computer, a mobile phone, a wearable device, a server, a desktop computer, or the like. Of course, two or more external devices 20 may be connected to the first stage loudspeaker box 100 at the same time. The external device 20 communicates with a remote device through a wired or wireless network, and may perform a teleconference or the like.
As shown in fig. 2, in an exemplary application scenario, the sound box system 10 may be placed in the same meeting room scenario, at least two sound boxes 100 of the sound box system 10 can respectively pick up a voice signal in the meeting room, the first-stage sound box 100 is connected to an external device 20, and the external device 20 is remote from a remote device. The first-stage sound box 100 can receive the voice signal transmitted by the next-stage sound box 100, and finally, the voice signal quality comparison is performed at the first-stage sound box 100, and the sound box with the best signal quality is transmitted to the external device 20, so that sound pickup of a meeting room scene is completed.
For every seat all corresponds and arranges a microphone among the prior art, in order to carry out the pickup to the speaker on this seat alone, sound box system 10 of this application can carry out the pickup to same meeting room scene, and then send the best one of signal quality among the speech signal who picks up for external equipment 20, can adapt to great meeting scene well, need not all to arrange a microphone at every seat, and can promote the pickup effect, guarantee to transmit the speech signal quality of external equipment 20 best, and then can transmit far-end equipment through modes such as network, make the pronunciation that far-end equipment broadcast clear, so can effectively promote the conversation effect. For the power of the sound pickup element of the conference sound box 100 that is increased alone among the prior art, this application constitutes sound box system 10 through at least two audio amplifier 100 and can carry out the pickup to same meeting room scene simultaneously, can enlarge sound box system 10's pickup range, and after each audio amplifier 100 picked up separately, compare the signal quality of voice signal through the audio amplifier 100 of first order, and then send the one that signal quality is the best for external equipment 20, so that external equipment 20 can obtain the voice signal that signal quality is the best, so promote the sound pickup effect and the meeting effect of great meeting room scene effectively, the consumption is lower, practice thrift the cost.
As shown in fig. 2, the sound box 100 of the sound box system 10 can pick up the voice of the meeting room scene, and can also play the voice in the meeting room scene, so as to implement the voice uplink and downlink, thereby facilitating the teleconference and the like. Optionally, the sound box 100 at the first stage is configured to receive an audio signal sent by the external device 20 for playing, and further forward the audio signal sent by the external device 20 to the sound box 100 at the next stage.
The number of enclosures 100 of the enclosure system 10 may be two or more. As shown in fig. 2, the sound box system 10 is formed by two sound boxes 100 arranged in cascade. Of course, the sound box system 10 can be expanded in number based on the illustration in fig. 2, for example, to three, five, ten, etc., and specifically, different numbers of sound boxes 100 can be expanded according to different conference rooms. The enclosures 100 are interconnected to form the enclosure system 10 of the present embodiment. There are various connection modes of the sound boxes 100 in the sound box system 10 of the present embodiment, and two of them are exemplified as follows.
The first embodiment: as shown in fig. 4, the number of the sound boxes 100 of the sound box system 10 may be three or more, and all the sound boxes 100 are connected in a cascade manner. That is, the speakers 100 of the first stage are sequentially connected in stages from the speaker 100 of the first stage to the speaker 100 of the last stage. The first stage of sound box 100 is connected with the external device 20, the second stage of sound box 100 is connected with the first stage of sound box 100, the third stage of sound box 100 is connected with the second stage of sound box 100, the fourth stage of sound box 100 is connected with the third stage of sound box 100, … … and so on.
The sound box 100 of the first stage is configured to receive the voice signal transmitted by the sound box 100 of the next stage (i.e., the sound box 100 of the second stage), compare the signal quality of the voice signal obtained by the voice signal with the signal quality of the voice signal transmitted by the sound box 100 of the next stage, and transmit the best signal quality to the external device 20.
The loudspeaker boxes 100 of the subsequent stage (the loudspeaker box 100 of the subsequent stage is the loudspeaker box 100 of the first stage) are used for transmitting the voice signal obtained by the loudspeaker box 100 of the previous stage to the loudspeaker box 100 of the previous stage. If the sound box 100 of the subsequent stage receives the voice signal sent by the sound box 100 of the next stage, the sound box 100 of the subsequent stage is further configured to compare the obtained voice signal with the signal quality of the voice signal sent by the sound box 100 of the next stage, and further send the sound box 100 of the previous stage with the best signal quality.
For example, the enclosure system 10 includes three enclosures 100, and the three enclosures 100 are arranged in a cascade. Since the fourth-stage loudspeaker box 100 is not present for a while, that is, the third-stage loudspeaker box 100 is not present with the next-stage loudspeaker box 100, the voice signal picked up by the third-stage loudspeaker box 100 itself is transmitted to the second-stage loudspeaker box 100. The second stage loudspeaker box 100 is used for comparing the signal quality of the voice signal picked up by itself with the voice signal sent by the third stage loudspeaker box 100, and sending the best one of the two signals to the first stage loudspeaker box 100. The voice signal transmitted by the speaker 100 of the second stage may be obtained by itself or may be obtained from the speaker 100 of the third stage according to the comparison result of the signal quality. After the first-stage sound box 100 receives the voice signal sent by the second-stage sound box 100, the first-stage sound box 100 is further configured to compare the voice signal obtained by the first-stage sound box 100 with the voice signal sent by the second-stage sound box 100, and further send one of the two sound boxes with the best signal quality to the external device 20. Thus, the voice signals can be compared step by step, so that the voice signal sent by the first-stage loudspeaker box 100 has the best signal quality in the whole loudspeaker box system 10.
Therefore, in the first embodiment, except for the last stage of the speakers 100, the speakers 100 of the other stages of the speakers 100 arranged in cascade need to receive the voice signal transmitted by the next stage of the speakers 100 and compare the signal quality with the voice signal obtained by the speakers 100, so as to transmit the voice signal with the best signal quality to the previous stage of the speakers 100 (for the other speakers 100 except for the first stage of the speakers 100 and the last stage of the speakers 100) or the external device 20 (for the first stage of the speakers 100).
The sound box system 10 of the present embodiment can be expanded by a cascade method, and the sound pickup range is expanded by arranging a plurality of cascaded sound boxes 100. Through setting up a plurality of audio amplifier 100 cascade connection settings, carry out comparison step by step and transmission step by step to speech signal, and then make the audio amplifier 100 at the first order obtain the best speech signal of signal quality among each audio amplifier 100 of whole audio amplifier system 10 after the signal quality of speech signal carries out the comparison, send for external equipment 20, so make whole audio amplifier system 10 can transmit its and clear speech signal of high quality to external equipment 20, and then promote pickup quality and meeting effect.
The audio upstream process of the loudspeaker system 10 of the first embodiment is described above, and for the downstream process of the audio system, it is roughly: the first stage loudspeaker box 100 acquires the audio signal from the external device 20, and the first stage loudspeaker box 100 receives the audio signal and plays the audio signal, and sends the audio signal to the next stage loudspeaker box 100. The sound boxes 100 of the subsequent stage receive the audio signal from the sound box 100 of the previous stage and then play the audio signal, and send the audio signal to the sound box 100 of the next stage until the sound box 100 of the last stage receives the audio signal sent by the sound box 100 of the previous stage and then plays the audio signal. The audio signal sent by the external device 20 is transferred step by step, and then the audio signal can be expanded to a larger spatial range, so that the audio signal can adapt to a larger conference room scene.
Second embodiment: as shown in fig. 5, the number of the sound boxes 100 of the sound box system 10 may be three or more. The sound box 100 of the first stage is connected to the other sound boxes 100, so that the other sound boxes 100 are all used as the sound boxes 100 of the next stage and send the acquired voice signals to the sound box 100 of the first stage. That is, except for the first-stage sound box 100, the other sound boxes 100 are the second-stage sound boxes 100, each second-stage sound box 100 may be connected to the first-stage sound box 100 in a parallel connection manner, and each second-stage sound box 100 may send the obtained voice signal to the first-stage sound box 100.
The first stage loudspeaker boxes 100 may be configured to compare the signal qualities of the respective acquired voice signals of all the loudspeaker boxes 100 with each other, and then send the best signal quality to the external device 20. For example, the sound box system 10 includes four sound boxes 100, the sound box 100 of the second stage, the sound box 100 of the third stage, and the sound box 100 of the fourth stage respectively transmit the respective obtained voice signals to the sound box 100 of the first stage, the sound box 100 of the first stage performs signal quality comparison between the obtained voice signals and the voice signals of the other three sound boxes 100, and selects the voice signal with the best signal quality from the four sound boxes 100 to transmit to the external device 20.
The above describes the audio upstream process of the audio system of the second embodiment, and for the downstream process of the audio system, it is roughly: the first-stage sound box 100 acquires an audio signal from the external device 20, and the first-stage sound box 100 plays the audio signal after receiving the audio signal, and sends the audio signal to the next-stage sound box 100 respectively. The speakers 100 of the second stage each receive the audio signal and then play it. The audio signal sent by the external device 20 is played in the first-stage sound box 100 and is transmitted to the second-stage sound boxes 100 in parallel, so that the audio signal can be expanded to a larger spatial range, and a larger conference room scene can be adapted.
The third embodiment: as shown in fig. 6, the number of the at least two sound boxes 100 of the sound box system 10 may be three or more. This embodiment is combined with the first and second embodiments, and both the cascade and parallel modes exist. Specifically, in the sound boxes 100 of the subsequent stages, the sound boxes 100 of each stage include at least one sound box 100. In the subsequent stage of the sound boxes 100, the sound boxes 100 of the same stage are arranged in parallel and are respectively connected with the corresponding sound box 100 of the previous stage. For example, the enclosures 100 of the second level include at least two enclosures 100, the enclosures 100 of the third level include at least two enclosures 100, the enclosures 100 of the first level are respectively connected to at least two enclosures 100 of the second level, each enclosure 100 of the second level is connected to at least one enclosure 100 of the third level, and so on.
The enclosure system 10, as shown in fig. 6, may include seven enclosures 100. The number of the sound boxes 100 of the first stage is one, the number of the sound boxes 100 of the second stage is two, and the number of the sound boxes 100 of the third stage is four. The first stage of sound box 100 is connected to two second stage of sound box 100, and the second stage of sound box 100 can be connected to two third stage of sound box 100. Of course, one of the second stage enclosures 100 is connected to one of the third stage enclosures 100, and the other second stage enclosure 100 is connected to three third stage enclosures 100.
Except for the last stage of the sound boxes 100, each stage of the sound boxes 100 may be configured to receive the voice signal transmitted by the next stage of the sound box 100, compare the obtained voice signal with the voice signal transmitted by the next stage of the sound box 100 in terms of signal quality, and correspondingly transmit the best signal quality to the previous stage of the sound box 100 (corresponding to the sound boxes 100 except for the first stage of the sound boxes 100 and the last stage of the sound boxes 100) or the external device 20 (corresponding to the first stage of the sound boxes 100). As shown in fig. 6, each second-stage loudspeaker box 100 needs to compare the signal quality of the voice signal acquired by itself and the voice signals transmitted by the two third-stage loudspeaker boxes 100, that is, needs to compare the signal quality of the three voice signals, and then transmits the best signal quality to the first-stage loudspeaker box 100.
For the downlink process of the audio system, the sound box 100 at the first stage obtains the audio signal from the external device 20 to play, and transmits the audio signal to the sound box 100 at the next stage. Subsequently, the sound boxes 100 of each stage receive the audio signal to play, and respectively transmit the audio signal to the sound boxes 100 of the next stage corresponding to the sound box.
Based on the detailed description of the first to third embodiments, each loudspeaker 100 can obtain the corresponding voice signal by itself. As shown in fig. 3 to fig. 6, in the present embodiment, after each sound box 100 of the sound box system 10 picks up the voice of the environment through its own sound pickup circuit 140 (see the detailed description below), the picked-up voice signal may be subjected to corresponding audio processing, so as to obtain a corresponding voice signal.
Specifically, the sound box 100 of the first stage may be configured to perform corresponding audio processing on the voice picked up by the sound box by using the audio signal sent by the external device 20 to obtain a corresponding voice signal. The sound boxes 100 of the subsequent stage are configured to perform corresponding audio processing on the picked-up voice by using the audio signal forwarded by the sound box 100 of the previous stage to obtain a corresponding voice signal.
In an exemplary scenario, the sound box system 10 operates in a meeting room scenario, when audio signals transmitted by the external device 20 are played between the sound boxes 100, sound pickup of each sound box 100 may be affected to a certain extent, and the audio played by the sound boxes 100 may generate reverberation, echo, noise and the like in the meeting room scenario, so that the sound boxes 100 of this embodiment need to perform corresponding audio processing. The audio processing may include at least one of audio processing including noise cancellation processing, echo cancellation processing, and reverberation cancellation processing. Specifically, the audio processing may be performed by an existing noise reduction algorithm, echo cancellation algorithm, and reverberation cancellation algorithm, and specific algorithm procedures related to the noise reduction algorithm, echo cancellation algorithm, and reverberation cancellation algorithm are not described here.
The sound box 100 of the first stage may use the audio signal sent by the external device 20 as a reference signal, and further perform the above audio processing on the voice picked up by the sound box. The sound box 100 of the subsequent stage may use the audio signal forwarded by the sound box 100 of the previous stage as a reference signal, and perform the audio processing on the voice picked up by the sound box 100 to obtain a corresponding voice signal.
For the above description of the connection between the sound boxes 100, the uplink and downlink of the audio signals, the signal quality comparison, the audio processing, and the like, the present embodiment may be implemented based on the following exemplary sound boxes 100.
As shown in fig. 3 to 6, each sound box 100 may include a connection circuit 110, an audio processing circuit 120, a power amplifier circuit 130, a speaker 150, and a sound pickup circuit 140. In one sound box 100, the connection circuit 110, the audio processing circuit 120, the power amplifier circuit 130, the speaker 150 and the sound pickup circuit 140 are corresponding. For example, the audio processing circuit 120 is connected to the corresponding power amplifier circuit 130, which means that the audio processing circuit 120 is connected to the power amplifier circuit 130 belonging to the same sound box 100.
The connection circuit 110 may be connected to the audio processing circuit 120 and the power amplifier circuit 130, respectively. The audio processing circuit 120 may be connected to the power amplifier circuit 130 and the sound pickup circuit 140, respectively.
The connection circuit 110 may be used to implement connections between the sound boxes 100 and the external device 20. Alternatively, the connection circuitry 110 of each speaker 100 can support a bluetooth connection. For example, the connection circuit 110 supports the HCI protocol, the L2CAP protocol, the SDP protocol, the RFOCMM protocol, the irOBEX protocol, the BNEP protocol, the AVDTP protocol, the AVCTP protocol, and the like.
The connection circuit 110 of the first stage of the sound box 100 may be used to connect the external device 20. The first stage speaker 100 may be connected to the external device 20 through the connection circuit 110 by wire or wirelessly. The sound box 100 of the subsequent stage may be wirelessly connected to the connection circuit 110 of the sound box 100 of the previous stage through the connection circuit 110.
The inventor of the present application finds in research that there are various connection modes between the sound boxes 100, for example, the connection modes may be wired or wireless, and the conventional practice for the wireless connection mode is to use private protocol modes such as 2.4G for connection, and the wired connection needs wiring, which is inconvenient and not beautiful, and the adoption of the private protocols such as 2.4G requires additional modules on the sound boxes 100, which results in high power consumption, high cost, and complex design. This embodiment utilizes the bluetooth connected mode to make and realize wireless pairing and connection between the audio amplifier 100, and it is more convenient to use.
Alternatively, the speaker 100 of the subsequent stage may perform bluetooth connection with the connection circuit 110 of the speaker 100 of the previous stage through the connection circuit 110 based on the L2CAP protocol. This embodiment utilizes the L2CAP agreement to realize being connected between the audio amplifier 100, can need not additionally to increase other modules, can realize effectively the expansion of audio amplifier system 10 pickup range, and then can effectively obtain clear speech signal, promotes the conference conversation effect.
Optionally, the speaker 100 of the first stage is configured to perform bluetooth connection with the external device 20 through the connection circuit 110 based on a bluetooth protocol different from the L2CAP protocol. In this way, the external device 20 and the other sound boxes 100 can be independently connected without mutual interference. For example, the speaker 100 of the first stage may perform a bluetooth connection with the external device 20 through the HCI protocol.
Optionally, the connection circuitry 110 also supports a wired connection, such as a USB wired connection. For example, the connection circuit 110 is a bluetooth chip, and the bluetooth chip may include a bluetooth interface circuit for performing bluetooth wireless connection and a USB interface circuit for performing USB wired connection. The first stage speaker 100 may be bluetooth connected to the external device 20 through a bluetooth interface circuit, or may be wired to the external device 20 through a USB interface circuit.
The audio processing circuit 120 may be used to implement corresponding audio processing and other audio operations. The audio processing circuit 120 is, for example, a DSP (digital signal processor).
The power amplifier circuit 130 may be used to power amplify the audio signal. The power amplifier circuit 130 is, for example, an audio power amplifier chip. Pickup circuit 140 may be used to pick up speech. The sound pickup circuit 140 is, for example, a microphone. The speaker 150 may be connected to the power amplifier circuit 130, and is configured to play the audio signal after power amplification by the power amplifier circuit 130.
The power amplifier circuit 130 of the first stage loudspeaker box 100 is used for performing power amplification on the audio signal sent by the external device 20. The speaker 150 of the first stage loudspeaker box 100 can play the audio signal which is power amplified by the power amplifier circuit 130.
The power amplifier circuit 130 of the subsequent stage of the sound box 100 is configured to amplify the power of the audio signal forwarded by the previous stage of the sound box 100. The speaker 150 of the subsequent stage of the sound box 100 can play the audio signal which is power amplified by the power amplifier circuit 130.
The audio processing circuit 120 of each sound box 100 is configured to perform corresponding audio processing on the voice picked up by the sound pickup circuit 140 corresponding to the sound box by using the audio signal subjected to power amplification by the corresponding power amplification module, so as to obtain a corresponding voice signal. The audio processing circuit 120 of each speaker 100 can be used to compare the signal quality of the speech signal in addition to performing corresponding audio processing.
Optionally, the audio processing circuit 120 of the sound box 100 of the subsequent stage can perform audio processing by using a noise reduction algorithm, a dereverberation algorithm, and the like in combination with the voice picked up by the sound pickup module corresponding to the audio processing circuit, the sound played by the speaker 150 corresponding to the audio processing circuit, and the audio signal received from the sound box 100 of the previous stage and amplified by the power amplification module corresponding to the audio processing circuit, so as to obtain a clear voice signal.
As shown in fig. 4, for the first embodiment, the audio processing circuit 120 of the loudspeaker box 100 at the first stage can be used to compare the signal quality of the voice signal obtained by the audio processing circuit 120 and the voice signal transmitted by the loudspeaker box 100 at the next stage. The audio processing circuit 120 of the speaker 100 of the subsequent stage, except for the last stage of the speaker 100, may be configured to compare the voice signal obtained by it with the voice signal transmitted by the speaker 100 of the next stage. The audio processing circuit 120 of the last stage of the sound box 100 performs corresponding audio processing on the voice picked up by the sound pickup circuit 140 by using the audio signal power-amplified by the power amplifier circuit 130 to obtain a corresponding voice signal, and sends the voice signal to the sound box 100 of the previous stage through the connection circuit 110.
As shown in fig. 5, in the second embodiment, the audio processing circuit 120 of the first stage loudspeaker box 100 is configured to compare signal qualities of the voice signal obtained by the first stage loudspeaker box 100 and the voice signal transmitted by the next stage loudspeaker box 100, further select one of the voice signals of the first stage loudspeaker box 100 and the next stage loudspeaker boxes 100 with the best signal quality, and transmit the selected one to the external device 20. The audio processing circuit 120 of each next-stage sound box 100 (i.e., each second-stage sound box 100) performs corresponding audio processing on the voice picked up by the sound pickup circuit 140 corresponding thereto by using the audio signal power-amplified by the corresponding power amplifier circuit 130 to obtain a corresponding voice signal, and sends the voice signal to the previous-stage sound box 100 through the corresponding connection circuit 110.
For the third embodiment described above, reference may be made specifically to the description relating to the first and second embodiments.
In summary, in the present embodiment, the sound box system 10 composed of at least two sound boxes 100 can simultaneously pick up sound in the same meeting room scene, so that the sound pick-up range of the sound box system 10 can be expanded, and after each sound box 100 picks up sound, the signal quality of the voice signal is compared by the sound box 100 of the first stage, and then the best signal quality is sent to the external device 20, so that the external device 20 can obtain the voice signal with the best signal quality, so that the sound pick-up effect and the meeting effect in a larger meeting room scene are effectively improved, the power consumption is low, and the cost is saved.
Of course, the sound box system 10 of the present embodiment can be applied not only to a meeting room scene, but also to other scenes.
The above description is only an example of the present application and is not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings, or which are directly or indirectly applied to other related technical fields, are intended to be included within the scope of the present application.