HK1155550B - Echo signal processing method and system as well as television - Google Patents
Echo signal processing method and system as well as television Download PDFInfo
- Publication number
- HK1155550B HK1155550B HK11109611.7A HK11109611A HK1155550B HK 1155550 B HK1155550 B HK 1155550B HK 11109611 A HK11109611 A HK 11109611A HK 1155550 B HK1155550 B HK 1155550B
- Authority
- HK
- Hong Kong
- Prior art keywords
- signal
- echo
- echo signal
- test signal
- test
- Prior art date
Links
Description
Technical Field
The invention belongs to the technical field of electroacoustic, and particularly relates to an echo signal cancellation method, an echo signal cancellation system and a television.
Background
The acoustic echo refers to that when the sound played by a loudspeaker is heard by a called party, a voice signal is picked up by a microphone through various ways and is re-transmitted to the original speaker. The paths traveled by the echoes vary, and different delayed echoes, including direct echoes and indirect echoes, are generated.
The direct echo is that the sound played by the loudspeaker directly enters the microphone without any reflection, the time delay of the echo is shortest, and the echo is directly related to the voice energy of a far-end speaker, the distance and the angle between the loudspeaker and the microphone, the playing volume of the loudspeaker and the pick-up sensitivity of the microphone; the indirect echo refers to a set of echoes generated after sound played by a loudspeaker enters a microphone through different paths and is reflected once or for multiple times. Movement or change of any item in the house changes the echo path, and thus the echo is characterized by multipath, time varying. Because the echo has close relation with the volume of the far-end speaker, the gain of the loudspeaker, the size of the space and the furniture arrangement, the factors are uncontrollable. The echo channel can not be simulated in advance by the system, so that the echo is difficult to be eliminated well in the algorithm.
Many scientific researches are made on the processing of echo, and as the technology develops, adaptive filters are widely used. However, because the difference of the use environment of the system is large, such as the sound pressure gain of the loudspeaker, the size of the space and the furniture layout are completely different, the use environment of the system is difficult to accurately estimate by the adaptive algorithm, but the final set of echoes can be influenced by the factors. Therefore, it is difficult to make r1(n) ═ r (n) to cancel the echo effect, since the environmental influence factors are not dynamically taken into consideration.
Disclosure of Invention
The embodiment of the invention aims to provide an echo signal cancellation method, an echo signal cancellation system and a television, and aims to solve the problem that echo is difficult to cancel due to the fact that environmental factors are not taken into consideration in echo processing in the prior art.
The embodiment of the invention is realized in such a way that an echo signal processing method comprises the following steps:
analyzing the echo signal and the test signal in a frequency domain and a time domain to obtain the time delay, phase shift and amplitude change information of the echo signal of the test signal;
correcting the filter coefficient according to the acquired change information, and estimating an echo signal generated by a far-end voice signal according to the corrected filter coefficient;
subtracting the estimated echo signal from a received local voice signal, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by a far-end voice signal in a local area.
An embodiment of the present invention further provides an echo signal processing system, where the system includes:
the calibration unit is used for analyzing the echo signal and the test signal in a frequency domain and a time domain to obtain the time delay, the phase shift and the amplitude change information of the echo signal of the test signal;
the adaptive filter is used for correcting the coefficient of the filter according to the change information acquired by the calibration unit and estimating an echo signal generated by a far-end voice signal according to the corrected coefficient of the filter;
and the signal processing unit is used for subtracting the echo signal estimated by the adaptive filter from the received local voice signal and outputting the local voice signal subtracted with the estimated echo signal to a far end, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by the far end voice signal in the local.
The embodiment of the invention also provides a television, which comprises the echo signal processing system.
In the embodiment of the invention, the time delay, the phase shift and the amplitude change information of the echo signal of the test signal are obtained, the filter coefficient is corrected, the echo signal generated by the far-end voice signal is estimated according to the corrected filter coefficient, the estimated echo signal is subtracted from the received local voice signal, and the local voice signal with the estimated echo signal subtracted is output to the far end.
Drawings
Fig. 1 is a flowchart illustrating an implementation of an echo signal processing method according to an embodiment of the present invention;
fig. 2 is a schematic diagram of a tv chat system provided by an embodiment of the present invention;
fig. 3 is a flowchart of an implementation of an echo signal processing method according to a second embodiment of the present invention;
fig. 4 is a block diagram of an echo signal processing system according to a third embodiment of the present invention;
fig. 5 is a structural diagram of a calibration unit according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
In the embodiment of the invention, the filter coefficient is corrected according to the information such as time delay, dependence and amplitude of the echo signal of the test signal, the voice signal at the far end is accurately estimated, the echo signal in the local voice signal is eliminated, and the echo can be effectively eliminated because the influence of environmental factors on the echo signal is fully considered.
The embodiment of the invention provides an echo signal processing method, which comprises the following steps:
acquiring the time delay, phase shift and amplitude change information of an echo signal of a test signal;
correcting the filter coefficient according to the acquired change information, and estimating an echo signal generated by a far-end voice signal according to the corrected filter coefficient;
subtracting the estimated echo signal from a received local voice signal, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by a far-end voice signal in a local area.
An embodiment of the present invention further provides an echo signal processing system, where the system includes:
the calibration unit is used for acquiring the time delay, the phase shift and the amplitude change information of the echo signal of the test signal;
the adaptive filter is used for correcting the coefficient of the filter according to the change information acquired by the calibration unit and estimating an echo signal generated by a far-end voice signal according to the corrected coefficient of the filter;
and the signal processing unit is used for subtracting the echo signal estimated by the adaptive filter from the received local voice signal and outputting the local voice signal subtracted with the estimated echo signal to a far end, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by the far end voice signal in the local.
The embodiment of the invention also provides a television, which comprises the echo signal processing system.
The following detailed description of the implementation of the present invention is made with reference to specific embodiments:
example one
The embodiment of the present invention estimates the echo signal of the received far-end speech signal according to the coefficient of the filter corrected by the actual environment, so that the echo signal generated by estimation is closer to the actual environmental echo, and the echo can be effectively eliminated, which is described below by a specific embodiment.
Fig. 1 shows a flowchart of an implementation of an echo signal processing method according to an embodiment of the present invention, which is detailed as follows:
an application scenario of the embodiment of the present invention may be an electronic device terminal with a chat function, such as a chat television, and fig. 2 shows a schematic diagram of a television chat system, where the chat television system includes a user a and a user b, and each chat television has an array microphone and a speaker.
In step S101, information on the time delay, phase shift, and amplitude variation of the echo signal of the test signal is acquired.
In the embodiment of the present invention, the form of the test signal may be various, and may be selectively set according to actual needs, and the signal when the actual person speaks is a sound signal with a frequency mainly ranging from 300HZ to 3400HZ, so the test signal may be sampled in this frequency interval, and the sampling interval may be set according to actual needs, for example, the sampled value may be 300HZ, 400HZ, or 500HZ..
In the embodiment of the present invention, the actual environment is usually changed, for example, the furniture in the room is changed, the size of the room is changed, and the change is usually uncertain, and any change of the environmental factors causes the change of the echo channel, and different echo channels cause different information of the change of the time delay, the phase shift and the amplitude of the echo signal.
In step S102, the filter coefficient is modified according to the acquired variation information, and an echo signal generated by the far-end speech signal is estimated according to the modified filter coefficient.
In the embodiment of the present invention, the filter coefficient correction may be performed by real-time automatic correction, timed automatic correction, or self-initiated correction by a user as required, for example, when the user feels that the echo ratio is relatively large, the correction may be initiated by the user as required, and the initiation of the filter coefficient correction function may be directly initiated by touch on the electronic device terminal with the chat function or may be initiated by a remote control device.
In the embodiment of the invention, the dynamic estimation value of the voice signal is obtained by dynamically estimating the echo signal of the received far-end voice signal according to the coefficient of the filter corrected by the actual environment.
In step S103, the estimated echo signal is subtracted from the received local voice signal, and the local voice signal obtained by subtracting the estimated echo signal is output to the far end, where the local voice signal includes the voice signal of the local user and the echo signal actually generated by the far end voice signal in the local area.
In the embodiment of the invention, the echo signal contained in the received local voice signal is cancelled by subtracting the estimated echo signal from the received local voice signal, so that the local user can clearly transmit the voice signal.
In the embodiment of the invention, the time delay, the phase shift and the amplitude change information of the echo signal of the test signal are obtained, the filter coefficient is corrected, the echo signal generated by the far-end voice signal is estimated according to the corrected filter coefficient, the estimated echo signal is subtracted from the received local voice signal, and the local voice signal with the estimated echo signal subtracted is output to the far end.
Example two
Fig. 3 shows a flowchart of an implementation of the echo signal processing method according to the second embodiment of the present invention, which is detailed as follows:
in step S301, when an instruction to trigger calibration of filter coefficients is received, a test signal is output.
In step S302, an echo signal formed by the test signal passing through the echo channel is acquired.
In the embodiment of the present invention, the sound channels are different due to different environmental factors such as the size of the space and the indoor equipment, so that the echo signals formed by the received far-end sound signals through different channels are different according to the different environmental factors.
In step S303, the variation information of the time delay, the phase shift and the amplitude of the echo signal of the test signal is obtained according to the comparison analysis of the echo signal and the test signal.
In the embodiment of the invention, the echo signal and the test signal are analyzed in a frequency domain and a time domain, and the coefficient of the filter is corrected through the obtained information of time delay, phase shift, amplitude and the like of the echo signal, so that the echo signal obtained by performing echo estimation on the speech signal through the coefficient is closer to the echo signal generated in the actual environment.
In step S304, the filter coefficient is modified according to the acquired variation information, and an echo signal generated by the far-end speech signal is estimated according to the modified filter coefficient.
In step S305, the estimated echo signal is subtracted from the received local speech signal, and the local speech signal obtained by subtracting the estimated echo signal is output to the far end, where the local speech signal includes the speech signal of the local user and the echo signal actually generated by the far end speech signal in the local area.
In the embodiment of the invention, when an instruction for triggering the calibration of the filter coefficient is received, the test signal is output, and the change information of the time delay, the phase shift and the amplitude of the echo signal of the test signal is obtained according to the comparison analysis of the echo signal of the test signal and the output test signal, so that the method is convenient and accurate.
EXAMPLE III
Fig. 4 is a block diagram of an echo signal processing system according to a third embodiment of the present invention, and only the parts related to the third embodiment of the present invention are shown for convenience of illustration.
The system can be a software unit, a hardware unit or a software and hardware combination unit which is arranged in an electronic equipment terminal with a chat function, and the electronic equipment terminal can be a television.
The echo signal processing system provided by the embodiment of the present invention includes a calibration unit 41, an adaptive filter 42, and a signal processing unit 43.
In the embodiment of the present invention, the calibration unit 41 obtains the variation information of the time delay, the phase shift, and the amplitude of the echo signal of the test signal.
In the embodiment of the present invention, the form of the test signal may be various, and may be selectively set according to actual needs, and the signal when the actual person speaks is a sound signal with a frequency mainly ranging from 300HZ to 3400HZ, so the test signal may be sampled in this frequency interval, and the sampling interval may be set according to actual needs, for example, the sampled value may be 300HZ, 400HZ, or 500HZ..
In the embodiment of the present invention, the environment of an echo information processing system actually using the echo signal processing system is usually changed, for example, the furniture in a room is changed, the size of the room is changed, and the like, and the change is usually uncertain, and any change of the environmental factor causes a change of an echo channel, and different echo channels cause different information on the changes of the delay, the phase shift and the amplitude of the echo signal of the test signal, so that the information on the changes of the delay, the phase shift and the amplitude of the echo signal of the test signal can be obtained by the calibration unit.
Based on the variation information obtained by the calibration unit 41, the adaptive filter 42 modifies the filter coefficient, and estimates an echo signal generated by the far-end speech signal based on the modified filter coefficient.
The signal processing unit 43 subtracts the echo signal estimated by the adaptive filter 42 from the received local speech signal, and outputs a local speech signal subtracted with the estimated echo signal to the far-end, where the local speech signal includes the speech signal of the local user and the echo signal actually generated by the far-end speech signal locally.
In the embodiment of the invention, the calibration unit acquires the time delay, the phase shift and the amplitude change information of the echo signal of the test signal, corrects the coefficient of the filter, the adaptive filter estimates the echo signal generated by the far-end voice signal according to the corrected coefficient of the filter, the signal processing unit subtracts the estimated echo signal from the received local voice signal and outputs the local voice signal subtracted with the estimated echo signal to the far end, and the echo can be effectively eliminated because the influence of environmental factors on the echo signal is fully considered.
Example four
The calibration unit in the embodiment of the present invention may further include a test signal output module 51, an echo signal obtaining module 52, and a variation information obtaining module 53, please refer to fig. 5.
In the embodiment of the present invention, when receiving an instruction to trigger calibration of the filter coefficients, the test signal output module 51 outputs a test signal.
The echo signal obtaining module 52 obtains an echo signal formed by the echo channel of the test signal output by the test signal output module 51.
The change information acquiring module 53 acquires change information of the time delay, the phase shift, and the amplitude of the echo signal of the test signal according to a comparative analysis of the echo signal acquired by the echo signal acquiring module 52 and the test signal output by the test signal output module 51.
In the embodiment of the invention, the time delay, the phase shift and the amplitude change information of the echo signal of the test signal are obtained, the filter coefficient is corrected, the echo signal generated by the far-end voice signal is estimated according to the corrected filter coefficient, the estimated echo signal is subtracted from the received local voice signal, and the local voice signal with the estimated echo signal subtracted is output to the far end.
It should be noted that, in the above system embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.
In addition, it can be understood by those skilled in the art that all or part of the steps in the method for implementing the embodiments described above can be implemented by instructing the relevant hardware through a program, and the corresponding program can be stored in a computer-readable storage medium, such as a ROM/RAM, a magnetic disk, an optical disk, or the like.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (8)
1. A method of echo signal processing, the method comprising the steps of:
analyzing the echo signal and the test signal in a frequency domain and a time domain to obtain the time delay, phase shift and amplitude change information of the echo signal of the test signal;
correcting the filter coefficient according to the acquired change information, and estimating an echo signal generated by a far-end voice signal according to the corrected filter coefficient;
subtracting the estimated echo signal from the received local voice signal, and outputting the local voice signal subtracted by the estimated echo signal to a far end, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by the far end voice signal in a local area.
2. The method of claim 1, wherein the step of obtaining information on the time delay, phase shift, and amplitude variation of the echo signal of the test signal specifically comprises:
when an instruction for triggering the calibration filter coefficient is received, outputting a test signal;
acquiring an echo signal formed by the test signal through an echo channel;
and obtaining the time delay, phase shift and amplitude change information of the echo signal of the test signal according to the comparison analysis of the echo signal and the test signal.
3. The method of claim 1, wherein the test signal is an acoustic signal between 300HZ and 3400 HZ.
4. A method according to any one of claims 1 to 3, wherein the filter coefficients are modified automatically in real time, automatically at a timing or by a user as required.
5. An echo signal processing system, characterized in that the system comprises:
the calibration unit is used for analyzing the echo signal and the test signal in a frequency domain and a time domain to obtain the time delay, the phase shift and the amplitude change information of the echo signal of the test signal;
the adaptive filter is used for correcting the coefficient of the filter according to the change information acquired by the calibration unit and estimating an echo signal generated by a far-end voice signal according to the corrected coefficient of the filter;
and the signal processing unit is used for subtracting the echo signal estimated by the adaptive filter from the received local voice signal and outputting the local voice signal subtracted with the estimated echo signal to a far end, wherein the local voice signal comprises a voice signal of a local user and an echo signal actually generated by the far end voice signal in the local.
6. The system of claim 5, wherein the calibration unit comprises:
the test signal output module is used for outputting a test signal when receiving an instruction for triggering the calibration filter coefficient;
the echo signal acquisition module is used for acquiring an echo signal formed by the test signal output module through an echo channel;
and the change information acquisition module is used for acquiring the change information of the time delay, the phase shift and the amplitude of the echo signal of the test signal according to the comparison analysis of the echo signal acquired by the echo signal acquisition module and the test signal output by the test signal output module.
7. The system of claim 5, wherein the test signal is an acoustic signal between 300Hz and 3400 Hz.
8. A television set, characterized in that it comprises an echo signal processing system according to any one of claims 5 to 7.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2010105249633A CN102044253B (en) | 2010-10-29 | 2010-10-29 | Echo signal processing method, system and television |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1155550A1 HK1155550A1 (en) | 2012-05-18 |
| HK1155550B true HK1155550B (en) | 2013-01-18 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN102044253B (en) | Echo signal processing method, system and television | |
| US11297178B2 (en) | Method, apparatus, and computer-readable media utilizing residual echo estimate information to derive secondary echo reduction parameters | |
| US9111543B2 (en) | Processing signals | |
| US9508358B2 (en) | Noise reduction system with remote noise detector | |
| US8355511B2 (en) | System and method for envelope-based acoustic echo cancellation | |
| US20220301577A1 (en) | Echo cancellation method and apparatus | |
| US8842851B2 (en) | Audio source localization system and method | |
| US20210067854A1 (en) | Capturing and processing sound signals | |
| WO2008041878A2 (en) | System and procedure of hands free speech communication using a microphone array | |
| US20160309042A1 (en) | Echo cancellation | |
| CN108630219B (en) | Processing system, method and device for echo suppression audio signal feature tracking | |
| JPWO2009051197A1 (en) | Echo suppression method and apparatus | |
| US20140010382A1 (en) | Audio signal processing system and echo signal removing method thereof | |
| US11380312B1 (en) | Residual echo suppression for keyword detection | |
| US10937418B1 (en) | Echo cancellation by acoustic playback estimation | |
| US20170310360A1 (en) | Echo removal device, echo removal method, and non-transitory storage medium | |
| US11380313B2 (en) | Voice-based control in a media system or other voice-controllable sound generating system | |
| US11238879B2 (en) | Acoustic delay measurement using adaptive filter with programmable delay buffer | |
| CN106297816B (en) | Echo cancellation nonlinear processing method and device and electronic equipment | |
| WO2014115290A1 (en) | Signal processing device/acoustic processing system | |
| HK1155550B (en) | Echo signal processing method and system as well as television | |
| CN119091902A (en) | Echo cancellation method, electronic device, computer program product and storage medium | |
| JP4622713B2 (en) | A loudspeaker sound collection and communication device with an echo canceller | |
| JP6314608B2 (en) | Echo suppression device, echo suppression program, and echo suppression method | |
| Bispo et al. | Acoustic echo cancellation based on cesptral analysis |