CN111863004A - Sound signal processing method, system, electronic device and storage medium - Google Patents
Sound signal processing method, system, electronic device and storage medium Download PDFInfo
- Publication number
- CN111863004A CN111863004A CN202010744551.4A CN202010744551A CN111863004A CN 111863004 A CN111863004 A CN 111863004A CN 202010744551 A CN202010744551 A CN 202010744551A CN 111863004 A CN111863004 A CN 111863004A
- Authority
- CN
- China
- Prior art keywords
- sound signal
- external
- signal
- voice
- played
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 156
- 238000003672 processing method Methods 0.000 title abstract description 15
- 238000012545 processing Methods 0.000 claims abstract description 30
- 238000000034 method Methods 0.000 claims abstract description 21
- 238000012544 monitoring process Methods 0.000 claims abstract description 16
- 238000005070 sampling Methods 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 11
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 abstract description 9
- 238000004891 communication Methods 0.000 description 13
- 230000001965 increasing effect Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 4
- 230000001960 triggered effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000035515 penetration Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000003313 weakening effect Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003028 elevating effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Telephone Function (AREA)
Abstract
Description
技术领域technical field
本发明涉及声音信号处理技术领域,尤其涉及一种声音信号的处理方法、系统、电子设备及存储介质。The present invention relates to the technical field of sound signal processing, and in particular, to a sound signal processing method, system, electronic device and storage medium.
背景技术Background technique
当前在声音信号的播放过程中(例如在语音通话过程中,又例如在音乐播放过程中等),当播放环境由安静转向嘈杂时,为了能够较为清晰地收听播放内容,通常的做法包括增大播放音量、佩戴耳机、转至安静的环境等等,而无法自动增强播放内容的清晰度。Currently, during the playback of sound signals (for example, during a voice call, for example, during music playback, etc.), when the playback environment changes from quiet to noisy, in order to listen to the playback content more clearly, the usual practice includes increasing the playback volume. volume, wearing headphones, moving to a quiet environment, etc. without automatically enhancing the clarity of the playback content.
发明内容SUMMARY OF THE INVENTION
本发明要解决的技术问题是为了克服现有技术中无法在播放环境由安静转向嘈杂时自动增强播放内容清晰度的缺陷,提供一种声音信号的处理方法、系统、电子设备及存储介质。The technical problem to be solved by the present invention is to provide a sound signal processing method, system, electronic device and storage medium in order to overcome the defect in the prior art that the playback content cannot be automatically enhanced when the playback environment changes from quiet to noisy.
本发明是通过下述技术方案来解决上述技术问题:The present invention solves the above-mentioned technical problems through the following technical solutions:
一种声音信号的处理方法,包括:A method for processing a sound signal, comprising:
监听外部声音信号;monitor external sound signals;
判断所述外部声音信号是否满足触发条件;Judging whether the external sound signal satisfies the triggering condition;
若是,则抬升待播放声音信号的频率;If so, increase the frequency of the sound signal to be played;
播放抬升频率后的所述待播放声音信号。The to-be-played sound signal after the frequency is raised is played.
较佳地,所述外部声音信号包括外部噪声信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:Preferably, the external sound signal includes an external noise signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:
识别所述外部声音信号中的外部噪声信号;identifying an external noise signal in the external sound signal;
根据所述外部噪声信号判断外部噪声的音量是否大于第一阈值;Determine whether the volume of the external noise is greater than a first threshold according to the external noise signal;
若是,则确定所述外部声音信号满足触发条件;If yes, then determine that the external sound signal satisfies the trigger condition;
和/或,所述外部声音信号包括外部语音信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:And/or, the external sound signal includes an external voice signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:
识别所述外部声音信号中的外部语音信号;Identifying an external voice signal in the external sound signal;
根据所述外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;According to the external voice signal, determine whether the difference between the volume of the external voice at the current sampling moment and the volume at the previous sampling moment is greater than a second threshold;
若是,则确定所述外部声音信号满足触发条件。If so, it is determined that the external sound signal satisfies the trigger condition.
较佳地,所述外部声音信号包括外部语音信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:Preferably, the external sound signal includes an external voice signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:
识别所述外部声音信号中的外部语音信号;Identifying an external voice signal in the external sound signal;
将所述外部语音信号转换成文字;converting the external speech signal into text;
判断所述文字是否包括触发关键字;determine whether the text includes a trigger keyword;
若是,则确定所述外部声音信号满足触发条件。If so, it is determined that the external sound signal satisfies the trigger condition.
较佳地,所述监听外部声音信号的步骤具体包括在语音通话过程中监听外部声音信号;Preferably, the step of monitoring the external sound signal specifically includes monitoring the external sound signal during the voice call;
所述待播放声音信号包括语音通话信号。The to-be-played sound signal includes a voice call signal.
一种声音信号的处理系统,包括:A sound signal processing system, comprising:
监听模块,用于监听外部声音信号;Monitoring module for monitoring external sound signals;
判断模块,用于判断所述外部声音信号是否满足触发条件;a judgment module for judging whether the external sound signal satisfies a trigger condition;
若是,则调用抬升模块,所述抬升模块用于抬升待播放声音信号的频率;If so, call a lift module, and the lift module is used to lift the frequency of the sound signal to be played;
播放模块,用于播放抬升频率后的所述待播放声音信号。The playing module is used for playing the sound signal to be played after the frequency is raised.
较佳地,所述外部声音信号包括外部噪声信号,所述判断模块包括:Preferably, the external sound signal includes an external noise signal, and the judgment module includes:
第一识别单元,用于识别所述外部声音信号中的外部噪声信号;a first identification unit, used to identify the external noise signal in the external sound signal;
第一判断单元,用于根据所述外部噪声信号判断外部噪声的音量是否大于第一阈值;a first judging unit, configured to judge whether the volume of the external noise is greater than a first threshold according to the external noise signal;
若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件;If so, call a determination unit, and the determination unit is used to determine that the external sound signal satisfies the trigger condition;
和/或,所述外部声音信号包括外部语音信号,所述判断模块包括:And/or, the external sound signal includes an external voice signal, and the judgment module includes:
第二识别单元,用于识别所述外部声音信号中的外部语音信号;a second identification unit, configured to identify an external voice signal in the external sound signal;
第二判断单元,用于根据所述外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;A second judging unit for judging whether the difference between the volume of the external voice at the current sampling moment and the volume at the last sampling moment is greater than a second threshold according to the external voice signal;
若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件。If so, the determining unit is invoked, and the determining unit is configured to determine that the external sound signal satisfies the triggering condition.
较佳地,所述外部声音信号包括外部语音信号,所述判断模块包括:Preferably, the external sound signal includes an external voice signal, and the judgment module includes:
第二识别单元,用于识别所述外部声音信号中的外部语音信号;a second identification unit, configured to identify an external voice signal in the external sound signal;
转换单元,用于将所述外部语音信号转换成文字;a conversion unit for converting the external voice signal into text;
第三判断单元,用于判断所述文字是否包括触发关键字;a third judging unit for judging whether the text includes a trigger keyword;
若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件。If so, the determining unit is invoked, and the determining unit is configured to determine that the external sound signal satisfies the triggering condition.
较佳地,所述监听模块具体用于在语音通话过程中监听外部声音信号;Preferably, the monitoring module is specifically used to monitor external sound signals during a voice call;
所述待播放声音信号包括语音通话信号;The to-be-played sound signal includes a voice call signal;
所述播放模块包括听筒和/或扬声器。The playback module includes an earpiece and/or a speaker.
一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述任一种声音信号的处理方法。An electronic device includes a memory, a processor, and a computer program stored in the memory and running on the processor. The processor implements any of the above sound signal processing methods when the processor executes the computer program.
一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种声音信号的处理方法的步骤。A computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of any of the above sound signal processing methods.
本发明的积极进步效果在于:本发明可以在声音信号的播放过程中实时监听外部声音信号,并在外部声音信号满足触发条件时,例如,播放环境由安静转向嘈杂时,先自动抬升待播放声音信号的频率,再行播放待播放声音信号,使得用户能够更加清晰地收听到所播放声音信号的内容,自动增强了所播放内容的清晰度。The positive improvement effect of the present invention is: the present invention can monitor the external sound signal in real time during the playback of the sound signal, and when the external sound signal meets the trigger condition, for example, when the playback environment changes from quiet to noisy, the sound to be played is automatically raised first The frequency of the signal is played, and the sound signal to be played is played again, so that the user can listen to the content of the played sound signal more clearly, and the clarity of the played content is automatically enhanced.
附图说明Description of drawings
图1为根据本发明实施例1的声音信号的处理方法的流程图。FIG. 1 is a flowchart of a method for processing a sound signal according to Embodiment 1 of the present invention.
图2为根据本发明实施例1的声音信号的处理方法的具体流程图。FIG. 2 is a specific flowchart of a method for processing a sound signal according to Embodiment 1 of the present invention.
图3为根据本发明实施例1的声音信号的处理方法中频率抬升的示意图。FIG. 3 is a schematic diagram of frequency boosting in the sound signal processing method according to Embodiment 1 of the present invention.
图4为根据本发明实施例2的声音信号的处理系统的模块示意图。FIG. 4 is a schematic block diagram of a sound signal processing system according to Embodiment 2 of the present invention.
图5为根据本发明实施例3的电子设备的结构示意图。FIG. 5 is a schematic structural diagram of an electronic device according to Embodiment 3 of the present invention.
具体实施方式Detailed ways
下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。The present invention is further described below by way of examples, but the present invention is not limited to the scope of the described examples.
实施例1Example 1
本实施例提供一种声音信号的处理方法,参照图1,本实施例的处理方法包括:This embodiment provides a method for processing a sound signal. Referring to FIG. 1 , the processing method in this embodiment includes:
S101、监听外部声音信号;S101. Monitor external sound signals;
S102、判断外部声音信号是否满足触发条件;S102, judging whether the external sound signal satisfies the triggering condition;
若是,则执行步骤S103;If yes, execute step S103;
S103、抬升待播放声音信号的频率;S103, raising the frequency of the sound signal to be played;
S104、播放抬升频率后的待播放声音信号。S104, playing the sound signal to be played after the frequency is raised.
本实施例的处理方法可以适用于语音通话过程中,其中,待播放声音信号可以包括语音通话信号,步骤S101具体可以包括在语音通话过程中监听外部声音信号。应当理解,本实施例的处理方法还可以适用于音乐等声音媒体的播放过程中。The processing method of this embodiment may be applicable to a voice call process, wherein the sound signal to be played may include a voice call signal, and step S101 may specifically include monitoring an external sound signal during the voice call process. It should be understood that the processing method in this embodiment can also be applied to the playing process of sound media such as music.
具体地,在本实施例中,外部声音信号可以包括外部噪声信号以及外部语音信号,例如,当待播放声音信号包括语音通话信号时,外部噪声信号可以包括语音通话设备的麦克风监听到的语音通话设备所在环境的声音信号,外部语音信号可以包括语音通话设备的使用者通过语音通话设备的麦克风输入的声音信号。Specifically, in this embodiment, the external sound signal may include an external noise signal and an external voice signal. For example, when the sound signal to be played includes a voice call signal, the external noise signal may include a voice call monitored by the microphone of the voice call device. The sound signal of the environment where the device is located, and the external voice signal may include the sound signal input by the user of the voice communication device through the microphone of the voice communication device.
在本实施例中,用于触发待播放声音信号处理的触发条件可以根据实际应用自定义设置,例如,触发条件可以包括:外部噪声的音量大于预设阈值、外部语音的音量突然增大、外部语音的内容表示语音通话设备的使用者听不清播放内容等,其中,不同的触发条件可以并列触发,也可以按照预设顺序先后触发,本实施例不对此加以限制。In this embodiment, the trigger condition for triggering the processing of the sound signal to be played may be customized according to the actual application. For example, the trigger condition may include: the volume of the external noise is greater than the preset threshold, the volume of the external voice suddenly increases, the external The content of the voice indicates that the user of the voice communication device cannot hear the playing content, etc., wherein, different trigger conditions may be triggered in parallel, or may be triggered sequentially in a preset order, which is not limited in this embodiment.
具体地,参照图2,本实施例中步骤S102可以包括:Specifically, referring to FIG. 2 , step S102 in this embodiment may include:
S1021、识别外部声音信号中的外部噪声信号;S1021, identifying the external noise signal in the external sound signal;
S1022、根据外部噪声信号判断外部噪声的音量是否大于第一阈值;S1022. Determine whether the volume of the external noise is greater than a first threshold according to the external noise signal;
若是,则执行步骤S1023;If yes, execute step S1023;
S1023、确定外部声音信号满足触发条件。S1023. Determine that the external sound signal satisfies the triggering condition.
其中,第一阈值可以根据实际应用自定义设置,例如,第一阈值可以取值为50db。Wherein, the first threshold can be customized according to the actual application, for example, the first threshold can be 50db.
步骤S102还可以包括:Step S102 may also include:
S1024、识别外部声音信号中的外部语音信号;S1024. Identify the external voice signal in the external voice signal;
S1025、根据外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;S1025, according to the external voice signal, determine whether the difference between the volume of the external voice at the current sampling moment and the volume at the last sampling moment is greater than the second threshold;
若是,则执行步骤S1023。If yes, step S1023 is executed.
其中,第二阈值可以根据实际应用自定义设置,例如,第二阈值可以取值为6db。Wherein, the second threshold can be customized according to the actual application, for example, the second threshold can be 6db.
在步骤S1024之后还可以包括:After step S1024, it may further include:
S1026、将外部语音信号转换成文字;S1026, convert the external voice signal into text;
S1027、判断文字是否包括触发关键字;S1027, determine whether the text includes a trigger keyword;
若是,则执行步骤S1023。If yes, step S1023 is executed.
其中,触发关键字可以根据实际应用自定义设置,例如,“听不清”、“什么”、“喂喂”、“大点声”等等。Among them, the trigger keyword can be customized according to the actual application, for example, "can't hear clearly", "what", "hello", "speak louder" and so on.
在本实施例中,若步骤S102判定外部声音信号不满足触发条件,则不对待播放声音信号进行频率抬升的处理。In this embodiment, if it is determined in step S102 that the external sound signal does not meet the triggering condition, the processing of increasing the frequency of the sound signal to be played is not performed.
在本实施例中,参照图3,经频率抬升处理后,待播放声音信号的频率得到了抬升,而音量并未发生改变,也即,将声音信号向频率增大的方向进行了平移。待播放声音信号进行频率抬升的处理后,由于人耳对频率较高的声音更加敏感,从而使得人耳能够更加清晰地收听到所播放声音信号的内容,增强了所播放内容的清晰度。In this embodiment, referring to FIG. 3 , after the frequency raising process, the frequency of the sound signal to be played is raised, but the volume does not change, that is, the sound signal is shifted in the direction of increasing frequency. After the sound signal to be played is processed to increase the frequency, since the human ear is more sensitive to the sound with higher frequency, the human ear can hear the content of the played sound signal more clearly, which enhances the clarity of the played content.
具体地,当本实施例的处理方法适用于语音通话过程中时,尽管频率的增大会导致声音穿透性的减弱,但由于语音通话时语音通话设备的使用者与语音通话设备之间距离较近,从而可以忽略声音传播的距离因素。在语音通话过程中,当判断满足触发条件,也即,判定语音通话设备的使用者可能无法听清对方所讲内容时,通过语音通话设备对待播放声音信号的实时频率抬升处理,使得语音通话设备的使用者可以听到经实时频率抬升处理后的声音信号,使得语音通话设备的使用者可以听清对方所讲内容。Specifically, when the processing method of this embodiment is applied to the process of voice call, although the increase of frequency will lead to the weakening of sound penetration, the distance between the user of the voice call device and the voice call device is relatively long during the voice call. so that the distance factor of sound propagation can be ignored. During the voice call, when it is judged that the trigger condition is met, that is, it is judged that the user of the voice call device may not be able to hear what the other party is saying, the voice call device will process the real-time frequency increase of the sound signal to be played, so that the voice call device The user of the voice communication device can hear the sound signal processed by the real-time frequency uplift, so that the user of the voice communication device can hear the content of the other party clearly.
从而,在语音通话设备通过双麦降噪和语音噪声/回声消除算法等手段在语音通话设备的使用者(用户A)输入的语音通话信号被传输至对方(用户B)之前对语音通话信号进行处理,以解决语音通话过程中所产生的回声和噪声,使得语音通话设备的使用者(用户A)的语音通话内容能够清晰传递给对方(用户B)的基础上,还能够在语音通话设备的使用者(用户A)可能无法听清对方(用户B)所讲内容的情况下对对方(用户B)输入的语音通话信号实时进行频率抬升,以使得语音通话设备的使用者(用户A)能够听清对方(用户B)所讲内容,进一步完善并提升了语音通话过程中的语音通话质量。Therefore, before the voice call signal input by the user (user A) of the voice call device is transmitted to the other party (user B) by means of dual microphone noise reduction and voice noise/echo cancellation algorithm, the voice call signal is processed. processing to solve the echo and noise generated during the voice call, so that the voice call content of the user (user A) of the voice call device can be clearly transmitted to the other party (user B), and the voice call device When the user (user A) may not be able to hear what the other party (user B) is saying, the frequency of the voice call signal input by the other party (user B) is increased in real time, so that the user (user A) of the voice call device can The content of the other party (user B) can be heard clearly, which further improves and improves the quality of the voice call during the voice call.
实施例2Example 2
本实施例提供一种声音信号的处理系统,参照图4,本实施例的处理系统包括:This embodiment provides a sound signal processing system. Referring to FIG. 4 , the processing system in this embodiment includes:
监听模块201,用于监听外部声音信号;a
判断模块202,用于判断外部声音信号是否满足触发条件;The
若是,则调用抬升模块203;If so, call the
抬升模块203,抬升待播放声音信号的频率;The elevating
播放模块204,播放抬升频率后的待播放声音信号。The
本实施例的处理系统可以适用于语音通话设备中,其中,待播放声音信号可以包括语音通话信号,监听模块201可以包括语音通话设备的麦克风,具体可以用于在语音通话过程中监听外部声音信号。应当理解,本实施例的处理系统还可以适用于音乐等声音媒体的播放设备中。The processing system of this embodiment may be applied to a voice call device, wherein the sound signal to be played may include a voice call signal, and the
具体地,在本实施例中,外部声音信号可以包括外部噪声信号以及外部语音信号,例如,当待播放声音信号包括语音通话信号时,外部噪声信号可以包括语音通话设备的麦克风监听到的语音通话设备所在环境的声音信号,外部语音信号可以包括语音通话设备的使用者通过语音通话设备的麦克风输入的声音信号,播放模块204可以包括语音通话设备的听筒与扬声器中的至少一种。Specifically, in this embodiment, the external sound signal may include an external noise signal and an external voice signal. For example, when the sound signal to be played includes a voice call signal, the external noise signal may include a voice call monitored by the microphone of the voice call device. The sound signal of the environment where the device is located, the external voice signal may include the sound signal input by the user of the voice call device through the microphone of the voice call device, and the
在本实施例中,用于触发待播放声音信号处理的触发条件可以根据实际应用自定义设置,例如,触发条件可以包括:外部噪声的音量大于预设阈值、外部语音的音量突然增大、外部语音的内容表示语音通话设备的使用者听不清播放内容等,其中,不同的触发条件可以并列触发,也可以按照预设顺序先后触发,本实施例不对此加以限制。In this embodiment, the trigger condition for triggering the processing of the sound signal to be played may be customized according to the actual application. For example, the trigger condition may include: the volume of the external noise is greater than the preset threshold, the volume of the external voice suddenly increases, the external The content of the voice indicates that the user of the voice communication device cannot hear the playing content, etc., wherein, different trigger conditions may be triggered in parallel, or may be triggered sequentially in a preset order, which is not limited in this embodiment.
具体地,参照图4,本实施例中判断模块202可以包括:Specifically, referring to FIG. 4 , the
第一识别单元2021,用于识别外部声音信号中的外部噪声信号;a
第一判断单元2022,用于根据外部噪声信号判断外部噪声的音量是否大于第一阈值;a
若是,则调用确定单元2023;If so, call the determining
确定单元2023,用于确定外部声音信号满足触发条件。The determining
其中,第一阈值可以根据实际应用自定义设置,例如,第一阈值可以取值为50db。Wherein, the first threshold can be customized according to the actual application, for example, the first threshold can be 50db.
判断模块202还可以包括:The judging
第二识别单元2024,用于识别外部声音信号中的外部语音信号;The
第二判断单元2025,用于根据外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;The
若是,则调用确定单元2023。If so, the determining
其中,第二阈值可以根据实际应用自定义设置,例如,第二阈值可以取值为6db。Wherein, the second threshold can be customized according to the actual application, for example, the second threshold can be 6db.
判断模块202还可以包括:The judging
转换单元2026,用于将外部语音信号转换成文字;
第三判断单元2027,用于判断文字是否包括触发关键字;The
若是,则调用确定单元2023。If so, the determining
其中,触发关键字可以根据实际应用自定义设置,例如,“听不清”、“什么”、“喂喂”、“大点声”等等。Among them, the trigger keyword can be customized according to the actual application, for example, "can't hear clearly", "what", "hello", "speak louder" and so on.
在本实施例中,若判断模块202判定外部声音信号不满足触发条件,则不调用抬升模块203对待播放声音信号进行频率抬升的处理。In this embodiment, if the
在本实施例中,参照图3,经频率抬升处理后,待播放声音信号的频率得到了抬升,而音量并未发生改变,也即,将声音信号向频率增大的方向进行了平移。待播放声音信号进行频率抬升的处理后,由于人耳对频率较高的声音更加敏感,从而使得人耳能够更加清晰地收听到所播放声音信号的内容,增强了所播放内容的清晰度。In this embodiment, referring to FIG. 3 , after the frequency raising process, the frequency of the sound signal to be played is raised, but the volume does not change, that is, the sound signal is shifted in the direction of increasing frequency. After the sound signal to be played is processed to increase the frequency, since the human ear is more sensitive to the sound with higher frequency, the human ear can hear the content of the played sound signal more clearly, which enhances the clarity of the played content.
具体地,当本实施例的处理系统适用于语音通话设备中时,尽管频率的增大会导致声音穿透性的减弱,但由于语音通话时语音通话设备的使用者与语音通话设备之间距离较近,从而可以忽略声音传播的距离因素。在语音通话过程中,当判断满足触发条件,也即,判定语音通话设备的使用者可能无法听清对方所讲内容时,通过语音通话设备对待播放声音信号的实时频率抬升处理,使得语音通话设备的使用者可以听到经实时频率抬升处理后的声音信号,使得语音通话设备的使用者可以听清对方所讲内容。Specifically, when the processing system of this embodiment is applied to a voice communication device, although the increase of the frequency will lead to the weakening of the sound penetration, the distance between the user of the voice communication device and the voice communication device is relatively large during the voice communication. so that the distance factor of sound propagation can be ignored. During the voice call, when it is judged that the trigger condition is met, that is, it is judged that the user of the voice call device may not be able to hear what the other party is saying, the voice call device will process the real-time frequency increase of the sound signal to be played, so that the voice call device The user of the voice communication device can hear the sound signal processed by the real-time frequency uplift, so that the user of the voice communication device can hear the content of the other party clearly.
从而,在语音通话设备通过双麦降噪和语音噪声/回声消除算法等手段在语音通话设备的使用者(用户A)输入的语音通话信号被传输至对方(用户B)之前对语音通话信号进行处理,以解决语音通话过程中所产生的回声和噪声,使得语音通话设备的使用者(用户A)的语音通话内容能够清晰传递给对方(用户B)的基础上,还能够在语音通话设备的使用者(用户A)可能无法听清对方(用户B)所讲内容的情况下对对方(用户B)输入的语音通话信号实时进行频率抬升,以使得语音通话设备的使用者(用户A)能够听清对方(用户B)所讲内容,进一步完善并提升了语音通话过程中的语音通话质量。Therefore, before the voice call signal input by the user (user A) of the voice call device is transmitted to the other party (user B) by means of dual microphone noise reduction and voice noise/echo cancellation algorithm, the voice call signal is processed. processing to solve the echo and noise generated during the voice call, so that the voice call content of the user (user A) of the voice call device can be clearly transmitted to the other party (user B), and the voice call device When the user (user A) may not be able to hear the content of the other party (user B) clearly, the frequency of the voice call signal input by the other party (user B) is increased in real time, so that the user (user A) of the voice call device can The content of the other party (user B) can be heard clearly, which further improves and improves the quality of the voice call during the voice call.
实施例3Example 3
本实施例提供一种电子设备,电子设备可以通过计算设备的形式表现(例如可以为服务器设备),包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其中处理器执行计算机程序时可以实现实施例1提供的声音信号的处理方法。This embodiment provides an electronic device, which can be expressed in the form of a computing device (for example, a server device), and includes a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor The sound signal processing method provided in Embodiment 1 can be implemented when the computer program is executed.
图5示出了本实施例的硬件结构示意图,如图5所示,电子设备9具体包括:FIG. 5 shows a schematic diagram of the hardware structure of this embodiment. As shown in FIG. 5 , the
至少一个处理器91、至少一个存储器92以及用于连接不同系统组件(包括处理器91和存储器92)的总线93,其中:At least one
总线93包括数据总线、地址总线和控制总线。The
存储器92包括易失性存储器,例如随机存取存储器(RAM)921和/或高速缓存存储器922,还可以进一步包括只读存储器(ROM)923。
存储器92还包括具有一组(至少一个)程序模块924的程序/实用工具925,这样的程序模块924包括但不限于:操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。The
处理器91通过运行存储在存储器92中的计算机程序,从而执行各种功能应用以及数据处理,例如本发明实施例1所提供的声音信号的处理方法。The
电子设备9进一步可以与一个或多个外部设备94(例如键盘、指向设备等)通信。这种通信可以通过输入/输出(I/O)接口95进行。并且,电子设备9还可以通过网络适配器96与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。网络适配器96通过总线93与电子设备9的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备9使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理器、外部磁盘驱动阵列、RAID(磁盘阵列)系统、磁带驱动器以及数据备份存储系统等。The
应当注意,尽管在上文详细描述中提及了电子设备的若干单元/模块或子单元/模块,但是这种划分仅仅是示例性的并非强制性的。实际上,根据本申请的实施方式,上文描述的两个或更多单元/模块的特征和功能可以在一个单元/模块中具体化。反之,上文描述的一个单元/模块的特征和功能可以进一步划分为由多个单元/模块来具体化。It should be noted that although several units/modules or sub-units/modules of the electronic device are mentioned in the above detailed description, this division is merely exemplary and not mandatory. Indeed, according to embodiments of the present application, the features and functions of two or more units/modules described above may be embodied in one unit/module. Conversely, the features and functions of one unit/module described above may be further subdivided to be embodied by multiple units/modules.
实施例4Example 4
本实施例提供了一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现实施例1所提供的声音信号的处理方法的步骤。This embodiment provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the steps of the sound signal processing method provided in Embodiment 1.
其中,可读存储介质可以采用的更具体可以包括但不限于:便携式盘、硬盘、随机存取存储器、只读存储器、可擦拭可编程只读存储器、光存储器件、磁存储器件或上述的任意合适的组合。Wherein, the readable storage medium may include, but is not limited to, a portable disk, a hard disk, a random access memory, a read-only memory, an erasable programmable read-only memory, an optical storage device, a magnetic storage device, or any of the above suitable combination.
在可能的实施方式中,本发明还可以实现为一种程序产品的形式,其包括程序代码,当所述程序产品在终端设备上运行时,所述程序代码用于使所述终端设备执行实现实施例1所述的声音信号的处理方法的步骤。In a possible implementation manner, the present invention can also be implemented in the form of a program product, which includes program codes, when the program product runs on a terminal device, the program code is used to cause the terminal device to execute the implementation Steps of the sound signal processing method described in Embodiment 1.
其中,可以以一种或多种程序设计语言的任意组合来编写用于执行本发明的程序代码,所述程序代码可以完全地在用户设备上执行、部分地在用户设备上执行、作为一个独立的软件包执行、部分在用户设备上部分在远程设备上执行或完全在远程设备上执行。Wherein, the program code for executing the present invention can be written in any combination of one or more programming languages, and the program code can be completely executed on the user equipment, partially executed on the user equipment, as an independent The software package executes on the user's device, partly on the user's device, partly on the remote device, or entirely on the remote device.
虽然以上描述了本发明的具体实施方式,但是本领域的技术人员应当理解,这仅是举例说明,本发明的保护范围是由所附权利要求书限定的。本领域的技术人员在不背离本发明的原理和实质的前提下,可以对这些实施方式做出多种变更或修改,但这些变更和修改均落入本发明的保护范围。Although the specific embodiments of the present invention are described above, those skilled in the art should understand that this is only an illustration, and the protection scope of the present invention is defined by the appended claims. Those skilled in the art can make various changes or modifications to these embodiments without departing from the principle and essence of the present invention, but these changes and modifications all fall within the protection scope of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010744551.4A CN111863004A (en) | 2020-07-29 | 2020-07-29 | Sound signal processing method, system, electronic device and storage medium |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010744551.4A CN111863004A (en) | 2020-07-29 | 2020-07-29 | Sound signal processing method, system, electronic device and storage medium |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN111863004A true CN111863004A (en) | 2020-10-30 |
Family
ID=72946037
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010744551.4A Pending CN111863004A (en) | 2020-07-29 | 2020-07-29 | Sound signal processing method, system, electronic device and storage medium |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111863004A (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114866639A (en) * | 2022-04-29 | 2022-08-05 | 维沃移动通信有限公司 | Electronic equipment, control method and device |
Citations (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000112710A (en) * | 1998-10-09 | 2000-04-21 | Canon Inc | Volume control device and method and storage medium |
| US20080162120A1 (en) * | 2007-01-03 | 2008-07-03 | Motorola, Inc. | Method and apparatus for providing feedback of vocal quality to a user |
| WO2011031273A1 (en) * | 2009-09-14 | 2011-03-17 | Srs Labs, Inc | System for adaptive voice intelligibility processing |
| CN102436821A (en) * | 2011-12-02 | 2012-05-02 | 海能达通信股份有限公司 | Method and equipment for self-adaptive adjustment of sound effect |
| CN202889458U (en) * | 2012-11-02 | 2013-04-17 | 姚西 | Automatic call volume regulation mobile phone based on environmental noise |
| WO2013078677A1 (en) * | 2011-12-02 | 2013-06-06 | 海能达通信股份有限公司 | A method and device for adaptively adjusting sound effect |
| CN103905646A (en) * | 2014-04-09 | 2014-07-02 | 深圳市中兴移动通信有限公司 | Communication terminal and voice processing method thereof |
| CN107105367A (en) * | 2017-05-24 | 2017-08-29 | 维沃移动通信有限公司 | A kind of acoustic signal processing method and terminal |
| CN107146613A (en) * | 2017-04-10 | 2017-09-08 | 北京猎户星空科技有限公司 | A kind of voice interactive method and device |
| CN107172256A (en) * | 2017-07-27 | 2017-09-15 | 广东欧珀移动通信有限公司 | Earphone call self-adapting regulation method, device, mobile terminal and storage medium |
| CN107331405A (en) * | 2017-06-30 | 2017-11-07 | 深圳市金立通信设备有限公司 | A kind of voice information processing method and server |
| CN107395883A (en) * | 2017-07-17 | 2017-11-24 | 广东欧珀移动通信有限公司 | Speech signal adjustment method, communication terminal, and computer-readable storage medium |
| CN108476256A (en) * | 2016-09-27 | 2018-08-31 | 华为技术有限公司 | A volume adjustment method and terminal |
| CN111356008A (en) * | 2018-12-21 | 2020-06-30 | 深圳Tcl数字技术有限公司 | Automatic television volume adjusting method, smart television and storage medium |
-
2020
- 2020-07-29 CN CN202010744551.4A patent/CN111863004A/en active Pending
Patent Citations (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000112710A (en) * | 1998-10-09 | 2000-04-21 | Canon Inc | Volume control device and method and storage medium |
| US20080162120A1 (en) * | 2007-01-03 | 2008-07-03 | Motorola, Inc. | Method and apparatus for providing feedback of vocal quality to a user |
| WO2011031273A1 (en) * | 2009-09-14 | 2011-03-17 | Srs Labs, Inc | System for adaptive voice intelligibility processing |
| CN102436821A (en) * | 2011-12-02 | 2012-05-02 | 海能达通信股份有限公司 | Method and equipment for self-adaptive adjustment of sound effect |
| WO2013078677A1 (en) * | 2011-12-02 | 2013-06-06 | 海能达通信股份有限公司 | A method and device for adaptively adjusting sound effect |
| CN202889458U (en) * | 2012-11-02 | 2013-04-17 | 姚西 | Automatic call volume regulation mobile phone based on environmental noise |
| CN103905646A (en) * | 2014-04-09 | 2014-07-02 | 深圳市中兴移动通信有限公司 | Communication terminal and voice processing method thereof |
| CN108476256A (en) * | 2016-09-27 | 2018-08-31 | 华为技术有限公司 | A volume adjustment method and terminal |
| CN107146613A (en) * | 2017-04-10 | 2017-09-08 | 北京猎户星空科技有限公司 | A kind of voice interactive method and device |
| CN107105367A (en) * | 2017-05-24 | 2017-08-29 | 维沃移动通信有限公司 | A kind of acoustic signal processing method and terminal |
| CN107331405A (en) * | 2017-06-30 | 2017-11-07 | 深圳市金立通信设备有限公司 | A kind of voice information processing method and server |
| CN107395883A (en) * | 2017-07-17 | 2017-11-24 | 广东欧珀移动通信有限公司 | Speech signal adjustment method, communication terminal, and computer-readable storage medium |
| CN107172256A (en) * | 2017-07-27 | 2017-09-15 | 广东欧珀移动通信有限公司 | Earphone call self-adapting regulation method, device, mobile terminal and storage medium |
| CN111356008A (en) * | 2018-12-21 | 2020-06-30 | 深圳Tcl数字技术有限公司 | Automatic television volume adjusting method, smart television and storage medium |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114866639A (en) * | 2022-04-29 | 2022-08-05 | 维沃移动通信有限公司 | Electronic equipment, control method and device |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2267695B1 (en) | Controlling music players using environment audio analysis | |
| CN107995360B (en) | Call processing method and related products | |
| WO2024051820A1 (en) | Abnormality-based paging method and related apparatus | |
| CN113949955B (en) | Noise reduction processing method, device, electronic equipment, earphone and storage medium | |
| CN108681440A (en) | A kind of smart machine method for controlling volume and system | |
| CN103796125A (en) | Sound adjusting method based on earphone playing | |
| CN112004177A (en) | Howling detection method, microphone volume adjustment method and medium | |
| TWI753685B (en) | Headphone noise reduction mode control method, system, equipment and storage medium | |
| US20250336407A1 (en) | Dynamic noise and speech removal | |
| CN111048118B (en) | Voice signal processing method and device and terminal | |
| CN111968660A (en) | Echo cancellation device and method, electronic device, and storage medium | |
| US20230298612A1 (en) | Microphone Array Configuration Invariant, Streaming, Multichannel Neural Enhancement Frontend for Automatic Speech Recognition | |
| CN105825854B (en) | A voice signal processing method, device and mobile terminal | |
| CN112087726A (en) | Method and system for identifying polyphonic ringtone, electronic equipment and storage medium | |
| JP2021177598A (en) | Speech processing system, speech processing method, and speech processing program | |
| US11122160B1 (en) | Detecting and correcting audio echo | |
| CN111863004A (en) | Sound signal processing method, system, electronic device and storage medium | |
| WO2022199320A1 (en) | Superimposing high-frequency copies of emitted sounds | |
| CN112235462A (en) | Voice adjusting method, system, electronic equipment and computer readable storage medium | |
| US20230290356A1 (en) | Hearing aid for cognitive help using speaker recognition | |
| CN112700785B (en) | Voice signal processing method and device and related equipment | |
| CN106293607B (en) | Method and system for automatically switching audio output modes | |
| CN112883178B (en) | Interaction method, device, server and storage medium | |
| CN112929777B (en) | Audio signal processing method and device and electronic equipment | |
| CN111048096B (en) | Voice signal processing method and device and terminal |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20201030 |