[go: up one dir, main page]

CN111863004A - Sound signal processing method, system, electronic device and storage medium - Google Patents

Sound signal processing method, system, electronic device and storage medium Download PDF

Info

Publication number
CN111863004A
CN111863004A CN202010744551.4A CN202010744551A CN111863004A CN 111863004 A CN111863004 A CN 111863004A CN 202010744551 A CN202010744551 A CN 202010744551A CN 111863004 A CN111863004 A CN 111863004A
Authority
CN
China
Prior art keywords
sound signal
external
signal
voice
played
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010744551.4A
Other languages
Chinese (zh)
Inventor
马洪刚
冯亚东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Simcom Wireless Solutions Co Ltd
Original Assignee
Shanghai Simcom Wireless Solutions Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Simcom Wireless Solutions Co Ltd filed Critical Shanghai Simcom Wireless Solutions Co Ltd
Priority to CN202010744551.4A priority Critical patent/CN111863004A/en
Publication of CN111863004A publication Critical patent/CN111863004A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)

Abstract

The invention discloses a method and a system for processing a sound signal, electronic equipment and a storage medium. The processing method comprises the following steps: monitoring an external sound signal; judging whether the external sound signal meets a trigger condition; if so, raising the frequency of the sound signal to be played; and playing the sound signal to be played after the frequency is raised. The invention can monitor the external sound signal in real time in the playing process of the sound signal, and when the external sound signal meets the triggering condition, for example, when the playing environment turns from quiet to noisy, the frequency of the sound signal to be played is automatically raised, and then the sound signal to be played is played, so that a user can more clearly listen to the content of the played sound signal, and the definition of the played content is automatically enhanced.

Description

声音信号的处理方法、系统、电子设备及存储介质Sound signal processing method, system, electronic device and storage medium

技术领域technical field

本发明涉及声音信号处理技术领域,尤其涉及一种声音信号的处理方法、系统、电子设备及存储介质。The present invention relates to the technical field of sound signal processing, and in particular, to a sound signal processing method, system, electronic device and storage medium.

背景技术Background technique

当前在声音信号的播放过程中(例如在语音通话过程中,又例如在音乐播放过程中等),当播放环境由安静转向嘈杂时,为了能够较为清晰地收听播放内容,通常的做法包括增大播放音量、佩戴耳机、转至安静的环境等等,而无法自动增强播放内容的清晰度。Currently, during the playback of sound signals (for example, during a voice call, for example, during music playback, etc.), when the playback environment changes from quiet to noisy, in order to listen to the playback content more clearly, the usual practice includes increasing the playback volume. volume, wearing headphones, moving to a quiet environment, etc. without automatically enhancing the clarity of the playback content.

发明内容SUMMARY OF THE INVENTION

本发明要解决的技术问题是为了克服现有技术中无法在播放环境由安静转向嘈杂时自动增强播放内容清晰度的缺陷,提供一种声音信号的处理方法、系统、电子设备及存储介质。The technical problem to be solved by the present invention is to provide a sound signal processing method, system, electronic device and storage medium in order to overcome the defect in the prior art that the playback content cannot be automatically enhanced when the playback environment changes from quiet to noisy.

本发明是通过下述技术方案来解决上述技术问题:The present invention solves the above-mentioned technical problems through the following technical solutions:

一种声音信号的处理方法,包括:A method for processing a sound signal, comprising:

监听外部声音信号;monitor external sound signals;

判断所述外部声音信号是否满足触发条件;Judging whether the external sound signal satisfies the triggering condition;

若是,则抬升待播放声音信号的频率;If so, increase the frequency of the sound signal to be played;

播放抬升频率后的所述待播放声音信号。The to-be-played sound signal after the frequency is raised is played.

较佳地,所述外部声音信号包括外部噪声信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:Preferably, the external sound signal includes an external noise signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:

识别所述外部声音信号中的外部噪声信号;identifying an external noise signal in the external sound signal;

根据所述外部噪声信号判断外部噪声的音量是否大于第一阈值;Determine whether the volume of the external noise is greater than a first threshold according to the external noise signal;

若是,则确定所述外部声音信号满足触发条件;If yes, then determine that the external sound signal satisfies the trigger condition;

和/或,所述外部声音信号包括外部语音信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:And/or, the external sound signal includes an external voice signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:

识别所述外部声音信号中的外部语音信号;Identifying an external voice signal in the external sound signal;

根据所述外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;According to the external voice signal, determine whether the difference between the volume of the external voice at the current sampling moment and the volume at the previous sampling moment is greater than a second threshold;

若是,则确定所述外部声音信号满足触发条件。If so, it is determined that the external sound signal satisfies the trigger condition.

较佳地,所述外部声音信号包括外部语音信号,所述判断所述外部声音信号是否满足触发条件的步骤包括:Preferably, the external sound signal includes an external voice signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:

识别所述外部声音信号中的外部语音信号;Identifying an external voice signal in the external sound signal;

将所述外部语音信号转换成文字;converting the external speech signal into text;

判断所述文字是否包括触发关键字;determine whether the text includes a trigger keyword;

若是,则确定所述外部声音信号满足触发条件。If so, it is determined that the external sound signal satisfies the trigger condition.

较佳地,所述监听外部声音信号的步骤具体包括在语音通话过程中监听外部声音信号;Preferably, the step of monitoring the external sound signal specifically includes monitoring the external sound signal during the voice call;

所述待播放声音信号包括语音通话信号。The to-be-played sound signal includes a voice call signal.

一种声音信号的处理系统,包括:A sound signal processing system, comprising:

监听模块,用于监听外部声音信号;Monitoring module for monitoring external sound signals;

判断模块,用于判断所述外部声音信号是否满足触发条件;a judgment module for judging whether the external sound signal satisfies a trigger condition;

若是,则调用抬升模块,所述抬升模块用于抬升待播放声音信号的频率;If so, call a lift module, and the lift module is used to lift the frequency of the sound signal to be played;

播放模块,用于播放抬升频率后的所述待播放声音信号。The playing module is used for playing the sound signal to be played after the frequency is raised.

较佳地,所述外部声音信号包括外部噪声信号,所述判断模块包括:Preferably, the external sound signal includes an external noise signal, and the judgment module includes:

第一识别单元,用于识别所述外部声音信号中的外部噪声信号;a first identification unit, used to identify the external noise signal in the external sound signal;

第一判断单元,用于根据所述外部噪声信号判断外部噪声的音量是否大于第一阈值;a first judging unit, configured to judge whether the volume of the external noise is greater than a first threshold according to the external noise signal;

若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件;If so, call a determination unit, and the determination unit is used to determine that the external sound signal satisfies the trigger condition;

和/或,所述外部声音信号包括外部语音信号,所述判断模块包括:And/or, the external sound signal includes an external voice signal, and the judgment module includes:

第二识别单元,用于识别所述外部声音信号中的外部语音信号;a second identification unit, configured to identify an external voice signal in the external sound signal;

第二判断单元,用于根据所述外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;A second judging unit for judging whether the difference between the volume of the external voice at the current sampling moment and the volume at the last sampling moment is greater than a second threshold according to the external voice signal;

若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件。If so, the determining unit is invoked, and the determining unit is configured to determine that the external sound signal satisfies the triggering condition.

较佳地,所述外部声音信号包括外部语音信号,所述判断模块包括:Preferably, the external sound signal includes an external voice signal, and the judgment module includes:

第二识别单元,用于识别所述外部声音信号中的外部语音信号;a second identification unit, configured to identify an external voice signal in the external sound signal;

转换单元,用于将所述外部语音信号转换成文字;a conversion unit for converting the external voice signal into text;

第三判断单元,用于判断所述文字是否包括触发关键字;a third judging unit for judging whether the text includes a trigger keyword;

若是,则调用确定单元,所述确定单元用于确定所述外部声音信号满足触发条件。If so, the determining unit is invoked, and the determining unit is configured to determine that the external sound signal satisfies the triggering condition.

较佳地,所述监听模块具体用于在语音通话过程中监听外部声音信号;Preferably, the monitoring module is specifically used to monitor external sound signals during a voice call;

所述待播放声音信号包括语音通话信号;The to-be-played sound signal includes a voice call signal;

所述播放模块包括听筒和/或扬声器。The playback module includes an earpiece and/or a speaker.

一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现上述任一种声音信号的处理方法。An electronic device includes a memory, a processor, and a computer program stored in the memory and running on the processor. The processor implements any of the above sound signal processing methods when the processor executes the computer program.

一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现上述任一种声音信号的处理方法的步骤。A computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of any of the above sound signal processing methods.

本发明的积极进步效果在于:本发明可以在声音信号的播放过程中实时监听外部声音信号,并在外部声音信号满足触发条件时,例如,播放环境由安静转向嘈杂时,先自动抬升待播放声音信号的频率,再行播放待播放声音信号,使得用户能够更加清晰地收听到所播放声音信号的内容,自动增强了所播放内容的清晰度。The positive improvement effect of the present invention is: the present invention can monitor the external sound signal in real time during the playback of the sound signal, and when the external sound signal meets the trigger condition, for example, when the playback environment changes from quiet to noisy, the sound to be played is automatically raised first The frequency of the signal is played, and the sound signal to be played is played again, so that the user can listen to the content of the played sound signal more clearly, and the clarity of the played content is automatically enhanced.

附图说明Description of drawings

图1为根据本发明实施例1的声音信号的处理方法的流程图。FIG. 1 is a flowchart of a method for processing a sound signal according to Embodiment 1 of the present invention.

图2为根据本发明实施例1的声音信号的处理方法的具体流程图。FIG. 2 is a specific flowchart of a method for processing a sound signal according to Embodiment 1 of the present invention.

图3为根据本发明实施例1的声音信号的处理方法中频率抬升的示意图。FIG. 3 is a schematic diagram of frequency boosting in the sound signal processing method according to Embodiment 1 of the present invention.

图4为根据本发明实施例2的声音信号的处理系统的模块示意图。FIG. 4 is a schematic block diagram of a sound signal processing system according to Embodiment 2 of the present invention.

图5为根据本发明实施例3的电子设备的结构示意图。FIG. 5 is a schematic structural diagram of an electronic device according to Embodiment 3 of the present invention.

具体实施方式Detailed ways

下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。The present invention is further described below by way of examples, but the present invention is not limited to the scope of the described examples.

实施例1Example 1

本实施例提供一种声音信号的处理方法,参照图1,本实施例的处理方法包括:This embodiment provides a method for processing a sound signal. Referring to FIG. 1 , the processing method in this embodiment includes:

S101、监听外部声音信号;S101. Monitor external sound signals;

S102、判断外部声音信号是否满足触发条件;S102, judging whether the external sound signal satisfies the triggering condition;

若是,则执行步骤S103;If yes, execute step S103;

S103、抬升待播放声音信号的频率;S103, raising the frequency of the sound signal to be played;

S104、播放抬升频率后的待播放声音信号。S104, playing the sound signal to be played after the frequency is raised.

本实施例的处理方法可以适用于语音通话过程中,其中,待播放声音信号可以包括语音通话信号,步骤S101具体可以包括在语音通话过程中监听外部声音信号。应当理解,本实施例的处理方法还可以适用于音乐等声音媒体的播放过程中。The processing method of this embodiment may be applicable to a voice call process, wherein the sound signal to be played may include a voice call signal, and step S101 may specifically include monitoring an external sound signal during the voice call process. It should be understood that the processing method in this embodiment can also be applied to the playing process of sound media such as music.

具体地,在本实施例中,外部声音信号可以包括外部噪声信号以及外部语音信号,例如,当待播放声音信号包括语音通话信号时,外部噪声信号可以包括语音通话设备的麦克风监听到的语音通话设备所在环境的声音信号,外部语音信号可以包括语音通话设备的使用者通过语音通话设备的麦克风输入的声音信号。Specifically, in this embodiment, the external sound signal may include an external noise signal and an external voice signal. For example, when the sound signal to be played includes a voice call signal, the external noise signal may include a voice call monitored by the microphone of the voice call device. The sound signal of the environment where the device is located, and the external voice signal may include the sound signal input by the user of the voice communication device through the microphone of the voice communication device.

在本实施例中,用于触发待播放声音信号处理的触发条件可以根据实际应用自定义设置,例如,触发条件可以包括:外部噪声的音量大于预设阈值、外部语音的音量突然增大、外部语音的内容表示语音通话设备的使用者听不清播放内容等,其中,不同的触发条件可以并列触发,也可以按照预设顺序先后触发,本实施例不对此加以限制。In this embodiment, the trigger condition for triggering the processing of the sound signal to be played may be customized according to the actual application. For example, the trigger condition may include: the volume of the external noise is greater than the preset threshold, the volume of the external voice suddenly increases, the external The content of the voice indicates that the user of the voice communication device cannot hear the playing content, etc., wherein, different trigger conditions may be triggered in parallel, or may be triggered sequentially in a preset order, which is not limited in this embodiment.

具体地,参照图2,本实施例中步骤S102可以包括:Specifically, referring to FIG. 2 , step S102 in this embodiment may include:

S1021、识别外部声音信号中的外部噪声信号;S1021, identifying the external noise signal in the external sound signal;

S1022、根据外部噪声信号判断外部噪声的音量是否大于第一阈值;S1022. Determine whether the volume of the external noise is greater than a first threshold according to the external noise signal;

若是,则执行步骤S1023;If yes, execute step S1023;

S1023、确定外部声音信号满足触发条件。S1023. Determine that the external sound signal satisfies the triggering condition.

其中,第一阈值可以根据实际应用自定义设置,例如,第一阈值可以取值为50db。Wherein, the first threshold can be customized according to the actual application, for example, the first threshold can be 50db.

步骤S102还可以包括:Step S102 may also include:

S1024、识别外部声音信号中的外部语音信号;S1024. Identify the external voice signal in the external voice signal;

S1025、根据外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;S1025, according to the external voice signal, determine whether the difference between the volume of the external voice at the current sampling moment and the volume at the last sampling moment is greater than the second threshold;

若是,则执行步骤S1023。If yes, step S1023 is executed.

其中,第二阈值可以根据实际应用自定义设置,例如,第二阈值可以取值为6db。Wherein, the second threshold can be customized according to the actual application, for example, the second threshold can be 6db.

在步骤S1024之后还可以包括:After step S1024, it may further include:

S1026、将外部语音信号转换成文字;S1026, convert the external voice signal into text;

S1027、判断文字是否包括触发关键字;S1027, determine whether the text includes a trigger keyword;

若是,则执行步骤S1023。If yes, step S1023 is executed.

其中,触发关键字可以根据实际应用自定义设置,例如,“听不清”、“什么”、“喂喂”、“大点声”等等。Among them, the trigger keyword can be customized according to the actual application, for example, "can't hear clearly", "what", "hello", "speak louder" and so on.

在本实施例中,若步骤S102判定外部声音信号不满足触发条件,则不对待播放声音信号进行频率抬升的处理。In this embodiment, if it is determined in step S102 that the external sound signal does not meet the triggering condition, the processing of increasing the frequency of the sound signal to be played is not performed.

在本实施例中,参照图3,经频率抬升处理后,待播放声音信号的频率得到了抬升,而音量并未发生改变,也即,将声音信号向频率增大的方向进行了平移。待播放声音信号进行频率抬升的处理后,由于人耳对频率较高的声音更加敏感,从而使得人耳能够更加清晰地收听到所播放声音信号的内容,增强了所播放内容的清晰度。In this embodiment, referring to FIG. 3 , after the frequency raising process, the frequency of the sound signal to be played is raised, but the volume does not change, that is, the sound signal is shifted in the direction of increasing frequency. After the sound signal to be played is processed to increase the frequency, since the human ear is more sensitive to the sound with higher frequency, the human ear can hear the content of the played sound signal more clearly, which enhances the clarity of the played content.

具体地,当本实施例的处理方法适用于语音通话过程中时,尽管频率的增大会导致声音穿透性的减弱,但由于语音通话时语音通话设备的使用者与语音通话设备之间距离较近,从而可以忽略声音传播的距离因素。在语音通话过程中,当判断满足触发条件,也即,判定语音通话设备的使用者可能无法听清对方所讲内容时,通过语音通话设备对待播放声音信号的实时频率抬升处理,使得语音通话设备的使用者可以听到经实时频率抬升处理后的声音信号,使得语音通话设备的使用者可以听清对方所讲内容。Specifically, when the processing method of this embodiment is applied to the process of voice call, although the increase of frequency will lead to the weakening of sound penetration, the distance between the user of the voice call device and the voice call device is relatively long during the voice call. so that the distance factor of sound propagation can be ignored. During the voice call, when it is judged that the trigger condition is met, that is, it is judged that the user of the voice call device may not be able to hear what the other party is saying, the voice call device will process the real-time frequency increase of the sound signal to be played, so that the voice call device The user of the voice communication device can hear the sound signal processed by the real-time frequency uplift, so that the user of the voice communication device can hear the content of the other party clearly.

从而,在语音通话设备通过双麦降噪和语音噪声/回声消除算法等手段在语音通话设备的使用者(用户A)输入的语音通话信号被传输至对方(用户B)之前对语音通话信号进行处理,以解决语音通话过程中所产生的回声和噪声,使得语音通话设备的使用者(用户A)的语音通话内容能够清晰传递给对方(用户B)的基础上,还能够在语音通话设备的使用者(用户A)可能无法听清对方(用户B)所讲内容的情况下对对方(用户B)输入的语音通话信号实时进行频率抬升,以使得语音通话设备的使用者(用户A)能够听清对方(用户B)所讲内容,进一步完善并提升了语音通话过程中的语音通话质量。Therefore, before the voice call signal input by the user (user A) of the voice call device is transmitted to the other party (user B) by means of dual microphone noise reduction and voice noise/echo cancellation algorithm, the voice call signal is processed. processing to solve the echo and noise generated during the voice call, so that the voice call content of the user (user A) of the voice call device can be clearly transmitted to the other party (user B), and the voice call device When the user (user A) may not be able to hear what the other party (user B) is saying, the frequency of the voice call signal input by the other party (user B) is increased in real time, so that the user (user A) of the voice call device can The content of the other party (user B) can be heard clearly, which further improves and improves the quality of the voice call during the voice call.

实施例2Example 2

本实施例提供一种声音信号的处理系统,参照图4,本实施例的处理系统包括:This embodiment provides a sound signal processing system. Referring to FIG. 4 , the processing system in this embodiment includes:

监听模块201,用于监听外部声音信号;a monitoring module 201 for monitoring external sound signals;

判断模块202,用于判断外部声音信号是否满足触发条件;The judgment module 202 is used for judging whether the external sound signal satisfies the triggering condition;

若是,则调用抬升模块203;If so, call the lifting module 203;

抬升模块203,抬升待播放声音信号的频率;The elevating module 203 elevates the frequency of the sound signal to be played;

播放模块204,播放抬升频率后的待播放声音信号。The playing module 204 plays the sound signal to be played after the frequency is raised.

本实施例的处理系统可以适用于语音通话设备中,其中,待播放声音信号可以包括语音通话信号,监听模块201可以包括语音通话设备的麦克风,具体可以用于在语音通话过程中监听外部声音信号。应当理解,本实施例的处理系统还可以适用于音乐等声音媒体的播放设备中。The processing system of this embodiment may be applied to a voice call device, wherein the sound signal to be played may include a voice call signal, and the monitoring module 201 may include a microphone of the voice call device, and may be specifically used to monitor external sound signals during a voice call . It should be understood that the processing system in this embodiment may also be applied to a playback device for sound media such as music.

具体地,在本实施例中,外部声音信号可以包括外部噪声信号以及外部语音信号,例如,当待播放声音信号包括语音通话信号时,外部噪声信号可以包括语音通话设备的麦克风监听到的语音通话设备所在环境的声音信号,外部语音信号可以包括语音通话设备的使用者通过语音通话设备的麦克风输入的声音信号,播放模块204可以包括语音通话设备的听筒与扬声器中的至少一种。Specifically, in this embodiment, the external sound signal may include an external noise signal and an external voice signal. For example, when the sound signal to be played includes a voice call signal, the external noise signal may include a voice call monitored by the microphone of the voice call device. The sound signal of the environment where the device is located, the external voice signal may include the sound signal input by the user of the voice call device through the microphone of the voice call device, and the playback module 204 may include at least one of the earpiece and the speaker of the voice call device.

在本实施例中,用于触发待播放声音信号处理的触发条件可以根据实际应用自定义设置,例如,触发条件可以包括:外部噪声的音量大于预设阈值、外部语音的音量突然增大、外部语音的内容表示语音通话设备的使用者听不清播放内容等,其中,不同的触发条件可以并列触发,也可以按照预设顺序先后触发,本实施例不对此加以限制。In this embodiment, the trigger condition for triggering the processing of the sound signal to be played may be customized according to the actual application. For example, the trigger condition may include: the volume of the external noise is greater than the preset threshold, the volume of the external voice suddenly increases, the external The content of the voice indicates that the user of the voice communication device cannot hear the playing content, etc., wherein, different trigger conditions may be triggered in parallel, or may be triggered sequentially in a preset order, which is not limited in this embodiment.

具体地,参照图4,本实施例中判断模块202可以包括:Specifically, referring to FIG. 4 , the judgment module 202 in this embodiment may include:

第一识别单元2021,用于识别外部声音信号中的外部噪声信号;a first identification unit 2021, used to identify the external noise signal in the external sound signal;

第一判断单元2022,用于根据外部噪声信号判断外部噪声的音量是否大于第一阈值;a first judgment unit 2022, configured to judge whether the volume of the external noise is greater than the first threshold according to the external noise signal;

若是,则调用确定单元2023;If so, call the determining unit 2023;

确定单元2023,用于确定外部声音信号满足触发条件。The determining unit 2023 is configured to determine that the external sound signal satisfies the triggering condition.

其中,第一阈值可以根据实际应用自定义设置,例如,第一阈值可以取值为50db。Wherein, the first threshold can be customized according to the actual application, for example, the first threshold can be 50db.

判断模块202还可以包括:The judging module 202 may also include:

第二识别单元2024,用于识别外部声音信号中的外部语音信号;The second identification unit 2024 is used to identify the external voice signal in the external sound signal;

第二判断单元2025,用于根据外部语音信号判断外部语音在当前采样时刻的音量与上一采样时刻的音量之差是否大于第二阈值;The second judgment unit 2025 is used to judge whether the difference between the volume of the external voice at the current sampling moment and the volume of the previous sampling moment is greater than the second threshold according to the external voice signal;

若是,则调用确定单元2023。If so, the determining unit 2023 is called.

其中,第二阈值可以根据实际应用自定义设置,例如,第二阈值可以取值为6db。Wherein, the second threshold can be customized according to the actual application, for example, the second threshold can be 6db.

判断模块202还可以包括:The judging module 202 may also include:

转换单元2026,用于将外部语音信号转换成文字;conversion unit 2026, for converting the external voice signal into text;

第三判断单元2027,用于判断文字是否包括触发关键字;The third judging unit 2027 is used for judging whether the text includes a trigger keyword;

若是,则调用确定单元2023。If so, the determining unit 2023 is called.

其中,触发关键字可以根据实际应用自定义设置,例如,“听不清”、“什么”、“喂喂”、“大点声”等等。Among them, the trigger keyword can be customized according to the actual application, for example, "can't hear clearly", "what", "hello", "speak louder" and so on.

在本实施例中,若判断模块202判定外部声音信号不满足触发条件,则不调用抬升模块203对待播放声音信号进行频率抬升的处理。In this embodiment, if the determination module 202 determines that the external sound signal does not meet the triggering condition, the raising module 203 is not called to perform frequency raising processing on the sound signal to be played.

在本实施例中,参照图3,经频率抬升处理后,待播放声音信号的频率得到了抬升,而音量并未发生改变,也即,将声音信号向频率增大的方向进行了平移。待播放声音信号进行频率抬升的处理后,由于人耳对频率较高的声音更加敏感,从而使得人耳能够更加清晰地收听到所播放声音信号的内容,增强了所播放内容的清晰度。In this embodiment, referring to FIG. 3 , after the frequency raising process, the frequency of the sound signal to be played is raised, but the volume does not change, that is, the sound signal is shifted in the direction of increasing frequency. After the sound signal to be played is processed to increase the frequency, since the human ear is more sensitive to the sound with higher frequency, the human ear can hear the content of the played sound signal more clearly, which enhances the clarity of the played content.

具体地,当本实施例的处理系统适用于语音通话设备中时,尽管频率的增大会导致声音穿透性的减弱,但由于语音通话时语音通话设备的使用者与语音通话设备之间距离较近,从而可以忽略声音传播的距离因素。在语音通话过程中,当判断满足触发条件,也即,判定语音通话设备的使用者可能无法听清对方所讲内容时,通过语音通话设备对待播放声音信号的实时频率抬升处理,使得语音通话设备的使用者可以听到经实时频率抬升处理后的声音信号,使得语音通话设备的使用者可以听清对方所讲内容。Specifically, when the processing system of this embodiment is applied to a voice communication device, although the increase of the frequency will lead to the weakening of the sound penetration, the distance between the user of the voice communication device and the voice communication device is relatively large during the voice communication. so that the distance factor of sound propagation can be ignored. During the voice call, when it is judged that the trigger condition is met, that is, it is judged that the user of the voice call device may not be able to hear what the other party is saying, the voice call device will process the real-time frequency increase of the sound signal to be played, so that the voice call device The user of the voice communication device can hear the sound signal processed by the real-time frequency uplift, so that the user of the voice communication device can hear the content of the other party clearly.

从而,在语音通话设备通过双麦降噪和语音噪声/回声消除算法等手段在语音通话设备的使用者(用户A)输入的语音通话信号被传输至对方(用户B)之前对语音通话信号进行处理,以解决语音通话过程中所产生的回声和噪声,使得语音通话设备的使用者(用户A)的语音通话内容能够清晰传递给对方(用户B)的基础上,还能够在语音通话设备的使用者(用户A)可能无法听清对方(用户B)所讲内容的情况下对对方(用户B)输入的语音通话信号实时进行频率抬升,以使得语音通话设备的使用者(用户A)能够听清对方(用户B)所讲内容,进一步完善并提升了语音通话过程中的语音通话质量。Therefore, before the voice call signal input by the user (user A) of the voice call device is transmitted to the other party (user B) by means of dual microphone noise reduction and voice noise/echo cancellation algorithm, the voice call signal is processed. processing to solve the echo and noise generated during the voice call, so that the voice call content of the user (user A) of the voice call device can be clearly transmitted to the other party (user B), and the voice call device When the user (user A) may not be able to hear the content of the other party (user B) clearly, the frequency of the voice call signal input by the other party (user B) is increased in real time, so that the user (user A) of the voice call device can The content of the other party (user B) can be heard clearly, which further improves and improves the quality of the voice call during the voice call.

实施例3Example 3

本实施例提供一种电子设备,电子设备可以通过计算设备的形式表现(例如可以为服务器设备),包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其中处理器执行计算机程序时可以实现实施例1提供的声音信号的处理方法。This embodiment provides an electronic device, which can be expressed in the form of a computing device (for example, a server device), and includes a memory, a processor, and a computer program stored in the memory and running on the processor, wherein the processor The sound signal processing method provided in Embodiment 1 can be implemented when the computer program is executed.

图5示出了本实施例的硬件结构示意图,如图5所示,电子设备9具体包括:FIG. 5 shows a schematic diagram of the hardware structure of this embodiment. As shown in FIG. 5 , the electronic device 9 specifically includes:

至少一个处理器91、至少一个存储器92以及用于连接不同系统组件(包括处理器91和存储器92)的总线93,其中:At least one processor 91, at least one memory 92, and a bus 93 for connecting different system components (including processor 91 and memory 92), wherein:

总线93包括数据总线、地址总线和控制总线。The bus 93 includes a data bus, an address bus and a control bus.

存储器92包括易失性存储器,例如随机存取存储器(RAM)921和/或高速缓存存储器922,还可以进一步包括只读存储器(ROM)923。Memory 92 includes volatile memory, such as random access memory (RAM) 921 and/or cache memory 922 , and may further include read only memory (ROM) 923 .

存储器92还包括具有一组(至少一个)程序模块924的程序/实用工具925,这样的程序模块924包括但不限于:操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。The memory 92 also includes a program/utility 925 having a set (at least one) of program modules 924 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, examples of which are Each or some combination of these may include an implementation of a network environment.

处理器91通过运行存储在存储器92中的计算机程序,从而执行各种功能应用以及数据处理,例如本发明实施例1所提供的声音信号的处理方法。The processor 91 executes various functional applications and data processing by running the computer program stored in the memory 92, such as the sound signal processing method provided in Embodiment 1 of the present invention.

电子设备9进一步可以与一个或多个外部设备94(例如键盘、指向设备等)通信。这种通信可以通过输入/输出(I/O)接口95进行。并且,电子设备9还可以通过网络适配器96与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。网络适配器96通过总线93与电子设备9的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备9使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理器、外部磁盘驱动阵列、RAID(磁盘阵列)系统、磁带驱动器以及数据备份存储系统等。The electronic device 9 may further communicate with one or more external devices 94 (eg, keyboards, pointing devices, etc.). Such communication may take place through input/output (I/O) interface 95 . Also, the electronic device 9 may communicate with one or more networks (eg, a local area network (LAN), a wide area network (WAN), and/or a public network such as the Internet) through a network adapter 96 . The network adapter 96 communicates with other modules of the electronic device 9 via the bus 93 . It should be understood that, although not shown in the figures, other hardware and/or software modules may be used in conjunction with the electronic device 9, including but not limited to: microcode, device drivers, redundant processors, external disk drive arrays, RAID (disk array) systems, tape drives, and data backup storage systems.

应当注意,尽管在上文详细描述中提及了电子设备的若干单元/模块或子单元/模块,但是这种划分仅仅是示例性的并非强制性的。实际上,根据本申请的实施方式,上文描述的两个或更多单元/模块的特征和功能可以在一个单元/模块中具体化。反之,上文描述的一个单元/模块的特征和功能可以进一步划分为由多个单元/模块来具体化。It should be noted that although several units/modules or sub-units/modules of the electronic device are mentioned in the above detailed description, this division is merely exemplary and not mandatory. Indeed, according to embodiments of the present application, the features and functions of two or more units/modules described above may be embodied in one unit/module. Conversely, the features and functions of one unit/module described above may be further subdivided to be embodied by multiple units/modules.

实施例4Example 4

本实施例提供了一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现实施例1所提供的声音信号的处理方法的步骤。This embodiment provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the steps of the sound signal processing method provided in Embodiment 1.

其中,可读存储介质可以采用的更具体可以包括但不限于:便携式盘、硬盘、随机存取存储器、只读存储器、可擦拭可编程只读存储器、光存储器件、磁存储器件或上述的任意合适的组合。Wherein, the readable storage medium may include, but is not limited to, a portable disk, a hard disk, a random access memory, a read-only memory, an erasable programmable read-only memory, an optical storage device, a magnetic storage device, or any of the above suitable combination.

在可能的实施方式中,本发明还可以实现为一种程序产品的形式,其包括程序代码,当所述程序产品在终端设备上运行时,所述程序代码用于使所述终端设备执行实现实施例1所述的声音信号的处理方法的步骤。In a possible implementation manner, the present invention can also be implemented in the form of a program product, which includes program codes, when the program product runs on a terminal device, the program code is used to cause the terminal device to execute the implementation Steps of the sound signal processing method described in Embodiment 1.

其中,可以以一种或多种程序设计语言的任意组合来编写用于执行本发明的程序代码,所述程序代码可以完全地在用户设备上执行、部分地在用户设备上执行、作为一个独立的软件包执行、部分在用户设备上部分在远程设备上执行或完全在远程设备上执行。Wherein, the program code for executing the present invention can be written in any combination of one or more programming languages, and the program code can be completely executed on the user equipment, partially executed on the user equipment, as an independent The software package executes on the user's device, partly on the user's device, partly on the remote device, or entirely on the remote device.

虽然以上描述了本发明的具体实施方式,但是本领域的技术人员应当理解,这仅是举例说明,本发明的保护范围是由所附权利要求书限定的。本领域的技术人员在不背离本发明的原理和实质的前提下,可以对这些实施方式做出多种变更或修改,但这些变更和修改均落入本发明的保护范围。Although the specific embodiments of the present invention are described above, those skilled in the art should understand that this is only an illustration, and the protection scope of the present invention is defined by the appended claims. Those skilled in the art can make various changes or modifications to these embodiments without departing from the principle and essence of the present invention, but these changes and modifications all fall within the protection scope of the present invention.

Claims (10)

1. A method for processing a sound signal, comprising:
monitoring an external sound signal;
judging whether the external sound signal meets a trigger condition;
if so, raising the frequency of the sound signal to be played;
and playing the sound signal to be played after the frequency is raised.
2. The method for processing the sound signal according to claim 1, wherein the external sound signal includes an external noise signal, and the step of determining whether the external sound signal satisfies a trigger condition includes:
identifying an external noise signal in the external sound signal;
judging whether the volume of the external noise is greater than a first threshold value according to the external noise signal;
if yes, determining that the external sound signal meets a trigger condition;
and/or, the external sound signal includes an external voice signal, and the step of judging whether the external sound signal satisfies the trigger condition includes:
recognizing an external voice signal in the external sound signal;
judging whether the difference between the volume of the external voice at the current sampling moment and the volume of the external voice at the previous sampling moment is larger than a second threshold value or not according to the external voice signal;
and if so, determining that the external sound signal meets the trigger condition.
3. The method for processing the sound signal according to claim 1, wherein the external sound signal includes an external voice signal, and the step of determining whether the external sound signal satisfies a trigger condition includes:
recognizing an external voice signal in the external sound signal;
converting the external voice signal into text;
judging whether the characters comprise trigger keywords or not;
and if so, determining that the external sound signal meets the trigger condition.
4. The method for processing the audio signal according to claim 1, wherein the step of monitoring the external audio signal includes monitoring the external audio signal during a voice call;
the sound signal to be played comprises a voice call signal.
5. A system for processing a sound signal, comprising:
the monitoring module is used for monitoring external sound signals;
the judging module is used for judging whether the external sound signal meets a triggering condition;
if so, calling a lifting module, wherein the lifting module is used for lifting the frequency of the sound signal to be played;
and the playing module is used for playing the sound signal to be played after the frequency is raised.
6. The sound signal processing system according to claim 5, wherein the external sound signal includes an external noise signal, and the determining module includes:
a first recognition unit for recognizing an external noise signal in the external sound signal;
a first judgment unit for judging whether the volume of the external noise is larger than a first threshold value according to the external noise signal;
if yes, calling a determining unit, wherein the determining unit is used for determining that the external sound signal meets a trigger condition;
and/or, the external sound signal includes an external voice signal, and the judging module includes:
a second recognition unit for recognizing an external voice signal among the external sound signals;
the second judgment unit is used for judging whether the difference between the volume of the external voice at the current sampling moment and the volume of the external voice at the previous sampling moment is larger than a second threshold value or not according to the external voice signal;
and if so, calling a determining unit, wherein the determining unit is used for determining that the external sound signal meets the triggering condition.
7. The sound signal processing system according to claim 5, wherein the external sound signal includes an external voice signal, and the determining module includes:
a second recognition unit for recognizing an external voice signal among the external sound signals;
a conversion unit for converting the external voice signal into a text;
a third judging unit, configured to judge whether the text includes a trigger keyword;
and if so, calling a determining unit, wherein the determining unit is used for determining that the external sound signal meets the triggering condition.
8. The system for processing the sound signal according to claim 5, wherein the monitoring module is specifically configured to monitor an external sound signal during a voice call;
the sound signal to be played comprises a voice call signal;
the playing module comprises an earphone and/or a loudspeaker.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of processing a sound signal according to any one of claims 1 to 4 when executing the computer program.
10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of processing a sound signal according to any one of claims 1 to 4.
CN202010744551.4A 2020-07-29 2020-07-29 Sound signal processing method, system, electronic device and storage medium Pending CN111863004A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010744551.4A CN111863004A (en) 2020-07-29 2020-07-29 Sound signal processing method, system, electronic device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010744551.4A CN111863004A (en) 2020-07-29 2020-07-29 Sound signal processing method, system, electronic device and storage medium

Publications (1)

Publication Number Publication Date
CN111863004A true CN111863004A (en) 2020-10-30

Family

ID=72946037

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010744551.4A Pending CN111863004A (en) 2020-07-29 2020-07-29 Sound signal processing method, system, electronic device and storage medium

Country Status (1)

Country Link
CN (1) CN111863004A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114866639A (en) * 2022-04-29 2022-08-05 维沃移动通信有限公司 Electronic equipment, control method and device

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000112710A (en) * 1998-10-09 2000-04-21 Canon Inc Volume control device and method and storage medium
US20080162120A1 (en) * 2007-01-03 2008-07-03 Motorola, Inc. Method and apparatus for providing feedback of vocal quality to a user
WO2011031273A1 (en) * 2009-09-14 2011-03-17 Srs Labs, Inc System for adaptive voice intelligibility processing
CN102436821A (en) * 2011-12-02 2012-05-02 海能达通信股份有限公司 Method and equipment for self-adaptive adjustment of sound effect
CN202889458U (en) * 2012-11-02 2013-04-17 姚西 Automatic call volume regulation mobile phone based on environmental noise
WO2013078677A1 (en) * 2011-12-02 2013-06-06 海能达通信股份有限公司 A method and device for adaptively adjusting sound effect
CN103905646A (en) * 2014-04-09 2014-07-02 深圳市中兴移动通信有限公司 Communication terminal and voice processing method thereof
CN107105367A (en) * 2017-05-24 2017-08-29 维沃移动通信有限公司 A kind of acoustic signal processing method and terminal
CN107146613A (en) * 2017-04-10 2017-09-08 北京猎户星空科技有限公司 A kind of voice interactive method and device
CN107172256A (en) * 2017-07-27 2017-09-15 广东欧珀移动通信有限公司 Earphone call self-adapting regulation method, device, mobile terminal and storage medium
CN107331405A (en) * 2017-06-30 2017-11-07 深圳市金立通信设备有限公司 A kind of voice information processing method and server
CN107395883A (en) * 2017-07-17 2017-11-24 广东欧珀移动通信有限公司 Speech signal adjustment method, communication terminal, and computer-readable storage medium
CN108476256A (en) * 2016-09-27 2018-08-31 华为技术有限公司 A volume adjustment method and terminal
CN111356008A (en) * 2018-12-21 2020-06-30 深圳Tcl数字技术有限公司 Automatic television volume adjusting method, smart television and storage medium

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000112710A (en) * 1998-10-09 2000-04-21 Canon Inc Volume control device and method and storage medium
US20080162120A1 (en) * 2007-01-03 2008-07-03 Motorola, Inc. Method and apparatus for providing feedback of vocal quality to a user
WO2011031273A1 (en) * 2009-09-14 2011-03-17 Srs Labs, Inc System for adaptive voice intelligibility processing
CN102436821A (en) * 2011-12-02 2012-05-02 海能达通信股份有限公司 Method and equipment for self-adaptive adjustment of sound effect
WO2013078677A1 (en) * 2011-12-02 2013-06-06 海能达通信股份有限公司 A method and device for adaptively adjusting sound effect
CN202889458U (en) * 2012-11-02 2013-04-17 姚西 Automatic call volume regulation mobile phone based on environmental noise
CN103905646A (en) * 2014-04-09 2014-07-02 深圳市中兴移动通信有限公司 Communication terminal and voice processing method thereof
CN108476256A (en) * 2016-09-27 2018-08-31 华为技术有限公司 A volume adjustment method and terminal
CN107146613A (en) * 2017-04-10 2017-09-08 北京猎户星空科技有限公司 A kind of voice interactive method and device
CN107105367A (en) * 2017-05-24 2017-08-29 维沃移动通信有限公司 A kind of acoustic signal processing method and terminal
CN107331405A (en) * 2017-06-30 2017-11-07 深圳市金立通信设备有限公司 A kind of voice information processing method and server
CN107395883A (en) * 2017-07-17 2017-11-24 广东欧珀移动通信有限公司 Speech signal adjustment method, communication terminal, and computer-readable storage medium
CN107172256A (en) * 2017-07-27 2017-09-15 广东欧珀移动通信有限公司 Earphone call self-adapting regulation method, device, mobile terminal and storage medium
CN111356008A (en) * 2018-12-21 2020-06-30 深圳Tcl数字技术有限公司 Automatic television volume adjusting method, smart television and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114866639A (en) * 2022-04-29 2022-08-05 维沃移动通信有限公司 Electronic equipment, control method and device

Similar Documents

Publication Publication Date Title
EP2267695B1 (en) Controlling music players using environment audio analysis
CN107995360B (en) Call processing method and related products
WO2024051820A1 (en) Abnormality-based paging method and related apparatus
CN113949955B (en) Noise reduction processing method, device, electronic equipment, earphone and storage medium
CN108681440A (en) A kind of smart machine method for controlling volume and system
CN103796125A (en) Sound adjusting method based on earphone playing
CN112004177A (en) Howling detection method, microphone volume adjustment method and medium
TWI753685B (en) Headphone noise reduction mode control method, system, equipment and storage medium
US20250336407A1 (en) Dynamic noise and speech removal
CN111048118B (en) Voice signal processing method and device and terminal
CN111968660A (en) Echo cancellation device and method, electronic device, and storage medium
US20230298612A1 (en) Microphone Array Configuration Invariant, Streaming, Multichannel Neural Enhancement Frontend for Automatic Speech Recognition
CN105825854B (en) A voice signal processing method, device and mobile terminal
CN112087726A (en) Method and system for identifying polyphonic ringtone, electronic equipment and storage medium
JP2021177598A (en) Speech processing system, speech processing method, and speech processing program
US11122160B1 (en) Detecting and correcting audio echo
CN111863004A (en) Sound signal processing method, system, electronic device and storage medium
WO2022199320A1 (en) Superimposing high-frequency copies of emitted sounds
CN112235462A (en) Voice adjusting method, system, electronic equipment and computer readable storage medium
US20230290356A1 (en) Hearing aid for cognitive help using speaker recognition
CN112700785B (en) Voice signal processing method and device and related equipment
CN106293607B (en) Method and system for automatically switching audio output modes
CN112883178B (en) Interaction method, device, server and storage medium
CN112929777B (en) Audio signal processing method and device and electronic equipment
CN111048096B (en) Voice signal processing method and device and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201030