CN115985323B

CN115985323B - Voice wakeup method and device, electronic equipment and readable storage medium

Info

Publication number: CN115985323B
Application number: CN202310273455.XA
Authority: CN
Inventors: 鲁勇; 丁萌; 刘波
Original assignee: Beijing Intengine Technology Co Ltd
Current assignee: Beijing Intengine Technology Co Ltd
Priority date: 2023-03-21
Filing date: 2023-03-21
Publication date: 2023-06-16
Anticipated expiration: 2043-03-21
Also published as: CN115985323A

Abstract

The application discloses a voice awakening method, a device, electronic equipment and a readable storage medium, wherein the voice awakening method comprises the following steps: collecting sample signals within a preset time period; counting signal values corresponding to each frame of sample frames in the sample signals; calculating a background signal value corresponding to the sample signal based on the counted signal value; and when a voice wake-up operation triggered by the target equipment is detected, waking up the target equipment according to the background signal value. The voice awakening scheme does not need to consume a large amount of calculation power of equipment to carry out long-time standby awakening, and avoids the condition that equipment generates heat to reduce the service life of the equipment.

Description

Voice wake-up method, device, electronic equipment and readable storage medium

技术领域technical field

本申请涉及通信领域，具体涉及一种语音唤醒方法、装置、电子设备及可读存储介质。The present application relates to the communication field, in particular to a voice wake-up method, device, electronic equipment and readable storage medium.

背景技术Background technique

随着移动互联网和人工智能时代的来临，语音交互近些年获得了前所未有的增长，其中，语音唤醒技术作为一种特别的语音识别技术，成为用户和机器进行交互重要组成部分。语音唤醒系统的目标，是在无手动操作的情况下，对设备进行唤醒。With the advent of the era of mobile Internet and artificial intelligence, voice interaction has achieved unprecedented growth in recent years. Among them, voice wake-up technology, as a special voice recognition technology, has become an important part of the interaction between users and machines. The goal of the voice wake-up system is to wake up the device without manual operation.

目前的语音唤醒方案中，通常采用惯性滤波器或基于神经网络模型的方案进行语音唤醒，然而，在上述两种方案下，需要耗费设备大量的算力进行长时待机唤醒，会导致设备发热，从而降低了设备的寿命。In the current voice wake-up scheme, the inertial filter or the scheme based on the neural network model is usually used for voice wake-up. However, under the above two schemes, a large amount of computing power is required to wake up the device for a long time, which will cause the device to heat up. Thereby reducing the life of the equipment.

发明内容Contents of the invention

针对上述技术问题，本申请提供一种语音唤醒方法、装置、电子设备及可读存储介质，无需耗费设备大量的算力进行长时待机唤醒，避免出现设备发热而降低设备寿命的情况。In view of the above technical problems, the present application provides a voice wake-up method, device, electronic equipment and readable storage medium, which does not need to consume a large amount of computing power of the equipment for long-term standby wake-up, and avoids the situation that the equipment heats up and reduces the life of the equipment.

为解决上述技术问题，本申请提供一种语音唤醒方法，包括：In order to solve the above technical problems, the present application provides a voice wake-up method, including:

在预设时长内采集样本信号；Collect sample signals within a preset duration;

统计所述样本信号中每帧样本帧对应的信号值；Counting the signal values corresponding to each sample frame in the sample signal;

基于统计的信号值计算所述样本信号对应的背景信号值；calculating a background signal value corresponding to the sample signal based on the statistical signal value;

当检测到针对目标设备触发的语音唤醒操作，根据所述背景信号值唤醒所述目标设备。When a voice wakeup operation triggered for the target device is detected, the target device is woken up according to the background signal value.

可选的，在本申请的一些实施例中，所述基于统计的信号值计算所述样本信号对应的背景信号值，包括：Optionally, in some embodiments of the present application, the calculation of the background signal value corresponding to the sample signal based on the statistical signal value includes:

获取历史时间内的历史信号值；Obtain the historical signal value in the historical time;

根据所述历史信号值以及统计的信号值之间的波动，计算所述样本信号对应的背景信号值。A background signal value corresponding to the sample signal is calculated according to fluctuations between the historical signal value and the statistical signal value.

可选的，在本申请的一些实施例中，所述根据所述历史信号值以及统计的信号值之间的波动，计算所述样本信号对应的背景信号值，包括：Optionally, in some embodiments of the present application, the calculation of the background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value includes:

在统计的信号值中确定初始信号值；Determining the initial signal value among the statistical signal values;

计算所述初始信号值与所述历史信号值之间的差值；calculating a difference between said initial signal value and said historical signal value;

根据所述差值调整所述历史信号值，得到调整后信号值；adjusting the historical signal value according to the difference to obtain an adjusted signal value;

根据除所述初始信号值外其他信号值之间的波动，对调整后信号值进行调整，得到所述样本信号对应的背景信号值。According to the fluctuation among other signal values except the initial signal value, the adjusted signal value is adjusted to obtain the background signal value corresponding to the sample signal.

可选的，在本申请的一些实施例中，所述根据所述差值调整所述历史信号值，得到调整后信号值，包括：Optionally, in some embodiments of the present application, the adjusting the historical signal value according to the difference to obtain the adjusted signal value includes:

当检测到所述差值大于阈值时，计算所述历史信号值与所述预设数值的和，得到调整后信号值；When it is detected that the difference is greater than a threshold, calculating the sum of the historical signal value and the preset value to obtain an adjusted signal value;

当检测到所述差值小于阈值时，计算所述历史信号值与所述预设数值的差，得到调整后信号值。When it is detected that the difference is smaller than the threshold, the difference between the historical signal value and the preset value is calculated to obtain an adjusted signal value.

可选的，在本申请的一些实施例中，所述当检测到针对目标设备触发的语音唤醒操作，根据所述背景信号值唤醒所述目标设备，包括：Optionally, in some embodiments of the present application, when the voice wakeup operation triggered for the target device is detected, waking up the target device according to the background signal value includes:

当检测到针对目标设备触发的语音唤醒操作，获取所述语音唤醒操作对应的操作信号值；When detecting a voice wake-up operation triggered for the target device, acquiring an operation signal value corresponding to the voice wake-up operation;

检测所述操作信号值是否大于所述背景信号值；detecting whether the value of the operation signal is greater than the value of the background signal;

当检测到所述操作信号值大于所述背景信号值时，唤醒所述目标设备。When it is detected that the value of the operation signal is greater than the value of the background signal, wake up the target device.

可选的，在本申请的一些实施例中，还包括：Optionally, in some embodiments of the present application, it also includes:

周期性更新所述背景信号值，得到更新后背景信号值；periodically updating the background signal value to obtain the updated background signal value;

所述当检测到针对目标设备触发的语音唤醒操作，根据所述背景信号值唤醒所述目标设备，包括：当检测到针对目标设备触发的语音唤醒操作，根据所述更新后背景信号值唤醒所述目标设备。The step of waking up the target device according to the background signal value when detecting the voice wakeup operation triggered for the target device includes: waking up the target device according to the updated background signal value when detecting the voice wakeup operation triggered for the target device target device.

相应的，本申请还提供一种语音唤醒装置，包括：Correspondingly, the present application also provides a voice wake-up device, including:

采集模块，用于在预设时长内采集样本信号；an acquisition module, configured to acquire sample signals within a preset duration;

统计模块，用于统计所述样本信号中每帧样本帧对应的信号值；A statistical module, configured to count signal values corresponding to each sample frame in the sample signal;

计算模块，用于基于统计的信号值计算所述样本信号对应的背景信号值；A calculation module, configured to calculate the background signal value corresponding to the sample signal based on the statistical signal value;

唤醒模块，用于当检测到针对目标设备触发的语音唤醒操作，根据所述背景信号值唤醒所述目标设备。A wake-up module, configured to wake up the target device according to the background signal value when a voice wake-up operation triggered for the target device is detected.

可选的，在本申请的一些实施例中，所述计算模块包括：Optionally, in some embodiments of the present application, the calculation module includes:

获取单元，用于获取历史时间内的历史信号值；The acquisition unit is used to acquire the historical signal value within the historical time;

计算单元，用于根据所述历史信号值以及统计的信号值之间的波动，计算所述样本信号对应的背景信号值。The calculation unit is configured to calculate the background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.

本申请还提供一种电子设备，包括存储器和处理器，所述存储器存储有计算机程序，所述处理器执行所述计算机程序时实现如上述方法的步骤。The present application also provides an electronic device, including a memory and a processor, the memory stores a computer program, and the processor implements the steps of the above method when executing the computer program.

本申请还提供一种计算机存储介质，所述计算机存储介质存储有计算机程序，所述计算机程序被处理器执行时实现如上述方法的步骤。The present application also provides a computer storage medium, where the computer storage medium stores a computer program, and when the computer program is executed by a processor, the steps of the above method are realized.

如上所述，本申请提供一种语音唤醒方法、装置、电子设备及可读存储介质，该语音唤醒方法包括：在预设时长内采集样本信号；统计所述样本信号中每帧样本帧对应的信号值；基于统计的信号值计算所述样本信号对应的背景信号值；当检测到针对目标设备触发的语音唤醒操作，根据所述背景信号值唤醒所述目标设备。在本申请提供的语音唤醒方案中，利用样本信号中每帧样本帧对应的信号值，计算样本信号对应的背景信号值，并利用该计算的背景信号值唤醒目标设备，不用通过惯性滤波器或基于神经网络模型的方案进行语音唤醒，无需耗费设备大量的算力进行长时待机唤醒，避免出现设备发热而降低设备寿命的情况。As mentioned above, the present application provides a voice wake-up method, device, electronic equipment, and readable storage medium. The voice wake-up method includes: collecting sample signals within a preset duration; signal value; calculate the background signal value corresponding to the sample signal based on the statistical signal value; wake up the target device according to the background signal value when a voice wake-up operation triggered for the target device is detected. In the voice wake-up solution provided by this application, the signal value corresponding to each sample frame in the sample signal is used to calculate the background signal value corresponding to the sample signal, and the calculated background signal value is used to wake up the target device without passing through an inertial filter or The neural network model-based solution for voice wake-up does not need to consume a large amount of computing power of the device for long-term standby wake-up, and avoids the situation where the device heats up and reduces the life of the device.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分，示出了符合本申请的实施例，并与说明书一起用于解释本申请的原理。为了更清楚地说明本申请实施例的技术方案，下面将对实施例描述中所需要使用的附图作简单地介绍，显而易见地，对于本领域普通技术人员而言，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description serve to explain the principles of the application. In order to more clearly illustrate the technical solutions of the embodiments of the present application, the accompanying drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, for those of ordinary skill in the art, the Under the premise, other drawings can also be obtained based on these drawings.

图1是本申请实施例提供的语音唤醒系统的结构示意图；FIG. 1 is a schematic structural diagram of a voice wake-up system provided by an embodiment of the present application;

图2是本申请实施例提供的语音唤醒方法的流程示意图；Fig. 2 is a schematic flow chart of the voice wake-up method provided by the embodiment of the present application;

图3是本申请实施例提供的语音唤醒装置的结构示意图；FIG. 3 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the present application;

图4是本申请实施例提供的语音唤醒装置的另一结构示意图；Fig. 4 is another structural schematic diagram of the voice wake-up device provided by the embodiment of the present application;

图5是本申请实施例提供的智能终端的结构示意图。FIG. 5 is a schematic structural diagram of a smart terminal provided by an embodiment of the present application.

本申请目的的实现、功能特点及优点将结合实施例，参照附图做进一步说明。通过上述附图，已示出本申请明确的实施例，后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本申请构思的范围，而是通过参考特定实施例为本领域技术人员说明本申请的概念。The realization, functional features and advantages of the present application will be further described in conjunction with the embodiments and with reference to the accompanying drawings. By means of the above drawings, specific embodiments of the present application have been shown, which will be described in more detail hereinafter. These drawings and text descriptions are not intended to limit the scope of the concept of the application in any way, but to illustrate the concept of the application for those skilled in the art by referring to specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明，其示例表示在附图中。下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反，它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

需要说明的是，在本文中，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素，此外，本申请不同实施例中具有同样命名的部件、特征、要素可能具有相同含义，也可能具有不同含义，其具体含义需以其在该具体实施例中的解释或者进一步结合该具体实施例中上下文进行确定。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the statement "comprising a..." does not exclude the presence of other identical elements in the process, method, article, or device that includes the element. In addition, different implementations of the present application Components, features, and elements with the same name in the example may have the same meaning, or may have different meanings, and the specific meaning shall be determined based on the explanation in the specific embodiment or further combined with the context in the specific embodiment.

应当理解，此处所描述的具体实施例仅仅用以解释本申请，并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

在后续的描述中，使用用于表示元件的诸如“模块”、“部件”或者“单元”的后缀仅为了有利于本申请的说明，其本身没有特定的意义。因此，“模块”、“部件”或者“单元”可以混合地使用。In the following description, the use of suffixes such as 'module', 'part' or 'unit' for denoting elements is only for facilitating the description of the present application and has no specific meaning by itself. Therefore, 'module', 'part' or 'unit' may be mixedly used.

以下对本申请涉及的实施例进行具体描述，需要说明的是，在本申请中对实施例的描述顺序不作为对实施例优先顺序的限定。The embodiments involved in the present application will be described in detail below. It should be noted that the description order of the embodiments in the present application is not used as a limitation on the priority order of the embodiments.

本申请实施例提供一种语音唤醒方法、装置、存储介质及电子设备。具体地，本申请实施例的语音唤醒方法可以由电子设备执行，其中，该电子设备可以为终端。该电子设备可以为智能手机、平板电脑、笔记本电脑、触控屏幕、游戏机、个人计算机（PC，PersonalComputer）、个人数字助理(Personal Digital Assistant，PDA)等电子设备，电子设备还可以包括客户端，该客户端可以是语音唤醒客户端或其他客户端。电子设备可以通过有线或无线的方式与服务器连接，服务器可以是独立的物理服务器，也可以是多个物理服务器构成的服务器集群或者分布式系统，还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN、以及大数据和人工智能平台等基础云计算服务的云服务器。Embodiments of the present application provide a voice wake-up method, device, storage medium, and electronic equipment. Specifically, the voice wake-up method in the embodiment of the present application may be performed by an electronic device, where the electronic device may be a terminal. The electronic device can be a smart phone, a tablet computer, a notebook computer, a touch screen, a game console, a personal computer (PC, Personal Computer), a personal digital assistant (Personal Digital Assistant, PDA) and other electronic devices, and the electronic device can also include a client , the client can be a voice wake-up client or other clients. Electronic devices can be connected to the server through wired or wireless methods. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers. It can also provide cloud services, cloud databases, cloud computing, Cloud servers for basic cloud computing services such as cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms.

例如，当该语音唤醒方法运行于电子设备时，电子设备在预设时长内采集样本信号后，电子设备统计样本信号中每帧样本帧对应的信号值，接着，电子基于统计的信号值计算样本信号对应的背景信号值，电子设备当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备。For example, when the voice wake-up method is run on an electronic device, after the electronic device collects a sample signal within a preset time period, the electronic device counts the signal value corresponding to each sample frame in the sample signal, and then electronically calculates the sample signal based on the statistical signal value. The background signal value corresponding to the signal. When the electronic device detects the voice wake-up operation triggered for the target device, it wakes up the target device according to the background signal value.

请参阅图1，图1为本申请实施例提供的语音唤醒装置的系统示意图。该系统可以包括至少一个电子设备1000，至少一个服务器或个人电脑2000。用户持有的电子设备1000可以通过网络连接到不同的服务器或个人电脑。电子设备1000可以是具有计算硬件的电子设备，该计算硬件能够支持和执行与多媒体对应的软件产品。另外，电子设备1000还可以具有用于感测和获得用户通过在一个或者多个触控显示屏的多个点执行的触摸或者滑动操作的输入的一个或者多个多触敏屏幕。另外，电子设备1000可以通过网络与服务器或个人电脑2000相互连接。网络可以是无线网络或者有线网络，比如无线网络为无线局域网(WLAN)、局域网(LAN)、蜂窝网络、2G网络、3G网络、4G网络、5G网络等。另外，不同的电子设备1000之间也可以使用自身的蓝牙网络或者热点网络连接到其他嵌入式平台或者连接到服务器以及个人电脑等。服务器可以是独立的物理服务器，也可以是多个物理服务器构成的服务器集群或者分布式系统，还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN、以及大数据和人工智能平台等基础云计算服务的云服务器。Please refer to FIG. 1 . FIG. 1 is a system diagram of a voice wake-up device provided by an embodiment of the present application. The system may include at least one electronic device 1000 , at least one server or personal computer 2000 . The electronic device 1000 held by the user can be connected to various servers or personal computers through a network. The electronic device 1000 may be an electronic device having computing hardware capable of supporting and executing software products corresponding to multimedia. In addition, the electronic device 1000 may also have one or more multi-touch sensitive screens for sensing and obtaining user input through touch or slide operations performed at multiple points of one or more touch display screens. In addition, the electronic device 1000 may be connected to a server or a personal computer 2000 through a network. The network may be a wireless network or a wired network. For example, the wireless network may be a wireless local area network (WLAN), a local area network (LAN), a cellular network, a 2G network, a 3G network, a 4G network, or a 5G network. In addition, different electronic devices 1000 can also use their own Bluetooth network or hotspot network to connect to other embedded platforms or connect to servers and personal computers. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or it can provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communication, intermediate Cloud servers for basic cloud computing services such as software services, domain name services, security services, CDN, and big data and artificial intelligence platforms.

本申请实施例提供了一种语音唤醒方法，该方法可以由电子设备执行。其中，该电子设备包括触控显示屏和处理器，该触控显示屏用于呈现图形用户界面以及接收用户作用于图形用户界面产生的操作指令。用户通过触控显示屏对图形用户界面进行操作时，该图形用户界面可以通过响应于接收到的操作指令控制电子设备本地的内容，也可以通过响应于接收到的操作指令控制服务器端的内容。例如，用户作用于图形用户界面产生的操作指令包括用于对初始音频数据进行处理的指令，处理器被配置为在接收到用户提供的指令之后启动对应的应用程序。此外，处理器被配置为在触控显示屏上渲染和绘制与应用程序相关联的图形用户界面。触控显示屏是能够感测屏幕上的多个点同时执行的触摸或者滑动操作的多触敏屏幕。用户在使用手指在图形用户界面上执行触控操作，图形用户界面在检测到触控操作时，控制应用的图形用户界面中显示对应的操作。An embodiment of the present application provides a voice wake-up method, which can be executed by an electronic device. Wherein, the electronic device includes a touch display screen and a processor, and the touch display screen is used for presenting a graphical user interface and receiving operation instructions generated by a user acting on the graphical user interface. When the user operates the graphical user interface through the touch screen, the graphical user interface can control the local content of the electronic device in response to the received operation instruction, and can also control the server-side content in response to the received operation instruction. For example, the operation instructions generated by the user acting on the graphical user interface include instructions for processing initial audio data, and the processor is configured to start a corresponding application program after receiving the instructions provided by the user. Additionally, the processor is configured to render and draw a graphical user interface associated with the application on the touch display. A touch display is a multi-touch sensitive screen capable of sensing simultaneous touch or swipe operations performed at multiple points on the screen. The user uses a finger to perform a touch operation on the GUI, and when the GUI detects the touch operation, it controls the display of the corresponding operation in the GUI of the application.

本申请提供的语音唤醒方案，利用样本信号中每帧样本帧对应的信号值，计算样本信号对应的背景信号值，并利用该计算的背景信号值唤醒目标设备，不用通过惯性滤波器或基于神经网络模型的方案进行语音唤醒，无需耗费设备大量的算力进行长时待机唤醒，避免出现设备发热而降低设备寿命的情况。The voice wake-up solution provided by this application uses the signal value corresponding to each sample frame in the sample signal to calculate the background signal value corresponding to the sample signal, and uses the calculated background signal value to wake up the target device, without using inertial filters or neural-based The network model solution for voice wake-up does not need to consume a large amount of computing power of the device for long-term standby wake-up, and avoids the situation where the device heats up and reduces the life of the device.

以下分别进行详细说明。需说明的是，以下实施例的描述顺序不作为对实施例优先顺序的限定。Each will be described in detail below. It should be noted that the order of description of the following embodiments is not intended to limit the order of priority of the embodiments.

一种语音唤醒方法，包括：在预设时长内采集样本信号；统计样本信号中每帧样本帧对应的信号值；基于统计的信号值计算样本信号对应的背景信号值；当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备。A voice wake-up method, comprising: collecting a sample signal within a preset duration; counting the signal value corresponding to each sample frame in the sample signal; calculating the background signal value corresponding to the sample signal based on the statistical signal value; Triggered voice wakeup operation to wake up the target device based on the background signal value.

请参阅图2，图2为本申请实施例提供的语音唤醒方法的流程示意图。该数语音唤醒方法的具体流程可以如下：Please refer to FIG. 2 . FIG. 2 is a schematic flowchart of a voice wake-up method provided by an embodiment of the present application. The concrete process of this voice wake-up method can be as follows:

101、在预设时长内采集样本信号。101. Collect sample signals within a preset time period.

其中，样本信号为预设时长内采集的到音频信号，该样本信号可以包括人声信号、环境声音信号以及其他类型的声音信号，该声音信号可以内置于电子设备的声音传感器（如麦克风）进行采集，预设时长可以为10分钟、20分钟或100分钟，也可以为50秒、120秒或300秒，具体可以根据实际情况进行设置，在此不再赘述。Wherein, the sample signal is an audio signal collected within a preset time period, and the sample signal may include a human voice signal, an environmental sound signal, and other types of sound signals, and the sound signal may be embedded in a sound sensor (such as a microphone) of an electronic device for processing. For collection, the preset duration can be 10 minutes, 20 minutes or 100 minutes, or 50 seconds, 120 seconds or 300 seconds, which can be set according to the actual situation and will not be repeated here.

102、统计样本信号中每帧样本帧对应的信号值。102. Count the signal values corresponding to each sample frame in the sample signal.

例如，具体的，可以对样本信号进行分帧处理，得到样本信号对应的多个样本帧，需要说明的是，由于样本信号中可能会存在人声信号（即语音信号），语音信号在宏观上是不平稳的，在微观上是平稳的，具有短时平稳性（10---30ms内可以认为语音信号近似不变），为了便于后续进行语音唤醒，可以将长时间的样本信号划分为一些短段来处理，每一个短段则为本申请的样本帧，在这一帧内，语音信号的特性可是视为是稳定的，而分帧的原则在于它必须足够短来保证帧内信号是平稳的，一帧的长度应该小于一个音素的长度，正常语速下一个音素持续时间大约为50ms。此外，要进行傅里叶分析这一帧必须包含足够多的振动周期，考虑到男声在 100赫兹左右，女声在 200 赫兹左右，换算成周期就是 10ms和5 ms，即，每一帧样本帧长度介于10ms~40ms，具体可以根据实际情况选择每一帧样本帧的长度。For example, specifically, the sample signal can be divided into frames to obtain multiple sample frames corresponding to the sample signal. It is unstable, microscopically stable, and has short-term stability (the voice signal can be considered approximately unchanged within 10-30ms). In order to facilitate subsequent voice wake-up, the long-term sample signal can be divided into some Each short segment is a sample frame of the present application. In this frame, the characteristics of the speech signal are considered to be stable, and the principle of framing is that it must be short enough to ensure that the signal in the frame is stable. Steadily, the length of a frame should be less than the length of a phoneme, and the duration of a phoneme at a normal speech rate is about 50ms. In addition, to perform Fourier analysis, this frame must contain enough vibration cycles. Considering that male voices are around 100 Hz and female voices are around 200 Hz, the converted cycles are 10 ms and 5 ms, that is, the sample frame length of each frame Between 10ms~40ms, the length of each sample frame can be selected according to the actual situation.

进一步的，对一帧信号可以施加离散傅里叶变换（也叫短时离散傅里叶变换），来获取信号在这一帧内，有关频率-能量的分布信息，将各帧在频域的图像拼接起来，用横轴是频率，纵轴是幅度，即可得到样本信号的频谱图，而在本申请中，将幅度确定为样本帧对应的信号值。Furthermore, discrete Fourier transform (also called short-time discrete Fourier transform) can be applied to a frame of signal to obtain the distribution information about frequency-energy of the signal in this frame, and the The images are spliced together, and the horizontal axis is the frequency, and the vertical axis is the amplitude, so that the spectrogram of the sample signal can be obtained. In this application, the amplitude is determined as the signal value corresponding to the sample frame.

103、基于统计的信号值计算样本信号对应的背景信号值。103. Calculate a background signal value corresponding to the sample signal based on the statistical signal value.

例如，具体的，可以获取一个参考值，确定统计的信号值之间的波动，并基于该波动与参考值之间的差异，计算样本信号对应的背景信号值，即，可选地，在一些实施例中，步骤“基于统计的信号值计算样本信号对应的背景信号值”，具体可以包括：For example, specifically, a reference value can be obtained, the fluctuation between the statistical signal values can be determined, and based on the difference between the fluctuation and the reference value, the background signal value corresponding to the sample signal can be calculated, that is, optionally, in some In the embodiment, the step "calculate the background signal value corresponding to the sample signal based on the statistical signal value" may specifically include:

（11）获取历史时间内的历史信号值；(11) Obtain the historical signal value within the historical time;

（12）根据历史信号值以及统计的信号值之间的波动，计算样本信号对应的背景信号值。(12) Calculate the background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.

例如，可以在统计的信号值中，确定初始信号值，如将该样本信号的第一帧样本帧的信号值确定为初始信号值，随后，计算该初始信号值与历史信号值之间的差值，并基于该差值以及统计的信号值之间的波动，计算样本信号对应的背景信号值，即，可选的，在一些实施例中，步骤“根据历史信号值以及统计的信号值之间的波动，计算样本信号对应的背景信号值”，具体可以包括：For example, the initial signal value can be determined from the statistical signal values, such as determining the signal value of the first frame of the sample signal as the initial signal value, and then calculating the difference between the initial signal value and the historical signal value value, and based on the fluctuation between the difference and the statistical signal value, calculate the background signal value corresponding to the sample signal, that is, optionally, in some embodiments, the step "according to the historical signal value and the statistical signal value Between the fluctuations, calculate the background signal value corresponding to the sample signal", which can specifically include:

（21）在统计的信号值中确定初始信号值；(21) Determine the initial signal value among the statistical signal values;

（22）计算初始信号值与历史信号值之间的差值；(22) Calculate the difference between the initial signal value and the historical signal value;

（23）根据差值调整历史信号值，得到调整后信号值；(23) Adjust the historical signal value according to the difference to obtain the adjusted signal value;

（24）根据除初始信号值外其他信号值之间的波动，对调整后信号值进行调整，得到样本信号对应的背景信号值。(24) According to the fluctuation among other signal values except the initial signal value, the adjusted signal value is adjusted to obtain the background signal value corresponding to the sample signal.

例如，具体的，将样本信号的第一帧样本帧的信号值确定为初始信号后，获取历史信号值，该历史信号值可以是历史时长内采集的信号值对应的平均值，并且，该历史信号值可以作为环境背景音噪声，进一步的，计算初始信号值与历史信号值之间的差值，并基于该差值调整历史信号值，随后，计算初始信号值的下一帧信号对应的信号值与调整后历史信号值之间的差值，并基于该差值对调整后历史信号值进一步调整，直到样本信号中所有样本帧均处理完毕，最后，计算所有调整后历史信号值的平均值，得到样本信号对应的背景信号值。For example, specifically, after determining the signal value of the first sample frame of the sample signal as the initial signal, the historical signal value is obtained, and the historical signal value may be the average value corresponding to the signal values collected within the historical time period, and the historical signal value The signal value can be used as ambient background noise. Further, the difference between the initial signal value and the historical signal value is calculated, and the historical signal value is adjusted based on the difference. Then, the signal corresponding to the next frame signal of the initial signal value is calculated value and the adjusted historical signal value, and based on the difference, the adjusted historical signal value is further adjusted until all sample frames in the sample signal have been processed, and finally, the average value of all adjusted historical signal values is calculated , to get the background signal value corresponding to the sample signal.

需要说明的是，本申请采用加减1器对历史信号值进行调整，即，大于设定值+1，等于设定值不调整，小于设定值-1的方式对历史信号值进行调整，也即，可选的，在一些实施例中，步骤“根据差值调整历史信号值，得到调整后信号值”，具体可以包括：It should be noted that this application uses an adder and subtracter to adjust the historical signal value, that is, if it is greater than the set value +1, it is equal to the set value without adjustment, and if it is less than the set value -1, the historical signal value is adjusted. That is, optionally, in some embodiments, the step of "adjusting the historical signal value according to the difference to obtain the adjusted signal value" may specifically include:

（31）当检测到差值大于阈值时，计算历史信号值与预设数值的和，得到调整后信号值；(31) When it is detected that the difference is greater than the threshold, calculate the sum of the historical signal value and the preset value to obtain the adjusted signal value;

（32）当检测到差值小于阈值时，计算历史信号值与预设数值的差，得到调整后信号值。(32) When it is detected that the difference is smaller than the threshold, calculate the difference between the historical signal value and the preset value to obtain the adjusted signal value.

104、当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备。104. When a voice wakeup operation triggered for the target device is detected, wake up the target device according to the background signal value.

例如，具体的，当检测到针对目标设备触发的语音唤醒操作，获取语音唤醒操作对应的操作信号值，通过比较操作信号值与背景信号值之间的大小，触发目标设备进行唤醒模式，即，可选的，在一些实施例中，步骤“当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备”，具体可以包括：For example, specifically, when the voice wake-up operation triggered for the target device is detected, the operation signal value corresponding to the voice wake-up operation is obtained, and the target device is triggered to perform a wake-up mode by comparing the value of the operation signal value with the background signal value, that is, Optionally, in some embodiments, the step "waking up the target device according to the background signal value when detecting a voice wakeup operation triggered for the target device" may specifically include:

（41）当检测到针对目标设备触发的语音唤醒操作，获取语音唤醒操作对应的操作信号值；(41) When the voice wake-up operation triggered for the target device is detected, obtain the operation signal value corresponding to the voice wake-up operation;

（42）检测操作信号值是否大于背景信号值；(42) Detect whether the operation signal value is greater than the background signal value;

（43）当检测到操作信号值大于背景信号值时，唤醒目标设备。(43) When it is detected that the operation signal value is greater than the background signal value, wake up the target device.

例如，具体的，在目标设备的芯片上电时，采集声音信号，经过长时间统计计算出平均统计量，作为该应用场景的环境背景音噪声（即历史信号值），随后，目标设备持续采集样本信号，并统计每帧样本帧对应的信号值，接着，目标设备计算信号值与历史信号值的差值，并不断更新背景信号值，当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备，即，可以理解的是，在一些实施例中，本申请的语音唤醒方法具体还可以包括周期性更新背景信号值，得到更新后背景信号值，步骤“当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备”，具体可以包括：当检测到针对目标设备触发的语音唤醒操作，根据更新后背景信号值唤醒目标设备。For example, specifically, when the chip of the target device is powered on, the sound signal is collected, and the average statistics are calculated after a long period of time as the environmental background noise (that is, the historical signal value) of the application scenario. Then, the target device continues to collect Sample the signal, and count the signal value corresponding to each sample frame. Then, the target device calculates the difference between the signal value and the historical signal value, and continuously updates the background signal value. When the voice wake-up operation triggered by the target device is detected, according to the background The signal value wakes up the target device, that is, it can be understood that, in some embodiments, the voice wake-up method of the present application may specifically include periodically updating the background signal value to obtain the updated background signal value, and the step "when it is detected that the target device The voice wake-up operation triggered by the device wakes up the target device according to the background signal value", which may specifically include: when the voice wake-up operation triggered for the target device is detected, wake up the target device according to the updated background signal value.

以上完成本申请的语音唤醒流程。The voice wake-up process of the present application is completed above.

由上可知，本申请提供一种语音唤醒方法，在预设时长内采集样本信号，然后，统计样本信号中每帧样本帧对应的信号值，接着，基于统计的信号值计算样本信号对应的背景信号值，当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备，在本申请提供的语音唤醒方案，利用样本信号中每帧样本帧对应的信号值，计算样本信号对应的背景信号值，并利用该计算的背景信号值唤醒目标设备，不用通过惯性滤波器或基于神经网络模型的方案进行语音唤醒，无需耗费设备大量的算力进行长时待机唤醒，避免出现设备发热而降低设备寿命的情况。It can be seen from the above that the present application provides a voice wake-up method, which collects sample signals within a preset time period, then counts the signal value corresponding to each frame of the sample signal in the sample signal, and then calculates the background corresponding to the sample signal based on the statistical signal value Signal value. When the voice wake-up operation triggered for the target device is detected, the target device is woken up according to the background signal value. The voice wake-up scheme provided in this application uses the signal value corresponding to each sample frame in the sample signal to calculate the corresponding value of the sample signal. background signal value, and use the calculated background signal value to wake up the target device. It does not need to use inertial filters or neural network model-based solutions for voice wake-up, and does not need to consume a lot of computing power for long-term standby wake-up, so as to avoid device heating and failure Conditions that reduce equipment life.

为便于更好的实施本申请的语音唤醒方法，本申请还提供一种基于上述语音唤醒装置。其中名词的含义与上述语音唤醒方法中相同，具体实现细节可以参考方法实施例中的说明。In order to better implement the voice wake-up method of the present application, the present application also provides a voice wake-up device based on the above. The meanings of the nouns are the same as those in the above voice wake-up method, and for specific implementation details, please refer to the description in the method embodiments.

请参阅图3，图3为本申请提供的语音唤醒装置的结构示意图，其中该语音唤醒装置可以包括采集模块201、统计模块202、计算模块203以及唤醒模块204，具体可以如下：Please refer to Fig. 3, Fig. 3 is the structural representation of the voice wake-up device provided by the present application, wherein the voice wake-up device may include a collection module 201, a statistical module 202, a calculation module 203 and a wake-up module 204, specifically as follows:

采集模块201，用于在预设时长内采集样本信号。The collection module 201 is configured to collect sample signals within a preset time period.

统计模块202，用于统计样本信号中每帧样本帧对应的信号值。The statistical module 202 is configured to count the signal value corresponding to each sample frame in the sample signal.

计算模块203，用于基于统计的信号值计算样本信号对应的背景信号值。The calculation module 203 is configured to calculate the background signal value corresponding to the sample signal based on the statistical signal value.

例如，具体的，可以获取一个参考值，确定统计的信号值之间的波动，并基于该波动与参考值之间的差异，计算样本信号对应的背景信号值，即，可选的，在一些实施例中，计算模块203具体可以包括：For example, specifically, a reference value can be obtained, the fluctuation between statistical signal values can be determined, and the background signal value corresponding to the sample signal can be calculated based on the difference between the fluctuation and the reference value, that is, optionally, in some In an embodiment, the computing module 203 may specifically include:

计算单元，用于根据历史信号值以及统计的信号值之间的波动，计算样本信号对应的背景信号值。The calculation unit is used to calculate the background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.

可选的，在一些实施例中，计算单元具体可以包括：Optionally, in some embodiments, the calculation unit may specifically include:

确定子单元，用于在统计的信号值中确定初始信号值；determining a subunit for determining an initial signal value among the statistical signal values;

计算子单元，用于计算初始信号值与历史信号值之间的差值；Calculation subunit for calculating the difference between the initial signal value and the historical signal value;

调整子单元，用于根据差值调整历史信号值，得到调整后信号值；The adjustment subunit is used to adjust the historical signal value according to the difference to obtain the adjusted signal value;

调整子单元，用于根据除初始信号值外其他信号值之间的波动，对调整后信号值进行调整，得到样本信号对应的背景信号值。The adjustment sub-unit is configured to adjust the adjusted signal value according to the fluctuation among other signal values except the initial signal value, so as to obtain the background signal value corresponding to the sample signal.

可选的，在一些实施例中，调整子单元具体可以用于：当检测到差值大于阈值时，计算历史信号值与预设数值的和，得到调整后信号值；当检测到差值小于阈值时，计算历史信号值与预设数值的差，得到调整后信号值。Optionally, in some embodiments, the adjustment subunit can be specifically configured to: when it is detected that the difference is greater than a threshold, calculate the sum of the historical signal value and the preset value to obtain the adjusted signal value; when it is detected that the difference is less than When the threshold is set, the difference between the historical signal value and the preset value is calculated to obtain the adjusted signal value.

唤醒模块204，用于当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备。The wake-up module 204 is configured to wake up the target device according to the background signal value when a voice wake-up operation triggered for the target device is detected.

例如，具体的，当检测到针对目标设备触发的语音唤醒操作，获取语音唤醒操作对应的操作信号值，通过比较操作信号值与背景信号值之间的大小，触发目标设备进行唤醒模式，可选的，在一些实施例中，唤醒模块204具体可以用于：当检测到针对目标设备触发的语音唤醒操作，获取语音唤醒操作对应的操作信号值；检测操作信号值是否大于背景信号值；当检测到操作信号值大于背景信号值时，唤醒目标设备。For example, specifically, when the voice wake-up operation triggered for the target device is detected, the operation signal value corresponding to the voice wake-up operation is obtained, and the target device is triggered to perform the wake-up mode by comparing the value of the operation signal value with the background signal value, optional Yes, in some embodiments, the wake-up module 204 can be specifically configured to: when detecting a voice wake-up operation triggered for the target device, obtain an operation signal value corresponding to the voice wake-up operation; detect whether the operation signal value is greater than the background signal value; when detecting When the operation signal value is greater than the background signal value, wake up the target device.

可选的，在一些实施例中，请参阅图4，本申请的语音唤醒装置具体还可以包括更新模块205，该更新模块205具体可以用于：周期性更新背景信号值，得到更新后背景信号值。Optionally, in some embodiments, please refer to FIG. 4 , the voice wake-up device of the present application may specifically include an update module 205, which may specifically be used to: periodically update the value of the background signal to obtain the updated background signal value.

可选的，在一些实施例中，唤醒模块204具体还可以用于：当检测到针对目标设备触发的语音唤醒操作，根据更新后背景信号值唤醒目标设备。Optionally, in some embodiments, the wake-up module 204 may also be specifically configured to wake up the target device according to the updated background signal value when a voice wake-up operation triggered for the target device is detected.

由上可知，本申请提供一种语音唤醒装置，采集模块201在预设时长内采集样本信号，然后，统计模块202统计样本信号中每帧样本帧对应的信号值，接着，计算模块203基于统计的信号值计算样本信号对应的背景信号值，唤醒模块204，当检测到针对目标设备触发的语音唤醒操作，根据背景信号值唤醒目标设备，在本申请提供的语音唤醒方案，利用样本信号中每帧样本帧对应的信号值，计算样本信号对应的背景信号值，并利用该计算的背景信号值唤醒目标设备，不用通过惯性滤波器或基于神经网络模型的方案进行语音唤醒，无需耗费设备大量的算力进行长时待机唤醒，避免出现设备发热而降低设备寿命的情况。As can be seen from the above, the present application provides a voice wake-up device. The acquisition module 201 collects sample signals within a preset time period. Then, the statistics module 202 counts the signal value corresponding to each frame of the sample signal in the sample signal. Then, the calculation module 203 calculates based on the statistics The signal value calculates the background signal value corresponding to the sample signal. The wake-up module 204, when detecting a voice wake-up operation triggered for the target device, wakes up the target device according to the background signal value. The voice wake-up solution provided in this application utilizes each The signal value corresponding to the frame sample frame, calculate the background signal value corresponding to the sample signal, and use the calculated background signal value to wake up the target device, without using inertial filters or neural network model-based solutions for voice wake-up, without consuming a lot of equipment Computing power wakes up from long-term standby to avoid the situation where the device is heated and the life of the device is reduced.

本领域普通技术人员可以理解，上述实施例的各种方法中的全部或部分步骤可以通过指令来完成，或通过指令控制相关的硬件来完成，该指令可以存储于一计算机可读存储介质中，并由处理器进行加载和执行。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructions, or by instructions controlling related hardware, and the instructions can be stored in a computer-readable storage medium, and is loaded and executed by the processor.

本发明实施例还提供一种电子设备500，如图5所示，该电子设备500可以集成上述语音唤醒装置，还可以进一步包括射频（RF，Radio Frequency）电路501、包括有一个或一个以上计算机可读存储介质的存储器502、输入单元503、显示单元504、传感器505、音频电路506、无线保真（WiFi，Wireless Fidelity)模块507、包括有一个或者一个以上处理核心的处理器508、以及电源509等部件。本领域技术人员可以理解，图5中示出的电子设备500结构并不构成对电子设备500的限定，可以包括比图示更多或更少的部件，或者组合某些部件，或者不同的部件布置。其中：The embodiment of the present invention also provides an electronic device 500. As shown in FIG. 5, the electronic device 500 can integrate the above-mentioned voice wake-up device, and can further include a radio frequency (RF, Radio Frequency) circuit 501, including one or more computers A memory 502 of a readable storage medium, an input unit 503, a display unit 504, a sensor 505, an audio circuit 506, a wireless fidelity (WiFi, Wireless Fidelity) module 507, a processor 508 including one or more processing cores, and a power supply 509 and other components. Those skilled in the art can understand that the structure of the electronic device 500 shown in FIG. 5 does not constitute a limitation on the electronic device 500, and may include more or less components than those shown in the illustration, or combine certain components, or different components. layout. in:

RF电路501可用于收发信息或通话过程中，信号的接收和发送，特别地，将基站的下行信息接收后，交由一个或者一个以上处理器508处理；另外，将涉及上行的数据发送给基站。通常，RF电路501包括但不限于天线、至少一个放大器、调谐器、一个或多个振荡器、用户身份模块（SIM，SubscriberIdentity Module）卡、收发信机、耦合器、低噪声放大器（LNA，Low NoiseAmplifier）、双工器等。此外，RF电路501还可以通过无线通信与网络和其他设备通信。无线通信可以使用任一通信标准或协议，包括但不限于全球移动通讯系统（GSM，Global Systemof Mobile communication）、通用分组无线服务（GPRS ，GeneralPacketRadio Service）、码分多址（CDMA，Code DivisionMultiple Access）、宽带码分多址（WCDMA，Wideband CodeDivision Multiple Access）、长期演进（LTE，LongTermEvolution)、电子邮件、短消息服务（SMS，ShortMessaging Service)等。The RF circuit 501 can be used for sending and receiving information or receiving and sending signals during a call. In particular, after receiving the downlink information of the base station, it is handed over to one or more processors 508 for processing; in addition, the data related to the uplink is sent to the base station . Generally, the RF circuit 501 includes but is not limited to an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM, Subscriber Identity Module) card, a transceiver, a coupler, a low noise amplifier (LNA, Low NoiseAmplifier), duplexer, etc. In addition, the RF circuit 501 can also communicate with networks and other devices through wireless communication. Wireless communication can use any communication standard or protocol, including but not limited to Global System of Mobile Communication (GSM, Global System of Mobile communication), General Packet Radio Service (GPRS, General Packet Radio Service), Code Division Multiple Access (CDMA, Code Division Multiple Access) , Wideband Code Division Multiple Access (WCDMA, Wideband CodeDivision Multiple Access), Long Term Evolution (LTE, LongTermEvolution), email, Short Message Service (SMS, ShortMessaging Service), etc.

存储器502可用于存储软件程序以及模块，处理器508通过运行存储在存储器502的软件程序以及模块，从而执行各种功能应用以及信息处理。存储器502可主要包括存储程序区和存储数据区，其中，存储程序区可存储操作系统、至少一个功能所需的应用程序（比如声音播放功能、目标数据播放功能等）等；存储数据区可存储根据电子设备500的使用所创建的数据（比如音频数据、电话本等）等。此外，存储器502可以包括高速随机存取存储器，还可以包括非易失性存储器，例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。相应地，存储器502还可以包括存储器控制器，以提供处理器508和输入单元503对存储器502的访问。The memory 502 can be used to store software programs and modules, and the processor 508 executes various functional applications and information processing by running the software programs and modules stored in the memory 502 . The memory 502 can mainly include a program storage area and a data storage area, wherein the program storage area can store an operating system, at least one application program required by a function (such as a sound playback function, a target data playback function, etc.) and the like; the storage data area can store Data created according to the use of the electronic device 500 (such as audio data, phonebook, etc.) and the like. In addition, the memory 502 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage devices. Correspondingly, the memory 502 may further include a memory controller to provide access to the memory 502 by the processor 508 and the input unit 503 .

输入单元503可用于接收输入的数字或字符信息，以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。具体地，在一个具体的实施例中，输入单元503可包括触敏表面以及其他输入设备。触敏表面，也称为触摸显示屏或者触控板，可收集用户在其上或附近的触摸操作（比如用户使用手指、触笔等任何适合的物体或附件在触敏表面上或在触敏表面附近的操作），并根据预先设定的程式驱动相应的连接装置。可选的，触敏表面可包括触摸检测装置和触摸控制器两个部分。其中，触摸检测装置检测用户的触摸方位，并检测触摸操作带来的信号，将信号传送给触摸控制器；触摸控制器从触摸检测装置上接收触摸信息，并将它转换成触点坐标，再送给处理器508，并能接收处理器508发来的命令并加以执行。此外，可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触敏表面。除了触敏表面，输入单元503还可以包括其他输入设备。具体地，其他输入设备可以包括但不限于物理键盘、功能键（比如音量控制按键、开关按键等）、轨迹球、鼠标、操作杆等中的一种或多种。The input unit 503 can be used to receive input numbers or character information, and generate keyboard, mouse, joystick, optical or trackball signal input related to user settings and function control. Specifically, in a specific embodiment, the input unit 503 may include a touch-sensitive surface and other input devices. A touch-sensitive surface, also known as a touch display or trackpad, collects the user's touch on or near it (for example, the user uses a finger, stylus, etc. any suitable object or accessory on the touch-sensitive surface or on the touch-sensitive operation near the surface), and drive the corresponding connection device according to the preset program. Optionally, the touch-sensitive surface may include two parts: a touch detection device and a touch controller. Among them, the touch detection device detects the user's touch orientation, and detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts it into contact coordinates, and sends it to the to the processor 508, and can receive and execute commands sent by the processor 508. In addition, touch-sensitive surfaces can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic wave. In addition to touch-sensitive surfaces, the input unit 503 may also include other input devices. Specifically, other input devices may include, but are not limited to, one or more of physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, joysticks, and the like.

显示单元504可用于显示由用户输入的信息或提供给用户的信息以及电子设备500的各种图形用户接口，这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。显示单元504可包括显示面板，可选的，可以采用液晶显示器（LCD，Liquid CrystalDisplay）、有机发光二极管（OLED，Organic Light-EmittingDiode）等形式来配置显示面板。进一步的，触敏表面可覆盖显示面板，当触敏表面检测到在其上或附近的触摸操作后，传送给处理器508以确定触摸事件的类型，随后处理器508根据触摸事件的类型在显示面板上提供相应的视觉输出。虽然在图5中，触敏表面与显示面板是作为两个独立的部件来实现输入和输入功能，但是在某些实施例中，可以将触敏表面与显示面板集成而实现输入和输出功能。The display unit 504 can be used to display information input by or provided to the user and various graphical user interfaces of the electronic device 500. These graphical user interfaces can be composed of graphics, text, icons, videos and any combination thereof. The display unit 504 may include a display panel, and optionally, the display panel may be configured in the form of a liquid crystal display (LCD, Liquid Crystal Display), an organic light-emitting diode (OLED, Organic Light-Emitting Diode), and the like. Further, the touch-sensitive surface can cover the display panel. When the touch-sensitive surface detects a touch operation on or near it, it is sent to the processor 508 to determine the type of the touch event, and then the processor 508 displays on the display according to the type of the touch event. The corresponding visual output is provided on the panel. Although in FIG. 5, the touch-sensitive surface and the display panel are used as two independent components to realize the input and input functions, in some embodiments, the touch-sensitive surface and the display panel can be integrated to realize the input and output functions.

电子设备500还可包括至少一种传感器505，比如光传感器、运动传感器以及其他传感器。具体地，光传感器可包括环境光传感器及接近传感器，其中，环境光传感器可根据环境光线的明暗来调节显示面板的亮度，接近传感器可在电子设备500移动到耳边时，关闭显示面板和/或背光。作为运动传感器的一种，重力加速度传感器可检测各个方向上（一般为三轴）加速度的大小，静止时可检测出重力的大小及方向，可用于识别手机姿态的应用（比如横竖屏切换、相关游戏、磁力计姿态校准）、振动识别相关功能（比如计步器、敲击）等;至于电子设备500还可配置的陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器，在此不再赘述。The electronic device 500 may also include at least one sensor 505, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel according to the brightness of the ambient light, and the proximity sensor may turn off the display panel and/or when the electronic device 500 moves to the ear. or backlight. As a kind of motion sensor, the gravitational acceleration sensor can detect the magnitude of acceleration in various directions (generally three axes), and can detect the magnitude and direction of gravity when it is stationary, and can be used for applications that recognize the attitude of mobile phones (such as horizontal and vertical screen switching, related Gaming, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tap), etc.; as for other sensors such as gyroscope, barometer, hygrometer, thermometer, infrared sensor, etc. that can be configured by electronic device 500, here No longer.

音频电路506、扬声器，传声器可提供用户与电子设备500之间的音频接口。音频电路506可将接收到的音频数据转换后的电信号，传输到扬声器，由扬声器转换为样本信号输出；另一方面，传声器将收集的样本信号转换为电信号，由音频电路506接收后转换为音频数据，再将音频数据输出处理器508处理后，经RF电路501以发送给比如另一电子设备500，或者将音频数据输出至存储器502以便进一步处理。音频电路506还可能包括耳塞插孔，以提供外设耳机与电子设备500的通信。The audio circuit 506 , speaker, and microphone can provide an audio interface between the user and the electronic device 500 . The audio circuit 506 can transmit the electrical signal converted from the received audio data to the speaker, and the speaker converts it into a sample signal for output; on the other hand, the microphone converts the collected sample signal into an electrical signal, which is converted by the audio circuit 506 After being processed by the audio data output processor 508, the audio data is sent to another electronic device 500 through the RF circuit 501, or the audio data is output to the memory 502 for further processing. The audio circuit 506 may also include an earphone jack to provide communication of an external earphone with the electronic device 500 .

WiFi属于短距离无线传输技术，电子设备500通过WiFi模块507可以帮助用户收发电子邮件、浏览网页和访问流式媒体等，它为用户提供了无线的宽带互联网访问。虽然图5示出了WiFi模块507，但是可以理解的是，其并不属于电子设备500的必须构成，完全可以根据需要在不改变发明的本质的范围内而省略。WiFi is a short-distance wireless transmission technology. The electronic device 500 can help users send and receive emails, browse web pages, and access streaming media through the WiFi module 507, which provides users with wireless broadband Internet access. Although FIG. 5 shows a WiFi module 507, it can be understood that it is not a necessary component of the electronic device 500, and can be completely omitted as required without changing the essence of the invention.

处理器508是电子设备500的控制中心，利用各种接口和线路连接整个手机的各个部分，通过运行或执行存储在存储器502内的软件程序和/或模块，以及调用存储在存储器502内的数据，执行电子设备500的各种功能和处理数据，从而对手机进行整体监控。可选的，处理器508可包括一个或多个处理核心；优选的，处理器508可集成应用处理器和调制解调处理器，其中，应用处理器主要处理操作系统、用户界面和应用程序等，调制解调处理器主要处理无线通信。可以理解的是，上述调制解调处理器也可以不集成到处理器508中。The processor 508 is the control center of the electronic device 500, and uses various interfaces and lines to connect various parts of the entire mobile phone, by running or executing software programs and/or modules stored in the memory 502, and calling data stored in the memory 502 , executing various functions and processing data of the electronic device 500, so as to monitor the mobile phone as a whole. Optionally, the processor 508 may include one or more processing cores; preferably, the processor 508 may integrate an application processor and a modem processor, wherein the application processor mainly processes operating systems, user interfaces, and application programs, etc. , the modem processor mainly handles wireless communications. It can be understood that the foregoing modem processor may not be integrated into the processor 508 .

电子设备500还包括给各个部件供电的电源509（比如电池），优选的，电源可以通过电源管理系统与处理器508逻辑相连，从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。电源509还可以包括一个或一个以上的直流或交流电源、再充电系统、电源故障检测电路、电源转换器或者逆变器、电源数据指示器等任意组件。The electronic device 500 also includes a power supply 509 (such as a battery) for supplying power to various components. Preferably, the power supply can be logically connected to the processor 508 through the power management system, so that functions such as charging, discharging, and power consumption management can be realized through the power management system. . The power supply 509 may also include one or more DC or AC power supplies, recharging systems, power failure detection circuits, power converters or inverters, power data indicators and other arbitrary components.

尽管未示出，电子设备500还可以包括摄像头、蓝牙模块等，在此不再赘述。具体在本实施例中，电子设备500中的处理器508会按照如下的指令，将一个或一个以上的应用程序的进程对应的可执行文件加载到存储器502中，并由处理器508来运行存储在存储器502中的应用程序，从而实现各种功能：Although not shown, the electronic device 500 may also include a camera, a Bluetooth module, etc., which will not be repeated here. Specifically, in this embodiment, the processor 508 in the electronic device 500 will load the executable file corresponding to the process of one or more application programs into the memory 502 according to the following instructions, and the processor 508 will run the stored The application program in memory 502, thereby realizing various functions:

获取虚拟内存的预设存储容量；将初始相位设置为预设值；启动BIST电路，并基于设置后的初始相位，在BIST电路的控制进行相位扫描；当检测到BIST电路扫描至终止相位时，计算最佳相位。Obtain the preset storage capacity of the virtual memory; set the initial phase to a preset value; start the BIST circuit, and based on the set initial phase, perform phase scanning under the control of the BIST circuit; when it is detected that the BIST circuit scans to the end phase, Calculate the best phase.

在上述实施例中，对各个实施例的描述都各有侧重，某个实施例中没有详述的部分，可以参见上文语音唤醒方法的详细描述，此处不再赘述。In the above-mentioned embodiments, the descriptions of each embodiment have their own emphases. For the part that is not described in detail in a certain embodiment, refer to the detailed description of the voice wake-up method above, and will not be repeated here.

由上可知，本发明实施例的电子设备500可以利用BIST电路进行相位扫描，并基于扫描结果计算最佳相位，最后，根据该最佳相位进行语音唤醒，可以保证负荷重且快地进行语音唤醒，并且无需增加额外的电路。It can be seen from the above that the electronic device 500 of the embodiment of the present invention can use the BIST circuit to perform phase scanning, and calculate the optimal phase based on the scanning results, and finally perform voice wake-up according to the optimal phase, which can ensure heavy and fast voice wake-up , and no additional circuitry is required.

为此，本申请实施例还提供一种存储介质，其上存储有多条指令，所述指令适于处理器进行加载，以执行上述语音唤醒方法中的步骤。To this end, an embodiment of the present application further provides a storage medium on which a plurality of instructions are stored, and the instructions are suitable for being loaded by a processor to execute the steps in the above voice wake-up method.

以上各个操作的具体实施可参见前面的实施例，在此不再赘述。For the specific implementation of the above operations, reference may be made to the foregoing embodiments, and details are not repeated here.

其中，该存储介质可以包括：只读存储器（ROM，Read OnlyMemory）、随机存取记忆体（RAM，Random AccessMemory）、磁盘或光盘等。Wherein, the storage medium may include: a read only memory (ROM, Read Only Memory), a random access memory (RAM, Random Access Memory), a magnetic disk or an optical disk, and the like.

由于该存储介质中所存储的指令，可以执行本发明实施例所提供的任一种语音唤醒方法中的步骤，因此，可以实现本发明实施例所提供的任一种语音唤醒方法所能实现的有益效果，详见前面的实施例，在此不再赘述。Due to the instructions stored in the storage medium, the steps in any voice wake-up method provided by the embodiments of the present invention can be executed, therefore, the functions that can be realized by any voice wake-up method provided by the embodiments of the present invention can be realized. For the beneficial effects, please refer to the previous embodiments for details, and details will not be repeated here.

以上对本发明实施例所提供的语音唤醒方法、装置、系统及存储介质进行了详细介绍，本文中应用了具体个例对本发明的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本发明的方法及其核心思想；同时，对于本领域的技术人员，依据本发明的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本发明的限制。The voice wake-up method, device, system and storage medium provided by the embodiments of the present invention have been described above in detail. In this paper, specific examples have been used to illustrate the principles and implementation methods of the present invention. The descriptions of the above embodiments are only used to help Understand the method of the present invention and its core idea; at the same time, for those skilled in the art, according to the idea of the present invention, there will be changes in the specific implementation and scope of application. In summary, the content of this specification should not be construed as a limitation of the invention.

Claims

1. A method of waking up speech, comprising:

collecting sample signals within a preset time period;

counting signal values corresponding to each frame of sample frames in the sample signals;

acquiring a historical signal value in historical time;

determining an initial signal value from the counted signal values;

calculating a first difference between the initial signal value and the historical signal value;

adjusting the historical signal value according to the first difference value to obtain an adjusted signal value;

sequentially calculating second difference values between other signal values except the initial signal value and the adjusted signal value, and further adjusting the adjusted signal value based on the second difference values until all sample frames in the sample signal are processed;

calculating the average value of all the adjusted signal values to obtain a background signal value corresponding to the sample signal;

and when a voice wake-up operation triggered by the target equipment is detected, waking up the target equipment according to the background signal value.

2. The method of claim 1, wherein adjusting the historical signal value based on the first difference value results in an adjusted signal value, comprising:

when the first difference value is detected to be larger than a threshold value, calculating the sum of the historical signal value and a preset value to obtain an adjusted signal value;

and when the first difference value is detected to be smaller than a threshold value, calculating the difference between the historical signal value and a preset value to obtain an adjusted signal value.

3. The method according to claim 1 or 2, wherein when a voice wake-up operation triggered for a target device is detected, waking up the target device according to the background signal value comprises:

when voice awakening operation triggered by target equipment is detected, acquiring an operation signal value corresponding to the voice awakening operation;

detecting whether the operation signal value is larger than the background signal value;

and waking up the target device when the operation signal value is detected to be larger than the background signal value.

4. The method according to claim 1 or 2, further comprising:

periodically updating the background signal value to obtain an updated background signal value;

when the voice wake-up operation triggered by the target equipment is detected, waking up the target equipment according to the background signal value, wherein the voice wake-up operation comprises the following steps: and when voice awakening operation triggered by the target equipment is detected, awakening the target equipment according to the updated background signal value.

5. A voice wakeup apparatus, comprising:

the acquisition module is used for acquiring sample signals within a preset time length;

the statistics module is used for counting signal values corresponding to each frame of sample frame in the sample signal;

the calculation module is used for acquiring historical signal values in the historical time and determining initial signal values in the counted signal values; calculating a first difference between the initial signal value and the historical signal value; adjusting the historical signal value according to the first difference value to obtain an adjusted signal value; sequentially calculating second difference values between other signal values except the initial signal value and the adjusted signal value, and further adjusting the adjusted signal value based on the second difference values until all sample frames in the sample signal are processed; calculating the average value of all the adjusted signal values to obtain a background signal value corresponding to the sample signal;

and the wake-up module is used for waking up the target equipment according to the background signal value when the voice wake-up operation triggered by the target equipment is detected.

6. An electronic device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor implements the steps of the voice wake-up method of any of claims 1 to 4 when the computer program is executed.

7. A readable storage medium, characterized in that it has stored thereon a computer program which, when executed by a processor, implements the steps of the voice wake-up method according to any of claims 1 to 4.