CN111696562A

CN111696562A - Voice wake-up method, device and storage medium

Info

Publication number: CN111696562A
Application number: CN202010353897.1A
Authority: CN
Inventors: 李树为; 孙渊; 屈伸; 蒋幼宇
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2020-04-29
Filing date: 2020-04-29
Publication date: 2020-09-22
Anticipated expiration: 2040-04-29
Also published as: CN111696562B

Abstract

Embodiments of the present application provide a voice wake-up method, a device, and a storage medium, which are used to solve the problem of a low wake-up rate of a master device in a current device group. The method includes: the master device introduces the wake-up identification results of other slave devices in the device group into the wake-up decision of the master device, and determines the master device according to the wake-up identification results of the master device itself and the wake-up identification results of other slave devices in the device group. Whether to wake up, this method improves the accuracy of the master device in the device group being woken up. Further, the master device can also dynamically adjust the preset wake-up threshold of the master device in the device group in a continuous or discrete manner, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more suitable for the entire device. The actual state of group wake-up, the master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, and the accuracy of wake-up devices is higher.

Description

Voice wake-up method, device and storage medium

技术领域technical field

本申请涉及终端技术领域，尤其涉及一种语音唤醒方法、设备及存储介质。The present application relates to the technical field of terminals, and in particular, to a voice wake-up method, device and storage medium.

背景技术Background technique

随着智能语音交互的兴起，越来越多的设备支持语音交互功能。其中，语音唤醒作为语音交互的开始，在不同设备中应用广泛，例如智能音箱、智能电视等。当用户所处空间存在支持语音唤醒的设备，用户发出唤醒语音后，这些被唤醒的设备会同时响应说话人的请求，同时与用户进行交互。此时，会给用户造成困惑，不知道与哪一个设备进行语音交互。With the rise of intelligent voice interaction, more and more devices support the function of voice interaction. Among them, voice wake-up, as the beginning of voice interaction, is widely used in different devices, such as smart speakers and smart TVs. When there are devices that support voice wake-up in the space where the user is located, after the user sends a wake-up voice, these wake-up devices will respond to the speaker's request and interact with the user at the same time. At this time, it will cause confusion to the user, not knowing which device to perform voice interaction with.

目前针对上述场景，可以将能够同时被用户唤醒的这些设备组成一个设备组，设备组内只有主设备对用户的唤醒进行应答响应，其他从设备配合主设备处理用户指令意图，从而避免多设备同时被唤醒。At present, for the above scenarios, these devices that can be awakened by the user at the same time can be grouped into a device group. In the device group, only the master device responds to the user's wake-up, and other slave devices cooperate with the master device to process the user's command intent, thereby avoiding multiple devices at the same time. be awakened.

然而，设备组内有多个可唤醒设备，只依靠主设备的唤醒结果作为整个设备组的唤醒结果，唤醒的准确性不高，例如，用户与主设备相距较远，或者，传输路径上存在外部干扰等，都会降低主设备的唤醒率。However, there are multiple wake-up devices in the device group, and only the wake-up result of the master device is used as the wake-up result of the entire device group, and the wake-up accuracy is not high, for example, the user is far away from the master device, or there are External interference, etc., will reduce the wake-up rate of the master device.

发明内容SUMMARY OF THE INVENTION

本申请实施例提供一种语音唤醒方法、设备及存储介质，提高设备组中主设备的唤醒率。Embodiments of the present application provide a voice wake-up method, a device, and a storage medium, which improve the wake-up rate of a master device in a device group.

第一方面，本申请实施例提供一种语音唤醒方法，该方法应用于第一电子设备，第一电子设备和至少一个第二电子设备属于同一设备组，该方法包括：获取音频数据的第一唤醒置信度，第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度；接收至少一个第二电子设备发送的唤醒识别结果，唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒；根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒。In a first aspect, an embodiment of the present application provides a voice wake-up method. The method is applied to a first electronic device. The first electronic device and at least one second electronic device belong to the same device group. The method includes: acquiring a first electronic device of audio data. Wake-up confidence, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; receive the wake-up recognition result sent by at least one second electronic device, and the wake-up recognition result It is used to indicate that at least one second electronic device is allowed or prohibited to be woken up; according to the first wake-up confidence level and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up.

其中，第一电子设备为设备组中的主设备，第二电子设备为设备组中的从设备。The first electronic device is a master device in the device group, and the second electronic device is a slave device in the device group.

上述方案中，主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中，根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果，确定主设备是否被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断，充分利用了设备组内其他设备的唤醒识别结果，提升了设备组内主设备唤醒的准确性。In the above scheme, the master device determines the master device according to the wake-up recognition results of the master device itself and the wake-up recognition results of other slave devices in the device group by introducing the wake-up recognition results of other slave devices in the device group into the master device's wake-up decision. whether to wake up. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.

可选的，唤醒识别结果包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项；其中，唤醒标识包括允许或禁止第二电子设备被唤醒的标识，第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification of at least one second electronic device and a second wake-up confidence; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence It is used to indicate the similarity of the acoustic features between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.

如果唤醒识别结果中仅包括至少一个第二电子设备的唤醒标识，第一电子设备可根据唤醒标识，统计允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。If the wake-up identification result only includes the wake-up identifier of at least one second electronic device, the first electronic device may, according to the wake-up identifier, count the number of devices of the second electronic device that are allowed to be woken up, and the number of the devices in the total number of devices in the device group. Proportion.

如果唤醒识别结果中仅包括至少一个第二电子设备的第二唤醒置信度，第一电子设备首先需要根据第二唤醒置信度和各第二电子设备预设的唤醒阈值，确定第二电子设备是否允许被唤醒。然后统计允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。If the wake-up identification result only includes the second wake-up confidence level of at least one second electronic device, the first electronic device first needs to determine whether the second electronic device is allowed to wake up. Then, count the number of devices of the second electronic device that are allowed to be woken up, and the proportion of the number of devices to the total number of devices in the device group.

如果唤醒标识结果中同时包括至少一个第二电子设备的唤醒标识和第二唤醒置信度，第一电子设备可通过上述任意一种统计方式，确定允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。If the wake-up identification result includes both the wake-up identification and the second wake-up confidence level of at least one second electronic device, the first electronic device may determine the number of devices of the second electronic device that are allowed to be woken up through any one of the above statistical methods, and The proportion of this number of devices to the total number of devices in the device group.

在一种可能的设计中，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒，包括：若第一唤醒置信度大于或等于第一阈值，确定允许第一电子设备被唤醒；或者若第一唤醒置信度小于第一阈值，且第一唤醒置信度大于第二阈值，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒；或者若第一唤醒置信度小于或等于第二阈值，确定禁止第一电子设备被唤醒。In a possible design, according to the first wake-up confidence level and the wake-up identification result, determining to allow or prohibit the first electronic device from being awakened includes: if the first wake-up confidence level is greater than or equal to a first threshold, determining to allow the first electronic device to be awakened The device is woken up; or if the first wake-up confidence is less than the first threshold and the first wake-up confidence is greater than the second threshold, according to the first wake-up confidence and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up; or If the first wake-up confidence level is less than or equal to the second threshold, it is determined that the first electronic device is prohibited from being woken up.

上述方案提供了一种基于预设的主设备唤醒条件，通过比较主设备计算得到的第一唤醒置信度与第一阈值、第二阈值的大小关系，确定允许或禁止主设备被唤醒。若第一唤醒置信度介于第一阈值和第二阈值之间，可结合设备组中其他从设备发送的唤醒识别结果，确定允许或禁止主设备被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断，充分利用了设备组内其他设备的唤醒识别结果，提升了设备组内主设备唤醒的准确性。The above solution provides a method for determining whether to allow or prohibit the master device from being woken up by comparing the relationship between the first wake-up confidence calculated by the master device and the first threshold and the second threshold based on a preset master device wake-up condition. If the first wake-up confidence level is between the first threshold and the second threshold, it may be determined to allow or prohibit the master device from being woken up in combination with the wake-up identification results sent by other slave devices in the device group. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.

在一种可能的设计中，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒，包括：根据唤醒识别结果统计至少一个第二电子设备的唤醒情况；若唤醒情况满足预设的第一电子设备的唤醒条件，确定允许第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the wake-up recognition result includes: counting the wake-up conditions of at least one second electronic device according to the wake-up recognition result; If the preset wake-up condition of the first electronic device is satisfied, it is determined that the first electronic device is allowed to wake up.

可选的，唤醒条件包括如下的任意一项：设备组中除第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值；设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例大于或等于第一比例；设备组中允许被唤醒的第二电子设备的设备数量占设备组设备总数的比例小于第一比例且大于第二比例，且允许被唤醒的第二电子设备的第二唤醒置信度均大于或等于第三阈值；其中，第三阈值为第一电子设备中预设的允许第二电子设备被唤醒的阈值。Optionally, the wake-up condition includes any one of the following: the second wake-up confidence of all second electronic devices except the first electronic device in the device group is greater than or equal to the third threshold; the device group is allowed to be woken up The proportion of the number of devices of the second electronic device to the total number of devices in the device group is greater than or equal to the first proportion; the proportion of the number of devices of the second electronic device allowed to be awakened in the device group to the total number of devices in the device group is less than the first proportion and greater than The second ratio, and the second wake-up confidence of the second electronic device that is allowed to wake up is greater than or equal to the third threshold; wherein, the third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up .

需要说明的是，为了提高判断的准确性，可通过设置第三阈值，排查允许被唤醒的第二电子设备中是否存在第二唤醒置信度低于第三阈值的电子设备，如果存在，则确定禁止第一电子设备被唤醒。上述第三种唤醒条件可有效避免由于部分从设备设置的预设唤醒阈值过低，导致主设备被误唤醒，提升主设备唤醒的准确性。It should be noted that, in order to improve the accuracy of the judgment, a third threshold can be set to check whether there is an electronic device whose second wake-up confidence is lower than the third threshold in the second electronic device that is allowed to be woken up. The first electronic device is prohibited from being woken up. The above-mentioned third wake-up condition can effectively prevent the master device from being woken up by mistake because the preset wake-up thresholds set by some slave devices are too low, thereby improving the accuracy of the master device's wake-up.

上述方案中，第一电子设备通过至少一个第二电子设备发送的唤醒识别结果统计设备组内从设备的唤醒情况，判断从设备的唤醒情况是否满足预设的唤醒条件，如果满足预设的唤醒条件，则确定允许第一电子设备被唤醒，如果不满足预设的唤醒条件，则确定禁止第一电子设备被唤醒。该方案充分考虑了设备组内其他从设备的唤醒识别结果，对设备组内主设备的唤醒条件进行优化，提升设备组内主设备唤醒的准确性。In the above solution, the first electronic device counts the wake-up situation of the slave devices in the device group through the wake-up recognition result sent by at least one second electronic device, and judges whether the wake-up situation of the slave device satisfies the preset wake-up condition. If the preset wake-up condition is not satisfied, it is determined to prohibit the first electronic device from being woken up. The scheme fully considers the wake-up recognition results of other slave devices in the device group, optimizes the wake-up conditions of the master device in the device group, and improves the accuracy of the master device's wake-up in the device group.

可选的，唤醒识别结果还包括至少一个第二电子设备的设备标识，设备标识用于指示第二电子设备的设备类型，确定第二电子设备的权重值。Optionally, the wake-up identification result further includes a device identifier of at least one second electronic device, where the device identifier is used to indicate the device type of the second electronic device and determine the weight value of the second electronic device.

在一种可能的设计中，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒，包括：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值；统计设备组内所有第二电子设备的第二权重值；基于第一权重值和第二权重值调整第一阈值；根据第一唤醒置信度和调整后的第一阈值，确定允许或者禁止第一电子设备被唤醒；其中，第一阈值为允许第一电子设备被唤醒的阈值。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result includes: according to the wake-up identification result, counting the number of the second electronic devices in the device group that are allowed to be woken up. a weight value; count the second weight values of all the second electronic devices in the device group; adjust the first threshold based on the first weight value and the second weight value; determine the allowable threshold according to the first wake-up confidence and the adjusted first threshold Or prohibit the first electronic device from being woken up; wherein, the first threshold is a threshold value that allows the first electronic device to be woken up.

上述方案引入了设备组内电子设备的权重值，权重值用于指示电子设备唤醒识别结果的置信度高低。权重值可以是与电子设备的设备类型相关的值，还可以是与电子设备的软/硬件性能相关的值。例如，智能电视的权重值为0.3，智能音箱的权重值为0.6，智能灯的权重值为0.1。又例如，音箱Pro的权重值为0.5，常规音箱的权重值为0.3，音箱mini的权重值为0.2。The above solution introduces the weight value of the electronic device in the device group, and the weight value is used to indicate the confidence level of the wake-up recognition result of the electronic device. The weight value may be a value related to the device type of the electronic device, and may also be a value related to the software/hardware performance of the electronic device. For example, a smart TV has a weight value of 0.3, a smart speaker has a weight value of 0.6, and a smart light has a weight value of 0.1. For another example, the weight value of the speaker Pro is 0.5, the weight value of the conventional speaker is 0.3, and the weight value of the speaker mini is 0.2.

上述方案从唤醒阈值的角度出发，结合设备组内从设备的实际唤醒情况，综合分析允许被唤醒的从设备的唤醒识别结果的置信度高低，进而基于连续方式动态地调整设备组内主设备的唤醒阈值，并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果，判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态，提升了设备组内主设备唤醒的准确性。From the perspective of the wake-up threshold, the above scheme comprehensively analyzes the confidence level of the wake-up recognition results of the slave devices that are allowed to be woken up, combined with the actual wake-up situation of the slave devices in the device group, and then dynamically adjusts the master device in the device group based on a continuous method. Wake-up threshold, and determine whether the current master device needs to make a wake-up response according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device. Since the adjusted wake-up threshold of the master device is more in line with the actual wake-up state of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.

可选的，第一权重值是根据设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量，以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的；第二权重值是根据设备组内所有第二电子设备的设备类型对应的设备数量，以及各设备类型对应的权重值确定的。Optionally, the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device that is allowed to be awakened in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be awakened; The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.

在一种可能的设计中，基于第一权重值和第二权重值调整第一阈值，包括：将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数，根据阈值调整参数调整第一阈值。In a possible design, adjusting the first threshold based on the first weight value and the second weight value includes: using the product of the ratio of the first weight value and the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, according to The threshold adjustment parameter adjusts the first threshold.

在一种可能的设计中，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒，包括：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量；根据唤醒阈值调整表确定设备数量对应的阈值调整参数，唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系；基于阈值调整参数调整第一阈值；根据第一唤醒置信度和调整后的第一阈值，确定允许或禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result includes: counting the devices of the second electronic device in the device group that are allowed to be woken up according to the wake-up identification result Quantity; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, the wake-up threshold adjustment table includes the corresponding relationship between the number of devices of the second electronic device that is allowed to be woken up and the threshold adjustment parameter; adjust the first threshold based on the threshold adjustment parameter; The first wake-up confidence and the adjusted first threshold determine whether to allow or prohibit the first electronic device from being woken up.

上述方案同样从唤醒阈值的角度出发，结合设备组内从设备的实际唤醒情况，统计允许被唤醒的从设备的设备数量或占比，进而基于离散方式动态地调整设备组内主设备的唤醒阈值，并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果，判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态，提升了设备组内主设备唤醒的准确性。The above solution also starts from the wake-up threshold, combined with the actual wake-up situation of the slave devices in the device group, counts the number or proportion of the slave devices that are allowed to be woken up, and then dynamically adjusts the wake-up threshold of the master device in the device group based on a discrete method. , and according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device, it is determined whether the current master device needs to make a wake-up response. Since the adjusted wake-up threshold of the master device is more in line with the actual wake-up state of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.

在一种可能的设计中，根据第一唤醒置信度和调整后的第一阈值，确定允许或者禁止第一电子设备被唤醒，包括：若第一唤醒置信度大于或等于调整后的第一阈值，确定允许第一电子设备被唤醒；或者若第一唤醒置信度小于调整后的第一阈值，确定禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold includes: if the first wake-up confidence is greater than or equal to the adjusted first threshold , it is determined that the first electronic device is allowed to be woken up; or if the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is not allowed to be woken up.

第二方面，本申请实施例提供一种语音唤醒方法，该方法应用于第一电子设备，第一电子设备和至少一个第二电子设备属于同一设备组，该方法包括：获取音频数据的第一唤醒置信度，第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度；接收至少一个第二电子设备发送的唤醒识别结果，唤醒识别结果用于指示允许或禁止第二电子设备被唤醒；根据唤醒识别结果调整第一阈值，第一阈值为允许第一电子设备被唤醒的阈值；根据调整后的第一阈值和第一唤醒置信度，确定允许或禁止第一电子设备被唤醒。In a second aspect, an embodiment of the present application provides a voice wake-up method. The method is applied to a first electronic device. The first electronic device and at least one second electronic device belong to the same device group. The method includes: acquiring a first electronic device of audio data. Wake-up confidence, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; receive the wake-up recognition result sent by at least one second electronic device, and the wake-up recognition result It is used to indicate that the second electronic device is allowed or prohibited to be woken up; the first threshold is adjusted according to the wake-up recognition result, and the first threshold is the threshold that allows the first electronic device to be woken up; according to the adjusted first threshold and the first wake-up confidence, It is determined whether the first electronic device is allowed or prohibited from being woken up.

在一种可能的设计中，根据唤醒识别结果调整第一阈值，包括：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值；统计设备组内所有第二电子设备的第二权重值；基于第一权重值和第二权重值调整第一阈值。In a possible design, adjusting the first threshold value according to the wake-up recognition result includes: according to the wake-up recognition result, counting the first weights of the second electronic devices allowed to be woken up in the device group; counting all the second electronic devices in the device group The second weight value of ; adjust the first threshold value based on the first weight value and the second weight value.

在一种可能的设计中，根据唤醒识别结果调整第一阈值，包括：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量；根据唤醒阈值调整表确定设备数量对应的阈值调整参数，唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系；基于阈值调整参数调整第一阈值。In a possible design, adjusting the first threshold according to the wake-up recognition result includes: counting the number of devices of the second electronic device allowed to be woken up in the device group according to the wake-up recognition result; determining the threshold corresponding to the number of devices according to the wake-up threshold adjustment table Adjusting parameters, the wake-up threshold adjustment table includes the correspondence between the number of devices of the second electronic device that are allowed to be woken up and the threshold adjustment parameters; the first threshold is adjusted based on the threshold adjustment parameters.

在一种可能的设计中，根据调整后的第一阈值和第一唤醒置信度，确定允许或者禁止第一电子设备被唤醒，包括：若第一唤醒置信度大于或等于调整后的第一阈值，确定允许第一电子设备被唤醒；或者若第一唤醒置信度小于调整后的第一阈值，确定禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the adjusted first threshold and the first wake-up confidence includes: if the first wake-up confidence is greater than or equal to the adjusted first threshold , it is determined that the first electronic device is allowed to be woken up; or if the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is not allowed to be woken up.

第三方面，本申请实施例提供一种语音唤醒设备，该语音唤醒设备为第一电子设备，第一电子设备和至少一个第二电子设备属于同一设备组，该语音唤醒设备包括：获取模块，用于获取音频数据的第一唤醒置信度，第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度；接收模块，用于接收至少一个第二电子设备发送的唤醒识别结果，唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒；处理模块，用于根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒。In a third aspect, an embodiment of the present application provides a voice wake-up device, the voice wake-up device is a first electronic device, the first electronic device and at least one second electronic device belong to the same device group, the voice wake-up device includes: an acquisition module, For obtaining the first wake-up confidence of the audio data, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; the receiving module is used to receive at least one A wake-up identification result sent by the second electronic device, where the wake-up identification result is used to indicate that at least one second electronic device is allowed to wake up; the processing module is used to determine whether to allow or prohibit the first wake-up confidence level and the wake-up identification result according to the first wake-up confidence Electronic device wakes up.

可选的，处理模块，具体用于：若第一唤醒置信度大于或等于第一阈值，确定允许第一电子设备被唤醒；或者若第一唤醒置信度小于第一阈值，且第一唤醒置信度大于第二阈值，根据第一唤醒置信度和唤醒识别结果，确定允许或者禁止第一电子设备被唤醒；或者若第一唤醒置信度小于或等于第二阈值，确定禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: if the first wake-up confidence is greater than or equal to the first threshold, determine to allow the first electronic device to be awakened; or if the first wake-up confidence is less than the first threshold, and the first wake-up confidence is greater than the second threshold, according to the first wake-up confidence and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up; or if the first wake-up confidence is less than or equal to the second threshold, it is determined to prohibit the first electronic device from being woken up .

可选的，处理模块，具体用于：根据唤醒识别结果统计至少一个第二电子设备的唤醒情况；若唤醒情况满足预设的第一电子设备的唤醒条件，确定允许第一电子设备被唤醒。Optionally, the processing module is specifically configured to: count the wake-up situation of at least one second electronic device according to the wake-up identification result; if the wake-up situation meets the preset wake-up condition of the first electronic device, determine to allow the first electronic device to be woken up.

可选的，处理模块，具体用于：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值；统计设备组内所有第二电子设备的第二权重值；基于第一权重值和第二权重值调整第一阈值；根据第一唤醒置信度和调整后的第一阈值，确定允许或者禁止第一电子设备被唤醒；其中，第一阈值为允许第一电子设备被唤醒的阈值。Optionally, the processing module is specifically configured to: count the first weight values of the second electronic devices allowed to be woken up in the device group according to the wake-up identification result; count the second weight values of all the second electronic devices in the device group; A weight value and a second weight value adjust the first threshold; according to the first wake-up confidence and the adjusted first threshold, determine whether to allow or prohibit the first electronic device from being woken up; wherein, the first threshold is to allow the first electronic device to be woken up Wake-up threshold.

可选的，处理模块，具体用于：将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数，根据阈值调整参数调整第一阈值。Optionally, the processing module is specifically configured to: use the product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, and adjust the first threshold according to the threshold adjustment parameter.

可选的，处理模块，具体用于：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量；根据唤醒阈值调整表确定设备数量对应的阈值调整参数，唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系；基于阈值调整参数调整第一阈值；根据第一唤醒置信度和调整后的第一阈值，确定允许或禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: count the number of devices of the second electronic device allowed to be awakened in the device group according to the wake-up identification result; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, and the wake-up threshold adjustment table includes: The correspondence between the number of devices of the second electronic device that is allowed to be awakened and the threshold adjustment parameter; the first threshold is adjusted based on the threshold adjustment parameter; according to the first wakeup confidence and the adjusted first threshold, it is determined to allow or prohibit the first electronic device be awakened.

可选的，处理模块，具体用于：若第一唤醒置信度大于或等于调整后的第一阈值，确定允许第一电子设备被唤醒；或者若第一唤醒置信度小于调整后的第一阈值，确定禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: if the first wake-up confidence is greater than or equal to the adjusted first threshold, determine to allow the first electronic device to be awakened; or if the first wake-up confidence is less than the adjusted first threshold , it is determined that the first electronic device is prohibited from being woken up.

第四方面，本申请实施例提供一种语音唤醒设备，该语音唤醒设备为第一电子设备，第一电子设备和至少一个第二电子设备属于同一设备组，该语音唤醒设备包括：获取模块，用于获取音频数据的第一唤醒置信度，第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度；接收模块，用于接收至少一个第二电子设备发送的唤醒识别结果，唤醒识别结果用于指示允许或禁止第二电子设备被唤醒；处理模块，用于根据唤醒识别结果调整第一阈值，第一阈值为允许第一电子设备被唤醒的阈值；根据调整后的第一阈值和第一唤醒置信度，确定允许或禁止第一电子设备被唤醒。In a fourth aspect, an embodiment of the present application provides a voice wake-up device, the voice wake-up device is a first electronic device, the first electronic device and at least one second electronic device belong to the same device group, the voice wake-up device includes: an acquisition module, For obtaining the first wake-up confidence of the audio data, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; the receiving module is used to receive at least one The wake-up identification result sent by the second electronic device, where the wake-up identification result is used to indicate that the second electronic device is allowed or prohibited to be woken up; the processing module is configured to adjust the first threshold according to the wake-up identification result, where the first threshold is to allow the first electronic device to be woken up. A wake-up threshold; according to the adjusted first threshold and the first wake-up confidence, determine whether to allow or prohibit the first electronic device from being woken up.

可选的，处理模块，具体用于：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值；统计设备组内所有第二电子设备的第二权重值；基于第一权重值和第二权重值调整第一阈值。Optionally, the processing module is specifically configured to: count the first weight values of the second electronic devices allowed to be woken up in the device group according to the wake-up identification result; count the second weight values of all the second electronic devices in the device group; A weight value and a second weight value adjust the first threshold.

可选的，处理模块，具体用于：根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量；根据唤醒阈值调整表确定设备数量对应的阈值调整参数，唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系；基于阈值调整参数调整第一阈值。Optionally, the processing module is specifically configured to: count the number of devices of the second electronic device allowed to be awakened in the device group according to the wake-up identification result; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, and the wake-up threshold adjustment table includes: The correspondence between the number of devices of the second electronic device that is allowed to be awakened and the threshold adjustment parameter; the first threshold is adjusted based on the threshold adjustment parameter.

第五方面，本申请实施例提供一种语音唤醒设备，包括：存储器和处理器，存储器用于存储计算机程序，处理器用于从存储器中调用并运行计算机程序，使得处理器运行计算机程序执行如第一方面任一项的方法，或者如第二方面任一项的方法。In a fifth aspect, an embodiment of the present application provides a voice wake-up device, including: a memory and a processor, where the memory is used to store a computer program, and the processor is used to call and run the computer program from the memory, so that the processor runs the computer program to execute as described in Section 1. The method of any one of the aspect, or the method of any one of the second aspect.

第六方面，本申请实施例提供一种存储介质，存储介质包括计算机程序，计算机程序用于实现如第一方面任一项的方法，或者，如第二方面任一项的方法。In a sixth aspect, an embodiment of the present application provides a storage medium, where the storage medium includes a computer program, and the computer program is used to implement the method of any one of the first aspect, or the method of any one of the second aspect.

本申请实施例提供一种语音唤醒方法、设备及存储介质，该方法包括：主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中，根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果，确定主设备是否被唤醒，该方法提高了设备组中主设备被唤醒的准确性。进一步地，主设备还可以采用连续或者离散方式，动态调整设备组内主设备的预设唤醒阈值，并根据调整后的唤醒阈值确定主设备是否被唤醒，动态调整后的唤醒阈值更加契合整个设备组唤醒的实际状态，主设备基于动态调整后的唤醒阈值进行唤醒决策，唤醒设备的准确性更高。Embodiments of the present application provide a voice wake-up method, device, and storage medium. The method includes: the master device introduces the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device, according to the wake-up recognition result of the master device itself. The result and the wake-up identification results of other slave devices in the device group determine whether the master device is woken up, and the method improves the accuracy of the wake-up of the master device in the device group. Further, the master device can also dynamically adjust the preset wake-up threshold of the master device in the device group in a continuous or discrete manner, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more suitable for the entire device. The actual state of group wake-up, the master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, and the accuracy of wake-up devices is higher.

附图说明Description of drawings

图1为本申请实施例提供的语音唤醒方法的场景示意图；1 is a schematic diagram of a scenario of a voice wake-up method provided by an embodiment of the present application;

图2为本申请实施例提供的一种电子设备的硬件结构示意图；2 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application;

图3为本申请实施例提供的一种电子设备的软件架构示意图；3 is a schematic diagram of a software architecture of an electronic device according to an embodiment of the present application;

图4a至图4c为本申请实施例提供的语音唤醒方法的场景示意图；4a to 4c are schematic diagrams of scenarios of a voice wake-up method provided by an embodiment of the present application;

图5为本申请实施例提供的一种用户界面交互示意图；FIG. 5 is a schematic diagram of a user interface interaction provided by an embodiment of the present application;

图6为本申请实施例提供的语音唤醒方法的交互示意图；FIG. 6 is an interactive schematic diagram of a voice wake-up method provided by an embodiment of the present application;

图7为本申请实施例提供的一种语音唤醒方法的流程示意图；FIG. 7 is a schematic flowchart of a voice wake-up method provided by an embodiment of the present application;

图8a为本申请实施例提供的一种语音唤醒方法的判断流程图；Fig. 8a is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;

图8b为本申请实施例提供的一种语音唤醒方法的判断流程图；FIG. 8b is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;

图9为本申请实施例提供的一种语音唤醒方法的判断流程图；FIG. 9 is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;

图10为本申请实施例提供的另一种语音唤醒方法的流程示意图；10 is a schematic flowchart of another voice wake-up method provided by an embodiment of the present application;

图11为本申请实施例提供的又一种语音唤醒方法的流程示意图；11 is a schematic flowchart of another voice wake-up method provided by an embodiment of the present application;

图12为本申请实施例提供的一种语音唤醒设备的结构示意图；12 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the application;

图13为本申请实施例提供的一种语音唤醒设备的结构示意图；13 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the application;

图14为本申请实施例提供的一种语音唤醒设备的硬件结构示意图。FIG. 14 is a schematic diagram of a hardware structure of a voice wake-up device according to an embodiment of the present application.

具体实施方式Detailed ways

本申请实施例提供的电子设备为具有语音唤醒功能的电子设备，即用户可以通过语音唤醒电子设备。具体的，用户通过说出唤醒词唤醒电子设备，唤醒词可以是用户根据自身需要预设的，也可以是电子设备在出厂之前预设的，对此本实施例不作任何限制。The electronic device provided by the embodiment of the present application is an electronic device with a voice wake-up function, that is, a user can wake up the electronic device by voice. Specifically, the user wakes up the electronic device by speaking a wake-up word. The wake-up word may be preset by the user according to his own needs, or may be preset by the electronic device before leaving the factory, which is not limited in this embodiment.

电子设备获取音频数据，检测该音频数据是否包含上述唤醒词，如果音频数据包含唤醒词，则电子设备被唤醒，否则电子设备不被唤醒。电子设备被唤醒后，用户可以通过语音与电子设备进行交互。例如，预设的唤醒词为“小艺小艺”，当电子设备检测到音频数据中包含“小艺小艺”时，电子设备被唤醒。示例性的，图1示出了一种语音唤醒的场景示意图，如图1所示，该场景包括电子设备10、电子设备20、电子设备30以及电子设备40。其中，电子设备10、电子设备30以及电子设备40具有相同的预设唤醒词，例如唤醒词1，电子设备20中预设的唤醒词为唤醒词2。当用户说出的唤醒词为唤醒词1，且场景中的各个电子设备均可以接收或采集到用户说出的唤醒词时，电子设备10、电子设备30以及电子设备40能够被唤醒。由于用户说出的唤醒词与电子设备20预设的唤醒词不同，因此电子设备20不被唤醒。The electronic device acquires the audio data, and detects whether the audio data contains the wake-up word. If the audio data contains the wake-up word, the electronic device is awakened, otherwise the electronic device is not awakened. After the electronic device is awakened, the user can interact with the electronic device through voice. For example, the preset wake-up word is "Xiaoyi Xiaoyi". When the electronic device detects that the audio data contains "Xiaoyi Xiaoyi", the electronic device is woken up. Exemplarily, FIG. 1 shows a schematic diagram of a voice wake-up scenario. As shown in FIG. 1 , the scenario includes an electronic device 10 , an electronic device 20 , an electronic device 30 , and an electronic device 40 . The electronic device 10 , the electronic device 30 , and the electronic device 40 have the same preset wake-up word, such as wake-up word 1 , and the preset wake-up word in the electronic device 20 is wake-up word 2 . When the wake-up word spoken by the user is wake-up word 1, and each electronic device in the scene can receive or collect the wake-up word spoken by the user, the electronic device 10, the electronic device 30, and the electronic device 40 can be woken up. Since the wake-up word spoken by the user is different from the wake-up word preset by the electronic device 20, the electronic device 20 is not woken up.

由上述示例可知，当用户所处空间存在多种支持语音唤醒的电子设备时，用户说出唤醒词，有可能被多个电子设备接收或采集到，如果多个电子设备预设的唤醒词都相同，有可能导致多个电子设备被唤醒，被唤醒的多个电子设备会同时响应用户的请求，造成用户不知道与哪个设备进行语音交互的困惑，降低用户体验。It can be seen from the above example that when there are multiple electronic devices that support voice wake-up in the space where the user is located, the wake-up word spoken by the user may be received or collected by multiple electronic devices. Similarly, it may cause multiple electronic devices to be awakened, and the multiple electronic devices that are awakened will respond to the user's request at the same time, causing the user to be confused about which device to perform voice interaction with, and reducing the user experience.

为了避免用户语音交互的困惑，本申请实施例提供的语音唤醒方法，将能够同时被用户唤醒的电子设备组成一个设备组，设备组内只有一个电子设备对用户的唤醒进行应答响应，将进行应答响应的电子设备看作设备组的主设备，除主设备之外的其他电子设备看作设备组的从设备，从设备配合主设备处理用户的指令意图。以图1为例，可以将具有相同预设唤醒词的电子设备10、电子设备30以及电子设备40组成一个设备组，将电子设备10设置为该设备组的主设备，电子设备30和电子设备40作为该设备组的从设备，从设备配合主设备处理用户的指令意图。需要说明的是，可以通过预设规则设置设备组的主设备，例如将设备组中处理性能最强的电子设备设置为主设备，也可以通过用户自定义设置设备组内的任意一个电子设备为主设备，对此本实施例不作任何限制。In order to avoid the confusion of the user's voice interaction, the voice wake-up method provided by the embodiment of the present application forms a device group of electronic devices that can be woken up by the user at the same time, and only one electronic device in the device group responds to the user's wake-up and will respond The responding electronic device is regarded as the master device of the device group, other electronic devices except the master device are regarded as the slave devices of the device group, and the slave devices cooperate with the master device to process the user's instruction intention. Taking FIG. 1 as an example, the electronic device 10, the electronic device 30 and the electronic device 40 with the same preset wake-up word can be formed into a device group, and the electronic device 10 can be set as the master device of the device group, and the electronic device 30 and the electronic device. 40 is a slave device of the device group, and the slave device cooperates with the master device to process the user's instruction intention. It should be noted that the master device of the device group can be set through preset rules, for example, the electronic device with the strongest processing performance in the device group can be set as the master device, or any electronic device in the device group can be set as the master device by user-defined rules. The master device, which is not limited in this embodiment.

基于图1提供的语音唤醒场景，设备组的主设备在接收或采集到音频数据后，对音频数据进行预处理，提取音频数据中的唤醒词，通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值，如果相似度值大于或者等于预设相似度阈值(也可称为预设唤醒阈值)，则认为检测到唤醒请求，设备组的主设备被唤醒，否则主设备不被唤醒。由此可知，设备组中的主设备是否唤醒是由主设备自身的唤醒识别结果决定的，该执行方案存在如下缺陷：Based on the voice wake-up scenario provided in Figure 1, after receiving or collecting the audio data, the master device of the device group preprocesses the audio data, extracts the wake-up words in the audio data, and uses the voice wake-up model to calculate the difference between the wake-up words in the audio data and the audio data. The similarity value of the acoustic features of the preset wake-up word, if the similarity value is greater than or equal to the preset similarity threshold (also referred to as the preset wake-up threshold), it is considered that a wake-up request is detected, and the master device of the device group is woken up, Otherwise the master device is not woken up. It can be seen from this that whether the master device in the device group wakes up is determined by the wake-up recognition result of the master device itself, and this implementation scheme has the following defects:

第一，设备组包括多个可被唤醒的电子设备，只依靠主设备自身的唤醒识别结果作为整个设备组的唤醒识别结果，该结果的准确性不高。例如，在语音传输路径上存在干扰源，或者，用户距离主设备较远时，仅仅依靠主设备自身的唤醒识别结果，极有可能在主设备需要被唤醒时，由于外界因素的影响，主设备没有检测到唤醒语音，或者检测到唤醒语音但没有达到预设唤醒阈值，造成整个设备组没有响应。First, the device group includes a plurality of electronic devices that can be woken up, and only the wake-up identification result of the master device itself is used as the wake-up identification result of the entire device group, and the accuracy of the result is not high. For example, if there is an interference source on the voice transmission path, or when the user is far away from the main device, only relying on the wake-up recognition result of the main device itself, it is very likely that when the main device needs to be woken up, due to the influence of external factors, the main device will No wake-up voice is detected, or a wake-up voice is detected but the preset wake-up threshold is not reached, causing the entire device group to become unresponsive.

第二，设备组中主设备的预设唤醒阈值是固定值，然而不同的语音唤醒场景该唤醒阈值可能存在差异。因此，固定的预设唤醒阈值可能影响主设备的唤醒识别结果。Second, the preset wake-up threshold of the master device in the device group is a fixed value, but the wake-up threshold may be different in different voice wake-up scenarios. Therefore, the fixed preset wake-up threshold may affect the wake-up recognition result of the master device.

为了提高主设备的唤醒率，可采用其他解决方式辅助主设备进行唤醒识别的判断。例如，通过服务端模型再校验的方式，设备组的主设备通过与服务端交互，从服务端获取唤醒识别结果。然而，该方式的实现成本较高，且依赖于网络环境，可能存在较高的延时问题。又例如，通过扩展设备侧的语音唤醒模型提高唤醒识别的准确率，该方式的缺点是占用更多的设备侧运算和存储资源，导致设备的整体成本较高。In order to improve the wake-up rate of the master device, other solutions may be used to assist the master device in judging wake-up identification. For example, by re-checking the server model, the master device of the device group obtains the wake-up recognition result from the server by interacting with the server. However, the implementation cost of this method is high, and depending on the network environment, there may be a high delay problem. For another example, the accuracy of wake-up recognition is improved by expanding the voice wake-up model on the device side. The disadvantage of this method is that it occupies more computing and storage resources on the device side, resulting in a higher overall cost of the device.

综上可知，设备组的主设备仅依据自身的唤醒识别结果，存在较高的误判率，易造成整个设备组在用户发出唤醒语音后无响应，用户体验不佳。虽然，通过服务端再校验或者扩展设备侧语音唤醒模型可以提高主设备的唤醒率，但实现成本较高，且存在较高的时延。对此，本申请实施例提供一种语音唤醒方法，主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中，充分利用设备组内不同电子设备上的唤醒识别结果，根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果，确定主设备是否被唤醒，由于主设备在唤醒决策时考虑了设备组内其他从设备的唤醒识别结果，提高了设备组中主设备被唤醒的准确性。To sum up, the master device of the device group only relies on its own wake-up recognition result, which has a high misjudgment rate, which is likely to cause the entire device group to not respond after the user sends a wake-up voice, resulting in poor user experience. Although the wake-up rate of the master device can be improved by re-checking on the server side or expanding the device-side voice wake-up model, the implementation cost is high and there is a high delay. In this regard, an embodiment of the present application provides a voice wake-up method. The master device makes full use of the wake-up recognition results on different electronic devices in the device group by introducing the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device. , according to the wake-up recognition result of the master device itself and the wake-up recognition results of other slave devices in the device group, to determine whether the master device is woken up, because the master device considers the wake-up recognition results of other slave devices in the device group in the wake-up decision. The accuracy with which the master device in the device group is woken up.

在不同语音唤醒场景下，设备组的主设备采用固定的唤醒阈值进行唤醒决策，存在较高的误判率。对此，本申请实施例提供一种语音唤醒方法，主设备同样通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中，根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果，采用连续或者离散方式，动态调整设备组内主设备的预设唤醒阈值，并根据调整后的唤醒阈值确定主设备是否被唤醒。动态调整后的唤醒阈值更加契合整个设备组唤醒的实际状态，主设备基于动态调整后的唤醒阈值进行唤醒决策，提高了设备组中主设备被唤醒的准确性。In different voice wake-up scenarios, the master device of the device group uses a fixed wake-up threshold to make wake-up decisions, which has a high false positive rate. In this regard, an embodiment of the present application provides a voice wake-up method. The master device also introduces the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device. The wake-up recognition results of other slave devices use continuous or discrete methods to dynamically adjust the preset wake-up threshold of the master device in the device group, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more in line with the actual wake-up state of the entire device group. The master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, which improves the accuracy of the master device in the device group being woken up.

以下对本申请实施例中设备组内的电子设备进行详细介绍。The electronic devices in the device group in the embodiments of the present application are described in detail below.

本申请实施例中的电子设备可以为便携式电子设备，诸如手机、平板电脑、人工智能(artificial intelligence，AI)智能语音终端、可穿戴设备、增强现实(augmentedreality，AR)/虚拟现实(virtual reality，VR)设备等。便携式电子设备的示例性实施例包括但不限于搭载或者其它操作系统的便携式电子设备。上述便携式电子设备也可以是车载终端、膝上型计算机(Laptop)等。还应当理解的是，本申请实施例的电子设备还可以台式计算机、智能家居设备(例如智能电视、智能音箱)等，对此不作限定。The electronic device in this embodiment of the present application may be a portable electronic device, such as a mobile phone, a tablet computer, an artificial intelligence (artificial intelligence, AI) intelligent voice terminal, a wearable device, an augmented reality (AR)/virtual reality (virtual reality, VR) equipment, etc. Exemplary embodiments of portable electronic devices include, but are not limited to, portable electronic devices powered by or other operating systems. The above-mentioned portable electronic device may also be an in-vehicle terminal, a laptop computer (Laptop), or the like. It should also be understood that the electronic device in the embodiment of the present application may also be a desktop computer, a smart home device (eg, a smart TV, a smart speaker), etc., which is not limited thereto.

示例的，图2示出了本申请实施例的一种电子设备的硬件结构示意图。具体的，如图2所示，电子设备包括处理器110、内部存储器121、外部存储器接口122、摄像头131、显示屏132、传感器模块140、用户标识模块(subscriber identification module，SIM)卡接口151、按键152、音频模块160、扬声器161、受话器162、麦克风163、耳机接口164、通用串行总线(universal serial bus，USB)接口170、充电管理模块180、电源管理模块181、电池182、移动通信模块191和无线通信模块192。在另一些实施例中，电子设备还可以包括马达、指示器、按键等。By way of example, FIG. 2 shows a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application. Specifically, as shown in FIG. 2 , the electronic device includes a processor 110, an internal memory 121, an external memory interface 122, a camera 131, a display screen 132, a sensor module 140, a subscriber identification module (SIM) card interface 151, Button 152, audio module 160, speaker 161, receiver 162, microphone 163, headphone jack 164, universal serial bus (USB) interface 170, charging management module 180, power management module 181, battery 182, mobile communication module 191 and wireless communication module 192. In other embodiments, the electronic device may further include motors, indicators, buttons, and the like.

应理解，图2所示的硬件结构仅是一个示例。本申请实施例的电子设备可以具有比图中所示电子设备更多的或者更少的部件，可以组合两个或更多的部件，或者可以具有不同的部件配置。图中所示出的各种部件可以在包括一个或多个信号处理和/或专用集成电路在内的硬件、软件、或硬件和软件的组合中实现。It should be understood that the hardware structure shown in FIG. 2 is only an example. The electronic device of the embodiments of the present application may have more or less components than the electronic device shown in the figures, may combine two or more components, or may have different component configurations. The various components shown in the figures may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing and/or application specific integrated circuits.

其中，处理器110可以包括一个或多个处理单元。例如：处理器110可以包括应用处理器(application processor，AP)、调制解调器、图形处理器(graphics processingunit，GPU)、图像信号处理器(image signal processor，ISP)、控制器、视频编解码器、数字信号处理器(digital signal processor，DSP)、基带处理器、和/或神经网络处理器(neural-network processing unit，NPU)等。其中，不同的处理单元可以是独立的器件，也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units. For example, the processor 110 may include an application processor (AP), a modem, a graphics processing unit (GPU), an image signal processor (ISP), a controller, a video codec, a digital A signal processor (digital signal processor, DSP), a baseband processor, and/or a neural-network processing unit (neural-network processing unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.

在一些实施例中，处理器110中还可以设置缓存器，用于存储指令和/或数据。示例的，处理器110中的缓存器可以为高速缓冲存储器。该缓存器可以用于保存处理器110刚用过的、生成的、或循环使用的指令和/或数据。如果处理器110需要使用该指令或数据，可从该缓存器中直接调用。有助于减少了处理器110获取指令或数据的时间，从而有助于提高系统的效率。In some embodiments, a buffer may also be provided in the processor 110 for storing instructions and/or data. For example, the buffer in the processor 110 may be a cache memory. The cache may be used to hold instructions and/or data that have just been used, generated, or recycled by the processor 110 . If the processor 110 needs to use the instruction or data, it can be called directly from the buffer. This helps to reduce the time for the processor 110 to obtain instructions or data, thereby helping to improve the efficiency of the system.

内部存储器121可以用于存储程序和/或数据。在一些实施例中，内部存储器121包括存储程序区和存储数据区。其中，存储程序区可以用于存储操作系统(如Android、IOS等操作系统)、至少一个功能所需的计算机程序(比如语音唤醒功能、声音播放功能)等。存储数据区可以用于存储电子设备使用过程中所创建、和/或采集的数据(比如音频数据)等。示例的，处理器110可以通过调用内部存储器121中存储的程序和/或数据，使得电子设备执行相应的方法，从而实现一种或多种功能。例如，处理器110调用内部存储器中的某些程序和/或数据，使得电子设备执行本申请实施例中所提供的语音唤醒方法、从而实现语音唤醒功能。其中，内部存储器121可以采用高速随机存取存储器、和/或非易失性存储器等。例如，非易失性存储器可以包括一个或多个磁盘存储器件、闪存器件、和/或通用闪存存储器(universal flash storage，UFS)等中的至少一个。Internal memory 121 may be used to store programs and/or data. In some embodiments, the internal memory 121 includes a stored program area and a stored data area. The storage program area may be used to store an operating system (such as Android, IOS, etc.), a computer program required for at least one function (such as a voice wake-up function, a sound playback function), and the like. The storage data area may be used to store data (such as audio data) created and/or collected during the use of the electronic device. For example, the processor 110 may cause the electronic device to execute a corresponding method by calling programs and/or data stored in the internal memory 121, thereby implementing one or more functions. For example, the processor 110 invokes certain programs and/or data in the internal memory, so that the electronic device executes the voice wake-up method provided in the embodiments of the present application, thereby implementing the voice wake-up function. Wherein, the internal memory 121 may adopt a high-speed random access memory, and/or a non-volatile memory, or the like. For example, the nonvolatile memory may include at least one of one or more magnetic disk storage devices, flash memory devices, and/or universal flash storage (UFS), among others.

外部存储器接口122可以用于连接外部存储卡(例如，Micro SD卡)，实现扩展电子设备的存储能力。外部存储卡通过外部存储器接口122与处理器110通信，实现数据存储功能。例如电子设备可以通过外部存储器接口122将图像、音乐、视频等文件保存在外部存储卡中。The external memory interface 122 can be used to connect an external memory card (eg, a Micro SD card) to expand the storage capacity of the electronic device. The external memory card communicates with the processor 110 through the external memory interface 122 to realize the data storage function. For example, the electronic device can save images, music, videos and other files in the external memory card through the external memory interface 122 .

摄像头131可以用于捕获动、静态图像等。通常情况下，摄像头131包括镜头和图像传感器。其中，物体通过镜头生成的光学图像投射到图像传感器上，然后转换为电信号，在进行后续处理。示例的，图像传感器可以是电荷耦合器件(charge coupled device，CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor，CMOS)光电晶体管。图像传感器把光信号转换成电信号，之后将电信号传递给ISP转换成数字图像信号。需要说明的是，电子设备可以包括1个或N个摄像头131，其中，N为大于1的正整数。The camera 131 may be used to capture moving, still images, and the like. Typically, the camera 131 includes a lens and an image sensor. Among them, the optical image generated by the object through the lens is projected onto the image sensor, and then converted into an electrical signal for subsequent processing. Illustratively, the image sensor may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The image sensor converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal. It should be noted that the electronic device may include 1 or N cameras 131 , where N is a positive integer greater than 1.

显示屏132可以包括显示面板，用于显示用户界面。显示面板可以采用液晶显示屏(liquid crystal display，LCD)、有机发光二极管(organic light-emitting diode，OLED)、有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrixorganiclight emitting diode，AMOLED)、柔性发光二极管(flex light-emitting diode，FLED)、Miniled、MicroLed、Micro-oLed、量子点发光二极管(quantum dot lightemittingdiodes，QLED)等。需要说明的是，电子设备可以包括1个或M个显示屏132，M为大于1的正整数。示例的，电子设备可以通过GPU、显示屏132、应用处理器等实现显示功能。Display screen 132 may include a display panel for displaying a user interface. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode). , AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (quantum dot light emittingdiodes, QLED) and so on. It should be noted that the electronic device may include one or M display screens 132 , where M is a positive integer greater than one. For example, the electronic device may implement a display function through a GPU, a display screen 132, an application processor, and the like.

传感器模块140可以包括一个或多个传感器。例如，触摸传感器140A、陀螺仪140B、加速度传感器140C、指纹传感器140D、压力传感器140E等。在一些实施例中，传感器模块140还可以包括环境光传感器、距离传感器、接近光传感器、骨传导传感器、温度传感器等。Sensor module 140 may include one or more sensors. For example, a touch sensor 140A, a gyroscope 140B, an acceleration sensor 140C, a fingerprint sensor 140D, a pressure sensor 140E, and the like. In some embodiments, the sensor module 140 may also include an ambient light sensor, a distance sensor, a proximity light sensor, a bone conduction sensor, a temperature sensor, and the like.

其中，触摸传感器140A，也可称为“触控面板”。触摸传感器140A可以设置于显示屏132，由触摸传感器140A与显示屏132组成触摸屏，也称“触控屏”。触摸传感器140A用于检测作用于其上或附近的触摸操作。触摸传感器140A可以将检测到的触摸操作传递给应用处理器，以确定触摸事件类型。电子设备可以通过显示屏132提供与触摸操作相关的视觉输出等。在另一些实施例中，触摸传感器140A也可以设置于电子设备的表面，与显示屏132所处的位置不同。The touch sensor 140A may also be referred to as a "touch panel". The touch sensor 140A may be disposed on the display screen 132 , and the touch sensor 140A and the display screen 132 form a touch screen, also referred to as a “touch screen”. The touch sensor 140A is used to detect a touch operation on or near it. The touch sensor 140A may communicate the detected touch operation to the application processor to determine the type of touch event. The electronic device may provide visual output and the like related to touch operations through the display screen 132 . In other embodiments, the touch sensor 140A may also be disposed on the surface of the electronic device, which is different from the location where the display screen 132 is located.

陀螺仪140B可以用于确定电子设备的运动姿态。在一些实施例中，可以通过陀螺仪140B确定电子设备围绕三个轴(即，x、y和z轴)的角速度。陀螺仪140B可以用于拍摄防抖。示例性的，当按下快门，陀螺仪140B检测电子设备抖动的角度，根据角度计算出镜头模组需要补偿的距离，让镜头通过反向运动抵消电子设备的抖动，从而实现防抖。陀螺仪传感器140B还可以用于导航、体感游戏场景。The gyroscope 140B can be used to determine the motion attitude of the electronic device. In some embodiments, the angular velocity of the electronic device about three axes (ie, the x, y, and z axes) may be determined by the gyroscope 140B. The gyroscope 140B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyroscope 140B detects the shaking angle of the electronic device, calculates the distance to be compensated by the lens module according to the angle, and allows the lens to counteract the shaking of the electronic device through reverse motion, thereby realizing anti-shake. The gyro sensor 140B can also be used for navigation and somatosensory game scenarios.

加速度传感器140C可检测电子设备在各个方向上(一般为三轴)加速度的大小。当电子设备静止时可检测出重力的大小及方向。加速度传感器140C还可以用于识别电子设备的姿态，应用于横竖屏切换、计步器等应用。The acceleration sensor 140C can detect the magnitude of the acceleration of the electronic device in various directions (generally three axes). The magnitude and direction of gravity can be detected when the electronic device is stationary. The acceleration sensor 140C can also be used to identify the posture of the electronic device, and is applied to applications such as switching between horizontal and vertical screens, and pedometers.

指纹传感器140D用于采集指纹。电子设备可以利用采集的指纹特性实现指纹解锁、访问应用锁、指纹拍照、指纹接听来电等。The fingerprint sensor 140D is used to collect fingerprints. Electronic devices can use the collected fingerprint characteristics to unlock fingerprints, access application locks, take photos with fingerprints, and answer incoming calls with fingerprints.

压力传感器140E用于感受压力信号，可以将压力信号转换成电信号。示例的，压力传感器140E可以设置于显示屏132。其中，作用于相同触摸位置，但不同触摸操作强度的触摸操作，可以对应不同的操作指令。The pressure sensor 140E is used to sense pressure signals, and can convert the pressure signals into electrical signals. For example, the pressure sensor 140E may be disposed on the display screen 132 . Among them, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions.

SIM卡接口151用于连接SIM卡。SIM卡可以通过插入SIM卡接口151，或从SIM卡接口151拔出，实现和电子设备的接触和分离。电子设备可以支持1个或K个SIM卡接口151，K为大于1的正整数。SIM卡接口151可以支持Nano SIM卡、Micro SIM卡、和/或SIM卡等。同一个SIM卡接口151可以同时插入多张卡。所述多张卡的类型可以相同，也可以不同。SIM卡接口151也可以兼容不同类型的SIM卡。SIM卡接口151也可以兼容外部存储卡。电子设备通过SIM卡和网络交互，实现通话以及数据通信等功能。在一些实施例中，电子设备还可以采用eSIM，即：嵌入式SIM卡。eSIM卡可以嵌在电子设备中，不能和电子设备分离。The SIM card interface 151 is used for connecting a SIM card. The SIM card can be inserted into the SIM card interface 151 or pulled out from the SIM card interface 151 to achieve contact and separation with the electronic device. The electronic device may support 1 or K SIM card interfaces 151 , where K is a positive integer greater than 1. The SIM card interface 151 may support Nano SIM cards, Micro SIM cards, and/or SIM cards, and the like. Multiple cards can be inserted into the same SIM card interface 151 at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 151 can also be compatible with different types of SIM cards. The SIM card interface 151 is also compatible with external memory cards. The electronic device interacts with the network through the SIM card to realize functions such as call and data communication. In some embodiments, the electronic device may also employ an eSIM, that is, an embedded SIM card. The eSIM card can be embedded in the electronic device and cannot be separated from the electronic device.

按键152可以包括开机键、音量键等。按键152可以是机械按键，也可以是触摸式按键。电子设备可以接收按键输入，产生与电子设备的用户设置以及功能控制有关的键信号输入。The keys 152 may include a power key, a volume key, and the like. The keys 152 may be mechanical keys or touch keys. The electronic device may receive key input and generate key signal input related to user settings and function control of the electronic device.

电子设备可以通过音频模块160、扬声器161、受话器162、麦克风163、耳机接口164以及应用处理器等实现音频功能。例如，音频播放功能、录音功能、语音唤醒功能等。The electronic device may implement audio functions through an audio module 160, a speaker 161, a receiver 162, a microphone 163, an earphone interface 164, an application processor, and the like. For example, audio playback function, recording function, voice wake-up function, etc.

音频模块160可以用于对音频数据进行数模转换、和/或模数转换，还可以用于对音频数据进行编码和/或解码。示例的，音频模块160可以独立于处理器设置，也可以设置于处理器110中，或将音频模块160的部分功能模块设置于处理器110中。The audio module 160 may be used to perform digital-to-analog conversion and/or analog-to-digital conversion on audio data, and may also be used to encode and/or decode audio data. For example, the audio module 160 may be provided independently of the processor, or may be provided in the processor 110 , or some functional modules of the audio module 160 may be provided in the processor 110 .

扬声器161，也称“喇叭”，用于将音频数据转换为声音，并播放声音。例如，电子设备100可以通过扬声器161收听音乐、接听免提电话、或者发出语音提示等。The speaker 161, also called "speaker", is used to convert audio data into sound and play the sound. For example, the electronic device 100 may listen to music, answer a speakerphone, or issue a voice prompt through the speaker 161 .

受话器162，也称“听筒”，用于将音频数据转换成声音，并播放声音。例如，当电子设备100接听电话时，可以通过将受话器162靠近人耳进行接听。The receiver 162, also referred to as an "earpiece", is used to convert audio data into sound and to play back the sound. For example, when the electronic device 100 answers a call, it can be answered by placing the receiver 162 close to the human ear.

麦克风163，也称“话筒”、“传声器”，用于采集声音(例如周围环境声音，包括人发出的声音、设备发出的声音等)，并将声音转换为音频电数据。当拨打电话或发送语音时，用户可以通过人嘴靠近麦克风163发出声音，麦克风163采集用户发出的声音。当电子设备的语音唤醒功能已开启的情况下，麦克风163可以实时采集周围环境声音，获取音频数据。其中，麦克风163采集声音的情况与所处的环境相关。例如，当周围环境较为嘈杂时，用户说出唤醒词，则麦克风163采集的声音包括周围环境噪声和用户发出唤醒词的声音。再例如，当周围环境较为安静时，用户说出唤醒词，则麦克风163采集的声音为用户发出唤醒词的声音。又例如，当周围环境较为嘈杂时，电子设备的语音唤醒功能已开启，但是用户并未说出唤醒词对电子设备进行唤醒，则麦克风163采集的声音仅为周围环境噪声。The microphone 163, also called "microphone" or "microphone", is used to collect sound (eg, ambient sound, including sounds made by people, sounds made by equipment, etc.), and convert the sound into audio electrical data. When making a call or sending a voice, the user can make a sound through the human mouth close to the microphone 163, and the microphone 163 collects the sound made by the user. When the voice wake-up function of the electronic device is enabled, the microphone 163 can collect ambient sound in real time and obtain audio data. The situation in which the microphone 163 collects sound is related to the environment in which it is located. For example, when the surrounding environment is relatively noisy and the user speaks the wake-up word, the sound collected by the microphone 163 includes the surrounding environment noise and the sound of the user uttering the wake-up word. For another example, when the surrounding environment is relatively quiet and the user speaks the wake-up word, the sound collected by the microphone 163 is the sound of the user issuing the wake-up word. For another example, when the surrounding environment is relatively noisy, the voice wake-up function of the electronic device is enabled, but the user does not speak a wake-up word to wake up the electronic device, and the sound collected by the microphone 163 is only ambient noise.

需要说明的是，电子设备可以设置至少一个麦克风163。例如，电子设备中设置两个麦克风163，除了采集声音，还可以实现降噪功能。又例如，电子设备中还可以设置三个、四个或更多个麦克风163，从而可以在实现声音采集、降噪的基础上，还可以实现声音来源的识别、或定向录音功能等。It should be noted that the electronic device may be provided with at least one microphone 163 . For example, two microphones 163 are provided in the electronic device, which can implement a noise reduction function in addition to collecting sound. For another example, three, four or more microphones 163 may also be set in the electronic device, so that on the basis of sound collection and noise reduction, sound source identification or directional recording functions can also be implemented.

耳机接口164用于连接有线耳机。耳机接口164可以是USB接口170，也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform，OMTP)标准接口、美国蜂窝电信工业协会(cellular telecommunications industry association of the USA，CTIA)标准接口等。The earphone jack 164 is used to connect wired earphones. The earphone interface 164 may be a USB interface 170, or a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface, etc. .

USB接口170是符合USB标准规范的接口，具体可以是Mini USB接口、Micro USB接口、USB Type C接口等。USB接口170可以用于连接充电器为电子设备充电，也可以用于电子设备与外围设备之间传输数据。也可以用于连接耳机，通过耳机播放音频。示例的，USB接口170除了可以为耳机接口164以外，还可以用于连接其他电子设备，例如AR设备、计算机等。The USB interface 170 is an interface conforming to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 170 can be used to connect a charger to charge the electronic device, and can also be used to transmit data between the electronic device and peripheral devices. It can also be used to connect headphones to play audio through the headphones. For example, the USB interface 170 can be used to connect other electronic devices, such as AR devices, computers, and the like, in addition to the headphone interface 164 .

充电管理模块180用于从充电器接收充电输入。其中，充电器可以是无线充电器，也可以是有线充电器。在一些有线充电的实施例中，充电管理模块180可以通过USB接口170接收有线充电器的充电输入。在一些无线充电的实施例中，充电管理模块180可以通过电子设备的无线充电线圈接收无线充电输入。充电管理模块180为电池182充电的同时，还可以通过电源管理模块180为电子设备供电。The charging management module 180 is used to receive charging input from the charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 180 may receive charging input from the wired charger through the USB interface 170 . In some wireless charging embodiments, the charging management module 180 may receive wireless charging input through a wireless charging coil of the electronic device. While the charging management module 180 charges the battery 182 , the power management module 180 can also supply power to the electronic device.

电源管理模块181用于连接电池182、充电管理模块180与处理器110。电源管理模块181接收电池182和/或充电管理模块180的输入，为处理器110、内部存储器121、显示屏132、摄像头131等供电。电源管理模块181还可以用于监测电池容量、电池循环次数、电池健康状态(漏电、阻抗)等参数。在其他一些实施例中，电源管理模块181也可以设置于处理器110中。在另一些实施例中，电源管理模块181和充电管理模块180也可以设置于同一个器件中。The power management module 181 is used for connecting the battery 182 , the charging management module 180 and the processor 110 . The power management module 181 receives input from the battery 182 and/or the charge management module 180, and supplies power to the processor 110, the internal memory 121, the display screen 132, the camera 131, and the like. The power management module 181 can also be used to monitor parameters such as battery capacity, battery cycle times, battery health status (leakage, impedance). In some other embodiments, the power management module 181 may also be provided in the processor 110 . In other embodiments, the power management module 181 and the charging management module 180 may also be provided in the same device.

移动通信模块191可以提供应用在电子设备上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块191可以包括滤波器、开关、功率放大器、低噪声放大器(lownoiseamplifier，LNA)等。The mobile communication module 191 can provide a wireless communication solution including 2G/3G/4G/5G etc. applied on the electronic device. The mobile communication module 191 may include a filter, a switch, a power amplifier, a low noise amplifier (LNA), and the like.

无线通信模块192可以提供应用在电子设备上的包括WLAN(如Wi-Fi网络)、蓝牙(Bluetooth，BT)、全球导航卫星系统(global navigation satellite system，GNSS)、调频(frequency modulation，FM)、近距离无线通信技术(near field communication，NFC)、红外技术(infrared，IR)等无线通信的解决方案。无线通信模块192可以是集成至少一个通信处理模块的一个或多个器件。The wireless communication module 192 can provide applications on electronic devices including WLAN (such as Wi-Fi network), Bluetooth (Bluetooth, BT), global navigation satellite system (GNSS), frequency modulation (frequency modulation, FM), Solutions for wireless communication such as near field communication (NFC), infrared technology (infrared, IR). The wireless communication module 192 may be one or more devices integrating at least one communication processing module.

在一些实施例中，电子设备的天线1和移动通信模块191耦合，天线2和无线通信模块192耦合，使得电子设备可以与其他设备通信。具体的，移动通信模块191可以通过天线1与其它设备通信，无线通信模块193可以通过天线2与其它设备通信。In some embodiments, the antenna 1 of the electronic device is coupled with the mobile communication module 191, and the antenna 2 is coupled with the wireless communication module 192, so that the electronic device can communicate with other devices. Specifically, the mobile communication module 191 can communicate with other devices through the antenna 1 , and the wireless communication module 193 can communicate with other devices through the antenna 2 .

图3示出了本申请实施例的一种电子设备的软件架构示意图。如图3所示，电子设备包括音频收集模块(audio collector)401、音频处理模块(aduio processor)402、音频识别模块(audio recognizer)403、交互模块(interactor)404。FIG. 3 shows a schematic diagram of a software architecture of an electronic device according to an embodiment of the present application. As shown in FIG. 3 , the electronic device includes an audio collector 401 , an audio processor 402 , an audio recognizer 403 , and an interactor 404 .

其中，音频收集模块401用于存储根据声音采集设备(例如图2中所示的麦克风163、或者其它用于采集声音的传感器)采集的声音转化为的音频数据，并将音频数据转发给音频处理模块402。例如，音频收集模块401可以用于将从音频模块160获取的音频数据存储到存储器(例如内部存储器121、或者处理器110中的存储器等)中，将存储器中存储的音频数据转发给音频处理模块402进行处理。需要说明的是，本申请实施例中音频收集模块401可以当接收到音频模块160中得到音频数据的通知后，主动从音频模块160中获取音频数据，也可以是音频模块160采集到音频数据后，将音频数据发送给音频收集模块401，本申请实施例对音频收集模块401从音频模块160获取音频数据的方式不作限定。The audio collection module 401 is used to store audio data converted from sounds collected by a sound collection device (such as the microphone 163 shown in FIG. 2 or other sensors for collecting sounds), and forward the audio data to audio processing Module 402. For example, the audio collection module 401 can be used to store the audio data obtained from the audio module 160 in a memory (eg, the internal memory 121, or the memory in the processor 110, etc.), and forward the audio data stored in the memory to the audio processing module 402 for processing. It should be noted that, in the embodiment of the present application, the audio collection module 401 may actively acquire the audio data from the audio module 160 after receiving the notification that the audio data is obtained from the audio module 160, or may be obtained after the audio module 160 collects the audio data. , the audio data is sent to the audio collection module 401 , and the embodiment of the present application does not limit the manner in which the audio collection module 401 acquires the audio data from the audio module 160 .

音频处理模块402用于对音频数据进行预处理，例如声道转换、平滑处理、降噪处理等，并将预处理后的音频数据发送给音频识别模块403，以便于后续音频识别模块403进行唤醒词检测。The audio processing module 402 is used to preprocess the audio data, such as channel conversion, smoothing, noise reduction, etc., and send the preprocessed audio data to the audio recognition module 403, so that the subsequent audio recognition module 403 can wake up word detection.

音频识别模块403用于对音频数据进行唤醒词检测，可通过语音唤醒模型确定音频数据中是否存在预设唤醒词。通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值，如果相似度值大于或者等于预设相似度阈值，则向麦克风163、受话器162、耳机通话器发送唤醒指令，唤醒后通过扬声器161从音频模块160获取预存的应答语音数据，将应答语音数据转换声音，进行语音应答。例如，音频识别模块403可以根据第一时间段内的Q个采样时刻的音频数据进行唤醒词检测，第一时间段又可以称之为唤醒词时间窗等，通常设置为一个不小于用户发出唤醒词声音所需的时长。其中，Q个采样时刻中相邻的两个采样间隔之间间隔的时长为第一采样间隔，即音频处理模块402可以每间隔第一采样间隔向音频识别模块403发送预处理后的音频数据，音频识别模块403每间隔第一采样间隔，根据最近接收到的Q个采样时刻的音频数据进行一次唤醒词检测。当音频识别模块403检测到唤醒词时，通过语音唤醒模型计算唤醒词与预设唤醒词的相似度值，如果相似度值大于或者等于预设相似度阈值，确定唤醒电子设备，音频识别模块403向扬声器161发送唤醒指令。需要说明的是，本申请实施例中第一采样间隔的取值可以为0.1ms、0.2ms等，可以是预先设置的，也可以是根据预设算法确定的，对此本实施例不作任何限制。在另一些实施例中，音频识别模块403还可以对音频数据进行语音数据识别，以及对语音数据中的语义进行识别等。The audio recognition module 403 is configured to perform wake-up word detection on the audio data, and can determine whether there is a preset wake-up word in the audio data through a voice wake-up model. Calculate the similarity value between the wake-up word in the audio data and the acoustic features of the preset wake-up word through the voice wake-up model. If the similarity value is greater than or equal to the preset similarity threshold, send a wake-up call to the microphone 163, the receiver 162, and the earphone. command, after waking up, the pre-stored response voice data is obtained from the audio module 160 through the speaker 161, the response voice data is converted into sound, and a voice response is performed. For example, the audio recognition module 403 can perform wake-up word detection according to the audio data at Q sampling moments in the first time period, which can also be called the wake-up word time window, etc., and is usually set to a value not smaller than the wake-up word issued by the user. The time required for the sound of the word. Wherein, the duration of the interval between two adjacent sampling intervals in the Q sampling moments is the first sampling interval, that is, the audio processing module 402 may send the preprocessed audio data to the audio recognition module 403 at every first sampling interval, The audio recognition module 403 performs a wake-up word detection every first sampling interval according to the audio data of the Q sampling moments recently received. When the audio recognition module 403 detects the wake-up word, it calculates the similarity value between the wake-up word and the preset wake-up word through the voice wake-up model. If the similarity value is greater than or equal to the preset similarity threshold, it is determined to wake up the electronic device, and the audio recognition module 403 A wake-up command is sent to the speaker 161 . It should be noted that, in this embodiment of the present application, the value of the first sampling interval may be 0.1 ms, 0.2 ms, etc., which may be preset or determined according to a preset algorithm, which is not limited in this embodiment. . In other embodiments, the audio recognition module 403 may further perform speech data recognition on the audio data, and recognize semantics in the speech data, and the like.

交互模块404用于与其它设备进行信息交互，例如与设备组内的其他从设备进行信息交互。交互模块404用于当接收到设备组内的其他从设备发送的唤醒识别结果时，向音频识别模块403发送其他从设备的唤醒识别结果，以便于音频识别模块403结合其他从设备的唤醒识别结果判定是否唤醒电子设备。交互模块404还用于向设备组内的其他从设备发送控制指令，以使其他从设备配合主设备处理用户的指令意图。The interaction module 404 is used for information interaction with other devices, for example, information interaction with other slave devices in the device group. The interaction module 404 is used to send the wake-up recognition results of other slave devices to the audio recognition module 403 when receiving the wake-up recognition results sent by other slave devices in the device group, so that the audio recognition module 403 can combine the wake-up recognition results of other slave devices. Determines whether to wake up the electronic device. The interaction module 404 is further configured to send control instructions to other slave devices in the device group, so that the other slave devices cooperate with the master device to process the user's instruction intention.

此外，在一些实施例中，电子设备还可以包括音频合成模块(audio synthesizer)405。其中，音频合成模块405用于合成相应的应答语音数据，并将应答语音数据转换为声音进行播放。例如，电子设备可以响应于采集到用户发出的“小艺小艺”的声音后，通过扬声器161播放“请问需要什么帮助”的声音。在这种情况下，音频合成模块405用于响应于采集到用户发出的“小艺小艺”的声音，合成相应的应答语音数据，并将合成的应答语音数据转换为“请问需要什么帮助”的声音，并播放。Additionally, in some embodiments, the electronic device may further include an audio synthesizer 405 . The audio synthesis module 405 is used for synthesizing corresponding response voice data, and converting the response voice data into sound for playback. For example, the electronic device may play the sound of "What help is needed" through the speaker 161 in response to collecting the sound of "Xiaoyi Xiaoyi" uttered by the user. In this case, the audio synthesis module 405 is used to synthesize corresponding response voice data in response to the collected voice of "Xiaoyi Xiaoyi" from the user, and convert the synthesized response voice data into "What help do I need?" sound and play.

应理解，图3所示的软件结构仅是一个示例。本申请实施例的电子设备可以具有比图中所示电子设备更多的或者更少的模块，可以组合两个或更多的模块等。图中所示出的各个模块可以在包括一个或多个信号处理和/或专用集成电路在内的硬件、软件、或硬件和软件的组合中实现。It should be understood that the software structure shown in FIG. 3 is only an example. The electronic device of the embodiments of the present application may have more or less modules than the electronic device shown in the figure, two or more modules may be combined, and so on. The various modules shown in the figures may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing and/or application specific integrated circuits.

需要说明的是，图3中所示的音频收集模块401、音频识别模块403、音频处理模块402、交互模块404、音频合成模块405可以集成在图2所示的处理器110中的一个或多个处理单元中，例如，音频收集模块401、音频识别模块403、音频处理模块402、交互模块404、音频合成模块405中的部分或全部可以集成在应用处理器、专用处理器等一个或多个处理器中。需要说明的是，本申请实施例中的专用处理器可以为DSP、专用集成电路(applicationspecific integrated circuit，ASIC)芯片等。It should be noted that the audio collection module 401, audio identification module 403, audio processing module 402, interaction module 404, and audio synthesis module 405 shown in FIG. 3 may be integrated into one or more of the processor 110 shown in FIG. 2 . Among the processing units, for example, some or all of the audio collection module 401, audio identification module 403, audio processing module 402, interaction module 404, and audio synthesis module 405 may be integrated in one or more of application processors, special-purpose processors, etc. in the processor. It should be noted that, the dedicated processor in the embodiments of the present application may be a DSP, an application specific integrated circuit (application specific integrated circuit, ASIC) chip, or the like.

以下实施例均可以在具有上述硬件结构和/或软件结构的电子设备中实现。The following embodiments can all be implemented in an electronic device having the above-mentioned hardware structure and/or software structure.

下面结合附图对本申请提供的语音唤醒方法的使用场景进行详细介绍。The usage scenarios of the voice wake-up method provided by the present application will be described in detail below with reference to the accompanying drawings.

图4a至图4c以智能音箱为设备组中的电子设备进行举例。如图4a所示，该场景包括6个音箱，这6个音箱组成一个设备组，分布在区域1(例如客厅)的不同位置处。其中，音箱11为设备组的主设备，音箱12至音箱16为设备组的从设备，设备组内的各个音箱之间相互连接(图4a仅示出了主设备与各从设备之间的连接关系)。设备组中各个音箱的设备类型相同，且唤醒词相同。设备组中各个音箱在唤醒时刻均可接收或者采集到的用户发出的唤醒语音，设备组内只有主设备音箱11进行唤醒应答，音箱11综合设备组内其他音箱的唤醒识别结果进行唤醒决策，确定音箱11是否被唤醒，在音箱11被唤醒后，从设备音箱12至音箱16可配合音箱11处理用户的指令意图。图4b与图4a的区别在于，设备组中的6个音箱分布在不同区域，作为一种示例，如图4b所示，主设备音箱11和从设备音箱12位于区域1，从设备音箱13至音箱16位于区域2(例如卧室)。在该场景中，用户可以在上述两个区域的任意区域发出唤醒语音，用户与设备组内主设备音箱11的距离不定，有可能距离较近，也有可能距离较远，且在唤醒语音的传输路径上还可能存在噪声源。图4c与图4a的区别在于，设备组中各个音箱的硬件规格不相同，但唤醒词相同。可根据音箱的硬件规格，将音箱细分为音箱Pro、常规音箱以及音箱mini，各音箱的硬件规格的大小关系为：音箱Pro＞常规音箱＞音箱mini。其中，硬件规格包括但不限于音箱内置芯片的处理性能、尺寸大小等。如图4c所示，将硬件规格最佳的音箱Pro11作为设备组的主设备。Figures 4a to 4c take a smart speaker as an example of an electronic device in the device group. As shown in Figure 4a, the scene includes 6 speakers, which form a device group and are distributed at different locations in area 1 (eg, living room). Among them, the speaker 11 is the master device of the device group, the speakers 12 to 16 are the slave devices of the device group, and each speaker in the device group is connected to each other (FIG. 4a only shows the connection between the master device and each slave device) relation). Each speaker in the device group has the same device type and the same wake word. Each speaker in the device group can receive or collect the wake-up voice sent by the user at the time of wake-up. In the device group, only the main device speaker 11 responds to the wake-up. Whether the speaker 11 is awakened, after the speaker 11 is awakened, the device speaker 12 to the speaker 16 can cooperate with the speaker 11 to process the user's instruction intention. The difference between Fig. 4b and Fig. 4a is that the 6 speakers in the device group are distributed in different regions. As an example, as shown in Fig. 4b, the master device speaker 11 and the slave device speaker 12 are located in region 1, and the slave device speakers 13 to The loudspeaker 16 is located in area 2 (eg bedroom). In this scenario, the user can send a wake-up voice in any of the above two areas. The distance between the user and the main device speaker 11 in the device group is not fixed, and the distance may be short or far away, and the transmission of the wake-up voice is in progress. There may also be noise sources on the path. The difference between Figure 4c and Figure 4a is that the hardware specifications of the speakers in the device group are different, but the wake-up words are the same. According to the hardware specifications of the speakers, the speakers can be subdivided into speaker Pro, conventional speakers and speaker mini. The size relationship of the hardware specifications of each speaker is: speaker Pro > conventional speakers > speaker mini. Among them, the hardware specifications include but are not limited to the processing performance and size of the built-in chip of the speaker. As shown in Figure 4c, the speaker Pro11 with the best hardware specifications is used as the main device of the device group.

上述使用场景仅作为示例，场景中同一设备组中的各个电子设备可以是同一类型的设备，例如设备组内都是智能音箱，如图4a至图4c，也可以是不同类型的设备，例如设备组内有智能音箱、智能手机、智能电视等，如图1所示。本申请实施例对设备组中的电子设备的设备类型不作任何限制。The above usage scenarios are only examples. In the scenario, each electronic device in the same device group may be the same type of device. For example, the device group is all smart speakers, as shown in Figure 4a to Figure 4c, or different types of devices, such as devices There are smart speakers, smart phones, smart TVs, etc. in the group, as shown in Figure 1. This embodiment of the present application does not impose any restrictions on the device types of the electronic devices in the device group.

作为一种示例，用户可通过任意具有显示功能的电子设备，例如智能手机、平板电脑等，管理或设置设备组中的电子设备。图5示出了本申请实施例的一种用户界面交互图。如图5所示，用户界面501包括状态栏、多个应用程序的图标控件(例如“智能家居”图标)、时间和天气小组件等。当电子设备检测到用户的手指(或者触控笔)针对某一应用图标的触摸操作后，响应于该触摸操作，电子设备启动该应用程序，并在显示屏上显示该应用程序的用户界面。示例性的，电子设备检测到对“智能家居”图标的触摸操作，响应于该触摸操作，在显示屏上显示用户的设备界面502，设备界面502包括用户已添加的智能设备列表(如图5所示的设备a、b、c、d)、第一控件503、第二控件504、第三控件505。用户可通过点击第一控件503添加新的智能设备至智能设备列表中，用户还可通过点击第二控件504、第三控件505，将智能设备列表中的多个智能设备添加至同一设备组中。在设备组界面506上，用户可通过点击第四控件507，设置设备组中的主设备(如图5中将设备a设置为主设备)，用户还可通过点击设备控件508，进入设备组中任意智能设备界面509，查询某一智能设备的设备信息(例如设备型号、数据记录、唤醒灵敏度等)。示例性的，对于智能设备的唤醒灵敏度，用户可通过第五控件510设置开启或关闭智能设备的唤醒增强功能。As an example, the user can manage or set the electronic devices in the device group through any electronic device having a display function, such as a smart phone, a tablet computer, and the like. FIG. 5 shows a user interface interaction diagram according to an embodiment of the present application. As shown in FIG. 5, user interface 501 includes a status bar, icon controls for multiple applications (eg, a "smart home" icon), time and weather widgets, and the like. After the electronic device detects a touch operation of a user's finger (or stylus) on an application icon, in response to the touch operation, the electronic device starts the application and displays the user interface of the application on the display screen. Exemplarily, the electronic device detects a touch operation on the "smart home" icon, and in response to the touch operation, displays the user's device interface 502 on the display screen, and the device interface 502 includes a list of smart devices that the user has added (as shown in FIG. 5 ). Shown devices a, b, c, d), first control 503, second control 504, third control 505. The user can add a new smart device to the smart device list by clicking the first control 503, and the user can also add multiple smart devices in the smart device list to the same device group by clicking the second control 504 and the third control 505. . On the device group interface 506, the user can click the fourth control 507 to set the main device in the device group (device a is set as the main device in FIG. 5), and the user can also click the device control 508 to enter the device group Any smart device interface 509 , to query device information of a certain smart device (eg, device model, data record, wake-up sensitivity, etc.). Exemplarily, for the wake-up sensitivity of the smart device, the user can set through the fifth control 510 to enable or disable the wake-up enhancement function of the smart device.

基于上述对电子设备以及场景的介绍，下面结合具体的几个实施例对本申请提供的语音唤醒方法进行详细说明。Based on the above introduction of electronic devices and scenarios, the voice wake-up method provided by the present application will be described in detail below with reference to several specific embodiments.

图6示出了本申请实施例提供的语音唤醒方法的交互示意图。示例性的，以3个电子设备组成的设备组为例，假设第一电子设备为设备组的主设备，第二电子设备a和第二电子设备b为设备组的从设备。如图6所示，用户发出唤醒语音，设备组中的各个电子设备均可接收或采集到该唤醒语音，设备组中的各个电子设备分别对唤醒语音进行唤醒词检测，得到唤醒识别结果。具体的，第一电子设备计算唤醒语音的第一唤醒置信度；第二电子设备a计算唤醒语音的第二唤醒置信度，确定允许或禁止第二电子设备a被唤醒；第二电子设备b计算唤醒语音的第三唤醒置信度，确定允许或禁止第二电子设备b被唤醒。第二电子设备a和第二电子设备b在确定唤醒识别结果后，分别向第一电子设备发送各自确定的唤醒识别结果，以使第一电子设备根据自身确定的第一唤醒置信度以及设备组中其他从设备发送的唤醒识别结果，确定允许或禁止第一电子设备被唤醒。应理解，由于各个电子设备与用户之间的距离不同、传输路径上干扰因素的不同、各个电子设备处理性能的差异等，各个电子设备计算得到的唤醒置信度可能相同也可能不同，因此唤醒识别结果可能不同。FIG. 6 shows an interactive schematic diagram of the voice wake-up method provided by the embodiment of the present application. Exemplarily, taking a device group composed of three electronic devices as an example, it is assumed that the first electronic device is the master device of the device group, and the second electronic device a and the second electronic device b are slave devices of the device group. As shown in FIG. 6 , when a user sends a wake-up voice, each electronic device in the device group can receive or collect the wake-up voice, and each electronic device in the device group performs wake-up word detection on the wake-up voice to obtain a wake-up recognition result. Specifically, the first electronic device calculates the first wake-up confidence of the wake-up voice; the second electronic device a calculates the second wake-up confidence of the wake-up voice, and determines whether to allow or prohibit the second electronic device a from being woken up; the second electronic device b calculates The third wake-up confidence level of the wake-up voice determines whether the second electronic device b is allowed or prohibited to be woken up. After determining the wake-up recognition result, the second electronic device a and the second electronic device b send the respectively determined wake-up recognition results to the first electronic device, so that the first electronic device can determine the first wake-up confidence level and the device group according to the first electronic device itself. The wake-up identification results sent by other slave devices in the device determine whether to allow or prohibit the first electronic device from being woken up. It should be understood that due to the difference in the distance between each electronic device and the user, the difference in the interference factors on the transmission path, the difference in the processing performance of each electronic device, etc., the wake-up confidence calculated by each electronic device may be the same or different. Results may vary.

作为一种示例，图7示出了一种语音唤醒方法的流程示意图，该方法以第一电子设备为执行主体，如图7所示，该方法具体包括如下步骤：As an example, FIG. 7 shows a schematic flowchart of a voice wake-up method. The method takes the first electronic device as the execution body. As shown in FIG. 7 , the method specifically includes the following steps:

步骤101、获取音频数据。Step 101: Acquire audio data.

第一电子设备可以通过声音采集设备，例如麦克风、其他用于采集声音的传感器等，采集周围环境的声音。声音采集设备在采集到周围环境声音后，将周围环境声音转换为音频电数据，并将音频电数据输出给音频模块160，音频模块160进行编码和/或模数转换得到相应格式的音频数据。当音频模块160得到相应格式的音频数据后，可以将相应格式的音频数据发送给处理器110中的音频收集模块401。音频收集模块401将相应格式的音频数据存储到存储器(例如内部存储器121、或者处理器110中的存储器等)中，并将相应格式的音频数据发送给音频处理模块402。音频处理模块402对相应格式的音频数据进行预处理，得到处理后的音频数据。其中，音频数据的预处理包括声道转换、平滑处理、降噪处理等。可选的，在一些实施例中，音频模块160获取相应格式的音频数据，还可以向处理器110中的音频收集模块401发送通知，音频收集模块401接收到该通知后，从音频模块401获取相应格式的音频数据，然后将相应格式的音频数据存储到存储器中，并向音频处理模块402发送相应格式的音频数据，音频处理模块402对相应格式的音频数据进行预处理，得到处理后的音频数据。The first electronic device may collect the sound of the surrounding environment through a sound collecting device, such as a microphone, other sensors for collecting sound, and the like. After collecting ambient sound, the sound collection device converts the ambient sound into audio electrical data, and outputs the audio electrical data to the audio module 160, which performs encoding and/or analog-to-digital conversion to obtain audio data in a corresponding format. After the audio module 160 obtains the audio data in the corresponding format, the audio data in the corresponding format can be sent to the audio collection module 401 in the processor 110 . The audio collection module 401 stores the audio data in the corresponding format in a memory (eg, the internal memory 121 or the memory in the processor 110 , etc.), and sends the audio data in the corresponding format to the audio processing module 402 . The audio processing module 402 preprocesses the audio data in the corresponding format to obtain processed audio data. The preprocessing of audio data includes channel conversion, smoothing, noise reduction, and the like. Optionally, in some embodiments, the audio module 160 acquires audio data in a corresponding format, and may also send a notification to the audio collection module 401 in the processor 110. After receiving the notification, the audio collection module 401 acquires the audio data from the audio module 401. Audio data of the corresponding format, then store the audio data of the corresponding format in the memory, and send the audio data of the corresponding format to the audio processing module 402, and the audio processing module 402 preprocesses the audio data of the corresponding format to obtain the processed audio. data.

步骤102、计算音频数据的第一唤醒置信度。其中，第一唤醒置信度用于指示音频数据中唤醒词与预设唤醒词的声学特征相似度。Step 102: Calculate the first wake-up confidence level of the audio data. Wherein, the first wake-up confidence level is used to indicate the similarity of acoustic features between the wake-up word in the audio data and the preset wake-up word.

第一电子设备可以通过处理器110对经过预处理后的音频数据进行唤醒词检测。示例性的，第一电子设备通过处理器110中的音频识别模块403对经过预处理后的音频数据进行唤醒词检测。其中，音频识别模块403是从音频处理模块402获取的经过预处理后的音频数据。The first electronic device may perform wake word detection on the preprocessed audio data through the processor 110 . Exemplarily, the first electronic device performs wake word detection on the preprocessed audio data through the audio recognition module 403 in the processor 110 . The audio identification module 403 is the preprocessed audio data obtained from the audio processing module 402 .

具体的，音频识别模块403对经过预处理后的音频数据进行唤醒词检测，可通过语音唤醒模型确定音频数据中是否存在预设唤醒词。语音唤醒模型是经过大量音频数据预先训练得到的，通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值。其中，语音唤醒模型的输入为经过预处理后的音频数据，语音唤醒模型的输出可以包括唤醒标识，唤醒标识包括允许或者禁止电子设备被唤醒的标识。示例性的，唤醒标识为0，用于指示禁止电子设备被唤醒；唤醒标识为1，用于指示允许电子设备被唤醒。在一些实施例中，语音唤醒模型的输出可以包括唤醒置信度，唤醒置信度用于指示音频数据中唤醒词与预设唤醒词的声学特征相似度。在一些实施例中，语音唤醒模型的输出可以同时包括上述的唤醒标识和唤醒置信度。音频识别模块403通过语音唤醒模型得到唤醒标识和/或唤醒置信度。Specifically, the audio recognition module 403 performs wake-up word detection on the preprocessed audio data, and can determine whether there is a preset wake-up word in the audio data through a voice wake-up model. The voice wake-up model is pre-trained with a large amount of audio data, and the voice wake-up model is used to calculate the similarity between the wake-up words in the audio data and the acoustic features of the preset wake-up words. The input of the voice wake-up model is preprocessed audio data, the output of the voice wake-up model may include a wake-up identifier, and the wake-up identifier includes an identifier that allows or prohibits the electronic device from being woken up. Exemplarily, the wake-up flag is 0, which is used to indicate that the electronic device is prohibited from being woken up; the wake-up flag is 1, which is used to indicate that the electronic device is allowed to be woken up. In some embodiments, the output of the voice wake-up model may include a wake-up confidence level, where the wake-up confidence level is used to indicate the similarity between the acoustic features of the wake-up word in the audio data and the preset wake-up word. In some embodiments, the output of the voice wake-up model may include the aforementioned wake-up identification and wake-up confidence at the same time. The audio recognition module 403 obtains the wake-up identification and/or the wake-up confidence level through the voice wake-up model.

步骤103、接收至少一个第二电子设备发送的唤醒识别结果。其中，唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒。Step 103: Receive a wake-up identification result sent by at least one second electronic device. Wherein, the wake-up identification result is used to indicate that at least one second electronic device is allowed or prohibited to be woken up.

在本实施例中，设备组中的各从设备均具有唤醒识别功能。与上述步骤101和步骤102类似，设备组中的各第二电子设备同样可以对接收到的音频数据进行预处理，然后对经过预处理后的音频数据进行唤醒词检测，可通过语音唤醒模型确定音频数据中是否存在预设唤醒词，通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度，最终得到唤醒标识和/或唤醒置信度。相应的，各第二电子设备发送的唤醒识别结果可以包括唤醒标识和/或唤醒置信度。第一电子设备可以通过交互模块404接收设备组中的至少一个第二电子设备发送的唤醒识别结果，交互模块404接收到至少一个第二电子设备发送的唤醒识别结果时，向音频识别模块403发送至少一个第二电子设备的唤醒识别结果，以便音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果，判定是否唤醒第一电子设备。In this embodiment, each slave device in the device group has a wake-up recognition function. Similar to the above steps 101 and 102, each second electronic device in the device group can also preprocess the received audio data, and then perform wake-up word detection on the pre-processed audio data, which can be determined by the voice wake-up model. Whether there is a preset wake-up word in the audio data, the similarity of the acoustic features of the wake-up word in the audio data and the preset wake-up word is calculated by the voice wake-up model, and finally the wake-up identification and/or the wake-up confidence level are obtained. Correspondingly, the wake-up identification result sent by each second electronic device may include a wake-up identification and/or a wake-up confidence level. The first electronic device can receive the wake-up recognition result sent by at least one second electronic device in the device group through the interaction module 404. When the interaction module 404 receives the wake-up recognition result sent by the at least one second electronic device, it sends to the audio recognition module 403. The wake-up identification result of the at least one second electronic device, so that the audio identification module 403 determines whether to wake up the first electronic device according to the first wake-up confidence level and the wake-up identification result sent by the at least one second electronic device.

步骤104、根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果，确定允许或禁止第一电子设备被唤醒。Step 104: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result sent by the at least one second electronic device.

在本步骤中，第一电子设备可以通过处理器110中的音频识别模块403，确定允许或禁止第一电子设备被唤醒。具体的，音频识别模块403首先判断第一唤醒置信度与预设唤醒阈值的大小关系，根据判断结果确定允许或禁止第一电子设备被唤醒。其中，预设唤醒阈值可以包括一个阈值，也可以包括两个阈值。In this step, the first electronic device may determine whether to allow or prohibit the first electronic device from being woken up through the audio identification module 403 in the processor 110 . Specifically, the audio recognition module 403 firstly judges the relationship between the first wake-up confidence and the preset wake-up threshold, and determines whether to allow or prohibit the first electronic device from being woken up according to the judgment result. The preset wake-up threshold may include one threshold or two thresholds.

在一种可能的实现方式中，预设唤醒阈值包括一个阈值，为第一阈值。如图8a所示，音频识别模块403可根据第一唤醒置信度与第一阈值的大小关系，确定允许或禁止第一电子设备被唤醒。具体的，若第一唤醒置信度大于或者等于第一阈值，音频识别模块403确定允许第一电子设备被唤醒；若第一唤醒置信度小于第一阈值，音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果，确定允许或禁止第一电子设备被唤醒。该实现方式中，如果第一电子设备计算得到的第一唤醒置信度较高时，第一电子设备可直接判定允许第一电子设备被唤醒。如果第一电子设备计算得到的第一唤醒置信度不是很高时，第一电子设备需要结合设备组中其他从设备发送的唤醒识别结果，综合判定第一电子设备是否允许被唤醒，提升设备组内主设备唤醒的准确性。In a possible implementation manner, the preset wake-up threshold includes a threshold, which is a first threshold. As shown in FIG. 8a, the audio identification module 403 may determine whether to allow or prohibit the first electronic device from being woken up according to the magnitude relationship between the first wake-up confidence level and the first threshold. Specifically, if the first wake-up confidence level is greater than or equal to the first threshold, the audio recognition module 403 determines that the first electronic device is allowed to be awakened; if the first wake-up confidence level is less than the first threshold, the audio recognition module 403 determines according to the first wake-up confidence level and the wake-up identification result sent by the at least one second electronic device to determine whether to allow or prohibit the first electronic device from being woken up. In this implementation manner, if the first wake-up confidence level calculated by the first electronic device is high, the first electronic device may directly determine that the first electronic device is allowed to be woken up. If the first wake-up confidence calculated by the first electronic device is not very high, the first electronic device needs to combine the wake-up identification results sent by other slave devices in the device group to comprehensively determine whether the first electronic device is allowed to be woken up, and improve the device group. The accuracy of the wake-up of the master device.

在另一种可能的实现方式中，预设唤醒阈值包括两个阈值，分别为第一阈值和第二阈值，其中，第一阈值大于第二阈值。如图8b所示，音频识别模块403可根据第一唤醒置信度与第一阈值、第二阈值的大小关系，确定允许或禁止第一电子设备被唤醒。具体的，若第一唤醒置信度大于或者等于第一阈值，音频识别模块403确定允许第一电子设备被唤醒；若第一唤醒置信度小于或者等于第二阈值，音频识别模块403确定禁止第一电子设备被唤醒；若第一唤醒置信度小于第一阈值且大于第二阈值，音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果，确定允许或禁止第一电子设备被唤醒。与第一种实现方式相比，该实现方式增设了第二阈值，第二阈值是一个相对较低的判断阈值，如果第一电子设备计算得到的第一唤醒置信度小于或者等于第二阈值时，可认为第一唤醒置信度确实很低，第一电子设备无需结合设备组内其他从设备发送的唤醒识别结果进行综合判定，可直接判定禁止第一电子设备被唤醒，该方式可以避免不必要的判断过程，提高第一电子设备的处理速率，通过该方式也可以提升设备组内主设备唤醒的准确性。In another possible implementation manner, the preset wake-up threshold includes two thresholds, which are a first threshold and a second threshold, wherein the first threshold is greater than the second threshold. As shown in FIG. 8b , the audio recognition module 403 may determine whether to allow or prohibit the first electronic device from being woken up according to the relationship between the first wake-up confidence level and the first threshold and the second threshold. Specifically, if the first wake-up confidence is greater than or equal to the first threshold, the audio recognition module 403 determines to allow the first electronic device to be awakened; if the first wake-up confidence is less than or equal to the second threshold, the audio recognition module 403 determines to prohibit the first electronic device The electronic device is woken up; if the first wake-up confidence level is less than the first threshold and greater than the second threshold, the audio recognition module 403 determines to allow or prohibit the first wake-up confidence level and the wake-up recognition result sent by at least one second electronic device Electronic device wakes up. Compared with the first implementation, this implementation adds a second threshold, which is a relatively low judgment threshold, if the first wake-up confidence calculated by the first electronic device is less than or equal to the second threshold , it can be considered that the first wake-up confidence is indeed very low. The first electronic device does not need to make a comprehensive judgment in combination with the wake-up recognition results sent by other slave devices in the device group, and can directly determine that the first electronic device is prohibited from being woken up. This method can avoid unnecessary The judgment process of the first electronic device improves the processing rate of the first electronic device, and the accuracy of the wake-up of the master device in the device group can also be improved by this method.

需要说明的是，本申请实施例中设备组内的各个电子设备的预设唤醒阈值可以是相同的阈值，例如，第一电子设备和任意第二电子设备中预设唤醒阈值包括一个阈值，该阈值为第一阈值，第一电子设备根据第一阈值和当前计算得到的第一唤醒置信度进行初步判定，同样的，第二电子设备根据第一阈值和当前计算得到的第二唤醒置信度进行唤醒识别判定，并将唤醒识别结果发送至第一电子设备。在一些实施例中，设备组内的各个电子设备的预设唤醒阈值还可以是不同的阈值，例如，第一电子设备中预设唤醒阈值包括一个阈值，该阈值为第一阈值，第二电子设备中预设唤醒阈值也包括一个阈值，该阈值可以是大于或者小于第一阈值的数值，由此可见，设备组内的各个电子设备可分别基于各自预设的唤醒阈值，进行唤醒识别判定。It should be noted that the preset wake-up thresholds of each electronic device in the device group in this embodiment of the present application may be the same threshold. For example, the preset wake-up thresholds in the first electronic device and any second electronic device include a threshold, which The threshold is the first threshold, and the first electronic device makes a preliminary determination according to the first threshold and the currently calculated first wake-up confidence. Similarly, the second electronic device makes a preliminary determination according to the first threshold and the currently calculated second wake-up confidence. The wake-up identification is determined, and the wake-up identification result is sent to the first electronic device. In some embodiments, the preset wake-up thresholds of each electronic device in the device group may also be different thresholds. For example, the preset wake-up threshold in the first electronic device includes a threshold, the threshold is the first threshold, and the second electronic device The preset wake-up threshold in the device also includes a threshold, and the threshold can be a value greater than or less than the first threshold. It can be seen that each electronic device in the device group can perform wake-up identification determination based on the respective preset wake-up thresholds.

综上，无论上述哪一种实现方式，均包括如下步骤：根据第一唤醒置信度和至少一个第二电子设备发送的唤醒识别结果，确定允许或禁止第一电子设备被唤醒。该步骤包括如下三种可能的实现方式：To sum up, no matter which of the above implementation manners, the following steps are included: according to the first wake-up confidence level and the wake-up identification result sent by at least one second electronic device, it is determined whether to allow or prohibit the first electronic device from being woken up. This step includes the following three possible implementations:

在第一种可能的实现方式中，第一电子设备通过至少一个第二电子设备发送的唤醒识别结果统计设备组内从设备的唤醒情况，判断从设备的唤醒情况是否满足预设的唤醒条件，如果满足预设的唤醒条件，则确定允许第一电子设备被唤醒，如果不满足预设的唤醒条件，则确定禁止第一电子设备被唤醒。In a first possible implementation manner, the first electronic device counts the wake-up status of the slave devices in the device group through the wake-up identification result sent by at least one second electronic device, and determines whether the wake-up status of the slave device satisfies a preset wake-up condition, If the preset wake-up condition is satisfied, it is determined to allow the first electronic device to be woken up, and if the preset wake-up condition is not satisfied, it is determined to prohibit the first electronic device from being woken up.

在第二种可能的实现方式中，第一电子设备基于连续方式动态调整设备组内第一电子设备的预设唤醒阈值，并根据调整后的预设唤醒阈值与当前第一电子设备计算得到的第一唤醒置信度进行比较，确定允许或禁止第一电子设备被唤醒。In a second possible implementation manner, the first electronic device dynamically adjusts the preset wake-up threshold of the first electronic device in the device group based on a continuous method, and calculates the preset wake-up threshold according to the adjusted preset wake-up threshold and the current first electronic device. The first wake-up confidence is compared to determine whether to allow or prohibit the first electronic device from being woken up.

在第三种可能的实现方式中，第一电子设备基于离散方式动态调整设备组内第一电子设备的预设唤醒阈值，并根据调整后的预设唤醒阈值与当前第一电子设备计算得到的第一唤醒置信度进行比较，确定允许或禁止第一电子设备被唤醒。In a third possible implementation manner, the first electronic device dynamically adjusts the preset wake-up threshold of the first electronic device in the device group based on a discrete method, and calculates the preset wake-up threshold according to the adjusted preset wake-up threshold and the current first electronic device. The first wake-up confidence is compared to determine whether to allow or prohibit the first electronic device from being woken up.

其中，第一种实现方式是基于预设规则，该预设规则充分考虑了设备组内其他从设备的唤醒识别结果，对设备组内主设备的唤醒条件进行优化，提升设备组内主设备唤醒的准确性。后两种实现方式均是从唤醒阈值的角度，结合设备组内其他从设备的唤醒识别结果对主设备的预设的唤醒阈值进行优化，其目的与第一种实现方式相同。Among them, the first implementation method is based on a preset rule, which fully considers the wake-up recognition results of other slave devices in the device group, optimizes the wake-up conditions of the master device in the device group, and improves the wake-up of the master device in the device group. accuracy. The latter two implementations both optimize the preset wakeup threshold of the master device from the perspective of the wakeup threshold combined with the wakeup identification results of other slave devices in the device group, the purpose of which is the same as that of the first implementation.

下面结合附图9至图11，分别对第一电子设备如何结合设备组中其他从设备发送的唤醒识别结果进行唤醒决策进行详细介绍。需要说明的是，下面几个实施例的判断流程均可通过第一电子设备的处理器110中的音频识别模块403来执行。The following describes in detail how the first electronic device makes a wake-up decision in combination with the wake-up identification results sent by other slave devices in the device group with reference to FIGS. 9 to 11 . It should be noted that, the judgment processes of the following several embodiments can be executed by the audio recognition module 403 in the processor 110 of the first electronic device.

图9示出了一种语音唤醒方法的判断流程图，如图9所示，若第一电子设备计算得到的第一唤醒置信度小于第一阈值，该方法包括如下步骤：FIG. 9 shows a judgment flowchart of a voice wake-up method. As shown in FIG. 9 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:

步骤201、根据至少一个第二电子设备发送的唤醒识别结果，统计至少一个第二电子设备的唤醒情况。Step 201 , according to the wake-up identification result sent by the at least one second electronic device, count the wake-up situation of the at least one second electronic device.

具体的，唤醒识别结果可以包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项。其中，唤醒标识包括允许或禁止第二电子设备被唤醒的标识，第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Specifically, the wake-up identification result may include at least one of a wake-up identifier of at least one second electronic device and a second wake-up confidence level. The wake-up identifier includes an identifier that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence level is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.

在一种可能的情况下，如果至少一个第二电子设备发送的唤醒识别结果中仅包括唤醒标识，第一电子设备可根据唤醒标识，统计允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。In a possible situation, if the wake-up identification result sent by at least one second electronic device only includes a wake-up identifier, the first electronic device can count the number of devices of the second electronic device that are allowed to be woken up according to the wake-up identifier, and the The number of devices as a percentage of the total number of devices in the device group.

在另一种可能的情况下，如果至少一个第二电子设备发送的唤醒识别结果中仅包括第二唤醒置信度，第一电子设备首先需要根据第二唤醒置信度和各第二电子设备预设的唤醒阈值，确定第二电子设备是否允许被唤醒。然后统计允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。其中，各第二电子设备的预设唤醒阈值可以是与第一电子设备预设的唤醒阈值相同的数值，例如第一阈值，也可以是与第一电子设备的预设唤醒阈值不同的数值。各第二电子设备的预设唤醒阈值可以相同也可以不同，对此本实施例不作任何限制。In another possible situation, if the wake-up identification result sent by at least one second electronic device only includes the second wake-up confidence level, the first electronic device first needs to preset the second wake-up confidence level and each second electronic device according to the second wake-up confidence level. The wake-up threshold determines whether the second electronic device is allowed to be woken up. Then, count the number of devices of the second electronic device that are allowed to be woken up, and the proportion of the number of devices to the total number of devices in the device group. The preset wake-up threshold of each second electronic device may be the same value as the preset wake-up threshold of the first electronic device, eg, the first threshold, or may be a different value from the preset wake-up threshold of the first electronic device. The preset wake-up thresholds of the second electronic devices may be the same or different, which is not limited in this embodiment.

在又一种可能的情况下，如果至少一个第二电子设备发送的唤醒标识结果中同时包括唤醒标识和第二唤醒置信度，第一电子设备可通过上述任意一种情况的统计方式，确定允许被唤醒的第二电子设备的设备数量，以及该设备数量占设备组的设备总数的比例。In yet another possible situation, if the wake-up identification result sent by at least one second electronic device includes both the wake-up identification and the second wake-up confidence, the first electronic device may determine whether to allow the The number of devices of the second electronic device that are awakened, and the proportion of the number of devices to the total number of devices in the device group.

步骤202、判断至少一个第二电子设备的唤醒情况是否满足预设的第一电子设备的唤醒条件，如果满足预设的第一电子设备的唤醒条件，则执行步骤203；如果不满足预设的第一电子设备的唤醒条件，则执行步骤204。Step 202: Determine whether the wake-up condition of at least one second electronic device satisfies the preset wake-up condition of the first electronic device, if it meets the preset wake-up condition of the first electronic device, perform step 203; If the wake-up condition of the first electronic device is satisfied, step 204 is executed.

在本实施例中，预设的第一电子设备的唤醒条件包括以下的任意一项：In this embodiment, the preset wake-up condition of the first electronic device includes any one of the following:

(1)设备组中除第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值。其中，第三阈值为第一电子设备中预设的允许第二电子设备被唤醒的阈值，该阈值可能大于某些第二电子设备的判断阈值。(1) The second wake-up confidence levels of all the second electronic devices in the device group except the first electronic device are greater than or equal to the third threshold. The third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up, and the threshold may be greater than the judgment threshold of some second electronic devices.

(2)设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例大于或者等于第一比例。示例性的，第一比例可以设置为80％，该条件的含义是：如果设备组中有80％的从设备允许被唤醒，可确定主设备第一电子设备允许被唤醒。(2) The ratio of the number of devices of the second electronic device allowed to be woken up in the device group to the total number of devices in the device group is greater than or equal to the first ratio. Exemplarily, the first ratio may be set to 80%, and the meaning of this condition is: if 80% of the slave devices in the device group are allowed to be woken up, it can be determined that the first electronic device of the master device is allowed to be woken up.

(3)设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例小于第一比例且大于第二比例，且允许被唤醒的第二电子设备的第二唤醒置信度均大于或者等于第三阈值。其中，第一比例大于第二比例。(3) The ratio of the number of devices of the second electronic device allowed to be awakened in the device group to the total number of devices in the device group is smaller than the first proportion and greater than the second proportion, and the second wake-up confidence level of the second electronic device allowed to be awakened are greater than or equal to the third threshold. Wherein, the first ratio is greater than the second ratio.

需要说明的是，上述第三种条件的设计初衷如下：各第二电子设备预设的唤醒阈值可能不同，即允许被唤醒的各第二电子设备的判断阈值存在差异，例如第二电子设备a的预设唤醒阈值为0.7，第二电子设备b的预设唤醒阈值为0.8，第二电子设备c的预设唤醒阈值为0.5。若各第二电子设备计算得到的音频数据的第二唤醒置信度相同，基于不同的预设唤醒阈值，各第二电子设备确定的唤醒识别结果将存在差异。基于此，第一电子设备统计得到的允许被唤醒的第二电子设备的设备数量的参考性不高。为了提高判断的准确性，可通过设置第三阈值，排查允许被唤醒的第二电子设备中是否存在第二唤醒置信度低于第三阈值的电子设备，如果存在，则确定禁止第一电子设备被唤醒。上述条件可有效避免由于部分从设备设置的预设唤醒阈值过低，导致主设备被误唤醒，提升主设备唤醒的准确性。It should be noted that the original intention of the above-mentioned third condition is as follows: the preset wake-up thresholds of the second electronic devices may be different, that is, the judgment thresholds of the second electronic devices that are allowed to be woken up are different, for example, the second electronic device a The preset wake-up threshold of the second electronic device b is 0.7, the preset wake-up threshold of the second electronic device b is 0.8, and the preset wake-up threshold of the second electronic device c is 0.5. If the second wake-up confidence levels of the audio data calculated by the second electronic devices are the same, based on different preset wake-up thresholds, the wake-up recognition results determined by the second electronic devices will be different. Based on this, the reference of the number of devices of the second electronic device that is allowed to be woken up obtained by the first electronic device is not high. In order to improve the accuracy of the judgment, a third threshold can be set to check whether there is an electronic device whose second wake-up confidence is lower than the third threshold in the second electronic device that is allowed to be woken up, and if so, it is determined to prohibit the first electronic device be awakened. The above conditions can effectively prevent the master device from being woken up by mistake due to the low preset wake-up threshold set by some slave devices, and improve the accuracy of the master device's wake-up.

示例性的，第一比例设置为80％，第二比例设置为50％，第三阈值设置为0.7，从设备包括第二电子设备a、b、c、d，各第二电子设备的预设唤醒阈值以及发送的唤醒识别结果参见表1，由表1可知，允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例为60％(3/5)，介于第一比例和第二比例之间，然而允许被唤醒的第二电子设备中存在第二唤醒置信度小于第三阈值0.7的电子设备，即第二电子设备c，因此不满足上述第三种条件，则确定第一电子设备不被唤醒。Exemplarily, the first ratio is set to 80%, the second ratio is set to 50%, the third threshold is set to 0.7, the slave devices include second electronic devices a, b, c, and d, and the preset values of each second electronic device are The wake-up threshold and the sent wake-up identification results are shown in Table 1. From Table 1, it can be seen that the number of devices of the second electronic device that is allowed to be woken up accounts for 60% (3/5) of the total number of devices in the device group, which is between the first ratio However, among the second electronic devices that are allowed to be woken up, there is an electronic device whose second wake-up confidence is less than the third threshold 0.7, that is, the second electronic device c, and therefore does not meet the third condition above, then determine The first electronic device is not awakened.

表1Table 1

第二电子设备second electronic device 预设唤醒阈值Preset wake-up threshold 第二唤醒置信度Second wake-up confidence 唤醒标识wake up sign aa 0.70.7 0.80.8 11 bb 0.70.7 0.50.5 00 cc 0.50.5 0.50.5 11 dd 0.80.8 0.80.8 11

步骤203、确定允许第一电子设备被唤醒。Step 203: Determine to allow the first electronic device to be woken up.

步骤204、确定禁止第一电子设备被唤醒。Step 204 , determining that the first electronic device is prohibited from being woken up.

上述方案提供了一种基于预设的主设备唤醒条件，在确定主设备计算得到的第一唤醒置信度小于第一阈值时，根据设备组中其他从设备发送的唤醒识别结果，统计设备组中从设备的唤醒情况，判断从设备的唤醒情况是否满足预设的主设备唤醒条件，如果满足，则确定允许设备组内主设备被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断，充分利用了设备组内其他设备的唤醒识别结果，提升了设备组内主设备唤醒的准确性。The above solution provides a wake-up condition based on the preset master device, when it is determined that the first wake-up confidence calculated by the master device is less than the first threshold, according to the wake-up recognition results sent by other slave devices in the device group, count the device group. The wake-up condition of the slave device determines whether the wake-up condition of the slave device satisfies the preset master device wake-up condition, and if so, it is determined to allow the master device in the device group to be woken up. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.

应理解，如果设备组中除了主设备之外的大多数从设备均判定允许被唤醒，主设备的预设唤醒阈值过高的话，主设备可能会出现误判，因此可通过下述两个实施例提供的方法，结合设备组内其他从设备的唤醒情况，判定是否需要动态调整当前主设备的唤醒阈值，从而提升设备组内主设备唤醒的准确性。It should be understood that if most of the slave devices in the device group except the master device are determined to be allowed to wake up, and the preset wake-up threshold of the master device is too high, the master device may misjudgment, so the following two implementations can be implemented: The method provided by the example, combined with the wake-up conditions of other slave devices in the device group, determines whether the wake-up threshold of the current master device needs to be dynamically adjusted, thereby improving the wake-up accuracy of the master device in the device group.

图10示出了另一种语音唤醒方法的判断流程图，如图10所示，若第一电子设备计算得到的第一唤醒置信度小于第一阈值，该方法包括如下步骤：FIG. 10 shows a judgment flowchart of another voice wake-up method. As shown in FIG. 10 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:

步骤301、根据至少一个第二电子设备发送的唤醒识别结果，统计设备组内允许被唤醒的第二电子设备的第一权重值。Step 301 , according to the wake-up identification result sent by at least one second electronic device, count the first weight values of the second electronic devices allowed to be woken up in the device group.

本实施例中，引入了设备组内电子设备的权重值，权重值用于指示电子设备唤醒识别结果的置信度高低。权重值可以是与电子设备的设备类型相关的值，还可以是与电子设备的软/硬件性能相关的值。作为一种示例，智能家居中包括多种设备类型的智能设备，均具有语音唤醒功能，包括智能电视、智能音箱、智能灯、智能空调、智能冰箱等等，可以将上述多种类型的智能设备中具有相同唤醒词的设备组成一个设备组。假设该设备组包括1台智能电视、2个智能音箱、2盏智能灯，可预设设备组内的不同设备类型的权重值，例如智能电视的权重值为0.3，智能音箱的权重值为0.6，智能灯的权重值为0.1。作为另一种示例，设备组中的各个电子设备可以为同一类型的设备，例如智能音箱，如图4c所示，智能音箱可细分为音箱Pro、常规音箱、音箱mini，处理性能的大小关系为：音箱Pro＞常规音箱＞音箱mini，可根据音箱的处理性能预设各种音箱的权重值，例如音箱Pro的权重值为0.5，常规音箱的权重值为0.3，音箱mini的权重值为0.2。In this embodiment, the weight value of the electronic device in the device group is introduced, and the weight value is used to indicate the confidence level of the wake-up recognition result of the electronic device. The weight value may be a value related to the device type of the electronic device, and may also be a value related to the software/hardware performance of the electronic device. As an example, a smart home includes smart devices of various device types, all of which have a voice wake-up function, including smart TVs, smart speakers, smart lights, smart air conditioners, smart refrigerators, etc. Devices with the same wake word in the device form a device group. Assuming that the device group includes 1 smart TV, 2 smart speakers, and 2 smart lights, the weight values of different device types in the device group can be preset. For example, the weight value of a smart TV is 0.3, and the weight value of a smart speaker is 0.6 , the weight of the smart light is 0.1. As another example, each electronic device in the device group may be the same type of device, such as a smart speaker. As shown in Figure 4c, the smart speaker can be subdivided into speaker Pro, conventional speaker, and speaker mini, and the size relationship of processing performance It is: Speaker Pro > Conventional Speaker > Speaker Mini, the weight value of various speakers can be preset according to the processing performance of the speaker, for example, the weight value of the speaker Pro is 0.5, the weight value of the conventional speaker is 0.3, and the weight value of the speaker mini is 0.2 .

需要说明的是，设备组内的各个电子设备的权重值可以由第三方设备(例如电子设备厂商提供的服务平台)预先设置或者推荐默认值。作为一种示例，当某一电子设备新加入至设备组时，第三方设备可向该电子设备或者该电子设备加入的设备组的主设备下发该电子设备的权重值。作为一种示例，若该电子设备为设备组中的从设备，当该电子设备向设备组中的主设备发送唤醒识别结果时，可在唤醒识别结果中携带该电子设备的权重值，以便主设备及时获知该新增的电子设备的权重值。在一些实施例中，可以由主设备或者第三方设备统一维护各电子设备的权重值。It should be noted that, the weight value of each electronic device in the device group may be preset or recommended by a third-party device (eg, a service platform provided by an electronic device manufacturer). As an example, when an electronic device is newly added to a device group, the third-party device may deliver the weight value of the electronic device to the electronic device or the master device of the device group to which the electronic device is added. As an example, if the electronic device is a slave device in a device group, when the electronic device sends the wake-up recognition result to the master device in the device group, the weight value of the electronic device can be carried in the wake-up recognition result so that the master device can The device acquires the weight value of the newly added electronic device in time. In some embodiments, the weight value of each electronic device may be uniformly maintained by the main device or the third-party device.

在本实施例中，至少一个第二电子设备发送的唤醒识别结果可以包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项，以及至少一个第二电子设备的设备标识。其中，设备标识用于指示第二电子设备的设备类型。具体的，第一电子设备首先根据至少一个第二电子设备发送的唤醒识别结果，统计设备组内允许被唤醒的第二电子设备有哪些，具体可参见上述实施例的步骤201，此处不再赘述。在确定允许被唤醒的第二电子设备后，再根据第二电子设备的设备标识，统计设备组内允许被唤醒的第二电子设备的设备类型，以及允许被唤醒的第二电子设备的设备类型对应的设备数量。基于预设的每一种电子设备的权重值，最终统计出设备组内允许被唤醒的第二电子设备的第一权重值，具体可参见如下公式：In this embodiment, the wake-up identification result sent by the at least one second electronic device may include at least one of a wake-up identification of the at least one second electronic device, a second wake-up confidence level, and a device identification of the at least one second electronic device. The device identifier is used to indicate the device type of the second electronic device. Specifically, the first electronic device first counts the second electronic devices that are allowed to be woken up in the device group according to the wake-up identification result sent by at least one second electronic device. For details, please refer to step 201 in the above embodiment, which is not repeated here. Repeat. After determining the second electronic device that can be woken up, count the device types of the second electronic device that can be woken up and the device types of the second electronic device that can be woken up in the device group according to the device identifier of the second electronic device The corresponding number of devices. Based on the preset weight value of each electronic device, the first weight value of the second electronic device that is allowed to be woken up in the device group is finally calculated. For details, please refer to the following formula:

α＝γ₁×n₁+γ₂×n₂+…+γ_x×n_x 公式一α＝γ ₁ ×n ₁ +γ ₂ ×n ₂ +…+γ _x ×n _x Formula 1

m＝n₁+n₂+…+n_x 公式二m=n ₁ +n ₂ +…+n _x Formula 2

式中，α表示设备组内允许被唤醒的第二电子设备的第一权重值；In the formula, α represents the first weight value of the second electronic device that is allowed to be woken up in the device group;

γ_i表示第i种允许被唤醒的第二电子设备的权重值；γ _i represents the weight value of the i-th second electronic device that is allowed to be woken up;

n_i表示第i种允许被唤醒的第二电子设备的设备数量，其中，i＝1,2,…,x，x为大于或者等于1的正整数；n _i represents the number of devices of the i-th second electronic device that is allowed to be woken up, where i=1, 2, . . . , x, where x is a positive integer greater than or equal to 1;

x表示设备组内允许被唤醒的第二电子设备的设备类型的数量；x represents the number of device types of the second electronic device allowed to be woken up within the device group;

m表示设备组内允许被唤醒的第二电子设备的设备总数。m represents the total number of devices of the second electronic device allowed to be woken up in the device group.

示例性的，表2示出了设备组内除第一电子设备之外的所有第二电子设备的参数统计表，表2包括各个第二电子设备的权重值、设备总数以及允许被唤醒的设备数量。Exemplarily, Table 2 shows the parameter statistics table of all second electronic devices except the first electronic device in the device group, and Table 2 includes the weight value of each second electronic device, the total number of devices, and the devices that are allowed to be woken up. quantity.

表2Table 2

第二电子设备second electronic device 权重值Weights 设备总数Total number of devices 允许被唤醒的设备数量The number of devices allowed to be woken up aa 0.50.5 22 11 bb 0.30.3 11 00 cc 0.20.2 33 22

由表2可知，设备组中允许被唤醒的第二电子设备的设备总数为3，包括1个第二电子设备a和2个第二电子设备c，其中，第二电子设备a的权重值为0.5，第二电子设备c的权重值为0.2，根据上述公式一，设备组内允许被唤醒的第二电子设备的第一权重值α＝0.5×1+0.2×2＝0.9。It can be seen from Table 2 that the total number of second electronic devices allowed to be awakened in the device group is 3, including 1 second electronic device a and 2 second electronic devices c, where the weight of the second electronic device a is 0.5, the weight value of the second electronic device c is 0.2, according to the above formula 1, the first weight value α=0.5×1+0.2×2=0.9 of the second electronic device allowed to be woken up in the device group.

步骤302、统计设备组内所有第二电子设备的第二权重值。Step 302: Count the second weight values of all the second electronic devices in the device group.

作为一种示例，第一电子设备可根据至少一个第二电子设备发送的唤醒识别结果中的设备标识，统计设备组内所有第二电子设备的设备类型，以及每一种设备类型对应的第二电子设备的设备总数。作为另一种示例，第一电子设备可根据当前设备组内的互联情况，直接确定设备组内所有第二电子设备的设备类型，以及每一种设备类型对应的第二电子设备的设备总数。需要说明的是，设备组内新增的第二电子设备，首次与第一电子设备互联时，可将第二电子设备的设备信息发送给第一电子设备，以便第一电子设备预存设备组内所有第二电子设备的设备信息，其中，上述设备信息包括设备标识和/或权重值。As an example, the first electronic device may count the device types of all second electronic devices in the device group according to the device identifier in the wake-up identification result sent by at least one second electronic device, and the second electronic device corresponding to each device type. The total number of electronic devices. As another example, the first electronic device may directly determine the device types of all the second electronic devices in the device group and the total number of second electronic devices corresponding to each device type according to the interconnection situation in the current device group. It should be noted that, when the second electronic device added to the device group is interconnected with the first electronic device for the first time, the device information of the second electronic device can be sent to the first electronic device, so that the first electronic device can be pre-stored in the device group. Device information of all the second electronic devices, wherein the device information includes a device identifier and/or a weight value.

具体的，可参见如下公式确定设备组内所有第二电子设备的第二权重值：Specifically, the second weight value of all the second electronic devices in the device group can be determined by referring to the following formula:

β＝γ₁×n₁+γ₂×n₂+…+γ_z×n_z 公式三β＝γ ₁ ×n ₁ +γ ₂ ×n ₂ +…+γ _z ×n _z Formula 3

o＝n₁+n₂+…+n_x公式四o=n ₁ +n ₂ +…+n _x Formula 4

式中，β表示设备组内所有第二电子设备的第二权重值；In the formula, β represents the second weight value of all second electronic devices in the device group;

γ_j表示第j种第二电子设备的权重值；γ _j represents the weight value of the jth second electronic device;

n_j表示第j种第二电子设备的设备总数；其中，j＝1,2,…,z，z为大于或者等于1的正整数；n _j represents the total number of devices of the jth second electronic device; wherein, j=1,2,...,z, z is a positive integer greater than or equal to 1;

z表示设备组内第二电子设备的设备类型的数量；z represents the number of device types of the second electronic device in the device group;

o表示设备组内所有第二电子设备的设备总数。o represents the total number of devices of all second electronic devices in the device group.

示例性的，仍以表2为例，可知：设备组中所有第二电子设备的设备总数为6，包括2个第二电子设备a、1个第二电子设备b以及3个第二电子设备c，其中，第二电子设备a的权重值为0.5，第二电子设备b的权重值为0.3，第二电子设备c的权重值为0.2，根据上述公式三，设备组内所有第二电子设备的第二权重值β＝0.5×2+0.3×1+0.2×3＝1.9。Exemplarily, still taking Table 2 as an example, it can be known that the total number of devices of all second electronic devices in the device group is 6, including 2 second electronic devices a, 1 second electronic device b and 3 second electronic devices c, where the weight value of the second electronic device a is 0.5, the weight value of the second electronic device b is 0.3, and the weight value of the second electronic device c is 0.2. According to the above formula 3, all the second electronic devices in the device group The second weight value β=0.5×2+0.3×1+0.2×3=1.9.

步骤303、基于第一权重值和第二权重值，调整第一阈值。Step 303: Adjust the first threshold based on the first weight value and the second weight value.

其中，第一阈值为允许第一电子设备被唤醒的阈值(即第一电子设备的预设唤醒阈值)。The first threshold is a threshold that allows the first electronic device to be woken up (ie, a preset wake-up threshold of the first electronic device).

在一种具体的实现方式中，第一电子设备根据第一权重值、第二权重值以及最大阈值调整参数，确定第一电子设备的唤醒阈值的调整参数，可通过如下公式确定调整参数：In a specific implementation manner, the first electronic device determines the adjustment parameter of the wake-up threshold of the first electronic device according to the first weight value, the second weight value and the maximum threshold adjustment parameter, and the adjustment parameter can be determined by the following formula:

θ＝α/β 公式五θ=α/β Formula 5

Δ′＝θ×Δ 公式六Δ′=θ×Δ Formula 6

式中，θ表示设备组内允许被唤醒的第二电子设备占所有第二电子设备的权重比例；In the formula, θ represents the weight ratio of the second electronic devices allowed to be woken up in the device group to all the second electronic devices;

Δ表示最大阈值调整参数，该参数为预设值，通常设置在0至1之间，例如Δ＝0.1。Δ represents the maximum threshold adjustment parameter, which is a preset value, usually set between 0 and 1, for example, Δ=0.1.

Δ′表示第一电子设备的唤醒阈值的调整参数。Δ' represents an adjustment parameter of the wake-up threshold of the first electronic device.

第一电子设备基于上述调整参数调整第一电子设备的预设唤醒阈值，可通过如下公式确定调整后的第一电子设备的唤醒阈值：The first electronic device adjusts the preset wake-up threshold of the first electronic device based on the above adjustment parameters, and the adjusted wake-up threshold of the first electronic device can be determined by the following formula:

Thredhold_current＝Thredhold-Δ′ 公式七Thredhold _current =Thredhold-Δ′ Equation 7

式中，Thredhold_current表示调整后的第一电子设备的唤醒阈值(即调整后的第一阈值)；In the formula, Thredhold _current represents the adjusted wake-up threshold of the first electronic device (that is, the adjusted first threshold);

Thredhold表示第一电子设备的预设唤醒阈值(即第一阈值)。Thredhold represents a preset wake-up threshold (ie, a first threshold) of the first electronic device.

可见，本实施例是通过设备的权重值计算，确定唤醒阈值的调整参数，调整参数的精确度高，相邻时刻的调整参数通常为连续变化的。It can be seen that in this embodiment, the adjustment parameter of the wake-up threshold is determined by calculating the weight value of the device, the adjustment parameter has high accuracy, and the adjustment parameter at adjacent moments is usually continuously changed.

步骤304、根据第一唤醒置信度和调整后的第一阈值，确定允许或者禁止第一电子设备被唤醒。Step 304: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the adjusted first threshold.

具体的，通过比较第一唤醒置信度与调整后的第一阈值的大小关系，确定允许或禁止第一电子设备被唤醒。若第一电子设备计算得到的第一唤醒置信度大于或者等于调整后的第一阈值，则确定允许第一电子设备被唤醒；若第一唤醒置信度小于调整后的第一阈值，则确定禁止第一电子设备被唤醒。Specifically, by comparing the magnitude relationship between the first wake-up confidence level and the adjusted first threshold, it is determined whether to allow or prohibit the first electronic device from being woken up. If the first wake-up confidence calculated by the first electronic device is greater than or equal to the adjusted first threshold, it is determined that the first electronic device is allowed to wake up; if the first wake-up confidence is less than the adjusted first threshold, it is determined to prohibit The first electronic device is awakened.

图11示出了又一种语音唤醒方法的判断流程图，如图11所示，若第一电子设备计算得到的第一唤醒置信度小于第一阈值，该方法包括如下步骤：FIG. 11 shows a judgment flowchart of another voice wake-up method. As shown in FIG. 11 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:

步骤401、根据至少一个第二电子设备发送的唤醒识别结果，统计设备组内允许被唤醒的第二电子设备的设备数量。Step 401 , according to the wake-up identification result sent by at least one second electronic device, count the number of devices of the second electronic device allowed to be woken up in the device group.

在本实施例中，至少一个第二电子设备发送的唤醒识别结果包括至少一个第二电子设备的唤醒标识和/或第二唤醒置信度。第一电子设备首先根据至少一个第二电子设备发送的唤醒识别结果，统计设备组内允许被唤醒的第二电子设备有哪些，从而确定设备组内允许被唤醒的第二电子设备的设备数量，具体可参见上述实施例的步骤201，此处不再赘述。In this embodiment, the wake-up identification result sent by the at least one second electronic device includes the wake-up identification and/or the second wake-up confidence level of the at least one second electronic device. The first electronic device firstly counts which second electronic devices are allowed to be woken up in the device group according to the wake-up identification result sent by at least one second electronic device, thereby determining the number of devices of the second electronic device allowed to be woken up in the device group, For details, reference may be made to step 201 in the foregoing embodiment, which is not repeated here.

步骤402、根据唤醒阈值调整表确定设备数量对应的阈值调整参数。Step 402: Determine threshold adjustment parameters corresponding to the number of devices according to the wake-up threshold adjustment table.

本实施例中，第一电子设备的存储器(例如内部存储器121、或者处理器110中的存储器等)中预存有唤醒阈值调整表。In this embodiment, a wake-up threshold adjustment table is pre-stored in the memory of the first electronic device (for example, the internal memory 121, or the memory in the processor 110, etc.).

在一种可能的实现方式中，唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系。作为一种示例，唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量的数值范围与阈值调整参数的对应关系。示例性的，假设设备组内除第一电子设备之外的第二电子设备的总数为7，第一电子设备统计得到的允许被唤醒的第二电子设备的设备数量可能落入表3所示的任意一个数值范围内，第一电子设备可根据表3所示的唤醒阈值调整表，确定本次的阈值调整参数。In a possible implementation manner, the wake-up threshold adjustment table may include a correspondence between the number of devices of the second electronic device that are allowed to be woken up and the threshold adjustment parameters. As an example, the wake-up threshold adjustment table may include a corresponding relationship between a numerical range of the number of devices of the second electronic device that is allowed to be woken up and a threshold adjustment parameter. Exemplarily, assuming that the total number of second electronic devices other than the first electronic device in the device group is 7, the number of devices of the second electronic device that is allowed to be woken up obtained from the statistics of the first electronic device may fall within the values shown in Table 3. Within any value range of , the first electronic device may determine the current threshold adjustment parameter according to the wake-up threshold adjustment table shown in Table 3.

表3table 3

由表3可知，允许被唤醒的第二电子设备的设备数量越多，阈值调整参数越大。It can be seen from Table 3 that the more the number of devices of the second electronic device that is allowed to be woken up, the larger the threshold adjustment parameter.

在另一种可能的实现方式中，唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量占所有第二电子设备的设备总数的比例，与，阈值调整参数的对应关系。应理解，该比例越大，阈值调整参数越大。In another possible implementation manner, the wake-up threshold adjustment table may include a ratio of the number of devices of the second electronic device allowed to be woken up to the total number of devices of all second electronic devices, and a corresponding relationship with the threshold adjustment parameter. It should be understood that the larger the ratio, the larger the threshold adjustment parameter.

可见，本实施例是通过预设的唤醒阈值调整表，确定阈值调整参数，各个阈值调整参数均为离散的数值。It can be seen that, in this embodiment, the threshold adjustment parameters are determined through a preset wake-up threshold adjustment table, and each threshold adjustment parameter is a discrete value.

步骤403、基于阈值调整参数调整第一阈值。Step 403: Adjust the first threshold based on the threshold adjustment parameter.

本实施例对第一电子设备的预设唤醒阈值的调整方式同上述实施例的步骤303，在确定阈值调整参数后，可采用公式七，减小第一电子设备的预设唤醒阈值。The method for adjusting the preset wake-up threshold of the first electronic device in this embodiment is the same as step 303 in the above embodiment. After determining the threshold adjustment parameters, formula 7 can be used to reduce the preset wake-up threshold of the first electronic device.

步骤404、根据第一唤醒置信度和调整后的第一阈值，确定允许或禁止第一电子设备被唤醒。Step 404: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold.

本实施例的步骤404与上述实施例的步骤304相同，具体可参见上述实施例，此处不再赘述。Step 404 in this embodiment is the same as step 304 in the foregoing embodiment. For details, reference may be made to the foregoing embodiment, and details are not repeated here.

上述方案同样从唤醒阈值的角度出发，结合设备组内从设备的实际唤醒情况，统计允许被唤醒的从设备的设备数量或占比，进而基于离散方式动态地调整设备组内主设备的唤醒阈值，并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果，判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态，提升了设备组内主设备唤醒的准确性。The above solution also starts from the wake-up threshold, combined with the actual wake-up situation of the slave devices in the device group, counts the number or proportion of the slave devices that are allowed to be woken up, and then dynamically adjusts the wake-up threshold of the master device in the device group based on a discrete method. , and according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device, it is determined whether the current master device needs to make a wake-up response. Since the adjusted wake-up threshold of the master device is more in line with the actual state of wake-up of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.

上文中详细描述了本申请实施例提供的语音唤醒方法，下面将描述本申请实施例提供的语音唤醒设备。图12为本申请实施例提供的一种语音唤醒设备的结构示意图。如图12所示，本申请实施例提供的语音唤醒设备1200，包括：The voice wake-up method provided by the embodiments of the present application is described in detail above, and the voice wake-up device provided by the embodiments of the present application will be described below. FIG. 12 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the present application. As shown in FIG. 12 , the voice wake-up device 1200 provided by this embodiment of the present application includes:

获取模块1201，用于获取音频数据的第一唤醒置信度，所述第一唤醒置信度用于指示所述第一电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度；The obtaining module 1201 is configured to obtain a first wake-up confidence level of the audio data, where the first wake-up confidence level is used to indicate that the wake-up word in the audio data determined by the first electronic device is similar to the acoustic feature of the preset wake-up word Spend;

接收模块1202，用于接收所述至少一个第二电子设备发送的唤醒识别结果，所述唤醒识别结果用于指示允许或禁止所述至少一个第二电子设备被唤醒；a receiving module 1202, configured to receive a wake-up identification result sent by the at least one second electronic device, where the wake-up identification result is used to indicate allowing or prohibiting the at least one second electronic device from being woken up;

处理模块1203，用于根据所述第一唤醒置信度和所述唤醒识别结果，确定允许或者禁止所述第一电子设备被唤醒。The processing module 1203 is configured to determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result.

可选的，所述唤醒识别结果包括所述至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项；其中，唤醒标识包括允许或禁止所述第二电子设备被唤醒的标识，所述第二唤醒置信度用于指示所述第二电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification and a second wake-up confidence of the at least one second electronic device; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up. , the second wake-up confidence level is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.

可选的，所述处理模块1203，具体用于：Optionally, the processing module 1203 is specifically used for:

若所述第一唤醒置信度大于或等于第一阈值，确定允许所述第一电子设备被唤醒；或者If the first wake-up confidence is greater than or equal to a first threshold, determining that the first electronic device is allowed to wake up; or

若所述第一唤醒置信度小于所述第一阈值，且所述第一唤醒置信度大于第二阈值，根据所述第一唤醒置信度和所述唤醒识别结果，确定允许或者禁止所述第一电子设备被唤醒；或者If the first wake-up confidence level is less than the first threshold, and the first wake-up confidence level is greater than the second threshold, determine whether to allow or prohibit the first wake-up confidence level according to the first wake-up confidence level and the wake-up identification result. an electronic device is awakened; or

若所述第一唤醒置信度小于或等于所述第二阈值，确定禁止所述第一电子设备被唤醒。If the first wake-up confidence level is less than or equal to the second threshold, it is determined that the first electronic device is prohibited from being woken up.

根据所述唤醒识别结果统计所述至少一个第二电子设备的唤醒情况；Statistics on the wake-up situation of the at least one second electronic device according to the wake-up identification result;

若所述唤醒情况满足预设的第一电子设备的唤醒条件，确定允许所述第一电子设备被唤醒。If the wake-up condition satisfies a preset wake-up condition of the first electronic device, it is determined that the first electronic device is allowed to wake up.

可选的，所述唤醒条件包括如下的任意一项：Optionally, the wake-up condition includes any of the following:

所述设备组中除所述第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值；The second wake-up confidences of all second electronic devices in the device group except the first electronic device are greater than or equal to a third threshold;

所述设备组中允许被唤醒的第二电子设备的设备数量占所述设备组的设备总数的比例大于或等于第一比例；The ratio of the number of devices of the second electronic device allowed to be woken up in the device group to the total number of devices in the device group is greater than or equal to the first ratio;

所述设备组中允许被唤醒的第二电子设备的设备数量占所述设备组设备总数的比例小于所述第一比例且大于第二比例，且允许被唤醒的第二电子设备的第二唤醒置信度均大于或等于第三阈值；The ratio of the number of devices of the second electronic device that can be woken up in the device group to the total number of devices in the device group is smaller than the first ratio and greater than the second ratio, and the second wake-up of the second electronic device that is allowed to wake up is allowed The confidence levels are all greater than or equal to the third threshold;

其中，所述第三阈值为所述第一电子设备中预设的允许第二电子设备被唤醒的阈值。The third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up.

可选的，所述唤醒识别结果还包括所述至少一个第二电子设备的设备标识，所述设备标识用于指示所述第二电子设备的设备类型，确定所述第二电子设备的权重值。Optionally, the wake-up identification result further includes a device identifier of the at least one second electronic device, where the device identifier is used to indicate the device type of the second electronic device, and determine the weight value of the second electronic device. .

根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的第一权重值；Calculate the first weight value of the second electronic device that is allowed to be woken up in the device group according to the wake-up identification result;

统计所述设备组内所有第二电子设备的第二权重值；Counting second weight values of all second electronic devices in the device group;

基于所述第一权重值和所述第二权重值调整第一阈值；Adjusting a first threshold based on the first weight value and the second weight value;

根据所述第一唤醒置信度和调整后的第一阈值，确定允许或者禁止所述第一电子设备被唤醒；According to the first wake-up confidence level and the adjusted first threshold, determine whether to allow or prohibit the first electronic device from being woken up;

其中，所述第一阈值为允许所述第一电子设备被唤醒的阈值。Wherein, the first threshold is a threshold for allowing the first electronic device to be woken up.

可选的，所述第一权重值是根据所述设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量，以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的；所述第二权重值是根据所述设备组内所有第二电子设备的设备类型对应的设备数量，以及各设备类型对应的权重值确定的。Optionally, the first weight value is based on the number of devices corresponding to the device type of the second electronic device that is allowed to be woken up in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be woken up. The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.

将所述第一权重值与所述第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数，根据所述阈值调整参数调整所述第一阈值。The product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter is used as a threshold adjustment parameter, and the first threshold is adjusted according to the threshold adjustment parameter.

根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的设备数量；Counting the number of devices of the second electronic device allowed to be woken up in the device group according to the wake-up identification result;

根据唤醒阈值调整表确定所述设备数量对应的阈值调整参数，所述唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系；Determine threshold adjustment parameters corresponding to the number of devices according to a wake-up threshold adjustment table, where the wake-up threshold adjustment table includes a correspondence between the number of devices of the second electronic device that is allowed to be woken up and the threshold adjustment parameters;

基于所述阈值调整参数调整所述第一阈值；Adjusting the first threshold based on the threshold adjustment parameter;

所述根据所述第一唤醒置信度和调整后的第一阈值，确定允许或禁止所述第一电子设备被唤醒。and determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold.

若所述第一唤醒置信度大于或等于所述调整后的第一阈值，确定允许所述第一电子设备被唤醒；或者If the first wake-up confidence level is greater than or equal to the adjusted first threshold, determining that the first electronic device is allowed to wake up; or

若所述第一唤醒置信度小于所述调整后的第一阈值，确定禁止所述第一电子设备被唤醒。If the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is prohibited from being woken up.

本申请实施例提供的语音唤醒设备，用于执行前述图7、图8a、8b、图9所示方法实施例中的第一电子设备执行的技术方案，其实现原理和技术效果类似，在此不再赘述。The voice wake-up device provided in this embodiment of the present application is used to execute the technical solution executed by the first electronic device in the method embodiments shown in FIG. 7 , FIG. 8 a , 8 b , and FIG. 9 . No longer.

图13为本申请实施例提供的一种语音唤醒设备的结构示意图。如图13所示，本申请实施例提供的语音唤醒设备1300，包括：FIG. 13 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the present application. As shown in FIG. 13 , the voice wake-up device 1300 provided by this embodiment of the present application includes:

获取模块1301，用于获取音频数据的第一唤醒置信度，所述第一唤醒置信度用于指示所述第一电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度；The acquiring module 1301 is configured to acquire the first wake-up confidence level of the audio data, where the first wake-up confidence level is used to indicate that the wake-up word in the audio data determined by the first electronic device is similar to the acoustic feature of the preset wake-up word Spend;

接收模块1302，用于接收所述至少一个第二电子设备发送的唤醒识别结果，所述唤醒识别结果用于指示允许或禁止所述第二电子设备被唤醒；a receiving module 1302, configured to receive a wake-up identification result sent by the at least one second electronic device, where the wake-up identification result is used to indicate allowing or prohibiting the second electronic device from being woken up;

处理模块1303，用于根据所述唤醒识别结果调整第一阈值，所述第一阈值为允许所述第一电子设备被唤醒的阈值；根据调整后的第一阈值和所述第一唤醒置信度，确定允许或禁止所述第一电子设备被唤醒。The processing module 1303 is configured to adjust a first threshold according to the wake-up identification result, where the first threshold is a threshold for allowing the first electronic device to be woken up; according to the adjusted first threshold and the first wake-up confidence , and determine whether to allow or prohibit the first electronic device from being woken up.

可选的，所述处理模块1303，具体用于：Optionally, the processing module 1303 is specifically used for:

基于所述第一权重值和所述第二权重值调整所述第一阈值。The first threshold is adjusted based on the first weight value and the second weight value.

基于所述阈值调整参数调整所述第一阈值。The first threshold is adjusted based on the threshold adjustment parameter.

本申请实施例提供的语音唤醒设备，用于执行前述图10、图11所示方法实施例中的第一电子设备执行的技术方案，其实现原理和技术效果类似，在此不再赘述。The voice wake-up device provided in this embodiment of the present application is used to execute the technical solution executed by the first electronic device in the method embodiments shown in FIG. 10 and FIG. 11 .

需要说明的是，应理解以上语音唤醒设备的各个模块的划分仅仅是一种逻辑功能的划分，实际实现时可以全部或部分集成到一个物理实体上，也可以物理上分开。且这些模块可以全部以软件通过处理元件调用的形式实现；也可以全部以硬件的形式实现；还可以部分模块通过处理元件调用软件的形式实现，部分模块通过硬件的形式实现。例如，处理模块可以为单独设立的处理元件，也可以集成在上述设备的某一个芯片中实现，此外，也可以以程序代码的形式存储于上述设备的存储器中，由上述设备的某一个处理元件调用并执行以上处理模块的功能。其它模块的实现与之类似。此外这些模块全部或部分可以集成在一起，也可以独立实现。这里所述的处理元件可以是一种集成电路，具有信号的处理能力。在实现过程中，上述方法的各步骤或以上各个模块可以通过处理器元件中的硬件的集成逻辑电路或者软件形式的指令完成。It should be noted that it should be understood that the above division of the modules of the voice wake-up device is only a division of logical functions, and may be fully or partially integrated into a physical entity in actual implementation, or may be physically separated. And these modules can all be implemented in the form of software calling through processing elements; they can also all be implemented in hardware; some modules can also be implemented in the form of calling software through processing elements, and some modules can be implemented in hardware. For example, the processing module may be a separately established processing element, or it may be integrated into a certain chip of the above-mentioned device, and it may also be stored in the memory of the above-mentioned device in the form of program code, and a certain processing element of the above-mentioned device Call and execute the function of the above processing module. The implementation of other modules is similar. In addition, all or part of these modules can be integrated together, and can also be implemented independently. The processing element described here may be an integrated circuit with signal processing capability. In the implementation process, each step of the above-mentioned method or each of the above-mentioned modules can be completed by an integrated logic circuit of hardware in the processor element or an instruction in the form of software.

例如，以上这些模块可以是被配置成实施以上方法的一个或多个集成电路，例如：一个或多个特定集成电路(application specific integrated circuit，ASIC)，或，一个或多个微处理器(digital signal processor，DSP)，或，一个或者多个现场可编程门阵列(field programmable gate array，FPGA)等。再如，当以上某个模块通过处理元件调度程序代码的形式实现时，该处理元件可以是通用处理器，例如中央处理器(centralprocessing unit，CPU)或其它可以调用程序代码的处理器。再如，这些模块可以集成在一起，以片上系统(system-on-a-chip，SOC)的形式实现。For example, the above modules may be one or more integrated circuits configured to implement the above methods, such as: one or more application specific integrated circuits (ASIC), or one or more digital microprocessors (digital) signal processor, DSP), or, one or more field programmable gate array (field programmable gate array, FPGA) and so on. For another example, when one of the above modules is implemented in the form of a processing element scheduling program code, the processing element may be a general-purpose processor, such as a central processing unit (CPU) or other processors that can invoke program codes. For another example, these modules can be integrated together and implemented in the form of a system-on-a-chip (SOC).

图14为本申请实施例提供的一种语音唤醒设备的硬件结构示意图。如图14所示，本申请实施例提供的语音唤醒设备1400，可以包括：FIG. 14 is a schematic diagram of a hardware structure of a voice wake-up device according to an embodiment of the present application. As shown in FIG. 14 , the voice wake-up device 1400 provided by this embodiment of the present application may include:

处理器1401、存储器1402和通信接口1403。其中，存储器1402，用于存储计算机程序；处理器1401，用于执行存储器1402存储的计算机程序，以实现上述任一方法实施例中第一电子设备所执行的方法。通信接口1403，用于与至少一个第二电子设备或者服务器进行数据通信或者信号通信。Processor 1401 , memory 1402 and communication interface 1403 . The memory 1402 is used to store computer programs; the processor 1401 is used to execute the computer programs stored in the memory 1402 to implement the method executed by the first electronic device in any of the above method embodiments. The communication interface 1403 is used for data communication or signal communication with at least one second electronic device or server.

可选地，存储器1402既可以是独立的，也可以跟处理器1401集成在一起。当所述存储器1402是独立于处理器1401之外的器件时，所述语音唤醒设备1400还可以包括：总线1404，用于连接所述存储器1402和处理器1401。Optionally, the memory 1402 may be independent or integrated with the processor 1401 . When the memory 1402 is a device independent of the processor 1401 , the voice wake-up device 1400 may further include: a bus 1404 for connecting the memory 1402 and the processor 1401 .

在一种可能的实施方式中，图12中的处理模块1203可以集成在处理器1401中实现，接收模块1202可以集成在通信接口1403中实现。图13中的处理模块1303可以集成在处理器1401中实现，接收模块1302可以集成在通信接口1403中实现。In a possible implementation manner, the processing module 1203 in FIG. 12 may be integrated in the processor 1401 and implemented, and the receiving module 1202 may be integrated in the communication interface 1403 and implemented. The processing module 1303 in FIG. 13 may be integrated in the processor 1401 and implemented, and the receiving module 1302 may be integrated in the communication interface 1403 and implemented.

在一种可能的实施方式中，处理器1401可用于实现上述方法实施例中第一电子设备的信息处理操作，通信接口1403可用于实现上述方法实施例中第一电子设备的信号收发操作。In a possible implementation manner, the processor 1401 can be used to implement the information processing operation of the first electronic device in the above method embodiment, and the communication interface 1403 can be used to implement the signal transceiving operation of the first electronic device in the above method embodiment.

本实施例提供的语音唤醒设备，可用于执行上述任一方法实施例中第一电子设备所执行的方法，其实现原理和技术效果类似，此处不再赘述。The voice wake-up device provided in this embodiment can be used to execute the method executed by the first electronic device in any of the above method embodiments, and its implementation principle and technical effect are similar, and details are not repeated here.

本申请实施例还提供一种计算机可读存储介质，所述计算机可读存储介质中存储有计算机执行指令，当所述计算机执行指令被处理器执行时用于实现前述任一方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a computer-readable storage medium, where computer-executable instructions are stored in the computer-readable storage medium, and when the computer-executable instructions are executed by a processor, are used to implement the first method in any of the foregoing method embodiments. A technical solution for electronic equipment.

本申请实施例还提供一种程序，当该程序被处理器执行时，用于执行前述任一方法实施例中第一电子设备的技术方案。The embodiments of the present application further provide a program, which, when the program is executed by the processor, is used to execute the technical solution of the first electronic device in any of the foregoing method embodiments.

本申请实施例还提供一种计算机程序产品，包括程序指令，程序指令用于实现前述任一方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a computer program product, including program instructions, where the program instructions are used to implement the technical solution of the first electronic device in any of the foregoing method embodiments.

本申请实施例还提供了一种芯片，包括：处理模块与通信接口，该处理模块能执行前述方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a chip, including: a processing module and a communication interface, where the processing module can execute the technical solutions of the first electronic device in the foregoing method embodiments.

进一步地，该芯片还包括存储模块(如，存储器)，存储模块用于存储指令，处理模块用于执行存储模块存储的指令，并且对存储模块中存储的指令的执行使得处理模块执行第一电子设备的技术方案。Further, the chip also includes a storage module (eg, memory), the storage module is used for storing instructions, the processing module is used for executing the instructions stored in the storage module, and the execution of the instructions stored in the storage module causes the processing module to execute the first electronic The technical solution of the equipment.

在上述实施例中，可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时，可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时，全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中，或者从一个计算机可读存储介质向另一个计算机可读存储介质传输，例如，所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质，(例如，软盘、硬盘、磁带)、光介质(例如，DVD)、或者半导体介质(例如固态硬盘solid state disk(SSD))等。In the above-mentioned embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present application are generated. The computer may be a general purpose computer, special purpose computer, computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be downloaded from a website site, computer, server, or data center Transmission to another website site, computer, server, or data center is by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media. The usable media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVDs), or semiconductor media (eg, solid state disks (SSDs)), and the like.

本申请中，“至少一个”是指一个或者多个，“多个”是指两个或两个以上。“和/或”，描述关联对象的关联关系，表示可以存在三种关系，例如，A和/或B，可以表示：单独存在A，同时存在A和B，单独存在B的情况，其中A，B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系；在公式中，字符“/”，表示前后关联对象是一种“相除”的关系。“以下至少一项(个)”或其类似表达，是指的这些项中的任意组合，包括单项(个)或复数项(个)的任意组合。例如，a，b，或c中的至少一项(个)，可以表示：a，b，c，a-b，a-c，b-c，或a-b-c，其中，a，b，c可以是单个，也可以是多个。In this application, "at least one" means one or more, and "plurality" means two or more. "And/or", which describes the association relationship of the associated objects, indicates that there can be three kinds of relationships, for example, A and/or B, which can indicate: the existence of A alone, the existence of A and B at the same time, and the existence of B alone, where A, B can be singular or plural. The character "/" generally indicates that the related objects before and after are an "or" relationship; in the formula, the character "/" indicates that the related objects are a "division" relationship. "At least one item(s) below" or similar expressions thereof refer to any combination of these items, including any combination of single item(s) or plural items(s). For example, at least one item (a) of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple indivual.

可以理解的是，在本申请的实施例中涉及的各种数字编号仅为描述方便进行的区分，并不用来限制本申请的实施例的范围。It can be understood that, the various numbers and numbers involved in the embodiments of the present application are only for the convenience of description, and are not used to limit the scope of the embodiments of the present application.

可以理解的是，在本申请的实施例中，上述各过程的序号的大小并不意味着执行顺序的先后，各过程的执行顺序应以其功能和内在逻辑确定，而不应对本申请的实施例的实施过程构成任何限定。It can be understood that, in the embodiments of the present application, the size of the sequence numbers of the above-mentioned processes does not imply the order of execution, and the execution order of each process should be determined by its functions and internal logic, rather than the implementation of the present application. The implementation of the examples constitutes no limitation.

Claims

1. A voice wake-up method is applied to a first electronic device, wherein the first electronic device and at least one second electronic device belong to the same device group, and the method comprises the following steps:

acquiring a first awakening confidence level of audio data, wherein the first awakening confidence level is used for indicating the acoustic feature similarity of an awakening word in the audio data determined by the first electronic device and a preset awakening word;

receiving a wake-up identification result sent by the at least one second electronic device, wherein the wake-up identification result is used for indicating that the at least one second electronic device is allowed or prohibited to be woken up;

and determining to allow or prohibit the first electronic equipment to be awakened according to the first awakening confidence level and the awakening identification result.

2. The method of claim 1, wherein the wake up recognition result comprises at least one of a wake up identity of the at least one second electronic device, a second wake up confidence;

the awakening identifier comprises an identifier which allows or prohibits the second electronic device to be awakened, and the second awakening confidence coefficient is used for indicating the acoustic feature similarity between the awakening word in the audio data determined by the second electronic device and a preset awakening word.

3. The method according to claim 1 or 2, wherein the determining to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up recognition result comprises:

if the first awakening confidence coefficient is larger than or equal to a first threshold value, determining that the first electronic equipment is allowed to be awakened; or

If the first awakening confidence degree is smaller than the first threshold value and the first awakening confidence degree is larger than a second threshold value, determining to allow or prohibit the first electronic equipment to be awakened according to the first awakening confidence degree and the awakening identification result; or

And if the first awakening confidence coefficient is smaller than or equal to the second threshold value, determining to prohibit the first electronic equipment from being awakened.

4. The method of claim 3, wherein the determining to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up recognition result comprises:

counting the awakening condition of the at least one second electronic device according to the awakening identification result;

and if the awakening condition meets the preset awakening condition of the first electronic equipment, determining that the first electronic equipment is allowed to be awakened.

5. The method of claim 4, wherein the wake-up condition comprises any one of:

the second awakening confidence degrees of all second electronic equipment except the first electronic equipment in the equipment group are greater than or equal to a third threshold;

the proportion of the number of the devices of the second electronic device which is allowed to be awakened in the device group to the total number of the devices of the device group is greater than or equal to a first proportion;

the proportion of the number of the devices of the second electronic device allowed to be awakened in the device group to the total number of the devices of the device group is smaller than the first proportion and larger than a second proportion, and second awakening confidences of the second electronic devices allowed to be awakened are all larger than or equal to a third threshold;

the third threshold is a preset threshold allowing the second electronic device to be awakened in the first electronic device.

6. The method of claim 2, wherein the wake up recognition result further comprises a device identifier of the at least one second electronic device, wherein the device identifier is used for indicating a device type of the second electronic device, and determining a weight value of the second electronic device.

7. The method of claim 6, wherein the determining to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up recognition result comprises:

counting a first weight value of a second electronic device which is allowed to be awakened in the device group according to the awakening identification result;

counting second weighted values of all second electronic equipment in the equipment group;

adjusting a first threshold based on the first weight value and the second weight value;

determining to allow or prohibit the first electronic device to be awakened according to the first awakening confidence level and the adjusted first threshold;

wherein the first threshold is a threshold that allows the first electronic device to be woken up.

8. The method according to claim 7, wherein the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device allowed to be woken in the device group and the weight values corresponding to the device types of the second electronic devices allowed to be woken; the second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group and the weight value corresponding to each device type.

9. The method of claim 7, wherein adjusting the first threshold based on the first and second weight values comprises:

and taking the product of the ratio of the first weight value to the second weight value and a maximum threshold value adjusting parameter as a threshold value adjusting parameter, and adjusting the first threshold value according to the threshold value adjusting parameter.

10. The method according to claim 1 or 2, wherein the determining to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up recognition result comprises:

counting the number of the second electronic equipment allowed to be awakened in the equipment group according to the awakening identification result;

determining a threshold adjustment parameter corresponding to the equipment number according to a wake-up threshold adjustment table, wherein the wake-up threshold adjustment table comprises a corresponding relation between the equipment number of the second electronic equipment allowed to be woken up and the threshold adjustment parameter;

adjusting a first threshold based on the threshold adjustment parameter;

and determining to allow or prohibit the first electronic equipment to be awakened according to the first awakening confidence and the adjusted first threshold.

11. The method according to claim 7 or 10, wherein the determining to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold comprises:

if the first awakening confidence is greater than or equal to the adjusted first threshold, determining that the first electronic device is allowed to be awakened; or

And if the first awakening confidence coefficient is smaller than the adjusted first threshold, determining to prohibit the first electronic device from being awakened.

12. A voice wake-up method is applied to a first electronic device, wherein the first electronic device and at least one second electronic device belong to the same device group, and the method comprises the following steps:

receiving a wake-up identification result sent by the at least one second electronic device, wherein the wake-up identification result is used for indicating that the second electronic device is allowed or prohibited to be woken up;

adjusting a first threshold value according to the awakening identification result, wherein the first threshold value is a threshold value for allowing the first electronic device to be awakened;

and determining to allow or prohibit the first electronic equipment to be awakened according to the adjusted first threshold and the first awakening confidence level.

13. The method of claim 12, wherein adjusting the first threshold according to the wake-up recognition result comprises:

adjusting the first threshold based on the first and second weight values.

14. The method according to claim 13, wherein the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device allowed to be woken in the device group, and the weight values corresponding to the device types of the second electronic devices allowed to be woken; the second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group and the weight value corresponding to each device type.

15. The method of claim 13, wherein adjusting the first threshold based on the first and second weight values comprises:

16. The method of claim 12, wherein adjusting the first threshold according to the wake-up recognition result comprises:

adjusting the first threshold based on the threshold adjustment parameter.

17. The method according to any of claims 12-16, wherein the wake up recognition result comprises at least one of a wake up identity of the at least one second electronic device, a second wake up confidence;

18. The method according to any of claims 12-16, wherein the determining to allow or prohibit the first electronic device from being woken up according to the adjusted first threshold and the first wake-up confidence level comprises:

19. The utility model provides a voice wake-up equipment, its characterized in that, voice wake-up equipment is first electronic equipment, first electronic equipment and at least one second electronic equipment belong to same equipment group, voice wake-up equipment includes:

the acquiring module is used for acquiring a first awakening confidence coefficient of the audio data, wherein the first awakening confidence coefficient is used for indicating the acoustic feature similarity of an awakening word in the audio data determined by the first electronic device and a preset awakening word;

a receiving module, configured to receive a wake-up identification result sent by the at least one second electronic device, where the wake-up identification result is used to indicate that the at least one second electronic device is allowed or prohibited to be woken up;

and the processing module is used for determining to allow or prohibit the first electronic equipment to be awakened according to the first awakening confidence coefficient and the awakening identification result.

20. The device of claim 19, wherein the wake up recognition result comprises at least one of a wake up identity of the at least one second electronic device, a second wake up confidence level;

21. The device according to claim 19 or 20, wherein the processing module is specifically configured to:

22. The device according to claim 21, wherein the processing module is specifically configured to:

23. The device of claim 22, wherein the wake-up condition comprises any one of:

24. The device of claim 20, wherein the wake up recognition result further comprises a device identifier of the at least one second electronic device, wherein the device identifier is used for indicating a device type of the second electronic device, and determining a weight value of the second electronic device.

25. The device according to claim 24, wherein the processing module is specifically configured to:

26. The device according to claim 25, wherein the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device allowed to be woken in the device group, and the weight values corresponding to the respective device types of the second electronic device allowed to be woken; the second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group and the weight value corresponding to each device type.

27. The device according to claim 25, wherein the processing module is specifically configured to:

28. The device according to claim 19 or 20, wherein the processing module is specifically configured to:

adjusting a first threshold based on the threshold adjustment parameter;

29. The device according to claim 25 or 28, wherein the processing module is specifically configured to:

30. The utility model provides a voice wake-up equipment, its characterized in that, voice wake-up equipment is first electronic equipment, first electronic equipment and at least one second electronic equipment belong to same equipment group, voice wake-up equipment includes:

a receiving module, configured to receive a wake-up identification result sent by the at least one second electronic device, where the wake-up identification result is used to indicate that the second electronic device is allowed or prohibited to be woken up;

the processing module is used for adjusting a first threshold value according to the awakening identification result, wherein the first threshold value is a threshold value allowing the first electronic device to be awakened; and determining to allow or prohibit the first electronic equipment to be awakened according to the adjusted first threshold and the first awakening confidence level.

31. The device according to claim 30, wherein the processing module is specifically configured to:

adjusting the first threshold based on the first and second weight values.

32. The device according to claim 31, wherein the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device allowed to be woken in the device group, and the weight values corresponding to the respective device types of the second electronic device allowed to be woken; the second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group and the weight value corresponding to each device type.

33. The device according to claim 31, wherein the processing module is specifically configured to:

34. The device according to claim 30, wherein the processing module is specifically configured to:

adjusting the first threshold based on the threshold adjustment parameter.

35. The device according to any of claims 30-34, wherein the wake up recognition result comprises at least one of a wake up identity of the at least one second electronic device, a second wake up confidence;

36. The device according to any one of claims 30 to 34, wherein the processing module is specifically configured to:

37. A voice wake-up device, comprising: a memory for storing a computer program and a processor for calling and executing the computer program from the memory, such that the processor executes the computer program to perform the method of any of claims 1 to 11, or the method of any of claims 12 to 18.

38. A storage medium, characterized in that the storage medium comprises a computer program for implementing the method according to any one of claims 1 to 11 or the method according to any one of claims 12 to 18.