CN111696562A - Voice wake-up method, device and storage medium - Google Patents
Voice wake-up method, device and storage medium Download PDFInfo
- Publication number
- CN111696562A CN111696562A CN202010353897.1A CN202010353897A CN111696562A CN 111696562 A CN111696562 A CN 111696562A CN 202010353897 A CN202010353897 A CN 202010353897A CN 111696562 A CN111696562 A CN 111696562A
- Authority
- CN
- China
- Prior art keywords
- wake
- electronic device
- threshold
- awakening
- electronic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
- G10L17/24—Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/08—Use of distortion metrics or a particular distance between probe pattern and reference templates
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Electric Clocks (AREA)
Abstract
本申请实施例提供一种语音唤醒方法、设备及存储介质,用于解决当前设备组中主设备唤醒率低的问题。该方法包括:主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中,根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果,确定主设备是否被唤醒,该方法提高了设备组中主设备被唤醒的准确性。进一步地,主设备还可以采用连续或者离散方式,动态调整设备组内主设备的预设唤醒阈值,并根据调整后的唤醒阈值确定主设备是否被唤醒,动态调整后的唤醒阈值更加契合整个设备组唤醒的实际状态,主设备基于动态调整后的唤醒阈值进行唤醒决策,唤醒设备的准确性更高。
Embodiments of the present application provide a voice wake-up method, a device, and a storage medium, which are used to solve the problem of a low wake-up rate of a master device in a current device group. The method includes: the master device introduces the wake-up identification results of other slave devices in the device group into the wake-up decision of the master device, and determines the master device according to the wake-up identification results of the master device itself and the wake-up identification results of other slave devices in the device group. Whether to wake up, this method improves the accuracy of the master device in the device group being woken up. Further, the master device can also dynamically adjust the preset wake-up threshold of the master device in the device group in a continuous or discrete manner, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more suitable for the entire device. The actual state of group wake-up, the master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, and the accuracy of wake-up devices is higher.
Description
技术领域technical field
本申请涉及终端技术领域,尤其涉及一种语音唤醒方法、设备及存储介质。The present application relates to the technical field of terminals, and in particular, to a voice wake-up method, device and storage medium.
背景技术Background technique
随着智能语音交互的兴起,越来越多的设备支持语音交互功能。其中,语音唤醒作为语音交互的开始,在不同设备中应用广泛,例如智能音箱、智能电视等。当用户所处空间存在支持语音唤醒的设备,用户发出唤醒语音后,这些被唤醒的设备会同时响应说话人的请求,同时与用户进行交互。此时,会给用户造成困惑,不知道与哪一个设备进行语音交互。With the rise of intelligent voice interaction, more and more devices support the function of voice interaction. Among them, voice wake-up, as the beginning of voice interaction, is widely used in different devices, such as smart speakers and smart TVs. When there are devices that support voice wake-up in the space where the user is located, after the user sends a wake-up voice, these wake-up devices will respond to the speaker's request and interact with the user at the same time. At this time, it will cause confusion to the user, not knowing which device to perform voice interaction with.
目前针对上述场景,可以将能够同时被用户唤醒的这些设备组成一个设备组,设备组内只有主设备对用户的唤醒进行应答响应,其他从设备配合主设备处理用户指令意图,从而避免多设备同时被唤醒。At present, for the above scenarios, these devices that can be awakened by the user at the same time can be grouped into a device group. In the device group, only the master device responds to the user's wake-up, and other slave devices cooperate with the master device to process the user's command intent, thereby avoiding multiple devices at the same time. be awakened.
然而,设备组内有多个可唤醒设备,只依靠主设备的唤醒结果作为整个设备组的唤醒结果,唤醒的准确性不高,例如,用户与主设备相距较远,或者,传输路径上存在外部干扰等,都会降低主设备的唤醒率。However, there are multiple wake-up devices in the device group, and only the wake-up result of the master device is used as the wake-up result of the entire device group, and the wake-up accuracy is not high, for example, the user is far away from the master device, or there are External interference, etc., will reduce the wake-up rate of the master device.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供一种语音唤醒方法、设备及存储介质,提高设备组中主设备的唤醒率。Embodiments of the present application provide a voice wake-up method, a device, and a storage medium, which improve the wake-up rate of a master device in a device group.
第一方面,本申请实施例提供一种语音唤醒方法,该方法应用于第一电子设备,第一电子设备和至少一个第二电子设备属于同一设备组,该方法包括:获取音频数据的第一唤醒置信度,第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度;接收至少一个第二电子设备发送的唤醒识别结果,唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒;根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒。In a first aspect, an embodiment of the present application provides a voice wake-up method. The method is applied to a first electronic device. The first electronic device and at least one second electronic device belong to the same device group. The method includes: acquiring a first electronic device of audio data. Wake-up confidence, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; receive the wake-up recognition result sent by at least one second electronic device, and the wake-up recognition result It is used to indicate that at least one second electronic device is allowed or prohibited to be woken up; according to the first wake-up confidence level and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up.
其中,第一电子设备为设备组中的主设备,第二电子设备为设备组中的从设备。The first electronic device is a master device in the device group, and the second electronic device is a slave device in the device group.
上述方案中,主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中,根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果,确定主设备是否被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断,充分利用了设备组内其他设备的唤醒识别结果,提升了设备组内主设备唤醒的准确性。In the above scheme, the master device determines the master device according to the wake-up recognition results of the master device itself and the wake-up recognition results of other slave devices in the device group by introducing the wake-up recognition results of other slave devices in the device group into the master device's wake-up decision. whether to wake up. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.
可选的,唤醒识别结果包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止第二电子设备被唤醒的标识,第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification of at least one second electronic device and a second wake-up confidence; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence It is used to indicate the similarity of the acoustic features between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
如果唤醒识别结果中仅包括至少一个第二电子设备的唤醒标识,第一电子设备可根据唤醒标识,统计允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。If the wake-up identification result only includes the wake-up identifier of at least one second electronic device, the first electronic device may, according to the wake-up identifier, count the number of devices of the second electronic device that are allowed to be woken up, and the number of the devices in the total number of devices in the device group. Proportion.
如果唤醒识别结果中仅包括至少一个第二电子设备的第二唤醒置信度,第一电子设备首先需要根据第二唤醒置信度和各第二电子设备预设的唤醒阈值,确定第二电子设备是否允许被唤醒。然后统计允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。If the wake-up identification result only includes the second wake-up confidence level of at least one second electronic device, the first electronic device first needs to determine whether the second electronic device is allowed to wake up. Then, count the number of devices of the second electronic device that are allowed to be woken up, and the proportion of the number of devices to the total number of devices in the device group.
如果唤醒标识结果中同时包括至少一个第二电子设备的唤醒标识和第二唤醒置信度,第一电子设备可通过上述任意一种统计方式,确定允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。If the wake-up identification result includes both the wake-up identification and the second wake-up confidence level of at least one second electronic device, the first electronic device may determine the number of devices of the second electronic device that are allowed to be woken up through any one of the above statistical methods, and The proportion of this number of devices to the total number of devices in the device group.
在一种可能的设计中,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒,包括:若第一唤醒置信度大于或等于第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于第一阈值,且第一唤醒置信度大于第二阈值,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒;或者若第一唤醒置信度小于或等于第二阈值,确定禁止第一电子设备被唤醒。In a possible design, according to the first wake-up confidence level and the wake-up identification result, determining to allow or prohibit the first electronic device from being awakened includes: if the first wake-up confidence level is greater than or equal to a first threshold, determining to allow the first electronic device to be awakened The device is woken up; or if the first wake-up confidence is less than the first threshold and the first wake-up confidence is greater than the second threshold, according to the first wake-up confidence and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up; or If the first wake-up confidence level is less than or equal to the second threshold, it is determined that the first electronic device is prohibited from being woken up.
上述方案提供了一种基于预设的主设备唤醒条件,通过比较主设备计算得到的第一唤醒置信度与第一阈值、第二阈值的大小关系,确定允许或禁止主设备被唤醒。若第一唤醒置信度介于第一阈值和第二阈值之间,可结合设备组中其他从设备发送的唤醒识别结果,确定允许或禁止主设备被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断,充分利用了设备组内其他设备的唤醒识别结果,提升了设备组内主设备唤醒的准确性。The above solution provides a method for determining whether to allow or prohibit the master device from being woken up by comparing the relationship between the first wake-up confidence calculated by the master device and the first threshold and the second threshold based on a preset master device wake-up condition. If the first wake-up confidence level is between the first threshold and the second threshold, it may be determined to allow or prohibit the master device from being woken up in combination with the wake-up identification results sent by other slave devices in the device group. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.
在一种可能的设计中,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒,包括:根据唤醒识别结果统计至少一个第二电子设备的唤醒情况;若唤醒情况满足预设的第一电子设备的唤醒条件,确定允许第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the wake-up recognition result includes: counting the wake-up conditions of at least one second electronic device according to the wake-up recognition result; If the preset wake-up condition of the first electronic device is satisfied, it is determined that the first electronic device is allowed to wake up.
可选的,唤醒条件包括如下的任意一项:设备组中除第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值;设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例大于或等于第一比例;设备组中允许被唤醒的第二电子设备的设备数量占设备组设备总数的比例小于第一比例且大于第二比例,且允许被唤醒的第二电子设备的第二唤醒置信度均大于或等于第三阈值;其中,第三阈值为第一电子设备中预设的允许第二电子设备被唤醒的阈值。Optionally, the wake-up condition includes any one of the following: the second wake-up confidence of all second electronic devices except the first electronic device in the device group is greater than or equal to the third threshold; the device group is allowed to be woken up The proportion of the number of devices of the second electronic device to the total number of devices in the device group is greater than or equal to the first proportion; the proportion of the number of devices of the second electronic device allowed to be awakened in the device group to the total number of devices in the device group is less than the first proportion and greater than The second ratio, and the second wake-up confidence of the second electronic device that is allowed to wake up is greater than or equal to the third threshold; wherein, the third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up .
需要说明的是,为了提高判断的准确性,可通过设置第三阈值,排查允许被唤醒的第二电子设备中是否存在第二唤醒置信度低于第三阈值的电子设备,如果存在,则确定禁止第一电子设备被唤醒。上述第三种唤醒条件可有效避免由于部分从设备设置的预设唤醒阈值过低,导致主设备被误唤醒,提升主设备唤醒的准确性。It should be noted that, in order to improve the accuracy of the judgment, a third threshold can be set to check whether there is an electronic device whose second wake-up confidence is lower than the third threshold in the second electronic device that is allowed to be woken up. The first electronic device is prohibited from being woken up. The above-mentioned third wake-up condition can effectively prevent the master device from being woken up by mistake because the preset wake-up thresholds set by some slave devices are too low, thereby improving the accuracy of the master device's wake-up.
上述方案中,第一电子设备通过至少一个第二电子设备发送的唤醒识别结果统计设备组内从设备的唤醒情况,判断从设备的唤醒情况是否满足预设的唤醒条件,如果满足预设的唤醒条件,则确定允许第一电子设备被唤醒,如果不满足预设的唤醒条件,则确定禁止第一电子设备被唤醒。该方案充分考虑了设备组内其他从设备的唤醒识别结果,对设备组内主设备的唤醒条件进行优化,提升设备组内主设备唤醒的准确性。In the above solution, the first electronic device counts the wake-up situation of the slave devices in the device group through the wake-up recognition result sent by at least one second electronic device, and judges whether the wake-up situation of the slave device satisfies the preset wake-up condition. If the preset wake-up condition is not satisfied, it is determined to prohibit the first electronic device from being woken up. The scheme fully considers the wake-up recognition results of other slave devices in the device group, optimizes the wake-up conditions of the master device in the device group, and improves the accuracy of the master device's wake-up in the device group.
可选的,唤醒识别结果还包括至少一个第二电子设备的设备标识,设备标识用于指示第二电子设备的设备类型,确定第二电子设备的权重值。Optionally, the wake-up identification result further includes a device identifier of at least one second electronic device, where the device identifier is used to indicate the device type of the second electronic device and determine the weight value of the second electronic device.
在一种可能的设计中,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒,包括:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值;统计设备组内所有第二电子设备的第二权重值;基于第一权重值和第二权重值调整第一阈值;根据第一唤醒置信度和调整后的第一阈值,确定允许或者禁止第一电子设备被唤醒;其中,第一阈值为允许第一电子设备被唤醒的阈值。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result includes: according to the wake-up identification result, counting the number of the second electronic devices in the device group that are allowed to be woken up. a weight value; count the second weight values of all the second electronic devices in the device group; adjust the first threshold based on the first weight value and the second weight value; determine the allowable threshold according to the first wake-up confidence and the adjusted first threshold Or prohibit the first electronic device from being woken up; wherein, the first threshold is a threshold value that allows the first electronic device to be woken up.
上述方案引入了设备组内电子设备的权重值,权重值用于指示电子设备唤醒识别结果的置信度高低。权重值可以是与电子设备的设备类型相关的值,还可以是与电子设备的软/硬件性能相关的值。例如,智能电视的权重值为0.3,智能音箱的权重值为0.6,智能灯的权重值为0.1。又例如,音箱Pro的权重值为0.5,常规音箱的权重值为0.3,音箱mini的权重值为0.2。The above solution introduces the weight value of the electronic device in the device group, and the weight value is used to indicate the confidence level of the wake-up recognition result of the electronic device. The weight value may be a value related to the device type of the electronic device, and may also be a value related to the software/hardware performance of the electronic device. For example, a smart TV has a weight value of 0.3, a smart speaker has a weight value of 0.6, and a smart light has a weight value of 0.1. For another example, the weight value of the speaker Pro is 0.5, the weight value of the conventional speaker is 0.3, and the weight value of the speaker mini is 0.2.
上述方案从唤醒阈值的角度出发,结合设备组内从设备的实际唤醒情况,综合分析允许被唤醒的从设备的唤醒识别结果的置信度高低,进而基于连续方式动态地调整设备组内主设备的唤醒阈值,并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果,判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态,提升了设备组内主设备唤醒的准确性。From the perspective of the wake-up threshold, the above scheme comprehensively analyzes the confidence level of the wake-up recognition results of the slave devices that are allowed to be woken up, combined with the actual wake-up situation of the slave devices in the device group, and then dynamically adjusts the master device in the device group based on a continuous method. Wake-up threshold, and determine whether the current master device needs to make a wake-up response according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device. Since the adjusted wake-up threshold of the master device is more in line with the actual wake-up state of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.
可选的,第一权重值是根据设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;第二权重值是根据设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device that is allowed to be awakened in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be awakened; The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
在一种可能的设计中,基于第一权重值和第二权重值调整第一阈值,包括:将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据阈值调整参数调整第一阈值。In a possible design, adjusting the first threshold based on the first weight value and the second weight value includes: using the product of the ratio of the first weight value and the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, according to The threshold adjustment parameter adjusts the first threshold.
在一种可能的设计中,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒,包括:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量;根据唤醒阈值调整表确定设备数量对应的阈值调整参数,唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;基于阈值调整参数调整第一阈值;根据第一唤醒置信度和调整后的第一阈值,确定允许或禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result includes: counting the devices of the second electronic device in the device group that are allowed to be woken up according to the wake-up identification result Quantity; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, the wake-up threshold adjustment table includes the corresponding relationship between the number of devices of the second electronic device that is allowed to be woken up and the threshold adjustment parameter; adjust the first threshold based on the threshold adjustment parameter; The first wake-up confidence and the adjusted first threshold determine whether to allow or prohibit the first electronic device from being woken up.
上述方案同样从唤醒阈值的角度出发,结合设备组内从设备的实际唤醒情况,统计允许被唤醒的从设备的设备数量或占比,进而基于离散方式动态地调整设备组内主设备的唤醒阈值,并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果,判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态,提升了设备组内主设备唤醒的准确性。The above solution also starts from the wake-up threshold, combined with the actual wake-up situation of the slave devices in the device group, counts the number or proportion of the slave devices that are allowed to be woken up, and then dynamically adjusts the wake-up threshold of the master device in the device group based on a discrete method. , and according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device, it is determined whether the current master device needs to make a wake-up response. Since the adjusted wake-up threshold of the master device is more in line with the actual wake-up state of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.
在一种可能的设计中,根据第一唤醒置信度和调整后的第一阈值,确定允许或者禁止第一电子设备被唤醒,包括:若第一唤醒置信度大于或等于调整后的第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于调整后的第一阈值,确定禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold includes: if the first wake-up confidence is greater than or equal to the adjusted first threshold , it is determined that the first electronic device is allowed to be woken up; or if the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is not allowed to be woken up.
第二方面,本申请实施例提供一种语音唤醒方法,该方法应用于第一电子设备,第一电子设备和至少一个第二电子设备属于同一设备组,该方法包括:获取音频数据的第一唤醒置信度,第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度;接收至少一个第二电子设备发送的唤醒识别结果,唤醒识别结果用于指示允许或禁止第二电子设备被唤醒;根据唤醒识别结果调整第一阈值,第一阈值为允许第一电子设备被唤醒的阈值;根据调整后的第一阈值和第一唤醒置信度,确定允许或禁止第一电子设备被唤醒。In a second aspect, an embodiment of the present application provides a voice wake-up method. The method is applied to a first electronic device. The first electronic device and at least one second electronic device belong to the same device group. The method includes: acquiring a first electronic device of audio data. Wake-up confidence, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; receive the wake-up recognition result sent by at least one second electronic device, and the wake-up recognition result It is used to indicate that the second electronic device is allowed or prohibited to be woken up; the first threshold is adjusted according to the wake-up recognition result, and the first threshold is the threshold that allows the first electronic device to be woken up; according to the adjusted first threshold and the first wake-up confidence, It is determined whether the first electronic device is allowed or prohibited from being woken up.
在一种可能的设计中,根据唤醒识别结果调整第一阈值,包括:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值;统计设备组内所有第二电子设备的第二权重值;基于第一权重值和第二权重值调整第一阈值。In a possible design, adjusting the first threshold value according to the wake-up recognition result includes: according to the wake-up recognition result, counting the first weights of the second electronic devices allowed to be woken up in the device group; counting all the second electronic devices in the device group The second weight value of ; adjust the first threshold value based on the first weight value and the second weight value.
可选的,第一权重值是根据设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;第二权重值是根据设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device that is allowed to be awakened in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be awakened; The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
在一种可能的设计中,基于第一权重值和第二权重值调整第一阈值,包括:将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据阈值调整参数调整第一阈值。In a possible design, adjusting the first threshold based on the first weight value and the second weight value includes: using the product of the ratio of the first weight value and the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, according to The threshold adjustment parameter adjusts the first threshold.
在一种可能的设计中,根据唤醒识别结果调整第一阈值,包括:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量;根据唤醒阈值调整表确定设备数量对应的阈值调整参数,唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;基于阈值调整参数调整第一阈值。In a possible design, adjusting the first threshold according to the wake-up recognition result includes: counting the number of devices of the second electronic device allowed to be woken up in the device group according to the wake-up recognition result; determining the threshold corresponding to the number of devices according to the wake-up threshold adjustment table Adjusting parameters, the wake-up threshold adjustment table includes the correspondence between the number of devices of the second electronic device that are allowed to be woken up and the threshold adjustment parameters; the first threshold is adjusted based on the threshold adjustment parameters.
可选的,唤醒识别结果包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止第二电子设备被唤醒的标识,第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification of at least one second electronic device and a second wake-up confidence; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence It is used to indicate the similarity of the acoustic features between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
在一种可能的设计中,根据调整后的第一阈值和第一唤醒置信度,确定允许或者禁止第一电子设备被唤醒,包括:若第一唤醒置信度大于或等于调整后的第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于调整后的第一阈值,确定禁止第一电子设备被唤醒。In a possible design, determining whether to allow or prohibit the first electronic device from being woken up according to the adjusted first threshold and the first wake-up confidence includes: if the first wake-up confidence is greater than or equal to the adjusted first threshold , it is determined that the first electronic device is allowed to be woken up; or if the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is not allowed to be woken up.
第三方面,本申请实施例提供一种语音唤醒设备,该语音唤醒设备为第一电子设备,第一电子设备和至少一个第二电子设备属于同一设备组,该语音唤醒设备包括:获取模块,用于获取音频数据的第一唤醒置信度,第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度;接收模块,用于接收至少一个第二电子设备发送的唤醒识别结果,唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒;处理模块,用于根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒。In a third aspect, an embodiment of the present application provides a voice wake-up device, the voice wake-up device is a first electronic device, the first electronic device and at least one second electronic device belong to the same device group, the voice wake-up device includes: an acquisition module, For obtaining the first wake-up confidence of the audio data, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; the receiving module is used to receive at least one A wake-up identification result sent by the second electronic device, where the wake-up identification result is used to indicate that at least one second electronic device is allowed to wake up; the processing module is used to determine whether to allow or prohibit the first wake-up confidence level and the wake-up identification result according to the first wake-up confidence Electronic device wakes up.
可选的,唤醒识别结果包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止第二电子设备被唤醒的标识,第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification of at least one second electronic device and a second wake-up confidence; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence It is used to indicate the similarity of the acoustic features between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
可选的,处理模块,具体用于:若第一唤醒置信度大于或等于第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于第一阈值,且第一唤醒置信度大于第二阈值,根据第一唤醒置信度和唤醒识别结果,确定允许或者禁止第一电子设备被唤醒;或者若第一唤醒置信度小于或等于第二阈值,确定禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: if the first wake-up confidence is greater than or equal to the first threshold, determine to allow the first electronic device to be awakened; or if the first wake-up confidence is less than the first threshold, and the first wake-up confidence is greater than the second threshold, according to the first wake-up confidence and the wake-up identification result, it is determined to allow or prohibit the first electronic device from being woken up; or if the first wake-up confidence is less than or equal to the second threshold, it is determined to prohibit the first electronic device from being woken up .
可选的,处理模块,具体用于:根据唤醒识别结果统计至少一个第二电子设备的唤醒情况;若唤醒情况满足预设的第一电子设备的唤醒条件,确定允许第一电子设备被唤醒。Optionally, the processing module is specifically configured to: count the wake-up situation of at least one second electronic device according to the wake-up identification result; if the wake-up situation meets the preset wake-up condition of the first electronic device, determine to allow the first electronic device to be woken up.
可选的,唤醒条件包括如下的任意一项:设备组中除第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值;设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例大于或等于第一比例;设备组中允许被唤醒的第二电子设备的设备数量占设备组设备总数的比例小于第一比例且大于第二比例,且允许被唤醒的第二电子设备的第二唤醒置信度均大于或等于第三阈值;其中,第三阈值为第一电子设备中预设的允许第二电子设备被唤醒的阈值。Optionally, the wake-up condition includes any one of the following: the second wake-up confidence of all second electronic devices except the first electronic device in the device group is greater than or equal to the third threshold; the device group is allowed to be woken up The proportion of the number of devices of the second electronic device to the total number of devices in the device group is greater than or equal to the first proportion; the proportion of the number of devices of the second electronic device allowed to be awakened in the device group to the total number of devices in the device group is less than the first proportion and greater than The second ratio, and the second wake-up confidence of the second electronic device that is allowed to wake up is greater than or equal to the third threshold; wherein, the third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up .
可选的,唤醒识别结果还包括至少一个第二电子设备的设备标识,设备标识用于指示第二电子设备的设备类型,确定第二电子设备的权重值。Optionally, the wake-up identification result further includes a device identifier of at least one second electronic device, where the device identifier is used to indicate the device type of the second electronic device and determine the weight value of the second electronic device.
可选的,处理模块,具体用于:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值;统计设备组内所有第二电子设备的第二权重值;基于第一权重值和第二权重值调整第一阈值;根据第一唤醒置信度和调整后的第一阈值,确定允许或者禁止第一电子设备被唤醒;其中,第一阈值为允许第一电子设备被唤醒的阈值。Optionally, the processing module is specifically configured to: count the first weight values of the second electronic devices allowed to be woken up in the device group according to the wake-up identification result; count the second weight values of all the second electronic devices in the device group; A weight value and a second weight value adjust the first threshold; according to the first wake-up confidence and the adjusted first threshold, determine whether to allow or prohibit the first electronic device from being woken up; wherein, the first threshold is to allow the first electronic device to be woken up Wake-up threshold.
可选的,第一权重值是根据设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;第二权重值是根据设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device that is allowed to be awakened in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be awakened; The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
可选的,处理模块,具体用于:将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据阈值调整参数调整第一阈值。Optionally, the processing module is specifically configured to: use the product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, and adjust the first threshold according to the threshold adjustment parameter.
可选的,处理模块,具体用于:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量;根据唤醒阈值调整表确定设备数量对应的阈值调整参数,唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;基于阈值调整参数调整第一阈值;根据第一唤醒置信度和调整后的第一阈值,确定允许或禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: count the number of devices of the second electronic device allowed to be awakened in the device group according to the wake-up identification result; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, and the wake-up threshold adjustment table includes: The correspondence between the number of devices of the second electronic device that is allowed to be awakened and the threshold adjustment parameter; the first threshold is adjusted based on the threshold adjustment parameter; according to the first wakeup confidence and the adjusted first threshold, it is determined to allow or prohibit the first electronic device be awakened.
可选的,处理模块,具体用于:若第一唤醒置信度大于或等于调整后的第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于调整后的第一阈值,确定禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: if the first wake-up confidence is greater than or equal to the adjusted first threshold, determine to allow the first electronic device to be awakened; or if the first wake-up confidence is less than the adjusted first threshold , it is determined that the first electronic device is prohibited from being woken up.
第四方面,本申请实施例提供一种语音唤醒设备,该语音唤醒设备为第一电子设备,第一电子设备和至少一个第二电子设备属于同一设备组,该语音唤醒设备包括:获取模块,用于获取音频数据的第一唤醒置信度,第一唤醒置信度用于指示第一电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度;接收模块,用于接收至少一个第二电子设备发送的唤醒识别结果,唤醒识别结果用于指示允许或禁止第二电子设备被唤醒;处理模块,用于根据唤醒识别结果调整第一阈值,第一阈值为允许第一电子设备被唤醒的阈值;根据调整后的第一阈值和第一唤醒置信度,确定允许或禁止第一电子设备被唤醒。In a fourth aspect, an embodiment of the present application provides a voice wake-up device, the voice wake-up device is a first electronic device, the first electronic device and at least one second electronic device belong to the same device group, the voice wake-up device includes: an acquisition module, For obtaining the first wake-up confidence of the audio data, the first wake-up confidence is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the first electronic device; the receiving module is used to receive at least one The wake-up identification result sent by the second electronic device, where the wake-up identification result is used to indicate that the second electronic device is allowed or prohibited to be woken up; the processing module is configured to adjust the first threshold according to the wake-up identification result, where the first threshold is to allow the first electronic device to be woken up. A wake-up threshold; according to the adjusted first threshold and the first wake-up confidence, determine whether to allow or prohibit the first electronic device from being woken up.
可选的,处理模块,具体用于:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的第一权重值;统计设备组内所有第二电子设备的第二权重值;基于第一权重值和第二权重值调整第一阈值。Optionally, the processing module is specifically configured to: count the first weight values of the second electronic devices allowed to be woken up in the device group according to the wake-up identification result; count the second weight values of all the second electronic devices in the device group; A weight value and a second weight value adjust the first threshold.
可选的,第一权重值是根据设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;第二权重值是根据设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is determined according to the number of devices corresponding to the device type of the second electronic device that is allowed to be awakened in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be awakened; The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
可选的,处理模块,具体用于:将第一权重值与第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据阈值调整参数调整第一阈值。Optionally, the processing module is specifically configured to: use the product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter as the threshold adjustment parameter, and adjust the first threshold according to the threshold adjustment parameter.
可选的,处理模块,具体用于:根据唤醒识别结果统计设备组内允许被唤醒的第二电子设备的设备数量;根据唤醒阈值调整表确定设备数量对应的阈值调整参数,唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;基于阈值调整参数调整第一阈值。Optionally, the processing module is specifically configured to: count the number of devices of the second electronic device allowed to be awakened in the device group according to the wake-up identification result; determine the threshold adjustment parameter corresponding to the number of devices according to the wake-up threshold adjustment table, and the wake-up threshold adjustment table includes: The correspondence between the number of devices of the second electronic device that is allowed to be awakened and the threshold adjustment parameter; the first threshold is adjusted based on the threshold adjustment parameter.
可选的,唤醒识别结果包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止第二电子设备被唤醒的标识,第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification of at least one second electronic device and a second wake-up confidence; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence It is used to indicate the similarity of the acoustic features between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
可选的,处理模块,具体用于:若第一唤醒置信度大于或等于调整后的第一阈值,确定允许第一电子设备被唤醒;或者若第一唤醒置信度小于调整后的第一阈值,确定禁止第一电子设备被唤醒。Optionally, the processing module is specifically configured to: if the first wake-up confidence is greater than or equal to the adjusted first threshold, determine to allow the first electronic device to be awakened; or if the first wake-up confidence is less than the adjusted first threshold , it is determined that the first electronic device is prohibited from being woken up.
第五方面,本申请实施例提供一种语音唤醒设备,包括:存储器和处理器,存储器用于存储计算机程序,处理器用于从存储器中调用并运行计算机程序,使得处理器运行计算机程序执行如第一方面任一项的方法,或者如第二方面任一项的方法。In a fifth aspect, an embodiment of the present application provides a voice wake-up device, including: a memory and a processor, where the memory is used to store a computer program, and the processor is used to call and run the computer program from the memory, so that the processor runs the computer program to execute as described in Section 1. The method of any one of the aspect, or the method of any one of the second aspect.
第六方面,本申请实施例提供一种存储介质,存储介质包括计算机程序,计算机程序用于实现如第一方面任一项的方法,或者,如第二方面任一项的方法。In a sixth aspect, an embodiment of the present application provides a storage medium, where the storage medium includes a computer program, and the computer program is used to implement the method of any one of the first aspect, or the method of any one of the second aspect.
本申请实施例提供一种语音唤醒方法、设备及存储介质,该方法包括:主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中,根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果,确定主设备是否被唤醒,该方法提高了设备组中主设备被唤醒的准确性。进一步地,主设备还可以采用连续或者离散方式,动态调整设备组内主设备的预设唤醒阈值,并根据调整后的唤醒阈值确定主设备是否被唤醒,动态调整后的唤醒阈值更加契合整个设备组唤醒的实际状态,主设备基于动态调整后的唤醒阈值进行唤醒决策,唤醒设备的准确性更高。Embodiments of the present application provide a voice wake-up method, device, and storage medium. The method includes: the master device introduces the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device, according to the wake-up recognition result of the master device itself. The result and the wake-up identification results of other slave devices in the device group determine whether the master device is woken up, and the method improves the accuracy of the wake-up of the master device in the device group. Further, the master device can also dynamically adjust the preset wake-up threshold of the master device in the device group in a continuous or discrete manner, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more suitable for the entire device. The actual state of group wake-up, the master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, and the accuracy of wake-up devices is higher.
附图说明Description of drawings
图1为本申请实施例提供的语音唤醒方法的场景示意图;1 is a schematic diagram of a scenario of a voice wake-up method provided by an embodiment of the present application;
图2为本申请实施例提供的一种电子设备的硬件结构示意图;2 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application;
图3为本申请实施例提供的一种电子设备的软件架构示意图;3 is a schematic diagram of a software architecture of an electronic device according to an embodiment of the present application;
图4a至图4c为本申请实施例提供的语音唤醒方法的场景示意图;4a to 4c are schematic diagrams of scenarios of a voice wake-up method provided by an embodiment of the present application;
图5为本申请实施例提供的一种用户界面交互示意图;FIG. 5 is a schematic diagram of a user interface interaction provided by an embodiment of the present application;
图6为本申请实施例提供的语音唤醒方法的交互示意图;FIG. 6 is an interactive schematic diagram of a voice wake-up method provided by an embodiment of the present application;
图7为本申请实施例提供的一种语音唤醒方法的流程示意图;FIG. 7 is a schematic flowchart of a voice wake-up method provided by an embodiment of the present application;
图8a为本申请实施例提供的一种语音唤醒方法的判断流程图;Fig. 8a is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;
图8b为本申请实施例提供的一种语音唤醒方法的判断流程图;FIG. 8b is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;
图9为本申请实施例提供的一种语音唤醒方法的判断流程图;FIG. 9 is a judgment flowchart of a voice wake-up method provided by an embodiment of the present application;
图10为本申请实施例提供的另一种语音唤醒方法的流程示意图;10 is a schematic flowchart of another voice wake-up method provided by an embodiment of the present application;
图11为本申请实施例提供的又一种语音唤醒方法的流程示意图;11 is a schematic flowchart of another voice wake-up method provided by an embodiment of the present application;
图12为本申请实施例提供的一种语音唤醒设备的结构示意图;12 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the application;
图13为本申请实施例提供的一种语音唤醒设备的结构示意图;13 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the application;
图14为本申请实施例提供的一种语音唤醒设备的硬件结构示意图。FIG. 14 is a schematic diagram of a hardware structure of a voice wake-up device according to an embodiment of the present application.
具体实施方式Detailed ways
本申请实施例提供的电子设备为具有语音唤醒功能的电子设备,即用户可以通过语音唤醒电子设备。具体的,用户通过说出唤醒词唤醒电子设备,唤醒词可以是用户根据自身需要预设的,也可以是电子设备在出厂之前预设的,对此本实施例不作任何限制。The electronic device provided by the embodiment of the present application is an electronic device with a voice wake-up function, that is, a user can wake up the electronic device by voice. Specifically, the user wakes up the electronic device by speaking a wake-up word. The wake-up word may be preset by the user according to his own needs, or may be preset by the electronic device before leaving the factory, which is not limited in this embodiment.
电子设备获取音频数据,检测该音频数据是否包含上述唤醒词,如果音频数据包含唤醒词,则电子设备被唤醒,否则电子设备不被唤醒。电子设备被唤醒后,用户可以通过语音与电子设备进行交互。例如,预设的唤醒词为“小艺小艺”,当电子设备检测到音频数据中包含“小艺小艺”时,电子设备被唤醒。示例性的,图1示出了一种语音唤醒的场景示意图,如图1所示,该场景包括电子设备10、电子设备20、电子设备30以及电子设备40。其中,电子设备10、电子设备30以及电子设备40具有相同的预设唤醒词,例如唤醒词1,电子设备20中预设的唤醒词为唤醒词2。当用户说出的唤醒词为唤醒词1,且场景中的各个电子设备均可以接收或采集到用户说出的唤醒词时,电子设备10、电子设备30以及电子设备40能够被唤醒。由于用户说出的唤醒词与电子设备20预设的唤醒词不同,因此电子设备20不被唤醒。The electronic device acquires the audio data, and detects whether the audio data contains the wake-up word. If the audio data contains the wake-up word, the electronic device is awakened, otherwise the electronic device is not awakened. After the electronic device is awakened, the user can interact with the electronic device through voice. For example, the preset wake-up word is "Xiaoyi Xiaoyi". When the electronic device detects that the audio data contains "Xiaoyi Xiaoyi", the electronic device is woken up. Exemplarily, FIG. 1 shows a schematic diagram of a voice wake-up scenario. As shown in FIG. 1 , the scenario includes an
由上述示例可知,当用户所处空间存在多种支持语音唤醒的电子设备时,用户说出唤醒词,有可能被多个电子设备接收或采集到,如果多个电子设备预设的唤醒词都相同,有可能导致多个电子设备被唤醒,被唤醒的多个电子设备会同时响应用户的请求,造成用户不知道与哪个设备进行语音交互的困惑,降低用户体验。It can be seen from the above example that when there are multiple electronic devices that support voice wake-up in the space where the user is located, the wake-up word spoken by the user may be received or collected by multiple electronic devices. Similarly, it may cause multiple electronic devices to be awakened, and the multiple electronic devices that are awakened will respond to the user's request at the same time, causing the user to be confused about which device to perform voice interaction with, and reducing the user experience.
为了避免用户语音交互的困惑,本申请实施例提供的语音唤醒方法,将能够同时被用户唤醒的电子设备组成一个设备组,设备组内只有一个电子设备对用户的唤醒进行应答响应,将进行应答响应的电子设备看作设备组的主设备,除主设备之外的其他电子设备看作设备组的从设备,从设备配合主设备处理用户的指令意图。以图1为例,可以将具有相同预设唤醒词的电子设备10、电子设备30以及电子设备40组成一个设备组,将电子设备10设置为该设备组的主设备,电子设备30和电子设备40作为该设备组的从设备,从设备配合主设备处理用户的指令意图。需要说明的是,可以通过预设规则设置设备组的主设备,例如将设备组中处理性能最强的电子设备设置为主设备,也可以通过用户自定义设置设备组内的任意一个电子设备为主设备,对此本实施例不作任何限制。In order to avoid the confusion of the user's voice interaction, the voice wake-up method provided by the embodiment of the present application forms a device group of electronic devices that can be woken up by the user at the same time, and only one electronic device in the device group responds to the user's wake-up and will respond The responding electronic device is regarded as the master device of the device group, other electronic devices except the master device are regarded as the slave devices of the device group, and the slave devices cooperate with the master device to process the user's instruction intention. Taking FIG. 1 as an example, the
基于图1提供的语音唤醒场景,设备组的主设备在接收或采集到音频数据后,对音频数据进行预处理,提取音频数据中的唤醒词,通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值,如果相似度值大于或者等于预设相似度阈值(也可称为预设唤醒阈值),则认为检测到唤醒请求,设备组的主设备被唤醒,否则主设备不被唤醒。由此可知,设备组中的主设备是否唤醒是由主设备自身的唤醒识别结果决定的,该执行方案存在如下缺陷:Based on the voice wake-up scenario provided in Figure 1, after receiving or collecting the audio data, the master device of the device group preprocesses the audio data, extracts the wake-up words in the audio data, and uses the voice wake-up model to calculate the difference between the wake-up words in the audio data and the audio data. The similarity value of the acoustic features of the preset wake-up word, if the similarity value is greater than or equal to the preset similarity threshold (also referred to as the preset wake-up threshold), it is considered that a wake-up request is detected, and the master device of the device group is woken up, Otherwise the master device is not woken up. It can be seen from this that whether the master device in the device group wakes up is determined by the wake-up recognition result of the master device itself, and this implementation scheme has the following defects:
第一,设备组包括多个可被唤醒的电子设备,只依靠主设备自身的唤醒识别结果作为整个设备组的唤醒识别结果,该结果的准确性不高。例如,在语音传输路径上存在干扰源,或者,用户距离主设备较远时,仅仅依靠主设备自身的唤醒识别结果,极有可能在主设备需要被唤醒时,由于外界因素的影响,主设备没有检测到唤醒语音,或者检测到唤醒语音但没有达到预设唤醒阈值,造成整个设备组没有响应。First, the device group includes a plurality of electronic devices that can be woken up, and only the wake-up identification result of the master device itself is used as the wake-up identification result of the entire device group, and the accuracy of the result is not high. For example, if there is an interference source on the voice transmission path, or when the user is far away from the main device, only relying on the wake-up recognition result of the main device itself, it is very likely that when the main device needs to be woken up, due to the influence of external factors, the main device will No wake-up voice is detected, or a wake-up voice is detected but the preset wake-up threshold is not reached, causing the entire device group to become unresponsive.
第二,设备组中主设备的预设唤醒阈值是固定值,然而不同的语音唤醒场景该唤醒阈值可能存在差异。因此,固定的预设唤醒阈值可能影响主设备的唤醒识别结果。Second, the preset wake-up threshold of the master device in the device group is a fixed value, but the wake-up threshold may be different in different voice wake-up scenarios. Therefore, the fixed preset wake-up threshold may affect the wake-up recognition result of the master device.
为了提高主设备的唤醒率,可采用其他解决方式辅助主设备进行唤醒识别的判断。例如,通过服务端模型再校验的方式,设备组的主设备通过与服务端交互,从服务端获取唤醒识别结果。然而,该方式的实现成本较高,且依赖于网络环境,可能存在较高的延时问题。又例如,通过扩展设备侧的语音唤醒模型提高唤醒识别的准确率,该方式的缺点是占用更多的设备侧运算和存储资源,导致设备的整体成本较高。In order to improve the wake-up rate of the master device, other solutions may be used to assist the master device in judging wake-up identification. For example, by re-checking the server model, the master device of the device group obtains the wake-up recognition result from the server by interacting with the server. However, the implementation cost of this method is high, and depending on the network environment, there may be a high delay problem. For another example, the accuracy of wake-up recognition is improved by expanding the voice wake-up model on the device side. The disadvantage of this method is that it occupies more computing and storage resources on the device side, resulting in a higher overall cost of the device.
综上可知,设备组的主设备仅依据自身的唤醒识别结果,存在较高的误判率,易造成整个设备组在用户发出唤醒语音后无响应,用户体验不佳。虽然,通过服务端再校验或者扩展设备侧语音唤醒模型可以提高主设备的唤醒率,但实现成本较高,且存在较高的时延。对此,本申请实施例提供一种语音唤醒方法,主设备通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中,充分利用设备组内不同电子设备上的唤醒识别结果,根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果,确定主设备是否被唤醒,由于主设备在唤醒决策时考虑了设备组内其他从设备的唤醒识别结果,提高了设备组中主设备被唤醒的准确性。To sum up, the master device of the device group only relies on its own wake-up recognition result, which has a high misjudgment rate, which is likely to cause the entire device group to not respond after the user sends a wake-up voice, resulting in poor user experience. Although the wake-up rate of the master device can be improved by re-checking on the server side or expanding the device-side voice wake-up model, the implementation cost is high and there is a high delay. In this regard, an embodiment of the present application provides a voice wake-up method. The master device makes full use of the wake-up recognition results on different electronic devices in the device group by introducing the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device. , according to the wake-up recognition result of the master device itself and the wake-up recognition results of other slave devices in the device group, to determine whether the master device is woken up, because the master device considers the wake-up recognition results of other slave devices in the device group in the wake-up decision. The accuracy with which the master device in the device group is woken up.
在不同语音唤醒场景下,设备组的主设备采用固定的唤醒阈值进行唤醒决策,存在较高的误判率。对此,本申请实施例提供一种语音唤醒方法,主设备同样通过将设备组内其他从设备的唤醒识别结果引入到主设备的唤醒决策中,根据主设备自身的唤醒识别结果以及设备组内其他从设备的唤醒识别结果,采用连续或者离散方式,动态调整设备组内主设备的预设唤醒阈值,并根据调整后的唤醒阈值确定主设备是否被唤醒。动态调整后的唤醒阈值更加契合整个设备组唤醒的实际状态,主设备基于动态调整后的唤醒阈值进行唤醒决策,提高了设备组中主设备被唤醒的准确性。In different voice wake-up scenarios, the master device of the device group uses a fixed wake-up threshold to make wake-up decisions, which has a high false positive rate. In this regard, an embodiment of the present application provides a voice wake-up method. The master device also introduces the wake-up recognition results of other slave devices in the device group into the wake-up decision of the master device. The wake-up recognition results of other slave devices use continuous or discrete methods to dynamically adjust the preset wake-up threshold of the master device in the device group, and determine whether the master device is woken up according to the adjusted wake-up threshold. The dynamically adjusted wake-up threshold is more in line with the actual wake-up state of the entire device group. The master device makes wake-up decisions based on the dynamically adjusted wake-up threshold, which improves the accuracy of the master device in the device group being woken up.
以下对本申请实施例中设备组内的电子设备进行详细介绍。The electronic devices in the device group in the embodiments of the present application are described in detail below.
本申请实施例中的电子设备可以为便携式电子设备,诸如手机、平板电脑、人工智能(artificial intelligence,AI)智能语音终端、可穿戴设备、增强现实(augmentedreality,AR)/虚拟现实(virtual reality,VR)设备等。便携式电子设备的示例性实施例包括但不限于搭载或者其它操作系统的便携式电子设备。上述便携式电子设备也可以是车载终端、膝上型计算机(Laptop)等。还应当理解的是,本申请实施例的电子设备还可以台式计算机、智能家居设备(例如智能电视、智能音箱)等,对此不作限定。The electronic device in this embodiment of the present application may be a portable electronic device, such as a mobile phone, a tablet computer, an artificial intelligence (artificial intelligence, AI) intelligent voice terminal, a wearable device, an augmented reality (AR)/virtual reality (virtual reality, VR) equipment, etc. Exemplary embodiments of portable electronic devices include, but are not limited to, portable electronic devices powered by or other operating systems. The above-mentioned portable electronic device may also be an in-vehicle terminal, a laptop computer (Laptop), or the like. It should also be understood that the electronic device in the embodiment of the present application may also be a desktop computer, a smart home device (eg, a smart TV, a smart speaker), etc., which is not limited thereto.
示例的,图2示出了本申请实施例的一种电子设备的硬件结构示意图。具体的,如图2所示,电子设备包括处理器110、内部存储器121、外部存储器接口122、摄像头131、显示屏132、传感器模块140、用户标识模块(subscriber identification module,SIM)卡接口151、按键152、音频模块160、扬声器161、受话器162、麦克风163、耳机接口164、通用串行总线(universal serial bus,USB)接口170、充电管理模块180、电源管理模块181、电池182、移动通信模块191和无线通信模块192。在另一些实施例中,电子设备还可以包括马达、指示器、按键等。By way of example, FIG. 2 shows a schematic diagram of a hardware structure of an electronic device according to an embodiment of the present application. Specifically, as shown in FIG. 2 , the electronic device includes a
应理解,图2所示的硬件结构仅是一个示例。本申请实施例的电子设备可以具有比图中所示电子设备更多的或者更少的部件,可以组合两个或更多的部件,或者可以具有不同的部件配置。图中所示出的各种部件可以在包括一个或多个信号处理和/或专用集成电路在内的硬件、软件、或硬件和软件的组合中实现。It should be understood that the hardware structure shown in FIG. 2 is only an example. The electronic device of the embodiments of the present application may have more or less components than the electronic device shown in the figures, may combine two or more components, or may have different component configurations. The various components shown in the figures may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing and/or application specific integrated circuits.
其中,处理器110可以包括一个或多个处理单元。例如:处理器110可以包括应用处理器(application processor,AP)、调制解调器、图形处理器(graphics processingunit,GPU)、图像信号处理器(image signal processor,ISP)、控制器、视频编解码器、数字信号处理器(digital signal processor,DSP)、基带处理器、和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The
在一些实施例中,处理器110中还可以设置缓存器,用于存储指令和/或数据。示例的,处理器110中的缓存器可以为高速缓冲存储器。该缓存器可以用于保存处理器110刚用过的、生成的、或循环使用的指令和/或数据。如果处理器110需要使用该指令或数据,可从该缓存器中直接调用。有助于减少了处理器110获取指令或数据的时间,从而有助于提高系统的效率。In some embodiments, a buffer may also be provided in the
内部存储器121可以用于存储程序和/或数据。在一些实施例中,内部存储器121包括存储程序区和存储数据区。其中,存储程序区可以用于存储操作系统(如Android、IOS等操作系统)、至少一个功能所需的计算机程序(比如语音唤醒功能、声音播放功能)等。存储数据区可以用于存储电子设备使用过程中所创建、和/或采集的数据(比如音频数据)等。示例的,处理器110可以通过调用内部存储器121中存储的程序和/或数据,使得电子设备执行相应的方法,从而实现一种或多种功能。例如,处理器110调用内部存储器中的某些程序和/或数据,使得电子设备执行本申请实施例中所提供的语音唤醒方法、从而实现语音唤醒功能。其中,内部存储器121可以采用高速随机存取存储器、和/或非易失性存储器等。例如,非易失性存储器可以包括一个或多个磁盘存储器件、闪存器件、和/或通用闪存存储器(universal flash storage,UFS)等中的至少一个。
外部存储器接口122可以用于连接外部存储卡(例如,Micro SD卡),实现扩展电子设备的存储能力。外部存储卡通过外部存储器接口122与处理器110通信,实现数据存储功能。例如电子设备可以通过外部存储器接口122将图像、音乐、视频等文件保存在外部存储卡中。The
摄像头131可以用于捕获动、静态图像等。通常情况下,摄像头131包括镜头和图像传感器。其中,物体通过镜头生成的光学图像投射到图像传感器上,然后转换为电信号,在进行后续处理。示例的,图像传感器可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。图像传感器把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。需要说明的是,电子设备可以包括1个或N个摄像头131,其中,N为大于1的正整数。The
显示屏132可以包括显示面板,用于显示用户界面。显示面板可以采用液晶显示屏(liquid crystal display,LCD)、有机发光二极管(organic light-emitting diode,OLED)、有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrixorganiclight emitting diode,AMOLED)、柔性发光二极管(flex light-emitting diode,FLED)、Miniled、MicroLed、Micro-oLed、量子点发光二极管(quantum dot lightemittingdiodes,QLED)等。需要说明的是,电子设备可以包括1个或M个显示屏132,M为大于1的正整数。示例的,电子设备可以通过GPU、显示屏132、应用处理器等实现显示功能。
传感器模块140可以包括一个或多个传感器。例如,触摸传感器140A、陀螺仪140B、加速度传感器140C、指纹传感器140D、压力传感器140E等。在一些实施例中,传感器模块140还可以包括环境光传感器、距离传感器、接近光传感器、骨传导传感器、温度传感器等。
其中,触摸传感器140A,也可称为“触控面板”。触摸传感器140A可以设置于显示屏132,由触摸传感器140A与显示屏132组成触摸屏,也称“触控屏”。触摸传感器140A用于检测作用于其上或附近的触摸操作。触摸传感器140A可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。电子设备可以通过显示屏132提供与触摸操作相关的视觉输出等。在另一些实施例中,触摸传感器140A也可以设置于电子设备的表面,与显示屏132所处的位置不同。The
陀螺仪140B可以用于确定电子设备的运动姿态。在一些实施例中,可以通过陀螺仪140B确定电子设备围绕三个轴(即,x、y和z轴)的角速度。陀螺仪140B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪140B检测电子设备抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备的抖动,从而实现防抖。陀螺仪传感器140B还可以用于导航、体感游戏场景。The
加速度传感器140C可检测电子设备在各个方向上(一般为三轴)加速度的大小。当电子设备静止时可检测出重力的大小及方向。加速度传感器140C还可以用于识别电子设备的姿态,应用于横竖屏切换、计步器等应用。The
指纹传感器140D用于采集指纹。电子设备可以利用采集的指纹特性实现指纹解锁、访问应用锁、指纹拍照、指纹接听来电等。The
压力传感器140E用于感受压力信号,可以将压力信号转换成电信号。示例的,压力传感器140E可以设置于显示屏132。其中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。The
SIM卡接口151用于连接SIM卡。SIM卡可以通过插入SIM卡接口151,或从SIM卡接口151拔出,实现和电子设备的接触和分离。电子设备可以支持1个或K个SIM卡接口151,K为大于1的正整数。SIM卡接口151可以支持Nano SIM卡、Micro SIM卡、和/或SIM卡等。同一个SIM卡接口151可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口151也可以兼容不同类型的SIM卡。SIM卡接口151也可以兼容外部存储卡。电子设备通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备还可以采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备中,不能和电子设备分离。The
按键152可以包括开机键、音量键等。按键152可以是机械按键,也可以是触摸式按键。电子设备可以接收按键输入,产生与电子设备的用户设置以及功能控制有关的键信号输入。The
电子设备可以通过音频模块160、扬声器161、受话器162、麦克风163、耳机接口164以及应用处理器等实现音频功能。例如,音频播放功能、录音功能、语音唤醒功能等。The electronic device may implement audio functions through an
音频模块160可以用于对音频数据进行数模转换、和/或模数转换,还可以用于对音频数据进行编码和/或解码。示例的,音频模块160可以独立于处理器设置,也可以设置于处理器110中,或将音频模块160的部分功能模块设置于处理器110中。The
扬声器161,也称“喇叭”,用于将音频数据转换为声音,并播放声音。例如,电子设备100可以通过扬声器161收听音乐、接听免提电话、或者发出语音提示等。The
受话器162,也称“听筒”,用于将音频数据转换成声音,并播放声音。例如,当电子设备100接听电话时,可以通过将受话器162靠近人耳进行接听。The
麦克风163,也称“话筒”、“传声器”,用于采集声音(例如周围环境声音,包括人发出的声音、设备发出的声音等),并将声音转换为音频电数据。当拨打电话或发送语音时,用户可以通过人嘴靠近麦克风163发出声音,麦克风163采集用户发出的声音。当电子设备的语音唤醒功能已开启的情况下,麦克风163可以实时采集周围环境声音,获取音频数据。其中,麦克风163采集声音的情况与所处的环境相关。例如,当周围环境较为嘈杂时,用户说出唤醒词,则麦克风163采集的声音包括周围环境噪声和用户发出唤醒词的声音。再例如,当周围环境较为安静时,用户说出唤醒词,则麦克风163采集的声音为用户发出唤醒词的声音。又例如,当周围环境较为嘈杂时,电子设备的语音唤醒功能已开启,但是用户并未说出唤醒词对电子设备进行唤醒,则麦克风163采集的声音仅为周围环境噪声。The
需要说明的是,电子设备可以设置至少一个麦克风163。例如,电子设备中设置两个麦克风163,除了采集声音,还可以实现降噪功能。又例如,电子设备中还可以设置三个、四个或更多个麦克风163,从而可以在实现声音采集、降噪的基础上,还可以实现声音来源的识别、或定向录音功能等。It should be noted that the electronic device may be provided with at least one
耳机接口164用于连接有线耳机。耳机接口164可以是USB接口170,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口、美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口等。The
USB接口170是符合USB标准规范的接口,具体可以是Mini USB接口、Micro USB接口、USB Type C接口等。USB接口170可以用于连接充电器为电子设备充电,也可以用于电子设备与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。示例的,USB接口170除了可以为耳机接口164以外,还可以用于连接其他电子设备,例如AR设备、计算机等。The
充电管理模块180用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块180可以通过USB接口170接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块180可以通过电子设备的无线充电线圈接收无线充电输入。充电管理模块180为电池182充电的同时,还可以通过电源管理模块180为电子设备供电。The
电源管理模块181用于连接电池182、充电管理模块180与处理器110。电源管理模块181接收电池182和/或充电管理模块180的输入,为处理器110、内部存储器121、显示屏132、摄像头131等供电。电源管理模块181还可以用于监测电池容量、电池循环次数、电池健康状态(漏电、阻抗)等参数。在其他一些实施例中,电源管理模块181也可以设置于处理器110中。在另一些实施例中,电源管理模块181和充电管理模块180也可以设置于同一个器件中。The
移动通信模块191可以提供应用在电子设备上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块191可以包括滤波器、开关、功率放大器、低噪声放大器(lownoiseamplifier,LNA)等。The
无线通信模块192可以提供应用在电子设备上的包括WLAN(如Wi-Fi网络)、蓝牙(Bluetooth,BT)、全球导航卫星系统(global navigation satellite system,GNSS)、调频(frequency modulation,FM)、近距离无线通信技术(near field communication,NFC)、红外技术(infrared,IR)等无线通信的解决方案。无线通信模块192可以是集成至少一个通信处理模块的一个或多个器件。The
在一些实施例中,电子设备的天线1和移动通信模块191耦合,天线2和无线通信模块192耦合,使得电子设备可以与其他设备通信。具体的,移动通信模块191可以通过天线1与其它设备通信,无线通信模块193可以通过天线2与其它设备通信。In some embodiments, the antenna 1 of the electronic device is coupled with the
图3示出了本申请实施例的一种电子设备的软件架构示意图。如图3所示,电子设备包括音频收集模块(audio collector)401、音频处理模块(aduio processor)402、音频识别模块(audio recognizer)403、交互模块(interactor)404。FIG. 3 shows a schematic diagram of a software architecture of an electronic device according to an embodiment of the present application. As shown in FIG. 3 , the electronic device includes an
其中,音频收集模块401用于存储根据声音采集设备(例如图2中所示的麦克风163、或者其它用于采集声音的传感器)采集的声音转化为的音频数据,并将音频数据转发给音频处理模块402。例如,音频收集模块401可以用于将从音频模块160获取的音频数据存储到存储器(例如内部存储器121、或者处理器110中的存储器等)中,将存储器中存储的音频数据转发给音频处理模块402进行处理。需要说明的是,本申请实施例中音频收集模块401可以当接收到音频模块160中得到音频数据的通知后,主动从音频模块160中获取音频数据,也可以是音频模块160采集到音频数据后,将音频数据发送给音频收集模块401,本申请实施例对音频收集模块401从音频模块160获取音频数据的方式不作限定。The
音频处理模块402用于对音频数据进行预处理,例如声道转换、平滑处理、降噪处理等,并将预处理后的音频数据发送给音频识别模块403,以便于后续音频识别模块403进行唤醒词检测。The
音频识别模块403用于对音频数据进行唤醒词检测,可通过语音唤醒模型确定音频数据中是否存在预设唤醒词。通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值,如果相似度值大于或者等于预设相似度阈值,则向麦克风163、受话器162、耳机通话器发送唤醒指令,唤醒后通过扬声器161从音频模块160获取预存的应答语音数据,将应答语音数据转换声音,进行语音应答。例如,音频识别模块403可以根据第一时间段内的Q个采样时刻的音频数据进行唤醒词检测,第一时间段又可以称之为唤醒词时间窗等,通常设置为一个不小于用户发出唤醒词声音所需的时长。其中,Q个采样时刻中相邻的两个采样间隔之间间隔的时长为第一采样间隔,即音频处理模块402可以每间隔第一采样间隔向音频识别模块403发送预处理后的音频数据,音频识别模块403每间隔第一采样间隔,根据最近接收到的Q个采样时刻的音频数据进行一次唤醒词检测。当音频识别模块403检测到唤醒词时,通过语音唤醒模型计算唤醒词与预设唤醒词的相似度值,如果相似度值大于或者等于预设相似度阈值,确定唤醒电子设备,音频识别模块403向扬声器161发送唤醒指令。需要说明的是,本申请实施例中第一采样间隔的取值可以为0.1ms、0.2ms等,可以是预先设置的,也可以是根据预设算法确定的,对此本实施例不作任何限制。在另一些实施例中,音频识别模块403还可以对音频数据进行语音数据识别,以及对语音数据中的语义进行识别等。The
交互模块404用于与其它设备进行信息交互,例如与设备组内的其他从设备进行信息交互。交互模块404用于当接收到设备组内的其他从设备发送的唤醒识别结果时,向音频识别模块403发送其他从设备的唤醒识别结果,以便于音频识别模块403结合其他从设备的唤醒识别结果判定是否唤醒电子设备。交互模块404还用于向设备组内的其他从设备发送控制指令,以使其他从设备配合主设备处理用户的指令意图。The
此外,在一些实施例中,电子设备还可以包括音频合成模块(audio synthesizer)405。其中,音频合成模块405用于合成相应的应答语音数据,并将应答语音数据转换为声音进行播放。例如,电子设备可以响应于采集到用户发出的“小艺小艺”的声音后,通过扬声器161播放“请问需要什么帮助”的声音。在这种情况下,音频合成模块405用于响应于采集到用户发出的“小艺小艺”的声音,合成相应的应答语音数据,并将合成的应答语音数据转换为“请问需要什么帮助”的声音,并播放。Additionally, in some embodiments, the electronic device may further include an
应理解,图3所示的软件结构仅是一个示例。本申请实施例的电子设备可以具有比图中所示电子设备更多的或者更少的模块,可以组合两个或更多的模块等。图中所示出的各个模块可以在包括一个或多个信号处理和/或专用集成电路在内的硬件、软件、或硬件和软件的组合中实现。It should be understood that the software structure shown in FIG. 3 is only an example. The electronic device of the embodiments of the present application may have more or less modules than the electronic device shown in the figure, two or more modules may be combined, and so on. The various modules shown in the figures may be implemented in hardware, software, or a combination of hardware and software, including one or more signal processing and/or application specific integrated circuits.
需要说明的是,图3中所示的音频收集模块401、音频识别模块403、音频处理模块402、交互模块404、音频合成模块405可以集成在图2所示的处理器110中的一个或多个处理单元中,例如,音频收集模块401、音频识别模块403、音频处理模块402、交互模块404、音频合成模块405中的部分或全部可以集成在应用处理器、专用处理器等一个或多个处理器中。需要说明的是,本申请实施例中的专用处理器可以为DSP、专用集成电路(applicationspecific integrated circuit,ASIC)芯片等。It should be noted that the
以下实施例均可以在具有上述硬件结构和/或软件结构的电子设备中实现。The following embodiments can all be implemented in an electronic device having the above-mentioned hardware structure and/or software structure.
下面结合附图对本申请提供的语音唤醒方法的使用场景进行详细介绍。The usage scenarios of the voice wake-up method provided by the present application will be described in detail below with reference to the accompanying drawings.
图4a至图4c以智能音箱为设备组中的电子设备进行举例。如图4a所示,该场景包括6个音箱,这6个音箱组成一个设备组,分布在区域1(例如客厅)的不同位置处。其中,音箱11为设备组的主设备,音箱12至音箱16为设备组的从设备,设备组内的各个音箱之间相互连接(图4a仅示出了主设备与各从设备之间的连接关系)。设备组中各个音箱的设备类型相同,且唤醒词相同。设备组中各个音箱在唤醒时刻均可接收或者采集到的用户发出的唤醒语音,设备组内只有主设备音箱11进行唤醒应答,音箱11综合设备组内其他音箱的唤醒识别结果进行唤醒决策,确定音箱11是否被唤醒,在音箱11被唤醒后,从设备音箱12至音箱16可配合音箱11处理用户的指令意图。图4b与图4a的区别在于,设备组中的6个音箱分布在不同区域,作为一种示例,如图4b所示,主设备音箱11和从设备音箱12位于区域1,从设备音箱13至音箱16位于区域2(例如卧室)。在该场景中,用户可以在上述两个区域的任意区域发出唤醒语音,用户与设备组内主设备音箱11的距离不定,有可能距离较近,也有可能距离较远,且在唤醒语音的传输路径上还可能存在噪声源。图4c与图4a的区别在于,设备组中各个音箱的硬件规格不相同,但唤醒词相同。可根据音箱的硬件规格,将音箱细分为音箱Pro、常规音箱以及音箱mini,各音箱的硬件规格的大小关系为:音箱Pro>常规音箱>音箱mini。其中,硬件规格包括但不限于音箱内置芯片的处理性能、尺寸大小等。如图4c所示,将硬件规格最佳的音箱Pro11作为设备组的主设备。Figures 4a to 4c take a smart speaker as an example of an electronic device in the device group. As shown in Figure 4a, the scene includes 6 speakers, which form a device group and are distributed at different locations in area 1 (eg, living room). Among them, the
上述使用场景仅作为示例,场景中同一设备组中的各个电子设备可以是同一类型的设备,例如设备组内都是智能音箱,如图4a至图4c,也可以是不同类型的设备,例如设备组内有智能音箱、智能手机、智能电视等,如图1所示。本申请实施例对设备组中的电子设备的设备类型不作任何限制。The above usage scenarios are only examples. In the scenario, each electronic device in the same device group may be the same type of device. For example, the device group is all smart speakers, as shown in Figure 4a to Figure 4c, or different types of devices, such as devices There are smart speakers, smart phones, smart TVs, etc. in the group, as shown in Figure 1. This embodiment of the present application does not impose any restrictions on the device types of the electronic devices in the device group.
作为一种示例,用户可通过任意具有显示功能的电子设备,例如智能手机、平板电脑等,管理或设置设备组中的电子设备。图5示出了本申请实施例的一种用户界面交互图。如图5所示,用户界面501包括状态栏、多个应用程序的图标控件(例如“智能家居”图标)、时间和天气小组件等。当电子设备检测到用户的手指(或者触控笔)针对某一应用图标的触摸操作后,响应于该触摸操作,电子设备启动该应用程序,并在显示屏上显示该应用程序的用户界面。示例性的,电子设备检测到对“智能家居”图标的触摸操作,响应于该触摸操作,在显示屏上显示用户的设备界面502,设备界面502包括用户已添加的智能设备列表(如图5所示的设备a、b、c、d)、第一控件503、第二控件504、第三控件505。用户可通过点击第一控件503添加新的智能设备至智能设备列表中,用户还可通过点击第二控件504、第三控件505,将智能设备列表中的多个智能设备添加至同一设备组中。在设备组界面506上,用户可通过点击第四控件507,设置设备组中的主设备(如图5中将设备a设置为主设备),用户还可通过点击设备控件508,进入设备组中任意智能设备界面509,查询某一智能设备的设备信息(例如设备型号、数据记录、唤醒灵敏度等)。示例性的,对于智能设备的唤醒灵敏度,用户可通过第五控件510设置开启或关闭智能设备的唤醒增强功能。As an example, the user can manage or set the electronic devices in the device group through any electronic device having a display function, such as a smart phone, a tablet computer, and the like. FIG. 5 shows a user interface interaction diagram according to an embodiment of the present application. As shown in FIG. 5,
基于上述对电子设备以及场景的介绍,下面结合具体的几个实施例对本申请提供的语音唤醒方法进行详细说明。Based on the above introduction of electronic devices and scenarios, the voice wake-up method provided by the present application will be described in detail below with reference to several specific embodiments.
图6示出了本申请实施例提供的语音唤醒方法的交互示意图。示例性的,以3个电子设备组成的设备组为例,假设第一电子设备为设备组的主设备,第二电子设备a和第二电子设备b为设备组的从设备。如图6所示,用户发出唤醒语音,设备组中的各个电子设备均可接收或采集到该唤醒语音,设备组中的各个电子设备分别对唤醒语音进行唤醒词检测,得到唤醒识别结果。具体的,第一电子设备计算唤醒语音的第一唤醒置信度;第二电子设备a计算唤醒语音的第二唤醒置信度,确定允许或禁止第二电子设备a被唤醒;第二电子设备b计算唤醒语音的第三唤醒置信度,确定允许或禁止第二电子设备b被唤醒。第二电子设备a和第二电子设备b在确定唤醒识别结果后,分别向第一电子设备发送各自确定的唤醒识别结果,以使第一电子设备根据自身确定的第一唤醒置信度以及设备组中其他从设备发送的唤醒识别结果,确定允许或禁止第一电子设备被唤醒。应理解,由于各个电子设备与用户之间的距离不同、传输路径上干扰因素的不同、各个电子设备处理性能的差异等,各个电子设备计算得到的唤醒置信度可能相同也可能不同,因此唤醒识别结果可能不同。FIG. 6 shows an interactive schematic diagram of the voice wake-up method provided by the embodiment of the present application. Exemplarily, taking a device group composed of three electronic devices as an example, it is assumed that the first electronic device is the master device of the device group, and the second electronic device a and the second electronic device b are slave devices of the device group. As shown in FIG. 6 , when a user sends a wake-up voice, each electronic device in the device group can receive or collect the wake-up voice, and each electronic device in the device group performs wake-up word detection on the wake-up voice to obtain a wake-up recognition result. Specifically, the first electronic device calculates the first wake-up confidence of the wake-up voice; the second electronic device a calculates the second wake-up confidence of the wake-up voice, and determines whether to allow or prohibit the second electronic device a from being woken up; the second electronic device b calculates The third wake-up confidence level of the wake-up voice determines whether the second electronic device b is allowed or prohibited to be woken up. After determining the wake-up recognition result, the second electronic device a and the second electronic device b send the respectively determined wake-up recognition results to the first electronic device, so that the first electronic device can determine the first wake-up confidence level and the device group according to the first electronic device itself. The wake-up identification results sent by other slave devices in the device determine whether to allow or prohibit the first electronic device from being woken up. It should be understood that due to the difference in the distance between each electronic device and the user, the difference in the interference factors on the transmission path, the difference in the processing performance of each electronic device, etc., the wake-up confidence calculated by each electronic device may be the same or different. Results may vary.
作为一种示例,图7示出了一种语音唤醒方法的流程示意图,该方法以第一电子设备为执行主体,如图7所示,该方法具体包括如下步骤:As an example, FIG. 7 shows a schematic flowchart of a voice wake-up method. The method takes the first electronic device as the execution body. As shown in FIG. 7 , the method specifically includes the following steps:
步骤101、获取音频数据。Step 101: Acquire audio data.
第一电子设备可以通过声音采集设备,例如麦克风、其他用于采集声音的传感器等,采集周围环境的声音。声音采集设备在采集到周围环境声音后,将周围环境声音转换为音频电数据,并将音频电数据输出给音频模块160,音频模块160进行编码和/或模数转换得到相应格式的音频数据。当音频模块160得到相应格式的音频数据后,可以将相应格式的音频数据发送给处理器110中的音频收集模块401。音频收集模块401将相应格式的音频数据存储到存储器(例如内部存储器121、或者处理器110中的存储器等)中,并将相应格式的音频数据发送给音频处理模块402。音频处理模块402对相应格式的音频数据进行预处理,得到处理后的音频数据。其中,音频数据的预处理包括声道转换、平滑处理、降噪处理等。可选的,在一些实施例中,音频模块160获取相应格式的音频数据,还可以向处理器110中的音频收集模块401发送通知,音频收集模块401接收到该通知后,从音频模块401获取相应格式的音频数据,然后将相应格式的音频数据存储到存储器中,并向音频处理模块402发送相应格式的音频数据,音频处理模块402对相应格式的音频数据进行预处理,得到处理后的音频数据。The first electronic device may collect the sound of the surrounding environment through a sound collecting device, such as a microphone, other sensors for collecting sound, and the like. After collecting ambient sound, the sound collection device converts the ambient sound into audio electrical data, and outputs the audio electrical data to the
步骤102、计算音频数据的第一唤醒置信度。其中,第一唤醒置信度用于指示音频数据中唤醒词与预设唤醒词的声学特征相似度。Step 102: Calculate the first wake-up confidence level of the audio data. Wherein, the first wake-up confidence level is used to indicate the similarity of acoustic features between the wake-up word in the audio data and the preset wake-up word.
第一电子设备可以通过处理器110对经过预处理后的音频数据进行唤醒词检测。示例性的,第一电子设备通过处理器110中的音频识别模块403对经过预处理后的音频数据进行唤醒词检测。其中,音频识别模块403是从音频处理模块402获取的经过预处理后的音频数据。The first electronic device may perform wake word detection on the preprocessed audio data through the
具体的,音频识别模块403对经过预处理后的音频数据进行唤醒词检测,可通过语音唤醒模型确定音频数据中是否存在预设唤醒词。语音唤醒模型是经过大量音频数据预先训练得到的,通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度值。其中,语音唤醒模型的输入为经过预处理后的音频数据,语音唤醒模型的输出可以包括唤醒标识,唤醒标识包括允许或者禁止电子设备被唤醒的标识。示例性的,唤醒标识为0,用于指示禁止电子设备被唤醒;唤醒标识为1,用于指示允许电子设备被唤醒。在一些实施例中,语音唤醒模型的输出可以包括唤醒置信度,唤醒置信度用于指示音频数据中唤醒词与预设唤醒词的声学特征相似度。在一些实施例中,语音唤醒模型的输出可以同时包括上述的唤醒标识和唤醒置信度。音频识别模块403通过语音唤醒模型得到唤醒标识和/或唤醒置信度。Specifically, the
步骤103、接收至少一个第二电子设备发送的唤醒识别结果。其中,唤醒识别结果用于指示允许或禁止至少一个第二电子设备被唤醒。Step 103: Receive a wake-up identification result sent by at least one second electronic device. Wherein, the wake-up identification result is used to indicate that at least one second electronic device is allowed or prohibited to be woken up.
在本实施例中,设备组中的各从设备均具有唤醒识别功能。与上述步骤101和步骤102类似,设备组中的各第二电子设备同样可以对接收到的音频数据进行预处理,然后对经过预处理后的音频数据进行唤醒词检测,可通过语音唤醒模型确定音频数据中是否存在预设唤醒词,通过语音唤醒模型计算音频数据中的唤醒词与预设唤醒词的声学特征的相似度,最终得到唤醒标识和/或唤醒置信度。相应的,各第二电子设备发送的唤醒识别结果可以包括唤醒标识和/或唤醒置信度。第一电子设备可以通过交互模块404接收设备组中的至少一个第二电子设备发送的唤醒识别结果,交互模块404接收到至少一个第二电子设备发送的唤醒识别结果时,向音频识别模块403发送至少一个第二电子设备的唤醒识别结果,以便音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果,判定是否唤醒第一电子设备。In this embodiment, each slave device in the device group has a wake-up recognition function. Similar to the
步骤104、根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果,确定允许或禁止第一电子设备被唤醒。Step 104: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the wake-up identification result sent by the at least one second electronic device.
在本步骤中,第一电子设备可以通过处理器110中的音频识别模块403,确定允许或禁止第一电子设备被唤醒。具体的,音频识别模块403首先判断第一唤醒置信度与预设唤醒阈值的大小关系,根据判断结果确定允许或禁止第一电子设备被唤醒。其中,预设唤醒阈值可以包括一个阈值,也可以包括两个阈值。In this step, the first electronic device may determine whether to allow or prohibit the first electronic device from being woken up through the
在一种可能的实现方式中,预设唤醒阈值包括一个阈值,为第一阈值。如图8a所示,音频识别模块403可根据第一唤醒置信度与第一阈值的大小关系,确定允许或禁止第一电子设备被唤醒。具体的,若第一唤醒置信度大于或者等于第一阈值,音频识别模块403确定允许第一电子设备被唤醒;若第一唤醒置信度小于第一阈值,音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果,确定允许或禁止第一电子设备被唤醒。该实现方式中,如果第一电子设备计算得到的第一唤醒置信度较高时,第一电子设备可直接判定允许第一电子设备被唤醒。如果第一电子设备计算得到的第一唤醒置信度不是很高时,第一电子设备需要结合设备组中其他从设备发送的唤醒识别结果,综合判定第一电子设备是否允许被唤醒,提升设备组内主设备唤醒的准确性。In a possible implementation manner, the preset wake-up threshold includes a threshold, which is a first threshold. As shown in FIG. 8a, the
在另一种可能的实现方式中,预设唤醒阈值包括两个阈值,分别为第一阈值和第二阈值,其中,第一阈值大于第二阈值。如图8b所示,音频识别模块403可根据第一唤醒置信度与第一阈值、第二阈值的大小关系,确定允许或禁止第一电子设备被唤醒。具体的,若第一唤醒置信度大于或者等于第一阈值,音频识别模块403确定允许第一电子设备被唤醒;若第一唤醒置信度小于或者等于第二阈值,音频识别模块403确定禁止第一电子设备被唤醒;若第一唤醒置信度小于第一阈值且大于第二阈值,音频识别模块403根据第一唤醒置信度以及至少一个第二电子设备发送的唤醒识别结果,确定允许或禁止第一电子设备被唤醒。与第一种实现方式相比,该实现方式增设了第二阈值,第二阈值是一个相对较低的判断阈值,如果第一电子设备计算得到的第一唤醒置信度小于或者等于第二阈值时,可认为第一唤醒置信度确实很低,第一电子设备无需结合设备组内其他从设备发送的唤醒识别结果进行综合判定,可直接判定禁止第一电子设备被唤醒,该方式可以避免不必要的判断过程,提高第一电子设备的处理速率,通过该方式也可以提升设备组内主设备唤醒的准确性。In another possible implementation manner, the preset wake-up threshold includes two thresholds, which are a first threshold and a second threshold, wherein the first threshold is greater than the second threshold. As shown in FIG. 8b , the
需要说明的是,本申请实施例中设备组内的各个电子设备的预设唤醒阈值可以是相同的阈值,例如,第一电子设备和任意第二电子设备中预设唤醒阈值包括一个阈值,该阈值为第一阈值,第一电子设备根据第一阈值和当前计算得到的第一唤醒置信度进行初步判定,同样的,第二电子设备根据第一阈值和当前计算得到的第二唤醒置信度进行唤醒识别判定,并将唤醒识别结果发送至第一电子设备。在一些实施例中,设备组内的各个电子设备的预设唤醒阈值还可以是不同的阈值,例如,第一电子设备中预设唤醒阈值包括一个阈值,该阈值为第一阈值,第二电子设备中预设唤醒阈值也包括一个阈值,该阈值可以是大于或者小于第一阈值的数值,由此可见,设备组内的各个电子设备可分别基于各自预设的唤醒阈值,进行唤醒识别判定。It should be noted that the preset wake-up thresholds of each electronic device in the device group in this embodiment of the present application may be the same threshold. For example, the preset wake-up thresholds in the first electronic device and any second electronic device include a threshold, which The threshold is the first threshold, and the first electronic device makes a preliminary determination according to the first threshold and the currently calculated first wake-up confidence. Similarly, the second electronic device makes a preliminary determination according to the first threshold and the currently calculated second wake-up confidence. The wake-up identification is determined, and the wake-up identification result is sent to the first electronic device. In some embodiments, the preset wake-up thresholds of each electronic device in the device group may also be different thresholds. For example, the preset wake-up threshold in the first electronic device includes a threshold, the threshold is the first threshold, and the second electronic device The preset wake-up threshold in the device also includes a threshold, and the threshold can be a value greater than or less than the first threshold. It can be seen that each electronic device in the device group can perform wake-up identification determination based on the respective preset wake-up thresholds.
综上,无论上述哪一种实现方式,均包括如下步骤:根据第一唤醒置信度和至少一个第二电子设备发送的唤醒识别结果,确定允许或禁止第一电子设备被唤醒。该步骤包括如下三种可能的实现方式:To sum up, no matter which of the above implementation manners, the following steps are included: according to the first wake-up confidence level and the wake-up identification result sent by at least one second electronic device, it is determined whether to allow or prohibit the first electronic device from being woken up. This step includes the following three possible implementations:
在第一种可能的实现方式中,第一电子设备通过至少一个第二电子设备发送的唤醒识别结果统计设备组内从设备的唤醒情况,判断从设备的唤醒情况是否满足预设的唤醒条件,如果满足预设的唤醒条件,则确定允许第一电子设备被唤醒,如果不满足预设的唤醒条件,则确定禁止第一电子设备被唤醒。In a first possible implementation manner, the first electronic device counts the wake-up status of the slave devices in the device group through the wake-up identification result sent by at least one second electronic device, and determines whether the wake-up status of the slave device satisfies a preset wake-up condition, If the preset wake-up condition is satisfied, it is determined to allow the first electronic device to be woken up, and if the preset wake-up condition is not satisfied, it is determined to prohibit the first electronic device from being woken up.
在第二种可能的实现方式中,第一电子设备基于连续方式动态调整设备组内第一电子设备的预设唤醒阈值,并根据调整后的预设唤醒阈值与当前第一电子设备计算得到的第一唤醒置信度进行比较,确定允许或禁止第一电子设备被唤醒。In a second possible implementation manner, the first electronic device dynamically adjusts the preset wake-up threshold of the first electronic device in the device group based on a continuous method, and calculates the preset wake-up threshold according to the adjusted preset wake-up threshold and the current first electronic device. The first wake-up confidence is compared to determine whether to allow or prohibit the first electronic device from being woken up.
在第三种可能的实现方式中,第一电子设备基于离散方式动态调整设备组内第一电子设备的预设唤醒阈值,并根据调整后的预设唤醒阈值与当前第一电子设备计算得到的第一唤醒置信度进行比较,确定允许或禁止第一电子设备被唤醒。In a third possible implementation manner, the first electronic device dynamically adjusts the preset wake-up threshold of the first electronic device in the device group based on a discrete method, and calculates the preset wake-up threshold according to the adjusted preset wake-up threshold and the current first electronic device. The first wake-up confidence is compared to determine whether to allow or prohibit the first electronic device from being woken up.
其中,第一种实现方式是基于预设规则,该预设规则充分考虑了设备组内其他从设备的唤醒识别结果,对设备组内主设备的唤醒条件进行优化,提升设备组内主设备唤醒的准确性。后两种实现方式均是从唤醒阈值的角度,结合设备组内其他从设备的唤醒识别结果对主设备的预设的唤醒阈值进行优化,其目的与第一种实现方式相同。Among them, the first implementation method is based on a preset rule, which fully considers the wake-up recognition results of other slave devices in the device group, optimizes the wake-up conditions of the master device in the device group, and improves the wake-up of the master device in the device group. accuracy. The latter two implementations both optimize the preset wakeup threshold of the master device from the perspective of the wakeup threshold combined with the wakeup identification results of other slave devices in the device group, the purpose of which is the same as that of the first implementation.
下面结合附图9至图11,分别对第一电子设备如何结合设备组中其他从设备发送的唤醒识别结果进行唤醒决策进行详细介绍。需要说明的是,下面几个实施例的判断流程均可通过第一电子设备的处理器110中的音频识别模块403来执行。The following describes in detail how the first electronic device makes a wake-up decision in combination with the wake-up identification results sent by other slave devices in the device group with reference to FIGS. 9 to 11 . It should be noted that, the judgment processes of the following several embodiments can be executed by the
图9示出了一种语音唤醒方法的判断流程图,如图9所示,若第一电子设备计算得到的第一唤醒置信度小于第一阈值,该方法包括如下步骤:FIG. 9 shows a judgment flowchart of a voice wake-up method. As shown in FIG. 9 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:
步骤201、根据至少一个第二电子设备发送的唤醒识别结果,统计至少一个第二电子设备的唤醒情况。
具体的,唤醒识别结果可以包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项。其中,唤醒标识包括允许或禁止第二电子设备被唤醒的标识,第二唤醒置信度用于指示第二电子设备确定的音频数据中唤醒词与预设唤醒词的声学特征相似度。Specifically, the wake-up identification result may include at least one of a wake-up identifier of at least one second electronic device and a second wake-up confidence level. The wake-up identifier includes an identifier that allows or prohibits the second electronic device from being woken up, and the second wake-up confidence level is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
在一种可能的情况下,如果至少一个第二电子设备发送的唤醒识别结果中仅包括唤醒标识,第一电子设备可根据唤醒标识,统计允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。In a possible situation, if the wake-up identification result sent by at least one second electronic device only includes a wake-up identifier, the first electronic device can count the number of devices of the second electronic device that are allowed to be woken up according to the wake-up identifier, and the The number of devices as a percentage of the total number of devices in the device group.
在另一种可能的情况下,如果至少一个第二电子设备发送的唤醒识别结果中仅包括第二唤醒置信度,第一电子设备首先需要根据第二唤醒置信度和各第二电子设备预设的唤醒阈值,确定第二电子设备是否允许被唤醒。然后统计允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。其中,各第二电子设备的预设唤醒阈值可以是与第一电子设备预设的唤醒阈值相同的数值,例如第一阈值,也可以是与第一电子设备的预设唤醒阈值不同的数值。各第二电子设备的预设唤醒阈值可以相同也可以不同,对此本实施例不作任何限制。In another possible situation, if the wake-up identification result sent by at least one second electronic device only includes the second wake-up confidence level, the first electronic device first needs to preset the second wake-up confidence level and each second electronic device according to the second wake-up confidence level. The wake-up threshold determines whether the second electronic device is allowed to be woken up. Then, count the number of devices of the second electronic device that are allowed to be woken up, and the proportion of the number of devices to the total number of devices in the device group. The preset wake-up threshold of each second electronic device may be the same value as the preset wake-up threshold of the first electronic device, eg, the first threshold, or may be a different value from the preset wake-up threshold of the first electronic device. The preset wake-up thresholds of the second electronic devices may be the same or different, which is not limited in this embodiment.
在又一种可能的情况下,如果至少一个第二电子设备发送的唤醒标识结果中同时包括唤醒标识和第二唤醒置信度,第一电子设备可通过上述任意一种情况的统计方式,确定允许被唤醒的第二电子设备的设备数量,以及该设备数量占设备组的设备总数的比例。In yet another possible situation, if the wake-up identification result sent by at least one second electronic device includes both the wake-up identification and the second wake-up confidence, the first electronic device may determine whether to allow the The number of devices of the second electronic device that are awakened, and the proportion of the number of devices to the total number of devices in the device group.
步骤202、判断至少一个第二电子设备的唤醒情况是否满足预设的第一电子设备的唤醒条件,如果满足预设的第一电子设备的唤醒条件,则执行步骤203;如果不满足预设的第一电子设备的唤醒条件,则执行步骤204。Step 202: Determine whether the wake-up condition of at least one second electronic device satisfies the preset wake-up condition of the first electronic device, if it meets the preset wake-up condition of the first electronic device, perform
在本实施例中,预设的第一电子设备的唤醒条件包括以下的任意一项:In this embodiment, the preset wake-up condition of the first electronic device includes any one of the following:
(1)设备组中除第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值。其中,第三阈值为第一电子设备中预设的允许第二电子设备被唤醒的阈值,该阈值可能大于某些第二电子设备的判断阈值。(1) The second wake-up confidence levels of all the second electronic devices in the device group except the first electronic device are greater than or equal to the third threshold. The third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up, and the threshold may be greater than the judgment threshold of some second electronic devices.
(2)设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例大于或者等于第一比例。示例性的,第一比例可以设置为80%,该条件的含义是:如果设备组中有80%的从设备允许被唤醒,可确定主设备第一电子设备允许被唤醒。(2) The ratio of the number of devices of the second electronic device allowed to be woken up in the device group to the total number of devices in the device group is greater than or equal to the first ratio. Exemplarily, the first ratio may be set to 80%, and the meaning of this condition is: if 80% of the slave devices in the device group are allowed to be woken up, it can be determined that the first electronic device of the master device is allowed to be woken up.
(3)设备组中允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例小于第一比例且大于第二比例,且允许被唤醒的第二电子设备的第二唤醒置信度均大于或者等于第三阈值。其中,第一比例大于第二比例。(3) The ratio of the number of devices of the second electronic device allowed to be awakened in the device group to the total number of devices in the device group is smaller than the first proportion and greater than the second proportion, and the second wake-up confidence level of the second electronic device allowed to be awakened are greater than or equal to the third threshold. Wherein, the first ratio is greater than the second ratio.
需要说明的是,上述第三种条件的设计初衷如下:各第二电子设备预设的唤醒阈值可能不同,即允许被唤醒的各第二电子设备的判断阈值存在差异,例如第二电子设备a的预设唤醒阈值为0.7,第二电子设备b的预设唤醒阈值为0.8,第二电子设备c的预设唤醒阈值为0.5。若各第二电子设备计算得到的音频数据的第二唤醒置信度相同,基于不同的预设唤醒阈值,各第二电子设备确定的唤醒识别结果将存在差异。基于此,第一电子设备统计得到的允许被唤醒的第二电子设备的设备数量的参考性不高。为了提高判断的准确性,可通过设置第三阈值,排查允许被唤醒的第二电子设备中是否存在第二唤醒置信度低于第三阈值的电子设备,如果存在,则确定禁止第一电子设备被唤醒。上述条件可有效避免由于部分从设备设置的预设唤醒阈值过低,导致主设备被误唤醒,提升主设备唤醒的准确性。It should be noted that the original intention of the above-mentioned third condition is as follows: the preset wake-up thresholds of the second electronic devices may be different, that is, the judgment thresholds of the second electronic devices that are allowed to be woken up are different, for example, the second electronic device a The preset wake-up threshold of the second electronic device b is 0.7, the preset wake-up threshold of the second electronic device b is 0.8, and the preset wake-up threshold of the second electronic device c is 0.5. If the second wake-up confidence levels of the audio data calculated by the second electronic devices are the same, based on different preset wake-up thresholds, the wake-up recognition results determined by the second electronic devices will be different. Based on this, the reference of the number of devices of the second electronic device that is allowed to be woken up obtained by the first electronic device is not high. In order to improve the accuracy of the judgment, a third threshold can be set to check whether there is an electronic device whose second wake-up confidence is lower than the third threshold in the second electronic device that is allowed to be woken up, and if so, it is determined to prohibit the first electronic device be awakened. The above conditions can effectively prevent the master device from being woken up by mistake due to the low preset wake-up threshold set by some slave devices, and improve the accuracy of the master device's wake-up.
示例性的,第一比例设置为80%,第二比例设置为50%,第三阈值设置为0.7,从设备包括第二电子设备a、b、c、d,各第二电子设备的预设唤醒阈值以及发送的唤醒识别结果参见表1,由表1可知,允许被唤醒的第二电子设备的设备数量占设备组的设备总数的比例为60%(3/5),介于第一比例和第二比例之间,然而允许被唤醒的第二电子设备中存在第二唤醒置信度小于第三阈值0.7的电子设备,即第二电子设备c,因此不满足上述第三种条件,则确定第一电子设备不被唤醒。Exemplarily, the first ratio is set to 80%, the second ratio is set to 50%, the third threshold is set to 0.7, the slave devices include second electronic devices a, b, c, and d, and the preset values of each second electronic device are The wake-up threshold and the sent wake-up identification results are shown in Table 1. From Table 1, it can be seen that the number of devices of the second electronic device that is allowed to be woken up accounts for 60% (3/5) of the total number of devices in the device group, which is between the first ratio However, among the second electronic devices that are allowed to be woken up, there is an electronic device whose second wake-up confidence is less than the third threshold 0.7, that is, the second electronic device c, and therefore does not meet the third condition above, then determine The first electronic device is not awakened.
表1Table 1
步骤203、确定允许第一电子设备被唤醒。Step 203: Determine to allow the first electronic device to be woken up.
步骤204、确定禁止第一电子设备被唤醒。
上述方案提供了一种基于预设的主设备唤醒条件,在确定主设备计算得到的第一唤醒置信度小于第一阈值时,根据设备组中其他从设备发送的唤醒识别结果,统计设备组中从设备的唤醒情况,判断从设备的唤醒情况是否满足预设的主设备唤醒条件,如果满足,则确定允许设备组内主设备被唤醒。该方案能够快速对设备组内主设备是否执行唤醒应答进行判断,充分利用了设备组内其他设备的唤醒识别结果,提升了设备组内主设备唤醒的准确性。The above solution provides a wake-up condition based on the preset master device, when it is determined that the first wake-up confidence calculated by the master device is less than the first threshold, according to the wake-up recognition results sent by other slave devices in the device group, count the device group. The wake-up condition of the slave device determines whether the wake-up condition of the slave device satisfies the preset master device wake-up condition, and if so, it is determined to allow the master device in the device group to be woken up. This solution can quickly determine whether the master device in the device group performs a wake-up response, makes full use of the wake-up recognition results of other devices in the device group, and improves the accuracy of the master device in the device group wake-up.
应理解,如果设备组中除了主设备之外的大多数从设备均判定允许被唤醒,主设备的预设唤醒阈值过高的话,主设备可能会出现误判,因此可通过下述两个实施例提供的方法,结合设备组内其他从设备的唤醒情况,判定是否需要动态调整当前主设备的唤醒阈值,从而提升设备组内主设备唤醒的准确性。It should be understood that if most of the slave devices in the device group except the master device are determined to be allowed to wake up, and the preset wake-up threshold of the master device is too high, the master device may misjudgment, so the following two implementations can be implemented: The method provided by the example, combined with the wake-up conditions of other slave devices in the device group, determines whether the wake-up threshold of the current master device needs to be dynamically adjusted, thereby improving the wake-up accuracy of the master device in the device group.
图10示出了另一种语音唤醒方法的判断流程图,如图10所示,若第一电子设备计算得到的第一唤醒置信度小于第一阈值,该方法包括如下步骤:FIG. 10 shows a judgment flowchart of another voice wake-up method. As shown in FIG. 10 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:
步骤301、根据至少一个第二电子设备发送的唤醒识别结果,统计设备组内允许被唤醒的第二电子设备的第一权重值。
本实施例中,引入了设备组内电子设备的权重值,权重值用于指示电子设备唤醒识别结果的置信度高低。权重值可以是与电子设备的设备类型相关的值,还可以是与电子设备的软/硬件性能相关的值。作为一种示例,智能家居中包括多种设备类型的智能设备,均具有语音唤醒功能,包括智能电视、智能音箱、智能灯、智能空调、智能冰箱等等,可以将上述多种类型的智能设备中具有相同唤醒词的设备组成一个设备组。假设该设备组包括1台智能电视、2个智能音箱、2盏智能灯,可预设设备组内的不同设备类型的权重值,例如智能电视的权重值为0.3,智能音箱的权重值为0.6,智能灯的权重值为0.1。作为另一种示例,设备组中的各个电子设备可以为同一类型的设备,例如智能音箱,如图4c所示,智能音箱可细分为音箱Pro、常规音箱、音箱mini,处理性能的大小关系为:音箱Pro>常规音箱>音箱mini,可根据音箱的处理性能预设各种音箱的权重值,例如音箱Pro的权重值为0.5,常规音箱的权重值为0.3,音箱mini的权重值为0.2。In this embodiment, the weight value of the electronic device in the device group is introduced, and the weight value is used to indicate the confidence level of the wake-up recognition result of the electronic device. The weight value may be a value related to the device type of the electronic device, and may also be a value related to the software/hardware performance of the electronic device. As an example, a smart home includes smart devices of various device types, all of which have a voice wake-up function, including smart TVs, smart speakers, smart lights, smart air conditioners, smart refrigerators, etc. Devices with the same wake word in the device form a device group. Assuming that the device group includes 1 smart TV, 2 smart speakers, and 2 smart lights, the weight values of different device types in the device group can be preset. For example, the weight value of a smart TV is 0.3, and the weight value of a smart speaker is 0.6 , the weight of the smart light is 0.1. As another example, each electronic device in the device group may be the same type of device, such as a smart speaker. As shown in Figure 4c, the smart speaker can be subdivided into speaker Pro, conventional speaker, and speaker mini, and the size relationship of processing performance It is: Speaker Pro > Conventional Speaker > Speaker Mini, the weight value of various speakers can be preset according to the processing performance of the speaker, for example, the weight value of the speaker Pro is 0.5, the weight value of the conventional speaker is 0.3, and the weight value of the speaker mini is 0.2 .
需要说明的是,设备组内的各个电子设备的权重值可以由第三方设备(例如电子设备厂商提供的服务平台)预先设置或者推荐默认值。作为一种示例,当某一电子设备新加入至设备组时,第三方设备可向该电子设备或者该电子设备加入的设备组的主设备下发该电子设备的权重值。作为一种示例,若该电子设备为设备组中的从设备,当该电子设备向设备组中的主设备发送唤醒识别结果时,可在唤醒识别结果中携带该电子设备的权重值,以便主设备及时获知该新增的电子设备的权重值。在一些实施例中,可以由主设备或者第三方设备统一维护各电子设备的权重值。It should be noted that, the weight value of each electronic device in the device group may be preset or recommended by a third-party device (eg, a service platform provided by an electronic device manufacturer). As an example, when an electronic device is newly added to a device group, the third-party device may deliver the weight value of the electronic device to the electronic device or the master device of the device group to which the electronic device is added. As an example, if the electronic device is a slave device in a device group, when the electronic device sends the wake-up recognition result to the master device in the device group, the weight value of the electronic device can be carried in the wake-up recognition result so that the master device can The device acquires the weight value of the newly added electronic device in time. In some embodiments, the weight value of each electronic device may be uniformly maintained by the main device or the third-party device.
在本实施例中,至少一个第二电子设备发送的唤醒识别结果可以包括至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项,以及至少一个第二电子设备的设备标识。其中,设备标识用于指示第二电子设备的设备类型。具体的,第一电子设备首先根据至少一个第二电子设备发送的唤醒识别结果,统计设备组内允许被唤醒的第二电子设备有哪些,具体可参见上述实施例的步骤201,此处不再赘述。在确定允许被唤醒的第二电子设备后,再根据第二电子设备的设备标识,统计设备组内允许被唤醒的第二电子设备的设备类型,以及允许被唤醒的第二电子设备的设备类型对应的设备数量。基于预设的每一种电子设备的权重值,最终统计出设备组内允许被唤醒的第二电子设备的第一权重值,具体可参见如下公式:In this embodiment, the wake-up identification result sent by the at least one second electronic device may include at least one of a wake-up identification of the at least one second electronic device, a second wake-up confidence level, and a device identification of the at least one second electronic device. The device identifier is used to indicate the device type of the second electronic device. Specifically, the first electronic device first counts the second electronic devices that are allowed to be woken up in the device group according to the wake-up identification result sent by at least one second electronic device. For details, please refer to step 201 in the above embodiment, which is not repeated here. Repeat. After determining the second electronic device that can be woken up, count the device types of the second electronic device that can be woken up and the device types of the second electronic device that can be woken up in the device group according to the device identifier of the second electronic device The corresponding number of devices. Based on the preset weight value of each electronic device, the first weight value of the second electronic device that is allowed to be woken up in the device group is finally calculated. For details, please refer to the following formula:
α=γ1×n1+γ2×n2+…+γx×nx 公式一α=γ 1 ×n 1 +γ 2 ×n 2 +…+γ x ×n x Formula 1
m=n1+n2+…+nx 公式二m=n 1 +n 2 +…+n x Formula 2
式中,α表示设备组内允许被唤醒的第二电子设备的第一权重值;In the formula, α represents the first weight value of the second electronic device that is allowed to be woken up in the device group;
γi表示第i种允许被唤醒的第二电子设备的权重值;γ i represents the weight value of the i-th second electronic device that is allowed to be woken up;
ni表示第i种允许被唤醒的第二电子设备的设备数量,其中,i=1,2,…,x,x为大于或者等于1的正整数;n i represents the number of devices of the i-th second electronic device that is allowed to be woken up, where i=1, 2, . . . , x, where x is a positive integer greater than or equal to 1;
x表示设备组内允许被唤醒的第二电子设备的设备类型的数量;x represents the number of device types of the second electronic device allowed to be woken up within the device group;
m表示设备组内允许被唤醒的第二电子设备的设备总数。m represents the total number of devices of the second electronic device allowed to be woken up in the device group.
示例性的,表2示出了设备组内除第一电子设备之外的所有第二电子设备的参数统计表,表2包括各个第二电子设备的权重值、设备总数以及允许被唤醒的设备数量。Exemplarily, Table 2 shows the parameter statistics table of all second electronic devices except the first electronic device in the device group, and Table 2 includes the weight value of each second electronic device, the total number of devices, and the devices that are allowed to be woken up. quantity.
表2Table 2
由表2可知,设备组中允许被唤醒的第二电子设备的设备总数为3,包括1个第二电子设备a和2个第二电子设备c,其中,第二电子设备a的权重值为0.5,第二电子设备c的权重值为0.2,根据上述公式一,设备组内允许被唤醒的第二电子设备的第一权重值α=0.5×1+0.2×2=0.9。It can be seen from Table 2 that the total number of second electronic devices allowed to be awakened in the device group is 3, including 1 second electronic device a and 2 second electronic devices c, where the weight of the second electronic device a is 0.5, the weight value of the second electronic device c is 0.2, according to the above formula 1, the first weight value α=0.5×1+0.2×2=0.9 of the second electronic device allowed to be woken up in the device group.
步骤302、统计设备组内所有第二电子设备的第二权重值。Step 302: Count the second weight values of all the second electronic devices in the device group.
作为一种示例,第一电子设备可根据至少一个第二电子设备发送的唤醒识别结果中的设备标识,统计设备组内所有第二电子设备的设备类型,以及每一种设备类型对应的第二电子设备的设备总数。作为另一种示例,第一电子设备可根据当前设备组内的互联情况,直接确定设备组内所有第二电子设备的设备类型,以及每一种设备类型对应的第二电子设备的设备总数。需要说明的是,设备组内新增的第二电子设备,首次与第一电子设备互联时,可将第二电子设备的设备信息发送给第一电子设备,以便第一电子设备预存设备组内所有第二电子设备的设备信息,其中,上述设备信息包括设备标识和/或权重值。As an example, the first electronic device may count the device types of all second electronic devices in the device group according to the device identifier in the wake-up identification result sent by at least one second electronic device, and the second electronic device corresponding to each device type. The total number of electronic devices. As another example, the first electronic device may directly determine the device types of all the second electronic devices in the device group and the total number of second electronic devices corresponding to each device type according to the interconnection situation in the current device group. It should be noted that, when the second electronic device added to the device group is interconnected with the first electronic device for the first time, the device information of the second electronic device can be sent to the first electronic device, so that the first electronic device can be pre-stored in the device group. Device information of all the second electronic devices, wherein the device information includes a device identifier and/or a weight value.
具体的,可参见如下公式确定设备组内所有第二电子设备的第二权重值:Specifically, the second weight value of all the second electronic devices in the device group can be determined by referring to the following formula:
β=γ1×n1+γ2×n2+…+γz×nz 公式三β=γ 1 ×n 1 +γ 2 ×n 2 +…+γ z ×n z Formula 3
o=n1+n2+…+nx公式四o=n 1 +n 2 +…+n x Formula 4
式中,β表示设备组内所有第二电子设备的第二权重值;In the formula, β represents the second weight value of all second electronic devices in the device group;
γj表示第j种第二电子设备的权重值;γ j represents the weight value of the jth second electronic device;
nj表示第j种第二电子设备的设备总数;其中,j=1,2,…,z,z为大于或者等于1的正整数;n j represents the total number of devices of the jth second electronic device; wherein, j=1,2,...,z, z is a positive integer greater than or equal to 1;
z表示设备组内第二电子设备的设备类型的数量;z represents the number of device types of the second electronic device in the device group;
o表示设备组内所有第二电子设备的设备总数。o represents the total number of devices of all second electronic devices in the device group.
示例性的,仍以表2为例,可知:设备组中所有第二电子设备的设备总数为6,包括2个第二电子设备a、1个第二电子设备b以及3个第二电子设备c,其中,第二电子设备a的权重值为0.5,第二电子设备b的权重值为0.3,第二电子设备c的权重值为0.2,根据上述公式三,设备组内所有第二电子设备的第二权重值β=0.5×2+0.3×1+0.2×3=1.9。Exemplarily, still taking Table 2 as an example, it can be known that the total number of devices of all second electronic devices in the device group is 6, including 2 second electronic devices a, 1 second electronic device b and 3 second electronic devices c, where the weight value of the second electronic device a is 0.5, the weight value of the second electronic device b is 0.3, and the weight value of the second electronic device c is 0.2. According to the above formula 3, all the second electronic devices in the device group The second weight value β=0.5×2+0.3×1+0.2×3=1.9.
步骤303、基于第一权重值和第二权重值,调整第一阈值。Step 303: Adjust the first threshold based on the first weight value and the second weight value.
其中,第一阈值为允许第一电子设备被唤醒的阈值(即第一电子设备的预设唤醒阈值)。The first threshold is a threshold that allows the first electronic device to be woken up (ie, a preset wake-up threshold of the first electronic device).
在一种具体的实现方式中,第一电子设备根据第一权重值、第二权重值以及最大阈值调整参数,确定第一电子设备的唤醒阈值的调整参数,可通过如下公式确定调整参数:In a specific implementation manner, the first electronic device determines the adjustment parameter of the wake-up threshold of the first electronic device according to the first weight value, the second weight value and the maximum threshold adjustment parameter, and the adjustment parameter can be determined by the following formula:
θ=α/β 公式五θ=α/β Formula 5
Δ′=θ×Δ 公式六Δ′=θ×Δ Formula 6
式中,θ表示设备组内允许被唤醒的第二电子设备占所有第二电子设备的权重比例;In the formula, θ represents the weight ratio of the second electronic devices allowed to be woken up in the device group to all the second electronic devices;
Δ表示最大阈值调整参数,该参数为预设值,通常设置在0至1之间,例如Δ=0.1。Δ represents the maximum threshold adjustment parameter, which is a preset value, usually set between 0 and 1, for example, Δ=0.1.
Δ′表示第一电子设备的唤醒阈值的调整参数。Δ' represents an adjustment parameter of the wake-up threshold of the first electronic device.
第一电子设备基于上述调整参数调整第一电子设备的预设唤醒阈值,可通过如下公式确定调整后的第一电子设备的唤醒阈值:The first electronic device adjusts the preset wake-up threshold of the first electronic device based on the above adjustment parameters, and the adjusted wake-up threshold of the first electronic device can be determined by the following formula:
Thredholdcurrent=Thredhold-Δ′ 公式七Thredhold current =Thredhold-Δ′ Equation 7
式中,Thredholdcurrent表示调整后的第一电子设备的唤醒阈值(即调整后的第一阈值);In the formula, Thredhold current represents the adjusted wake-up threshold of the first electronic device (that is, the adjusted first threshold);
Thredhold表示第一电子设备的预设唤醒阈值(即第一阈值)。Thredhold represents a preset wake-up threshold (ie, a first threshold) of the first electronic device.
可见,本实施例是通过设备的权重值计算,确定唤醒阈值的调整参数,调整参数的精确度高,相邻时刻的调整参数通常为连续变化的。It can be seen that in this embodiment, the adjustment parameter of the wake-up threshold is determined by calculating the weight value of the device, the adjustment parameter has high accuracy, and the adjustment parameter at adjacent moments is usually continuously changed.
步骤304、根据第一唤醒置信度和调整后的第一阈值,确定允许或者禁止第一电子设备被唤醒。Step 304: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence level and the adjusted first threshold.
具体的,通过比较第一唤醒置信度与调整后的第一阈值的大小关系,确定允许或禁止第一电子设备被唤醒。若第一电子设备计算得到的第一唤醒置信度大于或者等于调整后的第一阈值,则确定允许第一电子设备被唤醒;若第一唤醒置信度小于调整后的第一阈值,则确定禁止第一电子设备被唤醒。Specifically, by comparing the magnitude relationship between the first wake-up confidence level and the adjusted first threshold, it is determined whether to allow or prohibit the first electronic device from being woken up. If the first wake-up confidence calculated by the first electronic device is greater than or equal to the adjusted first threshold, it is determined that the first electronic device is allowed to wake up; if the first wake-up confidence is less than the adjusted first threshold, it is determined to prohibit The first electronic device is awakened.
上述方案从唤醒阈值的角度出发,结合设备组内从设备的实际唤醒情况,综合分析允许被唤醒的从设备的唤醒识别结果的置信度高低,进而基于连续方式动态地调整设备组内主设备的唤醒阈值,并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果,判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态,提升了设备组内主设备唤醒的准确性。From the perspective of the wake-up threshold, the above scheme comprehensively analyzes the confidence level of the wake-up recognition results of the slave devices that are allowed to be woken up, combined with the actual wake-up situation of the slave devices in the device group, and then dynamically adjusts the master device in the device group based on a continuous method. Wake-up threshold, and determine whether the current master device needs to make a wake-up response according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device. Since the adjusted wake-up threshold of the master device is more in line with the actual wake-up state of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.
图11示出了又一种语音唤醒方法的判断流程图,如图11所示,若第一电子设备计算得到的第一唤醒置信度小于第一阈值,该方法包括如下步骤:FIG. 11 shows a judgment flowchart of another voice wake-up method. As shown in FIG. 11 , if the first wake-up confidence level calculated by the first electronic device is less than the first threshold, the method includes the following steps:
步骤401、根据至少一个第二电子设备发送的唤醒识别结果,统计设备组内允许被唤醒的第二电子设备的设备数量。
在本实施例中,至少一个第二电子设备发送的唤醒识别结果包括至少一个第二电子设备的唤醒标识和/或第二唤醒置信度。第一电子设备首先根据至少一个第二电子设备发送的唤醒识别结果,统计设备组内允许被唤醒的第二电子设备有哪些,从而确定设备组内允许被唤醒的第二电子设备的设备数量,具体可参见上述实施例的步骤201,此处不再赘述。In this embodiment, the wake-up identification result sent by the at least one second electronic device includes the wake-up identification and/or the second wake-up confidence level of the at least one second electronic device. The first electronic device firstly counts which second electronic devices are allowed to be woken up in the device group according to the wake-up identification result sent by at least one second electronic device, thereby determining the number of devices of the second electronic device allowed to be woken up in the device group, For details, reference may be made to step 201 in the foregoing embodiment, which is not repeated here.
步骤402、根据唤醒阈值调整表确定设备数量对应的阈值调整参数。Step 402: Determine threshold adjustment parameters corresponding to the number of devices according to the wake-up threshold adjustment table.
本实施例中,第一电子设备的存储器(例如内部存储器121、或者处理器110中的存储器等)中预存有唤醒阈值调整表。In this embodiment, a wake-up threshold adjustment table is pre-stored in the memory of the first electronic device (for example, the
在一种可能的实现方式中,唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系。作为一种示例,唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量的数值范围与阈值调整参数的对应关系。示例性的,假设设备组内除第一电子设备之外的第二电子设备的总数为7,第一电子设备统计得到的允许被唤醒的第二电子设备的设备数量可能落入表3所示的任意一个数值范围内,第一电子设备可根据表3所示的唤醒阈值调整表,确定本次的阈值调整参数。In a possible implementation manner, the wake-up threshold adjustment table may include a correspondence between the number of devices of the second electronic device that are allowed to be woken up and the threshold adjustment parameters. As an example, the wake-up threshold adjustment table may include a corresponding relationship between a numerical range of the number of devices of the second electronic device that is allowed to be woken up and a threshold adjustment parameter. Exemplarily, assuming that the total number of second electronic devices other than the first electronic device in the device group is 7, the number of devices of the second electronic device that is allowed to be woken up obtained from the statistics of the first electronic device may fall within the values shown in Table 3. Within any value range of , the first electronic device may determine the current threshold adjustment parameter according to the wake-up threshold adjustment table shown in Table 3.
表3table 3
由表3可知,允许被唤醒的第二电子设备的设备数量越多,阈值调整参数越大。It can be seen from Table 3 that the more the number of devices of the second electronic device that is allowed to be woken up, the larger the threshold adjustment parameter.
在另一种可能的实现方式中,唤醒阈值调整表可以包括允许被唤醒的第二电子设备的设备数量占所有第二电子设备的设备总数的比例,与,阈值调整参数的对应关系。应理解,该比例越大,阈值调整参数越大。In another possible implementation manner, the wake-up threshold adjustment table may include a ratio of the number of devices of the second electronic device allowed to be woken up to the total number of devices of all second electronic devices, and a corresponding relationship with the threshold adjustment parameter. It should be understood that the larger the ratio, the larger the threshold adjustment parameter.
可见,本实施例是通过预设的唤醒阈值调整表,确定阈值调整参数,各个阈值调整参数均为离散的数值。It can be seen that, in this embodiment, the threshold adjustment parameters are determined through a preset wake-up threshold adjustment table, and each threshold adjustment parameter is a discrete value.
步骤403、基于阈值调整参数调整第一阈值。Step 403: Adjust the first threshold based on the threshold adjustment parameter.
本实施例对第一电子设备的预设唤醒阈值的调整方式同上述实施例的步骤303,在确定阈值调整参数后,可采用公式七,减小第一电子设备的预设唤醒阈值。The method for adjusting the preset wake-up threshold of the first electronic device in this embodiment is the same as
步骤404、根据第一唤醒置信度和调整后的第一阈值,确定允许或禁止第一电子设备被唤醒。Step 404: Determine whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold.
本实施例的步骤404与上述实施例的步骤304相同,具体可参见上述实施例,此处不再赘述。Step 404 in this embodiment is the same as
上述方案同样从唤醒阈值的角度出发,结合设备组内从设备的实际唤醒情况,统计允许被唤醒的从设备的设备数量或占比,进而基于离散方式动态地调整设备组内主设备的唤醒阈值,并根据调整后的唤醒阈值与当前主设备计算得到的唤醒置信度的比较结果,判定当前主设备是否需要作出唤醒应答。由于调整后的主设备的唤醒阈值更契合整体设备组被唤醒的实际状态,提升了设备组内主设备唤醒的准确性。The above solution also starts from the wake-up threshold, combined with the actual wake-up situation of the slave devices in the device group, counts the number or proportion of the slave devices that are allowed to be woken up, and then dynamically adjusts the wake-up threshold of the master device in the device group based on a discrete method. , and according to the comparison result between the adjusted wake-up threshold and the wake-up confidence level calculated by the current master device, it is determined whether the current master device needs to make a wake-up response. Since the adjusted wake-up threshold of the master device is more in line with the actual state of wake-up of the overall device group, the accuracy of the wake-up of the master device in the device group is improved.
上文中详细描述了本申请实施例提供的语音唤醒方法,下面将描述本申请实施例提供的语音唤醒设备。图12为本申请实施例提供的一种语音唤醒设备的结构示意图。如图12所示,本申请实施例提供的语音唤醒设备1200,包括:The voice wake-up method provided by the embodiments of the present application is described in detail above, and the voice wake-up device provided by the embodiments of the present application will be described below. FIG. 12 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the present application. As shown in FIG. 12 , the voice wake-
获取模块1201,用于获取音频数据的第一唤醒置信度,所述第一唤醒置信度用于指示所述第一电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度;The obtaining
接收模块1202,用于接收所述至少一个第二电子设备发送的唤醒识别结果,所述唤醒识别结果用于指示允许或禁止所述至少一个第二电子设备被唤醒;a
处理模块1203,用于根据所述第一唤醒置信度和所述唤醒识别结果,确定允许或者禁止所述第一电子设备被唤醒。The
可选的,所述唤醒识别结果包括所述至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止所述第二电子设备被唤醒的标识,所述第二唤醒置信度用于指示所述第二电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification and a second wake-up confidence of the at least one second electronic device; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up. , the second wake-up confidence level is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
可选的,所述处理模块1203,具体用于:Optionally, the
若所述第一唤醒置信度大于或等于第一阈值,确定允许所述第一电子设备被唤醒;或者If the first wake-up confidence is greater than or equal to a first threshold, determining that the first electronic device is allowed to wake up; or
若所述第一唤醒置信度小于所述第一阈值,且所述第一唤醒置信度大于第二阈值,根据所述第一唤醒置信度和所述唤醒识别结果,确定允许或者禁止所述第一电子设备被唤醒;或者If the first wake-up confidence level is less than the first threshold, and the first wake-up confidence level is greater than the second threshold, determine whether to allow or prohibit the first wake-up confidence level according to the first wake-up confidence level and the wake-up identification result. an electronic device is awakened; or
若所述第一唤醒置信度小于或等于所述第二阈值,确定禁止所述第一电子设备被唤醒。If the first wake-up confidence level is less than or equal to the second threshold, it is determined that the first electronic device is prohibited from being woken up.
可选的,所述处理模块1203,具体用于:Optionally, the
根据所述唤醒识别结果统计所述至少一个第二电子设备的唤醒情况;Statistics on the wake-up situation of the at least one second electronic device according to the wake-up identification result;
若所述唤醒情况满足预设的第一电子设备的唤醒条件,确定允许所述第一电子设备被唤醒。If the wake-up condition satisfies a preset wake-up condition of the first electronic device, it is determined that the first electronic device is allowed to wake up.
可选的,所述唤醒条件包括如下的任意一项:Optionally, the wake-up condition includes any of the following:
所述设备组中除所述第一电子设备之外的所有的第二电子设备的第二唤醒置信度均大于或等于第三阈值;The second wake-up confidences of all second electronic devices in the device group except the first electronic device are greater than or equal to a third threshold;
所述设备组中允许被唤醒的第二电子设备的设备数量占所述设备组的设备总数的比例大于或等于第一比例;The ratio of the number of devices of the second electronic device allowed to be woken up in the device group to the total number of devices in the device group is greater than or equal to the first ratio;
所述设备组中允许被唤醒的第二电子设备的设备数量占所述设备组设备总数的比例小于所述第一比例且大于第二比例,且允许被唤醒的第二电子设备的第二唤醒置信度均大于或等于第三阈值;The ratio of the number of devices of the second electronic device that can be woken up in the device group to the total number of devices in the device group is smaller than the first ratio and greater than the second ratio, and the second wake-up of the second electronic device that is allowed to wake up is allowed The confidence levels are all greater than or equal to the third threshold;
其中,所述第三阈值为所述第一电子设备中预设的允许第二电子设备被唤醒的阈值。The third threshold is a preset threshold in the first electronic device that allows the second electronic device to be woken up.
可选的,所述唤醒识别结果还包括所述至少一个第二电子设备的设备标识,所述设备标识用于指示所述第二电子设备的设备类型,确定所述第二电子设备的权重值。Optionally, the wake-up identification result further includes a device identifier of the at least one second electronic device, where the device identifier is used to indicate the device type of the second electronic device, and determine the weight value of the second electronic device. .
可选的,所述处理模块1203,具体用于:Optionally, the
根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的第一权重值;Calculate the first weight value of the second electronic device that is allowed to be woken up in the device group according to the wake-up identification result;
统计所述设备组内所有第二电子设备的第二权重值;Counting second weight values of all second electronic devices in the device group;
基于所述第一权重值和所述第二权重值调整第一阈值;Adjusting a first threshold based on the first weight value and the second weight value;
根据所述第一唤醒置信度和调整后的第一阈值,确定允许或者禁止所述第一电子设备被唤醒;According to the first wake-up confidence level and the adjusted first threshold, determine whether to allow or prohibit the first electronic device from being woken up;
其中,所述第一阈值为允许所述第一电子设备被唤醒的阈值。Wherein, the first threshold is a threshold for allowing the first electronic device to be woken up.
可选的,所述第一权重值是根据所述设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;所述第二权重值是根据所述设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is based on the number of devices corresponding to the device type of the second electronic device that is allowed to be woken up in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be woken up. The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
可选的,所述处理模块1203,具体用于:Optionally, the
将所述第一权重值与所述第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据所述阈值调整参数调整所述第一阈值。The product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter is used as a threshold adjustment parameter, and the first threshold is adjusted according to the threshold adjustment parameter.
可选的,所述处理模块1203,具体用于:Optionally, the
根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的设备数量;Counting the number of devices of the second electronic device allowed to be woken up in the device group according to the wake-up identification result;
根据唤醒阈值调整表确定所述设备数量对应的阈值调整参数,所述唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;Determine threshold adjustment parameters corresponding to the number of devices according to a wake-up threshold adjustment table, where the wake-up threshold adjustment table includes a correspondence between the number of devices of the second electronic device that is allowed to be woken up and the threshold adjustment parameters;
基于所述阈值调整参数调整所述第一阈值;Adjusting the first threshold based on the threshold adjustment parameter;
所述根据所述第一唤醒置信度和调整后的第一阈值,确定允许或禁止所述第一电子设备被唤醒。and determining whether to allow or prohibit the first electronic device from being woken up according to the first wake-up confidence and the adjusted first threshold.
可选的,所述处理模块1203,具体用于:Optionally, the
若所述第一唤醒置信度大于或等于所述调整后的第一阈值,确定允许所述第一电子设备被唤醒;或者If the first wake-up confidence level is greater than or equal to the adjusted first threshold, determining that the first electronic device is allowed to wake up; or
若所述第一唤醒置信度小于所述调整后的第一阈值,确定禁止所述第一电子设备被唤醒。If the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is prohibited from being woken up.
本申请实施例提供的语音唤醒设备,用于执行前述图7、图8a、8b、图9所示方法实施例中的第一电子设备执行的技术方案,其实现原理和技术效果类似,在此不再赘述。The voice wake-up device provided in this embodiment of the present application is used to execute the technical solution executed by the first electronic device in the method embodiments shown in FIG. 7 , FIG. 8 a , 8 b , and FIG. 9 . No longer.
图13为本申请实施例提供的一种语音唤醒设备的结构示意图。如图13所示,本申请实施例提供的语音唤醒设备1300,包括:FIG. 13 is a schematic structural diagram of a voice wake-up device provided by an embodiment of the present application. As shown in FIG. 13 , the voice wake-
获取模块1301,用于获取音频数据的第一唤醒置信度,所述第一唤醒置信度用于指示所述第一电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度;The acquiring
接收模块1302,用于接收所述至少一个第二电子设备发送的唤醒识别结果,所述唤醒识别结果用于指示允许或禁止所述第二电子设备被唤醒;a
处理模块1303,用于根据所述唤醒识别结果调整第一阈值,所述第一阈值为允许所述第一电子设备被唤醒的阈值;根据调整后的第一阈值和所述第一唤醒置信度,确定允许或禁止所述第一电子设备被唤醒。The
可选的,所述处理模块1303,具体用于:Optionally, the
根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的第一权重值;Calculate the first weight value of the second electronic device that is allowed to be woken up in the device group according to the wake-up identification result;
统计所述设备组内所有第二电子设备的第二权重值;Counting second weight values of all second electronic devices in the device group;
基于所述第一权重值和所述第二权重值调整所述第一阈值。The first threshold is adjusted based on the first weight value and the second weight value.
可选的,所述第一权重值是根据所述设备组内允许被唤醒的第二电子设备的设备类型对应的设备数量,以及允许被唤醒的第二电子设备的各个设备类型对应的权重值确定的;所述第二权重值是根据所述设备组内所有第二电子设备的设备类型对应的设备数量,以及各设备类型对应的权重值确定的。Optionally, the first weight value is based on the number of devices corresponding to the device type of the second electronic device that is allowed to be woken up in the device group, and the weight value corresponding to each device type of the second electronic device that is allowed to be woken up. The second weight value is determined according to the number of devices corresponding to the device types of all the second electronic devices in the device group, and the weight value corresponding to each device type.
可选的,所述处理模块1303,具体用于:Optionally, the
将所述第一权重值与所述第二权重值的比值和最大阈值调整参数的乘积作为阈值调整参数,根据所述阈值调整参数调整所述第一阈值。The product of the ratio of the first weight value to the second weight value and the maximum threshold adjustment parameter is used as a threshold adjustment parameter, and the first threshold is adjusted according to the threshold adjustment parameter.
可选的,所述处理模块1303,具体用于:Optionally, the
根据所述唤醒识别结果统计所述设备组内允许被唤醒的第二电子设备的设备数量;Counting the number of devices of the second electronic device allowed to be woken up in the device group according to the wake-up identification result;
根据唤醒阈值调整表确定所述设备数量对应的阈值调整参数,所述唤醒阈值调整表包括允许被唤醒的第二电子设备的设备数量与阈值调整参数的对应关系;Determine threshold adjustment parameters corresponding to the number of devices according to a wake-up threshold adjustment table, where the wake-up threshold adjustment table includes a correspondence between the number of devices of the second electronic device that is allowed to be woken up and the threshold adjustment parameters;
基于所述阈值调整参数调整所述第一阈值。The first threshold is adjusted based on the threshold adjustment parameter.
可选的,所述唤醒识别结果包括所述至少一个第二电子设备的唤醒标识、第二唤醒置信度的至少一项;其中,唤醒标识包括允许或禁止所述第二电子设备被唤醒的标识,所述第二唤醒置信度用于指示所述第二电子设备确定的所述音频数据中唤醒词与预设唤醒词的声学特征相似度。Optionally, the wake-up identification result includes at least one of a wake-up identification and a second wake-up confidence of the at least one second electronic device; wherein the wake-up identification includes an identification that allows or prohibits the second electronic device from being woken up. , the second wake-up confidence level is used to indicate the acoustic feature similarity between the wake-up word and the preset wake-up word in the audio data determined by the second electronic device.
可选的,所述处理模块1303,具体用于:Optionally, the
若所述第一唤醒置信度大于或等于所述调整后的第一阈值,确定允许所述第一电子设备被唤醒;或者If the first wake-up confidence level is greater than or equal to the adjusted first threshold, determining that the first electronic device is allowed to wake up; or
若所述第一唤醒置信度小于所述调整后的第一阈值,确定禁止所述第一电子设备被唤醒。If the first wake-up confidence level is less than the adjusted first threshold, it is determined that the first electronic device is prohibited from being woken up.
本申请实施例提供的语音唤醒设备,用于执行前述图10、图11所示方法实施例中的第一电子设备执行的技术方案,其实现原理和技术效果类似,在此不再赘述。The voice wake-up device provided in this embodiment of the present application is used to execute the technical solution executed by the first electronic device in the method embodiments shown in FIG. 10 and FIG. 11 .
需要说明的是,应理解以上语音唤醒设备的各个模块的划分仅仅是一种逻辑功能的划分,实际实现时可以全部或部分集成到一个物理实体上,也可以物理上分开。且这些模块可以全部以软件通过处理元件调用的形式实现;也可以全部以硬件的形式实现;还可以部分模块通过处理元件调用软件的形式实现,部分模块通过硬件的形式实现。例如,处理模块可以为单独设立的处理元件,也可以集成在上述设备的某一个芯片中实现,此外,也可以以程序代码的形式存储于上述设备的存储器中,由上述设备的某一个处理元件调用并执行以上处理模块的功能。其它模块的实现与之类似。此外这些模块全部或部分可以集成在一起,也可以独立实现。这里所述的处理元件可以是一种集成电路,具有信号的处理能力。在实现过程中,上述方法的各步骤或以上各个模块可以通过处理器元件中的硬件的集成逻辑电路或者软件形式的指令完成。It should be noted that it should be understood that the above division of the modules of the voice wake-up device is only a division of logical functions, and may be fully or partially integrated into a physical entity in actual implementation, or may be physically separated. And these modules can all be implemented in the form of software calling through processing elements; they can also all be implemented in hardware; some modules can also be implemented in the form of calling software through processing elements, and some modules can be implemented in hardware. For example, the processing module may be a separately established processing element, or it may be integrated into a certain chip of the above-mentioned device, and it may also be stored in the memory of the above-mentioned device in the form of program code, and a certain processing element of the above-mentioned device Call and execute the function of the above processing module. The implementation of other modules is similar. In addition, all or part of these modules can be integrated together, and can also be implemented independently. The processing element described here may be an integrated circuit with signal processing capability. In the implementation process, each step of the above-mentioned method or each of the above-mentioned modules can be completed by an integrated logic circuit of hardware in the processor element or an instruction in the form of software.
例如,以上这些模块可以是被配置成实施以上方法的一个或多个集成电路,例如:一个或多个特定集成电路(application specific integrated circuit,ASIC),或,一个或多个微处理器(digital signal processor,DSP),或,一个或者多个现场可编程门阵列(field programmable gate array,FPGA)等。再如,当以上某个模块通过处理元件调度程序代码的形式实现时,该处理元件可以是通用处理器,例如中央处理器(centralprocessing unit,CPU)或其它可以调用程序代码的处理器。再如,这些模块可以集成在一起,以片上系统(system-on-a-chip,SOC)的形式实现。For example, the above modules may be one or more integrated circuits configured to implement the above methods, such as: one or more application specific integrated circuits (ASIC), or one or more digital microprocessors (digital) signal processor, DSP), or, one or more field programmable gate array (field programmable gate array, FPGA) and so on. For another example, when one of the above modules is implemented in the form of a processing element scheduling program code, the processing element may be a general-purpose processor, such as a central processing unit (CPU) or other processors that can invoke program codes. For another example, these modules can be integrated together and implemented in the form of a system-on-a-chip (SOC).
图14为本申请实施例提供的一种语音唤醒设备的硬件结构示意图。如图14所示,本申请实施例提供的语音唤醒设备1400,可以包括:FIG. 14 is a schematic diagram of a hardware structure of a voice wake-up device according to an embodiment of the present application. As shown in FIG. 14 , the voice wake-up device 1400 provided by this embodiment of the present application may include:
处理器1401、存储器1402和通信接口1403。其中,存储器1402,用于存储计算机程序;处理器1401,用于执行存储器1402存储的计算机程序,以实现上述任一方法实施例中第一电子设备所执行的方法。通信接口1403,用于与至少一个第二电子设备或者服务器进行数据通信或者信号通信。
可选地,存储器1402既可以是独立的,也可以跟处理器1401集成在一起。当所述存储器1402是独立于处理器1401之外的器件时,所述语音唤醒设备1400还可以包括:总线1404,用于连接所述存储器1402和处理器1401。Optionally, the
在一种可能的实施方式中,图12中的处理模块1203可以集成在处理器1401中实现,接收模块1202可以集成在通信接口1403中实现。图13中的处理模块1303可以集成在处理器1401中实现,接收模块1302可以集成在通信接口1403中实现。In a possible implementation manner, the
在一种可能的实施方式中,处理器1401可用于实现上述方法实施例中第一电子设备的信息处理操作,通信接口1403可用于实现上述方法实施例中第一电子设备的信号收发操作。In a possible implementation manner, the
本实施例提供的语音唤醒设备,可用于执行上述任一方法实施例中第一电子设备所执行的方法,其实现原理和技术效果类似,此处不再赘述。The voice wake-up device provided in this embodiment can be used to execute the method executed by the first electronic device in any of the above method embodiments, and its implementation principle and technical effect are similar, and details are not repeated here.
本申请实施例还提供一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机执行指令,当所述计算机执行指令被处理器执行时用于实现前述任一方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a computer-readable storage medium, where computer-executable instructions are stored in the computer-readable storage medium, and when the computer-executable instructions are executed by a processor, are used to implement the first method in any of the foregoing method embodiments. A technical solution for electronic equipment.
本申请实施例还提供一种程序,当该程序被处理器执行时,用于执行前述任一方法实施例中第一电子设备的技术方案。The embodiments of the present application further provide a program, which, when the program is executed by the processor, is used to execute the technical solution of the first electronic device in any of the foregoing method embodiments.
本申请实施例还提供一种计算机程序产品,包括程序指令,程序指令用于实现前述任一方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a computer program product, including program instructions, where the program instructions are used to implement the technical solution of the first electronic device in any of the foregoing method embodiments.
本申请实施例还提供了一种芯片,包括:处理模块与通信接口,该处理模块能执行前述方法实施例中第一电子设备的技术方案。Embodiments of the present application further provide a chip, including: a processing module and a communication interface, where the processing module can execute the technical solutions of the first electronic device in the foregoing method embodiments.
进一步地,该芯片还包括存储模块(如,存储器),存储模块用于存储指令,处理模块用于执行存储模块存储的指令,并且对存储模块中存储的指令的执行使得处理模块执行第一电子设备的技术方案。Further, the chip also includes a storage module (eg, memory), the storage module is used for storing instructions, the processing module is used for executing the instructions stored in the storage module, and the execution of the instructions stored in the storage module causes the processing module to execute the first electronic The technical solution of the equipment.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘solid state disk(SSD))等。In the above-mentioned embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions described in the embodiments of the present application are generated. The computer may be a general purpose computer, special purpose computer, computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be downloaded from a website site, computer, server, or data center Transmission to another website site, computer, server, or data center is by wire (eg, coaxial cable, fiber optic, digital subscriber line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media. The usable media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVDs), or semiconductor media (eg, solid state disks (SSDs)), and the like.
本申请中,“至少一个”是指一个或者多个,“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系;在公式中,字符“/”,表示前后关联对象是一种“相除”的关系。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中,a,b,c可以是单个,也可以是多个。In this application, "at least one" means one or more, and "plurality" means two or more. "And/or", which describes the association relationship of the associated objects, indicates that there can be three kinds of relationships, for example, A and/or B, which can indicate: the existence of A alone, the existence of A and B at the same time, and the existence of B alone, where A, B can be singular or plural. The character "/" generally indicates that the related objects before and after are an "or" relationship; in the formula, the character "/" indicates that the related objects are a "division" relationship. "At least one item(s) below" or similar expressions thereof refer to any combination of these items, including any combination of single item(s) or plural items(s). For example, at least one item (a) of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple indivual.
可以理解的是,在本申请的实施例中涉及的各种数字编号仅为描述方便进行的区分,并不用来限制本申请的实施例的范围。It can be understood that, the various numbers and numbers involved in the embodiments of the present application are only for the convenience of description, and are not used to limit the scope of the embodiments of the present application.
可以理解的是,在本申请的实施例中,上述各过程的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请的实施例的实施过程构成任何限定。It can be understood that, in the embodiments of the present application, the size of the sequence numbers of the above-mentioned processes does not imply the order of execution, and the execution order of each process should be determined by its functions and internal logic, rather than the implementation of the present application. The implementation of the examples constitutes no limitation.
Claims (38)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010353897.1A CN111696562B (en) | 2020-04-29 | 2020-04-29 | Voice wake-up method, device and storage medium |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010353897.1A CN111696562B (en) | 2020-04-29 | 2020-04-29 | Voice wake-up method, device and storage medium |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111696562A true CN111696562A (en) | 2020-09-22 |
| CN111696562B CN111696562B (en) | 2022-08-19 |
Family
ID=72476807
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010353897.1A Active CN111696562B (en) | 2020-04-29 | 2020-04-29 | Voice wake-up method, device and storage medium |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111696562B (en) |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112420051A (en) * | 2020-11-18 | 2021-02-26 | 青岛海尔科技有限公司 | Equipment determination method, device and storage medium |
| CN112509596A (en) * | 2020-11-19 | 2021-03-16 | 北京小米移动软件有限公司 | Wake-up control method and device, storage medium and terminal |
| CN113889102A (en) * | 2021-09-23 | 2022-01-04 | 达闼科技(北京)有限公司 | Instruction receiving method, system, electronic device, cloud server and storage medium |
| WO2022188551A1 (en) * | 2021-03-10 | 2022-09-15 | Oppo广东移动通信有限公司 | Information processing method and apparatus, master control device, and controlled device |
| CN115132172A (en) * | 2021-03-24 | 2022-09-30 | 华为技术有限公司 | Intelligent equipment awakening method and device |
| CN115171703A (en) * | 2022-05-30 | 2022-10-11 | 青岛海尔科技有限公司 | Distributed voice awakening method and device, storage medium and electronic device |
| CN117950620A (en) * | 2022-10-28 | 2024-04-30 | 华为技术有限公司 | Device control method and electronic device |
| CN120601956A (en) * | 2025-07-30 | 2025-09-05 | 中北大学 | Wireless signal recognition method, system, medium and equipment based on UAV collaboration |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107622770A (en) * | 2017-09-30 | 2018-01-23 | 百度在线网络技术(北京)有限公司 | voice awakening method and device |
| US20190005954A1 (en) * | 2017-06-30 | 2019-01-03 | Baidu Online Network Technology (Beijing) Co., Ltd. | Wake-on-voice method, terminal and storage medium |
| CN109273007A (en) * | 2018-10-11 | 2019-01-25 | 科大讯飞股份有限公司 | Voice awakening method and device |
| CN109346071A (en) * | 2018-09-26 | 2019-02-15 | 出门问问信息科技有限公司 | Wake up processing method, device and electronic equipment |
| CN110223684A (en) * | 2019-05-16 | 2019-09-10 | 华为技术有限公司 | A kind of voice awakening method and equipment |
| CN110364151A (en) * | 2019-07-15 | 2019-10-22 | 华为技术有限公司 | Voice awakening method and electronic equipment |
| CN110570861A (en) * | 2019-09-24 | 2019-12-13 | Oppo广东移动通信有限公司 | Method, apparatus, terminal device and readable storage medium for voice wake-up |
| CN111081217A (en) * | 2019-12-03 | 2020-04-28 | 珠海格力电器股份有限公司 | Voice wake-up method and device, electronic equipment and storage medium |
-
2020
- 2020-04-29 CN CN202010353897.1A patent/CN111696562B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190005954A1 (en) * | 2017-06-30 | 2019-01-03 | Baidu Online Network Technology (Beijing) Co., Ltd. | Wake-on-voice method, terminal and storage medium |
| CN107622770A (en) * | 2017-09-30 | 2018-01-23 | 百度在线网络技术(北京)有限公司 | voice awakening method and device |
| CN109346071A (en) * | 2018-09-26 | 2019-02-15 | 出门问问信息科技有限公司 | Wake up processing method, device and electronic equipment |
| CN109273007A (en) * | 2018-10-11 | 2019-01-25 | 科大讯飞股份有限公司 | Voice awakening method and device |
| CN110223684A (en) * | 2019-05-16 | 2019-09-10 | 华为技术有限公司 | A kind of voice awakening method and equipment |
| CN110364151A (en) * | 2019-07-15 | 2019-10-22 | 华为技术有限公司 | Voice awakening method and electronic equipment |
| CN110570861A (en) * | 2019-09-24 | 2019-12-13 | Oppo广东移动通信有限公司 | Method, apparatus, terminal device and readable storage medium for voice wake-up |
| CN111081217A (en) * | 2019-12-03 | 2020-04-28 | 珠海格力电器股份有限公司 | Voice wake-up method and device, electronic equipment and storage medium |
Cited By (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112420051A (en) * | 2020-11-18 | 2021-02-26 | 青岛海尔科技有限公司 | Equipment determination method, device and storage medium |
| CN112509596A (en) * | 2020-11-19 | 2021-03-16 | 北京小米移动软件有限公司 | Wake-up control method and device, storage medium and terminal |
| WO2022188551A1 (en) * | 2021-03-10 | 2022-09-15 | Oppo广东移动通信有限公司 | Information processing method and apparatus, master control device, and controlled device |
| CN115079810A (en) * | 2021-03-10 | 2022-09-20 | Oppo广东移动通信有限公司 | Information processing method and device, main control equipment and controlled equipment |
| CN115132172A (en) * | 2021-03-24 | 2022-09-30 | 华为技术有限公司 | Intelligent equipment awakening method and device |
| CN113889102A (en) * | 2021-09-23 | 2022-01-04 | 达闼科技(北京)有限公司 | Instruction receiving method, system, electronic device, cloud server and storage medium |
| CN113889102B (en) * | 2021-09-23 | 2025-05-02 | 达闼科技(北京)有限公司 | Instruction receiving method, system, electronic device, cloud server and storage medium |
| CN115171703A (en) * | 2022-05-30 | 2022-10-11 | 青岛海尔科技有限公司 | Distributed voice awakening method and device, storage medium and electronic device |
| CN115171703B (en) * | 2022-05-30 | 2024-05-24 | 青岛海尔科技有限公司 | Distributed voice wake-up method and device, storage medium and electronic device |
| CN117950620A (en) * | 2022-10-28 | 2024-04-30 | 华为技术有限公司 | Device control method and electronic device |
| CN120601956A (en) * | 2025-07-30 | 2025-09-05 | 中北大学 | Wireless signal recognition method, system, medium and equipment based on UAV collaboration |
| CN120601956B (en) * | 2025-07-30 | 2025-10-03 | 中北大学 | Wireless signal identification method, system, medium and equipment based on unmanned plane cooperation |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111696562B (en) | 2022-08-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111696562B (en) | Voice wake-up method, device and storage medium | |
| US12142274B2 (en) | Voice wakeup method and device | |
| WO2021013137A1 (en) | Voice wake-up method and electronic device | |
| CN111696570B (en) | Voice signal processing method, device, equipment and storage medium | |
| WO2021000876A1 (en) | Voice control method, electronic equipment and system | |
| WO2020207328A1 (en) | Image recognition method and electronic device | |
| CN113347560A (en) | Bluetooth connection method, electronic device and storage medium | |
| WO2021180085A1 (en) | Sound pickup method and apparatus and electronic device | |
| WO2021000817A1 (en) | Ambient sound processing method and related device | |
| CN111613213B (en) | Audio classification method, device, equipment and storage medium | |
| CN113728295A (en) | Screen control method, device, equipment and storage medium | |
| CN114520002A (en) | Method for processing voice and electronic equipment | |
| WO2022156438A1 (en) | Wakeup method and electronic device | |
| CN111341317B (en) | Evaluation method, device, electronic device and medium for wake-up audio data | |
| CN111862972A (en) | Voice interactive service method, device, device and storage medium | |
| CN115718913A (en) | User identity identification method and electronic equipment | |
| WO2022161077A1 (en) | Speech control method, and electronic device | |
| CN110536388A (en) | A power control method and terminal equipment | |
| CN108806670A (en) | Audio recognition method, device and storage medium | |
| CN110852217A (en) | Method and electronic device for face recognition | |
| CN114120987B (en) | Voice wake-up method, electronic equipment and chip system | |
| CN109285563B (en) | Voice data processing method and device in online translation process | |
| CN113542984B (en) | Stereo realization system, method, electronic device and storage medium | |
| CN118972491A (en) | Method for adjusting call volume and electronic device | |
| WO2022222780A1 (en) | Audio output method, media file recording method, and electronic device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |