CN111429917A - Equipment awakening method and terminal equipment - Google Patents
Equipment awakening method and terminal equipment Download PDFInfo
- Publication number
- CN111429917A CN111429917A CN202010191577.0A CN202010191577A CN111429917A CN 111429917 A CN111429917 A CN 111429917A CN 202010191577 A CN202010191577 A CN 202010191577A CN 111429917 A CN111429917 A CN 111429917A
- Authority
- CN
- China
- Prior art keywords
- voice
- awakening
- wake
- word
- equipment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephone Function (AREA)
- Selective Calling Equipment (AREA)
Abstract
The invention provides a device awakening method and terminal equipment, wherein the terminal equipment is connected with at least one loudspeaker, the at least one loudspeaker is respectively arranged on at least one voice device and respectively covers a sound inlet channel of a microphone of each voice device, and the method comprises the following steps: receiving a first awakening voice input by a user; and under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment, awakening the terminal equipment, synthesizing a second awakening voice matched with the second awakening word of the target voice equipment, playing the second awakening voice through a loudspeaker installed on the target voice equipment, and awakening the target voice equipment through the second awakening voice, wherein the target voice equipment is at least one of the at least one voice equipment. The embodiment of the invention can indirectly achieve the aim of awakening the target voice equipment in a mode of awakening the terminal equipment, not only brings convenience to the user, but also reduces the burden of the user on memorizing awakening words of different equipment.
Description
Technical Field
The present invention relates to the field of voice processing technologies, and in particular, to a device wake-up method and a terminal device.
Background
With the progress and development of science and technology, devices such as intelligent voice devices and intelligent household appliances are popularized and popularized, and the voice function is also supported mostly. For example, the wakeup word of the smart sound box may be "love classmates", the wakeup word of the smart television may be "smart televisions", the wakeup word of the smart air conditioners may be "hello happy", the user can only speak the corresponding wakeup word to wake up the corresponding device, and if the wakeup words are not correct, the user cannot wake up the device.
In practice, different smart devices may come from different vendors and the wake words are difficult to unify. When the number of the wake-up words is large, the user is required to memorize the wake-up words of each device, and when the user forgets or gets confused, the device cannot be woken up. In some public places, such as hotels, companies and the like, more intelligent voice devices are provided, and in addition to the unfamiliarity of users, various awakening words of different devices are more likely to cause difficulty in use of the users.
Disclosure of Invention
The embodiment of the invention provides a device awakening method and terminal equipment, and aims to solve the problem that in the prior art, different awakening words of different devices cause difficulty in use of a user.
In order to solve the technical problem, the invention is realized as follows:
in a first aspect, an embodiment of the present invention provides an apparatus wake-up method, which is applied to a terminal apparatus, where the terminal apparatus is connected to at least one speaker, and the at least one speaker is respectively installed on at least one voice apparatus and respectively covers an acoustic channel of a microphone of each voice apparatus, where the method includes:
receiving a first awakening voice input by a user;
and awakening the terminal equipment under the condition that the first awakening voice is matched with a first awakening word of the terminal equipment, synthesizing a second awakening voice matched with a second awakening word of the target voice equipment, playing the second awakening voice through a loudspeaker installed on the target voice equipment, and awakening the target voice equipment through the second awakening voice, wherein the target voice equipment is at least one of the at least one voice equipment.
Optionally, the synthesizing a second wake-up voice matched with a second wake-up word of the target voice device includes:
determining a second awakening word of the target voice equipment according to the pre-acquired awakening word of each voice equipment in the at least one voice equipment;
and synthesizing a second awakening voice matched with the second awakening word.
Optionally, before receiving the first wake-up voice input by the user, the method further includes:
receiving a wake-up word of each voice device in the at least one voice device input by a user;
storing a wake-up word for each of the at least one voice device;
the determining a second wake-up word of the target voice device according to the pre-obtained wake-up word of each voice device of the at least one voice device includes:
and determining a second awakening word of the target voice equipment according to the stored awakening word of each voice equipment in the at least one voice equipment.
Optionally, after waking up the terminal device and before synthesizing a second wake-up voice matched with a second wake-up word of the target voice device, the method further includes:
receiving a voice control instruction input by a user;
and determining the corresponding target voice equipment based on the voice control instruction, wherein the target voice equipment is the voice equipment capable of responding to the voice control instruction in the at least one voice equipment.
Optionally, after receiving a voice control instruction input by a user, and before determining the corresponding target voice device based on the voice control instruction, the method further includes:
determining whether the voice control instruction is matched with the common control instruction or not according to the common control instruction which is pre-recorded and corresponds to the first awakening word;
the determining the corresponding target voice device based on the voice control instruction comprises:
and under the condition that the voice control instruction is matched with the common control instruction, determining the corresponding target voice equipment based on the voice control instruction.
Optionally, before receiving the first wake-up voice input by the user, the method further includes:
receiving a third awakening word input by a user;
setting the third awakening word as a first awakening word of the terminal equipment;
and determining the score of the first awakening word and outputting the score.
Optionally, if the first wake-up voice is matched with the first wake-up word of the terminal device, waking up the terminal device includes:
extracting voiceprint features in the first awakening voice;
and awakening the terminal equipment under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment and the voiceprint characteristics are matched with the preset voiceprint characteristics.
In a second aspect, an embodiment of the present invention provides a terminal device, where the terminal device is connected to at least one speaker, where the at least one speaker is respectively installed on at least one voice device and respectively covers an acoustic channel of a microphone of each voice device, and the terminal device includes:
the first receiving module is used for receiving a first awakening voice input by a user;
and the awakening module is used for awakening the terminal equipment under the condition that the first awakening voice is matched with a first awakening word of the terminal equipment, synthesizing a second awakening voice matched with a second awakening word of the target voice equipment, playing the second awakening voice through a loudspeaker installed on the target voice equipment, and awakening the target voice equipment through the second awakening voice, wherein the target voice equipment is at least one of the at least one voice equipment.
Optionally, the wake-up module includes:
a determining unit, configured to determine, according to a pre-obtained wake-up word of each voice device in the at least one voice device, a second wake-up word of the target voice device;
and the synthesis unit is used for synthesizing the second awakening voice matched with the second awakening word.
Optionally, the terminal device further includes:
the second receiving module is used for receiving a wake-up word of each voice device in the at least one voice device input by a user;
the storage module is used for storing the awakening words of each voice device in the at least one voice device;
the determining unit is configured to determine a second wake-up word of the target voice device according to the stored wake-up word of each voice device of the at least one voice device.
Optionally, the terminal device further includes:
the third receiving module is used for receiving a voice control instruction input by a user;
a first determining module, configured to determine, based on the voice control instruction, the corresponding target voice device, where the target voice device is a voice device that can respond to the voice control instruction in the at least one voice device.
Optionally, the terminal device further includes:
the second determination module is used for determining whether the voice control instruction is matched with the common control instruction or not according to the common control instruction which is pre-recorded and corresponds to the first awakening word;
the first determining module is used for determining the corresponding target voice equipment based on the voice control instruction under the condition that the voice control instruction is matched with the common control instruction.
Optionally, the terminal device further includes:
the fourth receiving module is used for receiving a third awakening word input by the user;
the setting module is used for setting the third awakening word as the first awakening word of the terminal equipment;
and the third determining module is used for determining the score of the first awakening word and outputting the score.
Optionally, the wake-up module includes:
the extracting unit is used for extracting the voiceprint features in the first awakening voice;
and the awakening unit is used for awakening the terminal equipment under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment and the voiceprint characteristics are matched with the preset voiceprint characteristics.
In a third aspect, an embodiment of the present invention provides a terminal device, which includes a processor, a memory, and a computer program that is stored in the memory and is executable on the processor, and when the computer program is executed by the processor, the steps in the device wake-up method are implemented.
In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements the steps in the device wake-up method described above.
In the embodiment of the invention, the terminal equipment is connected with at least one loudspeaker, and the at least one loudspeaker is respectively arranged on the at least one voice equipment and respectively covers the sound inlet channel of the microphone of each voice equipment. Therefore, the user does not need to memorize the awakening words of different voice devices, only needs to memorize the awakening words of the terminal device, and indirectly achieves the purpose of awakening the target voice device in the mode of awakening the terminal device, so that convenience is brought to the user, and the burden that the user needs to memorize the awakening words of different devices is reduced.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive exercise.
Fig. 1 is a flowchart of a device wake-up method according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of a terminal device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a flowchart of a device wake-up method provided in an embodiment of the present invention, applied to a terminal device, where the terminal device is connected to at least one speaker, and the at least one speaker is respectively installed on at least one voice device and respectively covers a sound input channel of a microphone of each voice device, as shown in fig. 1, the method includes the following steps:
In the embodiment of the invention, the terminal device can be an independent portable electronic device used as a special awakening housekeeper, or can be a mobile terminal such as a mobile phone and a wearable device, and the terminal device is provided with an awakening housekeeper APP which can be used for awakening management of other voice devices.
When the terminal device is an independent wake-up housekeeper device, the terminal device may include one or more speakers used in a matched manner, when the terminal device is a mobile terminal, the terminal device may be externally connected with the one or more speakers, the terminal device and the speakers may establish data connection through bluetooth, hot spots and other manners, that is, the terminal device may transmit audio data thereof to the speakers connected thereto for playing.
In order to achieve the purpose of the present invention, that is, to wake up the terminal device to indirectly wake up other voice devices, in an embodiment of the present invention, at least one speaker connected to the terminal device needs to be respectively installed on at least one voice device and respectively covers a sound input channel of a microphone of each voice device, where the at least one voice device may be a device supporting functions of voice wake-up, voice control, and the like, and is generally an intelligent device, such as an intelligent sound box, an intelligent household appliance, and the like. Therefore, because the sound inlet channel of the microphone of the voice equipment is covered by the loudspeaker, the voice equipment is difficult to receive external sound through the microphone, namely, the voice equipment is difficult to directly receive and respond to the voice command input by the user, but because the loudspeaker covers the sound inlet channel of the microphone, the audio transmission distance is short, the voice command sent by the loudspeaker can be better received, and the response is quicker and more accurate.
In practical application, when a user wakes up any one of the voice devices, especially a plurality of voice devices in a home, a hotel, a company, or the like, in order to make the user wake up any one of the voice devices more easily, a speaker may be installed on each voice device, each speaker covers a microphone port of each voice device, the speaker may be designed to be capable of fitting the shape of a housing of the voice device, such as a square or a circle with appropriate thickness, and the terminal device may establish data connection with the speakers.
In step 101, the first wake-up voice may be a voice message input by a user for waking up the terminal device, and in order to ensure that the terminal device is woken up, the first wake-up voice needs to be matched with a wake-up word of the terminal device, for example, if the wake-up word of the terminal device is "hello xiao fang", the user may input a wake-up voice of "hello xiao fang".
That is to say, the terminal device may be in a dormant state when not receiving the wake-up voice of the user, and when the user needs to wake up one or more of the at least one voice device, the terminal device may first input a first wake-up voice matched with the wake-up word of the terminal device to wake up the terminal device, and then wake up the voice device to be woken up through the terminal device.
Optionally, before the step 101, the method further includes:
receiving a third awakening word input by a user;
setting the third awakening word as a first awakening word of the terminal equipment;
and determining the score of the first awakening word and outputting the score.
In the embodiment, in order to meet the requirement of flexibly setting the wake-up word by the user and ensure a better wake-up effect, the wake-up word of the terminal device can be set according to the input of the user, the wake-up word input by the user is scored and evaluated, and the user is guided to input the proper wake-up word with the better wake-up effect.
Specifically, before the user uses the terminal device to wake up by voice, the user may register a wake-up word on the terminal device, that is, the user may input any wake-up word according to the user's own needs, usage habits, personal preferences, and the like, the terminal device may set the wake-up word input by the user, that is, the third wake-up word, as the wake-up word of the terminal device, that is, the first wake-up word, and the subsequent user may wake up the terminal device by inputting a wake-up voice matched with the first wake-up word.
When the wake-up word input by the user is too long, the number of repeated words is too large, or the wake-up is relatively noisy, the wake-up effect may be poor, such as easy false wake-up or low wake-up success rate, and the like, and therefore, in order to ensure a better wake-up effect, the wake-up word input by the user, that is, the first wake-up word, may be scored by specifically integrating the length of the first wake-up word, the confusion degree of the state sequence, and the like, wherein the confusion degree of the state sequence may be defined according to the arrangement and the number of repeated words of the words in the first wake-up word, for example, the wake-up word "a" is higher because all are repeated words, and the wake-up word "a little good word" is read smoothly and is not repeated, and the confusion degree of the state sequence is lower. The rule for scoring the first wake word may be that a moderate length, low confusion score for a state sequence is high, while an excessively long or short length, high confusion score for a state sequence is low.
After the first awakening word is scored, the score of the first awakening word can be directly output (such as display or voice prompt), so that a user can determine whether the first awakening word is suitable for being continuously used as the awakening word of the terminal equipment according to the score, if the score is high, the user does not need to modify the awakening word of the terminal equipment, and if the score is low, the user can modify the awakening word of the terminal equipment.
Further, in order to achieve a better prompt effect, a prompt signal may be output when it is determined that the score of the first wake-up word is lower than a preset score, where the prompt signal is used to prompt a user to change the first wake-up word. For example, in the case that it is determined that the score of the first wake-up word is lower than 60 points, a text prompt message or a voice prompt signal may be output to prompt the user that the score of the current wake-up word is low, and the wake-up word may be replaced to obtain a better wake-up effect.
102, when the first wake-up voice is matched with a first wake-up word of the terminal device, waking up the terminal device, synthesizing a second wake-up voice matched with a second wake-up word of a target voice device, and playing the second wake-up voice through a speaker installed on the target voice device so as to wake up the target voice device through the second wake-up voice, wherein the target voice device is at least one of the at least one voice device.
After receiving the first wake-up voice, matching verification can be performed on the first wake-up voice, that is, whether the first wake-up voice is matched with a first wake-up word of the terminal device is verified, and if the first wake-up voice is matched with the first wake-up word, the terminal device can be woken up, that is, the terminal device enters a wake-up state, in response to the first wake-up voice, wherein the first wake-up word is a predefined wake-up word for waking up the terminal device, and the wake-up word of the terminal device can be set by a user according to preferences or use habits.
In the embodiment of the present invention, a user wakes up the terminal device to expect to wake up one or more voice devices of the at least one voice device, so after the terminal device is woken up, a second wake-up voice for waking up a target voice device can be further synthesized through voice synthesis, and the second wake-up voice can be transmitted to a speaker installed on the target voice device, and the second wake-up voice is played through the speaker, so that the target voice device can receive the second wake-up voice and respond to enter a wake-up state, and thus, the target voice device is woken up successfully.
Wherein the target voice device may be at least one of the at least one voice device, that is, one voice device may be woken up at a time, or a plurality of voice devices may be woken up at a time, and specifically, the target voice device may be determined in a plurality of different manners, for example, may be determined by a function indicated by a voice control instruction input again by a user after the first wake-up voice is input, may be preset wake-up periods of different voice devices, so that the voice devices within the wake-up periods may be determined based on the current period, or may be preset by the user for the terminal device a plurality of wake-up words (e.g., 2) each for waking up a corresponding plurality of voice devices (e.g., wake-up word 1 for waking up voice devices 1, 2, and 3, and wake-up word 2 for waking up voice devices 4 and 5), and so on.
The obtaining of the awakening words of the target voice device can be obtained in a mode of pre-inputting by a user, for example, the user can pre-input the awakening words of the voice devices expected to be awakened by the terminal device into the terminal device one by one, or the models of the voice devices can be input into the terminal device, the terminal device searches for the corresponding awakening words according to the device models, or the terminal device can obtain device model information by scanning the two-dimensional codes of the voice devices, and then searches for the corresponding awakening words.
It should be noted that, when a plurality of voice devices need to be awakened simultaneously and the awakening words of the plurality of voice devices are different, the awakening voices of the corresponding voice devices can be synthesized respectively, and the awakening voices for awakening the corresponding voice devices are played through the corresponding speakers respectively, so as to achieve the purpose of awakening the plurality of voice devices at one time.
Optionally, the synthesizing a second wake-up voice matched with a second wake-up word of the target voice device includes:
determining a second awakening word of the target voice equipment according to the pre-acquired awakening word of each voice equipment in the at least one voice equipment;
and synthesizing a second awakening voice matched with the second awakening word.
In this embodiment, in order to ensure that the second wake-up speech of the target speech device is accurately synthesized, a second wake-up word of the target speech device may be determined first, so as to synthesize the second wake-up speech according to the second wake-up word, specifically, the terminal device may obtain the wake-up word of each speech device in the at least one speech device in advance, so that when the wake-up speech of the target speech device needs to be synthesized, the second wake-up word of the target speech device may be directly searched from the pre-obtained wake-up words of the speech devices, and then a speech synthesis technology is used to synthesize the second wake-up speech including the second wake-up word.
The method includes the steps that a wake-up word of each voice device in at least one voice device is obtained in advance, the wake-up word can be obtained in a mode of being input by a user in advance, if the user can input wake-up words of voice devices which are expected to be woken up through the terminal device into the terminal device one by one, or models of the voice devices can be input into the terminal device in advance, the terminal device searches corresponding wake-up words according to device models and stores the wake-up words in an associated mode, or device model information can be obtained by scanning two-dimensional codes of the voice devices in advance, and then corresponding wake-up words are searched and stored in an associated mode.
Therefore, after the terminal device is awakened, the second awakening word of the target voice device can be quickly determined based on the pre-acquired awakening word of each voice device in the at least one voice device, so that the second awakening voice matched with the second awakening word is quickly synthesized, and the purposes of improving the equipment awakening speed and further improving the user awakening experience can be achieved.
Further, before the receiving the first wake-up voice input by the user, the method further includes:
receiving a wake-up word of each voice device in the at least one voice device input by a user;
storing a wake-up word for each of the at least one voice device;
the determining a second wake-up word of the target voice device according to the pre-obtained wake-up word of each voice device of the at least one voice device includes:
and determining a second awakening word of the target voice equipment according to the stored awakening word of each voice equipment in the at least one voice equipment.
In this embodiment, in the process of waking up the voice devices, in order to ensure that the terminal device can quickly synthesize the wake-up word of any one of the at least one voice device and ensure the accuracy of the wake-up word of each voice device, the user may enter the wake-up word of each voice device in the at least one voice device in advance, that is, the user may enter the wake-up word of each voice device in the at least one voice device on the terminal device before using the terminal device to perform voice wake-up, specifically, the user may enter device information (such as a device name or a device model) and a corresponding wake-up word of each voice device, or the terminal device adds each voice device, and then the user inputs a corresponding wake-up word to each voice device.
It should be noted that, in order to complete the configuration of the terminal device at one time, when a wakeup word is registered for the terminal device, the wakeup word of each voice device in the at least one voice device may be entered together.
After the user inputs the voice data, each voice device and the corresponding awakening word can be stored in an associated mode, and the voice data can be stored locally or in a cloud. When the second wake-up voice of the target voice device needs to be synthesized, the second wake-up word of the target voice device can be quickly found from the stored wake-up words of each voice device in the at least one voice device, and then the second wake-up voice matched with the second wake-up word is synthesized.
Optionally, after waking up the terminal device and before synthesizing a second wake-up voice matched with a second wake-up word of the target voice device, the method further includes:
receiving a voice control instruction input by a user;
and determining the corresponding target voice equipment based on the voice control instruction, wherein the target voice equipment is the voice equipment capable of responding to the voice control instruction in the at least one voice equipment.
In this embodiment, in order to avoid waking up unnecessary voice devices, instead of immediately performing voice synthesis after waking up the terminal device, a user may input a voice control instruction, and then a target voice device that the user needs to wake up is determined based on analysis of the voice control instruction input by the user, where the voice control instruction may be an instruction for instructing to control a certain function of the target voice device, such as a voice instruction of "turn up temperature", "turn up volume", "play first music", and the like.
After receiving the voice control instruction, the terminal device may determine the corresponding target voice device based on the voice control instruction, and specifically, may determine the corresponding target voice device by analyzing a function control instruction corresponding to the voice control instruction and combining a function of supporting control of each voice device in the at least one voice device, that is, the target voice device is a voice device capable of responding to the voice control instruction in the at least one voice device.
For example, if the user inputs a voice control command of "turn up the temperature", it may be determined that the target voice device that the user desires to wake up is the smart air conditioner; if a user inputs a voice control instruction of switching to a news channel, the target voice device expected to be awakened can be determined to be the smart television; if the user inputs a voice control instruction of 'playing the head music', the target voice equipment expected to be awakened can be determined to be the intelligent sound box; and so on. And after the target voice device is awakened, the corresponding function of the target voice device can be correspondingly adjusted or started in response to the voice control instruction, namely according to the instruction of the voice control instruction.
Therefore, the corresponding target voice equipment is determined based on the voice control instruction input by the user, so that the voice equipment expected by the user can be accurately awakened, other unnecessary voice equipment can be prevented from being awakened by mistake, and the power consumption of the equipment is further reduced.
Further, after receiving a voice control instruction input by a user and before determining the corresponding target voice device based on the voice control instruction, the method further includes:
determining whether the voice control instruction is matched with the common control instruction or not according to the common control instruction which is pre-recorded and corresponds to the first awakening word;
the determining the corresponding target voice device based on the voice control instruction comprises:
and under the condition that the voice control instruction is matched with the common control instruction, determining the corresponding target voice equipment based on the voice control instruction.
In this embodiment, after receiving a voice control instruction input by a user, determining whether the voice control instruction matches a commonly used control instruction, and in case of matching, determining the corresponding target voice device based on the voice control instruction to avoid the user from mistakenly inputting a wrong voice control instruction or from mistakenly waking up and responding to the voice device, where the commonly used control instruction may be a commonly used control instruction corresponding to the first wake-up word that is previously entered by the user, and the user may register the corresponding commonly used control instruction when registering the wake-up word of the terminal device, so that the terminal device wakes up the target voice device in the at least one voice device only when receiving the voice control instruction matching the commonly used control instruction input by the user after being woken up, the target voice device will only respond to the voice control instruction in this case.
For example, the common control instruction includes "raise the temperature", "raise the volume", and "play the first music", and if the voice control instruction input by the user is "play the first music", because the common control instruction includes the instruction, it can be determined that the voice control instruction matches the common control instruction, and it can be determined that the corresponding target voice device is the smart speaker based on the instruction, and after waking up the smart speaker, the smart speaker starts playing music; if the voice control instruction input by the user is 'turn up brightness', the voice control instruction can be determined to be not matched with the common control instruction because the common control instruction does not comprise the instruction, so that the voice control instruction is not responded, and any voice equipment is not awakened.
Therefore, by inputting the common control instruction corresponding to the first awakening word in advance and matching the voice control instruction, the voice control instruction of the user can be ensured to be responded more accurately, and unnecessary equipment awakening and voice instruction response are avoided.
Optionally, if the first wake-up voice is matched with the first wake-up word of the terminal device, waking up the terminal device includes:
extracting voiceprint features in the first awakening voice;
and awakening the terminal equipment under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment and the voiceprint characteristics are matched with the preset voiceprint characteristics.
In order to enhance the safety of device awakening and avoid any awakening of other users, voiceprint matching can be introduced into the condition of awakening the terminal device, specifically, under the condition that a first awakening voice of a user is received, voiceprint features of the user inputting the first awakening voice can be further extracted, namely, the voiceprint features of the user inputting the first awakening voice are extracted from the first awakening voice, the extracted voiceprint features are matched with preset voiceprint features, and the first awakening voice is matched with a first awakening word of the terminal device; the preset voiceprint features can be voiceprint features of a user with a wakeup authority, which are pre-recorded in the terminal device, and the preset voiceprint features can include one or more voiceprint features.
And only under the condition that the voiceprint features extracted from the first awakening voice are matched with the preset voiceprint features, the current user can be determined to be an authority user, and under the condition that the first awakening word is matched with the first awakening word of the terminal equipment, the terminal equipment is awakened in response to the first awakening voice.
Therefore, under the condition that the at least one voice device does not have the voiceprint recognition function, the voiceprint recognition function can be indirectly achieved in the awakening of the at least one voice device by adding voiceprint matching in the awakening of the terminal device, and further the awakening safety of the voice devices is improved.
In the embodiment of the present invention, the terminal Device may be any Device having a storage medium, for example, a Computer (Computer), a Mobile phone, a Tablet Personal Computer (Tablet Personal Computer), a laptop Computer (L laptop Computer), a Personal Digital Assistant (Personal Digital Assistant, PDA for short), a Mobile Internet Device (MID for short), a Wearable Device (Wearable Device), a portable electronic Device, or other terminal devices.
In the device wake-up method in this embodiment, the terminal device is connected to at least one speaker, and the at least one speaker is respectively installed on at least one voice device and respectively covers a sound input channel of a microphone of each voice device, and on this basis, by waking up the terminal device, the terminal device is triggered to synthesize a wake-up voice of the target voice device, and the synthesized wake-up voice is played through the speaker, thereby achieving the purpose of waking up the target voice device. Therefore, the user does not need to memorize the awakening words of different voice devices, only needs to memorize the awakening words of the terminal device, and indirectly achieves the purpose of awakening the target voice device in the mode of awakening the terminal device, so that convenience is brought to the user, and the burden that the user needs to memorize the awakening words of different devices is reduced.
Referring to fig. 2, fig. 2 is a schematic structural diagram of a terminal device according to an embodiment of the present invention, the terminal device is connected to at least one speaker, the at least one speaker is respectively mounted on at least one voice device, and respectively covers a sound input channel of a microphone of each voice device, as shown in fig. 2, the terminal device 200 includes:
a first receiving module 201, configured to receive a first wake-up voice input by a user;
and the awakening module 202 is configured to awaken the terminal device under the condition that the first awakening voice is matched with a first awakening word of the terminal device, synthesize a second awakening voice matched with a second awakening word of a target voice device, play the second awakening voice through a speaker installed on the target voice device, and awaken the target voice device through the second awakening voice, wherein the target voice device is at least one of the at least one voice device.
Optionally, the wake-up module 202 includes:
a determining unit, configured to determine, according to a pre-obtained wake-up word of each voice device in the at least one voice device, a second wake-up word of the target voice device;
and the synthesis unit is used for synthesizing the second awakening voice matched with the second awakening word.
Optionally, the terminal device 200 further includes:
the second receiving module is used for receiving a wake-up word of each voice device in the at least one voice device input by a user;
the storage module is used for storing the awakening words of each voice device in the at least one voice device;
the determining unit is configured to determine a second wake-up word of the target voice device according to the stored wake-up word of each voice device of the at least one voice device.
Optionally, the terminal device 200 further includes:
the third receiving module is used for receiving a voice control instruction input by a user;
a first determining module, configured to determine, based on the voice control instruction, the corresponding target voice device, where the target voice device is a voice device that can respond to the voice control instruction in the at least one voice device.
Optionally, the terminal device 200 further includes:
the second determination module is used for determining whether the voice control instruction is matched with the common control instruction or not according to the common control instruction which is pre-recorded and corresponds to the first awakening word;
the first determining module is used for determining the corresponding target voice equipment based on the voice control instruction under the condition that the voice control instruction is matched with the common control instruction.
Optionally, the terminal device 200 further includes:
the fourth receiving module is used for receiving a third awakening word input by the user;
the setting module is used for setting the third awakening word as the first awakening word of the terminal equipment;
and the third determining module is used for determining the score of the first awakening word and outputting the score.
Optionally, the wake-up module 202 includes:
the extracting unit is used for extracting the voiceprint features in the first awakening voice;
and the awakening unit is used for awakening the terminal equipment under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment and the voiceprint characteristics are matched with the preset voiceprint characteristics.
The terminal device 200 can implement each process implemented by the terminal device in the method embodiment of fig. 1, and is not described herein again to avoid repetition. The terminal device 400 of the embodiment of the present invention may enter the wake-up state when receiving the first wake-up voice input by the user, synthesize the wake-up voice of the target voice device, and then play the synthesized wake-up voice through the speaker, thereby achieving the purpose of waking up the target voice device. Therefore, the user does not need to memorize the awakening words of different voice devices, only needs to memorize the awakening words of the terminal device, and indirectly achieves the purpose of awakening the target voice device in the mode of awakening the terminal device, so that convenience is brought to the user, and the burden that the user needs to memorize the awakening words of different devices is reduced.
The embodiment of the present invention further provides a terminal device, which includes a processor, a memory, and a computer program stored in the memory and capable of running on the processor, and when being executed by the processor, the computer program implements each process of the device wake-up method embodiment, and can achieve the same technical effect, and is not described herein again to avoid repetition.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements each process of the above device wake-up method embodiment, and can achieve the same technical effect, and in order to avoid repetition, details are not repeated here. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.
Claims (12)
1. A device wake-up method is applied to a terminal device, wherein the terminal device is connected with at least one speaker, the at least one speaker is respectively installed on at least one voice device and respectively covers an acoustic channel of a microphone of each voice device, and the method comprises the following steps:
receiving a first awakening voice input by a user;
and awakening the terminal equipment under the condition that the first awakening voice is matched with a first awakening word of the terminal equipment, synthesizing a second awakening voice matched with a second awakening word of the target voice equipment, playing the second awakening voice through a loudspeaker installed on the target voice equipment, and awakening the target voice equipment through the second awakening voice, wherein the target voice equipment is at least one of the at least one voice equipment.
2. The method of claim 1, wherein synthesizing a second wake-up voice that matches a second wake-up word of a target voice device comprises:
determining a second awakening word of the target voice equipment according to the pre-acquired awakening word of each voice equipment in the at least one voice equipment;
and synthesizing a second awakening voice matched with the second awakening word.
3. The method of claim 2, wherein prior to receiving the first wake-up voice of the user input, the method further comprises:
receiving a wake-up word of each voice device in the at least one voice device input by a user;
storing a wake-up word for each of the at least one voice device;
the determining a second wake-up word of the target voice device according to the pre-obtained wake-up word of each voice device of the at least one voice device includes:
and determining a second awakening word of the target voice equipment according to the stored awakening word of each voice equipment in the at least one voice equipment.
4. The method of claim 1, wherein after waking up the terminal device and before synthesizing a second wake-up voice matching a second wake-up word of a target voice device, the method further comprises:
receiving a voice control instruction input by a user;
and determining the corresponding target voice equipment based on the voice control instruction, wherein the target voice equipment is the voice equipment capable of responding to the voice control instruction in the at least one voice equipment.
5. The method of claim 4, wherein after receiving the voice control command input by the user and before determining the corresponding target voice device based on the voice control command, the method further comprises:
determining whether the voice control instruction is matched with the common control instruction or not according to the common control instruction which is pre-recorded and corresponds to the first awakening word;
the determining the corresponding target voice device based on the voice control instruction comprises:
and under the condition that the voice control instruction is matched with the common control instruction, determining the corresponding target voice equipment based on the voice control instruction.
6. The method of claim 1, wherein prior to receiving the first wake-up voice input by the user, the method further comprises:
receiving a third awakening word input by a user;
setting the third awakening word as a first awakening word of the terminal equipment;
and determining the score of the first awakening word and outputting the score.
7. The method of claim 1, wherein waking up the terminal device if the first wake-up voice matches a first wake-up word of the terminal device comprises:
extracting voiceprint features in the first awakening voice;
and awakening the terminal equipment under the condition that the first awakening voice is matched with the first awakening word of the terminal equipment and the voiceprint characteristics are matched with the preset voiceprint characteristics.
8. A terminal device, wherein the terminal device is connected to at least one speaker, the at least one speaker is respectively mounted on at least one voice device, and respectively covers an acoustic channel of a microphone of each voice device, the terminal device comprising:
the first receiving module is used for receiving a first awakening voice input by a user;
and the awakening module is used for awakening the terminal equipment under the condition that the first awakening voice is matched with a first awakening word of the terminal equipment, synthesizing a second awakening voice matched with a second awakening word of the target voice equipment, playing the second awakening voice through a loudspeaker installed on the target voice equipment, and awakening the target voice equipment through the second awakening voice, wherein the target voice equipment is at least one of the at least one voice equipment.
9. The terminal device of claim 8, wherein the wake-up module comprises:
a determining unit, configured to determine, according to a pre-obtained wake-up word of each voice device in the at least one voice device, a second wake-up word of the target voice device;
and the synthesis unit is used for synthesizing the second awakening voice matched with the second awakening word.
10. The terminal device according to claim 9, wherein the terminal device further comprises:
the second receiving module is used for receiving a wake-up word of each voice device in the at least one voice device input by a user;
the storage module is used for storing the awakening words of each voice device in the at least one voice device;
the determining unit is configured to determine a second wake-up word of the target voice device according to the stored wake-up word of each voice device of the at least one voice device.
11. A terminal device comprising a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps in the device wake-up method according to any one of claims 1 to 7.
12. A computer-readable storage medium, having stored thereon a computer program which, when being executed by a processor, carries out the steps of the device wake-up method according to any one of claims 1 to 7.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010191577.0A CN111429917B (en) | 2020-03-18 | 2020-03-18 | Equipment awakening method and terminal equipment |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010191577.0A CN111429917B (en) | 2020-03-18 | 2020-03-18 | Equipment awakening method and terminal equipment |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN111429917A true CN111429917A (en) | 2020-07-17 |
| CN111429917B CN111429917B (en) | 2023-09-22 |
Family
ID=71547551
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010191577.0A Active CN111429917B (en) | 2020-03-18 | 2020-03-18 | Equipment awakening method and terminal equipment |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111429917B (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111798850A (en) * | 2020-08-05 | 2020-10-20 | 深圳市北科瑞声科技股份有限公司 | Method and system for operating equipment by voice and server |
| CN112000836A (en) * | 2020-08-20 | 2020-11-27 | 北京声智科技有限公司 | Song playing method and device and electronic equipment |
| CN114220442A (en) * | 2022-01-27 | 2022-03-22 | 美的集团(上海)有限公司 | Control method of intelligent home system and intelligent home system |
| CN115242571A (en) * | 2021-04-25 | 2022-10-25 | 佛山市顺德区美的电热电器制造有限公司 | Distributed voice interaction method and device, readable storage medium, and home appliance |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104243717A (en) * | 2014-09-30 | 2014-12-24 | 广东欧珀移动通信有限公司 | Method and device for unlocking mobile phone through voice recognition based on social application |
| CN108712566A (en) * | 2018-04-27 | 2018-10-26 | 维沃移动通信有限公司 | A kind of voice assistant awakening method and mobile terminal |
| CN108899027A (en) * | 2018-08-15 | 2018-11-27 | 珠海格力电器股份有限公司 | Voice analysis method and device |
| CN109243459A (en) * | 2018-11-30 | 2019-01-18 | 广东美的制冷设备有限公司 | Voice awakening method, device, household appliance and the control system of equipment |
| US20190043492A1 (en) * | 2017-08-07 | 2019-02-07 | Sonos, Inc. | Wake-Word Detection Suppression |
| KR20190082689A (en) * | 2019-06-20 | 2019-07-10 | 엘지전자 주식회사 | Method and apparatus for recognizing a voice |
| CN110097876A (en) * | 2018-01-30 | 2019-08-06 | 阿里巴巴集团控股有限公司 | Voice wakes up processing method and is waken up equipment |
| US20190311720A1 (en) * | 2018-04-09 | 2019-10-10 | Amazon Technologies, Inc. | Device arbitration by multiple speech processing systems |
| CN110827836A (en) * | 2019-10-23 | 2020-02-21 | 珠海格力电器股份有限公司 | Method and device for resetting awakening words, electronic equipment and storage medium |
-
2020
- 2020-03-18 CN CN202010191577.0A patent/CN111429917B/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104243717A (en) * | 2014-09-30 | 2014-12-24 | 广东欧珀移动通信有限公司 | Method and device for unlocking mobile phone through voice recognition based on social application |
| US20190043492A1 (en) * | 2017-08-07 | 2019-02-07 | Sonos, Inc. | Wake-Word Detection Suppression |
| CN110097876A (en) * | 2018-01-30 | 2019-08-06 | 阿里巴巴集团控股有限公司 | Voice wakes up processing method and is waken up equipment |
| US20190311720A1 (en) * | 2018-04-09 | 2019-10-10 | Amazon Technologies, Inc. | Device arbitration by multiple speech processing systems |
| CN108712566A (en) * | 2018-04-27 | 2018-10-26 | 维沃移动通信有限公司 | A kind of voice assistant awakening method and mobile terminal |
| CN108899027A (en) * | 2018-08-15 | 2018-11-27 | 珠海格力电器股份有限公司 | Voice analysis method and device |
| CN109243459A (en) * | 2018-11-30 | 2019-01-18 | 广东美的制冷设备有限公司 | Voice awakening method, device, household appliance and the control system of equipment |
| KR20190082689A (en) * | 2019-06-20 | 2019-07-10 | 엘지전자 주식회사 | Method and apparatus for recognizing a voice |
| CN110827836A (en) * | 2019-10-23 | 2020-02-21 | 珠海格力电器股份有限公司 | Method and device for resetting awakening words, electronic equipment and storage medium |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111798850A (en) * | 2020-08-05 | 2020-10-20 | 深圳市北科瑞声科技股份有限公司 | Method and system for operating equipment by voice and server |
| CN111798850B (en) * | 2020-08-05 | 2024-03-01 | 深圳市北科瑞声科技股份有限公司 | Method and system for operating equipment by voice and server |
| CN112000836A (en) * | 2020-08-20 | 2020-11-27 | 北京声智科技有限公司 | Song playing method and device and electronic equipment |
| CN115242571A (en) * | 2021-04-25 | 2022-10-25 | 佛山市顺德区美的电热电器制造有限公司 | Distributed voice interaction method and device, readable storage medium, and home appliance |
| CN114220442A (en) * | 2022-01-27 | 2022-03-22 | 美的集团(上海)有限公司 | Control method of intelligent home system and intelligent home system |
Also Published As
| Publication number | Publication date |
|---|---|
| CN111429917B (en) | 2023-09-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107370649B (en) | Household appliance control method, system, control terminal and storage medium | |
| CN107895578B (en) | Voice interaction method and device | |
| CN111429917B (en) | Equipment awakening method and terminal equipment | |
| US6584439B1 (en) | Method and apparatus for controlling voice controlled devices | |
| CN106992008B (en) | Processing method and electronic equipment | |
| CN110634483A (en) | Human-computer interaction method, device, electronic device and storage medium | |
| US20020088336A1 (en) | Method of identifying pieces of music | |
| EP3611724A1 (en) | Voice response method and device, and smart device | |
| CN108735209A (en) | Wake up word binding method, smart machine and storage medium | |
| CN111312235A (en) | Voice interaction method, device and system | |
| CN107864410B (en) | Multimedia data processing method and device, electronic equipment and storage medium | |
| US20210168460A1 (en) | Electronic device and subtitle expression method thereof | |
| JP2019062377A (en) | Electronic device, acoustic device, control method of electronic device, and control program | |
| CN113438640B (en) | Equipment pairing connection method, device, system and storage medium | |
| CN111640434A (en) | Method and apparatus for controlling voice equipment | |
| CN111343028A (en) | Distribution network control method and device | |
| US20200125603A1 (en) | Electronic device and system which provides service based on voice recognition | |
| KR20010020876A (en) | Method and apparatus for enhancing activation of voice controlled devices | |
| JP2020003774A (en) | Method and apparatus for processing audio | |
| CN112802465A (en) | Voice control method and system | |
| JP2005031540A (en) | Home appliances with voice function | |
| JP2019191552A (en) | Cloud wake-up method and system, terminal and computer readable storage medium | |
| CN114391165A (en) | Voice information processing method, device, equipment and storage medium | |
| CN111724783B (en) | Wake-up methods, devices, smart devices and media for smart devices | |
| CN107205095A (en) | Voice information playback method, device and terminal |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |