[go: up one dir, main page]

HK1250024B - Control method of voice conversation robot and terminal equipment - Google Patents

Control method of voice conversation robot and terminal equipment Download PDF

Info

Publication number
HK1250024B
HK1250024B HK18109503.1A HK18109503A HK1250024B HK 1250024 B HK1250024 B HK 1250024B HK 18109503 A HK18109503 A HK 18109503A HK 1250024 B HK1250024 B HK 1250024B
Authority
HK
Hong Kong
Prior art keywords
voice
voice information
information
dialogue robot
identification code
Prior art date
Application number
HK18109503.1A
Other languages
Chinese (zh)
Other versions
HK1250024A1 (en
Inventor
黄伟淦
Original Assignee
深圳壹账通智能科技有限公司
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Priority to HK18109503.1A priority Critical patent/HK1250024B/en
Publication of HK1250024A1 publication Critical patent/HK1250024A1/en
Publication of HK1250024B publication Critical patent/HK1250024B/en

Links

Description

语音对话机器人的控制方法及终端设备Control method and terminal device of voice dialogue robot

技术领域Technical Field

本发明属于人工智能技术领域,尤其涉及一种语音对话机器人的控制方法及终端设备。The present invention belongs to the field of artificial intelligence technology, and in particular relates to a control method and terminal equipment for a voice dialogue robot.

背景技术Background Art

随着智能语音技术的发展,以智能音箱为代表的语音对话机器人已得到了广泛的应用。现有的语音对话机器人均已实现了与用户之间的人机语音交互,其可根据用户发出的语音信息,回答用户的问题或者执行相应的控制功能。因此,语音对话机器人的发展为人们的生活带来了极大的便利。With the development of intelligent voice technology, voice-activated conversational robots, represented by smart speakers, have become widely used. Existing voice-activated conversational robots have achieved human-machine voice interaction with users, answering questions or executing corresponding control functions based on the user's voice information. Therefore, the development of voice-activated conversational robots has brought great convenience to people's lives.

然而,市面上的语音对话机器人通常都只能单独运作。当多个语音对话机器人存在于同一环境,但各个语音对话机器人相互间隔较远时,若用户需要同时控制多个语音对话机器人,则只能走到各个语音对话机器人所在的位置后,才能分别进行语音控制。这种情况导致了语音对话机器人的控制效率较为低下。However, the voice chatbots currently available on the market typically operate independently. If multiple voice chatbots are located in the same environment but are separated by large distances, users who wish to control multiple robots simultaneously must walk to each robot's location before controlling them individually. This situation results in relatively low control efficiency for the voice chatbots.

发明内容Summary of the Invention

有鉴于此,本发明实施例提供了一种语音对话机器人的控制方法及终端设备,以解决现有技术中语音对话机器人的控制效率较为低下的问题。In view of this, an embodiment of the present invention provides a control method and terminal device for a voice dialogue robot to solve the problem of relatively low control efficiency of the voice dialogue robot in the prior art.

本发明实施例的第一方面提供了一种语音对话机器人的控制方法,包括:A first aspect of an embodiment of the present invention provides a method for controlling a voice dialogue robot, comprising:

广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码;broadcasting a robot search signal, and upon receiving a response message based on the robot search signal, extracting an identification code of the voice dialogue robot from the response message;

基于所述识别码,与所述语音对话机器人建立连接;Establishing a connection with the voice dialogue robot based on the identification code;

获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式;Acquire a first voice message sent by a user, and determine a control mode of the first voice message;

若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。If the control mode is a broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes a control instruction matching the first voice information.

本发明实施例的第二方面提供了一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如下步骤:A second aspect of an embodiment of the present invention provides a terminal device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the following steps when executing the computer program:

广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码;broadcasting a robot search signal, and upon receiving a response message based on the robot search signal, extracting an identification code of the voice dialogue robot from the response message;

基于所述识别码,与所述语音对话机器人建立连接;Establishing a connection with the voice dialogue robot based on the identification code;

获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式;Acquire a first voice message sent by a user, and determine a control mode of the first voice message;

若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。If the control mode is a broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes a control instruction matching the first voice information.

本发明实施例的第三方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如下步骤:A third aspect of an embodiment of the present invention provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the following steps are implemented:

广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码;broadcasting a robot search signal, and upon receiving a response message based on the robot search signal, extracting an identification code of the voice dialogue robot from the response message;

基于所述识别码,与所述语音对话机器人建立连接;Establishing a connection with the voice dialogue robot based on the identification code;

获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式;Acquire a first voice message sent by a user, and determine a control mode of the first voice message;

若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。If the control mode is a broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes a control instruction matching the first voice information.

本发明实施例中,通过广播机器人搜索信号,能够检测出信号搜索范围内所存在的各个语音对话机器人,从而可以自动获取出与本端语音对话机器人处于不同位置的各个语音对话机器人的识别码,并实现与远程的语音对话机器人的通讯连接;通过确定语音信息的控制模式,保证了在接收到用户发出的语音信息的控制模式为广播模式时,能够将语音信息同步至已连接的各个语音对话机器人,使得用户能够基于其发出的一条语音信息,同时对位置相隔较远的多个语音对话机器人进行语音控制。用户无需再走到各个语音对话机器人所在的位置后才能执行语音控制,因此,本方面实施例提高了语音对话机器人的控制效率。In an embodiment of the present invention, by broadcasting a robot search signal, it is possible to detect each voice dialogue robot within the signal search range, thereby automatically obtaining the identification codes of each voice dialogue robot at a different location from the local voice dialogue robot and achieving communication connection with the remote voice dialogue robot; by determining the control mode of the voice information, it is ensured that when the control mode of the voice information received from the user is the broadcast mode, the voice information can be synchronized to each connected voice dialogue robot, allowing the user to voice control multiple voice dialogue robots at distant locations based on a voice message sent by the user. The user no longer needs to go to the location of each voice dialogue robot to perform voice control, thus improving the control efficiency of the voice dialogue robot.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本发明实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the following briefly introduces the drawings required for use in the embodiments or the description of the prior art. Obviously, the drawings described below are only some embodiments of the present invention. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative work.

图1是本发明实施例提供的语音对话机器人的控制方法的实现流程图;FIG1 is a flowchart of a method for controlling a voice dialogue robot according to an embodiment of the present invention;

图2是本发明实施例提供的语音对话机器人的控制方法S103的具体实现流程图;FIG2 is a flowchart of a specific implementation of the control method S103 of the voice dialogue robot provided by an embodiment of the present invention;

图3是本发明另一实施例提供的语音对话机器人的控制方法的实现流程图;FIG3 is a flowchart of a control method for a voice dialogue robot according to another embodiment of the present invention;

图4是本发明又一实施例提供的语音对话机器人的控制方法的实现流程图;FIG4 is a flowchart of a control method for a voice dialogue robot according to another embodiment of the present invention;

图5是本发明实施例提供的语音对话机器人的控制方法S106的具体实现流程图;FIG5 is a flowchart of a specific implementation of the control method S106 of the voice dialogue robot provided in an embodiment of the present invention;

图6是本发明实施例提供的语音对话机器人的控制装置的结构框图;6 is a structural block diagram of a control device for a voice dialogue robot provided by an embodiment of the present invention;

图7是本发明另一实施例提供的语音对话机器人的控制装置的结构框图;7 is a block diagram of a control device for a voice dialogue robot according to another embodiment of the present invention;

图8是本发明又一实施例提供的语音对话机器人的控制装置的结构框图;8 is a structural block diagram of a control device for a voice dialogue robot provided by yet another embodiment of the present invention;

图9是本发明再一实施例提供的语音对话机器人的控制装置的结构框图;9 is a block diagram of a control device for a voice dialogue robot according to another embodiment of the present invention;

图10是本发明实施例提供的终端设备的示意图。FIG10 is a schematic diagram of a terminal device provided in an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本发明实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本发明。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本发明的描述。In the following description, specific details such as particular system structures and techniques are provided for purposes of illustration, not limitation, to facilitate a thorough understanding of the embodiments of the present invention. However, it will be apparent to those skilled in the art that the present invention may be practiced in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present invention with unnecessary detail.

在本发明的以下各个实施例中,为了区别语音对话机器人在不同时刻所发送或者接收到的各条语音信息,以“第一语音信息”、“第二语音信息”、“第三语音信”息以及“第四语音信息”来命名各条语音信息,值得注意的是,各条语音信息之间的关联关系由其功能用途和内在逻辑来确定,“第一”、“第二”、“第三”以及“第四”并非用于限定语音信息的执行顺序,因而其序号不应对本发明实施例的实施过程构成任何限定。In the following embodiments of the present invention, in order to distinguish the voice messages sent or received by the voice dialogue robot at different times, each voice message is named "first voice message", "second voice message", "third voice message" and "fourth voice message". It is worth noting that the relationship between the voice messages is determined by their functional use and internal logic. "First", "second", "third" and "fourth" are not used to limit the execution order of the voice messages, and therefore their serial numbers should not constitute any limitation on the implementation process of the embodiments of the present invention.

为了说明本发明所述的技术方案,下面通过具体实施例来进行说明。In order to illustrate the technical solution of the present invention, specific embodiments are provided below.

图1示出了本发明实施例提供的语音对话机器人的控制方法的实现流程,该方法流程包括步骤S101至S104。各步骤的具体实现原理如下:FIG1 shows the implementation process of the control method of the voice dialogue robot provided by an embodiment of the present invention, which includes steps S101 to S104. The specific implementation principles of each step are as follows:

S101:广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码。S101: Broadcasting a robot search signal, and upon receiving response information based on the robot search signal, extracting an identification code of the voice dialogue robot from the response information.

本发明实施例中,各步骤的执行主体为语音对话机器人,将该语音对话机器人称为本端语音对话机器人,以区别于远程的各个语音对话机器人。In the embodiment of the present invention, the execution subject of each step is a voice dialogue robot, which is called a local voice dialogue robot to distinguish it from various remote voice dialogue robots.

在本端语音对话机器人的运行过程中,持续发出机器人搜索信号。当位于信号传播范围内的各个语音对话机器人检测到该机器人搜索信号时,将发出基于该机器人搜索信号的响应信息。由于信号传播范围内的各个语音对话机器人与本端语音对话机器人处于不同的位置点,因而将信号传播范围内的各个语音对话机器人称为远程的语音对话机器人。During operation, the local voice dialogue robot continuously emits a robot search signal. When other voice dialogue robots within the signal's transmission range detect this robot search signal, they respond with information based on the robot search signal. Because each voice dialogue robot within the signal's transmission range is located at a different point from the local voice dialogue robot, each voice dialogue robot within the signal's transmission range is referred to as a remote voice dialogue robot.

远程的语音对话机器人所发出的响应信息中,包含有该语音对话机器人的识别码。识别码用于唯一标识一个语音对话机器人。识别码例如可以是PIN(PersonalIdentification Number)码。The response message sent by the remote voice dialogue robot includes the identification code of the voice dialogue robot. The identification code is used to uniquely identify a voice dialogue robot. The identification code can be, for example, a PIN (Personal Identification Number).

优选地,响应信息中,还包含有语音对话机器人的的设备名称。设备名称为语音对话机器人的名称,其在出厂时由厂家预先设置,也可由用户自定义设置。Preferably, the response information also includes the device name of the voice dialogue robot. The device name is the name of the voice dialogue robot, which is pre-set by the manufacturer when it leaves the factory and can also be customized by the user.

将包含于同一响应信息中的识别码以及设备名称存储于数据表的一条记录中,以确定各个识别码及其设备名称的对应关系。同时,将本端语音对话机器人的识别码以及设备名称也存储于数据表的一记录中,并将该记录标记为本地记录。The identification code and device name included in the same response message are stored in a record in the data table to determine the correspondence between each identification code and its device name. At the same time, the identification code and device name of the local voice dialogue robot are also stored in a record in the data table, and the record is marked as a local record.

S102:基于所述识别码,与所述语音对话机器人建立连接。S102: Establishing a connection with the voice dialogue robot based on the identification code.

本端语音对话机器人基于接收到的各个识别码,与远程的各个语音对话机器人进行自动配对,并向远程的各个语音对话机器人发出链路建立请求。链路建立成功后,本端语音对话机器人以及远程的语音对话机器人之间,即可进行双向的数据通讯或语音通讯。Based on the received identification codes, the local voice chatbot automatically pairs with the remote voice chatbots and sends a link establishment request to each remote voice chatbot. Once the link is established, two-way data or voice communication can be established between the local voice chatbot and the remote voice chatbot.

S103:获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式。S103: Acquire a first voice message sent by a user, and determine a control mode of the first voice message.

当用户需要对语音对话机器人进行控制时,将会对用户当前所在位置的本端语音对话机器人发出语音信息。本端语音对话机器人所接收到的用户发出的语音信息例如可以是,“播放歌曲清单中的第三首歌”。When the user needs to control the voice dialogue robot, a voice message will be sent to the local voice dialogue robot at the user's current location. The voice message received by the local voice dialogue robot may be, for example, "play the third song in the song list."

本发明实施例中,不同的语音信息对应不同的控制模式。控制模式表示本端语音对话机器人对语音信息的同步方式。控制模式包括单机模式、组播模式以及广播模式。In the embodiment of the present invention, different voice messages correspond to different control modes. The control mode indicates how the local voice dialogue robot synchronizes the voice messages. The control modes include stand-alone mode, multicast mode, and broadcast mode.

单机模式表示,本端语音对话机器人无需对语音信息进行同步,即,与该语音信息匹配的控制指令仅需由本端语音对话机器人执行。The stand-alone mode means that the local voice dialogue robot does not need to synchronize the voice information, that is, the control instructions matching the voice information only need to be executed by the local voice dialogue robot.

组播模式表示,本端语音对话机器人需将该语音信息同步至远程的一个或多个语音对话机器人。The multicast mode means that the local voice dialogue robot needs to synchronize the voice information to one or more remote voice dialogue robots.

广播模式表示,本端语音对话机器人需将该语音信息同步至当前时刻已连接的各个远程的语音对话机器人。The broadcast mode means that the local voice dialogue robot needs to synchronize the voice information to all remote voice dialogue robots that are currently connected.

语音信息的控制模式可由用户预设于本端语音对话机器人的参数信息中。例如,在参数信息中,将本端语音对话机器人的语音信息的控制模式统一设置为广播模式。The control mode of the voice information can be preset by the user in the parameter information of the local voice dialogue robot. For example, in the parameter information, the control mode of the voice information of the local voice dialogue robot is uniformly set to the broadcast mode.

作为本发明的一个实施例,如图2所示,上述S103具体包括:As an embodiment of the present invention, as shown in FIG2 , the above S103 specifically includes:

S1031:对用户发出的第一语音信息进行解析,以获取所述第一语音信息中的关键词。S1031: Analyze the first voice message sent by the user to obtain keywords in the first voice message.

本发明实施例中,在接收到用户发出的语音信息时,通过预设的语音识别算法对该语音信息进行解析。In the embodiment of the present invention, when a voice message sent by a user is received, the voice message is analyzed by a preset voice recognition algorithm.

具体地,语音识别过程包括:以预设的帧长以及帧移来对语音信息进行分帧处理,得到M(M为大于零的整数)帧波形;分别提取每一帧波形的声学特征,如MFCC(梅尔频率倒谱系数Mel-Frequency Cepstral Coefficients),以获得每一帧波形对应的N维向量。由于一个词语的发音由音素这一语音单位构成,比音素更细的语音单位为状态,一个音素包含3个状态,因此,本发明实施例中,将每一帧波形所对应的N维向量输入预先获得的声学模型,如隐马尔可夫模型,以输出该帧波形对应各个状态的概率;将概率最大的状态确定为该帧波形对应的状态。通过确定各帧波形所对应的状态,将连续出现的每三个状态组合成一个音素,再把若干个音素进行组合后,输出语音信息所对应的词语,由此实现语音信息到文本的转换。Specifically, the speech recognition process includes: framing the speech information using a preset frame length and frame shift to obtain M (M is an integer greater than zero) frame waveforms; extracting acoustic features, such as MFCC (Mel-Frequency Cepstral Coefficients), from each frame waveform to obtain an N-dimensional vector corresponding to each frame waveform. Since the pronunciation of a word is composed of phonemes, a phonetic unit finer than a phoneme, and a phoneme contains three states, in embodiments of the present invention, the N-dimensional vector corresponding to each frame waveform is input into a pre-acquired acoustic model, such as a hidden Markov model, to output the probabilities of each state corresponding to the frame waveform; the state with the highest probability is determined as the state corresponding to the frame waveform. By determining the states corresponding to each frame waveform, every three consecutive states are combined into a phoneme. After combining multiple phonemes, the word corresponding to the speech information is output, thereby achieving the conversion of speech information into text.

在将语音信息转换为相应的文本,并对该文本进行分词后,可得到的每一分词均为语音信息的关键词。After converting the voice information into corresponding text and performing word segmentation on the text, each word segmented is a keyword of the voice information.

S1032:若所述关键词与所述数据表中存储的各个所述设备名称均不相同,则确定所述第一语音信息的控制模式为广播模式。S1032: If the keyword is different from any of the device names stored in the data table, determining that the control mode of the first voice information is a broadcast mode.

在S101所生成的数据表中,存储有远程的各个语音对话机器人的设备名称。将当前时刻获得的每一关键词分别与数据表中的各个设备名称进行对比,以分别判断各个关键词是否与数据表中的任一设备名称相同。The data table generated in S101 stores the device names of each remote voice dialogue robot. Each keyword obtained at the current moment is compared with each device name in the data table to determine whether each keyword is the same as any device name in the data table.

对于语音信息中的各个关键词,若其与数据表中的任一设备名称均不相同,则确定用户发出的语音信息的控制模式为广播模式。For each keyword in the voice information, if it is different from any device name in the data table, it is determined that the control mode of the voice information sent by the user is the broadcast mode.

本发明实施例中,通过实时解析语音信息中的关键词,并在关键词与数据表中的设备名称均不相同的情况下,确定语音信息的控制模式为广播模式,实现了对语音信息的控制模式的个性化设置,避免了用户只能在本端语音对话机器人的参数信息中,统一设置所有语音信息的控制模式,因而提高了控制模式的设置灵活性,由此也使得用户可以基于控制模式的判断规则,发出不同控制模式的语音信息,提高了对于语音对话机器人的控制灵活性。In an embodiment of the present invention, by real-time analysis of keywords in voice information and determining that the control mode of the voice information is a broadcast mode when the keywords are different from the device name in the data table, personalized setting of the control mode of the voice information is achieved, avoiding the user having to uniformly set the control mode of all voice information in the parameter information of the voice dialogue robot on this end, thereby improving the flexibility of setting the control mode, and thus allowing the user to send voice information of different control modes based on the judgment rules of the control mode, thereby improving the control flexibility of the voice dialogue robot.

S104:若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。S104: If the control mode is the broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes the control instruction matching the first voice information.

当语音信息的控制模式为广播模式时,在数据表中,读取已存储的各个识别码,将该语音信息分别同步发送至远程的与各个识别码相关的语音对话机器人,以使接收到该语音信息的各个语音对话机器人可以执行与该语音信息匹配的控制指令。When the control mode of the voice information is the broadcast mode, the stored identification codes are read in the data table, and the voice information is synchronously sent to the remote voice dialogue robots associated with each identification code, so that each voice dialogue robot that receives the voice information can execute the control instructions matching the voice information.

优选地,对于接收到该语音信息的各个语音对话机器人,同样可基于上述步骤S101至S104,将该语音信息同步至远程的其他语音对话机器人,由此扩大语音信息的传播范围,实现了对分布范围更远的语音对话机器人的同步控制。Preferably, for each voice dialogue robot that receives the voice information, the voice information can also be synchronized to other remote voice dialogue robots based on the above steps S101 to S104, thereby expanding the propagation range of the voice information and realizing synchronous control of voice dialogue robots with a farther distribution range.

本发明实施例中,通过广播机器人搜索信号,能够检测出信号搜索范围内所存在的各个语音对话机器人,从而可以自动获取出与本端语音对话机器人处于不同位置的各个语音对话机器人的识别码,并实现与远程的语音对话机器人的通讯连接;通过令语音信息对应单机模式、组播模式或广播模式,并根据用户的实际所需,准确确定出语音信息的控制模式,保证了在接收到用户发出的语音信息的控制模式为广播模式时,能够将语音信息同步至已连接的各个语音对话机器人,由此使得用户能够基于其发出的一条语音信息,同时对位置相隔较远的多个语音对话机器人进行语音控制。用户无需再走到各个语音对话机器人所在的位置后才能执行语音控制,因此,本方面实施例提高了语音对话机器人的控制效率。In an embodiment of the present invention, by broadcasting a robot search signal, it is possible to detect each voice dialogue robot within the signal search range, thereby automatically obtaining the identification codes of each voice dialogue robot in a different location from the local voice dialogue robot and achieving communication connection with the remote voice dialogue robot; by making the voice information correspond to the stand-alone mode, multicast mode, or broadcast mode, and accurately determining the control mode of the voice information according to the actual needs of the user, it is ensured that when the control mode of the voice information received by the user is the broadcast mode, the voice information can be synchronized to each connected voice dialogue robot, thereby enabling the user to voice control multiple voice dialogue robots at distant locations based on a voice message sent by the user. The user no longer needs to go to the location of each voice dialogue robot to perform voice control, thus improving the control efficiency of the voice dialogue robot.

在上述各个实施例的基础之上,作为本发明的另一实施例,对控制模式为组播模式时的语音信息的同步方式作进一步的限定。如图3所示,在上述S104之后,还包括:On the basis of the above embodiments, as another embodiment of the present invention, the synchronization method of voice information when the control mode is the multicast mode is further limited. As shown in Figure 3, after the above S104, it also includes:

S105:若所述控制模式为组播模式,则在存储有识别码以及设备名称对应关系的数据表中,查找与所述第一语音信息所携带的设备名称相对应的所述识别码,所述识别码以及设备名称的对应关系从所述响应信息中获取。S105: If the control mode is a multicast mode, then in a data table storing the correspondence between identification codes and device names, the identification code corresponding to the device name carried by the first voice message is searched, and the correspondence between the identification code and the device name is obtained from the response information.

对于语音信息中的任一关键词,若其与本端语音对话机器人的设备名称相同,则确定语音信息的控制模式为单机模式;若其与数据表中除本端设备名称之外的任一设备名称相同,则确定语音信息的控制模式为组播模式,并确定用户发出的语音信息携带有该设备名称。For any keyword in the voice message, if it is the same as the device name of the local voice dialogue robot, the control mode of the voice message is determined to be stand-alone mode; if it is the same as any device name other than the local device name in the data table, the control mode of the voice message is determined to be multicast mode, and it is determined that the voice message sent by the user carries the device name.

当语音信息的控制模式为组播模式时,在数据表中,读取与语音信息所携带的设备名称相对应的识别码。When the control mode of the voice information is the multicast mode, the identification code corresponding to the device name carried in the voice information is read from the data table.

S106:将所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。S106: Synchronize the first voice information to the voice dialogue robot associated with the found identification code, so that the voice dialogue robot executes the control instruction matching the first voice information.

基于当前时刻所读取到的各个识别码,在当前时刻所连接的各个远程的语音对话机器人中,筛选出与上述各个识别码关联的各个语音对话机器人。仅将用户发出的语音信息同步至筛选出的各个语音对话机器人。Based on the identification codes read at the current moment, the voice dialogue robots associated with the identification codes are selected from the currently connected remote voice dialogue robots. Only the voice messages sent by the user are synchronized to the selected voice dialogue robots.

本发明实施例适用于用户需要控制区域内指定的多个语音对话机器人的场景之下。例如,若当前区域内分布有5个语音对话机器人,其中各个语音对话机器人设置的设备名称分别为Alice、Bob、Colly、Doggy以及Ella,且用户当前位于Alice所处的位置,则用户需要控制Alice、Colly以及Ella同时播放王力宏的歌曲时,可对Alice发出语音信息“Alice,你和Bob、Ella一起播放王力宏的歌”。此时,接收到语音信息的Alice将会把该语音信息同步至Bob以及Ella,从而保证Alice、Colly以及Ella均能获取该语音信息,并一起执行与该语音信息匹配的控制指令。The embodiments of the present invention are applicable to scenarios where a user needs to control multiple voice dialogue robots specified in an area. For example, if there are five voice dialogue robots distributed in the current area, and the device names of the voice dialogue robots are Alice, Bob, Colly, Doggy, and Ella, respectively, and the user is currently at Alice's location, then when the user needs to control Alice, Colly, and Ella to play Wang Leehom's songs at the same time, the user can send a voice message to Alice, "Alice, you play Wang Leehom's songs with Bob and Ella." At this time, Alice, who receives the voice message, will synchronize the voice message to Bob and Ella, thereby ensuring that Alice, Colly, and Ella can all obtain the voice message and jointly execute the control instructions that match the voice message.

本发明实施例中,在组播模式下,通过识别语音信息所携带的设备名称,获取与设备名称对应的识别码,并将用户发出的语音信息同步至与识别码关联的各个语音对话机器人,使得用户能够准确地发出携带有不同设备名称的语音信息,实现对指定的语音对话机器人的远程同步控制,避免了在通知远程的语音对话机器人时,仅能将语音信息广播至已连接的所有语音对话机器人,因此,实现了对语音对话机器人的有效控制,避免了无效信息的传输。In an embodiment of the present invention, in multicast mode, by identifying the device name carried in the voice information, obtaining the identification code corresponding to the device name, and synchronizing the voice information sent by the user to each voice dialogue robot associated with the identification code, the user can accurately send voice information carrying different device names, thereby realizing remote synchronous control of the specified voice dialogue robot, avoiding the situation where the voice information can only be broadcast to all connected voice dialogue robots when notifying the remote voice dialogue robot, thereby realizing effective control of the voice dialogue robot and avoiding the transmission of invalid information.

作为本发明的又一实施例,如图4所示,所述语音对话机器人的控制方法还包括:As another embodiment of the present invention, as shown in FIG4 , the control method of the voice dialogue robot further includes:

S107:若接收到所述语音对话机器人所同步的第二语音信息,则确定所述第二语音信息的功能类型。S107: If the second voice information synchronized by the voice dialogue robot is received, the function type of the second voice information is determined.

语音信息的功能类型,是指语音对话机器人在执行与语音信息匹配的控制指令后所实现的功能。语音信息的功能类型包括但不限于定时提醒、音乐播放、以及问题回答等。The function type of a voice message refers to the function achieved by the voice chatbot after executing the control instructions that match the voice message. Voice message function types include, but are not limited to, timed reminders, music playback, and question answering.

若用户发出一语音信息,且该语音信息用于控制语音对话机器人在预设时间到达时发出提醒,则该语音信息的功能类型即为定时提醒。If a user sends a voice message, and the voice message is used to control the voice dialogue robot to issue a reminder when the preset time arrives, the function type of the voice message is a timed reminder.

在接收到远程任一语音对话机器人同步至本端语音对话机器人的语音信息时,对该语音信息进行解析,以确定该语音信息的功能类型。When receiving a voice message synchronized from any remote voice dialogue robot to the local voice dialogue robot, the voice message is parsed to determine the function type of the voice message.

示例性地,若识别到语音信息中包含时间信息以及“提醒”二字,则确定语音信息的功能类型为定时提醒。For example, if it is recognized that the voice information contains time information and the word "reminder", the function type of the voice information is determined to be a timed reminder.

S108:若所述功能类型为定时提醒,则在所述第二语音信息对应的提醒时刻到达时,检测当前时刻与用户的位置距离。S108: If the function type is a timed reminder, when the reminder time corresponding to the second voice message arrives, the distance between the current time and the user's location is detected.

语音信息中所包含的时间信息即为语音信息对应的提醒时刻。若当前本端语音对话机器人的系统时间为该提醒时刻,则本端语音对话机器人检测其与用户的实时位置距离。The time information contained in the voice message is the reminder time corresponding to the voice message. If the current system time of the local voice dialogue robot is the reminder time, the local voice dialogue robot detects the real-time location distance between it and the user.

在一示例中,位置距离的检测方式可以是:基于用户所携带的定位器,获取定位器所实时上报的位置信息,以确定用户的地理位置;计算该地理位置与本端语音对话机器人所处位置的距离;将计算出的距离确定为当前时刻本端语音对话机器人与用户的位置距离。In one example, the location distance may be detected by: obtaining the location information reported in real time by the locator carried by the user to determine the user's geographic location; calculating the distance between the geographic location and the location of the local voice dialogue robot; and determining the calculated distance as the location distance between the local voice dialogue robot and the user at the current moment.

S109:若所述位置距离小于预设阈值,则发出提示信息。S109: If the location distance is less than a preset threshold, a prompt message is issued.

若位置距离小于预设阈值,则本端语音对话机器人发出提示信息,以使用户接收该提示信息。提示信息包括但不限于音频提示以及闪烁提示等。If the distance is less than a preset threshold, the local voice dialogue robot will issue a prompt message so that the user can receive the prompt message. The prompt message includes but is not limited to audio prompts and flashing prompts.

作为本发明的在另一实施示例,本端语音对话机器人启动内置的摄像头,可扫描摄像区域内所存在的人脸。此时,将摄像头的最大摄像范围确定为上述预设阈值。若在最大摄像范围内检测到人脸存在,则确定用户与本端语音对话机器人位置距离小于预设阈值,并发出提示信息。In another embodiment of the present invention, the local voice dialogue robot activates its built-in camera to scan for faces within its camera area. In this case, the camera's maximum camera range is set to the preset threshold. If a face is detected within the maximum camera range, the user is determined to be less than the preset threshold, and a prompt is issued.

优选地,在上述示例中,若在最大摄像范围内检测到人脸存在,则将该人脸的脸部特征与预设的用户的脸部特征进行对比,以确定当前位于摄像范围内的人体是否为语音对话机器人的主人。若是,则确定用户与本端语音对话机器人位置距离小于预设阈值,并发出提示信息;若否,则确定用户与本端语音对话机器人位置距离大于预设阈值,不发出提示信息。Preferably, in the above example, if a human face is detected within the maximum camera range, the facial features of that face are compared with the facial features of a preset user to determine whether the person currently within the camera range is the owner of the voice dialogue robot. If so, it is determined that the distance between the user and the local voice dialogue robot is less than a preset threshold, and a prompt message is issued; if not, it is determined that the distance between the user and the local voice dialogue robot is greater than the preset threshold, and no prompt message is issued.

本发明实施例中,在接受到定时提醒类型的语音信息后,通过在提醒时刻实时判定用户与本端语音对话机器人的位置距离是否小于预设阈值,能够确定用户是否位于本端语音对话机器人的附近区域。若用户并非位于本端语音对话机器人的附近区域,则用户也难以接收到本端语音对话机器人发出的提示信息。因此,仅在用户与本端语音对话机器人的位置距离小于预设阈值时才发出提示信息,达到了更为有效的提示效果,同时也避免了接收到语音信息的多个语音对话机器人都同时发出提示,降低了语音对话机器人的能耗。另外,通过对检测到的人脸脸部特征进行识别,使得语音对话机器人的能够准确地对语音对话机器人的主人发出提示,提高了提示的准确性。In an embodiment of the present invention, after receiving a timed reminder voice message, the user can be determined to be in the vicinity of the local voice dialogue robot by determining in real time at the reminder moment whether the distance between the user and the local voice dialogue robot is less than a preset threshold. If the user is not in the vicinity of the local voice dialogue robot, it will be difficult for the user to receive the reminder message from the local voice dialogue robot. Therefore, the reminder message is only issued when the distance between the user and the local voice dialogue robot is less than the preset threshold, achieving a more effective reminder effect. At the same time, it also avoids multiple voice dialogue robots that receive voice messages from issuing reminders at the same time, thereby reducing the energy consumption of the voice dialogue robot. In addition, by recognizing the facial features of the detected human face, the voice dialogue robot can accurately issue reminders to the owner of the voice dialogue robot, thereby improving the accuracy of the reminder.

作为本发明的一个实施例,如图5所示,上述S106具体包括:As an embodiment of the present invention, as shown in FIG5 , the above S106 specifically includes:

S1061:获取本端设备名称。S1061: Obtain the local device name.

在组播模式之下,在本端语音对话机器人将用户发出的语音信息同步至指定的一个或多个远程的语音对话机器人之前,先获取本端语音对话机器人所预先存储的设备名称,即本端设备名称。In multicast mode, before the local voice dialogue robot synchronizes the voice information sent by the user to one or more designated remote voice dialogue robots, it first obtains the device name pre-stored by the local voice dialogue robot, that is, the local device name.

S1062:在所述第一语音信息中,删除包含所述本端设备名称的语音片段。S1062: Delete the voice segment containing the local device name in the first voice information.

对用户发出的语音信息进行识别,确定出其中包含本端设备名称的语音片段。将该语音片段进行截取后删除,使得用户发出的语音信息中,不再携带有本端设备名称。The voice message sent by the user is identified to determine the voice segment containing the local device name. The voice segment is intercepted and deleted so that the voice message sent by the user no longer carries the local device name.

S1063:将删除所述语音片段后的所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与删除所述语音片段后的所述第一语音信息相匹配的控制指令。S1063: Synchronize the first voice information after deleting the voice segment to the voice dialogue robot associated with the found identification code, so that the voice dialogue robot executes a control instruction that matches the first voice information after deleting the voice segment.

根据上述S105所查找出的各个识别码,确定所需同步的远程的各个语音对话机器人。将不再携带有本端设备名称的语音信息发送至所需同步的各个语音对话机器人。According to the identification codes found in the above S105, the remote voice dialogue robots to be synchronized are determined, and the voice message no longer carrying the local device name is sent to the voice dialogue robots to be synchronized.

例如,若用户对Alice发出语音信息“Alice、Bob和Ella一起播放王力宏的歌”,则由于Alice的本端设备名称为“Alice”,故在该语音信息中,删除包含“Alice”的语音片段,得到“Bob和Ella一起播放王力宏的歌”;Alice将“Bob和Ella一起播放王力宏的歌”这一语音信息同步至Bob和Ella。For example, if a user sends a voice message to Alice saying "Alice, Bob, and Ella play Wang Leehom's songs together", then since Alice's local device name is "Alice", the voice segment containing "Alice" is deleted from the voice message, resulting in "Bob and Ella play Wang Leehom's songs together"; Alice synchronizes the voice message "Bob and Ella play Wang Leehom's songs together" to Bob and Ella.

由于远程的各个语音对话机器人在接收其所同步的语音信息时,会执行上述S101至S106,即,根据该语音信息所携带的各个设备名称,将该语音信息再次同步至于各个设备名称对应的语音对话机器人。因此,本发明实施例中,通过将语音信息中包含本端设备名称的语音片段进行删除,使得远程的各个语音对话机器人在接收其所同步的语音信息时,不会再解析出所述本端设备名称,因而不会再将该语音信息重复同步至语音信息的来源端,提高了信息的同步效率。Since each remote voice dialogue robot executes steps S101 to S106 above upon receiving the synchronized voice information, i.e., resynchronizing the voice information with the corresponding voice dialogue robot based on the device names carried in the voice information, in this embodiment of the present invention, by deleting the voice segments containing the local device name from the voice information, each remote voice dialogue robot will no longer parse the local device name upon receiving the synchronized voice information, and thus will not repeatedly synchronize the voice information with the source end, thereby improving information synchronization efficiency.

在上述各个实施例的基础之上,作为本发明的一个实施例,若接收到远程的语音对话机器人所同步过来的语音信息,则启动计时功能。在预设时长之内,若再次接收到远程的语音对话机器人所同步过来的语音信息,则计算这些语音信息的相似度。若相似度大于预设阈值,则确定这些语音信息为用户实际发出的同一语音信息,此时,筛选其中信号强度最强的一条语音信息,以执行与语音信息相匹配的控制指令。In one embodiment of the present invention, based on the aforementioned embodiments, if a voice message synchronized from a remote voice dialogue robot is received, a timing function is activated. If a voice message synchronized from the remote voice dialogue robot is received again within a preset time period, the similarity between these voice messages is calculated. If the similarity exceeds a preset threshold, the voice messages are determined to be the same voice message actually uttered by the user. In this case, the voice message with the strongest signal strength is selected to execute the control instruction that matches the voice message.

本发明实施例中,由于用户发出的语音信息可能同时被附近的多个语音对话机器人检测得到,故在广播模式或者组播模式之下,所述多个语音对话机器人均会该语音信息同步至远程的各个语音对话机器人。因此,对于远程的任一语音对话机器人而言,可能会接收到信号强度不同但内容相同的多条语音信息。这种情况下,通过判断预设时长内所先后接收到的各条语音信息的相似度,并在相似度大于阈值时,筛选出信号强度最强的语音信息,能够避免语音对话机器人重复执行多次相同的控制指令,由于筛选出的语音信息的信号强度最强,故在识别与语音信息匹配的控制指令时,能够提高识别的准确性。In an embodiment of the present invention, since the voice message sent by the user may be detected by multiple nearby voice dialogue robots at the same time, under broadcast mode or multicast mode, the multiple voice dialogue robots will synchronize the voice message to each remote voice dialogue robot. Therefore, any remote voice dialogue robot may receive multiple voice messages with different signal strengths but the same content. In this case, by determining the similarity of each voice message received successively within a preset time period, and when the similarity is greater than a threshold, filtering out the voice message with the strongest signal strength, it is possible to prevent the voice dialogue robot from repeatedly executing the same control instruction. Since the filtered voice message has the strongest signal strength, it is possible to improve the accuracy of recognition when identifying the control instruction that matches the voice message.

应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本发明实施例的实施过程构成任何限定。It should be understood that the size of the serial numbers of the steps in the above embodiments does not mean the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present invention.

对应于上文实施例所述的语音对话机器人的控制方法,图6示出了本发明实施例提供的语音对话机器人的控制装置的结构框图,为了便于说明,仅示出了与本发明实施例相关的部分。Corresponding to the control method of the voice dialogue robot described in the above embodiment, FIG6 shows a structural block diagram of the control device of the voice dialogue robot provided by the embodiment of the present invention. For the sake of convenience, only the part related to the embodiment of the present invention is shown.

参照图6,该装置包括:Referring to FIG6 , the apparatus comprises:

广播单元601,用于广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码。The broadcast unit 601 is used to broadcast the robot search signal and, upon receiving response information based on the robot search signal, extract the identification code of the voice dialogue robot from the response information.

连接单元602,用于基于所述识别码,与所述语音对话机器人建立连接。The connection unit 602 is used to establish a connection with the voice dialogue robot based on the identification code.

获取单元603,用于获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式。The acquiring unit 603 is configured to acquire first voice information sent by a user and determine a control mode of the first voice information.

第一同步单元604,用于若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。The first synchronization unit 604 is used to synchronize the first voice information to the voice dialogue robot associated with the identification code if the control mode is the broadcast mode, so that the voice dialogue robot executes the control instruction matching the first voice information.

可选地,所述广播单元601包括:Optionally, the broadcast unit 601 includes:

广播子单元,用于从所述响应信息中,提取出语音对话机器人的识别码以及设备名称,并将所述识别码以及所述设备名称存储至预先建立的数据表。The broadcast subunit is used to extract the identification code and device name of the voice dialogue robot from the response information, and store the identification code and the device name in a pre-established data table.

所述获取单元603包括:The acquisition unit 603 includes:

解析子单元,用于对用户发出的第一语音信息进行解析,以获取所述第一语音信息中的关键词。The parsing subunit is used to parse the first voice information sent by the user to obtain keywords in the first voice information.

确定子单元,用于若所述关键词与所述数据表中存储的各个所述设备名称均不相同,则确定所述第一语音信息的控制模式为广播模式。The determining subunit is configured to determine that the control mode of the first voice information is a broadcast mode if the keyword is different from the device names stored in the data table.

可选地,如图7所示,所述语音对话机器人的控制装置还包括:Optionally, as shown in FIG7 , the control device of the voice dialogue robot further includes:

查找单元605,用于若所述控制模式为组播模式,则在存储有识别码以及设备名称对应关系的数据表中,查找与所述第一语音信息所携带的设备名称相对应的所述识别码,所述识别码以及设备名称的对应关系从所述响应信息中获取。The search unit 605 is used to search for the identification code corresponding to the device name carried by the first voice information in a data table storing the correspondence between identification codes and device names if the control mode is a multicast mode, and the correspondence between the identification code and the device name is obtained from the response information.

第二同步单元606,用于将所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。The second synchronization unit 606 is used to synchronize the first voice information to the voice dialogue robot associated with the found identification code, so that the voice dialogue robot executes the control instruction matching the first voice information.

可选地,如图8所示,所述语音对话机器人的控制装置还包括:Optionally, as shown in FIG8 , the control device of the voice dialogue robot further includes:

确定单元607,用于若接收到所述语音对话机器人所同步的第二语音信息,则确定所述第二语音信息的功能类型。The determination unit 607 is configured to determine the function type of the second voice information if the second voice information synchronized by the voice dialogue robot is received.

检测单元608,用于若所述功能类型为定时提醒,则在所述第二语音信息对应的提醒时刻到达时,检测当前时刻与用户的位置距离。The detection unit 608 is configured to detect the distance between the current time and the user's location when the reminder time corresponding to the second voice message arrives if the function type is a timed reminder.

提示单元609,用于若所述位置距离小于预设阈值,则发出提示信息。The prompt unit 609 is configured to issue a prompt message if the position distance is less than a preset threshold.

可选地,所述第二同步单元606包括:Optionally, the second synchronization unit 606 includes:

获取子单元,用于获取本端设备名称。Get subunit, used to get the name of the local device.

删除子单元,用于在所述第一语音信息中,删除包含所述本端设备名称的语音片段。The deletion subunit is used to delete the voice segment containing the name of the local device in the first voice information.

同步子单元,用于将删除所述语音片段后的所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与删除所述语音片段后的所述第一语音信息相匹配的控制指令;a synchronization subunit, configured to synchronize the first voice information after deleting the voice segment to the voice dialogue robot associated with the found identification code, so that the voice dialogue robot executes a control instruction matching the first voice information after deleting the voice segment;

可选地,如图9所示,所述语音对话机器人的控制装置还包括:Optionally, as shown in FIG9 , the control device of the voice dialogue robot further includes:

计时单元610,用于若接收到所述语音对话机器人所同步的第三语音信息,则控制内置的计时器启动计时。The timing unit 610 is used to control the built-in timer to start timing if the third voice information synchronized by the voice dialogue robot is received.

计算单元611,用于在计时值达到第一预设阈值之前,若接收到所述语音对话机器人所同步的第四语音信息,则计算这所述第三语音信息以及所述第四语音信息的相似度。The calculation unit 611 is configured to calculate the similarity between the third voice information and the fourth voice information if the fourth voice information synchronized by the voice dialogue robot is received before the timing value reaches the first preset threshold.

执行单元612,用于若所述相似度大于第二预设阈值,则在所述第三语音信息以及所述第四语音信息中,确定出信号强度较强的一条语音信息,以执行与该语音信息匹配的控制指令。The execution unit 612 is configured to determine, if the similarity is greater than a second preset threshold, a voice message with a stronger signal strength between the third voice message and the fourth voice message, so as to execute a control instruction matching the voice message.

本发明实施例中,通过广播机器人搜索信号,能够检测出信号搜索范围内所存在的各个语音对话机器人,从而可以自动获取出与本端语音对话机器人处于不同位置的各个语音对话机器人的识别码,并实现与远程的语音对话机器人的通讯连接;通过确定语音信息的控制模式,保证了在接收到用户发出的语音信息的控制模式为广播模式时,能够将语音信息同步至已连接的各个语音对话机器人,使得用户能够基于其发出的一条语音信息,同时对位置相隔较远的多个语音对话机器人进行语音控制。用户无需再走到各个语音对话机器人所在的位置后才能执行语音控制,因此,本方面实施例提高了语音对话机器人的控制效率。In an embodiment of the present invention, by broadcasting a robot search signal, it is possible to detect each voice dialogue robot within the signal search range, thereby automatically obtaining the identification codes of each voice dialogue robot at a different location from the local voice dialogue robot and achieving communication connection with the remote voice dialogue robot; by determining the control mode of the voice information, it is ensured that when the control mode of the voice information received from the user is the broadcast mode, the voice information can be synchronized to each connected voice dialogue robot, allowing the user to voice control multiple voice dialogue robots at distant locations based on a voice message sent by the user. The user no longer needs to go to the location of each voice dialogue robot to perform voice control, thus improving the control efficiency of the voice dialogue robot.

图10是本发明一实施例提供的终端设备的示意图。如图10所示,该实施例的终端设备10包括处理器1000以及存储器1001,所述存储器1001中存储有可在所述处理器1000上运行的计算机程序1002,例如语音对话机器人的控制程序。所述处理器1000执行所述计算机程序1002时实现上述各个语音对话机器人的控制方法实施例中的步骤,例如图1所示的步骤101至104。或者,所述处理器1000执行所述计算机程序1002时实现上述各装置实施例中各模块/单元的功能,例如图6所示单元601至604的功能。FIG10 is a schematic diagram of a terminal device provided by an embodiment of the present invention. As shown in FIG10 , the terminal device 10 of this embodiment includes a processor 1000 and a memory 1001. The memory 1001 stores a computer program 1002 that can be run on the processor 1000, such as a control program for a voice dialogue robot. When the processor 1000 executes the computer program 1002, it implements the steps in the above-mentioned embodiments of the control method of each voice dialogue robot, such as steps 101 to 104 shown in FIG1 . Alternatively, when the processor 1000 executes the computer program 1002, it implements the functions of each module/unit in the above-mentioned device embodiments, such as the functions of units 601 to 604 shown in FIG6 .

示例性的,所述计算机程序1002可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器1001中,并由所述处理器1000执行,以完成本发明。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机程序指令段,该指令段用于描述所述计算机程序1002在所述终端设备10中的执行过程。Exemplarily, the computer program 1002 may be divided into one or more modules/units, which are stored in the memory 1001 and executed by the processor 1000 to implement the present invention. The one or more modules/units may be a series of computer program instruction segments capable of performing specific functions, and the instruction segments are used to describe the execution process of the computer program 1002 in the terminal device 10.

所述终端设备10可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。所述终端设备可包括,但不仅限于,处理器1000、存储器1001。本领域技术人员可以理解,图10仅仅是终端设备10的示例,并不构成对终端设备10的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述终端设备还可以包括输入输出设备、网络接入设备、总线等。The terminal device 10 may be a computing device such as a desktop computer, laptop, PDA, or cloud server. The terminal device may include, but is not limited to, a processor 1000 and a memory 1001. Those skilled in the art will appreciate that FIG10 is merely an example of a terminal device 10 and does not limit the terminal device 10. The terminal device 10 may include more or fewer components than shown, or may combine certain components or different components. For example, the terminal device may also include input/output devices, network access devices, buses, and the like.

所称处理器1000可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 1000 may be a central processing unit (CPU), or other general-purpose processors, digital signal processors (DSP), application-specific integrated circuits (ASIC), field-programmable gate arrays (FPGA), or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general-purpose processor may be a microprocessor or any conventional processor.

所述存储器1001可以是所述终端设备10的内部存储单元,例如终端设备10的硬盘或内存。所述存储器1001也可以是所述终端设备10的外部存储设备,例如所述终端设备10上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器1001还可以既包括所述终端设备10的内部存储单元也包括外部存储设备。所述存储器1001用于存储所述计算机程序以及所述终端设备所需的其他程序和数据。所述存储器1001还可以用于暂时地存储已经输出或者将要输出的数据。The memory 1001 may be an internal storage unit of the terminal device 10, such as a hard disk or memory of the terminal device 10. The memory 1001 may also be an external storage device of the terminal device 10, such as a plug-in hard disk, a smart media card (SMC), a secure digital (SD) card, a flash card, etc. equipped on the terminal device 10. Furthermore, the memory 1001 may also include both an internal storage unit of the terminal device 10 and an external storage device. The memory 1001 is used to store the computer program and other programs and data required by the terminal device. The memory 1001 may also be used to temporarily store data that has been output or is to be output.

所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, only the division of the above-mentioned functional units and modules is used as an example for illustration. In actual applications, the above-mentioned functions can be distributed and completed by different functional units and modules as needed, that is, the internal structure of the device can be divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiment can be integrated into one processing unit, or each unit can exist physically alone, or two or more units can be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing each other, and are not used to limit the scope of protection of this application. The specific working process of the units and modules in the above-mentioned system can refer to the corresponding process in the aforementioned method embodiment, and will not be repeated here.

所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明实现上述实施例方法中的全部或部分流程,也可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括电载波信号和电信信号。If the integrated module/unit is implemented as a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the present invention can implement all or part of the process of the above-mentioned method embodiment by instructing the relevant hardware through a computer program. The computer program can be stored in a computer-readable storage medium. When the computer program is executed by a processor, it can implement the steps of each of the above-mentioned method embodiments. The computer program includes computer program code, which can be in source code form, object code form, executable file, or some intermediate form. The computer-readable medium can include: any entity or device capable of carrying the computer program code, recording medium, USB flash drive, mobile hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM), random access memory (RAM), electric carrier signal, telecommunication signal, and software distribution medium. It should be noted that the content contained in the computer-readable medium can be appropriately increased or decreased according to the requirements of legislation and patent practice in the jurisdiction. For example, in some jurisdictions, according to legislation and patent practice, computer-readable media does not include electric carrier signals and telecommunication signals.

以上所述实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围,均应包含在本发明的保护范围之内。The embodiments described above are only used to illustrate the technical solutions of the present invention, rather than to limit the same. Although the present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that they can still modify the technical solutions described in the aforementioned embodiments, or make equivalent replacements for some of the technical features therein. These modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the various embodiments of the present invention, and should all be included in the scope of protection of the present invention.

Claims (10)

1.一种语音对话机器人的控制方法,其特征在于,包括:1. A control method for a voice-interactive robot, characterized in that it includes: 广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码;The system broadcasts a robot search signal and, upon receiving response information based on the robot search signal, extracts the identification code of the voice dialogue robot from the response information. 基于所述识别码,与所述语音对话机器人建立连接;Based on the identification code, a connection is established with the voice dialogue robot; 获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式;Acquire the first voice information sent by the user and determine the control mode of the first voice information; 若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令;If the control mode is broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes control instructions that match the first voice information; 若接收到所述语音对话机器人所同步的第三语音信息,则控制内置的计时器启动计时;If the third voice information synchronized by the voice dialogue robot is received, the built-in timer is controlled to start timing; 在计时值达到第一预设阈值之前,若接收到所述语音对话机器人所同步的第四语音信息,则计算这所述第三语音信息以及所述第四语音信息的相似度;If the fourth voice information synchronized by the voice dialogue robot is received before the timing value reaches the first preset threshold, the similarity between the third voice information and the fourth voice information is calculated. 若所述相似度大于第二预设阈值,则在所述第三语音信息以及所述第四语音信息中,确定出信号强度较强的一条语音信息,以执行与该语音信息匹配的控制指令。If the similarity is greater than the second preset threshold, then among the third and fourth voice information, the voice information with the stronger signal strength is determined, and a control command matching the voice information is executed. 2.如权利要求1所述的控制方法,其特征在于,所述从所述响应信息中,提取出语音对话机器人的识别码,包括:2. The control method as described in claim 1, characterized in that, extracting the identification code of the voice dialogue robot from the response information includes: 从所述响应信息中,提取出语音对话机器人的识别码以及设备名称,并将所述识别码以及所述设备名称存储至预先建立的数据表;Extract the identification code and device name of the voice chatbot from the response information, and store the identification code and device name in a pre-established data table; 所述获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式,包括:The step of acquiring the first voice information sent by the user and determining the control mode of the first voice information includes: 对用户发出的第一语音信息进行解析,以获取所述第一语音信息中的关键词;The user's first voice message is parsed to obtain keywords from the first voice message; 若所述关键词与所述数据表中存储的各个所述设备名称均不相同,则确定所述第一语音信息的控制模式为广播模式。If the keyword is different from all the device names stored in the data table, then the control mode of the first voice information is determined to be broadcast mode. 3.如权利要求1所述的控制方法,其特征在于,还包括:3. The control method as described in claim 1, characterized in that it further comprises: 若所述控制模式为组播模式,则在存储有识别码以及设备名称对应关系的数据表中,查找与所述第一语音信息所携带的设备名称相对应的所述识别码,所述识别码以及设备名称的对应关系从所述响应信息中获取;If the control mode is multicast mode, then in the data table storing the correspondence between identification codes and device names, the identification code corresponding to the device name carried in the first voice information is searched, and the correspondence between the identification code and the device name is obtained from the response information; 将所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。The first voice information is synchronized to the voice chatbot associated with the found identification code, so that the voice chatbot executes control commands that match the first voice information. 4.如权利要求1所述的控制方法,其特征在于,还包括:4. The control method as described in claim 1, characterized in that it further comprises: 若接收到所述语音对话机器人所同步的第二语音信息,则确定所述第二语音信息的功能类型;If the second voice information synchronized by the voice dialogue robot is received, the function type of the second voice information is determined; 若所述功能类型为定时提醒,则在所述第二语音信息对应的提醒时刻到达时,检测当前时刻与用户的位置距离;If the function type is a timed reminder, then when the reminder time corresponding to the second voice information arrives, the distance between the current time and the user's location is detected; 若所述位置距离小于预设阈值,则发出提示信息。If the distance to the location is less than a preset threshold, a prompt message will be issued. 5.如权利要求3所述的控制方法,其特征在于,所述将所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令,包括:5. The control method as described in claim 3, characterized in that, synchronizing the first voice information to the voice dialogue robot associated with the found identification code, so that the voice dialogue robot executes control instructions matching the first voice information, includes: 获取本端设备名称;Get the local device name; 在所述第一语音信息中,删除包含所述本端设备名称的语音片段;In the first voice information, delete the voice segment containing the name of the local device; 将删除所述语音片段后的所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与删除所述语音片段后的所述第一语音信息相匹配的控制指令。The first voice information after deleting the voice segment is synchronized to the voice chatbot associated with the found identification code, so that the voice chatbot executes control instructions that match the first voice information after deleting the voice segment. 6.一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,其特征在于,所述处理器执行所述计算机程序时实现如下步骤:6. A terminal device, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, characterized in that the processor, when executing the computer program, performs the following steps: 广播机器人搜索信号,并在接收到基于所述机器人搜索信号的响应信息时,从所述响应信息中,提取出语音对话机器人的识别码;The system broadcasts a robot search signal and, upon receiving response information based on the robot search signal, extracts the identification code of the voice dialogue robot from the response information. 基于所述识别码,与所述语音对话机器人建立连接;Based on the identification code, a connection is established with the voice dialogue robot; 获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式;Acquire the first voice information sent by the user and determine the control mode of the first voice information; 若所述控制模式为广播模式,则将所述第一语音信息同步至与所述识别码关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令;If the control mode is broadcast mode, the first voice information is synchronized to the voice dialogue robot associated with the identification code, so that the voice dialogue robot executes control instructions that match the first voice information; 若接收到所述语音对话机器人所同步的第三语音信息,则控制内置的计时器启动计时;If the third voice information synchronized by the voice dialogue robot is received, the built-in timer is controlled to start timing; 在计时值达到第一预设阈值之前,若接收到所述语音对话机器人所同步的第四语音信息,则计算这所述第三语音信息以及所述第四语音信息的相似度;If the fourth voice information synchronized by the voice dialogue robot is received before the timing value reaches the first preset threshold, the similarity between the third voice information and the fourth voice information is calculated. 若所述相似度大于第二预设阈值,则在所述第三语音信息以及所述第四语音信息中,确定出信号强度较强的一条语音信息,以执行与该语音信息匹配的控制指令。If the similarity is greater than the second preset threshold, then among the third and fourth voice information, the voice information with the stronger signal strength is determined, and a control command matching the voice information is executed. 7.如权利要求6所述的终端设备,其特征在于,所述从所述响应信息中,提取出语音对话机器人的识别码的步骤,具体包括:7. The terminal device as described in claim 6, wherein the step of extracting the identification code of the voice dialogue robot from the response information specifically includes: 从所述响应信息中,提取出语音对话机器人的识别码以及设备名称,并将所述识别码以及所述设备名称存储至预先建立的数据表;Extract the identification code and device name of the voice chatbot from the response information, and store the identification code and device name in a pre-established data table; 所述获取用户发出的第一语音信息,并确定所述第一语音信息的控制模式的步骤,具体包括:The step of acquiring the first voice information sent by the user and determining the control mode of the first voice information specifically includes: 对用户发出的第一语音信息进行解析,以获取所述第一语音信息中的关键词;The user's first voice message is parsed to obtain keywords from the first voice message; 若所述关键词与所述数据表中存储的各个所述设备名称均不相同,则确定所述第一语音信息的控制模式为广播模式。If the keyword is different from all the device names stored in the data table, then the control mode of the first voice information is determined to be broadcast mode. 8.如权利要求6所述的终端设备,其特征在于,所述处理器执行所述计算机程序时,还实现如下步骤:8. The terminal device as described in claim 6, wherein when the processor executes the computer program, it further performs the following steps: 若所述控制模式为组播模式,则在存储有识别码以及设备名称对应关系的数据表中,查找与所述第一语音信息所携带的设备名称相对应的所述识别码,所述识别码以及设备名称的对应关系从所述响应信息中获取;If the control mode is multicast mode, then in the data table storing the correspondence between identification codes and device names, the identification code corresponding to the device name carried in the first voice information is searched, and the correspondence between the identification code and the device name is obtained from the response information; 将所述第一语音信息同步至与查找出的所述识别码相关联的所述语音对话机器人,以使所述语音对话机器人执行与所述第一语音信息匹配的控制指令。The first voice information is synchronized to the voice chatbot associated with the found identification code, so that the voice chatbot executes control commands that match the first voice information. 9.如权利要求6所述的终端设备,其特征在于,所述处理器执行所述计算机程序时,还实现如下步骤:9. The terminal device as described in claim 6, wherein when the processor executes the computer program, it further performs the following steps: 若接收到所述语音对话机器人所同步的第二语音信息,则确定所述第二语音信息的功能类型;If the second voice information synchronized by the voice dialogue robot is received, the function type of the second voice information is determined; 若所述功能类型为定时提醒,则在所述第二语音信息对应的提醒时刻到达时,检测当前时刻与用户的位置距离;If the function type is a timed reminder, then when the reminder time corresponding to the second voice information arrives, the distance between the current time and the user's location is detected; 若所述位置距离小于预设阈值,则发出提示信息。If the distance to the location is less than a preset threshold, a prompt message will be issued. 10.一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至5任一项所述方法的步骤。10. A computer-readable storage medium storing a computer program, characterized in that, when executed by a processor, the computer program implements the steps of the method as claimed in any one of claims 1 to 5.
HK18109503.1A 2018-07-23 Control method of voice conversation robot and terminal equipment HK1250024B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
HK18109503.1A HK1250024B (en) 2018-07-23 Control method of voice conversation robot and terminal equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
HK18109503.1A HK1250024B (en) 2018-07-23 Control method of voice conversation robot and terminal equipment

Publications (2)

Publication Number Publication Date
HK1250024A1 HK1250024A1 (en) 2018-11-23
HK1250024B true HK1250024B (en) 2020-08-14

Family

ID=

Similar Documents

Publication Publication Date Title
US11875820B1 (en) Context driven device arbitration
US11676575B2 (en) On-device learning in a hybrid speech processing system
CN111223497B (en) A method, device, computing device and storage medium for nearby wake-up of a terminal
WO2021159688A1 (en) Voiceprint recognition method and apparatus, and storage medium and electronic apparatus
US10643609B1 (en) Selecting speech inputs
US10629186B1 (en) Domain and intent name feature identification and processing
CN112102850B (en) Emotion recognition processing method and device, medium and electronic equipment
CN107644638A (en) Audio recognition method, device, terminal and computer-readable recording medium
CN109994106B (en) Voice processing method and equipment
EP3414757A1 (en) Controlling distributed audio outputs to enable voice output
CN113314119B (en) Voice recognition intelligent household control method and device
US20210398550A1 (en) Low power mode for speech capture devices
CN111862943B (en) Speech recognition method and device, electronic equipment and storage medium
CN111178081A (en) Method, server, electronic device and computer storage medium for semantic recognition
CN110706707A (en) Method, apparatus, device and computer-readable storage medium for voice interaction
CN113779208A (en) Method and apparatus for human-machine dialogue
JP6915637B2 (en) Information processing equipment, information processing methods, and programs
CN112002347B (en) Voice detection method and device and electronic equipment
CN112802465A (en) Voice control method and system
CN103680505A (en) Voice recognition method and voice recognition system
CN107756412B (en) The control method and terminal device of voice dialogue robot
CN112185374A (en) Method and device for determining voice intention
CN107886940A (en) Speech translation processing method and device
Goel et al. Audio dialogues: Dialogues dataset for audio and music understanding
CN111145748A (en) Audio recognition confidence determination method, device, device and storage medium