[go: up one dir, main page]

CN110444209A - Voice interactive method, the apparatus and system of web page are embedded towards intelligent vehicle device - Google Patents

Voice interactive method, the apparatus and system of web page are embedded towards intelligent vehicle device Download PDF

Info

Publication number
CN110444209A
CN110444209A CN201910745196.XA CN201910745196A CN110444209A CN 110444209 A CN110444209 A CN 110444209A CN 201910745196 A CN201910745196 A CN 201910745196A CN 110444209 A CN110444209 A CN 110444209A
Authority
CN
China
Prior art keywords
web page
instruction
embedded web
page
message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910745196.XA
Other languages
Chinese (zh)
Other versions
CN110444209B (en
Inventor
侯锦雄
郭凯
徐茂森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AI Speech Ltd
Original Assignee
AI Speech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AI Speech Ltd filed Critical AI Speech Ltd
Priority to CN201910745196.XA priority Critical patent/CN110444209B/en
Publication of CN110444209A publication Critical patent/CN110444209A/en
Application granted granted Critical
Publication of CN110444209B publication Critical patent/CN110444209B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/30Services specially adapted for particular environments, situations or purposes
    • H04W4/40Services specially adapted for particular environments, situations or purposes for vehicles, e.g. vehicle-to-pedestrians [V2P]
    • H04W4/48Services specially adapted for particular environments, situations or purposes for vehicles, e.g. vehicle-to-pedestrians [V2P] for in-vehicle communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/12Protocols specially adapted for proprietary or special-purpose networking environments, e.g. medical networks, sensor networks, networks in vehicles or remote metering networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

本发明公开了一种面向智能车机内嵌web页面的语音交互方法,包括如下步骤:为各内嵌web页面配置响应消息指令;获取用户语音指令;根据获取的用户语音指令和配置的内嵌web页面的响应消息指令确定关键指令消息;将关键消息指令输出至相应的内嵌web页面端。根据本发明的另一个方面,还提供了一种面向智能车机内嵌web页面的语音交互装置和系统,根据本发明公开的方法、装置及系统可以实现对在行驶过程中涉及WEB页面的操作通过语音交互实现,提高了用户的体验感和驾驶的安全性,并且具有一定的可拓展性。

The invention discloses a voice interaction method for embedded web pages in an intelligent vehicle, comprising the following steps: configuring response message instructions for each embedded web page; acquiring user voice instructions; The response message instruction of the web page determines the key instruction message; the key message instruction is output to the corresponding embedded web page. According to another aspect of the present invention, there is also provided a voice interaction device and system for embedded web pages in a smart car. According to the method, device and system disclosed in the present invention, operations involving web pages during driving can be realized. Realized by voice interaction, the user experience and driving safety are improved, and it has certain scalability.

Description

面向智能车机内嵌web页面的语音交互方法、装置及系统Voice interaction method, device and system for embedded web page in smart car

技术领域technical field

本发明涉及车载系统技术领域,特别是一种面向智能车机内嵌web页面的语音交互方法、装置及系统。The invention relates to the technical field of in-vehicle systems, in particular to a voice interaction method, device and system for embedded web pages in intelligent vehicles.

背景技术Background technique

目前随着我国经济的高速发展,汽车的销量不断攀升,家庭收入的不断提高,越来越多的人可以购买私家车,因此开车的人会越来越多,汽车的销量以及汽车附属产品的销量都越来越火爆。At present, with the rapid development of my country's economy, the sales of automobiles continue to rise, and the income of households continues to increase. More and more people can buy private cars, so more and more people will drive. Sales are getting more and more popular.

现有车载智能系统,将车内各个设备集中起来并形成一个系统,方便各个设备的控制,现有的车载智能系统虽然已经有一些可以进行语音控制,但是对于一些例如控制车载系统弹出的WEB端的相关操作,仍然需要手动操作,不方便开车时的司机进行操控。The existing in-vehicle intelligent system integrates various devices in the car to form a system, which is convenient for the control of each device. Although there are some existing in-vehicle intelligent systems that can be controlled by voice, for some such as control the WEB terminal that pops up from the in-vehicle system. Relevant operations still require manual operation, which is inconvenient for the driver to control while driving.

发明内容SUMMARY OF THE INVENTION

为了解决上述问题,发明人构思通过车机客户端监听用户发出的语音信息,再将该语音信息通过定义的通信协议发送到web页面端,web页面端就会根据车机客户端发送的消息做对应的页面元素操作,并通知到车机客户端当前页面的一些关键内容消息。这样只要在相应的车载系统页面上添加对指令的响应,即可完成操作,可以大大提高车机客户端在内容以及操作上的拓展性,提高了人机交互的体验。In order to solve the above problem, the inventor conceives of monitoring the voice information sent by the user through the vehicle client, and then sending the voice information to the web page through the defined communication protocol, and the web page will make a message according to the message sent by the vehicle client. The corresponding page elements are operated, and some key content messages of the current page are notified to the vehicle client. In this way, as long as the response to the command is added on the corresponding in-vehicle system page, the operation can be completed, which can greatly improve the scalability of the in-vehicle client in terms of content and operation, and improve the experience of human-computer interaction.

根据本发明的第一方面,提供了一种面向智能车机内嵌web页面的语音交互方法,包括如下步骤:According to a first aspect of the present invention, there is provided a voice interaction method for embedded web pages in a smart car machine, comprising the following steps:

为各内嵌web页面配置响应消息指令;Configure response message instructions for each embedded web page;

获取用户语音指令;Obtain user voice commands;

根据获取的用户语音指令和配置的内嵌web页面的响应消息指令确定关键指令消息;Determine the key instruction message according to the acquired user voice instruction and the configured response message instruction of the embedded web page;

将关键消息指令输出至相应的内嵌web页面端。Output key message instructions to the corresponding embedded web page.

根据本发明的第二方面,提供了面向智能车机内嵌web页面的语音交互装置,包括:According to a second aspect of the present invention, there is provided a voice interaction device for embedded web pages in a smart car machine, including:

指令配置模块,用于为各内嵌web页面配置响应消息指令;The instruction configuration module is used to configure the response message instruction for each embedded web page;

语音获取模块,用于获取用户语音指令;A voice acquisition module, used to acquire user voice commands;

关键消息确定模块,用于根据获取的用户语音指令和配置的内嵌web页面的响应消息指令确定关键指令消息;A key message determination module, configured to determine a key instruction message according to the acquired user voice instruction and the configured response message instruction of the embedded web page;

页面通信模块,用于将关键消息指令输出至相应的内嵌web页面端。The page communication module is used to output key message instructions to the corresponding embedded web page.

根据本发明的第三方面,提供了一种智能车机内嵌web页面系统,内嵌web页面中配置有关键消息响应映射信息,内嵌web页面根据接收到的关键信息指令进行相应的页面元素操作,其中,关键消息指令来自上述的面向智能车机内嵌web页面的语音交互装置。According to a third aspect of the present invention, there is provided an embedded web page system for an intelligent vehicle machine. The embedded web page is configured with key message response mapping information, and the embedded web page executes corresponding page elements according to the received key information instructions. Operation, wherein, the key message instruction comes from the above-mentioned voice interaction device oriented to the embedded web page of the smart car machine.

根据本发明的第四个方面,提供了一种电子设备,其包括:至少一个处理器,以及与至少一个处理器通信连接的存储器,其中,存储器存储有可被至少一个处理器执行的指令,指令被至少一个处理器执行,以使至少一个处理器能够执行上述方法的步骤。According to a fourth aspect of the present invention, there is provided an electronic device comprising: at least one processor, and a memory communicatively connected to the at least one processor, wherein the memory stores instructions executable by the at least one processor, The instructions are executed by at least one processor to enable the at least one processor to perform the steps of the above-described method.

根据本发明的第五个方面,提供了一种存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述方法的步骤。According to a fifth aspect of the present invention, there is provided a storage medium on which a computer program is stored, which implements the steps of the above method when the program is executed by a processor.

根据本发明提供了方法、装置及系统,可以实现通过语音控制车载系统中的web页面的显示、关闭和页面内部相关元素的操作。具有极大的拓展性,大大的提高了用户与车载系统的交互体验。According to the present invention, a method, device and system are provided, which can control the display and closing of the web page in the in-vehicle system and the operation of related elements inside the page through voice control. It has great expansibility and greatly improves the interaction experience between the user and the in-vehicle system.

附图说明Description of drawings

图1为本发明一实施方式的面向智能车机内嵌web页面的语音交互方法流程图;FIG. 1 is a flow chart of a voice interaction method for embedded web pages in a smart car according to an embodiment of the present invention;

图2为本发明一实施方式的面向智能车机内嵌web页面的语音交互装置原理框图;FIG. 2 is a schematic block diagram of a voice interaction device oriented to a web page embedded in a smart car according to an embodiment of the present invention;

图3为本发明一实施方式的智能车机内嵌web页面系统框图;FIG. 3 is a block diagram of an embedded web page system in a smart car machine according to an embodiment of the present invention;

图4为本发明一种实施方式的电子设备的框图。FIG. 4 is a block diagram of an electronic device according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict.

本发明可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、元件、数据结构等等。也可以在分布式计算环境中实践本发明,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, elements, data structures, etc. that perform particular tasks or implement particular abstract data types. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including storage devices.

在本发明中,“模块”、“装置”、“系统”等指应用于计算机的相关实体,如硬件、硬件和软件的组合、软件或执行中的软件等。详细地说,例如,元件可以、但不限于是运行于处理器的过程、处理器、对象、可执行元件、执行线程、程序和/或计算机。还有,运行于服务器上的应用程序或脚本程序、服务器都可以是元件。一个或多个元件可在执行的过程和/或线程中,并且元件可以在一台计算机上本地化和/或分布在两台或多台计算机之间,并可以由各种计算机可读介质运行。元件还可以根据具有一个或多个数据包的信号,例如,来自一个与本地系统、分布式系统中另一元件交互的,和/或在因特网的网络通过信号与其它系统交互的数据的信号通过本地和/或远程过程来进行通信。In the present invention, "module", "device", "system", etc. refer to relevant entities applied to a computer, such as hardware, a combination of hardware and software, software or software in execution, and the like. In detail, for example, an element may be, but is not limited to, a process running on a processor, a processor, an object, an executable element, a thread of execution, a program, and/or a computer. Also, an application program or script program running on the server, and the server can be a component. One or more elements may be in a process and/or thread of execution and an element may be localized on one computer and/or distributed between two or more computers and may be executed from various computer readable media . Elements may also pass through a signal having one or more data packets, for example, a signal from one interacting with another element in a local system, in a distributed system, and/or with data interacting with other systems through a network of the Internet local and/or remote processes to communicate.

最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”,不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or that there is any such actual relationship or sequence between operations. Furthermore, the terms "comprising" and "comprising" include not only those elements, but also other elements not expressly listed, or elements inherent to such a process, method, article or apparatus. Without further limitation, an element defined by the phrase "comprises" does not preclude the presence of additional identical elements in a process, method, article, or device that includes the element.

下面结合附图对本发明作进一步详细的说明。The present invention will be described in further detail below in conjunction with the accompanying drawings.

图1示意性地显示了根据本发明的一种实施方式的面向智能车机内嵌web页面的语音交互方法流程图,如图1所示,本实施例中包括如下步骤:Fig. 1 schematically shows a flow chart of a method for voice interaction oriented to a web page embedded in a smart car machine according to an embodiment of the present invention. As shown in Fig. 1, this embodiment includes the following steps:

步骤S101:为各内嵌web页面配置响应消息指令。在车机端为不同的web页面操作功能指令配置映射关系表,使得不同的响应消息指令可以调用其对应功能的web页面功能接口。其中,响应消息指令包括确定、取消、关闭、上一页、下一页、放大图片、缩小图片等指令,可以根据需求设置,例如根据web页面内容元素和页面交互需求来设置需作出响应的消息指令。示例性地,将“确定”响应消息指令与调用web页面“确定”按钮对应的功能接口形成对应关系;将“上一页”响应消息指令与调用web页面“下一页”翻页功能的接口形成对应关系。将配置的类似对应关系形成如下的映射关系表存储:Step S101: Configure a response message instruction for each embedded web page. The mapping relationship table is configured for different web page operation function instructions on the vehicle end, so that different response message instructions can call the web page function interface of its corresponding function. Among them, the response message instructions include instructions such as confirm, cancel, close, previous page, next page, enlarge picture, reduce picture, etc., which can be set according to requirements, for example, according to web page content elements and page interaction requirements to set the message to be responded to instruction. Exemplarily, a corresponding relationship is formed between the "OK" response message instruction and the function interface corresponding to invoking the "OK" button on the web page; the "previous page" response message instruction and the interface for invoking the "next page" page-turning function of the web page form a corresponding relationship. The similar corresponding relationship of the configuration is formed into the following mapping table to store:

响应消息指令response message command Web页面的功能接口Web page functional interface 确认confirm 确认功能接口Confirm function interface 下一页next page 向下翻页功能接口Page down function interface 放大图片Zoom in picture 图片放大功能接口Image zoom function interface

步骤S102:获取用户语音指令。获取的方式可以通过车机上自带的具有拾音功能的音频采集装置进行实时监听。示例性地,通过车载系统的语音助手进行获取。Step S102: Obtain a user voice instruction. The acquisition method can be monitored in real time through an audio acquisition device with a pickup function that is built into the vehicle. Exemplarily, the acquisition is made through the voice assistant of the in-vehicle system.

步骤S103:根据获取的用户语音指令和配置的内嵌web页面的响应消息指令确定关键指令消息。为了提供交互的方式精准性,对用户输出的语音指令进行关键词提取,根据关键词匹配对应的操作指令。Step S103: Determine the key instruction message according to the acquired user voice instruction and the configured response message instruction of the embedded web page. In order to provide the accuracy of the interaction method, keywords are extracted from the voice instructions output by the user, and corresponding operation instructions are matched according to the keywords.

首先获取当前显示的内嵌web页面,通过现有技术就可以实现,例如通过Launcher桌面应用提取当前显示的web页面。再将获取的用户语音指令与当前显示的内嵌web页面对应的响应消息指令进行匹配,可以通过现有技术的语义解析将获取的用户语音指令转换为文本信息,将该文本信息与步骤S101配置的映射关系表进行匹配,根据匹配结果确定关键消息指令(即将用户语音指令发出的与响应消息指令匹配上的消息指令视作关键消息指令)以及该关键消息指令对应的web功能调用接口。First, the currently displayed embedded web page is acquired, which can be achieved by using the existing technology, for example, the currently displayed web page is extracted through the Launcher desktop application. Then match the obtained user voice command with the response message command corresponding to the currently displayed embedded web page, and convert the obtained user voice command into text information through semantic analysis of the prior art, and configure the text information with step S101. The mapping relationship table is matched, and the key message command is determined according to the matching result (that is, the message command issued by the user's voice command that matches the response message command is regarded as the key message command) and the web function call interface corresponding to the key message command.

步骤S104:将关键消息指令输出至相应的内嵌web页面端。输出方式可以通过web页面端与车机端建立的自定义协议实现,示例性地,通过Android系统调用js代码(即Javascript脚本语言),即通过调用WebView的loadUrl(),实现将关键消息指令输出至响应的内嵌web页面端。内嵌web页面端根据关键消息指令和对应的web功能调用接口进行相应的页面元素操作。Step S104: Output the key message instruction to the corresponding embedded web page. The output method can be realized through a custom protocol established between the web page and the vehicle end. Exemplarily, the js code (that is, the Javascript scripting language) is called through the Android system, that is, by calling the loadUrl() of the WebView, the key message instructions are output. to the inline web page side of the response. The embedded web page side performs corresponding page element operations according to key message instructions and corresponding web function invocation interfaces.

优选地,js代码还可以通过调用Android代码,即通过WebViewClient的shouldOverrideUrlLoading()方法回调拦截url地址,从而实现web页面端与车机端的交互,将执行关键消息指令后对应的改变页面元素的执行结果和页面状态反馈给车机客户端,从而方便车机客户端获取当前内嵌页面端的信息和与之对应的响应消息指令,以开启新一轮的监听和响应操作。Preferably, the js code can also intercept the url address by calling the Android code, that is, through the shouldOverrideUrlLoading() method of WebViewClient, so as to realize the interaction between the web page end and the vehicle end, and execute the execution result of the page element corresponding to the change of the page element after executing the key message instruction. And the page status is fed back to the vehicle client, so that the vehicle client can obtain the information of the current embedded page and the corresponding response message command, so as to start a new round of monitoring and response operations.

示例性地,车机导航到目的地后,车机系统桌面弹出一个webview加载的导航轨迹的回放web页面,并播报了相关的行驶数据(例如总里程,车速等),用户可以通过语音指令操作web页面的内容,如用户说出“放大地图”,监听到这一语音指令,通过语音识别出“放大”文本信息,根据该文本信息和映射关系表获取关键消息指令“放大图片”以及web页面的放大功能接口,并通过自定义协议将其传输至web页面端,web页面端就可以进行相应的将当前页面显示的图片进行放大的操作。Exemplarily, after the car-machine navigates to the destination, a web-view-loaded navigation track playback web page pops up on the car-machine system desktop, and broadcasts relevant driving data (such as total mileage, vehicle speed, etc.), and the user can operate through voice commands. The content of the web page, such as the user saying "Zoom in on the map", listens to this voice command, recognizes the "Zoom in" text information through the voice, and obtains the key message command "Zoom in the picture" and the web page according to the text information and the mapping table. , and transmit it to the web page side through a custom protocol, and the web page side can perform the corresponding operation of magnifying the picture displayed on the current page.

根据本实施例的方法可以实现通过用户发出的语音控制web页面内的元素操作,并且不需要额外增加接口,对于新的页面需求,只要在相应的页面上添加对指令的响应即可完成操作,具有极高的拓展性,提高了用户的体验感。According to the method of this embodiment, the operation of elements in the web page can be controlled by the voice issued by the user, and no additional interface is required. For new page requirements, the operation can be completed only by adding a response to the command on the corresponding page. It has extremely high scalability and improves the user experience.

在其他实施例中,还可以进一步在车机客户端配置映射关系表的执行协议,例如根据当前内嵌web页面的状态来确定是否将匹配到的响应指令消息作为关键指令消息传递给web页面进行执行操作,如可以进一步配置为当web界面处于激活状态时,用户发出的关闭或确定的语音指令,将会由车机客户端直接操作web页面关闭或确定按钮,而不会传递给web页面进行相应页面操作处理。In other embodiments, the execution protocol of the mapping relationship table may be further configured on the vehicle client, for example, according to the state of the current embedded web page to determine whether to transmit the matched response instruction message to the web page as a key instruction message for execution To perform operations, for example, it can be further configured that when the web interface is activated, the user's voice command to close or confirm will be directly operated by the vehicle client to close or confirm the web page, and will not be transmitted to the web page for processing. Corresponding page operation processing.

图2示意性地显示了根据本发明一实施方式的面向智能车机内嵌web页面的语音交互装置原理框图,如图2所示,Fig. 2 schematically shows a principle block diagram of a voice interaction device oriented to a web page embedded in a smart car machine according to an embodiment of the present invention, as shown in Fig. 2 ,

该面向智能车机内嵌web页面的语音交互装置包括指令配置模块201、语音获取模块202、关键消息确定模块203和页面通信模块204。The voice interaction device for the embedded web page of the intelligent vehicle machine includes an instruction configuration module 201 , a voice acquisition module 202 , a key message determination module 203 and a page communication module 204 .

指令配置模块201用于为各内嵌web页面配置响应消息指令,响应消息指令包括快捷唤醒指令,具体包括有确定、取消、关闭、上一页、下一页、放大图片、缩小图片等指令。在模块中配置为一关系映射表,具体实现方式可以参照上述的方法部分,并且该表的内容可以根据用户需求进行添加,例如除了进行上述的简单的响应指令的映射关系配置外,在其他实施例中,还可以进一步在车机客户端配置映射关系表的执行协议,例如根据当前内嵌web页面的状态来确定是否将匹配到的响应指令消息作为关键指令消息传递给web页面进行执行操作,如可以进一步配置为当web界面处于激活状态时,用户发出的关闭或确定的语音指令,将会由车机客户端直接操作web页面关闭或确定按钮,而不会传递给web页面进行相应页面操作处理。The instruction configuration module 201 is used to configure response message instructions for each embedded web page, the response message instructions include quick wake-up instructions, and specifically include instructions such as confirm, cancel, close, previous page, next page, zoom in and zoom out. It is configured as a relationship mapping table in the module, the specific implementation can refer to the above method section, and the content of the table can be added according to user requirements. In an example, the execution protocol of the mapping relationship table can be further configured on the vehicle client, for example, according to the state of the current embedded web page, to determine whether to pass the matched response command message as a key command message to the web page for execution, For example, it can be further configured that when the web interface is in the active state, the user's voice command to close or confirm will be directly operated by the vehicle client to close or confirm the button of the web page, and will not be transmitted to the web page for corresponding page operations. deal with.

语音获取模块202用于获取用户语音指令,实现为具有拾音功能的音频采集装置,可以利用车载系统上已有的设备实现。The voice acquisition module 202 is used to acquire the user's voice command, and is implemented as an audio acquisition device with a pickup function, which can be implemented by using existing equipment on the vehicle-mounted system.

关键消息确定模块203用于根据获取的用户语音指令和配置的内嵌web页面的响应消息指令确定关键指令消息,具体实现方式可以参照上述的方法部分。The key message determination module 203 is configured to determine the key instruction message according to the acquired user voice instruction and the configured response message instruction of the embedded web page, and the specific implementation can refer to the above method section.

页面通信模块204用于将关键消息指令输出至相应的内嵌web页面端,其存储有自定义的车载系统与web页面端的通信协议,具体的协议内容可以参照上述的方法部分。在其他实现例中,页面通信模块204还配置为可以实现内嵌web页面端向车机客户端即车载系统进行消息反馈例如通过上述的拦截的方式,使得车机客户端了解执行操作后的页面状态和页面元素内容,以方便进行后续响应处理。The page communication module 204 is used to output the key message instruction to the corresponding embedded web page, which stores the customized communication protocol between the vehicle system and the web page. In other implementation examples, the page communication module 204 is further configured to enable the embedded web page to provide message feedback to the vehicle client, that is, the vehicle system, for example, through the above-mentioned interception method, so that the vehicle client understands the page after the operation is performed. Status and page element content to facilitate subsequent response processing.

根据本实施例提供的装置可以实现使用简单方便,不需要额外增加硬件配置,具有极强的扩展性,通过用户发出的语音控制web页面内的元素操作,使用户具有极高的体验感。The device provided according to this embodiment can be simple and convenient to use, does not require additional hardware configuration, has strong expansibility, and controls the operation of elements in the web page through the voice sent by the user, so that the user has a very high sense of experience.

图3示意性地显示了根据本发明一实施方式的智能车机内嵌web页面系统框图,如图3所示,Fig. 3 schematically shows a block diagram of a web page system embedded in a smart car machine according to an embodiment of the present invention, as shown in Fig. 3,

内嵌web页面3中配置有关键消息响应映射信息,该映射信息可以参照上述的方法部分的映射表的实现方式。内嵌web页面3根据接收到的关键信息指令进行相应的页面元素操作,其中,关键消息指令来自上述的图3部分的面向智能车机内嵌web页面的语音交互装置2。The embedded web page 3 is configured with key message response mapping information, and the mapping information may refer to the implementation manner of the mapping table in the above method part. The embedded web page 3 performs corresponding page element operations according to the received key information instruction, wherein the key message instruction comes from the above-mentioned voice interaction device 2 for the embedded web page of the smart vehicle in the part of FIG. 3 .

根据本实施例的系统可以简单方便的实现车机系统与内嵌web页面的语音交互,具有极高的拓展性,实现方式简单方便。The system according to the present embodiment can simply and conveniently realize the voice interaction between the vehicle-machine system and the embedded web page, has extremely high expansibility, and the realization method is simple and convenient.

在一些实施例中,本发明实施例提供一种非易失性计算机可读存储介质,存储介质中存储有一个或多个包括执行指令的程序,执行指令能够被电子设备(包括但不限于计算机,服务器,或者网络设备等)读取并执行,以用于执行本发明上述任一项面向智能车机内嵌web页面的语音交互方法。In some embodiments, embodiments of the present invention provide a non-volatile computer-readable storage medium, where one or more programs including execution instructions are stored in the storage medium, and the execution instructions can be executed by an electronic device (including but not limited to a computer , a server, or a network device, etc.) to read and execute it, so as to execute any of the above-mentioned voice interaction methods oriented to the embedded web page of a smart vehicle of the present invention.

在一些实施例中,本发明实施例还提供一种计算机程序产品,计算机程序产品包括存储在非易失性计算机可读存储介质上的计算机程序,计算机程序包括程序指令,当程序指令被计算机执行时,使计算机执行上述任一项面向智能车机内嵌web页面的语音交互方法。In some embodiments, embodiments of the present invention further provide a computer program product, the computer program product includes a computer program stored on a non-volatile computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by a computer At the time, the computer is made to execute any one of the above-mentioned voice interaction methods for the embedded web page of the smart car machine.

在一些实施例中,本发明实施例还提供一种电子设备,其包括:至少一个处理器,以及与至少一个处理器通信连接的存储器,其中,存储器存储有可被至少一个处理器执行的指令,指令被所述至少一个处理器执行,以使至少一个处理器能够执行面向智能车机内嵌web页面的语音交互方法。In some embodiments, embodiments of the present invention further provide an electronic device, which includes: at least one processor, and a memory communicatively connected to the at least one processor, wherein the memory stores instructions executable by the at least one processor , the instructions are executed by the at least one processor, so that the at least one processor can execute the voice interaction method for the embedded web page of the intelligent vehicle machine.

在一些实施例中,本发明实施例还提供一种存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时面向智能车机内嵌web页面的语音交互方法。In some embodiments, embodiments of the present invention further provide a storage medium on which a computer program is stored, characterized in that, when the program is executed by a processor, a voice interaction method for a web page embedded in a smart car machine is oriented.

上述本发明实施例的面向智能车机内嵌web页面的语音交互装置可用于执行本发明实施例的面向智能车机内嵌web页面的语音交互方法,并相应的达到上述本发明实施例的实现面向智能车机内嵌web页面的语音交互方法所达到的技术效果,这里不再赘述。本发明实施例中可以通过硬件处理器(hardware processor)来实现相关功能模块。The above-mentioned voice interaction device oriented to a web page embedded in a smart car machine according to the embodiment of the present invention can be used to execute the voice interaction method oriented to a web page embedded in a smart car machine according to the embodiment of the present invention, and correspondingly achieve the implementation of the above embodiment of the present invention The technical effect achieved by the voice interaction method for the embedded web page of the smart car machine will not be repeated here. In the embodiment of the present invention, the relevant functional modules may be implemented by a hardware processor (hardware processor).

图4为本发明一实施例提供的执行面向智能车机内嵌web页面的语音交互方法的电子设备的硬件结构示意图,如图4所示,该设备包括:FIG. 4 is a schematic diagram of the hardware structure of an electronic device for executing a voice interaction method oriented to a web page embedded in a smart car according to an embodiment of the present invention. As shown in FIG. 4 , the device includes:

一个或多个处理器310以及存储器320,图4中以一个处理器310为例。One or more processors 310 and a memory 320, one processor 310 is taken as an example in FIG. 4 .

执行面向智能车机内嵌web页面的语音交互方法的设备还可以包括:输入装置330和输出装置340。The device for executing the voice interaction method for the embedded web page of the smart vehicle may further include: an input device 330 and an output device 340 .

处理器310、存储器320、输入装置330和输出装置340可以通过总线或者其他方式连接,图4中以通过总线连接为例。The processor 310, the memory 320, the input device 330, and the output device 340 may be connected through a bus or in other ways, and the connection through a bus is taken as an example in FIG. 4 .

存储器320作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本申请实施例中的面向智能车机内嵌web页面的语音交互方法对应的程序指令/模块。处理器310通过运行存储在存储器320中的非易失性软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例面向智能车机内嵌web页面的语音交互方法。The memory 320, as a non-volatile computer-readable storage medium, can be used to store non-volatile software programs, non-volatile computer-executable programs, and modules, such as the smart vehicle-oriented embedded web in the embodiments of the present application. The program instruction/module corresponding to the voice interaction method of the page. The processor 310 executes various functional applications and data processing of the server by running the non-volatile software programs, instructions and modules stored in the memory 320, that is, to realize the voice of the above method embodiments for the embedded web page of the smart car. interactive method.

存储器320可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据面向智能车机内嵌web页面的语音交互装置的使用所创建的数据等。此外,存储器320可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器320可选包括相对于处理器310远程设置的存储器,这些远程存储器可以通过网络连接至面向智能车机内嵌web页面的语音交互装置。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 320 may include a stored program area and a stored data area, wherein the stored program area may store an operating system and an application program required by at least one function; Use the created data, etc. Additionally, memory 320 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid-state storage device. In some embodiments, the memory 320 may optionally include memory located remotely relative to the processor 310, and the remote memory may be connected to a voice interaction device oriented to the embedded web page of the intelligent vehicle through a network. Examples of such networks include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.

输入装置330可接收输入的数字或字符信息,以及产生与面向智能车机内嵌web页面的语音交互装置的用户设置以及功能控制有关的信号。输出装置340可包括显示屏等显示设备。The input device 330 can receive the input numerical or character information, and generate signals related to user setting and function control of the voice interaction device for the embedded web page of the smart car machine. The output device 340 may include a display device such as a display screen.

上述一个或者多个模块存储在所述存储器320中,当被所述一个或者多个处理器310执行时,执行上述任意方法实施例中的面向智能车机内嵌web页面的语音交互方法。The one or more modules described above are stored in the memory 320, and when executed by the one or more processors 310, execute the voice interaction method for a web page embedded in a smart car machine in any of the above method embodiments.

上述产品可执行本申请实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本申请实施例所提供的方法。The above product can execute the method provided by the embodiments of the present application, and has functional modules and beneficial effects corresponding to the execution method. For technical details not described in detail in this embodiment, reference may be made to the methods provided in the embodiments of this application.

本申请实施例的电子设备以多种形式存在,包括但不限于:The electronic devices of the embodiments of the present application exist in various forms, including but not limited to:

(1)移动通信设备:这类设备的特点是具备移动通信功能,并且以提供话音、数据通信为主要目标。这类终端包括:智能手机(例如iPhone)、多媒体手机、功能性手机,以及低端手机等。(1) Mobile communication equipment: This type of equipment is characterized by having mobile communication functions, and its main goal is to provide voice and data communication. Such terminals include: smart phones (eg iPhone), multimedia phones, feature phones, and low-end phones.

(2)超移动个人计算机设备:这类设备属于个人计算机的范畴,有计算和处理功能,一般也具备移动上网特性。这类终端包括:PDA、MID和UMPC设备等,例如iPad。(2) Ultra-mobile personal computer equipment: This type of equipment belongs to the category of personal computers, has computing and processing functions, and generally has the characteristics of mobile Internet access. Such terminals include: PDAs, MIDs, and UMPC devices, such as iPads.

(3)车载设备:这类设备可以应用于车载领域。该类设备包括:车载导航。(3) Vehicle equipment: This type of equipment can be used in the vehicle field. Such equipment includes: car navigation.

(4)服务器:提供计算服务的设备,服务器的构成包括处理器、硬盘、内存、系统总线等,服务器和通用的计算机架构类似,但是由于需要提供高可靠的服务,因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。(4) Server: A device that provides computing services. The composition of the server includes a processor, a hard disk, a memory, a system bus, etc. The server is similar to a general computer architecture, but due to the need to provide highly reliable services, the processing power, stability , reliability, security, scalability, manageability and other aspects of high requirements.

(5)其他具有数据交互功能的电子装置。(5) Other electronic devices with data interaction function.

以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment.

通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加通用硬件平台的方式来实现,当然也可以通过硬件。基于这样的理解,上述技术方案本质上或者说对相关技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that each embodiment can be implemented by means of software plus a general hardware platform, and certainly can also be implemented by hardware. Based on this understanding, the above-mentioned technical solutions can be embodied in the form of software products in essence, or the parts that make contributions to related technologies, and the computer software products can be stored in computer-readable storage media, such as ROM/RAM, magnetic disks , optical disc, etc., including several instructions to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to perform the methods described in various embodiments or some parts of the embodiments.

最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, but not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be The technical solutions described in the foregoing embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions in the embodiments of the present application.

Claims (10)

1. embedding the voice interactive method of web page towards intelligent vehicle device, which comprises the steps of:
It is instructed for each embedded web page configuration response message;
Obtain user speech instruction;
The response message for the embedded web page for being instructed and being configured according to the user speech of acquisition, which instructs, determines key instruction message;
By critical message instruction output to corresponding embedded web page end.
2. the method according to claim 1, wherein described according in the instruction of the user speech of acquisition and configuration The response message of embedding web page, which instructs, determines that key instruction message includes
Obtain the embedded web page currently shown;
Response message instruction corresponding with the embedded web page currently shown is instructed to match the user speech, according to Matching result determines that critical message instructs.
3. method according to claim 1 or 2, which is characterized in that further include:
The embedded web page end carries out corresponding page elements operation according to critical message instruction.
4. according to the method described in claim 3, it is characterized in that, the response message instruction include determine, cancel, close, At least one of page up, lower one page, amplification picture, diminution picture or two or more any combination.
5. embedding the voice interaction device of web page towards intelligent vehicle device, which is characterized in that including
Configuration module is instructed, for instructing for each embedded web page configuration response message;
Voice obtains module, for obtaining user speech instruction;
Critical message determining module, the response message of the embedded web page for being instructed and being configured according to the user speech of acquisition It instructs and determines key instruction message;
Page communication module, for critical message instruction output to be embedded web page end to corresponding.
6. voice interaction device according to claim 5, which is characterized in that the response message instruction includes quick wakes up Instruction.
7. device according to claim 5 or 6, which is characterized in that the response message instruction includes determining, cancelling, closing It closes, page up, lower one page, amplification picture, reduce at least one of picture or two or more any combination.
8. intelligent vehicle device embeds Web page plane system, which is characterized in that responded in the embedded web page configured with critical message Map information, the embedded web page carry out corresponding page elements operation according to the key message instruction received,
Wherein, the critical message instruction is described in any item towards the embedded web page of intelligent vehicle device from claim 5 to 7 Voice interaction device.
9. electronic equipment comprising: at least one processor, and the storage being connect at least one described processor communication Device, wherein the memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one A processor executes, so that at least one described processor is able to carry out the step of any one of claim 1-4 the method Suddenly.
10. storage medium is stored thereon with computer program, which is characterized in that the program realizes right when being executed by processor It is required that the step of any one of 1-4 the method.
CN201910745196.XA 2019-08-13 2019-08-13 Voice interaction method, device and system for embedded web page of intelligent vehicle machine Active CN110444209B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910745196.XA CN110444209B (en) 2019-08-13 2019-08-13 Voice interaction method, device and system for embedded web page of intelligent vehicle machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910745196.XA CN110444209B (en) 2019-08-13 2019-08-13 Voice interaction method, device and system for embedded web page of intelligent vehicle machine

Publications (2)

Publication Number Publication Date
CN110444209A true CN110444209A (en) 2019-11-12
CN110444209B CN110444209B (en) 2022-04-12

Family

ID=68435015

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910745196.XA Active CN110444209B (en) 2019-08-13 2019-08-13 Voice interaction method, device and system for embedded web page of intelligent vehicle machine

Country Status (1)

Country Link
CN (1) CN110444209B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112770157A (en) * 2020-12-17 2021-05-07 深圳创维-Rgb电子有限公司 Voice control method, device, equipment and medium for WEB front-end interface of television

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7289960B2 (en) * 2001-10-24 2007-10-30 Agiletv Corporation System and method for speech activated internet browsing using open vocabulary enhancement
TW201403457A (en) * 2012-07-04 2014-01-16 Tpv Display Technology Xiamen Smart television having voice control function and voice control method thereof
CN107590174A (en) * 2017-07-31 2018-01-16 北京五八信息技术有限公司 Page access method and device
CN107992587A (en) * 2017-12-08 2018-05-04 北京百度网讯科技有限公司 A kind of voice interactive method of browser, device, terminal and storage medium
CN109669754A (en) * 2018-12-25 2019-04-23 苏州思必驰信息科技有限公司 The dynamic display method of interactive voice window, voice interactive method and device with telescopic interactive window

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7289960B2 (en) * 2001-10-24 2007-10-30 Agiletv Corporation System and method for speech activated internet browsing using open vocabulary enhancement
TW201403457A (en) * 2012-07-04 2014-01-16 Tpv Display Technology Xiamen Smart television having voice control function and voice control method thereof
CN107590174A (en) * 2017-07-31 2018-01-16 北京五八信息技术有限公司 Page access method and device
CN107992587A (en) * 2017-12-08 2018-05-04 北京百度网讯科技有限公司 A kind of voice interactive method of browser, device, terminal and storage medium
CN109669754A (en) * 2018-12-25 2019-04-23 苏州思必驰信息科技有限公司 The dynamic display method of interactive voice window, voice interactive method and device with telescopic interactive window

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112770157A (en) * 2020-12-17 2021-05-07 深圳创维-Rgb电子有限公司 Voice control method, device, equipment and medium for WEB front-end interface of television

Also Published As

Publication number Publication date
CN110444209B (en) 2022-04-12

Similar Documents

Publication Publication Date Title
CN111400518B (en) Method, device, terminal, server and system for generating and editing works
US20180060296A1 (en) Personal assistant context building
CN111933134A (en) Man-machine interaction method and device, electronic equipment and storage medium
CN110459221A (en) Method and device for multi-device cooperative voice interaction
WO2019019535A1 (en) Method for sending, forwarding and transmitting data, and apparatus for sending, forwarding and transmitting data
CN109448694A (en) A kind of method and device of rapid synthesis TTS voice
WO2020020202A1 (en) Account management method and apparatus of application program, and device and medium
CN112447177B (en) Full duplex voice conversation method and system
US8706909B1 (en) Systems and methods for semantic URL handling
CN114827704A (en) Vehicle-mounted system interaction method with vehicle, storage medium and mobile terminal
CN110444209B (en) Voice interaction method, device and system for embedded web page of intelligent vehicle machine
CN108924183A (en) For handling the method and device of information
CN109298956A (en) File transfer method, device and terminal device
CN111031141B (en) Method and server for realizing customized configuration of voice skills
CN110442698B (en) Dialog content generation method and system
CN111726284A (en) WeChat sending method and device for in-vehicle smart speakers
CN110827142A (en) User credit evaluation method, system, server and storage medium
US12342030B2 (en) Virtual remote control among digital assistant devices
CN117133286A (en) Man-machine voice interaction method, device and equipment under vehicle-mounted environment and storage medium
CN111726283B (en) WeChat receiving method and device for vehicle-mounted intelligent sound box
CN115762505A (en) Voice interaction method, electronic device and storage medium
CN111158838B (en) Information processing method and device
CN112288452B (en) Advertisement previewing method and device, electronic equipment and storage medium
CN114385128A (en) Data processing method, device, apparatus, storage medium and program product
CN109361767B (en) Optimize the processing method, server, and client of client data display errors

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu.

Applicant after: Sipic Technology Co.,Ltd.

Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu.

Applicant before: AI SPEECH Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant