[go: up one dir, main page]

WO2018102980A1 - Procédé, dispositif et système d'interaction vocale - Google Patents

Procédé, dispositif et système d'interaction vocale Download PDF

Info

Publication number
WO2018102980A1
WO2018102980A1 PCT/CN2016/108659 CN2016108659W WO2018102980A1 WO 2018102980 A1 WO2018102980 A1 WO 2018102980A1 CN 2016108659 W CN2016108659 W CN 2016108659W WO 2018102980 A1 WO2018102980 A1 WO 2018102980A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
data
feedback
semantic analysis
feedback data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2016/108659
Other languages
English (en)
Chinese (zh)
Inventor
郝磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gt Robot Technology Pte Ltd
Original Assignee
Gt Robot Technology Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gt Robot Technology Pte Ltd filed Critical Gt Robot Technology Pte Ltd
Priority to CN201680087124.5A priority Critical patent/CN109429522A/zh
Priority to PCT/CN2016/108659 priority patent/WO2018102980A1/fr
Publication of WO2018102980A1 publication Critical patent/WO2018102980A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • G06F40/35Discourse or dialogue representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Definitions

  • the present application relates to speech recognition technology, for example, to a voice interaction method, apparatus and system.
  • the companion robot in the related art combines the voice system of the artificial intelligence to realize the function of the robot chat, and realizes the companionship of the person.
  • the function of the companion robot in the related art is limited to a voice conversation in a fixed category with the user, and cannot provide more services to the user.
  • the present application provides a voice interaction method, device and system to provide more services for users through intelligent robots.
  • the embodiment of the present application provides a voice interaction method, which is applied to an intelligent robot, including:
  • the interaction information is presented based on the feedback data.
  • displaying the interaction information includes:
  • the corresponding voice is played, the corresponding robotic limb motion is performed, and/or the corresponding content is displayed on the display screen.
  • the feedback data includes:
  • the embodiment of the present application provides a voice interaction method, which is applied to a server, and includes:
  • the feedback data is sent to the intelligent robot.
  • performing voice recognition and semantic analysis on the voice data, and generating semantic analysis results include:
  • Semantic matching is performed according to the word segmentation replacement text and the preset semantic library, and the semantic analysis result is determined.
  • determining the feedback data according to the semantic analysis result includes:
  • An index of the feedback voice data and/or related service is determined as the feedback data.
  • the embodiment of the present application further provides a voice interaction device, configured in the intelligent robot, including:
  • a voice data sending module configured to send the obtained voice data to a server
  • a feedback data receiving module configured to receive feedback data sent by the server
  • the interactive information display module is configured to display the interaction information according to the feedback data.
  • the interactive information display module is set to:
  • the corresponding voice is played, the corresponding robotic limb motion is performed, and/or the corresponding content is displayed on the display screen.
  • the feedback data includes:
  • the embodiment of the present application further provides a voice interaction device, configured on the server, including:
  • a voice data receiving module configured to receive voice data sent by the intelligent robot
  • the semantic analysis result generating module is configured to perform voice recognition and semantic analysis on the voice data to generate a semantic analysis result
  • a feedback data determining module configured to determine feedback data based on the semantic analysis result
  • a feedback data sending module is configured to send the feedback data to the intelligent robot.
  • the semantic analysis result generation module is set to:
  • Semantic matching is performed according to the word segmentation replacement text and the preset semantic library, and the semantic analysis result is determined.
  • the feedback data determination module is set to:
  • An index of the feedback voice data and/or related service is determined as the feedback data.
  • the embodiment of the present application further provides a voice interaction system, including the intelligent robot of the third aspect and the server of the fourth aspect.
  • the embodiment of the present application further provides an intelligent robot, including:
  • At least one processor and,
  • the memory stores instructions executable by the one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform an intelligent robot side of any of the embodiments of the present application Voice interaction method.
  • the embodiment of the present application further provides a non-transitory computer readable storage medium, wherein the non-transitory computer readable storage medium stores computer instructions, where the computer instructions are used to cause the computer to execute the application.
  • the embodiment of the present application further provides a computer program product, where the computer program product comprises a computing program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when When the program instructions are executed by the computer, the computer is caused to execute the voice interaction method on the intelligent robot side of any of the embodiments of the present application.
  • the embodiment of the present application further provides a server, including:
  • At least one processor and,
  • the memory stores instructions executable by the one processor, the instructions being executed by the at least one processor to enable the at least one processor to perform voice on a server side of any of the embodiments of the present application Interactive method.
  • the embodiment of the present application further provides a non-transitory computer readable storage medium, wherein the non-transitory computer readable storage medium stores computer instructions, where the computer instructions are used to cause the computer to execute the application.
  • the embodiment of the present application further provides a computer program product, where the computer program product comprises a calculation program stored on a non-transitory computer readable storage medium,
  • the computer program includes a program instruction, and when the program instruction is executed by a computer, causes the computer to execute a voice interaction method on the server side of any one of the embodiments of the present application.
  • the embodiment of the present application solves the problem that the function of the companion robot in the related art is limited to the voice conversation in the fixed category with the user by identifying and analyzing the voice instruction of the user and using the intelligent robot to provide the corresponding service to the user, which is convenient for the user. Life, enhance the user experience.
  • FIG. 1 is a flowchart of a voice interaction method in Embodiment 1 of the present application.
  • FIG. 2 is a flowchart of a voice interaction method in Embodiment 2 of the present application.
  • FIG. 3 is a schematic structural diagram of a voice interaction apparatus in Embodiment 3 of the present application.
  • FIG. 4 is a schematic structural diagram of a voice interaction apparatus in Embodiment 4 of the present application.
  • FIG. 5 is a data flow diagram of a voice interaction system in Embodiment 5 of the present application.
  • FIG. 6 is a schematic diagram showing the hardware structure of an intelligent robot in Embodiment 5 of the present application.
  • FIG. 7 is a schematic structural diagram of a server in Embodiment 5 of the present application.
  • Embodiment 1 is a flowchart of a voice interaction method according to Embodiment 1 of the present application. This embodiment is applicable to a situation in which an intelligent robot performs voice interaction with a user, and the method may be performed by a voice interaction device applied to an intelligent robot.
  • the device can be implemented by hardware and/or software, and the voice interaction method includes:
  • the acquired voice data is transmitted to the server.
  • the voice command sent by the user is obtained through the microphone provided on the intelligent robot, the microphone converts the sound signal into an electrical signal, saves the electrical signal as voice data, and sends the voice data to the server, for example, through the intelligent robot
  • the mobile communication module is equipped to send voice data to the server through the mobile communication network.
  • the server performs voice conversion text according to the voice data sent by the intelligent robot, parses the text semantics, determines the user's intention, and determines the service provided for the user according to the intention of the user and/or responds to the voice content of the user voice instruction, and the related service
  • the index and/or feedback voice data is sent as feedback data to the intelligent robot, and the intelligent robot receives the feedback data.
  • the intelligent robot displays the corresponding interactive information for the user by analyzing the feedback data.
  • displaying the interaction information includes: playing the corresponding voice, performing a corresponding robotic limb motion, and/or displaying the corresponding content on the display screen.
  • the feedback data includes a service index of the current weather condition and voice data of the current weather condition, and after receiving the feedback data, the intelligent robot displays the weather condition information on the display screen according to the service index of the current weather condition in the feedback data, And play the current weather conditions through the speaker.
  • the smart robot receives the user's voice instruction, and uses the intelligent robot to provide the corresponding service to the user, and solves the problem that the function of the companion robot in the related art is limited to the voice dialogue in the fixed category with the user. It can facilitate the life of the user and enhance the user experience.
  • Embodiment 2 is a flowchart of a voice interaction method according to Embodiment 2 of the present application. This embodiment is applicable to a scenario where a smart robot performs voice interaction with a user, and the method may be performed by a voice interaction device applied to a server.
  • the device may be implemented by hardware and/or software, and the voice interaction method includes:
  • S220 speech recognition and semantic analysis are performed on the speech data to generate a semantic analysis result.
  • S220 includes: performing voice recognition on the voice data, generating text; segmenting the text, replacing the word segmentation with the synonyms, generating the word segmentation replacement text; and performing semantic matching according to the word segmentation text and the preset semantic library to determine semantics Analysis results.
  • the part of speech of each part of the word is determined, and the part of speech is replaced with the part of speech of the participle to generate a participle replacement text.
  • the sentence components such as subjective object are extracted from the word segmentation text, and then the word segmentation text of the extracted sentence component is semantically matched with the preset semantic library.
  • the semantic database is pre-stored in the preset semantic library.
  • feedback data is determined based on the result of the semantic analysis.
  • S230 includes: determining feedback voice data according to the semantic analysis result and/or determining an index of the related service according to the semantic analysis result; and determining an index of the feedback voice data and/or the related service as the feedback data.
  • the feedback data is transmitted to the intelligent robot.
  • the server determines the intention of the user according to the received voice data sent by the intelligent robot, and feeds back the corresponding feedback data to the intelligent robot, thereby solving the function of the companion robot in the related art is limited to the user.
  • the problem of voice dialogue in a fixed category can facilitate the user's life and enhance the user's experience.
  • FIG. 3 is a schematic structural diagram of a voice interaction apparatus according to Embodiment 3 of the present application.
  • the apparatus is configured in an intelligent robot, and the smart robot further includes a microphone, a speaker, a display screen, and a communication module.
  • the voice interaction device 30 includes:
  • the voice data sending module 310 is configured to send the obtained voice data to the server;
  • the feedback data receiving module 320 is configured to receive feedback data sent by the server
  • the interactive information display module 330 is configured to display the interaction information according to the feedback data.
  • the interactive information display module 330 is configured to:
  • the corresponding voice is played, the corresponding robotic limb motion is performed, and/or the corresponding content is displayed on the display screen.
  • the feedback data includes:
  • the smart robot receives the user's voice instruction, and uses the intelligent robot to provide the corresponding service to the user, and solves the problem that the function of the companion robot in the related art is limited to the voice dialogue in the fixed category with the user. It can facilitate the life of the user and enhance the user experience.
  • FIG. 4 is a schematic structural diagram of a voice interaction apparatus according to Embodiment 4 of the present application.
  • the apparatus is configured on a server, and the voice interaction apparatus 40 includes:
  • the voice data receiving module 410 is configured to receive voice data sent by the intelligent robot
  • the semantic analysis result generating module 420 is configured to perform voice recognition and semantic analysis on the voice data to generate a semantic analysis result
  • the feedback data determining module 430 is configured to determine the feedback data according to the semantic analysis result
  • the feedback data sending module 440 is configured to send the feedback data to the intelligent robot.
  • the semantic analysis result generating module 420 is configured to:
  • Semantic matching is performed according to the word segmentation text and the preset semantic library to determine the semantic analysis result.
  • the feedback data determining module 430 is configured to:
  • the index of the feedback voice data and/or related service is determined as feedback data.
  • the server determines the intention of the user according to the received voice data sent by the intelligent robot, and feeds back the corresponding feedback data to the intelligent robot, thereby solving the function of the companion robot in the related art is limited to the user.
  • the problem of voice dialogue in a fixed category can facilitate the user's life and enhance the user's experience.
  • FIG. 5 is a data flow diagram of a voice interaction system according to Embodiment 5 of the present application.
  • the voice interaction system provided in this embodiment includes:
  • the intelligent robot 510 is the intelligent robot described in the third embodiment.
  • the server 520 is the server described in the fourth embodiment.
  • the intelligent robot acquires voice data through a microphone and transmits the acquired voice data to the server.
  • the server receives the voice data sent by the intelligent robot; the number of voice pairs by the server According to the speech recognition and semantic analysis, the semantic analysis result is generated; the server determines the feedback data according to the semantic analysis result; the server sends the feedback data to the intelligent robot.
  • the intelligent robot receives the feedback data sent by the server; the intelligent robot displays the interaction information according to the feedback data.
  • the embodiment of the present application solves the problem that the function of the companion robot in the related art is limited to the voice conversation in the fixed category with the user by identifying and analyzing the voice instruction of the user and using the intelligent robot to provide the corresponding service to the user, which is convenient for the user. Life, enhance the user experience.
  • the above product can perform the method provided by any embodiment of the present application, and has the corresponding functional modules and beneficial effects of the execution method.
  • the embodiment of the present application further provides a non-transitory computer storage medium, where the computer storage medium stores computer executable instructions, which can execute the voice interaction method on the intelligent robot side in any of the above embodiments.
  • FIG. 6 is a schematic diagram of a hardware structure of an intelligent robot according to an embodiment of the present application. As shown in FIG. 6, the smart robot includes:
  • processors 610 and memory 620 one processor 610 is taken as an example in FIG.
  • the intelligent robot may further include an input device 630 and an output device 640.
  • the processor 610, the memory 620, the input device 630, and the output device 640 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 620 is used as a non-transitory computer readable storage medium, and can be used for storing a non-transitory software program, a non-transitory computer executable program, and a module, such as a program instruction/module corresponding to the voice interaction method in the embodiment of the present application ( For example, the voice data transmitting module 310, the feedback data receiving module 320, and the interactive information display module 330) shown in FIG.
  • the processor 610 executes various functions and data processing of the intelligent robot by running non-transitory software programs, instructions and modules stored in the memory 620, that is, implementing the voice interaction method on the intelligent robot side of the above method embodiment.
  • the memory 620 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to use of a server in which the application interacts, and the like.
  • memory 620 can include high speed random access memory, and can also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device.
  • the memory 620 can optionally include a memory remotely located relative to the processor 610 that can be connected to the server of the application interaction via a network.
  • Examples of the above network may be an Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • Input device 630 can receive input numeric or character information, as well as key signal inputs related to user settings and function control of the terminal that interacts with the application.
  • the output device 640 can include a display device such as a display screen.
  • the one or more modules are stored in the memory 620, and when executed by the one or more processors 610, perform a voice interaction method on the intelligent robot side in any of the above method embodiments.
  • the embodiment of the present application further provides a non-transitory computer storage medium, where the computer storage medium stores computer executable instructions, and the computer executable instructions can execute the server side voice interaction method in any of the foregoing embodiments.
  • FIG. 7 is a schematic structural diagram of a server according to an embodiment of the present application. As shown in FIG. 7, the server includes:
  • processors 710 and memory 720 one processor 710 is taken as an example in FIG.
  • the server may also include an input device 730 and an output device 740.
  • the processor 710, the memory 720, the input device 730, and the output device 740 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 720 is used as a non-transitory computer readable storage medium, and can be used for storing a non-transitory software program, a non-transitory computer executable program, and a module, such as a program instruction/module corresponding to the voice interaction method in the embodiment of the present application ( For example, the voice data receiving module 410 shown in FIG. 4, the semantic analysis result generating module 420, the feedback data determining module 430, and the feedback data transmitting module 440).
  • the processor 710 executes various functions and data processing of the server by running non-transitory software programs, instructions and modules stored in the memory 720, that is, implementing the voice interaction method on the server side of the above method embodiment.
  • the memory 720 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created by use of the intelligent robot interacting with the application, etc. .
  • memory 720 can include high speed random access memory, and can also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device.
  • memory 720 can optionally include memory remotely located relative to processor 710 that can be connected via a network to an intelligent robot that the application interacts with.
  • Examples of the above network may be an Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
  • Input device 730 can receive input numeric or character information, as well as key signal inputs related to user settings and function control of the terminal that interacts with the application.
  • the output device 740 can include a display device such as a display screen.
  • the one or more modules are stored in the memory 720, and when executed by the one or more processors 710, perform a server-side voice interaction method in any of the above method embodiments.
  • the embodiment of the present application uses an intelligent robot to provide a corresponding service for the user by identifying and analyzing the voice instruction of the user, thereby facilitating the life of the user and improving the user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • User Interface Of Digital Computer (AREA)
  • Manipulator (AREA)

Abstract

L'invention concerne un procédé, un dispositif et un système d'interaction vocale. Le procédé d'interaction vocale, applicable à un robot intelligent (510), comprend les étapes consistant : à transmettre des données vocales acquises à un serveur (110) ; à recevoir des données de rétroaction transmises par le serveur (120) ; et, selon les données de rétroaction, à présenter des informations d'interaction (130).
PCT/CN2016/108659 2016-12-06 2016-12-06 Procédé, dispositif et système d'interaction vocale Ceased WO2018102980A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201680087124.5A CN109429522A (zh) 2016-12-06 2016-12-06 语音交互方法、装置及系统
PCT/CN2016/108659 WO2018102980A1 (fr) 2016-12-06 2016-12-06 Procédé, dispositif et système d'interaction vocale

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/108659 WO2018102980A1 (fr) 2016-12-06 2016-12-06 Procédé, dispositif et système d'interaction vocale

Publications (1)

Publication Number Publication Date
WO2018102980A1 true WO2018102980A1 (fr) 2018-06-14

Family

ID=62490688

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/108659 Ceased WO2018102980A1 (fr) 2016-12-06 2016-12-06 Procédé, dispositif et système d'interaction vocale

Country Status (2)

Country Link
CN (1) CN109429522A (fr)
WO (1) WO2018102980A1 (fr)

Cited By (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109074808A (zh) * 2018-07-18 2018-12-21 深圳魔耳智能声学科技有限公司 语音控制方法、中控设备和存储介质
CN109308893A (zh) * 2018-10-25 2019-02-05 珠海格力电器股份有限公司 信息发送方法及装置、存储介质、电子装置
CN109359211A (zh) * 2018-11-13 2019-02-19 平安科技(深圳)有限公司 语音交互的数据更新方法、装置、计算机设备和存储介质
CN109408815A (zh) * 2018-10-09 2019-03-01 苏州思必驰信息科技有限公司 用于语音对话平台的词库管理方法和系统
CN109543182A (zh) * 2018-11-15 2019-03-29 广东电网有限责任公司信息中心 一种基于solr引擎的电力企业多轮交互语义分析方法
CN109859032A (zh) * 2019-01-22 2019-06-07 深圳壹账通智能科技有限公司 基于智能语音的账款催收方法、装置、设备和存储介质
CN110377908A (zh) * 2019-07-19 2019-10-25 科大讯飞股份有限公司 语义理解方法、装置、设备及可读存储介质
CN110647305A (zh) * 2019-09-29 2020-01-03 百度在线网络技术(北京)有限公司 应用程序的语音交互方法、装置、设备和介质
CN111142833A (zh) * 2019-12-26 2020-05-12 苏州思必驰信息科技有限公司 基于情景模式开发语音交互产品的方法及系统
CN111354452A (zh) * 2020-03-11 2020-06-30 北京声智科技有限公司 导诊方法、系统、电子设备及计算机可读存储介质
CN111415642A (zh) * 2020-03-31 2020-07-14 广东美的制冷设备有限公司 用电设备的语音播报方法、装置、空调器和存储介质
CN111724774A (zh) * 2019-03-22 2020-09-29 阿里巴巴集团控股有限公司 语音交互及车载语音交互方法、装置、设备及存储介质
CN111724785A (zh) * 2020-06-29 2020-09-29 百度在线网络技术(北京)有限公司 小程序的语音控制方法、设备及存储介质
CN111738013A (zh) * 2020-06-16 2020-10-02 中国建设银行股份有限公司 语音交互方法、装置、自助设备、服务器和介质
CN111914072A (zh) * 2020-07-14 2020-11-10 青岛聚好联科技有限公司 一种信息交互方法、设备及装置
CN111949240A (zh) * 2019-05-16 2020-11-17 阿里巴巴集团控股有限公司 交互方法、存储介质、服务程序和设备
CN112015567A (zh) * 2019-05-31 2020-12-01 阿里巴巴集团控股有限公司 数据处理方法、装置、设备和存储介质
CN112164402A (zh) * 2020-09-18 2021-01-01 广州小鹏汽车科技有限公司 车辆语音交互方法、装置、服务器和计算机可读存储介质
CN112418385A (zh) * 2020-12-10 2021-02-26 郑子龙 一种仿生智能体控制方法、设备、系统
CN112528000A (zh) * 2020-12-22 2021-03-19 北京百度网讯科技有限公司 虚拟机器人的生成方法、装置和电子设备
CN112908313A (zh) * 2021-03-08 2021-06-04 深圳市英特飞电子有限公司 智慧路灯语音交互方法、装置、计算机设备及存储介质
CN113033179A (zh) * 2021-03-24 2021-06-25 北京百度网讯科技有限公司 知识获取方法、装置、电子设备及可读存储介质
CN113271383A (zh) * 2021-05-12 2021-08-17 思必驰科技股份有限公司 语音查号方法和装置
CN113454732A (zh) * 2019-04-26 2021-09-28 深圳迈瑞生物医疗电子股份有限公司 多医疗设备共存时的语音交互方法、医疗系统及医疗设备
CN113593576A (zh) * 2021-08-30 2021-11-02 北京声智科技有限公司 一种语音交互设备、系统、方法、云端服务器和存储介质
CN113676601A (zh) * 2021-09-30 2021-11-19 百可录(北京)科技有限公司 一种智能语音交互系统及方法
CN113705224A (zh) * 2021-09-06 2021-11-26 广东电网有限责任公司 一种语音识别的调度业务语音交互方法及系统
CN113849083A (zh) * 2021-09-24 2021-12-28 珠海格力电器股份有限公司 一种滤网复位显示方法、系统及滤网复位显示设备
CN113889111A (zh) * 2021-11-02 2022-01-04 东莞市凌岳电子科技有限公司 一种智能语音交互系统和钟表
CN114114932A (zh) * 2020-08-31 2022-03-01 无锡小天鹅电器有限公司 家电设备的控制方法、控制装置、家电设备和存储介质
WO2022135493A1 (fr) * 2020-12-25 2022-06-30 广州橙行智动汽车科技有限公司 Procédé d'interaction vocale, serveur, système d'interaction vocale et support de stockage
CN114697713A (zh) * 2020-12-29 2022-07-01 深圳Tcl新技术有限公司 语音助手控制方法、装置、存储介质及智能电视
CN115076988A (zh) * 2022-05-25 2022-09-20 青岛海尔空调器有限总公司 一种智能空调的监控方法、装置、设备及存储介质
CN115338858A (zh) * 2022-07-14 2022-11-15 达闼机器人股份有限公司 智能机器人控制方法、装置、服务器、机器人和存储介质
CN115376500A (zh) * 2022-08-22 2022-11-22 江苏安防科技有限公司 一种智能语音识别处理方法及系统
CN116403583A (zh) * 2023-04-12 2023-07-07 中国第一汽车股份有限公司 语音数据处理方法和装置、非易失性存储介质及车辆
CN116894078A (zh) * 2023-07-28 2023-10-17 广东保伦电子股份有限公司 一种信息交互方法、装置、电子设备及介质
US11984120B2 (en) 2020-06-29 2024-05-14 Beijing Baidu Netcom Science Technology Co., Ltd. Voice control method for applet and device, and storage medium
CN119576136A (zh) * 2024-11-25 2025-03-07 国电南瑞南京控制系统有限公司 一种基于虚拟数字人的ai智能对话系统

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110136712A (zh) * 2019-05-07 2019-08-16 甄十信息科技(上海)有限公司 实现智能语音ai功能的方法及装置
CN111477224A (zh) * 2020-03-23 2020-07-31 一汽奔腾轿车有限公司 一种人车虚拟交互系统
CN113506569B (zh) * 2021-06-07 2023-02-03 上海七十迈数字科技有限公司 一种语音交互的方法、系统及设备
CN114154636A (zh) * 2021-11-29 2022-03-08 阿里巴巴(中国)有限公司 数据处理方法、电子设备及计算机程序产品

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1501233A (zh) * 2002-11-13 2004-06-02 ���ǵ�����ʽ���� 使用家庭服务器的家庭机器人及其家庭网络系统
JP2009223170A (ja) * 2008-03-18 2009-10-01 Advanced Telecommunication Research Institute International 音声認識システム
JP2010149266A (ja) * 2008-12-26 2010-07-08 Takao Hayashi ロボットシステム
CN105206272A (zh) * 2015-09-06 2015-12-30 上海智臻智能网络科技股份有限公司 语音传输控制方法及系统
CN105551493A (zh) * 2015-11-30 2016-05-04 北京光年无限科技有限公司 儿童语音机器人数据处理方法、装置及儿童语音机器人
CN205508398U (zh) * 2015-09-30 2016-08-24 深圳光启合众科技有限公司 具有云端交互功能的智能机器人

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105488032A (zh) * 2015-12-31 2016-04-13 杭州智蚁科技有限公司 一种语音识别输入的控制方法及系统
CN105930452A (zh) * 2016-04-21 2016-09-07 北京紫平方信息技术股份有限公司 一种识别自然语言的智能应答方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1501233A (zh) * 2002-11-13 2004-06-02 ���ǵ�����ʽ���� 使用家庭服务器的家庭机器人及其家庭网络系统
JP2009223170A (ja) * 2008-03-18 2009-10-01 Advanced Telecommunication Research Institute International 音声認識システム
JP2010149266A (ja) * 2008-12-26 2010-07-08 Takao Hayashi ロボットシステム
CN105206272A (zh) * 2015-09-06 2015-12-30 上海智臻智能网络科技股份有限公司 语音传输控制方法及系统
CN205508398U (zh) * 2015-09-30 2016-08-24 深圳光启合众科技有限公司 具有云端交互功能的智能机器人
CN105551493A (zh) * 2015-11-30 2016-05-04 北京光年无限科技有限公司 儿童语音机器人数据处理方法、装置及儿童语音机器人

Cited By (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109074808A (zh) * 2018-07-18 2018-12-21 深圳魔耳智能声学科技有限公司 语音控制方法、中控设备和存储介质
CN109408815A (zh) * 2018-10-09 2019-03-01 苏州思必驰信息科技有限公司 用于语音对话平台的词库管理方法和系统
CN109308893A (zh) * 2018-10-25 2019-02-05 珠海格力电器股份有限公司 信息发送方法及装置、存储介质、电子装置
CN109359211A (zh) * 2018-11-13 2019-02-19 平安科技(深圳)有限公司 语音交互的数据更新方法、装置、计算机设备和存储介质
CN109359211B (zh) * 2018-11-13 2024-05-07 平安科技(深圳)有限公司 语音交互的数据更新方法、装置、计算机设备和存储介质
CN109543182A (zh) * 2018-11-15 2019-03-29 广东电网有限责任公司信息中心 一种基于solr引擎的电力企业多轮交互语义分析方法
CN109859032A (zh) * 2019-01-22 2019-06-07 深圳壹账通智能科技有限公司 基于智能语音的账款催收方法、装置、设备和存储介质
CN111724774A (zh) * 2019-03-22 2020-09-29 阿里巴巴集团控股有限公司 语音交互及车载语音交互方法、装置、设备及存储介质
CN111724774B (zh) * 2019-03-22 2024-05-17 斑马智行网络(香港)有限公司 语音交互及车载语音交互方法、装置、设备及存储介质
CN113454732B (zh) * 2019-04-26 2023-11-28 深圳迈瑞生物医疗电子股份有限公司 多医疗设备共存时的语音交互方法、医疗系统及医疗设备
CN113454732A (zh) * 2019-04-26 2021-09-28 深圳迈瑞生物医疗电子股份有限公司 多医疗设备共存时的语音交互方法、医疗系统及医疗设备
CN111949240A (zh) * 2019-05-16 2020-11-17 阿里巴巴集团控股有限公司 交互方法、存储介质、服务程序和设备
CN112015567A (zh) * 2019-05-31 2020-12-01 阿里巴巴集团控股有限公司 数据处理方法、装置、设备和存储介质
CN110377908A (zh) * 2019-07-19 2019-10-25 科大讯飞股份有限公司 语义理解方法、装置、设备及可读存储介质
CN110647305A (zh) * 2019-09-29 2020-01-03 百度在线网络技术(北京)有限公司 应用程序的语音交互方法、装置、设备和介质
CN110647305B (zh) * 2019-09-29 2023-10-31 阿波罗智联(北京)科技有限公司 应用程序的语音交互方法、装置、设备和介质
CN111142833A (zh) * 2019-12-26 2020-05-12 苏州思必驰信息科技有限公司 基于情景模式开发语音交互产品的方法及系统
CN111142833B (zh) * 2019-12-26 2022-07-08 思必驰科技股份有限公司 基于情景模式开发语音交互产品的方法及系统
CN111354452A (zh) * 2020-03-11 2020-06-30 北京声智科技有限公司 导诊方法、系统、电子设备及计算机可读存储介质
CN111415642A (zh) * 2020-03-31 2020-07-14 广东美的制冷设备有限公司 用电设备的语音播报方法、装置、空调器和存储介质
CN111738013A (zh) * 2020-06-16 2020-10-02 中国建设银行股份有限公司 语音交互方法、装置、自助设备、服务器和介质
CN111724785B (zh) * 2020-06-29 2023-07-04 百度在线网络技术(北京)有限公司 小程序的语音控制方法、设备及存储介质
US11984120B2 (en) 2020-06-29 2024-05-14 Beijing Baidu Netcom Science Technology Co., Ltd. Voice control method for applet and device, and storage medium
CN111724785A (zh) * 2020-06-29 2020-09-29 百度在线网络技术(北京)有限公司 小程序的语音控制方法、设备及存储介质
CN111914072A (zh) * 2020-07-14 2020-11-10 青岛聚好联科技有限公司 一种信息交互方法、设备及装置
CN114114932A (zh) * 2020-08-31 2022-03-01 无锡小天鹅电器有限公司 家电设备的控制方法、控制装置、家电设备和存储介质
CN112164402A (zh) * 2020-09-18 2021-01-01 广州小鹏汽车科技有限公司 车辆语音交互方法、装置、服务器和计算机可读存储介质
CN112418385A (zh) * 2020-12-10 2021-02-26 郑子龙 一种仿生智能体控制方法、设备、系统
CN112528000A (zh) * 2020-12-22 2021-03-19 北京百度网讯科技有限公司 虚拟机器人的生成方法、装置和电子设备
WO2022135493A1 (fr) * 2020-12-25 2022-06-30 广州橙行智动汽车科技有限公司 Procédé d'interaction vocale, serveur, système d'interaction vocale et support de stockage
CN114697713B (zh) * 2020-12-29 2024-02-06 深圳Tcl新技术有限公司 语音助手控制方法、装置、存储介质及智能电视
CN114697713A (zh) * 2020-12-29 2022-07-01 深圳Tcl新技术有限公司 语音助手控制方法、装置、存储介质及智能电视
CN112908313A (zh) * 2021-03-08 2021-06-04 深圳市英特飞电子有限公司 智慧路灯语音交互方法、装置、计算机设备及存储介质
CN113033179B (zh) * 2021-03-24 2024-05-24 北京百度网讯科技有限公司 知识获取方法、装置、电子设备及可读存储介质
CN113033179A (zh) * 2021-03-24 2021-06-25 北京百度网讯科技有限公司 知识获取方法、装置、电子设备及可读存储介质
CN113271383A (zh) * 2021-05-12 2021-08-17 思必驰科技股份有限公司 语音查号方法和装置
CN113593576A (zh) * 2021-08-30 2021-11-02 北京声智科技有限公司 一种语音交互设备、系统、方法、云端服务器和存储介质
CN113705224A (zh) * 2021-09-06 2021-11-26 广东电网有限责任公司 一种语音识别的调度业务语音交互方法及系统
CN113849083A (zh) * 2021-09-24 2021-12-28 珠海格力电器股份有限公司 一种滤网复位显示方法、系统及滤网复位显示设备
CN113849083B (zh) * 2021-09-24 2024-03-29 珠海格力电器股份有限公司 一种滤网复位显示方法、系统及滤网复位显示设备
CN113676601A (zh) * 2021-09-30 2021-11-19 百可录(北京)科技有限公司 一种智能语音交互系统及方法
CN113889111A (zh) * 2021-11-02 2022-01-04 东莞市凌岳电子科技有限公司 一种智能语音交互系统和钟表
CN115076988A (zh) * 2022-05-25 2022-09-20 青岛海尔空调器有限总公司 一种智能空调的监控方法、装置、设备及存储介质
CN115338858A (zh) * 2022-07-14 2022-11-15 达闼机器人股份有限公司 智能机器人控制方法、装置、服务器、机器人和存储介质
CN115376500A (zh) * 2022-08-22 2022-11-22 江苏安防科技有限公司 一种智能语音识别处理方法及系统
CN116403583A (zh) * 2023-04-12 2023-07-07 中国第一汽车股份有限公司 语音数据处理方法和装置、非易失性存储介质及车辆
CN116894078A (zh) * 2023-07-28 2023-10-17 广东保伦电子股份有限公司 一种信息交互方法、装置、电子设备及介质
CN119576136A (zh) * 2024-11-25 2025-03-07 国电南瑞南京控制系统有限公司 一种基于虚拟数字人的ai智能对话系统

Also Published As

Publication number Publication date
CN109429522A (zh) 2019-03-05

Similar Documents

Publication Publication Date Title
WO2018102980A1 (fr) Procédé, dispositif et système d'interaction vocale
CN108022586B (zh) 用于控制页面的方法和装置
US11935521B2 (en) Real-time feedback for efficient dialog processing
CN112334892B (zh) 选择性生成用于指导人机对话的继续的扩展响应
US9805718B2 (en) Clarifying natural language input using targeted questions
JP6667504B2 (ja) オーファン発話検出システム及び方法
CN116012481B (zh) 图像生成处理方法、装置、电子设备及存储介质
CN112100352A (zh) 与虚拟对象的对话方法、装置、客户端及存储介质
KR102429407B1 (ko) 사용자 구성의 맞춤형 인터렉티브 대화 애플리케이션
WO2021022992A1 (fr) Procédé et dispositif d'apprentissage de modèle de génération de dialogue, et procédé et dispositif de génération de dialogue, et support
US12210973B2 (en) Compressing neural networks for natural language understanding
CN111601145A (zh) 基于直播的内容展示方法、装置、设备及存储介质
WO2017186050A1 (fr) Procédé et dispositif de reconnaissance de phrase segmentée pour un système de question-réponse intelligent homme-machine
KR102104294B1 (ko) 디스플레이 장치로 읽을 수 있는 저장매체에 저장된 수화 영상 챗봇 애플리케이션
CN115905490B (zh) 人机交互对话方法、装置以及设备
US20220068267A1 (en) Method and apparatus for recognizing speech, electronic device and storage medium
CN118764681B (zh) 针对视频的交互方法和针对视频的处理方法、装置
CN115167733A (zh) 一种直播资源的展示方法、装置、电子设备和存储介质
KR101385316B1 (ko) 로봇을 이용한 광고 및 콘텐츠 연계형 대화 서비스 제공 시스템 및 그 방법
CN110633357A (zh) 语音交互方法、装置、设备和介质
CN113763925A (zh) 语音识别方法、装置、计算机设备及存储介质
CN116680376A (zh) 信息交互方法、装置、电子设备和存储介质
US20210098012A1 (en) Voice Skill Recommendation Method, Apparatus, Device and Storage Medium
CN113111658B (zh) 校验信息的方法、装置、设备和存储介质
CN114691922A (zh) 基于虚拟对象的会话处理方法、装置和设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16923188

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16923188

Country of ref document: EP

Kind code of ref document: A1