[go: up one dir, main page]

CN111402869B - Multi-voice mode man-machine dialogue system - Google Patents

Multi-voice mode man-machine dialogue system Download PDF

Info

Publication number
CN111402869B
CN111402869B CN201811524605.5A CN201811524605A CN111402869B CN 111402869 B CN111402869 B CN 111402869B CN 201811524605 A CN201811524605 A CN 201811524605A CN 111402869 B CN111402869 B CN 111402869B
Authority
CN
China
Prior art keywords
module
voice
mobile terminal
voice interaction
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201811524605.5A
Other languages
Chinese (zh)
Other versions
CN111402869A (en
Inventor
司马华鹏
陈莉萍
茅玥琪
孙翊杰
陆放
司马德一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing baide yuancheng trading Co.,Ltd.
Original Assignee
Suqian Silicon Based Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suqian Silicon Based Intelligent Technology Co ltd filed Critical Suqian Silicon Based Intelligent Technology Co ltd
Priority to CN201811524605.5A priority Critical patent/CN111402869B/en
Publication of CN111402869A publication Critical patent/CN111402869A/en
Application granted granted Critical
Publication of CN111402869B publication Critical patent/CN111402869B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G08SIGNALLING
    • G08CTRANSMISSION SYSTEMS FOR MEASURED VALUES, CONTROL OR SIMILAR SIGNALS
    • G08C17/00Arrangements for transmitting signals characterised by the use of a wireless electrical link
    • G08C17/02Arrangements for transmitting signals characterised by the use of a wireless electrical link using a radio link
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72409User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories
    • H04M1/72415User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories for remote control of appliances
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Environmental & Geological Engineering (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a man-machine dialogue system with multiple voice modes, which solves the problem that the voice mode of the traditional voice interaction system is single, and the technical scheme is characterized in that a calling module is used for calling voice data in a storage module and sending the voice data to a voice interaction module; the audio input module can collect voice information of a user, and the audio output module can output the content replied by the voice interaction module in a voice mode, so that voice interaction between the voice interaction module and the user is realized.

Description

多人声模式人机对话系统Multi-voice mode man-machine dialogue system

技术领域technical field

本发明涉及语音交互系统,特别涉及多人声模式人机对话系统。The invention relates to a voice interaction system, in particular to a multi-voice mode man-machine dialogue system.

背景技术Background technique

智能语音交互是基于语音输入的新一代交互模式,通过说话就可以得到反馈结果,语音助手就是其中典型的应用场景。然而在实际应用过程中,语音交互系统在与用户进行交互过程中,其发声形式过于单一,导致应用场景受限,因此还存在一定的改进空间。Intelligent voice interaction is a new generation of interaction mode based on voice input. You can get feedback results by speaking. Voice assistant is a typical application scenario. However, in the actual application process, when the voice interaction system interacts with the user, its vocalization form is too single, resulting in limited application scenarios, so there is still room for improvement.

发明内容Contents of the invention

本发明的目的是提供一种能够根据应用场景切换发声模式的多人声模式人机对话系统。The purpose of the present invention is to provide a multi-voice mode man-machine dialogue system capable of switching voice modes according to application scenarios.

本发明的上述技术目的是通过以下技术方案得以实现的:Above-mentioned technical purpose of the present invention is achieved through the following technical solutions:

一种多人声模式人机对话系统,包括依次耦接的音频录入模块、控制模块和音频输出模块;所述控制模块包括语音交互模块、存储模块、切换模块和调取模块,所述存储模块内预存有多个不同声音形式的语音数据;所述音频录入模块用于收集外界用户的声音信息并将该信息发送至语音交互模块,所述调取模块用于调取存储模块内的语音数据并发送至语音交互模块,所述语音交互模块根据接收的声音信息和语音数据并通过音频输出模块与用户进行语音交互;所述切换模块用于切换调取模块所调取的语音数据。A multi-voice mode human-machine dialogue system, including an audio input module, a control module and an audio output module coupled in sequence; the control module includes a voice interaction module, a storage module, a switching module and a calling module, and the storage module There are a plurality of voice data in different voice forms pre-stored in it; the audio input module is used to collect the voice information of external users and send the information to the voice interaction module, and the call module is used to call the voice data in the storage module And send to the voice interaction module, the voice interaction module performs voice interaction with the user through the audio output module according to the received voice information and voice data; the switching module is used to switch the voice data retrieved by the calling module.

采用上述方案,使得用户在和语音交互模块进行语音交互的过程中,能够选择不同声音形式的语音数据包,从而使音频输出模块能够以不同声音模式(如不同年龄阶段的人群)与用户交谈,以适应不同的应用场景,更加人性化;音频录入模块能够采集用户的声音信息,音频输出模块能够以语音形式输出语音交互模块回复的内容,从而实现语音交互模块与用户之间的语音交互。By adopting the above solution, the user can select voice data packets in different voice forms during voice interaction with the voice interaction module, so that the audio output module can communicate with the user in different voice modes (such as people of different ages), To adapt to different application scenarios, it is more humanized; the audio input module can collect the user's voice information, and the audio output module can output the content replied by the voice interaction module in the form of voice, thereby realizing the voice interaction between the voice interaction module and the user.

作为优选,所述控制模块还包括计数模块,所述控制模块还耦接有生物特征辨识模块;Preferably, the control module further includes a counting module, and the control module is also coupled to a biometric identification module;

当音频录入模块开始收集外界用户的声音信息时,所述生物特征辨识模块用于获取用户的唯一生物特征信息并将该特征信息发送至控制模块,所述计数模块用于统计该唯一生物特征信息下调取模块所调取的每种语音数据的次数;当生物特征辨识模块再次获取相同用户的唯一生物特征信息后,所述调取模块能够从存储模块内调取当前调用次数最多的语音数据。When the audio input module starts to collect the voice information of external users, the biometric identification module is used to obtain the unique biometric information of the user and send the characteristic information to the control module, and the counting module is used to count the unique biometric information Lower the number of voice data retrieved by the retrieval module; when the biometric identification module acquires the unique biometric information of the same user again, the retrieval module can retrieve the voice data with the most current retrieval times from the storage module.

采用上述方案,生物特征辨识模块能够辨认每个用户的唯一生物特征,从而确定当前用户的身份;计数模块能够对每位用户通过调取模块调取每种语音数据的次数进行统计,以使用户在下次进行语音交互时,调取模块能够优先调用使用频率最高的语音数据,提升了操作效率,更加人性化。Using the above scheme, the biometric identification module can identify the unique biometric characteristics of each user, thereby determining the identity of the current user; the counting module can count the number of times each user calls each type of voice data through the calling module, so that the user In the next voice interaction, the calling module can preferentially call the voice data with the highest frequency, which improves the operation efficiency and is more humanized.

作为优选,所述控制模块还耦接有显示模块,所述控制模块还包括有一编号模块;所述编号模块依次对存储模块内各语音数据进行编号;当调取模块调取相应的语音数据后,所述显示模块能够显示该语音数据所对应的编号。As a preference, the control module is also coupled with a display module, and the control module also includes a numbering module; the numbering module sequentially numbers each voice data in the storage module; when the call module calls the corresponding voice data , the display module can display the number corresponding to the voice data.

采用上述方案,使得用户能通过显示模块查看调取模块当前所调取的语音数据,从而了解该语音数据所对应的声音模式,以提升操作效率,更加人性化。By adopting the above solution, the user can view the voice data currently retrieved by the retrieval module through the display module, so as to know the voice mode corresponding to the voice data, so as to improve the operation efficiency and be more humanized.

作为优选,所述生物特征辨识模块为人脸识别模块。Preferably, the biometric identification module is a face recognition module.

采用上述方案,人脸识别是基于人的脸部特征信息进行身份识别的一种生物识别技术。用摄像机或摄像头采集含有人脸的图像或视频流,并自动在图像中检测和跟踪人脸,进而对检测到的人脸进行识别的一系列相关技术。人脸识别具有识别精度高,响应速度快,识别过程自然,同时不易被察觉的优势。Using the above solution, face recognition is a biometric technology for identity recognition based on human facial feature information. A series of related technologies that use a camera or camera to collect images or video streams containing human faces, automatically detect and track human faces in the images, and then recognize the detected faces. Face recognition has the advantages of high recognition accuracy, fast response speed, natural recognition process, and not easy to be detected.

作为优选,还包括移动终端,所述控制模块还包括有一授权模块,所述控制模块还耦接有与移动终端进行远程数据交换的无线通信模块;所述移动终端能够响应于外部触发向无线通信模块远程发送授权指令,所述无线通信模块将接收的授权指令发送至授权模块,所述授权模块授权调取模块调取存储模块内的语音数据至语音交互模块,同时移动终端开始采集用户的声音信息并将该信息远程发送至无线通信模块,所述无线通信模块将接收的语音信息发送至语音交互模块,所述语音交互模块根据接收的语音信息和语音数据生成回复指令,并将该回复指令通过无线通信模块远程发送至移动终端,所述移动终端将回复指令的文字内容转换成语音内容,并根据语音数据所对应的声音和用户进行语音交互。Preferably, it also includes a mobile terminal, the control module also includes an authorization module, and the control module is also coupled with a wireless communication module for remote data exchange with the mobile terminal; the mobile terminal can respond to external triggers to wirelessly communicate The module remotely sends an authorization command, the wireless communication module sends the received authorization command to the authorization module, and the authorization module authorizes the call module to call the voice data in the storage module to the voice interaction module, and the mobile terminal starts to collect the user's voice at the same time information and remotely send the information to the wireless communication module, the wireless communication module sends the received voice information to the voice interaction module, the voice interaction module generates a reply instruction according to the received voice information and voice data, and sends the reply instruction Remotely send to the mobile terminal through the wireless communication module, and the mobile terminal converts the text content of the reply instruction into voice content, and performs voice interaction with the user according to the voice corresponding to the voice data.

采用上述方案,使得用户能通过移动终端和语音交互模块进行远程语音交互,从而提升语音交互的效率以及便利性,更加人性化。By adopting the above solution, the user can perform remote voice interaction through the mobile terminal and the voice interaction module, thereby improving the efficiency and convenience of the voice interaction and making it more humanized.

作为优选,所述控制模块还包括距离判断模块,所述移动终端内包含有一定位模块,所述距离判断模块预设有一基准距离值;所述定位模块用于获取移动终端的当前位置信息并将该位置信息远程发送至无线通信模块;所述无线通信模块将接收的位置信息发送至距离判断模块,所述距离判断模块计算控制模块与移动终端之间的实际距离值并将计算得出的实际距离值与基准距离值进行比对;Preferably, the control module further includes a distance judging module, the mobile terminal includes a positioning module, and the distance judging module presets a reference distance value; the positioning module is used to acquire the current location information of the mobile terminal and The location information is remotely sent to the wireless communication module; the wireless communication module sends the received location information to the distance judgment module, and the distance judgment module calculates the actual distance value between the control module and the mobile terminal and calculates the actual distance The distance value is compared with the reference distance value;

若实际距离值大于基准距离值,所述控制模块控制语音交互模块通过无线通信模块与移动终端进行远程语音交互;反之,若实际距离值小于等于基准距离值,所述控制模块控制语音交互模块通过音频录入模块及音频输出模块与用户进行近端语音交互。If the actual distance value is greater than the reference distance value, the control module controls the voice interaction module to perform remote voice interaction with the mobile terminal through the wireless communication module; otherwise, if the actual distance value is less than or equal to the reference distance value, the control module controls the voice interaction module to pass The audio input module and the audio output module perform near-end voice interaction with the user.

采用上述方案,使得用户在和语音交互模块进行语音交互过程中,距离判断模块能够实时计算移动终端与控制模块之间的实际距离值,并将实际距离值与基准距离值进行比对;若实际距离值大于基准距离值,则说明移动终端距离控制模块较远,此时控制模块能够将语音交互模块的交互对象自动切换至移动终端,以实现远程语音交互;反之,若实际距离值小于等于基准距离值,则说明移动终端距离控制模块较近,此时控制模块能够将语音交互模块的交互对象切换至音频录入模块和音频输出模块,以实现用户在本地的语音交互;通过判断移动终端与控制模块之间的实际距离值来自动切换语音交互对象,更加高效便捷,同时更加人性化。By adopting the above scheme, the distance judging module can calculate the actual distance value between the mobile terminal and the control module in real time during the voice interaction process between the user and the voice interaction module, and compare the actual distance value with the reference distance value; if the actual If the distance value is greater than the reference distance value, it means that the mobile terminal is far away from the control module. At this time, the control module can automatically switch the interactive object of the voice interaction module to the mobile terminal to realize remote voice interaction; otherwise, if the actual distance value is less than or equal to the reference distance value, it means that the mobile terminal is relatively close to the control module. At this time, the control module can switch the interactive object of the voice interaction module to the audio input module and the audio output module to realize the user's local voice interaction; The actual distance between modules is used to automatically switch voice interaction objects, which is more efficient, more convenient, and more humanized.

作为优选,所述定位模块为GPS模块。Preferably, the positioning module is a GPS module.

采用上述方案,GPS模块能够在全球范围内实时进行定位、导航,称为全球卫星定位系统,其是一种具有全方位、全天候、全时段、高精度的卫星导航系统,能为全球用户提供低成本、高精度的三维位置、速度和精确定时等导航信息,是卫星通信技术在导航领域的应用典范。Using the above scheme, the GPS module can locate and navigate in real time around the world, which is called the global satellite positioning system. Navigation information such as low-cost, high-precision three-dimensional position, speed and precise timing is a model for the application of satellite communication technology in the field of navigation.

作为优选,所述显示模块为显示屏或者触摸屏。Preferably, the display module is a display screen or a touch screen.

采用上述方案,显示屏或者触摸屏能够高质量地呈现高清视频图像,以提升用户的语音交互体验。By adopting the above solution, the display screen or touch screen can present high-definition video images with high quality, so as to improve the user's voice interaction experience.

综上所述,本发明具有以下有益效果:使得用户在和语音交互模块进行语音交互的过程中,能够选择不同声音形式的语音数据包,从而使音频输出模块能够以不同声音模式(如不同年龄阶段的人群)与用户交谈,以适应不同的应用场景,更加人性化;音频录入模块能够采集用户的声音信息,音频输出模块能够以语音形式输出语音交互模块回复的内容,从而实现语音交互模块与用户之间的语音交互。In summary, the present invention has the following beneficial effects: in the process of voice interaction with the voice interaction module, the user can select voice data packets in different voice forms, so that the audio output module can use different voice modes (such as different ages) stage crowd) to chat with users to adapt to different application scenarios and be more humane; the audio input module can collect the user's voice information, and the audio output module can output the content replied by the voice interaction module in the form of voice, so as to realize the interaction between the voice interaction module and Voice interaction between users.

附图说明Description of drawings

图1为本实施例的系统结构示意图。FIG. 1 is a schematic diagram of the system structure of this embodiment.

图中:1、音频录入模块;2、控制模块;3、音频输出模块;4、生物特征辨识模块;5、显示模块;6、移动终端;7、无线通信模块;8、定位模块。In the figure: 1. Audio input module; 2. Control module; 3. Audio output module; 4. Biometric identification module; 5. Display module; 6. Mobile terminal; 7. Wireless communication module; 8. Positioning module.

具体实施方式Detailed ways

以下结合附图对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings.

本实施例公开的一种多人声模式人机对话系统,如图1所示,包括依次耦接的音频录入模块1、控制模块2和音频输出模块3。其中,控制模块2为具有数据处理能力的芯片,包括而不仅限于CPU、ARM、MCU、单片机、DSP等;音频录入模块1可以为麦克风、拾音器等;音频输出模块3可以为扬声器、电喇叭或者蜂鸣器等。本实施例中,控制模块2包括语音交互模块、存储模块、切换模块和调取模块,其中存储模块为具有数据存储能力的芯片,包括而不仅限于软盘、硬盘、光盘、U盘等;切换模块可以为实体按键或者集成于触摸屏内的虚拟按键,其能响应于外部触发而输出相应的切换指令至调取模块。存储模块内预存有多个不同声音形式的语音数据,该语音数据对应不同类型人的声音,如男声、女声、不同年龄阶段的声音等。A multi-voice mode man-machine dialogue system disclosed in this embodiment, as shown in FIG. 1 , includes an audio input module 1 , a control module 2 and an audio output module 3 coupled in sequence. Wherein, the control module 2 is a chip with data processing capability, including but not limited to CPU, ARM, MCU, single-chip microcomputer, DSP, etc.; the audio input module 1 can be a microphone, a pickup, etc.; the audio output module 3 can be a loudspeaker, an electric horn or buzzer etc. In this embodiment, the control module 2 includes a voice interaction module, a storage module, a switching module and a calling module, wherein the storage module is a chip with data storage capability, including but not limited to a floppy disk, a hard disk, a CD, a U disk, etc.; the switching module It can be a physical button or a virtual button integrated in the touch screen, which can output a corresponding switching instruction to the calling module in response to an external trigger. A plurality of voice data in different voice forms are pre-stored in the storage module, and the voice data correspond to voices of different types of people, such as male voices, female voices, and voices of different ages.

更具体的,音频录入模块1用于收集外界用户的声音信息并将该信息发送至语音交互模块,语音交互模块能够将接收的声音信息转变成文字指令并根据文字指令智能生成相对应的回复指令,并将回复指令转换成声音信息发送至音频输出模块3;调取模块用于调取存储模块内的语音数据并发送至语音交互模块,即用户能根据需要调取所需声音对应的语音数据;语音交互模块根据接收的声音信息和语音数据并通过音频输出模块3与用户进行语音交互,交互过程中,语音交互模块能控制音频输出模块3通过语音数据对应的声音模式进行发声。切换模块用于切换调取模块所调取的语音数据,通过触发切换模块,能够改变调取模块所调取的语音数据,以切换音频输出模块3的发声模式,从而使语音交互模块能够以多种声音模式与用户实现语音交互。More specifically, the audio input module 1 is used to collect voice information of external users and send the information to the voice interaction module. The voice interaction module can convert the received voice information into text instructions and intelligently generate corresponding reply instructions according to the text instructions , and convert the reply command into voice information and send it to the audio output module 3; the calling module is used to call the voice data in the storage module and send it to the voice interaction module, that is, the user can call the voice data corresponding to the desired sound as needed ; The voice interaction module performs voice interaction with the user through the audio output module 3 according to the received voice information and voice data. The switching module is used to switch the voice data retrieved by the retrieval module. By triggering the switching module, the voice data retrieved by the retrieval module can be changed to switch the sounding mode of the audio output module 3, so that the voice interaction module can A voice mode to realize voice interaction with the user.

更进一步的,控制模块2还包括计数模块,控制模块2还耦接有生物特征辨识模块4,生物特征辨识模块4为人脸识别模块。当音频录入模块1开始收集外界用户的声音信息时,生物特征辨识模块4用于获取用户的唯一生物特征信息并将该特征信息发送至控制模块2,控制模块2能够在接收的唯一生物特征信息下建立用户信息数据库,并将该数据库存储至存储模块内;计数模块用于统计该唯一生物特征信息下调取模块所调取的每种语音数据的次数,即计数模块能够计算每种语音数据的调取次数,并将统计次数存储至对应的用户信息数据库内,以统计各用户对于每种语音数据的使用频率。Furthermore, the control module 2 also includes a counting module, and the control module 2 is also coupled to a biometric identification module 4, which is a face recognition module. When the audio input module 1 starts to collect the sound information of the external user, the biological feature recognition module 4 is used to obtain the user's unique biological feature information and send the feature information to the control module 2, and the control module 2 can receive the unique biological feature information Set up the user information database, and store the database in the storage module; the counting module is used to count the number of times of each kind of voice data transferred by the unique biometric information retrieval module, that is, the counting module can calculate the number of times of each voice data The number of calling times is stored in the corresponding user information database, so as to count the frequency of use of each voice data by each user.

当生物特征辨识模块4再次获取相同用户的唯一生物特征信息后,调取模块能够从存储模块内调取当前调用次数最多的语音数据,即用户再次和语音交互模块进行语音交互时,调取模块能够从存储模块内自动调取使用频率最高的语音数据。When the biometric identification module 4 obtains the unique biometric information of the same user again, the calling module can call the voice data with the most current call times from the storage module, that is, when the user performs voice interaction with the voice interaction module again, the calling module The most frequently used voice data can be automatically retrieved from the storage module.

更进一步的,控制模块2还耦接有显示模块5,显示模块5为显示屏或者触摸屏。控制模块2还包括有一编号模块,编号模块依次对存储模块内各语音数据进行编号,编号为具有标识作用的标记,包括而不仅限于数字标记、字母标记、中文标记、英文标记等。当调取模块调取相应的语音数据后,显示模块5能够显示该语音数据所对应的编号,以使用户能够了解音频输出模块3当前所采用的声音模式。Furthermore, the control module 2 is also coupled to a display module 5, which is a display screen or a touch screen. The control module 2 also includes a numbering module, which sequentially numbers each voice data in the storage module, and the numbering is a mark with an identification effect, including but not limited to a number mark, a letter mark, a Chinese mark, an English mark, etc. After the calling module calls the corresponding voice data, the display module 5 can display the number corresponding to the voice data, so that the user can understand the sound mode currently used by the audio output module 3 .

更进一步的,多人声模式人机对话系统还包括移动终端6,该移动终端6可以为智能手机、平板电脑等具有语音通话功能的远程通讯设备,本实施例中提及的移动终端6为进行语音交互的用户所有。控制模块2还包括有一授权模块,控制模块2还耦接有与移动终端6进行远程数据交换的无线通信模块7,该无线通信模块7可以为4G-LTE、WCDMA、HSPA、蓝牙、红外、激光、微波、CDPD、遵循IEEE802.11协议或IEEE802.16协议的通信技术,以能够与外界进行远程通信。Furthermore, the multi-voice mode human-machine dialogue system also includes a mobile terminal 6, which can be a remote communication device with a voice call function such as a smart phone or a tablet computer. The mobile terminal 6 mentioned in this embodiment is Owned by the user performing the voice interaction. The control module 2 also includes an authorization module, the control module 2 is also coupled with the wireless communication module 7 for remote data exchange with the mobile terminal 6, the wireless communication module 7 can be 4G-LTE, WCDMA, HSPA, bluetooth, infrared, laser , Microwave, CDPD, communication technology following the IEEE802.11 protocol or IEEE802.16 protocol, so as to be able to communicate remotely with the outside world.

本实施例中,当用户需要远程进行语音交互时,通过触发移动终端6,移动终端6能够响应于外部触发向无线通信模块7远程发送授权指令,无线通信模块7将接收的授权指令发送至授权模块,授权模块授权调取模块调取存储模块内的语音数据至语音交互模块,同时移动终端6开始采集用户的声音信息并将该信息远程发送至无线通信模块7,无线通信模块7将接收的语音信息发送至语音交互模块,语音交互模块根据接收的语音信息和语音数据生成回复指令,并将该回复指令通过无线通信模块7远程发送至移动终端6,移动终端6将回复指令的文字内容转换成语音内容,并根据语音数据所对应的声音模式和用户进行语音交互,使得用户能够通过移动终端6和语音交互模块进行远程语音交互,增加了通话效率以及便利性,更加人性化。In this embodiment, when the user needs to perform voice interaction remotely, by triggering the mobile terminal 6, the mobile terminal 6 can remotely send an authorization instruction to the wireless communication module 7 in response to an external trigger, and the wireless communication module 7 sends the received authorization instruction to the authorization module, the authorization module authorizes the calling module to call the voice data in the storage module to the voice interaction module, and at the same time, the mobile terminal 6 starts to collect the user's voice information and sends the information to the wireless communication module 7 remotely, and the wireless communication module 7 will receive the voice data. The voice information is sent to the voice interaction module, and the voice interaction module generates a reply command according to the received voice information and voice data, and sends the reply command remotely to the mobile terminal 6 through the wireless communication module 7, and the mobile terminal 6 converts the text content of the reply command into voice content, and perform voice interaction with the user according to the voice mode corresponding to the voice data, so that the user can perform remote voice interaction through the mobile terminal 6 and the voice interaction module, which increases call efficiency and convenience, and is more humanized.

更进一步的,控制模块2还包括距离判断模块,移动终端6内包含有一定位模块8,该定位模块8为GPS模块。距离判断模块预设有一基准距离值。本实施例中,定位模块8用于获取移动终端6的当前位置信息并将该位置信息远程发送至无线通信模块7;无线通信模块7将接收的位置信息发送至距离判断模块,距离判断模块计算控制模块2与移动终端6之间的实际距离值并将计算得出的实际距离值与基准距离值进行比对,以判断移动终端6是否靠近控制模块2。更具体的,若实际距离值大于基准距离值,说明移动终端6距离控制模块2较远,此时控制模块2控制语音交互模块通过无线通信模块7与移动终端6进行远程语音交互,更加便捷。反之,若实际距离值小于等于基准距离值,说明移动终端6距离控制模块2较近,此时控制模块2控制语音交互模块通过音频录入模块1及音频输出模块3与用户进行近端语音交互,以降低移动终端6的能耗,从而延长移动终端6的续航能力,更加节能环保。Furthermore, the control module 2 also includes a distance judging module, and the mobile terminal 6 includes a positioning module 8, which is a GPS module. The distance judging module presets a reference distance value. In this embodiment, the positioning module 8 is used to obtain the current location information of the mobile terminal 6 and remotely send the location information to the wireless communication module 7; the wireless communication module 7 sends the received location information to the distance judgment module, and the distance judgment module calculates Control the actual distance between the module 2 and the mobile terminal 6 and compare the calculated actual distance with the reference distance to determine whether the mobile terminal 6 is close to the control module 2 . More specifically, if the actual distance value is greater than the reference distance value, it means that the mobile terminal 6 is far away from the control module 2. At this time, the control module 2 controls the voice interaction module to perform remote voice interaction with the mobile terminal 6 through the wireless communication module 7, which is more convenient. Conversely, if the actual distance value is less than or equal to the reference distance value, it means that the mobile terminal 6 is relatively close to the control module 2. At this time, the control module 2 controls the voice interaction module to perform near-end voice interaction with the user through the audio input module 1 and the audio output module 3. In order to reduce the energy consumption of the mobile terminal 6, thereby extending the battery life of the mobile terminal 6, it is more energy-saving and environment-friendly.

本具体实施例仅仅是对本发明的解释,其并不是对本发明的限制,本领域技术人员在阅读完本说明书后可以根据需要对本实施例做出没有创造性贡献的修改,但只要在本发明的权利要求范围内都受到专利法的保护。This specific embodiment is only an explanation of the present invention, and it is not a limitation of the present invention. Those skilled in the art can make modifications to this embodiment without creative contribution as required after reading this specification, but as long as they are within the rights of the present invention All claims are protected by patent law.

Claims (6)

1.一种多人声模式人机对话系统,其特征是:包括依次耦接的音频录入模块(1)、控制模块(2)和音频输出模块(3);所述控制模块(2)包括语音交互模块、存储模块、切换模块和调取模块,所述存储模块内预存有多个不同声音形式的语音数据;所述音频录入模块(1)用于收集外界用户的声音信息并将该信息发送至语音交互模块,所述调取模块用于调取存储模块内的语音数据并发送至语音交互模块,所述语音交互模块根据接收的声音信息和语音数据并通过音频输出模块(3)与用户进行语音交互;所述切换模块用于切换调取模块所调取的语音数据;1. A multi-voice mode man-machine dialogue system is characterized in that: comprise an audio input module (1), a control module (2) and an audio output module (3) coupled in sequence; the control module (2) includes A voice interaction module, a storage module, a switching module and a calling module, the voice data of a plurality of different voice forms are pre-stored in the storage module; Send to the voice interaction module, the calling module is used to call the voice data in the storage module and send to the voice interaction module, the voice interaction module is based on the received voice information and voice data and through the audio output module (3) and The user performs voice interaction; the switching module is used to switch the voice data retrieved by the calling module; 还包括移动终端(6),所述控制模块(2)还包括有一授权模块,所述控制模块(2)还耦接有与移动终端(6)进行远程数据交换的无线通信模块(7);所述移动终端(6)能够响应于外部触发向无线通信模块(7)远程发送授权指令,所述无线通信模块(7)将接收的授权指令发送至授权模块,所述授权模块授权调取模块调取存储模块内的语音数据至语音交互模块,同时移动终端(6)开始采集用户的声音信息并将该信息远程发送至无线通信模块(7),所述无线通信模块(7)将接收的语音信息发送至语音交互模块,所述语音交互模块根据接收的语音信息和语音数据生成回复指令,并将该回复指令通过无线通信模块(7)远程发送至移动终端(6),所述移动终端(6)将回复指令的文字内容转换成语音内容,并根据语音数据所对应的声音和用户进行语音交互;It also includes a mobile terminal (6), the control module (2) also includes an authorization module, and the control module (2) is also coupled with a wireless communication module (7) for remote data exchange with the mobile terminal (6); The mobile terminal (6) can remotely send an authorization instruction to the wireless communication module (7) in response to an external trigger, and the wireless communication module (7) sends the received authorization instruction to the authorization module, and the authorization module authorizes the retrieval module Call the voice data in the storage module to the voice interaction module, and at the same time, the mobile terminal (6) starts to collect the voice information of the user and remotely sends the information to the wireless communication module (7), and the wireless communication module (7) will receive The voice information is sent to the voice interaction module, and the voice interaction module generates a reply instruction according to the received voice information and voice data, and sends the reply instruction to the mobile terminal (6) remotely through the wireless communication module (7), and the mobile terminal (6) Convert the text content of the reply command into voice content, and perform voice interaction with the user according to the voice corresponding to the voice data; 所述控制模块(2)还包括距离判断模块,所述移动终端(6)内包含有一定位模块(8),所述距离判断模块预设有一基准距离值;所述定位模块(8)用于获取移动终端(6)的当前位置信息并将该位置信息远程发送至无线通信模块(7);所述无线通信模块(7)将接收的位置信息发送至距离判断模块,所述距离判断模块计算控制模块(2)与移动终端(6)之间的实际距离值并将计算得出的实际距离值与基准距离值进行比对;若实际距离值大于基准距离值,所述控制模块(2)控制语音交互模块通过无线通信模块(7)与移动终端(6)进行远程语音交互;反之,若实际距离值小于等于基准距离值,所述控制模块(2)控制语音交互模块通过音频录入模块(1)及音频输出模块(3)与用户进行近端语音交互。The control module (2) also includes a distance judging module, the mobile terminal (6) contains a positioning module (8), and the distance judging module is preset with a reference distance value; the positioning module (8) is used for Obtain the current location information of the mobile terminal (6) and remotely send the location information to the wireless communication module (7); the wireless communication module (7) sends the received location information to the distance judgment module, and the distance judgment module calculates The actual distance value between the control module (2) and the mobile terminal (6) and the calculated actual distance value is compared with the reference distance value; if the actual distance value is greater than the reference distance value, the control module (2) Control the voice interaction module to carry out remote voice interaction with the mobile terminal (6) through the wireless communication module (7); otherwise, if the actual distance value is less than or equal to the reference distance value, the control module (2) controls the voice interaction module through the audio input module ( 1) and the audio output module (3) perform near-end voice interaction with the user. 2.根据权利要求1所述的多人声模式人机对话系统,其特征是:所述控制模块(2)还包括计数模块,所述控制模块(2)还耦接有生物特征辨识模块(4);当音频录入模块(1)开始收集外界用户的声音信息时,所述生物特征辨识模块(4)用于获取用户的唯一生物特征信息并将该特征信息发送至控制模块(2),所述计数模块用于统计该唯一生物特征信息下调取模块所调取的每种语音数据的次数;当生物特征辨识模块(4)再次获取相同用户的唯一生物特征信息后,所述调取模块能够从存储模块内调取当前调用次数最多的语音数据。2. The multi-voice mode man-machine dialogue system according to claim 1, characterized in that: the control module (2) also includes a counting module, and the control module (2) is also coupled to a biometric identification module ( 4); when the audio input module (1) starts to collect the sound information of the external user, the biological feature recognition module (4) is used to obtain the user's unique biological feature information and send the feature information to the control module (2), The counting module is used to count the number of times of each kind of voice data transferred by the calling module under the unique biological feature information; when the biological feature identification module (4) obtains the unique biological feature information of the same user again, the calling module The voice data with the most current calls can be retrieved from the storage module. 3.根据权利要求2所述的多人声模式人机对话系统,其特征是:所述控制模块(2)还耦接有显示模块(5),所述控制模块(2)还包括有一编号模块;所述编号模块依次对存储模块内各语音数据进行编号;当调取模块调取相应的语音数据后,所述显示模块(5)能够显示该语音数据所对应的编号。3. The multi-voice mode man-machine dialogue system according to claim 2, characterized in that: the control module (2) is also coupled with a display module (5), and the control module (2) also includes a serial number module; the numbering module sequentially numbers each voice data in the storage module; when the call module calls the corresponding voice data, the display module (5) can display the number corresponding to the voice data. 4.根据权利要求2所述的多人声模式人机对话系统,其特征是:所述生物特征辨识模块(4)为人脸识别模块。4. The multi-voice mode man-machine dialogue system according to claim 2, characterized in that: the biometric identification module (4) is a face recognition module. 5.根据权利要求1所述的多人声模式人机对话系统,其特征是:所述定位模块(8)为GPS模块。5. The multi-voice mode man-machine dialogue system according to claim 1, characterized in that: the positioning module (8) is a GPS module. 6.根据权利要求3所述的多人声模式人机对话系统,其特征是:所述显示模块(5)为显示屏或者触摸屏。6. The multi-voice mode man-machine dialogue system according to claim 3, characterized in that: the display module (5) is a display screen or a touch screen.
CN201811524605.5A 2018-12-13 2018-12-13 Multi-voice mode man-machine dialogue system Active CN111402869B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811524605.5A CN111402869B (en) 2018-12-13 2018-12-13 Multi-voice mode man-machine dialogue system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811524605.5A CN111402869B (en) 2018-12-13 2018-12-13 Multi-voice mode man-machine dialogue system

Publications (2)

Publication Number Publication Date
CN111402869A CN111402869A (en) 2020-07-10
CN111402869B true CN111402869B (en) 2023-09-01

Family

ID=71430092

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811524605.5A Active CN111402869B (en) 2018-12-13 2018-12-13 Multi-voice mode man-machine dialogue system

Country Status (1)

Country Link
CN (1) CN111402869B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112151037A (en) * 2020-09-23 2020-12-29 江苏小梦科技有限公司 Man-machine conversation system based on embedded software

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102959884A (en) * 2010-04-14 2013-03-06 迈克尔·博克瑟 Method and apparatus for identifying objects and triggering interactions using close-range coupling of sound-modulated data signals
CN202838711U (en) * 2012-07-06 2013-03-27 北京千家悦网络科技有限公司 Device for interacting via language and interaction system
CN105139850A (en) * 2015-08-12 2015-12-09 西安诺瓦电子科技有限公司 Speech interaction device, speech interaction method and speech interaction type LED asynchronous control system terminal
CN106297783A (en) * 2016-08-05 2017-01-04 易晓阳 A kind of interactive voice identification intelligent terminal
CN106569773A (en) * 2016-10-31 2017-04-19 努比亚技术有限公司 Terminal and voice interaction processing method
CN106998398A (en) * 2017-03-28 2017-08-01 杭州三为电子技术有限公司 Telephone operation machine voice input system
CN207060261U (en) * 2017-08-08 2018-03-02 潍坊歌尔电子有限公司 Intelligent bicycle code table

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102959884A (en) * 2010-04-14 2013-03-06 迈克尔·博克瑟 Method and apparatus for identifying objects and triggering interactions using close-range coupling of sound-modulated data signals
CN202838711U (en) * 2012-07-06 2013-03-27 北京千家悦网络科技有限公司 Device for interacting via language and interaction system
CN105139850A (en) * 2015-08-12 2015-12-09 西安诺瓦电子科技有限公司 Speech interaction device, speech interaction method and speech interaction type LED asynchronous control system terminal
CN106297783A (en) * 2016-08-05 2017-01-04 易晓阳 A kind of interactive voice identification intelligent terminal
CN106569773A (en) * 2016-10-31 2017-04-19 努比亚技术有限公司 Terminal and voice interaction processing method
CN106998398A (en) * 2017-03-28 2017-08-01 杭州三为电子技术有限公司 Telephone operation machine voice input system
CN207060261U (en) * 2017-08-08 2018-03-02 潍坊歌尔电子有限公司 Intelligent bicycle code table

Also Published As

Publication number Publication date
CN111402869A (en) 2020-07-10

Similar Documents

Publication Publication Date Title
CN103605975B (en) A kind of method, apparatus and terminal device of image procossing
CN103730116B (en) Intelligent watch realizes the system and method that intelligent home device controls
CN107708007A (en) A wireless earphone control method, device and wireless earphone
CN104580699B (en) Acoustic control intelligent terminal method and device when a kind of standby
CN109637548A (en) Voice interactive method and device based on Application on Voiceprint Recognition
WO2014008843A1 (en) Method for updating voiceprint feature model and terminal
CN109658927A (en) Wake-up processing method, device and the management equipment of smart machine
CN109949801A (en) A kind of smart home device sound control method and system based on earphone
CN111522524B (en) Presentation control method and device based on conference robot, storage medium and terminal
CN119882999A (en) Information processing method and device, master control equipment and controlled equipment
CN109151789A (en) Interpretation method, device, system and bluetooth headset
US20240386893A1 (en) Smart glasses, system and control method based on generative artificial intelligence large language models
CN109284081A (en) A kind of audio output method, device and audio equipment
CN112912955A (en) Electronic devices and systems providing voice recognition based services
WO2020192215A1 (en) Interactive method and wearable interactive device
WO2021244058A1 (en) Process execution method, device, and readable medium
CN110109608A (en) Text display method, device, terminal and storage medium
CN114765026A (en) Voice control method, device and system
CN111402869B (en) Multi-voice mode man-machine dialogue system
CN114283798A (en) Radio receiving method of handheld device and handheld device
CN106873939A (en) Electronic equipment and its application method
CN106730876A (en) The control system of Intelligent doll
WO2025148664A1 (en) Large language model-based smart glasses control system, method, and smart glasses
WO2018023514A1 (en) Home background music control system
WO2018023523A1 (en) Motion and emotion recognizing home control system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20211021

Address after: 223809 Room 201, building B19, insurance Town, Hubin new area, Suqian City, Jiangsu Province

Applicant after: Suqian silicon based Intelligent Technology Co.,Ltd.

Address before: Room 602, Huatong Science Park, No. 66, software Avenue, Yuhuatai District, Nanjing, Jiangsu 210000

Applicant before: NANJING SILICON INTELLIGENCE TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20250528

Address after: 210000 building 4, No.5 Hengsheng Road, economic development zone, Gaochun District, Nanjing City, Jiangsu Province

Patentee after: Nanjing baide yuancheng trading Co.,Ltd.

Country or region after: China

Address before: 223809 Room 201, building B19, insurance Town, Hubin new area, Suqian City, Jiangsu Province

Patentee before: Suqian silicon based Intelligent Technology Co.,Ltd.

Country or region before: China

TR01 Transfer of patent right