CN111246008A - Method, system and device for realizing telephone assistant - Google Patents
Method, system and device for realizing telephone assistant Download PDFInfo
- Publication number
- CN111246008A CN111246008A CN202010024599.8A CN202010024599A CN111246008A CN 111246008 A CN111246008 A CN 111246008A CN 202010024599 A CN202010024599 A CN 202010024599A CN 111246008 A CN111246008 A CN 111246008A
- Authority
- CN
- China
- Prior art keywords
- incoming call
- mobile terminal
- voice
- call
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/66—Substation equipment, e.g. for use by subscribers with means for preventing unauthorised or fraudulent calling
- H04M1/663—Preventing unauthorised calls to a telephone set
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72436—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. short messaging services [SMS] or e-mails
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72484—User interfaces specially adapted for cordless or mobile telephones wherein functions are triggered by incoming communication events
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/12—Messaging; Mailboxes; Announcements
- H04W4/14—Short messaging services, e.g. short message services [SMS] or unstructured supplementary service data [USSD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2250/00—Details of telephonic subscriber devices
- H04M2250/74—Details of telephonic subscriber devices with voice recognition means
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Security & Cryptography (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Business, Economics & Management (AREA)
- Evolutionary Computation (AREA)
- General Business, Economics & Management (AREA)
- Telephone Function (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
技术领域technical field
本发明涉及通信技术领域,特别涉及一种电话助理的实现方法、系统及装置。The present invention relates to the field of communication technologies, and in particular, to a method, system and device for implementing a telephone assistant.
背景技术Background technique
随着通信技术的发展,移动终端现在已经全方位渗透到我们的生活中,给我们生活带来了很大的便利。但同时,移动终端也为用户带来了很多困扰,例如,拨打骚扰电话是商家进行推销的渠道和手段,通过频繁地向用户拨打电话进行商品推销,更有甚者通过拨打骚扰电话来欺骗用户的钱财,总之,骚扰电话已经严重影响用户的生活,甚至给用户的人身财产产生带来了隐患。With the development of communication technology, mobile terminals have now penetrated into our lives in an all-round way, bringing great convenience to our lives. But at the same time, mobile terminals also bring a lot of troubles to users. For example, making harassing calls is a channel and means for merchants to sell products. They frequently make calls to users to sell products, and even deceive users by making harassing calls. In short, harassing calls have seriously affected the life of users, and even brought hidden dangers to users' personal and property.
目前,有很多类似移动终端电话助理的技术来拦截骚扰电话,比如:在移动终端中下载并安装安全应用,拦截某些骚扰电话,或者在移动终端中设置一些黑名单,达到精确拦截的目的。而对于真正有意义的来电,电话助理会让用户直接参与进来。然而,采用安全应用识别骚扰电话有时因为网络或更新等问题无法准确地帮用户识别出骚扰电话;而由于骚扰电话的电话号码经常改变,移动终端中设置的黑名单并不能很好的解决该类问题。所以,在移动终端设置的电话助理并不能给用户带来很好的体验,大部分情况还是用户自己手动甄别,极大地浪费了用户时间及经历。At present, there are many technologies similar to mobile terminal phone assistants to intercept harassing calls, such as: downloading and installing security applications in mobile terminals, intercepting certain harassing calls, or setting up some blacklists in mobile terminals to achieve the purpose of precise interception. And for truly meaningful calls, the phone assistant engages the user directly. However, the use of security applications to identify harassing calls sometimes cannot accurately help users identify harassing calls due to network or update problems; and because the phone numbers of harassing calls often change, the blacklist set in the mobile terminal cannot solve this type of harassment. question. Therefore, the telephone assistant set on the mobile terminal cannot bring a good experience to the user, and in most cases, the user is manually screened, which greatly wastes the user's time and experience.
具体地说,移动终端对骚扰电话进行拦截及处理有以下多种方式。Specifically, the mobile terminal intercepts and processes the harassing calls in the following ways.
专利公开号为CN109688276A的专利申请,涉及一种基于人工智能技术的来电过滤系统及其方法,解决了传统的防骚扰拦截误拦截的问题,包括:用户一的终端设备、用户二的终端设备,其中用户一的终端设备和用户二的终端设备中均包含通信模块,且用户二的终端设备具有呼叫转移功能,所述的基于人工智能技术的来电过滤系统中还包含电话交换机,电话交换机中包括自动应答系统,自动应答系统中包括号码验证模块、声纹识别模块、语音识别模块、对话系统模块、语音合成模块、对话验证模块和存储模块。该专利申请详细阐述了智能拦截骚扰电话的过程,但是其是通过用户预设值验证来电,还需要和用户重新开始交谈。The patent application with the patent publication number CN109688276A relates to a call filtering system and method based on artificial intelligence technology, which solves the problem of traditional anti-harassment interception and false interception, including: the terminal equipment of user one, the terminal equipment of user two, The terminal equipment of
专利公开号为CN109995925A的专利申请,涉及一种骚扰电话识别方法、终端及计算机可读存储介质,该方法包括检测到来电呼叫时,获取来电呼叫的标识信息,根据标识信息与预置标识信息计算来电呼叫的来电特征值;根据来电特征值与预设特征值,对来电呼叫进行识别,解决了现有的骚扰电话号码的识别方法无法准确地、及时的识别出骚扰电话号码,使得用户体验感不高的问题。可以看出,该专利申请是使用预设特征值和标记信息的方式,对来电进行骚扰拦截。The patent application with the patent publication number CN109995925A relates to a method for identifying a harassing call, a terminal and a computer-readable storage medium. The method includes, when an incoming call is detected, acquiring the identification information of the incoming call, and calculating the identification information according to the identification information and the preset identification information. Incoming call characteristic value of incoming call; according to the incoming call characteristic value and the preset characteristic value, the incoming call is identified, which solves the problem that the existing harassing phone number identification method cannot accurately and timely identify the harassing phone number, which makes the user experience a sense of Not a high problem. It can be seen that the patent application uses preset feature values and marking information to harass and intercept incoming calls.
专利公开号为CN106302942A的专利申请,涉及了一种智能拦截骚扰电话的方法,通过在智能拦截系统中预设一段语音来检测来电是否录音电话,辅以陌生号码二次接通、白名单黑名单、联网查询、按键监控等综合措施来判断和拦截骚扰电话,本方法可以自主识别群拨打来电和录音形式来电,智能联网查询来电号码,对陌生来电进行通话实时监控并全程录音,用户发生按键操作时语音提醒,可以有效抗录音诈骗和群拨呼叫;保证用户免受骚扰和诈骗电话的威胁。可以看出,该专利申请主要针对录音诈骗的场景,通过预设录音可以有效拦截录音诈骗。The patent application with the patent publication number CN106302942A relates to a method for intelligently intercepting harassing calls, by presetting a voice in the intelligent interception system to detect whether the incoming call is recorded or not, supplemented by the second connection of unfamiliar numbers, whitelist and blacklist , network query, key monitoring and other comprehensive measures to judge and intercept harassing calls, this method can independently identify group calls and incoming calls in the form of recording, intelligently search the caller number online, monitor the call in real time and record the whole process of unfamiliar calls, and the user has a key operation. Timely voice reminders can effectively resist recording fraud and group dial calls; ensure users are protected from the threat of harassment and fraudulent calls. It can be seen that the patent application is mainly aimed at the scene of recording fraud, and the recording fraud can be effectively intercepted by preset recording.
专利公开号为CN109672786A的专利申请,涉及了一种来电接听方法及装置,包括::步骤101、终端接收到来电呼叫时,语音助手自动接通来电;步骤103、语音助手根据预设槽位记录来电号码和/或来电号码的语音内容,将相关内容填入与预设槽位对应的槽位值,与来电号码进行对话。该方法可以实现语音助手智能处理来电的功能,语音助手可以与来电号码进行多轮智能对话,获取来电意图,根据需要解答来电号码的问题,使得对于一些需要紧急处理的事项可以及时答复来电用户,帮助用户解决骚扰电话,提升智能来电的人性化,提高用户体验。可以看出,该专利申请是将通话权利全部交予语音助手,语音助手代表用户与来电进行交谈,最终将交谈内容通知给用户。The patent application with the patent publication number CN109672786A relates to a method and device for answering an incoming call, including: step 101, when the terminal receives an incoming call, the voice assistant automatically connects the incoming call; step 103, the voice assistant records the incoming call according to the preset slot The caller number and/or the voice content of the caller number, fill in the relevant content into the slot value corresponding to the preset slot, and have a dialogue with the caller number. The method can realize the function of the voice assistant to intelligently handle incoming calls. The voice assistant can conduct multiple rounds of intelligent dialogue with the incoming call number, obtain the incoming call intention, and answer the caller number question as needed, so that the incoming caller can be answered in time for some urgent matters. Help users solve harassing calls, improve the humanization of smart calls, and improve user experience. It can be seen that in this patent application, all the call rights are given to the voice assistant, and the voice assistant talks with the incoming call on behalf of the user, and finally notifies the user of the content of the conversation.
可以看出,上述方案在对来电进行处理以符合用户接听需求时,可以根据预设的信息对来电进行验证,或者采用录音电话或采用语音助手进行来电对话后提示给用户,这需要用户在来电之前或之后的参与及设置,给用户带来不便。更进一步地,其采用录音电话或采用语音助手进行来电对话都是对应相应的来电预先设置好的,不智能。It can be seen that when the above solution processes the incoming call to meet the user's answering needs, the incoming call can be verified according to the preset information, or the user can be prompted after the incoming call by using a recording phone or a voice assistant. Before or after participation and settings, causing inconvenience to users. Furthermore, the use of recorded calls or the use of voice assistants for incoming calls is pre-set corresponding to the corresponding incoming calls, which is not intelligent.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本发明实施例提供一种电话助理的实现方法,该方法在不需要用户设置的情况下,能够智能地对来电进行处理,以满足用户需求。In view of this, embodiments of the present invention provide a method for implementing a telephone assistant, which can intelligently process incoming calls without requiring user settings to meet user needs.
本发明实施例还提供一种电话助理的实现系统,该系统在不需要用户设置的情况下,能够智能地对来电进行处理,以满足用户需求。Embodiments of the present invention also provide a system for implementing a telephone assistant, which can intelligently process incoming calls without requiring user settings to meet user needs.
本发明实施例还提供一种电话助理的实现装置,该装置在不需要用户设置的情况下,能够智能地对来电进行处理,以满足用户需求。Embodiments of the present invention also provide a device for implementing a telephone assistant, which can intelligently process incoming calls without requiring user settings to meet user needs.
本发明实施例是这样实现的:The embodiments of the present invention are implemented as follows:
一种电话助理的实现方法,该方法包括:A method for implementing a telephone assistant, the method comprising:
采用训练好的神经网络模型构建电话助理应用;Use the trained neural network model to build a phone assistant application;
将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息;Access the incoming call of the mobile terminal into the telephone assistant application, and the telephone assistant application obtains identification analysis information after performing voice recognition and semantic analysis on the content of the incoming call;
所述电话助理应用基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理。After the phone assistant application simulates the user's voice and performs multiple rounds of pre-conversation with the incoming call based on the identification and analysis information, and obtains the incoming call intention information, it provides it to the mobile terminal for processing.
在所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息之前,还包括:Before the identification and analysis information is obtained after the phone assistant application performs speech recognition and semantic analysis on the content of the incoming call, the method further includes:
在所述电话助理应用中设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤。A call interception rule is set in the phone assistant application, and the incoming call is pre-filtered according to the incoming call interception rule. If the incoming call interception rule is met, it is directly intercepted; if the incoming call interception rule is not met, the incoming call is directly connected to the mobile terminal; If it cannot be determined whether the call interception rule is met, the step of obtaining the recognition analysis information after the phone assistant application performs voice recognition and semantic analysis on the content of the call is performed.
所述提供给移动终端处理包括:The processing provided to the mobile terminal includes:
根据来电意图信息确认来电是骚扰来电或预先设定的阈值内来电时,直接挂断,以短信形式通知移动终端;When it is confirmed that the incoming call is a harassing incoming call or an incoming call within a preset threshold according to the incoming call intention information, hang up directly and notify the mobile terminal in the form of a text message;
根据来电意图信息确认来电为正常来电时,移动终端直接接听,同时预交谈内容以文本形式显示在移动终端的用户交互界面上。When it is confirmed that the incoming call is a normal incoming call according to the incoming call intention information, the mobile terminal directly answers the call, and at the same time, the pre-chat content is displayed on the user interaction interface of the mobile terminal in the form of text.
所述采用训练好的神经网络模型构建电话助理应用包括:The described use of the trained neural network model to build a telephone assistant application includes:
基于深度学习的自动语音识别ASR方式及深度学习的自然语言处理NLP方式进行神经网络模型的训练;The neural network model is trained based on the deep learning automatic speech recognition ASR method and the deep learning natural language processing NLP method;
所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息包括:The identification and analysis information obtained after the phone assistant application performs speech recognition and semantic analysis on the content of the incoming call includes:
将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。After the voice-to-text conversion of the incoming call of the mobile terminal is performed in the ASR manner, the language processing is performed in the NLP manner to obtain analysis information.
一种电话助理的实现系统,包括:移动终端、来电者设备及智能语音通话助手单元,其中,A system for implementing a telephone assistant, comprising: a mobile terminal, a caller device and an intelligent voice call assistant unit, wherein,
来电者设备,用于向移动终端发起来电;Caller equipment, used to initiate an incoming call to the mobile terminal;
智能语音通话助手单元,用于采用训练好的神经网络模型构建电话助理应用,将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息;基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理;The intelligent voice call assistant unit is used to construct a telephone assistant application by using the trained neural network model, and access the incoming call of the mobile terminal to the telephone assistant application, and the telephone assistant application performs voice recognition and semantic analysis on the content of the incoming call. Obtain identification and analysis information; after simulating user voice and incoming calls for multiple rounds of pre-conversation based on the identification and analysis information, obtain caller intention information, and provide it to the mobile terminal for processing;
移动终端,用于接收到来电意图信息后,进行对应的处理。The mobile terminal is configured to perform corresponding processing after receiving the incoming call intention information.
所述智能语音通话助手单元,还用于在所述电话助理应用中设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤。The intelligent voice call assistant unit is also used for setting a rule for intercepting incoming calls in the phone assistant application, pre-filtering incoming calls according to the rules for intercepting incoming calls, and directly intercepting incoming calls if they meet the rules for intercepting incoming calls; If the rule is met, the incoming call is directly connected to the mobile terminal; if it cannot be determined whether the call interception rule is met, the phone assistant application performs voice recognition and semantic analysis on the content of the incoming call to obtain identification analysis information.
所述移动终端,还用于进行对应的处理包括:根据来电意图信息确认来电是骚扰来电或预先设定的阈值内来电时,直接挂断,以短信形式通知移动终端;根据来电意图信息确认来电为正常来电时,移动终端直接接听,同时预交谈内容以文本形式显示在移动终端的用户交互界面上。The mobile terminal is also used to perform corresponding processing: when confirming that the incoming call is a harassing call or an incoming call within a preset threshold according to the incoming call intention information, hang up directly, and notify the mobile terminal in the form of a short message; confirm the incoming call according to the incoming call intention information. When it is a normal incoming call, the mobile terminal directly answers the call, and at the same time, the content of the pre-chat is displayed on the user interface of the mobile terminal in the form of text.
所述智能语音通话助手单元,还用于采用训练好的神经网络模型构建电话助理应用包括:基于深度学习的ASR方式及深度学习的NLP方式进行神经网络模型的训练;所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息包括:将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。The intelligent voice call assistant unit is also used to construct a telephone assistant application by using the trained neural network model, including: training the neural network model based on the deep learning ASR method and the deep learning NLP method; the telephone assistant application responds to incoming calls. After the content is subjected to speech recognition and semantic analysis, the recognition analysis information includes: after the incoming call of the mobile terminal is converted from speech to text by ASR, language processing is performed by NLP to obtain the analysis information.
一种电话助理的实现装置,所述装置包括:智能语音对答模块、智能语音分析决策模块及智能对话主旨提取模块,其中,A device for realizing a telephone assistant, the device comprises: an intelligent voice answering module, an intelligent voice analysis decision-making module and an intelligent dialogue subject extraction module, wherein,
智能语音对答模块,用于对所接入的移动终端的来电进行语音识别及语义分析后得到识别分析信息发送给智能语音分析决策模块,在智能语音分析决策模块的指示下模拟用户语音与来电进行多轮预交谈;The intelligent voice answering module is used to perform voice recognition and semantic analysis on the incoming call of the connected mobile terminal, and then obtain the identification and analysis information and send it to the intelligent voice analysis decision-making module. Under the instruction of the intelligent voice analysis and decision-making module, simulate the user's voice and call multiple rounds of pre-conversation;
智能语音分析决策模块,用于基于识别分析信息进行决策,指示智能语音对答模块模拟用户语音与来电进行多轮预交谈;提取多轮预交谈内容进行决策,得到来电意图信息;The intelligent voice analysis and decision-making module is used to make decisions based on the identification and analysis information, and instructs the intelligent voice answering module to simulate the user's voice and the incoming call for multiple rounds of pre-conversation; extract the content of the multi-round pre-conversation for decision-making, and obtain the incoming call intention information;
智能对话主旨提取模块,用于将预交谈内容以文本方式提供给移动终端。The intelligent dialogue subject extraction module is used for providing the pre-chat content to the mobile terminal in the form of text.
所述智能语音对答模块。还用于设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤;The intelligent voice answering module. It is also used to set the rules for intercepting incoming calls, and pre-filter the incoming calls according to the rules for intercepting incoming calls. If it meets the rules for intercepting incoming calls, it will be directly intercepted; If the rules for intercepting incoming calls are met, the steps of obtaining the identification and analysis information after the phone assistant application performs voice recognition and semantic analysis on the content of the incoming call;
所述智能语音对答模块,还用于对来电内容进行语音识别及语义分析后得到识别分析信息包括:将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。The intelligent voice answering module is also used to perform voice recognition and semantic analysis on the content of the incoming call to obtain identification analysis information, including: after the incoming call of the mobile terminal is converted from speech to text in an ASR manner, and language processing is performed in an NLP manner. , to get analysis information.
如上所见,本发明实施例采用训练好的神经网络模型构建电话助理应用,将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息,基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理。由于本发明将深度学习的神经网络模型应用到来电的处理过程中,所以可以在不需要用户设置的情况下,使得智能地对来电进行处理。As can be seen above, in the embodiment of the present invention, a trained neural network model is used to construct a telephone assistant application, and incoming calls from a mobile terminal are connected to the telephone assistant application, and the telephone assistant application performs speech recognition and semantic analysis on the content of the incoming call. The identification and analysis information is obtained, and based on the identification and analysis information, the user's voice is simulated for multiple rounds of pre-conversation with the incoming call, and the incoming call intention information is obtained, and then provided to the mobile terminal for processing. Since the present invention applies the deep learning neural network model to the processing of incoming calls, the incoming calls can be processed intelligently without user setting.
附图说明Description of drawings
图1为本发明实施例提供的电话助理的实现方法过程示意图;FIG. 1 is a schematic process diagram of a method for implementing a telephone assistant provided by an embodiment of the present invention;
图2为本发明实施例提供的一种电话助理的实现方法流程图;2 is a flowchart of a method for implementing a telephone assistant according to an embodiment of the present invention;
图3为本发明实施例提供的一种电话助理的实现系统结构示意图;3 is a schematic structural diagram of a system for implementing a telephone assistant according to an embodiment of the present invention;
图4为本发明实施例提供的一种电话助理的实现装置结构示意图;4 is a schematic structural diagram of a device for implementing a telephone assistant according to an embodiment of the present invention;
图5为本发明实施例提供的智能语音通话助手单元实现示意图;5 is a schematic diagram of the implementation of an intelligent voice call assistant unit provided by an embodiment of the present invention;
图6为本发明实施例提供的自然语言处理过程示意图;6 is a schematic diagram of a natural language processing process provided by an embodiment of the present invention;
图7为本发明实施例提供的语音合成过程示意图;7 is a schematic diagram of a speech synthesis process provided by an embodiment of the present invention;
图8为本发明实施例提供的智能语音分析决策过程示意图;8 is a schematic diagram of an intelligent speech analysis decision-making process provided by an embodiment of the present invention;
图9为本发明实施例提供的basic Recursive NN模型示意图;9 is a schematic diagram of a basic Recursive NN model provided by an embodiment of the present invention;
图10为本发明实施例提供的基于槽位的识别示意图;10 is a schematic diagram of slot-based identification provided by an embodiment of the present invention;
图11为本发明实施例提供的电话助理的实现具体例子一示意图;11 is a schematic diagram of a specific example of implementation of a telephone assistant provided by an embodiment of the present invention;
图12为本发明实施例提供的电话助理的实现具体例子二示意图。FIG. 12 is a schematic diagram of a second specific implementation example of a telephone assistant provided by an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明的目的、技术方案及优点更加清楚明白,以下参照附图并举实施例,对本发明进一步详细说明。In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in further detail below with reference to the accompanying drawings and embodiments.
本发明实施例采用训练好的神经网络模型构建电话助理应用,将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息,基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理。In the embodiment of the present invention, a trained neural network model is used to construct a telephone assistant application, and incoming calls from a mobile terminal are connected to the telephone assistant application, and the telephone assistant application performs voice recognition and semantic analysis on the content of the incoming call to obtain identification analysis information , based on the identification and analysis information, after simulating the user's voice and the incoming call for multiple rounds of pre-conversation, the incoming call intention information is obtained, and then provided to the mobile terminal for processing.
这样,由于本发明将深度学习的神经网络模型应用到来电的处理过程中,所以可以在不需要用户设置的情况下,智能地对来电进行处理。In this way, since the present invention applies the deep learning neural network model to the processing of incoming calls, the incoming calls can be processed intelligently without user setting.
本发明实施例提供的电话助理应用可以在用户接听电话之前,模拟用户声音与电话进行多轮的预交谈,并通过深度学习技术,识别出来电意图信息,根据来电意图信息智能判断是否继续接听,如果不继续接听,挂断后发送信息通知,告知来电意图和挂断原因;当需要移动终端的用户接听时,则移动终端会弹出正常的是否接听电话窗口,预交谈内容会以文本形式显示在移动终端的弹出窗口中,方便该用户与来电者的无缝交流。整个实现过程示意图如图1所示。The phone assistant application provided by the embodiment of the present invention can simulate the user's voice and conduct multiple rounds of pre-conversation with the phone before the user answers the phone, and recognize the call intention information through deep learning technology, and intelligently judge whether to continue answering according to the call intention information, If you do not continue to answer the call, a message notification will be sent after hanging up, informing the intention of the call and the reason for hanging up; when the user of the mobile terminal needs to answer the call, the mobile terminal will pop up a normal answer window, and the content of the pre-chat will be displayed in text form on the In the pop-up window of the mobile terminal, it is convenient for the user to communicate seamlessly with the caller. The schematic diagram of the whole realization process is shown in Figure 1.
具体地说,当移动终端接收到来电时,将来电接入给所设置的电话助理应用,在电话助理应用中还可以设置黑名单和白名单,电话助理应用根据所设置的黑名单和白名单对来电进行预处理:如果来电号码在黑名单中,直接将来电挂断,如果来电号码在白名单中,则使得来电接通,直接到达移动终端;如果来电号码既不在黑名单中也不在白名单中,则电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息,基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息。电话助理应用根据来电意图信息确认来电是骚扰来电或预先设定的阈值内来电时,会直接挂断,同时以短信形式通知移动终端;电话助理应用根据来电意图信息确认来电为正常来电时,则会交给移动终端直接接听,同时预交谈内容会以文本形式显示在移动终端的用户交互界面上,让用户预先了解来电目的,方便用户与来电者更为有效地沟通。Specifically, when the mobile terminal receives an incoming call, it connects the incoming call to the set phone assistant application, and blacklist and whitelist can also be set in the phone assistant application, and the phone assistant application can set the blacklist and whitelist according to the set Preprocessing of incoming calls: if the incoming call number is in the blacklist, hang up the incoming call directly; if the incoming call number is in the white list, make the incoming call connect and reach the mobile terminal directly; if the incoming call number is neither in the blacklist nor in the white list In the list, the phone assistant application performs speech recognition and semantic analysis on the content of the incoming call to obtain identification and analysis information, and based on the identification and analysis information, simulates the user's voice and conducts multiple rounds of pre-conversation with the incoming call to obtain the incoming call intention information. When the phone assistant application confirms that the incoming call is a harassing call or an incoming call within a preset threshold according to the call intention information, it will hang up directly and notify the mobile terminal in the form of a text message at the same time; It will be handed over to the mobile terminal to answer directly, and the content of the pre-chat will be displayed on the user interface of the mobile terminal in the form of text, so that the user can know the purpose of the call in advance, and it is convenient for the user to communicate with the caller more effectively.
在本发明实施例中,电话助理应用可以称为智能语音通话助手,可以设置在电信运营商提供的交换机中,交换机设置在移动终端与来电者设备之间,对来电进行处理。In this embodiment of the present invention, the phone assistant application may be called an intelligent voice call assistant, and may be set in a switch provided by a telecom operator, and the switch is set between the mobile terminal and the caller's device to process incoming calls.
这样,移动终端使用了所设置的电话助理应用后,就可以高效地根据用户主体意愿来处理来电,用户不感兴趣的来电直接挂断,用户感兴趣的来电则让用户接听,并且在接听之前,会有预交谈内容会以文本形式显示在移动终端的用户交互界面上,让用户可以预先了解来电目的,做出更为妥善的安排回复。In this way, after using the set phone assistant application, the mobile terminal can efficiently handle incoming calls according to the user's main wishes, directly hang up incoming calls that the user is not interested in, and let the user answer incoming calls that the user is interested in. The content of the pre-chat will be displayed on the user interface of the mobile terminal in the form of text, so that the user can know the purpose of the call in advance and make a more appropriate response.
图2为本发明实施例提供的一种电话助理的实现方法流程图,其具体步骤为:2 is a flowchart of a method for implementing a telephone assistant provided by an embodiment of the present invention, and its specific steps are:
步骤201、采用训练好的神经网络模型构建电话助理应用;
步骤202、将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息;Step 202: Access the incoming call of the mobile terminal to the telephone assistant application, and the telephone assistant application performs voice recognition and semantic analysis on the content of the incoming call to obtain identification analysis information;
步骤203、所述电话助理应用基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理。Step 203: After the phone assistant application simulates the user's voice and conducts multiple rounds of pre-conversation with the incoming call based on the identification and analysis information, and obtains the incoming call intention information, it provides it to the mobile terminal for processing.
在该方法中,在所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息之前,还包括:In the method, before the phone assistant application performs voice recognition and semantic analysis on the content of the incoming call to obtain recognition analysis information, the method further includes:
在所述电话助理应用中设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤。A call interception rule is set in the phone assistant application, and the incoming call is pre-filtered according to the incoming call interception rule. If the incoming call interception rule is met, it is directly intercepted; if the incoming call interception rule is not met, the incoming call is directly connected to the mobile terminal; If it cannot be determined whether the call interception rule is met, the step of obtaining the recognition analysis information after the phone assistant application performs voice recognition and semantic analysis on the content of the call is performed.
在这里,所述拦截来电规则为设置的拦截通话阈值或优先通话阈值等,或者为针对用户标识设置的黑名单及白名单,进行直接预过滤操作。Here, the incoming call interception rule is a set call interception threshold or a priority call threshold, etc., or a blacklist and a whitelist set for the user ID, which perform direct pre-filtering operations.
在该方法中,所述提供给移动终端处理包括:In the method, the processing provided to the mobile terminal includes:
根据来电意图信息确认来电是骚扰来电或预先设定的阈值内来电时,直接挂断,以短信形式通知移动终端;When it is confirmed that the incoming call is a harassing incoming call or an incoming call within a preset threshold according to the incoming call intention information, hang up directly and notify the mobile terminal in the form of a text message;
根据来电意图信息确认来电为正常来电时,移动终端直接接听,同时预交谈内容会以文本形式显示在移动终端的用户交互界面上。When it is confirmed that the incoming call is a normal incoming call according to the incoming call intention information, the mobile terminal directly answers the call, and at the same time, the content of the pre-chat is displayed on the user interface of the mobile terminal in the form of text.
在该方法中,所述采用训练好的神经网络模型构建电话助理应用包括:In the method, the use of the trained neural network model to build a telephone assistant application includes:
基于深度学习的自动语音识别(ASR)方式及深度学习的自然语言处理(NLP)方式进行神经网络模型的训练;The neural network model is trained based on the automatic speech recognition (ASR) method of deep learning and the natural language processing (NLP) method of deep learning;
所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息包括:The identification and analysis information obtained after the phone assistant application performs speech recognition and semantic analysis on the content of the incoming call includes:
将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。After the voice-to-text conversion of the incoming call of the mobile terminal is performed in the ASR manner, the language processing is performed in the NLP manner to obtain analysis information.
图3为本发明实施例提供的一种电话助理的实现系统结构示意图,包括:移动终端、来电者设备及智能语音通话助手单元,其中,3 is a schematic structural diagram of a system for implementing a telephone assistant according to an embodiment of the present invention, including: a mobile terminal, a caller device, and an intelligent voice call assistant unit, wherein,
来电者设备,用于向移动终端发起来电;Caller equipment, used to initiate an incoming call to the mobile terminal;
智能语音通话助手单元,用于采用训练好的神经网络模型构建电话助理应用,将移动终端的来电接入到所述电话助理应用中,所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息;基于识别分析信息模拟用户语音与来电进行多轮预交谈后,得到来电意图信息后,提供给移动终端处理;The intelligent voice call assistant unit is used to construct a telephone assistant application by using the trained neural network model, and access the incoming call of the mobile terminal to the telephone assistant application, and the telephone assistant application performs voice recognition and semantic analysis on the content of the incoming call. Obtain identification and analysis information; after simulating user voice and incoming calls for multiple rounds of pre-conversation based on the identification and analysis information, obtain caller intention information, and provide it to the mobile terminal for processing;
移动终端,用于接收到来电意图信息后,进行对应的处理。The mobile terminal is configured to perform corresponding processing after receiving the incoming call intention information.
在该系统中,所述智能语音通话助手单元,还用于在所述电话助理应用中设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤。In this system, the intelligent voice call assistant unit is further configured to set a rule for intercepting incoming calls in the phone assistant application, pre-filter incoming calls according to the rules for intercepting incoming calls, and directly intercept the incoming calls if it conforms to the rules for intercepting incoming calls; If the call interception rule is not met, the call is directly connected to the mobile terminal; if it cannot be determined whether the call interception rule is met, the phone assistant application performs voice recognition and semantic analysis on the content of the call to obtain identification analysis information.
在该系统中,所述移动终端,还用于进行对应的处理包括:根据来电意图信息确认来电是骚扰来电或预先设定的阈值内来电时,直接挂断,以短信形式通知移动终端;根据来电意图信息确认来电为正常来电时,移动终端直接接听,同时预交谈内容会以文本形式显示在移动终端的用户交互界面上。In this system, the mobile terminal is also used to perform corresponding processing including: according to the call intention information, when confirming that the incoming call is a harassing call or an incoming call within a preset threshold, directly hang up and notify the mobile terminal in the form of a short message; When the incoming call intention information confirms that the incoming call is a normal incoming call, the mobile terminal directly answers the call, and at the same time, the content of the pre-chat is displayed on the user interface of the mobile terminal in the form of text.
在该系统中,所述智能语音通话助手单元,还用于采用训练好的神经网络模型构建电话助理应用包括:基于深度学习的ASR方式及深度学习的NLP方式进行神经网络模型的训练;所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息包括:将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。In the system, the intelligent voice call assistant unit is also used to construct a telephone assistant application by using the trained neural network model, including: training the neural network model based on the deep learning ASR method and the deep learning NLP method; The phone assistant application performs speech recognition and semantic analysis on the content of the incoming call to obtain the identification analysis information, which includes: after the incoming call of the mobile terminal is converted from speech to text in the ASR mode, and language processing is performed in the NLP mode to obtain the analysis information.
在该系统中,智能语音通话助手单元可以设置在电信运营提供的交换机中。In this system, the intelligent voice call assistant unit can be set in the switch provided by the telecom operator.
图4为本发明实施例提供的一种电话助理的实现装置结构示意图,该装置也就是智能语音通话助手单元,包括:智能语音对答模块、智能语音分析决策模块及智能对话主旨提取模块,其中,4 is a schematic structural diagram of a device for implementing a telephone assistant provided by an embodiment of the present invention. The device is also an intelligent voice call assistant unit, including: an intelligent voice answering module, an intelligent voice analysis and decision-making module, and an intelligent dialogue subject extraction module, wherein,
智能语音对答模块,用于对所接入的移动终端的来电进行语音识别及语义分析后得到识别分析信息发送给智能语音分析决策模块,在智能语音分析决策模块的指示下模拟用户语音与来电进行多轮预交谈;The intelligent voice answering module is used to perform voice recognition and semantic analysis on the incoming call of the connected mobile terminal, and then obtain the identification and analysis information and send it to the intelligent voice analysis decision-making module. Under the instruction of the intelligent voice analysis and decision-making module, simulate the user's voice and call multiple rounds of pre-conversation;
智能语音分析决策模块,用于基于识别分析信息进行决策,指示智能语音对答模块模拟用户语音与来电进行多轮预交谈;提取多轮预交谈内容进行决策,得到来电意图信息;The intelligent voice analysis and decision-making module is used to make decisions based on the identification and analysis information, and instructs the intelligent voice answering module to simulate the user's voice and the incoming call for multiple rounds of pre-conversation; extract the content of the multi-round pre-conversation for decision-making, and obtain the incoming call intention information;
智能对话主旨提取模块,用于将预交谈内容以文本方式提供给移动终端。The intelligent dialogue subject extraction module is used for providing the pre-chat content to the mobile terminal in the form of text.
在该装置中,智能语音对答模块,还用于设置拦截来电规则,根据所述拦截来电规则对来电进行预过滤,如果符合拦截来电规则,则直接拦截;如果不符合拦截来电规则,则直接接通来电到移动终端;如果无法确定是否符合拦截来电规则,则执行所述电话助理应用对来电内容进行语音识别及语义分析后得到识别分析信息的步骤。In the device, the intelligent voice answering module is also used to set the rules for intercepting incoming calls, and pre-filters incoming calls according to the rules for intercepting incoming calls. If it meets the rules for intercepting incoming calls, it will directly intercept them; Call the mobile terminal; if it cannot be determined whether the call interception rule is met, the phone assistant application performs voice recognition and semantic analysis on the content of the call to obtain identification analysis information.
在该装置中,智能语音对答模块,还用于对来电内容进行语音识别及语义分析后得到识别分析信息包括:将所述移动终端的来电采用ASR方式进行语音到文本的转换后,采用NLP方式进行语言处理,得到分析信息。In the device, the intelligent voice answering module is further configured to perform voice recognition and semantic analysis on the content of the incoming call to obtain the identification and analysis information, including: converting the incoming call of the mobile terminal to text by using ASR, and then using NLP. Perform language processing to obtain analytical information.
图5为本发明实施例提供的智能语音通话助手单元实现示意图。如图所示,具体地说,智能语音通话助手单元中的智能语音对答模块包含三部分:语音识别,语义分析,语音合成。当来电号码通过号码验证后,先进入语音识别,基于深度学习的ASR技术,可以实现语音转化成文本的形式。之后进入深度学习的NLP方式,对文本进行自然语音理解(NLU)和自然语言生成(NLG)处理,推断来电意图信息。同时将得出的意图文本传递给智能语音分析决策模块,语音分析决策模块进行下一步操作。FIG. 5 is a schematic diagram of an implementation of an intelligent voice call assistant unit provided by an embodiment of the present invention. As shown in the figure, specifically, the intelligent voice answering module in the intelligent voice call assistant unit includes three parts: voice recognition, semantic analysis, and voice synthesis. After the caller number is verified by the number, it first enters the speech recognition, and the ASR technology based on deep learning can realize the conversion of speech into text. Then enter the NLP method of deep learning, perform natural speech understanding (NLU) and natural language generation (NLG) processing on the text, and infer the call intention information. At the same time, the obtained intent text is passed to the intelligent speech analysis and decision-making module, and the speech analysis and decision-making module performs the next step.
当智能语音分析决策模块做出决策需要和来电进行对话时,又会将回复文本发送到智能语音对答模块,这里语音合成技术会将回复文本转换成回复语音,语音合成本身是比较成熟的技术,可以根据自己喜好选择不同声音回复,也可以自己录制声音,让来电者感觉到与本人在交流。When the intelligent voice analysis and decision-making module makes a decision and needs to have a dialogue with the incoming call, it will send the reply text to the intelligent voice answering module. Here, the voice synthesis technology will convert the reply text into the reply voice. Speech synthesis itself is a relatively mature technology. You can choose a different voice to reply according to your own preferences, or you can record your own voice, so that the caller feels that he is communicating with himself.
图6为本发明实施例提供的自然语言处理过程示意图;图7为本发明实施例提供的语音合成过程示意图,在上述过程中,应用到了图6及图7所示的过程。FIG. 6 is a schematic diagram of a natural language processing process provided by an embodiment of the present invention; FIG. 7 is a schematic diagram of a speech synthesis process provided by an embodiment of the present invention. In the above process, the processes shown in FIG. 6 and FIG. 7 are applied.
本发明实施例提供的智能语音分析决策模块,根据智能语音对答模块传递过来的意图文本,进行文本分析,如果是通用语句,智能电话助理可以自主生成回复文本,并发送给智能语音对答模块;如果是用户阈值范围内的语句,智能电话助理会直接挂断,并短信形式通知移动终端;如果分析后发现是正常来电,需要移动终端自己接通处理,则给移动终端发起来电请求,并将交流文本显示在移动终端中的来电用户界面上,达到用户和来电者无缝交谈的效果。如图8所示,图8为本发明实施例提供的智能语音分析决策过程示意图。The intelligent voice analysis and decision-making module provided by the embodiment of the present invention performs text analysis according to the intended text transmitted by the intelligent voice answering module. If it is a general sentence, the intelligent phone assistant can independently generate the reply text and send it to the intelligent voice answering module; If it is a sentence within the user's threshold range, the smart phone assistant will hang up directly and notify the mobile terminal in the form of a text message; if it is found to be a normal call after analysis, and the mobile terminal needs to be connected and processed by itself, it will initiate a call request to the mobile terminal and communicate with each other. The text is displayed on the incoming call user interface in the mobile terminal to achieve the effect of seamless conversation between the user and the caller. As shown in FIG. 8 , FIG. 8 is a schematic diagram of a decision-making process of intelligent speech analysis provided by an embodiment of the present invention.
本发明实施例提供的智能对话主旨提取模块,当需要移动终端的用户亲自接听时,智能语音对话模块会将之前和来电者之间的对话文本,发送给智能对话主旨提取模块。智能语音对话主旨提取模块,基于现有的大型数据集,将其通过一些联合模型,可得到对话文本的主旨大意。这里以RecNN+Viterbi为例进行说明,如图9和图10所示,图9为本发明实施例提供的basic Recursive NN模型示意图,图10为本发明实施例提供的基于槽位的识别示意图。In the intelligent dialogue subject extraction module provided by the embodiment of the present invention, when the user of the mobile terminal needs to answer in person, the intelligent voice dialogue module will send the text of the previous dialogue with the caller to the intelligent dialogue subject extraction module. The intelligent speech dialogue subject extraction module, based on the existing large-scale data set, can get the main idea of the dialogue text through some joint models. Here, RecNN+Viterbi is used as an example for description. As shown in FIG. 9 and FIG. 10 , FIG. 9 is a schematic diagram of a basic Recursive NN model provided by an embodiment of the present invention, and FIG. 10 is a schematic diagram of slot-based identification provided by an embodiment of the present invention.
来电输入为单个词向量(后续优化输入为窗口的词向量),每个词性被看作为权重向量(weight vector),这样每个词在其路径的运算为简单的词向量与词性权重向量的点积运算。如图中的方块为词性的权重向量与输入向量的点积运算后的结果。当一个父节点有多个孩子分支的时候,可以看成每个分支与权重点积的和运算。如”in“这个单词,在语义分析树中的路径为”IN-PP-NP“,将该路径的每个输出向量做一个加权运算得到path的特征,采用了三个词的path特征的concat作为tri-path特征进行槽位的分类,从而进行对”in“的一个预测。The input of the incoming call is a single word vector (the subsequent optimization input is the word vector of the window), and each part of speech is regarded as a weight vector, so that the operation of each word in its path is a simple word vector and part of speech weight vector points Product operation. The square in the figure is the result of the dot product operation of the part-of-speech weight vector and the input vector. When a parent node has multiple child branches, it can be regarded as the sum operation of the product of each branch and the weight. For example, for the word "in", the path in the semantic analysis tree is "IN-PP-NP", and each output vector of the path is subjected to a weighted operation to obtain the path feature, and the concat of the path feature of the three words is used. Slot classification is performed as tri-path features to make a prediction of "in".
以下举三个具体例子对本发明实施例进行详细说明The following three specific examples are used to describe the embodiments of the present invention in detail
例子一:广告推销电话拦截Example 1: Blocking of advertising calls
如图11所示,图11为本发明实施例提供的电话助理的实现具体例子一示意图,包括:As shown in FIG. 11 , FIG. 11 is a schematic diagram of a specific implementation example of a telephone assistant provided by an embodiment of the present invention, including:
第一步骤、电话助理应用接收到来电,首先进入智能语音对答模块;The first step, the phone assistant application receives an incoming call, and first enters the intelligent voice answering module;
第二步骤、在智能语音对答模块中,通过ASR识别后,进入NLU模块,对该语句进行词句分析,理解其语义;In the second step, in the intelligent voice answering module, after being identified by ASR, enter the NLU module to analyze the sentence and understand its semantics;
第三步骤、进入NLG模块,针对解析出的语义翻译出相应的关键字文本“房产”;The third step is to enter the NLG module, and translate the corresponding keyword text "real estate" according to the parsed semantics;
第四步骤、智能语音对答模块,将得到的意图文本“房产”,发送到智能语音分析决策模块;The fourth step, the intelligent voice answering module, sends the obtained intention text "real estate" to the intelligent voice analysis and decision-making module;
第五步骤、智能语音分析决策模块,根据用户之前的习惯得出,用户希望过滤掉“房产”相关来电;The fifth step, the intelligent voice analysis and decision-making module, according to the user's previous habits, the user wishes to filter out calls related to "real estate";
第六步骤、智能语音分析决策模块将回复文本“多谢,不需要”发送给智能语音对答模块;The sixth step, the intelligent voice analysis and decision-making module sends the reply text "Thank you, no need" to the intelligent voice answering module;
第七步骤、智能语音对答模块将回复文本“多谢,不需要”,通过语音合成技术,合成用户语音与来电对话;The seventh step, the intelligent voice answering module will reply the text "Thank you, no need", and use the voice synthesis technology to synthesize the user's voice to talk to the incoming call;
第八步骤、智能语音分析决策模块挂断电话,并发送信息给用户使用的移动终端。In the eighth step, the intelligent voice analysis and decision-making module hangs up the phone and sends information to the mobile terminal used by the user.
例子二:朋友电话通过Example 2: Friends call through
如图12所示,图12为本发明实施例提供的电话助理的实现具体例子二示意图,包括:As shown in FIG. 12 , FIG. 12 is a schematic diagram of a second specific implementation example of a telephone assistant provided by an embodiment of the present invention, including:
第一步骤、电话助理应用接收到来电,首先进入智能语音对答模块;The first step, the phone assistant application receives an incoming call, and first enters the intelligent voice answering module;
第二步骤、智能语音对答模块中,通过ASR识别后,进入NLU模块,对该语句进行词句分析,理解其语义;In the second step, in the intelligent voice answering module, after being identified by ASR, enter the NLU module to analyze the sentence and understand its semantics;
第三步骤、进入NLG模块,针对解析出的语义翻译出相应的关键字文本“小亮,今晚有空吗”;The third step is to enter the NLG module, and translate the corresponding keyword text "Xiao Liang, are you free tonight" for the parsed semantics;
第四步骤、智能语音对答模块,将得到的意图文本“小亮,今晚有空吗”,发送到智能语音分析决策模块;The fourth step, the intelligent voice answering module, sends the obtained intention text "Xiao Liang, are you free tonight" to the intelligent voice analysis and decision-making module;
第五步骤、智能语音分析决策模块,根据之前大量的训练得出,该文本是个普通问候用语;The fifth step, the intelligent speech analysis decision-making module, according to the previous extensive training, the text is a common greeting;
第六步骤、智能语音分析决策模块自主回复,将回复文本“有事吗”发送给智能语音对答模块;The sixth step, the intelligent voice analysis and decision-making module responds autonomously, and sends the reply text "Is something wrong" to the intelligent voice answering module;
第七步骤、智能语音对答模块将回复文本“有事吗”,通过语音合成技术,合成用户语音与来电对话;The seventh step, the intelligent voice answering module will reply the text "Is something wrong", and use the voice synthesis technology to synthesize the user's voice and the incoming call;
第八步骤、来电方发起第二轮通话“我在南京出差,晚上去学校附近再找小伟和华胜一起出来喝两杯”;The eighth step, the caller initiates the second round of calls, "I'm on a business trip in Nanjing, and I'll go to the school in the evening to find Xiaowei and Huasheng for a drink together";
第九步骤、智能语音对答模块,提取意图文本“晚上,学校附近找小伟,华胜喝酒”;The ninth step, the intelligent voice answering module, extracts the intent text "in the evening, look for Xiaowei near the school, Huasheng drinks";
第十步骤、智能语音分析决策模块,分析出是个问答用语,需要用户自主回答;The tenth step, the intelligent speech analysis decision-making module, analyzes that it is a question-and-answer term, which requires the user to answer independently;
第十一步骤、智能语音分析决策模块将对话文本“小亮,今晚有空吗,晚上,找小伟,华胜喝酒”发送给智能对话主旨提取模块;The eleventh step, the intelligent speech analysis and decision-making module sends the dialogue text "Xiao Liang, are you free tonight, in the evening, find Xiao Wei, Huasheng for a drink" to the intelligent dialogue subject extraction module;
第十二步骤、智能对话主旨提取模块,根据预交流的意图文本,经过深度学习模型,得出主旨大意;The twelfth step, the intelligent dialogue subject extraction module, according to the pre-communication intention text, through the deep learning model, to obtain the main idea;
第十三步骤、智能语音分析决策模块将电话转接给移动终端,并显示主旨大意,移终端的用户可以预先知道来电意图,做出预判断,和来电进行无缝交流。In the thirteenth step, the intelligent voice analysis and decision-making module transfers the call to the mobile terminal and displays the main idea. The user of the mobile terminal can know the intention of the incoming call in advance, make a pre-judgment, and communicate with the incoming call seamlessly.
例子三:根据备忘录(memo)中记录主动电话提醒Example 3: Active phone reminders recorded in the memo
这个例子的具体过程为:The specific process of this example is:
第一步骤、电话助理应用根据用户memo记录,知道周五用户和朋友小亮要一起吃饭;The first step, according to the user's memo record, the phone assistant application knows that the user and his friend Xiaoliang are going to eat together on Friday;
第二步骤、周四的时候,用户还未和小亮打电话提及吃饭的事情;The second step, on Thursday, the user has not called Xiaoliang to mention the meal;
第三步骤、电话助理应用在周四的时候,主动播用户电话,提醒其周五吃饭的事情;The third step, the phone assistant application automatically broadcasts the user's phone call on Thursday to remind him of his meal on Friday;
第四步骤、电话助理应用咨询用户是否需要和小亮通话;The fourth step, the phone assistant application inquires whether the user needs to talk to Xiaoliang;
第五步骤、用户同意通话时,电话助理应用主动拨通小亮电话和其约定吃饭事宜。In the fifth step, when the user agrees to call, the phone assistant application actively dials Xiaoliang's phone and agrees to have a meal.
可以看出,本发明实施例提供的语音助理应用根据来电的语音识别,并根据NLP技术,得到对应的关键词文本,与数据库中已有的文本阈值做比较,达到过滤广告推销和用户自定义拦截的目的,同时可以帮助用户在接听正常来电之前,预沟通来电目的,方便用户更为有效的交谈。该智能电话助理极大地方便了用户,提升了用户体验。It can be seen that the voice assistant application provided by the embodiment of the present invention obtains the corresponding keyword text according to the voice recognition of incoming calls and NLP technology, and compares it with the existing text thresholds in the database, so as to achieve filtering advertising promotion and user-defined The purpose of interception can also help the user to pre-communicate the purpose of the call before answering the normal call, which is convenient for the user to talk more effectively. The smart phone assistant greatly facilitates the user and enhances the user experience.
以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明保护的范围之内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the present invention. within the scope of protection.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010024599.8A CN111246008A (en) | 2020-01-10 | 2020-01-10 | Method, system and device for realizing telephone assistant |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010024599.8A CN111246008A (en) | 2020-01-10 | 2020-01-10 | Method, system and device for realizing telephone assistant |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111246008A true CN111246008A (en) | 2020-06-05 |
Family
ID=70869554
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010024599.8A Pending CN111246008A (en) | 2020-01-10 | 2020-01-10 | Method, system and device for realizing telephone assistant |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111246008A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112261233A (en) * | 2020-10-23 | 2021-01-22 | 北京蓦然认知科技有限公司 | Method for calling by voice assistant and voice assistant |
CN112272259A (en) * | 2020-10-23 | 2021-01-26 | 北京蓦然认知科技有限公司 | Training method and device for automatic assistant |
CN112291437A (en) * | 2020-10-23 | 2021-01-29 | 北京蓦然认知科技有限公司 | Method for voice assistant to participate in call and voice assistant |
CN115345407A (en) * | 2021-05-13 | 2022-11-15 | 八维智能股份有限公司 | Virtual assistant system for emergency dispatchers and method of operation thereof |
CN117059095A (en) * | 2023-07-21 | 2023-11-14 | 广州市睿翔通信科技有限公司 | IVR-based service providing method and device, computer equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103095889A (en) * | 2013-01-07 | 2013-05-08 | 上海欣方智能系统有限公司 | Junk call intercepting system based on talk mode identification and operating method thereof |
CN106790950A (en) * | 2015-11-20 | 2017-05-31 | 北京奇虎科技有限公司 | The recognition methods of malicious call and device |
CN109348037A (en) * | 2018-11-28 | 2019-02-15 | 东南大学 | An anti-fraud and anti-harassment fixed telephone based on intelligent question and answer |
CN109525700A (en) * | 2018-12-25 | 2019-03-26 | 出门问问信息科技有限公司 | Incoming call recognition methods, device, computer equipment and readable storage medium storing program for executing |
CN110149441A (en) * | 2019-04-12 | 2019-08-20 | 深圳壹账通智能科技有限公司 | Harassing call hold-up interception method, device, equipment and storage medium |
CN111355837A (en) * | 2018-12-24 | 2020-06-30 | 中兴通讯股份有限公司 | Incoming call processing method and device, telephone anti-harassment system and readable storage medium |
-
2020
- 2020-01-10 CN CN202010024599.8A patent/CN111246008A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103095889A (en) * | 2013-01-07 | 2013-05-08 | 上海欣方智能系统有限公司 | Junk call intercepting system based on talk mode identification and operating method thereof |
CN106790950A (en) * | 2015-11-20 | 2017-05-31 | 北京奇虎科技有限公司 | The recognition methods of malicious call and device |
CN109348037A (en) * | 2018-11-28 | 2019-02-15 | 东南大学 | An anti-fraud and anti-harassment fixed telephone based on intelligent question and answer |
CN111355837A (en) * | 2018-12-24 | 2020-06-30 | 中兴通讯股份有限公司 | Incoming call processing method and device, telephone anti-harassment system and readable storage medium |
CN109525700A (en) * | 2018-12-25 | 2019-03-26 | 出门问问信息科技有限公司 | Incoming call recognition methods, device, computer equipment and readable storage medium storing program for executing |
CN110149441A (en) * | 2019-04-12 | 2019-08-20 | 深圳壹账通智能科技有限公司 | Harassing call hold-up interception method, device, equipment and storage medium |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112261233A (en) * | 2020-10-23 | 2021-01-22 | 北京蓦然认知科技有限公司 | Method for calling by voice assistant and voice assistant |
CN112272259A (en) * | 2020-10-23 | 2021-01-26 | 北京蓦然认知科技有限公司 | Training method and device for automatic assistant |
CN112291437A (en) * | 2020-10-23 | 2021-01-29 | 北京蓦然认知科技有限公司 | Method for voice assistant to participate in call and voice assistant |
CN112272259B (en) * | 2020-10-23 | 2021-06-01 | 北京蓦然认知科技有限公司 | A training method and device for an automated assistant |
CN112261233B (en) * | 2020-10-23 | 2021-09-17 | 北京蓦然认知科技有限公司 | Method for calling by voice assistant and voice assistant |
CN112291437B (en) * | 2020-10-23 | 2021-10-01 | 北京蓦然认知科技有限公司 | Method for voice assistant to participate in call and voice assistant |
CN115345407A (en) * | 2021-05-13 | 2022-11-15 | 八维智能股份有限公司 | Virtual assistant system for emergency dispatchers and method of operation thereof |
CN117059095A (en) * | 2023-07-21 | 2023-11-14 | 广州市睿翔通信科技有限公司 | IVR-based service providing method and device, computer equipment and storage medium |
CN117059095B (en) * | 2023-07-21 | 2024-04-30 | 广州市睿翔通信科技有限公司 | IVR-based service providing method and device, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111246008A (en) | Method, system and device for realizing telephone assistant | |
CN107343077B (en) | Method, device and equipment for identifying malicious phone and establishing identification model | |
CN111683175B (en) | Method, device, equipment and storage medium for automatically answering incoming call | |
US9866516B1 (en) | Method and apparatus of processing social networking-based user input information | |
CN106202301B (en) | A kind of intelligent response system based on deep learning | |
CN113794805B (en) | A detection method and detection system for GOIP fraud calls | |
CN109819127B (en) | Method and system for managing crank calls | |
CN110072019A (en) | A kind of method and device shielding harassing call | |
CN108777751A (en) | A kind of call center system and its voice interactive method, device and equipment | |
CN109688276B (en) | Incoming call filtering system and method based on artificial intelligence technology | |
CN110705926A (en) | Method, device and system for acquiring logistics object distribution information | |
CN110149269A (en) | A kind of communication means, server and communication system | |
CN108111701A (en) | Silence processing method and device | |
CN110290155B (en) | Defense method and device for social engineering attack | |
CN113782010B (en) | Robot response method, device, electronic equipment and storage medium | |
KR20080004813A (en) | Reliability Detection System and Service Method Using Hierarchical Speech Analysis | |
CN102694896B (en) | In order to store the method for communicating number, terminal and system | |
CN110740212B (en) | Call answering method and device based on intelligent voice technology and electronic equipment | |
CN109634554A (en) | Method and apparatus for output information | |
CN107783650A (en) | A kind of man-machine interaction method and device based on virtual robot | |
CN114363277A (en) | Intelligent chatting method and device based on social relationship and related products | |
CN111970295B (en) | Multi-terminal-based call transaction management method and device | |
CN111683174B (en) | Incoming call processing method, device and system | |
CN105163288B (en) | Cluster communication method and group communication system | |
CN112671968A (en) | Method and device for intercepting crank call, computer equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200605 |
|
RJ01 | Rejection of invention patent application after publication |