TWI698756B

TWI698756B - System for inquiry service and method thereof

Info

Publication number: TWI698756B
Application number: TW107139680A
Authority: TW
Inventors: 翁健欽
Original assignee: 中華電信股份有限公司
Priority date: 2018-11-08
Filing date: 2018-11-08
Publication date: 2020-07-11
Also published as: TW202018529A

Abstract

A system for inquiry service and a method thereof are disclosed, including: using an interaction device to automate the conversation with the client and obtaining the inquiry information of the client, the inquiry information is automatically analyzed by the voice recognition device, the keyword analysis and determination device, and the big data analysis device to obtain a recommended store data corresponding to the inquiry information.

Description

System and method for query service

本發明係有關一種查詢服務技術，尤指一種自動化查詢服務之系統與方法。 The present invention relates to a query service technology, especially a system and method for automatic query service.

傳統的電話查詢服務屬高人力密集度且重複性質極高的服務，由查詢服務人員詢問客戶，聽取完問題之後，才能開始進行查詢服務，且過程中有可能會遇到新進人員，若新進人員訓練不足而無法快速找到該問題所對應的答案時，會耽誤了不少查詢服務的時間；其次，當線上之查詢服務人員的話務繁忙時，會導致欲查詢之客戶必須再線上等待查詢服務人員的接通才能進行查詢服務，而浪費不少等待時間。 The traditional telephone enquiry service is a highly labor-intensive and repetitive service. The enquiry service staff asks the customer. After listening to the question, the enquiry service can be started, and new staff may be encountered in the process. If the new staff Insufficient training to quickly find the answer to the question will delay a lot of query service time; secondly, when the online query service staff is busy, it will cause the customer who wants to query to wait for the query service online again Only when the personnel are connected can the inquiry service be performed, and a lot of waiting time is wasted.

因此，如何自動化的進行電話查詢服務且精確且快速地提供答案即為目前所亟待解決的課題之一。 Therefore, how to automatically perform telephone inquiry services and provide answers accurately and quickly is one of the urgent problems to be solved.

為克服習知技術之缺失，本發明係提供查詢服務之系統，係包括：互動裝置，係取得客戶端要求查詢服務所發出的查詢資訊，其中，該查詢資料包含語音查詢資料；語音辨識裝置，係將該互動裝置所取得該查詢資訊中的語音查詢資料轉換成文字查詢資料；關鍵字分析判定裝置，係接收該文字查詢資料，並從字典檔資料庫中找出與該文字查詢資料相同的資料，以得到關鍵字特徵，其中，該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別；以及大數據分析裝置，係接收該關鍵字分析判定裝置所得的該關鍵字特徵，且該大數據分析裝置係包含：工商店家資料庫，係儲存工商店家資料，該工商店家資料包含招牌名稱、縣市名稱、路名或行業別；工商店家比對模組，係從該工商店家資料庫中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及路名之至少一相同者，以得到關聯工商店家資料；歷史查詢資料庫，係儲存該查詢服務的歷史查詢紀錄；及第一關聯店家推薦模組，係透過逆文檔頻率演算法計算該關聯店家資料於該歷史查詢資料庫中的被查詢頻率，以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料，進而推薦該第一類推薦店家資料給該客戶端。 In order to overcome the deficiencies of the conventional technology, the present invention provides a query service system, which includes: an interactive device for obtaining query information sent by the client requesting the query service, wherein the query data includes voice query data; The phonetic recognition device converts the voice query data in the query information obtained by the interactive device into text query data; the keyword analysis and determination device receives the text query data, and finds the text from the dictionary file database Query data with the same data to obtain keyword features, where the keyword features include signboard names, county names, road names, or industry categories; and a big data analysis device that receives the key from the keyword analysis and determination device Character features, and the big data analysis device includes: a storehouse database, which stores storehouse data, the storehouse data includes signboard names, county names, road names or industry categories; the storehouse comparison module, from Find out the same pronunciation of the sign name in the keyword feature and at least one of the industry category, county and city name and road name in the industrial store database to obtain related industrial store information; the historical query database is stored The historical query record of the query service; and the recommendation module of the first associated store, which uses the inverse document frequency algorithm to calculate the query frequency of the associated store data in the historical query database, so that the query frequency is greater than the first predicted Set the threshold value of the associated store information as the first type of recommended store information, and then recommend the first type of recommended store information to the client.

於一實施例中，該歷史查詢紀錄包含客戶滿意度，且該大數據分析裝置更包括：第二關聯店家推薦模組，係當該第一關聯店家推薦模組無法取得該第一類推薦店家資料時，從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄，以得到關聯歷史查詢紀錄，並將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值，以令該推薦值大於第二預設門檻值的該關聯歷史查詢紀錄為第二類推薦店家資料，進而推薦該第二類推薦店家資料給該客戶端。 In one embodiment, the historical query record includes customer satisfaction, and the big data analysis device further includes: a second associated store recommendation module, when the first associated store recommendation module cannot obtain the first type of recommended store When collecting data, find historical query records that are the same as the keyword and have good customer satisfaction from the historical query database to obtain related historical query records, and perform Apriori machine learning algorithm calculations on the related historical query records Obtain a recommended value so that the recommended value is greater than the second preset threshold The associated history query record is the second type of recommended store information, and then the second type of recommended store information is recommended to the client.

於一實施例中，該大數據分析裝置更包括：反饋資訊模組，係將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In one embodiment, the big data analysis device further includes: a feedback information module, which stores the processing between the client requesting the query service and recommending the recommended store data to the client as the historical query database The historical query record in.

於一實施例中，該查詢資訊更包含該客戶端之話機線路所在縣市，且該文字查詢資料中無縣市名稱或路名時，該關鍵字分析判定裝置將該客戶端之話機線路所在縣市納入該關鍵字特徵中。 In one embodiment, the query information further includes the county or city where the phone line of the client is located, and when there is no county or city name or road name in the text query data, the keyword analysis and determination device is where the phone line of the client is located Counties and cities are included in this keyword feature.

於一實施例中，該字典檔資料庫包含縣市名稱字典檔、路名字典檔、行業名稱字典檔、招牌名稱字典檔、景點名稱字典檔、俗名簡稱字典檔、破音字字典檔或行業別同義詞字典檔。 In one embodiment, the dictionary file database includes a county and city name dictionary file, a road name dictionary file, an industry name dictionary file, a sign name dictionary file, a scenic spot name dictionary file, a common name abbreviation dictionary file, a broken sound word dictionary file, or an industry category Synonym dictionary file.

本發明另提供一種查詢服務之方法，係包括下列步驟：(1)取得客戶端要求查詢服務的查詢資訊，其中，該查詢資料包含語音查詢資料；(2)將該查詢資訊中的語音查詢資料轉換成文字查詢資料；(3)從字典檔資料庫中找出與該文字查詢資料相同的資料，並將該相同的資料中的贅詞過濾，以得到關鍵字特徵，其中，該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別；(4)從該工商店家資料庫中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及路名之至少一相同者，以得到關聯工商店家資料；(5)利用逆文檔頻率演算法計算該關聯店家資料於歷史查詢資料庫中的被查詢頻率，以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料；以及(6)將該推薦店家資料推薦給該客戶端。 The present invention also provides a method for query service, which includes the following steps: (1) obtain query information requested by the client for the query service, wherein the query data includes voice query data; (2) the voice query data in the query information Converted into text query data; (3) Find the same data as the text query data from the dictionary file database, and filter the redundant words in the same data to obtain the keyword features, where the keyword features Including the name of the signboard, the name of the county and city, the name of the road or the industry; (4) Find out the same pronunciation of the name of the sign in the keyword feature and at least the name of the industry, county and city from the database of the shop One is the same to obtain the related store information; (5) Use the inverse document frequency algorithm to calculate the related store information in The queried frequency in the historical query database is such that the associated store information whose frequency of query is greater than the first preset threshold is the first type of recommended store information; and (6) the recommended store information is recommended to the client.

於一實施例中，當該步驟(5)無法利用該逆文檔頻率演算法取得該第一類推薦店家資料時，則從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄，以得到關聯歷史查詢紀錄，進而將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值，以令該推薦值大於第二預設門檻值的該關聯歷史查詢紀錄為第二類推薦店家資料。 In one embodiment, when the step (5) is unable to use the inverse document frequency algorithm to obtain the first type of recommended store information, then find out from the historical query database the same as the keyword characteristics and customer satisfaction It is a good historical query record to obtain the related historical query record, and then the related historical query record is calculated by the Apriori machine learning algorithm to obtain a recommended value, so that the recommended value is greater than the second preset threshold value of the related historical query The record is the second type of recommended store information.

於一實施例中，該方法更包括下列步驟：(7)將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In one embodiment, the method further includes the following steps: (7) The process between the client requesting the query service and recommending the recommended store information to the client is stored as the history in the historical query database Query records.

於一實施例中，該查詢資訊更包含該客戶端之話機線路所在縣市，且該語音查詢資料中無縣市名稱或路名時，令該關鍵字分析判定部將該客戶端之話機線路所在縣市納入該關鍵字特徵中。 In one embodiment, the query information further includes the county or city where the phone line of the client is located, and when there is no county or city name or road name in the voice query data, the keyword analysis and determination unit is made to make the phone line of the client The county where you live is included in this keyword feature.

由上可知，本發明透過互動裝置進行自動化對答，主動對客戶端問候，並取得客戶端的查詢資訊，降低客戶端等候太久的不良感受，後續再透過語音辨識裝置、關鍵字分析判定裝置及大數據分析裝置將客戶端的查詢資料的自動化分析取得該查詢資訊所對應的推薦店家資料，以改善客服人員訓練程度不一，容易造成查詢錯誤，且人員離職流動率高，造成學習成本大幅提高的問題，且該大數據分析裝置中的反饋資訊模組係同時將系統及客服人員的處理結果回饋至歷史查詢資料庫中，藉此提高由該歷史查詢資料庫中產出推薦店家資料的正確率，並精確且快速地回覆至客戶端。 It can be seen from the above that the present invention uses an interactive device to perform automated answering, actively greet the client, and obtain query information from the client, reducing the client If you wait too long for the bad feelings, then use the voice recognition device, keyword analysis and judgment device and big data analysis device to automatically analyze the query data of the client to obtain the recommended store data corresponding to the query information, so as to improve the training level of the customer service staff. 1. It is easy to cause query errors, and the turnover rate of personnel is high, resulting in a significant increase in learning costs, and the feedback information module in the big data analysis device simultaneously returns the processing results of the system and customer service personnel to the historical query database In this way, the accuracy rate of recommended store information generated from the historical query database is improved, and the response to the client is accurately and quickly returned.

1‧‧‧互動裝置 1‧‧‧Interactive device

2‧‧‧語音辨識裝置 2‧‧‧Voice recognition device

3‧‧‧關鍵字分析判定裝置 3‧‧‧Keyword analysis and judgment device

4‧‧‧大數據分析裝置 4‧‧‧Big data analysis device

5‧‧‧文字轉語音模組 5‧‧‧Text-to-speech module

6‧‧‧客戶端 6‧‧‧Client

7‧‧‧交換機 7‧‧‧Switch

8‧‧‧客服人員 8‧‧‧Customer Service Staff

11‧‧‧中控模組 11‧‧‧Central Control Module

12‧‧‧互動模組 12‧‧‧Interactive Module

21‧‧‧信號處理模組 21‧‧‧Signal processing module

22‧‧‧語言分類模組 22‧‧‧Language Classification Module

23‧‧‧語音轉文字模組 23‧‧‧Voice to text module

24‧‧‧發音字典檔 24‧‧‧ pronunciation dictionary file

31‧‧‧字典檔資料庫 31‧‧‧Dictionary file database

32‧‧‧語意分析模組 32‧‧‧Semantic Analysis Module

33‧‧‧分詞處理模組 33‧‧‧Word Segmentation Processing Module

34‧‧‧關鍵字擷取模組 34‧‧‧Keyword Extraction Module

41‧‧‧工商店家資料庫 41‧‧‧Workshop Home Database

42‧‧‧工商店家比對模組 42‧‧‧Industry shop home comparison module

43‧‧‧歷史查詢資料庫 43‧‧‧History query database

44‧‧‧第一關聯店家推薦模組 44‧‧‧The first related store recommendation module

45‧‧‧第二關聯店家推薦模組 45‧‧‧Recommendation module of the second related store

46‧‧‧反饋資訊模組 46‧‧‧Feedback Information Module

100‧‧‧系統 100‧‧‧System

200‧‧‧方法 200‧‧‧Method

S201~S208‧‧‧步驟 S201~S208‧‧‧Step

第1圖為本發明之查詢服務之系統之示意圖；以及第2圖為本發明之查詢服務之方法之步驟流程圖。 Figure 1 is a schematic diagram of the query service system of the present invention; and Figure 2 is a flowchart of the steps of the query service method of the present invention.

以下藉由特定的具體實施例說明本發明之實施方式，熟悉此技藝之人士可由本說明書所揭示之內容輕易地瞭解本發明之其他優點及功效。 The following specific examples illustrate the implementation of the present invention. Those familiar with the art can easily understand the other advantages and effects of the present invention from the content disclosed in this specification.

須知，本說明書所附圖式所繪示之結構、比例、大小等，均僅用以配合說明書所揭示之內容，以供熟悉此技藝之人士之瞭解與閱讀，並非用以限定本發明可實施之限定條件，故不具技術上之實質意義，任何結構之修飾、比例關係之改變或大小之調整，在不影響本發明所能產生之功效及所能達成之目的下，均應仍落在本發明所揭示之技術內容得能涵蓋之範圍內。同時，本說明書中所引用之如「第一」、「第二」及「一」等之用語，亦僅為便於敘述之明瞭，而非用以限定本發明可實施之範圍，其相對關係之改變或調整，在無實質變更技術內容下，當視為本發明可實施之範疇。 It should be noted that the structures, proportions, sizes, etc. shown in the drawings in this manual are only used to match the contents disclosed in the manual for the understanding and reading of those familiar with the art, and are not intended to limit the implementation of the present invention. Therefore, it does not have any technical significance. Any structural modification, proportional relationship change, or size adjustment should still fall within the scope of the present invention without affecting the effects and objectives that can be achieved. The technical content disclosed by the invention can be covered. At the same time, the terms such as "first", "second" and "one" cited in this manual are only for ease of description. It is not intended to limit the scope of implementation of the present invention, and the change or adjustment of its relative relationship shall be regarded as the scope of implementation of the present invention without substantial changes to the technical content.

請參閱第1圖所示，係本發明之查詢服務之系統100之示意圖。該系統係包括互動裝置1、語音辨識裝置2、關鍵字分析判定裝置3、大數據分析裝置4及文字轉語音模組5，其中，互動裝置1、語音辨識裝置2、關鍵字分析判定裝置3、大數據分析裝置4可實作在相同設備中彼此電性連接，亦可實作在不同設備中彼此電性連接。 Please refer to Figure 1, which is a schematic diagram of the query service system 100 of the present invention. The system includes an interactive device 1, a voice recognition device 2, a keyword analysis and determination device 3, a big data analysis device 4, and a text-to-speech module 5. The interactive device 1, a voice recognition device 2, and a keyword analysis and determination device 3 The big data analysis device 4 can be implemented in the same device to be electrically connected to each other, or implemented in different devices to be electrically connected to each other.

互動裝置1係於客戶端6向系統100要求查詢服務時，與客戶端6進行對話互動，其中，互動裝置1係包含中控模組11及互動模組12。 The interactive device 1 interacts with the client 6 when the client 6 requests a query service from the system 100. The interactive device 1 includes a central control module 11 and an interactive module 12.

於一實施例中，客戶端6係透過交換機7向系統100提出查詢服務的要求，但不以此為限。 In one embodiment, the client 6 requests the system 100 to query the service through the switch 7, but it is not limited to this.

於一實施例中，客戶端6係為市內電話機或行動裝置，但不以此為限。 In one embodiment, the client 6 is a local telephone or a mobile device, but it is not limited to this.

中控模組11，係接收客戶端6向系統100要求該查詢服務所發出的查詢資訊，並負責整個該查詢作業中系統100與客戶端6互動的流程控制。 The central control module 11 receives the inquiry information sent by the client 6 to the system 100 for the inquiry service, and is responsible for the process control of the interaction between the system 100 and the client 6 in the entire inquiry operation.

於一實施例中，該查詢資訊包含客戶端6的進線號碼、話機線路所在縣市、進線裝置、進線查詢時間及語音查詢資料。 In one embodiment, the query information includes the incoming line number of the client terminal 6, the county or city where the phone line is located, the incoming line device, the incoming line query time, and the voice query data.

互動模組12，係內建一對答語句自動生成模型，並令該對答語句自動生成模型產生與客戶端6對話互動的對話字串，該對答語句自動生成模型係內建自然人聊天所需的語料庫，該語料庫內容包含了大量的上下文(context)及回覆(response)，當客戶端6的該語音查詢資料轉換成文字查詢資料並輸入該對答語句自動生成模型時，該對答語句自動生成模型會拆解比對輸入的上下文，並從該語料庫中選擇最佳的回覆為該對話字串。此互動模組12的主要功能是為了與客戶端6保持互動，引導客戶端回答問題。 The interactive module 12 is a built-in one-to-one answer sentence automatic generation model, and makes the answering sentence automatic generation model generate a dialogue interactive with the client 6 Character string, the automatic generation model of the answer sentence is built-in the corpus required for natural persons to chat. The content of the corpus contains a large number of contexts and responses. When the voice query data of the client 6 is converted into text query data When inputting the answer sentence automatic generation model, the answer sentence automatic generation model will disassemble the context of the comparison input, and select the best reply from the corpus as the dialogue string. The main function of the interaction module 12 is to maintain interaction with the client 6 and guide the client to answer questions.

於一實施例中，該對答語句自動生成模型透過機器學習演算法訓練產生，其中，該機器學習演算法包含長短期記憶(LSTN)或遞歸神經網絡(RNN)。 In one embodiment, the answer sentence automatic generation model is generated through machine learning algorithm training, wherein the machine learning algorithm includes long-term short-term memory (LSTN) or recurrent neural network (RNN).

文字轉語音模組5，係接收互動模組12所產生的該對話字串，並將該對話字串轉換成對話語音，以將該對話語音傳送至客戶端6。 The text-to-speech module 5 receives the dialogue string generated by the interactive module 12 and converts the dialogue string into a dialogue voice to transmit the dialogue voice to the client 6.

於一實施例中，文字轉語音模組5係透過交換機7將該回覆語音傳送至客戶端6。 In one embodiment, the text-to-speech module 5 transmits the reply voice to the client 6 through the switch 7.

在客戶端6向系統100提出查詢服務的要求時，中控模組11先呼叫互動模組12產出對話字串，並將該對話字串透過5 TTS文字轉語音元件將該對話字串轉為語音播放至客戶端6，引導客戶回答問題。過程中會不斷呼叫語音辨識裝置2來分析客戶端6的回答，並確認是否有產出辨識結果，並重複此問答過程數次，若語音辨識裝置2辨識結果可產出查詢用的文字查詢資料，則繼續交付後台(即關鍵字分析判定裝置及大數據分析裝置)進行查詢流程；若過程中無法辨識出有效資訊，或者語意分析後的意圖並非查詢，就會將此話務轉交客服人員8服務。 When the client 6 makes a service inquiry request to the system 100, the central control module 11 first calls the interactive module 12 to generate a dialogue string, and converts the dialogue string through the 5 TTS text-to-speech component Play the voice to the client 6 to guide the customer to answer questions. During the process, the voice recognition device 2 will be called continuously to analyze the answer of the client 6 and confirm whether there is a recognition result, and the question and answer process will be repeated several times. If the recognition result of the voice recognition device 2 can produce text query data for query , It will continue to be delivered to the background (that is, the keyword analysis and determination device and the big data analysis device) for the query process; if the effective information cannot be identified in the process, or the intention after the semantic analysis is not the query Inquiry, the call will be forwarded to customer service staff 8 for service.

語音辨識裝置2，係接收互動裝置1所取得該查詢資訊，以將該查詢資訊中的語音查詢資料轉換成該文字查詢資料，其中，語音辨識裝置2包含信號處理模組21、語音轉文字模組23及發音字典檔24。 The voice recognition device 2 receives the query information obtained by the interactive device 1 to convert the voice query data in the query information into the text query data. The voice recognition device 2 includes a signal processing module 21 and a voice-to-text model Group 23 and pronunciation dictionary file 24.

信號處理模組21，係使用數位訊號處理(Digital Signal Processing,DSP)去除掉該語音查詢資料中的雜訊。 The signal processing module 21 uses digital signal processing (DSP) to remove noise in the voice query data.

發音字典檔24，係儲存詞彙的發音組合，以依據該組合提供單字轉音素或音素轉單字的應用，其中，該音素為語言中的最小發音單位，例如國語的注音符號或英語的音標，但不以此為限。 The pronunciation dictionary file 24 stores the pronunciation combinations of words to provide the application of word-to-phoneme or phoneme-to-character conversion according to the combination. The phoneme is the smallest pronunciation unit in the language, such as the phonetic symbols of Mandarin or the phonetic symbols of English. Not limited to this.

於一實施例中，發音字典檔24係為國語或英語的發音字典檔，亦可為其他語系的發音字典檔，但不限於此。 In one embodiment, the pronunciation dictionary file 24 is a pronunciation dictionary file of Mandarin or English, and may also be a pronunciation dictionary file of other languages, but is not limited to this.

語音轉文字模組23，係將已去除掉雜訊的該語音查詢資料轉換成該文字查詢資料。 The voice-to-text module 23 converts the voice query data from which noise has been removed into the text query data.

於一實施例中，語音轉文字模組23係根據己經訓練好的聲學模型(HMM)及發音字典檔24，並配合使用神經網路模型BPNN(Backpropagation Neural Network)建立一第一類神經網路來對已去除掉雜訊的該語音查詢資料轉換成單字組合，接著，依據已訓練好的語言模型及配合該神經網路模型BPNN建立第二類神經網路來對該單字組合轉換成完整的文字句子，並令該完整的文字句子為該文字查詢資料。 In one embodiment, the speech-to-text module 23 builds a first-type neural network based on the trained acoustic model (HMM) and pronunciation dictionary file 24, together with the neural network model BPNN (Backpropagation Neural Network) Lulai converts the voice query data from which the noise has been removed into a single word combination, and then builds a second type of neural network based on the trained language model and the neural network model BPNN to convert the single word combination into a complete And make the complete text sentence as the text query data.

於該第一類神經網路中，神經網路模型BPNN配合聲學模型的作用是將已去除掉雜訊的該語音查詢資料換成複數個該音素，神經網路模型BPNN配合發音字典檔24的作用是將該複數個音素轉換成複數個字或詞的該單字組合。 In the first type of neural network, the neural network model BPNN cooperates with the sound The function of the learning model is to replace the voice query data with noise removed into plural phonemes. The function of the neural network model BPNN with the pronunciation dictionary file 24 is to convert the plural phonemes into plural words or words. Single word combination.

於該第二類神經網路中，神經網路模型BPNN配合語言模型的作用是將字或詞(例如該單字組合)轉換成該文字句子。 In the second type of neural network, the function of the neural network model BPNN in conjunction with the language model is to convert the word or word (for example, the combination of single characters) into the text sentence.

於一實施例中，語音辨識裝置2更包括語言分類模組22，但不以此為限。 In one embodiment, the speech recognition device 2 further includes a language classification module 22, but it is not limited thereto.

語言分類模組22，係依據該第一類神經網路來對已去除掉雜訊的該語音查詢資料進行語言分類，其中，該語言分類為國語或英語，但不限於此。 The language classification module 22 performs language classification on the voice query data from which noise has been removed according to the first type of neural network, wherein the language classification is Mandarin or English, but not limited to this.

於一實施例中，語言分類模組22係整合在語音轉文字模組23，以供該第一類神經網路來對已去除掉雜訊的該語音查詢資料進行該語言分類及產生該單字組合，但不限於此。 In one embodiment, the language classification module 22 is integrated in the speech-to-text module 23 for the first type of neural network to perform the language classification on the voice query data from which noise has been removed and generate the word Combination, but not limited to this.

關鍵字分析判定裝置3，係接收語音轉文字模組23所得的該文字查詢資料，以從該文字查詢資料中找出關鍵字特徵，其中，關鍵字分析判定裝置3包含字典檔資料庫31、語意分析模組32、分詞處理模組33、關鍵字擷取模組34。 The keyword analysis and determination device 3 receives the text query data obtained by the speech-to-text module 23 to find the keyword characteristics from the text query data. The keyword analysis and determination device 3 includes a dictionary file database 31, The semantic analysis module 32, the word segmentation processing module 33, and the keyword extraction module 34.

字典檔資料庫31，係包含縣市名稱字典檔、路名字典檔、行業名稱字典檔、招牌名稱字典檔、景點名稱字典檔、破音字字典檔、俗名簡稱字典檔(例如：行天宮=恩主宮等資訊)或行業別同義詞字典檔，其中，同義詞字典檔的內容就是記錄中文詞彙可能的轉換，例如小吃店的同義詞也可以稱呼為餐廳、飲食店及簡餐，而iPhone的同義詞為哀鳳、愛瘋、智慧手機、蘋果，但不以此為限。 The dictionary file database 31 contains the county and city name dictionary file, the road name dictionary file, the industry name dictionary file, the sign name dictionary file, the scenic spot name dictionary file, the broken sound word dictionary file, the common name abbreviation dictionary file (for example: Xingtian Palace=恩The main palace and other information) or industry-specific synonym dictionary files, where the content of the synonym dictionary file is to record the possible conversion of Chinese vocabulary. For example, synonyms for snack bars can also be used. The names are restaurants, eateries, and simple meals, and the synonyms of iPhone are Aifeng, Crazy, Smart Phone, Apple, but not limited to this.

語意分析模組32，係從該文字查詢資料中辨識客戶端6的查詢意圖，其查詢意圖的辨識方式係從主詞及動詞去判斷，例如：我想要查、請幫我查、我要查、我想查、幫我找或我想找等等，即可認定具有該查詢意圖，但不以此為限。 The semantic analysis module 32 identifies the query intent of the client 6 from the text query data. The method of identifying the query intent is to determine the subject and verb, for example: I want to check, please help me check, I want to check , I want to check, help me find, or I want to find, etc., you can determine the intent of the query, but not limited to this.

由於系統100的任務很明確，就是請客戶端6提供店家名稱或景點名稱，然後幫客戶端6進行查詢相關電話資訊，因此客戶端6問句偏向短問句，諸如：我想要查XXX餐廳的電話、幫我查XX醫院的電話、我要找最近上過新聞的XXX的電話、可以幫我找XXX嗎或我想知道XX的電話等等，問法有很多種，但是其意圖都是一樣的，因此語意分析模組32僅需要進行判斷該文字查詢資料是否為查詢即可，若無法判斷查詢意圖，則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理。 Since the task of the system 100 is very clear, it is to ask the client 6 to provide the store name or the name of the scenic spot, and then help the client 6 to query related telephone information, so the client 6 questions tend to be short questions, such as: I want to check XXX restaurants The phone number of XX, the phone number of XX hospital for me, the phone number of XXX who has been on the news recently, can you find XXX for me or I want to know the phone number of XX, etc. There are many ways to ask, but the intentions are all The same, therefore, the semantic analysis module 32 only needs to determine whether the text query data is a query. If the query intention cannot be determined, the central control module 11 will control the dialogue interaction with the client 6 to the customer service staff 8 deal with.

分詞處理模組33，係對該文字查詢資料進行不同字元數的切詞，以得到不同字元數的詞語組合，然後比對字典檔資料庫31中是否有相符的該詞語組合出現，並將比對相符的該詞語組合輸出為關鍵字陣列。 The word segmentation processing module 33 performs word segmentation with different numbers of characters on the text query data to obtain word combinations with different numbers of characters, and then compares whether there is a matching word combination in the dictionary file database 31, and Output the matched word combination as a keyword array.

關鍵字擷取模組34，係將該關鍵字陣列中的贅詞刪除，以得到該關鍵字特徵，其中，該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別，該贅詞為一般對話裡常出現高頻率詞彙，例如主詞(你、我、他、小姐、先生、你好、)、嘆詞(ㄟ、啊、呀、喔、哦、噢、唷、喲…)、動詞(想要、想查、想知道、幫我找、請幫我、告訴我、…)等等，但不以此為限。關鍵字擷取模組34的目的在於將自然人交談時所用的贅詞過濾，僅留下可能是店家名稱的關鍵字。 The keyword extraction module 34 deletes redundant words in the keyword array to obtain the keyword characteristics, where the keyword characteristics include signboard names, county names, road names, or industry categories, and the redundant words High-frequency words often appear in general conversations, such as subject words (you, me, him, miss, mr, hello,), Interjections (ㄟ, ah, ah, oh, oh, oh, yo, yo...), verbs (want, want to check, want to know, help me find, please help me, tell me,...) etc., but not Limit this. The purpose of the keyword extraction module 34 is to filter the redundant words used by natural persons in conversation, and only leave keywords that may be the name of the store.

於一實施例中，關鍵字分析判定裝置3更包括從語音辨識裝置2接收該查詢資訊，且當分詞處理模組33無法從字典檔資料庫31的縣市名稱字典檔或路名字典檔中比對出相符的該詞語組合時(即該文字查詢資料中沒有縣市名稱或路名)，關鍵字擷取模組34將該查詢資訊中的該客戶端之話機線路所在縣市納入該關鍵字特徵中，但不以此為限。 In one embodiment, the keyword analysis and determination device 3 further includes receiving the query information from the voice recognition device 2, and when the word segmentation processing module 33 is unable to obtain from the county and city name dictionary file or the road name dictionary file in the dictionary file database 31 When the matching word combination is found (that is, there is no county or city name or road name in the text query data), the keyword extraction module 34 includes the county and city where the client's phone line in the query information is located in the key Character features, but not limited to this.

大數據分析裝置4，係接收關鍵字擷取模組34所得的該關鍵字特徵，以依據該關鍵字特徵從大數據分析裝置4內的資料庫中找出推薦店家資料給該客戶端6，其中，大數據分析裝置4包含工商店家資料庫41、工商店家比對模組42、歷史查詢資料庫43、第一關聯店家推薦模組44、第二關聯店家推薦模組45及反饋資訊模組46。 The big data analysis device 4 receives the keyword feature obtained by the keyword extraction module 34, and finds recommended store information from the database in the big data analysis device 4 to the client 6 according to the keyword feature. Among them, the big data analysis device 4 includes a shopkeeper database 41, a shopkeeper comparison module 42, a historical query database 43, a first related shop recommendation module 44, a second related shop recommendation module 45, and a feedback information module 46.

工商店家資料庫41，係儲存工商店家資料，該工商店家資料包含招牌名稱、縣市名稱、路名或行業別。 The industrial store database 41 stores industrial store information, which includes signboard name, county name, road name, or industry category.

於一實施例中，工商店家資料庫41為特殊設計的資訊聚合大表，每日由網路爬蟲程式自動從經濟部商業司取得登記有案的工商店家招牌名稱，並整合比對104查號台內登記為有效使用中，尚未拆機的工商店家資料，由專業人員進行讀音(即注音符號)編排分類以及行業別分類，但不以此為限。 In one embodiment, the shopkeeper database 41 is a specially designed information aggregation table. The web crawler program automatically obtains the registered shopkeeper's name from the Department of Commerce of the Ministry of Economic Affairs every day, and integrates the 104 number check Registered in the station as being in effective use, but not yet dismantled the information of the industrial shop, the professional staff will arrange and classify the pronunciation (that is, the phonetic symbol) and the industry classification, but not Limit this.

工商店家比對模組42，係從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同的工商店家資料，以得到招牌同音的工商店家資料，且比對出該招牌同音的工商店家資料具有與該關鍵字特徵的行業別、縣市名稱及路名之至少一相同時，令該至少一相同的該招牌同音的工商店家資料為到關聯工商店家資料。 The shop comparison module 42 finds the shop shop data from the shop shop database 41 that has the same pronunciation as the name of the sign in the keyword feature to obtain the shop shop data with the same pronunciation of the sign, and compare it to the sign When the homophonic industrial store data has at least one of the industry category, county name, and road name of the keyword feature, the at least one identical industrial store data of the sign with the same sound is the related industrial store data.

歷史查詢資料庫43，係儲存該查詢服務的歷史查詢紀錄，該歷史查詢紀錄包含客戶端6所查詢的該文字查詢資料與所對應回覆的答案，以及來電總結記錄客戶端6是否對該答案的滿意度。 The historical query database 43 stores the historical query records of the query service. The historical query records include the text query data queried by the client 6 and the corresponding reply answer, and the call summary record whether the client 6 is correct for the answer Satisfaction.

第一關聯店家推薦模組44，係透過逆文檔頻率(IDF)演算法計算該關聯店家資料於歷史查詢資料庫43中的被查詢頻率，以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料，進而將該第一類推薦店家資料透過文字轉語音模組5語音推薦給該客戶端6。 The first related store recommendation module 44 uses the inverse document frequency (IDF) algorithm to calculate the query frequency of the related store data in the historical query database 43, so that the query frequency is greater than the first preset threshold. The associated store information is the first type of recommended store information, and the first type of recommended store information is then voice-recommended to the client 6 through the text-to-speech module 5.

逆文檔頻率(IDF)計算公式如下，其中D表示該歷史查詢紀錄內，行業別為餐廳類的查詢記錄總筆數，Dw表示招牌名稱w出現在多少筆資料中，透過下列公式：

Inverse document frequency (IDF) calculation formula is as follows, where D represents the total number of query records in the historical query record where the industry is restaurant category, and Dw represents the number of records of the signboard name w appearing in the following formula:

第二關聯店家推薦模組45，係當第一關聯店家推薦模組44無法取得該第一類推薦店家資料時，從歷史查詢資料庫43中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄，以得到關聯歷史查詢紀錄，並利用Apriori機器學習演算法計算取得該關聯歷史查詢紀錄的特徵值，且令該特徵值符合預設推薦條件的該關聯歷史查詢紀錄為第二類推薦店家資料，以將該第二類推薦店家資料透過文字轉語音模組5語音推薦給該客戶端6。 The second associated store recommendation module 45, when the first associated store recommendation module 44 is unable to obtain the first type of recommended store data, finds out from the historical query database 43 the same as the keyword feature and the customer satisfaction is Good calendar Historical query records to obtain related historical query records, and use Apriori machine learning algorithm to calculate the feature value of the related historical query record, and make the related historical query record whose feature value meets the preset recommendation condition as the second type of recommendation Store information to recommend the second type of recommended store information to the client 6 through the text-to-speech module 5 by voice.

該Apriori機器學習演算法計算公式如下：

The calculation formula of the Apriori machine learning algorithm is as follows:

X為客戶端6查詢的店家X，Y為從該歷史查詢紀錄中找出的店家Y(即為該第二類推薦店家資料)，support、confidence及Lift為該特徵值，其中，frq是計算店家在該歷史查詢紀錄內被查詢的次數，frq(X)是指店家X在該歷史查詢紀錄內被查詢的次數，frq(X,Y)是指同時包含店家X和店家Y在該歷史查詢紀錄內被查詢的次數，N為該歷史查詢紀錄的所有查詢次數，Support是支持度即代表該所有查詢次數N中，X與Y同時出現的比例，Supp(X)即為在該所有查詢次數N中，店家X出現的比例，Supp(Y)即為在該所有查詢次數N中，店家Y出現的比例，Confidence是信心度即代表所有出現店家X的查詢中所同時伴隨查詢店家Y的比例，Lift是增益值，Lift值小於1代表店家X與Y兩者是負相關，等於1代表店家X與Y兩者是無關，大於1代表店家X與Y兩者是正相關，其中，當Lift值為正相關且support值和confidence值都超過該第二預設門檻值之後，則此關聯規則成立，則可將店家Y進行推薦給客戶端6。 X is the store X queried by the client 6, Y is the store Y found from the historical query record (that is, the second type of recommended store data), support, confidence, and Lift are the characteristic values, where frq is the calculation The number of times the store has been queried in the historical query record, frq(X) refers to the number of times store X has been queried in the historical query record, frq(X,Y) refers to the historical query that contains both store X and store Y The number of queries in the record, N is all the number of queries in the historical query record, Support is the support degree, which represents the proportion of all the number of queries N, X and Y appear at the same time, Supp(X) is the number of all queries In N, the proportion of store X appearing, Supp(Y) is the proportion of store Y appearing in all the number of queries N, Confidence is the degree of confidence that represents the proportion of all queries where store X appears at the same time as the query store Y , Lift is the gain value, Lift value less than 1 means that the store X and Y are negatively correlated, and equal to 1 means that the store X and Y are irrelevant. Greater than 1 means that both store X and Y are positively correlated. When the Lift value is positively correlated and the support value and confidence value both exceed the second preset threshold, then this association rule is established, and store Y can be recommended To the client 6.

當大數據分析裝置4中的各模組無法依據該關鍵字特徵找出該第一類推薦店家資料及第二類推薦店家資料時，則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理，並將處理所得的第三類推薦店家資料告知客戶端6。 When the modules in the big data analysis device 4 cannot find the first type of recommended store information and the second type of recommended store information based on the keyword characteristics, the central control module 11 controls the communication with the client 6 The dialogue interaction is transferred to the customer service staff 8 for processing, and the client 6 is notified of the third-type recommended store information obtained from the processing.

反饋資訊模組46，係將該客戶端6要求該查詢服務至推薦各該推薦店家資料給該客戶端6之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 The feedback information module 46 stores the processing process between the client 6 requesting the query service and recommending each of the recommended store information to the client 6 as the historical query record in the historical query database.

請參閱第2圖，係本發明之查詢服務之方法200之步驟流程圖。該方法包括下列步驟：在步驟S201中，取得該客戶端6要求該查詢服務的查詢資訊，其中，該查詢資料包含客戶端6的進線號碼、話機線路所在縣市、進線裝置、進線查詢時間及語音查詢資料。 Please refer to Figure 2, which is a flowchart of the steps of the method 200 for querying services of the present invention. The method includes the following steps: in step S201, the query information requested by the client 6 for the query service is obtained, wherein the query data includes the incoming line number of the client 6, the county or city where the phone line is located, the incoming device, and the incoming line Query time and voice query data.

於一實施例中，該步驟S201係利用互動模組12生成對話字串，並透過文字轉語音模組5將該對話字串轉換成對話語音傳送至客戶端6，藉此引導客戶端6回答問題，以從該回答問題中取得該查詢資訊，但不以此為限。 In one embodiment, in step S201, the interactive module 12 is used to generate a dialogue string, and the dialogue string is converted into a dialogue voice through the text-to-speech module 5 and sent to the client 6, thereby guiding the client 6 to answer The question is to obtain the query information from the answer question, but not limited to this.

在步驟S202中，將該查詢資訊中的語音查詢資料轉換成文字查詢資料。 In step S202, the voice query data in the query information is converted into text query data.

於一實施例中，該步驟S202係透過語音辨識裝置2 將該查詢資訊中的語音查詢資料轉換成文字查詢資料。 In one embodiment, the step S202 is performed by the voice recognition device 2 Convert the voice query data in the query information into text query data.

在步驟S203中，從字典檔資料庫31中找出與該文字查詢資料相同的資料，並將該相同的資料中的贅詞過濾，以得到關鍵字特徵，其中，該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別。 In step S203, find the same data as the text query data from the dictionary file database 31, and filter the redundant words in the same data to obtain keyword features, where the keyword features include the name of the sign , County and city name, road name or industry category.

該步驟S203係進一步包含下列步驟：利用分詞處理模組33對具有查詢意圖的該文字查詢資料進行不同字元數的切詞，以得到不同字元數的詞語組合，然後比對字典檔資料庫31中是否有相同的該詞語組合出現，並將比對相同的該詞語組合輸出為關鍵字陣列；以及 The step S203 further includes the following steps: use the word segmentation processing module 33 to segment the text query data with query intent to obtain word combinations with different numbers of characters, and then compare the dictionary file database Whether the same word combination appears in 31, and output the same word combination as a keyword array; and

利用關鍵字擷取模組34將該關鍵字陣列中的贅詞刪除，以得到該關鍵字特徵。 The keyword extraction module 34 is used to delete redundant words in the keyword array to obtain the keyword characteristics.

於一實施例中，該步驟S203一開始係利用語意分析模組32從該文字查詢資料中辨識客戶端6的查詢意圖，若有，則進行利用分詞處理模組33之步驟，若無，則回到步驟S201重新取得另一該查詢資訊，但不以此為限。 In one embodiment, at the beginning of step S203, the semantic analysis module 32 is used to identify the query intention of the client 6 from the text query data. If so, the step of using the word segmentation processing module 33 is performed. If not, then Return to step S201 to obtain another query information, but not limited to this.

於一實施例中，當分詞處理模組33無法從字典檔資料庫31的縣市名稱字典檔或路名字典檔中比對出相符的該詞語組合時(即該文字查詢資料中沒有縣市名稱或路名)，令關鍵字擷取模組34將該查詢資訊中的該客戶端之話機線路所在縣市納入該關鍵字特徵中，但不以此為限。 In one embodiment, when the word segmentation processing module 33 cannot compare the word combination from the county and city name dictionary file or the road name dictionary file in the dictionary file database 31 (that is, there is no county or city in the text query data). Name or road name) to make the keyword retrieval module 34 include the county and city where the phone line of the client in the query information is located in the keyword feature, but it is not limited to this.

在步驟S204中，從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及路名之至少一相同者，以得到關聯工商店家資料。 In step S204, find out from the industrial store database 41 the pronunciation of the sign name in the keyword feature is the same as the name of the industry, county and city, and At least one of the road names is the same in order to obtain the information of the related shops.

於一實施例中，該步驟S204係透過工商店家比對模組42從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同的工商店家資料，以得到招牌同音的工商店家資料，且比對出該招牌同音的工商店家資料具有與該關鍵字特徵的行業別、縣市名稱及路名之至少一相同時，令該至少一相同的該招牌同音的工商店家資料為到關聯工商店家資料。 In one embodiment, the step S204 is to find from the industrial store database 41 from the store house database 41 through the store house comparison module 42 to obtain the store house data with the same pronunciation of the signboard name in the keyword feature When the store data is compared with the information of the industrial store with the same sound of the signboard and at least one of the industry category, county and city name and road name of the keyword feature is the same, the information of the at least one identical industrial store with the same sound of the signboard is Go to the related store's information.

在步驟S205中，利用逆文檔頻率演算法計算該關聯店家資料於歷史查詢資料庫中的被查詢頻率，以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料。 In step S205, the inverse document frequency algorithm is used to calculate the inquired frequency of the related store information in the historical query database, so that the related store information whose inquired frequency is greater than the first preset threshold is the first type of recommended store data.

於一實施例中，該步驟S205係透過第一關聯店家推薦模組44執行。 In one embodiment, the step S205 is performed through the first associated store recommendation module 44.

在步驟S206中，當該步驟S205無法利用該逆文檔頻率演算法取得該第一類推薦店家資料時，則從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄，以得到關聯歷史查詢紀錄，進而將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值，以令該推薦值大於第二預設門檻值的該關聯歷史查詢紀錄為第二類推薦店家資料。 In step S206, when the inverse document frequency algorithm cannot be used to obtain the recommended store information of the first type in the step S205, it is found from the historical query database that the feature is the same as the keyword and the customer satisfaction is good Historical query records to obtain associated historical query records, and then perform Apriori machine learning algorithm calculations on the associated historical query records to obtain a recommended value, so that the associated historical query record with the recommended value greater than the second preset threshold is the first The second category recommended store information.

於一實施例中，該步驟S206係透過第二關聯店家推薦模組45執行。 In one embodiment, the step S206 is performed through the second associated store recommendation module 45.

於一實施例中，當該步驟S206無法依據該關鍵字特徵找出該第二類推薦店家資料時，則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理，並將處理所得的第三類推薦店家資料推薦給客戶端6。 In one embodiment, when the step S206 cannot be based on the keyword feature When the second type of recommended store information is found, the central control module 11 will control the dialogue and interaction with the client 6 to the customer service staff 8 for processing, and recommend the processed third type of recommended store information to the client 6 .

在步驟S207中，將該推薦店家資料推薦給該客戶端6。 In step S207, the recommended store profile is recommended to the client 6.

於一實施例中，係由中控模組11將該步驟S205所得的該第一類推薦店家資料推薦給該客戶端6，當該步驟S205無法取得該第一類推薦店家資料時，則將該步驟S206所得的該第二類推薦店家資料推薦給該客戶端6，若該步驟S206也無法取得該第二類推薦店家資料時，則由客服人員8將該第三類推薦店家資料推薦給客戶端6，但不以此為限。 In one embodiment, the central control module 11 recommends the first-type recommended store information obtained in step S205 to the client 6, and when the first-type recommended store information cannot be obtained in step S205, the The second type of recommended store information obtained in step S206 is recommended to the client 6. If the second type of recommended store information cannot be obtained in step S206, the customer service staff 8 recommends the third type of recommended store information to the client 6. Client 6, but not limited to this.

在步驟S208中，將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In step S208, the process between the client requesting the query service and recommending the recommended store information to the client is stored as the historical query record in the historical query database.

於一實施例中，該步驟S208係由中控模組11或客服人員8所執行。 In one embodiment, the step S208 is executed by the central control module 11 or the customer service personnel 8.

本發明之查詢服務之系統與方法係適用於104查號台，下列提供一情境實例說明本發明之查詢服務之系統與方法：客戶(即客戶端6)打電話至104查號台後，話務被交換機7分配到互動裝置1，並呼叫中控模組11啟動標準的對話流程，首先由互動模組12問候客戶並引導客戶回答問題，並由中控模組11從交換機7取得該客戶進線的相關資訊，得到X集合(即該查詢資訊)為={X1客戶進線號碼=0910XXXXXX,X2話機線路所在縣市=台北大安區,X3=手機進線,X4客戶進線查詢時間=20XX/04/23 11：45，X5=客戶透過電話筒所回答的問題語音多媒體檔案}。 The system and method of the query service of the present invention is applicable to the 104 directory. The following provides a scenario example to illustrate the system and method of the query service of the present invention: After the customer (ie client 6) calls the 104 directory, then The service is assigned to the interactive device 1 by the switch 7, and calls the central control module 11 to start a standard dialogue process. First, the interactive module 12 greets the customer and guides the customer to answer questions, and the central control module 11 obtains the customer from the switch 7. Incoming related resources The X set (that is, the query information) obtained is = {X1 customer incoming line number=0910XXXXXX, X2 phone line in the county = Daan District, Taipei, X3=mobile phone incoming line, X4 customer incoming line query time=20XX/04/ 23 11:45, X5=Voice multimedia file of the questions answered by the customer through the phone}.

將X5交由語音辨識裝置2進行處理，透過信號處理模組21處理過後，得到較為清晰且無背景音雜訊的數位訊號X5'，然後呼叫語言分類模組22比對聲學模型、發音字典檔24後，確認X5'是中文發音之後，則將X5'交給語音轉文字模組23處理得到Y集合(即該文字查詢資料)為{Y1客戶問題字串=喂，小姐你好，我想要查中和區中山路上的鼎泰豐餐廳，請幫我查訂位電話，謝謝}。 Hand over X5 to the speech recognition device 2 for processing. After processing through the signal processing module 21, a clearer digital signal X5' without background noise is obtained, and then the language classification module 22 is called to compare the acoustic model and pronunciation dictionary files After 24, after confirming that X5' is the Chinese pronunciation, then X5' will be handed over to the speech-to-text module 23 for processing to obtain the Y set (that is, the text query data) as {Y1 customer question string=Hello, miss hello, I think To check the Ding Tai Fung restaurant on Zhongshan Road, Zhonghe District, please check the reservation phone number for me, thank you}.

將Y集合送到關鍵字分析判定裝置3，先呼叫語意分析模組32對字串集合Y進行語意分析，分析其目的意圖是否為查詢，若是，開始呼叫字典檔資料庫31(路名字典、行業名稱字典、景點名稱字典、招牌名稱字典)、分詞處理模組33進行中文分詞比對作業，將中文語句切割為不同字元數的詞語組合，並透過關鍵字擷取模組34過濾該詞語組合中之自然人交談所使用的主詞、動詞、嘆詞、語助詞等贅詞(例如：喂，小姐你好，我想要查、請幫我查、謝謝)，經過上述過濾、比對處理作業後及各字典檔比對結果可得到下列表1所示之關鍵字特徵：『鼎泰豐』、『頂泰風』、『頂太峰』、『新北市中和區』、『中山路』、『餐廳』。 Send the Y set to the keyword analysis and determination device 3, first call the semantic analysis module 32 to perform semantic analysis on the string set Y, analyze whether its purpose is a query, and if so, start calling the dictionary file database 31 (road name dictionary, Industry name dictionary, scenic spot name dictionary, sign name dictionary), word segmentation processing module 33 performs Chinese word segmentation comparison operations, cuts Chinese sentences into word combinations with different numbers of characters, and filters the words through the keyword extraction module 34 Subjects, verbs, interjections, auxiliary words and other superfluous words used by natural persons in the combination (for example: hello, miss hello, I want to check, please help me check, thank you), after the above filtering and comparison processing operations After the comparison result of each dictionary file, the keyword features shown in Table 1 can be obtained: "Ding Tai Fung", "Ding Tai Feng", "Ding Tai Feng", "New Taipei City Zhonghe District", "Zhongshan Road", " restaurant".

產出查詢資料庫所需使用關鍵字特徵Y'={Y'1店家名稱=鼎泰豐、頂泰風或頂太峰,Y'2店家所在的縣市=新北市中和區,Y'3店家所在的路段名稱=中山路,Y'4店家的行業別=餐廳}；此時將交換機7所得之資訊X集合與查詢用Y'集合輸入至工商店家比對模組42去比對工商店家資料庫41中的資料，由於Y'1工商店家招牌大多屬於中文，若僅透過聲學模型進行音轉字處理，則會因為同音異字、破音字的因素，造成候選組合案例相當多，而不知該輸出哪一個候選組合，因此需預先將Y'1店家名稱先轉為注音符號『ㄉ一ㄥ^ˇㄊㄞ^`ㄈㄥ』，並且將此注音符號比對工商店家資料庫41裡的店家讀音欄位，會發現具有相同讀音的店家有『頂太峰登山用品店』、『頂泰風泰式料理』、『鼎泰豐小籠包』等同音異字的招牌，且客戶詢問的行業別名稱是『餐廳』，為一般性餐飲業的泛稱，並非是工商店家資料庫41所註冊的店家行業別『小籠包』、『登山用品』、『泰式料理』，因此還需要將行業別資料比對字典檔資料庫31內的行業別同義詞字典進行比對。 Keyword feature required to generate query database Y'=(Y'1 store name=Ding Tai Fung, Ding Tai Feng or Ding Tai Feng, Y'2 store in the county and city = New Taipei City Zhonghe District, Y'3 store Road section name=Zhongshan Road, Y'4 store industry type=restaurant}; at this time, the information X set and query obtained by exchange 7 are input into the shop comparison module 42 to compare the shop data The data in library 41, because most Y'1 industrial shop signs are in Chinese, if only the sound-to-word processing is performed through the acoustic model, there will be a lot of candidate combinations due to the factors of homophones and broken sound characters, and the output is unknown. which candidate combination, and therefore the need to advance Y'1 store name into phonetic symbols "ㄉㄞ ^ㄊ ^a` ㄈ eng eng ", and this phonetic alphabet pronunciation of the store field than industrial and commercial stores in the database 41, You will find that the stores with the same pronunciation have "Dingtaifeng Mountaineering Supplies Store", "Dingtai Style Thai Cuisine", "Ding Tai Fung Xiao Long Bao" and equivalent signboards, and the industry name that the customer asked is "restaurant". It is a general term for the general catering industry. It is not the "xiaolongbao", "climbing supplies", and "Thai cuisine" of the store industry registered in the storehouse database 41. Therefore, it is necessary to compare the industry-specific data with the dictionary file data The industry-specific synonym dictionary in the database 31 is compared.

假若客戶所詢問的行業別(即表1所示的餐廳)，透過行業別同義詞字典對照後發現與該相同讀音的店家在工商店家資料庫41內所登記的行業別名稱相同(如餐廳的同義字等於小籠包或料理)，則信心度為60，比對完行業別同義字後接著比對縣市及路名，假若該相同讀音的店家在工商店家資料庫41內所登記的縣市與客戶所詢問的縣市(如表1所示的新北市中和區)相同但不同路名時，則信心度為70，再而，假若該相同讀音的店家在工商店家資料庫41內所登記的縣市及路名與客戶所詢問的縣市及路名(如表1所示的中山路)相同時，則信心度為90，並取信心度最高的該相同讀音的店家於工商店家資料庫41中的資料(即該工商店家資料)為該關聯工商店家資料。 If the industry category that the customer asks (ie the restaurant shown in Table 1), through After comparing the industry synonym dictionary, it is found that the industry name registered in the industry database 41 of the same pronunciation store is the same (for example, the synonym of restaurant is equal to Xiaolongbao or cuisine), the confidence level is 60, and the comparison is complete After the synonymous word of the industry category, the county and city and the road name are compared. If the store with the same pronunciation is registered in the county and city in the industrial store’s home database 41 and the county and city inquired by the customer (as shown in Table 1, Zhonghe, New Taipei City) District) the same but different road names, the confidence is 70, and if the same pronunciation of the store in the industrial store home database 41 registered in the county and road name and the customer inquired about the county, city and road name ( When the Zhongshan Road shown in Table 1 is the same, the confidence level is 90, and the data in the industrial store database 41 of the store with the same pronunciation with the highest confidence level (ie, the store’s store data) is the affiliated store’s store. data.

在本案例中客戶詢問鼎泰豐餐廳，但是工商店家資料庫41內記載，客戶註冊的資料，並沒有鼎泰豐餐廳，經過比對同義詞字典檔之後，發現業別餐廳的同義詞含有小籠包以及料理，因此在招牌名稱的定義裡，鼎泰豐餐廳=鼎泰豐小籠包以及鼎泰豐餐廳=頂泰風泰式料理餐廳。 In this case, the customer inquired about Ding Tai Fung Restaurant, but it was recorded in the industrial store’s home database 41 that there was no Ding Tai Fung Restaurant in the customer’s registered information. After comparing the synonym dictionary files, it was found that the synonyms of Yebei Restaurant contained Xiao Long Bao and cuisine, so In the definition of the signature name, Ding Tai Fung Restaurant = Ding Tai Fung Xiao Long Bao and Ding Tai Fung Restaurant = Ding Tai Feng Thai Restaurant.

一般而言，在同一條路上要出現兩筆同音異字的招牌名稱，機率是相當低，很少會有這樣的情形，此例屬於極端的案例，主要是展示本方法可解決客戶不清楚店家名稱、不清楚店家所在路名的情況下，亦可找出相關店家。甚至客戶開車出門在不知道自己位置的情況下，本發明透過查詢參數可判斷出客戶是透過手機門號查詢，可透過基地台的註冊表，可反查基地台所在資訊，間接得知客戶所在的區域，例如『台北市大安區』。若客戶透過市話進線，則可更精準的知道裝機地址，可更精準的協助客戶，進行的地理位置附近的店家檢索。 Generally speaking, the probability of two signs with homophones and different characters appearing on the same road is quite low. This is rarely the case. This example is an extreme case, mainly to show that this method can solve the customer's unclear store name , If you don’t know the name of the street where the store is located, you can also find the relevant store. Even when the customer drives out without knowing his location, the present invention can determine that the customer is inquiring through the phone number through the query parameters. The registration form of the base station can be used to check the information of the base station and indirectly know where the customer is. of Area, such as "Da'an District, Taipei City". If the customer enters the line through the local call, they can know the installation address more accurately, and can more accurately assist the customer to search for stores near the geographic location.

接下來將這些同音異字的招牌名稱(即該關聯工商店家資料)，輸入至第一關聯店家推薦模組44，透過內建的IDF演算法統計分析這些同音異字的招牌名稱在歷史查詢資料庫43內的被查詢頻率，最後得到結果Z{店家名稱、店家名稱相似度、店家登記電話}的集合，並依照IDF分數來排序，如下列表2所示。 Next, input the name of the signboard of these homophones (that is, the store data of the associated shop) into the first associated store recommendation module 44, and use the built-in IDF algorithm to statistically analyze the signboard names of these homophones in the historical query database 43 The frequency of being queried within, and finally the result Z{store name, store name similarity, store registration phone number} set, and sorted according to IDF score, as shown in Table 2 below.

上述案例產出的結果為{Z1=鼎泰豐小籠包，Z2=10.38},{Z'1=頂泰風泰式料理,Z'2=7.11}；其中{Z1=鼎泰豐小籠包，Z2是IDF數值=10.38}的分數較高，透過綜合評比之後，假設Z1及Z2都超過預設門檻值，第一關聯店家推薦模組44會將Z1的店家名稱以及此店家登記的電話結果送至中控模組11，並透過文字轉語音模組5播放訊息：『您好，您剛詢問位在{新北市}{中和區}{中山路}的{鼎泰豐餐廳}，登記的電話是{02-XXXX-XXXX}』，另外一筆較為相近的是在{新北市}{中和區}{中山路}的{頂泰風泰式料理}，登記的電話是{02-XXXX-YYYY}。 The output of the above case is {Z1=Ding Tai Fung Xiao Long Bao, Z2=10.38}, {Z'1=Ding Tai Feng Thai Cuisine, Z'2=7.11}; where {Z1=Ding Tai Feng Xiao Long Bao, Z2 is IDF value=10.38} has a higher score. After the comprehensive evaluation, assuming that both Z1 and Z2 exceed the preset threshold, the first level The joint store recommendation module 44 will send the Z1 store name and the phone number registered by this store to the central control module 11, and play a message through the text-to-speech module 5: "Hello, you just asked about the location in {新北市}{Ding Tai Fung Restaurant} in {Zhonghe District}{Zhongshan Road}, the registered phone number is {02-XXXX-XXXX}", and another similar one is in {New Taipei City} {Zhonghe District} {Zhongshan Road} {Top Thai Style Thai Cuisine}, the registration phone number is {02-XXXX-YYYY}.

由上可知，本發明透過互動裝置進行自動化對答，主動對客戶端問候，並取得客戶端的查詢資訊，降低客戶端等候太久的不良感受，後續再透過語音辨識裝置、關鍵字分析判定裝置及大數據分析裝置將客戶端的查詢資料的自動化分析取得該查詢資訊所對應的推薦店家資料，以改善客服人員訓練程度不一，容易造成查詢錯誤，且人員離職流動率高，造成學習成本大幅提高的問題，且該大數據分析裝置中的反饋資訊模組係同時將系統及客服人員的處理結果回饋至歷史查詢資料庫中，藉此提高由該歷史查詢資料庫中產出推薦店家資料的正確率。 It can be seen from the above that the present invention uses the interactive device to automate the answering, actively greets the client, and obtains the client's query information, reducing the client's bad feeling of waiting too long, and then using the voice recognition device, the keyword analysis and determination device and the big The data analysis device automatically analyzes the query data of the client to obtain the recommended store data corresponding to the query information, so as to improve the problem of different training levels of customer service personnel, which is easy to cause query errors, and the turnover rate of personnel is high, resulting in a significant increase in learning costs And the feedback information module in the big data analysis device simultaneously feeds back the processing results of the system and customer service personnel to the historical query database, thereby improving the accuracy of recommended store data generated from the historical query database.

1‧‧‧互動裝置 1‧‧‧Interactive device

2‧‧‧語音辨識裝置 2‧‧‧Voice recognition device

4‧‧‧大數據分析裝置 4‧‧‧Big data analysis device

5‧‧‧文字轉語音模組 5‧‧‧Text-to-speech module

6‧‧‧客戶端 6‧‧‧Client

7‧‧‧交換機 7‧‧‧Switch

8‧‧‧客服人員 8‧‧‧Customer Service Staff

11‧‧‧中控模組 11‧‧‧Central Control Module

12‧‧‧互動模組 12‧‧‧Interactive Module

21‧‧‧信號處理模組 21‧‧‧Signal processing module

22‧‧‧語言分類模組 22‧‧‧Language Classification Module

23‧‧‧語音轉文字模組 23‧‧‧Voice to text module

24‧‧‧發音字典檔 24‧‧‧ pronunciation dictionary file

31‧‧‧字典檔資料庫 31‧‧‧Dictionary file database

32‧‧‧語意分析模組 32‧‧‧Semantic Analysis Module

33‧‧‧分詞處理模組 33‧‧‧Word Segmentation Processing Module

34‧‧‧關鍵字擷取模組 34‧‧‧Keyword Extraction Module

41‧‧‧工商店家資料庫 41‧‧‧Workshop Home Database

43‧‧‧歷史查詢資料庫 43‧‧‧History query database

46‧‧‧反饋資訊模組 46‧‧‧Feedback Information Module

100‧‧‧系統 100‧‧‧System

Claims

A query service system includes: an interactive device for obtaining query information sent by a client requesting a query service, wherein the query data includes voice query data; a voice recognition device is the query information obtained by the interactive device The voice query data is converted into text query data; the keyword analysis and determination device receives the text query data, and finds the same data as the text query data from the dictionary file database to obtain the keyword characteristics, where the Keyword features include signboard names, county and city names, road names, or industry categories; and a big data analysis device that receives the keyword features from the keyword analysis and determination device, and the big data analysis device includes: industrial store information The database is to store the data of the industrial store. The store data includes the name of the sign, the name of the county or city, the road name or the industry category; the industrial store comparison module is to find out the characteristics of the keyword from the database of the industrial store The name of the signboard has the same pronunciation and at least one of the industry category, county and city name, and road name is the same to obtain the information of the related industrial store; the historical query database stores the historical query records of the query service; and the recommended model of the first associated store Group, which calculates the inquired frequency of the related store data in the historical inquiry database through the inverse document frequency algorithm, so that the inquired frequency is greater than the first preset The associated store information of the threshold value is the first type of recommended store information, and the first type of recommended store information is then recommended to the client.

For example, the system described in item 1 of the scope of patent application, wherein the historical query record includes customer satisfaction, and the big data analysis device further includes: a second affiliated store recommendation module, which acts as the first affiliated store recommendation module When the first type of recommended store data cannot be obtained, find the historical query records that are the same as the keyword and have good customer satisfaction from the historical query database to obtain the associated historical query records, and the associated historical query The record is calculated by the Apriori machine learning algorithm to obtain the feature value, so that the associated historical query record whose feature value meets the preset recommendation condition is the second type of recommended store information, and then the second type of recommended store information is recommended to the client.

For example, the system described in item 1 or 2 of the scope of patent application, wherein the big data analysis department further includes: a feedback information module, which is between the client requesting the query service and recommending the recommended store information to the client The processing process is stored as the historical query record in the historical query database.

For example, the system described in item 1 or 2 of the scope of patent application, wherein the query information further includes the county or city where the phone line of the client is located, and the text query data does not include the county or road name, the keyword analysis The determining device incorporates the county and city where the phone line of the client is located into the keyword feature.

For example, the system described in item 1 of the scope of patent application, wherein the dictionary file database includes county and city name dictionary files, road name dictionary files, and industry name dictionary files File, sign name dictionary file, scenic spot name dictionary file, common name abbreviation dictionary file, broken sound word dictionary file or industry-specific synonyms dictionary file.

A method for querying services includes the following steps: (1) obtaining query information requested by the client for query services, wherein the query data includes voice query data; (2) converting the voice query data in the query information into text query Data; (3) Find the same data as the text query data from the dictionary file database, and filter the redundant words in the same data to obtain the keyword features, where the keyword features include the name of the sign, County and city name, road name or industry category; (4) Find from the industrial store’s home database that the pronunciation of the sign name in the keyword feature is the same and at least one of the industry category, county name and road name is the same, In order to obtain the related store information; (5) Use the inverse document frequency algorithm to calculate the inquired frequency of the related store data in the historical query database, so that the associated store data whose inquired frequency is greater than the first preset threshold is The first type of recommended store information; and (6) recommend the recommended store information to the client.

For example, the method described in item 6 of the scope of patent application, wherein, when the step (5) can not use the inverse document frequency algorithm to obtain the information of the first type of recommended store, then find out from the historical query database and the The historical query records with the same keyword characteristics and good customer satisfaction are obtained to obtain the related historical query records, and then the related historical query records are Apriori The machine learning algorithm calculates and obtains the feature value, so that the associated historical query record whose feature value meets the preset recommendation condition is the second type of recommended store information.

For example, the method described in item 6 or 7 of the scope of the patent application, wherein the method further includes the following steps: (7) The process between the client requesting the query service and recommending the recommended store information to the client is stored Create the historical query record in the historical query database.

For example, the method described in item 6 or 7 of the scope of patent application, wherein the query information further includes the county or city where the phone line of the client is located, and when the voice query data does not contain the name of the county or city, the keyword The analysis and determination unit includes the county and city where the phone line of the client is located in the keyword feature.

Such as the method described in item 6 of the scope of patent application, wherein the dictionary file database includes a county and city name dictionary file, a road name dictionary file, an industry name dictionary file, a sign name dictionary file, a scenic spot name dictionary file, and a common name abbreviation dictionary file , Poyin word dictionary file or industry-specific synonyms dictionary file.