TWI698756B - System for inquiry service and method thereof - Google Patents
System for inquiry service and method thereof Download PDFInfo
- Publication number
- TWI698756B TWI698756B TW107139680A TW107139680A TWI698756B TW I698756 B TWI698756 B TW I698756B TW 107139680 A TW107139680 A TW 107139680A TW 107139680 A TW107139680 A TW 107139680A TW I698756 B TWI698756 B TW I698756B
- Authority
- TW
- Taiwan
- Prior art keywords
- query
- name
- store
- data
- client
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 238000004458 analytical method Methods 0.000 claims abstract description 29
- 238000007405 data analysis Methods 0.000 claims abstract description 22
- 238000012545 processing Methods 0.000 claims description 28
- 230000002452 interceptive effect Effects 0.000 claims description 23
- 230000008569 process Effects 0.000 claims description 11
- 238000010801 machine learning Methods 0.000 claims description 9
- 230000003993 interaction Effects 0.000 abstract description 7
- 230000011218 segmentation Effects 0.000 description 11
- 238000000605 extraction Methods 0.000 description 9
- 238000013528 artificial neural network Methods 0.000 description 8
- 238000004364 calculation method Methods 0.000 description 5
- 238000003062 neural network model Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 241001672694 Citrus reticulata Species 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000007306 turnover Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000009194 climbing Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000004886 process control Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000006403 short-term memory Effects 0.000 description 1
- 235000009561 snack bars Nutrition 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
本發明係有關一種查詢服務技術,尤指一種自動化查詢服務之系統與方法。 The present invention relates to a query service technology, especially a system and method for automatic query service.
傳統的電話查詢服務屬高人力密集度且重複性質極高的服務,由查詢服務人員詢問客戶,聽取完問題之後,才能開始進行查詢服務,且過程中有可能會遇到新進人員,若新進人員訓練不足而無法快速找到該問題所對應的答案時,會耽誤了不少查詢服務的時間;其次,當線上之查詢服務人員的話務繁忙時,會導致欲查詢之客戶必須再線上等待查詢服務人員的接通才能進行查詢服務,而浪費不少等待時間。 The traditional telephone enquiry service is a highly labor-intensive and repetitive service. The enquiry service staff asks the customer. After listening to the question, the enquiry service can be started, and new staff may be encountered in the process. If the new staff Insufficient training to quickly find the answer to the question will delay a lot of query service time; secondly, when the online query service staff is busy, it will cause the customer who wants to query to wait for the query service online again Only when the personnel are connected can the inquiry service be performed, and a lot of waiting time is wasted.
因此,如何自動化的進行電話查詢服務且精確且快速地提供答案即為目前所亟待解決的課題之一。 Therefore, how to automatically perform telephone inquiry services and provide answers accurately and quickly is one of the urgent problems to be solved.
為克服習知技術之缺失,本發明係提供查詢服務之系統,係包括:互動裝置,係取得客戶端要求查詢服務所發出的查詢資訊,其中,該查詢資料包含語音查詢資料;語 音辨識裝置,係將該互動裝置所取得該查詢資訊中的語音查詢資料轉換成文字查詢資料;關鍵字分析判定裝置,係接收該文字查詢資料,並從字典檔資料庫中找出與該文字查詢資料相同的資料,以得到關鍵字特徵,其中,該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別;以及大數據分析裝置,係接收該關鍵字分析判定裝置所得的該關鍵字特徵,且該大數據分析裝置係包含:工商店家資料庫,係儲存工商店家資料,該工商店家資料包含招牌名稱、縣市名稱、路名或行業別;工商店家比對模組,係從該工商店家資料庫中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及路名之至少一相同者,以得到關聯工商店家資料;歷史查詢資料庫,係儲存該查詢服務的歷史查詢紀錄;及第一關聯店家推薦模組,係透過逆文檔頻率演算法計算該關聯店家資料於該歷史查詢資料庫中的被查詢頻率,以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料,進而推薦該第一類推薦店家資料給該客戶端。 In order to overcome the deficiencies of the conventional technology, the present invention provides a query service system, which includes: an interactive device for obtaining query information sent by the client requesting the query service, wherein the query data includes voice query data; The phonetic recognition device converts the voice query data in the query information obtained by the interactive device into text query data; the keyword analysis and determination device receives the text query data, and finds the text from the dictionary file database Query data with the same data to obtain keyword features, where the keyword features include signboard names, county names, road names, or industry categories; and a big data analysis device that receives the key from the keyword analysis and determination device Character features, and the big data analysis device includes: a storehouse database, which stores storehouse data, the storehouse data includes signboard names, county names, road names or industry categories; the storehouse comparison module, from Find out the same pronunciation of the sign name in the keyword feature and at least one of the industry category, county and city name and road name in the industrial store database to obtain related industrial store information; the historical query database is stored The historical query record of the query service; and the recommendation module of the first associated store, which uses the inverse document frequency algorithm to calculate the query frequency of the associated store data in the historical query database, so that the query frequency is greater than the first predicted Set the threshold value of the associated store information as the first type of recommended store information, and then recommend the first type of recommended store information to the client.
於一實施例中,該歷史查詢紀錄包含客戶滿意度,且該大數據分析裝置更包括:第二關聯店家推薦模組,係當該第一關聯店家推薦模組無法取得該第一類推薦店家資料時,從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄,以得到關聯歷史查詢紀錄,並將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值,以令該推薦值大於第二預設門檻值的該 關聯歷史查詢紀錄為第二類推薦店家資料,進而推薦該第二類推薦店家資料給該客戶端。 In one embodiment, the historical query record includes customer satisfaction, and the big data analysis device further includes: a second associated store recommendation module, when the first associated store recommendation module cannot obtain the first type of recommended store When collecting data, find historical query records that are the same as the keyword and have good customer satisfaction from the historical query database to obtain related historical query records, and perform Apriori machine learning algorithm calculations on the related historical query records Obtain a recommended value so that the recommended value is greater than the second preset threshold The associated history query record is the second type of recommended store information, and then the second type of recommended store information is recommended to the client.
於一實施例中,該大數據分析裝置更包括:反饋資訊模組,係將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In one embodiment, the big data analysis device further includes: a feedback information module, which stores the processing between the client requesting the query service and recommending the recommended store data to the client as the historical query database The historical query record in.
於一實施例中,該查詢資訊更包含該客戶端之話機線路所在縣市,且該文字查詢資料中無縣市名稱或路名時,該關鍵字分析判定裝置將該客戶端之話機線路所在縣市納入該關鍵字特徵中。 In one embodiment, the query information further includes the county or city where the phone line of the client is located, and when there is no county or city name or road name in the text query data, the keyword analysis and determination device is where the phone line of the client is located Counties and cities are included in this keyword feature.
於一實施例中,該字典檔資料庫包含縣市名稱字典檔、路名字典檔、行業名稱字典檔、招牌名稱字典檔、景點名稱字典檔、俗名簡稱字典檔、破音字字典檔或行業別同義詞字典檔。 In one embodiment, the dictionary file database includes a county and city name dictionary file, a road name dictionary file, an industry name dictionary file, a sign name dictionary file, a scenic spot name dictionary file, a common name abbreviation dictionary file, a broken sound word dictionary file, or an industry category Synonym dictionary file.
本發明另提供一種查詢服務之方法,係包括下列步驟:(1)取得客戶端要求查詢服務的查詢資訊,其中,該查詢資料包含語音查詢資料;(2)將該查詢資訊中的語音查詢資料轉換成文字查詢資料;(3)從字典檔資料庫中找出與該文字查詢資料相同的資料,並將該相同的資料中的贅詞過濾,以得到關鍵字特徵,其中,該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別;(4)從該工商店家資料庫中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及路名之至少一相同者,以得到關聯工商店家資料;(5)利用逆文檔頻率演算法計算該關聯店家資料於 歷史查詢資料庫中的被查詢頻率,以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料;以及(6)將該推薦店家資料推薦給該客戶端。 The present invention also provides a method for query service, which includes the following steps: (1) obtain query information requested by the client for the query service, wherein the query data includes voice query data; (2) the voice query data in the query information Converted into text query data; (3) Find the same data as the text query data from the dictionary file database, and filter the redundant words in the same data to obtain the keyword features, where the keyword features Including the name of the signboard, the name of the county and city, the name of the road or the industry; (4) Find out the same pronunciation of the name of the sign in the keyword feature and at least the name of the industry, county and city from the database of the shop One is the same to obtain the related store information; (5) Use the inverse document frequency algorithm to calculate the related store information in The queried frequency in the historical query database is such that the associated store information whose frequency of query is greater than the first preset threshold is the first type of recommended store information; and (6) the recommended store information is recommended to the client.
於一實施例中,當該步驟(5)無法利用該逆文檔頻率演算法取得該第一類推薦店家資料時,則從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄,以得到關聯歷史查詢紀錄,進而將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值,以令該推薦值大於第二預設門檻值的該關聯歷史查詢紀錄為第二類推薦店家資料。 In one embodiment, when the step (5) is unable to use the inverse document frequency algorithm to obtain the first type of recommended store information, then find out from the historical query database the same as the keyword characteristics and customer satisfaction It is a good historical query record to obtain the related historical query record, and then the related historical query record is calculated by the Apriori machine learning algorithm to obtain a recommended value, so that the recommended value is greater than the second preset threshold value of the related historical query The record is the second type of recommended store information.
於一實施例中,該方法更包括下列步驟:(7)將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In one embodiment, the method further includes the following steps: (7) The process between the client requesting the query service and recommending the recommended store information to the client is stored as the history in the historical query database Query records.
於一實施例中,該查詢資訊更包含該客戶端之話機線路所在縣市,且該語音查詢資料中無縣市名稱或路名時,令該關鍵字分析判定部將該客戶端之話機線路所在縣市納入該關鍵字特徵中。 In one embodiment, the query information further includes the county or city where the phone line of the client is located, and when there is no county or city name or road name in the voice query data, the keyword analysis and determination unit is made to make the phone line of the client The county where you live is included in this keyword feature.
於一實施例中,該字典檔資料庫包含縣市名稱字典檔、路名字典檔、行業名稱字典檔、招牌名稱字典檔、景點名稱字典檔、俗名簡稱字典檔、破音字字典檔或行業別同義詞字典檔。 In one embodiment, the dictionary file database includes a county and city name dictionary file, a road name dictionary file, an industry name dictionary file, a sign name dictionary file, a scenic spot name dictionary file, a common name abbreviation dictionary file, a broken sound word dictionary file, or an industry category Synonym dictionary file.
由上可知,本發明透過互動裝置進行自動化對答,主動對客戶端問候,並取得客戶端的查詢資訊,降低客戶端 等候太久的不良感受,後續再透過語音辨識裝置、關鍵字分析判定裝置及大數據分析裝置將客戶端的查詢資料的自動化分析取得該查詢資訊所對應的推薦店家資料,以改善客服人員訓練程度不一,容易造成查詢錯誤,且人員離職流動率高,造成學習成本大幅提高的問題,且該大數據分析裝置中的反饋資訊模組係同時將系統及客服人員的處理結果回饋至歷史查詢資料庫中,藉此提高由該歷史查詢資料庫中產出推薦店家資料的正確率,並精確且快速地回覆至客戶端。 It can be seen from the above that the present invention uses an interactive device to perform automated answering, actively greet the client, and obtain query information from the client, reducing the client If you wait too long for the bad feelings, then use the voice recognition device, keyword analysis and judgment device and big data analysis device to automatically analyze the query data of the client to obtain the recommended store data corresponding to the query information, so as to improve the training level of the customer service staff. 1. It is easy to cause query errors, and the turnover rate of personnel is high, resulting in a significant increase in learning costs, and the feedback information module in the big data analysis device simultaneously returns the processing results of the system and customer service personnel to the historical query database In this way, the accuracy rate of recommended store information generated from the historical query database is improved, and the response to the client is accurately and quickly returned.
1‧‧‧互動裝置 1‧‧‧Interactive device
2‧‧‧語音辨識裝置 2‧‧‧Voice recognition device
3‧‧‧關鍵字分析判定裝置 3‧‧‧Keyword analysis and judgment device
4‧‧‧大數據分析裝置 4‧‧‧Big data analysis device
5‧‧‧文字轉語音模組 5‧‧‧Text-to-speech module
6‧‧‧客戶端 6‧‧‧Client
7‧‧‧交換機 7‧‧‧Switch
8‧‧‧客服人員 8‧‧‧Customer Service Staff
11‧‧‧中控模組 11‧‧‧Central Control Module
12‧‧‧互動模組 12‧‧‧Interactive Module
21‧‧‧信號處理模組 21‧‧‧Signal processing module
22‧‧‧語言分類模組 22‧‧‧Language Classification Module
23‧‧‧語音轉文字模組 23‧‧‧Voice to text module
24‧‧‧發音字典檔 24‧‧‧ pronunciation dictionary file
31‧‧‧字典檔資料庫 31‧‧‧Dictionary file database
32‧‧‧語意分析模組 32‧‧‧Semantic Analysis Module
33‧‧‧分詞處理模組 33‧‧‧Word Segmentation Processing Module
34‧‧‧關鍵字擷取模組 34‧‧‧Keyword Extraction Module
41‧‧‧工商店家資料庫 41‧‧‧Workshop Home Database
42‧‧‧工商店家比對模組 42‧‧‧Industry shop home comparison module
43‧‧‧歷史查詢資料庫 43‧‧‧History query database
44‧‧‧第一關聯店家推薦模組 44‧‧‧The first related store recommendation module
45‧‧‧第二關聯店家推薦模組 45‧‧‧Recommendation module of the second related store
46‧‧‧反饋資訊模組 46‧‧‧Feedback Information Module
100‧‧‧系統 100‧‧‧System
200‧‧‧方法 200‧‧‧Method
S201~S208‧‧‧步驟 S201~S208‧‧‧Step
第1圖為本發明之查詢服務之系統之示意圖;以及第2圖為本發明之查詢服務之方法之步驟流程圖。 Figure 1 is a schematic diagram of the query service system of the present invention; and Figure 2 is a flowchart of the steps of the query service method of the present invention.
以下藉由特定的具體實施例說明本發明之實施方式,熟悉此技藝之人士可由本說明書所揭示之內容輕易地瞭解本發明之其他優點及功效。 The following specific examples illustrate the implementation of the present invention. Those familiar with the art can easily understand the other advantages and effects of the present invention from the content disclosed in this specification.
須知,本說明書所附圖式所繪示之結構、比例、大小等,均僅用以配合說明書所揭示之內容,以供熟悉此技藝之人士之瞭解與閱讀,並非用以限定本發明可實施之限定條件,故不具技術上之實質意義,任何結構之修飾、比例關係之改變或大小之調整,在不影響本發明所能產生之功效及所能達成之目的下,均應仍落在本發明所揭示之技術內容得能涵蓋之範圍內。同時,本說明書中所引用之如「第一」、「第二」及「一」等之用語,亦僅為便於敘述之明瞭, 而非用以限定本發明可實施之範圍,其相對關係之改變或調整,在無實質變更技術內容下,當視為本發明可實施之範疇。 It should be noted that the structures, proportions, sizes, etc. shown in the drawings in this manual are only used to match the contents disclosed in the manual for the understanding and reading of those familiar with the art, and are not intended to limit the implementation of the present invention. Therefore, it does not have any technical significance. Any structural modification, proportional relationship change, or size adjustment should still fall within the scope of the present invention without affecting the effects and objectives that can be achieved. The technical content disclosed by the invention can be covered. At the same time, the terms such as "first", "second" and "one" cited in this manual are only for ease of description. It is not intended to limit the scope of implementation of the present invention, and the change or adjustment of its relative relationship shall be regarded as the scope of implementation of the present invention without substantial changes to the technical content.
請參閱第1圖所示,係本發明之查詢服務之系統100之示意圖。該系統係包括互動裝置1、語音辨識裝置2、關鍵字分析判定裝置3、大數據分析裝置4及文字轉語音模組5,其中,互動裝置1、語音辨識裝置2、關鍵字分析判定裝置3、大數據分析裝置4可實作在相同設備中彼此電性連接,亦可實作在不同設備中彼此電性連接。
Please refer to Figure 1, which is a schematic diagram of the
互動裝置1係於客戶端6向系統100要求查詢服務時,與客戶端6進行對話互動,其中,互動裝置1係包含中控模組11及互動模組12。
The
於一實施例中,客戶端6係透過交換機7向系統100提出查詢服務的要求,但不以此為限。
In one embodiment, the
於一實施例中,客戶端6係為市內電話機或行動裝置,但不以此為限。
In one embodiment, the
中控模組11,係接收客戶端6向系統100要求該查詢服務所發出的查詢資訊,並負責整個該查詢作業中系統100與客戶端6互動的流程控制。
The
於一實施例中,該查詢資訊包含客戶端6的進線號碼、話機線路所在縣市、進線裝置、進線查詢時間及語音查詢資料。
In one embodiment, the query information includes the incoming line number of the
互動模組12,係內建一對答語句自動生成模型,並令該對答語句自動生成模型產生與客戶端6對話互動的對話
字串,該對答語句自動生成模型係內建自然人聊天所需的語料庫,該語料庫內容包含了大量的上下文(context)及回覆(response),當客戶端6的該語音查詢資料轉換成文字查詢資料並輸入該對答語句自動生成模型時,該對答語句自動生成模型會拆解比對輸入的上下文,並從該語料庫中選擇最佳的回覆為該對話字串。此互動模組12的主要功能是為了與客戶端6保持互動,引導客戶端回答問題。
The
於一實施例中,該對答語句自動生成模型透過機器學習演算法訓練產生,其中,該機器學習演算法包含長短期記憶(LSTN)或遞歸神經網絡(RNN)。 In one embodiment, the answer sentence automatic generation model is generated through machine learning algorithm training, wherein the machine learning algorithm includes long-term short-term memory (LSTN) or recurrent neural network (RNN).
文字轉語音模組5,係接收互動模組12所產生的該對話字串,並將該對話字串轉換成對話語音,以將該對話語音傳送至客戶端6。
The text-to-
於一實施例中,文字轉語音模組5係透過交換機7將該回覆語音傳送至客戶端6。
In one embodiment, the text-to-
在客戶端6向系統100提出查詢服務的要求時,中控模組11先呼叫互動模組12產出對話字串,並將該對話字串透過5 TTS文字轉語音元件將該對話字串轉為語音播放至客戶端6,引導客戶回答問題。過程中會不斷呼叫語音辨識裝置2來分析客戶端6的回答,並確認是否有產出辨識結果,並重複此問答過程數次,若語音辨識裝置2辨識結果可產出查詢用的文字查詢資料,則繼續交付後台(即關鍵字分析判定裝置及大數據分析裝置)進行查詢流程;若過程中無法辨識出有效資訊,或者語意分析後的意圖並非查
詢,就會將此話務轉交客服人員8服務。
When the
語音辨識裝置2,係接收互動裝置1所取得該查詢資訊,以將該查詢資訊中的語音查詢資料轉換成該文字查詢資料,其中,語音辨識裝置2包含信號處理模組21、語音轉文字模組23及發音字典檔24。
The
信號處理模組21,係使用數位訊號處理(Digital Signal Processing,DSP)去除掉該語音查詢資料中的雜訊。 The signal processing module 21 uses digital signal processing (DSP) to remove noise in the voice query data.
發音字典檔24,係儲存詞彙的發音組合,以依據該組合提供單字轉音素或音素轉單字的應用,其中,該音素為語言中的最小發音單位,例如國語的注音符號或英語的音標,但不以此為限。 The pronunciation dictionary file 24 stores the pronunciation combinations of words to provide the application of word-to-phoneme or phoneme-to-character conversion according to the combination. The phoneme is the smallest pronunciation unit in the language, such as the phonetic symbols of Mandarin or the phonetic symbols of English. Not limited to this.
於一實施例中,發音字典檔24係為國語或英語的發音字典檔,亦可為其他語系的發音字典檔,但不限於此。 In one embodiment, the pronunciation dictionary file 24 is a pronunciation dictionary file of Mandarin or English, and may also be a pronunciation dictionary file of other languages, but is not limited to this.
語音轉文字模組23,係將已去除掉雜訊的該語音查詢資料轉換成該文字查詢資料。
The voice-to-
於一實施例中,語音轉文字模組23係根據己經訓練好的聲學模型(HMM)及發音字典檔24,並配合使用神經網路模型BPNN(Backpropagation Neural Network)建立一第一類神經網路來對已去除掉雜訊的該語音查詢資料轉換成單字組合,接著,依據已訓練好的語言模型及配合該神經網路模型BPNN建立第二類神經網路來對該單字組合轉換成完整的文字句子,並令該完整的文字句子為該文字查詢資料。
In one embodiment, the speech-to-
於該第一類神經網路中,神經網路模型BPNN配合聲 學模型的作用是將已去除掉雜訊的該語音查詢資料換成複數個該音素,神經網路模型BPNN配合發音字典檔24的作用是將該複數個音素轉換成複數個字或詞的該單字組合。 In the first type of neural network, the neural network model BPNN cooperates with the sound The function of the learning model is to replace the voice query data with noise removed into plural phonemes. The function of the neural network model BPNN with the pronunciation dictionary file 24 is to convert the plural phonemes into plural words or words. Single word combination.
於該第二類神經網路中,神經網路模型BPNN配合語言模型的作用是將字或詞(例如該單字組合)轉換成該文字句子。 In the second type of neural network, the function of the neural network model BPNN in conjunction with the language model is to convert the word or word (for example, the combination of single characters) into the text sentence.
於一實施例中,語音辨識裝置2更包括語言分類模組22,但不以此為限。
In one embodiment, the
語言分類模組22,係依據該第一類神經網路來對已去除掉雜訊的該語音查詢資料進行語言分類,其中,該語言分類為國語或英語,但不限於此。
The
於一實施例中,語言分類模組22係整合在語音轉文字模組23,以供該第一類神經網路來對已去除掉雜訊的該語音查詢資料進行該語言分類及產生該單字組合,但不限於此。
In one embodiment, the
關鍵字分析判定裝置3,係接收語音轉文字模組23所得的該文字查詢資料,以從該文字查詢資料中找出關鍵字特徵,其中,關鍵字分析判定裝置3包含字典檔資料庫31、語意分析模組32、分詞處理模組33、關鍵字擷取模組34。
The keyword analysis and determination device 3 receives the text query data obtained by the speech-to-
字典檔資料庫31,係包含縣市名稱字典檔、路名字典檔、行業名稱字典檔、招牌名稱字典檔、景點名稱字典檔、破音字字典檔、俗名簡稱字典檔(例如:行天宮=恩主宮等資訊)或行業別同義詞字典檔,其中,同義詞字典檔的內容就是記錄中文詞彙可能的轉換,例如小吃店的同義詞也可 以稱呼為餐廳、飲食店及簡餐,而iPhone的同義詞為哀鳳、愛瘋、智慧手機、蘋果,但不以此為限。 The dictionary file database 31 contains the county and city name dictionary file, the road name dictionary file, the industry name dictionary file, the sign name dictionary file, the scenic spot name dictionary file, the broken sound word dictionary file, the common name abbreviation dictionary file (for example: Xingtian Palace=恩The main palace and other information) or industry-specific synonym dictionary files, where the content of the synonym dictionary file is to record the possible conversion of Chinese vocabulary. For example, synonyms for snack bars can also be used. The names are restaurants, eateries, and simple meals, and the synonyms of iPhone are Aifeng, Crazy, Smart Phone, Apple, but not limited to this.
語意分析模組32,係從該文字查詢資料中辨識客戶端6的查詢意圖,其查詢意圖的辨識方式係從主詞及動詞去判斷,例如:我想要查、請幫我查、我要查、我想查、幫我找或我想找等等,即可認定具有該查詢意圖,但不以此為限。
The semantic analysis module 32 identifies the query intent of the
由於系統100的任務很明確,就是請客戶端6提供店家名稱或景點名稱,然後幫客戶端6進行查詢相關電話資訊,因此客戶端6問句偏向短問句,諸如:我想要查XXX餐廳的電話、幫我查XX醫院的電話、我要找最近上過新聞的XXX的電話、可以幫我找XXX嗎或我想知道XX的電話等等,問法有很多種,但是其意圖都是一樣的,因此語意分析模組32僅需要進行判斷該文字查詢資料是否為查詢即可,若無法判斷查詢意圖,則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理。
Since the task of the
分詞處理模組33,係對該文字查詢資料進行不同字元數的切詞,以得到不同字元數的詞語組合,然後比對字典檔資料庫31中是否有相符的該詞語組合出現,並將比對相符的該詞語組合輸出為關鍵字陣列。 The word segmentation processing module 33 performs word segmentation with different numbers of characters on the text query data to obtain word combinations with different numbers of characters, and then compares whether there is a matching word combination in the dictionary file database 31, and Output the matched word combination as a keyword array.
關鍵字擷取模組34,係將該關鍵字陣列中的贅詞刪除,以得到該關鍵字特徵,其中,該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別,該贅詞為一般對話裡常出現高頻率詞彙,例如主詞(你、我、他、小姐、先生、你好、)、 嘆詞(ㄟ、啊、呀、喔、哦、噢、唷、喲…)、動詞(想要、想查、想知道、幫我找、請幫我、告訴我、…)等等,但不以此為限。關鍵字擷取模組34的目的在於將自然人交談時所用的贅詞過濾,僅留下可能是店家名稱的關鍵字。 The keyword extraction module 34 deletes redundant words in the keyword array to obtain the keyword characteristics, where the keyword characteristics include signboard names, county names, road names, or industry categories, and the redundant words High-frequency words often appear in general conversations, such as subject words (you, me, him, miss, mr, hello,), Interjections (ㄟ, ah, ah, oh, oh, oh, yo, yo...), verbs (want, want to check, want to know, help me find, please help me, tell me,...) etc., but not Limit this. The purpose of the keyword extraction module 34 is to filter the redundant words used by natural persons in conversation, and only leave keywords that may be the name of the store.
於一實施例中,關鍵字分析判定裝置3更包括從語音辨識裝置2接收該查詢資訊,且當分詞處理模組33無法從字典檔資料庫31的縣市名稱字典檔或路名字典檔中比對出相符的該詞語組合時(即該文字查詢資料中沒有縣市名稱或路名),關鍵字擷取模組34將該查詢資訊中的該客戶端之話機線路所在縣市納入該關鍵字特徵中,但不以此為限。
In one embodiment, the keyword analysis and determination device 3 further includes receiving the query information from the
大數據分析裝置4,係接收關鍵字擷取模組34所得的該關鍵字特徵,以依據該關鍵字特徵從大數據分析裝置4內的資料庫中找出推薦店家資料給該客戶端6,其中,大數據分析裝置4包含工商店家資料庫41、工商店家比對模組42、歷史查詢資料庫43、第一關聯店家推薦模組44、第二關聯店家推薦模組45及反饋資訊模組46。
The big
工商店家資料庫41,係儲存工商店家資料,該工商店家資料包含招牌名稱、縣市名稱、路名或行業別。 The industrial store database 41 stores industrial store information, which includes signboard name, county name, road name, or industry category.
於一實施例中,工商店家資料庫41為特殊設計的資訊聚合大表,每日由網路爬蟲程式自動從經濟部商業司取得登記有案的工商店家招牌名稱,並整合比對104查號台內登記為有效使用中,尚未拆機的工商店家資料,由專業人員進行讀音(即注音符號)編排分類以及行業別分類,但不 以此為限。 In one embodiment, the shopkeeper database 41 is a specially designed information aggregation table. The web crawler program automatically obtains the registered shopkeeper's name from the Department of Commerce of the Ministry of Economic Affairs every day, and integrates the 104 number check Registered in the station as being in effective use, but not yet dismantled the information of the industrial shop, the professional staff will arrange and classify the pronunciation (that is, the phonetic symbol) and the industry classification, but not Limit this.
工商店家比對模組42,係從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同的工商店家資料,以得到招牌同音的工商店家資料,且比對出該招牌同音的工商店家資料具有與該關鍵字特徵的行業別、縣市名稱及路名之至少一相同時,令該至少一相同的該招牌同音的工商店家資料為到關聯工商店家資料。
The
歷史查詢資料庫43,係儲存該查詢服務的歷史查詢紀錄,該歷史查詢紀錄包含客戶端6所查詢的該文字查詢資料與所對應回覆的答案,以及來電總結記錄客戶端6是否對該答案的滿意度。
The historical query database 43 stores the historical query records of the query service. The historical query records include the text query data queried by the
第一關聯店家推薦模組44,係透過逆文檔頻率(IDF)演算法計算該關聯店家資料於歷史查詢資料庫43中的被查詢頻率,以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料,進而將該第一類推薦店家資料透過文字轉語音模組5語音推薦給該客戶端6。
The first related store recommendation module 44 uses the inverse document frequency (IDF) algorithm to calculate the query frequency of the related store data in the historical query database 43, so that the query frequency is greater than the first preset threshold. The associated store information is the first type of recommended store information, and the first type of recommended store information is then voice-recommended to the
逆文檔頻率(IDF)計算公式如下,其中D表示該歷史查詢紀錄內,行業別為餐廳類的查詢記錄總筆數,Dw表示招牌名稱w出現在多少筆資料中,透過下列公式:
第二關聯店家推薦模組45,係當第一關聯店家推薦模組44無法取得該第一類推薦店家資料時,從歷史查詢資料庫43中找出與該關鍵字特徵相同且客戶滿意度為好的歷
史查詢紀錄,以得到關聯歷史查詢紀錄,並利用Apriori機器學習演算法計算取得該關聯歷史查詢紀錄的特徵值,且令該特徵值符合預設推薦條件的該關聯歷史查詢紀錄為第二類推薦店家資料,以將該第二類推薦店家資料透過文字轉語音模組5語音推薦給該客戶端6。
The second associated
該Apriori機器學習演算法計算公式如下:
X為客戶端6查詢的店家X,Y為從該歷史查詢紀錄中找出的店家Y(即為該第二類推薦店家資料),support、confidence及Lift為該特徵值,其中,frq是計算店家在該歷史查詢紀錄內被查詢的次數,frq(X)是指店家X在該歷史查詢紀錄內被查詢的次數,frq(X,Y)是指同時包含店家X和店家Y在該歷史查詢紀錄內被查詢的次數,N為該歷史查詢紀錄的所有查詢次數,Support是支持度即代表該所有查詢次數N中,X與Y同時出現的比例,Supp(X)即為在該所有查詢次數N中,店家X出現的比例,Supp(Y)即為在該所有查詢次數N中,店家Y出現的比例,Confidence是信心度即代表所有出現店家X的查詢中所同時伴隨查詢店家Y的比例,Lift是增益值,Lift值小於1代表店家X與Y兩者是負相關,等於1代表店家X與Y兩者是無關,
大於1代表店家X與Y兩者是正相關,其中,當Lift值為正相關且support值和confidence值都超過該第二預設門檻值之後,則此關聯規則成立,則可將店家Y進行推薦給客戶端6。
X is the store X queried by the client 6, Y is the store Y found from the historical query record (that is, the second type of recommended store data), support, confidence, and Lift are the characteristic values, where frq is the calculation The number of times the store has been queried in the historical query record, frq(X) refers to the number of times store X has been queried in the historical query record, frq(X,Y) refers to the historical query that contains both store X and store Y The number of queries in the record, N is all the number of queries in the historical query record, Support is the support degree, which represents the proportion of all the number of queries N, X and Y appear at the same time, Supp(X) is the number of all queries In N, the proportion of store X appearing, Supp(Y) is the proportion of store Y appearing in all the number of queries N, Confidence is the degree of confidence that represents the proportion of all queries where store X appears at the same time as the query store Y , Lift is the gain value, Lift value less than 1 means that the store X and Y are negatively correlated, and equal to 1 means that the store X and Y are irrelevant.
Greater than 1 means that both store X and Y are positively correlated. When the Lift value is positively correlated and the support value and confidence value both exceed the second preset threshold, then this association rule is established, and store Y can be recommended To the
當大數據分析裝置4中的各模組無法依據該關鍵字特徵找出該第一類推薦店家資料及第二類推薦店家資料時,則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理,並將處理所得的第三類推薦店家資料告知客戶端6。
When the modules in the big
反饋資訊模組46,係將該客戶端6要求該查詢服務至推薦各該推薦店家資料給該客戶端6之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。
The
請參閱第2圖,係本發明之查詢服務之方法200之步驟流程圖。該方法包括下列步驟:在步驟S201中,取得該客戶端6要求該查詢服務的查詢資訊,其中,該查詢資料包含客戶端6的進線號碼、話機線路所在縣市、進線裝置、進線查詢時間及語音查詢資料。
Please refer to Figure 2, which is a flowchart of the steps of the
於一實施例中,該步驟S201係利用互動模組12生成對話字串,並透過文字轉語音模組5將該對話字串轉換成對話語音傳送至客戶端6,藉此引導客戶端6回答問題,以從該回答問題中取得該查詢資訊,但不以此為限。
In one embodiment, in step S201, the
在步驟S202中,將該查詢資訊中的語音查詢資料轉換成文字查詢資料。 In step S202, the voice query data in the query information is converted into text query data.
於一實施例中,該步驟S202係透過語音辨識裝置2
將該查詢資訊中的語音查詢資料轉換成文字查詢資料。
In one embodiment, the step S202 is performed by the
在步驟S203中,從字典檔資料庫31中找出與該文字查詢資料相同的資料,並將該相同的資料中的贅詞過濾,以得到關鍵字特徵,其中,該關鍵字特徵包含招牌名稱、縣市名稱、路名或行業別。 In step S203, find the same data as the text query data from the dictionary file database 31, and filter the redundant words in the same data to obtain keyword features, where the keyword features include the name of the sign , County and city name, road name or industry category.
該步驟S203係進一步包含下列步驟:利用分詞處理模組33對具有查詢意圖的該文字查詢資料進行不同字元數的切詞,以得到不同字元數的詞語組合,然後比對字典檔資料庫31中是否有相同的該詞語組合出現,並將比對相同的該詞語組合輸出為關鍵字陣列;以及 The step S203 further includes the following steps: use the word segmentation processing module 33 to segment the text query data with query intent to obtain word combinations with different numbers of characters, and then compare the dictionary file database Whether the same word combination appears in 31, and output the same word combination as a keyword array; and
利用關鍵字擷取模組34將該關鍵字陣列中的贅詞刪除,以得到該關鍵字特徵。 The keyword extraction module 34 is used to delete redundant words in the keyword array to obtain the keyword characteristics.
於一實施例中,該步驟S203一開始係利用語意分析模組32從該文字查詢資料中辨識客戶端6的查詢意圖,若有,則進行利用分詞處理模組33之步驟,若無,則回到步驟S201重新取得另一該查詢資訊,但不以此為限。
In one embodiment, at the beginning of step S203, the semantic analysis module 32 is used to identify the query intention of the
於一實施例中,當分詞處理模組33無法從字典檔資料庫31的縣市名稱字典檔或路名字典檔中比對出相符的該詞語組合時(即該文字查詢資料中沒有縣市名稱或路名),令關鍵字擷取模組34將該查詢資訊中的該客戶端之話機線路所在縣市納入該關鍵字特徵中,但不以此為限。 In one embodiment, when the word segmentation processing module 33 cannot compare the word combination from the county and city name dictionary file or the road name dictionary file in the dictionary file database 31 (that is, there is no county or city in the text query data). Name or road name) to make the keyword retrieval module 34 include the county and city where the phone line of the client in the query information is located in the keyword feature, but it is not limited to this.
在步驟S204中,從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同且行業別、縣市名稱及 路名之至少一相同者,以得到關聯工商店家資料。 In step S204, find out from the industrial store database 41 the pronunciation of the sign name in the keyword feature is the same as the name of the industry, county and city, and At least one of the road names is the same in order to obtain the information of the related shops.
於一實施例中,該步驟S204係透過工商店家比對模組42從工商店家資料庫41中找出與該關鍵字特徵中的招牌名稱之讀音相同的工商店家資料,以得到招牌同音的工商店家資料,且比對出該招牌同音的工商店家資料具有與該關鍵字特徵的行業別、縣市名稱及路名之至少一相同時,令該至少一相同的該招牌同音的工商店家資料為到關聯工商店家資料。
In one embodiment, the step S204 is to find from the industrial store database 41 from the store house database 41 through the store
在步驟S205中,利用逆文檔頻率演算法計算該關聯店家資料於歷史查詢資料庫中的被查詢頻率,以令被查詢頻率大於第一預設門檻值的該關聯店家資料為第一類推薦店家資料。 In step S205, the inverse document frequency algorithm is used to calculate the inquired frequency of the related store information in the historical query database, so that the related store information whose inquired frequency is greater than the first preset threshold is the first type of recommended store data.
於一實施例中,該步驟S205係透過第一關聯店家推薦模組44執行。 In one embodiment, the step S205 is performed through the first associated store recommendation module 44.
在步驟S206中,當該步驟S205無法利用該逆文檔頻率演算法取得該第一類推薦店家資料時,則從該歷史查詢資料庫中找出與該關鍵字特徵相同且客戶滿意度為好的歷史查詢紀錄,以得到關聯歷史查詢紀錄,進而將該關聯歷史查詢紀錄進行Apriori機器學習演算法計算取得一推薦值,以令該推薦值大於第二預設門檻值的該關聯歷史查詢紀錄為第二類推薦店家資料。 In step S206, when the inverse document frequency algorithm cannot be used to obtain the recommended store information of the first type in the step S205, it is found from the historical query database that the feature is the same as the keyword and the customer satisfaction is good Historical query records to obtain associated historical query records, and then perform Apriori machine learning algorithm calculations on the associated historical query records to obtain a recommended value, so that the associated historical query record with the recommended value greater than the second preset threshold is the first The second category recommended store information.
於一實施例中,該步驟S206係透過第二關聯店家推薦模組45執行。
In one embodiment, the step S206 is performed through the second associated
於一實施例中,當該步驟S206無法依據該關鍵字特徵
找出該第二類推薦店家資料時,則由中控模組11控制將與客戶端6進行的對話互動轉交客服人員8處理,並將處理所得的第三類推薦店家資料推薦給客戶端6。
In one embodiment, when the step S206 cannot be based on the keyword feature
When the second type of recommended store information is found, the
在步驟S207中,將該推薦店家資料推薦給該客戶端6。
In step S207, the recommended store profile is recommended to the
於一實施例中,係由中控模組11將該步驟S205所得的該第一類推薦店家資料推薦給該客戶端6,當該步驟S205無法取得該第一類推薦店家資料時,則將該步驟S206所得的該第二類推薦店家資料推薦給該客戶端6,若該步驟S206也無法取得該第二類推薦店家資料時,則由客服人員8將該第三類推薦店家資料推薦給客戶端6,但不以此為限。
In one embodiment, the
在步驟S208中,將該客戶端要求該查詢服務至推薦該推薦店家資料給該客戶端之間的處理過程儲存成該歷史查詢資料庫中的該歷史查詢紀錄。 In step S208, the process between the client requesting the query service and recommending the recommended store information to the client is stored as the historical query record in the historical query database.
於一實施例中,該步驟S208係由中控模組11或客服人員8所執行。
In one embodiment, the step S208 is executed by the
本發明之查詢服務之系統與方法係適用於104查號台,下列提供一情境實例說明本發明之查詢服務之系統與方法:客戶(即客戶端6)打電話至104查號台後,話務被交換機7分配到互動裝置1,並呼叫中控模組11啟動標準的對話流程,首先由互動模組12問候客戶並引導客戶回答問題,並由中控模組11從交換機7取得該客戶進線的相關資
訊,得到X集合(即該查詢資訊)為={X1客戶進線號碼=0910XXXXXX,X2話機線路所在縣市=台北大安區,X3=手機進線,X4客戶進線查詢時間=20XX/04/23 11:45,X5=客戶透過電話筒所回答的問題語音多媒體檔案}。
The system and method of the query service of the present invention is applicable to the 104 directory. The following provides a scenario example to illustrate the system and method of the query service of the present invention: After the customer (ie client 6) calls the 104 directory, then The service is assigned to the
將X5交由語音辨識裝置2進行處理,透過信號處理模組21處理過後,得到較為清晰且無背景音雜訊的數位訊號X5',然後呼叫語言分類模組22比對聲學模型、發音字典檔24後,確認X5'是中文發音之後,則將X5'交給語音轉文字模組23處理得到Y集合(即該文字查詢資料)為{Y1客戶問題字串=喂,小姐你好,我想要查中和區中山路上的鼎泰豐餐廳,請幫我查訂位電話,謝謝}。
Hand over X5 to the
將Y集合送到關鍵字分析判定裝置3,先呼叫語意分析模組32對字串集合Y進行語意分析,分析其目的意圖是否為查詢,若是,開始呼叫字典檔資料庫31(路名字典、行業名稱字典、景點名稱字典、招牌名稱字典)、分詞處理模組33進行中文分詞比對作業,將中文語句切割為不同字元數的詞語組合,並透過關鍵字擷取模組34過濾該詞語組合中之自然人交談所使用的主詞、動詞、嘆詞、語助詞等贅詞(例如:喂,小姐你好,我想要查、請幫我查、謝謝),經過上述過濾、比對處理作業後及各字典檔比對結果可得到下列表1所示之關鍵字特徵:『鼎泰豐』、『頂泰風』、『頂太峰』、『新北市中和區』、『中山路』、『餐廳』。 Send the Y set to the keyword analysis and determination device 3, first call the semantic analysis module 32 to perform semantic analysis on the string set Y, analyze whether its purpose is a query, and if so, start calling the dictionary file database 31 (road name dictionary, Industry name dictionary, scenic spot name dictionary, sign name dictionary), word segmentation processing module 33 performs Chinese word segmentation comparison operations, cuts Chinese sentences into word combinations with different numbers of characters, and filters the words through the keyword extraction module 34 Subjects, verbs, interjections, auxiliary words and other superfluous words used by natural persons in the combination (for example: hello, miss hello, I want to check, please help me check, thank you), after the above filtering and comparison processing operations After the comparison result of each dictionary file, the keyword features shown in Table 1 can be obtained: "Ding Tai Fung", "Ding Tai Feng", "Ding Tai Feng", "New Taipei City Zhonghe District", "Zhongshan Road", " restaurant".
產出查詢資料庫所需使用關鍵字特徵Y'={Y'1店家名稱=鼎泰豐、頂泰風或頂太峰,Y'2店家所在的縣市=新北市中和區,Y'3店家所在的路段名稱=中山路,Y'4店家的行業別=餐廳};此時將交換機7所得之資訊X集合與查詢用Y'集合輸入至工商店家比對模組42去比對工商店家資料庫41中的資料,由於Y'1工商店家招牌大多屬於中文,若僅透過聲學模型進行音轉字處理,則會因為同音異字、破音字的因素,造成候選組合案例相當多,而不知該輸出哪一個候選組合,因此需預先將Y'1店家名稱先轉為注音符號『ㄉ一ㄥˇㄊㄞ`ㄈㄥ』,並且將此注音符號比對工商店家資料庫41裡的店家讀音欄位,會發現具有相同讀音的店家有『頂太峰登山用品店』、『頂泰風泰式料理』、『鼎泰豐小籠包』等同音異字的招牌,且客戶詢問的行業別名稱是『餐廳』,為一般性餐飲業的泛稱,並非是工商店家資料庫41所註冊的店家行業別『小籠包』、『登山用品』、『泰式料理』,因此還需要將行業別資料比對字典檔資料庫31內的行業別同義詞字典進行比對。
Keyword feature required to generate query database Y'=(Y'1 store name=Ding Tai Fung, Ding Tai Feng or Ding Tai Feng, Y'2 store in the county and city = New Taipei City Zhonghe District, Y'3 store Road section name=Zhongshan Road, Y'4 store industry type=restaurant}; at this time, the information X set and query obtained by exchange 7 are input into the
假若客戶所詢問的行業別(即表1所示的餐廳),透過 行業別同義詞字典對照後發現與該相同讀音的店家在工商店家資料庫41內所登記的行業別名稱相同(如餐廳的同義字等於小籠包或料理),則信心度為60,比對完行業別同義字後接著比對縣市及路名,假若該相同讀音的店家在工商店家資料庫41內所登記的縣市與客戶所詢問的縣市(如表1所示的新北市中和區)相同但不同路名時,則信心度為70,再而,假若該相同讀音的店家在工商店家資料庫41內所登記的縣市及路名與客戶所詢問的縣市及路名(如表1所示的中山路)相同時,則信心度為90,並取信心度最高的該相同讀音的店家於工商店家資料庫41中的資料(即該工商店家資料)為該關聯工商店家資料。 If the industry category that the customer asks (ie the restaurant shown in Table 1), through After comparing the industry synonym dictionary, it is found that the industry name registered in the industry database 41 of the same pronunciation store is the same (for example, the synonym of restaurant is equal to Xiaolongbao or cuisine), the confidence level is 60, and the comparison is complete After the synonymous word of the industry category, the county and city and the road name are compared. If the store with the same pronunciation is registered in the county and city in the industrial store’s home database 41 and the county and city inquired by the customer (as shown in Table 1, Zhonghe, New Taipei City) District) the same but different road names, the confidence is 70, and if the same pronunciation of the store in the industrial store home database 41 registered in the county and road name and the customer inquired about the county, city and road name ( When the Zhongshan Road shown in Table 1 is the same, the confidence level is 90, and the data in the industrial store database 41 of the store with the same pronunciation with the highest confidence level (ie, the store’s store data) is the affiliated store’s store. data.
在本案例中客戶詢問鼎泰豐餐廳,但是工商店家資料庫41內記載,客戶註冊的資料,並沒有鼎泰豐餐廳,經過比對同義詞字典檔之後,發現業別餐廳的同義詞含有小籠包以及料理,因此在招牌名稱的定義裡,鼎泰豐餐廳=鼎泰豐小籠包以及鼎泰豐餐廳=頂泰風泰式料理餐廳。 In this case, the customer inquired about Ding Tai Fung Restaurant, but it was recorded in the industrial store’s home database 41 that there was no Ding Tai Fung Restaurant in the customer’s registered information. After comparing the synonym dictionary files, it was found that the synonyms of Yebei Restaurant contained Xiao Long Bao and cuisine, so In the definition of the signature name, Ding Tai Fung Restaurant = Ding Tai Fung Xiao Long Bao and Ding Tai Fung Restaurant = Ding Tai Feng Thai Restaurant.
一般而言,在同一條路上要出現兩筆同音異字的招牌名稱,機率是相當低,很少會有這樣的情形,此例屬於極端的案例,主要是展示本方法可解決客戶不清楚店家名稱、不清楚店家所在路名的情況下,亦可找出相關店家。甚至客戶開車出門在不知道自己位置的情況下,本發明透過查詢參數可判斷出客戶是透過手機門號查詢,可透過基地台的註冊表,可反查基地台所在資訊,間接得知客戶所在的 區域,例如『台北市大安區』。若客戶透過市話進線,則可更精準的知道裝機地址,可更精準的協助客戶,進行的地理位置附近的店家檢索。 Generally speaking, the probability of two signs with homophones and different characters appearing on the same road is quite low. This is rarely the case. This example is an extreme case, mainly to show that this method can solve the customer's unclear store name , If you don’t know the name of the street where the store is located, you can also find the relevant store. Even when the customer drives out without knowing his location, the present invention can determine that the customer is inquiring through the phone number through the query parameters. The registration form of the base station can be used to check the information of the base station and indirectly know where the customer is. of Area, such as "Da'an District, Taipei City". If the customer enters the line through the local call, they can know the installation address more accurately, and can more accurately assist the customer to search for stores near the geographic location.
接下來將這些同音異字的招牌名稱(即該關聯工商店家資料),輸入至第一關聯店家推薦模組44,透過內建的IDF演算法統計分析這些同音異字的招牌名稱在歷史查詢資料庫43內的被查詢頻率,最後得到結果Z{店家名稱、店家名稱相似度、店家登記電話}的集合,並依照IDF分數來排序,如下列表2所示。 Next, input the name of the signboard of these homophones (that is, the store data of the associated shop) into the first associated store recommendation module 44, and use the built-in IDF algorithm to statistically analyze the signboard names of these homophones in the historical query database 43 The frequency of being queried within, and finally the result Z{store name, store name similarity, store registration phone number} set, and sorted according to IDF score, as shown in Table 2 below.
上述案例產出的結果為{Z1=鼎泰豐小籠包,Z2=10.38},{Z'1=頂泰風泰式料理,Z'2=7.11};其中{Z1=鼎泰豐小籠包,Z2是IDF數值=10.38}的分數較高,透過綜合評比之後,假設Z1及Z2都超過預設門檻值,第一關
聯店家推薦模組44會將Z1的店家名稱以及此店家登記的電話結果送至中控模組11,並透過文字轉語音模組5播放訊息:『您好,您剛詢問位在{新北市}{中和區}{中山路}的{鼎泰豐餐廳},登記的電話是{02-XXXX-XXXX}』,另外一筆較為相近的是在{新北市}{中和區}{中山路}的{頂泰風泰式料理},登記的電話是{02-XXXX-YYYY}。
The output of the above case is {Z1=Ding Tai Fung Xiao Long Bao, Z2=10.38}, {Z'1=Ding Tai Feng Thai Cuisine, Z'2=7.11}; where {Z1=Ding Tai Feng Xiao Long Bao, Z2 is IDF value=10.38} has a higher score. After the comprehensive evaluation, assuming that both Z1 and Z2 exceed the preset threshold, the first level
The joint store recommendation module 44 will send the Z1 store name and the phone number registered by this store to the
由上可知,本發明透過互動裝置進行自動化對答,主動對客戶端問候,並取得客戶端的查詢資訊,降低客戶端等候太久的不良感受,後續再透過語音辨識裝置、關鍵字分析判定裝置及大數據分析裝置將客戶端的查詢資料的自動化分析取得該查詢資訊所對應的推薦店家資料,以改善客服人員訓練程度不一,容易造成查詢錯誤,且人員離職流動率高,造成學習成本大幅提高的問題,且該大數據分析裝置中的反饋資訊模組係同時將系統及客服人員的處理結果回饋至歷史查詢資料庫中,藉此提高由該歷史查詢資料庫中產出推薦店家資料的正確率。 It can be seen from the above that the present invention uses the interactive device to automate the answering, actively greets the client, and obtains the client's query information, reducing the client's bad feeling of waiting too long, and then using the voice recognition device, the keyword analysis and determination device and the big The data analysis device automatically analyzes the query data of the client to obtain the recommended store data corresponding to the query information, so as to improve the problem of different training levels of customer service personnel, which is easy to cause query errors, and the turnover rate of personnel is high, resulting in a significant increase in learning costs And the feedback information module in the big data analysis device simultaneously feeds back the processing results of the system and customer service personnel to the historical query database, thereby improving the accuracy of recommended store data generated from the historical query database.
1‧‧‧互動裝置 1‧‧‧Interactive device
2‧‧‧語音辨識裝置 2‧‧‧Voice recognition device
3‧‧‧關鍵字分析判定裝置 3‧‧‧Keyword analysis and judgment device
4‧‧‧大數據分析裝置 4‧‧‧Big data analysis device
5‧‧‧文字轉語音模組 5‧‧‧Text-to-speech module
6‧‧‧客戶端 6‧‧‧Client
7‧‧‧交換機 7‧‧‧Switch
8‧‧‧客服人員 8‧‧‧Customer Service Staff
11‧‧‧中控模組 11‧‧‧Central Control Module
12‧‧‧互動模組 12‧‧‧Interactive Module
21‧‧‧信號處理模組 21‧‧‧Signal processing module
22‧‧‧語言分類模組 22‧‧‧Language Classification Module
23‧‧‧語音轉文字模組 23‧‧‧Voice to text module
24‧‧‧發音字典檔 24‧‧‧ pronunciation dictionary file
31‧‧‧字典檔資料庫 31‧‧‧Dictionary file database
32‧‧‧語意分析模組 32‧‧‧Semantic Analysis Module
33‧‧‧分詞處理模組 33‧‧‧Word Segmentation Processing Module
34‧‧‧關鍵字擷取模組 34‧‧‧Keyword Extraction Module
41‧‧‧工商店家資料庫 41‧‧‧Workshop Home Database
42‧‧‧工商店家比對模組 42‧‧‧Industry shop home comparison module
43‧‧‧歷史查詢資料庫 43‧‧‧History query database
44‧‧‧第一關聯店家推薦模組 44‧‧‧The first related store recommendation module
45‧‧‧第二關聯店家推薦模組 45‧‧‧Recommendation module of the second related store
46‧‧‧反饋資訊模組 46‧‧‧Feedback Information Module
100‧‧‧系統 100‧‧‧System
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW107139680A TWI698756B (en) | 2018-11-08 | 2018-11-08 | System for inquiry service and method thereof |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW107139680A TWI698756B (en) | 2018-11-08 | 2018-11-08 | System for inquiry service and method thereof |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW202018529A TW202018529A (en) | 2020-05-16 |
| TWI698756B true TWI698756B (en) | 2020-07-11 |
Family
ID=71895769
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW107139680A TWI698756B (en) | 2018-11-08 | 2018-11-08 | System for inquiry service and method thereof |
Country Status (1)
| Country | Link |
|---|---|
| TW (1) | TWI698756B (en) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2021241602A1 (en) * | 2020-05-28 | 2021-12-02 | Jfeスチール株式会社 | Information search system |
| MX2022014970A (en) | 2020-05-28 | 2023-01-11 | Jfe Steel Corp | Information retrieval system. |
| TWI766457B (en) * | 2020-11-27 | 2022-06-01 | 國立臺北護理健康大學 | Analysis system and upload method for language |
| TWI759003B (en) * | 2020-12-10 | 2022-03-21 | 國立成功大學 | Method for training a speech recognition model |
| TWI799835B (en) * | 2021-04-16 | 2023-04-21 | 兆豐國際商業銀行股份有限公司 | Electronic device and method of assigning service bank branch for store |
| TWI841866B (en) * | 2021-09-14 | 2024-05-11 | 中國信託商業銀行股份有限公司 | Business handling willingness determination method and computing device thereof |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101939740A (en) * | 2007-12-11 | 2011-01-05 | 声钰科技 | Providing a natural language voice user interface in an integrated language navigation service environment |
| TW201115368A (en) * | 2009-10-28 | 2011-05-01 | Chunghwa Telecom Co Ltd | Business information system of interest index using voice for number dearching and method thereof |
| CN103049853A (en) * | 2012-12-19 | 2013-04-17 | 胡绍珠 | Identifying device and verification method for authenticity of shop |
| US8756248B1 (en) * | 2012-06-26 | 2014-06-17 | C. Joseph Rickrode | Rapid access information database (RAID) system and method for mobile entity data aggregation |
| TW201434003A (en) * | 2013-01-31 | 2014-09-01 | 微軟公司 | Activity graphs |
-
2018
- 2018-11-08 TW TW107139680A patent/TWI698756B/en not_active IP Right Cessation
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101939740A (en) * | 2007-12-11 | 2011-01-05 | 声钰科技 | Providing a natural language voice user interface in an integrated language navigation service environment |
| TW201115368A (en) * | 2009-10-28 | 2011-05-01 | Chunghwa Telecom Co Ltd | Business information system of interest index using voice for number dearching and method thereof |
| US8756248B1 (en) * | 2012-06-26 | 2014-06-17 | C. Joseph Rickrode | Rapid access information database (RAID) system and method for mobile entity data aggregation |
| CN103049853A (en) * | 2012-12-19 | 2013-04-17 | 胡绍珠 | Identifying device and verification method for authenticity of shop |
| TW201434003A (en) * | 2013-01-31 | 2014-09-01 | 微軟公司 | Activity graphs |
Also Published As
| Publication number | Publication date |
|---|---|
| TW202018529A (en) | 2020-05-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI698756B (en) | System for inquiry service and method thereof | |
| US11341970B2 (en) | Personal directory service | |
| US10431214B2 (en) | System and method of determining a domain and/or an action related to a natural language input | |
| US9742912B2 (en) | Method and apparatus for predicting intent in IVR using natural language queries | |
| US20030191639A1 (en) | Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition | |
| CN110266900B (en) | Method and device for identifying customer intention and customer service system | |
| US20040024754A1 (en) | System and method for automatically processing a user's request by an automated assistant | |
| US7680661B2 (en) | Method and system for improved speech recognition | |
| US20080071544A1 (en) | Integrating Voice-Enabled Local Search and Contact Lists | |
| US20030149566A1 (en) | System and method for a spoken language interface to a large database of changing records | |
| US7627096B2 (en) | System and method for independently recognizing and selecting actions and objects in a speech recognition system | |
| US20060067508A1 (en) | Methods and apparatus for processing foreign accent/language communications | |
| US20020152207A1 (en) | Method and apparatus for automatically processing a user's communication | |
| US20050004799A1 (en) | System and method for a spoken language interface to a large database of changing records | |
| CN110198381B (en) | Method, device, voice assistant, terminal, computer equipment and computer readable storage medium for identifying AI incoming call | |
| US8428241B2 (en) | Semi-supervised training of destination map for call handling applications | |
| US7401023B1 (en) | Systems and methods for providing automated directory assistance using transcripts | |
| US20090019027A1 (en) | Disambiguating residential listing search results | |
| US8423354B2 (en) | Speech recognition dictionary creating support device, computer readable medium storing processing program, and processing method | |
| CN113822029A (en) | Customer service assistance method, device and system | |
| Callejas et al. | Implementing modular dialogue systems: a case study | |
| US11563846B1 (en) | System and method for predicting intelligent voice assistant content | |
| KR20090071635A (en) | Navigate your business listing | |
| CN112860724A (en) | Automatic address deviation rectifying method for man-machine integration customer service system | |
| JP3621922B2 (en) | Sentence recognition apparatus, sentence recognition method, program, and medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MM4A | Annulment or lapse of patent due to non-payment of fees |