TWI901377B - Information recommendation system and method - Google Patents
Information recommendation system and methodInfo
- Publication number
- TWI901377B TWI901377B TW113137581A TW113137581A TWI901377B TW I901377 B TWI901377 B TW I901377B TW 113137581 A TW113137581 A TW 113137581A TW 113137581 A TW113137581 A TW 113137581A TW I901377 B TWI901377 B TW I901377B
- Authority
- TW
- Taiwan
- Prior art keywords
- information
- data
- feature data
- information feature
- vector
- Prior art date
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
一種資訊推薦系統包含一處理單元及一與該處理單元電連接且儲存有一資訊資料庫的儲存單元,該資訊資料庫包含多筆候選資訊及多筆對應的資訊特徵資料。該處理單元用於:對一需求描述資料執行一向量轉換處理以產生一以向量形式呈現的需求特徵資料;根據每一資訊特徵資料包含的多個資訊特徵向量,計算每一資訊特徵資料與該需求特徵資料之間的關聯度,並從該等資訊特徵資料中選出一最匹配資訊特徵資料;將該最匹配資訊特徵資料所對應的該候選資訊傳送至一對應於該需求描述資料的使用端電子裝置。An information recommendation system includes a processing unit and a storage unit electrically connected to the processing unit and storing an information database containing multiple candidate information and corresponding information feature data. The processing unit is configured to: perform a vector conversion process on a requirement description to generate requirement feature data in vector form; calculate the correlation between each piece of information feature data and the requirement feature data based on multiple information feature vectors contained in each piece of information feature data; select the most matching piece of information feature data from the information feature data; and transmit the candidate information corresponding to the most matching piece of information feature data to a user electronic device corresponding to the requirement description data.
Description
本發明是有關於一種推薦系統,特別是指一種用於透過網路推薦資訊的資訊推薦系統。本發明還有關於一種用於透過網路推薦資訊的資訊推薦方法。The present invention relates to a recommendation system, and more particularly to an information recommendation system for recommending information via the Internet. The present invention also relates to an information recommendation method for recommending information via the Internet.
在現代社會中,許多營業機構會使用具備自動化推薦功能的推薦系統來向使用者推薦各種數位資訊,例如數位文章、廣告文案以及產品介紹。因此,要如何進一步改善推薦系統的推薦準確度,便成為一個值得探討的問題。In modern society, many businesses use automated recommendation systems to recommend various digital information to users, such as digital articles, advertising copy, and product introductions. Therefore, how to further improve the accuracy of recommendation systems has become a question worth exploring.
因此,本發明的其中一目的,便在於提供一種有助於提升推薦準確度的資訊推薦系統。Therefore, one of the objectives of the present invention is to provide an information recommendation system that helps improve the accuracy of recommendations.
本發明資訊推薦系統包含一處理單元及一與該處理單元電連接的儲存單元。該儲存單元儲存有一資訊資料庫,該資訊資料庫包含多筆候選資訊,以及多筆分別對應於該等候選資訊的資訊特徵資料,每一資訊特徵資料包括多個資訊特徵向量,且每一資訊特徵向量表示該資訊特徵資料所對應之該候選資訊的特徵。其中,該處理單元用於:在獲得一需求描述資料之後,對該需求描述資料執行一向量轉換處理,以產生一以向量形式呈現且表示該需求描述資料之特徵的需求特徵資料;根據每一資訊特徵資料的該等資訊特徵向量,計算每一資訊特徵資料與該需求特徵資料之間的關聯度,並將該等資訊特徵資料中與該需求特徵資料之關聯度最高的該資訊特徵資料作為一最匹配資訊特徵資料;將該最匹配資訊特徵資料所對應的該候選資訊傳送至一對應於該需求描述資料的使用端電子裝置。The information recommendation system of the present invention includes a processing unit and a storage unit electrically connected to the processing unit. The storage unit stores an information database containing a plurality of candidate information and a plurality of information feature data corresponding to each candidate information. Each piece of information feature data includes a plurality of information feature vectors, and each information feature vector represents a characteristic of the candidate information to which the information feature data corresponds. The processing unit is used to: after obtaining a requirement description data, perform a vector conversion process on the requirement description data to generate requirement feature data presented in vector form and representing the characteristics of the requirement description data; calculate the correlation between each information feature data and the requirement feature data based on the information feature vectors of each information feature data, and use the information feature data with the highest correlation with the requirement feature data among the information feature data as the most matching information feature data; and transmit the candidate information corresponding to the most matching information feature data to a user-end electronic device corresponding to the requirement description data.
在本發明資訊推薦系統的一些實施態樣中,該需求描述資料為一由該使用端電子裝置所傳送的即時文字訊息,該需求特徵資料為一個用於描述該即時文字訊息之語意的語意向量,並且,對於每一資訊特徵資料,該處理單元是根據該資訊特徵資料的每一資訊特徵向量及該語意向量,計算出多個分別對應於該等資訊特徵向量的相似度分數,再根據該等相似度分數的平均值計算出該資訊特徵資料與該語意向量之間的關聯度,其中,每一相似度分數表示該相似度分數所對應之該資訊特徵向量與該語意向量之間的相似度。In some embodiments of the information recommendation system of the present invention, the demand description data is an instant text message transmitted by the user electronic device, and the demand feature data is a semantic vector used to describe the semantics of the instant text message. Moreover, for each information feature data, the processing unit calculates multiple similarity scores corresponding to each information feature vector and the semantic vector of the information feature data, and then calculates the correlation between the information feature data and the semantic vector based on the average value of the similarity scores, wherein each similarity score represents the similarity between the information feature vector corresponding to the similarity score and the semantic vector.
在本發明資訊推薦系統的一些實施態樣中,該處理單元在選出該最匹配資訊特徵資料之後,還判斷該最匹配資訊特徵資料與該語意向量的關聯度是否大於等於一關聯度門檻值,並且,該處理單元是在判定該最匹配資訊特徵資料與該語意向量的關聯度大於等於該關聯度門檻值的情況下,才將該最匹配資訊特徵資料所對應的該候選資訊傳送至該使用端電子裝置。In some implementations of the information recommendation system of the present invention, after selecting the best-matching information feature data, the processing unit also determines whether the correlation between the best-matching information feature data and the semantic vector is greater than or equal to a correlation threshold value, and only when the processing unit determines that the correlation between the best-matching information feature data and the semantic vector is greater than or equal to the correlation threshold value does it transmit the candidate information corresponding to the best-matching information feature data to the user-end electronic device.
在本發明資訊推薦系統的一些實施態樣中,該儲存單元還儲存有一門檻值資料,該門檻值資料包含多個門檻值參數,以及多個分別對應於該等門檻值參數的主題類型資料。該處理單元在選出該最匹配資訊特徵資料之後,還從該等主題類型資料中選出其中一筆與該即時文字訊息的語意匹配度最高的最匹配主題類型資料,並將該最匹配主題類型資料所對應之該門檻值參數的數值作為該關聯度門檻值。In some embodiments of the information recommendation system of the present invention, the storage unit further stores threshold data, which includes multiple threshold parameters and multiple topic type data corresponding to the threshold parameters. After selecting the most matching information feature data, the processing unit further selects a most matching topic type data from the topic type data that has the highest semantic match with the instant text message, and uses the value of the threshold parameter corresponding to the most matching topic type data as the relevance threshold.
在本發明資訊推薦系統的一些實施態樣中,該需求描述資料為一被包含於一客戶資料庫且對應於該使用端電子裝置的客戶屬性資料,該需求特徵資料包含多個用於描述客戶偏好的偏好特徵向量,並且,對於每一資訊特徵資料,該處理單元是根據該資訊特徵資料的該等資訊特徵向量及該需求特徵資料的該等偏好特徵向量,計算出多個各自對應於該等資訊特徵向量之其中一者及該等偏好特徵向量之其中一者的相似度分數,再根據該等相似度分數的平均值計算出該資訊特徵資料與該需求特徵資料之間的關聯度,其中,每一相似度分數表示該相似度分數所對應之該資訊特徵向量與該相似度分數所對應之該需求特徵資料之間的相似度。In some embodiments of the information recommendation system of the present invention, the demand description data is customer attribute data contained in a customer database and corresponding to the user-end electronic device, and the demand feature data includes multiple preference feature vectors for describing customer preferences. Moreover, for each information feature data, the processing unit calculates multiple similarity scores corresponding to one of the information feature vectors and one of the preference feature vectors based on the information feature vectors of the information feature data and the preference feature vectors of the demand feature data, and then calculates the correlation between the information feature data and the demand feature data based on the average value of the similarity scores, wherein each similarity score represents the similarity between the information feature vector corresponding to the similarity score and the demand feature data corresponding to the similarity score.
本發明的另一目的,在於提供一種有助於提升推薦準確度的資訊推薦方法。Another object of the present invention is to provide an information recommendation method that helps improve recommendation accuracy.
本發明資訊推薦方法由一資訊推薦系統實施,該資訊推薦系統儲存有一資訊資料庫,該資訊資料庫包含多筆候選資訊,以及多筆分別對應於該等候選資訊的資訊特徵資料,每一資訊特徵資料包括多個資訊特徵向量,且每一資訊特徵向量表示該資訊特徵資料所對應之該候選資訊的特徵。該資訊推薦方法包含:(A)在獲得一需求描述資料之後,對該需求描述資料執行一向量轉換處理,以產生一以向量形式呈現且表示該需求描述資料之特徵的需求特徵資料;(B)根據每一資訊特徵資料的該等資訊特徵向量,計算每一資訊特徵資料與該需求特徵資料之間的關聯度,並將該等資訊特徵資料中與該需求特徵資料之關聯度最高的該資訊特徵資料作為一最匹配資訊特徵資料;(C)將該最匹配資訊特徵資料所對應的該候選資訊傳送至一對應於該需求描述資料的使用端電子裝置。The information recommendation method of the present invention is implemented by an information recommendation system. The information recommendation system stores an information database, which includes multiple candidate information and multiple information feature data corresponding to the candidate information. Each information feature data includes multiple information feature vectors, and each information feature vector represents the characteristics of the candidate information corresponding to the information feature data. The information recommendation method includes: (A) after obtaining a requirement description data, performing a vector conversion process on the requirement description data to generate requirement feature data presented in vector form and representing the characteristics of the requirement description data; (B) based on the information feature vectors of each information feature data, calculating the correlation between each information feature data and the requirement feature data, and determining the information feature data with the highest correlation with the requirement feature data among the information feature data as the best matching information feature data; and (C) transmitting the candidate information corresponding to the best matching information feature data to a user-end electronic device corresponding to the requirement description data.
在本發明資訊推薦方法的一些實施態樣中,在步驟(A)中,該需求描述資料為一由該使用端電子裝置所傳送的即時文字訊息。在步驟(B)中,該需求特徵資料為一個用於描述該即時文字訊息之語意的語意向量,並且,對於每一資訊特徵資料,該資訊推薦系統是根據該資訊特徵資料的每一資訊特徵向量及該語意向量,計算出多個分別對應於該等資訊特徵向量的相似度分數,再根據該等相似度分數的平均值計算出該資訊特徵資料與該語意向量之間的關聯度,其中,每一相似度分數表示該相似度分數所對應之該資訊特徵向量與該語意向量之間的相似度。In some embodiments of the information recommendation method of the present invention, in step (A), the demand description data is an instant text message sent by the user electronic device. In step (B), the demand feature data is a semantic vector used to describe the semantics of the instant text message. For each piece of information feature data, the information recommendation system calculates multiple similarity scores corresponding to each information feature vector and the semantic vector of the information feature data, and then calculates the correlation between the information feature data and the semantic vector based on the average of the similarity scores, wherein each similarity score represents the similarity between the information feature vector corresponding to the similarity score and the semantic vector.
在本發明資訊推薦方法的一些實施態樣中,該資訊推薦方法還包含在(B)及(C)之間的:(D)判斷該最匹配資訊特徵資料與該語意向量的關聯度是否大於等於一關聯度門檻值。在步驟(C)中,該資訊推薦系統是在判定該最匹配資訊特徵資料與該語意向量的關聯度大於等於該關聯度門檻值的情況下,才將該最匹配資訊特徵資料所對應的該候選資訊傳送至該使用端電子裝置。In some embodiments of the information recommendation method of the present invention, the method further includes, between (B) and (C): (D) determining whether the correlation between the best-matching information feature data and the semantic vector is greater than or equal to a correlation threshold. In step (C), the information recommendation system transmits the candidate information corresponding to the best-matching information feature data to the user electronic device only if the correlation between the best-matching information feature data and the semantic vector is determined to be greater than or equal to the correlation threshold.
在本發明資訊推薦方法的一些實施態樣中,該資訊推薦系統還儲存有一門檻值資料,該門檻值資料包含多個門檻值參數,以及多個分別對應於該等門檻值參數的主題類型資料;該資訊推薦方法還包含在(B)及(D)之間的:(E)從該等主題類型資料中選出其中一筆與該即時文字訊息的語意匹配度最高的最匹配主題類型資料,並將該最匹配主題類型資料所對應之該門檻值參數的數值作為該關聯度門檻值。In some embodiments of the information recommendation method of the present invention, the information recommendation system further stores threshold data, which includes multiple threshold parameters and multiple topic type data corresponding to the threshold parameters. The information recommendation method further includes, between (B) and (D): (E) selecting, from the topic type data, a best-matching topic type data that has the highest semantic match with the instant text message, and using the value of the threshold parameter corresponding to the best-matching topic type data as the relevance threshold.
在本發明資訊推薦方法的一些實施態樣中,在步驟(A)中,該需求描述資料為一被包含於一客戶資料庫且對應於該使用端電子裝置的客戶屬性資料。在步驟(B)中,該需求特徵資料包含多個用於描述客戶偏好的偏好特徵向量,並且,對於每一資訊特徵資料,該資訊推薦系統是根據該資訊特徵資料的該等資訊特徵向量及該需求特徵資料的該等偏好特徵向量,計算出多個各自對應於該等資訊特徵向量之其中一者及該等偏好特徵向量之其中一者的相似度分數,再根據該等相似度分數的平均值計算出該資訊特徵資料與該需求特徵資料之間的關聯度,其中,每一相似度分數表示該相似度分數所對應之該資訊特徵向量與該相似度分數所對應之該需求特徵資料之間的相似度。In some implementations of the information recommendation method of the present invention, in step (A), the demand description data is customer attribute data contained in a customer database and corresponding to the user-end electronic device. In step (B), the demand feature data includes multiple preference feature vectors for describing customer preferences, and, for each information feature data, the information recommendation system calculates multiple similarity scores corresponding to one of the information feature vectors and one of the preference feature vectors based on the information feature vectors of the information feature data and the preference feature vectors of the demand feature data, and then calculates the correlation between the information feature data and the demand feature data based on the average value of the similarity scores, wherein each similarity score represents the similarity between the information feature vector corresponding to the similarity score and the demand feature data corresponding to the similarity score.
本發明之功效在於:該資訊推薦系統能在獲得該需求描述資料之後,先將其轉換為向量形式的該需求特徵資料,再將該需求特徵資料與該等候選資訊所對應的該等資訊特徵資料進行關聯度計算,以選出關聯度最高的該最匹配資訊特徵資料,並將該最匹配資訊特徵資料所對應的該候選資訊傳送至該使用端電子裝置,以實現資訊推薦。如此一來,即便該需求描述資料中僅存在與候選資訊的內容相同或相近的詞語,而不存在與候選資訊的內容完全相符的詞語,該資訊推薦系統亦能利用該語言處理模型選出與該需求描述資料之語意關聯度最高的候選資訊來進行推薦,因此,相較於針對每一候選資訊建立對應之詞袋模型(Bag-of-words model)的作法,本發明不會因為該需求描述資料中缺少完全相符的關鍵詞而錯過實際上適合被推薦的候選資訊,故有助於改善資訊推薦的準確度。The utility of the present invention lies in that, after obtaining the demand description data, the information recommendation system can first convert it into the demand feature data in vector form, then calculate the correlation between the demand feature data and the information feature data corresponding to the candidate information to select the most relevant and best-matching information feature data, and then transmit the candidate information corresponding to the best-matching information feature data to the user electronic device to achieve information recommendation. In this way, even if the requirement description data only contains words that are identical or similar to the content of the candidate information, but no words that completely match the content of the candidate information, the information recommendation system can use the language processing model to select the candidate information with the highest semantic relevance to the requirement description data for recommendation. Therefore, compared to the approach of establishing a corresponding bag-of-words model for each candidate information, the present invention will not miss out on candidate information that is actually suitable for recommendation due to the lack of completely matching keywords in the requirement description data, thereby helping to improve the accuracy of information recommendations.
在本發明被詳細描述之前應當注意:在未特別定義的情況下,本專利說明書中所述的「電連接(electrically connected)」是用來描述電腦硬體(例如電子系統、設備、裝置、單元、元件)之間的「耦接(coupled)」關係,且泛指複數電腦硬體之間透過導體/半導體材料彼此實體相連而實現的「有線電連接」,以及利用無線通訊技術(例如但不限於無線網路、藍芽及電磁感應等)而實現無線資料傳輸的「無線電連接」。另一方面,在未特別定義的情況下,本專利說明書中所述的「電連接」也泛指複數電腦硬體之間彼此直接耦接而實現的「直接電連接」,以及複數電腦硬體之間是透過其他電腦硬體間接耦接而實現的「間接電連接」。Before the present invention is described in detail, it should be noted that, unless otherwise specified, the term "electrically connected" as used in this patent specification is used to describe the "coupled" relationship between computer hardware (e.g., electronic systems, devices, apparatuses, units, components), and generally refers to "wired electrical connections" achieved by physically connecting multiple computer hardware components to each other through conductive/semiconductor materials, as well as "radio connections" achieved by wireless data transmission using wireless communication technologies (such as, but not limited to, wireless networks, Bluetooth, and electromagnetic induction). On the other hand, unless otherwise specified, the "electrical connection" described in this patent specification also generally refers to a "direct electrical connection" achieved by directly coupling multiple computer hardware components to each other, and an "indirect electrical connection" achieved by indirectly coupling multiple computer hardware components through other computer hardware components.
本專利說明書中所述的「單元(unit)」是代表電腦硬體而非軟體,舉例來說,「處理單元」是用來代表具備資料處理功能的電腦硬體。另一方面,本專利說明書中所述的「單元」可以是指具備特定功能的單一個電腦硬體,也可以是指具備類似功能的一群電腦硬體。舉例來說,「處理單元」可以是指具備資料處理功能的單一個處理器,但也可以是指一群處理器的集合。The term "unit" as used in this patent specification refers to computer hardware, not software. For example, a "processing unit" is used to refer to computer hardware that performs data processing. On the other hand, a "unit" as used in this patent specification can refer to a single piece of computer hardware that performs a specific function, or a group of computer hardware that performs similar functions. For example, a "processing unit" can refer to a single processor that performs data processing, but it can also refer to a group of processors.
參閱圖1,本發明資訊推薦系統1的一實施例歸屬於一服務機構,而且,本實施例的該資訊推薦系統1適用於與圖1所示的一使用端電子裝置5配合應用。Referring to FIG. 1 , an embodiment of the information recommendation system 1 of the present invention belongs to a service organization. Moreover, the information recommendation system 1 of this embodiment is suitable for use in conjunction with a user-end electronic device 5 shown in FIG. 1 .
在本實施例的應用環境中,該服務機構例如是一家提供金融服務的商業銀行。然而,該服務機構也可以是提供不同服務的其他種類的營業機構,因此,本實施例的應用並不以前述所舉之例為限。In the application environment of this embodiment, the service institution is, for example, a commercial bank that provides financial services. However, the service institution may also be other types of business institutions that provide different services. Therefore, the application of this embodiment is not limited to the aforementioned example.
該使用端電子裝置5例如是一台手機、平板電腦、筆記型電腦或者桌上型電腦,而歸屬於該服務機構的一位客戶。補充說明的是,該資訊推薦系統1實際上能與多台使用端電子裝置配合應用,然而,為了便於描述,本實施例僅會以圖1所示的該使用端電子裝置5進行說明。The user-end electronic device 5 is, for example, a mobile phone, tablet computer, laptop computer, or desktop computer, and belongs to a customer of the service organization. It should be noted that the information recommendation system 1 can actually be used with multiple user-end electronic devices. However, for ease of description, this embodiment will only be described using the user-end electronic device 5 shown in FIG. 1 .
在本實施例中,該資訊推薦系統1是一台伺服設備,而且,該資訊推薦系統1包含一處理單元11,以及一電連接該處理單元11的儲存單元12。In this embodiment, the information recommendation system 1 is a server device, and the information recommendation system 1 includes a processing unit 11 and a storage unit 12 electrically connected to the processing unit 11.
在本實施例中,該處理單元11是一個以積體電路實現且具備資料運算及指令收發功能的處理器。然而,在不同的實施態樣中,該處理單元11也可以是一包括有處理器及電路板的電路組件,或者是多個處理器的集合。該儲存單元12是一台用於儲存數位資料的資料儲存裝置(例如硬碟)。然而,在不同的實施態樣中,該儲存單元12也可以是多個相同或相異種類之儲存裝置的集合。In this embodiment, the processing unit 11 is a processor implemented as an integrated circuit and having data processing and instruction transmission and reception functions. However, in different embodiments, the processing unit 11 may also be a circuit assembly including a processor and a circuit board, or a collection of multiple processors. The storage unit 12 is a data storage device (such as a hard drive) for storing digital data. However, in different embodiments, the storage unit 12 may also be a collection of multiple storage devices of the same or different types.
進一步地,在其他實施例中,該資訊推薦系統1也可被實施為多台彼此電連接的伺服設備,在此情況下,該處理單元11可被實施為該等伺服設備所分別具有之多個處理器/電路組件的集合,而該儲存單元12則可被實施為該等伺服設備所分別具有之多個儲存裝置的集合。基於上述,該資訊推薦系統1在電腦硬體方面的實際實施態樣並不以本實施例為限。Furthermore, in other embodiments, the information recommendation system 1 may also be implemented as multiple electrically connected server devices. In this case, the processing unit 11 may be implemented as a collection of multiple processors/circuit components included in each of the server devices, and the storage unit 12 may be implemented as a collection of multiple storage devices included in each of the server devices. Based on the above, the actual implementation of the information recommendation system 1 in terms of computer hardware is not limited to this embodiment.
該儲存單元12儲存有一語言處理模型M、一資訊資料庫D1、一客戶資料庫D2,以及一門檻值資料D3。The storage unit 12 stores a language processing model M, an information database D1, a customer database D2, and threshold data D3.
該語言處理模型M是一個利用深度學習技術實現且具備自然語言理解功能的大型語言模型(Large Language Model,簡稱LLM)。然而,在不同的實施例中,該語言處理模型M只要是可以利用句嵌入(Sentence Embedding)、詞嵌入(Word Embedding)或文章嵌入(Chapter Embedding)方式來將文字訊息轉換成文意向量的模型即可實施,而並不以大型語言模型為限。The language processing model M is a large language model (LLM) implemented using deep learning technology and capable of natural language understanding. However, in various embodiments, the language processing model M can be any model that can convert text messages into contextual vectors using sentence embedding, word embedding, or chapter embedding, and is not limited to a large language model.
該資訊資料庫D1包含多筆候選資訊D11,以及多筆分別對應於該等候選資訊D11的資訊特徵資料D12。The information database D1 includes a plurality of candidate information D11 and a plurality of information feature data D12 corresponding to the candidate information D11.
每一候選資訊D11的資料形式可以是一文字檔案、一靜態影像檔案、一動態影像檔案、一聲音檔案,或者是一同時包含文字、靜態影像、動態影像及聲音之其中多者的多媒體檔案。並且,每一候選資訊D11的內容例如包含數位文章、數位廣告、服務資訊、產品介紹的其中一或多者,但並不以此為限。Each candidate information D11 may be in the form of a text file, a still image file, a moving image file, an audio file, or a multimedia file containing multiple of these. Furthermore, the content of each candidate information D11 may include, but is not limited to, one or more of a digital article, a digital advertisement, service information, or a product introduction.
舉例來說,該等候選資訊D11中的其中兩筆候選資訊D11例如分別被作為本實施例的一第一候選資訊D11A及一第二候選資訊D11B(示於圖1)。其中,該第一候選資訊D11A可例如是一篇主題為「美元進入緩升格局,商品貨幣等待政策等基本面因素發酵」的數位文章,該第二候選資訊D11B可例如是一篇主題為「出口管制短期對手機與射頻產業影響有限」的數位文章。For example, two pieces of candidate information D11 from the candidate information D11 are designated as first candidate information D11A and second candidate information D11B in this embodiment (shown in FIG1 ). The first candidate information D11A may be a digital article titled "The US dollar enters a slow appreciation pattern, and commodity currencies await the development of policy and other fundamental factors." The second candidate information D11B may be a digital article titled "Export controls have limited short-term impact on the mobile phone and radio frequency industries."
每一資訊特徵資料D12包括多個資訊特徵向量V12(示例性地示於圖3),其中,每一資訊特徵向量V12是用於表示該資訊特徵資料D12所對應之該候選資訊D11的特徵。更具體地說,每一資訊特徵向量V12是一個七百六十維(但不限於此)且能被該語言處理模型M所處理的文意向量,而用來描述對應之該候選資訊D11的內容。延續前例,該第一候選資訊D11A所對應的該資訊特徵資料D12被作為一第一資訊特徵資料D12A,該第一資訊特徵資料D12A例如包括四個資訊特徵向量V12,且該四個資訊特徵向量V12例如是分別用來描述「澳幣」、「加幣」、「美元」及「貨幣政策」等四個與該第一候選資訊D11A對應的關鍵詞。該第二候選資訊D11B所對應的該資訊特徵資料D12被作為一第二資訊特徵資料D12B,該第二資訊特徵資料D12B例如包括三個資訊特徵向量V12,且該三個資訊特徵向量V12例如是分別用來描述「手機」、「半導體」及「貿易戰」等三個與該第二候選資訊D11B對應的關鍵詞。Each piece of information feature data D12 includes multiple information feature vectors V12 (illustrated in FIG3 ), each of which is used to represent the characteristics of the candidate information D11 to which the information feature data D12 corresponds. More specifically, each information feature vector V12 is a 760-dimensional (but not limited to) context vector that can be processed by the language processing model M and is used to describe the content of the corresponding candidate information D11. Continuing with the previous example, the information feature data D12 corresponding to the first candidate information D11A is used as a first information feature data D12A. The first information feature data D12A, for example, includes four information feature vectors V12, and the four information feature vectors V12 are used to describe the four keywords corresponding to the first candidate information D11A, such as "Australian dollar", "Canadian dollar", "US dollar" and "monetary policy". The information feature data D12 corresponding to the second candidate information D11B is used as a second information feature data D12B. The second information feature data D12B, for example, includes three information feature vectors V12, and the three information feature vectors V12 are used to describe the three keywords corresponding to the second candidate information D11B, such as "mobile phone", "semiconductor" and "trade war".
補充說明的是,每一候選資訊D11所對應的關鍵詞可例如是由一利用隱含狄利克雷分布(Latent Dirichlet allocation,簡稱LDA)的主題模型(Topic Model)所決定,或者是由各筆候選資訊D11所屬之相關領域的專業人員來決定。It should be noted that the keywords corresponding to each piece of candidate information D11 can be determined, for example, by a topic model using Latent Dirichlet allocation (LDA), or by professionals in the relevant field to which each piece of candidate information D11 belongs.
該客戶資料庫D2包含多筆分別對應於多位客戶的客戶屬性資料D21。The customer database D2 contains multiple customer attribute data D21 corresponding to multiple customers.
在本實施例中,每一客戶屬性資料D21例如包含一關注特徵部分D211及一資產特徵部分D212。In this embodiment, each customer attribute data D21 includes, for example, a focus feature portion D211 and an asset feature portion D212.
在本實施例中,該關注特徵部分D211指示出對應之客戶近期曾瀏覽過之金融商品的類型(例如「高科技」、「股票型指數基金」、「債券」、「太陽能」)、曾瀏覽過之線上文章的主題(例如「升息」、「房地產」、「貿易戰」、「美債」、「通膨」),以及曾點擊過之線上廣告的主題(例如「優惠」、「信貸」、「手續費」),但並不以此為限。In this embodiment, the attention feature portion D211 indicates the types of financial products that the corresponding customer has recently browsed (e.g., "high-tech," "stock index funds," "bonds," "solar energy"), the topics of online articles that the customer has browsed (e.g., "interest rate hikes," "real estate," "trade war," "U.S. debt," "inflation"), and the topics of online advertisements that the customer has clicked (e.g., "discounts," "credit," "handling fees"), but is not limited to these.
在本實施例中,該資產特徵部分D212指示出對應之客戶當前所持有之資產(包含外幣及各種投資型金融商品)的類型,例如「美元」、「日元」、「半導體」、「健護」,但並不以此為限。In this embodiment, the asset characteristics portion D212 indicates the type of assets (including foreign currencies and various investment-type financial products) currently held by the corresponding customer, such as "US dollars", "Japanese yen", "semiconductors", and "health care", but is not limited thereto.
該門檻值資料D3包含多個門檻值參數D31,以及多個分別對應於該等門檻值參數D31的主題類型資料D32。The threshold data D3 includes a plurality of threshold parameters D31 and a plurality of subject type data D32 corresponding to the threshold parameters D31.
在本實施例中,每一門檻值參數D31的數值介於0與1之間,而可例如為0.5、0.6、0.7等數值,但並不以此為限。In this embodiment, the value of each threshold parameter D31 is between 0 and 1, and may be, for example, 0.5, 0.6, 0.7, etc., but is not limited thereto.
該等主題類型資料D32例如分別指示出多個相關於該等候選資訊D11的主題類型。舉例來說,該等主題類型例如包含一金融類型、一電子商務類型、一數位媒體類型、一政府與公共服務類型,以及一健康與醫療類型。更明確地說,每一主題類型資料D32在本實施例中是一個文意向量,其用來描述該主題類型資料D32所對應的該主題類型,且能夠被該語言處理模型M所處理。The topic category data D32, for example, respectively indicates a plurality of topic categories related to the candidate information D11. For example, the topic categories include a finance category, an e-commerce category, a digital media category, a government and public services category, and a health and medical category. More specifically, in this embodiment, each topic category data D32 is a context vector that describes the topic category corresponding to the topic category data D32 and can be processed by the language processing model M.
特別說明的是,該等門檻值參數D31的數值例如是根據該等候選資訊D11中屬於每一種主題類型之候選資訊D11的數量而被設定。舉例來說,若該資訊資料庫D1中有相對較多的候選資訊D11是屬於該金融類型的資訊,表示在金融類型的資訊推薦方面,可利用相對較高的標準來要求推薦資訊時的準確度。在此情況下,對於指示出金融類型之該主題類型資料D32所對應的該門檻值參數D31,其可被設定成相對較高的數值,例如0.75。而若該資訊資料庫D1中僅有少數的候選資訊D11是屬於該健康與醫療類型的資訊,表示在健康與醫療類型的資訊推薦方面,僅能以相對較低的標準來要求推薦資訊時的準確度。在此情況下,對於指示出健康與醫療類型之該主題類型資料D32所對應的該門檻值參數D31,其可被設定成相對較低的數值,例如0.45。Specifically, the values of the threshold parameters D31 are set based on, for example, the number of candidate information D11 belonging to each topic type within the candidate information D11. For example, if a relatively large number of candidate information D11 belonging to the financial category in the information database D1 is financial, this indicates that a relatively high standard of accuracy can be applied to the recommendation of financial information. In this case, the threshold parameter D31 corresponding to the topic type data D32 indicating the financial category can be set to a relatively high value, such as 0.75. If only a small number of candidate information D11 in the information database D1 belongs to the health and medical category, this means that when recommending health and medical information, only a relatively low standard of accuracy can be required. In this case, the threshold parameter D31 corresponding to the topic type data D32 indicating the health and medical category can be set to a relatively low value, such as 0.45.
以下說明本實施例的該資訊推薦系統1如何實施一資訊推薦方法。The following describes how the information recommendation system 1 of this embodiment implements an information recommendation method.
在本實施例中,該資訊推薦方法包含一第一推薦程序及一第二推薦程序。同時參閱圖1及圖2,以下先說明該資訊推薦系統1如何執行該第一推薦程序。In this embodiment, the information recommendation method includes a first recommendation process and a second recommendation process. Referring to Figures 1 and 2, the following first describes how the information recommendation system 1 performs the first recommendation process.
首先,在步驟S11中,該處理單元11在獲得一對應於該使用端電子裝置5的需求描述資料之後,利用該語言處理模型M對該需求描述資料執行一向量轉換處理,以產生一以向量形式呈現且表示該需求描述資料之特徵的需求特徵資料。First, in step S11, after obtaining the requirement description data corresponding to the user-end electronic device 5, the processing unit 11 uses the language processing model M to perform a vector conversion process on the requirement description data to generate requirement feature data presented in vector form and representing the characteristics of the requirement description data.
為了便於描述,在此將該需求描述資料作為一第一需求描述資料,並且,在此還將該需求特徵資料作為圖3所示的一第一需求特徵資料D10。For the convenience of description, the requirement description data is referred to as a first requirement description data, and the requirement feature data is also referred to as a first requirement feature data D10 shown in FIG. 3 .
在本實施例中,該第一需求描述資料為一由該使用端電子裝置5根據使用者的操作所傳送的即時文字訊息,並且,該處理單元11是透過網路而從該使用端電子裝置5接收該第一需求描述資料。In this embodiment, the first demand description data is an instant text message sent by the user electronic device 5 according to the user's operation, and the processing unit 11 receives the first demand description data from the user electronic device 5 through the network.
舉例來說,該第一需求描述資料可例如是「該如何投資美金?」,或者是「iPhone相關股能買嗎?」,但並不以此為限。For example, the first requirement description data may be, for example, "How to invest in US dollars?" or "Can I buy iPhone-related stocks?", but is not limited thereto.
進一步地,在本實施例中,該處理單元11是利用該語言處理模型M以句嵌入的方式來執行該向量轉換處理,以將該第一需求描述資料(即使用者所輸入的即時文字訊息)轉換為向量形式的該第一需求特徵資料D10。更明確地說,如圖3所示,該第一需求特徵資料D10在本實施例中是一個維度為七百六十維且用於描述該即時文字訊息之語意的語意向量V10(語意向量也稱「語義向量」),但並不以此為限。Furthermore, in this embodiment, the processing unit 11 utilizes the language processing model M to perform the vector conversion process using sentence embedding to convert the first demand description data (i.e., the instant text message input by the user) into the first demand feature data D10 in vector form. More specifically, as shown in FIG3 , the first demand feature data D10 in this embodiment is a 760-dimensional semantic vector V10 (also referred to as a "semantic vector") used to describe the semantics of the instant text message, but the present invention is not limited to this.
補充說明的是,在不同的實施例中,該處理單元11產生該語意向量V10的方式,也可以是以詞嵌入技術或文章嵌入的方式來對該第一需求描述資料執行該向量轉換處理,而並不以本實施例為限。It should be noted that, in different embodiments, the processing unit 11 may generate the semantic vector V10 by performing the vector conversion process on the first requirement description data using word embedding technology or article embedding, and is not limited to this embodiment.
在該處理單元11產生該語意向量V10之後,流程進行至步驟S12。After the processing unit 11 generates the semantic vector V10, the process proceeds to step S12.
在步驟S12中,該處理單元11根據每一資訊特徵資料D12的該等資訊特徵向量V12,計算每一資訊特徵資料D12與該語意向量V10之間的關聯度。In step S12, the processing unit 11 calculates the correlation between each information feature data D12 and the semantic vector V10 based on the information feature vectors V12 of each information feature data D12.
在本實施例中,對於每一資訊特徵資料D12,該處理單元11是先根據該資訊特徵資料D12的每一資訊特徵向量V12及該語意向量V10,計算出多個分別對應於該等資訊特徵向量V12的相似度分數。接著,該處理單元11再根據該等相似度分數的平均值,計算出該資訊特徵資料D12與該語意向量V10之間的關聯度。In this embodiment, for each piece of information feature data D12, the processing unit 11 first calculates a plurality of similarity scores corresponding to each information feature vector V12 and the semantic vector V10 of the information feature data D12. Next, the processing unit 11 calculates the correlation between the information feature data D12 and the semantic vector V10 based on the average of these similarity scores.
每一相似度分數表示該相似度分數所對應之該資訊特徵向量V12與該語意向量V10之間的相似度,而且,在本實施例中,每一個相似度分數的數值例如介於0與1之間。該處理單元11在本實施例中是利用餘弦相似度(Cosine Similarity)算法來計算出該等相似度分數,然而,在不同的實施例中,該處理單元11也可以利用例如歐氏距離(Euclidean Distance)、曼哈頓距離(Manhattan Distance),或者其他能用於評估兩個向量之間的相似度的數學方法來計算該等相似度分數,而並不以本實施例為限。Each similarity score represents the similarity between the information feature vector V12 and the semantic vector V10 to which it corresponds. Furthermore, in this embodiment, the value of each similarity score is, for example, between 0 and 1. In this embodiment, the processing unit 11 calculates the similarity scores using a cosine similarity algorithm. However, in different embodiments, the processing unit 11 may also calculate the similarity scores using, for example, Euclidean distance, Manhattan distance, or other mathematical methods that can be used to evaluate the similarity between two vectors, and is not limited to this embodiment.
配合參閱圖3,以該第一資訊特徵資料D12A為例,該處理單元11會先計算出四個分別對應於該第一資訊特徵資料D12A之該四個資訊特徵向量V12的相似度分數。該四個相似度分數分別用來代表描述「澳幣」之關鍵詞的該資訊特徵向量V12與該語意向量V10之間的相似度(例如為0.43)、描述「加幣」之關鍵詞的該資訊特徵向量V12與該語意向量V10之間的相似度(例如為0.49)、描述「美元」之關鍵詞的該資訊特徵向量V12與該語意向量V10之間的相似度(例如為0.83),以及描述「貨幣政策」之關鍵詞的該資訊特徵向量V12與該語意向量V10之間的相似度(例如為0.72)。接著,則該處理單元11會再將該四個相似度分數的平均值取至小數點後第三位的結果,作為該第一資訊特徵資料D12A與該語意向量V10之間的關聯度(例如為0.617)。3 , taking the first information feature data D12A as an example, the processing unit 11 first calculates the similarity scores of the four information feature vectors V12 corresponding to the first information feature data D12A. The four similarity scores are used to represent the similarity between the information feature vector V12 and the semantic vector V10 describing the keyword "Australian dollar" (for example, 0.43), the similarity between the information feature vector V12 and the semantic vector V10 describing the keyword "Canadian dollar" (for example, 0.49), the similarity between the information feature vector V12 and the semantic vector V10 describing the keyword "US dollar" (for example, 0.83), and the similarity between the information feature vector V12 and the semantic vector V10 describing the keyword "monetary policy" (for example, 0.72). Then, the processing unit 11 takes the average of the four similarity scores to the third decimal place as the correlation degree between the first information feature data D12A and the semantic vector V10 (for example, 0.617).
在該處理單元11計算出每一資訊特徵資料D12與該語意向量V10之間的關聯度之後,流程進行至步驟S13。After the processing unit 11 calculates the correlation between each information feature data D12 and the semantic vector V10, the process proceeds to step S13.
在步驟S13中,該處理單元11從該等資訊特徵資料D12中選出與該語意向量V10之關聯度最高的該資訊特徵資料D12,並將其選出的該資訊特徵資料D12作為一最匹配資訊特徵資料。In step S13, the processing unit 11 selects the information feature data D12 having the highest correlation with the semantic vector V10 from the information feature data D12, and uses the selected information feature data D12 as the best matching information feature data.
延續前例,假設該第一資訊特徵資料D12A是所有資訊特徵資料D12中與該語意向量V10之關聯度最高的一者,則該處理單元11便會將該第一資訊特徵資料D12A選為該最匹配資訊特徵資料。Continuing with the previous example, assuming that the first information feature data D12A is the one with the highest correlation with the semantic vector V10 among all the information feature data D12, the processing unit 11 will select the first information feature data D12A as the most matching information feature data.
在該處理單元11選出該最匹配資訊特徵資料之後,流程進行至步驟S14。After the processing unit 11 selects the most matching information feature data, the process proceeds to step S14.
在步驟S14中,該處理單元11從該等主題類型資料D32中選出其中一筆與該即時文字訊息的語意匹配度最高的最匹配主題類型資料,並將該最匹配主題類型資料所對應之該門檻值參數D31的數值作為本實施例中的一關聯度門檻值。換言之,在本實施例中,該處理單元11是根據該等主題類型資料D32所指示出的該等主題類型中,與該即時文字訊息最為接近的該主題類型來決定該關聯度門檻值。In step S14, the processing unit 11 selects a most matching topic type data item from the topic type data item D32 that has the highest semantic match with the instant text message, and uses the value of the threshold parameter D31 corresponding to the most matching topic type data item as a relevance threshold in this embodiment. In other words, in this embodiment, the processing unit 11 determines the relevance threshold based on the topic type item that is most similar to the instant text message among the topic types indicated by the topic type data item D32.
更具體地說,在本實施例中,該處理單元11例如是利用該語言處理模型M,以餘弦相似度算法計算每一主題類型資料D32與該語意向量V10之間的相似度,並將與該語意向量V10之相似度最高的該主題類型資料D32作為該最匹配主題類型資料,但並不以此為限。More specifically, in this embodiment, the processing unit 11, for example, utilizes the language processing model M to calculate the similarity between each topic type data D32 and the semantic vector V10 using the cosine similarity algorithm, and uses the topic type data D32 with the highest similarity to the semantic vector V10 as the most matching topic type data, but is not limited to this.
在該處理單元11決定出該關聯度門檻值之後,流程進行至步驟S15。After the processing unit 11 determines the correlation threshold, the process proceeds to step S15.
在步驟S15中,該處理單元11判斷該最匹配資訊特徵資料與該語意向量V10之間的關聯度是否大於等於該關聯度門檻值。若該處理單元11的判斷結果為是,流程進行至步驟S16。若該處理單元11的判斷結果為否,流程進行至步驟S17。In step S15, the processing unit 11 determines whether the correlation between the best-matching information feature data and the semantic vector V10 is greater than or equal to the correlation threshold. If the processing unit 11 determines yes, the process proceeds to step S16. If the processing unit 11 determines no, the process proceeds to step S17.
在接續於步驟S15之後的步驟S16中,一旦該處理單元11判定該最匹配資訊特徵資料與該語意向量V10之間的關聯度大於等於該關聯度門檻值,表示該最匹配資訊特徵資料所對應的該候選資訊D11有相對高的機率符合使用者透過該即時文字訊息所表達的需求。在此情況下,該處理單元11將該最匹配資訊特徵資料所對應的該候選資訊D11傳送至該使用端電子裝置5。並且,該第一推薦程序結束。In step S16, following step S15, if the processing unit 11 determines that the correlation between the best-matching information feature data and the semantic vector V10 is greater than or equal to the correlation threshold, it indicates that the candidate information D11 corresponding to the best-matching information feature data has a relatively high probability of meeting the user's needs expressed in the instant text message. In this case, the processing unit 11 transmits the candidate information D11 corresponding to the best-matching information feature data to the user electronic device 5. The first recommendation process then ends.
在接續於步驟S15之後的步驟S17中,一旦該處理單元11判定該最匹配資訊特徵資料與該語意向量V10之間的關聯度未大於等於該關聯度門檻值,表示該最匹配資訊特徵資料所對應的該候選資訊D11符合使用者之需求的機率不夠高。在此情況下,該處理單元11產生一個無匹配資訊通知,並將該無匹配資訊通知傳送至該使用端電子裝置5。並且,該第一推薦程序結束。In step S17, following step S15, if the processing unit 11 determines that the correlation between the best-matching information feature data and the semantic vector V10 is not greater than or equal to the correlation threshold, it indicates that the probability that the candidate information D11 corresponding to the best-matching information feature data meets the user's needs is insufficient. In this case, the processing unit 11 generates a no-matching information notification and transmits it to the user electronic device 5. The first recommendation process then ends.
值得一提的是,由於該等門檻值參數D31的數值在本實施例中是根據該等候選資訊D11中屬於每一種主題類型之候選資訊D11的數量而被設定,且該處理單元11會根據與該即時文字訊息最接近的主題類型即時決定該關聯度門檻值,因此,本實施例的資訊推薦系統1能根據該即時文字訊息所表達出的語意,而自動調整是否要將候選資訊D11提供給使用者的判斷標準。然而,在一些實施例中,該關聯度門檻值也可以是被預設好的單一個固定數值,而無須由該處理單元11根據該即時文字訊息的語意來決定。因此,該關聯度門檻值的態樣並不以本實施例為限。It is worth mentioning that, in this embodiment, the values of the threshold parameters D31 are set based on the number of candidate information D11 belonging to each topic type in the candidate information D11, and the processing unit 11 determines the relevance threshold in real time based on the topic type closest to the instant text message. Therefore, the information recommendation system 1 of this embodiment can automatically adjust the judgment criteria for whether to provide candidate information D11 to the user based on the semantics expressed in the instant text message. However, in some embodiments, the relevance threshold can also be a single fixed value that is preset, and does not need to be determined by the processing unit 11 based on the semantics of the instant text message. Therefore, the state of the relevance threshold is not limited to this embodiment.
以上即為該資訊推薦系統1如何執行該第一推薦程序的說明。The above is an explanation of how the information recommendation system 1 executes the first recommendation process.
同時參閱圖1及圖4,以下說明該資訊推薦系統1如何執行該資訊推薦方法中的該第二推薦程序。Referring to FIG. 1 and FIG. 4 , the following describes how the information recommendation system 1 performs the second recommendation procedure in the information recommendation method.
首先,在步驟S21中,該處理單元11在獲得另一對應於該使用端電子裝置5的需求描述資料之後,利用該語言處理模型M對該需求描述資料執行該向量轉換處理,以產生另一以向量形式呈現且表示該需求描述資料之特徵的需求特徵資料。First, in step S21, after obtaining another requirement description data corresponding to the user-end electronic device 5, the processing unit 11 uses the language processing model M to perform the vector conversion processing on the requirement description data to generate another requirement feature data presented in vector form and representing the characteristics of the requirement description data.
為了便於描述,在此將該另一需求描述資料作為一第二需求描述資料,並且,在此還將該另一需求特徵資料作為圖5所示的一第二需求特徵資料D20。For the convenience of description, the other requirement description data is referred to as a second requirement description data, and the other requirement feature data is also referred to as a second requirement feature data D20 shown in FIG. 5 .
在本實施例中,與該第一需求描述資料不同的是,該第二需求描述資料是該客戶資料庫D2的該等客戶屬性資料D21中與該使用端電子裝置5對應的其中一筆目標客戶屬性資料。具體來說,該目標客戶屬性資料所對應的該客戶是被作為一目標推薦對象,且該使用端電子裝置5例如是由該目標推薦對象所持有。進一步地,在本實施例中,該處理單元11例如是在當前時間到達一被預先設定好的自動推薦時間時自動地對該儲存單元12進行讀取,以從該客戶資料庫D2獲得該第二需求描述資料,但並不以此為限。In this embodiment, unlike the first demand description data, the second demand description data is a target customer attribute data entry corresponding to the user electronic device 5 in the customer attribute data D21 of the customer database D2. Specifically, the customer corresponding to the target customer attribute data is a target recommendation object, and the user electronic device 5 is, for example, owned by the target recommendation object. Furthermore, in this embodiment, the processing unit 11 automatically reads the storage unit 12 to obtain the second demand description data from the customer database D2, for example, when the current time reaches a pre-set automatic recommendation time, but the present invention is not limited to this.
進一步地,配合參閱圖5,與該第一需求特徵資料D10不同的是,該第二需求特徵資料D20包含多個用於描述客戶偏好的偏好特徵向量V20,且每一偏好特徵向量V20的維度為七百六十維,但並不以此為限。具體而言,在本實施例中,該等偏好特徵向量V20是用於描述該第二需求描述資料的特徵,亦即描述該目標推薦對象近期曾瀏覽過之金融商品的類型、曾瀏覽過之線上文章的主題、曾點擊過之線上廣告的主題,及/或當前所持有之資產的類型。Furthermore, referring to FIG. 5 , unlike the first demand profile data D10, the second demand profile data D20 includes multiple preference feature vectors V20 used to describe customer preferences. Each preference feature vector V20 has a dimension of 760, but is not limited to this. Specifically, in this embodiment, these preference feature vectors V20 are used to describe the characteristics of the second demand profile data, namely, the types of financial products recently viewed by the target recommendation recipient, the topics of online articles viewed, the topics of online ads clicked, and/or the types of assets currently held.
舉例來說,如圖5所示,該等偏好特徵向量V20例如分別為一第一偏好特徵向量V20A、一第二偏好特徵向量V20B、一第三偏好特徵向量V20C、一第四偏好特徵向量V20D及一第五偏好特徵向量V20E。其中,該第一偏好特徵向量V20A例如是用來描述該目標推薦對象近期瀏覽過「股票型指數基金」的金融商品,該第二偏好特徵向量V20B例如是用來描述該目標推薦對象近期瀏覽過以「升息」為主題的線上文章,該第三偏好特徵向量V20C例如是用來描述該目標推薦對象近期點擊過「信貸」的線上廣告,該第四偏好特徵向量V20D例如是用來描述該目標推薦對象當前持有「美元」的外幣資產,該第五偏好特徵向量V20E例如是用來描述該目標推薦對象當前持有「半導體」類型的投資型金融商品,但並不以此為限。For example, as shown in FIG5 , the preference eigenvectors V20 are respectively a first preference eigenvector V20A, a second preference eigenvector V20B, a third preference eigenvector V20C, a fourth preference eigenvector V20D, and a fifth preference eigenvector V20E. Among them, the first preference feature vector V20A is used to describe that the target recommendation object recently browsed the financial product "stock index fund", the second preference feature vector V20B is used to describe that the target recommendation object recently browsed an online article with the theme of "interest rate hike", the third preference feature vector V20C is used to describe that the target recommendation object recently clicked on the online advertisement of "credit", the fourth preference feature vector V20D is used to describe that the target recommendation object currently holds "US dollar" foreign currency assets, and the fifth preference feature vector V20E is used to describe that the target recommendation object currently holds "semiconductor" type investment financial products, but it is not limited to this.
在該處理單元11產生該第二需求特徵資料D20之後,流程進行至步驟S22。After the processing unit 11 generates the second demand characteristic data D20, the process proceeds to step S22.
在步驟S22中,該處理單元11根據每一資訊特徵資料D12的該等資訊特徵向量V12,計算每一資訊特徵資料D12與該第二需求特徵資料D20之間的關聯度。In step S22, the processing unit 11 calculates the correlation between each information feature data D12 and the second demand feature data D20 according to the information feature vectors V12 of each information feature data D12.
在本實施例中,對於每一資訊特徵資料D12,該處理單元11是先根據該資訊特徵資料D12的該等資訊特徵向量V12及該第二需求特徵資料D20的該等偏好特徵向量V20,以利用餘弦相似度算法(但不限於此)計算出多個各自對應於該等資訊特徵向量V12之其中一者及該等偏好特徵向量V20之其中一者的相似度分數。接著,該處理單元11再根據該等相似度分數的平均值,計算出該資訊特徵資料D12與該第二需求特徵資料D20之間的關聯度。In this embodiment, for each piece of information feature data D12, the processing unit 11 first calculates a plurality of similarity scores corresponding to one of the information feature vectors V12 and one of the preference feature vectors V20 based on the information feature data D12 and the preference feature vectors V20 of the second demand feature data D20 using a cosine similarity algorithm (but not limited to this algorithm). The processing unit 11 then calculates the correlation between the information feature data D12 and the second demand feature data D20 based on the average of these similarity scores.
每一相似度分數表示該相似度分數所對應之該資訊特徵向量V12與該相似度分數所對應之該第二需求特徵資料D20之間的相似度,且每一個相似度分數的數值例如介於0與1之間,但並不以此為限。Each similarity score represents the similarity between the information feature vector V12 corresponding to the similarity score and the second demand feature data D20 corresponding to the similarity score, and the value of each similarity score is, for example, between 0 and 1, but is not limited thereto.
參閱圖5,以該第二資訊特徵資料D12B中分別用來描述「手機」、「半導體」及「貿易戰」的該三個資訊特徵向量V12為例,對於用來描述「手機」的該資訊特徵向量V12,該處理單元11會計算出其與該第一偏好特徵向量V20A至該第五偏好特徵向量V20E每一者之間的五個相似度分數。因此,該五個相似度分數除了皆對應於用來描述「手機」的該資訊特徵向量V12之外,還分別對應於該第二需求特徵資料D20中的該第一偏好特徵向量V20A至該第五偏好特徵向量V20E。Referring to FIG. 5 , taking the three information feature vectors V12 used to describe "mobile phone," "semiconductor," and "trade war" in the second information feature data D12B as an example, the processing unit 11 calculates five similarity scores between the information feature vector V12 describing "mobile phone" and each of the first through fifth preference feature vectors V20A through V20E. Therefore, these five similarity scores not only correspond to the information feature vector V12 describing "mobile phone," but also to the first through fifth preference feature vectors V20A through V20E in the second demand feature data D20.
同理,對於用來描述「半導體」的該資訊特徵向量V12,該處理單元11會計算出其與該五個偏好特徵向量V20(即該第一偏好特徵向量V20A至該第五偏好特徵向量V20E)每一者之間的另五個相似度分數。因此,該另五個相似度分數除了皆對應於用來描述「半導體」的該資訊特徵向量V12之外,還分別對應於該第一偏好特徵向量V20A至該第五偏好特徵向量V20E。Similarly, for the information feature vector V12 used to describe "semiconductor," the processing unit 11 calculates five additional similarity scores between it and each of the five preference feature vectors V20 (i.e., the first through fifth preference feature vectors V20A through V20E). Therefore, in addition to corresponding to the information feature vector V12 used to describe "semiconductor," the five additional similarity scores also correspond to the first through fifth preference feature vectors V20A through V20E.
同理,對於用來描述「貿易戰」的該資訊特徵向量V12,該處理單元11會計算出其與該五個偏好特徵向量V20之間的再五個相似度分數,該再五個相似度分數皆對應於用來描述「貿易戰」的該資訊特徵向量V12,也分別對應於該第一偏好特徵向量V20A至該第五偏好特徵向量V20E。Similarly, for the information feature vector V12 used to describe the "trade war", the processing unit 11 will calculate five more similarity scores between it and the five preference feature vectors V20. The five more similarity scores all correspond to the information feature vector V12 used to describe the "trade war" and also correspond to the first preference feature vector V20A to the fifth preference feature vector V20E, respectively.
因此,在此例中,對於該第二資訊特徵資料D12B,該處理單元11會根據該第二資訊特徵資料D12B的該三個資訊特徵向量V12,以及該第二需求特徵資料D20的該第一偏好特徵向量V20A至該第五偏好特徵向量V20E,而總共計算出十五個相似度分數。然後,類似於該第一推薦程序,該處理單元11會再將該十五個相似度分數的平均值取至小數點後第三位的結果,作為該第二資訊特徵資料D12B與該第二需求特徵資料D20之間的關聯度。Therefore, in this example, for the second information feature data D12B, the processing unit 11 calculates a total of fifteen similarity scores based on the three information feature vectors V12 of the second information feature data D12B and the first through fifth preference feature vectors V20A to V20E of the second demand feature data D20. Then, similar to the first recommendation process, the processing unit 11 averages these fifteen similarity scores to the third decimal place, which serves as the correlation between the second information feature data D12B and the second demand feature data D20.
在該處理單元11計算出每一資訊特徵資料D12與該第二需求特徵資料D20之間的關聯度之後,流程進行至步驟S23。After the processing unit 11 calculates the correlation between each information feature data D12 and the second demand feature data D20, the process proceeds to step S23.
在步驟S23中,該處理單元11從該等資訊特徵資料D12中選出與該第二需求特徵資料D20之關聯度最高的該資訊特徵資料D12,並將其選出的該資訊特徵資料D12作為另一最匹配資訊特徵資料。In step S23, the processing unit 11 selects the information feature data D12 having the highest correlation with the second demand feature data D20 from the information feature data D12, and uses the selected information feature data D12 as another best matching information feature data.
延續前例,假設該第二資訊特徵資料D12B是所有資訊特徵資料D12中與該第二需求特徵資料D20之關聯度最高的一者,則該處理單元11便會將該第二資訊特徵資料D12B選為該另一最匹配資訊特徵資料。Continuing with the previous example, assuming that the second information feature data D12B is the one with the highest correlation with the second demand feature data D20 among all the information feature data D12, the processing unit 11 will select the second information feature data D12B as the other best matching information feature data.
在該處理單元11選出該另一最匹配資訊特徵資料之後,流程進行至步驟S24。After the processing unit 11 selects the other best matching information feature data, the process proceeds to step S24.
在步驟S24中,該處理單元11判斷該另一最匹配資訊特徵資料與該第二資訊特徵資料D12B之間的關聯度是否大於等於另一關聯度門檻值。其中,該另一關聯度門檻值可例如是該等門檻值參數D31中,對應於該數位金融服務類型之該主題類型資料D32所對應的該門檻值參數D31的數值,但並不以此為限。若該處理單元11的判斷結果為是,流程進行至步驟S25。若該處理單元11的判斷結果為否,流程進行至步驟S26。In step S24, the processing unit 11 determines whether the correlation between the other best-matching information feature data and the second information feature data D12B is greater than or equal to another correlation threshold. The other correlation threshold may be, for example, the value of the threshold parameter D31 corresponding to the subject category data D32 of the digital financial service type, but is not limited thereto. If the processing unit 11 determines yes, the process proceeds to step S25. If the processing unit 11 determines no, the process proceeds to step S26.
在接續於步驟S24之後的步驟S25中,一旦該處理單元11判定該另一最匹配資訊特徵資料與該第二資訊特徵資料D12B之間的關聯度大於等於該另一關聯度門檻值,表示該另一最匹配資訊特徵資料所對應的該候選資訊D11有相對高的機率符合該目標推薦對象的偏好。在此情況下,該處理單元11將該另一最匹配資訊特徵資料所對應的該候選資訊D11傳送至該使用端電子裝置5以供該目標推薦對象瀏覽。並且,該第二推薦程序結束。In step S25, following step S24, if the processing unit 11 determines that the correlation between the other best-matching information feature data and the second information feature data D12B is greater than or equal to the other correlation threshold, it indicates that the candidate information D11 corresponding to the other best-matching information feature data has a relatively high probability of meeting the target recommendation subject's preferences. In this case, the processing unit 11 transmits the candidate information D11 corresponding to the other best-matching information feature data to the user electronic device 5 for viewing by the target recommendation subject. The second recommendation process then ends.
在接續於步驟S24之後的步驟S26中,一旦該處理單元11判定該另一最匹配資訊特徵資料與該第二資訊特徵資料D12B之間的關聯度未大於等於該另一關聯度門檻值,表示該另一最匹配資訊特徵資料所對應的該候選資訊D11符合該目標推薦對象之偏好的機率不夠高。在此情況下,該處理單元11不將任一候選資訊D11傳送至該使用端電子裝置5,且該第二推薦程序結束。In step S26, following step S24, if the processing unit 11 determines that the correlation between the other best-matching information feature data and the second information feature data D12B is not greater than or equal to the other correlation threshold, it indicates that the probability that the candidate information D11 corresponding to the other best-matching information feature data meets the preferences of the target recommendation object is not high enough. In this case, the processing unit 11 does not transmit any candidate information D11 to the user electronic device 5, and the second recommendation process ends.
補充說明的是,在實際的實施態樣中,該處理單元11在步驟S21中實際上可以是從該客戶資料庫D2中一次性地獲得多筆目標客戶屬性資料以分別作為多筆第二需求特徵資料D20,並根據每一第二需求特徵資料D20執行步驟S22至步驟S26。It should be noted that, in an actual implementation, the processing unit 11 may actually obtain multiple target customer attribute data from the customer database D2 at one time in step S21 as multiple second demand feature data D20, and execute steps S22 to S26 based on each second demand feature data D20.
此外,在類似的實施例中,該資訊推薦系統1也可以是根據使用者(例如該服務機構的職員)的近端或遠端操作而開始執行該第二推薦程序。In addition, in a similar embodiment, the information recommendation system 1 can also start executing the second recommendation process based on the local or remote operation of the user (such as the staff of the service agency).
以上即為該資訊推薦系統1如何執行該第二推薦程序的說明。The above is an explanation of how the information recommendation system 1 executes the second recommendation process.
特別說明的是,本實施例的步驟S11至步驟S17、步驟S21至步驟S26及圖2、圖4的流程圖僅是用於示例說明本發明的其中一種可實施方式。應當理解,即便將步驟S11至步驟S17、步驟S21至步驟S26進行合併、拆分或順序調整,若合併、拆分或順序調整之後的流程與本實施例相比是以類似的方式,執行類似的功能,而得到類似的結果,便仍屬於本發明的可實施態樣,因此,本實施例的步驟S11至步驟S17、步驟S21至步驟S26及圖2、圖4的流程圖並非用於限制本發明的可實施範圍。It should be noted that steps S11 to S17, steps S21 to S26 and the flowcharts of FIG. 2 and FIG. 4 of this embodiment are only used to illustrate one possible implementation of the present invention. It should be understood that even if steps S11 to S17 and steps S21 to S26 are merged, split, or reordered, if the process after the merger, split, or reordering performs similar functions in a similar manner and obtains similar results compared to the present embodiment, it still falls within the applicable scope of the present invention. Therefore, steps S11 to S17 and steps S21 to S26 of the present embodiment and the flowcharts of Figures 2 and 4 are not intended to limit the applicable scope of the present invention.
綜上所述,藉由執行該資訊推薦方法,該資訊推薦系統1能在獲得該需求描述資料之後,先將其轉換為向量形式的該需求特徵資料,再將該需求特徵資料與該等候選資訊D11所對應的該等資訊特徵資料D12進行關聯度計算,以選出關聯度最高的該最匹配資訊特徵資料,並將該最匹配資訊特徵資料所對應的該候選資訊D11傳送至該使用端電子裝置5,以實現資訊推薦。如此一來,即便該需求描述資料中僅存在與候選資訊D11的內容相同或相近的詞語,而不存在與候選資訊D11的內容完全相符的詞語,該資訊推薦系統1亦能利用該語言處理模型M選出與需求描述資料之語意關聯度最高的候選資訊D11來進行推薦,因此,相較於針對每一候選資訊D11建立對應之詞袋模型(Bag-of-words model)的作法,本實施例不會因為該需求描述資料中缺少完全相符的關鍵詞而錯過實際上適合被推薦的候選資訊D11,有助於改善資訊推薦的準確度,故確實能達成本發明之目的。In summary, by executing the information recommendation method, the information recommendation system 1 can, after obtaining the demand description data, first convert it into the demand feature data in vector form, and then calculate the correlation between the demand feature data and the information feature data D12 corresponding to the candidate information D11 to select the best matching information feature data with the highest correlation, and transmit the candidate information D11 corresponding to the best matching information feature data to the user-end electronic device 5 to achieve information recommendation. In this way, even if the demand description data only contains words that are identical or similar to the content of the candidate information D11, but no words that completely match the content of the candidate information D11, the information recommendation system 1 can use the language processing model M to select the candidate information D11 with the highest semantic relevance to the demand description data for recommendation. Therefore, compared with the method of establishing a corresponding bag-of-words model for each candidate information D11, this embodiment will not miss the candidate information D11 that is actually suitable for recommendation due to the lack of completely matching keywords in the demand description data, which helps to improve the accuracy of information recommendation and can indeed achieve the purpose of this invention.
惟以上所述者,僅為本發明之實施例而已,當不能以此限定本發明實施之範圍,凡是依本發明申請專利範圍及專利說明書內容所作之簡單的等效變化與修飾,皆仍屬本發明專利涵蓋之範圍內。However, the above description is merely an example of the present invention and should not be used to limit the scope of the present invention. All simple equivalent changes and modifications made within the scope of the patent application and the contents of the patent specification of the present invention are still within the scope of the present patent.
1:資訊推薦系統 11:處理單元 12:儲存單元 M:語言處理模型 D1:資訊資料庫 D11:候選資訊 D11A:第一候選資訊 D11B:第二候選資訊 D12:資訊特徵資料 D12A:第一資訊特徵資料 D12B:第二資訊特徵資料 V12:資訊特徵向量 D2:客戶資料庫 D21:客戶屬性資料 D211:關注特徵部分 D212:資產特徵部分 D3:門檻值資料 D31:門檻值參數 D32:主題類型資料 5:使用端電子裝置 D10:第一需求特徵資料 D20:第二需求特徵資料 V10:語意向量 V20:偏好特徵向量 V20A:第一偏好特徵向量 V20B:第二偏好特徵向量 V20C:第三偏好特徵向量 V20D:第四偏好特徵向量 V20E:第五偏好特徵向量 S11~S17:步驟 S21~S26:步驟 1: Information Recommendation System 11: Processing Unit 12: Storage Unit M: Language Processing Model D1: Information Database D11: Candidate Information D11A: First Candidate Information D11B: Second Candidate Information D12: Information Feature Data D12A: First Information Feature Data D12B: Second Information Feature Data V12: Information Feature Vector D2: Customer Database D21: Customer Attribute Data D211: Attention Feature Data D212: Asset Feature Data D3: Threshold Data D31: Threshold Parameter D32: Topic Type Data 5: User Electronic Device D10: First Requirement Feature Data D20: Second Requirement Feature Data V10: Semantic Vector V20: Preference eigenvector V20A: First preference eigenvector V20B: Second preference eigenvector V20C: Third preference eigenvector V20D: Fourth preference eigenvector V20E: Fifth preference eigenvector S11-S17: Steps S21-S26: Steps
本發明之其他的特徵及功效,將於參照圖式的實施方式中清楚地呈現。Other features and functions of the present invention will be clearly presented in the embodiments with reference to the drawings.
圖1是一方塊示意圖,示例性地表示本發明資訊推薦系統的一實施例。FIG1 is a block diagram illustrating an exemplary embodiment of the information recommendation system of the present invention.
圖2是一流程圖,用於示例性地說明該實施例如何實施一資訊推薦方法中的一第一推薦流程。FIG2 is a flow chart for exemplarily illustrating how the embodiment implements a first recommendation process in an information recommendation method.
圖3是一示意圖,示例性地表示該實施例在該第一推薦流程的執行過程中所利用到的一筆資訊特徵資料及一筆第一需求特徵資料。FIG3 is a schematic diagram exemplarily showing a piece of information feature data and a piece of first demand feature data used in the execution of the first recommendation process of the embodiment.
圖4是一流程圖,用於示例性地說明該實施例如何實施一資訊推薦方法中的一第二推薦流程。FIG4 is a flow chart for exemplarily illustrating how this embodiment implements a second recommendation process in an information recommendation method.
圖5是一示意圖,示例性地表示該實施例在該第二推薦流程的執行過程中所利用到的一筆資訊特徵資料及一筆第二需求特徵資料。FIG5 is a schematic diagram exemplarily showing a piece of information feature data and a piece of second demand feature data used in the execution of the second recommendation process of the embodiment.
1:資訊推薦系統 1: Information recommendation system
11:處理單元 11: Processing unit
12:儲存單元 12: Storage unit
M:語言處理模型 M: Language Processing Model
D1:資訊資料庫 D1: Information Database
D11:候選資訊 D11: Candidate Information
D11A:第一候選資訊 D11A: First Candidate Information
D11B:第二候選資訊 D11B: Second candidate information
D12:資訊特徵資料 D12: Information Feature Data
D12A:第一資訊特徵資料 D12A: First Information Feature Data
D12B:第二資訊特徵資料 D12B: Second information feature data
D2:客戶資料庫 D2: Customer Database
D21:客戶屬性資料 D21: Customer attribute data
D211:關注特徵部分 D211: Focus on the characteristic part
D212:資產特徵部分 D212: Asset Characteristics
D3:門檻值資料 D3: Threshold Data
D31:門檻值參數 D31: Threshold value parameter
D32:主題類型資料 D32: Subject type data
5:使用端電子裝置 5: Using end electronic devices
Claims (2)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW113137581A TWI901377B (en) | 2024-10-01 | 2024-10-01 | Information recommendation system and method |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TW113137581A TWI901377B (en) | 2024-10-01 | 2024-10-01 | Information recommendation system and method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| TWI901377B true TWI901377B (en) | 2025-10-11 |
Family
ID=98263880
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW113137581A TWI901377B (en) | 2024-10-01 | 2024-10-01 | Information recommendation system and method |
Country Status (1)
| Country | Link |
|---|---|
| TW (1) | TWI901377B (en) |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWM579762U (en) * | 2019-03-06 | 2019-06-21 | 蔡昀錚 | Fabric information matching system |
| US10592958B2 (en) * | 2005-01-07 | 2020-03-17 | Sony Corporation | Information processing apparatus, information processing method, and program |
| CN109840321B (en) * | 2017-11-29 | 2022-02-01 | 腾讯科技(深圳)有限公司 | Text recommendation method and device and electronic equipment |
| TWI795707B (en) * | 2021-01-12 | 2023-03-11 | 威聯通科技股份有限公司 | Content recommendation system and content recommendation method |
| TWM641360U (en) * | 2022-12-23 | 2023-05-21 | 伊雲谷數位科技股份有限公司 | Personalized product recommendation system |
| TWM665193U (en) * | 2024-10-01 | 2025-01-01 | 中國信託商業銀行股份有限公司 | Information recommendation system |
-
2024
- 2024-10-01 TW TW113137581A patent/TWI901377B/en active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10592958B2 (en) * | 2005-01-07 | 2020-03-17 | Sony Corporation | Information processing apparatus, information processing method, and program |
| CN109840321B (en) * | 2017-11-29 | 2022-02-01 | 腾讯科技(深圳)有限公司 | Text recommendation method and device and electronic equipment |
| TWM579762U (en) * | 2019-03-06 | 2019-06-21 | 蔡昀錚 | Fabric information matching system |
| TWI795707B (en) * | 2021-01-12 | 2023-03-11 | 威聯通科技股份有限公司 | Content recommendation system and content recommendation method |
| TWM641360U (en) * | 2022-12-23 | 2023-05-21 | 伊雲谷數位科技股份有限公司 | Personalized product recommendation system |
| TWM665193U (en) * | 2024-10-01 | 2025-01-01 | 中國信託商業銀行股份有限公司 | Information recommendation system |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Xiong et al. | Deep hybrid collaborative filtering for web service recommendation | |
| US11443305B2 (en) | Context augmentation for processing data from multiple sources | |
| TWI409712B (en) | Consumer-recommended social networking method and device | |
| US11741139B2 (en) | Systems and methods for determining a response to a user query | |
| CN104639420B (en) | The information processing method and system of instant messaging | |
| JP2020177670A (en) | Information recommendation method, information recommendation device, equipment and medium | |
| CN107729937A (en) | For determining the method and device of user interest label | |
| CN107247786A (en) | Method, device and server for determining similar users | |
| CN110084658A (en) | The matched method and apparatus of article | |
| You et al. | Customer knowledge discovery from online reviews | |
| CN113592315B (en) | Method and device for processing dispute sheets | |
| CN112765478A (en) | Method, apparatus, device, medium, and program product for recommending content | |
| CN113792134B (en) | User service method and system based on digital twin technology | |
| TWM665193U (en) | Information recommendation system | |
| US20150095194A1 (en) | Systems and methods for search term prioritization | |
| CN110555747A (en) | Method and device for determining target users | |
| CN114996579B (en) | Information push method, device, electronic device and computer readable medium | |
| TWI901377B (en) | Information recommendation system and method | |
| CN116737929B (en) | User classification methods, devices, computer equipment, storage media, and program products | |
| US20240296402A1 (en) | Method, system, and software for tracing paths in graph | |
| CN116756315A (en) | Dialogue intention recognition method, device, electronic device and storage medium | |
| CN116308641A (en) | Product recommendation methods, training methods, devices, electronics and media | |
| Seng et al. | Big data and FinTech | |
| CN113325959A (en) | Input corpus recommendation method and device | |
| CN114912008A (en) | User portrait construction method, system, computer equipment and storage medium |