[go: up one dir, main page]

TWI872368B - A training method of tour group registration prediction model and a system of tour group registration prediction - Google Patents

A training method of tour group registration prediction model and a system of tour group registration prediction Download PDF

Info

Publication number
TWI872368B
TWI872368B TW111134196A TW111134196A TWI872368B TW I872368 B TWI872368 B TW I872368B TW 111134196 A TW111134196 A TW 111134196A TW 111134196 A TW111134196 A TW 111134196A TW I872368 B TWI872368 B TW I872368B
Authority
TW
Taiwan
Prior art keywords
registration
tour group
data
tour
prediction model
Prior art date
Application number
TW111134196A
Other languages
Chinese (zh)
Other versions
TW202411931A (en
Inventor
張志勇
武士戎
程昱翔
游國忠
廖文華
Original Assignee
淡江大學學校財團法人淡江大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 淡江大學學校財團法人淡江大學 filed Critical 淡江大學學校財團法人淡江大學
Priority to TW111134196A priority Critical patent/TWI872368B/en
Publication of TW202411931A publication Critical patent/TW202411931A/en
Application granted granted Critical
Publication of TWI872368B publication Critical patent/TWI872368B/en

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

This invention is clustering to tour group data which are collected to judge registration popularity, according to registration number, and clustering tour group data to groups in which all tours in one group have similar popularities. Because all tour group data have a lot of advertisement features, such as price, date, travel country, scenic spot, travel period, accommodation hotel and other information. A tour group registration prediction model which is trained by tour group advertisement of tour group cluster is able to judge registration popularity of tour group advertisement. Then, a CPU trains to enrolled number data of tour group cluster and obtains a tour group registration prediction number model which is able to predict registration number.

Description

旅行團報名預測模型的訓練方法及旅行團報名預測系統Training method of tour group registration prediction model and tour group registration prediction system

本發明是應用於旅遊業之報名相關業務,尤指一種旅行團報名預測模型的訓練方法及旅行團報名預測系統。The present invention is applied to the registration-related business of the tourism industry, and particularly refers to a training method for a tour group registration prediction model and a tour group registration prediction system.

各家旅行社的企劃部或產品部需要設計吸引旅客的旅遊行程,並刊登廣告或安排人員去招攬顧客。當該旅遊行程沒有顧客參加或是未達他們所預設的出團人數時,旅行社就會變更該旅遊行程的價格、出團日期…等,甚至取消該旅行行程,導致旅行社及旅客的損失。這些損失,包括旅行社在國外的飯店、餐飲、門票、人力資源等損失,並造成客戶在行程、公司工作等安排的混亂。The planning department or product department of each travel agency needs to design travel itineraries that attract tourists, and publish advertisements or arrange personnel to attract customers. When there are no customers participating in the travel itinerary or the number of people in the group does not reach their preset number, the travel agency will change the price, departure date, etc. of the travel itinerary, or even cancel the travel itinerary, resulting in losses for the travel agency and tourists. These losses include the loss of hotels, meals, tickets, human resources, etc. in foreign countries of the travel agency, and cause confusion in the arrangements of customers in their itineraries and company work.

為了避免上述之損失,須事前對旅遊成團率及報名人數進行預測,然而這一直以來是一項困難的課題,很難透過工作人員根據經驗來進行預判,也因此需要想辦法利用電腦的輔助來提高旅遊成團的預測準確度。In order to avoid the above losses, it is necessary to predict the tour group formation rate and the number of registrations in advance. However, this has always been a difficult issue. It is difficult for staff to make predictions based on experience. Therefore, it is necessary to find a way to use computers to assist in improving the accuracy of tour group predictions.

有鑑於先前技術所述不足之處,本發明人提出以下解決方式:In view of the shortcomings of the prior art, the inventors propose the following solutions:

一種旅行團報名預測系統,包括:一報名熱門度預測模型及一處理器,該處理器根據一旅行團廣告資料及該報名熱門度預測模型,判斷該旅行團廣告資料的報名熱門度,得到一熱門度判斷結果。A tour group registration prediction system includes: a registration popularity prediction model and a processor. The processor determines the registration popularity of the tour group advertisement data according to the tour group advertisement data and the registration popularity prediction model to obtain a popularity judgment result.

一種旅行團報名預測模型的訓練方法,包括:一處理器對多個旅行團資料以報名人數資料為基礎進行分群,得到多個旅行團分群;其中該些旅行團資料分別包括一報名人數資料及一旅行團廣告資料;以及該處理器根據該些旅行團分群及該旅行團廣告資料對一第一預測模型進行訓練,得到訓一報名熱門度預測模型。A method for training a tour group registration prediction model includes: a processor groups a plurality of tour group data based on registration number data to obtain a plurality of tour group groups; wherein the tour group data respectively include registration number data and tour group advertising data; and the processor trains a first prediction model based on the tour group groups and the tour group advertising data to obtain a registration popularity prediction model.

本發明主要根據收集來的旅行團資料進行分群,分群方式是根據報名人數來判斷旅行團的報名熱門程度,並將報名熱門程度相近的旅行團分類成同一旅行團分群,由於各旅行團廣告資料中具有各種旅行團的廣告特徵,如價格、日期、旅行國家、景點、遊玩期間、住宿旅館等資訊,因此針對各旅行團分群內的各旅行團廣告資料進行訓練後,所得到的報名熱門度預測模型可以用來判斷旅行團廣告的報名熱門程度。以供旅行社得以事先評估旅行行程的規劃是否符合客戶期待。The invention mainly performs grouping based on the collected tour group data. The grouping method is to judge the registration popularity of the tour group based on the number of registrations, and classify tour groups with similar registration popularity into the same tour group group. Since each tour group advertising data contains advertising features of various tour groups, such as price, date, travel country, scenic spot, travel period, accommodation hotel and other information, after training the tour group advertising data in each tour group group, the registration popularity prediction model obtained can be used to judge the registration popularity of tour group advertisements. This allows travel agencies to evaluate in advance whether the travel itinerary planning meets customer expectations.

除此之外,更進一步還可以對各旅行團分群內的報名人數資料進行訓練,所得到的報名人數預測模型可以用來預測報名人數,例如根據第n天的報名人數可預測第n+1天的報名人數,甚至可以預測報名截止日期前旅行團是否能夠成團,以利旅行社提前進行相對應處理方式,進而降低其損失。In addition, the number of registrations in each tour group group can be further trained, and the resulting registration number prediction model can be used to predict the number of registrations. For example, the number of registrations on the nth day can be used to predict the number of registrations on the n+1th day, and it can even be predicted whether the tour group can be formed before the registration deadline, so that the travel agency can take corresponding measures in advance, thereby reducing its losses.

請參閱圖1所示,依據一實施例,本案旅行團報名預測模型的訓練方法包括:Please refer to FIG. 1 , according to one embodiment, the training method of the tour group registration prediction model includes:

步驟S01:處理器10對多個旅行團資料以報名人數資料為基礎進行分群,得到多個旅行團分群;該些旅行團資料分別包括一報名人數資料、一報名期間、及一旅行團廣告資料。其中,當旅行團廣告刊登後,無論最後是否成團,都可作為該旅行團資料。Step S01: The processor 10 groups a plurality of tour group data based on the number of registered people data to obtain a plurality of tour group groups; the tour group data respectively include a number of registered people data, a registration period, and a tour group advertisement data. Among them, after the tour group advertisement is published, whether or not the tour group is finally formed, it can be used as the tour group data.

其中,報名人數資料包括每日報名人數、每日報名人數累計、報名人數成長數、報名人數成長率等等,報名人數成長數可以是相鄰二天的報名人數成長數、也可以是相鄰一周的報名人數成長數,同樣的,報名人數成長率可以是相鄰二天的報名人數成長率,也可以是相鄰一周的報名人數成長率。在一些實施例中,該些旅行團資料以報名人數資料為基礎進行分群之目的,是為了將該些旅行團資料以報名熱門程度相近者進行分群,因此利用報名人數資料可做為判斷旅行團資料熱門程度的依據,所以舉凡可作為報名熱門程度高低之判斷基礎者,均為本說明書所指之報名人數資料。Among them, the registration number data includes the number of daily registrations, the cumulative number of daily registrations, the number of registration growth, the number of registration growth rate, etc. The number of registration growth can be the number of registration growth in the next two days, or the number of registration growth in the next week. Similarly, the number of registration growth rate can be the number of registration growth rate in the next two days, or the number of registration growth rate in the next week. In some embodiments, the purpose of grouping the tour group data based on the registration number data is to group the tour group data with similar registration popularity. Therefore, the registration number data can be used as a basis for judging the popularity of the tour group data. Therefore, all those that can be used as a basis for judging the popularity of registration are the registration number data referred to in this manual.

該旅行團廣告資料包括複數廣告特徵,廣告特徵則旅行團廣告內所刊載之特徵資訊,且特徵資訊是可影響消費者決定報名參加旅行團與否之資訊,因此廣告特徵包括:價格、出團日期、旅行社知名度、行程、天數、天氣、交通,其中行程又包括:觀光行程、消費行程、住宿等,舉凡可供影響消費者報名參加旅行團意願之資訊,均為本說明書所指之廣告特徵。舉例來說,該廣告特徵是價格時,該價格可根據費用區間訂出低價格(0~20000)、中價格(20001~40000)、高價格(40001~60000);該廣告特徵是出團日期時,該出團日期可細分為旺季、暑假、寒假、過年;該廣告特徵是觀光行程時,該觀光行程可細分為歐洲、日本、澳洲;該廣告特徵是天氣時,該天氣可細分為雨季、颱風;該廣告特徵是旅遊天數時,該旅遊天數可細分為短天期(1~3天)、中天期(4~5天)、長天期(6~9天);該廣告特徵是消費行程時,該消費行程可細分為購物、自費;該廣告特徵是交通時,該交通可細分為轉機與否、早晚班機、廉價航空;該廣告特徵是交通時,該住宿可細分為市中心、五星飯店等。上述各種細分內容僅為參考,實際實施時不以為限,亦可根據使用者需求進行調整或重新設定、變動。上述旅行團資料以數學式來表示則是:D={ , ,…, },其中 表示某一筆的旅行團資料,每一筆的旅行團資料都包括報名人數資料及旅行團廣告資料,故 ,其中 為旅行團廣告資料、 為報名起始日期、 為報名截止日期、 為報名人數資料、 表示旅行團是否成團。另外,假設每個團報名的資料 第j天有 個家庭群報名,資料是: =( 。若 第j天有 個人報名,則 = The tour group advertising materials include multiple advertising features. Advertising features are the characteristic information published in the tour group advertisement, and the characteristic information is the information that can affect consumers' decision to sign up for the tour group. Therefore, advertising features include: price, tour date, travel agency reputation, itinerary, number of days, weather, and transportation. The itinerary includes: sightseeing itinerary, consumption itinerary, accommodation, etc. All information that can influence consumers' willingness to sign up for the tour group is the advertising features referred to in this manual. For example, if the ad feature is price, the price can be set as low price (0~20000), medium price (20001~40000), and high price (40001~60000) according to the cost range; if the ad feature is tour date, the tour date can be divided into peak season, summer vacation, winter vacation, and Chinese New Year; if the ad feature is tourist itinerary, the tourist itinerary can be divided into Europe, Japan, and Australia; if the ad feature is weather, the Weather can be subdivided into rainy season and typhoon; when the advertisement feature is the number of travel days, the travel days can be subdivided into short-term (1-3 days), medium-term (4-5 days), and long-term (6-9 days); when the advertisement feature is a consumption itinerary, the consumption itinerary can be subdivided into shopping and self-funded; when the advertisement feature is transportation, the transportation can be subdivided into whether there is a transfer, early or late flights, and low-cost airlines; when the advertisement feature is transportation, the accommodation can be subdivided into the city center, five-star hotels, etc. The above-mentioned various subdivisions are for reference only and are not limited to them in actual implementation. They can also be adjusted, reset, or changed according to user needs. The above tour group data is expressed in mathematical form as: D={ , ,…, },in Represents a certain tour group data. Each tour group data includes the number of registered persons and tour group advertising data. ,in For tour group advertising materials, The registration start date, The registration deadline is For the number of applicants, Indicates whether the tour group is formed. In addition, assume that the registration data of each group On day j there are Family groups register, the information is: =( .like On day j there are Individual registration: = .

分群方式是以報名人數資料為基礎進行分群,分群方式可以是:k-means、階層式、DB-scan、高斯混合模型GMM、Fuzzy、k-means ++ (低微度)、kernal k means (高維度)。在一些實施例中是以k-means進行分群,分群過程中是先決定好該些旅行團資料的分群數量,再決定分群的依據,如歐式距離、餘弦距離等,在一些實施例中是以餘弦距離進行分群。如此一來,經過k-means分群後會得到多個旅行團分群,每一個旅行團分群內包括多個旅行團資料,且每一個旅行團資料間的報名人數資料相近似,表示彼此之間的報名熱門程度相近似。此外,該些旅行團分群是有排序,以呈現出各旅行團分群的報名熱門程度的不同,舉例來說該些旅行團分群可包括:第一旅行團分群、第二旅行團分群、第三旅行團分群…等,因此,屬於該第一旅行團分群的旅行團資料,其報名熱門程度必定高於屬於該第二旅行團分群的旅行團資料,所以可透過旅行團分群來了解各旅行團資料的報名熱門程度。另外,值得一提的是,沒有成團的旅行團資料也會被分類為一個旅行團分群,以表示報名熱門程度極低難以成團。The clustering method is based on the number of registration data. The clustering method can be: k-means, hierarchical, DB-scan, Gaussian mixture model GMM, Fuzzy, k-means ++ (low-dimensional), kernal k means (high-dimensional). In some embodiments, k-means is used for clustering. In the clustering process, the number of clusters of the tour group data is first determined, and then the basis for clustering is determined, such as Euclidean distance, cosine distance, etc. In some embodiments, cosine distance is used for clustering. In this way, after k-means clustering, multiple tour group clusters will be obtained, each tour group cluster includes multiple tour group data, and the number of registration data between each tour group data is similar, indicating that the popularity of registration between each other is similar. In addition, these tour group groups are sorted to show the different popularity of each tour group group. For example, these tour group groups may include: first tour group group, second tour group group, third tour group group, etc. Therefore, the registration popularity of the tour group data belonging to the first tour group group must be higher than that of the tour group data belonging to the second tour group group. Therefore, the registration popularity of each tour group data can be understood through the tour group group. In addition, it is worth mentioning that the tour group data that has not been formed will also be classified as a tour group group to indicate that the registration popularity is extremely low and it is difficult to form a group.

在一些實施例中,在進行步驟S01前會先進行步驟S02:處理器10對多個旅行團資料進行預處理,旅行團資料進行預處理後,可提升步驟S01的分群效果。In some embodiments, step S02 is performed before step S01: the processor 10 pre-processes the data of a plurality of tour groups. After the tour group data is pre-processed, the grouping effect of step S01 can be improved.

在一些實施例中,預處理包括報名期間正規化或報名人數標準化二者其中之一或二。首先,報名期間正規化是統一各旅行團的報名期間,統一的方式是以其中一個旅行團資料的報名期間為基準對其他不同報名期間的旅行團資料進行報名期間的減縮或延展。在一些實施例中,是以報名期間最長的旅行團資料為基礎,其餘旅行團資料以等比例方式對報名期間進行延展,延展方式則是根據每日累進報名人數進行等比例延展,以下表一為例,當其中一旅行團資料的報名期間為七天,每日累進報名人數是第一天及第二天為0人、第三天至第五天均為18人、第六天及第七天均為41人,如今預將報名期間統一延展成40天時,由於第一天及第二天累進報名人數相同,且共同為報名期間的 ,由於 ,所以第一天及第二天會延展成11天,且如下表二所示每天的累進報名人數為0;同理第三天至第五天累進報名人數相同,且同為報名期間的 ,由於 ,所以第一天及第二天會延展成17天,且如下表二所示每天的累進報名人數為18;第七天及第八天累進報名人數相同,由於剩下的天數為 ,所以第七天及第八天會延展成12天,且如下表二所示每天的累進報名人數為41。 報名天數 第一天 第二天 第三天 第四天 第五天 第六天 第七天 報名人數 0 0 18 18 18 41 41 表一 報名天數 第一天 第二天 第十一天 第十二天 第十三天 報名人數 0 0 0 0 18 18 報名天數 第二十八天 第二十九天 第三十天 第四十天 報名人數 18 18 41 41 41 41 表二 In some embodiments, the pre-processing includes one or both of registration period normalization and registration number normalization. First, registration period normalization is to unify the registration period of each tour group, and the unification method is to reduce or extend the registration period of tour group data with different registration periods based on the registration period of one tour group data. In some embodiments, the registration period of the tour group data with the longest registration period is extended in equal proportion to the registration period of the remaining tour group data. The extension method is to extend the registration period in equal proportion according to the number of daily progressive registrations. For example, in Table 1 below, when the registration period of one of the tour group data is seven days, the number of daily progressive registrations is 0 on the first and second days, 18 on the third to fifth days, and 41 on the sixth and seventh days. Now, when the registration period is uniformly extended to 40 days, since the number of progressive registrations on the first and second days is the same and is the same as the registration period, ,due to Therefore, the first and second days will be extended to 11 days, and the number of progressive registrations per day will be 0 as shown in Table 2 below; similarly, the number of progressive registrations from the third to the fifth day will be the same, and they are also the registration period. ,due to , so the first and second days will be extended to 17 days, and the progressive number of registrations per day is 18 as shown in Table 2 below; the progressive number of registrations on the seventh and eighth days is the same, and the remaining days are Therefore, the seventh and eighth days will be extended to 12 days, and the progressive number of registrants per day is 41 as shown in Table 2 below. Registration days Day 1 the next day Day 3 Day 4 Day 5 Day 6 Day 7 Number of applicants 0 0 18 18 18 41 41 Table 1 Registration days Day 1 the next day Day 11 Day 12 Day 13 Number of applicants 0 0 0 0 18 18 Registration days Day 28 Day 29 Day 30 Day 40 Number of applicants 18 18 41 41 41 41 Table 2

接著,報名人數標準化是降低報名人數的差距,進而提升分群的效果。在一些實施例中,是以統計的標準化公式來完成,其公式為 ,其中 經標準化的結果, 為母體的平均值、 是母體的標準差。在一些實施例中,是利用高斯函數來進行標準化,所使用的高斯函數為 ,其中a、b、c為常數,且a>0,x為報名人數資料。 Next, the number of applicants is standardized to reduce the difference in the number of applicants, thereby improving the effect of grouping. In some embodiments, this is accomplished using a statistical standardization formula, which is: ,in for The standardized results, is the average value of the population, is the standard deviation of the population. In some embodiments, the Gaussian function is used for standardization, and the Gaussian function used is , where a, b, c are constants, and a>0, and x is the number of applicants.

步驟S03:處理器10根據該些旅行團分群及該旅行團廣告資料對一第一預測模型進行訓練,得到一報名熱門度預測模型20。由於同一旅行團分群內的各旅行團資料其報名熱門度相近似,這也表示各旅行團資料之間的旅行團廣告資料也會近似,因此對各旅行團分群以旅行團廣告資料進行訓練後,將可取得各旅行團分群的特徵,利用這些特徵就可以對旅行團所設計的旅遊行程進行報名熱門度預測。舉例來說,當旅行社推出一項全新的旅遊行程,處理器10便可針對該旅遊行程內的廣告特徵,如價格、出團日期、旅行社知名度、行程、天數、天氣、交通等,判斷該旅遊行程與哪個旅行團分群相近似,再根據該旅行團分群便可判斷該旅遊行程的報名熱門程度。Step S03: The processor 10 trains a first prediction model according to the tour group groups and the tour group advertising data to obtain a registration popularity prediction model 20. Since the registration popularity of each tour group data in the same tour group group is similar, it also means that the tour group advertising data between the tour group data will also be similar. Therefore, after training each tour group group with the tour group advertising data, the characteristics of each tour group group can be obtained, and these characteristics can be used to predict the registration popularity of the travel itinerary designed by the tour group. For example, when a travel agency launches a new tour itinerary, the processor 10 can determine which tour group the tour itinerary is similar to based on the advertising features of the tour itinerary, such as price, departure date, travel agency popularity, itinerary, number of days, weather, traffic, etc., and then determine the popularity of the tour itinerary based on the tour group group.

在一些實施例中,處理器是利用命名實體識別法(Named Entity Recognition(NER)將旅遊行程中提取出廣告特徵,如價格、出團日期、旅行社知名度、行程、天數、天氣、交通等,接著配合字符串模糊匹配法(FuzzyWuzzy),將命名實體識別法所提出的廣告特徵之間屬於相似的廣告特徵進行刪除,然後再根據填槽法(Slot Filling)生成特徵句。其中,命名實體識別法是識別旅遊行程內具有特定意義的實體,以本說明書為例,特定有意義的實體為價格、出團日期、旅行社知名度、行程、天數、天氣、交通等;字符串模糊匹配法是一種用於辨識字串之間的相似度並與最相似之字串進行匹配,與命名實體識別法搭配使用,可更有效的提取旅遊行程內的廣告特徵;至於特徵句舉例來說為:出發日期是2022年4月6日;目的地(國家或城市)是D1、D2和D3;景點有A1、A2和A3;旅行費用約為2,000美元;飛行公司是F1;下午5點起飛;旅行社是T;行程天數是12天;酒店是H1、H2、H3、H4和H5。In some embodiments, the processor uses named entity recognition (NER) to extract advertising features from the travel itinerary, such as price, departure date, travel agency popularity, itinerary, number of days, weather, traffic, etc., and then uses fuzzy matching to delete similar advertising features from the advertising features proposed by the named entity recognition method, and then uses slot filling to match the advertising features. Filling) generates feature sentences. Among them, named entity recognition is to identify entities with specific meanings in the travel itinerary. Taking this manual as an example, the specific and meaningful entities are price, departure date, travel agency popularity, itinerary, number of days, weather, traffic, etc.; string fuzzy matching is a method used to identify the similarity between strings and match the most similar strings. When used in conjunction with named entity recognition, it can more effectively extract advertising features in the travel itinerary; as for the feature sentence, for example: the departure date is April 6, 2022; the destination (country or city) is D1, D2 and D3; the attractions are A1, A2 and A3; the travel fee is about US$2,000; the flight company is F1; the flight takes off at 5 pm; the travel agency is T; the number of days in the itinerary is 12 days; the hotels are H1, H2, H3, H4 and H5.

在取得廣告特徵後,處理器10將廣告特徵作為輸入、該些旅行團分群作為輸出對該第一預測模型進行訓練。在一些實施例中,訓練過程是以基於變換器的雙向編碼器表示技術(Bidirectional Encoder Representations from Transformers,BERT)搭配卷積神經網路(Convolutional Neural Network, CNN)進行訓練。其中,基於變換器的雙向編碼器表示技術是用於詞向量的轉換。在一些實施例中,卷積神經網路是依序包括二個卷積層(Convolution)、一個池化層(Pooling)、一個扁平層(flatten)、二個全連接層(Fully Connencted)。另外,在一些實施例中,處理器10對第一預測模型進行訓練時,其隱藏層的神經元數量為256。After obtaining the advertisement features, the processor 10 trains the first prediction model using the advertisement features as input and the tour group groups as output. In some embodiments, the training process is performed using a bidirectional encoder representation technique based on transformers (BERT) in combination with a convolutional neural network (CNN). The bidirectional encoder representation technique based on transformers is used for the conversion of word vectors. In some embodiments, the convolutional neural network sequentially includes two convolution layers, a pooling layer, a flatten layer, and two fully connected layers. In addition, in some embodiments, when the processor 10 trains the first prediction model, the number of neurons in its hidden layer is 256.

以上是介紹報名熱門度預測模型20的訓練方法,藉此對新的旅遊行程進行報名熱門度的預測,接下來是介紹報名人數預測模型30的訓練方法,利用報名人數預測模型30可讓旅行社可根據第n天的報名人數,直接預測接下來每一天的報名人數,在一些實施例中,甚至可以預測至報名期限截止當天的報名人數,或是預測第n天的報名人數就達到出團人數之門檻,因此可以預測旅遊行程是否可以成團,以利旅行社提前進行相對應處理方式,進而降低其損失。The above is an introduction to the training method of the registration popularity prediction model 20, which is used to predict the registration popularity of a new travel itinerary. The next is an introduction to the training method of the registration number prediction model 30. The registration number prediction model 30 allows travel agencies to directly predict the number of registrations for each subsequent day based on the number of registrations on the nth day. In some embodiments, it can even predict the number of registrations on the day when the registration deadline ends, or predict that the number of registrations on the nth day will reach the threshold of the number of people for the group. Therefore, it can be predicted whether the travel itinerary can be formed, so that the travel agency can take corresponding measures in advance, thereby reducing its losses.

步驟S04:該處理器10根據該些旅行團分群及該報名人數資料對一第二預測模型進行訓練,得到一報名人數預測模型30。利用該報名人數預測模型30,該處理器10可以根據前n天的報名人數預測第n+1天的報名人數,甚至可提前預測是否能成團。訓練方式是根據該報名人數資料中任取二天的每日人數資料分別作為輸入及輸出,該每日人數資料包括每日報名人數、每日報名人數累計,在一些實施例中,是以連續二天的每日報名人數累計進行訓練,如以第n天的每日報名人數累計作為輸入、第n+1天的每日報名人數累計作為輸出進行訓練,且隱藏層的神經元之數量為50。在一些實施例中,對該第二預測模型進行訓練是利用長短期記憶法(Long Short-Term Memory,LSTM)進行訓練。Step S04: The processor 10 trains a second prediction model according to the grouping of the tour groups and the registration number data to obtain a registration number prediction model 30. Using the registration number prediction model 30, the processor 10 can predict the registration number of the n+1 day according to the registration number of the previous n days, and can even predict in advance whether a tour group can be formed. The training method is to use the daily number of people data of any two days in the registration number data as input and output respectively, and the daily number of people data includes the daily number of people and the daily number of people cumulative. In some embodiments, the daily number of people cumulative for two consecutive days is used for training, such as using the daily number of people cumulative for the nth day as input and the daily number of people cumulative for the n+1th day as output for training, and the number of neurons in the hidden layer is 50. In some embodiments, the second prediction model is trained using Long Short-Term Memory (LSTM).

以上介紹完報名熱門度預測模型20及報名人數預測模型30的訓練方法,接下來介紹相關應用之系統。The above introduces the training methods of the registration popularity prediction model 20 and the registration number prediction model 30. Next, the related application systems will be introduced.

請參閱圖2所示,是一種旅行團報名預測系統,包括:一報名熱門度預測模型20、一報名人數預測模型30、及一處理器10。該處理器10根據一旅行團廣告資料及該報名熱門度預測模型20,判斷該旅行團廣告資料的報名熱門度,得到一熱門度判斷結果。Please refer to FIG. 2 , which is a tour group registration prediction system, comprising: a registration popularity prediction model 20, a registration number prediction model 30, and a processor 10. The processor 10 determines the registration popularity of the tour group advertisement data according to the tour group advertisement data and the registration popularity prediction model 20, and obtains a popularity determination result.

因此,當旅行社欲推出一旅遊行程時,只要將輸入特徵資料,該處理器10便會利用該報名熱門預測模型根據該特徵資料,便可判斷該旅遊行程屬於哪個旅行團分群,根據該旅行團分群即可預測該旅遊行程的熱門度。甚至,當該旅遊行程被歸屬於無法成團的旅行團分群時,即可判斷該旅遊行程將無法成團,旅行社可以趁早進行行程的修改。Therefore, when a travel agency wants to launch a travel itinerary, as long as the feature data is input, the processor 10 will use the registration popularity prediction model to determine which tour group the travel itinerary belongs to based on the feature data, and the popularity of the travel itinerary can be predicted based on the tour group. Even when the travel itinerary is classified into a tour group that cannot be formed, it can be determined that the travel itinerary will not be formed, and the travel agency can modify the itinerary as soon as possible.

接著,當該旅遊行程推出後,根據報名人數製作成一報名人數資料,在一些實施例中是以每日報名人數累計作為該報名人數資料。接著,該處理器10根據該旅行團廣告資料、該報名人數資料及該報名人數預測模型30進行報名人數的預測,得到一報名人數預測值。舉例來說,當輸入第n天的每日報名人數累計,該處理器10便可利用該報名人數預測模型30預測第n+1天的每日報名人數累計,甚至該處理器10可預測報名截止日期的每日報名人數累計。藉以讓旅行社得以提前知道該旅遊行程是否可以成團,以便提前進行相對應應變處理方式。Then, when the travel itinerary is launched, a registration number data is generated based on the number of applicants. In some embodiments, the daily registration number accumulation is used as the registration number data. Then, the processor 10 predicts the number of applicants based on the tour group advertising data, the registration number data and the registration number prediction model 30 to obtain a registration number prediction value. For example, when the daily registration number accumulation of the nth day is input, the processor 10 can use the registration number prediction model 30 to predict the daily registration number accumulation of the n+1th day, and even the processor 10 can predict the daily registration number accumulation of the registration deadline. This allows the travel agency to know in advance whether the travel itinerary can be formed, so as to carry out corresponding contingency processing methods in advance.

在一些實施例中,該處理器10根據一最低成團人數、該累進報名人數預測值,得到一成團率預測值,透過該成團率預測值以數值具體化呈現成團機率高低,更可具體化讓旅行社人員了解該旅遊行程的成團機率高低。In some embodiments, the processor 10 obtains a group formation rate prediction value based on a minimum number of group members and the progressive registration number prediction value. The group formation rate prediction value is used to concretely present the probability of group formation in a numerical value, and can also specifically let travel agency personnel understand the probability of group formation of the travel itinerary.

雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明之精神和範圍內,當可作些許之修改與變化。因此,只要這些修改與變化是在後附之申請專利範圍及與其同等之範圍內,本發明也將涵蓋這些修改與變化。Although the present invention has been disclosed as above by way of embodiments, it is not intended to limit the present invention. Any person with ordinary knowledge in the relevant technical field may make some modifications and changes without departing from the spirit and scope of the present invention. Therefore, as long as these modifications and changes are within the scope of the attached patent application and its equivalent, the present invention will also cover these modifications and changes.

S01~S04:步驟 10:處理器 20:報名熱門度預測模型 30:報名人數預測模型 S01~S04: Steps 10: Processor 20: Registration popularity prediction model 30: Registration number prediction model

圖1繪示本發明之旅行團報名預測模型訓練方法的步驟流程圖。 圖2繪示本發明之旅行團報名預測系統的功能方塊圖。 FIG1 shows a flow chart of the steps of the tour group registration prediction model training method of the present invention. FIG2 shows a functional block diagram of the tour group registration prediction system of the present invention.

S01~S04:步驟 S01~S04: Steps

Claims (5)

一種旅行團報名預測模型的訓練方法,包括:一處理器對多個旅行團資料以一報名人數資料為基礎進行分群,得到多個旅行團分群;其中該些旅行團資料分別包括該報名人數資料及一旅行團廣告資料;該處理器根據該些旅行團分群及該旅行團廣告資料對一第一預測模型進行訓練,得到一報名熱門度預測模型;該處理器對多個該旅行團資料以該報名人數資料為基礎進行分群的步驟中,係以該報名人數資料及成團與否為基礎對該些旅行團資料進行分群;該旅行團廣告資料包括複數廣告特徵;於該處理器根據該些旅行團分群及該旅行團廣告資料對一預測模型進行訓練之步驟中,該處理器將該些廣告特徵作為輸入、該些旅行團分群作為輸出對該第一預測模型進行訓練;以及該處理器根據該些旅行團分群及該報名人數資料對一第二預測模型進行訓練,得到一報名人數預測模型。 A method for training a tour group registration prediction model comprises: a processor groups a plurality of tour group data based on registration number data to obtain a plurality of tour group groups; wherein the tour group data respectively include the registration number data and tour group advertising data; the processor trains a first prediction model based on the tour group groups and the tour group advertising data to obtain a registration popularity prediction model; in the step of the processor grouping the plurality of tour group data based on the registration number data, the processor trains a first prediction model based on the tour group groups and the tour group advertising data to obtain a registration popularity prediction model; The tour group data are grouped based on the registration number data and whether the tour group is formed; the tour group advertising data includes multiple advertising features; in the step of the processor training a prediction model based on the tour group groups and the tour group advertising data, the processor trains the first prediction model using the advertising features as input and the tour group groups as output; and the processor trains a second prediction model based on the tour group groups and the registration number data to obtain a registration number prediction model. 如請求項1所述之旅行團報名預測模型的訓練方法,其中該報名人數資料包括一每日人數資料;於該處理器根據該些旅行團分群及該報名人數資料對該第二預測模型進行訓練之步驟中,取其中二日的該每日人數資料分別作為輸入及輸出對該第二預測模型進行訓練。 The method for training a tour group registration prediction model as described in claim 1, wherein the registration number data includes daily number data; in the step of the processor training the second prediction model according to the tour group groups and the registration number data, the daily number data of two days are taken as input and output respectively to train the second prediction model. 如請求項2所述之旅行團報名預測模型的訓練方法,其中該處理器對多個旅行團資料以報名人數資料為基礎進行分群之步驟前, 對該些旅行團資料的報名人數進行標準化。 A method for training a tour group registration prediction model as described in claim 2, wherein before the processor performs a step of grouping multiple tour group data based on the number of registration data, the number of registrations of the tour group data is standardized. 如請求項2所述之旅行團報名預測模型的訓練方法,其中該處理器對多個該旅行團資料以該報名人數資料為基礎進行分群之步驟前,對該些旅行團資料的報名天數進行等比例收縮或擴充。 A method for training a tour group registration prediction model as described in claim 2, wherein the processor proportionally shrinks or expands the registration days of the tour group data before grouping the tour group data based on the registration number data. 一種旅行團報名預測系統,包括:一報名熱門度預測模型;一處理器,根據一旅行團廣告資料及該報名熱門度預測模型,判斷該旅行團廣告資料的報名熱門度,得到一熱門度判斷結果;一報名人數預測模型,該處理器根據一報名人數資料、該旅行團廣告資料及該報名人數預測模型,得到一報名人數預測值;以及該處理器根據一最低成團人數、該報名人數預測值,得到一成團率預測值。 A tour group registration prediction system includes: a registration popularity prediction model; a processor, judging the registration popularity of the tour group advertising data and the registration popularity prediction model, and obtaining a popularity judgment result; a registration number prediction model, the processor obtaining a registration number prediction value based on a registration number data, the tour group advertising data and the registration number prediction model; and the processor obtaining a group formation rate prediction value based on a minimum number of group members and the registration number prediction value.
TW111134196A 2022-09-08 2022-09-08 A training method of tour group registration prediction model and a system of tour group registration prediction TWI872368B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW111134196A TWI872368B (en) 2022-09-08 2022-09-08 A training method of tour group registration prediction model and a system of tour group registration prediction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW111134196A TWI872368B (en) 2022-09-08 2022-09-08 A training method of tour group registration prediction model and a system of tour group registration prediction

Publications (2)

Publication Number Publication Date
TW202411931A TW202411931A (en) 2024-03-16
TWI872368B true TWI872368B (en) 2025-02-11

Family

ID=91228119

Family Applications (1)

Application Number Title Priority Date Filing Date
TW111134196A TWI872368B (en) 2022-09-08 2022-09-08 A training method of tour group registration prediction model and a system of tour group registration prediction

Country Status (1)

Country Link
TW (1) TWI872368B (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201028951A (en) * 2009-01-23 2010-08-01 Shacom Com Inc System and method of optimum combination mutual-help fund raising and allocation
CN103400321A (en) * 2013-08-21 2013-11-20 苏州换游信息科技有限公司 Service system for travel agency
CN109242594A (en) * 2018-07-27 2019-01-18 国政通科技有限公司 A kind of tourist group's group generation method, device and storage medium
CN109325803A (en) * 2018-09-06 2019-02-12 洪文朗 A kind of Destination Management platform
US20190108546A1 (en) * 2017-10-11 2019-04-11 Mastercard International Incorporated Providing targeted communications based on travel related data
CN111652444A (en) * 2020-06-05 2020-09-11 南京机电职业技术学院 A daily tourist traffic forecast method based on K-means and LSTM
CN111915395A (en) * 2020-07-07 2020-11-10 云境商务智能研究院南京有限公司 Travel bag recommendation method based on multi-view attention mechanism
CN112488746A (en) * 2020-11-03 2021-03-12 广州易起行信息技术有限公司 Secondary marketing method of travel product based on big data user behavior, electronic device and storage medium
CN114091768A (en) * 2021-11-25 2022-02-25 安徽大学 Tourism Demand Prediction Method Based on STL and LSTM with Attention Mechanism

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201028951A (en) * 2009-01-23 2010-08-01 Shacom Com Inc System and method of optimum combination mutual-help fund raising and allocation
CN103400321A (en) * 2013-08-21 2013-11-20 苏州换游信息科技有限公司 Service system for travel agency
US20190108546A1 (en) * 2017-10-11 2019-04-11 Mastercard International Incorporated Providing targeted communications based on travel related data
CN109242594A (en) * 2018-07-27 2019-01-18 国政通科技有限公司 A kind of tourist group's group generation method, device and storage medium
CN109325803A (en) * 2018-09-06 2019-02-12 洪文朗 A kind of Destination Management platform
CN111652444A (en) * 2020-06-05 2020-09-11 南京机电职业技术学院 A daily tourist traffic forecast method based on K-means and LSTM
CN111915395A (en) * 2020-07-07 2020-11-10 云境商务智能研究院南京有限公司 Travel bag recommendation method based on multi-view attention mechanism
CN112488746A (en) * 2020-11-03 2021-03-12 广州易起行信息技术有限公司 Secondary marketing method of travel product based on big data user behavior, electronic device and storage medium
CN114091768A (en) * 2021-11-25 2022-02-25 安徽大学 Tourism Demand Prediction Method Based on STL and LSTM with Attention Mechanism

Also Published As

Publication number Publication date
TW202411931A (en) 2024-03-16

Similar Documents

Publication Publication Date Title
Kang et al. Transfer learning of a deep learning model for exploring tourists’ urban image using geotagged photos
Stepchenkova et al. Cultural differences in pictorial destination images: Russia through the camera lenses of American and Korean tourists
Li et al. A dynamic panel data analysis of climate and tourism demand: Additional evidence
WO2016019857A1 (en) Service distribution system and method
Castillo-Manzano et al. When the mall is in the airport: Measuring the effect of the airport mall on passengers’ consumer behavior
Niu et al. An online-traffic-prediction based route finding mechanism for smart city
CN112000801A (en) Government affair text classification and hot spot problem mining method and system based on machine learning
CN111104498A (en) Semantic understanding method in task type dialogue system
CN113934835B (en) Retrieval type reply dialogue method and system combining keywords and semantic understanding representation
CN110020121A (en) Software crowdsourcing item recommendation method and system based on transfer learning
CN110442662A (en) A method for determining user attribute information and an information push method
Chen et al. Exploring cross-cultural disparities in tourists' perceived images: a text mining and sentiment analysis study using LDA and BERT-BILSTM models
Zheng et al. Research on urban public green space planning based on taxi data: A case study on three districts of Shenzhen, China
CN115600751A (en) Individual trajectory data trip purpose prediction method based on long-term and short-term memory network
Peng et al. Analysis of destination images in the emerging ski market: The case study in the host city of the 2022 Beijing Winter Olympic Games
Yfantidou et al. Experience Economy Perspective on Recreational Fishing Tourism Travelers’ Reviews: A Data Science Approach
Fu et al. Theme exploration and sentiment analysis of online reviews of Wuyishan National park
Dou et al. How do people perceive the quality of urban transport service? New insights from online reviews of Shanghai metro system
TWI872368B (en) A training method of tour group registration prediction model and a system of tour group registration prediction
Du et al. Zone2vec: Distributed representation learning of urban zones
CN115718799B (en) A method for measuring spatiotemporal semantic similarity of public transport passengers' travel behavior
CN113222299A (en) Intelligent vehicle scheduling system and method for scenic spot
González et al. Visitors’ attitudes towards bicycle use in the Teide National Park
CN115100395B (en) A method for urban block function classification integrating POI pre-classification and graph neural network
CN119149833B (en) Tourist attraction recommendation method and system based on big data, equipment and storage medium