[go: up one dir, main page]

CN111639978A - Event-driven demand forecasting method for e-commerce based on Prophet-random forest - Google Patents

Event-driven demand forecasting method for e-commerce based on Prophet-random forest Download PDF

Info

Publication number
CN111639978A
CN111639978A CN202010511718.2A CN202010511718A CN111639978A CN 111639978 A CN111639978 A CN 111639978A CN 202010511718 A CN202010511718 A CN 202010511718A CN 111639978 A CN111639978 A CN 111639978A
Authority
CN
China
Prior art keywords
event
data
prophet
random forest
prediction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010511718.2A
Other languages
Chinese (zh)
Other versions
CN111639978B (en
Inventor
张梦雅
王家宁
任梦婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University of Technology WUT
Original Assignee
Wuhan University of Technology WUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University of Technology WUT filed Critical Wuhan University of Technology WUT
Priority to CN202010511718.2A priority Critical patent/CN111639978B/en
Publication of CN111639978A publication Critical patent/CN111639978A/en
Application granted granted Critical
Publication of CN111639978B publication Critical patent/CN111639978B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了一种基于Prophet‑随机森林的电商事件驱动型需求量预测方法,包括步骤:获取电商平台的历史销售数据,销售数据包括时序数据和购买相关产品用户数据;清洗历史销售数据,提高数据质量;将时序数据进行尺度压缩,降低数据波动性;基于节日活动及电商平台促销事件进行事件建模;为Prophet模型附加事件回归量,并进行Prophet预测,剔除并插值处理属于事件效应范围内的离群点,并进行随机森林预测;组合Prophet预测与随机森林预测结果;进行精度评估以观测模型泛化能力与预测效果。本发明针对电商事件需求及非事件需求特点,分别应用Prophet、随机森林进行预测,提高了电商事件驱动型需求的预测精度。

Figure 202010511718

The invention discloses an event-driven demand forecasting method for e-commerce based on Prophet-random forest, comprising the steps of: acquiring historical sales data of an e-commerce platform, the sales data including time series data and user data for purchasing related products; cleaning the historical sales data , improve data quality; compress time series data to reduce data volatility; conduct event modeling based on festival activities and e-commerce platform promotion events; add event regressors to the Prophet model, and perform Prophet prediction, eliminate and interpolate as events Outliers within the effect range, and perform random forest prediction; combine Prophet prediction and random forest prediction results; perform accuracy evaluation to observe model generalization ability and prediction effect. Aiming at the characteristics of e-commerce event demand and non-event demand, the present invention uses Prophet and random forest for prediction respectively, and improves the prediction accuracy of the event-driven demand of e-commerce.

Figure 202010511718

Description

基于Prophet-随机森林的电商事件驱动型需求量预测方法Event-driven demand forecasting method for e-commerce based on Prophet-random forest

技术领域technical field

本发明属于电商需求预测领域,具体涉及一种基于Prophet-随机森林的电商事件驱动型需求量预测方法。The invention belongs to the field of e-commerce demand forecasting, and in particular relates to an event-driven demand forecast method for e-commerce based on Prophet-random forest.

背景技术Background technique

节日活动及电商平台促销事件之间日期间隔并不固定,并不呈现标准的周期性运动规律,其驱动的短期消费者需求呈现出事件短期内激增、非事件效应影响时段内小幅度波动的状态。但是,当前电商需求预测常用方法为自回归移动平均模型,其进行时序数据预测的前提是数据稳定,无法很好地捕捉不稳定数据的规律,在电商订单需求常常由节日活动及电商平台促销事件驱动且具备突发性特征的情况下,现有的模型并不能表现真实商业环境下的需求。The date interval between festival activities and e-commerce platform promotion events is not fixed, and does not show a standard cyclical movement law. The short-term consumer demand driven by it shows a surge in the short term of the event and a small fluctuation in the non-event effect period. state. However, the current common method for e-commerce demand forecasting is the autoregressive moving average model. The premise of time series data forecasting is that the data is stable, and the law of unstable data cannot be well captured. Under the circumstance that platform promotion events are driven and have the characteristics of emergencies, the existing models cannot represent the needs of the real business environment.

发明内容SUMMARY OF THE INVENTION

本发明的目的是提供一种基于Prophet-随机森林的电商事件驱动型需求量预测方法,本发明针对电商事件需求及非事件需求特点,分别应用Prophet、随机森林进行预测,提高了电商事件驱动型需求的预测精度。The purpose of the present invention is to provide an event-driven demand forecasting method for e-commerce based on Prophet-random forest. According to the characteristics of e-commerce event demand and non-event demand, the present invention uses Prophet and random forest for prediction respectively, which improves the efficiency of e-commerce. Forecast accuracy for event-driven demand.

本发明所采用的技术方案是:The technical scheme adopted in the present invention is:

一种基于Prophet-随机森林的电商事件驱动型需求量预测方法,包括步骤:An event-driven demand forecasting method for e-commerce based on Prophet-random forest, including steps:

步骤1、获取电商平台的历史销售数据,销售数据包括时序数据和购买相关产品用户数据;Step 1. Obtain historical sales data of the e-commerce platform, including time series data and user data for purchasing related products;

时序数据包含不同时间上的销售数据,以描述需求量随时间变化的情况;购买相关产品用户数据应包括产品类别描述与客户信息。Time series data includes sales data at different times to describe the changes in demand over time; user data for purchasing related products should include product category descriptions and customer information.

步骤2、清洗历史销售数据,提高数据质量;Step 2. Clean historical sales data to improve data quality;

数据清洗操作包括无效值处理、缺失值与重复值处理、一致性处理、数据子集筛选和数据排序,其中,无效值处理是用统计分析方法识别出极端值并进行删除或替换,缺失值与重复值处理是用平均值、最大值、最小值或概率估计代替缺失的值并删除重复数据,一致性处理是根据变量合法规则及逻辑将超出正常范围内的数据删除,数据子集筛选是筛选有用信息并减少字段冗余,数据排序是按照销售数据中的时序数据对整体数据进行时序排序以观测数据的时序变化规律。Data cleaning operations include invalid value processing, missing value and duplicate value processing, consistency processing, data subset filtering and data sorting. Among them, invalid value processing is to identify extreme values with statistical analysis methods and delete or replace them. Duplicate value processing is to replace missing values with mean, maximum, minimum or probability estimates and remove duplicate data. Consistency processing is to remove data that are outside the normal range according to the legal rules and logic of variables. Data subset screening is to filter Useful information and reduce field redundancy. Data sorting is to sort the overall data in time series according to the time series data in the sales data to observe the time series change law of the data.

步骤3、将时序数据进行尺度压缩操作,降低数据波动性;Step 3. Scale the time series data to reduce data volatility;

时序数据为经提取的时间戳与订单需求量;时间戳数据列包含日期(YYYY-MM-DD)或具体时间点(YYYY-MM-DD HH:MM:SS);电商订单需求数据列是数值变量;The time series data is the extracted timestamp and order demand; the timestamp data column contains the date (YYYY-MM-DD) or the specific time point (YYYY-MM-DD HH:MM:SS); the e-commerce order demand data column is numeric variable;

尺度压缩是基于时序分解的乘法模型y(t)=T*S*H*I,即将时序分解为趋势项T,季节项S,节日活动及电商平台促销事件项H,随机波动项I;数据进行尺度压缩的方式:Scale compression is a multiplication model y(t)=T*S*H*I based on time series decomposition, that is, the time series is decomposed into trend items T, seasonal items S, festival activities and e-commerce platform promotion event items H, and random fluctuation items I; The way the data is scaled compressed:

y(t)(1)=ln(y(t))y(t) (1) = ln(y(t))

步骤4、基于节日活动及电商平台促销事件进行事件建模;Step 4. Conduct event modeling based on festival activities and e-commerce platform promotion events;

事件建模为不同节日活动及电商平台促销事件Hi设置前后时间窗口值,其中Hi=(H1,H2,...,Hn),并假设节日活动及电商平台促销事件对前后窗口值中的订单需求量的影响呈高斯分布,即k~N(0,γ2),其中,γ表示事件对模型的影响程度。对于具体的事件中的窗口日t,其事件效应E(t)用虚拟变量表示:The event modeling is to set the time window values before and after different festival activities and e-commerce platform promotional events Hi, where Hi = (H 1 , H 2 ,..., H n ) , and assume festival activities and e-commerce platform promotional events The influence on the order demand in the front and back window values is Gaussian distribution, ie k~N(0, γ 2 ), where γ represents the degree of influence of the event on the model. For the window day t in a specific event, its event effect E(t) is represented by a dummy variable:

Figure BDA0002528569820000021
Figure BDA0002528569820000021

步骤5,Step 5,

1、为Prophet模型附加事件回归量,并进行Prophet预测,Prophet预测包括步骤:1. Attach an event regressor to the Prophet model and perform Prophet prediction. Prophet prediction includes steps:

1)创建事件附加回归量,将步骤4中正向影响订单需求量的事件再次进行相同的虚拟变量赋值,但对不同的事件赋予不同γ以进一步区分不同事件影响程度;1) Create an event additional regressor, assign the same dummy variable assignment to the event that positively affects the order demand in step 4, but assign different γ to different events to further distinguish the degree of influence of different events;

2)通过创建事件列表,导入事件附加回归量与步骤3中的事件效应;2) By creating an event list, import the event additional regressor and the event effect in step 3;

3)切割训练集进行模型训练,利用测试集初步评估模型得分。并基于模型训练与测试结果调整参数,确定最终趋势项模型、突变点的位置、个数与增长率,季节性的拟合程度与相关季节因素;3) Cut the training set for model training, and use the test set to preliminarily evaluate the model score. And adjust the parameters based on the model training and testing results to determine the final trend item model, the location, number and growth rate of mutation points, the degree of seasonal fit and related seasonal factors;

4)使用评估指标平均绝对误差、均方根误差、平均绝对百分比误差进行模型预测效果评价。4) Use the evaluation indicators mean absolute error, root mean square error, and mean absolute percentage error to evaluate the model prediction effect.

2、剔除并插值处理属于事件效应的离群点,并进行随机森林预测;2. Eliminate and interpolate outliers belonging to event effects, and perform random forest prediction;

事件效应的离群点是利用Box Plot图与建模事件筛选出的事件效应范围内的时序数据中的离群点,在进行随机森林预测前将其剔除并进行插值处理,以弥补剔除离群点后的缺失项;The outliers of the event effect are the outliers in the time series data within the scope of the event effect screened out by the Box Plot plot and the modeling event, and they are eliminated and interpolated before random forest prediction to make up for the elimination of outliers. missing item after point;

随机森林预测包括步骤:Random forest prediction consists of steps:

1)结合时序数据、事件建模结果与原始数据中的其他信息进行时序特征、产品及用户特征构造;1) Combine time series data, event modeling results and other information in the original data to construct time series features, product and user features;

时序特征包括历史特征、平移特征与滑动窗口;时序历史特征包括相关的星期、月份、年份;星期特征值包括{0,1,2,3,4,5,6},依次对应当前订单日期是星期一至星期日;月份特征值包括{1,2,3,4……12},依次对应订单日期月份一月至十二月;年份特征值依次对应订单日期中的相应年份;平移特征指时间序列向前平移后相应的订单需求量与原始数据的相关性特征;滑动窗口特征是对同一特征在不同时间维度下的体现,由对固定间隔的时间段中对应的订单需求量的统计分析得出;Time series features include historical features, translation features and sliding windows; time series historical features include related weeks, months, and years; week feature values include {0, 1, 2, 3, 4, 5, 6}, which in turn correspond to the current order date. Monday to Sunday; the month feature value includes {1, 2, 3, 4...12}, corresponding to the order date month January to December in turn; the year feature value corresponds to the corresponding year in the order date in turn; the translation feature refers to the time The correlation characteristics between the corresponding order demand and the original data after the sequence is shifted forward; the sliding window feature is the embodiment of the same feature in different time dimensions, which is obtained from the statistical analysis of the corresponding order demand in the fixed interval time period. out;

产品及用户特征以原始数据中的产品类别、用户信息、商品属性信息为基础进行特征构造;The product and user features are constructed based on the product category, user information, and commodity attribute information in the original data;

2)基于基尼指数来评估各特征重要性进行特征排序及筛选;2) Based on the Gini index to evaluate the importance of each feature for feature ranking and screening;

其中,特征筛选指剔除重要性小于一定程度(如1%)的特征,以避免模型过拟合;Among them, feature screening refers to eliminating features whose importance is less than a certain degree (such as 1%) to avoid model overfitting;

3)进行模型训练与参数调整;3) Carry out model training and parameter adjustment;

模型训练前提是切割训练集,切割比例应与Prophet算法切割比例相同;The premise of model training is to cut the training set, and the cutting ratio should be the same as that of the Prophet algorithm;

参数调整是利用测试集初步评估随机森林模型得分,并基于模型得分进行参数调整,调整弱学习器的最大迭代次数与决策树深度等模型拟合参数;Parameter adjustment is to use the test set to initially evaluate the random forest model score, and adjust the parameters based on the model score, and adjust the model fitting parameters such as the maximum number of iterations of the weak learner and the depth of the decision tree;

4)使用平均绝对误差、均方根误差、平均绝对百分比误差评估指标进行模型预测效果评价。4) Use the mean absolute error, root mean square error, and mean absolute percentage error evaluation indicators to evaluate the model prediction effect.

步骤6、组合Prophet预测与随机森林预测结果;Step 6. Combine Prophet prediction and random forest prediction results;

组合结果包括Prophet预测结果中的事件效应预测值与随机森林预测结果中的非事件效应预测值。The combined results include the predicted value of event effects in the prediction results of Prophet and the predicted value of non-event effects in the prediction results of random forest.

步骤7、进行精度评估以观测模型泛化能力与预测效果;Step 7. Carry out accuracy evaluation to observe the generalization ability and prediction effect of the model;

计算Prophet-随机森林组合模型的平均绝对误差、均方根误差、平均绝对百分比误差的评估指标,并分别与Prophet、随机森林模型的预测精度进行比较,评估组合模型的预测效果。Calculate the evaluation indicators of the mean absolute error, root mean square error, and mean absolute percentage error of the Prophet-random forest combination model, and compare with the prediction accuracy of the Prophet and random forest models respectively to evaluate the prediction effect of the combined model.

本发明的有益效果是:The beneficial effects of the present invention are:

本发明针对电商事件需求及非事件需求特点,分别应用Prophet、随机森林进行预测,提高了电商事件驱动型需求的预测精度——Prophet模型主要用于电商事件建模,通过设立附加事件回归量,对电商事件进行敏感捕捉,并对未来的相应事件进行推测,有效解决了常规需求预测方法无法按照平稳的周期性模式对电商节假日及促销事件进行建模的问题,同时利用随机森林模型结合原始数据中产品及用户特征,对非事件的时间序列进行回归预测,弥补了Prophet模型因极端离群点而产生的非事件的时间序列预测区间过于宽泛的问题,同时起到了合理利用非时序特征之外的其他原始数据的作用。Aiming at the characteristics of e-commerce event requirements and non-event requirements, the present invention uses Prophet and random forest for prediction respectively, which improves the prediction accuracy of e-commerce event-driven requirements. The Prophet model is mainly used for e-commerce event modeling. By establishing additional events Regressor, sensitively captures e-commerce events, and speculates on corresponding future events, effectively solving the problem that conventional demand forecasting methods cannot model e-commerce holidays and promotional events according to a stable periodic pattern. The forest model combines product and user characteristics in the original data to perform regression prediction on non-event time series, which makes up for the problem that the non-event time series prediction interval of the Prophet model is too wide due to extreme outliers, and at the same time plays a rational use. The role of other raw data other than non-sequential features.

附图说明Description of drawings

图1是本发明实施例的主要步骤流程图。FIG. 1 is a flow chart of main steps of an embodiment of the present invention.

图2是本发明实施例中电商实际需求量与预测量对比图。FIG. 2 is a comparison diagram of the actual demand and the predicted amount of the e-commerce in the embodiment of the present invention.

具体实施方式Detailed ways

下面结合附图和实施例对本发明作进一步的说明。The present invention will be further described below with reference to the accompanying drawings and embodiments.

以母婴用品的需求量为例,如图1所示,公开了一种基于Prophet-随机森林的电商事件驱动型需求量预测方法,包括步骤:Taking the demand for maternal and infant products as an example, as shown in Figure 1, an event-driven demand forecasting method for e-commerce based on Prophet-Random Forest is disclosed, including the steps:

步骤1、获取电商平台的历史销售数据,销售数据包括时间序列数据与购买相关产品用户数据;Step 1. Obtain the historical sales data of the e-commerce platform, and the sales data includes time series data and user data for purchasing related products;

历史数据使用天池数据实验室网站(https://tianchi.aliyun.com)上传的淘宝及天猫母婴销售数据集。数据集包括两个数据表,分别为婴儿信息表与交易信息表。婴儿信息表数据字段包括婴儿出生日期、性别、用户ID;交易信息表数据字段包括用户ID、用户行为描述、商品序列号、商品二级类目、物品属性描述、用户购买数量、购买日期。其中,两个数据表通过用户ID数据列联系到一起,一行数据表示一个用户在一个购买日期的交易信息。The historical data uses the Taobao and Tmall maternal and child sales data sets uploaded on the Tianchi Data Lab website (https://tianchi.aliyun.com). The dataset includes two data tables, namely the infant information table and the transaction information table. The data fields of the baby information table include the date of birth, gender, and user ID of the baby; the data fields of the transaction information table include user ID, user behavior description, product serial number, product secondary category, item attribute description, user purchase quantity, and purchase date. Among them, the two data tables are linked together by the user ID data column, and one row of data represents the transaction information of a user on a purchase date.

步骤2、清洗历史销售数据,提高数据质量;Step 2. Clean historical sales data to improve data quality;

数据清洗操作包括无效值处理、缺失值与重复值处理、一致性处理、数据子集筛选和数据排序:Data cleaning operations include invalid value processing, missing and duplicate value processing, consistency processing, data subset filtering, and data sorting:

1)无效值处理;1) Invalid value processing;

处理无效婴儿年龄数据。将两张数据表合并后,利用婴儿出生日期与用户购买日期计算用户发生购买行为时的婴儿年龄。考虑到客户输入的婴儿信息可能并不准确,将婴儿年龄无效数据删除,即删除大于28岁与小于0岁中的未处于怀孕周期的数据,并将小于0岁但仍处于怀孕周期内的婴儿年龄改为0岁;同时处理购买数量极端值。对购买数量数据进行描述统计分析,删除最大值10000所在的数据行;Handling invalid baby age data. After combining the two data tables, use the baby's birth date and the user's purchase date to calculate the baby's age when the user made a purchase. Considering that the baby information entered by the customer may be inaccurate, delete the invalid baby age data, that is, delete the data that is older than 28 years old and less than 0 years old and are not in the pregnancy cycle, and delete the baby who is less than 0 years old but still in the pregnancy cycle. Age changed to 0; also handle extreme value of purchase quantity. Perform descriptive statistical analysis on the purchase quantity data, and delete the data row where the maximum value is 10000;

2)缺失值与重复值处理:经过查找筛选,该数据集并无空白值与重复值;2) Handling of missing values and duplicate values: After searching and filtering, the data set has no blank values and duplicate values;

3)一致性处理;3) Consistency processing;

处理异常性别数据。婴儿性别为0或1,分别代表男性或女性。将不符合逻辑的婴儿性别为2的数据删除;同时处理购买日期数据。将购买日期统一修改为标准日期格式;Handling abnormal gender data. Baby gender is 0 or 1, representing male or female, respectively. Remove illogical baby gender 2 data; also process purchase date data. Change the purchase date uniformly to the standard date format;

4)数据子集筛选;4) Data subset screening;

由于本发明所用方法以时间序列分析和产品特征分析为基础,从原始数据集中选择出婴儿年龄、婴儿性别、商品二级类目、用户购买数量、购买日期数据列。其中,物品属性描述数据列过于复杂且无法知晓其中含义,故未考虑该项数据;Since the method used in the present invention is based on time series analysis and product feature analysis, data columns of infant age, infant gender, secondary category of commodities, user purchase quantity, and purchase date are selected from the original data set. Among them, the item attribute description data column is too complicated and its meaning cannot be known, so this data is not considered;

5)数据排序;5) Data sorting;

基于时间序列对数据集进行整体排序。并合并相同购买日期的购买数量。Overall ordering of the dataset based on time series. And combine the purchase quantities of the same purchase date.

步骤3、将时序数据进行尺度压缩,降低数据波动性;Step 3. Scale the time series data to reduce data volatility;

基于时序分解的乘法模型y(t)=T*S*H*I,即将时序分解为趋势项T,季节项S,节假日及促销事件项H,随机波动项I,通过对数变换对数据进行尺度压缩。The multiplication model y(t)=T*S*H*I based on time series decomposition, that is, the time series is decomposed into trend item T, seasonal item S, holiday and promotion event item H, random fluctuation item I, and the data is processed by logarithmic transformation. Scale compression.

步骤4、基于节日活动及电商平台促销事件进行事件建模;Step 4. Conduct event modeling based on festival activities and e-commerce platform promotion events;

根据数据集中所含时间序列,进行了主要电商母婴事件建模。为数据事件范围内重要的电商母婴事件Hi设置前后时间窗口值,其中Hi=(H1,H2,...,H6),并假设电商事件对前后窗口值中的订单需求量的影响呈高斯分布,即k~N(0,γ2),其中,γ取10;对于具体的事件中的窗口日t,其事件效应E(t)用虚拟变量表示;Based on the time series contained in the dataset, the main e-commerce maternal and child events were modeled. Set the before and after time window values for the important e-commerce maternal and child events Hi within the scope of the data event, where Hi =( H 1 , H 2 ,..., H 6 ), and assume that the e-commerce event has a significant impact on The influence of the order demand is Gaussian distribution, namely k~N(0, γ 2 ), where γ is 10; for the window day t in a specific event, the event effect E(t) is represented by a dummy variable;

需要说明的是,在其他可行的实施例中不限制事件前后具体的窗口期,不同的产品、时间序列、电商平台也有不同的事件列表。It should be noted that, in other feasible embodiments, the specific window period before and after the event is not limited, and different products, time series, and e-commerce platforms also have different event lists.

表1电商母婴事件列表Table 1 List of e-commerce maternal and child events

Figure BDA0002528569820000041
Figure BDA0002528569820000041

步骤5,Step 5,

1、为Prophet模型附加事件回归量,并进行Prophet模型预测;1. Attach the event regressor to the Prophet model, and perform the Prophet model prediction;

Prophet预测包括步骤:Prophet prediction consists of steps:

1)创建事件附加回归量,将步骤4中正向影响订单需求量的事件再次进行相同的虚拟变量赋值,但对不同的事件赋予不同γ以进一步区分不同事件影响程度。本实施例中,附加事件为妇女节、母亲节、九九大促、双十一、双十二;附加事件分别设置具体的γ值分别为:10,20,20,40,20;1) Create an event additional regressor, assign the same dummy variable assignment to the event that positively affects the order demand in step 4, but assign different γ to different events to further distinguish the degree of influence of different events. In this embodiment, the additional events are Women's Day, Mother's Day, 99 Promotion, Double Eleven, Double Twelve; the additional events are respectively set with specific γ values: 10, 20, 20, 40, 20;

2)通过创建事件列表,导入事件附加回归量与步骤3中的事件效应;2) By creating an event list, import the event additional regressor and the event effect in step 3;

3)切割训练集进行模型训练,设定训练集所占总数据集比例为0.75。基于模型训练与测试结果调整参数结果如下:最终趋势项模型为分段线性模型,突变点的增长率为0.1,置信区间为0.95;3) Cut the training set for model training, and set the proportion of the training set to the total data set to 0.75. The results of adjusting parameters based on the model training and testing results are as follows: the final trend term model is a piecewise linear model, the growth rate of the mutation point is 0.1, and the confidence interval is 0.95;

4)使用评估指标平均绝对误差(MAE)、均方根误差(RMSE)、平均绝对百分比误差(MAPE)进行模型预测效果评价。4) Use the evaluation indicators Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE) to evaluate the prediction effect of the model.

2、剔除并插值处理属于事件效应的离群点,并进行随机森林预测;2. Eliminate and interpolate outliers belonging to event effects, and perform random forest prediction;

根据Box plot图的,本实施例中时序数据存在55个离群点。本实施例中离群点是电商小型事件如个体店铺促销的驱动结果,属于正常的观测数据。因此为对非事件销售数据进行预测,仅将属于事件效应中的11个离群点剔除,并利用Prophet算法插值处理提出后数据列产生的缺失值。According to the Box plot, there are 55 outliers in the time series data in this example. The outliers in this embodiment are the driving results of small e-commerce events such as individual store promotions, and belong to normal observation data. Therefore, in order to predict the non-event sales data, only 11 outliers belonging to the event effect are eliminated, and the Prophet algorithm is used to interpolate the missing values generated by the proposed data column.

对离群点处理后的数据进行一阶差分处理,以获得更平稳的数据;First-order difference processing is performed on the data after processing outliers to obtain more stable data;

随机森林预测包括步骤:Random forest prediction consists of steps:

1)结合时序数据、事件建模结果与原始数据中的其他信息进行时序特征、产品及用户特征构造;1) Combine time series data, event modeling results and other information in the original data to construct time series features, product and user features;

本实施例主要根据时序历史特征与平移特征创建时序特征,根据步骤2中的筛选子集结果This embodiment mainly creates time-series features based on time-series historical features and translation features, and selects subsets based on the results of step 2.

创建产品与用户特征,并结合步骤4中的事件建模结果增加母婴事件特征。Create product and user features, and add maternal and infant event features based on the event modeling results in step 4.

表2特征创建列表Table 2 Feature Creation List

Figure BDA0002528569820000051
Figure BDA0002528569820000051

2)基于基尼指数来评估各特征重要性进行特征排序及筛选;2) Based on the Gini index to evaluate the importance of each feature for feature ranking and screening;

剔除重要性小于1%的特征以避免模型过拟合。剔除后剩余特征为:Day、Weekday、Month、Year、Event、Good_28、Good_50008168、Good_50014815、Good_38、Good_122650008、Good_50022520、Baby_age;Features less than 1% importance are removed to avoid model overfitting. The remaining features after removal are: Day, Weekday, Month, Year, Event, Good_28, Good_50008168, Good_50014815, Good_38, Good_122650008, Good_50022520, Baby_age;

3)进行模型训练与参数调整;3) Carry out model training and parameter adjustment;

切割训练集进行模型训练,切割比例与Prophet算法切割比例相同,占总数据集比例为0.75。利用测试集初步评估随机森林模型得分。基于模型得分进行参数调整,调整弱学习器的最大迭代次数为200,决策树最大深度为20,切分策略使用最小平方平均误差(MSE);The training set is cut for model training, and the cutting ratio is the same as that of the Prophet algorithm, accounting for 0.75 of the total data set. Use the test set to initially evaluate random forest model scores. Adjust the parameters based on the model score, adjust the maximum number of iterations of the weak learner to 200, the maximum depth of the decision tree to 20, and use the minimum square mean error (MSE) for the segmentation strategy;

4)使用评估指标平均绝对误差(MAE)、均方根误差(RMSE)、平均绝对百分比误差(MAPE)进行模型预测效果评价。4) Use the evaluation indicators Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Mean Absolute Percentage Error (MAPE) to evaluate the prediction effect of the model.

表3模型评估指标对比Table 3 Comparison of model evaluation indicators

Figure BDA0002528569820000061
Figure BDA0002528569820000061

步骤6、组合Prophet预测与随机森林预测结果;Step 6. Combine Prophet prediction and random forest prediction results;

组合结果包括Prophet预测结果中的事件效应预测值与随机森林预测结果中的非事件效应预测值。The combined results include the predicted value of event effects in the prediction results of Prophet and the predicted value of non-event effects in the prediction results of random forest.

步骤7、进行精度评估以观测模型泛化能力与预测效果;Step 7. Carry out accuracy evaluation to observe the generalization ability and prediction effect of the model;

计算Prophet-随机森林组合模型的平均绝对误差、均方根误差、平均绝对百分比误差的评估指标,并分别与Prophet、随机森林模型的预测精度进行比较,评估组合模型的预测效果。如图2所示,本发明的模型评估结果表示,本发明所述Prophet-随机森林算法与Prophet算法和随机森林算法相比,对于电商事件驱动型需求具备更好的预测性能。Calculate the evaluation indicators of the mean absolute error, root mean square error, and mean absolute percentage error of the Prophet-random forest combination model, and compare with the prediction accuracy of the Prophet and random forest models respectively to evaluate the prediction effect of the combined model. As shown in FIG. 2 , the model evaluation result of the present invention shows that the Prophet-Random Forest algorithm of the present invention has better prediction performance for the event-driven demand of e-commerce compared with the Prophet algorithm and the random forest algorithm.

应当理解的是,对本领域普通技术人员来说,可以根据上述说明加以改进或变换,而所有这些改进和变换都应属于本发明所附权利要求的保护范围。It should be understood that, for those skilled in the art, improvements or changes can be made according to the above description, and all these improvements and changes should fall within the protection scope of the appended claims of the present invention.

Claims (10)

1.一种基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:包括步骤,1. an e-commerce event-driven demand forecasting method based on Prophet-random forest, is characterized in that: comprise steps, 步骤1、获取电商平台的历史销售数据,销售数据包括时序数据和购买相关产品用户数据;Step 1. Obtain historical sales data of the e-commerce platform, including time series data and user data for purchasing related products; 步骤2、清洗历史销售数据,提高数据质量;Step 2. Clean historical sales data to improve data quality; 步骤3、将时序数据进行尺度压缩,降低数据波动性;Step 3. Scale the time series data to reduce data volatility; 步骤4、基于节日活动及电商平台促销事件进行事件建模;Step 4. Conduct event modeling based on festival activities and e-commerce platform promotion events; 步骤5、为Prophet模型附加事件回归量,并进行Prophet预测;剔除并插值处理属于事件效应范围内的离群点,并进行随机森林预测;Step 5. Add an event regressor to the Prophet model, and perform Prophet prediction; remove and interpolate outliers that belong to the range of event effects, and perform random forest prediction; 步骤6、组合Prophet预测与随机森林预测结果;Step 6. Combine Prophet prediction and random forest prediction results; 步骤7、进行精度评估以观测模型泛化能力与预测效果。Step 7: Carry out an accuracy evaluation to observe the generalization ability and prediction effect of the model. 2.如权利要求1所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤1中,时序数据包含不同时间上的销售数据,以描述需求量随时间变化的情况;购买相关产品用户数据包括产品类别描述与客户信息。2. The event-driven demand forecasting method for e-commerce based on Prophet-random forest as claimed in claim 1, wherein in step 1, the time series data includes sales data at different times to describe demand over time Changes; purchase-related product user data includes product category descriptions and customer information. 3.如权利要求1所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤2中,数据清洗操作包括无效值处理、缺失值与重复值处理、一致性处理、数据子集筛选和数据排序,其中,无效值处理是用统计分析方法识别出极端值并进行删除或替换,缺失值与重复值处理是用平均值、最大值、最小值或概率估计代替缺失的值并删除重复数据,一致性处理是根据变量合法规则及逻辑将超出正常范围内的数据删除,数据子集筛选是筛选有用信息并减少字段冗余,数据排序是按照销售数据中的时序数据对整体数据进行时序排序以观测数据的时序变化规律。3. The prophet-random forest-based e-commerce event-driven demand forecasting method according to claim 1, wherein in step 2, the data cleaning operation includes invalid value processing, missing value and repeated value processing, consistent Sexual processing, data subset screening and data sorting, among which, invalid value processing is to identify extreme values by statistical analysis and delete or replace them; Replace missing values and delete duplicate data. Consistency processing is to delete data that exceeds the normal range according to the legal rules and logic of variables. Data subset filtering is to filter useful information and reduce field redundancy. Data sorting is based on sales data. Time series data sorts the overall data in time series to observe the time series variation of the data. 4.如权利要求1所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤3中,时序数据为经提取的时间戳和订单需求量,时间戳数据列包含日期或具体的时间点,订单需求量数据列是数值变量,基于时序分解的乘法模型y(t)=T*S*H*I,将时序分解为趋势项T、季节项S、节日活动及电商平台促销事件项H和随机波动项I,通过对数变换对数据进行尺度压缩。4. The prophet-random forest-based e-commerce event-driven demand forecasting method as claimed in claim 1, wherein in step 3, the time series data is the extracted timestamp and order demand, and the timestamp data The column contains dates or specific time points, and the order demand data column is a numerical variable. The multiplication model y(t)=T*S*H*I based on time series decomposition decomposes the time series into trend items T, seasonal items S, festivals Activity and e-commerce platform promotion event item H and random fluctuation item I, the data is scaled compressed by logarithmic transformation. 5.如权利要求4所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤4中,事件建模为不同节日活动及电商平台促销事件Hi设置前后时间窗口值,并假设节日活动及电商平台促销事件Hi对前后窗口值中的订单需求量的影响呈高斯分布,即k~N(0,γ2),其中γ表示影响程度;对于具体的事件中的窗口日t,其事件效应E(t)用虚拟变量表示:5. The prophet-random forest-based e-commerce event-driven demand forecasting method as claimed in claim 4, wherein in step 4, the event modeling is set as different festival activities and e-commerce platform promotional events H i Before and after the time window value, and it is assumed that the influence of festival activities and e-commerce platform promotion events Hi on the order demand in the before and after window values is Gaussian distribution, that is, k ~N(0, γ 2 ), where γ represents the degree of influence; For the window day t in a specific event, its event effect E(t) is represented by a dummy variable:
Figure FDA0002528569810000011
Figure FDA0002528569810000011
6.如权利要求5所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤5中,为Prophet模型附加事件回归量,并进行Prophet预测时,包括步骤:6. The event-driven demand forecasting method for e-commerce based on Prophet-random forest as claimed in claim 5, characterized in that: in step 5, adding an event regressor to the Prophet model, and performing the Prophet prediction, comprising the steps of: : 1)创建事件附加回归量,将步骤4中正向影响订单需求量的事件再次进行相同的虚拟变量赋值,但对不同的事件赋予不同以γ进一步区分不同事件影响程度;1) Create an event additional regressor, assign the same dummy variable assignment to the event that positively affects the order demand in step 4, but assign different events to γ to further distinguish the degree of influence of different events; 2)通过创建事件列表,导入事件附加回归量与步骤3中的事件效应;2) By creating an event list, import the event additional regressor and the event effect in step 3; 3)切割训练集进行模型训练,利用测试集初步评估模型得分,并基于模型训练与测试结果调整参数,确定最终趋势项模型、突变点的位置、个数与增长率,季节性的拟合程度与相关季节因素;3) Cut the training set for model training, use the test set to preliminarily evaluate the model score, and adjust the parameters based on the model training and test results to determine the final trend item model, the location, number and growth rate of mutation points, and the degree of seasonality fitting and related seasonal factors; 4)使用评估指标平均绝对误差、均方根误差、平均绝对百分比误差进行模型预测效果评价。4) Use the evaluation indicators mean absolute error, root mean square error, and mean absolute percentage error to evaluate the model prediction effect. 7.如权利要求5所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤5中,剔除并插值处理属于事件效应范围内的离群点时,离群点处理是利用Box Plot图与建模事件筛选出事件效应范围内的时序数据中的离群点,在进行随机森林预测前将其剔除并进行插值处理,以弥补剔除离群点后的缺失项。7. The event-driven demand forecasting method for e-commerce based on Prophet-random forest as claimed in claim 5, characterized in that: in step 5, when eliminating and interpolating outliers belonging to the event effect range, the Cluster processing is to use Box Plot and modeling events to screen out outliers in the time series data within the scope of the event effect, remove them and perform interpolation before random forest prediction, to make up for the missing after removing outliers. item. 8.如权利要求7所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤5中,进行随机森林预测时,包括步骤:8. The prophet-random forest-based e-commerce event-driven demand forecasting method as claimed in claim 7, wherein in step 5, when performing random forest forecasting, the method comprises the steps: 1)结合时序数据、事件建模结果与原始数据中的其他信息进行时序特征、产品及用户特征构造;1) Combine time series data, event modeling results and other information in the original data to construct time series features, product and user features; 时序特征包括历史特征、平移特征与滑动窗口;时序历史特征包括相关的星期、月份、年份,星期特征值包括{0,1,2,3,4,5,6},依次对应当前订单日期是星期一至星期日,月份特征值包括{1,2,3,4……12},依次对应订单日期月份一月至十二月,年份特征值依次对应订单日期中的相应年份;平移特征指时间序列向前平移后相应的订单需求量与原始数据的相关性特征;滑动窗口特征是对同一特征在不同时间维度下的体现,由对固定间隔的时间段中对应的订单需求量的统计分析得出;Time series features include historical features, translation features, and sliding windows; time series historical features include related weeks, months, and years, and week feature values include {0, 1, 2, 3, 4, 5, 6}, which correspond to the current order date. From Monday to Sunday, the month feature value includes {1, 2, 3, 4...12}, which corresponds to the order date month from January to December, and the year feature value corresponds to the corresponding year in the order date in turn; the translation feature refers to the time The correlation characteristics between the corresponding order demand and the original data after the sequence is shifted forward; the sliding window feature is the embodiment of the same feature in different time dimensions, which is obtained from the statistical analysis of the corresponding order demand in the fixed interval time period. out; 产品及用户特征以原始数据中的产品类别、用户信息、商品属性信息为基础进行特征构造;The product and user features are constructed based on the product category, user information, and commodity attribute information in the original data; 2)基于基尼指数来评估各特征重要性进行特征排序及筛选;2) Based on the Gini index to evaluate the importance of each feature for feature ranking and screening; 特征筛选是剔除重要性小于一定程度的特征,以避免模型过拟合;Feature screening is to remove features whose importance is less than a certain level to avoid model overfitting; 3)进行模型训练与参数调整;3) Carry out model training and parameter adjustment; 模型训练前提是切割训练集,切割比例与Prophet算法切割比例相同;The premise of model training is to cut the training set, and the cutting ratio is the same as that of the Prophet algorithm; 参数调整是利用测试集初步评估随机森林模型得分,并基于模型得分进行参数调整,调整弱学习器的最大迭代次数与决策树深度等模型拟合参数;Parameter adjustment is to use the test set to initially evaluate the random forest model score, and adjust the parameters based on the model score, and adjust the model fitting parameters such as the maximum number of iterations of the weak learner and the depth of the decision tree; 4)使用评估指标平均绝对误差、均方根误差、平均绝对百分比误差进行模型预测效果评价。4) Use the evaluation indicators mean absolute error, root mean square error, and mean absolute percentage error to evaluate the model prediction effect. 9.如权利要求1所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤5中,在步骤6中,组合结果包括Prophet预测结果中的事件效应预测值与随机森林预测结果中的非事件效应预测值。9. The event-driven demand forecast method for e-commerce based on Prophet-random forest as claimed in claim 1, characterized in that: in step 5, in step 6, the combined result includes the event effect prediction in the Prophet prediction result value and the non-event effect predicted value in the random forest prediction results. 10.如权利要求1所述的基于Prophet-随机森林的电商事件驱动型需求量预测方法,其特征在于:在步骤7中,计算Prophet-随机森林组合模型的平均绝对误差、均方根误差、平均绝对百分比误差的评估指标,并分别与Prophet、随机森林模型的预测精度进行比较,评估组合模型的预测效果。10. The prophet-random forest-based e-commerce event-driven demand forecasting method as claimed in claim 1, wherein in step 7, the mean absolute error and the root mean square error of the Prophet-random forest combined model are calculated , the evaluation index of the mean absolute percentage error, and compared with the prediction accuracy of the Prophet and random forest models respectively to evaluate the prediction effect of the combined model.
CN202010511718.2A 2020-06-08 2020-06-08 Electronic commerce event driving type demand forecasting method based on Prophet-random forest Active CN111639978B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010511718.2A CN111639978B (en) 2020-06-08 2020-06-08 Electronic commerce event driving type demand forecasting method based on Prophet-random forest

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010511718.2A CN111639978B (en) 2020-06-08 2020-06-08 Electronic commerce event driving type demand forecasting method based on Prophet-random forest

Publications (2)

Publication Number Publication Date
CN111639978A true CN111639978A (en) 2020-09-08
CN111639978B CN111639978B (en) 2022-07-08

Family

ID=72330383

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010511718.2A Active CN111639978B (en) 2020-06-08 2020-06-08 Electronic commerce event driving type demand forecasting method based on Prophet-random forest

Country Status (1)

Country Link
CN (1) CN111639978B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112396466A (en) * 2020-11-30 2021-02-23 上海明略人工智能(集团)有限公司 E-commerce platform flow prediction method, system, storage medium and electronic equipment
CN112884524A (en) * 2021-03-12 2021-06-01 杉数科技(北京)有限公司 Product demand optimization method and device
CN113313529A (en) * 2021-06-15 2021-08-27 大唐软控(青岛)科技有限公司 Finished oil sales amount prediction method based on time regression sequence
CN114756604A (en) * 2022-06-13 2022-07-15 西南交通大学 A prediction method of monitoring time series data based on Prophet combination model
CN114782065A (en) * 2022-01-21 2022-07-22 北京数势云创科技有限公司 A product sales forecast method, device and storage medium based on model combination
CN116402321A (en) * 2023-06-08 2023-07-07 北京京东乾石科技有限公司 Method and device for determining demand quantity of goods, electronic equipment and storage medium
CN117370744A (en) * 2023-12-07 2024-01-09 国网天津市电力公司营销服务中心 A dynamic cleaning method and system for abnormal power consumption data of electric power users
CN117743814A (en) * 2023-12-20 2024-03-22 昆明理工大学 Ionized layer TEC forecasting method and system and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130173482A1 (en) * 2002-04-02 2013-07-04 Collaborative Agreements, LLC System and Method for Facilitating Transactions Between Two or More Parties
US20180172707A1 (en) * 2015-06-03 2018-06-21 The University Of Lincoln Diagnostic target
CN109040257A (en) * 2018-08-09 2018-12-18 湖南江河机电自动化设备股份有限公司 A kind of power station intelligent alarm system based on big data platform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130173482A1 (en) * 2002-04-02 2013-07-04 Collaborative Agreements, LLC System and Method for Facilitating Transactions Between Two or More Parties
US20180172707A1 (en) * 2015-06-03 2018-06-21 The University Of Lincoln Diagnostic target
CN109040257A (en) * 2018-08-09 2018-12-18 湖南江河机电自动化设备股份有限公司 A kind of power station intelligent alarm system based on big data platform

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
史金成等: "数据流挖掘研究", 《计算机技术与发展》 *
孙延华等: "基于GBRT树模型分位数回归预测的CPFR补货方法", 《软件导刊》 *
石婧文等: "电商集群的流量预测与不确定性区间估计", 《集成技术》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112396466A (en) * 2020-11-30 2021-02-23 上海明略人工智能(集团)有限公司 E-commerce platform flow prediction method, system, storage medium and electronic equipment
CN112884524A (en) * 2021-03-12 2021-06-01 杉数科技(北京)有限公司 Product demand optimization method and device
CN112884524B (en) * 2021-03-12 2024-02-27 杉数科技(北京)有限公司 Product demand optimization method and device
CN113313529A (en) * 2021-06-15 2021-08-27 大唐软控(青岛)科技有限公司 Finished oil sales amount prediction method based on time regression sequence
CN114782065A (en) * 2022-01-21 2022-07-22 北京数势云创科技有限公司 A product sales forecast method, device and storage medium based on model combination
CN114756604A (en) * 2022-06-13 2022-07-15 西南交通大学 A prediction method of monitoring time series data based on Prophet combination model
CN116402321A (en) * 2023-06-08 2023-07-07 北京京东乾石科技有限公司 Method and device for determining demand quantity of goods, electronic equipment and storage medium
CN116402321B (en) * 2023-06-08 2023-09-22 北京京东乾石科技有限公司 Item demand determination method and device, electronic equipment and storage medium
CN117370744A (en) * 2023-12-07 2024-01-09 国网天津市电力公司营销服务中心 A dynamic cleaning method and system for abnormal power consumption data of electric power users
CN117743814A (en) * 2023-12-20 2024-03-22 昆明理工大学 Ionized layer TEC forecasting method and system and electronic equipment

Also Published As

Publication number Publication date
CN111639978B (en) 2022-07-08

Similar Documents

Publication Publication Date Title
CN111639978A (en) Event-driven demand forecasting method for e-commerce based on Prophet-random forest
CN111160992A (en) Marketing system based on user portrait system
CN107346502A (en) A kind of iteration product marketing forecast method based on big data
CN113962745A (en) Sales forecast method and system based on prophet model and big data
CN108335200A (en) A kind of credit rating method that feature based is chosen
EP2355035A1 (en) Method for automatic development of an art index
Kukhotska Wheat market price dynamics in Ukraine: quantitative exploration and forecasting
CN113724008A (en) Data analysis method for new marketing big data
CN116596580A (en) A Short-term Sales Prediction Method of Seasonal Clothing Based on Weather Forecast Information
Bhambri A CAD system for software effort estimation
CN107292713A (en) A kind of rule-based individual character merged with level recommends method
CN113869950A (en) Ranking method of restaurants on takeaway platform based on sales volume, user preferences and preferential discounts
CN114037137A (en) Object prediction method, system and medium
Wee Mah et al. A comparative study between univariate and bivariate time series models for crude palm oil industry in peninsular Malaysia
Khumaidi et al. Forecasting of Sales Based on Long Short Term Memory Algorithm with Hyperparameter
Aras et al. Forecasting Hotel Room Sales within Online Travel Agencies by Combining Multiple Feature Sets.
Hasan et al. Investigation of influential factors towards predicting birth rate in Bangladesh
CN113377846A (en) Takeaway platform restaurant sequencing method based on sales volume and user preference
CN114780599A (en) Comprehensive analysis system based on wheat quality ratio test data
Akhmad et al. Deep learning for predicting food commodity prices in traditional markets
Naghashi et al. Univariate and multivariate time-series methods to forecast dairy income
Lin et al. A Practical Framework for Forecasting Stock Keeping Unit Level Seasonal Sales
Rahmat et al. Gated Recurrent Unit in Malaria Forecasting with Meteorological and Clinical Data
CN119722129B (en) Shopping basket purchasing behavior-based category optimization method, shopping basket purchasing behavior-based category optimization system and storage medium
CN120298031A (en) A product order sales forecasting method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant