[go: up one dir, main page]

CN106952130A - Common user item based on collaborative filtering recommends method - Google Patents

Common user item based on collaborative filtering recommends method Download PDF

Info

Publication number
CN106952130A
CN106952130A CN201710110168.1A CN201710110168A CN106952130A CN 106952130 A CN106952130 A CN 106952130A CN 201710110168 A CN201710110168 A CN 201710110168A CN 106952130 A CN106952130 A CN 106952130A
Authority
CN
China
Prior art keywords
user
recommendation
attribute
item
collaborative filtering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710110168.1A
Other languages
Chinese (zh)
Other versions
CN106952130B (en
Inventor
周智恒
劳志辉
俞政
黄俊楚
代雨琨
李立军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China University of Technology SCUT
Original Assignee
South China University of Technology SCUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China University of Technology SCUT filed Critical South China University of Technology SCUT
Priority to CN201710110168.1A priority Critical patent/CN106952130B/en
Publication of CN106952130A publication Critical patent/CN106952130A/en
Application granted granted Critical
Publication of CN106952130B publication Critical patent/CN106952130B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Recommending goods or services
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0623Electronic shopping [e-shopping] by investigating goods or services
    • G06Q30/0625Electronic shopping [e-shopping] by investigating goods or services by formulating product or service queries, e.g. using keywords or predefined options

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种基于协同过滤的通用物品推荐方法,该方法利用基于用户的协同过滤方法,通过欧几里得距离计算用户之间的相似度,得到相似用户集并进一步得出不同用户的候选推荐集;之后再通过对用户初始感兴趣物品属性的分类和特征赋值,计算出候选推荐集的推荐分数,从而得到一种适用性和鲁棒性较强的推荐结果,帮助用户更方便的获取感兴趣的内容。相比于传统的基于人口统计学和基于内容的推荐方法相比,本发明更为注重了用户之间的个体差异性,通过分析用户的历史行为数据,挖掘用户的兴趣进行推荐;因此,推荐结果因人而异,更多的考虑了用户的个人喜好,也可以针对不同的推荐场景定制不同的方法参数。

The invention discloses a method for recommending general items based on collaborative filtering. The method utilizes a user-based collaborative filtering method to calculate the similarity between users through Euclidean distance, obtain similar user sets, and further obtain different user profiles. Candidate recommendation set; and then calculate the recommendation score of the candidate recommendation set through the classification and feature assignment of the user's initial interest item attributes, so as to obtain a recommendation result with strong applicability and robustness, and help users more conveniently Get interesting content. Compared with the traditional recommendation methods based on demographics and content, the present invention pays more attention to the individual differences between users, and recommends by analyzing the user's historical behavior data and tapping the user's interest; therefore, recommending The results vary from person to person, more consideration is given to the user's personal preferences, and different method parameters can be customized for different recommendation scenarios.

Description

基于协同过滤的通用物品推荐方法A general item recommendation method based on collaborative filtering

技术领域technical field

本发明涉及信息推荐技术领域,具体涉及一种基于协同过滤的通用物品推荐方法。The invention relates to the technical field of information recommendation, in particular to a method for recommending general items based on collaborative filtering.

背景技术Background technique

如今已经进入了一个数据爆炸的时代,随着Web 2.0的发展,Web已经变成数据分享的平台,那么,如何让人们在海量的数据中想要找到他们需要的信息将变得越来越难。Now that we have entered an era of data explosion, with the development of Web 2.0, the Web has become a platform for data sharing, so it will become more and more difficult for people to find the information they need in the massive data .

在这样的情形下,搜索引擎(Google,Bing,百度等等)成为大家快速找到目标信息的最好途径。在用户对自己需求相对明确的时候,用搜索引擎很方便的通过关键字搜索很快的找到自己需要的信息。但搜索引擎并不能完全满足用户对信息发现的需求,那是因为在很多情况下,用户其实并不明确自己的需要,或者他们的需求很难用简单的关键字来表述。又或者他们需要更加符合他们个人口味和喜好的结果,因此出现了推荐系统,与搜索引擎对应,大家也习惯称它为推荐引擎。Under such circumstances, search engines (Google, Bing, Baidu, etc.) become the best way for everyone to quickly find target information. When users are relatively clear about their needs, it is very convenient to use search engines to quickly find the information they need through keyword searches. However, search engines cannot fully meet users' needs for information discovery, because in many cases, users do not know their needs clearly, or their needs are difficult to express with simple keywords. Or they need results that are more in line with their personal tastes and preferences, so there is a recommendation system that corresponds to a search engine, and everyone is used to calling it a recommendation engine.

随着推荐引擎的出现,用户获取信息的方式从简单的目标明确的数据的搜索转换到更高级更符合人们使用习惯的信息发现。With the emergence of recommendation engines, the way for users to obtain information has changed from simple data searches with specific goals to more advanced information discovery that is more in line with people's usage habits.

如今,随着推荐技术的不断发展,推荐引擎已经在电子商务(E-commerce,例如Amazon,当当网)和一些基于social的社会化站点(包括音乐,电影和图书分享,例如豆瓣,Mtime等)都取得很大的成功。这也进一步的说明了,Web2.0环境下,在面对海量的数据,用户需要这种更加智能的,更加了解他们需求,口味和喜好的信息发现机制。协同过滤推荐算法是诞生最早,并且较为著名的推荐算法。主要的功能是预测和推荐。算法通过对用户历史行为数据的挖掘发现用户的偏好,基于不同的偏好对用户进行群组划分并推荐品味相似的商品。协同过滤推荐算法分为两类,分别是基于用户的协同过滤算法(user-basedcollaboratIve filtering),和基于物品的协同过滤算法(item-based collaborativefiltering)。简单的说就是:人以类聚,物以群分。Today, with the continuous development of recommendation technology, recommendation engines have been used in e-commerce (E-commerce, such as Amazon, Dangdang) and some social-based social sites (including music, movie and book sharing, such as Douban, Mtime, etc.) All achieved great success. This further shows that in the Web2.0 environment, facing massive amounts of data, users need this kind of information discovery mechanism that is more intelligent and better understands their needs, tastes and preferences. Collaborative filtering recommendation algorithm is the earliest and well-known recommendation algorithm. The main functions are prediction and recommendation. The algorithm discovers user preferences through mining historical user behavior data, divides users into groups based on different preferences, and recommends products with similar tastes. Collaborative filtering recommendation algorithms are divided into two categories, namely, user-based collaborative filtering algorithm (user-based collaborative filtering) and item-based collaborative filtering algorithm (item-based collaborative filtering). To put it simply: people of a feather flock together and things of a feather flock together.

发明内容Contents of the invention

本发明的目的是为了解决现有技术中的上述缺陷,提供一种基于协同过滤的通用物品推荐方法。The purpose of the present invention is to provide a method for recommending general items based on collaborative filtering in order to solve the above-mentioned defects in the prior art.

本发明的目的可以通过采取如下技术方案达到:The purpose of the present invention can be achieved by taking the following technical solutions:

一种基于协同过滤的通用物品推荐方法,所述通用物品推荐方法包括下列步骤:A method for recommending general items based on collaborative filtering, the method for recommending general items comprises the following steps:

S1、业务方对推荐物品及用户的属性及特征值进行初始化定义,将用户的行为分类并设置初始搜索引擎的条件,完成初始化设定和推荐引擎APK接入后,向推荐引擎APK发起推荐请求,并将初始化的数据集发送给推荐引擎APK;S1. The business side initializes and defines the attributes and feature values of recommended items and users, classifies user behaviors and sets the conditions of the initial search engine, and initiates a recommendation request to the recommendation engine APK after completing the initialization settings and accessing the recommendation engine APK , and send the initialized data set to the recommendation engine APK;

S2、推荐引擎APK首先根据业务方的初始搜索引擎,将满足搜索条件的数据,汇总形成可推荐候选集A,对可推荐候选集A通过相似度判定得到相似用户集,并根据基于用户的协同过滤思想,进行数据筛选得到可推荐结果集B;S2. The recommendation engine APK first summarizes the data that meets the search conditions according to the initial search engine of the business party to form a recommendable candidate set A, and obtains a similar user set through the similarity judgment for the recommendable candidate set A, and based on user-based collaboration Filter ideas and filter data to get recommendable result set B;

S3、将可推荐结果集B根据物品的属性,对属性的特征值进行划分,根据用户的感兴趣集中各个特征值所占的比例,得到可推荐结果集B的用户中每个属性特征值所占的权重,并根据各个最显著的特征集去得到用户感知最敏感的属性,根据感知最敏感的属性的不同参考权重,将用户的感兴趣集进行排序,得到初步推荐列表C;S3. Divide the recommendable result set B according to the attributes of the items, and divide the feature values of the attributes, and according to the proportion of each feature value in the user's interest set, obtain the proportion of each attribute feature value in the users of the recommendable result set B According to the weight of each most significant feature set to obtain the most sensitive attributes of user perception, according to the different reference weights of the most sensitive attributes, the user's interest set is sorted, and the preliminary recommendation list C is obtained;

S4、将初步推荐结果列表C发送给业务方,业务方根据需求进行重排,最后得到推荐结果列表D。S4. Send the preliminary recommendation result list C to the business party, and the business party rearranges it according to requirements, and finally obtains the recommendation result list D.

进一步地,所述步骤S2的具体过程如下:Further, the specific process of the step S2 is as follows:

S201、将用户的行为分为T1~TK共K类,并对这K类行为分别进行权重赋值w1~wk,根据不同的用户行为区分为正面、负面以及高、中、低六个维度,赋值向量w的取值为w=【-2,-1,0,1,2,3】;S201. Divide user behaviors into K categories T 1 to T K , and assign weights w 1 to w k to these K categories of behaviors, and classify them into positive, negative, high, medium, and low according to different user behaviors. dimension, the value of the assignment vector w is w=[-2,-1,0,1,2,3];

S202、获取用户对物品的行为操作累加值得到用户对物品的喜好度H=∑w;当H>3时,则认为用户对该物品感兴趣;S202. Obtain the accumulated value of the user's behavior and operation on the item to obtain the user's preference for the item H=∑w; when H>3, it is considered that the user is interested in the item;

S203、通过不同用户对各个物品的喜好度H,利用欧几里得距离计算得到用户之间的相似度: S203. Using the Euclidean distance according to the preferences H of different users for each item Calculate the similarity between users:

当两个用户之间的相似度sim(x,y)>k时,其中k由业务方决定,即认为两者相似从而得到相似用户集,并根据基于用户的协同过滤思想,得到各个用户的可推荐结果集B。When the similarity sim(x,y)>k between two users, where k is determined by the business side, that is, the two are considered similar to obtain a similar user set, and according to the idea of user-based collaborative filtering, each user’s Result set B can be recommended.

进一步地,所述步骤S3的具体过程如下:Further, the specific process of step S3 is as follows:

S301、根据可推荐结果集B,对物品的属性和特征值进行划分,设物品的属性向量为:S301. According to the recommendable result set B, divide the attribute and characteristic value of the item, and set the attribute vector of the item as:

属性Si的特征值向量为:The eigenvalue vector of attribute S i is:

S302、通过各个属性的区分度向量和候选推荐集构造属性特征矩阵;S302. Construct an attribute feature matrix through the discrimination degree vector of each attribute and the candidate recommendation set;

对用户A而言,感兴趣物品集QA中某一物品属性Si的特征值vk所占的比例为则在候选推荐集QT中,设该物品属性的特征值vk所占的权重:For user A, the eigenvalue v k of an item attribute S i in the interested item set Q A is The proportion of Then in the candidate recommendation set QT , set the weight of the feature value v k of the item attribute:

另,当(i,k取任意可取值),则认为属性Sx的区分度最强;各个属性的区分度取:Also, when (i, k take arbitrary values), then it is considered that the discrimination degree of attribute S x is the strongest; the discrimination degree of each attribute is taken as:

所以,当给出QA后,可得索性区分度向量:Therefore, when Q A is given, the simple discrimination vector can be obtained:

得到QT后,可得物品的属性特征矩阵:After obtaining Q T , the attribute characteristic matrix of the item can be obtained:

S303、根据属性特征矩阵和属性区分度向量,可以得到候选推荐集QT的各物品推荐分数向量:S303. According to the attribute feature matrix and the attribute discrimination degree vector, the item recommendation score vector of the candidate recommendation set Q T can be obtained:

S304、根据得到的推荐分数向量对可推荐结果集B的排序,确定初步推荐结果列表C。S304. Determine the preliminary recommendation result list C by sorting the recommendable result set B according to the obtained recommendation score vector.

进一步地,所述步骤S2中可推荐候选集A的人数如果不满足最低推荐人数要求,则向业务方请求扩大搜索条件。Further, in the step S2, if the number of people who can recommend the candidate set A does not meet the requirement of the minimum recommended number of people, request the business party to expand the search conditions.

进一步地,所述用户的属性及特征值的定义的方式为【属性-值】键值对。Further, the user's attributes and feature values are defined in the form of [attribute-value] key-value pairs.

进一步地,用户的初始属性由用户在注册时填写,推荐引擎可以根据用户填写的属性值,为初始用户进行特征划分。Furthermore, the user's initial attributes are filled in by the user during registration, and the recommendation engine can perform feature division for the initial user based on the attribute values filled in by the user.

本发明相对于现有技术具有如下的优点及效果:Compared with the prior art, the present invention has the following advantages and effects:

相比于传统的基于人口统计学和基于内容的推荐方法相比,本发明借鉴协同过滤思想和基于内容的推荐思想,更为注重了用户之间的个体差异性,通过分析用户的历史行为数据,挖掘用户的兴趣进行推荐;因此,推荐结果因人而异,更多的考虑了用户的个人喜好,也可以针对不同的推荐场景定制不同的方法参数。Compared with traditional demographic-based and content-based recommendation methods, the present invention draws on the ideas of collaborative filtering and content-based recommendation, and pays more attention to the individual differences among users. By analyzing the historical behavior data of users , mining the interests of users to make recommendations; therefore, the recommendation results vary from person to person, more consideration is given to the user's personal preferences, and different method parameters can be customized for different recommendation scenarios.

附图说明Description of drawings

图1是本发明中公开的基于协同过滤的通用物品推荐方法的流程图。Fig. 1 is a flow chart of a method for recommending general items based on collaborative filtering disclosed in the present invention.

具体实施方式detailed description

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

实施例Example

如附图1所示,本实施例公开了一种基于协调过滤的通用物品推荐方法,该通用物品推荐方法借鉴协同过滤思想和基于内容的推荐思想,设计一套通用物品的推荐引擎。其中,通用物品包括但不限于:书籍、音乐、电影,不同的通用物品的区别在于,喜好度的衡量标准以及物品的属性特征差异可根据不同的推荐场景由业务方调整参数。As shown in FIG. 1 , this embodiment discloses a method for recommending general items based on coordinated filtering. The method for recommending general items draws on ideas of collaborative filtering and content-based recommendation to design a recommendation engine for general items. Among them, general-purpose items include but are not limited to: books, music, and movies. The difference between different general-purpose items is that the measurement standard of preference and the difference in attribute characteristics of items can be adjusted by the business side according to different recommendation scenarios.

该通过物品推荐方法具体包括下列步骤:The item recommendation method specifically includes the following steps:

S1、业务方对推荐物品及用户的属性及特征值进行初始化定义,定义的方式为【属性-值】键值对,例如【身高-170CM】;将用户的行为分类并设置初始搜索引擎的条件,完成初始化设定和推荐引擎APK接入后,向推荐引擎APK发起推荐请求,并将初始化的数据集发送给推荐引擎APK。S1. The business side initializes and defines the attributes and feature values of recommended items and users. The definition method is [attribute-value] key-value pairs, such as [height-170CM]; classify user behavior and set initial search engine conditions , after completing the initialization settings and accessing the recommendation engine APK, initiate a recommendation request to the recommendation engine APK, and send the initialized data set to the recommendation engine APK.

具体应用中,用户的初始属性由用户在注册时填写,推荐引擎可以根据用户填写的属性值,为初始用户进行特征划分。In a specific application, the user's initial attributes are filled in by the user during registration, and the recommendation engine can classify the characteristics of the initial user according to the attribute values filled in by the user.

S2、推荐引擎APK首先根据业务方的初始搜索引擎,将满足搜索条件的数据,汇总形成可推荐候选集A,对可推荐候选集A通过相似度判定得到相似用户集,并根据基于用户的协同过滤思想,进行数据筛选得到可推荐结果集B;如果可推荐候选集A的人数不满足最低推荐人数要求,则向业务方请求扩大搜索条件。S2. The recommendation engine APK first summarizes the data that meets the search conditions according to the initial search engine of the business party to form a recommendable candidate set A, and obtains a similar user set through the similarity judgment for the recommendable candidate set A, and based on user-based collaboration Filter ideas and perform data screening to obtain recommendable result set B; if the number of recommendable candidate set A does not meet the minimum number of recommenders, request the business side to expand the search conditions.

其中,所述进行数据筛选得到可推荐结果集B的具体过程如下:Wherein, the specific process of performing data screening to obtain the recommendable result set B is as follows:

S201、将用户的行为分为T1~TK共K类,并对这K类行为分别进行权重赋值w1~wk,根据不同的用户行为区分为正面、负面以及高、中、低六个维度,赋值向量w的取值为w=【-2,-1,0,1,2,3】;S201. Divide user behaviors into K categories T 1 to T K , and assign weights w 1 to w k to these K categories of behaviors, and classify them into positive, negative, high, medium, and low according to different user behaviors. dimension, the value of the assignment vector w is w=[-2,-1,0,1,2,3];

具体应用中,根据用户在线上的行为数据,定义不同行为的特征。In a specific application, the characteristics of different behaviors are defined according to the user's online behavior data.

S202、获取用户对物品的行为操作累加值得到用户对物品的喜好度H=∑w,当H>3时,则认为用户对该物品感兴趣;S202. Obtain the accumulated value of the user's behavior and operation on the item to obtain the user's preference for the item H=∑w. When H>3, it is considered that the user is interested in the item;

S203、通过不同用户对各个物品的喜好度H,利用欧几里得距离 S203. Using the Euclidean distance according to the preferences H of different users for each item

计算得到用户之间的相似度: Calculate the similarity between users:

当两个用户之间的相似度sim(x,y)>k时,其中k由业务方决定,即认为两者相似从而得到相似用户集,并根据基于用户的协同过滤思想,得到各个用户的可推荐结果集B。When the similarity sim(x,y)>k between two users, where k is determined by the business side, that is, the two are considered similar to obtain a similar user set, and according to the idea of user-based collaborative filtering, each user’s Result set B can be recommended.

S3、将可推荐结果集B根据物品的属性,对属性的特征值进行划分,根据用户的感兴趣集中各个特征值所占的比例,得到可推荐结果集B的用户中每个属性特征值所占的权重,并根据各个最显著的特征集去得到用户感知最敏感的属性,根据感知最敏感的属性的不同参考权重,将用户的感兴趣集进行排序,得到初步推荐列表C。S3. Divide the recommendable result set B according to the attributes of the items, and divide the feature values of the attributes, and according to the proportion of each feature value in the user's interest set, obtain the proportion of each attribute feature value in the users of the recommendable result set B According to the most significant feature sets to obtain the most sensitive attribute of user perception, according to the different reference weights of the most sensitive attribute, the user's interest set is sorted, and the preliminary recommendation list C is obtained.

该步骤具体过程如下:The specific process of this step is as follows:

S301、根据可推荐结果集B,对物品的属性和特征值进行划分,设物品的属性向量为:S301. According to the recommendable result set B, divide the attribute and characteristic value of the item, and set the attribute vector of the item as:

属性Si的特征值向量为:The eigenvalue vector of attribute S i is:

S302、通过各个属性的区分度向量和候选推荐集构造属性特征矩阵;S302. Construct an attribute feature matrix through the discrimination degree vector of each attribute and the candidate recommendation set;

对用户A而言,候选推荐集QT中某一物品属性Si的特征值vk所占的比例为则在候选推荐集QT中,设该物品属性的特征值vk所占的权重:For user A, the feature value v k of an item attribute S i in the candidate recommendation set Q T is The proportion of Then in the candidate recommendation set QT , set the weight of the feature value v k of the item attribute:

另,当(i,k取任意可取值)Also, when (i, k take any possible value)

则认为属性Sx的区分度最强;各个属性的区分度取:Then it is considered that the distinguishing degree of attribute S x is the strongest; the distinguishing degree of each attribute is taken as:

所以,当给出QA后,可得索性区分度向量:Therefore, when Q A is given, the simple discrimination vector can be obtained:

得到QT后,可得物品的属性特征矩阵:After obtaining Q T , the attribute characteristic matrix of the item can be obtained:

S303、根据属性特征矩阵和属性区分度向量,可以得到候选推荐集QT的各物品推荐分数向量:S303. According to the attribute feature matrix and the attribute discrimination degree vector, the item recommendation score vector of the candidate recommendation set Q T can be obtained:

S304、根据得到的推荐分数向量对可推荐结果集B的排序,确定初步推荐结果列表C。S304. Determine the preliminary recommendation result list C by sorting the recommendable result set B according to the obtained recommendation score vector.

S4、将初步推荐结果列表C发送给业务方,业务方根据需求进行适当重排,最后得到推荐结果列表D,即为最终结果。S4. The preliminary recommendation result list C is sent to the business party, and the business party performs appropriate rearrangement according to requirements, and finally obtains the recommendation result list D, which is the final result.

综上所述,本发明借鉴协同过滤思想和基于内容的推荐思想,注重用户之间的个体差异性,通过分析用户的历史行为数据,挖掘用户的兴趣进行推荐;因此,推荐结果因人而异,更多的考虑了用户的个人喜好,也可以针对不同的推荐场景定制不同的方法参数。In summary, the present invention draws on the ideas of collaborative filtering and content-based recommendation, pays attention to the individual differences among users, analyzes the user's historical behavior data, and taps the user's interests to make recommendations; therefore, the recommendation results vary from person to person , more consideration is given to the user's personal preferences, and different method parameters can be customized for different recommendation scenarios.

上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。The above-mentioned embodiment is a preferred embodiment of the present invention, but the embodiment of the present invention is not limited by the above-mentioned embodiment, and any other changes, modifications, substitutions, combinations, Simplifications should be equivalent replacement methods, and all are included in the protection scope of the present invention.

Claims (8)

1.一种基于协同过滤的通用物品推荐方法,其特征在于,所述通用物品推荐方法包括下列步骤:1. A general article recommendation method based on collaborative filtering, characterized in that, the general article recommendation method comprises the following steps: S1、业务方对推荐物品及用户的属性及特征值进行初始化定义,将用户的行为分类并设置初始搜索引擎的条件,完成初始化设定和推荐引擎APK接入后,向推荐引擎APK发起推荐请求,并将初始化的数据集发送给推荐引擎APK;S1. The business side initializes and defines the attributes and feature values of recommended items and users, classifies user behaviors and sets the conditions of the initial search engine, and initiates a recommendation request to the recommendation engine APK after completing the initialization settings and accessing the recommendation engine APK , and send the initialized data set to the recommendation engine APK; S2、推荐引擎APK首先根据业务方的初始搜索引擎,将满足搜索条件的数据,汇总形成可推荐候选集A,对可推荐候选集A通过相似度判定得到相似用户集,并根据基于用户的协同过滤思想,进行数据筛选得到可推荐结果集B;S2. The recommendation engine APK first summarizes the data that meets the search conditions according to the initial search engine of the business party to form a recommendable candidate set A, and obtains a similar user set through the similarity judgment for the recommendable candidate set A, and based on user-based collaboration Filter ideas and filter data to get recommendable result set B; S3、将可推荐结果集B根据物品的属性,对属性的特征值进行划分,根据用户的感兴趣集中各个特征值所占的比例,得到可推荐结果集B的用户中每个属性特征值所占的权重,并根据各个最显著的特征集去得到用户感知最敏感的属性,根据感知最敏感的属性的不同参考权重,将用户的感兴趣集进行排序,得到初步推荐列表C;S3. Divide the recommendable result set B according to the attributes of the items, and divide the feature values of the attributes, and according to the proportion of each feature value in the user's interest set, obtain the proportion of each attribute feature value in the users of the recommendable result set B According to the weight of each most significant feature set to obtain the most sensitive attributes of user perception, according to the different reference weights of the most sensitive attributes, the user's interest set is sorted, and the preliminary recommendation list C is obtained; S4、将初步推荐结果列表C发送给业务方,业务方根据需求进行重排,最后得到推荐结果列表D。S4. Send the preliminary recommendation result list C to the business party, and the business party rearranges it according to requirements, and finally obtains the recommendation result list D. 2.根据权利要求1所述的基于协同过滤的通用物品推荐方法,其特征在于,所述步骤S2的具体过程如下:2. The general article recommendation method based on collaborative filtering according to claim 1, wherein the specific process of the step S2 is as follows: S201、将用户的行为分为T1~TK共K类,并对这K类行为分别进行权重赋值w1~wk,根据不同的用户行为区分为正面、负面以及高、中、低六个维度,赋值向量w的取值为w=【-2,-1,0,1,2,3】;S201. Divide user behaviors into K categories T 1 to T K , and assign weights w 1 to w k to these K categories of behaviors, and classify them into positive, negative, high, medium, and low according to different user behaviors. dimension, the value of the assignment vector w is w=[-2,-1,0,1,2,3]; S202、获取用户对物品的行为操作累加值得到用户对物品的喜好度H=∑w;S202. Obtain the accumulated value of the user's behavior and operation on the item to obtain the user's preference for the item H=∑w; S203、通过不同用户对各个物品的喜好度H,利用欧几里得距离计算得到用户之间的相似度: S203. Using the Euclidean distance according to the preferences H of different users for each item Calculate the similarity between users: 当两个用户之间的相似度sim(x,y)>k时,其中k由业务方决定,即认为两者相似从而得到相似用户集,并根据基于用户的协同过滤思想,得到各个用户的可推荐结果集B。When the similarity sim(x,y)>k between two users, where k is determined by the business side, that is, the two are considered similar to obtain a similar user set, and according to the idea of user-based collaborative filtering, each user’s Result set B can be recommended. 3.根据权利要求1所述的基于协同过滤的通用物品推荐方法,其特征在于,所述步骤S3的具体过程如下:3. The general article recommendation method based on collaborative filtering according to claim 1, wherein the specific process of the step S3 is as follows: S301、根据可推荐结果集B,对物品的属性和特征值进行划分,设物品的属性向量为:S301. According to the recommendable result set B, divide the attribute and characteristic value of the item, and set the attribute vector of the item as: 属性Si的特征值向量为:The eigenvalue vector of attribute S i is: S302、通过各个属性的区分度向量和候选推荐集构造属性特征矩阵;S302. Construct an attribute feature matrix through the discrimination degree vector of each attribute and the candidate recommendation set; 对用户A而言,感兴趣物品集QA中某一物品属性Si的特征值vk所占的比例为则在候选推荐集QT中,设该物品属性的特征值vk所占的权重:For user A, the eigenvalue v k of an item attribute S i in the interested item set Q A is The proportion of Then in the candidate recommendation set QT , set the weight of the feature value v k of the item attribute: qq == qq vv kk ii 另,当(i,k取任意可取值),则认为属性Sx的区分度最强;各个属性的区分度取:Also, when (i, k take arbitrary values), then it is considered that the discrimination degree of attribute S x is the strongest; the discrimination degree of each attribute is taken as: PP ww == maxmax qq vv ww 所以,当给出QA后,可得索性区分度向量:Therefore, when Q A is given, the simple discrimination vector can be obtained: 得到QT后,可得物品的属性特征矩阵:After obtaining Q T , the attribute characteristic matrix of the item can be obtained: AA QQ ee == xx 11 11 ,, xx 11 22 ..................... xx 11 TT xx 22 11 ,, xx 22 22 ..................... xx 22 TT ........................................................... ........................................................... xx ww 11 ,, xx ww 22 ..................... xx ww TT ;; S303、根据属性特征矩阵和属性区分度向量,可以得到候选推荐集QT的各物品推荐分数向量:S303. According to the attribute feature matrix and the attribute discrimination degree vector, the item recommendation score vector of the candidate recommendation set Q T can be obtained: S304、根据得到的推荐分数向量对可推荐结果集B的排序,确定初步推荐结果列表C。S304. Determine the preliminary recommendation result list C by sorting the recommendable result set B according to the obtained recommendation score vector. 4.根据权利要求1所述的基于协同过滤的通用物品推荐方法,其特征在于,4. the general item recommendation method based on collaborative filtering according to claim 1, characterized in that, 所述步骤S2中可推荐候选集A的人数如果不满足最低推荐人数要求,则向业务方请求扩大搜索条件。If the number of people who can recommend the candidate set A in the step S2 does not meet the requirement of the minimum number of recommended people, request the business party to expand the search conditions. 5.根据权利要求1所述的基于协同过滤的通用物品推荐方法,其特征在于,所述用户的属性及特征值的定义的方式为【属性-值】键值对。5. The method for recommending general items based on collaborative filtering according to claim 1, wherein the user's attributes and feature values are defined in the form of [attribute-value] key-value pairs. 6.根据权利要求1所述的基于协同过滤的通用物品推荐方法,其特征在于,用户的初始属性由用户在注册时填写,推荐引擎可以根据用户填写的属性值,为初始用户进行特征划分。6. The general item recommendation method based on collaborative filtering according to claim 1, wherein the user's initial attributes are filled in by the user when registering, and the recommendation engine can perform feature division for the initial user according to the attribute values filled in by the user. 7.根据权利要求2所述的基于协同过滤的通用物品推荐方法,其特征在于,7. the method for recommending general items based on collaborative filtering according to claim 2, characterized in that, 当用户对物品的喜好度H>3时,则认为用户对该物品感兴趣。When the user's preference for an item is H>3, it is considered that the user is interested in the item. 8.根据权利要求1至7任一所述的基于协同过滤的通用物品推荐方法,其特征在于,8. The general item recommendation method based on collaborative filtering according to any one of claims 1 to 7, characterized in that, 所述通用物品包括:书籍、音乐、电影,不同的通用物品的区别在于,喜好度的衡量标准以及物品的属性特征差异可根据不同的推荐场景由业务方调整参数。The general-purpose items include: books, music, and movies. The difference between different general-purpose items is that the measurement standard of preference and the difference in attribute characteristics of the items can be adjusted by the business side according to different recommendation scenarios.
CN201710110168.1A 2017-02-27 2017-02-27 Generic item recommendation method based on collaborative filtering Active CN106952130B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710110168.1A CN106952130B (en) 2017-02-27 2017-02-27 Generic item recommendation method based on collaborative filtering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710110168.1A CN106952130B (en) 2017-02-27 2017-02-27 Generic item recommendation method based on collaborative filtering

Publications (2)

Publication Number Publication Date
CN106952130A true CN106952130A (en) 2017-07-14
CN106952130B CN106952130B (en) 2020-10-27

Family

ID=59468020

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710110168.1A Active CN106952130B (en) 2017-02-27 2017-02-27 Generic item recommendation method based on collaborative filtering

Country Status (1)

Country Link
CN (1) CN106952130B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108038730A (en) * 2017-12-22 2018-05-15 联想(北京)有限公司 Product similarity determination methods, device and server cluster
CN109214893A (en) * 2018-08-31 2019-01-15 深圳春沐源控股有限公司 Method of Commodity Recommendation, recommender system and computer installation
CN109345175A (en) * 2018-10-10 2019-02-15 江苏满运软件科技有限公司 Source of goods method for pushing, system, equipment and storage medium based on driver's matching degree
CN110399185A (en) * 2018-04-24 2019-11-01 华为技术有限公司 A kind of method, terminal and server adjusting intelligent recommendation
CN111400532A (en) * 2019-01-03 2020-07-10 福建天泉教育科技有限公司 Photo recommendation method and terminal
CN111708945A (en) * 2020-06-18 2020-09-25 腾讯科技(深圳)有限公司 Recommended methods and devices for products, electronic equipment, and computer storage media
CN112632405A (en) * 2020-12-31 2021-04-09 数字广东网络建设有限公司 Recommendation method, device, equipment and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6041311A (en) * 1995-06-30 2000-03-21 Microsoft Corporation Method and apparatus for item recommendation using automated collaborative filtering
CN103023977A (en) * 2012-11-19 2013-04-03 华南理工大学 Recommendation system and method based on positional information
US8738467B2 (en) * 2006-03-16 2014-05-27 Microsoft Corporation Cluster-based scalable collaborative filtering
CN104915861A (en) * 2015-06-15 2015-09-16 浙江经贸职业技术学院 An electronic commerce recommendation method for a user group model constructed based on scores and labels
CN105095267A (en) * 2014-05-09 2015-11-25 阿里巴巴集团控股有限公司 User involving project recommendation method and apparatus
CN105976229A (en) * 2016-05-13 2016-09-28 云南大学 Collaborative filtering algorithm based on user and project mixing
CN106326483A (en) * 2016-08-31 2017-01-11 华南理工大学 Collaborative recommendation method with user context information aggregation
CN106339502A (en) * 2016-09-18 2017-01-18 电子科技大学 Modeling recommendation method based on user behavior data fragmentation cluster

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6041311A (en) * 1995-06-30 2000-03-21 Microsoft Corporation Method and apparatus for item recommendation using automated collaborative filtering
US8738467B2 (en) * 2006-03-16 2014-05-27 Microsoft Corporation Cluster-based scalable collaborative filtering
CN103023977A (en) * 2012-11-19 2013-04-03 华南理工大学 Recommendation system and method based on positional information
CN105095267A (en) * 2014-05-09 2015-11-25 阿里巴巴集团控股有限公司 User involving project recommendation method and apparatus
CN104915861A (en) * 2015-06-15 2015-09-16 浙江经贸职业技术学院 An electronic commerce recommendation method for a user group model constructed based on scores and labels
CN105976229A (en) * 2016-05-13 2016-09-28 云南大学 Collaborative filtering algorithm based on user and project mixing
CN106326483A (en) * 2016-08-31 2017-01-11 华南理工大学 Collaborative recommendation method with user context information aggregation
CN106339502A (en) * 2016-09-18 2017-01-18 电子科技大学 Modeling recommendation method based on user behavior data fragmentation cluster

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
尹航: "信息推荐系统中的协同过滤技术研究", 《万方学术论文数据库》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108038730A (en) * 2017-12-22 2018-05-15 联想(北京)有限公司 Product similarity determination methods, device and server cluster
CN110399185A (en) * 2018-04-24 2019-11-01 华为技术有限公司 A kind of method, terminal and server adjusting intelligent recommendation
US11442751B2 (en) 2018-04-24 2022-09-13 Huawei Technologies Co., Ltd. Method for adjusting intelligent recommendation, terminal, and server
CN109214893A (en) * 2018-08-31 2019-01-15 深圳春沐源控股有限公司 Method of Commodity Recommendation, recommender system and computer installation
CN109345175A (en) * 2018-10-10 2019-02-15 江苏满运软件科技有限公司 Source of goods method for pushing, system, equipment and storage medium based on driver's matching degree
CN109345175B (en) * 2018-10-10 2021-08-03 江苏满运软件科技有限公司 Goods source pushing method, system, equipment and storage medium based on driver matching degree
CN111400532A (en) * 2019-01-03 2020-07-10 福建天泉教育科技有限公司 Photo recommendation method and terminal
CN111708945A (en) * 2020-06-18 2020-09-25 腾讯科技(深圳)有限公司 Recommended methods and devices for products, electronic equipment, and computer storage media
CN111708945B (en) * 2020-06-18 2025-05-06 腾讯科技(深圳)有限公司 Product recommendation method and device, electronic device, computer storage medium
CN112632405A (en) * 2020-12-31 2021-04-09 数字广东网络建设有限公司 Recommendation method, device, equipment and storage medium
CN112632405B (en) * 2020-12-31 2024-05-10 数字广东网络建设有限公司 Recommendation method, recommendation device, recommendation equipment and storage medium

Also Published As

Publication number Publication date
CN106952130B (en) 2020-10-27

Similar Documents

Publication Publication Date Title
CN106952130B (en) Generic item recommendation method based on collaborative filtering
US20230252094A1 (en) Computer-implemented system and method for updating user interest profiles
US8843470B2 (en) Meta classifier for query intent classification
US7991715B2 (en) System and method for image classification
US10402917B2 (en) Color-related social networking recommendations using affiliated colors
TWI636416B (en) Method and system for multi-phase ranking for content personalization
CN104317835B (en) The new user of video terminal recommends method
US20110320441A1 (en) Adjusting search results based on user social profiles
US20160132800A1 (en) Business Relationship Accessing
US20100153215A1 (en) Enhanced search result relevance using relationship information
CN102063433A (en) Method and device for recommending related items
CN109146626B (en) Fashionable garment matching recommendation method based on user dynamic interest analysis
CN113836393B (en) Cold start recommendation method based on preference self-adaptive meta-learning
Kanetkar et al. Web-based personalized hybrid book recommendation system
US20140288999A1 (en) Social character recognition (scr) system
CN107563867A (en) A kind of commending system cold start-up method based on multi-arm fruit machine confidence upper limit
CN106709076B (en) Social network recommendation device and method based on collaborative filtering
JP2017054214A (en) Determination device, learning device, information distribution device, determination method, and determination program
US20140351052A1 (en) Contextual Product Recommendation Engine
US20240248944A1 (en) Systems and methods for implementing session cookies for content selection
Dhawan et al. High rating recent preferences based recommendation system
Prando et al. Content-based Recommender System using Social Networks for Cold-start Users.
US20160132902A1 (en) Search and Rank Organizations
JP2017201535A (en) Determination device, learning device, determination method, and determination program
CN106952111A (en) Personalized recommendation method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant