JP2005128673A

JP2005128673A - Information recommendation method, server device, program, and recording medium

Info

Publication number: JP2005128673A
Application number: JP2003361553A
Authority: JP
Inventors: Masakazu Endo; 雅和遠藤; Akira Kitahara; 亮北原; Keiichi Hirota; 啓一廣田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2003-10-22
Filing date: 2003-10-22
Publication date: 2005-05-19

Abstract

【課題】興味が一致するユーザに関連するコンテンツを取得できるようにする。
【解決手段】コンテンツ再生情報をもとに情報探索を要求するユーザのコンテンツに関するユーザ種別を判定し、情報探索を要求するユーザのコンテンツ再生情報に含まれるコンテンツの知名度を数値化し、再生頻度に対し、情報探索要求元ユーザのユーザ種別に従って、コンテンツ情報の知名度をもとに算出される重みを付与したユーザ嗜好ベクトルを生成し、生成したユーザ嗜好ベクトルが情報探索を要求するユーザと類似する、情報探索を要求するユーザ以外のユーザを選択し、選択したユーザのコンテンツ再生情報に含まれるコンテンツの情報を取得し、取得したコンテンツの情報のうち、情報探索を要求するユーザにとって有益な情報を選択し、選択された情報を、情報探索を要求するユーザに提示する。
【選択図】図１PROBLEM TO BE SOLVED: To obtain content related to a user having a common interest.
SOLUTION: Based on content reproduction information, a user type related to content of a user who requests information search is determined, and the degree of popularity of content included in the content reproduction information of the user who requests information search is quantified, and the reproduction frequency is determined. , Generating a user preference vector to which a weight calculated based on the degree of publicity of content information is generated according to the user type of the information search requesting user, and the generated user preference vector is similar to the user requesting the information search A user other than the user who requests the search is selected, content information included in the content reproduction information of the selected user is acquired, and information useful for the user who requests the information search is selected from the acquired content information. The selected information is presented to the user who requests the information search.
[Selection] Figure 1

Description

本発明は、ネットワークを介した情報探索において、興味・関心・意図といったユーザの問題意識およびユーザにより獲得された情報を収集し、類似の問題意識を持った他ユーザに情報を提供することで情報収集活動を支援する情報レコメンドを行う技術に関する。 The present invention collects information on user problems such as interests, interests and intentions and information acquired by users in information search via a network, and provides information to other users who have similar problem awareness. The present invention relates to information recommendation technology that supports collection activities.

従来、興味・関心・意図といったユーザの問題意識およびユーザにより獲得された情報を収集し、類似の問題意識を持った他ユーザに提供することで情報収集活動を支援するための手法として、協調フィルタリングを利用した情報レコメンド方法が存在する。この協調フィルタリングは、複数の人間が存在する環境を前提とし、あるユーザに関する情報の取捨選択の基準を記述するファイルであるＵｓｅｒｐｒｏｆｉ１ｅと、ユーザが情報収集で得た情報を蓄積するファイルであるＲｅｃｏｍｍｅｎｄａｔｉｏｎと、ユーザの特徴を抽出する処理単位であるＵｓｅｒｍｏｄｅｌｅｒと、ユーザ関の類似度を計算する処理単位であるＭａｔｃｈｍａｋｅｒと、個々のユーザに応じて情報のランキングを計算する処理単位であるＳｃｏｒｅｐｒｅｄｉｃａｔｏｒとの５要素から成り立つ。ユーザがＵｓｅｒｐｒｏｆｉｌｅとＲｅｃｏｍｍｅｎｄａｔｉｏｎとを入力することにより、ＵｓｅｒｍｏａｅｌｅｒはＵｓｅｒｐｒｏｆｉｌｅを元にユーザの特徴を数値化し、Ｍａｔｃｈｍａｋｅｒはユーザ特徴が類似する他者を探索し、Ｓｃｏｒｅｐｒｅｄｉｃａｔｏｒが、探索された他者のＲｅｃｏｍｍｅｎｄａｔｉｏｎから個々のユーザに応じて評価された情報を出力することにより、情報のフィルタリングを行う。 Conventionally, collaborative filtering is a method for supporting information gathering activities by collecting user's awareness of interests, interests and intentions and information acquired by users and providing them to other users with similar problem awareness. There is an information recommendation method using. This collaborative filtering is based on an environment where there are a plurality of people, and User profile 1e is a file that describes the criteria for selecting information related to a user, and Recommendation is a file that stores information obtained by the user by collecting information. User modeler, which is a processing unit for extracting user characteristics, Match maker, which is a processing unit for calculating similarity of user relations, and Score predicator, which is a processing unit for calculating ranking of information according to individual users It consists of five elements. When the user inputs User profile and Recommendation, User moerer quantifies the user's characteristics based on the User profile, Match maker searches for others with similar user characteristics, and Score predictor The information is filtered by outputting information evaluated according to each user from the recommendation of the person.

協調フィルタリングシステムの要素の一つであるＭａｔｃｈｍａｋｅｒの処理で行われる、ユーザ間の類似度を計算する処理には様々な手法が提案されている。 Various methods have been proposed for the process of calculating the similarity between users performed in the process of Match maker which is one of the elements of the collaborative filtering system.

現在提案されている手法の中で最も一般的なアルゴリズムであるＧｒｏｕｐＬｅｎｓ（例えば、非特許文献１参照。）における、ユーザ間の類似度算出方法について以下に説明する。 A method for calculating similarity between users in GroupLens (for example, see Non-Patent Document 1), which is the most general algorithm among currently proposed methods, will be described below.

ＧｒｏｕｐＬｅｎｓはユーザの関心を、コンテンツに関するユーザ評価を得点化した数値の配列として扱い、ユーザＫの評価得点配列Ｋと、ユーザＬの評価得点配列Ｌとの類似度ｒ_KLを以下の式で定義する。 GroupLens treats the user's interest as an array of numerical values obtained by scoring the user evaluation regarding the content, and defines the similarity r _KL between the evaluation score array K of the user K and the evaluation score array L of the user L by the following expression. .

ＧｒｏｕｐＬｅｎｓのアルゴリズムを用いた協調フィルタリングにおける課題を解決するための、いくつかの研究がなされているが、その多くは、ＧｒｏｕｐＬｅｎｓのアルゴリズムを基礎としている。 Several studies have been made to solve the problem in collaborative filtering using the GroupLens algorithm, many of which are based on the GroupLens algorithm.

一方、人間社会における情報の流通過程において、その過程に関与するユーザは、その行動特性により複数のユーザ種別に分類されることが報告されている（例えば、非特許文献２参照。）。ユーザ種別の判定方法については、アンケートを行った結果をもとに判定する方法や、Ｗｅｂのアクセスログをもとに判定する方法など様々な研究が進められている。非特許文献３では、音楽や映像といった嗜好コンテンツの再生履歴の特徴をもとにユーザを分類すると、ユーザ種別ごとに、他者からの情報取得形態が異なる傾向があることが報告されている。非特許文献３によれば、コンテンツ再生の特徴と、他者からの情報取得形態の特徴との間には、以下のような関連がある。 On the other hand, in the information distribution process in human society, it has been reported that users involved in the process are classified into a plurality of user types according to their behavior characteristics (see, for example, Non-Patent Document 2). As for the determination method of the user type, various researches such as a determination method based on a result of a questionnaire and a determination method based on a Web access log are being conducted. Non-Patent Document 3 reports that when users are classified based on the characteristics of the playback history of favorite content such as music and video, the information acquisition form from others tends to differ for each user type. According to Non-Patent Document 3, there is the following relationship between the feature of content reproduction and the feature of information acquisition form from others.

（１）コンテンツに対する拘りが強く、特定のコンテンツに限って再生頻度が高いユーザ（以下、Ｍａｎｉａと呼ぶ）は、自分の好きなマイナーな趣味が一致する他者からの情報取得に対する期待が高い。 (1) Users who are strongly concerned with content and have high reproduction frequency only for specific content (hereinafter referred to as “Mania”) have high expectations for information acquisition from others with their favorite minor hobbies.

（２）いくつかのコンテンツを、ある程度の期間連続して再生するユーザ（以下、ＯｐｉｎｉｏｎＬｅａｄｅｒと呼ぶ）は、新しい情報を持つ他者からの情報取得に対する期待が高い。 (2) A user who reproduces some contents continuously for a certain period of time (hereinafter referred to as Opinion Leader) is highly expected to acquire information from others who have new information.

（３）多くのコンテンツを少数回ずつ再生するユーザ（以下、Ｍａｊｏｒｉｔｙと呼ぶ）は、世の中で採用者の多い流行の情報を持つ他者からの情報取得に対する期待が高い。
ＰａｕｌＲｅｓｎｉｃｋ、ＮｅｏｐｈｙｔｏｓＩａｃｏｖｏｕ、ＭｉｔｅｓｈＳｕｃｈａｋ、ＰｅｔｅｒＶｅｒｇｓｔｒｏｍ、ＪｏｈｎＲｉｅｄｌ、“ＧｒｏｕｐＬｅｎｓ：ＡｎＯｐｅｎＡｒｃｈｉｔｅｃｔｕｒｅｆｏｒｃｏｌｌａｂｏｒａｔｉｖｅｆｉｌｔｅｒｉｎｇｏｆｎｅｔｎｅｗｓ”、ＣｏｍｐｕｔｅｒＳｕｐｐｏｒｔｅｄＣｏｏｐｅｒａｔｉｖｅＷｏｒｋ、１９９４Ｅ．Ｍロジャース、”コミュニケーションの科学−マルチメディア社会の基礎理論”、１９９２遠藤、廣田、大竹、山田、”コンテンツ属性情報を利用したコミュニティ形成方法の評価”、情報処理学会第４７回ＧＮ研究会、２００２ (3) A user who reproduces a lot of content a small number of times (hereinafter referred to as “majority”) is highly expected to acquire information from others who have trendy information with many employers in the world.
Paul Resnick, Neophytos Iacovou, Mitsuku Schak, Peter Verstrom, John Riedl, “GroupLens: An Open Architecture for collaborative feeds”. E. M. Rogers, “Science of Communication-Basic Theory of Multimedia Society”, 1992 Endo, Iwata, Otake, Yamada, “Evaluation of Community Formation Method Using Content Attribute Information”, 47th GN Study Group of Information Processing Society, 2002

従来手法では、ユーザ間の類似度計算式は、コンテンツに関するユーザ評価値の分散が分母に含まれるため、評価の分散が小さいユーザ、すなわち、特定のコンテンツに対する拘りがなく、各コンテンツに対する評価に差が付かないユーザとの類似度が高くなる傾向がある。例えば、図１５に示した３ユーザが存在する場合、特定のコンテンツに評価が集中しているＭａｎｉａ傾向のＵｓｅｒＡと、他２ユーザとの類似度を、従来手法により算出すると、ＵｓｅｒＡが高く評価する特定のコンテンツ（ｃｏｎｔｅｎｔ３）に対する興味が一致しているＵｓｅｒＣとの類似度よりも、コンテンツに対する評価のばらつきが少ないＵｓｅｒＢとの類似度が高く算出される。そのため、ＵｓｅｒＡとＵｓｅｒＣとの興味が一致するコンテンツ（ｃｏｎｔｅｎｔ３）が、ＵｓｅｒＡが好むマイナーコンテンツであった場合、Ｍａｎｉａの情報取得形態である、マイナーな趣味が一致する他者からの情報取得が困難である。 In the conventional method, the similarity calculation formula between users includes the variance of user evaluation values related to content in the denominator. Therefore, users with a small variance of evaluation, i.e., there is no concern for specific content, and there is a difference in evaluation for each content. There is a tendency that the degree of similarity with users who do not have is increased. For example, when the three users shown in FIG. 15 exist, if the similarity between User A who has a tendency to concentrate on specific content and the other two users is calculated by the conventional method, User A highly evaluates. The degree of similarity with User B, which has a small evaluation variation with respect to the content, is calculated higher than the degree of similarity with User C that has the same interest in the specific content (content 3). Therefore, if the content (content 3) with the same interest between UserA and UserC is the minor content preferred by UserA, it is difficult to obtain information from others who have minor interests, which is the information acquisition form of Mania. is there.

また、コンテンツに対する評価にばらつきが少ないＭａｊｏｒｉｔｙの傾向のＵｓｅｒＢと、他２ユーザとの類似度について見ると、ＵｓｅｒＡとの類似度と比較して、ＵｓｅｒＣとの類似度が低く算出されている。しかし、ＵｓｅｒＣとＵｓｅｒＢとの評価がゆるやかに一致しているコンテンツ（ｃｏｎｔｅｎｔ８）がメジャーなコンテンツであり、ＵｓｅｒＡとＵｓｅｒＢとの評価がゆるやかに一致しているコンテンツ（ｃｏｔｅｎｔ７）がマイナーなコンテンツであった場合、Ｍａｊｏｒｉｔｙの情報取得形態である、流行の情報を持つ他者からの情報取得が困難である。 In addition, regarding the similarity between User B, which has a tendency of a minority in evaluation of content, and the other two users, the similarity with User C is calculated to be lower than the similarity with User A. However, the content (content 8) in which the evaluations of UserC and UserB are gently matched is the major content, and the content (content 7) in which the evaluations of UserA and UserB are gently matched is the minor content. In this case, it is difficult to acquire information from others who have information on fashion, which is the information acquisition form of the Majority.

本発明はかかる事情に鑑みなされたもので、その目的は、上記課題を解決することができる情報レコメンド方法、サーバ装置、プログラム及びそのプログラムを記録した記録媒体を提供することにある。 The present invention has been made in view of such circumstances, and an object thereof is to provide an information recommendation method, a server device, a program, and a recording medium on which the program is recorded, which can solve the above-described problems.

そこで、前記課題を解決するために、請求項１の情報レコメンド方法は、ユーザがクライアント装置により情報探索要求を行うと、クライアント装置に接続するサーバ装置が情報の探索処理を行い、クライアント装置に情報提供を行う情報レコメンド方法において、ユーザが再生したコンテンツまたはコンテンツの属性と、コンテンツの属性のばらつきと、コンテンツ再生のバースト性とから興味の集中度を求め、この集中度の値に閾値を設けてユーザの種別を決定し、このユーザの種別に応じた方法でユーザの嗜好を表現する嗜好ベクトルを生成し、情報提供を求めるユーザの嗜好ベクトルと一定値以上の類似度を有する嗜好ベクトルを持つユーザ集合を生成し、このユーザ集合に含まれるユーザに対応する情報の中から情報提供を求めるユーザに提供する情報を決定することを特徴とする。 Therefore, in order to solve the above-described problem, the information recommendation method according to claim 1 is configured such that when a user makes an information search request by a client device, a server device connected to the client device performs information search processing, and information is transmitted to the client device. In the information recommendation method to be provided, the concentration of interest is obtained from the content reproduced by the user or the attribute of the content, the variation of the attribute of the content, and the burst property of the content reproduction, and a threshold is set to the value of the concentration A user having a preference vector that determines a user type, generates a preference vector that expresses the user's preference by a method according to the type of the user, and has a similarity greater than a certain value with the user's preference vector that requests information provision Generate a set and request information from the information corresponding to the users in this set And determining the information to be provided to over The.

また、請求項２の情報レコメンド方法は、ユーザがクライアント装置により情報探索要求を行うと、クライアント装置に接続するサーバ装置が情報の探索処理を行い、クライアント装置に情報提供を行う情報レコメンド方法であって、クライアント装置にて、ユーザがコンテンツの再生を行った際に、前記クラインアント装置から、コンテンツ再生情報を受信する過程と、前記受信したコンテンツ再生情報に含まれるコンテンツＩＤに該当するコンテンツ属性情報をコンテンツ情報データベースから取得する過程と、前記コンテンツ再生情報と、前記取得したコンテンツ属性情報とを対応付けてコンテンツ再生情報データベースに記録する過程と、前記クライアント装置からユーザ情報を指定した情報探索要求を受信する過程と、情報探索要求元ユーザのユーザ種別を、前記コンテンツ再生情報データベースに記録されたコンテンツ再生情報をもとに判定する過程と、前記情報探索要求元ユーザのコンテンツ再生情報に含まれるコンテンツまたはコンテンツ属性の知名度を数値として算出する過程と、前記コンテンツ再生情報データベースに記録されたユーザのコンテンツ再生情報と前記コンテンツまたはコンテンツ属性の知名度とをもとに、前記ユーザの種別に応じてユーザの嗜好を表現するベクトルを生成する過程と、前記ベクトルを、ユーザの嗜好ベクトルとして、嗜好ベクトル情報データベースに記録する過程と、前記情報探索要求元ユーザの嗜好ベクトルと、前記嗜好ベクトル情報データベースに記録されたユーザの嗜好ベクトルとの類似度を算出する過程と、前記情報探索要求元ユーザと嗜好ベクトルの類似度が高いユーザを選択し、選択したユーザのコンテンツ再生情報に含まれるコンテンツまたはコンテンツ属性情報から、情報探索要求を送信したユーザにとって有益な情報を抽出する過程と、前記抽出された情報を、前記クライアント装置に対して、情報探索結果として送信する過程と、を有することを特徴とする。 The information recommendation method according to claim 2 is an information recommendation method in which when a user makes an information search request by a client device, a server device connected to the client device performs information search processing and provides information to the client device. In the client device, when the user reproduces the content, the process of receiving the content reproduction information from the client device, and the content attribute information corresponding to the content ID included in the received content reproduction information A content information database, a process of associating the content reproduction information with the acquired content attribute information and recording it in the content reproduction information database, and an information search request designating user information from the client device. Reception process and information search required The process of determining the user type of the original user based on the content reproduction information recorded in the content reproduction information database, and the degree of popularity of the content or content attribute included in the content reproduction information of the information search requesting user Based on the calculation process and the content reproduction information of the user recorded in the content reproduction information database and the well-known degree of the content or content attribute, a vector expressing the user preference according to the type of the user is generated. A process, a process of recording the vector as a user preference vector in a preference vector information database, and a similarity between the preference vector of the information search requesting user and the user preference vector recorded in the preference vector information database The process of calculating the degree and the information Selecting a user having a high similarity between the search requesting user and the preference vector, and extracting information useful for the user who has transmitted the information search request from the content or content attribute information included in the content reproduction information of the selected user; And transmitting the extracted information as an information search result to the client device.

また、請求項３の情報レコメンド方法は、請求項２記載の情報レコメンド方法において、前記ユーザ種別は、前記コンテンツ再生情報データベースに記録されたコンテンツ再生情報をもとに、該当ユーザが再生したコンテンツまたはそのコンテンツ属性のばらつきと、コンテンツの再生が特定の時間範囲内に集中しているか、継続的に再生されているかを数値で表したコンテンツ再生のバースト性とを反映して算出される指標値により判定されることを特徴とする。 The information recommendation method according to claim 3 is the information recommendation method according to claim 2, wherein the user type is a content reproduced by a corresponding user based on content reproduction information recorded in the content reproduction information database, or Based on the index value calculated to reflect the dispersion of the content attributes and the burstiness of content playback that expresses whether content playback is concentrated within a specific time range or is continuously played back It is determined.

また、請求項４の情報レコメンド方法は、請求項２または３記載の情報レコメンド方法において、前記嗜好ベクトルは、ユーザが過去に再生したコンテンツのコンテンツＩＤまたはコンテンツ属性をベクトルの要素とし、要素となるコンテンツＩＤまたはコンテンツ属性を持つコンテンツの再生頻度と、前記コンテンツまたはコンテンツ属性の知名度とをもとに、前記情報探索要求元ユーザのユーザ種別に応じて算出した重みを付与した値を要素の値とすることを特徴とする。 Further, the information recommendation method according to claim 4 is the information recommendation method according to claim 2 or 3, wherein the preference vector is an element having a content ID or content attribute of a content reproduced by the user in the past as a vector element. Based on the reproduction frequency of content having a content ID or content attribute and the familiarity of the content or content attribute, a value given a weight calculated according to the user type of the information search requesting user is set as an element value It is characterized by doing.

また、請求項５のサーバ装置は、ユーザがクライアント装置により情報探索要求を行うと、この要求を受け取り情報の探索処理を行い、クライアント装置に情報提供を行うサーバ装置において、ユーザが再生したコンテンツまたはコンテンツの属性と、コンテンツの属性のばらつきと、コンテンツ再生のバースト性とから興味の集中度を求め、この集中度の値に閾値を設けてユーザの種別を決定し、このユーザの種別に応じた方法でユーザの嗜好を表現する嗜好ベクトルを生成し、情報提供を求めるユーザの嗜好ベクトルと一定値以上の類似度を有する嗜好ベクトルを持つユーザ集合を生成し、このユーザ集合に含まれるユーザに対応する情報の中から情報提供を求めるユーザに提供する情報を決定することを特徴とする。 In addition, when the user makes an information search request by the client device, the server device according to claim 5 receives the request, performs information search processing, and provides information to the client device. The concentration of interest is determined from the content attributes, content attribute variations, and bursts of content playback, and a threshold is set for the concentration value to determine the user type. According to the user type Generate a preference vector that expresses the user's preference by the method, generate a user set having a preference vector that has a certain degree of similarity with the user's preference vector for which information is to be provided, and handle users included in this user set The information to be provided to the user who requests information provision is determined from the information to be provided.

また、請求項６のサーバ装置は、ユーザがクライアント装置により情報探索要求を行うと、この要求を受け取り情報の探索処理を行い、クライアント装置に情報提供を行うサーバ装置であって、クライアント装置にて、ユーザがコンテンツの再生を行った際に、前記クラインアント装置から、コンテンツ再生情報を受信する手段と、前記受信したコンテンツ再生情報に含まれるコンテンツＩＤに該当するコンテンツ属性情報をコンテンツ情報データベースから取得するコンテンツ情報取得手段と、前記コンテンツ再生情報と、前記取得したコンテンツ属性情報とを対応付けてコンテンツ再生情報データベースに記録する再生情報登録手段と、前記クライアント装置からユーザ情報を指定した情報探索要求を受信し、情報探索要求元ユーザのユーザ種別を、前記コンテンツ再生情報データベースに記録されたコンテンツ再生情報をもとに判定するユーザ種別判定手段と、前記情報探索要求元ユーザのコンテンツ再生情報に含まれるコンテンツまたはコンテンツ属性の知名度を数値として算出する知名度算出手段と、前記コンテンツ再生情報データベースに記録されたユーザのコンテンツ再生情報と前記コンテンツまたはコンテンツ属性の知名度とをもとに、前記ユーザの種別に応じてユーザの嗜好を表現するベクトルを生成し、このベクトルをユーザの嗜好ベクトルとして、嗜好ベクトル情報データベースに記録する嗜好ベクトル生成手段と、前記情報探索要求元ユーザの嗜好ベクトルと、前記嗜好ベクトル情報データベースに記録されたユーザの嗜好ベクトルとの類似度を算出する嗜好類似度算出手段と、前記情報探索要求元ユーザと嗜好ベクトルの類似度が高いユーザを選択し、選択したユーザのコンテンツ再生情報に含まれるコンテンツまたはコンテンツ属性情報から、情報探索要求を送信したユーザにとって有益な情報を抽出し、この抽出された情報を、前記クライアント装置に対して、情報探索結果として送信する探索結果抽出手段と、を有することを特徴とする。 According to a sixth aspect of the present invention, when a user makes an information search request by a client device, the server device receives the request, performs information search processing, and provides information to the client device. When the user reproduces content, means for receiving content reproduction information from the client device and content attribute information corresponding to the content ID included in the received content reproduction information are acquired from the content information database. Content information acquisition means for performing, content reproduction information, reproduction information registration means for recording the acquired content attribute information in association with each other, and recording information search request specifying user information from the client device The user of the information search requesting user The user type determination means for determining the type based on the content reproduction information recorded in the content reproduction information database, and the content or the content attribute in the content reproduction information of the information search requesting user as numerical values A vector representing the user's preference according to the type of the user, based on the degree-of-name calculation means for calculating, the content reproduction information of the user recorded in the content reproduction information database and the name of the content or the content attribute. Generating a preference vector generating means for recording this vector as a user preference vector in a preference vector information database, a preference vector of the information search requesting user, and a user preference vector recorded in the preference vector information database Calculate the similarity of A user who has selected a preference similarity calculating unit and a user having a high preference vector similarity with the information search requesting user, and has transmitted an information search request from content or content attribute information included in the content reproduction information of the selected user And a search result extraction unit that extracts information useful for the client and transmits the extracted information as an information search result to the client device.

また、請求項７のサーバ装置は、請求項６記載のサーバ装置において、前記ユーザ種別は、前記コンテンツ再生情報データベースに記録されたコンテンツ再生情報をもとに、該当ユーザが再生したコンテンツまたはそのコンテンツ属性のばらつきと、コンテンツの再生が特定の時間範囲内に集中しているか、継続的に再生されているかを数値で表したコンテンツ再生のバースト性とを反映して算出される指標値により判定されることを特徴とする。 The server device according to claim 7 is the server device according to claim 6, wherein the user type is content reproduced by the corresponding user based on content reproduction information recorded in the content reproduction information database or the content thereof Judged by an index value that is calculated by reflecting the variation in attributes and the burstiness of content playback that expresses whether content playback is concentrated within a specific time range or is continuously played back. It is characterized by that.

また、請求項８のサーバ装置は、請求項６または７記載のサーバ装置において、前記嗜好ベクトルは、ユーザが過去に再生したコンテンツのコンテンツＩＤまたはコンテンツ属性をベクトルの要素とし、要素となるコンテンツＩＤまたはコンテンツ属性を持つコンテンツの再生頻度と、前記コンテンツまたはコンテンツ属性の知名度とをもとに、前記情報探索要求元ユーザのユーザ種別に応じて算出した重みを付与した値を要素の値とすることを特徴とする。 The server device according to claim 8 is the server device according to claim 6 or 7, wherein the preference vector includes a content ID or content attribute of a content reproduced by the user in the past as a vector element, and a content ID serving as an element Alternatively, an element value may be a value assigned with a weight calculated according to a user type of the information search requesting user based on the reproduction frequency of the content having the content attribute and the degree of publicity of the content or the content attribute. It is characterized by.

また、請求項９のプログラムは、前記の請求項１〜８のいずれか１項に記載の情報レコメンド方法またはサーバ装置を、コンピュータプログラムで記載してそれを実行可能にしたことを特徴とする。 A program according to claim 9 is characterized in that the information recommendation method or server device according to any one of claims 1 to 8 is described as a computer program and can be executed.

また、請求項１０の記録媒体は、前記の請求項１〜８のいずれか１項に記載の情報レコメンド方法またはサーバ装置を、コンピュータで実行可能に記載したプログラムを記録したことを特徴とする。 A recording medium according to a tenth aspect is characterized in that the information recommendation method according to any one of the first to eighth aspects or the server device is recorded with a program that can be executed by a computer.

本発明では、ユーザのコンテンツに対する評価から、ユーザをいくつかの種別に分類し、この種別に応じた方法でユーザの嗜好ベクトルを算出しているので、この嗜好ベクトルの類似度が高いユーザに対応する情報の提供をおこなうと、情報提供を要求するユーザにとって興味が一致する有益な情報である可能性が高い。 In the present invention, the user is classified into several types based on the evaluation of the user's content, and the user's preference vector is calculated by a method according to this type. If the information to be provided is provided, there is a high possibility that it is useful information that matches the interest of the user who requests the information provision.

本発明によれば、ユーザにとっては、自分の興味ある情報の属性と、嗜好の偏りを考慮したレコメンド情報が提供されることにより、従来のレコメンド手法と比較して、より趣味嗜好に合ったコンテンツを発見、取得することが可能となる。 According to the present invention, for a user, content that suits hobbies and tastes is provided as compared with the conventional recommendation method by providing recommendation information that takes into account the attributes of the information that the user is interested in and preference bias. Can be found and acquired.

コンテンツ提供者にとっては、特に、これまでユーザの目に付きにくいマイナーなコンテンツ等を、ユーザの嗜好に合わせて効率的に配信することにより、コンテンツの販売を促進させるという効果が期待できる。 For content providers, in particular, it is possible to expect the effect of promoting the sale of content by efficiently distributing minor content, etc., which has not been easily noticed by users, according to user preferences.

以下、図面を用いて本発明の実施例を説明する。 Embodiments of the present invention will be described below with reference to the drawings.

なお、本発明の実施の形態の説明中、コンテンツとは、映像情報、音声情報、画像情報、文字情報等あらゆるコンテンツを含む概念である。 In the description of the embodiment of the present invention, content is a concept including all types of content such as video information, audio information, image information, and character information.

図１に、本発明により実現される情報レコメンドシステムのシステム構成例を図示する。情報レコメンドシステムは複数のクライアント装置１０と、サーバ装置２０とから概略構成され、クライアント装置１０とサーバ装置２０とは、インターネット、ＬＡＮ、公衆回線を介したパソコン通信網、その他任意の有線又は無線を使用したネットワークなどの通信網（図示省略）を介して接続することができる。クライアント装置１０は複数存在するが、説明の簡略化のため、以下では１台のクライアント装置１０について説明する。 FIG. 1 illustrates a system configuration example of an information recommendation system realized by the present invention. The information recommendation system is roughly composed of a plurality of client devices 10 and a server device 20. The client device 10 and the server device 20 are connected to the Internet, a LAN, a personal computer communication network via a public line, or any other wired or wireless. It is possible to connect via a communication network (not shown) such as a used network. Although there are a plurality of client devices 10, for simplicity of explanation, only one client device 10 will be described below.

(クライアント装置１０の構成)
クライアント装置１０は、図１に示すように、ユーザ情報入力手段１１と、コンテンツ再生制御手段１２と、コンテンツＩＤ取得手段１３と、再生情報通知手段１４と、情報探索要求送信手段１５と、探索結果表示手段１６とを備える。 (Configuration of client device 10)
As shown in FIG. 1, the client device 10 includes a user information input unit 11, a content reproduction control unit 12, a content ID acquisition unit 13, a reproduction information notification unit 14, an information search request transmission unit 15, and a search result. Display means 16.

ユーザ情報入力手段１１は、サーバ装置２０が提供するサービスを利用するために必要なユーザＩＤ等のユーザ情報の入力手段を与える。 The user information input unit 11 provides a user information input unit such as a user ID necessary for using the service provided by the server device 20.

コンテンツ再生制御手段１２は、クライアント装置１０のローカルディスクや、ＣＤ等の記憶媒体や、コンテンツ配信サーバ等に保存されるコンテンツファイル１７の再生、停止といった制御を行う。 The content reproduction control means 12 performs control such as reproduction and stop of the content file 17 stored in the local disk of the client device 10, a storage medium such as a CD, a content distribution server, or the like.

コンテンツＩＤ取得手段１３は、コンテンツファイル１７のヘッダ部等に埋め込まれたコンテンツの一意識別子であるコンテンツＩＤを、コンテンツファイル１７から取得する。 The content ID acquisition unit 13 acquires a content ID, which is a unique identifier of content embedded in the header portion of the content file 17, from the content file 17.

再生情報通知手段１４は、ユーザ情報入力手段１１を利用して入力されたユーザ情報と、コンテンツＩＤ取得手段１３を利用して取得したコンテンツＩＤと、コンテンツ再生時刻として現在時刻とを、コンテンツ再生情報としてサーバ装置２０に送信する。 The reproduction information notification unit 14 uses the user information input using the user information input unit 11, the content ID acquired using the content ID acquisition unit 13, the current time as the content reproduction time, and the content reproduction information. To the server device 20.

情報探索要求送信手段１５は、ユーザ情報入力手段１１を利用して入力されたユーザ情報を情報探索要求メッセージとしてサーバ装置２０に送信する。 The information search request transmitting unit 15 transmits user information input using the user information input unit 11 to the server device 20 as an information search request message.

探索結果表示手段１６は、情報探索要求送信手段１５でサーバ装置２０に送信した情報探索要求の応答として、サーバ装置２０から返信される探索結果を受信し、表示する。 The search result display means 16 receives and displays the search result returned from the server apparatus 20 as a response to the information search request transmitted to the server apparatus 20 by the information search request transmission means 15.

(サーバ装置２０の構成)
サーバ装置２０は、図１に示すように、コンテンツ情報取得手段２１と、再生情報登録手段２２と、ユーザ種別判定手段２３と、知名度算出手段２４と、嗜好ベクトル生成手段２５と、嗜好類似度算出手段２６と、探索結果抽出手段２７と、コンテンツ情報ＤＢ２８と、コンテンツ再生情報ＤＢ２９と、嗜好ベクトル情報ＤＢ２ａとを備える。 (Configuration of server device 20)
As shown in FIG. 1, the server device 20 includes a content information acquisition unit 21, a reproduction information registration unit 22, a user type determination unit 23, a name degree calculation unit 24, a preference vector generation unit 25, and a preference similarity calculation. Means 26, search result extraction means 27, content information DB 28, content reproduction information DB 29, and preference vector information DB 2a are provided.

ここで、コンテンツ情報ＤＢ２８のデータ構造の例を図２に示す。コンテンツ情報ＤＢ２８には、コンテンツＩＤ、コンテンツのタイトル、アーティスト名、ジャンル名等のコンテンツ属性情報が格納される。 An example of the data structure of the content information DB 28 is shown in FIG. The content information DB 28 stores content attribute information such as content ID, content title, artist name, and genre name.

次に、コンテンツ再生情報ＤＢ２９のデータ構造の例を図３に示す。コンテンツ再生情報ＤＢ２９には、ユーザＩＤ、コンテンツＩＤ、再生時刻、コンテンツ属性情報などが格納される。 Next, an example of the data structure of the content reproduction information DB 29 is shown in FIG. The content reproduction information DB 29 stores a user ID, content ID, reproduction time, content attribute information, and the like.

次に、嗜好ベクトル情報ＤＢ２ａのデータ構造の例を図４に示す。嗜好ベクトル情報ＤＢ２ａには、ユーザＩＤ、要素番号、要素値、要素名などが格納される。 Next, an example of the data structure of the preference vector information DB 2a is shown in FIG. The preference vector information DB 2a stores a user ID, an element number, an element value, an element name, and the like.

続いて、各構成手段について説明する。コンテンツ情報取得手段２１は、指定されたコンテンツＩＤで管理されるコンテンツのコンテンツ属性情報（コンテンツのタイトル、アーティスト名、ジャンル名等）を、コンテンツ情報ＤＢ２８より取得する。 Subsequently, each component means will be described. The content information acquisition unit 21 acquires content attribute information (content title, artist name, genre name, etc.) of the content managed by the specified content ID from the content information DB 28.

再生情報登録手段２２は、クライアント装置１０から送信されるコンテンツ再生情報（ユーザＩＤ、コンテンツＩＤ、コンテンツ再生時刻等）を受信し、受信したユーザＩＤと、コンテンツＩＤと、コンテンツ再生時刻と、コンテンツ情報取得手段２１を利用して取得したコンテンツ再生情報に含まれるコンテンツＩＤで管理されるコンテンツの属性情報とを、コンテンツ再生情報ＤＢ２９に保存する。 The reproduction information registration unit 22 receives content reproduction information (user ID, content ID, content reproduction time, etc.) transmitted from the client device 10, and receives the received user ID, content ID, content reproduction time, and content information. The attribute information of the content managed by the content ID included in the content reproduction information acquired using the acquisition unit 21 is stored in the content reproduction information DB 29.

ユーザ種別判定手段２３は、クライアント装置１０から情報探索要求としてユーザＩＤを受信し、受信したユーザＩＤに該当するユーザのコンテンツ再生情報（ユーザＩＤ、コンテンツＩＤ、再生時刻、コンテンツ属性情報等）をコンテンツ再生情報ＤＢ２９より取得し、取得したコンテンツ再生情報をもとに、上記ユーザのユーザ種別を判定する。 The user type determination unit 23 receives the user ID as an information search request from the client device 10 and stores the content reproduction information (user ID, content ID, reproduction time, content attribute information, etc.) of the user corresponding to the received user ID as the content. The user type of the user is determined based on the content playback information acquired from the playback information DB 29.

知名度算出手段２４は、計測対象であるコンテンツの流通情報を取得し、指定されたコンテンツＩＤで管理されるコンテンツや、指定されたコンテンツ属性の知名度を、取得したコンテンツの流通情報をもとに算出する。 The famousness calculation means 24 acquires distribution information of the content to be measured, and calculates the content managed by the specified content ID and the famousness of the specified content attribute based on the acquired distribution information of the content To do.

嗜好ベクトル生成手段２５は、指定されたユーザＩＤに該当するユーザのコンテンツ再生情報に含まれる、コンテンツまたはコンテンツ属性の知名度を、知名度算出手段２４より取得し、ユーザ種別判定手段２３より取得した、上記指定されたユーザのユーザ種別に応じた計算方法で、本サービスを享受する全ユーザに対して、ユーザの嗜好をベクトル表現した嗜好ベクトルを、コンテンツ再生情報ＤＢ２９より取得したコンテンツ再生情報と、コンテンツの知名度とをもとに生成し、生成した嗜好ベクトルを嗜好ベクトル情報ＤＢ２ａに保存する。 The preference vector generation unit 25 acquires the degree of popularity of content or content attribute included in the content reproduction information of the user corresponding to the specified user ID from the name calculation unit 24 and the user type determination unit 23. The content reproduction information obtained from the content reproduction information DB 29, and the content reproduction information obtained from the content reproduction information DB 29, for all users who enjoy this service with a calculation method according to the user type of the designated user. The preference vector is generated based on the name recognition degree, and the generated preference vector is stored in the preference vector information DB 2a.

嗜好類似度算出手段２６は、サービスを享受する全ユーザの嗜好ベクトルを、嗜好ベクトル情報ＤＢ２ａより取得し、指定されたユーザＩＤに該当するユーザの嗜好ベクトルと、それ以外のユーザの嗜好ヘクトルとの類似度を算出し、算出した類似度の高いユーザを選択する。 The preference similarity calculating means 26 acquires the preference vectors of all users who enjoy the service from the preference vector information DB 2a, and the user preference vectors corresponding to the specified user ID and the other user preference vectors. The degree of similarity is calculated, and a user with a high degree of similarity calculated is selected.

探索結果抽出手段２７は、指定された複数のユーザＩＤに該当するユーザのコンテンツ再生情報に含まれる全てのコンテンツに関する情報を、コンテンツ再生情報ＤＢ２９より取得し、取得した全ての情報に対して、指定された第二のユーザＩＤに該当するユーザにとっての有益性を数値化した得点を算出し、算出した得点の高い情報を、クライアント装置１０に送信する。 The search result extraction unit 27 acquires information regarding all contents included in the content reproduction information of the user corresponding to the specified plurality of user IDs from the content reproduction information DB 29, and specifies all the acquired information with respect to the specified information. The score obtained by quantifying the benefit for the user corresponding to the second user ID is calculated, and the calculated high score information is transmitted to the client device 10.

次に、本発明の実施の形態による情報レコメンドシステムの全体的な動作について説明する。図５および図６は、本発明の実施の形態に係る情報レコメンドシステムの全体フローチャートである。本発明の対象とするコンテンツには、コンテンツを一意に識別するためのコンテンツＩＤが付与され、コンテンツファイルから付与されたコンテンツＩＤを取得することができ、予めコンテンツ提供者等が登録したコンテンツ属性情報を、コンテンツＩＤをもとに取得できる必要がある。そのため、本システムの処理前に、以下の前処理を行う必要がある。 Next, the overall operation of the information recommendation system according to the embodiment of the present invention will be described. 5 and 6 are overall flowcharts of the information recommendation system according to the embodiment of the present invention. A content ID for uniquely identifying the content is given to the content targeted by the present invention, the content ID given from the content file can be acquired, and content attribute information registered in advance by a content provider or the like Must be obtained based on the content ID. Therefore, before the processing of this system, it is necessary to perform the following preprocessing.

(前処理)
（１）コンテンツ提供者等が、コンテンツ属性情報をコンテンツ情報ＤＢ２８に登録する。 (Preprocessing)
(1) A content provider or the like registers content attribute information in the content information DB 28.

（２）記録されたコンテンツ属性情報に対して、サーバ装置２０が、コンテンツの一意識別子であるコンテンツＩＤを発行する。 (2) For the recorded content attribute information, the server device 20 issues a content ID that is a unique identifier of the content.

（３）発行されたコンテンツＩＤを、コンテンツファイルのヘッダ部等を利用して、コンテンツファイルに埋め込む。 (3) The issued content ID is embedded in the content file using the header portion of the content file.

上記の処理は、サーバ装置２０での処理に限定されるわけではなく、例えば、対象となるコンテンツが音楽の場合、Ｇｒａｃｅｎｏｔｅ社（ｈｔｔｐ：／／ｗｗｗ．ｇｒａｃｅｎｏｔｅ．ｃｏｍ）が提供する音楽データベースである「ＣＤＤＢ」で管理する楽曲のＩＤとコンテンツ属性情報を利用する等、サーバ装置２０以外の装置で行われる処理結果を利用することも可能である。 The above processing is not limited to the processing in the server device 20. For example, when the target content is music, it is a music database provided by Gracenote (http://www.gradenote.com). It is also possible to use a processing result performed by a device other than the server device 20, such as using a song ID and content attribute information managed by "CDDB".

以上の前処理の後、以下のフローで処理を行う。 After the above preprocessing, processing is performed according to the following flow.

(クライアント装置１０のコンテンツ再生情報登録処理フロー)
(ステップＳ０１)ユーザ情報入力処理
本サービスを利用するユーザは、コンテンツの再生情報をサーバ装置２０に送信する。その際、ユーザを一意に識別するためのユーザＩＤ等のユーザ情報を、ユーザ情報入力手段１１を利用して入力する。ユーザ情報を入力する契機は、コンテンツの再生開始時点や、再生プレイヤ起動時点や、コンテンツ再生情報送信時点等が考えられるが、後述するステップＳ０５のコンテンツ再生情報送信処理の以前に行われる必要がある。 (Content reproduction information registration processing flow of the client device 10)
(Step S01) User Information Input Processing A user who uses this service transmits content reproduction information to the server device 20. At that time, user information such as a user ID for uniquely identifying the user is input using the user information input means 11. The trigger for inputting user information may be the content playback start time, playback player activation time, content playback information transmission time, etc., but needs to be performed before the content playback information transmission processing in step S05 described later. .

(ステップＳ０２)コンテンツ再生処理
コンテンツ再生手段１２は、ユーザが指定したコンテンツファイル１７を再生する。コンテンツ再生形式については、クライアント装置１０の記憶装置上に存在するコンテンツファイルや、ＣＤ・ＤＶＤ等に保存されたコンテンツファイルを再生する形式と、コンテンツ配信業者等の運営するコンテンツ配信サーバ上に存在するストリーム配信用コンテンツファイルを再生する形式等があるが、本実施例においてはいずれのコンテンツ再生形式についても適用可能である。 (Step S02) Content Reproduction Processing The content reproduction means 12 reproduces the content file 17 designated by the user. The content reproduction format exists on a content distribution server operated by a content distributor and a format for reproducing a content file existing on a storage device of the client device 10 or a content file stored on a CD / DVD or the like. There are formats for reproducing content files for stream distribution, etc., but any content reproduction format is applicable in this embodiment.

(ステップＳ０３)コンテンツＩＤ取得処理
コンテンツＩＤ取得手段１３は、コンテンツ再生制御手段１２を利用して再生されるコンテンツファイル１７から、前処理（３）にてコンテンツファイルに埋め込んだコンテンツＩＤを取得する。ここで、コンテンツファイル１７からコンテンツＩＤを取得する方法は、上記したコンテンツファイルのヘッダ部に埋め込まれたコンテンツＩＤを読み出す方法や、電子透かし等によってコンテンツファイルに埋め込まれたコンテンツＩＤを読み出す方法や、ＣＤ等の場合は、ＣＤに埋め込まれたＩＤを読み出すための専用ツールを利用してＩＤを読み出す方法等が考えられる。 (Step S03) Content ID Acquisition Processing The content ID acquisition means 13 acquires the content ID embedded in the content file in the preprocessing (3) from the content file 17 reproduced using the content reproduction control means 12. Here, the method of acquiring the content ID from the content file 17 includes a method of reading the content ID embedded in the header portion of the content file described above, a method of reading the content ID embedded in the content file by digital watermarking, In the case of a CD or the like, a method of reading the ID using a dedicated tool for reading the ID embedded in the CD can be considered.

(ステップＳ０４)コンテンツＩＤ埋め込み有無判定
コンテンツファイル１７へのコンテンツＩＤ埋め込みの有無を判定する。 (Step S04) Content ID Embedding Presence Determination The presence or absence of content ID embedding in the content file 17 is determined.

(ステップＳ０５)再生情報送信処理
コンテンツファイル１７にコンテンツＩＤが埋め込まれていた場合、再生情報通知手段１４は、ステップＳ０１にて入力されたユーザＩＤと、ステップＳ０３にて取得したコンテンツファイル１７のコンテンツＩＤと、コンテンツ再生時刻として現在時刻とをコンテンツ再生情報としてサーバ装置２０に送信する。 (Step S05) Reproduction Information Transmission Processing When the content ID is embedded in the content file 17, the reproduction information notification means 14 sends the user ID input in step S01 and the content of the content file 17 acquired in step S03. The ID and the current time as the content playback time are transmitted to the server device 20 as content playback information.

(サーバ装置２０のコンテンツ再生情報登録処理フロー)
(ステップＳ０６)コンテンツ情報取得処理
サーバ装置２０は、ステップＳ０５にてクライアント装置１０から送信されるコンテンツ再生情報（ユーザＩＤ、コンテンツＩＤ、再生時刻等）を受信する。コンテンツ情報取得手段２１は、受信したコンテンツ再生情報に含まれるコンテンツＩＤに該当するコンテンツのコンテンツ属性情報（コンテンツのタイトル、アーティスト名、ジャンル名等）をコンテンツ情報ＤＢ２８より取得する。 (Content reproduction information registration processing flow of server device 20)
(Step S06) Content Information Acquisition Processing The server device 20 receives the content reproduction information (user ID, content ID, reproduction time, etc.) transmitted from the client device 10 in step S05. The content information acquisition unit 21 acquires content attribute information (content title, artist name, genre name, etc.) of the content corresponding to the content ID included in the received content reproduction information from the content information DB 28.

(ステップＳ０７)再生情報登録処理
再生情報登録手段２２は、ステップＳ０５にてクライアント装置１０から送信されるコンテンツ再生情報（ユーザＩＤ、コンテンツＩＤ、再生時刻等）と、ステップＳ０６にて取得したコンテンツ属性情報とを、対応づけてコンテンツ再生情報ＤＢ２９に保存する。 (Step S07) Reproduction Information Registration Process The reproduction information registration means 22 includes the content reproduction information (user ID, content ID, reproduction time, etc.) transmitted from the client device 10 in step S05, and the content attribute acquired in step S06. The information is stored in the content reproduction information DB 29 in association with each other.

(情報探索処理フロー)
(ステップＳ０８)ユーザ情報入力処理
本サービスを利用するユーザは、ユーザ情報入力手段１１を利用して、ユーザを一意に識別するためのユーザＩＤ等のユーザ情報を入力する。クライアント装置１０の情報探索要求送信手段１５は、入力されたユーザ情報を、サーバ装置２０に情報探索要求メッセージとして送信する。 (Information search processing flow)
(Step S08) User Information Input Processing A user who uses this service uses the user information input means 11 to input user information such as a user ID for uniquely identifying the user. The information search request transmission unit 15 of the client device 10 transmits the input user information to the server device 20 as an information search request message.

(ステップＳ０９)ユーザ種別判定処理
サーバ装置２０は、クライアント装置１０から送信される情報探索要求メッセージとしてユーザ情報を受信する。ユーザ種別判定手段２３は、受信したユーザＩＤに該当する情報探索要求元ユーザのコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９より取得し、取得したコンテンツ再生情報をもとに、ユーザが再生したコンテンツまたはそのコンテンツ属性のばらつきと、コンテンツの再生が特定の時間範囲内に集中しているか、継続的に再生されているかを数値で表したコンテンツ再生のバースト性とを反映した指標値を計算する。そして、算出された指標値をもとに、情報探索要求元ユーザのユーザ種別を判定する。 (Step S09) User Type Determination Processing The server device 20 receives user information as an information search request message transmitted from the client device 10. The user type determination unit 23 acquires the content reproduction information of the information search requesting user corresponding to the received user ID from the content reproduction information DB 29, and the content reproduced by the user based on the acquired content reproduction information or its content An index value is calculated that reflects the variation in content attributes and the burstiness of content reproduction that expresses numerically whether content reproduction is concentrated within a specific time range or is being continuously reproduced. Then, based on the calculated index value, the user type of the information search requesting user is determined.

以下の説明では、ユーザ種別として、非特許文献３で報告されているＭａｎｉａ、ＯｐｉｎｉｏｎＬｅａｄｅｒ、Ｍａｊｏｒｉｔｙの３階層のユーザ種別を例にとって説明するが、ユーザ種別として、特願２００３−１５９１１８号「プロファイリングシステム、プロファイリング方法、プログラムおよび記録媒体」で示されているユーザプロファイル等のユーザ種別を利用することも可能である。 In the following description, the user type will be described by taking the three-level user type reported in Non-Patent Document 3 as Mania, Opinion Leader, and Majority as an example, but as the user type, Japanese Patent Application No. 2003-159118 “Profiling System” It is also possible to use a user type such as a user profile indicated by “Profiling method, program and recording medium”.

(ステップＳ１０)知名度算出処理
知名度算出手段２４は、ステップＳ０９で取得した、情報探索要求元ユーザのコンテンツ再生情報に含まれるコンテンツＩＤに該当する全てのコンテンツに対して、コンテンツまたはコンテンツ属性情報の流通情報を取得し、取得した流通情報をもとに、コンテンツまたはコンテンツ属性の知名度を算出する。 (Step S10) Name recognition calculation processing The name recognition means 24 distributes content or content attribute information for all contents corresponding to the content ID included in the content reproduction information of the information search requesting user acquired in step S09. Information is acquired, and based on the acquired distribution information, the degree of popularity of the content or content attribute is calculated.

ここで、コンテンツまたはコンテンツ属性情報の流通情報とは、オリコン社が提供するような音楽ＣＤ販売に関する売り上げ・ランキング情報（ｗｗｗ．ｏｒｉｃｏｎ．ｃｏ．ｊｐ）や、例えばＷｉｎＭＸ（ｈｔｔｐ：／／ｗｗｗ．ｗｉｎｍｘ．ｃｏｍ／）のようなユーザ間でコンテンツを交換するアプリケーションを利用して単位時問当たりに交換されるコンテンツの個数や、対象コンテンツのコンテンツ属性情報を検索エンジンなどで検索した際の検索結果件数や、これらの情報を複数組み合わせた情報といった、コンテンツの流通頻度を数値化した値を指す。もちろん、本システムで取得したコンテンツ再生情報ＤＢ２９に保存されるコンテンツ再生情報をもとに、コンテンツまたはコンテンツ属性情報の流通情報を算出することも可能である。 Here, the distribution information of content or content attribute information is sales / ranking information (www.oricon.co.jp) related to music CD sales such as provided by Oricon, for example, WinMX (http: //www.winmx). .Com /) and the number of search results when searching the content attribute information of the target content using a search engine, etc., using an application for exchanging content between users. Or a value obtained by quantifying the distribution frequency of content, such as information obtained by combining a plurality of such information. Of course, it is also possible to calculate distribution information of content or content attribute information based on the content reproduction information stored in the content reproduction information DB 29 acquired by the present system.

(ステップＳ１１)嗜好ベクトル生成処理
嗜好ベクトル生成手段２５は、サービスを享受する全ユーザのコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９より取得し、取得したコンテンツ再生情報と、ステップＳ０９で判定した情報探索要求元ユーザのユーザ種別と、Ｓ１０で取得したコンテンツまたはコンテンツ属性の知名度とをもとに、ユーザの嗜好を表現するベクトルである嗜好ベクトルを、本サービスを享受する全ユーザについて生成する。生成した嗜好ベクトルは嗜好ベクトル情報ＤＢ２ａに保存する。 (Step S11) Preference Vector Generation Processing The preference vector generation means 25 acquires the content reproduction information of all users who enjoy the service from the content reproduction information DB 29, the acquired content reproduction information, and the information search request determined in step S09 Based on the user type of the original user and the familiarity of the content or content attribute acquired in S10, a preference vector, which is a vector expressing the user's preference, is generated for all users who enjoy this service. The generated preference vector is stored in the preference vector information DB 2a.

嗜好ベクトルは、ユーザが過去に再生したコンテンツのコンテンツＩＤまたはコンテンツ属性情報を要素とし、要素としたコンテンツＩＤまたはコンテンツ属性情報を持つコンテンツの再生回数や再生時間といった再生頻度と、上記コンテンツまたはコンテンツ属性の知名度をもとに、上記情報探索要求元ユーザのユーザ種別に応じて算出した重みとの積を各要素の値とする。 The preference vector includes the content ID or content attribute information of the content that the user has played in the past as an element, the reproduction frequency such as the number of times or the reproduction time of the content having the content ID or content attribute information as the element, and the content or content attribute. The product of the weight calculated according to the user type of the information search requesting user based on the degree of name recognition is the value of each element.

(ステップＳ１２)嗜好類似度算出処理
嗜好類似度算出手段２６は、ステップＳ１１で生成した全ユーザの嗜好ベクトルを、嗜好ベクトル情報ＤＢ２ａより取得し、取得した嗜好ベクトル情報をもとに、情報探索要求元ユーザと、その他のユーザとの嗜好ベクトルの類似度を算出する。 (Step S12) Preference Similarity Calculation Processing The preference similarity calculation means 26 acquires the preference vectors of all users generated in step S11 from the preference vector information DB 2a, and requests information search based on the acquired preference vector information. The similarity of preference vectors between the original user and other users is calculated.

２ユーザ間の嗜好ベクトル類似度の算出方法は、対象となる２ベクトルの余弦を取る方法や、ベクトルの要素値の平均が０になるようにベクトルを正規化した後に余弦を取る方法といった、従来の協調フィルタリングの嗜好類似度の算出方法等を用いることが考えられる。 Conventional methods for calculating the preference vector similarity between two users include a method of taking the cosine of two target vectors, and a method of taking the cosine after normalizing the vectors so that the average of the element values of the vectors becomes zero. It is conceivable to use a preference similarity calculation method for collaborative filtering.

(ステップＳ１３)情報選択処理
探索結果抽出手段２７は、ステップＳ１２で算出した情報探索要求元ユーザの嗜好ベクトルと類似度が高いユーザ集合を生成し、生成したユーザ集合に含まれるユーザのコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９から取得する。取得したコンテンツ再生情報に含まれるコンテンツに対して、上記ユーザ集合に含まれるユーザのコンテンツ再生情報や、情報探索要求元ユーザの再生の有無等を考慮して、そのコンテンツが情報探索要求元ユーザにとって、どのくらい有益であるかを表す得点を算出する。算出した得点の高いコンテンツを、情報探索結果として、クライアント装置１０に送信する。 (Step S13) Information Selection Processing The search result extraction unit 27 generates a user set having a high similarity to the preference vector of the information search requesting user calculated in step S12, and the user content reproduction information included in the generated user set Is acquired from the content reproduction information DB 29. In consideration of the content reproduction information of the user included in the user set, the presence / absence of reproduction of the information search requesting user, and the like for the content included in the acquired content reproduction information, the content is determined by the information search requesting user. , Calculate a score that represents how useful it is. The calculated high score content is transmitted to the client device 10 as an information search result.

(ステップＳ１４)探索結果表示処理
クライアント装置１０の探索結果表示手段１６は、ステップＳ１３にて、サーバ装置２０から送信される情報探索結果を受信し、表示する。 (Step S14) Search Result Display Processing The search result display means 16 of the client device 10 receives and displays the information search result transmitted from the server device 20 in step S13.

次に、図７に従って、上述した全体的な処理フローにおけるステップＳ０９のユーザ種別判定処理について、さらに詳しく説明する。 Next, the user type determination process in step S09 in the overall process flow described above will be described in more detail with reference to FIG.

ユーザ種別判定手段２３は、クライアント装置１０から、情報探索要求メッセージとしてユーザＩＤを受信する（Ｓ０９０１）。 The user type determination unit 23 receives the user ID as an information search request message from the client device 10 (S0901).

受信したユーザＩＤをもとに、情報探索要求元ユーザのコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９より取得する（Ｓ０９０２）。 Based on the received user ID, the content reproduction information of the information search requesting user is acquired from the content reproduction information DB 29 (S0902).

取得した情報探索要求元ユーザのコンテンツ再生情報を、コンテンツＩＤまたはコンテンツ属性情報と、時間とで集計する。これにより、情報探索要求元ユーザが、任意の時刻１から、設定された時間範囲後の時刻２までの間に、どのようなコンテンツをどれぐらい（回数・時間等）再生したかを表すコンテンツ再生情報集計表が作成される。 The acquired content reproduction information of the information search request source user is totaled by content ID or content attribute information and time. As a result, content reproduction indicating what content (how many times, time, etc.) has been reproduced by the information search requesting user from time 1 to time 2 after the set time range. An information summary table is created.

図８に、コンテンツ再生情報からコンテンツ再生情報集計表を作成した例について示す。図８に示した例では、コンテンツ再生情報から得られるコンテンツ再生回数を、コンテンツのアーティスト名と、時間とで集計している（Ｓ０９０３）。 FIG. 8 shows an example in which a content reproduction information tabulation table is created from content reproduction information. In the example shown in FIG. 8, the number of times of content reproduction obtained from the content reproduction information is totaled based on the content artist name and time (S0903).

作成した情報探索要求元ユーザのコンテンツ再生情報集計表の値を正規化したコンテンツ再生頻度表を作成する。コンテンツ再生頻度表は、情報探索要求元ユーザが、あるコンテンツまたはコンテンツ属性に対して、ユーザの全興味中どれくらいの興味を持っているかを表現する。図９に、図８で作成したコンテンツ再生情報集計表からコンテンツ再生頻度表を作成した例について示す。この例では、コンテンツ再生情報集計表の各値を、全コンテンツ再生回数で除算することにより、コンテンツ再生情報集計表を正規化し、コンテンツ再生頻度表を作成している（Ｓ０９０４）。 A content reproduction frequency table is created by normalizing the values in the content reproduction information aggregation table of the created information search request source user. The content reproduction frequency table expresses how much the information search requesting user is interested in a certain content or content attribute among all the interests of the user. FIG. 9 shows an example in which a content reproduction frequency table is created from the content reproduction information tabulation table created in FIG. In this example, by dividing each value of the content reproduction information aggregation table by the total number of times of content reproduction, the content reproduction information aggregation table is normalized to create a content reproduction frequency table (S0904).

作成したコンテンツ再生頻度表の値をもとに、情報探索要求元ユーザのユーザ種別を判定するためのユーザ指標値を算出する。本発明では、ユーザが再生するコンテンツまたはコンテンツ属性のばらつきと、コンテンツの再生が特定の時間範囲内に集中しているか、継続的に再生されているかを数値で表したコンテンツ再生のバースト性とをもとに、ユーザ指標値を算出する。 Based on the value of the created content reproduction frequency table, a user index value for determining the user type of the information search requesting user is calculated. In the present invention, the variation of content or content attributes to be played back by the user and the burstability of content playback that expresses numerically whether content playback is concentrated within a specific time range or continuously played back. Based on this, a user index value is calculated.

図１０に、コンテンツ再生頻度表からユーザ指標値を算出した例を示す、図１０に示した例では、再生コンテンツ属性のばらつきを、コンテンツ再生頻度表の値を各コンテンツ属性（図１０に示した例では、各アーティスト）ごとに集計した値の分散とした。また、コンテンツ再生のバースト性は、コンテンツ再生頻度表の時間軸方向の分散値をコンテンツ属性（図１０に示した例では、アーティスト）ごとに算出し、算出した各分散値の平均として算出した。ユーザ指標値は、再生コンテンツ属性のばらつきと、コンテンツ再生のバースト性との商とした。図１０に示したユーザ指標値算出方法では、特定のコンテンツに限って再生回数が多く、かつ長期問に渡って再生を繰り返すユーザのユーザ指標値が高く算出され、多くの種類のコンテンツを短時間に再生するユーザのユーザ指標値が低く算出される。すなわち、Ｍａｎｉａのユーザ指標値が最も高く、Ｍａｊｏｒｉｔｙのユーザ指標値が最も低く、Ｍａｎｉａのユーザ指標値とＭａｊｏｒｉｔｙのユ一ザ指標値との中間のユーザ指標値をとるユーザがＯｐｉｎｉｏｎＬｅａｄｅｒとなる（Ｓ０９０５）。 FIG. 10 shows an example in which the user index value is calculated from the content reproduction frequency table. In the example shown in FIG. 10, the variation of the reproduction content attribute is shown, and the value of the content reproduction frequency table is shown for each content attribute (shown in FIG. 10). In the example, the distribution of values aggregated for each artist) is used. Further, the burst property of content reproduction is calculated as the average of the calculated variance values by calculating the variance value in the time axis direction of the content reproduction frequency table for each content attribute (in the example shown in FIG. 10, artist). The user index value is a quotient of the variation in the reproduction content attribute and the burst property of the content reproduction. In the user index value calculation method shown in FIG. 10, the user index value of a user who plays a large number of times only for specific content and repeats playback over a long period of time is calculated high, and many types of content are Therefore, the user index value of the user who reproduces is calculated to be low. In other words, a user who has the highest user index value in Maria, the lowest user index value in Majority, and takes an intermediate user index value between the user index value in Mania and the user index value in Majority becomes Opinion Leader (S0905). ).

算出したユーザ指標値と、Ｍａｎｉａ、ＯｐｉｎｉｏｎＬｅａｄｅｒ、Ｍａｊｏｒｉｔｙを判定するための閾値とを比較することにより、ユーザ種別を判定する。ＭａｎｉａとＯｐｉｎｉｏｎＬｅａｄｅｒとを分離するための閾値を「閾値１」とし、ＯｐｉｎｉｏｎＬｅａｄｅｒとＭａｊｏｒｉｔｙを分離するための閾値を閾値２」とする。まず、ユーザ指標値と閾値１とを比較する（Ｓ０９０６）。 The user type is determined by comparing the calculated user index value with thresholds for determining Mania, Opinion Leader, and Majority. The threshold for separating Mania and Opinion Leader is “threshold 1”, and the threshold for separating Opinion Leader and Majority is “threshold 2”. First, the user index value is compared with the threshold value 1 (S0906).

（１）ユーザ指標値が閾値１より小さい場合。 (1) The user index value is smaller than the threshold value 1.

ユーザ指標値と閾値２とを比較する。（Ｓ０９０７）
（１−１）ユーザ指標値が閾値２より小さい場合
情報探索要求元ユーザのユーザ種別を「Ｍａｊｏｒｉｔｙ」と判定する（Ｓ０９０８）。 The user index value and the threshold value 2 are compared. (S0907)
(1-1) When the user index value is smaller than the threshold value 2 The user type of the information search requesting user is determined as “Majority” (S0908).

（１−２）ユーザ指標値が閾値２より大きい場合情報探索要求元ユーザのユーザ種別を「ＯｐｉｎｉｏｎＬｅａｄｅｒ」と判定する（Ｓ０９０９）。 (1-2) When the user index value is larger than the threshold value 2 The user type of the information search requesting user is determined to be “Opinion Leader” (S0909).

（２）ユーザ指標値が閾値１より大きい場合
情報探索要求元ユーザのユーザ種別を「Ｍａｎｉａ」と判定する（Ｓ０９１０）。 (2) When the user index value is larger than the threshold value 1 The user type of the information search requesting user is determined to be “Mania” (S0910).

ユーザ種別を判定するための閾値の設定については、サービス開始時に経験的に決定された固定値を用いる方法や、本システムによって提供される探索結果に対するユーザ評価をフィードバックし、機械学習等を利用して動的に設定する方法等が考えられる。 Regarding the threshold setting for determining the user type, a method using a fixed value determined empirically at the start of the service or a user evaluation for the search result provided by the system is fed back and machine learning or the like is used. For example, a method of dynamically setting can be considered.

なお、図７〜図１０で示したユーザ指標値の算出方法は、本発明の一例についての理解を容易にするために記載されたものであり、本発明は、ユーザ指標値が、再生コンテンツ属性のばらつきと、コンテンツ再生のバースト性とから算出される、ユーザの興味に関する特徴を表した数値であれば、以上説明したユーザ指標値の算出方法に限定されるわけではない。 It should be noted that the user index value calculation methods shown in FIGS. 7 to 10 are described for easy understanding of an example of the present invention. In the present invention, the user index value is a reproduction content attribute. As long as it is a numerical value that represents a feature relating to the user's interest, calculated from the variation of the content and the burst property of content reproduction, the method is not limited to the above-described method of calculating the user index value.

次に、図１１に従って、上述した全体的な処理フローにおけるステップＳ１０のコンテンツ知名度算出処理について、本システムで取得したコンテンツ再生情報ＤＢ２９に保存されるコンテンツ再生情報をコンテンツ流通情報とした場合を例にとって、さらに詳しく説明する。 Next, according to FIG. 11, for the content name recognition calculation process in step S10 in the overall processing flow described above, a case where the content reproduction information stored in the content reproduction information DB 29 acquired by this system is content distribution information is taken as an example. This will be described in more detail.

コンテンツの流通情報として、コンテンツ再生情報ＤＢ２９より取得した、指定した任意の時刻から現在時刻までのコンテンツ再生情報を、コンテンツＩＤまたはコンテンツ属性（図１１に示した例では、アーティスト名）と、ユーザＩＤとで集計する。これにより、指定した任意の時刻から現在までの間に、どのユーザがどのコンテンツまたはどのコンテンツ属性を持つコンテンツ（図１１に示した例では、どのアーティストのコンテンツ）をどれぐらい（回数・時間等）再生したかを示す表（図１１中、コンテンツ再生集計表）が作成される。図１１の例では、コンテンツ知名度を、知名度算出対象のアーティストを属性に持つコンテンツを再生した人数として算出した。 As content distribution information, content reproduction information from the specified arbitrary time to the current time acquired from the content reproduction information DB 29, content ID or content attribute (artist name in the example shown in FIG. 11), and user ID And sum up. Thus, what content (which artist's content in the example shown in FIG. 11) has which content or which content attribute from which user is designated to the present time (how many times, time, etc.) A table (content reproduction totaling table in FIG. 11) indicating whether the content has been reproduced is created. In the example of FIG. 11, the content familiarity is calculated as the number of people who have played back the content having the artist whose name is to be calculated.

図１１で示したコンテンツ知名度の算出方法は、本発明の一例についての理解を容易にするために記載されたものであり、本発明は、以上説明したコンテンツ知名度の算出方法に限定されるわけではない。 The content recognition calculation method shown in FIG. 11 is described in order to facilitate understanding of an example of the present invention, and the present invention is not limited to the content recognition calculation method described above. Absent.

次に、図１２に従って、上述した全体的な処理フローにおけるステップＳ１１の嗜好ベクトル生成処理について、さらに詳しく説明する。嗜好ベクトル生成手段２５は、情報探索要求元ユーザのユーザ種別をユーザ種別判定手段２３より取得する（Ｓ１１０１）。 Next, the preference vector generation process in step S11 in the overall process flow described above will be described in more detail with reference to FIG. The preference vector generation unit 25 acquires the user type of the information search requesting user from the user type determination unit 23 (S1101).

情報探索要求元ユーザのコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９より取得し（Ｓ１１０２）、取得したコンテンツ再生情報に含まれる全コンテンツまたはコンテンツ属性から成る、コンテンツまたはコンテンツ属性の集合Ｃを生成する（Ｓ１１０３）。 The content reproduction information of the information search request source user is acquired from the content reproduction information DB 29 (S1102), and a set C of content or content attributes including all content or content attributes included in the acquired content reproduction information is generated (S1103). ).

コンテンツまたはコンテンツ属性の集合Ｃに含まれる全てのコンテンツまたはコンテンツ属性の知名度を、知名度算出手段２４より取得する（Ｓ１１０４）。 The famousness of all the content or content attributes included in the content or content attribute set C is acquired from the famousness calculation means 24 (S1104).

次に、ループ関数ｊを１に初期化し（Ｓ１１０５）、以下の処理を繰り返す。 Next, the loop function j is initialized to 1 (S1105), and the following processing is repeated.

嗜好ベクトル生成手段２５は、本サービスを享受するユーザｊめコンテンツ再生情報を、コンテンツ再生情報ＤＢ２９より取得する（Ｓ１１０６）。 The preference vector generation means 25 acquires content reproduction information for the user who enjoys this service from the content reproduction information DB 29 (S1106).

取得したユーザｊのコンテンツ再生情報をもとに、ユーザｊのコンテンツｋまたはコンテンツ属性ｋを持つコンテンツの累積再生回数Ｐｊｋを、コンテンツまたはコンテンツ属性の集合Ｃに含まれる全てのコンテンツｋまたはコンテンツ属性ｋについて算出する（Ｓ１１０７）。 Based on the acquired content playback information of the user j, the cumulative playback count Pjk of the content having the content k or content attribute k of the user j is set to all the content k or content attributes k included in the content or content attribute set C. Is calculated (S1107).

算出したコンテンツ累積再生回数をもとに、全てのユーザの嗜好ベクトルを、情報探索要求元のユーザ種別に従って（Ｓ１１０８）、以下の方法で生成する。 Based on the calculated content cumulative reproduction count, preference vectors of all users are generated by the following method according to the user type of the information search request source (S1108).

（１）情報探索要求元ユーザのユーザ種別が「Ｍａｎｉａ」の場合
情報探索要求元ユーザは、自分とマイナーな趣味が一致する他者からの情報に対する期待が高いことから、嗜好ベクトルの要素のうち、希少性の高いコンテンツまたはコンテンツ属性の要素値を強調する必要がある。そのため、本実施例では、ユーザの嗜好ベクトルの各要素値を、コンテンツ累積再生回数と、コンテンツまたはコンテンツ属性の知名度との商として算出する（Ｓ１１０９）。 (1) When the user type of the information search requesting user is “Mania” The information search requesting user has a high expectation for information from others who have minor hobbies with him. It is necessary to emphasize element values of highly rare content or content attributes. Therefore, in this embodiment, each element value of the user's preference vector is calculated as a quotient between the content cumulative number of reproduction times and the content or content attribute name recognition (S1109).

（２）情報探索要求元ユーザのユーザ種別が「ＯｐｉｎｉｏｎＬｅａｄｅｒ」の場合
本実施例では、ユーザの嗜好ベクトルの各要素値を、コンテンツ累積再生回数とする（Ｓ１１１０）。 (2) When the user type of the information search requesting user is “Opinion Leader” In this embodiment, each element value of the user's preference vector is set as the cumulative content reproduction count (S1110).

ここで、情報探索要求元ユーザは、新しい情報に対する期待が高いことから、コンテンツまたはコンテンツ属性の知名度の元になるコンテンツまたはコンテンツ属性情報の流通情報として、現在時刻に近い一定期間に限定した情報を利用し、ユーザの嗜好ベクトルの各要素値を、コンテンツ累積再生回数と、コンテンツまたはコンテンツ属性の知名度の積として算出する等の方法により、より効果的な嗜好ベクトルが生成される可能性がある。 Here, since the information search requesting user has high expectation for new information, the information limited to a certain period close to the current time is distributed as the distribution information of the content or content attribute information that is the origin of the content or the content attribute. There is a possibility that a more effective preference vector may be generated by a method such as calculating each element value of the user's preference vector as a product of the cumulative number of times of content reproduction and the famousness of the content or content attribute.

（３）情報探索要求元ユーザのユーザ種別が「Ｍａｊｏｒｉｔｙ」の場合
情報探索要求元ユーザは、世の中で採用者の多い流行の情報を持つユーザからの情報取得に対する期待が高いことから、嗜好ベクトルの要素のうち、知名度が高く、世の中に多く普及するコンテンツまたはコンテンツ属性の要素値を強調する必要がある。そのため、本実施例では、ユーザの嗜好ベクトルの各要素値を、コンテンツ累積再生回数と、コンテンツまたはコンテンツ属性の知名度との積として算出する（Ｓ１１１１）。 (3) When the user type of the information search requesting user is “Majority” Since the information search requesting user has high expectation for information acquisition from users who have popular information in the world, the preference vector Among the elements, it is necessary to emphasize element values of content or content attributes that are highly known and widely spread in the world. Therefore, in this embodiment, each element value of the user's preference vector is calculated as the product of the content cumulative number of reproduction times and the content or the name of the content attribute (S1111).

上記の操作を、本サービスを享受する全ユーザ（ユーザ総数Ｎ）に対して繰り返す。すなわち、ｊ≧Ｎかを判断し（Ｓ１１１２）、ｊ<Ｎの場合には、ｊを＋１インクリメントして（Ｓ１１１３）、ステップＳ１１０６以降を繰り返す。ｊ≧Ｎになったら、処理を終了する。 The above operation is repeated for all users (the total number N of users) who enjoy this service. That is, it is determined whether j ≧ N (S1112). If j <N, j is incremented by +1 (S1113), and step S1106 and subsequent steps are repeated. When j ≧ N, the process is terminated.

図１２に示した嗜好ベクトルの算出式は、本発明の一例についての理解を容易にするために記載されたものであり、本発明は、以上説明した嗜好ベクトルの算出式に限定されるわけではない。 The preference vector calculation formula shown in FIG. 12 is described for easy understanding of an example of the present invention, and the present invention is not limited to the preference vector calculation formula described above. Absent.

図１３は、図１５で示した評価表の例に対して、計測対象のコンテンツに適当なコンテンツ知名度を付与し、ユーザ種別がＭａｎｉａであるＵｓｅｒＡが情報探索要求元ユーザである場合の、本実施例を利用した各ユーザの嗜好類似度の計算結果である。 FIG. 13 shows the implementation of the example of the evaluation table shown in FIG. 15 in the case where appropriate content name recognition is given to the content to be measured and User A whose user type is “Mania” is the information search requesting user. It is a calculation result of each user's preference similarity using an example.

図１４は、図１５で示した評価表の例に対して、計測対象のコンテンツに適当なコンテンツ知名度を付与し、ユーザ種別がＭａｊｏｒｉｔｙであるＵｓｅｒＢが情報探索要求元ユーザである場合の、本実施例を利用した各ユーザの嗜好類似度の計算結果である。 FIG. 14 shows the implementation of the example of the evaluation table shown in FIG. 15 in the case where appropriate content name recognition is given to the content to be measured and User B whose user type is Majority is the information search requesting user. It is a calculation result of each user's preference similarity using an example.

図１３および図１４の計算結果は、上述した従来技術における課題を解決した結果となっている。 The calculation results of FIGS. 13 and 14 are the results of solving the above-described problems in the prior art.

なお、本発明は、図５〜図１２に示した方法または図１に示した装置の一部または全部の処理機能をプログラムとして構成してコンピュータを用いて実現すること、あるいは図５〜図１２で示した処理手順をプログラムとして構成してコンピュータに実行させることができる。また、コンピュータでその各部の処理機能を実現するためのプログラム、あるいはコンピュータのその処理手順を実行させるためのプログラムを、そのコンピュータが読み取り可能な記録媒体、例えば、フレキシブルディスク、ＭＯ、ＲＯＭ、メモリカード、ＣＤ、ＤＶＤ、リムーバブルディスクなどに記録して、保存したり、提供したりすることが可能であり、また、インターネットのような通信ネットワークを介して配布したりすることが可能である。 Note that the present invention can be realized by using a computer by configuring a part or all of the processing functions of the method shown in FIGS. 5 to 12 or the apparatus shown in FIG. 1 as a program, or FIGS. Can be configured as a program and executed by a computer. Also, a computer-readable recording medium such as a flexible disk, MO, ROM, memory card, or the like is stored in the computer. It can be recorded on a CD, a DVD, a removable disk, etc., stored, provided, and distributed via a communication network such as the Internet.

情報レコメンドシステムのシステム構成の概要を示した図。The figure which showed the outline | summary of the system configuration | structure of an information recommendation system. コンテンツ情報ＤＢのデータ構造例を示した図。The figure which showed the data structure example of content information DB. コンテンツ再生情報ＤＢのデータ構造例を示した図。The figure which showed the data structure example of content reproduction information DB. 嗜好ベクトル情報ＤＢのデータ構造例を示した図。The figure which showed the data structure example of preference vector information DB. 情報レコメンド方法の全体フローチャート。The whole flowchart of an information recommendation method. 情報レコメンド方法の全体フローチャート。The whole flowchart of an information recommendation method. ユーザ種別算出処理の詳細フローチャート。The detailed flowchart of a user classification calculation process. コンテンツ再生情報からコンテンツ再生集計表への変換例を示した図。The figure which showed the example of conversion from content reproduction information to a content reproduction total table. コンテンツ再生集計表からコンテンツ再生頻度表への変換例を示した図。The figure which showed the example of conversion from a content reproduction total table to a content reproduction frequency table. ユーザ指標値の計算例を示した図。The figure which showed the example of calculation of a user index value. コンテンツ知名度の算出方法の例を示す図。The figure which shows the example of the calculation method of content visibility. 嗜好ベクトル生成処理の詳細フローチャート。The detailed flowchart of a preference vector production | generation process. 本実施例による嗜好類似度の算出結果例を示した図。The figure which showed the example of a calculation result of the preference similarity by a present Example. 本実施例による嗜好類似度の算出結果例を示した図。The figure which showed the example of a calculation result of the preference similarity by a present Example. 従来の協調フィルタリングによる嗜好類似度の算出結果例を示した図。The figure which showed the calculation result example of the preference similarity by the conventional collaborative filtering.

Explanation of symbols

１０…クライアント装置
１１…ユーザ情報入力手段
１２…コンテンツ再生制御手段
１３…コンテンツＩＤ取得手段
１４…再生情報通知手段
１５…情報探索要求送信手段
１６…探索結果表示手段
１７…コンテンツファイル
２０…サーバ装置
２１…コンテンツ情報取得手段
２２…再生情報登録手段
２３…ユーザ種別判定手段
２４…知名度算出手段
２５…嗜好ベクトル生成手段
２６…嗜好類似度算出手段
２７…探索結果抽出手段
２８…コンテンツ情報
２９…コンテンツ再生情報
２ａ…嗜好ベクトル情報 DESCRIPTION OF SYMBOLS 10 ... Client apparatus 11 ... User information input means 12 ... Content reproduction control means 13 ... Content ID acquisition means 14 ... Reproduction information notification means 15 ... Information search request transmission means 16 ... Search result display means 17 ... Content file 20 ... Server apparatus 21 ... content information acquisition means 22 ... reproduction information registration means 23 ... user type determination means 24 ... name degree calculation means 25 ... preference vector generation means 26 ... preference similarity calculation means 27 ... search result extraction means 28 ... content information 29 ... content reproduction information 2a ... Preference vector information

Claims

In an information recommendation method in which when a user makes an information search request by a client device, a server device connected to the client device performs an information search process and provides information to the client device.
Determine the concentration of interest from the content played by the user or the content attribute, the content attribute variation, and the burstiness of the content playback, determine the type of user by setting a threshold value for this concentration value,
Generate a preference vector that expresses the user's preference by a method according to the type of the user,
A user set having a preference vector having a similarity equal to or greater than a certain value with a user's preference vector for information provision,
An information recommendation method comprising: determining information to be provided to a user who requests information provision from among information corresponding to users included in the user set.

When a user makes an information search request by a client device, a server device connected to the client device performs an information search process and provides information to the client device.
A process of receiving content playback information from the client device when a user plays back content on a client device;
Acquiring content attribute information corresponding to a content ID included in the received content reproduction information from a content information database;
Recording the content reproduction information and the acquired content attribute information in a content reproduction information database in association with each other;
Receiving an information search request specifying user information from the client device;
Determining the user type of the information search request source user based on the content reproduction information recorded in the content reproduction information database;
A step of calculating the degree of popularity of the content or content attribute included in the content reproduction information of the information search requesting user as a numerical value;
Generating a vector expressing user preferences according to the type of the user based on the content reproduction information of the user recorded in the content reproduction information database and the degree of popularity of the content or content attribute;
Recording the vector as a user preference vector in a preference vector information database;
Calculating the similarity between the preference vector of the information search requesting user and the preference vector of the user recorded in the preference vector information database;
A user having a high similarity between the information search request source user and the preference vector is selected, and information useful for the user who transmitted the information search request is extracted from the content or content attribute information included in the content reproduction information of the selected user. Process,
Transmitting the extracted information as an information search result to the client device;
An information recommendation method characterized by comprising:

The information recommendation method according to claim 2,
The user type is based on the content playback information recorded in the content playback information database, whether the content played back by the user or the content attribute of the user and whether the content playback is concentrated within a specific time range An information recommendation method, characterized in that the determination is based on an index value calculated by reflecting the burst property of content reproduction, which is a numerical value indicating whether the content is continuously reproduced.

The information recommendation method according to claim 2 or 3,
The preference vector has a content ID or content attribute of content that the user has played in the past as an element of the vector, and has a reproduction frequency of content having the content ID or content attribute as an element, and a well-known degree of the content or content attribute. And an information recommendation method, wherein a value given a weight calculated according to a user type of the information search requesting user is used as an element value.

When a user makes an information search request by a client device, the server receives the request, performs information search processing, and provides information to the client device.
Determine the concentration of interest from the content played by the user or the content attribute, the content attribute variation, and the burstiness of the content playback, determine the type of user by setting a threshold value for this concentration value,
Generate a preference vector that expresses the user's preference by a method according to the type of the user,
A user set having a preference vector having a similarity equal to or greater than a certain value with a user's preference vector for information provision,
A server device characterized in that information to be provided to a user who requests information provision is determined from information corresponding to the user included in the user set.

When a user makes an information search request by a client device, the server device receives the request, performs information search processing, and provides information to the client device.
Means for receiving content playback information from the client device when the user plays back the content in the client device;
Content information acquisition means for acquiring content attribute information corresponding to a content ID included in the received content reproduction information from a content information database;
Reproduction information registration means for associating the content reproduction information with the acquired content attribute information and recording it in a content reproduction information database;
A user type determination unit that receives an information search request specifying user information from the client device, and determines a user type of an information search request source user based on content reproduction information recorded in the content reproduction information database;
A well-known degree calculating means for calculating, as a numerical value, a well-known degree of content or content attribute included in the content reproduction information of the information search requesting user;
Based on the content reproduction information of the user recorded in the content reproduction information database and the well-known degree of the content or content attribute, a vector expressing the user's preference is generated according to the type of the user, and this vector is generated by the user. Preference vector generation means for recording in a preference vector information database as a preference vector,
A preference similarity calculating means for calculating a similarity between the preference vector of the information search requesting user and the preference vector of the user recorded in the preference vector information database;
Select a user having a high preference vector similarity with the information search requesting user, and extract information useful for the user who transmitted the information search request from the content or content attribute information included in the content reproduction information of the selected user Search result extraction means for transmitting the extracted information to the client device as an information search result;
The server apparatus characterized by having.

The server device according to claim 6, wherein
The user type is based on the content playback information recorded in the content playback information database, whether the content played back by the user or the content attribute of the user and whether the content playback is concentrated within a specific time range The server apparatus is characterized in that the determination is made based on an index value calculated by reflecting the burst property of content reproduction in which whether the content is continuously reproduced is represented by a numerical value.

The server apparatus according to claim 6 or 7,
The preference vector has a content ID or content attribute of content that the user has played in the past as an element of the vector, and has a reproduction frequency of content having the content ID or content attribute as an element, and a well-known degree of the content or content attribute. In addition, the server apparatus is characterized in that a value given a weight calculated according to a user type of the information search requesting user is used as an element value.

A program characterized in that the information recommendation method or server device according to any one of claims 1 to 8 is written in a computer program and is executable.

A recording medium on which the information recommendation method or server device according to any one of claims 1 to 8 is recorded so as to be executable by a computer.