JP2013020290A

JP2013020290A - Pattern extraction device, pattern extraction method and pattern extraction program

Info

Publication number: JP2013020290A
Application number: JP2011150634A
Authority: JP
Inventors: Seiichi Konya; 精一紺谷; Akimichi Tanaka; 明通田中; Masashi Uchiyama; 匡内山
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 2011-07-07
Filing date: 2011-07-07
Publication date: 2013-01-31
Anticipated expiration: 2031-07-07
Also published as: JP5545889B2

Abstract

【課題】クラス分類を適切に行なうことができるパターン抽出装置を提供する。
【解決手段】学習データ間の第１の類似度と、前記学習データおよび新規データ間の第２の類似度とを計算する類似度計算部１０２と、学習データの、同じクラスに属するデータの組について、元の空間で類似度の高いものは変換後の距離が小さくなるように集約度を計算するクラス内集約度計算部１０３と、異なるクラスに属するデータの組について、元の空間で類似度の低いものはデータ変換後の距離が大きくなるように分離度を計算するクラス間分離度計算部１０４と、前記クラス内のデータ集約度およびクラス間のデータ分離度が大きくなる特徴空間への変換情報を計算する射影情報計算部１０５と、新規データを、前記第２の類似度および変換情報を用いて、特徴空間に変換するデータ変換部１０７と、前記変換情報および変換された新規データを出力する結果出力部１０８と、を備える。
【選択図】図１A pattern extraction apparatus capable of appropriately classifying a class is provided.
A similarity calculation unit that calculates a first similarity between learning data and a second similarity between the learning data and new data, and a set of data belonging to the same class of learning data In the original space, a high degree of similarity in the original space calculates the degree of aggregation so that the distance after the conversion becomes small, and the degree of similarity is calculated in the original space for a set of data belonging to different classes. For those having a low level, an interclass separation degree calculation unit 104 that calculates a separation degree so that a distance after data conversion becomes large, and conversion to a feature space in which the data aggregation degree in the class and the data separation degree between classes become large A projection information calculation unit 105 that calculates information; a data conversion unit 107 that converts new data into a feature space using the second similarity and conversion information; and the conversion information and the conversion information It was provided with a result output unit 108 for outputting the new data.
[Selection] Figure 1

Description

本発明は教師あり機械学習に関し、特にテキストや画像などのデータをクラス分類するためのパターン抽出装置、方法に関する。 The present invention relates to supervised machine learning, and more particularly to a pattern extraction apparatus and method for classifying data such as text and images.

従来、パターン抽出方法として、非特許文献１に示すように線形判別分析という手法があった。線形判別分析は次のように、与えられたｎ組のデータｘ_i∈Ｒ^m及びそのラベルｙ_i∈｛１，…，ｃ}からクラス分類が行いやすいパターンｚ_i∈Ｒ^c-1，ｃ＜ｍを抽出する。 Conventionally, as a pattern extraction method, there has been a technique called linear discriminant analysis as shown in Non-Patent Document 1. In the linear discriminant analysis, patterns z _i ∈R ^c−1 , c that are easy to classify from given n sets of data x _i ∈R ^m and their labels y _i ∈ {1,. <M is extracted.

ｚ_i＝Ｗ^Tｘ_i，ｉ＝１，…，ｎ（１）
ここで、Ｗ＝（ｗ₁，…，ｗ_c-1），ｗ_i∈Ｒ^m。尚ｃはクラス数、ｍは抽出する特徴の次元数（データの次元）、Ｒは空間を各々示している。 z _i = W ^T x _i , i = 1,..., n (1)
Here, W = (w ₁ ,..., W _c-1 ), w _i ∈R ^m . Note that c represents the number of classes, m represents the number of dimensions of the features to be extracted (data dimensions), and R represents the space.

ｗ_iは、クラス間分散とクラス内分散の比Ｊ（ｗ）を最大化するように選ぶ。 w _i is selected to maximize the ratio J (w) of the interclass variance to the intraclass variance.

すなわち、ｗ＝ａｒｇｍａｘ_wＪ（ｗ）。 That is, w = argmax _w J (w).

これは下記の一般化固有値問題を解くことで求められる。 This can be obtained by solving the following generalized eigenvalue problem.

ｗ_iは、上位ｃ−１個の固有値の固有ベクトルとなる。 w _i is the eigenvector of the upper c-1 eigenvalues.

例えば、入力空間におけるデータ集合を表す図４のようにｃｌａｓｓ１，ｃｌａｓｓ２のデータが与えられると、ｗは図４のｗで示す直線として求められる。この例は、クラス数ｃ＝２で、１（＝ｃ−１）次元の特徴が得られる。抽出されるパターンは図５となる。図５によれば、ｃｌａｓｓ１，ｃｌａｓｓ２がｘ軸上で上手く分離されている事が分かる（図５ではｃｌａｓｓ１，ｃｌａｓｓ２の結果が見やすいように、ｙ軸の値をずらしてある）。 For example, when data of class 1 and class 2 are given as shown in FIG. 4 representing a data set in the input space, w is obtained as a straight line indicated by w in FIG. In this example, the number of classes c = 2, and 1 (= c-1) -dimensional features are obtained. The extracted pattern is shown in FIG. According to FIG. 5, it can be seen that class 1 and class 2 are well separated on the x-axis (in FIG. 5, the y-axis values are shifted so that the results of class 1 and class 2 can be easily seen).

Ｃ．Ｍ．ビショップ，「パターン認識と機械学習上」，シュプリンガー・ジャパン株式会社，ｐｐ．１７７−１９０C. M.M. Bishop, “Pattern Recognition and Machine Learning,” Springer Japan, pp. 177-190

上述した線形判別分析では、
１．各クラスの平均値を分離する特徴しか得られない
２．クラス数−１の次元の特徴しか得られない
３．各クラスがガウス分布を仮定しているため、多峰性のデータにフィットしない
という問題があった。 In the linear discriminant analysis described above,
1. Only features that separate the average value of each class are obtained. 2. Only dimensional features of class number -1 can be obtained. Since each class assumed a Gaussian distribution, there was a problem that it did not fit multimodal data.

特に、図６のようなｃｌａｓｓ１，ｃｌａｓｓ２のデータが与えられると、ｃｌａｓｓ１，ｃｌａｓｓ２の平均値が等しくなるため、Ｓ_Bがゼロ行列となり、前記式（８）の解が求まらないため、クラスを分離する特徴が得られない。 In particular, when data of class 1 and class 2 as shown in FIG. 6 are given, the average values of class 1 and class 2 become equal, so that S _B becomes a zero matrix, and the solution of equation (8) is obtained. As a result, the class separating feature cannot be obtained.

本発明は、上記問題を解決するものであり、クラス分類を適切に行なうことができるパターン抽出装置、方法、プログラムを提供することを目的としている。 SUMMARY OF THE INVENTION The present invention solves the above-described problems, and an object thereof is to provide a pattern extraction apparatus, method, and program that can perform class classification appropriately.

上記課題を解決するための本発明のパターン抽出装置は、入力データをクラス分類するためのパターン抽出装置であって、学習時に入力された学習データ間の第１の類似度と、前記学習データおよび分類時に入力された新規データ間の第２の類似度とを計算する類似度計算手段と、学習時に入力された、同じクラスに属するデータの組について、元の空間で類似度の高いものはデータ変換後の距離が小さくなるように集約度を計算するクラス内集約度計算手段と、学習時に入力された、異なるクラスに属するデータの組について、元の空間で類似度の低いものはデータ変換後の距離が大きくなるように分離度を計算するクラス間分離度計算手段と、前記クラス内集約度計算手段により計算されたクラス内のデータ集約度およびクラス間分離度計算手段により計算されたクラス間のデータ分離度が大きくなる特徴空間への変換情報を計算する射影情報計算手段と、分類時に入力された新規データを、前記類似度計算手段によって計算された第２の類似度および射影情報計算手段によって計算された変換情報を用いて、クラス内のデータの集約度およびクラス間のデータの分離度が大きくなる特徴空間に変換するデータ変換手段と、前記射影情報計算手段によって計算された変換情報およびデータ変換手段によって変換された新規データを出力する結果出力手段と、を備えたことを特徴としている。 A pattern extraction device of the present invention for solving the above-mentioned problem is a pattern extraction device for classifying input data, wherein the first similarity between learning data input during learning, the learning data, Similarity calculation means for calculating the second similarity between new data input at the time of classification, and data sets belonging to the same class that are input at the time of learning are those having high similarity in the original space Intraclass aggregation degree calculation means for calculating the degree of aggregation so that the distance after conversion becomes small, and a set of data belonging to different classes input during learning, those with low similarity in the original space are after data conversion Class separation degree calculating means for calculating the degree of separation so that the distance between the classes increases, and the data aggregation degree and the class separation degree within the class calculated by the intra-class aggregation degree calculation means Projection information calculation means for calculating conversion information into a feature space in which the degree of data separation between classes calculated by the calculation means is large, and second data calculated by the similarity calculation means for new data input at the time of classification. Using the conversion information calculated by the similarity and projection information calculation means, the data conversion means for converting into a feature space in which the degree of aggregation of data within a class and the degree of separation of data between classes are large, and the projection information calculation And a result output means for outputting the conversion information calculated by the means and the new data converted by the data conversion means.

上記構成によれば、クラス内集約度計算手段は各クラスの分散ではなく、元の空間で類似度の高いものは変換後の距離が小さくなるように評価するため、多峰性などの非ガウス分布のデータに対応できる。また、本発明のクラス間分離度計算手段は各クラスの平均ではなく、元の空間で類似度の低いものは変換後の距離が大きくなるように評価するため、各クラスの平均値が等しい場合にも対応できる。 According to the above configuration, the intra-class aggregation degree calculation means is not a variance of each class, but evaluates an object with high similarity in the original space so that the distance after the conversion becomes small. Can handle distribution data. In addition, since the interclass separation degree calculation means of the present invention is not the average of each class, but evaluates an object having a low similarity in the original space so that the distance after conversion becomes large, the average value of each class is equal. Can also be supported.

また、本発明の射影情報計算手段は、各クラスの平均値間の関係ではなく、類似度計算手段で求めたデータ間の関係を基に特徴空間を計算するため、クラス数以上の特徴を求めることができる。 In addition, the projection information calculation means of the present invention calculates the feature space based on the relationship between the data obtained by the similarity calculation means, not the relationship between the average values of each class, and thus obtains a feature that exceeds the number of classes. be able to.

さらに、本発明のデータ変換手段は、射影情報計算手段で計算した情報を用いてデータを特徴量空間に変換する。こうすることで、新規のデータについてもクラス分類に適した特徴量空間に変換できる。 Furthermore, the data conversion means of the present invention converts the data into the feature amount space using the information calculated by the projection information calculation means. In this way, new data can be converted into a feature amount space suitable for classification.

本発明によれば、次のような効果が得られる。
（１）各クラスの平均値に依存せず、平均値が等しくても分離することができる。
（２）抽出する特徴の次元は次元数ｋ（≦ｍ）によって変えることができ、分類に有効な特徴が得られる。
（３）ガウス分布の過程をしていないので、多峰性のデータにもフィットする。 According to the present invention, the following effects can be obtained.
(1) It does not depend on the average value of each class and can be separated even if the average values are equal.
(2) The dimension of the feature to be extracted can be changed by the number of dimensions k (≦ m), and a feature effective for classification can be obtained.
(3) Since the process of Gaussian distribution is not performed, it fits to multimodal data.

本発明の一実施形態例によるパターン抽出装置のブロック図。1 is a block diagram of a pattern extraction device according to an example embodiment of the present invention. 本発明の一実施形態例によるパターン抽出方法を示す学習時のフローチャート。The flowchart at the time of the learning which shows the pattern extraction method by one Embodiment of this invention. 本発明の一実施形態例によるパターン抽出方法を示す分類時のフローチャート。The flowchart at the time of the classification | category which shows the pattern extraction method by one Example of this invention. 従来法を説明する第１の図であり、入力空間における２クラスのデータ集合を表す説明図。It is the 1st figure explaining the conventional method, and is explanatory drawing showing the data set of 2 classes in input space. 従来法を説明する第２の図であり、抽出されるパターンの説明図。It is a 2nd figure explaining the conventional method, and is explanatory drawing of the pattern extracted. 従来法の問題点を説明する図であり、入力空間における２クラスのデータ集合を表す説明図。It is a figure explaining the problem of the conventional method, and explanatory drawing showing the data set of 2 classes in input space. 本発明の動作を説明する第１の図であり、射影情報α¹を表す説明図。It is a 1st figure explaining operation | movement of this invention, and is explanatory drawing showing projection information (alpha) ¹ . 本発明の動作を説明する第２の図であり、新規データに対してクラス分類に有効な特徴が得られているようすを示す説明図。It is the 2nd figure explaining operation | movement of this invention, and is explanatory drawing which shows that the characteristic effective for a classification is obtained with respect to new data.

以下、図面を参照しながら本発明の実施の形態を説明するが、本発明は下記の実施形態例に限定されるものではない。本発明のパターン抽出装置は、図１のブロック図に示すように、入力される各種データおよびパラメータを取り込むデータ入力部１０１、類似度計算手段としての類似度計算部１０２、クラス内集約度計算手段としてのクラス内集約度計算部１０３、クラス間分離度計算手段としてのクラス間分離度計算部１０４、射影情報計算手段としての射影情報計算部１０５、各種データおよびパラメータが格納される蓄積部１０６、データ変換手段としてのデータ変換部１０７および結果出力手段としての結果出力部１０８を備えている。 Hereinafter, embodiments of the present invention will be described with reference to the drawings, but the present invention is not limited to the following embodiments. As shown in the block diagram of FIG. 1, the pattern extraction apparatus of the present invention includes a data input unit 101 for capturing various input data and parameters, a similarity calculation unit 102 as a similarity calculation unit, and an intra-class aggregation level calculation unit. An intra-class aggregation degree calculation unit 103, an inter-class separation degree calculation unit 104 as an inter-class separation degree calculation unit, a projection information calculation unit 105 as a projection information calculation unit, a storage unit 106 in which various data and parameters are stored, A data conversion unit 107 as data conversion means and a result output unit 108 as result output means are provided.

図１のパターン抽出装置で実施されるパターン抽出のアルゴリズムは以下の手順からなる。
Ａｌｇｏｒｉｔｈｍ１学習時の手順
Ｒｅｑｕｉｒｅ：Ｘ，ｙ，ｋ，τ，ζ，η
１：類似度行列の計算
２：クラス内集約度の計算
３：クラス間分離度の計算
４：射影情報の計算
５：Ｘ，ｋ，τ，α^j，ｂの蓄積
６：α^jの出力

Ａｌｇｏｒｉｔｈｍ２分類時の手順
Ｒｅｑｕｉｒｅ：Ｚ
１：類似度行列の計算
２：データ変換の実行
３：ｆ（ｚ_u）の出力

上記「Ａｌｇｏｒｉｔｈｍ１学習時の手順」は図２のフローチャートのステップＳ２０１〜Ｓ２０７で示され、「Ａｌｇｏｒｉｔｈｍ２分類時の手順」は図３のフローチャートのステップＳ３０１〜Ｓ３０４で示される。 The pattern extraction algorithm implemented by the pattern extraction apparatus of FIG.
Algorithm 1 Learning Procedure Required: X, y, k, τ, ζ, η
1: Calculation of similarity matrix 2: Calculation of intra-class aggregation degree 3: Calculation of separation between classes 4: Calculation of projection information 5: Accumulation of X, k, τ, α ^j , b 6: Output of α ^j

Algorithm 2 Classification Request: Z
1: Calculation of similarity matrix 2: Execution of data conversion 3: Output of f (z _u )

The above “procedure for learning Algorithm 1” is shown in steps S201 to S207 in the flowchart of FIG. 2, and the “procedure for sorting Algorithm 2” is shown in steps S301 to S304 of the flowchart in FIG.

図１のパターン抽出装置は、例えばコンピュータにより構成され、通常のコンピュータのハードウェアリソース、例えばＲＯＭ，ＲＡＭ，ＣＰＵ、入力装置、出力装置、通信インターフェース、ハードディスク、記録媒体およびその駆動装置を備えている。 The pattern extraction apparatus of FIG. 1 is configured by a computer, for example, and includes hardware resources of a normal computer such as a ROM, a RAM, a CPU, an input device, an output device, a communication interface, a hard disk, a recording medium, and a driving device thereof. .

このハードウェアリソースとソフトウェアリソース（ＯＳ、アプリケーションなど）との協働の結果、本実施形態例のパターン抽出装置は、図１に示すように、データ入力部１０１、類似度計算部１０２、クラス内集約度計算部１０３、クラス間分離度計算部１０４、射影情報計算部１０５、蓄積部１０６、データ変換部１０７および結果出力部１０８を実装する。 As a result of the cooperation between the hardware resource and the software resource (OS, application, etc.), the pattern extraction apparatus according to the present embodiment has a data input unit 101, a similarity calculation unit 102, an in-class as shown in FIG. An aggregation degree calculation unit 103, an interclass separation degree calculation unit 104, a projection information calculation unit 105, a storage unit 106, a data conversion unit 107, and a result output unit 108 are mounted.

前記蓄積部１０６は、ハードディスクあるいはＲＡＭなどの保存手段・記憶手段で構成されているものとする。 It is assumed that the storage unit 106 includes a storage unit / storage unit such as a hard disk or a RAM.

次に上記のように構成された装置の詳細を具体的に説明する。本実施形態例では、学習時に、データＸ＝（ｘ₁，…，ｘ_n），ｘ_i∈Ｒ^m，ラベルｙ＝（ｙ₁，…，ｙ_n），ｙ_i∈{１，…，ｃ}（ｃはクラス数），抽出する特徴の次元数ｋ（０＜ｋ＜ｍ），類似度計算のパラメータτ＞０，クラス内集約度のパラメータζ≧０，及びクラス間分離度のパラメータη≧０を入力とし、射影情報計算部１０５により計算された射影情報α^j，ｂ_j，ｊ＝１，…，ｋを出力し、分類時に、新規データＺを入力とし、データ変換部１０７によって変換された新規データｆ（ｚ_u）を出力する。 Next, the details of the apparatus configured as described above will be specifically described. In this embodiment, at the time of learning, data X = (x ₁ ,..., X _n ), x _i ∈R ^m , label y = (y ₁ ,..., Y _n ), y _i ∈ {1,. } (C is the number of classes), dimension number k of extracted features (0 <k <m), similarity calculation parameter τ> 0, intraclass aggregation parameter ζ ≧ 0, and interclass separation parameter η ≧ 0 is input, projection information α ^j , b _j , j = 1,..., K calculated by the projection information calculation unit 105 is output. At the time of classification, new data Z is input and converted by the data conversion unit 107 The new data f (z _u ) thus output is output.

データ入力部１０１は、ネットワークまたはファイルなどから、学習時には学習データＸ，ラベルｙ，パラメータｋ，τ，ζ，ηを入力し、分類時には新規データＺを入力する。 The data input unit 101 inputs learning data X, label y, parameters k, τ, ζ, and η during learning from a network or a file, and inputs new data Z during classification.

蓄積部１０５には、データ入力部１０１から入力されたデータＸ，パラメータｋ，τ，及び射影情報計算部１０５により計算された射影情報α^j，ｂ_j，ｊ＝１，…，ｋが蓄積される。 The storage unit 105 stores data X, parameters k and τ input from the data input unit 101, and projection information α ^j , b _j , j = 1,..., K calculated by the projection information calculation unit 105. The

類似度計算部１０２は学習時に入力された、学習データＸの類似度行列Ω（第１の類似度）、及び学習データＸと分類時に入力された新規データＺとの類似度行列Ω^new（第２の類似度）を計算する。 The similarity calculation unit 102 inputs the similarity matrix Ω (first similarity) of the learning data X input at the time of learning and the similarity matrix Ω ^new (the first similarity data between the learning data X and the new data Z input at the time of classification). 2).

ω_ij＝ｅｘｐ{−τ‖ｘ_i−ｘ_j‖} （９）
は、学習データｘ_iと学習データｘ_jの類似度である。 ω _ij = exp {−τ‖x _i −x _j ‖} (9)
Is the similarity between the learning data x _i and the learning data x _j .

ω_ij ^new＝ｅｘｐ{−τ‖ｘ_i−ｚ_j‖} （１０）
は、学習データｘ_iと新規データｚ_jの類似度である。 ω _ij ^new = exp {−τ‖x _i −z _j ‖} (10)
Is the similarity between the learning data x _i and the new data z _j .

ここで、‖・‖はユークリッドノルムである。 Here, ‖ and ‖ are Euclidean norms.

クラス内集約度計算部１０３は、学習時に入力された、同じクラスに属するデータの組について、元の空間で類似度の高いものはデータ変換後の距離が小さくなるように集約度を計算するものであり、クラス内集約度行列Ｌ^W、すなわち同一のクラスに属するデータを近くに配置するための項、を計算する。 The intra-class aggregation degree calculation unit 103 calculates the degree of aggregation so that the distance between the data sets that belong to the same class input at the time of learning is high in the original space so that the distance after data conversion becomes small And calculate an intra-class intensity matrix L ^W , that is, a term for arranging data belonging to the same class nearby.

Ｌ^W＝Ｄ^W−Ｓ^W （１１）
ここで、 L ^W = D ^W −S ^W (11)
here,

は、同一クラスに属するデータ間の類似度を示す。 Indicates the similarity between data belonging to the same class.

また、 Also,

は、同一クラス内で密集した（すなわち同じクラス内のデータから高い類似度で参照される）データを重視するための項である。 Is a term for emphasizing data that is dense within the same class (that is, is referenced with high similarity from data within the same class).

クラス間分離度計算部１０４は、学習時に入力された、異なるクラスに属するデータの組について、元の空間で類似度の低いものはデータ変換後の距離が大きくなるように分離度を計算するものであり、クラス間分離度行列Ｌ^B、すなわち異なるクラスに属するデータを遠くに配置するための項、を計算する。 The interclass separation degree calculation unit 104 calculates the degree of separation so that the distance after data conversion becomes large for a set of data belonging to different classes input at the time of learning and having a low similarity in the original space The interclass separation matrix L ^B , that is, a term for disposing data belonging to different classes far away is calculated.

Ｌ^B＝Ｄ^B−Ｓ^B （１４）
ここで、 L ^B = D ^B −S ^B (14)
here,

は、異なるクラスに属するデータ間の非類似度（距離が大きいものほど値が大きく、距離の小さいものは値が小さい）を示す。 Indicates the degree of dissimilarity between data belonging to different classes (the value increases as the distance increases, and the value decreases as the distance decreases).

また、 Also,

は、クラス間で隔たりが大きい（すなわち異なるクラスのデータから遠い距離で参照される）データを重視するための項である。 Is a term for emphasizing data having a large gap between classes (that is, referred to at a distance far from data of different classes).

射影情報計算部１０５は、前記クラス内集約度計算部１０３により計算されたクラス内のデータ集約度およびクラス間分離度計算部１０４により計算されたクラス間のデータ分離度が大きくなるような特徴空間を求め、その特徴空間への変換情報、すなわち射影情報α^j，ｂ_j，ｊ＝１，…，ｋを計算する。 The projection information calculation unit 105 has a feature space in which the data intensity within the class calculated by the intra-class aggregation degree calculation unit 103 and the data separation degree between classes calculated by the inter-class separation degree calculation unit 104 are increased. And conversion information into the feature space, that is, projection information α ^j , b _j , j = 1,..., K is calculated.

ここで、α^jは、クラス内のデータの集約度、およびクラス間のデータの分離度が大きくなる空間に変換した学習データである。 Here, α ^j is learning data converted into a space in which the degree of aggregation of data within a class and the degree of separation of data between classes are increased.

また、ｂ_jは、新規データをクラス内のデータの集約度、およびクラス間のデータの分離度が大きくなる空間に変換するためのパラメータである。 Further, b _j is a parameter for converting new data into a space in which the degree of aggregation of data in a class and the degree of separation of data between classes are increased.

これは、下記の固有値問題を解くことで求められる。 This can be obtained by solving the following eigenvalue problem.

ＬＭΩα＝λα （１７）
α^jは、上位ｋ個の固有値に対応した固有ベクトルとなる。 LMΩα = λα (17)
α ^j is an eigenvector corresponding to the top k eigenvalues.

ここで、 here,

また、 Also,

データ変換部１０７は、類似度計算部１０２によって計算された学習データＸと新規データＺとの類似度行列Ω^newと、射影情報計算部１０５によって計算された射影情報α^j，ｂ_jを用いて、 The data converter 107 uses the similarity matrix Ω ^new between the learning data X and the new data Z calculated by the similarity calculator 102 and the projection information α ^j and b _j calculated by the projection information calculator 105. ,

結果出力部１０８は、ネットワーク、またはファイルなどに、学習時は、蓄積部１０６に蓄積された射影情報α^j，ｂ_j，ｊ＝１，…，ｋを出力し、分類時はデータ変換部１０７によって変換された新規データｆ（ｚ₁）、…、ｆ（ｚ_l）を出力する。 The result output unit 108 outputs projection information α ^j , b _j , j = 1,..., K stored in the storage unit 106 during learning to a network or a file, and the data conversion unit 107 during classification. The new data f (z ₁ ),..., F (z ₁ ) converted by the above is output.

次に、上記のように構成された装置の動作を、学習時のフローチャートを示す図２、および分類時のフローチャートを示す図３とともに説明する。 Next, the operation of the apparatus configured as described above will be described with reference to FIG. 2 showing a flowchart for learning and FIG. 3 showing a flowchart for classification.

＜学習時＞
学習時にデータ入力部１０１は、例えば表１に示す学習データＸ、及びラベルｙを入力する（図６と同じもの）。ここで、データの次元ｍ＝２、データ数ｎ＝２０。 <During learning>
At the time of learning, the data input unit 101 inputs, for example, the learning data X and the label y shown in Table 1 (the same as in FIG. 6). Here, the data dimension m = 2 and the number of data n = 20.

また、他のパラメータ
ｋ＝１（２３）
τ＝１（２４）
ζ＝１（２５）
η＝１（２６）
を入力する（ステップＳ２０１）。 Other parameters k = 1 (23)
τ = 1 (24)
ζ = 1 (25)
η = 1 (26)
Is input (step S201).

次に類似度計算部１０２は、前記式（９）に従って学習データＸの類似度行列Ωを計算する（ステップＳ２０２）。 Next, the similarity calculation unit 102 calculates the similarity matrix Ω of the learning data X according to the equation (9) (step S202).

次にクラス内集約度計算部１０３は、前記式（１１）に従ってクラス内集約度Ｌ^Wを計算する（ステップＳ２０３）。 Next, the intra-class aggregation degree calculation unit 103 calculates the intra-class aggregation degree L ^W according to the equation (11) (step S203).

次にクラス間分離度計算部１０４は、前記式（１４）に従ってクラス間分離度Ｌ^Bを計算する（ステップＳ２０４）。 Next, the inter-class separation degree calculation unit 104 calculates the inter-class separation degree L ^B according to the equation (14) (step S204).

次に射影情報計算部１０５は、前記式（１７）の固有値問題を解いて射影情報を計算する（ステップＳ２０５）。ｋ＝１なので、最大固有値に対応した固有ベクトルを求める。 Next, the projection information calculation unit 105 calculates the projection information by solving the eigenvalue problem of the equation (17) (step S205). Since k = 1, the eigenvector corresponding to the maximum eigenvalue is obtained.

最大固有値は、λ₁＝６．６４９９８と計算され、またその最大固有値に対応する固有ベクトルα¹を表２に示す。 The maximum eigenvalue is calculated as λ ₁ = 6.664998, and the eigenvector α ¹ corresponding to the maximum eigenvalue is shown in Table 2.

次に、前記式（２１）に従ってｂ₁を計算すると、ｂ₁＝−７．５１５４９×１０^-16となる。 Next, when b ₁ is calculated according to the equation (21), b ₁ = −7.51549 × 10 ⁻¹⁶ is obtained.

次にステップＳ２０６において、データＸ，パラメータｋ，τ，及び射影情報α¹，ｂ₁が蓄積部１０６に蓄積される。 In step S <b> 206, the data X, parameters k, τ, and projection information α ¹ , b ₁ are stored in the storage unit 106.

次に結果出力部１０８は、前記射影情報α¹を出力する（ステップＳ２０７）。 Next, the result output unit 108 outputs the projection information α ¹ (step S207).

射影情報α¹を図７に示す。射影情報α¹はクラス内のデータの集約度およびクラス間のデータの分離度が大きくなる空間に変換した学習データであり、図７によれば、ｃｌａｓｓ１とｃｌａｓｓ２がｘ軸で上手く分離されていることが分かる（尚図７では、ｃｌａｓｓ１とｃｌａｓｓ２の結果が見やすいように、ｙ軸の値をずらしてある）。 The projection information α ¹ is shown in FIG. The projection information α ¹ is learning data converted into a space in which the degree of aggregation of data within a class and the degree of separation of data between classes are large. According to FIG. 7, class 1 and class 2 are well separated on the x axis. (In FIG. 7, the y-axis value is shifted so that the results of class 1 and class 2 are easy to see).

＜分類時＞
分類時にデータ入力部１０１は、新規データＺ＝（ｚ₁，…，ｚ_l）を入力する（ステップＳ３０１）。Ｚは[−１，４]×[−１，４]の点とする。 <At the time of classification>
At the time of classification, the data input unit 101 inputs new data Z = (z ₁ ,..., Z _l ) (step S301). Z is a point of [-1, 4] x [-1, 4].

次に類似度計算部１０２は、前記式（１０）に従って学習データＸと新規データＺの類似度行列Ω^newを計算する（ステップＳ３０２）。学習データＸ，及びτは蓄積部１０６から取得する。 Next, the similarity calculation unit 102 calculates a similarity matrix Ω ^new between the learning data X and the new data Z according to the equation (10) (step S302). The learning data X and τ are acquired from the storage unit 106.

次にデータ変換部１０７は、入力データｚ_uを前記式（２２）に従ってｆ（ｚ_u）∈Ｒ^kに変換する（ステップＳ３０３）。この際、ｋ，α，ｂは蓄積部１０６から取得し、Ω^newは類似度計算部１０２から取得する。この例ではｋ＝１である。 Next, the data converter 107 converts the input data z _u into f (z _u ) εR ^k according to the equation (22) (step S303). At this time, k, α, and b are acquired from the storage unit 106, and Ω ^new is acquired from the similarity calculation unit 102. In this example, k = 1.

新規データＺに対応した値ｆ（Ｚ）を図８に示す。データ変換部１０７は、入力された新規データｚ_uをクラス内のデータの集約度およびクラス間のデータの分離度が大きくなる空間Ｒ^kに変換しているため、図８のようにｃｌａｓｓ１の近くの領域で値が０より大きく（白に近く）、ｃｌａｓｓ２の近くの領域で値が０より小さく（黒に近く）なっている。 A value f (Z) corresponding to the new data Z is shown in FIG. Since the data conversion unit 107 converts the input new data z _u into a space R ^{k in} which the degree of aggregation of the data in the class and the degree of separation of the data between the classes are increased, the data of the class 1 as shown in FIG. The value is larger than 0 (close to white) in the nearby region, and the value is smaller than 0 (close to black) in the region near class 2.

したがって図８によれば、新規データに対してもクラス分類に有効な特徴が得られているのが分かる。 Therefore, according to FIG. 8, it can be seen that features effective for classification are obtained even for new data.

また、本実施形態のパターン抽出装置における各手段の一部もしくは全部の機能をコンピュータのプログラムで構成し、そのプログラムをコンピュータを用いて実行して本発明を実現することができること、本実施形態のパターン抽出方法における手順をコンピュータのプログラムで構成し、そのプログラムをコンピュータに実行させることができることは言うまでもなく、コンピュータでその機能を実現するためのプログラムを、そのコンピュータが読み取り可能な記録媒体、例えばＦＤ（Ｆｌｏｐｐｙ（登録商標）Ｄｉｓｋ）や、ＭＯ（Ｍａｇｎｅｔｏ−Ｏｐｔｉｃａｌｄｉｓｋ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、メモリカード、ＣＤ（ＣｏｍｐａｃｔＤｉｓｋ）−ＲＯＭ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、ＨＤＤ、リムーバブルディスクなどに記録して、保存したり、配布したりすることが可能である。また、上記のプログラムをインターネットや電子メールなど、ネットワークを通して提供することも可能である。 In addition, a part or all of the functions of each unit in the pattern extraction apparatus of the present embodiment can be configured by a computer program, and the program can be executed using the computer to realize the present invention. It goes without saying that the procedure in the pattern extraction method can be configured by a computer program and the program can be executed by the computer, and the program for realizing the function by the computer can be read by a computer-readable recording medium such as an FD. (Floppy (registered trademark) Disk), MO (Magneto-Optical disk), ROM (Read Only Memory), memory card, CD (Compact Disk) -ROM, DVD (Digital Versati) e Disk) -ROM, CD-R, CD-RW, HDD, and recorded in a removable disk, or stored, it is possible or distribute. It is also possible to provide the above program through a network such as the Internet or electronic mail.

１０１…データ入力部
１０２…類似度計算部
１０３…クラス内集約度計算部
１０４…クラス間分離度計算部
１０５…射影情報計算部
１０６…蓄積部
１０７…データ変換部
１０８…結果出力部 DESCRIPTION OF SYMBOLS 101 ... Data input part 102 ... Similarity calculation part 103 ... Intraclass aggregation degree calculation part 104 ... Interclass separation degree calculation part 105 ... Projection information calculation part 106 ... Accumulation part 107 ... Data conversion part 108 ... Result output part

Claims

A pattern extraction device for classifying input data,
Similarity calculating means for calculating a first similarity between learning data input during learning and a second similarity between the learning data and new data input during classification;
With respect to a set of data belonging to the same class input at the time of learning, an intra-class aggregation degree calculation means for calculating an aggregation degree so that a distance after data conversion is small in a high similarity in the original space,
For a set of data belonging to different classes input at the time of learning, an inter-class separability calculating means for calculating a separability so that the distance after data conversion is large for those with low similarity in the original space,
Projection information calculation means for calculating conversion information into a feature space in which the data intensity within the class calculated by the intra-class aggregation degree calculation means and the data separation degree between classes calculated by the class separation degree calculation means are large. When,
Using the second similarity calculated by the similarity calculation means and the conversion information calculated by the projection information calculation means, the degree of data aggregation within the class and the data between classes are used. Data conversion means for converting into a feature space in which the degree of separation of
A result output means for outputting the conversion information calculated by the projection information calculation means and the new data converted by the data conversion means;
A pattern extraction apparatus comprising:

The learning data label is y, the dimension number of features to be extracted is k, the similarity calculation parameter is τ, the intra-class aggregation parameter is ζ, and the inter-class separation parameter is η,
The similarity calculation means obtains the first similarity by calculating a similarity matrix Ω of learning data X input at the time of learning, and the similarity between the learning data X and new data Z input at the time of classification By calculating the degree matrix Ω ^new , the second similarity is obtained,
The intra-class aggregation degree calculation means calculates the similarity between data belonging to the same class.
age,
A section for emphasizing dense data within the same class
And the intra-class aggregation matrix L ^W = D ^W −S ^W
The interclass separation degree calculation means calculates the dissimilarity between data belonging to different classes.
age,
A term for emphasizing data with large gaps between classes
And the interclass separation matrix L ^B = D ^B −S ^B is calculated as the separation,
The projection information calculation means has the following eigenvalue problem, that is,
LMΩα = λα (17)
The learning data α ^j converted to a feature space that increases the degree of data aggregation and the degree of data separation between classes, and the new data A parameter b _j (where j = 1,..., K) for conversion into a feature space with a high degree of separation is calculated as the conversion information;
The data conversion means includes
The pattern extraction apparatus according to claim 1, wherein the new data is converted into a feature space R ^{k in} which the degree of aggregation of data within a class and the degree of separation of data between classes are increased by calculating

A pattern extraction method for classifying input data,
A first similarity calculating step in which a similarity calculating means calculates a first similarity between learning data input at the time of learning;
Intra-class aggregation level calculation means, for a set of data belonging to the same class that was input during learning, those with high similarity in the original space calculate the aggregation level so that the distance after data conversion is small An aggregation calculation step;
Inter-class separability calculation means calculates the separability of data sets that belong to different classes that were input during learning so that the distance after data conversion is greater for those with low similarity in the original space A separation degree calculating step;
Projection information calculation means includes conversion information to a feature space in which the data intensity within the class calculated by the intra-class aggregation degree calculation means and the data separation degree between classes calculated by the inter-class separation degree calculation means are increased. A projection information calculation step to calculate,
A first result output step in which the result output means outputs the conversion information calculated by the projection information calculation means;
A second similarity calculating step in which a similarity calculating means calculates a second similarity between the learning data and the new data input at the time of classification;
The data conversion means uses the second similarity calculated by the similarity calculation means and the conversion information calculated by the projection information calculation means for the new data input at the time of classification, and the degree of data aggregation in the class And a data conversion step for converting to a feature space that increases the degree of separation of data between classes,
A second result output step in which the result output means outputs the data converted by the data conversion means;
A pattern extraction method characterized by comprising:

The learning data label is y, the dimension number of features to be extracted is k, the similarity calculation parameter is τ, the intra-class aggregation parameter is ζ, and the inter-class separation parameter is η,
In the first similarity calculation step, the first similarity is obtained by calculating a similarity matrix Ω of learning data X input during learning,
In the intra-class aggregation degree calculation step, similarity between data belonging to the same class is calculated.
age,
A section for emphasizing dense data within the same class
And the intra-class aggregation matrix L ^W = D ^W −S ^W
In the interclass separation degree calculation step, dissimilarity between data belonging to different classes is calculated.
age,
A term for emphasizing data with large gaps between classes
And the interclass separation matrix L ^B = D ^B −S ^B is calculated as the separation,
The projection information calculation step includes the following eigenvalue problem:
LMΩα = λα (17)
The learning data α ^j converted to a feature space that increases the degree of data aggregation and the degree of data separation between classes, and the new data A parameter b _j (where j = 1,..., K) for conversion into a feature space with a high degree of separation is calculated as the conversion information;
The first result output step outputs conversion information α ^j and b _j calculated by the projection information calculation means,
In the second similarity calculation step, the second similarity is obtained by calculating a similarity matrix Ω ^new between the learning data X and the new data Z input at the time of classification,
The data conversion step includes
To convert the new data into a feature space R ^k that increases the degree of aggregation of data within the class and the degree of data separation between classes,
The pattern extraction method according to claim 3, wherein the second result output step outputs new data converted by the data conversion unit.

A pattern extraction program for causing a computer to function as each means according to claim 1.