JP2019061494A

JP2019061494A - INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM

Info

Publication number: JP2019061494A
Application number: JP2017185470A
Authority: JP
Inventors: 友貴藤森; Tomoki Fujimori
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2017-09-26
Filing date: 2017-09-26
Publication date: 2019-04-18

Abstract

【課題】識別精度の高い識別モデルを生成できるようにすること。
【解決手段】本発明は、複数の特徴量の中から組み合わせを異ならせながら複数回特徴量を選択する。そして、特徴量の組み合わせを異ならせたそれぞれでノイズデータであるかを判断し、その判断結果を統合してノイズデータを決定する。
【選択図】図５PROBLEM TO BE SOLVED: To generate a discrimination model with high discrimination accuracy.
According to the present invention, a plurality of feature quantities are selected while making a combination different among a plurality of feature quantities. Then, different combinations of feature amounts are used to determine whether the data is noise data, and the determination results are integrated to determine noise data.
[Selected figure] Figure 5

Description

本発明は、識別モデルの学習で使用される学習データを決定する技術に関する。 The present invention relates to a technique for determining learning data used in learning of a discrimination model.

対象物を撮影した画像から画素値の平均や分散といった多様な特徴量群を用いて予め生成した識別モデルにより、良否判定（良品と不良品の２クラス判別）を行う手法がある。識別モデルを生成する際、識別モデルを生成するための学習データのラベルを誤って設定してしまった場合、識別モデルを適切に生成することができず、良否判定の精度が低下するという問題があった。 There is a method of performing pass / fail judgment (two-class judgment of non-defective product and defective product) based on an identification model generated in advance using various feature amount groups such as average and variance of pixel values from an image obtained by photographing an object. When generating a classification model, if the label of learning data for generating a classification model is set incorrectly, the classification model can not be generated properly, and the accuracy of the pass / fail judgment is lowered. there were.

非特許文献１で開示されている手法では、予め設定した特徴量を抽出した上で、データセットを複数のデータセットに分割し、データセットごとに学習用データと検証用データに分ける。そして、分割したデータセットごとに学習用データを用いて識別器を学習し、各データの認識スコアを算出する。さらに、データごとの認識スコアを統合し、閾値処理によりノイズデータ及びグッドデータを決定する。ここで、ノイズデータとは、正常データであるのにもかかわらず異常データのラベルが付いているデータであり、グッドデータとは、正常データの中でより正常らしいデータである。そして、決定したノイズデータとグッドデータを学習データから除去する。以上の処理を複数回繰り返すことで、学習データから決定したノイズデータとグッドデータを除去し続ける。最後に、除去されたグッドデータを元に戻すことにより、学習データを決定する。 In the method disclosed in Non-Patent Document 1, after extracting a feature amount set in advance, a data set is divided into a plurality of data sets, and divided into learning data and verification data for each data set. Then, a classifier is learned using learning data for each divided data set, and a recognition score of each data is calculated. Furthermore, the recognition score for each data is integrated, and noise data and good data are determined by threshold processing. Here, the noise data is data that is labeled as abnormal data despite being normal data, and the good data is data that seems more normal among the normal data. Then, the determined noise data and good data are removed from the learning data. By repeating the above processing a plurality of times, noise data and good data determined from the learning data are continuously removed. Finally, the learning data is determined by restoring the removed good data.

特許第５４１４４１６号公報Patent No. 5414416 gazette

Ｘ．Ｚｈｕ，Ｘ．Ｗｕ，Ｑ．Ｃｈｅｎら，“Ｅｌｉｍｉｎａｔｉｎｇｃｌａｓｓｎｏｉｓｅｉｎｌａｒｇｅｄａｔａｓｅｔｓ，”第２０回Ｉｎｔ．Ｃｏｎｆ．ＭａｃｈｉｎｅＬｅａｒｎｉｎｇ，ワシントンＤＣ，２００３年８月，９２０〜９２７頁．X. Zhu, X. Wu, Q. Chen et al., "Eliminating class noise in large datasets," 20th Int. Conf. Machine Learning, Washington, DC, August 2003, pages 920-927.

しかしながら、非特許文献１では、特徴量を固定化した上で認識スコアを求めてノイズデータとグッドデータを決定している。ここでの、ノイズデータ、グッドデータの決定は特徴量に依存しているので、特徴量を固定化した場合、１つの基準でしかノイズデータ及びグッドデータを決定しないこととなる。このようにして決定されたノイズデータ及びグッドデータに基づいて学習データを決定する手法では、その学習データに基づいて生成される識別モデルによる識別精度が低いという問題があった。 However, in Non-Patent Document 1, after fixing the feature amount, the recognition score is obtained to determine the noise data and the good data. Since the determination of the noise data and the good data here depends on the feature amount, when the feature amount is fixed, the noise data and the good data are determined by only one reference. In the method of determining learning data based on the noise data and the good data determined in this manner, there is a problem that the identification accuracy by the identification model generated based on the learning data is low.

本発明は、第１の生成手段が入力データを識別するための第１の識別モデルを生成する際に用いられる学習データを決定するための情報処理装置であって、複数の学習データを取得する取得手段と、前記取得した複数の学習データそれぞれから複数種類の特徴量を抽出する抽出手段と、前記学習データのそれぞれから抽出した複数種類の特徴量の中から１以上の特徴量を選択する選択処理を実行する選択手段と、前記選択した特徴量に基づいて第２の識別モデルを生成する生成処理を実行する第２の生成手段と、前記生成された第２の識別モデルの認識スコアを算出する算出処理を実行する算出手段と、前記選択処理、前記生成処理、前記算出処理をそれぞれ複数回実行することにより求められる複数の前記認識スコアに基づいて、前記複数の学習データの中から第１の識別モデルを生成する際に用いられる学習データを決定する決定手段と、を有することを特徴とする。 The present invention is an information processing apparatus for determining learning data used when the first generation means generates a first identification model for identifying input data, and acquires a plurality of learning data. Selection means for selecting one or more feature amounts from among a plurality of types of feature amounts extracted from each of the learning data, an extraction means for extracting a plurality of types of feature amounts from each of the plurality of acquired learning data Selecting means for executing processing, second generation means for executing generation processing for generating a second identification model based on the selected feature amount, and calculating a recognition score of the generated second identification model Calculation means for executing calculation processing, the selection processing, the generation processing, and the plurality of the plurality of recognition scores obtained by performing the calculation processing a plurality of times respectively; And having a determining means for determining a training data used in generating the first identification model from the training data.

本発明によれば、識別精度の高い識別モデルを生成することができるようになる。 According to the present invention, a discrimination model with high discrimination accuracy can be generated.

第１の実施形態に係る対象物の正常異常判定を行うシステムの概略を示す図。BRIEF DESCRIPTION OF THE DRAWINGS The figure which shows the outline of the system which performs normal abnormality determination of the target object which concerns on 1st Embodiment. 第１の実施形態に係る情報処理装置の機能構成を示す概略ブロック図。FIG. 1 is a schematic block diagram showing a functional configuration of an information processing apparatus according to a first embodiment. 第１の実施形態に係る識別モデル生成の処理の詳細を示すフローチャート。6 is a flowchart showing details of identification model generation processing according to the first embodiment. 第１の実施形態においてハール・ウェーブレット変換を説明するための概略図。BRIEF DESCRIPTION OF THE DRAWINGS The schematic for demonstrating a Haar wavelet transform in 1st Embodiment. 第１の実施形態に係るラベルノイズクレンジングの詳細を示すフローチャート。6 is a flowchart showing details of label noise cleansing according to the first embodiment. 第２の実施形態に係るラベルノイズクレンジングの詳細を示すフローチャート。The flowchart which shows the detail of the label noise cleansing which concerns on 2nd Embodiment. 第３の実施形態に係るラベルノイズクレンジングの詳細を示すフローチャート。The flowchart which shows the detail of the label noise cleansing which concerns on 3rd Embodiment. 第２の実施形態に係る処理選択処理を示すフローチャート。The flowchart which shows the process selection process which concerns on 2nd Embodiment. 第４の実施形態に係るラベルノイズクレンジングの詳細を示すフローチャート。The flowchart which shows the detail of the label noise cleansing which concerns on 4th Embodiment. 第５の実施形態に係るラベルノイズクレンジングの詳細を示すフローチャート。The flowchart which shows the detail of the label noise cleansing which concerns on 5th Embodiment. 第６の実施形態に係る識別モデル生成の処理の詳細を示すフローチャート。A flow chart which shows details of processing of discernment model generation concerning a 6th embodiment.

［第１の実施形態］
以下、本発明の第１の実施形態の詳細について図面を参照しつつ説明する。本実施形態では、対象物の良否判定（正常異常判定）に用いられる識別モデルを生成する際に使用される学習データ（学習画像）の中から誤ってラベル付けされたデータを検出し、除去する構成について説明を行う。 First Embodiment
Hereinafter, the details of the first embodiment of the present invention will be described with reference to the drawings. In the present embodiment, erroneously labeled data is detected and removed from learning data (learning image) used when generating an identification model used for object quality determination (normality / abnormality determination). The configuration will be described.

図１は、生成された識別モデル（識別器）を用いて対象物の正常異常判定を行う情報処理システムの概略を示す図である。同図において、１０１は対象物を示しており、本システムは対象物１０１に対し正常異常判定を行う。１０２は画像撮影装置（カメラ）であり、対象物１０１の画像を撮影する。１０３は情報処理装置であり、画像撮影装置１０２で撮影された画像から予め設定された特徴量を抽出し、抽出した特徴量と予め生成してある識別器とに基づいて対象物体が正常であるか異常であるかの判定を行う。１０４は表示装置であり、情報処理装置１０３で判定した結果を表示する。１０５は光源であり、欠陥の可視化のために光源１０５から対象物１０１に光を照射するようになっており、この状態で画像撮影装置１０２は対象物の画像を撮影する。 FIG. 1 is a diagram schematically illustrating an information processing system that performs normal / abnormal determination of an object using the generated identification model (classifier). In the figure, reference numeral 101 denotes an object, and the present system performs normal / abnormal judgment on the object 101. An image capturing apparatus (camera) 102 captures an image of the object 101. An information processing apparatus 103 extracts a predetermined feature amount from an image captured by the image capturing device 102, and the target object is normal based on the extracted feature amount and a classifier generated in advance. It is judged whether it is abnormal or not. Reference numeral 104 denotes a display device, which displays the result determined by the information processing apparatus 103. Reference numeral 105 denotes a light source, which emits light from the light source 105 to the object 101 for visualizing defects. In this state, the image capturing apparatus 102 captures an image of the object.

次に、上述した識別モデルを予め生成するための生成処理について説明を行う。ここでは、図１に示した、実際に正常異常判定を行うシステムにより、識別モデルの生成する構成について説明をする。しかし、識別モデルの生成は、実際の正常異常判定を行うシステムとは別のシステムによって行われるものであってもよい。本実施形態のシステムにおける情報処理装置１０３は、画像撮影装置１０２で撮影された画像に対して人手によって付与されたラベルが正しいか否かの判定を行う。そして、ラベルが誤っていると判定した場合、その画像は識別モデルを生成する際の学習画像として用いないようにする（除去する）。このようにして決定された学習画像を用いて、正常異常判定を行うための識別モデルを学習により生成する。 Next, generation processing for generating the above-described identification model in advance will be described. Here, the configuration for generating the identification model by the system shown in FIG. 1 that actually performs the normal / abnormal determination will be described. However, the generation of the identification model may be performed by a system other than the system that makes the actual normal / abnormal determination. The information processing apparatus 103 in the system of the present embodiment determines whether the label manually attached to the image captured by the image capturing apparatus 102 is correct. Then, when it is determined that the label is incorrect, the image is not used (removed) as a learning image at the time of generating the identification model. Using the learning image determined in this manner, a discrimination model for performing normal / abnormal determination is generated by learning.

情報処理装置１０３は、ＣＰＵ、ＲＯＭ、ＲＡＭ、ＨＤＤ等のハードウェア構成を備え、ＣＰＵがＲＯＭやＨＤ等に格納されたプログラムを実行することにより、例えば、後述する各機能構成やフローチャートの処理が実現される。ＲＡＭは、ＣＰＵがプログラムを展開して実行するワークエリアとして機能する記憶領域を有する。ＲＯＭは、ＣＰＵが実行するプログラム等を格納する記憶領域を有する。ＨＤＤは、ＣＰＵが処理を実行する際に要する各種のプログラム、閾値に関するデータ等を含む各種のデータを格納する記憶領域を有する。 The information processing apparatus 103 includes a hardware configuration such as a CPU, a ROM, a RAM, and an HDD, and the CPU executes a program stored in the ROM, the HD, or the like to process, for example, each functional configuration and flowchart described later To be realized. The RAM has a storage area that functions as a work area in which the CPU develops and executes a program. The ROM has a storage area for storing programs to be executed by the CPU. The HDD has a storage area for storing various data including various programs required when the CPU executes a process, data on a threshold, and the like.

図２は、本実施形態に係る情報処理装置１０３の機能構成を示す概略ブロック図である。学習データ設定部２０１は、対象物１０１の撮影された画像に対して人（ユーザ）が決定したラベルの情報を取得し、その情報を画像に対して付与する。ここでは、その画像に含まれる対象物１０１が正常である（第１の情報）か、異常である（第２の情報）かという２値の情報である。また、学習データ設定部２０１は、ラベルの付与された画像（学習画像）から特徴量を算出する。なお、学習データ設定部２０１は、自身がデータに対してラベルを付与して学習データを生成することによって、学習データを取得するという構成のほか、既にラベルの付与された学習データを他の構成から取得してもよい。いずれにしても、学習データ設定部２０１は、ラベルの付与された学習データを取得する取得手段として機能する。また、上述のとおり、学習データから特徴量を抽出する抽出手段としても機能する。 FIG. 2 is a schematic block diagram showing a functional configuration of the information processing apparatus 103 according to the present embodiment. The learning data setting unit 201 acquires information of a label determined by a person (user) with respect to a photographed image of the object 101, and adds the information to the image. Here, it is binary information indicating whether the object 101 included in the image is normal (first information) or abnormal (second information). Also, the learning data setting unit 201 calculates the feature amount from the image (learning image) to which the label is attached. In addition to the configuration in which the learning data setting unit 201 acquires learning data by generating labels by adding labels to data, the learning data setting unit 201 may be configured to obtain learning data to which labels have already been added. It may be obtained from In any case, the learning data setting unit 201 functions as an acquiring unit that acquires learning data to which a label is attached. Also, as described above, it also functions as an extraction unit that extracts a feature amount from learning data.

クレンジング部２０２は、学習データ設定部２０１で付与したラベルが誤っていると判定した学習データ（画像）を除去することで、識別モデルを生成する際に使用する学習画像を選択、決定する。特徴量選択部２０３は、学習データ設定部２０１で抽出された特徴量に対して特徴量選択を行うことにより、特徴量の順位付けを行う。 The cleansing unit 202 selects and determines a learning image to be used when generating the identification model by removing the learning data (image) determined that the label given by the learning data setting unit 201 is incorrect. The feature amount selection unit 203 performs feature amount selection by performing feature amount selection on the feature amounts extracted by the learning data setting unit 201.

識別モデル学習部２０４は、クレンジング部２０２が決定した学習データを用いて、識別モデルを学習する。パラメータ設定部２０５は、特徴量選択部２０３により順位付けされた特徴量からの選択数、及び識別モデル学習部２０４により学習された識別モデルのパラメータを交差確認法を用いて決定する。正常異常判定部２０６は、パラメータ設定部２０５で決定した特徴量選択数に基づいて、テストデータ（テスト画像）から特徴量を抽出する。また、パラメータ設定部２０５で決定したパラメータに基づいて、予め学習しておいた識別モデルにより正常異常判定を行う。 The identification model learning unit 204 learns the identification model using the learning data determined by the cleansing unit 202. The parameter setting unit 205 determines the number of selections from the feature quantities ranked by the feature quantity selection unit 203 and the parameters of the identification model learned by the identification model learning unit 204 using the cross-validation method. The normal / abnormality determination unit 206 extracts a feature amount from test data (test image) based on the feature amount selection number determined by the parameter setting unit 205. Further, based on the parameters determined by the parameter setting unit 205, the normal / abnormal determination is performed by the identification model learned in advance.

図３は、本実施形態における識別モデル生成の処理の詳細を示すフローチャートである。 FIG. 3 is a flowchart showing the details of the identification model generation process in the present embodiment.

（ステップＳ３０１：学習データに対する特徴量抽出）
ステップＳ３０１では、学習データ設定部２０１が、ラベルの付与された学習画像の対象領域から特徴量を抽出する。複数の特徴量を用いる場合は、例えば学習画像の対象領域に対して、ハール・ウェーブレット（ＨａａｒＷａｖｅｌｅｔ）変換をかけて、階層的に画像を生成する。ハール・ウェーブレット変換とは、簡単に述べると、位置情報を保持したまま周波数変換する処理である。 (Step S301: Feature extraction for learning data)
In step S301, the learning data setting unit 201 extracts a feature amount from a target region of a learning image to which a label is attached. When a plurality of feature quantities are used, for example, Haar-Wavelet (Haar Wavelet) transformation is performed on a target region of a learning image to hierarchically generate an image. The Haar wavelet transform is a process of performing frequency conversion while holding position information.

図４は、ハール・ウェーブレット変換の概略図である。まず、学習画像の対象領域に対して４種類のフィルタ（数式１−１〜１−４）を用意する。 FIG. 4 is a schematic view of the Haar wavelet transform. First, four types of filters (Equation 1-1 to 1-4) are prepared for the target region of the learning image.

数式１−１が縦方向数成分フィルタ（ＨＬ）、数式１−２が横方向数成分フィルタ（ＬＨ）、数式１−３が対角方向数成分フィルタ（ＨＨ）、数式１−４が低周波数成分フィルタ（ＬＬ）を示す。対象領域の２×２の画素に対して、上記のフィルタで内積をとる。２×２の領域を重ね合わせることなく移動させて、解像度が２分の１になるように、縦方向成分画像、横方向成分画像、対角方向成分画像、低周波成分画像の４種類の画像を生成する。そして、生成された低周波成分画像から、次の階層の縦方向成分画像、横方向成分画像、対角方向成分画像、低周波成分画像の４種類の画像を生成する。 Equation 1-1 is the vertical direction number component filter (HL), equation 1-2 is the horizontal direction number component filter (LH), equation 1-3 is the diagonal direction number component filter (HH), and equation 1-4 is low frequency The component filter (LL) is shown. For the 2 × 2 pixels of the target area, the inner product is taken with the above filter. Four types of images, vertical component image, horizontal component image, diagonal direction component image, and low frequency component image, so that the 2 × 2 region is moved without overlapping and the resolution becomes half. Generate Then, from the generated low frequency component image, four types of images of the vertical direction component image, the horizontal direction component image, the diagonal direction component image, and the low frequency component image of the next layer are generated.

このような画像生成を繰り返すことによって、各階層で縦方向成分画像、横方向成分画像、対角方向成分画像、低周波成分画像の４種類の成分画像を生成する。この際、解像度は２分の１になるので、例えばハール・ウェーブレット変換を８回繰り返すのであれば、画像サイズは２の８乗の倍数に設定しておくことが好ましい。結果としてハール・ウェーブレット変換を８回行い、各階層から４画像生成されるので、３２画像が生成される。これに加えて入力画像が追加されるので、以下の合計３３画像が生成される。
１）入力画像
２）第１〜第８階層の各階層の縦方向成分画像
３）第１〜第８階層の各階層の横方向成分画像
４）第１〜第８階層の各階層の対角方向成分画像
５）第１〜第８階層の各階層の低周波成分画像
１枚の画像から生成された合計３３種類の画像に対して、最大値、最大値−最小値及び以下の数式２から数式６で示す７つの特徴量を抽出する。結果的に、１枚の画像に対して、７×３３＝２３１個の特徴量を抽出する（以下、１枚の画像から抽出される特徴量の個数をＮ個とする）。画素値の平均は数式２で、分散は数式３で、尖度は数式４で、歪度は数式５で、相乗平均は数式６で算出される。なお、画像のサイズは垂直方向ａ画素、水平方向ｂ画素の画像とし、水平ｉ番目、垂直ｊ番目の画素値をｐ（ｉ，ｊ）と表す。 By repeating such image generation, four types of component images of vertical direction component image, horizontal direction component image, diagonal direction component image, and low frequency component image are generated in each layer. At this time, since the resolution is halved, it is preferable to set the image size to a multiple of 2 to the eighth power, for example, when repeating Haar wavelet transform eight times. As a result, Haar wavelet transform is performed eight times, and four images are generated from each layer, so 32 images are generated. In addition to this, since the input image is added, the following total 33 images are generated.
1) Input image 2) Longitudinal component image of each hierarchy of first to eighth hierarchy 3) Horizontal component image of each hierarchy of first to eighth hierarchy 4) Diagonal of each hierarchy of first to eighth hierarchy Direction component image 5) Low frequency component image of each of the first to eighth layers For a total of 33 types of images generated from one image, maximum value, maximum value-minimum value, and the following equation 2 Seven feature quantities shown in Equation 6 are extracted. As a result, 7 × 33 = 231 feature quantities are extracted from one image (hereinafter, the number of feature quantities extracted from one image is N). The average of the pixel values is calculated by Equation 2, the variance by Equation 3, the kurtosis by Equation 4, the skewness by Equation 5, and the geometric average by Equation 6. Note that the size of the image is an image of a pixel in the vertical direction and b pixels in the horizontal direction, and the i-th horizontal pixel value and the j-th vertical pixel value are p (i, j).

ここでは、ハール・ウェーブレット変換を用いる手法について述べたが、ウェーブレット変換、エッジ抽出、フーリエ変換、ガボール変換といったその他の変換手法を用いても良い。また、他の統計特徴量として、最大値から最小値を引いた値、標準偏差といった統計量を用いても良い。以上の処理により、学習画像から複数種類の特徴量を抽出することができる。 Here, the method using the Haar-wavelet transform has been described, but other transform methods such as wavelet transform, edge extraction, Fourier transform, and Gabor transform may be used. Further, as other statistical feature quantities, statistics such as a value obtained by subtracting the minimum value from the maximum value and a standard deviation may be used. By the above processing, a plurality of types of feature quantities can be extracted from the learning image.

（ステップＳ３０２：ラベルノイズクレンジング）
ステップＳ３０２では、クレンジング部２０２が、Ｓ３０１で学習画像から抽出した複数の特徴量を利用して、学習画像のラベルノイズクレンジングを行う。すなわち、クレンジング部２０２は、学習画像がノイズデータ（誤ったラベルが付与されているデータ）であるか否かの判定を行い、ノイズデータと判定された学習画像を除外する。これにより、入力画像を識別するための識別モデルの生成の際に使用される学習データが決定される。このステップにおける処理の詳細については後述する。 (Step S302: Label Noise Cleansing)
In step S302, the cleansing unit 202 cleans the label image of the learning image using the plurality of feature amounts extracted from the learning image in step S301. That is, the cleansing unit 202 determines whether the learning image is noise data (data to which an incorrect label is added), and excludes the learning image determined as noise data. This determines the training data used in generating the identification model to identify the input image. Details of the process in this step will be described later.

（ステップＳ３０３：特徴量選択）
ステップＳ３０３では、特徴量選択部２０３が、ステップＳ３０２でラベルノイズクレンジングを行った学習画像に対し、特徴量の順位付けを行う。特徴量の順位付けを行う方法として、特許文献１には、入力データから抽出される複数の特徴量から特徴量間の組合せの相性を考慮し、入力データの分類に適した特徴量を選択する技術が開示されている。具体的には、特徴量を組み合わせて第１評価値を算出し、第１評価値同士を比較して上位のＩ個の特徴量にのみ特徴量間の組合せの相性を示す第２評価値を投票し、第２評価値に基づいて特徴量に順位付けする。 (Step S303: Feature Selection)
In step S303, the feature amount selection unit 203 performs feature amount ranking on the learning image subjected to label noise cleansing in step S302. As a method of ranking feature amounts, Patent Document 1 selects feature amounts suitable for classification of input data, taking into consideration the compatibility between combinations of feature amounts from a plurality of feature amounts extracted from input data. Technology is disclosed. Specifically, the first evaluation value is calculated by combining the feature amounts, the first evaluation values are compared, and the second evaluation value indicating the compatibility between the feature amounts only for the upper I feature amounts is used. Vote and rank feature quantities based on the second evaluation value.

（ステップＳ３０４：識別モデルの生成）
ステップＳ３０４では、識別モデル学習部２０４が、Ｓ３０３で順位付けされた特徴量を用いて、識別モデルの生成を行う。この識別モデルは未知の入力画像を識別するための識別モデル（第１の識別モデル）であり、本実施形態においては入力画像に含まれる対象物の良否判定を行うための識別モデルである。ここでは、部分空間法のひとつである投影距離法を識別モデルの生成に用いる。投影距離とは、簡単に述べると、それぞれの特徴量を軸とする特徴空間における特徴ベクトルと、パターンの分布の分散が最大となる向きを持つ超平面（主平面）との最短距離である。以下、数式を用いて具体的に説明する。 (Step S304: Generation of Identification Model)
In step S304, the identification model learning unit 204 generates an identification model using the feature quantities ranked in step S303. The identification model is an identification model (first identification model) for identifying an unknown input image, and in the present embodiment is an identification model for performing quality determination of an object included in the input image. Here, a projection distance method, which is one of the subspace methods, is used to generate a discrimination model. The projection distance is simply the shortest distance between the feature vector in the feature space whose axis is the respective feature amount and the hyperplane (principal plane) having the direction in which the distribution of the pattern is maximized. Hereinafter, this will be specifically described using formulas.

正常データの平均ベクトルｍと共分散行列Σは、正常データの数ｎと特徴ベクトルｘ_ｉを用いて示すことができる。正常データの平均ベクトルｍを数式７に、共分散行列Σを数式８に示す。 The mean vector m of normal data and the covariance matrix Σ can be shown using the number n of normal data and the feature vector x _i . An average vector m of normal data is shown in Equation 7 and a covariance matrix Σ is shown in Equation 8.

ここで、Σの第ｉ番目の固有値、固有ベクトルをそれぞれλ_ｉ、φ_ｉとし、固有値は降順で並んでいるものとする。本実施形態では、学習用の正常データを用いて、正常データの平均ベクトルｍと共分散行列Σから識別モデルを生成する。 Here, it is assumed that the i-th eigenvalue and eigenvector of Σ are λ _i and φ _i , respectively, and the eigenvalues are arranged in descending order. In this embodiment, a discrimination model is generated from the mean vector m of the normal data and the covariance matrix て using the normal data for learning.

（ステップＳ３０５：交差確認法によるパラメータ決定）
ステップＳ３０５では、パラメータ設定部２０５が、交差確認法を用いて、ステップＳ３０３における特徴量選択により順位付けされた特徴量の選択数、およびステップＳ３０４における部分空間の射影次元を決定する。 (Step S305: Parameter Determination by Cross Confirmation Method)
In step S305, the parameter setting unit 205 determines the number of feature quantities selected by the feature quantity selection in step S303 and the projection dimension of the subspace in step S304 using the cross-validation method.

具体的には、ここでは、ｋ−Ｆｏｌｄ交差確認法を用いてパラメータを決定する。すなわち、学習データをランダムにｋ分割し、ｋ分割したデータセットのうち、（ｋ−１）個のデータセットで識別モデルの生成を行い、１つのデータセットで検証する。そして、決定すべきパラメータ（特徴量の選択数と部分空間の次元数）を順次変えながら、認識率の性能評価を行い、ＡＵＣ（エリアアンダーカーブ：認識性能曲線の下部面積）が最も良いパラメータを選択する。なお、ｋ＝５程度に設定するのが適当である。 Specifically, parameters are determined here using k-Fold cross validation. That is, the learning data is randomly divided into k, and of the k-divided data sets, the identification model is generated using (k-1) data sets, and is verified with one data set. Then, the performance of the recognition rate is evaluated while sequentially changing the parameters to be determined (the number of selected features and the number of dimensions of the subspace), and the AUC (area under curve: area under the recognition performance curve) is the best parameter. select. Incidentally, it is appropriate to set about k = 5.

（ステップＳ３０６：テストデータに対する正常異常判定）
ステップＳ３０６では、正常異常判定部２０６が、ステップＳ３０３で選択した特徴量を用いて、テストデータに対する特徴量抽出を行い、ステップＳ３０４で生成した識別モデルを用いて、ステップＳ３０５で決定したパラメータにより正常異常判定を行う。数式７で算出された主平面と、数式８で算出された平均ベクトルｍを用いて、射影次元数ｌとテストデータの特徴ベクトル (Step S306: Normal / abnormal judgment on test data)
In step S306, the normality / abnormality determination unit 206 performs feature amount extraction on test data using the feature amount selected in step S303, and using the discrimination model generated in step S304, the normality is determined according to the parameters determined in step S305. Perform an anomaly judgment. Using the principal plane calculated by Equation 7 and the average vector m calculated by Equation 8, the projection dimension number l and the feature vector of the test data

により、投影距離ｄ（ｘ）は以下の数式９で表される。ここでは、数式９で表された投影距離を算出し、閾値処理をした上で正常異常判定を行う。 Thus, the projection distance d (x) is expressed by Equation 9 below. Here, the projection distance represented by Formula 9 is calculated, threshold processing is performed, and the normal / abnormal determination is performed.

なお、本実施形態では、部分空間法を用いて識別モデルを生成する構成について述べてきたが、ＳＶＭ等の他の識別器を用いて識別モデルを生成してもよい。 In the present embodiment, although the configuration for generating the identification model using the subspace method has been described, the identification model may be generated using another identifier such as SVM.

ここでは、本実施形態の情報処理装置１０３が、ラベルノイズクレンジング処理以降の、実際の入力データに対して識別を行う際の識別モデルの学習も行うものとしたが、実際の入力データに対して識別を行う際の識別モデルの学習は別の装置で行ってもよい。 Here, although the information processing apparatus 103 according to the present embodiment performs learning of the identification model when performing identification on actual input data after the label noise cleansing process, the information processing apparatus 103 also performs processing on the actual input data. The learning of the identification model at the time of identification may be performed by another device.

ここで、ステップＳ３０２におけるラベルノイズクレンジングの処理の詳細について述べる。図５は、本実施形態のラベルノイズクレンジングの処理の詳細を示すフローチャートである。 Here, the details of the label noise cleansing process in step S302 will be described. FIG. 5 is a flowchart showing the details of the label noise cleansing process of this embodiment.

（ステップＳ５０１：学習データの設定）
ステップＳ５０１では、Ｍ個の画像データ（学習画像）を用意（設定）する。ここで、学習データはＭ個用意するものとし、各学習データからは上述したようにＮ個の特徴量が抽出されている。なお、ここで設定される学習データは上述のＳ３０１で取得した学習データの全部、または一部である。 (Step S501: setting of learning data)
In step S501, M pieces of image data (learning images) are prepared (set). Here, it is assumed that M learning data are prepared, and N feature quantities are extracted from each learning data as described above. The learning data set here is all or part of the learning data acquired in S301 described above.

（ステップＳ５０２：特徴量選択）
ステップＳ５０２では、ステップＳ５０１で設定したＭ個の学習データそれぞれから抽出したＮ個の特徴量のうちランダムにＲ個の特徴量を選択する選択処理を実行する（Ｒは１以上の整数）。このとき特徴量をランダムに選択するので、特徴量選択にかかる計算時間を軽減することができ、繰り返し処理が多数になったとしても計算時間の増大を抑制できる。なお、各学習画像において選択されるＲ個の特徴量の組み合わせはどれも同じである。 (Step S502: Feature Selection)
In step S502, a selection process is performed to select R feature quantities at random among the N feature quantities extracted from each of the M learning data set in step S501 (R is an integer of 1 or more). At this time, since the feature quantities are randomly selected, it is possible to reduce the calculation time required to select feature quantities, and to suppress an increase in calculation time even if the number of repetitive processes is large. The combination of R feature amounts selected in each learning image is the same.

（ステップＳ５０３：データ分割）
ステップＳ５０３では、Ｍ個用意してある学習データをランダムにＬ個のデータセット（グループ）に分割する（Ｌは１以上の整数）。そして、Ｌ−１個のデータセットを学習用データに設定し、１個のデータセットを検証用データに設定する。そして、組み合わせを変えながら、学習用データと検証用データの組み合わせをＬ個生成する。なお、ここでは、Ｌ個のデータセットに分割することを前提とすると述べたが、分割することなく、用意したＭ個の学習画像を学習用データと検証用データの両方に用いるようにしても良い。なお、ステップＳ５０２とＳ５０３は処理の順番は逆であってもよい。 (Step S503: Data Division)
In step S503, M pieces of learning data prepared are randomly divided into L data sets (groups) (L is an integer of 1 or more). Then, L-1 data sets are set as learning data, and one data set is set as verification data. Then, while changing the combination, L combinations of learning data and verification data are generated. Although it has been stated here that division into L data sets is premised, it is possible to use the prepared M learning images for both learning data and verification data without division. good. The order of processing in steps S502 and S503 may be reversed.

（ステップＳ５０４：識別モデル生成）
ステップＳ５０４では、ステップＳ５０３で学習用データとして設定されたデータセットのデータを用いて、部分空間法により識別モデルを生成する生成処理を実行する。このステップで生成する識別モデルは、先に説明をした入力画像を識別するための識別モデルと異なり、ラベルノイズクレンジングにおいて使用される識別モデル（第２の識別モデル）である。なお、第１の識別モデルと第２の識別モデルとは異なる種類のモデルであってもよいし、本実施形態のように同じ種類のモデルであってもよい。 (Step S504: Identification Model Generation)
In step S504, using the data of the data set set as learning data in step S503, a generation process of generating a discrimination model by a subspace method is executed. The discrimination model generated in this step is a discrimination model (second discrimination model) used in label noise cleansing, unlike the discrimination model for identifying the input image described above. Note that the first identification model and the second identification model may be different types of models, or may be the same type of model as in this embodiment.

（ステップＳ５０５：認識スコア算出）
ステップＳ５０５では、ステップＳ５０４で生成した識別モデルを利用して、検証用データとして設定された各データの認識スコアを算出する算出処理を実行する。ここで、認識スコアはそのデータが正常らしさ（正常であることの尤度）を示すスコアである。すなわち、学習データに付与されているラベルが正常である（第１の情報）か、異常である（第２の情報）かという２値の場合に、認識スコアは第１の情報であることの尤度を示すものである。なお、本実施形態においては、識別モデル生成と認識スコアの算出はステップＳ３０４で述べた投影距離法を用いるが、ステップＳ３０４と異なる手法を利用するようにしても良い。 (Step S505: Calculation of recognition score)
In step S505, calculation processing is performed to calculate the recognition score of each data set as verification data using the identification model generated in step S504. Here, the recognition score is a score indicating that the data is normal (likelihood of being normal). That is, in the case where the label attached to the learning data is normal (first information) or abnormal (second information), the recognition score is the first information. It indicates the likelihood. In the present embodiment, the projection distance method described in step S304 is used for identification model generation and recognition score calculation, but a method different from step S304 may be used.

（ステップＳ５０６：認識スコア統合）
ステップＳ５０６では、ステップＳ５０５で算出したデータごとの認識スコアを足し合わせて、スコアを統合する。具体的には、データごとに算出した認識スコアを足し合わせて、認識スコアの総和を求める。なお、検証用データの認識スコアの総和を求めると述べたが、学習用データと検証用データの認識スコアの両方を用いて、認識スコアの総和を算出してもよい。 (Step S506: recognition score integration)
In step S506, the recognition scores for each piece of data calculated in step S505 are added together to integrate the scores. Specifically, the recognition scores calculated for each data are added to obtain the total of the recognition scores. Although it has been stated that the sum of recognition scores of verification data is determined, the sum of recognition scores may be calculated using both learning data and recognition scores of verification data.

（ステップＳ５０７：ノイズデータ候補決定）
ステップＳ５０７では、ステップＳ５０６で算出した認識スコアを用いて、ノイズデータ候補（除外すべき学習画像の候補）を決定する。各データで算出された認識スコアから、正常のラベルが付与されているデータのみのスコアを抽出し、正常のラベルが付与されているデータのスコアの平均値及び標準偏差を算出する。そして、そのスコアの平均値に対してスコアの標準偏差をａ倍した値を加算し、第１の閾値とする。また、スコアの平均値に対してスコアの標準偏差をａ倍した値を減算し、第２の閾値とする。正常のラベルが付与されているデータのスコアが第１、第２の閾値に挟まれた値であるかを判断し、挟まれたデータであるならば、本当に正常データであると判断する。一方、それ以外の値であればノイズデータであると判断する。正常のノイズデータは正常データの分布の外部にあることを考慮し、例えばａ＝３に設定して閾値処理を行い、正常のラベルが付与されているデータが、正常データであるかノイズデータであるかを判定する。 (Step S507: noise data candidate determination)
In step S507, noise data candidates (candidates of learning images to be excluded) are determined using the recognition score calculated in step S506. From the recognition score calculated for each data, the score of only the data to which the normal label is attached is extracted, and the average value and the standard deviation of the scores of the data to which the normal label is attached are calculated. Then, a value obtained by multiplying the standard deviation of the scores by a is added to the average value of the scores to obtain a first threshold. Further, a value obtained by multiplying the standard deviation of the score by a times the average value of the score is subtracted to obtain a second threshold. It is determined whether the score of the data to which the normal label is assigned is a value between the first and second threshold values, and if it is the sandwiched data, it is determined that the data is truly normal data. On the other hand, if the value is other than that, it is determined that the data is noise data. In consideration of the fact that normal noise data is outside the distribution of normal data, threshold processing is performed by setting, for example, a = 3, and data labeled as normal is normal data or noise data. Determine if there is.

異常のラベルが付与されているデータに対しても、同様に閾値処理を行う。まず異常のラベルが付与されているデータのスコアの平均値及び標準偏差を算出し、スコアの平均値に対してスコアの標準偏差をｂ倍した値を加算し、第３の閾値とする。また、スコアの平均値に対してスコアの標準偏差をｂ倍した値を減算し、第４の閾値とする。異常のラベルが付与されているデータのスコアが第３、第４の閾値に挟まれた値であるかを判断し、挟まれたデータであるならば、ノイズデータであると判断する。一方、それ以外の値であれば本当に異常データであると判断する。異常のノイズデータは正常データの分布の内部にあることを考慮し、例えば、ｂ＝１に設定し、閾値処理を行い、異常のラベルが付与されているデータが、異常データであるかノイズデータであるかを判定する。 The threshold processing is similarly performed on data to which an abnormal label is attached. First, the average value and the standard deviation of the scores of the data labeled with the abnormality are calculated, and a value obtained by multiplying the standard deviation of the scores by b is added to the average value of the scores to obtain a third threshold. In addition, a value obtained by multiplying the standard deviation of the score by b with respect to the average value of the score is subtracted to obtain a fourth threshold. It is determined whether the score of the data to which the abnormal label is attached is a value between the third and fourth threshold values, and if it is the sandwiched data, it is determined that the data is noise data. On the other hand, if the value is other than that, it is determined that the data is really abnormal data. Consider that the noise data of the abnormality is inside the normal data distribution, for example, set b = 1, perform threshold processing, and indicate whether the data with the label of the abnormality is abnormal data or noise data Determine if it is.

（ステップＳ５０８：終了条件を満たすかの確認）
ステップＳ５０８では、ステップＳ５０２からステップＳ５０７までの処理を繰り返し、終了条件を満たすかの確認を行う。終了条件としては、例えば、ステップＳ５０２からＳ５０７の処理を所定回数（例えば、１００回）以上繰り返し実行したか否か等が挙げられる。 (Step S508: Confirmation of End Condition)
In step S508, the process from step S502 to step S507 is repeated to check whether the end condition is satisfied. As the termination condition, for example, whether or not the processing of steps S502 to S507 is repeatedly performed a predetermined number of times (for example, 100 times) or more can be mentioned.

本実施形態では、Ｓ５０２において特徴量をランダムに選択をして、その選択された特徴量に基づいて識別モデルが生成されてノイズデータが決定される。そのため、予め決まった１つの特徴量に基づいてノイズデータを決定する構成よりも、ロバスト性よくノイズデータを決定することができる。したがって、このノイズデータを除去した学習データに基づいて生成される、対象物を識別するための識別モデルでは精度よく識別を行うことができるようになる。また、特徴量をランダムに選択することにより、繰り返し処理にかかる計算時間を軽減させることができる。 In the present embodiment, feature amounts are randomly selected in S502, and a discrimination model is generated based on the selected feature amounts to determine noise data. Therefore, the noise data can be determined more robustly than the configuration in which the noise data is determined based on one predetermined feature amount. Therefore, in the identification model for identifying the object, which is generated based on the learning data from which the noise data is removed, the identification can be performed with high accuracy. In addition, by selecting the feature amounts at random, it is possible to reduce the calculation time required for the repetitive processing.

なお、本実施形態では、ステップＳ５０２において特徴量をランダムに選択したが、Ｓ５０２〜Ｓ５０８が複数回実行される際、各回で組合せが異なるように特徴量が選択されればよく、必ずしもランダムに選択するようにしなくともよい。 In the present embodiment, the feature amount is randomly selected in step S502. However, when S502 to S508 are executed a plurality of times, the feature amount may be selected so that the combination is different each time. You do not have to do it.

（ステップＳ５０９：ノイズデータの決定）
ステップＳ５０９では、ステップＳ５０２からステップＳ５０７までの繰り返し処理の回数に対してノイズデータ候補であると決定した割合を基に、ノイズデータを決定する。本実施形態では、ステップＳ５０２からステップＳ５０７までの繰り返し処理の回数を１００としたとき、ｘ％の割合でノイズデータ候補であると判定したデータをノイズデータであると決定する。ここでｘ＝５０程度に設定するのが好ましい。そして、ここでノイズデータとして決定された学習データ（学習画像）はＳ３０５における識別モデルの生成には利用されないよう対象のデータから除外される。 (Step S509: Determination of noise data)
In step S509, noise data is determined based on the ratio determined to be a noise data candidate with respect to the number of times of repetitive processing from step S502 to step S507. In the present embodiment, assuming that the number of times of repetitive processing from step S502 to step S507 is 100, data determined as noise data candidates at a rate of x% is determined as noise data. Here, it is preferable to set about x = 50. And the learning data (learning image) determined as noise data here is excluded from the data of object so that it may not be utilized for the production | generation of the identification model in S305.

以上、本実施形態にかかる情報処理装置によれば、複数の特徴量の中から組み合わせを異ならせながら複数回特徴量を選択する。そして、特徴量の組み合わせを異ならせたそれぞれでノイズデータであるかを判断し、その判断結果を統合してノイズデータを決定する。これにより、ロバストに精度よくノイズデータを決定することができるため、識別精度の高い識別モデルを生成することができるようになる。 As described above, according to the information processing apparatus according to the present embodiment, the feature amount is selected a plurality of times while making the combination different among the plurality of feature amounts. Then, different combinations of feature amounts are used to determine whether the data is noise data, and the determination results are integrated to determine noise data. As a result, noise data can be determined robustly and accurately, so that a discrimination model with high discrimination accuracy can be generated.

［第２の実施形態］
次に、本発明の第２の実施形態について説明する。第１の実施形態では、ステップＳ５０２において特徴量をランダムに選択する構成を示した。本実施形態は、より精度良くノイズデータを決定するために、繰り返し処理の各処理において、ノイズデータ候補を決定する処理が正しく行われていることを確認するものである。なお、第１の実施形態で既に説明をした構成については同一の符号を付し、その説明を省略する。 Second Embodiment
Next, a second embodiment of the present invention will be described. In the first embodiment, the configuration in which the feature amount is randomly selected in step S502 has been described. In the present embodiment, in order to determine noise data more accurately, it is confirmed that the process of determining noise data candidates is correctly performed in each process of the iterative process. The components already described in the first embodiment are denoted by the same reference numerals, and the description thereof is omitted.

図６は、本実施形態におけるラベルノイズクレンジングの処理の詳細を示すフローチャートである。なお、図６のステップＳ６０１〜Ｓ６０６、Ｓ６０８における各処理は、第１の実施形態で示した図５のステップＳ５０１〜Ｓ５０６、ステップＳ５０８の各処理と同様であるため説明を省略する。 FIG. 6 is a flowchart showing details of label noise cleansing processing in the present embodiment. The processes in steps S601 to S606 and S608 in FIG. 6 are the same as the processes in steps S501 to S506 and step S508 in FIG. 5 described in the first embodiment, and thus the description thereof is omitted.

（ステップＳ６０７：ノイズデータ候補とグッドデータ候補を決定）
ステップＳ６０７では、正常データと異常データそれぞれからグッドデータ候補とノイズデータ候補を決定する。 (Step S 607: Determine noise data candidates and good data candidates)
In step S607, good data candidates and noise data candidates are determined from the normal data and the abnormal data.

最初に、正常のラベルが付与されているデータに対して、ノイズデータ候補を決定する。ここでのノイズデータ候補の決定は、例えば、第１の実施形態で説明したような第１、第２の閾値を用いた閾値処理によって行えばよい。次に、異常のラベルが付与されているデータに対して、ノイズデータ候補を決定する。ここでのノイズデータ候補の決定は、例えば、第１の実施形態で説明したような第３、第４の閾値を用いた閾値処理によって行えばよい。以上により、正常のラベルが付与されているデータ及び異常のラベルが付与されているデータのそれぞれからノイズデータ候補を決定する。 First, noise data candidates are determined for data labeled as normal. The determination of the noise data candidate here may be performed by, for example, threshold processing using the first and second thresholds as described in the first embodiment. Next, noise data candidates are determined for the data to which the abnormal label is attached. The determination of the noise data candidate here may be performed by, for example, threshold processing using the third and fourth thresholds as described in the first embodiment. As described above, the noise data candidate is determined from each of the data to which the normal label is attached and the data to which the abnormal label is attached.

次に、正常のラベルが付与されているデータに対して、グッドデータ候補を決定する。具体的には、まず正常のラベルが付与されているデータのみのスコアを抽出し、正常のラベルが付与されているデータのスコアの平均値及び標準偏差を算出する。そして、スコアの平均値に対してスコアの標準偏差をｃ倍した値を加算し、第５の閾値とする。また、スコアの平均値に対してスコアの標準偏差をｃ倍した値を減算し、第６の閾値とする。正常のラベルが付与されているデータのスコアが第５、第６の閾値に挟まれた値であるかどうかを判断し、挟まれたデータであるならばグッドデータであると判断する。例えば、正常のグッドデータは正常データの分布の内部にあることを考慮して、ｃ＝１に設定した上で閾値処理を行い、正常のラベルが付与されているデータからグッドデータ候補を決定する。 Next, good data candidates are determined for the data to which the normal label is assigned. Specifically, first, the score of only the data to which the normal label is attached is extracted, and the average value and the standard deviation of the scores of the data to which the normal label is attached are calculated. Then, a value obtained by multiplying the standard deviation of the score by c with respect to the average value of the scores is added to obtain a fifth threshold. Further, a value obtained by multiplying the standard deviation of the score by c with respect to the average value of the score is subtracted to obtain a sixth threshold. It is determined whether the score of the data to which the normal label is assigned is a value between the fifth and sixth threshold values, and if it is the sandwiched data, it is determined that the data is good data. For example, in consideration of the fact that normal good data are in the distribution of normal data, threshold processing is performed after setting c = 1, and good data candidates are determined from data to which normal labels are attached. .

最後に、異常のラベルが付与されているデータに対して、グッドデータ候補を決定する。各データで算出されたスコアから、異常のラベルが付与されているデータのみのスコアを抽出する。異常のラベルが付与されているデータのスコアの平均値及び標準偏差を算出し、スコアの平均値に対してスコアの標準偏差をｄ倍した値を加算し、第７の閾値とする。また、スコアの平均値に対してスコアの標準偏差をｄ倍した値を減算し、第８の閾値とする。異常のラベルが付与されているデータのスコアが第７、第８の閾値に挟まれていなければ、異常データのグッドデータであると判定する。異常のグッドデータは正常データの分布の外部にあることを考慮して、例えば、ｆ＝３に設定した上で閾値処理を行い、異常のラベルが付与されているデータからグッドデータ候補を決定する。 Finally, good data candidates are determined for the data that is labeled as abnormal. From the scores calculated for each data, the score of only the data to which the abnormal label is assigned is extracted. The average value and the standard deviation of the scores of the data labeled as abnormal are calculated, and a value obtained by multiplying the standard deviation of the scores by d is added to the average value of the scores to obtain a seventh threshold. Further, a value obtained by multiplying the standard deviation of the score by d with respect to the average value of the score is subtracted to obtain an eighth threshold. If the score of the data labeled as abnormal is not between the seventh and eighth threshold values, it is determined that the data is good data of the abnormal data. In consideration of the fact that the good data of abnormality is outside the distribution of normal data, for example, threshold processing is performed after setting f = 3, and good data candidates are determined from the data to which the label of abnormality is attached. .

以上の説明からわかるように、グッドデータ候補とは、各学習画像に付与されているラベルを正しく認識できている可能性が相対的に高い学習データである。逆に、ノイズデータ候補とは、各学習画像に付与されているラベルを正しく認識できている可能性が相対的に低い学習データである。 As understood from the above description, the good data candidate is learning data having a relatively high possibility of correctly recognizing the label attached to each learning image. Conversely, the noise data candidate is learning data that is relatively low in the possibility of correctly recognizing the label attached to each learning image.

（ステップＳ６０９：ノイズデータの設定）
ステップＳ６０９では、ステップＳ６０７で選ばれたノイズデータ候補からノイズデータを決定する。 (Step S609: setting of noise data)
In step S609, noise data is determined from the noise data candidates selected in step S607.

ステップＳ６０２からステップＳ６０７までの繰り返し処理の回数に対してノイズデータ候補であると決定した割合を基に、ノイズデータを決定する。ここでは、各データに対し、ノイズデータ候補であるかどうかの集計を行い、ステップＳ６０２からステップＳ６０７までの繰り返しにおいて、ｘ％の割合でノイズデータと判定した場合、ノイズデータであると設定する。 Noise data is determined based on a ratio determined to be a noise data candidate with respect to the number of times of repetitive processing from step S602 to step S607. Here, each data is summed up whether it is a noise data candidate or not, and in the repetition from step S602 to step S607, if it is determined as noise data at a rate of x%, it is set as noise data.

（ステップＳ６１０：ノイズデータの確認）
ステップＳ６１０では、ステップＳ６０９でノイズデータが正しく決定できているかを確認する。ステップＳ６０９においてノイズデータであると判定したデータに関し、ステップＳ６０２からステップＳ６０７までの繰り返しにおいて、ステップＳ６０７で１度でもグッドデータ候補と判断されている場合には、正しいノイズデータではないと判断する。 (Step S610: Confirmation of noise data)
In step S610, it is checked in step S609 whether the noise data can be correctly determined. With regard to the data determined to be noise data in step S609, if it is determined as a good data candidate even once in step S607 in the repetition of steps S602 to S607, it is determined that the data is not correct noise data.

（ステップＳ６１１：ノイズデータの再設定）
ステップＳ６１１では、ステップＳ６０９で決定したノイズデータから、ステップＳ６１０で正しいノイズデータでないと判断したものを除外することにより、ノイズデータを再設定する。 (Step S611: Resetting noise data)
In step S611, the noise data is reset by excluding the noise data determined in step S609 that is determined not to be the correct noise data in step S610.

本実施形態においては、ラベルノイズクレンジングにおいてグッドデータは用いていないが、グッドデータ候補からグッドデータを決定し、非特許文献１と同様にノイズデータとグッドデータによりデータクレンジングを行うようにしてもよい。 In the present embodiment, good data is not used in label noise cleansing, but good data may be determined from good data candidates and data cleansing may be performed using noise data and good data as in Non-Patent Document 1. .

本実施形態にかかる情報処理装置によれば、取捨選択した設定基準を満たす、繰り返し処理におけるノイズデータ候補とグッドデータ候補を用いて、ノイズデータの再設定を行う。これにより、ロバストに精度よくノイズデータを決定することができるため、識別精度の高い識別モデルを生成することができるようになる。 According to the information processing apparatus according to the present embodiment, the noise data is reset using the noise data candidate and the good data candidate in the iterative process which satisfy the selected setting criteria. As a result, noise data can be determined robustly and accurately, so that a discrimination model with high discrimination accuracy can be generated.

［第３の実施形態］
次に、本発明の第３の実施形態について説明する。本実施形態は、繰り返し処理の中から、文字列の順番に関する類似度を算出する距離尺度を用いてノイズデータ候補を決定するためにどの回の処理を用いるかを選択する。そして、選択した処理のノイズデータ候補からノイズデータを決定する。なお、第１、第２の実施形態で既に説明をした構成については同一の符号を付し、その説明を省略する。 Third Embodiment
Next, a third embodiment of the present invention will be described. In the present embodiment, from among the iterative processes, which process is used to determine noise data candidates is selected using a distance scale that calculates the degree of similarity related to the order of character strings. Then, noise data is determined from the noise data candidates of the selected process. The components already described in the first and second embodiments are denoted by the same reference numerals, and the description thereof will be omitted.

図７は、本実施形態におけるラベルノイズクレンジングの処理の詳細を示すフローチャートである。なお、図７のステップＳ６０１〜Ｓ６０６における各処理は、第１の実施形態で示した図５のステップＳ５０１〜Ｓ５０６の各処理と同様であるため説明を省略する。 FIG. 7 is a flowchart showing details of label noise cleansing processing in the present embodiment. The processes in steps S601 to S606 in FIG. 7 are the same as the processes in steps S501 to S506 in FIG. 5 shown in the first embodiment, and therefore the description thereof is omitted.

（ステップＳ７０７：認識スコアの順番を記憶）
ステップＳ７０７では、繰り返し処理の各回のステップＳ７０５でデータに付与されたインデックスを利用して、各データに対して算出された認識スコアから認識スコアの順番を記憶する。 (Step S 707: memorize the order of recognition score)
In step S 707, the index assigned to the data in step S 705 of each iteration of the repetitive processing is used to store the order of the recognition score from the recognition score calculated for each data.

（ステップＳ７０８：終了条件を満たすかの確認）
ステップＳ７０８では、ステップＳ７０２からステップＳ７０７までの処理を繰り返し、終了条件を満たすかの確認を行う。ステップＳ７０２からステップＳ７０８までの処理を繰り返し、選んだ特徴量の組み合わせに対応するノイズデータ候補を決定する。 (Step S 708: Confirmation of End Condition)
In step S 708, the processing from step S 702 to step S 707 is repeated to check whether the end condition is satisfied. The processing from step S702 to step S708 is repeated to determine noise data candidates corresponding to the selected combination of feature amounts.

（ステップＳ７０９：順番に関する類似度を算出し、繰り返し処理を選択）
ステップＳ７０９では、ステップＳ７０７で記憶した認識スコアの順番に基づいて、順番に関する類似度を算出し、繰り返し処理の中からどの回の処理を使用するかを選択する。詳細に関しては、図８を用いて後述する。 (Step S 709: Calculate the degree of similarity regarding the order and select the repetitive process)
In step S 709, based on the order of the recognition scores stored in step S 707, the similarity relating to the order is calculated, and among the repetitive processes, which process to use is selected. Details will be described later with reference to FIG.

（ステップＳ７１０：ノイズデータの決定）
ステップＳ７１０では、ステップＳ７０９で選択した処理のみを利用して、ノイズデータ候補からノイズデータを決定する。各データに対し、選択した回の処理を利用して、ノイズデータ候補であるかどうかの集計を行い、ｂ％の割合でノイズデータと判定した場合、ノイズデータであると決定する。ここでは、ｂ＝５０程度に設定するのが好ましい。 (Step S710: Determination of noise data)
In step S710, noise data is determined from the noise data candidates using only the process selected in step S709. Each data is subjected to processing of selected times to count whether it is a noise data candidate or not, and when it is determined as noise data at a ratio of b%, it is determined to be noise data. Here, it is preferable to set b = about 50.

図８は、繰り返し処理の中からノイズデータを判断するために用いる処理を選択するための処理を示すフローチャートである。以下に、ステップＳ８０１からステップＳ８０２までの処理を説明する。 FIG. 8 is a flowchart showing a process for selecting a process to be used to determine noise data from among the iterative processes. The processes from step S801 to step S802 will be described below.

（ステップＳ８０１：順番に関する類似度行列の算出）
ステップＳ８０１では、文字列の類似度に用いる評価値を用いて、その評価値を要素とする、順番に関する類似度行列を算出する。順番に関する類似度を求めるために、文字列の類似度に用いる評価値を用いる。例えば、評価値として、最小編集距離等を用いる。最小編集距離とは、並んだ数字列に対してどれだけ変更を加えれば別の数字列になるか、ということに基づいて求められる距離である。変更の回数が多ければ多いほど、距離は大きくなる。ここでは、認識スコアの降順に並んだインデックスに対し、異なる繰り返し処理で並んだ数字列を比較し、「挿入」「削除」「置換」の処理を行い、同一の数字列になるようにする。例えば、繰り返し処理１「３，６，７，１」と繰り返し処理２「５，６，８」に並んだ数字列に変更する場合、繰り返し処理１の「３」を「５」に置換し、「７」を「８」に置換し、「１」を削除する。そうすれば繰り返し処理１の数字列が、繰り返し処理２の数字列に変更される。このときの処理の回数が３回なので、最小編集距離は３となる。このようにして、異なる繰り返し処理の認識スコアを比較し、最小編集距離を算出する。なお、ここでは、最小編集距離を用いることを前提としたが、そのほかの距離尺度として、レーベンシュタイン距離を用いても良い。最小編集距離では、「挿入」「削除」「置換」のそれぞれの処理で同一のコストを与えるが、レーベンシュタイン距離は、「挿入」「削除」「置換」に対し、それぞれ異なるコストを与える。距離の尺度を用いて、繰り返し処理の各処理のスコアを算出する。 (Step S801: Calculation of similarity matrix regarding order)
In step S801, using the evaluation value used for the similarity of the character string, a similarity matrix regarding order is calculated using the evaluation value as an element. The evaluation value used for the similarity of character strings is used to obtain the similarity regarding the order. For example, the minimum editing distance or the like is used as the evaluation value. The minimum editing distance is a distance determined based on how much change is made to the arranged number string to become another number string. The greater the number of changes, the greater the distance. Here, for the indexes arranged in descending order of recognition score, the numeral strings arranged in different repetition processing are compared, and the processes of “insertion”, “deletion” and “replacement” are performed so that they become identical numeral strings. For example, when changing to the numeral string in which repeat process 1 “3, 6, 7, 1” and repeat process 2 “5, 6, 8” are arranged, “3” in repeat process 1 is replaced with “5”, Replace "7" with "8" and delete "1". Then, the numeral string of the repeat process 1 is changed to the numeral string of the repeat process 2. Since the number of times of processing at this time is three, the minimum editing distance is three. Thus, the recognition scores of different repeated processes are compared to calculate the minimum editing distance. Although it is assumed here that the minimum editing distance is used, Levenshtein distance may be used as another distance scale. The minimum editing distance gives the same cost in each of the "insert", "delete" and "replace" processes, but the Levenshtein distance gives different costs to "insert", "delete" and "replace". The distance measure is used to calculate the score of each process of the iterative process.

最小編集距離を利用して、繰り返し処理ｉと繰り返し処理ｊを比較し、繰り返し処理ｉと繰り返し処理ｊの類似度ＤＩＳＴ_ｉｊを算出する。そして、順番に関する類似度行列Ｙを数式１０を用いて算出する。 Using the minimum edit distance, the iterative process i is compared with the iterative process j to calculate the similarity DIST _ij of the iterative process i and the iterative process j. Then, the similarity matrix Y related to the order is calculated using Equation 10.

（ステップＳ８０２：繰り返し処理の各処理のスコアの算出）
ステップＳ８０２では、ステップＳ８０１で算出した類似度行列を用いて、繰り返し処理の各回の処理のスコアを算出する。以下の数式１１に示すように、類似度行列Ｙに対し、列ごとに類似度Ｙ_ｉｊを加算し、総和を求め、繰り返し処理ｉに対するスコアＤＩＳＴ＿ＳＵＭ（ｉ）を算出する。 (Step S802: Calculation of Scores of Repetitive Processing)
In step S802, using the similarity matrix calculated in step S801, the score of each process of the iterative process is calculated. As shown in Equation 11 below, the similarity score Y _ij is added to the similarity matrix Y for each column to obtain the sum, and the score DIST_SUM (i) for the iterative process i is calculated.

このようにして、繰り返し処理の各回の処理に対応するスコアを算出できたので、スコアを降順に並べ、上位ａ％は利用しないようにして、ステップＳ７０１からステップＳ７０６までの繰り返し処理を選択する。このときａ＝１０程度に設定する。 In this way, since the score corresponding to each process of the iterative process has been calculated, the score is arranged in descending order, and the top a% is not used, and the iterative process from step S701 to step S706 is selected. At this time, it is set to about a = 10.

本実施形態にかかる情報処理装置によれば、文字列の順番に関する類似度を算出する距離尺度を用いて、ノイズデータ候補を決定するための処理の回を選択し、選択した回の処理のノイズデータ候補からノイズデータを決定する。これにより、精度よくノイズデータを決定することができる。 According to the information processing apparatus according to the present embodiment, using the distance measure for calculating the similarity with respect to the order of the character string, the process of determining the noise data candidate is selected, and the noise of the selected process is selected. Determine noise data from data candidates. Thereby, the noise data can be determined with high accuracy.

［第４の実施形態］
次に、本発明の第４の実施形態について説明する。本実施形態は、繰り返し処理に負荷のかからない特徴量選択手法を組み合わせることにより、ノイズデータ候補を決定し、精度良くノイズデータを決定するものである。なお、第１〜第３の実施形態で既に説明をした構成については同一の符号を付し、その説明を省略する。 Fourth Embodiment
Next, a fourth embodiment of the present invention will be described. In the present embodiment, noise data candidates are determined and noise data is determined with high accuracy by combining a feature amount selection method that does not require a load on repetitive processing. The same reference numerals are given to the configurations already described in the first to third embodiments, and the descriptions thereof will be omitted.

図９は、本実施形態におけるラベルノイズクレンジングの処理の詳細を示すフローチャートである。なお、図９のステップＳ９０１、Ｓ９０３〜Ｓ９０９における各処理は、第１の実施形態で示した図５のステップＳ５０１、Ｓ５０３〜Ｓ５０９の各処理と同様であるため説明を省略する。 FIG. 9 is a flowchart showing details of label noise cleansing processing in the present embodiment. The processes in steps S901 and S903 to S909 in FIG. 9 are the same as the processes in steps S501 and S503 to S509 in FIG. 5 described in the first embodiment, and thus the description thereof is omitted.

（ステップＳ９０２：特徴量選択手法を選択し特徴量選択）
ステップＳ９０２では、ステップＳ９０１で設定した学習データに対し、繰り返し処理に負荷のかからない特徴量選択手法を選択し、特徴量選択を行う。計算負荷がかからない特徴量選択手法として、評価値基準（ベイズ誤り確率推定値やクラス内分散・クラス間分散比）により、１つずつ特徴量を評価し、評価値が良い特徴量から順に特徴量を選択するといった手法がある。 (Step S902: Select a feature selection method and select a feature)
In step S902, for the learning data set in step S901, a feature amount selection method that does not impose a load on repetitive processing is selected, and feature amount selection is performed. As a feature quantity selection method that does not require calculation load, feature quantities are evaluated one by one according to the evaluation value criteria (Bayesian error probability estimated value or intraclass variance / interclass variance ratio), and feature quantities are ordered in order from the best There is a method of selecting

ここでは、評価値の例の１つとして、ベイズ誤り確率推定値について述べる。ここで、正常のクラス、異常のクラスのそれぞれをｗ_１、ｗ_２とし、ｎ個の特徴をもつベクトルをＸ＝［ｘ_１，・・・，ｘ_ｎ］^ｔとする。正常クラスｗ_１、異常クラスｗ_２に属する確率の分布に対応するｗ_１とｗ_２における条件付き確率分布Ｐ（ｘ｜ｗ_１）、Ｐ（ｘ｜ｗ_２）をヒストグラムで表現し、そこから事後確率分布Ｐ（ｗ_１｜ｘ）、Ｐ（ｗ_２｜ｘ）を算出する。事後確率分布Ｐ（ｗ_ｉ｜ｘ）を数式１２に示す。 Here, the Bayesian error probability estimated value will be described as one of the examples of the evaluation value. Here, let w ₁ and w ₂ denote a normal class and an abnormal class, respectively, and let X = [x ₁ ,..., X _n ] ^t be a vector having n features. The conditional probability distributions P (x | w ₁ ) and P (x | w ₂ ) in w ₁ and w ₂ corresponding to the distributions of the probability belonging to the normal class w ₁ and the abnormal class w ₂ are expressed by a histogram, from which The posterior probability distributions P (w ₁ | x) and P (w ₂ | x) are calculated. Posterior probability distribution P | a _(w i x) shown in Equation 12.

そして、事後確率分布Ｐ（ｗ_１｜ｘ）、Ｐ（ｗ_２｜ｘ）の重なりに対応するベイズ誤り確率推定値を数式１３を用いて算出する。 Then, a Bayesian error probability estimated value corresponding to the overlap of the posterior probability distributions P (w ₁ | x) and P (w ₂ | x) is calculated using Expression 13.

Ｂａｙｅｓ＝∫ｍｉｎ｛Ｐ（ｗ_１│ｘ），Ｐ（ｗ_２│ｘ）｝ｄｘ（数式１３）
この確率推定値の計算を、Ｎ個の特徴量の組み合わせそれぞれに対して行う。ここで算出するベイズ誤り確率推定値は、値が低いほど良品と不良品との分類に適している組み合わせとみなすことが出来る。 Bayes = ∫min {P (w ₁ │x), P (w ₂ │x)} dx (Equation 13)
The calculation of the probability estimated value is performed on each of the N feature quantities. The Bayesian error probability estimated value calculated here can be regarded as a combination suitable for the classification of the non-defective product and the non-defective product as the value is lower.

次に、クラス内分散・クラス間分散比について詳細に述べる。例えば２クラス問題の場合、２つのクラスをｗ_１、ｗ_２とし、観測される特徴をｘ０＝［ｘ_１，ｘ_２，・・・，ｘ_ｋ，・・・，ｘ_Ｎ］とするとき、特徴量ｘ_ｋに関するクラス内分散・クラス間分散比を求める。また、クラスｗ_ｉに属するパターン数をｎ_ｉ、クラスｗ_ｉに属するパターンのｘ_ｋの平均をｍ_ｉとする。さらに、全パターンのｘ_ｋの平均をｍとする。このとき、クラス内分散 Next, the in-class variance / inter-class variance ratio will be described in detail. For example, in the case of 2-class problem, the two classes as _w 1, _{w 2,} characterized observed _{_{x0 = [x 1, x 2}} , ···, x k, ···, x N] when the, The intra-class variance / inter-class variance ratio regarding the feature quantity x _k is determined. Also, let n _{i be} the number of patterns belonging to class w _i , and let m _i be the average of x _k of patterns belonging to class w _i . Furthermore, let m be the average of x _k of all patterns. At this time, intraclass dispersion

とクラス間分散 And interclass distribution

は数式１４及び数式１５のように算出することができる。 Can be calculated as Equation 14 and Equation 15.

数式１４及び数式１５から、クラス内分散・クラス間分散比は From Equation 14 and Equation 15, the intraclass variance / interclass variance ratio is

で算出することができる。このようにして、クラス内分散・クラス間分散比を求め、値が大きい順に特徴量を選択する。ここでは、評価値として、ベイズ誤り確率推定値とクラス内分散・クラス間分散比を用いることを述べたが、ガウス分布のずれに基づく評価値を用いてもよい。 It can be calculated by In this way, the intraclass variance / interclass variance ratio is determined, and feature quantities are selected in descending order of values. Here, as the evaluation value, the use of the Bayesian error probability estimated value and the in-class variance / inter-class variance ratio has been described, but an evaluation value based on the deviation of the Gaussian distribution may be used.

次に計算負荷がかからない手法として、２つずつ特徴量を評価して、評価基準の良い特徴量から２つずつ特徴量を選択していく手法がある。このときも同様に、ベイズ誤り確率推定値もしくはクラス内分散・クラス間分散比を用いて特徴選択を行う。 Next, as a method that does not apply a calculation load, there is a method of evaluating the feature quantities two by two and selecting the feature quantities two by two from the feature quantities having good evaluation criteria. Also in this case, feature selection is performed using the Bayesian error probability estimated value or the intraclass variance / interclass variance ratio.

また、最後に計算負荷がかからない手法として、特許文献１で開示されている手法がある。特許文献１では、特徴量間の組み合わせの相性を評価し、特徴量ごとにスコアを算出し、特徴量を選択する順序を決定する手法である。ここでも同様に、ベイズ誤り確率推定値もしくはクラス内分散・クラス間分散比を用いて特徴量選択を行う。 Lastly, there is a method disclosed in Patent Document 1 as a method that does not require a calculation load. Patent Document 1 is a method of evaluating the compatibility of combinations of feature amounts, calculating a score for each feature amount, and determining the order of selecting the feature amounts. Here too, feature value selection is performed using the Bayesian error probability estimated value or the intraclass variance / interclass variance ratio.

以上述べた複数の特徴量選択手法の中から、特徴量選択手法を予め設定しておくか、もしくは処理ごとにランダムに特徴量選択手法を選択して特徴量選択を行う。 Among the plurality of feature amount selection methods described above, the feature amount selection method is set in advance, or the feature amount selection method is randomly selected for each process to perform feature amount selection.

本実施形態にかかる情報処理装置によれば、計算負荷のかからない特徴量選択の手法を用いて特徴量選択を行う。このようにして、特徴量選択にかかる計算時間を軽減させて、ノイズデータ候補を選択する繰り返し処理を行う。これにより、精度よくノイズデータを決定することができる。 According to the information processing apparatus of the present embodiment, feature amount selection is performed using a feature amount selection method that does not require a calculation load. In this manner, the calculation time for selecting the feature amount is reduced, and the iterative process of selecting the noise data candidate is performed. Thereby, the noise data can be determined with high accuracy.

［第５の実施形態］
次に、本発明の第５の実施形態について説明する。本実施形態は、分割したデータセットごとに選択する特徴量を変えてノイズデータ候補を決定することにより、処理が正しく行われていることを確認するものである。なお、第１〜第４の実施形態で既に説明をした構成については同一の符号を付し、その説明を省略する。 Fifth Embodiment
Next, a fifth embodiment of the present invention will be described. In the present embodiment, it is confirmed that the processing is correctly performed by changing the feature amount to be selected for each divided data set to determine the noise data candidate. The same reference numerals are given to the configurations that have already been described in the first to fourth embodiments, and the descriptions thereof will be omitted.

図１０は、本実施形態におけるラベルノイズクレンジングの処理の詳細を示すフローチャートである。なお、図９のステップＳ１００４〜Ｓ１００９における各処理は、第１の実施形態で示した図５のステップＳ５０４〜Ｓ５０９の各処理と同様であるため説明を省略する。 FIG. 10 is a flowchart showing details of label noise cleansing processing in the present embodiment. The processes in steps S1004 to S1009 in FIG. 9 are the same as the processes in steps S504 to S509 in FIG. 5 described in the first embodiment, and thus the description thereof is omitted.

（ステップＳ１００２：データ分割）
ステップＳ１００２では、ステップＳ１００１で設定したデータを、複数個のデータセットに分割する。選択したＲ個の特徴量に関し、学習データをランダムにＬ個のデータセットに分割する。このとき、学習用データと検証用データに分けるが、Ｌ−１個のデータセットを学習用データに設定し、１個のデータセットを検証用データに設定する。そして、組み合わせを変えながら、Ｌ個の学習用データと検証用データの組み合わせを生成する。 (Step S1002: Data Division)
In step S1002, the data set in step S1001 is divided into a plurality of data sets. The learning data is randomly divided into L data sets for the selected R feature amounts. At this time, although divided into learning data and verification data, L-1 data sets are set as learning data, and one data set is set as verification data. Then, while changing the combination, a combination of L pieces of learning data and verification data is generated.

（ステップＳ１００３：特徴量選択）
ステップＳ１００３では、ステップＳ１００２で分割したＬ個の学習用データと検証用データの組み合わせに対応する特徴量をランダムに選択する。これにより、Ｌ個のランダムな特徴量のセットが生成される。 (Step S1003: feature amount selection)
In step S1003, a feature amount corresponding to the combination of L pieces of learning data and verification data divided in step S1002 is randomly selected. This generates L sets of random feature amounts.

本実施形態にかかる情報処理装置によれば、分割したデータセットごとに選択する特徴量を変えて、ノイズデータ候補を決定することにより、処理が正しく行われていることを確認する。これにより、精度よくノイズデータを決定することができる。 According to the information processing apparatus according to the present embodiment, it is confirmed that the processing is correctly performed by changing the feature amount to be selected for each divided data set and determining the noise data candidate. Thereby, the noise data can be determined with high accuracy.

［第６の実施形態］
次に、本発明の第６の実施形態について説明する。上述の各実施形態では、対象物の良否判定（正常異常判定）に用いられる識別モデルを生成する場合を例に説明してきた。本実施形態は、画像の診断に用いられる識別モデルを生成する場合を示す。なお、第１〜第５の実施形態で既に説明をした構成については同一の符号を付し、その説明を省略する。 Sixth Embodiment
Next, a sixth embodiment of the present invention will be described. In each of the above-mentioned embodiments, the case of generating the identification model used for the quality determination (normal / abnormal determination) of the object has been described as an example. This embodiment shows the case of generating a discrimination model used for image diagnosis. The same reference numerals are given to the configurations already described in the first to fifth embodiments, and the descriptions thereof will be omitted.

本実施形態に係る情報処理システムでは、画像取得装置により取得された医療画像に特徴的な異常部分を検出する。そのため、システムは画像取得装置と情報処理装置とを含み、情報処理装置は画像取得装置１１０１により取得した医療画像、ここでは医療画像に特徴的な異常部分があるかどうかを判定する。なお、本実施形態に係る情報処理装置のハードウェア構成、機能構成は第１の実施形態と同様である。 The information processing system according to the present embodiment detects an abnormal portion characteristic of a medical image acquired by the image acquisition device. Therefore, the system includes an image acquisition apparatus and an information processing apparatus, and the information processing apparatus determines whether a medical image acquired by the image acquisition apparatus 1101, here, a medical image has a characteristic abnormal part. The hardware configuration and the functional configuration of the information processing apparatus according to the present embodiment are the same as those of the first embodiment.

図１１は、本実施形態に係る識別モデル生成の処理の詳細を示すフローチャートである。 FIG. 11 is a flowchart showing details of identification model generation processing according to the present embodiment.

（ステップＳ１１０１：学習データに対する特徴量抽出）
ステップＳ１１０１では、学習データ設定部２０１が、取得した医療画像から学習データを生成する。例えば、眼底画像において、糖尿病に特徴的な異常部分を予め人手でマーキングした領域を異常データと判断し、局所特徴量を抽出する。そして、マーキングしていない領域を正常データと判断し、局所特徴量を抽出する。特徴量としては、サイズ不変、回転不変な特徴量のひとつであるＳＩＦＴ特徴量を用いる。ここでは、正常部分、異常部分を含む各領域に対して、Ｎ次元のＳＩＦＴ特徴量を利用して学習データを生成する。 (Step S1101: feature amount extraction for learning data)
In step S1101, the learning data setting unit 201 generates learning data from the acquired medical image. For example, in the fundus image, a region in which an abnormal portion characteristic of diabetes is manually marked in advance is determined as abnormal data, and a local feature amount is extracted. Then, the region not marked is judged as normal data, and the local feature amount is extracted. As feature quantities, SIFT feature quantities which are one of size invariant and rotation invariant feature quantities are used. Here, learning data is generated using N-dimensional SIFT feature quantities for each region including normal and abnormal parts.

（ステップＳ１１０２：ラベルノイズクレンジング）
ステップＳ１１０２では、クレンジング部２０２が、ステップＳ１１０１で生成した学習データに対し、ラベルノイズクレンジング技術を用いて、学習データのクレンジングを行う。ラベルノイズクレンジングを行う手法は、第１の実施形態と同様である。 (Step S1102: Label Noise Cleansing)
In step S1102, the cleansing unit 202 cleans the learning data from the learning data generated in step S1101 using a label noise cleansing technique. The method of performing label noise cleansing is the same as that of the first embodiment.

（ステップＳ１１０３：識別モデルの生成）
ステップＳ１１０３では、識別モデル学習部２０４が、ステップＳ１１０２で生成した学習データに対して、ＳＶＭを利用して識別モデルの生成を行う。 (Step S1103: Generation of Identification Model)
In step S1103, the identification model learning unit 204 generates an identification model using the SVM for the learning data generated in step S1102.

（ステップＳ１１０４：テストデータに対する画像診断）
ステップＳ１１０４では、正常異常判定部２０６が、テスト画像から局所領域を切り出しＳＩＦＴ特徴量で特徴抽出し、ステップＳ１１０３で生成した識別モデルを用いて異常部分があるかの判断を行う。 (Step S1104: Diagnostic imaging on test data)
In step S1104, the normal / abnormality determination unit 206 extracts local regions from the test image, extracts features using SIFT feature amounts, and determines whether there is an abnormal part using the discrimination model generated in step S1103.

本実施形態にかかる情報処理装置によれば、医療画像の診断において、特徴的な異常部分の局所領域を検出する際に用いる学習画像の局所領域の特徴量を利用して、誤ってラベル付けされたデータを除去するラベルノイズクレンジングを行う。かかる構成により、精度の高い画像診断を行うことができる。 According to the information processing apparatus according to the present embodiment, in diagnosis of a medical image, erroneous labeling is performed using the feature amount of the local area of the learning image used when detecting the local area of the characteristic abnormal portion. Perform label noise cleansing to remove out-of-date data. With this configuration, it is possible to perform highly accurate image diagnosis.

［その他の実施形態］
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１以上のプロセッサがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 Other Embodiments
The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or storage medium, and one or more processors in a computer of the system or apparatus read and execute the program. Processing is also feasible. It can also be implemented by a circuit (eg, an ASIC) that implements one or more functions.

２０１学習データ設定部
２０２クレンジング部
２０３特徴量選択部
２０４識別モデル学習部
２０５パラメータ設定部
２０６正常異常判定部 201 Learning data setting unit 202 Cleansing unit 203 Feature amount selection unit 204 Identification model learning unit 205 Parameter setting unit 206 Normal abnormality judgment unit

Claims

An information processing apparatus for determining learning data used when the first generation means generates a first identification model for identifying input data, the information processing apparatus comprising:
Acquisition means for acquiring a plurality of learning data;
Extracting means for extracting a plurality of types of feature quantities from each of the plurality of acquired learning data;
Selection means for executing a selection process of selecting one or more feature amounts from a plurality of types of feature amounts extracted from each of the learning data;
Second generation means for executing generation processing for generating a second identification model based on the selected feature amount;
Calculation means for executing calculation processing for calculating a recognition score of the generated second identification model;
It is used when generating a first identification model from among the plurality of learning data based on the plurality of recognition scores obtained by performing the selection process, the generation process, and the calculation process a plurality of times. Determining means for determining learning data;
An information processing apparatus comprising:

The apparatus further comprises dividing means for dividing the plurality of learning data into a plurality of groups,
The second generation unit generates the second identification model using learning data of a part of the plurality of groups.
The information processing apparatus according to claim 1, wherein the calculation unit calculates the recognition score using learning data of remaining groups of the plurality of groups.

The information processing apparatus according to claim 1, wherein the selection unit randomly selects one or more feature amounts from the plurality of types of feature amounts.

The information processing apparatus according to claim 1, wherein the selection unit selects a feature amount from the plurality of types of feature amounts based on evaluation values of the plurality of types of feature amounts.

The information processing apparatus according to claim 4, wherein the evaluation value is a Bayesian error probability estimated value, an in-class variance, an inter-class variance ratio, or a deviation of distribution in a Gaussian distribution.

The information processing apparatus according to claim 4, wherein the evaluation value is a distance measure when the recognition score is assigned a unique index, and the index is rearranged.

The information processing apparatus according to claim 6, wherein the distance measure is a minimum editing distance or a Levenshtein distance.

8. The information processing apparatus according to claim 6, wherein the selection unit selects a feature amount from the plurality of types of feature amounts using a matrix having the distance measure as an element.

8. The information processing apparatus according to claim 6, wherein the selection unit selects a feature quantity from the plurality of types of feature quantities by converting a matrix having the distance measure as an element into a vector. .

The information processing apparatus according to any one of claims 1 to 5, wherein the second generation unit generates the second identification model using an SVM or a subspace method.

The determination means determines whether or not the learning data is excluded based on each of the plurality of recognition scores, and excludes the learning data based on a ratio determined to be excluded. The information processing apparatus according to any one of 1 to 10.

The selection unit determines whether the data is relatively likely to be correctly recognized, or the data is relatively unlikely to be likely, based on each of the plurality of recognition scores. The information processing apparatus according to any one of claims 1 to 10, wherein it is determined whether or not the learning data is excluded.

The learning data is provided with a label indicating whether it is the first information or the second information,
The information processing apparatus according to any one of claims 1 to 12, wherein the recognition score indicates a likelihood that the label attached to learning data is the first information.

The information according to any one of claims 1 to 13, further comprising the first generation means for generating the first identification model based on the learning data determined by the determination means. Processing unit.

An information processing method for determining learning data used when generating a first identification model for identifying input data, comprising:
Acquiring a plurality of learning data;
Extracting a plurality of types of feature quantities from each of the plurality of acquired learning data;
Executing a selection process of selecting one or more feature amounts from a plurality of types of feature amounts extracted from each of the learning data;
Executing a generation process of generating a second identification model based on the selected feature amount;
Executing a calculation process for calculating a recognition score of the generated second identification model;
It is used when generating a first identification model from among the plurality of learning data based on the plurality of recognition scores obtained by performing the selection process, the generation process, and the calculation process a plurality of times. Determining learning data;
An information processing method characterized by comprising:

A program for causing a computer to function as the information processing apparatus according to any one of claims 1 to 13.