JP2019200554A

JP2019200554A - Generation method, generation program, and generation device

Info

Publication number: JP2019200554A
Application number: JP2018094189A
Authority: JP
Inventors: 直紀高橋; Naoki Takahashi
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2018-05-16
Filing date: 2018-05-16
Publication date: 2019-11-21
Anticipated expiration: 2038-05-16
Also published as: JP7133351B2

Abstract

【課題】文章が入力されないことが出力に与える影響を抑制することを目的とする。【解決手段】生成方法は、コンピュータが、学習情報に基づき、第１の単語、及び第１の文章に応じた第１の文章とは異なる形式の第１の中間情報の組みを、第１の情報に応じた情報に変換する第１の変換パラメータと、第１の単語を、第１の中間情報に変換する第２の変換パラメータとを生成し、第２の入力項目に対応付けられた文章の入力を受け付けずに、第１の入力項目に対応付けられた第２の単語の入力を受け付けると、第２の変換パラメータにより第２の単語を変換して、形式の第２の中間情報を生成するとともに、第１の変換パラメータにより第２の単語及び第２の中間情報の組みを変換して、第２の情報を生成する。【選択図】図１１PROBLEM TO BE SOLVED: To suppress the influence of no input of a sentence on output. According to a generation method, a computer determines a first word and a first set of intermediate information in a format different from the first sentence according to the first sentence based on learning information as a first set. A sentence generated by generating a first conversion parameter for converting into information according to the information and a second conversion parameter for converting the first word into the first intermediate information, and correlating with the second input item. When the input of the second word associated with the first input item is accepted without accepting the input of, the second word is converted by the second conversion parameter to obtain the second intermediate information of the format. Along with the generation, the second word and the second intermediate information set are converted by the first conversion parameter to generate the second information. [Selection diagram] Fig. 11

Description

本発明は、生成方法、生成プログラムおよび生成装置に関する。 The present invention relates to a generation method, a generation program, and a generation apparatus.

機械学習を用いて、入力情報に基づいて出力情報を生成する分類器等のシステムを生成する技術が用いられている。しかし、入力情報に欠損がある場合、出力情報の精度が低下する可能性があるため、欠損した情報を予測する技術が用いられる。 A technique for generating a system such as a classifier that generates output information based on input information using machine learning is used. However, when the input information is missing, the accuracy of the output information may be reduced, so a technique for predicting the missing information is used.

関連する技術として、トレーニングデータをデータ学習プログラムに入力して分類器を作成し、欠損値を含むデータから生成した入力データを分類器に入力して、欠損値を予測する技術が提案されている（例えば、特許文献１を参照）。 As a related technique, a technique has been proposed in which training data is input to a data learning program to create a classifier, and input data generated from data containing missing values is input to the classifier to predict missing values. (For example, see Patent Document 1).

また、関連する技術として、欠損値を含むフィールドが複数ある場合に欠損値を含むフィールド以外のフィールドのデータから欠損値を予測し、補間する技術が提案されている（例えば、特許文献２を参照）。 Further, as a related technique, a technique has been proposed in which when there are a plurality of fields containing missing values, the missing values are predicted from the data of fields other than the fields containing missing values and interpolated (see, for example, Patent Document 2). ).

また、関連する技術として、iterative inversion法と誤差逆伝搬により、不完全なセンサ情報の欠損値を推定しモデル獲得を行う技術が提案されている（例えば、特許文献３を参照）。 Further, as a related technique, a technique for estimating a missing value of incomplete sensor information and acquiring a model by using an iterative inversion method and error back propagation has been proposed (see, for example, Patent Document 3).

また、関連する技術として、欠損値のうちの予測したい目的変数を受け付け、重回帰分析の結果をもとに、目的変数を含むレコードの各説明変数の値及び係数から目的変数の値を計算する技術が提案されている（例えば、特許文献４を参照）。 Also, as a related technique, accept the objective variable that you want to predict among the missing values, and calculate the value of the objective variable from the value and coefficient of each explanatory variable of the record including the objective variable based on the result of multiple regression analysis Techniques have been proposed (see, for example, Patent Document 4).

特開２００７−３３４７１９号公報JP 2007-334719 A 特開２０００−４００７９号公報JP 2000-40079 特開平８−２１２１８４号公報JP-A-8-212184 特開２０１４−６３４３２号公報JP 2014-63432 A

例えば、入力情報のうち文章が欠損している場合、入力情報内の他の情報を用いて文章を予測することが考えられる。しかし、文章内の単語の数が可変であり、単語の順番を考慮することになるため、欠損している文章を直接予測することは困難である。そして、欠損した文章の予測精度が低下すると、出力情報の精度も低下する。 For example, when a sentence is missing in the input information, it is conceivable to predict the sentence using other information in the input information. However, since the number of words in the sentence is variable and the order of the words is taken into account, it is difficult to directly predict the missing sentence. And if the prediction precision of the missing sentence falls, the precision of output information will also fall.

１つの側面として、本発明は、文章が入力されないことが出力に与える影響を抑制することを目的とする。 As one aspect, an object of the present invention is to suppress the influence on the output that a sentence is not input.

１つの態様では、生成方法は、コンピュータが、第１の入力項目に対応付けて入力された第１の単語と、第２の入力項目に対応付けて入力された第１の文章と、前記第１の単語及び前記第１の文章の組みに応じた第１の情報と、を含む学習情報を取得し、前記学習情報に基づき、前記第１の単語、及び前記第１の文章に応じた前記第１の文章とは異なる形式の第１の中間情報の組みを、前記第１の情報に応じた情報に変換する第１の変換パラメータと、前記第１の単語を、前記第１の中間情報に変換する第２の変換パラメータとを生成し、前記第２の入力項目に対応付けられた文章の入力を受け付けずに、前記第１の入力項目に対応付けられた第２の単語の入力を受け付けると、前記第２の変換パラメータにより前記第２の単語を変換して、前記形式の第２の中間情報を生成するとともに、前記第１の変換パラメータにより前記第２の単語及び前記第２の中間情報の組みを変換して、第２の情報を生成する、処理を実行する。 In one aspect, the generation method includes: a first word input in association with a first input item; a first sentence input in association with a second input item; Learning information including one word and first information corresponding to the first sentence set, and based on the learning information, the first word and the first sentence according to the first sentence A first conversion parameter for converting a set of first intermediate information in a format different from that of the first sentence into information corresponding to the first information; and the first word is converted into the first intermediate information. The second conversion parameter to be converted into the second input parameter is generated, and the second word associated with the first input item is input without accepting the input of the sentence associated with the second input item. If accepted, the second word is converted by the second conversion parameter, Generating the second intermediate information in the format, and generating the second information by converting the combination of the second word and the second intermediate information by the first conversion parameter To do.

１つの側面によれば、文章が入力されないことが出力に与える影響を抑制することができる。 According to one aspect, it is possible to suppress the influence on the output that a sentence is not input.

生成装置の一例を示す図である。It is a figure which shows an example of a production | generation apparatus. 入力情報および出力情報の一例を示す図である。It is a figure which shows an example of input information and output information. 各情報およびパラメータの例を示す図である。It is a figure which shows the example of each information and a parameter. パラメータ生成方法の一例を示す図である。It is a figure which shows an example of the parameter production | generation method. 中間情報の例を示す図である。It is a figure which shows the example of intermediate information. 中間情報に変換するパラメータの生成方法の例を示す図である。It is a figure which shows the example of the production | generation method of the parameter converted into intermediate information. 入力情報から中間情報への変換方法の例を示す図である。It is a figure which shows the example of the conversion method from input information to intermediate information. 他の入力情報に変換するパラメータの生成方法の第１の例を示す図である。It is a figure which shows the 1st example of the production | generation method of the parameter converted into other input information. 他の入力情報に変換するパラメータの生成方法の第２の例を示す図である。It is a figure which shows the 2nd example of the production | generation method of the parameter converted into other input information. 入力情報から他の入力情報への変換方法の例を示す図である。It is a figure which shows the example of the conversion method from input information to other input information. 学習時における処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process at the time of learning. 運用時における処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process at the time of operation. 入力情報を示す行列式の例を示す図である。It is a figure which shows the example of the determinant which shows input information. 実施例における各情報およびパラメータを示す図である。It is a figure which shows each information and parameter in an Example. 欠損および外れ値を含む情報の例を示す図である。It is a figure which shows the example of the information containing a defect | deletion and an outlier. 生成装置のハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions of a production | generation apparatus.

以下、図面を参照して、実施形態について説明する。図１は、生成装置１の一例を示す図である。生成装置１は、コンピュータの一例である。生成装置１は、例えば、サーバ、パーソナルコンピュータ、またはタブレット端末等である。 Hereinafter, embodiments will be described with reference to the drawings. FIG. 1 is a diagram illustrating an example of the generation device 1. The generation device 1 is an example of a computer. The generation device 1 is, for example, a server, a personal computer, a tablet terminal, or the like.

生成装置１は、取得部１１と第１生成部１２と第２生成部１３と判定部１４と変換部１５と第３生成部１６と記憶部１７とを含む。 The generation device 1 includes an acquisition unit 11, a first generation unit 12, a second generation unit 13, a determination unit 14, a conversion unit 15, a third generation unit 16, and a storage unit 17.

取得部１１は、第１の入力項目に対応付けて入力された第１の単語と、第２の入力項目に対応付けて入力された第１の文章と、第１の単語及び第１の文章の組みに応じた第１の情報と、を含む学習情報を取得する。 The acquisition unit 11 includes a first word input in association with the first input item, a first sentence input in association with the second input item, the first word, and the first sentence. The learning information including the first information corresponding to the set of is acquired.

第１生成部１２は、学習情報に基づき、第１の単語、及び第１の文章に応じた第１の文章とは異なる形式の第１の中間情報の組みを、第１の情報に応じた情報に変換する第１の変換パラメータを生成する。 The first generation unit 12 sets, based on the learning information, a set of first intermediate information in a format different from the first word and the first sentence corresponding to the first sentence according to the first information. A first conversion parameter to be converted into information is generated.

第２生成部１３は、第１の単語を、第１の中間情報に変換する第２の変換パラメータを生成する。第１生成部１２および第２生成部１３は、パラメータ生成部の一例である。 The second generation unit 13 generates a second conversion parameter for converting the first word into the first intermediate information. The first generation unit 12 and the second generation unit 13 are an example of a parameter generation unit.

判定部１４は、入力情報に欠損があるか判定する。なお、生成装置１は、入力情報が学習情報に存在しない情報を含む場合等に、すなわち、入力情報に外れ値がある場合にも、欠損がある場合と同様の処理を行うこととする。 The determination unit 14 determines whether there is a defect in the input information. Note that the generation device 1 performs the same processing as when there is a defect even when the input information includes information that does not exist in the learning information, that is, when there is an outlier in the input information.

変換部１５は、第２の入力項目に対応付けられた文章の入力を受け付けずに、第１の入力項目に対応付けられた第２の単語の入力を受け付けると、第２の変換パラメータにより第２の単語を変換して、第２の中間情報を生成する。 When the conversion unit 15 receives the input of the second word associated with the first input item without receiving the input of the sentence associated with the second input item, the conversion unit 15 receives the input of the second word associated with the first input item. The second word is converted to generate second intermediate information.

第３生成部１６は、第１の変換パラメータにより第２の単語及び第２の中間情報の組みを変換して、第２の情報を生成する。変換部１５および第３生成部１６は、情報生成部の一例である。 The third generation unit 16 converts the combination of the second word and the second intermediate information using the first conversion parameter to generate the second information. The conversion unit 15 and the third generation unit 16 are an example of an information generation unit.

記憶部１７は、生成された各種パラメータ、中間情報等を記憶する。 The storage unit 17 stores various generated parameters, intermediate information, and the like.

図２は、入力情報および出力情報の一例を示す図である。図２に示す入力情報および出力情報は、映画に関する情報である。図２に示す入力情報の入力項目は、映画のジャンル（ｘ_１）、製作会社（ｘ_２）、映画の概要（ｘ_３）を含む。映画のジャンル（ｘ_１）、製作会社（ｘ_２）、は、単語で表現され、映画の概要（ｘ_３）は、文章で表現される。出力情報の項目は、面白さ（ｙ）を含む。なお、以下の説明において、各項目を、記号（ｘ_１、ｘ_２、ｘ_３、ｙ）を用いて表現することがある。 FIG. 2 is a diagram illustrating an example of input information and output information. The input information and output information shown in FIG. 2 are information regarding movies. The input items of the input information shown in FIG. 2 include a movie genre (x ₁ ), a production company (x ₂ ), and a movie overview (x ₃ ). The movie genre (x ₁ ) and the production company (x ₂ ) are expressed by words, and the movie overview (x ₃ ) is expressed by sentences. The item of output information includes interest (y). In the following description, each item may be expressed using symbols (x ₁ , x ₂ , x ₃ , y).

図３は、各情報およびパラメータの例を示す図である。図３に示すように、入力情報ｘ_１は、パラメータｗ_１を用いた分類器により中間情報ｘ_１’に変換される。同様に、入力情報ｘ_２は、パラメータｗ_２を用いた分類器により中間情報ｘ_２’に変換される。また、入力情報ｘ_３は、パラメータｗ_３を用いた分類器により中間情報ｘ_３’に変換される。また、中間情報ｘ_１’、ｘ_２’、ｘ_３’は、パラメータｗ_ｈを用いた分類器によりｙに変換される。 FIG. 3 is a diagram illustrating examples of information and parameters. As shown in FIG. 3, the input information x ₁ is converted into intermediate information x ₁ ′ by a classifier using the parameter w ₁ . Similarly, the input information x ₂ is converted into intermediate information x ₂ ′ by a classifier using the parameter w ₂ . Further, the input information x ₃ is converted into intermediate information x ₃ ′ by a classifier using the parameter w ₃ . The intermediate information _{_{x 1 ', x 2',}} x 3 ' is converted into y by classifier with parameters _{w h.}

生成装置１は、学習情報としてｘ_１〜ｘ_３、ｙを取得した場合、学習情報に基づいて各パラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）を生成する。また、生成装置１は、未知の出力情報を生成するために入力情報ｘ_１〜ｘ_３を取得した場合、入力情報ｘ_１〜ｘ_３と生成した各パラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）とを用いてｙを生成する。 When the generation apparatus 1 acquires x ₁ to x ₃ and y as learning information, the generation apparatus 1 generates each parameter (w _{1 to} w ₃ , w _h ) based on the learning information. Moreover, when the generation apparatus 1 acquires the input information x _{1 to} x ₃ to generate unknown output information, the generation information ₁ includes the input information x _{1 to} x ₃ and the generated parameters (w _{1 to} w ₃ , w _h ). To generate y.

図４は、パラメータ生成方法の一例を示す図である。図４に示す例では、学習情報として、ｘ_１〜ｘ_３、ｙが与えられているとする。そして、第１生成部１２は、ｘ_１〜ｘ_３、ｙに基づいて、各パラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）を生成する。第１生成部１２は、ｘ_１〜ｘ_３、ｙと初期パラメータに基づいて、ｙの予測値ｐｒｅｄ＿ｙを生成し、ｐｒｅｄ＿ｙがｙに近づくように、パラメータを調整する。そして、第１生成部１２は、あるパラメータを用いて生成した予測値ｐｒｅｄ＿ｙがｙに最も近い場合、そのパラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）を記憶部１７に記憶する。記憶部１７に記憶されたパラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）は、運用時において、入力情報から未知の出力情報を生成するために用いられる。 FIG. 4 is a diagram illustrating an example of a parameter generation method. In the example illustrated in FIG. 4, it is assumed that x _{1 to} x ₃ and y are given as learning information. The first generator _12, based on x 1 _~x 3, y, generates the parameters _{_{_{(w 1 ~w 3, w h}}} ). The first generation unit 12 generates a predicted value pred_y of y based on x _{1 to} x ₃ , y and the initial parameter, and adjusts the parameter so that pred_y approaches y. When the predicted value pred_y generated using a certain parameter is closest to y, the first generation unit 12 stores the parameter (w _{1 to} w ₃ , w _h ) in the storage unit 17. The parameters (w _{1 to} w ₃ , w _h ) stored in the storage unit 17 are used for generating unknown output information from input information during operation.

図５は、中間情報の例を示す図である。図５に示す例では、中間情報ｘ_３’は、行列で表される。ｘ_３が文章である場合、中間情報ｘ_３’は、例えば、文章の意味を示す。中間情報ｘ_３’は、入力情報ｘ_３より少ない次元数の行列であるとする。よって、生成装置１は、入力情報ｘ_３が欠損している場合に、入力情報ｘ_３を予測するよりも容易に中間情報ｘ_３’を予測することができる。 FIG. 5 is a diagram illustrating an example of intermediate information. In the example illustrated in FIG. 5, the intermediate information x ₃ ′ is represented by a matrix. If x ₃ is the sentence, intermediate information x _{3 ',} for example, indicating the meaning of a sentence. It is assumed that the intermediate information x ₃ ′ is a matrix having a smaller number of dimensions than the input information x ₃ . Therefore, the generation device 1 can predict the intermediate information x ₃ ′ more easily than when the input information x ₃ is predicted when the input information x ₃ is missing.

図６は、中間情報に変換するパラメータの生成方法の例を示す図である。図６に示す例では、第２生成部１３は、ｘ_１を取得し、ｘ_１から中間情報ｘ_３’を生成するためのパラメータｗ’_３１を生成する。第２生成部１３は、例えば、ｘ_１と初期のパラメータｗ’_３１に基づいて、中間情報ｘ_３’の予測値であるｐｒｅｄ１＿ｘ_３’を生成する。そして、第２生成部１３は、ｐｒｅｄ１＿ｘ_３’とｘ_３’との差が最も小さくなるようにｗ’_３１を生成する。ｘ_３’は、第１生成部１２が学習情報に基づいて生成したパラメータｗ_３と入力情報とｘ_３を用いて、第２生成部１３が予め生成した値である。 FIG. 6 is a diagram illustrating an example of a method for generating a parameter to be converted into intermediate information. In the example shown in FIG. 6, the second generation unit 13 obtains the _{x 1,} to produce a ₃₁ 'parameter w for generating' intermediate information _{x 3} from _{x 1.} For example, the second generation unit 13 generates pred1_x ₃ ′ that is a predicted value of the intermediate information x ₃ ′ based on x ₁ and the initial parameter w ′ ₃₁ . Then, the second generation unit 13 generates w ′ ₃₁ so that the difference between pred1_x ₃ ′ and x ₃ ′ is the smallest. x ₃ ′ is a value generated in advance by the second generation unit 13 using the parameter w ₃ generated by the first generation unit 12 based on the learning information, the input information, and x ₃ .

第２生成部１３は、同様に、ｘ_２を取得し、ｘ_２から中間情報ｘ_３’を生成するためのパラメータｗ’_３２を生成する。 Second generating unit 13 similarly acquires _{x 2,} to produce a ₃₂ 'parameter w for generating' intermediate information _{x 3} from _{x 2.}

変換部１５は、入力情報ｘ_３が欠損している場合、ｗ’_３１を用いて、ｘ_１からｘ_３’を生成する。または、変換部１５は、入力情報ｘ_３が欠損している場合、ｗ’_３２を用いて、ｘ_２からｘ_３’を生成する。 When the input information x ₃ is missing, the conversion unit 15 generates x ₃ ′ from x ₁ using w ′ ₃₁ . Or, converter 15, when the input information _{x 3} is deficient, 'with _32, _{x 3} from _{x 2'} w generates a.

図６におけるｘ_１またはｘ_２は、第１の単語の一例である。ｘ_３は、第１の文章の一例である。ｗ’_３１またはｗ’_３２は、第２の変換パラメータの一例である。 X ₁ or x ₂ in FIG. 6 is an example of the first word. x ₃ is an example of the first sentence. w ′ ₃₁ or w ′ ₃₂ is an example of a second conversion parameter.

図７は、入力情報から中間情報への変換方法の例を示す図である。運用時の入力情報にｘ_３が欠損している場合、変換部１５は、第２生成部１３が生成したｗ’_３１を用いて、ｘ_１からｘ_３’＿１を生成する。さらに、変換部１５は、第２生成部１３が生成したｗ’_３２を用いて、ｘ_２からｘ_３’＿２を生成する。 FIG. 7 is a diagram illustrating an example of a method for converting input information into intermediate information. If the _{x 3} input information during operation are deficient, converting unit 15, 'with _31, _{x 3} from _{x 1'} w the second generator 13 has generated to produce a _1. Furthermore, the conversion unit 15 generates x ₃ ′ _2 from x ₂ using w ′ ₃₂ generated by the second generation unit 13.

変換部１５は、ｘ_１から生成したｘ_３’＿１とｘ_２から生成したｘ_３’＿２とに基づいて、ｘ_３’を生成する。変換部１５は、例えば、ｘ_３’＿１とｘ_３’＿２との平均値を、ｘ_３’とする。変換部１５は、ｘ_３’＿１とｘ_３’＿２とのいずれかをｘ_３’としてもよい。 Converting unit 15, based on the _2 _{'x 3} is generated from _1 and _{x 2'} _{of x 3} generated from _{x 1,} to generate the _{x 3} '. Conversion unit 15, for example, the average value of the _{x 3} '_1 and _{x 3'} _2, and _{x 3} '. The conversion unit 15 may set x ₃ ′ _1 and x ₃ ′ _2 as x ₃ ′.

図７におけるｘ_１、ｘ_２は、第２の単語の一例であり、ｘ_３’は、第２の中間情報の一例である。ｘ_１、ｘ_２、ｘ_３’から生成されるｙは、第２の情報の一例である。 In FIG. 7, x ₁ and x ₂ are examples of second words, and x ₃ ′ is an example of second intermediate information. y generated from _{_{_{x 1, x 2, x 3}}} ' is an example of the second information.

図８は、他の入力情報に変換するパラメータの生成方法の第１の例を示す図である。図８に示す例では、第２生成部１３は、ｘ_１を取得し、ｘ_１からｘ_２を生成するためのパラメータｗ_２１を生成する。第２生成部１３は、例えば、ｘ_１と初期のパラメータｗ_２１に基づいて、ｘ_２の予測値であるｐｒｅｄ１＿ｘ_２を生成する。そして、第２生成部１３は、ｐｒｅｄ１＿ｘ_２とｘ_２との差が最も小さくなるようにｗ_２１を生成する。 FIG. 8 is a diagram illustrating a first example of a method for generating a parameter to be converted into other input information. In the example shown in FIG. 8, the second generation unit 13 obtains the _{x 1,} generates parameters _{w 21} for generating _{x 2} from _{x 1.} Second generating unit 13, for example, based on _{x 1} and the initial parameter _{w 21,} to produce a Pred1_x ₂ is the predicted value of _{x 2.} Then, the second generation unit 13 generates w ₂₁ so that the difference between pred1_x ₂ and x ₂ is minimized.

第２生成部１３は、同様に、ｘ_３を取得し、ｘ_３からｘ_２を生成するためのパラメータｗ_２３を生成する。 Second generating unit 13 similarly acquires _{x 3,} it generates parameters _{w 23} for generating _{x 2} from _{x 3.}

図９は、他の入力情報に変換するパラメータの生成方法の第２の例を示す図である。第２生成部１３は、図８に示した方法と同様の方法により、各入力情報を他の入力情報に変換するためのパラメータを生成する。 FIG. 9 is a diagram illustrating a second example of a method for generating a parameter to be converted into other input information. The second generation unit 13 generates parameters for converting each input information into other input information by a method similar to the method shown in FIG.

例えば、第２生成部１３は、ｘ_１をｘ_２に変換するためのパラメータｗ_２１、およびｘ_１をｘ_３に変換するためのパラメータｗ_３１を生成する。また、第２生成部１３は、ｘ_２をｘ_１に変換するためのパラメータｗ_１２、およびｘ_２をｘ_３に変換するためのパラメータｗ_３２を生成する。また、第２生成部１３は、ｘ_３をｘ_１に変換するためのパラメータｗ_１３、およびｘ_３をｘ_２に変換するためのパラメータｗ_２３を生成する。 For example, the second generation unit 13 generates a parameter w ₂₁ for converting x ₁ into x ₂ and a parameter w ₃₁ for converting x ₁ into x ₃ . The second generation unit 13 generates a parameter w ₁₂ for converting x ₂ to x ₁ and a parameter w ₃₂ for converting x ₂ to x ₃ . The second generation unit 13 generates a parameter w ₁₃ for converting x ₃ to x ₁ and a parameter w ₂₃ for converting x ₃ to x ₂ .

図１０は、入力情報から他の入力情報への変換方法の例を示す図である。運用時の入力情報にｘ_２が欠損している場合、変換部１５は、第１生成部１２が生成したｗ_２１を用いて、ｘ_１からｘ_２＿１を生成する。さらに、変換部１５は、第１生成部１２が生成したｗ_２３を用いて、ｘ_２からｘ_２＿３を生成する。 FIG. 10 is a diagram illustrating an example of a conversion method from input information to other input information. _{If x 2} input information during operation is defective, the converting unit 15 uses the _{w 21} of the first generator 12 has generated, to produce an _x 2 _1 from _{x 1.} Furthermore, the conversion unit 15 generates x ₂ _ ₃ from x ₂ using w ₂₃ generated by the first generation unit 12.

変換部１５は、ｘ_１から生成したｘ_２＿１とｘ_３から生成したｘ_２＿３とに基づいて、ｘ_２を生成する。変換部１５は、例えば、ｘ_２＿１とｘ_２＿３との平均値を、ｘ_２とする。変換部１５は、例えば、ｘ_２＿１とｘ_２＿３とのいずれかをｘ_２としてもよい。 Converting unit 15, based on the _x 2 _3 generated from _x 2 _1 and _{x 3} generated from _{x 1,} to generate the _{x 2.} Conversion unit 15, for _example, the average value of the _x 2 _1 and _x 2 _3, and _{x 2.} For example, the conversion unit 15 may set x ₂ _ 1 and x ₂ _ ₃ as x ₂ .

図１１は、学習時における処理の一例を示すフローチャートである。取得部１１は、分類器のパラメータを生成するための学習情報を取得する（ステップＳ１０１）。学習情報は、例えば、第１の入力項目（例えば、ジャンル、製作会社等）に対応付けて入力された第１の単語と、第２の入力項目（例えば、概要を示す文章）に対応付けて入力された第１の文章を含む。また、学習情報は、第１の単語及び第１の文章の組みに応じた第１の情報（例えば、面白さ）を含む。 FIG. 11 is a flowchart illustrating an example of processing during learning. The acquisition unit 11 acquires learning information for generating a classifier parameter (step S101). The learning information is associated with, for example, a first word input in association with a first input item (for example, genre, production company, etc.) and a second input item (for example, a sentence indicating an outline). Contains the input first sentence. The learning information includes first information (for example, fun) according to the combination of the first word and the first sentence.

第１生成部１２は、第１の単語及び第１の中間情報の組みを、第１の情報に応じた情報に変換するパラメータを生成する（ステップＳ１０２）。ステップＳ１０２で生成されるパラメータは、第１の変換パラメータの一例である。ステップＳ１０２で生成されるパラメータは、例えば、図３に示すＷ_１、Ｗ_２、およびＷ_ｈの組である。 The 1st production | generation part 12 produces | generates the parameter which converts the combination of a 1st word and 1st intermediate information into the information according to 1st information (step S102). The parameter generated in step S102 is an example of a first conversion parameter. The parameter generated in step S102 is, for example, a set of W ₁ , W ₂ , and W _h shown in FIG.

第２生成部１３は、取得部１１が取得した入力情報の入力項目毎に繰り返し処理を開始する（ステップＳ１０３）。第２生成部１３は、処理対象の項目の入力情報が文章であるか判定する（ステップＳ１０４）。 The 2nd generation part 13 starts a repetition process for every input item of input information which acquisition part 11 acquired (Step S103). The second generation unit 13 determines whether the input information of the processing target item is a sentence (step S104).

入力情報が文章である場合（ステップＳ１０４でＹＥＳ）、第２生成部１３は、入力情報を他の入力情報に応じた中間情報に変換するためのパラメータを生成する（ステップＳ１０５）。第２生成部１３は、例えば、ｘ_１を、ｘ_３の中間情報であるｘ_３’に変換するためのパラメータＷ’_３１を生成する（図６参照）。ステップＳ１０５で生成されるパラメータは第２のパラメータの一例である。 When the input information is a sentence (YES in step S104), the second generation unit 13 generates a parameter for converting the input information into intermediate information according to other input information (step S105). Second generating unit 13, for example, the _{x 1,} generates the ₃₁ 'parameters W for converting the' _{x 3} is the intermediate information of _{x 3} (see FIG. 6). The parameter generated in step S105 is an example of a second parameter.

入力情報が文章でない場合（ステップＳ１０４でＮＯ）、第２生成部１３は、入力情報を他の入力情報に変換するためのパラメータを生成する（ステップＳ１０６）。第２生成部１３は、例えば、ｘ_１を、ｘ_２に変換するためのパラメータＷ_２１を生成する（図８参照）。 If the input information is not a sentence (NO in step S104), the second generation unit 13 generates a parameter for converting the input information into other input information (step S106). For example, the second generation unit 13 generates a parameter W ₂₁ for converting x ₁ into x ₂ (see FIG. 8).

第２生成部１３は、取得部１１が取得した入力情報の全項目についてステップＳ１０４〜Ｓ１０６の処理を実行した場合、繰り返し処理を終了する（ステップＳ１０７）。 The 2nd production | generation part 13 complete | finishes a repetition process, when the process of step S104-S106 is performed about all the items of the input information which the acquisition part 11 acquired (step S107).

図１２は、運用時における処理の一例を示すフローチャートである。取得部１１は、入力情報を受け付ける（ステップＳ２０１）。入力情報は、例えば、単語および文章である。 FIG. 12 is a flowchart illustrating an example of processing during operation. The acquisition unit 11 receives input information (step S201). The input information is, for example, words and sentences.

判定部１４は、入力情報に欠損があるか判定する（ステップＳ２０２）。判定部１４は入力情報のうちの一部または全部が不足している場合、入力情報が欠損していると判定する。 The determination unit 14 determines whether there is a defect in the input information (step S202). The determination unit 14 determines that the input information is missing when some or all of the input information is insufficient.

入力情報に欠損があると判定された場合（ステップＳ２０２でＹＥＳ）、変換部１５は、欠損した入力情報毎の繰り返し処理を開始する（ステップＳ２０３）。 When it is determined that the input information is missing (YES in step S202), the conversion unit 15 starts a repetition process for each missing input information (step S203).

変換部１５は、欠損している情報が文章であるか判定する（ステップＳ２０４）。欠損している情報が文章である場合（ステップＳ２０４でＹＥＳ）、変換部１５は、第２生成部１３が生成した第２のパラメータを用いて入力情報を変換して、欠損した入力情報の中間情報を生成する（ステップＳ２０５）。中間情報は、入力情報とは異なる形式であるとする。変換部１５は、例えば、図７の例に示すように、ｗ’_３１を用いて、ｘ_１からｘ_３’＿１を生成する。 The conversion unit 15 determines whether the missing information is a sentence (step S204). When the missing information is a sentence (YES in step S204), the conversion unit 15 converts the input information using the second parameter generated by the second generation unit 13, and intermediate the missing input information. Information is generated (step S205). It is assumed that the intermediate information has a format different from that of the input information. For example, as illustrated in the example of FIG. 7, the conversion unit 15 generates x ₃ ′ _1 from x ₁ using w ′ ₃₁ .

欠損している情報が文章でない場合（ステップＳ２０４でＮＯ）、変換部１５は、第１生成部１２が生成したパラメータを用いて、入力情報を欠損した入力情報に変換する（ステップＳ２０６）。変換部１５は、繰り返し処理を終了する（ステップＳ２０７）。変換部１５は、例えば、第１生成部１２が生成したｗ_２１を用いて、ｘ_１からｘ_２＿１を生成する。 When the missing information is not a sentence (NO in step S204), the conversion unit 15 converts the input information into the missing input information using the parameter generated by the first generation unit 12 (step S206). The conversion unit 15 ends the repetition process (step S207). For example, the conversion unit 15 generates x ₂ _ ₁ from x ₁ using w ₂₁ generated by the first generation unit 12.

第３生成部１６は、欠損していた情報が文章であるか判定する（ステップＳ２０８）。文章が欠損していた場合、ステップＳ２０５の処理で中間情報が生成されている。よって、ステップＳ２０８でＹＥＳの場合、第３生成部１６は、第１のパラメータを用いて、入力情報および中間情報の組に応じた出力情報を生成する（ステップＳ２０９）。第３生成部１６は、例えば、入力情報（ｘ_１、ｘ_２）と、中間情報ｘ_３’と、学習時に得られたＷ_ｈ、Ｗ_１、Ｗ_２、とに基づいて出力情報ｙを生成する。 The third generation unit 16 determines whether the missing information is a sentence (step S208). If the sentence is missing, intermediate information is generated in the process of step S205. Therefore, in the case of YES in step S208, the third generation unit 16 generates output information corresponding to the set of input information and intermediate information using the first parameter (step S209). For example, the third generation unit 16 generates the output information y based on the input information (x ₁ , x ₂ ), the intermediate information x ₃ ′, and W _h , W ₁ , W ₂ obtained at the time of learning. To do.

ステップＳ２０２、またはステップＳ２０８でＮＯの場合、第３生成部１６は、第１のパラメータを用いて入力情報に応じた出力情報を生成する（ステップＳ２１０）。第３生成部１６は、例えば、入力情報（ｘ_１、ｘ_２、ｘ_３）と、学習時に得られたＷ_ｈ、Ｗ_１、Ｗ_２、Ｗ_３とに基づいて出力情報ｙを生成する。 In the case of NO in step S202 or step S208, the third generation unit 16 generates output information corresponding to the input information using the first parameter (step S210). For example, the third generation unit 16 generates output information y based on input information (x ₁ , x ₂ , x ₃ ) and W _h , W ₁ , W ₂ , W ₃ obtained at the time of learning.

なお、生成装置１は、学習時に学習情報が欠損を含む場合、欠損を除く学習情報に対して、図１１に示す処理を実行し、パラメータを生成する。これにより、生成装置１は、欠損がノイズとなり、学習精度が低下することを防ぐことができる。また、生成装置１は、学習情報の欠損について、図１２のステップＳ２０３〜Ｓ２０７と同様の処理を実行し、入力情報または中間情報を生成する。これにより、生成装置１は、学習情報の欠損を補うことができるので、学習情報の量を増やし、出力情報の精度を向上させることができる。 In addition, when learning information contains a defect at the time of learning, the production | generation apparatus 1 performs the process shown in FIG. 11 with respect to the learning information except a defect, and produces | generates a parameter. Thereby, the generation device 1 can prevent the loss from becoming noise and reducing the learning accuracy. Further, the generation device 1 performs the same processing as steps S203 to S207 in FIG. 12 for learning information loss, and generates input information or intermediate information. Thereby, since the production | generation apparatus 1 can compensate the defect | deletion of learning information, it can increase the quantity of learning information and can improve the precision of output information.

以上のように本実施形態の生成装置１は、学習時に、単語から文章に応じた中間情報を生成するためのパラメータを生成する。そして、生成装置１は、運用時に、文章が入力されない（欠損している）場合に、生成したパラメータを用いて中間情報を生成し、その中間情報を用いて出力情報を生成する。そのため、生成装置１は、文章のように予測が困難な情報が欠損していたとしても、中間情報を用いて出力情報を生成することができる。 As described above, the generation apparatus 1 of the present embodiment generates parameters for generating intermediate information corresponding to sentences from words during learning. And the production | generation apparatus 1 produces | generates intermediate information using the produced | generated parameter, and produces | generates output information using the intermediate information, when a sentence is not input at the time of an operation | movement (it is missing). Therefore, the generation device 1 can generate output information using intermediate information even if information that is difficult to predict, such as a sentence, is missing.

また、中間情報は、入力情報より次元が少ない情報であるため、生成装置１は、他の情報を用いて容易に中間情報を予測することできる。 Further, since the intermediate information is information having a smaller dimension than the input information, the generation device 1 can easily predict the intermediate information using other information.

＜実施例＞
図１３は、入力情報を示す行列式の例を示す図である。図１３に示す入力情報は、図２に示す入力情報に対応している。図１３（ａ）は、”ａｃｔｉｏｎ”を示す入力情報ｘ_１を示す行列式である。図１３（ａ）の行数は、映画ジャンルの数に対応している。また、２行目が”ａｃｔｉｏｎ”を示すとする。よって、入力情報ｘ_１が”ａｃｔｉｏｎ”である場合、行列式の２行目の値が１となる。 <Example>
FIG. 13 is a diagram illustrating an example of a determinant indicating input information. The input information shown in FIG. 13 corresponds to the input information shown in FIG. Figure 13 (a) is a determinant indicating the input information _{x 1} showing the "action". The number of rows in FIG. 13A corresponds to the number of movie genres. The second line indicates “action”. Therefore, when the input information x ₁ is “action”, the value in the second row of the determinant is 1.

図１３（ｂ）は、”Ｄｉｓｎｅｙ”を示す入力情報ｘ_２を示す行列式である。図１３（ｂ）の行数は、製作会社の数に対応している。また、１行目が”Ｄｉｓｎｅｙ”を示すとする。よって、入力情報ｘ_２が”Ｄｉｓｎｅｙ”である場合、行列式の１行目の値が１となる。 Figure 13 (b) is a matrix equation representing the input information _{x 2} indicating the "Disney". The number of rows in FIG. 13B corresponds to the number of production companies. Also, assume that the first line indicates “Disney”. Therefore, when the input information _{x 2} is "Disney", the value of the first row of the determinant is 1.

図１３（ｃ）は、”ダイナミックな映画”を示す入力情報ｘ_３を示す行列式である。図１３（ｃ）の行数は、登録単語数に対応し、列数（次元数）は、入力情報内の単語数に対応している。また、２行目が”ダイナミック”を示し、４行目が”な”を示し、５行目が”映画”を示す。 FIG. 13C is a determinant indicating the input information x ₃ indicating “dynamic movie”. The number of rows in FIG. 13C corresponds to the number of registered words, and the number of columns (number of dimensions) corresponds to the number of words in the input information. The second line indicates “dynamic”, the fourth line indicates “na”, and the fifth line indicates “movie”.

なお、実際の運用において登録単語数は、例えば、約１０万語であり、入力情報内の単語数は約１０語であるとする。その場合、パターン数は、１０万×１０＝１００万パターンとなる。このパターン数は、入力情報が単語である場合のパターン数と比べると大きく異なるため、単語から文章を直接予測することは困難である。 In actual operation, the number of registered words is, for example, about 100,000 words, and the number of words in the input information is about 10. In that case, the number of patterns is 100,000 × 10 = 1 million patterns. Since the number of patterns is greatly different from the number of patterns when the input information is a word, it is difficult to predict a sentence directly from the word.

しかし、中間情報ｘ_３’は、図５に示すように１次元の情報である。すなわち、入力情報から中間情報ｘ_３’に変換すると含まれる単語の順序は失われるが、単語の意味は失われない。 However, the intermediate information x ₃ ′ is one-dimensional information as shown in FIG. That is, when the input information is converted to the intermediate information x ₃ ′, the order of the included words is lost, but the meaning of the words is not lost.

（１）学習時における第１のパラメータ生成処理の例
以下、学習時における第１生成部１２によるパラメータ生成の例について説明する。図１４は、実施例における各情報およびパラメータを示す図である。ｈは、中間情報ｘ_１’、ｘ_２’、ｘ_３’の組を示す。ｈ以外の各情報、パラメータは図３に示す例と同様である。 (1) Example of First Parameter Generation Processing During Learning Hereinafter, an example of parameter generation by the first generation unit 12 during learning will be described. FIG. 14 is a diagram illustrating information and parameters in the embodiment. h represents a set of intermediate information x ₁ ′, x ₂ ′, x ₃ ′. Each information and parameter other than h is the same as the example shown in FIG.

中間情報ｘ_１’、ｘ_２’、ｘ_３’は、下記の式（１）〜（３）のように表される。 The intermediate information x ₁ ′, x ₂ ′, x ₃ ′ is expressed as the following formulas (1) to (3).

また、中間情報ｘ_１’、ｘ_２’、ｘ_３’の組を示すｈは、下記の式（４）のように表される。 Further, h indicating a set of the intermediate information x ₁ ′, x ₂ ′, and x ₃ ′ is expressed as the following formula (4).

また、出力情報ｙの予測値であるｐｒｅｄ＿ｙは、下記の式（５）のように表される。 Also, pred_y, which is the predicted value of the output information y, is expressed as in the following equation (5).

なお、式（５）のsoftmax関数は、以下のように定義される。
softmax(x)＝exp(x)/sum(exp(x)) Note that the softmax function of Equation (5) is defined as follows.
softmax (x) = exp (x) / sum (exp (x))

第１生成部１２は、ｐｒｅｄ＿ｙとｙとが近づくように、各パラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）を調整する。第１生成部１２は、例えば、バックプロパゲーションを用いて学習する。ｐｒｅｄ＿ｙとｙとの差（ｌｏｓｓ）としては、SoftMaxCrossEntropy、またはMean Squared Error（ＭＳＥ）等が用いられる。 The first generator 12, so as to approach and the pred_y and y, adjusting each parameter _{_{_{(w 1 ~w 3, w h}}} ). The 1st production | generation part 12 learns using back propagation, for example. As the difference (loss) between pred_y and y, SoftMaxCrossEntropy, Mean Squared Error (MSE), or the like is used.

運用時において、第３生成部１６は、第１生成部１２が生成した各パラメータ（ｗ_１〜ｗ_３、ｗ_ｈ）を用いてｐｒｅｄ＿ｙを算出し、算出されたｐｒｅｄ＿ｙから予測した値を出力情報ｙとして出力する。例えば、式（５）のｐｒｅｄ＿ｙでは、１行目の０．９が最も大きく、１行目が”ｖｅｒｙｇｏｏｄ”を示す場合、第３生成部１６は、ｙが”ｖｅｒｙｇｏｏｄ”であると予測する。 During operation, the third generation unit 16 calculates pred_y using each parameter (w _{1 to} w ₃ , w _h ) generated by the first generation unit 12 and outputs a value predicted from the calculated pred_y as output information. Output as y. For example, in pred_y of Expression (5), when 0.9 in the first line is the largest and the first line indicates “very good”, the third generation unit 16 predicts that y is “very good”. To do.

（２）学習時における第２のパラメータ生成処理の例
次に、学習時において、第２生成部１３が中間情報ｘ_３’を生成するためのパラメータを生成する方法の例を示す。以下の説明は、図６に示す処理の実施例に相当する。ｘ_１に基づく中間情報ｘ_３’の予測値であるｐｒｅｄ１＿ｘ_３’は、下記の式（６）のように表される。 (2) Example of Second Parameter Generation Processing During Learning Next, an example of a method in which the second generation unit 13 generates a parameter for generating the intermediate information x ₃ ′ during learning will be described. The following description corresponds to the embodiment of the process shown in FIG. pred1_x ₃ ′, which is a predicted value of the intermediate information x ₃ ′ based on x ₁ , is expressed as the following Expression (6).

また、ｐｒｅｄ１＿ｘ_３’とｘ_３’との差は、ＭＳＥを用いた場合、下記の式（７）のように表される。 Further, the difference between pred1_x ₃ ′ and x ₃ ′ is expressed as in the following formula (7) when MSE is used.

第２生成部１３は、式（７）に示すｌｏｓｓが最も小さくなるようにパラメータｗ’_３１を生成する。同様に、第２生成部１３は、ｘ_２から中間情報ｘ_３’を生成するためのパラメータｗ’_３２を生成する。 The second generation unit 13 generates the parameter w ′ ₃₁ so that the loss shown in Expression (7) is minimized. Similarly, the second generation unit 13 generates a parameter w ′ ₃₂ for generating intermediate information x ₃ ′ from x ₂ .

（３）運用時に文章（ｘ_３）が欠損していた場合の出力情報生成処理の例
次に運用時において、文章（ｘ_３）が欠損していた場合の出力情報生成処理を説明する。第３生成部１６は、入力情報ｘ_１と学習時に生成されたｗ_１とに基づいて、下記の式（８）のように中間情報ｘ_１’を生成する。 (3) in the example then during operation of the output information generation process when the sentence (x ₃₎ were missing at the time of operation, illustrating the output information generation process when the sentence (x ₃₎ was missing. The third generation unit 16 generates intermediate information x ₁ ′ based on the input information x ₁ and w ₁ generated at the time of learning as shown in the following formula (8).

また、第３生成部１６は、入力情報ｘ_２と学習時に生成されたｗ_２とに基づいて、下記の式（９）のように中間情報ｘ_２’を生成する。 The third generator 16, based on the input information x ₂ and w ₂ that is generated at the time of learning, to generate intermediate information x _{2 'as} in the following formula (9).

そして、変換部１５は、学習時に第２生成部１３が生成したｗ’_３１に基づいて、下記の式（１０）のように、入力情報ｘ_１をｘ_３’＿１に変換する。 Then, the conversion unit 15 'based on the _31, as in the following equation (10), the input information _{x 1} _{x 3'} w the second generator 13 has generated at the time of learning to convert _1.

そして、変換部１５は、学習時に第２生成部１３が生成したｗ’_３２に基づいて、下記の式の式（１１）のように、入力情報ｘ_２をｘ_３’＿２に変換する。 Then, the conversion unit 15 'based on the _32, as shown in equation (11) in the following equation, the input information _{x 2} _{x 3'} w the second generator 13 has generated at the time of learning to convert _2.

そして、変換部１５は、下記の式（１２）のように、ｘ_１から生成したｘ_３’＿１とｘ_２から生成したｘ_３’＿２との平均値を、ｘ_３’とする。 Then, the conversion unit 15, as in the following equation (12), the average value of the _2 _{'x 3} is generated from _1 and _{x 2'} _{of x 3} generated from _{x 1,} and _{x 3} '.

中間情報の組ｈは、上記の式（１０）〜（１２）のように生成されたｘ_１’、ｘ_２’、ｘ_３’を用いると、下記の式（１３）のように表される。 The set h of intermediate information is represented by the following equation (13) when x ₁ ′, x ₂ ′, and x ₃ ′ generated as in the above equations (10) to (12) are used. .

第３生成部１６は、学習時に生成されたＷ_ｈと上記のように算出されたｈとに基づいて下記の式（１４）のように出力情報ｙを生成する。この出力情報ｙは、第２の情報の一例である。 The third generation unit 16 generates output information y based on W _h generated at the time of learning and _h calculated as described above, as in Expression (14) below. This output information y is an example of second information.

（４）運用時に単語（ｘ_２）が欠損していた場合の出力情報生成処理の例
次に運用時において、単語（ｘ_２）が欠損していた場合の出力情報生成処理を説明する。変換部１５は、学習時に生成されたｗ_２１に基づいて、下記の式（１５）のように入力情報ｘ_１を入力情報ｘ_２の予測値であるｐｒｅｄ１＿ｘ_２に変換する。 (4) at the time of output information generating process instances then operation when a word (x ₂₎ were missing at the time of operation, illustrating the output information generation process in the case where the word (x ₂₎ was missing. Converting unit 15, based on the _{w 21} generated at the time of learning, into a Pred1_x ₂ is the predicted value of the input information _{x 2} input information _{x 1} as the following equation (15).

また、変換部１５は、学習時に生成されたｗ_２３に基づいて、下記の式（１６）のように入力情報ｘ_３を入力情報ｘ_２の予測値であるｐｒｅｄ３＿ｘ_２に変換する。 The conversion unit 15, based on the _{w 23} generated at the time of learning, into a Pred3_x ₂ is the predicted value of the input information _{x 2} input information _{x 3} as in the following formula (16).

式（１６）におけるＷ_２３ｓｕｍ（ｘ_３）は、下記の式（１７）のように計算される。 W ₂₃ sum (x ₃ ) in equation (16) is calculated as in equation (17) below.

変換部１５は、上記のように生成されたｐｒｅｄ１＿ｘ_２、およびｐｒｅｄ３＿ｘ_２の平均値を計算することにより、下記の式（１８）のようにｘ_２を生成する。 Converting unit 15, by calculating the Pred1_x _2, and pred3_x average value of ₂ is generated as described above, to generate a _{x 2} as in the following formula (18).

上記ｘ_２のうち、３行目の０．４２が最も大きい値であるため、変換部１５は、０．４２を１に変換し、他の数字を０に変換する。変換部１５は、下記の式（１９）のようなｘ_２を得る。 Of the above _{x 2,} for 0.42 in the third row it is the largest value, the converting unit 15 converts 0.42 to 1, to convert the other numbers 0. Conversion unit 15 obtains the _{x 2} as the following equation (19).

第３生成部１６は、上記の計算で得られたｘ_２を含む入力情報（ｘ_１、ｘ_２、ｘ_３）と、学習時に得られたＷ_ｈ、Ｗ_１、Ｗ_２、Ｗ_３とに基づいて出力情報ｙを生成する。 The third generation unit 16 uses the input information (x ₁ , x ₂ , x ₃ ) including x ₂ obtained by the above calculation and the W _h , W ₁ , W ₂ , W ₃ obtained at the time of learning. Based on this, output information y is generated.

図１５は、欠損および外れ値を含む情報の例を示す図である。図１５（ａ）は、欠損を含む入力情報の例である。図１５（ａ）のｘ_１は、全ての要素が０であるため、判定部１４は、欠損があると判定する。 FIG. 15 is a diagram illustrating an example of information including missing and outliers. FIG. 15A shows an example of input information including a defect. Figure 15 _{x 1} in (a) determines that since all elements are 0, the determination unit 14 is defective.

図１５（ｂ）は、外れ値を含む入力情報の例である。図１５（ｂ）のｘ_１およびｘ_２の組は、学習時の入力情報に存在しなかった組み合わせであるとする。判定部１４は、このような入力情報に含まれる値を、外れ値であるとみなしてもよい。 FIG. 15B is an example of input information including outliers. The set of x ₁ and x ₂ in FIG. 15 (b), and a combination that did not exist in the input information at the time of learning. The determination unit 14 may regard a value included in such input information as an outlier.

図１５（ｃ）は、外れ値を含む入力情報の例である。図１５（ｃ）のｘ_３は、学習時の入力情報に存在しなかった情報であるとする。判定部１４は、このような入力情報に含まれる値を、外れ値であるとみなしてもよい。 FIG. 15C is an example of input information including an outlier. Figure 15 x ₃ in (c) is assumed to be information that did not exist in the input information at the time of learning. The determination unit 14 may regard a value included in such input information as an outlier.

次に、生成装置１のハードウェア構成の一例を説明する。図１６は、生成装置１のハードウェア構成の一例を示す図である。図１６の例に示すように、生成装置１において、バス１００に、プロセッサ１１１とメモリ１１２と補助記憶装置１１３と通信インタフェース１１４と媒体接続部１１５と入力装置１１６と出力装置１１７とが接続される。 Next, an example of the hardware configuration of the generation apparatus 1 will be described. FIG. 16 is a diagram illustrating an example of a hardware configuration of the generation apparatus 1. As shown in the example of FIG. 16, in the generation device 1, a processor 111, a memory 112, an auxiliary storage device 113, a communication interface 114, a medium connection unit 115, an input device 116, and an output device 117 are connected to the bus 100. .

プロセッサ１１１は、メモリ１１２に展開されたプログラムを実行する。実行されるプログラムには、実施形態における処理を行う生成プログラムが適用されてもよい。 The processor 111 executes the program expanded in the memory 112. A generation program that performs the processing in the embodiment may be applied to the program to be executed.

メモリ１１２は、例えば、Random Access Memory(RAM)である。補助記憶装置１１３は、種々の情報を記憶する記憶装置であり、例えばハードディスクドライブや半導体メモリ等が適用されてもよい。補助記憶装置１１３に実施形態の処理を行う生成プログラムが記憶されていてもよい。 The memory 112 is, for example, a random access memory (RAM). The auxiliary storage device 113 is a storage device that stores various information, and for example, a hard disk drive, a semiconductor memory, or the like may be applied. The auxiliary storage device 113 may store a generation program that performs the processing of the embodiment.

通信インタフェース１１４は、Local Area Network（LAN）、Wide Area Network（WAN）等の通信ネットワークに接続され、通信に伴うデータ変換等を行う。 The communication interface 114 is connected to a communication network such as a local area network (LAN) or a wide area network (WAN), and performs data conversion associated with communication.

媒体接続部１１５は、可搬型記録媒体１１８が接続可能なインタフェースである。可搬型記録媒体１１８には、光学式ディスク（例えば、Compact Disc(CD)やDigital Versatile Disc(DVD))、半導体メモリ等が適用されてもよい。可搬型記録媒体１１８に実施形態の処理を行う生成プログラムが記録されていてもよい。 The medium connection unit 115 is an interface to which a portable recording medium 118 can be connected. As the portable recording medium 118, an optical disc (for example, Compact Disc (CD) or Digital Versatile Disc (DVD)), a semiconductor memory, or the like may be applied. A generation program for performing the processing of the embodiment may be recorded on the portable recording medium 118.

入力装置１１６は、例えば、キーボード、ポインティングデバイス等であり、ユーザからの指示及び情報等の入力を受け付ける。 The input device 116 is, for example, a keyboard, a pointing device, and the like, and receives input from the user such as instructions and information.

出力装置１１７は、例えば、表示装置、プリンタ、スピーカ等であり、ユーザへの問い合わせ又は指示、及び処理結果等を出力する。生成装置１は、出力装置１１７を用いて、生成したパラメータ、および出力情報等を出力（表示または印刷等）してもよい。 The output device 117 is, for example, a display device, a printer, a speaker, or the like, and outputs an inquiry or instruction to the user, a processing result, and the like. The generation apparatus 1 may output (display or print) the generated parameters, output information, and the like using the output apparatus 117.

図１に示す記憶部１７は、メモリ１１２、補助記憶装置１１３または可搬型記録媒体１１８等により実現されてもよい。図１に示す取得部１１、第１生成部１２、第２生成部１３、判定部１４、変換部１５、および第３生成部１６は、メモリ１１２に展開された生成プログラムをプロセッサ１１１が実行することにより実現されてもよい。 The storage unit 17 illustrated in FIG. 1 may be realized by the memory 112, the auxiliary storage device 113, the portable recording medium 118, or the like. In the acquisition unit 11, the first generation unit 12, the second generation unit 13, the determination unit 14, the conversion unit 15, and the third generation unit 16 illustrated in FIG. 1, the processor 111 executes the generation program expanded in the memory 112. May be realized.

メモリ１１２、補助記憶装置１１３および可搬型記録媒体１１８は、コンピュータが読み取り可能であって非一時的な有形の記憶媒体であり、信号搬送波のような一時的な媒体ではない。 The memory 112, the auxiliary storage device 113, and the portable recording medium 118 are computer-readable, non-temporary tangible storage media, and are not temporary media such as signal carriers.

＜その他＞
本実施形態は、以上に述べた実施の形態に限定されるものではなく、本実施形態の要旨を逸脱しない範囲内で様々な変更、追加、省略が適用可能である。 <Others>
The present embodiment is not limited to the embodiment described above, and various modifications, additions, and omissions can be applied without departing from the gist of the present embodiment.

１生成装置
１１取得部
１２第１生成部
１３第２生成部
１４判定部
１５変換部
１６第３生成部
１７記憶部
１００バス
１１１プロセッサ
１１２メモリ
１１３補助記憶装置
１１４通信インタフェース
１１５媒体接続部
１１６入力装置
１１７出力装置
１１８可搬型記録媒体 DESCRIPTION OF SYMBOLS 1 Generation apparatus 11 Acquisition part 12 1st generation part 13 2nd generation part 14 Judgment part 15 Conversion part 16 3rd generation part 17 Memory | storage part 100 Bus 111 Processor 112 Memory 113 Auxiliary storage device 114 Communication interface 115 Medium connection part 116 Input device 117 output device 118 portable recording medium

Claims

Computer
A set of a first word input in association with the first input item, a first sentence input in association with the second input item, and the first word and the first sentence. And learning information including the first information in response,
Based on the learning information, a set of first intermediate information in a format different from the first sentence according to the first word and the first sentence is used as information according to the first information. Generating a first conversion parameter for conversion and a second conversion parameter for converting the first word into the first intermediate information;
If an input of a second word associated with the first input item is received without receiving an input of a sentence associated with the second input item, the second conversion parameter causes the second To generate second intermediate information of the format, and to convert a set of the second word and the second intermediate information according to the first conversion parameter, thereby generating second information Generate
A generation method characterized by executing processing.

The generation method according to claim 1, wherein the first intermediate information is information having a format different from that of the first sentence, which indicates a meaning of the first sentence.

3. The generation method according to claim 1, wherein the number of dimensions of the matrix indicating the first intermediate information is smaller than the number of dimensions of the matrix indicating the first sentence.

A set of a first word input in association with the first input item, a first sentence input in association with the second input item, and the first word and the first sentence. And learning information including the first information in response,
Based on the learning information, a set of first intermediate information in a format different from the first sentence according to the first word and the first sentence is used as information according to the first information. Generating a first conversion parameter for conversion and a second conversion parameter for converting the first word into the first intermediate information;
If an input of a second word associated with the first input item is received without receiving an input of a sentence associated with the second input item, the second conversion parameter causes the second To generate second intermediate information of the format, and to convert a set of the second word and the second intermediate information according to the first conversion parameter, thereby generating second information Generate
A generation program that causes a computer to execute processing.

A set of a first word input in association with the first input item, a first sentence input in association with the second input item, and the first word and the first sentence. An acquisition unit for acquiring learning information including the first information in response,
Based on the learning information, a set of first intermediate information in a format different from the first sentence according to the first word and the first sentence is used as information according to the first information. A parameter generation unit for generating a first conversion parameter to be converted and a second conversion parameter for converting the first word into the first intermediate information;
If an input of a second word associated with the first input item is received without receiving an input of a sentence associated with the second input item, the second conversion parameter causes the second To generate second intermediate information of the format, and to convert a set of the second word and the second intermediate information according to the first conversion parameter to obtain second information An information generation unit for generating
A generating apparatus comprising: