JP2019194759A

JP2019194759A - Dialogue system reinforcement device and computer program

Info

Publication number: JP2019194759A
Application number: JP2018088218A
Authority: JP
Inventors: 阿部　憲幸; Noriyuki Abe; 憲幸阿部; 可奈子大西; Kanako Onishi; 健太郎鳥澤; Kentaro Torisawa; 豊木俵; Yutaka Kidawara
Original assignee: National Institute of Information and Communications Technology
Current assignee: National Institute of Information and Communications Technology
Priority date: 2018-05-01
Filing date: 2018-05-01
Publication date: 2019-11-07
Anticipated expiration: 2038-05-01
Also published as: JP7044245B2

Abstract

【課題】既存のシステムを、特定の目的に沿った特徴にしたがって動作するように補強できる対話システム補強装置を提供する。【解決手段】対話システム５２とともに動作し、対話システム５２による対話に特徴を与える対話システム補強装置５０は、対話システム５２に与えられる入力５４を受け、対話システム５２に代わって当該入力に対する応答を出力すべきか否かを判定する応答可否判定部６４と、応答可否判定部６４による判定が肯定であることに応答して、入力に基づいて予め準備された応答生成方法により応答を生成するための応答生成部７２と、応答生成部７２の出力と、対話システム５２の出力とを受けるように接続され、応答可否判定部６４の判定結果が肯定か否かにしたがって、応答生成部７２の出力と、対話システム５２の出力とを選択的に出力する選択部７４とを含む。【選択図】図１PROBLEM TO BE SOLVED: To provide an interactive system reinforcing device capable of reinforcing an existing system so as to operate in accordance with characteristics according to a specific purpose. SOLUTION: A dialogue system reinforcing device 50 which operates together with a dialogue system 52 and gives a characteristic to dialogue by the dialogue system 52 receives an input 54 given to the dialogue system 52, and outputs a response to the input on behalf of the dialogue system 52. A response propriety determination unit 64 that determines whether or not to respond, and a response that responds to the determination by the response propriety determination unit 64 being affirmative to generate a response by a response generation method prepared in advance based on the input. A generator 72, an output of the response generator 72, and an output of the dialogue system 52 are connected so as to receive the output of the response generator 72 according to whether or not the determination result of the response availability determination unit 64 is affirmative. And a selector 74 for selectively outputting the output of the dialogue system 52. [Selection diagram] Fig. 1

Description

この発明は対話システムに関し、特に、既存の対話システムと協調動作して、既存の対話システムに特徴を与える技術に関する。 The present invention relates to a dialogue system, and more particularly, to a technique for giving a feature to an existing dialogue system by cooperating with the existing dialogue system.

近年、機械学習及び深層学習等の技術の発達により、対話システムの開発が盛んに行なわれている。すでに様々な対話システムが利用可能となっている。例えばスマートフォン、パーソナルコンピュータ等、既存のシステムでこれらが利用可能になっているだけでなく、実体はパーソナルコンピュータと同様でありながら、モニタもキーボードも持たず、音声の入出力のみでユーザとの対話を行うようにした装置も市販されている。 In recent years, dialogue systems have been actively developed due to the development of technologies such as machine learning and deep learning. Various dialogue systems are already available. For example, smartphones and personal computers are not only available in existing systems, but they are similar to personal computers but do not have a monitor or keyboard. There are also commercially available devices that perform the above.

これらの対話システムは、用途を限っていないことから、どんな発言に対してもそれなりの応答を返すことができる。しかし、特定の用途に使用することを考えると改善の余地が多分にある。 Since these interactive systems are not limited in use, they can return a reasonable response to any utterance. However, there is plenty of room for improvement when considering use for specific purposes.

例えば子供に使って欲しい対話システムについて考える。そうしたシステムは、ただ漫然と子供と対話を行うだけでなく、対話をする中で、例えば両親、学校の先生のように、様々な知識を子供に伝えられるものであることが望ましい。すなわち、教育向けに特化した対話システムがあると望ましい。 For example, consider a dialogue system that children want to use. It is desirable for such a system not only to communicate with children in a casual manner, but also to communicate various knowledge to children, such as parents and school teachers, for example. In other words, it is desirable to have a dialogue system specialized for education.

このような教育向けの対話システムを開発する場合、既存の汎用的な対話システムがあったとしても、データベースを入れ替える等して応答の内容を大幅に変更したり、対話システム自体を開発し直したりする必要がある。対話システム自体を開発し直すためには高コストが必要となる。また既存の対話システムにおいて応答の内容を大幅に変更したりすると、せっかく開発した対話システムの良さを捨てることにもなりかねない。 When developing such a dialogue system for education, even if there is an existing general-purpose dialogue system, the content of the response can be changed drastically by replacing the database, or the dialogue system itself can be re-developed. There is a need to. In order to redevelop the dialogue system itself, high costs are required. In addition, if the content of the response is changed significantly in the existing dialog system, the goodness of the dialog system that has been developed may be thrown away.

したがって、既存の対話システムを用いて、教育向け等の特定のシーンに合致するような対話を可能にする技術が必要である。 Therefore, there is a need for a technology that enables dialogue that matches a specific scene such as for education using an existing dialogue system.

既存の対話システムを利用して特定のシーンに合う対話を可能とする新たな対話システムを開発する従来技術が後掲の特許文献１に開示されている。特許文献１は、予め複数の特定のシーンにそれぞれ対応する複数のデータベースを準備し、人間からの指示に基づいて対話のシーンを選択する対話システムを開示している。この対話システムは、選択されたシーンに応じたデータベースを用いて、そのシーンに即した自然対話を構成する応答文を作成し、それを音声合成して出力する。 Japanese Laid-Open Patent Publication No. 2003-228561 discloses a conventional technique for developing a new dialog system that enables a dialog suitable for a specific scene using an existing dialog system. Patent Document 1 discloses a dialogue system that prepares a plurality of databases respectively corresponding to a plurality of specific scenes and selects a dialogue scene based on an instruction from a human. This dialogue system uses a database corresponding to a selected scene to create a response sentence that constitutes a natural dialogue according to the scene, and synthesizes it as a speech and outputs it.

また後掲の特許文献２には、内部構成が明らかでないものを含む複数の対話システムを含む対話応答装置において、発話入力に適した対話システムから出力される応答候補を適切に選択するものが開示されている。 Further, Patent Document 2 listed below discloses an apparatus that appropriately selects response candidates output from an interactive system suitable for speech input in an interactive response apparatus including a plurality of interactive systems including those whose internal configuration is not clear. Has been.

特開2004-219714号公報JP 2004-219714 A 特開2016-212541号公報JP 2016-212541 A

しかし、従来技術には解決すべき以下のような問題が残っている。特許文献１に開示された技術の場合、既存のシステムがあっても利用できないという問題が有る。既存の対話システムが活用できないため、ゼロから各シーンに適した対話が可能な対話システムを開発する必要がある。既に対話システムを保有しているもの、又は既存の対話システムを活用したいものにとっては、対話システムの開発コストが高すぎるため、特許文献１の手法は採用できない。また、既存の対話システムで既にサービスの提供を開始している場合には、全く新しい対話システムを投入することはコスト面の問題を引き起こすだけでなく、ユーザから見たときの対話システムの動作に大きな変化が生じやすく、ユーザにとって使いにくくなるという問題もある。 However, the following problems remain to be solved in the prior art. In the case of the technique disclosed in Patent Document 1, there is a problem that it cannot be used even if there is an existing system. Since an existing dialog system cannot be used, it is necessary to develop a dialog system capable of dialog suitable for each scene from scratch. For those who already have a dialogue system or who want to utilize an existing dialogue system, the method of Patent Document 1 cannot be adopted because the development cost of the dialogue system is too high. In addition, if an existing interactive system has already started providing services, introducing a completely new interactive system will not only cause cost problems, but will also affect the operation of the interactive system as seen by the user. There is also a problem that large changes are likely to occur and are difficult for the user to use.

一方、特許文献２は、既存の対話システムを活用する有用な方法を提案している。しかし、特許文献２では、既存の対話システムが再利用できる範囲は、その既存の対話システムがもともと目的としている活用範囲内に止まっているという問題がある。例えば、既存の対話システムにとって目的外のジャンルにその対話システムを特化させることはできない。 On the other hand, Patent Document 2 proposes a useful method of utilizing an existing dialogue system. However, in Patent Document 2, there is a problem that the range in which an existing dialogue system can be reused is within the range of utilization originally intended by the existing dialogue system. For example, the dialog system cannot be specialized in a genre that is not intended for the existing dialog system.

したがって本発明の目的は、既存のシステムを、特定の目的に沿った特徴にしたがって動作するように補強できる対話システム補強装置及びそのためのプログラムを提供することである。 Accordingly, an object of the present invention is to provide an interactive system reinforcement device and a program therefor that can reinforce an existing system so as to operate according to characteristics according to a specific purpose.

本発明の第１の局面に係る対話システム補強装置は、対話システムとともに動作し、対話システムによる対話に特徴を与える対話システム補強装置であって、対話システムに与えられる入力を受け、対話システムに代わって当該入力に対する応答を出力すべきか否かを判定する第１の判定手段と、第１の判定手段による判定が肯定であることに応答して、入力に基づいて予め準備された応答生成方法により応答を生成するための第１の応答生成手段と、第１の応答生成手段の出力と、対話システムの出力とを受けるように接続され、第１の判定手段の判定結果が肯定か否かにしたがって、第１の応答生成手段の出力と、対話システムの出力とを選択的に出力する第１の選択手段とを含む。 An interactive system reinforcement device according to a first aspect of the present invention is an interactive system reinforcement device that operates together with an interactive system and gives a characteristic to an interactive operation by the interactive system, receives an input given to the interactive system, and replaces the interactive system. A first determination unit that determines whether or not to output a response to the input, and a response generation method prepared in advance based on the input in response to a positive determination by the first determination unit It is connected to receive the first response generation means for generating the response, the output of the first response generation means, and the output of the dialogue system, and whether or not the determination result of the first determination means is affirmative Accordingly, it includes first selection means for selectively outputting the output of the first response generation means and the output of the dialogue system.

好ましくは、第１の判定手段は、単語をキーとして当該単語に関連する応答文候補を検索できるように複数の応答文候補を記憶した応答文候補記憶手段と、入力から当該入力に含まれる単語を抽出する単語抽出手段と、単語抽出手段により抽出された単語をキーとして応答文候補記憶手段から応答文候補を検索し取出すための応答文候補取出手段と、応答文候補取出手段により取出された応答文候補があったか否かにしたがって、対話システムに代わって入力に対する応答を出力すべきか否かを判定するための手段とを含む。 Preferably, the first determination unit includes a response sentence candidate storage unit that stores a plurality of response sentence candidates so that a response sentence candidate related to the word can be searched using the word as a key, and a word included in the input from the input Extracted by the word extraction means, the response sentence candidate extraction means for searching for and extracting the response sentence candidates from the response sentence candidate storage means using the word extracted by the word extraction means as a key, and the response sentence candidate extraction means Means for determining whether or not to output a response to the input on behalf of the dialogue system according to whether or not there is a response sentence candidate.

より好ましくは、第１の応答生成手段は、応答文候補取出手段により取出された応答文候補の各々について、特徴を表している程度を示す特徴スコアを算出するためのスコア算出手段と、応答文候補取出手段により取出された応答文候補の内で、スコア算出手段により最も高い特徴スコアが算出された応答文候補を選択し第１の選択手段に出力するための手段とを含む。 More preferably, the first response generation means includes a score calculation means for calculating a feature score indicating a degree representing a feature for each of the response sentence candidates extracted by the response sentence candidate extraction means, and a response sentence Means for selecting a response sentence candidate having the highest feature score calculated by the score calculation means from among the response sentence candidates extracted by the candidate extraction means, and outputting the response sentence candidate to the first selection means.

さらに好ましくは、第１の応答生成手段は、単語をキーとして当該単語に関連する応答文候補を検索できるように複数の応答文候補を記憶した応答文候補記憶手段と、入力から当該入力に含まれる単語を抽出する単語抽出手段と、単語抽出手段により抽出された単語をキーとして応答文候補記憶手段から応答文候補を検索し取出すための応答文候補取出手段と、応答文候補取出手段により取出された応答文候補の各々について、特徴を表している程度を示す特徴スコアを算出するためのスコア算出手段と、応答文候補取出手段により取出された応答文候補の内で、スコア算出手段により最も高い特徴スコアが算出された応答文候補を選択し第１の選択手段に出力するための手段とを含む。 More preferably, the first response generation unit includes a response sentence candidate storage unit that stores a plurality of response sentence candidates so that a response sentence candidate related to the word can be searched using the word as a key, and is included in the input from the input. Word extraction means for extracting a word to be read, response sentence candidate extraction means for searching for and extracting response sentence candidates from the response sentence candidate storage means using the word extracted by the word extraction means as a key, and extraction by the response sentence candidate extraction means For each of the response sentence candidates, a score calculation means for calculating a feature score indicating a degree representing the feature and a response sentence candidate extracted by the response sentence candidate extraction means, the score calculation means most Means for selecting a response sentence candidate for which a high feature score has been calculated and outputting it to the first selection means.

好ましくは、第１の応答生成手段は、複数の特定対話データ項目を記憶した特定対話データ項目記憶手段を含み、特定対話データ項目の各々は、第１の応答文と、当該第１の応答文の後に出力が予定される第２の応答文と、少なくとも第１の応答文に含まれる第１の単語とを含み、第１の応答生成手段はさらに、入力に対する対話システムの出力するシステム応答文を受信するよう対話システムに接続され、当該システム応答文に含まれる単語を抽出するための単語抽出手段と、単語抽出手段により抽出された単語を第１の単語として含む特定対話データ項目を特定対話データ項目記憶手段から取出すためのデータ項目取出手段と、データ項目取出手段により取出された特定対話データ項目に含まれる第１の応答文を出力して第１の選択手段に入力するための手段と、データ項目取出手段により特定対話データ項目が取出されたか否かを判定する第２の判定手段と、第１の応答文を出力したことを示す出力履歴を記憶するための履歴記憶手段とを含み、第１の判定手段は、対話システムに入力が与えられたことに応答して、履歴記憶手段を参照し、当該入力が第１の応答文の出力された直後の入力であるか否かにしたがって、対話システムに代わって当該入力に対する応答を出力すべきか否かを判定する第３の判定手段を含み、対話システム補強装置はさらに、第２の判定手段により、対話システムに与えられた入力が第１の応答文の出力された直後の入力であると判定されたことに応答して、第１の応答文と同じ特定対話データ項目内の第２の応答文に基づいて応答を生成し、出力するための第２の応答生成手段を含み、第１の選択手段は、第１の応答生成手段の出力と、対話システムの出力と、第２の応答生成手段の出力とを受けるように接続され、第２の判定手段の判定結果と第３の判定手段による判定結果との組合せにしたがって、第１の応答生成手段の出力と、第２の応答生成手段の出力と、対話システムの出力とを選択的に出力する第２の選択手段を含む。 Preferably, the first response generation unit includes a specific dialogue data item storage unit that stores a plurality of specific dialogue data items, and each of the specific dialogue data items includes a first response sentence and the first response sentence. A first response sentence to be output after, and at least a first word included in the first response sentence, and the first response generation means further includes a system response sentence output by the dialog system for the input Is connected to the dialogue system so as to receive the word, and the word extraction means for extracting the word included in the system response sentence, and the specific dialogue data item including the word extracted by the word extraction means as the first word Data item extraction means for extracting from the data item storage means, and first selection means for outputting a first response sentence included in the specific dialog data item extracted by the data item extraction means Means for inputting, second determination means for determining whether or not the specific dialogue data item has been extracted by the data item extraction means, and for storing an output history indicating that the first response sentence has been output The first determination means refers to the history storage means in response to the input being given to the dialogue system, and the input is input immediately after the first response sentence is output. And determining whether or not to output a response to the input in place of the dialog system, the dialog system reinforcement device further includes a dialog system by the second determination means. In response to determining that the input given to the input is the input immediately after the output of the first response sentence, based on the second response sentence in the same specific interaction data item as the first response sentence Generate a response Second response generating means for performing the operation, wherein the first selecting means is connected to receive the output of the first response generating means, the output of the dialogue system, and the output of the second response generating means. The output of the first response generating means, the output of the second response generating means, and the output of the dialogue system according to the combination of the determination result of the second determination means and the determination result of the third determination means Second selection means for selectively outputting is included.

本発明の第２の局面に係るコンピュータプログラム（以下単に「プログラム」という。）は、コンピュータを、上記したいずれかの装置の各手段として機能させる。 A computer program according to the second aspect of the present invention (hereinafter simply referred to as “program”) causes a computer to function as each unit of any of the above-described apparatuses.

上記した本願発明の技術的範囲及びその均等範囲は、以下の実施の形態の説明と図面とにより一層明らかになる。 The technical scope of the present invention and the equivalent scope thereof will become more apparent from the following description of the embodiments and the drawings.

図１は、本願発明の第１の実施の形態に係る対話システム補強装置の概略ブロック図である。FIG. 1 is a schematic block diagram of an interactive system reinforcement device according to a first embodiment of the present invention. 図２は、第１の実施の形態で使用されるニュースＤＢの構成を示す模式図である。FIG. 2 is a schematic diagram showing the configuration of the news DB used in the first embodiment. 図３は、第１の実施の形態に係る対話システム補強装置としてコンピュータを機能させるプログラムの制御構造を示すフローチャートである。FIG. 3 is a flowchart illustrating a control structure of a program that causes a computer to function as the interactive system reinforcement device according to the first embodiment. 図４は、第１の実施の形態において各応答候補のニュースらしさを示すスコアを算出するための判定モデルの学習システムを示すブロック図である。FIG. 4 is a block diagram illustrating a determination model learning system for calculating a score indicating the likelihood of news of each response candidate in the first embodiment. 図５は、図４に示す判定モデルを構成するマルチカラム畳み込みニューラルネットワーク（Multi Column Convolutional Neural Network：ＭＣＮＮ）の概略構造を示す図である。FIG. 5 is a diagram showing a schematic structure of a multi-column convolutional neural network (MCNN) constituting the determination model shown in FIG. 図６は、図５に示すＭＣＮＮの１カラムの概略構成及び動作原理を説明する模式図である。FIG. 6 is a schematic diagram illustrating a schematic configuration and an operation principle of one column of MCNN illustrated in FIG. 図７は、図５及び図６に示す判定モデルに入力される素性である第２の単語ベクトル列の生成方法を示す模式図である。FIG. 7 is a schematic diagram illustrating a method of generating a second word vector sequence that is a feature input to the determination model illustrated in FIGS. 5 and 6. 図８は、図５及び図６に示す判定モデルに入力される素性である第３の単語ベクトル列の生成方法を示す模式図である。FIG. 8 is a schematic diagram illustrating a method for generating a third word vector sequence that is a feature input to the determination model illustrated in FIGS. 5 and 6. 図９は、図５及び図６に示す判定モデルに入力される素性である第４の単語ベクトル列の生成方法を示す模式図である。FIG. 9 is a schematic diagram illustrating a method of generating a fourth word vector sequence that is a feature input to the determination model illustrated in FIGS. 5 and 6. 図１０は、コンピュータを、本発明の第１の実施の形態に係る対話システムにおけるニュースタイトルのスコアリング部として機能させるプログラムのフローチャートである。FIG. 10 is a flowchart of a program that causes a computer to function as a news title scoring unit in the interactive system according to the first embodiment of the present invention. 図１１は、コンピュータを第１の実施の形態に係る対話システムにおける応答生成部として機能させるプログラムのフローチャートである。FIG. 11 is a flowchart of a program that causes a computer to function as a response generation unit in the interactive system according to the first embodiment. 図１２は、本発明の第２の実施の形態に係る対話システム補強装置である質問回答システムの概略構成を示すブロック図である。FIG. 12 is a block diagram showing a schematic configuration of a question answering system which is an interactive system reinforcement device according to the second embodiment of the present invention. 図１３は、図１２に示す質問回答ＤＢの概略構成を示す模式図である。FIG. 13 is a schematic diagram showing a schematic configuration of the question answer DB shown in FIG. 図１４は、図１２に示す質問処理部の構成を示すブロック図である。FIG. 14 is a block diagram showing a configuration of the question processing unit shown in FIG. 図１５は、図１２に示す状態判定部の構成を示すブロック図である。15 is a block diagram showing a configuration of the state determination unit shown in FIG. 図１６は、図１２に示す回答処理部の構成を示すブロック図である。FIG. 16 is a block diagram showing a configuration of the answer processing unit shown in FIG. 図１７は、コンピュータを図１２に示す質問処理部として機能させるプログラムの制御構造を示すフローチャートである。FIG. 17 is a flowchart showing a control structure of a program that causes a computer to function as the question processing unit shown in FIG. 図１８は、コンピュータを図１２に示す状態判定部及び回答処理部として機能させるプログラムの制御構造を示すフローチャートである。FIG. 18 is a flowchart illustrating a control structure of a program that causes a computer to function as the state determination unit and the answer processing unit illustrated in FIG. 図１９は、第１及び第２の実施の形態を実現するコンピュータシステムの外観を示す図である。FIG. 19 is a diagram illustrating an external appearance of a computer system that implements the first and second embodiments. 図２０は、図１９に示すコンピュータの内部構成を示すブロック図である。FIG. 20 is a block diagram showing an internal configuration of the computer shown in FIG.

以下の説明及び図面では、同一の部品には同一の参照番号を付してある。したがって、それらについての詳細な説明は繰返さない。 In the following description and drawings, the same parts are denoted by the same reference numerals. Therefore, detailed description thereof will not be repeated.

＜第１の実施の形態＞
［概要］
第１の実施の形態に係る対話システム補強装置は、既存の対話システムと協調動作することにより、既存の対話システムがニュースに特化した対話を行うようにする。具体的には、この対話システム補強装置は、ユーザの発話がニュースで使われているような話題だった場合に、ニュースの情報を与え、同時にそのニュースに関する応答文を出力する。これを実現するために、まず入力された発話文から話題となる語を抽出する。その語が予め準備したニュース情報が入ったデータベース又はそれに類するもの（以下単に「データベース」と呼ぶ。）に含まれている場合には、そのデータベースから関連する情報を取得する。例えば入力に「テロ」という話題が含まれていた場合、「仏財務相：テロ資金の監視、日本と協力確認」というニュースタイトルを取得できる。この文を、ユーザの発話文とみなして既存の対話システムへ入力し、応答文を取得する。取得された応答文と、データベースから取得した文とを用いて、例えば「『仏財務相：テロ資金の監視、日本と協力確認』っていうニュースがあるみたい。テロ組織に対する監視を一層強化し、捜査当局は、ロンドンやマドリードで起きたような無差別テロを食い止めるらしい。」という応答文を生成し出力する。その結果、既存の対話システムがニュースに特化した対話システムとして利用できる。 <First Embodiment>
[Overview]
The dialogue system reinforcement apparatus according to the first embodiment cooperates with an existing dialogue system so that the existing dialogue system performs a dialogue specialized for news. Specifically, when the user's utterance is a topic used in news, this dialog system reinforcement device gives information about the news and simultaneously outputs a response sentence related to the news. In order to realize this, first, a topic word is extracted from the inputted utterance sentence. If the word is contained in a database containing previously prepared news information or the like (hereinafter simply referred to as “database”), relevant information is acquired from the database. For example, if the input includes the topic of “terrorism”, the news title “French Finance Minister: Monitoring of Terrorist Funds, Confirming Cooperation with Japan” can be acquired. This sentence is regarded as a user's utterance sentence and is input to an existing dialogue system to obtain a response sentence. Using the acquired response sentence and the sentence obtained from the database, for example, there seems to be a news saying “French Finance Minister: Monitoring of Terrorist Funds, Confirming Cooperation with Japan”. The investigative authorities seem to stop the indiscriminate terrorism that occurred in London and Madrid. " As a result, the existing dialog system can be used as a dialog system specialized in news.

［構成］
《全体構成》
図１を参照して、第１の実施の形態に係る対話システム補強装置５０は、既存の対話システム５２と協調動作し、対話システム５２をニュースに特化した対話システムとして機能させる。対話システム補強装置５０は、ニュースＤＢ６６と、対話履歴ＤＢ６８とを含む。ニュースＤＢ６６の構成については後述するが、単語を検索キーとして、その単語を含むニュースタイトルが取り出せるように予め準備したデータベースである。ニュースＤＢ６６にはニュースタイトルが多数格納されている。 [Constitution]
"overall structure"
With reference to FIG. 1, the dialogue system reinforcement device 50 according to the first embodiment operates in cooperation with an existing dialogue system 52 and causes the dialogue system 52 to function as a dialogue system specialized in news. The dialogue system reinforcement device 50 includes a news DB 66 and a dialogue history DB 68. Although the structure of the news DB 66 will be described later, it is a database prepared in advance so that a news title including the word can be taken out using the word as a search key. The news DB 66 stores a large number of news titles.

対話システム補強装置５０はさらに、自然言語文からなる入力５４を受けて、入力５４からトピックを抽出するためのトピック抽出部６０と、トピック抽出部６０が抽出したトピックと一致する単語を持つニュースタイトルをニュースＤＢ６６から取得するためのニュースタイトル取得部６２と、ニュースタイトル取得部６２によるニュースタイトルの取得結果に基づいて、入力５４に対して対話システム補強装置５０の機能を用いて応答するか、対話システム５２に応答させるかを示す選択信号７６を出力するための応答可否判定部６４とを含む。本実施の形態では、選択信号７６は対話システム補強装置５０の機能を用いて応答すべきときには第１の値をとり、そうでないときには第２の値をとるものとする。応答可否判定部６４は、対話システム補強装置５０の機能を用いて応答すべきときには、ニュースタイトル取得部６２により選択されたニュースタイトル７８を出力する。 The dialogue system reinforcement device 50 further receives a topic 54 that is a natural language sentence, extracts a topic from the input 54, and a news title having a word that matches the topic extracted by the topic extraction unit 60. Is received from the news DB 66 and based on the news title acquisition result by the news title acquisition unit 62, the input 54 is responded by using the function of the dialog system reinforcement device 50, or the dialog A response availability determination unit 64 for outputting a selection signal 76 indicating whether the system 52 is to respond. In the present embodiment, the selection signal 76 takes a first value when it should respond using the function of the interactive system reinforcement device 50, and takes a second value otherwise. The response availability determination unit 64 outputs the news title 78 selected by the news title acquisition unit 62 when it should respond using the function of the dialog system reinforcement device 50.

なお、トピックとは、入力５４に含まれる名詞のことを言い、会話中に頻繁に出現するような名詞を除いた、トピックを特定するのにふさわしい名詞のことを言う。１つの入力５４から複数のトピックを抽出してもよい。その場合には、ニュースＤＢ６６からそれらトピックの各々をキーとしてニュースタイトルを取得し、それらの和集合を使用する。またニュースタイトルを取得する代わりに、ニュース本文を取得し、それらの和集合を使用しても良い。 The topic refers to a noun included in the input 54, and refers to a noun suitable for identifying a topic, excluding nouns that frequently appear during conversation. A plurality of topics may be extracted from one input 54. In that case, a news title is acquired from the news DB 66 using each of the topics as a key, and the union of these is used. Further, instead of acquiring the news title, the news text may be acquired and the union of them may be used.

対話システム補強装置５０はさらに、選択信号７６に応答して、選択信号７６が第１の値のときには応答可否判定部６４が出力するニュースタイトル７８を選択し、第２の値のときには入力５４を選択して対話システム５２に選択後の入力８０として入力するための選択部７０と、選択信号７６が第１の値のときには対話システム５２が出力する応答８２と応答可否判定部６４が出力するニュースタイトル７８とを用いて応答８４を生成し出力するための応答生成部７２と、選択信号７６が第１の値のときには応答８４を、選択信号７６が第２の値のときには対話システム５２の出力する応答８２を、それぞれ選択してこの対話システムの応答５６として出力するための選択部７４とを含む。 Further, in response to the selection signal 76, the dialog system reinforcement device 50 selects the news title 78 output by the response availability determination unit 64 when the selection signal 76 is the first value, and inputs the input 54 when the selection signal 76 is the second value. A selection unit 70 to select and input as an input 80 after selection to the dialog system 52, a response 82 output from the dialog system 52 and a news output from the response availability determination unit 64 when the selection signal 76 is the first value. The response generator 72 for generating and outputting a response 84 using the title 78, and the response 84 when the selection signal 76 is the first value, and the output of the dialogue system 52 when the selection signal 76 is the second value. And a selection unit 74 for selecting and outputting the responses 82 as responses 56 of the dialog system.

《ニュースＤＢ６６》
図２を参照して、ニュースＤＢ６６は、複数のレコードを含む。各レコードは、キーと、そのキーを検索のためのキーとして取出可能なニュースタイトルとを含む。各レコードのキーは、対応するニュースタイトル中の単語（トピック）からなる。１つのレコードは複数のキーを持つことが可能であり、何れかのキーが検索のためのキーワードと一致すればそのレコードをニュースＤＢ６６から取出すことができる。 <News DB 66>
Referring to FIG. 2, news DB 66 includes a plurality of records. Each record includes a key and a news title that can be extracted using the key as a search key. The key of each record consists of a word (topic) in the corresponding news title. One record can have a plurality of keys, and if any key matches a keyword for search, the record can be taken out from the news DB 66.

《プログラム構成》
コンピュータをこの対話システム補強装置５０として機能させるためのプログラムは以下のような制御構造を持つ。図３を参照して、このプログラムは、入力５４からトピックである単語を抽出するステップ９０と、ステップ９０で抽出したトピックを検索キーとしてニュースＤＢ６６を検索することにより、そのトピックをキーに持つレコードのニュースタイトルをニュースＤＢ６６から取得するステップ９２と、ステップ９２において取得されたニュースタイトルの各々に対して以下の処理９６を実行するステップ９４とを含む。なお、ステップ９２において１つもレコードが見つからなかった場合には、処理９６は１回も実行されず制御はステップ９４の次（後述するステップ９８）に進む。 <Program structure>
A program for causing a computer to function as the interactive system reinforcement device 50 has the following control structure. Referring to FIG. 3, this program extracts a word as a topic from input 54, and searches news DB 66 using the topic extracted in step 90 as a search key, and records having the topic as a key. Are acquired from the news DB 66, and a step 94 for executing the following processing 96 on each of the news titles acquired in step 92 is included. If no record is found in step 92, the process 96 is not executed once, and the control proceeds to the step after step 94 (step 98 described later).

処理９６は、対話履歴ＤＢ６８を参照することにより、処理対象のニュースタイトルが最近の履歴にあるか否かを判定し、結果に応じて制御の流れを分岐させるステップ１３０と、ステップ１３０の判定が否定であるときに、処理対象のニュースタイトルのニュースらしさを示すスコアを算出するステップ１３２と、ステップ１３０の判定が肯定であるときに、処理対象のニュースタイトルのニュースらしさスコアを０とするステップ１３４とを含む。ステップ１３２又はステップ１３４が終了すると処理対象のニュースタイトルに対する処理９６が終了する。なお、ステップ９２で取得されたニュースタイトルがない場合には、前述の通り処理９６は１回も実行されず、したがってスコアがついたニュースタイトルは得られない。 The process 96 refers to the dialog history DB 68 to determine whether or not the news title to be processed is in the recent history, and the determination of step 130 and branching of the control flow according to the result is performed. When the determination is negative, step 132 calculates a score indicating the news likeness of the news title to be processed, and when the determination at step 130 is affirmative, step 134 sets the news likelihood score of the processing target news title to zero. Including. When step 132 or step 134 ends, the process 96 for the news title to be processed ends. If there is no news title acquired in step 92, the process 96 is not executed once as described above, and therefore a news title with a score cannot be obtained.

このプログラムはさらに、ステップ９４の結果、ニュースらしさスコア＞０であるニュースレコードがあるか否かを判定し、結果にしたがって判定結果を示す選択信号７６を出力し、制御の流れを分岐させるステップ９８を含む。判定結果が肯定であれば選択信号７６は第１の値となり、そうでなければ選択信号７６は第２の値となる。このプログラムはさらに、ステップ９８の判定が肯定であるときに、スコア最大のニュースタイトルを選択して図１に示すニュースタイトル７８として選択部７０に出力するステップ１００と、ステップ９８の判定結果が肯定であるときに、対話システム５２が出力する応答８２と、ニュースタイトル７８とを組合せることにより応答８４を生成し選択部７４に出力するステップ１０２と、ステップ９８の判定が否定であるときに、入力５４を選択して対話システム５２に与え、その出力である応答８２を得るステップ１０４とを含む。 This program further determines whether or not there is a news record with a news likelihood score> 0 as a result of step 94, outputs a selection signal 76 indicating the determination result according to the result, and branches the flow of control 98. including. If the determination result is affirmative, the selection signal 76 has a first value, and if not, the selection signal 76 has a second value. In addition, when the determination in step 98 is affirmative, the program selects a news title with the highest score and outputs it to the selection unit 70 as the news title 78 shown in FIG. 1, and the determination result in step 98 is affirmative. When the response 82 output from the dialogue system 52 and the news title 78 are combined to generate a response 84 and output it to the selection unit 74, and when the determination at step 98 is negative, Selecting and providing the input 54 to the dialog system 52 to obtain its output response 82.

このプログラムはさらに、ステップ９８での判定が肯定のときにはステップ１０２で生成された応答を、そうでないときにはステップ１０４で得られた応答を、それぞれ選択して応答５６として出力するステップ１０６と、入力５４、応答５６、及びニュースタイトル７８を対話履歴ＤＢ６８に履歴として登録するステップ１０８とを含む。応答可否判定部６４での判定が否定であるときにはニュースタイトル７８はない。したがってこの場合、対話履歴ＤＢ６８には入力５４、応答５６、及び空の文字列、又はニュースタイトルが選択されなかったことを示すシンボルが履歴として登録される。 The program further selects a response generated in step 102 when the determination in step 98 is affirmative, and otherwise selects the response obtained in step 104 and outputs it as a response 56. , A response 56, and a step 108 of registering the news title 78 in the dialog history DB 68 as a history. When the determination by the response availability determination unit 64 is negative, there is no news title 78. Therefore, in this case, the input history 54, the response 56, and an empty character string or a symbol indicating that no news title has been selected are registered in the dialog history DB 68 as a history.

《ニュースらしさの算出のためのモデル学習》
図３のステップ１３２におけるスコアの算出は、図４に示すニュースらしさ判定モデル１６８を使用して行われる。このニュースらしさ判定モデル１６８として、本実施の形態では、いわゆるディープニューラルネットワークの一種である畳み込みニューラルネットワーク、より具体的には、マルチカラム畳み込みニューラルネットワーク（Multicolumn Convolutional Neural Network：ＭＣＮＮ）を用いる。《Model learning for news-likeness calculation》
The calculation of the score in step 132 of FIG. 3 is performed using the news likelihood determination model 168 shown in FIG. In the present embodiment, a convolutional neural network which is a kind of so-called deep neural network, more specifically, a multicolumn convolutional neural network (MCNN) is used as the news likelihood determination model 168.

〈モデル学習部１６２〉
図４にこのニュースらしさ判定モデル１６８の学習を行うためのモデル学習装置１５０の構成を示す。また図５にはニュースらしさ判定モデル１６８の概略構成を示す。図６にはニュースらしさ判定モデル１６８の１カラムの構成を示す。図７、図８及び図９には、このニュースらしさ判定モデル１６８の各カラムへの入力となる素性の導出方法を説明するための図を示す。 <Model learning unit 162>
FIG. 4 shows the configuration of the model learning device 150 for learning the news likelihood determination model 168. FIG. 5 shows a schematic configuration of the news likelihood determination model 168. FIG. 6 shows the configuration of one column of the news likelihood determination model 168. 7, 8, and 9 are diagrams for explaining a method of deriving a feature that becomes an input to each column of the news likelihood determination model 168.

図４を参照して、モデル学習装置１５０は、ニュース用語辞書１６４及び接続語辞書１６６と、これらを参照して、大規模テキストデータベース１６０から接続語辞書１６６に記憶された接続語のいずれかと、ニュース用語辞書１６４に記憶されたニュース用語のいずれかが連続して現れるテキストを抽出して各テキストにそのテキストが含むニュース用語をラベルとして付して出力し、あわせて、接続語とニュース用語とが連続したものが何も含まれないテキストをラベルを付さずに抽出するテキスト抽出・分類部１９０と、テキスト抽出・分類部１９０から出力されるラベル付きのテキスト及びラベルなしのテキストをニュースらしさ判定モデル１６８の学習用として記憶する学習用テキスト記憶部１９２と、学習用テキスト記憶部１９２に記憶された学習用テキストを用いてニュースらしさ判定モデル１６８の学習を行う学習処理部１９４とを含む。テキストにラベルが付いている場合にはそのテキストはニュースタイトルであり、ラベルが付されていない場合にはそのテキストはニュースタイトルではない。学習用テキスト記憶部１９２に記憶される学習用テキストでは、これらはそれぞれニュースタイトルを含むテキストの正例及び負例としてのラベルが付される。例えばこのラベルは、正例は（１，０）であり、負例が（０，１）である。学習処理部１９４によるニュースらしさ判定モデル１６８の学習は、モデルがマルチカラム畳み込みニューラルネットワークである点、及び素性として何を使用するかという点を除き、通常のニューラルネットワークと同様、誤差逆伝搬法により行われる。 Referring to FIG. 4, model learning apparatus 150 refers to news term dictionary 164 and connected word dictionary 166, and referring to these, any of connected words stored in connected word dictionary 166 from large-scale text database 160, A text in which any of the news terms stored in the news term dictionary 164 continuously appears is extracted and a news term included in the text is attached to each text as a label. The text extraction / classification unit 190 that extracts text without any continuous text without labeling, and the labeled text and the unlabeled text output from the text extraction / classification unit 190 are like news. A learning text storage unit 192 that stores the determination model 168 for learning, and a learning text storage unit 19 Using the stored learning text to include a learning processing unit 194 to perform the learning of the news likeness decision model 168. If the text is labeled, the text is a news title; if the text is not labeled, the text is not a news title. In the learning texts stored in the learning text storage unit 192, these are respectively labeled as positive examples and negative examples of texts including news titles. For example, this label has a positive example of (1, 0) and a negative example of (0, 1). Learning of the news likeness determination model 168 by the learning processing unit 194 is performed by the error back propagation method as in a normal neural network except that the model is a multi-column convolutional neural network and what is used as a feature. Done.

図４を参照して、図３に示すステップ１３２は、図３のステップ９２で取出されたニュースタイトルを記憶するためのニュースタイトル記憶部１７０と、ニュースらしさ判定モデル１６８を使用して各ニュースタイトルのニュースらしさスコアを算出しスコア付ニュースタイトル１７４を出力するためのニュースタイトルスコアリング部１７２とを用いて行われる。 Referring to FIG. 4, step 132 shown in FIG. 3 uses a news title storage unit 170 for storing the news title extracted in step 92 of FIG. The news title scoring unit 172 for calculating the news-likeness score and outputting the scored news title 174 is performed.

本実施の形態では、テキスト抽出・分類部１９０は、プログラムにより実現される。このプログラムは、接続語辞書１６６に記憶された接続語のいずれかの後にニュース用語辞書１６４に記憶されたニュース用語のいずれかが現れる単語列を含むテキストを大規模テキストデータベース１６０において検索するための検索式を生成し、この検索式を用いて大規模テキストデータベース１６０から条件を満たすテキストを全て抽出するステップと、抽出されたテキストの各々に対し、そのテキストの内、接続語の前までを抽出し、検索されたニュース用語をラベルとして付したテキストを学習データとして学習用テキスト記憶部１９２に格納するステップとを含む。 In the present embodiment, the text extraction / classification unit 190 is realized by a program. This program searches the large text database 160 for text that includes a word string in which any of the news terms stored in the news term dictionary 164 appears after any of the connected words stored in the connection word dictionary 166. Generate a search expression, extract all the text that satisfies the condition from the large-scale text database 160 using this search expression, and for each extracted text, extract the text before the connected word And storing the text with the searched news term as a label in the learning text storage unit 192 as learning data.

図５を参照して、ニュースらしさ判定モデル１６８を構成するＭＣＮＮ２２０は、第１〜第４の畳み込みニューラルネットワーク群２６０、２６２、２６４、２６６からなるニューラルネットワーク層２３０と、ニューラルネットワーク層２３０内の各ニューラルネットワークの出力を線形に連結する連結層２３２と、連結層２３２の出力するベクトルに対してSoftmax関数を適用して、ニュースタイトル候補が真のニュースタイトルか否かを０〜１の間のスコアで評価し出力するSoftmax層２３４とを含む。 Referring to FIG. 5, MCNN 220 constituting news likelihood determination model 168 includes a neural network layer 230 including first to fourth convolutional neural network groups 260, 262, 264, and 266, and each of the neural network layers 230. A connection layer 232 that linearly connects the outputs of the neural network, and a softmax function applied to a vector output from the connection layer 232, a score between 0 and 1 indicating whether the news title candidate is a true news title. And a Softmax layer 234 that is evaluated and output at

第１の畳み込みニューラルネットワーク群２６０は、後述する第１の単語ベクトル列を受ける第１カラムのサブネットワークを含む。第２の畳み込みニューラルネットワーク群２６２は、後述する３つの第２の単語ベクトル列をそれぞれ受ける第２、第３及び第４カラムのサブネットワークを含む。第３の畳み込みニューラルネットワーク群２６４は、後述する４つの第３の単語ベクトル列をそれぞれ受ける第５、第６、第７、及び第８カラムのサブネットワークを含む。第４の畳み込みニューラルネットワーク群２６６は、後述する３つの第４の単語ベクトル列を受ける第９、第１０、及び第１１カラムのサブネットワークを含む。これらサブネットワークは、いずれも畳み込みニューラルネットワークである。 The first convolutional neural network group 260 includes a first column sub-network that receives a first word vector sequence to be described later. The second convolutional neural network group 262 includes second, third, and fourth column sub-networks that respectively receive three second word vector sequences to be described later. The third convolutional neural network group 264 includes fifth, sixth, seventh, and eighth column sub-networks that respectively receive four third word vector sequences described later. The fourth convolutional neural network group 266 includes ninth, tenth, and eleventh column sub-networks that receive three fourth word vector sequences to be described later. These sub-networks are all convolutional neural networks.

ニューラルネットワーク層２３０の各畳み込みニューラルネットワークの出力は連結層２３２で単純に線形に連結され、Softmax層２３４への入力ベクトルとなる。 The output of each convolutional neural network in the neural network layer 230 is simply linearly connected in the connection layer 232 and becomes an input vector to the Softmax layer 234.

ＭＣＮＮ２２０についてその機能をより詳細に説明する。図６に、代表として第３の畳み込みニューラルネットワーク群２６４の１カラムを構成するサブネットワーク２８０の構成を示す。第３の畳み込みニューラルネットワーク群２６４内の他のカラム及び他の畳み込みニューラルネットワーク群２６０、２６２、及び２６６に含まれる各カラムも、図６に示すものと同じ構造を持つ。ここでは、説明を分かりやすくするために、サブネットワーク２８０が、入力層２９０、畳み込み層２９２、及びプーリング層２９４のみからなっているものとするが、この３つの層を複数個備えているものでもよい。 The function of the MCNN 220 will be described in detail. FIG. 6 shows the configuration of the sub-network 280 that constitutes one column of the third convolutional neural network group 264 as a representative. The other columns in the third convolutional neural network group 264 and the columns included in the other convolutional neural network groups 260, 262, and 266 have the same structure as that shown in FIG. Here, in order to make the explanation easy to understand, it is assumed that the sub-network 280 includes only the input layer 290, the convolution layer 292, and the pooling layer 294. However, the sub-network 280 may include a plurality of these three layers. Good.

入力層２９０には、学習データの内、ニュースタイトルから得られた素性の各単語を単語ベクトルに変換して得られた単語ベクトル列Ｘ_１、Ｘ_２、…、Ｘ_|ｔ|が入力される。この単語ベクトル列Ｘ_１、Ｘ_２、…、Ｘ_|ｔ|は、行列Ｔ＝［Ｘ_１、Ｘ_２、…、Ｘ_|ｔ|］^Ｔとして表される。この行列Ｔに対して、Ｍ個の素性マップが適用される。素性マップはベクトルであって、各素性マップの要素であるベクトルＯは連続する単語ベクトルからなるＮグラムに対してｆ_j（１≦ｊ≦Ｍ）で示されるフィルタを適用しながらＮグラム３１０を移動させることにより計算される。Ｎは任意の自然数だが、本実施の形態ではＮ＝３とする。すなわちＯは次の式により表される。 In the input layer 290, word vector strings X ₁ , X ₂ ,..., X _{| t |} obtained by converting each word of the feature obtained from the news title into a word vector in the learning data are input. . The word vector sequence _{_{X 1, X 2, ...,}} X | t | , the matrix _{_{T = [X 1, X 2}} , ..., X | t |] expressed as ^T. M feature maps are applied to this matrix T. The feature map is a vector, and the vector O, which is an element of each feature map, applies an N-gram 310 while applying a filter indicated by f _j (1 ≦ j ≦ M) to an N-gram consisting of consecutive word vectors. Calculated by moving. N is an arbitrary natural number, but N = 3 in this embodiment. That is, O is represented by the following formula.

なお、素性マップの全体にわたりＮを等しくしてもよいし、異なるものがあってもよい。Ｎとしては、２、３、４及び５程度が適当である。本実施の形態では、重み行列は各カラムセット内の畳み込みニューラルネットワークにおいて等しくしてある。これらは互いに異なっていても良いが、実際、このように等しくした方が、各重み行列を独立に学習する場合より精度が高くなる。なお、畳み込みニューラルネットワークで使用するフィルタとしては、どのようなものを用いることもできる。画像処理で用いるフィルタを流用してもよい。

Note that N may be equal throughout the entire feature map, or there may be different ones. As N, 2, 3, 4 and 5 are appropriate. In the present embodiment, the weight matrix is equal in the convolutional neural network in each column set. These may be different from each other, but in fact, the equality is more accurate than the case where each weight matrix is learned independently. Any filter can be used as a filter used in the convolutional neural network. A filter used in image processing may be used.

この素性マップの各々について、次のプーリング層２９４は、いわゆるマックスプーリングを行う。すなわち、プーリング層２９４は、例えば素性マップｆ_Ｍの要素の内、最大の要素３２０を選択し、要素３３０として取出す。これを素性マップの各々に対して行うことによって、要素３３０、…、３３２を取出し、これらをｆ_１からｆ_Ｍの順番に連接して連結層２３２にベクトル３４２として出力する。各畳み込みニューラルネットワークからはこのようにして得られたベクトル３４０、…、３４２、…、３４４が連結層２３２に出力される。連結層２３２は、ベクトル３４０、…、３４２、…、３４４を単純に線形に連結してSoftmax層２３４に与える。なお、プーリング層２９４としてはマックスプーリングを行うものの方が平均値を採用するものよりも精度が高いと言われている。しかし、もちろん平均値を採用するようにしてもよいし、下位の層の性質をよく表現するものであれば、他の代表値を用いるようにしてもよい。 For each feature map, the next pooling layer 294 performs so-called max pooling. That is, the pooling layer 294 selects, for example, the largest element 320 among the elements of the feature map f _M and takes it out as the element 330. By performing this for each of the feature maps, the elements 330,..., 332 are taken out and connected in the order of f ₁ to f _M to be output as a vector 342 to the connection layer 232. Each of the convolutional neural networks outputs the vectors 340,..., 342,. The connection layer 232 simply connects the vectors 340,..., 342,. As the pooling layer 294, it is said that the one that performs max pooling is more accurate than the one that employs an average value. However, of course, an average value may be adopted, and other representative values may be used as long as the properties of lower layers are well expressed.

〈学習のための単語ベクトル列〉
学習データは、上記した１１種類の単語ベクトル列と、その単語ベクトル列が得られたテキストが正例か負例かを示すラベルとからなる。 <Word vector sequence for learning>
The learning data includes the above 11 types of word vector strings and labels indicating whether the text from which the word vector strings are obtained is a positive example or a negative example.

ＭＣＮＮへの入力となる単語列は以下のとおりである。ただしこの表で、ＴＮＰはニュースタイトル候補に含まれる名詞を含む文節を表し、ＰＲＥＤはニュースタイトル候補に含まれる述語テンプレートの述語を含む文節を表す。述語テンプレートとは、名詞と結び付いてフレーズを構成する言語単位のことをいう。具体例は後述する。なお、負例の場合には、ニュース用単語に相当するものに代えて例えば文の主語を使用して単語ベクトル列を生成する。 The word strings to be input to MCNN are as follows. In this table, TNP represents a clause including a noun included in a news title candidate, and PRED represents a clause including a predicate of a predicate template included in the news title candidate. A predicate template refers to a language unit that is combined with a noun and constitutes a phrase. Specific examples will be described later. In the case of a negative example, a word vector string is generated using, for example, the subject of a sentence instead of the word corresponding to the news word.

これら単語列はいずれも、文全体から抽出される。なお、本実施の形態では、ニュースタイトル候補のテキストは所定の複数文という場合もある。したがってＴＮＰとＰＲＥＤが別文に存在することもあり得る。本実施の形態では、そうした場合でも上記した単語列が得られるようにするために、これら複数文の全ての係り受け木のルートを仮想的なルートノードで互いに結合し、１つの係り受け木を用いて上記単語列を得るようにした。

All of these word strings are extracted from the entire sentence. In this embodiment, the news title candidate text may be a predetermined plurality of sentences. Therefore, TNP and PRED may exist in different sentences. In this embodiment, in order to obtain the above-described word string even in such a case, the roots of all the dependency trees of the plurality of sentences are connected to each other by a virtual root node, and one dependency tree is obtained. Used to obtain the above word string.

図７を参照して、第１の単語列は、ニュースタイトル候補３６０から抽出されたニュース用語の名詞を含む文節３７０と述語テンプレートの述語３７２との対からなる。本実施の形態では、単語の出現順序を保存し、かつ演算量を少なくするために以下の全ての単語ベクトルとして単語埋め込みベクトルを使用する。 Referring to FIG. 7, the first word string includes a pair of a clause 370 including a noun of a news term extracted from a news title candidate 360 and a predicate 372 of a predicate template. In the present embodiment, word embedding vectors are used as all of the following word vectors in order to preserve the appearance order of words and reduce the amount of calculation.

図７を参照して、第２の種類の単語ベクトル列であるカラム番号２−４の単語ベクトル列は、ニュースタイトル候補３６０中での単語列の出現順序に基づき、文頭からニュースタイトル候補の名詞を含む文節３７０の直前までの単語列３８０、文節３７０と述語３７２の間の単語列３８２、及び述語３７２の後の単語から文末までの単語列３８４を含む。したがって、第２の単語ベクトル列は３つの単語埋め込みベクトル列として得られる。 Referring to FIG. 7, the word vector sequence of column numbers 2-4, which is the second type of word vector sequence, is based on the order of appearance of the word sequence in the news title candidate 360 from the beginning of the sentence to the news title candidate noun. Includes a word string 380 immediately before the clause 370 including, a word string 382 between the clause 370 and the predicate 372, and a word string 384 from the word after the predicate 372 to the end of the sentence. Therefore, the second word vector sequence is obtained as three word embedded vector sequences.

図８を参照して、第３の単語ベクトル列であるカラム５−８の単語ベクトル列は、ニュースタイトル候補３６０の係り受け木に基づき、ニュースタイトル候補の名詞を含む文節３７０に係る部分木４２０、述語３７２の係り先の部分木４１０、文節３７０と述語３７２の間の係り受けパス４２２、及びその他４２４からそれぞれ得られる単語列を含む。したがってこの例では第３の単語ベクトル列は４つの単語埋め込みベクトル列として得られる。 Referring to FIG. 8, the word vector sequence in column 5-8, which is the third word vector sequence, is based on the dependency tree of news title candidate 360, and subtree 420 related to clause 370 including the noun of the news title candidate. , A word tree obtained from the subtree 410 of the destination of the predicate 372, the dependency path 422 between the clause 370 and the predicate 372, and the other 424. Therefore, in this example, the third word vector sequence is obtained as four word embedded vector sequences.

図９を参照して、第４の単語ベクトル列であるカラム９−１１の単語ベクトル列は、ニュースタイトル候補３６０において、述語３７２を含む文節内の単語列と、述語３７２の前の単語列４４０と、後の単語列４４２とを含む。したがってこの場合、第４の単語ベクトル列は３つの単語埋め込みベクトル列として得られる。 Referring to FIG. 9, the word vector sequence in column 9-11, which is the fourth word vector sequence, is the word sequence in the phrase including predicate 372 and word sequence 440 before predicate 372 in news title candidate 360. And a subsequent word string 442. Therefore, in this case, the fourth word vector sequence is obtained as three word embedded vector sequences.

学習時には、ＭＣＮＮ２２０の各カラムにはそれぞれの各カラムへの入力ベクトルが与えられ、ＭＣＮＮ２２０の出力がそのテキストのラベルと比較され、通常の誤差逆伝播方式によって、誤差関数の値が小さくなる方向にＭＣＮＮ２２０を構成する各ウェイトとバイアスの値が修正される。 At the time of learning, each column of MCNN 220 is given an input vector to each column, the output of MCNN 220 is compared with the label of the text, and the value of the error function is decreased in the normal error back propagation method. Each weight and bias value constituting the MCNN 220 is corrected.

なお、本実施の形態では、ＭＣＮＮ２２０の出力は２つである。出力はSoftMax関数により実装されているため、第１の出力は、ニュースタイトル候補がニュースタイトルである確率を表し、第２の出力はニュースタイトル候補がニュースタイトルでない確率を表す。本実施の形態では、ニュースタイトルである確率を各ニュースタイトルのスコアとして用いる。 In the present embodiment, the MCNN 220 has two outputs. Since the output is implemented by the SoftMax function, the first output represents the probability that the news title candidate is a news title, and the second output represents the probability that the news title candidate is not a news title. In the present embodiment, the probability of being a news title is used as the score of each news title.

図３に示すステップ１３２、すなわち図４に示すニュースタイトルスコアリング部１７２は、コンピュータと協働するプログラムにより実現される。図１０を参照して、このプログラムは、ニュースタイトル記憶部１７０に記憶された各ニュースタイトルに対して以下の処理４６２を実行するステップ４６０を含む。 Step 132 shown in FIG. 3, that is, the news title scoring unit 172 shown in FIG. 4, is realized by a program that cooperates with the computer. Referring to FIG. 10, the program includes a step 460 for executing the following processing 462 for each news title stored in news title storage unit 170.

処理４６２は、ニュースらしさ判定モデル１６８への入力パラメータ（第１〜第４の単語ベクトル列）を生成するステップ４７０、これらパラメータをニュースらしさ判定モデル１６８に入力するステップ４７２、及びニュースらしさ判定モデル１６８の出力するラベル（ニュースタイトルである確率）をニュースタイトルにスコアとして付すステップ４７４を含む。 The process 462 includes a step 470 of generating input parameters (first to fourth word vector sequences) to the news likelihood determination model 168, a step 472 of inputting these parameters to the news likelihood determination model 168, and a news likelihood determination model 168. Step 474 of attaching a label (probability of being a news title) to the news title as a score.

《応答生成部７２》
図１に示す応答生成部７２もコンピュータとの協働によりコンピュータを応答生成部７２として機能させるプログラムにより実現される。図１１を参照して、このプログラムは、予め準備されている応答のための応答フォームのいずれかをランダムに選択するステップ５００を含む。この応答フォームは、例えば「［第１スロット］というニュースがありますよ。［第２スロット］」のようなものである。第１スロットにはニュースタイトルが代入され、第２スロットには対話システム５２の出力が代入される。 << Response generator 72 >>
The response generation unit 72 shown in FIG. 1 is also realized by a program that causes the computer to function as the response generation unit 72 in cooperation with the computer. Referring to FIG. 11, the program includes a step 500 of randomly selecting one of response forms for responses prepared in advance. This response form is, for example, “There is news [First slot]. [Second slot]”. The news title is assigned to the first slot, and the output of the dialogue system 52 is assigned to the second slot.

このプログラムはさらに、応答フォームの第１スロットにニュースタイトルを代入するステップ５０２、及び応答フォームの第２スロットに対話システム５２の出力した応答を代入するステップ５０４を含む。 The program further includes a step 502 for substituting the news title in the first slot of the response form, and a step 504 for substituting the response output by the dialog system 52 in the second slot of the response form.

［動作］
第１の実施の形態に係る対話システム補強装置５０は以下の様に動作する。対話システム補強装置５０は、入力５４のトピックに適合するニュースタイトルがニュースＤＢ６６に格納されているとき（適合時）とそうでないとき（非適合時）とで異なる。以下、最初に適合時、次に非適合時の対話システム補強装置５０の動作を説明する。なお、この動作に先立ち、図４に示すニュースらしさ判定モデル１６８の学習は完了しているものとする。 [Operation]
The dialogue system reinforcement device 50 according to the first embodiment operates as follows. The dialog system reinforcement device 50 differs depending on whether or not a news title that matches the topic of the input 54 is stored in the news DB 66 (when it conforms) and when it does not (when it does not conform). Hereinafter, the operation of the interactive system reinforcement device 50 when first adapted and then non-conforming will be described. Prior to this operation, it is assumed that learning of the news likelihood determination model 168 shown in FIG. 4 has been completed.

《適合時の対話システム補強装置５０の動作》
図１を参照して、適合時には対話システム補強装置５０は以下のように動作する。入力５４が入力されると、トピック抽出部６０は入力５４からトピックを抽出しニュースタイトル取得部６２に与える（図３（以下同様）のステップ９０）。ニュースタイトル取得部６２は、トピック抽出部６０が抽出したトピックと一致する単語を持つニュースタイトルをニュースＤＢ６６から取得し応答可否判定部６４に与える（ステップ９２）。応答可否判定部６４は、ニュースタイトル取得部６２によるニュースタイトルの取得結果に基づいて、入力５４に対して対話システム補強装置５０の機能を用いて応答するか、対話システム５２に応答させるかを示す選択信号７６を出力する。 << Operation of Dialog System Reinforcing Device 50 When Adapted >>
Referring to FIG. 1, the dialog system reinforcing device 50 operates as follows at the time of adaptation. When the input 54 is input, the topic extraction unit 60 extracts the topic from the input 54 and gives it to the news title acquisition unit 62 (step 90 in FIG. 3 (the same applies hereinafter)). The news title acquisition unit 62 acquires a news title having a word that matches the topic extracted by the topic extraction unit 60 from the news DB 66 and gives it to the response availability determination unit 64 (step 92). The response availability determination unit 64 indicates whether to respond to the input 54 using the function of the dialog system reinforcement device 50 or to respond to the dialog system 52 based on the news title acquisition result by the news title acquisition unit 62. A selection signal 76 is output.

応答可否判定部６４はこのとき、以下のように動作する。図３を参照して、応答可否判定部６４は、ニュースタイトル取得部６２から与えられたニュースタイトルの各々について、そのニュースタイトルを含む履歴が対話履歴ＤＢ６８にあるか否かを判定する（ステップ１３０）。もしあれば（ステップ１３０でＹＥＳ）そのニュースタイトルのニュースらしさスコアを０とする（ステップ１３４）。もしなければ（ステップ１３０でＮＯ）、そのニュースタイトルのニュースらしさスコアを図４に示すニュースらしさ判定モデル１６８及びニュースタイトルスコアリング部１７２を用いて算出し、そのニュースタイトルに付して出力する。 At this time, the response availability determination unit 64 operates as follows. Referring to FIG. 3, response availability determination unit 64 determines whether or not a history including the news title exists in dialog history DB 68 for each news title given from news title acquisition unit 62 (step 130). ). If there is any (YES in step 130), the news-likeness score of the news title is set to 0 (step 134). If not (NO in step 130), the news-likeness score of the news title is calculated using the news-likeness determination model 168 and the news title scoring unit 172 shown in FIG. 4, and attached to the news title and output.

再び図３を参照して、スコア＞０のニュースタイトルがあれば（ステップ９８でＹＥＳ）、スコア最大のニュースタイトルを対話システム５２に入力する（ステップ１００）。適合時にはステップ９８の判定はＹＥＳとなり、ステップ１００が実行される。このとき、図１に示す選択信号７６は第１の値となる。選択部７０は入力５４ではなくニュースタイトル７８を選択して選択後の入力８０として対話システム５２に与える。対話システム５２は選択後の入力８０に応答して応答８２を出力する。 Referring to FIG. 3 again, if there is a news title with a score> 0 (YES in step 98), the news title with the highest score is input to the dialog system 52 (step 100). At the time of adaptation, the determination in step 98 is YES, and step 100 is executed. At this time, the selection signal 76 shown in FIG. 1 becomes the first value. The selection unit 70 selects the news title 78 instead of the input 54 and gives it to the dialogue system 52 as the input 80 after selection. The dialogue system 52 outputs a response 82 in response to the selected input 80.

図３のステップ１０２において、応答生成部７２はランダムに選択した応答フォームの第１スロットにニュースタイトル７８を、第２スロットに応答８２を、それぞれ代入して応答８４を生成し選択部７４に与える。選択部７４は選択信号７６の値が第１の値なので応答８２ではなく応答８４を選択し、応答５６として出力する。 In step 102 of FIG. 3, the response generation unit 72 generates a response 84 by assigning the news title 78 to the first slot and the response 82 to the second slot of the randomly selected response form, and gives the response 84 to the selection unit 74. . Since the value of the selection signal 76 is the first value, the selection unit 74 selects the response 84 instead of the response 82 and outputs it as the response 56.

すなわち適合時には、入力５４に対する対話システム５２の応答が出力されるのではない。そうではなく、入力５４に含まれるトピックに適合するニュースタイトルがニュースＤＢ６６から選択され、そのニュースタイトル７８が入力５４の代わりに対話システム５２に与えられる。このときの対話システム５２の応答が応答生成部７２に与えられ、ニュースタイトル７８と組合されて応答８４が生成され、出力される。 That is, at the time of adaptation, the response of the dialogue system 52 to the input 54 is not output. Rather, a news title that matches the topic contained in the input 54 is selected from the news DB 66 and that news title 78 is provided to the dialog system 52 instead of the input 54. The response of the dialog system 52 at this time is given to the response generation unit 72, and the response 84 is generated and output in combination with the news title 78.

例えば入力５４が「テロは怖いね」というものであり、ニュースタイトルに「仏財務相：テロ資金の監視、日本と協力確認」というものである場合を想定する。また、ランダムに選択された応答フォームが「［第１スロット］っていうニュースがあるみたい。［第２スロット］」というものであり、このときの対話システム５２からの応答８２が「テロ組織に対する監視を一層強化し、捜査当局は、ロンドンやマドリードで起きたような無差別テロを食い止めるらしい。」というものである場合を想定する。すると、最終的に得られる応答は「『仏財務相：テロ資金の監視、日本と協力確認』っていうニュースがあるみたい。テロ組織に対する監視を一層強化し、捜査当局は、ロンドンやマドリードで起きたような無差別テロを食い止めるらしい。」となる。 For example, assume that the input 54 is “I'm afraid of terrorism” and the news title is “French finance minister: monitoring terrorist funds, confirming cooperation with Japan”. In addition, a randomly selected response form is “[First slot] seems to be news. [Second slot]”, and the response 82 from the dialogue system 52 at this time is “monitoring terrorist organizations”. The investigation authorities seem to stop the indiscriminate terrorism that happened in London and Madrid. " Then, the final response is “News from the French Finance Minister: Monitoring of Terrorist Funds, Confirming Cooperation with Japan”. It seems to stop such indiscriminate terrorism. "

《非適合時の対話システム補強装置５０の動作》
非適合時には、対話システム補強装置５０は以下のように動作する。入力５４が入力されてから応答可否判定部６４の動作までは上記適合時と同様である。しかし非適合時には、図３のステップ９８の判定がＮＯとなり、図１に示す選択信号７６は第２の値となる。その結果、選択部７０はニュースタイトル７８ではなく入力５４を選択し、選択後の入力８０として対話システム５２に入力する。選択信号７６が第２の値なので応答生成部７２は何も動作しない。同様に選択信号７６が第２の値なので選択部７４は応答８４ではなく応答８２を選択し、応答５６として出力する。 << Operation of Dialog System Reinforcing Device 50 at Non-conformance >>
At the time of non-conformity, the dialogue system reinforcement device 50 operates as follows. The operation from the input 54 to the operation of the response availability determination unit 64 is the same as that at the time of adaptation. However, at the time of non-conformity, the determination at step 98 in FIG. 3 is NO, and the selection signal 76 shown in FIG. 1 has the second value. As a result, the selection unit 70 selects the input 54 instead of the news title 78 and inputs it to the dialogue system 52 as the input 80 after selection. Since the selection signal 76 is the second value, the response generator 72 does not operate. Similarly, since the selection signal 76 is the second value, the selection unit 74 selects the response 82 instead of the response 84 and outputs it as the response 56.

すなわちこの場合には、仮に関連したニュースタイトルが見つからない場合には、「テロ組織に対する監視を一層強化し、捜査当局は、ロンドンやマドリードで起きたような無差別テロを食い止めるらしい。」という応答が出力されることになる。 In other words, in this case, if a related news title is not found, the response is “strengthening surveillance against terrorist organizations and investigating authorities to stop indiscriminate terrorism that occurred in London and Madrid”. Will be output.

以上のようにこの実施の形態に係る対話システム補強装置５０によれば、既存の対話システムに対話システム補強装置５０を追加することにより、ニュースのような特定の分野での応答に特徴を持つ対話システムを実現できる。ここではニュースについての実施の形態を挙げたが、例えばスポーツ関係のニュース、芸能関係のニュース、科学技術分野のニュース、政治関係のニュース等、さらにニュースの分野を細分化したものにも本実施の形態を適用できる。さらに、ニュースに限らず、特定の地域・時代に関してニュースタイトル的にまとめた電子データが利用可能であれば、そうした情報に特化した対話システムとして既存の対話システムを動作させることもできる。この際、既存の対話システムを改造したり、新たに作ったりする必要はない。 As described above, according to the dialog system reinforcing device 50 according to this embodiment, by adding the dialog system reinforcing device 50 to the existing dialog system, the dialog characterized by the response in a specific field such as news. A system can be realized. Although an embodiment of the news has been described here, for example, sports-related news, entertainment-related news, science and technology field news, political news, etc. Applicable form. Furthermore, as long as electronic data summarized as a news title regarding a specific region / era is available, not only news, an existing dialog system can be operated as a dialog system specialized for such information. At this time, it is not necessary to modify the existing dialog system or make a new one.

＜第２の実施の形態＞
［概要］
本願発明の第２の実施の形態に係る対話システム補強装置は、既存の対話システムと協調動作して、既存の対話システムが教育に特化した対話を行うようにする。具体的には、対話の最中に教育的に重要な単語が出現した際に、それに関する問題を出題し、ユーザに回答させる。問題を出題可能とするため、予め教科書データを用いて質問回答データベースを作成しておく。ユーザの発話から話題（トピック）を抽出する。トピックに含まれる名詞が質問回答データベースにキーとして含まれる場合、その単語に関連する質問文を質問回答データベースから抽出する。ユーザの発話文自体は既存の対話システムへ入力し、応答文を取得する。質問回答データベースから抽出した質問文と、対話システムからの応答文を組合せて応答文を生成し、出力する。質問文を出力した後のユーザの入力に対しては、トピックとなる単語を抽出し、その単語が直前に出題した問題（質問文）の回答かどうかを質問回答データベースで確認し、正解か不正解に応じて文を生成する。また、質問回答データベースから該当の回答文も抽出する。正解か不正解かに応じた文と、質問回答データベースから抽出した回答文を組合せて応答文を生成し、出力する。 <Second Embodiment>
[Overview]
The dialogue system reinforcement apparatus according to the second embodiment of the present invention cooperates with an existing dialogue system so that the existing dialogue system performs a dialogue specialized for education. Specifically, when an educationally important word appears in the middle of a dialogue, a question related to the word is presented and the user is made to answer. In order to be able to ask questions, a question answer database is created in advance using textbook data. A topic is extracted from the user's utterance. When a noun included in a topic is included as a key in the question answer database, a question sentence related to the word is extracted from the question answer database. The user's utterance sentence itself is input to an existing dialogue system to obtain a response sentence. A response sentence is generated by combining the question sentence extracted from the question answer database and the response sentence from the dialogue system, and output. In response to user input after the question text is output, a word that is a topic is extracted, and whether or not the word is the answer to the question (question text) that was presented immediately before is checked. A sentence is generated according to the correct answer. Also, the corresponding answer sentence is extracted from the question answer database. A response sentence is generated and output by combining a sentence according to whether the answer is correct or incorrect and an answer sentence extracted from the question answer database.

［構成］
《全体構成》
図１２を参照して、第２の実施の形態に係る対話システム補強装置５３０は、既存の対話システム５２への入力５４及びこの入力５４に対する対話システム５２の出力（応答）を受けるように接続され、対話システム５２による対話の最中に教育的に重要な単語が出現した際に、それに関する問題をユーザに対して出題し、ユーザに回答させる処理を行う。さらに対話システム補強装置５３０は、この出題の直後にユーザから入力された入力５４が直前の問題に対して正しい回答か否かを判定し、正解か不正解かに応じた文と、質問に対する回答文を組合せて応答文を生成し、出力する。すなわち、第１の実施の形態に係る対話システム補強装置５０と異なり、この第２の実施の形態に係る対話システム補強装置５３０は、ユーザとの２回の対話により、ユーザに対して何らかの教育的な知識を伝えるよう既存の対話システム５２の機能を補強する。 [Constitution]
"overall structure"
Referring to FIG. 12, dialogue system reinforcement apparatus 530 according to the second embodiment is connected to receive input 54 to existing dialogue system 52 and the output (response) of dialogue system 52 to this input 54. When an educationally important word appears during the dialogue by the dialogue system 52, a question related to the word is given to the user, and the user is made to answer. Further, the dialog system reinforcement device 530 determines whether the input 54 input from the user immediately after this question is a correct answer to the immediately preceding question, and a sentence according to whether the answer is correct or incorrect and an answer to the question A response sentence is generated by combining sentences and output. That is, unlike the interactive system reinforcement device 50 according to the first embodiment, the interactive system reinforcement device 530 according to the second embodiment is capable of providing some kind of educational to the user through two dialogues with the user. The function of the existing dialogue system 52 is reinforced so as to convey knowledge.

対話システム補強装置５３０は、入力５４が与えられたことに応答して、入力５４が質問に対する回答か否かを判定し、判定信号５６０を出力する状態判定部５４２を含む。判定信号５６０は入力５４が質問に対する回答であるときには第１の値、そうでないときには第２の値であるものとする。対話システム補強装置５３０はさらに、対話システム５２による対話の履歴を保存している対話履歴ＤＢ５４４と、教科書データ等から作成した、質問とその回答との対を多数記憶する質問回答ＤＢ５４８と、判定信号５６０に応答して、判定信号５６０が第１の値のときには入力５４を入力５６４に、そうでないときには入力５６２に、それぞれ分岐させるための分岐部５４０とを含む。 The dialogue system reinforcement device 530 includes a state determination unit 542 that determines whether the input 54 is an answer to the question in response to the input 54 being given, and outputs a determination signal 560. The determination signal 560 is a first value when the input 54 is an answer to the question, and is a second value otherwise. The dialogue system reinforcement device 530 further includes a dialogue history DB 544 that stores a history of dialogues by the dialogue system 52, a question answer DB 548 that stores a large number of pairs of questions and their answers created from textbook data, etc., and a determination signal In response to 560, a branching unit 540 for branching is included, which includes branching input 54 to input 564 when determination signal 560 is the first value, and branching to input 562 otherwise.

対話システム補強装置５３０はさらに、判定信号５６０の値が第２の値であるときに、対話システム５２が入力５６２に対して出力する応答５６６を受け、対話履歴ＤＢ５４４及び質問回答ＤＢ５４８を用いて、ユーザに対して入力５４に関連した質問を提示することが可能か否かを判定し、可能なときには質問５７０を生成するとともに判定信号５７２（第１の値）を出力し、可能でないときには判定信号（第２の値）を出力するための質問処理部５５０と、判定信号５６０が第１の値のときに分岐部５４０から入力５６４を受けたことに応答して、対話履歴ＤＢ５４４及び質問回答ＤＢ５４８を用いて直前に対話システム補強装置５３０が出力した質問に対する回答５７６を生成し出力するための回答処理部５７４と、対話システム５２からの応答５６６、質問処理部５５０からの応答（質問）５７０、及び回答処理部５７４からの回答５７６を受け、判定信号５６０と判定信号５７２との値の組合せに応じてこれらのいずれかを選択し応答５３２として出力する選択部５５４とを含む。 The dialog system reinforcement device 530 further receives a response 566 that the dialog system 52 outputs to the input 562 when the value of the determination signal 560 is the second value, and uses the dialog history DB 544 and the question answer DB 548, It is determined whether or not a question related to the input 54 can be presented to the user. When possible, a question 570 is generated and a determination signal 572 (first value) is output. In response to receiving the input 564 from the branching unit 540 when the determination signal 560 is the first value and the question processing unit 550 for outputting (second value), the dialogue history DB 544 and the question answer DB 548 An answer processing unit 574 for generating and outputting an answer 576 to the question output by the dialog system reinforcement device 530 immediately before, and the dialog system 52 The response 566, the response (question) 570 from the question processing unit 550, and the response 576 from the answer processing unit 574 are selected, and one of these is selected according to the combination of the values of the determination signal 560 and the determination signal 572 And a selection unit 554 that outputs the response 532.

図１３を参照して、質問回答ＤＢ５４８は、複数のレコードを含む。各レコードは、質問文と、その質問文に対する回答と、その質問文のトピックと、その質問文に対する回答文とを含む。本実施の形態では、各レコードは、美術・歴史に関する質問文と回答文とを含んでいる。回答は、質問文に対する回答を端的に名詞で示したものである。トピックは、質問文に含まれる名詞のことを指す。この実施の形態でも、トピックとしては、使用頻度が高い単語は使用せず、トピックとして使用するにふさわしい単語のみをトピックとして使用する。回答文は、質問文に対する回答文としてユーザに提示するのに適した形式となっている。 Referring to FIG. 13, question answer DB 548 includes a plurality of records. Each record includes a question sentence, an answer to the question sentence, a topic of the question sentence, and an answer sentence to the question sentence. In the present embodiment, each record includes a question sentence and an answer sentence regarding art / history. The answer is simply a noun indicating the answer to the question sentence. A topic refers to a noun included in a question sentence. Also in this embodiment, words that are frequently used are not used as topics, and only words that are suitable for use as topics are used as topics. The answer sentence has a format suitable for being presented to the user as an answer sentence to the question sentence.

例えば１番目のレコードは、「誰がモナ・リザを描いたか知っていますか？」であり、回答文は「ダ・ビンチがモナ・リザを描きました。」である。このレコードの回答は「ダ・ビンチ」であり、トピックは「モナ・リザ」である。２番目以下のレコードの構成も同様である。なお、この質問回答ＤＢ５４８の各レコードは、例えば教科書のテキストデータから適切なパターンの文を抽出することで容易に生成できる。もちろん、これらレコードを手作業により準備してもよい。 For example, the first record is "Do you know who drew Mona Lisa?" And the answer is "Da Vinci drew Mona Lisa." The answer to this record is “Da Vinci” and the topic is “Mona Lisa”. The configuration of the second and subsequent records is the same. Each record in the question answer DB 548 can be easily generated by extracting a sentence with an appropriate pattern from text data of a textbook, for example. Of course, these records may be prepared manually.

対話履歴ＤＢ５４４には、例えば直近の複数個の質問文及びユーザ入力が蓄積されている。この対話履歴ＤＢ５４４は、直前の応答５３２が質問文だったか否かに関する判定に使用される。対話履歴ＤＢ５４４はまた、同じ質問を過去に行ったか否かに関する判定を行うためにも使用される。過去に行った質問と同じ質問が対話システム補強装置５３０により生成された場合には、その質問は出力しないようにする。同じ質問を繰返すことで対話システムの動作が堂々巡りになることを避けるためである。ただしこの機能については本願発明と直接の関係がないのでその細部についてはここでは説明しない。しかしこの機能はいわゆる当業者であれば容易に実現できる筈である。 In the dialogue history DB 544, for example, a plurality of recent questions and user inputs are accumulated. This dialogue history DB 544 is used for determination as to whether or not the immediately preceding response 532 is a question sentence. The dialogue history DB 544 is also used to make a determination as to whether the same question has been made in the past. When the same question as the question made in the past is generated by the dialogue system reinforcement device 530, the question is not output. This is to prevent the dialogue system from moving around repeatedly by repeating the same question. However, since this function is not directly related to the present invention, its details are not described here. However, this function should be easily realized by those skilled in the art.

《質問処理部５５０》
図１４を参照して、質問処理部５５０は、応答５６６からトピックとしての単語を抽出するためのトピック抽出部６００と、質問処理部５５０が抽出した単語をトピックに含む質問回答のレコードを質問回答ＤＢ５４８から取得するための質問回答取得部６０２と、質問回答取得部６０２が抽出した質問回答のレコードに基づいて、対話システム補強装置５３０が質問を生成して出力すべきか、対話システム５２の応答を出力するべきかを判定するための応答可否判定部６０４と、応答可否判定部６０４により質問を生成すべきと判定されたときに、質問回答取得部６０２により取得された質問回答のレコードに基づいて対話システム補強装置５３０による応答（質問）５７０を生成するための応答生成部６０６とを含む。 << Question Processing Unit 550 >>
Referring to FIG. 14, question processing unit 550 includes topic extraction unit 600 for extracting a word as a topic from response 566, and a question answer record including a question answer record including the word extracted by question processing unit 550 as a topic. Based on the question answer acquisition unit 602 to be acquired from the DB 548 and the question answer record extracted by the question answer acquisition unit 602, whether or not the dialog system reinforcement device 530 should generate and output a question is output as a response of the dialog system 52. Based on the record of the question answer acquired by the question answer acquiring unit 602 when it is determined that the question should be generated by the response enable / disable determining unit 604 for determining whether to output, and the response enable / disable determining unit 604 And a response generation unit 606 for generating a response (question) 570 by the dialog system reinforcement device 530.

《状態判定部５４２》
図１５を参照して、状態判定部５４２は、入力５４からトピックを表す単語を抽出するためのトピック抽出部６３０と、トピック抽出部６３０により抽出された単語をトピックとして持つ質問回答のレコードを質問回答ＤＢ５４８から取得し、対話履歴ＤＢ５４４に保存された対話履歴と、取得された質問回答のレコードとに基づいて、対話システム補強装置５３０による応答（質問）の生成を行うべきか否かを示す判定信号５６０を出力する回答判定部６３２とを含む。回答判定部６３２は、対話履歴ＤＢ５４４により、直前の対話システム補強装置５３０の出力が質問でないとき、又は直前の対話システム補強装置５３０の出力が質問であったが、そのトピックと入力５４からトピック抽出部６３０が抽出したトピックとが一致しないときには、入力５４が質問に対する回答ではないと判定し、判定信号５６０を第２の値とする。直前の対話システム補強装置５３０の出力が質問であり、そのトピックと入力５４からトピック抽出部６３０が抽出したトピックとが一致したときには判定信号５６０を第１の値とする。すなわち、判定信号５６０は、対話システム補強装置５３０による質問を生成すべきと判定されたときには第１の値となり、そうでないときには第２の値となる。 << State determination unit 542 >>
Referring to FIG. 15, state determination unit 542 extracts a topic extraction unit 630 for extracting a word representing a topic from input 54, and a question answer record having the word extracted by topic extraction unit 630 as a topic. Judgment indicating whether or not to generate a response (question) by the dialogue system reinforcement device 530 based on the dialogue history acquired from the answer DB 548 and stored in the dialogue history DB 544 and the acquired record of the question answer And an answer determination unit 632 that outputs a signal 560. The answer determination unit 632 extracts a topic from the topic and the input 54 based on the conversation history DB 544 when the output of the previous dialog system reinforcement device 530 is not a question or when the output of the previous dialog system reinforcement device 530 is a question. When the topic extracted by the unit 630 does not match, it is determined that the input 54 is not an answer to the question, and the determination signal 560 is set to the second value. The output of the previous dialog system reinforcement device 530 is a question, and when the topic matches the topic extracted by the topic extraction unit 630 from the input 54, the determination signal 560 is set to the first value. That is, the determination signal 560 has a first value when it is determined that the question should be generated by the interactive system reinforcement device 530, and has a second value otherwise.

《回答処理部５７４》
図１６を参照して、回答処理部５７４は、入力５６４が与えられたことに応答して、対話履歴ＤＢ５４４及び質問回答ＤＢ５４８を参照して直前に対話システム補強装置５３０が生成した質問と対になっている回答文を質問回答ＤＢ５４８から抽出するための回答文抽出部６６０と、回答文抽出部６６０により抽出された回答文と対話履歴ＤＢ５４４に記憶された前回の質問とを対比することで回答が正しいか否かを判定し、判定結果に応じた適切な形で回答５７６を生成する応答生成部６６２とを含む。 <Answer processing unit 574>
Referring to FIG. 16, in response to the input 564 being given, the answer processing unit 574 refers to the dialog history DB 544 and the question answer DB 548 to pair with the question generated by the dialog system reinforcement device 530 immediately before. The answer sentence extraction unit 660 for extracting the answer sentence that has been created from the question answer DB 548, the answer sentence extracted by the answer sentence extraction unit 660, and the previous question stored in the dialogue history DB 544 are compared with each other. And a response generation unit 662 that generates an answer 576 in an appropriate form according to the determination result.

《選択部５５４》
選択部５５４は、判定信号５６０の値が第２の値であり、判定信号５７２の値が第２の値であるときには応答５６６を選択する。選択部５５４は、判定信号５６０の値が第２の値であり、判定信号５７２の値が第１の値であるときには応答（質問）５７０を選択する。選択部５５４は、判定信号５６０の値が第１の値であるときには、判定信号５７２の値にかかわらず回答５７６を選択する。 << Selection Unit 554 >>
The selection unit 554 selects the response 566 when the value of the determination signal 560 is the second value and the value of the determination signal 572 is the second value. The selection unit 554 selects the response (question) 570 when the value of the determination signal 560 is the second value and the value of the determination signal 572 is the first value. When the value of determination signal 560 is the first value, selection unit 554 selects answer 576 regardless of the value of determination signal 572.

《コンピュータによる質問処理部５５０の実現》
図１４に示した質問処理部５５０は、コンピュータとの協働によりコンピュータを質問処理部５５０として機能させるプログラムにより実現される。図１７を参照して、このプログラムは、対話システム５２の出力する応答５６６（図１２を参照）を取得するステップ６９０と、応答５６６からトピックを表す単語を抽出するステップ６９２と、ステップ６９２で取得されたトピックを検索キーとして質問回答ＤＢ５４８を検索し該当するレコードを取出すステップ６９４と、ステップ６９４で取出されたレコードがあるか否かを判定して制御の流れを分岐させるステップ６９６とを含む。 << Realization of Question Processing Unit 550 by Computer >>
The question processing unit 550 illustrated in FIG. 14 is realized by a program that causes a computer to function as the question processing unit 550 in cooperation with the computer. Referring to FIG. 17, the program obtains in step 690 that obtains response 566 (see FIG. 12) output from dialogue system 52, step 692 that extracts a word representing a topic from response 566, and step 692. Step 694 that searches the question answer DB 548 by using the retrieved topic as a search key and extracts the corresponding record, and step 696 that branches whether the control flow is branched by determining whether or not there is a record extracted in step 694.

このプログラムはさらに、ステップ６９６の判定が肯定のときに実行され、ステップ６９４で取出された各レコードの質問文のスコアを計算するステップ６９８と、対話履歴ＤＢ５４４に対話履歴が保存されていない質問文で、かつステップ６９８で計算されたスコアが最高である質問文を選択するステップ７００と、対話システム５２からの応答５６６とステップ７００で選択された質問文とを用いて応答文を生成し出力するステップ７０２と、ステップ６９６の判定が否定のときに、対話システム５２の応答５６６を選択して出力するステップ７０４とを含む。なお、ステップ６９８での質問文のスコアは、第１の実施の形態におけるニュースらしさのスコアの算出と同様の手法により算出できる。したがってここではその詳細は繰返さない。 This program is further executed when the determination in step 696 is affirmative, and calculates the score of the question sentence of each record extracted in step 694, and the question sentence in which no conversation history is stored in the conversation history DB 544 And a step 700 for selecting the question sentence having the highest score calculated in step 698, a response 566 from the dialog system 52, and the question sentence selected in step 700, and generating and outputting a response sentence. Step 702 and step 704 for selecting and outputting the response 566 of the interactive system 52 when the determination at Step 696 is negative. The score of the question sentence in step 698 can be calculated by the same method as the calculation of the news-likeness score in the first embodiment. Therefore, details thereof will not be repeated here.

《コンピュータによる状態判定部５４２及び回答処理部５７４の実現》
図１５及び図１６に示した状態判定部５４２及び回答処理部５７４は、コンピュータとの協働によりコンピュータを状態判定部５４２及び回答処理部５７４として機能させるプログラムにより実現される。 << Realization of State Determination Unit 542 and Answer Processing Unit 574 by Computer >>
The state determination unit 542 and the answer processing unit 574 illustrated in FIGS. 15 and 16 are realized by a program that causes the computer to function as the state determination unit 542 and the answer processing unit 574 in cooperation with the computer.

図１８を参照して、このプログラムは、対話履歴ＤＢ５４４から対話履歴を取得するステップ７４０と、ステップ７４０で取得した対話履歴に基づき、直前の応答が質問文だったか否かを判定し制御の流れを分岐させるステップ７４２と、ステップ７４２の判定が否定のときに、対話システム５２に入力５４を入力するステップ７５８と、ステップ７５８の処理の結果、対話システム５２から得られる応答を取得するステップ７６０とを含む。 Referring to FIG. 18, this program obtains a dialogue history from dialogue history DB 544 and, based on the dialogue history obtained in step 740, determines whether or not the immediately preceding response is a question sentence, and the control flow Branch 742, step 758 for inputting the input 54 to the dialog system 52 when the determination in step 742 is negative, and step 760 for obtaining a response obtained from the dialog system 52 as a result of the process of step 758; including.

このプログラムはさらに、ステップ７４２の判定が肯定のときに、分岐部５４０からの入力５６４の、質問に対する回答らしさを示すスコアを算出するステップ７４４と、ステップ７４４で算出されたスコアがしきい値より大きいか否かを判定し、判定にしたがって制御の流れを分岐させるステップ７４６とを含む。ステップ７４４のスコア算出は、第１の実施の形態におけるニュースらしさのスコアと同様の手法で計算できる。ステップ７４６での判定が否定のときには制御は上記したステップ７５８に進む。 The program further calculates a score indicating the likelihood of answer to the question of the input 564 from the branching unit 540 when the determination in step 742 is affirmative, and the score calculated in step 744 is based on the threshold value. And step 746 for branching the control flow according to the determination. The score calculation in step 744 can be calculated by the same method as the news-likeness score in the first embodiment. If the determination in step 746 is negative, control proceeds to step 758 described above.

このプログラムはさらに、ステップ７４６の判定が肯定のときに入力５６４からトピックとして単語を抽出するステップ７４８と、ステップ７４８で抽出されたトピックを持つレコードを質問回答ＤＢ５４８から読出しその回答文を取出すステップ７５０を含む。 The program further extracts a word as a topic from the input 564 when the determination at step 746 is affirmative, and reads a record having the topic extracted at step 748 from the question answer DB 548 and retrieves the answer sentence 750. including.

このプログラムはさらに、ステップ７５０で抽出されたトピックが、直前に対話システム補強装置５３０が出力した質問のトピックと一致するか否かを判定し、判定結果にしたがって制御を分岐させるステップ７５２と、ステップ７５２の判定が肯定のときに、入力５４が質問に対する正解である旨の応答をステップ７５０で抽出された回答文を用いて生成するステップ７５４と、ステップ７５２の判定が否定であるときに、入力５４が質問に対する誤答であり、正解はステップ７５０で抽出された回答文である旨の応答を生成するステップ７５６と、ステップ７６０、ステップ７５４、及びステップ７５６の後にこれらステップで生成された応答を図１２に示す応答（質問）５７０として出力するステップ７６２とを含む。 The program further determines whether the topic extracted in step 750 matches the topic of the question output by the dialog system reinforcement device 530 immediately before, and branches control according to the determination result, step 752; When the determination in 752 is affirmative, step 754 for generating a response that the input 54 is a correct answer to the question using the answer sentence extracted in step 750, and input when the determination in step 752 is negative 54 is a wrong answer to the question, and a response that the correct answer is the answer sentence extracted in step 750 is generated. Step 756, and the response generated in these steps after step 760, step 754, and step 756 And step 762 of outputting as a response (question) 570 shown in FIG.

［動作］
この第２の実施の形態に係る対話システム補強装置５３０は、大きく分けて以下の３通りのいずれかにより動作する。第１は、入力５４に対して特に質問を行うことなく応答する場合である（通常時）。第２は、入力５４に対して何らかの質問を行う場合である（質問時）。第３は、質問を行った直後の入力５４に対して回答となる応答を行う場合である（回答時）。以下、これら３つの場合について説明する。 [Operation]
The dialogue system reinforcement device 530 according to the second embodiment operates roughly in any of the following three ways. The first is a case of responding to the input 54 without particularly asking a question (normal time). The second is a case where a question is asked to the input 54 (at the time of a question). The third is a case where a response is made as an answer to the input 54 immediately after the question is made (when answering). Hereinafter, these three cases will be described.

《通常時》
通常時、入力５４が状態判定部５４２と分岐部５４０とに与えられる。このとき、状態判定部５４２の出力する判定信号５６０の値は第２の値となる。すなわち、図１５を参照して、トピック抽出部６３０が入力５４からトピックを抽出する。回答判定部６３２はこのトピックに合致する質問回答レコードを質問回答ＤＢ５４８から読出す。通常時には対話システム補強装置５３０が直前に出力した応答は質問ではない。したがって回答判定部６３２は判定信号５６０を第２の値とする。 "Normal time"
Normally, the input 54 is given to the state determination unit 542 and the branching unit 540. At this time, the value of the determination signal 560 output from the state determination unit 542 becomes the second value. That is, with reference to FIG. 15, the topic extraction unit 630 extracts topics from the input 54. The answer determination unit 632 reads a question answer record that matches this topic from the question answer DB 548. In normal times, the response output immediately before by the dialog system reinforcement device 530 is not a question. Therefore, the answer determination unit 632 sets the determination signal 560 to the second value.

したがって分岐部５４０は入力５４を入力５６２として対話システム５２に与える。対話システム５２はこの入力５６２に対して何らかの応答５６６を出力する。この応答５６６は質問処理部５５０及び選択部５５４に与えられる。 Accordingly, branching unit 540 provides input 54 to dialog system 52 as input 562. The dialog system 52 outputs some response 566 in response to this input 562. This response 566 is given to the question processing unit 550 and the selection unit 554.

図１４を参照して、トピック抽出部６００が応答５６６からトピックを表す単語を抽出する。質問回答取得部６０２は、トピック抽出部６００が抽出した単語をトピックに含む質問回答のレコードを質問回答ＤＢ５４８から取得する。応答可否判定部６０４は、質問回答取得部６０２が抽出した質問回答のレコードに基づいて、対話システム補強装置５３０が質問を生成して出力すべきか、対話システム５２の応答を出力するべきかを判定する。この例は通常時であるため、応答可否判定部６０４は対話システム５２の応答を出力すべきと判定する。この結果、応答可否判定部６０４の出力する判定信号５７２の値は第２の値となる。 Referring to FIG. 14, topic extraction unit 600 extracts a word representing a topic from response 566. The question answer acquisition unit 602 acquires from the question answer DB 548 a question answer record that includes the word extracted by the topic extraction unit 600 in the topic. The response availability determination unit 604 determines whether the dialog system reinforcement device 530 should generate and output a question or output the response of the dialog system 52 based on the question answer record extracted by the question answer acquisition unit 602. To do. Since this example is a normal time, the response availability determination unit 604 determines that the response of the interactive system 52 should be output. As a result, the value of the determination signal 572 output from the response availability determination unit 604 becomes the second value.

判定信号５６０の値が第２の値、判定信号５７２の値が第２の値なので、選択部５５４は応答５６６を選択して応答５３２として出力する。 Since the value of the determination signal 560 is the second value and the value of the determination signal 572 is the second value, the selection unit 554 selects the response 566 and outputs it as the response 532.

《質問時》
質問時にも状態判定部５４２の動作は通常時と同様に動作する。 <At the time of question>
Even at the time of a question, the operation of the state determination unit 542 operates in the same manner as in the normal time.

したがって入力５４は入力５６２として対話システム５２に与えられ、対話システム５２は応答５６６を出力する。応答５６６は質問処理部５５０に与えられる。 Input 54 is thus provided as input 562 to dialog system 52, which outputs response 566. The response 566 is given to the question processing unit 550.

図１４を参照して、トピック抽出部６００が応答５６６からトピックを表す単語を抽出する。質問回答取得部６０２は、トピック抽出部６００が抽出した単語をトピックに含む質問回答のレコードを質問回答ＤＢ５４８から取得する。ここでは、対話履歴ＤＢ５４４に保存された対話履歴により、直前の対話システム補強装置５３０の出力は質問ではないことが分かる。また、質問回答ＤＢ５４８から質問回答レコードが抽出されたものとする。すると、応答可否判定部６０４は、質問回答取得部６０２が抽出した質問回答のレコードに基づいて、対話システム補強装置５３０が質問を生成して出力すべきと判定する。したがって判定信号５７２が第１の値となる。応答生成部６０６は、質問回答取得部６０２により質問回答ＤＢ５４８から抽出した質問回答レコードと入力５４とから質問文を生成し、応答（質問）５７０として出力する。 Referring to FIG. 14, topic extraction unit 600 extracts a word representing a topic from response 566. The question answer acquisition unit 602 acquires from the question answer DB 548 a question answer record that includes the word extracted by the topic extraction unit 600 in the topic. Here, it is understood from the dialogue history stored in the dialogue history DB 544 that the output of the previous dialogue system reinforcement device 530 is not a question. It is assumed that a question answer record is extracted from the question answer DB 548. Then, the response availability determination unit 604 determines that the dialogue system reinforcement device 530 should generate and output a question based on the question answer record extracted by the question answer acquisition unit 602. Therefore, the determination signal 572 becomes the first value. The response generation unit 606 generates a question sentence from the question answer record extracted from the question answer DB 548 by the question answer acquisition unit 602 and the input 54, and outputs it as a response (question) 570.

例えば入力５４が「そういえばモナ・リザがみたいな」というものであったと想定する。これに対する対話システム５２の応答５６６が「モナ・リザの周りにはいつも人が多くいて見るのが大変なようですよ。」というものであったとする。一方、トピックとして「モナ・リザ」が抽出される。この結果、質問回答取得部６０２により抽出された質問回答レコードの質問文が「誰がモナ・リザを描いたか知っていますか。」というものであったとする。この場合、例えば応答生成部６０６は応答５６６とこの質問文との間を「ところで」等の接続語でつなぎ、「モナ・リザの周りにはいつも人が多くいて見るのが大変なようですよ。ところで、誰がモナ・リザを描いたか知っていますか。」という質問文を生成し応答（質問）５７０として出力する。 For example, assume that the input 54 is “Like a Mona Lisa”. Assume that the response 566 of the dialogue system 52 is “There are always so many people around Mona Lisa that it seems difficult to see.” On the other hand, “Mona Lisa” is extracted as a topic. As a result, it is assumed that the question sentence of the question answer record extracted by the question answer acquisition unit 602 is “Do you know who drew Mona Lisa?”. In this case, for example, the response generation unit 606 connects the response 566 and the question sentence with a connection word such as “by the way”, and “It seems to be hard to see around Mona Lisa. “Do you know who drew Mona Lisa?” Is generated and output as a response (question) 570.

選択部５５４において、判定信号５６０が第２の値、判定信号５７２が第１の値となる。判定信号５６０の値が第２の値であり、判定信号５７２の値が第１の値であるときには選択部５５４は応答（質問）５７０を選択する。入力５４と、この応答（質問）５７０と、その際に使用された質問応答レコードを特定する情報とが対話履歴ＤＢ５４４に記録される。 In the selection unit 554, the determination signal 560 is a second value, and the determination signal 572 is a first value. When the value of the determination signal 560 is the second value and the value of the determination signal 572 is the first value, the selection unit 554 selects the response (question) 570. The input 54, this response (question) 570, and information for specifying the question response record used at that time are recorded in the dialogue history DB 544.

《回答時》
回答時には、直前の質問に関する情報が対話履歴ＤＢ５４４に記録されている。図１５と図１８を参照して、回答判定部６３２は、入力５４が与えられると、対話履歴ＤＢ５４４を参照して、直前の対話システム補強装置５３０の応答が質問文であったか否かを判定する（図１８の（以下同じ）ステップ７４２）。この例の場合には直前の応答が質問であったと判定される（ステップ７４２でＹＥＳ）。さらに回答判定部６３２は、入力５４が質問文に対する回答か否かを判定する（ステップ７４４及び７４６）。この設例ではステップ７４６の判定はＹＥＳとなる。したがってトピック抽出部６３０は入力５４からトピックを表す単語を抽出する（ステップ７４８）。回答判定部６３２は、トピック抽出部６３０が抽出したトピックを検索キーに質問回答ＤＢ５４８から質問回答レコードを取出し、回答文を抽出する（ステップ７５０）。この回答文のトピックが直前の質問文のトピックと一致するか否かを判定する（ステップ７５２）。両者が一致すればステップ７５２の判定はＹＥＳとなる。ステップ７５４で正解時の応答を回答文から生成しステップ７６２で出力する。両者が一致しないときには誤答時の応答を回答文から生成しステップ７６２で出力する。 <When answering>
At the time of answering, information related to the immediately previous question is recorded in the dialogue history DB 544. Referring to FIGS. 15 and 18, when input 54 is given, answer determination unit 632 refers to dialog history DB 544 to determine whether or not the previous response from dialog system reinforcement device 530 is a question sentence. (Step 742 in FIG. 18 (hereinafter the same)). In this example, it is determined that the previous response was a question (YES in step 742). Furthermore, the answer determination unit 632 determines whether or not the input 54 is an answer to the question sentence (steps 744 and 746). In this example, the determination in step 746 is YES. Accordingly, the topic extraction unit 630 extracts words representing topics from the input 54 (step 748). The answer determination unit 632 extracts a question answer record from the question answer DB 548 using the topic extracted by the topic extraction unit 630 as a search key, and extracts an answer sentence (step 750). It is determined whether the topic of this answer sentence matches the topic of the immediately preceding question sentence (step 752). If they match, the determination in step 752 is YES. In step 754, a response at the time of correct answer is generated from the answer sentence, and output in step 762. If they do not match, a response at the time of an incorrect answer is generated from the answer sentence and output in step 762.

＜実施の形態の効果＞
以上のように上記各実施の形態によれば、既存の対話システムには手を加えることなく、教育及びニュースのような、あるカテゴリに特化した対話システムに変換できる。既存の対話システムを改造したり、新たな対話システムを開発したりする必要がない。そのため、開発コストを低くできる。また、ユーザは基本的には既存の対話システムと対話をすることになり、ある条件が充足されたときにだけ特化した応答を受け取るようになる。そのため、既存の対話システムをそのまま使用していたときと比較して対話システムの動作が変化したという感覚を持つ可能性を小さくできる。その結果、既存の対話システムを利用していたユーザに違和感を生じさせることなく、既存のシステムに特定の新たな機能を追加できる。 <Effect of Embodiment>
As described above, according to each of the above embodiments, the dialogue system can be converted into a dialogue system specialized in a certain category such as education and news without modifying the existing dialogue system. There is no need to modify the existing dialogue system or develop a new dialogue system. Therefore, the development cost can be reduced. In addition, the user basically interacts with an existing dialog system, and receives a specialized response only when a certain condition is satisfied. Therefore, it is possible to reduce the possibility of having a sense that the operation of the dialogue system has changed compared to when the existing dialogue system is used as it is. As a result, a specific new function can be added to the existing system without causing the user who used the existing dialog system to feel uncomfortable.

なお、上記第２の実施の形態では、問題を美術・歴史から採用しているが本発明はその様な実施の形態には限定されない。一般的に教育に関するものであればどのような学科に対してもこの実施の形態の対話システム補強装置が有効であることは言うまでもない。また教育に関するものだけでなく、いわゆる豆知識的な問題を対話において提供することもできる。さらに、上記実施の形態では回答も文の形式で与えた。しかし本発明はその様な実施の形態には限定されない。例えばいわゆる３択、４択等の形式の問題を出すこともできるし、ＹＥＳ・ＮＯ形式の問題を出すようにすることもできる。 In the second embodiment, the problem is adopted from the art and history, but the present invention is not limited to such an embodiment. It goes without saying that the dialogue system reinforcing apparatus of this embodiment is effective for any department that generally relates to education. In addition to education-related issues, it is also possible to provide so-called knowledgeable problems in dialogue. Further, in the above embodiment, the answer is also given in the form of a sentence. However, the present invention is not limited to such an embodiment. For example, it is possible to issue a so-called 3-choice, 4-choice problem, or a YES / NO problem.

＜コンピュータによる実現＞
上記した各実施の形態に係る対話システム補強装置５０及び対話システム補強装置５３０の各機能部は、それぞれコンピュータハードウェアと、そのハードウェア上でＣＰＵ（中央演算処理装置）及びＧＰＵ（Graphics Processing Unit）により実行される、対話システムに対するアドオンとして準備されるプログラムとにより実現できる。図１９及び図２０に上記各装置及びシステムを実現するコンピュータハードウェアを示す。ＧＰＵは通常は画像処理を行うために使用されるが、このようにＧＰＵを画像処理ではなく通常の演算処理に使用する技術をＧＰＧＰＵ（General-purpose computing on graphics processing units）と呼ぶ。ＧＰＵは同種の複数の演算を同時並列的に実行できる。一方、ニューラルネットワークの訓練時及び動作時には、各ノードの重み演算は単純な積和演算であり、しかもそれらは同時に実行できることが多い。したがって、図４〜図６に示すＭＣＮＮ２２０、並びに図１７に示す質問文のスコア計算及び図１８の回答らしさのスコア計算に使用される図示しないＭＣＮＮ等の訓練と動作にはＧＰＧＰＵが適している。 <Realization by computer>
The functional units of the dialogue system reinforcement device 50 and the dialogue system reinforcement device 530 according to each of the above embodiments are respectively computer hardware, and a CPU (Central Processing Unit) and a GPU (Graphics Processing Unit) on the hardware. And a program prepared as an add-on to the dialogue system. FIG. 19 and FIG. 20 show computer hardware that realizes each of the above apparatuses and systems. A GPU is usually used for performing image processing, and a technique for using a GPU for normal arithmetic processing instead of image processing in this way is called GPGPU (General-purpose computing on graphics processing units). The GPU can simultaneously execute a plurality of operations of the same type in parallel. On the other hand, at the time of training and operation of the neural network, the weight calculation of each node is a simple product-sum operation, and they can often be executed simultaneously. Accordingly, GPGPU is suitable for the training and operation of the MCNN 220 shown in FIGS. 4 to 6 and the MCNN (not shown) used for the score calculation of the question sentence shown in FIG. 17 and the score of the likelihood of answer shown in FIG.

図１９を参照して、このコンピュータシステム８３０は、メモリポート８５２及びＤＶＤ（Digital Versatile Disk）ドライブ８５０を有するコンピュータ８４０と、キーボード８４６と、マウス８４８と、モニタ８４２とを含む。 Referring to FIG. 19, the computer system 830 includes a computer 840 having a memory port 852 and a DVD (Digital Versatile Disk) drive 850, a keyboard 846, a mouse 848, and a monitor 842.

図２０を参照して、コンピュータ８４０は、メモリポート８５２及びＤＶＤドライブ８５０に加えて、ＣＰＵ８５６及びＧＰＵ８５８と、ＣＰＵ８５６、ＧＰＵ８５８、メモリポート８５２及びＤＶＤドライブ８５０に接続されたバス８６６と、ブートプログラム等を記憶する読出専用メモリであるＲＯＭ８６０と、バス８６６に接続され、プログラム命令、システムプログラム及び作業データ等を記憶するコンピュータ読出可能な記憶媒体であるランダムアクセスメモリ（ＲＡＭ）８６２と、ハードディスク８５４を含む。コンピュータ８４０はさらに、いずれもバス８６６に接続され、他端末との通信を可能とするネットワーク８６８への接続を提供するネットワークインターフェイス（Ｉ／Ｆ）８４４と、外部との音声信号の入出力を行うための音声Ｉ／Ｆ８７０とを含む。 Referring to FIG. 20, in addition to the memory port 852 and the DVD drive 850, the computer 840 includes a CPU 856 and a GPU 858, a bus 866 connected to the CPU 856, the GPU 858, the memory port 852 and the DVD drive 850, a boot program, and the like. It includes a ROM 860 that is a read-only memory for storing, a random access memory (RAM) 862 that is a computer-readable storage medium that is connected to the bus 866 and stores program instructions, system programs, work data, and the like, and a hard disk 854. The computer 840 is further connected to the bus 866, and inputs / outputs audio signals to / from a network interface (I / F) 844 that provides a connection to a network 868 that enables communication with other terminals. Voice I / F 870.

コンピュータシステム８３０を上記した実施の形態に係る各装置及びシステムの各機能部として機能させるためのプログラムは、ＤＶＤドライブ８５０又はメモリポート８５２に装着される、いずれもコンピュータ読出可能な記憶媒体であるＤＶＤ８７２又はリムーバブルメモリ８６４に記憶され、さらにハードディスク８５４に転送される。又は、プログラムはネットワーク８６８を通じてコンピュータ８４０に送信されハードディスク８５４に記憶されてもよい。プログラムは実行の際にＲＡＭ８６２にロードされる。ＤＶＤ８７２から、リムーバブルメモリ８６４から又はネットワーク８６８を介して、直接にＲＡＭ８６２にプログラムをロードしてもよい。また、上記処理に必要なデータは、ハードディスク８５４、ＲＡＭ８６２、ＣＰＵ８５６又はＧＰＵ８５８内のレジスタ等の所定のアドレスに記憶され、ＣＰＵ８５６又はＧＰＵ８５８により処理され、プログラムにより指定されるアドレスに格納される。最終的に訓練が終了したＭＣＮＮのパラメータは、例えばハードディスク８５４に格納されたり、ＤＶＤドライブ８５０及びメモリポート８５２をそれぞれ介してＤＶＤ８７２又はリムーバブルメモリ８６４に格納されたりする。又は、ネットワークＩ／Ｆ８４４を介してネットワーク８６８に接続された他のコンピュータ又は記憶装置に送信される。 A program for causing the computer system 830 to function as each functional unit of each device and system according to the above-described embodiments is a DVD 872 which is a computer-readable storage medium that is mounted on the DVD drive 850 or the memory port 852. Alternatively, it is stored in the removable memory 864 and further transferred to the hard disk 854. Alternatively, the program may be transmitted to the computer 840 through the network 868 and stored in the hard disk 854. The program is loaded into the RAM 862 when executed. The program may be loaded directly from the DVD 872 to the RAM 862 from the removable memory 864 or via the network 868. Data necessary for the above processing is stored in a predetermined address such as a hard disk 854, RAM 862, CPU 856 or a register in the GPU 858, processed by the CPU 856 or GPU 858, and stored in an address designated by the program. The MCNN parameters that have been finally trained are stored in, for example, the hard disk 854 or in the DVD 872 or the removable memory 864 via the DVD drive 850 and the memory port 852, respectively. Alternatively, the data is transmitted to another computer or storage device connected to the network 868 via the network I / F 844.

このプログラムは、コンピュータ８４０を、上記実施の形態に係る各装置及びシステムとして機能させるための複数の命令からなる命令列を含む。上記各装置及びシステムにおける数値演算処理は、ＣＰＵ８５６及びＧＰＵ８５８を用いて行う。ＣＰＵ８５６のみを用いてもよいがＧＰＵ８５８を用いる方が高速である。コンピュータ８４０にこの動作を行わせるのに必要な基本的機能のいくつかはコンピュータ８４０上で動作するオペレーティングシステム若しくはサードパーティのプログラム又はコンピュータ８４０にインストールされる、ダイナミックリンク可能な各種プログラミングツールキット又はプログラムライブラリにより提供される。したがって、このプログラム自体はこの実施の形態のシステム、装置及び方法を実現するのに必要な機能全てを必ずしも含まなくてよい。このプログラムは、命令の内、所望の結果が得られるように制御されたやり方で適切な機能又はプログラミングツールキット又はプログラムライブラリ内の適切なプログラムを実行時に動的に呼出すことにより、上記したシステム、装置又は方法としての機能を実現する命令のみを含んでいればよい。もちろん、プログラムのみで必要な機能を全て提供してもよい。 This program includes an instruction sequence including a plurality of instructions for causing the computer 840 to function as each device and system according to the above-described embodiment. Numerical calculation processing in each of the above devices and systems is performed using the CPU 856 and the GPU 858. Only the CPU 856 may be used, but using the GPU 858 is faster. Some of the basic functions necessary to cause computer 840 to perform this operation are an operating system or third party program running on computer 840 or various dynamically linked programming toolkits or programs installed on computer 840. Provided by the library. Therefore, this program itself does not necessarily include all the functions necessary for realizing the system, apparatus, and method of this embodiment. The program is a system as described above, by dynamically calling the appropriate program in the appropriate function or programming toolkit or program library at run time in a controlled manner to obtain the desired result. It only needs to include instructions that implement the functions of the apparatus or method. Of course, all necessary functions may be provided only by the program.

今回開示された実施の形態は単に例示であって、本発明が上記した実施の形態のみに制限されるわけではない。本発明の範囲は、発明の詳細な説明の記載を参酌した上で、特許請求の範囲の各請求項によって示され、そこに記載された文言と均等の意味及び範囲内での全ての変更を含む。 The embodiment disclosed herein is merely an example, and the present invention is not limited to the above-described embodiment. The scope of the present invention is indicated by each claim of the claims after taking into account the description of the detailed description of the invention, and all modifications within the meaning and scope equivalent to the wording described therein are included. Including.

５０、５３０対話システム補強装置
５２対話システム
６０、６００、６３０トピック抽出部
６２ニュースタイトル取得部
６４、６０４応答可否判定部
６６ニュースＤＢ
６８、５４４対話履歴ＤＢ
７０、７４、５５４選択部
７２、６０６、６６２応答生成部
７８ニュースタイトル
１５０モデル学習装置
１６０大規模テキストデータベース
１６２モデル学習部
１６４ニュース用語辞書
１６６接続語辞書
１６８ニュースらしさ判定モデル
１７０ニュースタイトル記憶部
１７２ニュースタイトルスコアリング部
１７４スコア付ニュースタイトル
１９０テキスト抽出・分類部
１９２学習用テキスト記憶部
１９４学習処理部
２２０ＭＣＮＮ
２６０、２６２、２６４、２６６畳み込みニューラルネットワーク群
２８０サブネットワーク
３６０ニュースタイトル候補
５４０分岐部
５４２状態判定部
５４８質問回答ＤＢ
５５０質問処理部
５７４回答処理部
６０２質問回答取得部
６３２回答判定部
６６０回答文抽出部 50, 530 Dialog system reinforcement device 52 Dialog system 60, 600, 630 Topic extraction unit 62 News title acquisition unit 64, 604 Response availability determination unit 66 News DB
68,544 Dialog history DB
70, 74, 554 Selection unit 72, 606, 662 Response generation unit 78 News title 150 Model learning device 160 Large text database 162 Model learning unit 164 News term dictionary 166 Connection word dictionary 168 News likelihood determination model 170 News title storage unit 172 News title scoring unit 174 Scored news title 190 Text extraction / classification unit 192 Learning text storage unit 194 Learning processing unit 220 MCNN
260, 262, 264, 266 Convolutional neural network group 280 Subnetwork 360 News title candidate 540 Branch unit 542 State determination unit 548 Question answer DB
550 Question processing unit 574 Answer processing unit 602 Question answer acquisition unit 632 Answer determination unit 660 Answer sentence extraction unit

Claims

A dialogue system reinforcement device that operates with a dialogue system and gives a feature to dialogue by the dialogue system,
First determination means for receiving an input given to the dialog system and determining whether or not to output a response to the input on behalf of the dialog system;
In response to a positive determination by the first determination unit, a first response generation unit for generating a response by a response generation method prepared in advance based on the input;
The output of the first response generation means is connected to receive the output of the first response generation means and the output of the dialogue system, and the output of the first response generation means is determined according to whether the determination result of the first determination means is affirmative. And a first selection means for selectively outputting the output of the dialogue system.

The first determination means includes
A response sentence candidate storage unit storing a plurality of response sentence candidates so that a response sentence candidate related to the word can be searched using the word as a key;
Word extraction means for extracting words included in the input from the input;
Response sentence candidate extracting means for searching for and extracting response sentence candidates from the response sentence candidate storage means using the word extracted by the word extraction means as a key;
And means for determining whether or not to output a response to the input on behalf of the dialogue system according to whether or not there is a response sentence candidate extracted by the response sentence candidate extraction means. The dialogue system reinforcement device described.

The first response generation means includes:
For each response sentence candidate extracted by the response sentence candidate extraction means, a score calculation means for calculating a feature score indicating the degree representing the feature;
Means for selecting a response sentence candidate having the highest feature score calculated by the score calculation means from among the response sentence candidates extracted by the response sentence candidate extraction means and outputting the response sentence candidate to the first selection means; The dialog system reinforcement apparatus of Claim 2 containing.

The first response generation means includes:
A response sentence candidate storage unit storing a plurality of response sentence candidates so that a response sentence candidate related to the word can be searched using the word as a key;
Word extraction means for extracting words included in the input from the input;
Response sentence candidate extracting means for searching for and extracting response sentence candidates from the response sentence candidate storage means using the word extracted by the word extraction means as a key;
For each response sentence candidate extracted by the response sentence candidate extraction means, a score calculation means for calculating a feature score indicating the degree representing the feature;
Means for selecting a response sentence candidate having the highest feature score calculated by the score calculation means from among the response sentence candidates extracted by the response sentence candidate extraction means, and outputting the response sentence candidate to the first selection means; The dialog system reinforcement apparatus of Claim 1 containing this.

The first response generation means includes:
A specific dialogue data item storage means for storing a plurality of specific dialogue data items;
Each of the specific dialogue data items includes a first response sentence, a second response sentence to be output after the first response sentence, and a first word included in at least the first response sentence. Including
The first response generation means further includes:
Word extraction means connected to the dialogue system for receiving a system response sentence output by the dialogue system in response to the input and for extracting a word included in the system response sentence;
Data item extraction means for extracting from the specific dialogue data item storage means a specific dialogue data item including the word extracted by the word extraction means as the first word;
Means for outputting the first response text included in the specific dialogue data item retrieved by the data item retrieving means and inputting it to the first selection means;
Second determination means for determining whether or not the specific dialog data item has been extracted by the data item extraction means;
History storage means for storing an output history indicating that the first response sentence has been output;
The first determination means refers to the history storage means in response to the input being given to the dialogue system, and is the input immediately after the output of the first response sentence? A third determining means for determining whether or not to output a response to the input on behalf of the dialogue system according to whether or not
The dialog system reinforcing device is further responsive to the second determination means determining that the input given to the dialog system is an input immediately after the output of the first response sentence. Second response generation means for generating and outputting a response based on the second response text in the same specific interaction data item as the first response text,
The first selection means is connected to receive the output of the first response generation means, the output of the dialogue system, and the output of the second response generation means, and the second determination means According to the combination of the determination result and the determination result by the third determination unit, the output of the first response generation unit, the output of the second response generation unit, and the output of the dialog system are selectively selected. The interactive system reinforcement device according to claim 1, further comprising second selection means for outputting.

A computer program for causing a computer to function as each means of the interactive system reinforcement device according to any one of claims 1 to 5.