JP2005182400A

JP2005182400A - Natural language processor, processing method and program, and portable terminal loaded with natural language processor

Info

Publication number: JP2005182400A
Application number: JP2003421294A
Authority: JP
Inventors: Hiroaki Kiso; 宏顕木曽
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2003-12-18
Filing date: 2003-12-18
Publication date: 2005-07-07

Abstract

<P>PROBLEM TO BE SOLVED: To provide a natural language processor, processing method and a program and a portable terminal loaded with the natural language processor for easily browsing opposite party information in an arranged form by automatically extracting from the electronic mail the information (attributes and conditions, etc.) of an opposite party with whom electronic mail is exchanged, and storing the information as a database even for a small-sized device with limited hardware resources. <P>SOLUTION: The natural language processor is provided with: a morpheme analysis part for performing a the processing of dividing a sentence included in received electronic mail into word units; a relation analysis part for performing the processing of extracting the opposite party information on the basis of the analysis result of the morpheme analysis part; a dictionary for morpheme analysis/relation analysis storing necessary minimum words and rules for the relation analysis to be required when performing registration in an opposite party information database as the opposite party information and to be referred to when the morpheme analysis part and the relation analysis part perform analysis/extraction processing; and a control part. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は自然言語処理装置、方法、及びプログラム、並びに自然言語処理装置を搭載した携帯端末に関し、特に、電子メールをやりとりしている相手の情報（属性や状況など）を電子メールから自動的に抽出してデータベースとして蓄積することを可能とする、自然言語処理装置、方法、及びプログラム、並びに自然言語処理装置を搭載した携帯端末に関する。 The present invention relates to a natural language processing apparatus, method and program, and a portable terminal equipped with a natural language processing apparatus, and in particular, information (attributes, status, etc.) of a partner with whom an electronic mail is exchanged is automatically transmitted from the electronic mail. The present invention relates to a natural language processing apparatus, method, and program that can be extracted and stored as a database, and a portable terminal equipped with the natural language processing apparatus.

近年、パーソナルコンピュータや携帯端末を利用した電子メールが広く普及してきたため、電子メールをやりとりしている相手の情報（属性や状況など）を容易に得られるユーザインタフェースの提供が望まれるようになってきている。 In recent years, e-mail using personal computers and portable terminals has become widespread, and it has become desirable to provide a user interface that can easily obtain information (attributes, status, etc.) of the partner with whom e-mail is exchanged. ing.

しかしながら、電子メールをやりとりしている相手の情報を、その電子メールを元に得ようとする場合、まずは膨大な量の過去の電子メールを保存しておくことが前提となり、更には、それら過去の電子メールから必要な情報を一つずつ検索して確認するという作業が必要である。 However, in order to obtain information on the other party with whom an email is exchanged based on the email, it is premised that a huge amount of past emails must be saved, It is necessary to search and confirm necessary information one by one from the e-mail.

電子メールには、その送受信者間或いは関係者間にとって有用な情報が多く含まれているが、その有用な情報を電子メールを元にして得ようとする場合には、上述したように、膨大な量の過去の電子メールを保存しておくことが前提となり、それら過去の電子メールから、得ようとする有用な情報を一つずつ検索して確認するといった作業が必要となってくる。 The e-mail contains a lot of useful information for the sender / receiver or between the parties concerned. However, when trying to obtain the useful information based on the e-mail, as described above, it is enormous. It is assumed that a large amount of past e-mails is stored, and it is necessary to search and confirm useful information to be obtained one by one from the past e-mails.

膨大な量の電子メールを、後に効率的に確認できるようにするために、これら電子メールをテーマ別にデータベース化して保存・蓄積しておくことが一つの解決策として提案されてきている（例えば、特許文献１参照。）。 In order to be able to efficiently check a large amount of e-mails later, it has been proposed as a solution to store and store these e-mails in a database by theme (for example, (See Patent Document 1).

上述した特許文献１に記載の「情報管理装置、通信端末、情報管理システム、情報管理方法、記録媒体及びプログラム」には、特定の人物間でやりとりされた業務文書（電子メール）を、業務課題毎にデータベースで別に保管することが記載されており、このことにより、データベースに保管された業務文書を、その業務課題の関係者間で共有することができるようになるとしている。 In the “information management device, communication terminal, information management system, information management method, recording medium, and program” described in Patent Document 1 described above, business documents (e-mail) exchanged between specific persons are It is described that the database is stored separately in each database, and this allows business documents stored in the database to be shared among the parties involved in the business task.

また、電子メールから有用な情報を一つずつ検索して確認するといった作業を、作業負担の増加を招くことなく自動的に行う電子メール装置も提案されている（例えば、特許文献２参照。）。 There has also been proposed an e-mail apparatus that automatically performs an operation of searching and checking useful information one by one from an e-mail without causing an increase in work load (see, for example, Patent Document 2). .

上述した特許文献２に記載の「電子メール装置、及び電子メールプログラムが記録された記録媒体」には、電子メール装置のＣＰＵが、受信した電子メールから所定の部分に記載される文字列を抽出し、この抽出された文字列を、例えば住所録プログラムによって管理されている住所録データ等の分類に応じた所定の項目に登録する技術が記載されている。このことにより、住所録データによって分類して記憶される複数の項目に応じて、受信した電子メールの所定の部分（シグネチャなど）に記載される部分から必要な情報が抽出されて登録されるために、特別な編集作業等を行うことなく電子メールの内容を有効に利用することができるようになるとしている。 In the above-mentioned “recording medium on which an electronic mail apparatus and an electronic mail program are recorded” described in Patent Document 2, the CPU of the electronic mail apparatus extracts a character string described in a predetermined portion from the received electronic mail. A technique is described in which the extracted character string is registered in a predetermined item corresponding to the classification of address book data managed by, for example, an address book program. As a result, necessary information is extracted and registered from a part described in a predetermined part (e.g., signature) of received e-mail according to a plurality of items classified and stored by address book data. In addition, the contents of the e-mail can be effectively used without special editing work or the like.

さらに、電子メールから有用な情報（属性情報）を抽出するにあたり、電子メールのテキストを解析して単語分割や単語情報を取得し、辞書を用いてテキストから属性情報を抽出する電子メールシステムも提案されている（例えば、特許文献３参照。）。 Furthermore, when extracting useful information (attribute information) from e-mails, we also propose an e-mail system that analyzes e-mail texts to obtain word segmentation and word information, and extracts attribute information from texts using a dictionary. (For example, see Patent Document 3).

上述した特許文献３に記載の「電子メールシステム」には、辞書を用いて、メール中のテキストを解析して単語分割や単語情報を取得するテキスト解析処理部と、これを用いてヘッダ部および本文部より属性情報を抽出する属性情報抽出部と、属性情報を電子メールのヘッダ部に拡張情報として追加するヘッダ更新部と、を備える電子メールシステムが記載されている。このシステムにより、電子メールの属性情報を抽出して表示したりできるようになるものとしている。 The “e-mail system” described in Patent Document 3 described above uses a dictionary to analyze a text in an email to obtain word division and word information, and to use this to analyze a header and An e-mail system is described that includes an attribute information extraction unit that extracts attribute information from a body part, and a header update unit that adds the attribute information as extension information to the header part of the e-mail. With this system, it is possible to extract and display e-mail attribute information.

特開２００３−０１６０１９号公報（第５−１２頁、図１−３０）JP 2003-016019 A (page 5-12, FIG. 1-30) 特開２０００−１７２５８７号公報（第３−６頁、図１−４）JP 2000-172587 A (page 3-6, FIGS. 1-4) 特開平１０−０７８９２３号公報（第３−６頁、図１−１２）Japanese Patent Laid-Open No. 10-078923 (page 3-6, FIG. 1-12)

上述した従来の情報管理装置や電子メールシステムにおいては、受信した電子メールをテーマ別にデータベース化して蓄積・保存すること、或いは、受信した電子メールから住所録データ等の有用な情報を抽出して必要箇所に登録して利用すること、また、電子メールのテキストを解析して電子メールの属性情報を抽出すること、などが実現できるようになり、膨大な量の電子メールから有用な情報を自動的に抽出・保存することが可能となっている。 In the above-described conventional information management apparatus and e-mail system, it is necessary to store and store the received e-mail as a database by theme, or to extract useful information such as address book data from the received e-mail. It is possible to register and use in locations, analyze email text and extract email attribute information, etc., and automatically collect useful information from a huge amount of email It is possible to extract and save.

しかし、それらの電子メールから、電子メールをやりとりしている相手の属性や状況などの情報を、自動的に抽出して保存するユーザインタフェースを提供することができれば、従来行われていた、受信した電子メールから必要な情報を手動でピックアップして保存するといった大変な手間が削減されるようになり、ユーザの利便性が向上するものとなる。 However, if it is possible to provide a user interface that automatically extracts and saves information such as the attributes and status of the person with whom the email is exchanged from those emails, it has been received in the past The trouble of manually picking up and storing necessary information from the e-mail is reduced, and the convenience for the user is improved.

また、電子メールのテキスト解析処理が、例えば上述の特許文献３に記載の電子メールシステムより簡便な方式で実現できれば、近年普及してきた携帯端末などの、ハードウェアリソースに制限を持つ小型装置であっても、容易に搭載可能となり、ユーザの利便性がより一層向上するものとなる。 In addition, if e-mail text analysis processing can be realized by a simpler method than the e-mail system described in Patent Document 3, for example, it is a small device with limited hardware resources, such as portable terminals that have become popular in recent years. However, it can be easily mounted, and the convenience for the user is further improved.

本発明は上述した事情を改善するために成されたものであり、従って本発明の目的は、電子メールをやりとりしている相手の情報（属性や状況など）を電子メールから自動的に抽出してデータベースとして蓄積することを可能とし、また、ハードウェアリソースに制限を持つ小型装置であっても容易に搭載可能とすることにより、相手情報を容易にまとまった形で閲覧することを可能とする、自然言語処理装置、方法、及びプログラム、並びに自然言語処理装置を搭載した携帯端末を提供することにある。 The present invention has been made in order to improve the above-described circumstances. Therefore, the object of the present invention is to automatically extract information (attributes, status, etc.) of the partner with whom an email is exchanged from the email. Can be stored as a database, and even small devices with limited hardware resources can be easily installed, so that partner information can be easily viewed in a unified form. Another object of the present invention is to provide a natural language processing apparatus, method and program, and a portable terminal equipped with the natural language processing apparatus.

本発明の自然言語処理装置は、電子メールをやりとりしている相手の属性や状況などからなる相手情報を受信電子メールから抽出して相手情報データベースに登録する自然言語処理装置であって、
前記自然言語処理装置は、受信電子メールに含まれる文章を単語単位に分割する処理を行う形態素解析部と、
前記形態素解析部の解析結果を元に相手情報を抽出する処理を行う関係解析部と、
相手情報として前記相手情報データベースに登録する上で必要となる必要最小限の単語と関係解析用ルールとを記憶し前記形態素解析部及び前記関係解析部が解析・抽出処理を行う際に参照する形態素解析・関係解析用辞書と、
受信電子メールの電子メール本文を前記形態素解析部に形態素解析させる制御と、前記形態素解析部の解析結果から前記関係解析部に関係解析させて相手情報の抽出処理を実行させる制御と、前記関係解析部の抽出処理の結果に前記受信電子メールの電子メールヘッダ情報のメールアドレスとタイムスタンプの情報を付与して前記相手情報データベースに登録する制御と、を行う制御部と、
を備えることを特徴とする。 The natural language processing apparatus of the present invention is a natural language processing apparatus that extracts partner information including attributes and conditions of a partner with whom an email is exchanged from a received email and registers it in the partner information database.
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. A dictionary for analysis and relation analysis,
Control for causing the morpheme analysis unit to analyze the email body of the received email, control for causing the relationship analysis unit to perform relationship analysis based on the analysis result of the morpheme analysis unit, and execution of the partner information extraction process, and the relationship analysis A control unit for performing the control of adding the e-mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the unit and registering the information in the counterpart information database;
It is characterized by providing.

また、前記自然言語処理装置は、更に、相手情報を履歴として管理する相手情報履歴管理部を備え、前記制御部は、前記関係解析部の抽出処理の結果を得た際に、前記相手情報履歴管理部に前記抽出処理の結果と同一カテゴリの相手情報が既に前記相手情報データベースに登録されているかを確認させ、同一カテゴリの相手情報が既に登録されていた場合には既存の相手情報を前記抽出処理の結果に置き換えて登録させる制御を行わせる、ことを特徴とする。 The natural language processing device further includes a partner information history management unit that manages partner information as a history, and the control unit obtains the result of the extraction process of the relation analysis unit, and the partner information history The management unit confirms whether the partner information of the same category as the result of the extraction process has already been registered in the partner information database. If the partner information of the same category has already been registered, the existing partner information is extracted. It is characterized in that control is performed to replace the result of processing for registration.

さらに、前記自然言語処理装置は、入力装置から相手情報の閲覧要求情報を得た際に、前記相手情報データベースを検索して前記閲覧要求情報に合致する相手情報を抽出し出力装置に出力する、ことを特徴とする。 Further, when the natural language processing device obtains the browsing request information of the partner information from the input device, it searches the partner information database, extracts the partner information that matches the browsing request information, and outputs it to the output device. It is characterized by that.

本発明の自然言語処理方法は、電子メールをやりとりしている相手の属性や状況などからなる相手情報を受信電子メールから抽出して相手情報データベースに登録する自然言語処理装置において、
前記自然言語処理装置は、受信電子メールに含まれる文章を単語単位に分割する処理を行う形態素解析部と、
前記形態素解析部の解析結果を元に相手情報を抽出する処理を行う関係解析部と、
相手情報として前記相手情報データベースに登録する上で必要となる必要最小限の単語と関係解析用ルールとを記憶し前記形態素解析部及び前記関係解析部が解析・抽出処理を行う際に参照する形態素解析・関係解析用辞書と、を備え、
前記自然言語処理装置が、受信電子メールの電子メール本文を前記形態素解析部に形態素解析させるステップと、前記形態素解析部の解析結果から前記関係解析部に関係解析させて相手情報の抽出処理を実行させるステップと、前記関係解析部の抽出処理の結果に前記受信電子メールの電子メールヘッダ情報のメールアドレスとタイムスタンプの情報を付与して前記相手情報データベースに登録するステップと、を実行することを特徴とする。 The natural language processing method of the present invention is a natural language processing apparatus that extracts partner information including attributes and conditions of a partner with whom an email is exchanged from a received email and registers it in the partner information database.
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. And a dictionary for analysis and relation analysis,
The natural language processing device causes the morphological analysis unit to analyze the email body of the received e-mail, and causes the relationship analysis unit to analyze the relationship from the analysis result of the morpheme analysis unit, and executes partner information extraction processing And a step of adding a mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the relation analysis unit and registering the result in the partner information database. Features.

また、前記自然言語処理装置は、更に、相手情報を履歴として管理する相手情報履歴管理部を備え、前記自然言語処理装置が、前記関係解析部の抽出処理の結果を得た際に、前記相手情報履歴管理部に前記抽出処理の結果と同一カテゴリの相手情報が既に前記相手情報データベースに登録されているかを確認させ、同一カテゴリの相手情報が既に登録されていた場合には既存の相手情報を前記抽出処理の結果に置き換えて登録させる、ことを特徴とする。 The natural language processing device further includes a partner information history management unit that manages partner information as a history, and when the natural language processing device obtains a result of the extraction process of the relation analysis unit, Let the information history management unit check whether the partner information of the same category as the result of the extraction process has already been registered in the partner information database, and if the partner information of the same category has already been registered, the existing partner information is The result of registration is replaced with the result of the extraction process.

さらに、前記自然言語処理装置は、入力装置から相手情報の閲覧要求情報を得た際に、前記相手情報データベースを検索して前記閲覧要求情報に合致する相手情報を抽出し出力装置に出力するステップを実行する、ことを特徴とする。 Further, when the natural language processing device obtains the browsing request information of the partner information from the input device, the natural language processing device searches the partner information database, extracts the partner information that matches the browsing request information, and outputs the partner information to the output device. It is characterized by performing.

本発明のプログラムは、受信電子メールに含まれる文章を単語単位に分割する処理を行う形態素解析部と、
前記形態素解析部の解析結果を元に電子メールをやりとりしている相手の属性や状況などからなる相手情報を抽出する処理を行う関係解析部と、
相手情報として相手情報データベースに登録する上で必要となる必要最小限の単語と関係解析用ルールとを記憶し前記形態素解析部及び前記関係解析部が解析・抽出処理を行う際に参照する形態素解析・関係解析用辞書と、を備え、受信電子メールから相手情報を抽出して前記相手情報データベースに登録する自然言語処理装置のコンピュータに、
受信電子メールの電子メール本文を前記形態素解析部に形態素解析させる処理と、前記形態素解析部の解析結果から前記関係解析部に関係解析させて相手情報の抽出処理を実行させる処理と、前記関係解析部の抽出処理の結果に前記受信電子メールの電子メールヘッダ情報のメールアドレスとタイムスタンプの情報を付与して前記相手情報データベースに登録する処理と、を実行させることを特徴とする。 The program of the present invention includes a morpheme analyzer that performs a process of dividing a sentence included in a received e-mail into word units;
A relational analysis unit that performs processing to extract partner information consisting of attributes and status of a partner who is exchanging e-mails based on the analysis result of the morphological analysis unit;
Morphological analysis that is stored in the partner information database as partner information and stores the minimum necessary words and relation analysis rules, and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing A relational analysis dictionary, and a computer of a natural language processing device that extracts partner information from received e-mail and registers it in the partner information database;
A process for causing the morphological analysis unit to analyze the body of the email of the received e-mail, a process for causing the relational analysis unit to perform a relational analysis based on an analysis result of the morpheme analysis part, and a process for executing the partner information extraction process. A process of adding the mail address and time stamp information of the e-mail header information of the received e-mail to the result of the part extraction process and registering the result in the partner information database.

また、前記自然言語処理装置は、更に、相手情報を履歴として管理する相手情報履歴管理部を備え、前記プログラムは、前記コンピュータが前記関係解析部から抽出処理の結果を得た際に、前記相手情報履歴管理部に前記抽出処理の結果と同一カテゴリの相手情報が既に前記相手情報データベースに登録されているかを確認させ、同一カテゴリの相手情報が既に登録されていた場合には既存の相手情報を前記抽出処理の結果に置き換えて登録させる処理を実行させる、ことを特徴とする。 The natural language processing apparatus further includes a partner information history management unit that manages partner information as a history, and the program receives the result of the extraction process from the relation analysis unit when the computer obtains a result of the extraction process. Let the information history management unit check whether the partner information of the same category as the result of the extraction process has already been registered in the partner information database, and if the partner information of the same category has already been registered, the existing partner information is It is characterized in that a process of registering with the result of the extraction process is executed.

さらに、前記プログラムは、入力装置から相手情報の閲覧要求情報を得た際に、前記相手情報データベースを検索して前記閲覧要求情報に合致する相手情報を抽出し出力装置に出力する処理を実行させる、ことを特徴とする。 Further, when the program obtains the browsing request information of the partner information from the input device, the program executes a process of searching the partner information database to extract partner information that matches the browsing request information and outputting it to the output device. It is characterized by that.

本発明の自然言語処理装置を搭載した携帯端末は、電子メールをやりとりしている相手の属性や状況などからなる相手情報を受信電子メールから抽出して相手情報データベースに登録する自然言語処理装置を搭載した携帯端末であって、
前記自然言語処理装置は、受信電子メールに含まれる文章を単語単位に分割する処理を行う形態素解析部と、
前記形態素解析部の解析結果を元に相手情報を抽出する処理を行う関係解析部と、
相手情報として前記相手情報データベースに登録する上で必要となる必要最小限の単語と関係解析用ルールとを記憶し前記形態素解析部及び前記関係解析部が解析・抽出処理を行う際に参照する形態素解析・関係解析用辞書と、
受信電子メールの電子メール本文を前記形態素解析部に形態素解析させる制御と、前記形態素解析部の解析結果から前記関係解析部に関係解析させて相手情報の抽出処理を実行させる制御と、前記関係解析部の抽出処理の結果に前記受信電子メールの電子メールヘッダ情報のメールアドレスとタイムスタンプの情報を付与して前記相手情報データベースに登録する制御と、を行う制御部と、
を備えることを特徴とする。 A mobile terminal equipped with the natural language processing device of the present invention is a natural language processing device that extracts partner information including attributes and conditions of a partner with whom an email is exchanged from a received email and registers it in the partner information database. An on-board mobile device,
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. A dictionary for analysis and relation analysis,
Control for causing the morpheme analysis unit to analyze the email body of the received email, control for causing the relationship analysis unit to perform relationship analysis based on the analysis result of the morpheme analysis unit, and execution of the partner information extraction process, and the relationship analysis A control unit for performing the control of adding the e-mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the unit and registering the information in the counterpart information database;
It is characterized by providing.

本発明の自然言語処理装置、方法、及びプログラム、並びに自然言語処理装置を搭載した携帯端末は、受信電子メールから相手情報を自動的に抽出し、相手情報を相手情報データベースに蓄積できるので、膨大な量の過去の電子メールを保存およびその検索をすることなしに、相手情報を容易にまとまった形で閲覧することが可能となるという効果を有すると共に、ユーザが手動で受信電子メールから相手情報を抽出する作業が必要なくなり、ユーザの手間を削減可能となり、ユーザの利便性が向上する、という効果を有している。 The natural language processing device, method and program of the present invention, and a portable terminal equipped with the natural language processing device can automatically extract the partner information from the received e-mail and store the partner information in the partner information database. It is possible to easily view the partner information in a collective form without storing and retrieving a large amount of past emails, and the user can manually read the partner information from the received email. This eliminates the need for the work of extracting, so that the user's labor can be reduced and the convenience of the user is improved.

また、本発明において、形態素解析処理に関係解析処理を追加しており、関係解析処理として必要となる程度の形態素解析処理にとどめることにより、そして、形態素解析／関係解析用辞書には、関係解析処理に必要な最低限の単語とそのルールだけを登録することにより、ハードウェアリソースに制限のある携帯端末などの小型装置であっても、自然言語処理装置を容易に搭載可能となると共に、自然言語処理の速度が向上する、という効果を奏することができる。 Further, in the present invention, a relational analysis process is added to the morphological analysis process. By limiting the morphological analysis process to a degree necessary for the relational analysis process, the morphological analysis / relationship analysis dictionary includes a relational analysis. By registering only the minimum words and rules necessary for processing, a natural language processing device can be easily installed even in a small device such as a portable terminal with limited hardware resources. The effect that the speed of language processing improves can be produced.

さらに、本発明において、電子メールが持つタイムスタンプを利用して、相手情報を時系列的に相手情報データベースに登録することにより、相手情報を時系列的に閲覧可能となる、という効果を奏することができる。 Furthermore, in the present invention, by registering the partner information in the partner information database in time series using the time stamp of the e-mail, there is an effect that the partner information can be browsed in time series. Can do.

次に、本発明の実施の形態について図面を参照して説明する。 Next, embodiments of the present invention will be described with reference to the drawings.

図１は、本発明の自然言語処理装置を搭載した携帯端末の一実施形態を示すブロック図である。 FIG. 1 is a block diagram showing an embodiment of a portable terminal equipped with the natural language processing apparatus of the present invention.

図１に示す本実施の形態は、携帯電話機やＰＨＳ（ＰｅｒｓｏｎａｌＨａｎｄｙｐｈｏｎｅＳｙｓｔｅｍ：パーソナルハンディホンシステム）端末、或いは、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｃｅ：携帯情報端末）などで構成される携帯端末９に、自然言語処理装置を搭載した場合の実施形態を示すものである。携帯端末９は、図示しない電子メール送受信装置を搭載しているものとする。 The present embodiment shown in FIG. 1 is applied to a mobile terminal 9 including a mobile phone, a PHS (Personal Handyphone System) terminal, or a PDA (Personal Digital Assistance). 1 shows an embodiment in which a processing apparatus is mounted. The portable terminal 9 is assumed to be equipped with an electronic mail transmission / reception device (not shown).

図１において、携帯端末９は、キーボードやキー入力装置等の入力装置１と、ディスプレイ装置や印刷装置等の出力装置２と、受信した電子メールから相手情報の抽出処理を行う相手情報抽出処理装置３と、受信した電子メールを蓄積・保存する受信電子メールデータベース４と、相手情報抽出処理装置３が抽出した相手情報が登録される相手情報データベース５と、から構成されている。 In FIG. 1, a portable terminal 9 includes an input device 1 such as a keyboard and a key input device, an output device 2 such as a display device and a printing device, and a partner information extraction processing device that performs processing for extracting partner information from received electronic mail. 3, a received e-mail database 4 that stores and stores received e-mails, and a partner information database 5 in which partner information extracted by the partner information extraction processing device 3 is registered.

ここで、相手情報とは、電子メールの送信者、つまり相手の属性や状況を示す情報である。そして、相手の属性とは、相手の住所、氏名、年齢、生年月日、職業、電話番号、電子メールアドレスなどの特徴・性質情報であり、相手の状況とは、相手が旅行に行ったとか、うれしいことがあったなど、受信電子メールから知ることができる相手の近況などに関するイベント情報である。 Here, the partner information is information indicating the attribute and status of the sender of the electronic mail, that is, the partner. The other party's attributes are information on characteristics / characteristics such as the other party's address, name, age, date of birth, occupation, telephone number, email address, etc. The situation of the other party means that the other party has traveled This is event information about the recent situation of the other party that can be known from the received e-mail.

相手情報抽出処理装置３は、相手情報抽出処理装置３全体の動作制御を行う制御部３１と、制御部３１の制御により受信電子メールの形態素解析を行う形態素解析部３２と、制御部３１の制御により形態素解析部３２の解析結果を元に相手情報を解析し抽出する関係解析部３３と、形態素解析部３２及び関係解析部３３が解析を行う際に参照する形態素解析／関係解析用辞書３４と、制御部３１の制御プログラムを記憶・格納する記憶部３５と、から構成されている。 The partner information extraction processing device 3 includes a control unit 31 that controls the operation of the partner information extraction processing device 3 as a whole, a morpheme analysis unit 32 that performs morphological analysis of received e-mails under the control of the control unit 31, and control of the control unit 31. Based on the analysis result of the morpheme analysis unit 32, the relationship analysis unit 33 that analyzes and extracts the partner information, the morpheme analysis / relation analysis dictionary 34 that the morpheme analysis unit 32 and the relationship analysis unit 33 refer to when performing analysis, The storage unit 35 stores and stores the control program of the control unit 31.

制御部３１は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ：中央処理装置）を備え、記憶部３５に格納されている制御プログラムによって動作するよう構成されている。なお、形態素解析部３２及び関係解析部３３の処理も記憶部３５に格納されている制御プログラムによって動作するよう構成してもよい。 The control unit 31 includes a CPU (Central Processing Unit) and is configured to operate according to a control program stored in the storage unit 35. Note that the processes of the morphological analysis unit 32 and the relationship analysis unit 33 may be configured to operate according to a control program stored in the storage unit 35.

制御部３１は、受信電子メールデータベース４と相手情報データベース５のアクセス制御、形態素解析部３２及び関係解析部３３の動作制御、入力装置１からの入力情報制御、出力装置２への出力情報制御、を行う。 The control unit 31 controls access to the received email database 4 and the partner information database 5, operation control of the morphological analysis unit 32 and the relationship analysis unit 33, input information control from the input device 1, output information control to the output device 2, I do.

形態素解析部３２は、形態素解析／関係解析用辞書３４を使用して、受信電子メールに対して形態素解析する処理を行う。形態素解析処理とは、受信電子メールに含まれる文章を単語（形態素）単位に分割する処理である。 The morpheme analysis unit 32 uses the morpheme analysis / relationship analysis dictionary 34 to perform morpheme analysis on the received e-mail. The morpheme analysis process is a process of dividing a sentence included in the received electronic mail into words (morphemes).

関係解析部３３は、形態素解析部３２の解析結果と形態素解析／関係解析用辞書３４を使用して、相手に関する属性や近況などの状況、すなわち、相手情報を抽出する処理を行う。関係解析部３３で抽出された相手情報は、制御部３１を介して相手の属性や状況などのカテゴリに分類され相手情報データベース５に登録される。 The relationship analysis unit 33 uses the analysis result of the morpheme analysis unit 32 and the morpheme analysis / relationship analysis dictionary 34 to perform processing for extracting attributes, status, and other information related to the partner, that is, partner information. The partner information extracted by the relationship analysis unit 33 is classified into a category such as the attribute and status of the partner via the control unit 31 and registered in the partner information database 5.

形態素解析／関係解析用辞書３４は、形態素解析部３２及び関係解析部３３が解析処理を行う際に必要となる単語及び関係解析用ルールを記憶・格納している辞書である。解析処理を行う際に必要となる単語とは、例えば、名詞、動詞、助詞、助動詞、など文章を構成する品詞であり、具体的には、「私」（代名詞）、「は」（副助詞）、「大学生」（名詞）、「に」（格助詞）、「なりました」（本実施形態における簡略化された単語）などの単語である。なお、「なりました」は、通常であれば、「なり」（「なる」：自動詞）、「まし」（「ます」：助動詞・特殊型）、「た」（助動詞・特殊型）と、更に細分化される単語であるが、本実施形態においては単語の簡略化を図るため、「なりました」を一つの単語として使用するようにしている。このことにより、形態素解析／関係解析用辞書３４への登録単語数が少なくなり、辞書として必要となるメモリ容量の削減を図るものである。 The morpheme analysis / relationship analysis dictionary 34 is a dictionary that stores and stores words and relationship analysis rules that are required when the morpheme analysis unit 32 and the relationship analysis unit 33 perform analysis processing. Words necessary for analysis processing are, for example, parts of speech that make up sentences such as nouns, verbs, particles, and auxiliary verbs. Specifically, “I” (pronoun), “ha” (adjunct particle) ), “University student” (noun), “ni” (case particle), “naru” (simplified word in this embodiment), and the like. In addition, “Naru” is usually “Nari” (“Naru”: intransitive verb), “Mashi” (“Masu”: auxiliary verb / special type), “Ta” (auxiliary verb / special type), Although the word is further subdivided, in this embodiment, “Nana” is used as one word in order to simplify the word. As a result, the number of registered words in the morphological analysis / relationship analysis dictionary 34 is reduced, thereby reducing the memory capacity required for the dictionary.

また、解析処理を行う際に必要となる関係解析用ルールとは、相手情報を抽出する際の基準を示すものであり、例えば、「ｘは大学生になりました」という文章からは、「ｘ＝大学生」という相手情報を抽出する、というルールである。これは、左辺のｘに当てはまる単語（行為者）が「ｘ＝大学生」という関係を持つことを表すものである。 Also, the relational analysis rules necessary for performing the analysis process indicate the criteria for extracting the partner information. For example, from the sentence “x has become a college student”, “x The rule is that the partner information “= University student” is extracted. This indicates that a word (actor) that applies to x on the left side has a relationship of “x = university student”.

関係解析用ルールは、また、抽出された相手情報を相手情報データベース５へ登録する際の基準をも示すものであり、例えば、「職業：大学生」というルールは、「大学生」を相手情報データベース５の「職業カテゴリ」として登録する、という基準を示すものである。また、「私：メール送信者」というルールは、行為者が「私」である場合には、相手情報データベース５に登録する、という基準を示すものである。 The relation analysis rule also indicates a standard for registering the extracted partner information in the partner information database 5. For example, the rule “profession: university student” indicates “college student” as the partner information database 5. This indicates the standard of registering as a “profession category”. Further, the rule “I: mail sender” indicates a criterion for registering in the partner information database 5 when the actor is “I”.

次に、図２を参照して、図１に示した本実施形態の動作について説明する。 Next, the operation of the present embodiment shown in FIG. 1 will be described with reference to FIG.

図２は、図１の実施形態の動作を説明するフローチャートである。 FIG. 2 is a flowchart for explaining the operation of the embodiment of FIG.

まず、電子メールの受信をきっかけとして、相手情報抽出処理装置３の制御部３１が、受信電子メールデータベース４から受信した電子メールの取り込みを行う（図２のステップＡ１）。取り込んだ電子メール（電子メールヘッダ情報と電子メール本文とから構成されている）のうち、電子メール本文は形態素解析部３２に渡し、メールアドレス、タイムスタンプなどの電子メールヘッダ情報は、相手情報データベース５への登録の際に使用するため、制御部３１で保持する。 First, triggered by the reception of the email, the control unit 31 of the partner information extraction processing device 3 takes in the email received from the received email database 4 (step A1 in FIG. 2). Of the captured e-mail (consisting of e-mail header information and e-mail body), the e-mail body is passed to the morpheme analyzer 32, and e-mail header information such as e-mail address and time stamp is stored in the partner information database. 5 is used by the control unit 31 to be used when registering to 5.

制御部３１は、形態素解析部３２を起動し、形態素解析部３２では、制御部３１から電子メール本文を受け取り、形態素解析／関係解析用辞書３４を使用して形態素解析処理を行う（ステップＡ２）。形態素解析処理とは、前述したように、受信電子メールに含まれる文章を単語（形態素）単位に分割する処理であるが、本実施形態においては、相手情報として登録したい単語が抽出されれば良いため、この後実行される関係解析部３３の解析処理に支障をきたさない程度の精度で形態素解析を行う。具体的には、前述したように、「なりました」を一つの単語として扱うようにするなど、である。形態素解析部３２で形態素解析された結果は、制御部３１に渡される。 The control unit 31 activates the morpheme analysis unit 32. The morpheme analysis unit 32 receives the email text from the control unit 31 and performs morpheme analysis processing using the morpheme analysis / relationship analysis dictionary 34 (step A2). . As described above, the morpheme analysis process is a process of dividing a sentence included in the received e-mail into words (morpheme) units. However, in the present embodiment, it is only necessary to extract a word to be registered as partner information. Therefore, the morphological analysis is performed with an accuracy that does not hinder the analysis processing of the relationship analysis unit 33 to be executed thereafter. Specifically, as described above, “Naru” is handled as one word. The result of the morphological analysis performed by the morphological analysis unit 32 is passed to the control unit 31.

形態素解析された結果を渡された制御部３１は、次に、関係解析部３３を起動し、形態素解析された結果を関係解析部３３に渡す。関係解析部３３では、形態素解析された結果を制御部３１から受け取り、形態素解析／関係解析用辞書３４を使用して関係解析処理を行い、相手情報の抽出処理を行う（ステップＡ３）。関係解析処理とは、前述したように、相手に関する属性や近況などの状況を抽出する処理であり、形態素解析／関係解析用辞書３４には、相手情報を抽出する上で最低限必要となる単語と、関係解析用ルールとが格納されているため、関係解析部３３は、形態素解析された結果と形態素解析／関係解析用辞書３４を使用して相手情報を抽出することができる。関係解析部３３で関係解析された結果は、制御部３１に渡される。 The control unit 31 to which the result of the morphological analysis is passed next activates the relationship analysis unit 33 and passes the result of the morphological analysis to the relationship analysis unit 33. The relationship analysis unit 33 receives the result of the morphological analysis from the control unit 31, performs the relationship analysis processing using the morphological analysis / relation analysis dictionary 34, and performs the partner information extraction processing (step A3). As described above, the relationship analysis process is a process for extracting attributes and status of the other party, and the morphological analysis / relationship analysis dictionary 34 has a minimum word required for extracting the partner information. And the relation analysis rules are stored, the relation analysis unit 33 can extract the partner information using the result of the morphological analysis and the morphological analysis / relation analysis dictionary 34. The result of the relationship analysis performed by the relationship analysis unit 33 is passed to the control unit 31.

制御部３１は、関係解析された結果すなわち得られた相手情報を、メールアドレス、タイムスタンプ等の情報を付与して、相手情報データベース５に登録する（ステップＡ４）。相手情報データベース５へは、相手情報を属性や状況などのカテゴリに分類し、かつ、タイムスタンプを利用して相手情報を時系列的に並べ、データベース内の一レコードとして追加して登録する。 The control unit 31 registers the result of the relationship analysis, that is, the obtained partner information in the partner information database 5 with information such as a mail address and a time stamp added (step A4). In the partner information database 5, the partner information is classified into categories such as attributes and situations, and the partner information is arranged in time series using a time stamp and added and registered as one record in the database.

以上の処理の後、入力装置１を操作して、相手情報データベース５をアクセスすることにより、制御部３１は相手情報データベース５から相手情報を検索して出力装置２に出力する（ステップＡ５）。このことにより、相手情報をまとまった形で閲覧することが可能となる。 After the above processing, by operating the input device 1 and accessing the partner information database 5, the control unit 31 searches the partner information database 5 for partner information and outputs it to the output device 2 (step A5). This makes it possible to browse the partner information in a collective form.

次に、具体例を用いて、図１の実施形態の動作について再度説明する。 Next, the operation of the embodiment of FIG. 1 will be described again using a specific example.

例えば、受信電子メールデータベース４に、「私は大学生になりました！」という電子メール本文を含む電子メールがあるものとする。また、形態素解析／関係解析用辞書３４に、「ｘは大学生になりました」という文章から「ｘ＝大学生」という相手情報を抽出するというルールと、「職業：大学生」（「大学生」を相手情報データベース５の「職業カテゴリ」として登録する）というルールと、「私：メール送信者」（行為者が「私」である場合には相手情報データベース５に登録する）というルールと、が記憶・格納されているものとする。 For example, it is assumed that the received e-mail database 4 has an e-mail including an e-mail text “I am a college student!”. Also, in the dictionary 34 for morphological analysis / relationship analysis, the rule that the partner information “x = university student” is extracted from the sentence “x has become a university student” and “profession: university student” (“university student” The rule “Register as“ profession category ”in the information database 5” and the rule “I: Email sender” (register in the partner information database 5 if the actor is “I”) are stored Assume that it is stored.

まず、相手情報抽出処理装置３の制御部３１が電子メールの取り込みを行う（図２のステップＡ１）。その電子メールに対して形態素解析処理を行う（ステップＡ２）。電子メール本文が「私は大学生になりました！」の場合は、「私／は／大学生／に／なりました／！」のように単語（形態素）単位に分割される。なお、本実施形態においては、前述したように、「なりました」を一単語として扱うようにしている。 First, the control unit 31 of the partner information extraction processing device 3 takes in an e-mail (step A1 in FIG. 2). A morphological analysis process is performed on the electronic mail (step A2). If the e-mail text is "I am a college student!", It is divided into word (morpheme) units such as "I / ha / college student / become /! /!". In the present embodiment, as described above, “Naru” is handled as one word.

この形態素解析の結果を使用して、関係解析処理を行う（ステップＡ３）。「私／は／大学生／に／なりました／！」という形態素解析結果を受けて、まず、「ｘは大学生になりました」という文章から「ｘ＝大学生」という相手情報を抽出するというルールにより、「私＝大学生」という情報が抽出される。また、「私：メール送信者」というルールにより、「私＝大学生」という情報が相手情報データベース５に登録される対象情報であることがわかる。さらに、「職業：大学生」というルールにより、「大学生」は相手情報データベース５の「職業カテゴリ」として登録されるものであることがわかる。 Using the result of this morphological analysis, a relational analysis process is performed (step A3). Based on the morphological analysis result of “I / ha / university student / ni / now /!”, First, the rule that “x = university student” is extracted from the sentence “x became a college student” Thus, the information “I = University student” is extracted. Further, the rule “I: mail sender” indicates that the information “I = university student” is the target information registered in the partner information database 5. Furthermore, it can be understood that the “college student” is registered as the “profession category” in the partner information database 5 by the rule of “occupation: university student”.

その結果、相手情報データベース５には、メール送信者は大学生（職業カテゴリ）である、という相手情報がタイムスタンプを付して登録される（ステップＡ４）。 As a result, the partner information that the mail sender is a college student (profession category) is registered in the partner information database 5 with a time stamp (step A4).

以上の処理の後、入力装置１を操作して、相手情報データベース５をアクセスすることにより、今回のメール送信者は大学生である、という情報を出力装置２を通じて閲覧することが可能となる（ステップＡ５）。 After the above processing, by operating the input device 1 and accessing the partner information database 5, it is possible to browse through the output device 2 information that the current mail sender is a college student (step). A5).

以上説明したように、本実施形態によれば、受信電子メールから相手情報を自動的に抽出し、相手情報を相手情報データベース５に蓄積できるので、膨大な量の過去の電子メールを保存およびその検索をすることなしに、相手情報を容易にまとまった形で閲覧することが可能となるという効果を有すると共に、ユーザが手動で受信電子メールから相手情報を抽出する作業が必要なくなり、ユーザの手間を削減可能となり、ユーザの利便性が向上する、という効果を有している。 As described above, according to the present embodiment, the partner information can be automatically extracted from the received electronic mail, and the partner information can be stored in the partner information database 5. There is an effect that it is possible to easily view the partner information in a collective form without performing a search, and the user does not need to manually extract the partner information from the received e-mail. Can be reduced, and the convenience of the user is improved.

また、本実施形態において、形態素解析処理に関係解析処理を追加しており、関係解析処理として必要となる程度の形態素解析処理にとどめることにより、そして、形態素解析／関係解析用辞書３４には、関係解析処理に必要な最低限の単語とそのルールだけを登録することにより、ハードウェアリソースに制限のある携帯端末などの小型装置であっても、自然言語処理装置を容易に搭載可能となると共に、自然言語処理の速度が向上する、という効果を奏することができる。 Further, in the present embodiment, a relational analysis process is added to the morphological analysis process, and the morphological analysis / relationship analysis dictionary 34 includes the morpheme analysis process to the extent necessary for the relational analysis process. By registering only the minimum words and rules necessary for relation analysis processing, it is possible to easily mount a natural language processing device even in a small device such as a portable terminal with limited hardware resources. The natural language processing speed can be improved.

さらに、本実施形態において、電子メールが持つタイムスタンプを利用して、相手情報を時系列的に相手情報データベース５に登録することにより、相手情報を時系列的に閲覧可能となる、という効果を奏することができる。 Further, in the present embodiment, by registering the partner information in the partner information database 5 in time series using the time stamp of the e-mail, the partner information can be browsed in time series. Can play.

次に、図３、図４を参照して、自然言語処理装置を搭載した携帯端末の第２の実施形態について説明する。 Next, with reference to FIG. 3 and FIG. 4, a second embodiment of the mobile terminal equipped with the natural language processing device will be described.

図３は、本発明の自然言語処理装置を搭載した携帯端末の第２の実施形態を示すブロック図である。 FIG. 3 is a block diagram showing a second embodiment of a portable terminal equipped with the natural language processing apparatus of the present invention.

図３に示す第２の実施形態は、図１に示した第１の実施形態における相手情報データベース５に代えて、相手情報履歴処理装置５０を有する点でのみ第１の実施形態と異なるものである。従って、図３において図１に示す構成要素に対応するものは同一の参照数字または符号を付し、その説明を省略するものとする。 The second embodiment shown in FIG. 3 differs from the first embodiment only in that it has a partner information history processing device 50 instead of the partner information database 5 in the first embodiment shown in FIG. is there. Therefore, in FIG. 3, the same reference numerals or symbols are assigned to the components corresponding to those shown in FIG. 1, and the description thereof is omitted.

図３において、制御部３１に接続される相手情報履歴処理装置５０は、相手情報を履歴として管理し保存する機能を備えており、相手情報履歴管理部５１と相手情報データベース５２とから構成されている。 In FIG. 3, the partner information history processing device 50 connected to the control unit 31 has a function of managing and storing partner information as a history, and includes a partner information history management unit 51 and a partner information database 52. Yes.

相手情報履歴管理部５１は、相手情報における履歴を管理し、同一カテゴリに属する情報はその最新情報だけを管理し、最新情報だけを相手情報データベース５２に登録する。相手情報データベース５２は、相手情報が登録されるデータベースである。 The partner information history management unit 51 manages the history of the partner information, manages only the latest information of information belonging to the same category, and registers only the latest information in the partner information database 52. The partner information database 52 is a database in which partner information is registered.

次に、図４を参照して、図３に示した第２の実施形態の動作について説明する。 Next, the operation of the second embodiment shown in FIG. 3 will be described with reference to FIG.

図４は、図３の第２の実施形態の動作を説明するフローチャートである。図４に示す動作は、図２で示した第１の実施形態の動作に、相手情報履歴管理部５１の動作が追加されている点でのみ、第１の実施形態の動作と異なるものである。従って、図４において図２に示す構成要素に対応するものは同一の参照数字または符号を付し、その詳細な説明を省略するものとする。 FIG. 4 is a flowchart for explaining the operation of the second embodiment of FIG. The operation shown in FIG. 4 is different from the operation of the first embodiment only in that the operation of the partner information history management unit 51 is added to the operation of the first embodiment shown in FIG. . Therefore, in FIG. 4, the same reference numerals or symbols are assigned to the components corresponding to those shown in FIG. 2, and the detailed description thereof is omitted.

図４において、電子メールの受信をきっかけとして、相手情報抽出処理装置３の制御部３１が、受信電子メールデータベース４から受信した電子メールの取り込みを行い（図４のステップＡ１）、形態素解析部３２では、制御部３１から電子メール本文を受け取り、形態素解析／関係解析用辞書３４を使用して形態素解析処理を行い（ステップＡ２）、関係解析部３３では、形態素解析された結果を制御部３１から受け取り、形態素解析／関係解析用辞書３４を使用して関係解析処理を行い、相手情報の抽出処理を行う（ステップＡ３）。そして、関係解析部３３で関係解析された結果は、制御部３１に渡される。ここまでの動作は、図２で示したステップＡ１〜ステップＡ３までの動作と同一である。 In FIG. 4, triggered by reception of the email, the control unit 31 of the partner information extraction processing device 3 takes in the email received from the received email database 4 (step A <b> 1 in FIG. 4), and the morphological analysis unit 32. Then, the electronic mail text is received from the control unit 31 and morpheme analysis processing is performed using the morphological analysis / relationship analysis dictionary 34 (step A2), and the relationship analysis unit 33 displays the result of the morpheme analysis from the control unit 31. Receiving, the morphological analysis / relationship analysis dictionary 34 is used to perform the relationship analysis process, and the partner information extraction process is performed (step A3). Then, the result of the relationship analysis performed by the relationship analysis unit 33 is passed to the control unit 31. The operation so far is the same as the operation from step A1 to step A3 shown in FIG.

次に、制御部３１は関係解析された結果としての相手情報を、相手情報履歴管理部５１に渡し、相手情報履歴管理部５１は相手情報の履歴処理を行う（ステップＢ１）。相手情報の履歴処理とは、関係解析された結果としての相手情報（最新相手情報と称することとする）を相手情報データベース５２に登録する際に、最新相手情報と同一カテゴリに属する既存情報が、相手情報データベース５２に登録されているかを確認し、もし既存情報が登録されていた場合には、該既存情報に代えて最新相手情報を登録するものである（ステップＡ４）。このことにより、相手情報として過去の相手情報に代えて最新相手情報だけを閲覧できるようになる。既存情報が登録されていなかった場合には、第１の実施形態の動作と同様に、最新相手情報を相手情報データベース５２に追加して登録する（ステップＡ４）。 Next, the control unit 31 passes the partner information as a result of the relationship analysis to the partner information history management unit 51, and the partner information history management unit 51 performs history processing of the partner information (step B1). The history processing of the partner information means that when the partner information (hereinafter referred to as the latest partner information) as a result of the relationship analysis is registered in the partner information database 52, existing information belonging to the same category as the latest partner information is It is confirmed whether it is registered in the partner information database 52. If the existing information is registered, the latest partner information is registered instead of the existing information (step A4). As a result, only the latest partner information can be viewed as partner information instead of the past partner information. If the existing information has not been registered, the latest partner information is added and registered in the partner information database 52 as in the operation of the first embodiment (step A4).

以上の処理の後、入力装置１を操作して、相手情報データベース５２をアクセスすることにより、制御部３１は相手情報データベース５２から相手情報を検索して出力装置２に出力する（ステップＡ５）。このことにより、同一カテゴリに属する相手情報は、常に最新の相手情報だけを閲覧することが可能となる。 After the above processing, by operating the input device 1 and accessing the partner information database 52, the control unit 31 searches the partner information database 52 for partner information and outputs it to the output device 2 (step A5). As a result, the partner information belonging to the same category can always browse only the latest partner information.

次に、具体例を用いて、第２の実施形態の動作について再度説明する。 Next, the operation of the second embodiment will be described again using a specific example.

例えば、相手情報データベース５２には、「職業カテゴリ」として「高校生」を有する既存の相手情報が登録されているものとする。 For example, it is assumed that existing partner information having “high school student” as “profession category” is registered in the partner information database 52.

そして、第１の実施形態の具体例で示したと同様に、受信電子メールデータベース４には、「私は大学生になりました！」という電子メール本文を含む電子メールがあるものとする。また、形態素解析／関係解析用辞書３４に、「ｘは大学生になりました」という文章から「ｘ＝大学生」という相手情報を抽出するというルールと、「職業：大学生」（「大学生」を相手情報データベース５の「職業カテゴリ」として登録する）というルールと、「私：メール送信者」（行為者が「私」である場合には相手情報データベース５に登録する）というルールと、が記憶・格納されているものとする。 As in the specific example of the first embodiment, it is assumed that the received e-mail database 4 includes an e-mail including an e-mail text “I am a college student!”. Also, in the dictionary 34 for morphological analysis / relationship analysis, the rule that the partner information “x = university student” is extracted from the sentence “x has become a university student” and “profession: university student” (“university student” The rule “Register as“ profession category ”in the information database 5” and the rule “I: Email sender” (register in the partner information database 5 if the actor is “I”) are stored Assume that it is stored.

図４のステップＡ１〜ステップＡ３の動作により、「私＝大学生」という情報が抽出される。また、「私：メール送信者」というルールにより、「私＝大学生」という情報が相手情報データベース５２に登録される対象情報であることがわかる。さらに、「職業：大学生」というルールにより、「大学生」は相手情報データベース５２の「職業カテゴリ」として登録されるものであることがわかる。 The information “I = university student” is extracted by the operations of step A1 to step A3 in FIG. In addition, the rule “I: mail sender” indicates that the information “I = university student” is the target information registered in the partner information database 52. Furthermore, it can be understood that the “college student” is registered as the “profession category” in the partner information database 52 by the rule of “profession: university student”.

この結果、相手情報履歴処理装置５０における相手情報履歴管理部５１では、相手情報データベース５２の「職業カテゴリ」に対して既存情報が登録されているかの確認処理を行う。このとき「職業カテゴリ」として「高校生」が登録されていることが判明するが、そのタイムスタンプにより、過去の相手情報であると判断する（図４のステップＢ１）。 As a result, the partner information history management unit 51 in the partner information history processing apparatus 50 performs confirmation processing to check whether existing information is registered for the “profession category” in the partner information database 52. At this time, it becomes clear that “high school student” is registered as the “profession category”, but based on the time stamp, it is determined that it is the past partner information (step B1 in FIG. 4).

このことにより、相手情報データベース５２の「職業カテゴリ」において、過去の相手情報である「高校生」を削除し、代わりに、最新相手情報であるところの「大学生」として登録しなおす（図４のステップＡ４）。 As a result, in the “profession category” of the partner information database 52, the “high school student” that is the past partner information is deleted, and is registered again as the “university student” that is the latest partner information (step in FIG. 4). A4).

以上の処理の後、入力装置１を操作して、相手情報データベース５２をアクセスすることにより、今回のメール送信者は、現在、高校生ではなく、大学生であるという情報のみを出力装置２を通じて閲覧することが可能となる（図４のステップＡ５）。 After the above processing, by operating the input device 1 and accessing the partner information database 52, the current mail sender browses only the information that he is not a high school student but a college student through the output device 2. (Step A5 in FIG. 4).

以上説明したように、本発明の第２の実施形態によれば、相手情報履歴処理装置５０によって相手情報の履歴処理を行っているため、相手情報を閲覧するに際し、最新の相手情報のみを閲覧することが可能となる、という効果を奏することができる。 As described above, according to the second embodiment of the present invention, since the partner information history processing device 50 performs the partner information history process, when browsing the partner information, only the latest partner information is browsed. The effect that it becomes possible can be produced.

本発明の自然言語処理装置を搭載した携帯端末の一実施形態を示すブロック図である。It is a block diagram which shows one Embodiment of the portable terminal carrying the natural language processing apparatus of this invention. 図１の実施形態の動作を説明するフローチャートである。It is a flowchart explaining operation | movement of embodiment of FIG. 本発明の自然言語処理装置を搭載した携帯端末の第２の実施形態を示すブロック図である。It is a block diagram which shows 2nd Embodiment of the portable terminal carrying the natural language processing apparatus of this invention. 図３の第２の実施形態の動作を説明するフローチャートである。It is a flowchart explaining the operation | movement of 2nd Embodiment of FIG.

Explanation of symbols

１入力装置
２出力装置
３相手情報抽出処理装置
３１制御部
３２形態素解析部
３３関係解析部
３４形態素解析／関係解析用辞書
３５記憶部
４受信電子メールデータベース
５相手情報データベース
５０相手情報履歴処理装置
５１相手情報履歴管理部
５２相手情報データベース
９携帯端末
DESCRIPTION OF SYMBOLS 1 Input device 2 Output device 3 Counterpart information extraction processing device 31 Control part 32 Morphological analysis part 33 Relational analysis part 34 Morphological analysis / relational analysis dictionary 35 Storage part 4 Received e-mail database 5 Counterpart information database 50 Counterpart information history processing apparatus 51 Partner information history management unit 52 Partner information database 9 Mobile terminal

Claims

A natural language processing device for extracting partner information including attributes and status of a partner with whom an email is exchanged from a received email and registering it in the partner information database,
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. A dictionary for analysis and relation analysis,
Control for causing the morpheme analysis unit to analyze the email body of the received email, control for causing the relationship analysis unit to perform relationship analysis based on the analysis result of the morpheme analysis unit, and execution of the partner information extraction process, and the relationship analysis A control unit for performing the control of adding the e-mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the unit and registering the information in the counterpart information database;
A natural language processing apparatus comprising:

The natural language processing apparatus further includes a partner information history management unit that manages partner information as a history, and when the control unit obtains a result of the extraction process of the relation analysis unit, the partner information history management unit To check if the partner information of the same category as the result of the extraction process has already been registered in the partner information database, and if the partner information of the same category has already been registered, the existing partner information is The natural language processing apparatus according to claim 1, wherein control is performed so that the result is registered in place of the result.

When the natural language processing device obtains the browsing request information of the partner information from the input device, it searches the partner information database, extracts the partner information that matches the browsing request information, and outputs it to the output device. The natural language processing apparatus according to claim 1, wherein the natural language processing apparatus is characterized.

In a natural language processing apparatus that extracts partner information consisting of attributes and status of a partner with whom an email is exchanged from a received email and registers it in the partner information database.
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. And a dictionary for analysis and relation analysis,
The natural language processing device causes the morphological analysis unit to analyze the email body of the received e-mail, and causes the relationship analysis unit to analyze the relationship from the analysis result of the morpheme analysis unit, and executes partner information extraction processing And a step of adding a mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the relation analysis unit and registering the result in the partner information database. A natural language processing method characterized.

The natural language processing device further includes a partner information history management unit that manages partner information as a history, and the partner information history is obtained when the natural language processing device obtains the extraction processing result of the relation analysis unit. The management unit confirms whether the partner information of the same category as the result of the extraction process has already been registered in the partner information database. If the partner information of the same category has already been registered, the existing partner information is extracted. The natural language processing method according to claim 4, wherein the registration is performed by replacing the result of the processing.

When the natural language processing device obtains the browsing request information of the partner information from the input device, the natural language processing device performs a step of searching the partner information database to extract partner information that matches the browsing request information and outputting it to the output device The natural language processing method according to claim 4, wherein the natural language processing method is any one of claims 4 and 5.

A morpheme analyzer that performs processing to divide a sentence included in the received e-mail into words;
A relational analysis unit that performs processing to extract partner information consisting of attributes and status of a partner who is exchanging e-mails based on the analysis result of the morphological analysis unit;
Morphological analysis that is stored in the partner information database as partner information and stores the minimum necessary words and relation analysis rules, and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing A relational analysis dictionary, and a computer of a natural language processing device that extracts partner information from received e-mail and registers it in the partner information database;
A process for causing the morphological analysis unit to analyze the body of the email of the received e-mail, a process for causing the relational analysis unit to perform a relational analysis based on an analysis result of the morpheme analysis part, and a process for executing the partner information extraction process And a process of registering in the partner information database with the mail address and time stamp information of the e-mail header information of the received e-mail added to the result of the part extraction process.

The natural language processing apparatus further includes a partner information history management unit that manages partner information as a history, and the program stores the partner information history when the computer obtains a result of extraction processing from the relation analysis unit. The management unit confirms whether the partner information of the same category as the result of the extraction process has already been registered in the partner information database. If the partner information of the same category has already been registered, the existing partner information is extracted. The program according to claim 7, wherein a program for performing a registration by replacing with a result of the processing is executed.

The program executes a process of searching the partner information database, extracting partner information that matches the browse request information, and outputting the partner information database to the output device when the partner information browse request information is obtained from the input device. The program according to any one of claims 7 and 8.

A mobile terminal equipped with a natural language processing device that extracts partner information including attributes and status of the partner with whom the email is exchanged from the received email and registers it in the partner information database,
The natural language processing device includes a morpheme analyzer that performs processing to divide a sentence included in a received email into words.
A relation analysis unit that performs processing to extract partner information based on the analysis result of the morphological analysis unit;
A morpheme that is stored as the partner information in the partner information database and stores the minimum necessary words and relation analysis rules and is referred to when the morpheme analysis unit and the relationship analysis unit perform analysis / extraction processing. A dictionary for analysis and relation analysis,
Control for causing the morpheme analysis unit to analyze the email body of the received email, control for causing the relationship analysis unit to perform relationship analysis based on the analysis result of the morpheme analysis unit, and execution of the partner information extraction process, and the relationship analysis A control unit for performing the control of adding the e-mail address and time stamp information of the e-mail header information of the received e-mail to the result of the extraction process of the unit and registering the information in the counterpart information database;
A portable terminal equipped with a natural language processing device.

The natural language processing apparatus further includes a partner information history management unit that manages partner information as a history, and when the control unit obtains a result of the extraction process of the relation analysis unit, the partner information history management unit To check if the partner information of the same category as the result of the extraction process has already been registered in the partner information database, and if the partner information of the same category has already been registered, the existing partner information is The mobile terminal equipped with the natural language processing device according to claim 10, wherein control is performed to register the result by replacing the result.

When the natural language processing device obtains the browsing request information of the partner information from the input device, it searches the partner information database, extracts the partner information that matches the browsing request information, and outputs it to the output device. A portable terminal equipped with the natural language processing device according to any one of claims 10 and 11.