JPH01166258A

JPH01166258A - Mechanical translation device

Info

Publication number: JPH01166258A
Application number: JP62326391A
Authority: JP
Inventors: Youji Fukumochi; 福持　陽士; Hitoshi Suzuki; 等鈴木; Hidezo Kugimiya; 釘宮　秀造; Ichiko Sada; いち子佐田; Noriyuki Hirai; 平井　徳行
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1987-12-23
Filing date: 1987-12-23
Publication date: 1989-06-30

Abstract

PURPOSE:To interpret with higher accuracy whether an original sentence is equal to a sentence or a title and to ensure correct mechanical translation by dividing the original sentence into morpheme strings and extracting corresponding translated words. CONSTITUTION:A read original sentence is divided into morpheme strings by a dictionary consulting morpheme analyzing par 11 via a dictionary of a memory 6. Grammar information and the translated word are obtained for each word and tense, person, numeral, etc., are analyzed. A syntax structure analyzing part 12 decides a syntax structure analyzing tree showing the active/passive relation between words according to grammar rules and by reference to the dictionary of the memory 6. A converting par 13 converts the structure of the syntax structure analyzing tree corresponding to the input original sentence into the structure of the syntax structure analyzing tree corresponding to the translated sentence based on the tree structure converting rule of the memory 6. Then a translated sentence producing part 14 adds proper postpositional words and auxiliary verbs for production of a translated sentence of a target language. The translated sentence is supplied to a main memory 2 and a CRT 3 respectively via a translation module 5.

Description

【発明の詳細な説明】〈産業上の利用分野〉この発明は、入力された単語列が文であるか表題である
かを判別して翻訳することができる機械翻訳装置に関す
る。DETAILED DESCRIPTION OF THE INVENTION <Industrial Application Field> The present invention relates to a machine translation device that can determine whether an input word string is a sentence or a title and translate it.

〈従来の技術〉従来、機械翻訳装置としては次のようなものがある。こ
の機械翻訳装置は、キーボードから入力されたソース言
語を、メインＣＰｔＪの制御により翻訳モジュールに入
力し、この入力されたソース言語を、翻訳モジュールに
よってメモリーに記憶されている辞書、文法規則および
本構造変換規則を利用して、ターゲット言語に翻訳する
ものである。上記文法規則は、入力された文章（以下、
原文という）の構文構造を決定する際に用いられる。<Prior Art> Conventionally, there are the following machine translation devices. This machine translation device inputs the source language input from the keyboard into the translation module under the control of the main CPtJ, and the translation module transfers the input source language to the dictionary, grammar rules and book structure stored in the memory. It uses conversion rules to translate into the target language. The above grammar rules apply to the input sentence (hereinafter,
It is used to determine the syntactic structure of the original text.

〈発明が解決しようとする問題点〉しかしながら、上記従来の機械翻訳装置で翻訳する場合
は、メモリーに記憶されている文法規則に従って原文の
構文構造を決定するようにしており、また、処理中の原
文が文であるか表題（概ね、名詞句）であるかを判別す
ることができないので、入力された原文が表題であるに
もかかわらず、上記メモリーに記憶されている通常文の
文法規則が適用されて原文の解釈を間違う場合が多いと
いう問題がある。また、表題を表わす単語列中の各単語
の先頭は通常大文字であるため、辞書引きの際に表題中
の各単語と辞書中の登録語がマツチせず、上記単語は固
有名詞の扱いを受けたり、未知語と判定されて、当該単
語の品詞の同定が難しく、誤解析を生じやすいという問
題がある。<Problems to be Solved by the Invention> However, when translating with the above-mentioned conventional machine translation device, the syntactic structure of the original text is determined according to the grammatical rules stored in memory, and the Since it is not possible to determine whether the original text is a sentence or a title (generally a noun phrase), even though the input original text is a title, the grammar rules for normal sentences stored in the memory are The problem is that when applied, the original text is often misinterpreted. Additionally, since the first letter of each word in the word string representing the title is usually capitalized, when looking up a dictionary, each word in the title is not matched with a registered word in the dictionary, and the above words are treated as proper nouns. There is a problem in that the word is determined to be an unknown word, making it difficult to identify the part of speech of the word, and erroneous analysis is likely to occur.

そこで、この発明の目的は、原文が文であるか表題であ
るかを判別し、その判別結果に基づいて°原文の解釈を
より正しく行うことを可能にした翻訳装置を提供するこ
とにある。SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide a translation device that can determine whether an original text is a sentence or a title, and can more accurately interpret the original text based on the result of the determination.

〈問題点を解決するための手段〉上記目的を達成するため、この発明は、入力された文章
を辞書を用いて形態素に分割して品詞等の情報を得る形
態素解析部と、上記形態素解析部によって分割された形
態素列の構文を辞書と文法規則を用いて解析する構文解
析部と、上記構文解析部で得られた構文の構造を目標言
語における構文の構造に変換する変換部と、上記変換部
によって得られた目標言語の構文の構造に従って翻訳文
を生成する翻訳文生成部を有する機械翻訳装置において
、入力された形態素の先頭が大文字か否かを判別する大
文字判別手段と、上記大文字判別手段によって先頭が大
文字であると判別された形態素を記憶するメモリー手段
と、上記メモリー手段に記憶された形態素に続く形態素
が終端の形態素であり、かつ、ピリオドまたはクエスチ
ョン・マークでなければ、上記メモリー手段に記憶され
た形態素は表題であることを特定する表題特定手段と、
上記表題特定手段が上記メモリー手段に記憶された形態
素は表題であることを特定すると、上記メモリー手段に
記憶された形態素の先頭の大文字を小文字に変換する大
文字／小文字変換手段を備えて、表題に応じた翻訳をな
し得るようにしたことを特徴としている。<Means for Solving the Problems> In order to achieve the above object, the present invention includes a morphological analysis unit that divides an input sentence into morphemes using a dictionary and obtains information such as parts of speech, and the morphological analysis unit described above. a syntactic analysis unit that analyzes the syntax of the morpheme sequence divided by using a dictionary and grammar rules, a conversion unit that converts the syntactic structure obtained by the syntactic analysis unit into a syntactic structure in the target language, and the conversion unit A machine translation device having a translated sentence generation section that generates a translated sentence according to the syntactic structure of the target language obtained by the section, a capital letter discrimination means for determining whether or not the beginning of an input morpheme is a capital letter; a memory means for storing a morpheme determined by the means to have a capital letter at the beginning; and a memory means for storing a morpheme whose first character is a capital letter; title identifying means for identifying that the morpheme stored in the means is a title;
When the title specifying means specifies that the morpheme stored in the memory means is a title, the morpheme stored in the memory means is provided with an upper case/lower case converting means for converting the first capital letter of the morpheme into a lower case letter, and the morpheme is changed to a title. It is characterized by being able to perform translations according to the user's needs.

〈作用〉入力された文章が形態素解析部によって形態素に分割さ
れ、この形態素の先頭が大文字か否かが大文字判別手段
によって判別される。その結果、先頭が大文字であると
判別された形態素は全てメモリー手段に記憶される。そ
して、上記メモリー手段に記憶された形態素に続く形態
素が終端の形ｒｓＴ：であり、かつ、ピリオドまたはク
エスチョン・マークでなければ、表題特定手段によって
上記メモリー手段に記憶された形態素は表題であると特
定される。そうすると、上記メモリー手段に記憶された
形ＢＴ：の先頭の大文字が、大文字／小文字変換手段に
よって小文字に変換される。そして、その後、辞書を用
いて上記小文字に変換された形Ｂ素の品詞等の情報が得
られる。<Operation> The input sentence is divided into morphemes by the morpheme analysis section, and the capitalization determining means determines whether the beginning of the morpheme is a capital letter. As a result, all morphemes determined to have a capital letter at the beginning are stored in the memory means. If the morpheme following the morpheme stored in the memory means has the final form rsT: and is not a period or a question mark, the title specifying means determines that the morpheme stored in the memory means is a title. be identified. Then, the first capital letter of the form BT: stored in the memory means is converted to a lower case letter by the case/lower case conversion means. Thereafter, information such as the part of speech of the form B element converted to lowercase letters is obtained using a dictionary.

したがって、入力された文章が表題であるか否かを判別
して、表題であるときは表題に応じた翻訳を行うことが
可能である。Therefore, it is possible to determine whether or not the input sentence is a title, and if it is a title, to perform translation according to the title.

〈実施例〉以下、この発明を図示の実施例により詳細に説明する。<Example> Hereinafter, the present invention will be explained in detail with reference to illustrated embodiments.

第１図はこの発明の機械翻訳装置における一実施例のブ
ロック図である。ｌはメインＣＰＵ（中央処理装置）、
２はメインメモリー、３はＣＲＴ（カソード・レイ・デ
ユープ）、４はキーボード、５は翻訳モジュール、６は
翻訳用の辞書１文法規則および木構造変換規則等を格納
しているメモリーである。FIG. 1 is a block diagram of an embodiment of a machine translation apparatus according to the present invention. l is the main CPU (central processing unit),
2 is a main memory, 3 is a CRT (Cathode Ray Duplex), 4 is a keyboard, 5 is a translation module, and 6 is a memory that stores a dictionary 1 for translation, grammatical rules, tree structure conversion rules, etc.

上記翻訳モジュール５は、ソース言語が入力されると、
それを翻訳してターゲット言語を出力するものである。When the source language is input, the translation module 5
It translates it and outputs the target language.

すなわち、キーボード４から入力されたソース言語はメ
インＣＰＵＩの制御により翻訳モジュール５に送られる
。翻訳モジュール５はメモリー６に記憶されている辞書
９文法規問および木構造変換規則等を用いて、後に詳述
するようにして入力されたソース言語をターゲット言語
に翻訳する。その結果は、メインメモリー２に一旦記憶
されるとともに、ＣＲＴ３に表示される。That is, the source language input from the keyboard 4 is sent to the translation module 5 under the control of the main CPUI. The translation module 5 uses the dictionary 9 grammar questions and tree structure conversion rules stored in the memory 6 to translate the input source language into the target language as will be detailed later. The results are temporarily stored in the main memory 2 and displayed on the CRT 3.

本実施例の翻訳モジュール５の具体的な構成を第２図に
示す。FIG. 2 shows a specific configuration of the translation module 5 of this embodiment.

以下、第１図、第２図を用いて翻訳モジュールの動作を
説明する。The operation of the translation module will be explained below using FIGS. 1 and 2.

まず、読み込まれた原文は辞書引き形態素解析部ＩＩに
より、メモリー６の辞書を用いて、各形態素列（単語列
）に分割されてこの各単語に対する品詞等の文法情報お
よび訳語を得、さらに、時制・人称・数等が解析される
。構文解析部１２ではメモリー６の辞書と文法規則にし
たがって、各単語間の係り受は関係を示す構造解析木が
後述するようにして第５図に示すように決定される。First, the read original text is divided into each morpheme string (word string) by the dictionary reference morphological analysis unit II using the dictionary in the memory 6, and grammatical information such as part of speech and translation for each word are obtained, and further, Tense, person, number, etc. are analyzed. In the syntactic analysis unit 12, in accordance with the dictionary in the memory 6 and the grammatical rules, the dependencies between each word are determined as shown in FIG. 5 by creating a structural analysis tree indicating the relationship as will be described later.

変換部Ｉ３では、メモリー６の木構造変換規則を用いて
、入力された原文に対する構文解析木の構造が翻訳文に
対する構文解析木の構造に変換される。そして、翻訳文
生成部１４でターゲット言語にゼける翻訳文として適切
な助詞や助動詞をつけて、得られた翻訳文は翻訳モジュ
ール５から出力され、メインメモリー２に格納されると
ともに、ＣＲＴ３に表示される。The conversion unit I3 uses the tree structure conversion rules in the memory 6 to convert the structure of the parse tree for the input original text into the structure of the parse tree for the translated text. Then, the translated sentence generation unit 14 adds particles and auxiliary verbs suitable for the translated sentence in the target language, and the obtained translated sentence is output from the translation module 5, stored in the main memory 2, and displayed on the CRT 3. be done.

次に、上記翻訳モジュール５の辞書引き形態素解析部１
１の動作において、入力される文章が表題か否かを判別
する表題判別処理について詳細に説明する。ここで、上
記メモリー６には第１表に示す文法規則のみが格納され
ているものとする。Next, the dictionary lookup morphological analysis unit 1 of the translation module 5
In operation 1, title determination processing for determining whether an input sentence is a title will be described in detail. Here, it is assumed that the memory 6 stores only the grammar rules shown in Table 1.

第１表上記第１表の文法規則は次のことを表わしている。Table 1 The grammar rules in Table 1 above represent the following:

すなわち、文法規則（ａ）は、名詞句は名詞と名詞から
できていることを表わす。文法規則（ｂ）は、名詞句は
名詞と名詞と名詞からできていることを表わす。文法規
則（Ｃ）は、動詞句は動詞からできていることを表わす
。文法規則（ｄ）は、平叙文は名詞句と動詞句からでき
ていることを表わす。文法規則（ｅ）は、文は平叙文と
文末からできていることを表わす。文法規則（「）は、
文は名詞句と文末からできていることを表わす。That is, grammar rule (a) indicates that a noun phrase is made up of nouns and nouns. Grammar rule (b) indicates that a noun phrase is made up of nouns, nouns, and nouns. Grammar rule (C) indicates that verb phrases are made up of verbs. Grammar rule (d) indicates that a declarative sentence is made up of a noun phrase and a verb phrase. Grammar rule (e) indicates that a sentence is made up of a declarative sentence and a sentence ending. The grammar rule (``) is
Indicates that a sentence is made up of a noun phrase and the end of the sentence.

第３図は、本実施例による表題判別処理のフローチャー
トである。以下、第３図にしたがって表題判別処理の動
作を説明する。FIG. 3 is a flowchart of title discrimination processing according to this embodiment. The operation of the title discrimination process will be explained below with reference to FIG.

ステップＳ１で、ポインタが先頭の形態素にセットされ
る。In step S1, a pointer is set to the first morpheme.

ステップＳ２で、上記ステップＳ１でポインタがセット
された形態素（単語）の先頭が大文字か否かが判別され
る。その結果、大文字であればステップＳ３に進み、そ
うでなければステップＳＩＯに進む。In step S2, it is determined whether the beginning of the morpheme (word) to which the pointer was set in step S1 is a capital letter. As a result, if the character is an uppercase character, the process proceeds to step S3; otherwise, the process proceeds to step SIO.

ステップＳ３で、先頭が大文字の単語がバッファにスト
アされる。In step S3, words with initial capital letters are stored in the buffer.

ステップＳ４で、ポインタが次の形ｐ４素にセラ・トさ
れる。In step S4, the pointer is set to the next p4 element.

ステップＳ５で、ポインタがセットされている形態素は
終端を表わす形態素が否かが判別される。In step S5, it is determined whether the morpheme to which the pointer is set is a morpheme representing the end.

その結集、終端の形態素であればステップＳ６に進みそ
うでなければステップＳ２に戻る。If the morpheme is the final morpheme, the process advances to step S6, otherwise the process returns to step S2.

ステップＳ６で、上記ステップＳ５で終端であると判別
された形態素は、ピリオドまたはクエスチョン・マーク
であるか否かが判別される。その結果、ピリオドまたは
クエスチョン・マークであればステップＳＩＯに進み、
そうでなければステップＳ７に進む。In step S6, it is determined whether the morpheme determined to be the terminal in step S5 is a period or a question mark. If the result is a period or question mark, proceed to step SIO,
Otherwise, the process proceeds to step S7.

ステップＳ７で、上記ステップＳ５．ステップＳ６でピ
リオドまたはクエスチョン・マークでなはないと判別さ
れた終端の形態素がバッファにストアされる。In step S7, step S5. The terminal morpheme determined to be not a period or a question mark in step S6 is stored in the buffer.

ステップＳ８で、上記バッファにストアされた形態素は
表題であることが特定されて、表題フラグがセットされ
る。In step S8, the morpheme stored in the buffer is identified as a title, and a title flag is set.

ステップＳ９で、上記バッファにストアされた各単語の
先頭の大文字が小文字に変換される。In step S9, the first uppercase letter of each word stored in the buffer is converted to lowercase letter.

ステップＳＩＯで、先頭が小文字の単語に従って辞書引
きが行なわれて品詞等の情報が得られる。In step SIO, a dictionary lookup is performed according to words whose first letters are lowercase letters to obtain information such as part of speech.

ステップＳｔｔで、処理が終了か否かが判別され、その
結果、終了であれば表題特定動作を終了し、そうでなけ
ればステップＳ１２に進む。In step Stt, it is determined whether or not the process is finished. If the process is finished, the title specifying operation is finished, and if not, the process proceeds to step S12.

ステップＳ１２で、ポインタが次の形痙素にセットされ
る。In step S12, the pointer is set to the next shape spasticity.

上述の表題特定処理に基づく表題翻訳処理動作を具体的
な入力文によりさらに詳細に説明する。The title translation processing operation based on the above-mentioned title identification processing will be explained in more detail using a specific input sentence.

ここで、入力文として、（ａ）　　ｒ　Ｓ　ｕｒｆａｃｅ　　Ｔ　ｅｍｐｅｒａ
ｔｕｒｃｓ　　Ｒｉｓｅ　　Ｊ（ｂ）　　ｒ　Ｓ　ｕｒ
ｒａｃｅ　　ｔｅｍｐｅｒａｔｕｒｅｓ　　ｒｉｓｅ、
　Ｊが入力されたものと仮定する。Here, as an input sentence, (a) r Surface Tempera
turcs Rise J(b) r S ur
race temperatures rise,
Assume that J is input.

入力文（ａ）が入力されると、各単語の先頭が大文字で
あるので、第３図のフローチャートのステップＳ２．Ｓ
３．Ｓ４．Ｓ５によって各単語ｒ　Ｓ　ｕｒｆａｃｅ上
ｒ　Ｔ　ｅｍｐｅｒａｔｕｒｅｓ　ＪおよびｒＲｉｓｅ
Ｊがバッファにストアされる。When the input sentence (a) is input, since the beginning of each word is a capital letter, step S2. of the flowchart of FIG. S
3. S4. By S5 each word r S surface r T emperatures J and r Rise
J is stored in the buffer.

、　次に、’ｒＲｉｓｅＪの後にスペースが続くので、
ステップＳ５．Ｓ６によって、すでにバッファにストア
されている単語列は表題であると認識される。, then 'rRiseJ is followed by a space, so
Step S5. At S6, the word string already stored in the buffer is recognized as a title.

そして、ステップＳ８で表題フラグがセットされ、ステ
ップＳ９で上記バッファにストアされた各単語の先頭の
大文字Ｓ、Ｔ、ｒＬが小文字に変換される。その後、ス
テップＳ１０で先頭が小文字に変換された単語を用いて
辞書引きが実行される。その結果、得られた品詞情報が
、第４図に示すように翻訳モジュール５内の辞書引き結
果バッファにストアされる。その際に、辞書引き形態素
解析部１１の処理を容易にするために、入力文（ａ）に
存在しない文末をバッファに加える。Then, in step S8, a title flag is set, and in step S9, the first capital letters S, T, and rL of each word stored in the buffer are converted to lowercase letters. Thereafter, in step S10, a dictionary lookup is performed using the word whose first letter is converted to lower case. As a result, the obtained part-of-speech information is stored in the dictionary lookup result buffer within the translation module 5, as shown in FIG. At this time, in order to facilitate the processing of the dictionary lookup morphological analysis unit 11, a sentence ending that does not exist in the input sentence (a) is added to the buffer.

次に、構文解析部！２は、第４図に示す辞書引き結果バ
ッファの内容と、第１表に示す文法規則の（ａ）、　（
ｃ）、　（ｄ）および（ｅ）とによって第５図（ａ）に
示すように構文解析を行う。そしてさらに、横文解析部
１２は第３図のステップＳ８でセットされた表題フラグ
をチエツクする。そうすると、表題フラグはセットされ
ているので入力文（ａ）は表題であると判別し、横文解
析部１２は第５図ｒａ）に示すように動詞句を用いて解
析した横文解析を失敗させる。その結果、構文解析部１
２ではバックトラックが機能し、第４図に示す辞書引き
結果バッファにストアされたｒｒｉｓｅＪの品詞を２番
目の品詞「名詞」に、また、文法規則を第１表に示す文
法規則の（ｂ）および（ｒ）に変更することによって、
第５図（ｂ）に示すように構文解析を行う。そうすると
、出来上がった構文の構造は名詞句であるので構文解析
に成功し、第５図（ｂ）のソース言語の構文解析木が決
定される。以後、変換部１３．翻訳文生成部１４へ処理
が移り、入力文（ａ）の翻訳結果が第６図（ａ）に示す
ように出力されて表題翻訳処理が終了する。Next, the syntax analysis section! 2 is the content of the dictionary lookup result buffer shown in Figure 4 and the grammar rules (a) and (a) shown in Table 1.
c), (d) and (e), the syntax is analyzed as shown in FIG. 5(a). Further, the horizontal text analysis unit 12 checks the title flag set in step S8 of FIG. Then, since the title flag is set, the input sentence (a) is determined to be a title, and the lateral sentence analysis unit 12 fails in the lateral sentence analysis using the verb phrase, as shown in Figure 5 (ra). let As a result, the parser 1
In 2, backtracking works, and the part of speech of rriseJ stored in the dictionary lookup result buffer shown in Figure 4 is changed to the second part of speech "noun", and the grammar rule is changed to (b) of the grammar rule shown in Table 1. and (r) by changing
Syntax analysis is performed as shown in FIG. 5(b). Then, since the structure of the resulting syntax is a noun phrase, the syntactic analysis is successful, and the source language parse tree shown in FIG. 5(b) is determined. Thereafter, the conversion unit 13. The process moves to the translated sentence generation section 14, the translation result of the input sentence (a) is outputted as shown in FIG. 6(a), and the title translation process ends.

一方、入力文（ｂ）が入力されると、単語ｒ　Ｓ　ｕｒ
ｆａｃｅＪの先頭は大文字であるため、第３図のフロー
チャートのステップＳ２．Ｓ３によって単語ｒ　Ｓ　ｕ
ｒｆａｃｃＪがバッファにストアされる。次の単語ｒｔ
ｅｍｐｅｒａｔｕｒｅｓ　Ｊの先頭は小文字であるため
、ステップＳ２で単語ｒ　ｔｅＩＩｌｐｅｒａｔｕｒｅ
ｓ　Ｊは表題であると判別されず、ステップＳ８をスキ
ップするので表題フラグはセットされない。そして、バ
ッファにストアされている単語ｒ　Ｓ　ｕｒｆａｃｅＪ
と共にステップＳＩＯで辞書引きが行なわれる。さらに
、ステップＳ２．ＳＩＯで次の単語ｒｒＩｓｅＪの辞書
引きが行なわれる。その結果、構文解析部１２により第
５図（ａ）に示すような構文解析木が決定される。そし
て、この構文解析木に基づいて変換処理および翻訳文生
成処理が実行される。そして、翻訳結果が第６図（ｂ）
に示すように出力されて翻訳処理が終了する。On the other hand, when input sentence (b) is input, the word r S ur
Since the beginning of faceJ is a capital letter, step S2. of the flowchart of FIG. Word r S u by S3
rfccJ is stored in the buffer. next word rt
Since the beginning of empires J is a lowercase letter, the word r teIIlperature is entered in step S2.
Since sJ is not determined to be a title and step S8 is skipped, the title flag is not set. Then, the word r SurfaceJ stored in the buffer
At the same time, a dictionary lookup is performed in step SIO. Furthermore, step S2. A dictionary lookup for the next word rrIseJ is performed at SIO. As a result, the syntax analysis section 12 determines a syntax analysis tree as shown in FIG. 5(a). Conversion processing and translated sentence generation processing are then performed based on this parse tree. The translation result is shown in Figure 6(b).
The translation process ends with output as shown in .

このように、表題は概ね名詞句であり、表題を表わす単
語列中の各単語の先頭は通常大文字であることを利用し
て、入力された形態素の先頭が大文字であるか否かと、
終端の形態素がピリオドまたはクエスチタン・マークで
あるか否かを判別することによって、入力された原文が
表題であるか文であるかを判別する。したがって、その
判別結果に基づいて、表題であれば各単語の先頭の大文
字を小文字に変換して表題に応じた翻訳を行うことを可
能にしたので、表題が入力されたにもかかわらず通常文
に誤訳したり、表題中の単語の誤解析を行うことなく、
常に正しく翻訳することができる。In this way, by using the fact that a title is generally a noun phrase and that the first letter of each word in the word string representing the title is usually capitalized, we can determine whether the input morpheme starts with a capital letter or not.
By determining whether the final morpheme is a period or a question mark, it is determined whether the input original text is a title or a sentence. Therefore, based on the discrimination results, if it is a title, the first capital letter of each word is converted to lower case, and translation according to the title can be performed. without mistranslating or incorrectly parsing the words in the title.
Can always translate correctly.

〈発明の効果〉以上より明らかなように、この発明は、形＠索解Ｉｌ′
ｒ部、構文解析部、変換部および翻訳文生成部を備えた
機械翻訳装置に、大文字判別手段、メモリー手段１表題
特定手段および大文字／小文字変換手段を設けて、上記
大文字判別手段によって先頭が大文字であると判別され
た形態素をメモリー手段に記憶し、上記表題特定手段に
よって上記メモリー手段に記憶された形態素は表題であ
ると特定されると、表題に応じた翻訳を実行可能にした
ので、表題と通常文が混在した文章が入力されても構文
解析を正確に行うことができ、精度の高い翻訳が可能に
なる。<Effects of the invention> As is clear from the above, this invention has the form @search solution Il'
A machine translation device including an r section, a syntax analysis section, a conversion section, and a translated sentence generation section is provided with an upper case discrimination means, a memory means 1 title identification means, and an uppercase/lower case conversion means. The morpheme determined to be a title is stored in the memory means, and when the title identifying means identifies the morpheme stored in the memory means as a title, translation according to the title can be executed. Even if a sentence is input that contains a mixture of ``and'' normal sentences, the syntax can be parsed accurately, making highly accurate translation possible.

また、表題を翻訳する際に、上記大文字／小文字変換手
段によってメモリー手段に記憶された形態素の先頭の大
文字が小文字に変換されるので、単語列中の大文字で始
まる単語を辞書中の登録語とマツチさ仕ることができ、
表題中の単語が誤まった品詞の同定を受けたり固何名詞
扱いを受けてしまってすすることがなく、精度の高い翻
訳を行うことができる。Furthermore, when translating a title, the case converting means converts the first capital letter of the morpheme stored in the memory means to a lower case letter, so that the word starting with a capital letter in the word string is treated as a registered word in the dictionary. Matsushi can serve,
Highly accurate translation can be performed without the words in the title being identified as incorrect parts of speech or being treated as proper nouns.

[Brief explanation of the drawing]

第１図はこの発明の機械翻訳装置の一実施例のブロック
図、第２図は上記実施例における翻訳モジュールの作用
構成図、第３図は上記実施例による表題判別処理のフロ
ーチャート、下４図は辞書引き結寒バッファの内容を示
す図、第５図は決定された構文解析木の一例を示す図、
第６図は第１図の機械翻訳装置への入力文例とそれに対
する出力文（翻訳文）例を示す図である。１・・・メインＯｒ’Ｕ、　　　２・・・メインメモリ
ー、３・・・ＣＲＴ、　　　　　４・・・キーボード、
５・・・翻訳モジュール、６・・・メモリー、■ト・・
辞書引き形！＠素解析部、１２・・・構文解析部、　　１３・・・変換部、１４・
・・翻訳文生成部。第４図第６図第５図（ａ）ブ（ｂ）フFig. 1 is a block diagram of an embodiment of the machine translation device of the present invention, Fig. 2 is a functional block diagram of the translation module in the above embodiment, Fig. 3 is a flowchart of title discrimination processing according to the above embodiment, and the lower 4 figures is a diagram showing the contents of the dictionary lookup buffer, and Figure 5 is a diagram showing an example of the determined parse tree.
FIG. 6 is a diagram showing an example of an input sentence to the machine translation device of FIG. 1 and an example of an output sentence (translated sentence) therefor. 1... Main Or'U, 2... Main memory, 3... CRT, 4... Keyboard,
5...Translation module, 6...Memory, ■...
Dictionary guide! @Elementary analysis unit, 12...Syntax analysis unit, 13...Conversion unit, 14.
...Translation generation unit. Figure 4 Figure 6 Figure 5 (a) Bu (b) Fu

Claims

[Claims]

(1) A morphological analysis unit that divides the input sentence into morphemes using a dictionary to obtain information such as parts of speech, and analyzes the syntax of the morpheme string divided by the morphological analysis unit using a dictionary and grammar rules. a syntactic analysis unit; a conversion unit that converts the syntactic structure obtained by the syntactic analysis unit into a syntactic structure in the target language; and a translation unit that generates a translated sentence according to the syntactic structure of the target language obtained by the conversion unit. A machine translation device having a sentence generation unit, comprising: an uppercase character determination means for determining whether or not the beginning of an input morpheme is an uppercase letter; and a memory means for storing a morpheme determined to have an uppercase character at the beginning by the uppercase character determination means. , title specifying means for specifying that the morpheme stored in the memory means is a title if the morpheme following the morpheme stored in the memory means is a terminal morpheme and is not a period or a question mark; , when the title specifying means specifies that the morpheme stored in the memory means is a title, case converting means converts the first uppercase letter of the morpheme stored in the memory means into a lowercase letter; A machine translation device characterized by being able to perform translations according to.