JPS59221732A

JPS59221732A - Kana-kanji conversion processing device

Info

Publication number: JPS59221732A
Application number: JP58096681A
Authority: JP
Inventors: Yasuyuki Numata; 泰之沼田; Katsuhiko Fujita; 克彦藤田; Yoshitoshi Yamauchi; 佐敏山内
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1983-05-31
Filing date: 1983-05-31
Publication date: 1984-12-13

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】技術分野本発明はカナ漢字変換処理装置に関し、特に単語２文節
等による区切りを有しｙ−、ｂ）、ｂ）１）ゆるべた書
き入力力４式のカナ漢字変換処理装置において、変換候
補（以下、単に「候補」という）単語抽出に際して各単
語の文中における位置を考慮することにより、候補単語
抽出の妥当性を向」ニさせるとともに確定処理の速度を
向上させることを可能としたカナ漢字変換処理装置に関
するものである。[Detailed Description of the Invention] Technical Field The present invention relates to a kana-kanji conversion processing device, and in particular, to a kana-kanji conversion processing device that has divisions such as two words and clauses, y-, b), b) 1) Yurubeta writing input power, and 4 types of kana-kanji conversion. To improve the validity of candidate word extraction and the speed of confirmation processing by considering the position of each word in a sentence when extracting conversion candidate (hereinafter simply referred to as "candidate") words in a processing device. The present invention relates to a kana-kanji conversion processing device that enables the conversion of kana to kanji.

従来技術従来のカナ漢字変換処理装置においては、選択抽出さ、
ｂた単語列の確定を行う場合に、選択抽出された単語列
を構成する単語数あるいは文字数を用いるものが知られ
ている。しかしながら、この装置においては、例えは、
「わたくしは」（単語数２、文字数５）というような、
あいまいさの非常に少ない文においても、ある一定の解
析長を得るまでは、ｒわたくシ」さえも確定でさないと
いう問題があった。また、上記単語数あるいは文字数等
の、いわゆる評価値は、候補単語の選択抽出には用いら
れておらず、候補単語の選択抽出には他の評価値を用い
ているため、両評価値を求めるための手段が別々に必要
になるという問題もあった。Prior Art In conventional kana-kanji conversion processing devices, selective extraction,
It is known to use the number of words or characters constituting the selected and extracted word string when determining the selected word string. However, in this device, for example,
Something like "I am" (2 words, 5 characters)
Even in sentences with very little ambiguity, there was a problem in that even ``r watakushi'' was not confirmed until a certain parsing length was obtained. In addition, the so-called evaluation values such as the number of words or the number of characters are not used for selecting and extracting candidate words, but other evaluation values are used for selecting and extracting candidate words, so both evaluation values are calculated. There was also the problem that separate means were required for this purpose.

更に、確定単位が単語であるため、助詞、助動詞等も一
つの単位として出力され、オペレータの負担が大きいと
いう問題もあった。Furthermore, since the definite unit is a word, particles, auxiliary verbs, etc. are also output as one unit, which poses a problem of heavy burden on the operator.

目　　　　　的本発明は上記事情に鑑みてなされたもので、その目的と
するところは、従来のカナ漢字変換処理装置における上
述の如き問題をＭ消し、簡単な構成で、確定処理の速度
を向上させるとともにオペレータの負担を減少させるこ
とを可能どしたカナ漢字変換処理装置を提供することに
ある。Purpose The present invention has been made in view of the above circumstances, and its purpose is to eliminate the above-mentioned problems in conventional kana-kanji conversion processing devices and improve the speed of confirmation processing with a simple configuration. Another object of the present invention is to provide a kana-kanji conversion processing device that makes it possible to reduce the burden on an operator.

構　　　成以下、本発明の構成を実施例により５）−相に説明する
。Configuration The configuration of the present invention will be explained below in terms of 5)-phase by way of examples.

まず、最尤候補１１１語の選択抽出について説明する。First, the selection and extraction of the 111 most likely candidate words will be explained.

本実施例においては、辞１Ｆ内に、後述する！！１１さ
各単語の文中における出現１ｉｌＩ′率（以下、単に「
出現確率」という）を格納してあＺ′Ｉｌ’ｌ’）’ｉ
に特徴を有するものである。上記出現確率は、辞書検索
手段により検索された単語とこれに先行する鵬語との接
続可否を、上記検索された単語の文中における位置を考
慮して定めたものであり、他のパラメータと組合わせて
各単語の評価値を形成する。最尤候補単語選択抽出はこ
の評価値に基づいて行われる。In this embodiment, the following will be described later in the first floor. ! 11 The appearance rate of each word in a sentence (hereinafter simply ``
Z'Il'l')'i
It has the following characteristics. The above probability of occurrence is determined by considering the position of the searched word in the sentence to determine whether or not the word searched by the dictionary search means can be connected with the preceding Peng word, and is combined with other parameters. Together, they form an evaluation value for each word. Maximum likelihood candidate word selection and extraction is performed based on this evaluation value.

第１図は本発明の一実施例であるカナ漢字変換処理装置
の最尤候補単語抽出部の構成例を示すブロック図である
。図において、１は入力文字列バッファ、２は単語辞＠
１１を検索して候補単語を抽出する辞書検索部、３は該
辞８倹銀部２が抽出した単語を保持する単語バッファ、
４は上記辞書検索部２が抽出して単語バッファ３に格納
した単語について、接続可否検定部１２を参照して先行
する単語との接続可否の検定を行う接続可否検定部であ
る。また、５は上記接続可否検定部４により接続可と判
定さ扛た単語を格納している接続可ｎに単語バソンア、
６は出現確率表１３を参照して上記接続可能Ｊ、１４語
バッファ５中の各単語に出現確率の値を（１加する出現
確率認定部、７は上記出現確率値付き単語を格納してい
るＱｉ語バッファ・、８は出現確率総合、７勤部で；ｂ
　’Ｊ・９（よ出力文ｑニジ１］ノヘソフア、１０は前
記出現確率値および後述する総合評価値付き単記を格納
している候補単語保持用バッファである。FIG. 1 is a block diagram showing an example of the configuration of a maximum likelihood candidate word extraction section of a kana-kanji conversion processing device according to an embodiment of the present invention. In the figure, 1 is the input string buffer, 2 is the word dictionary @
11 is a dictionary search unit that searches for and extracts candidate words; 3 is a word buffer that holds the words extracted by the dictionary unit 2;
Reference numeral 4 denotes a connectability test section that refers to the connectability test section 12 to test whether the word extracted by the dictionary search section 2 and stored in the word buffer 3 can be connected to a preceding word. In addition, 5 is a word bassona in a connectable n which stores words that are determined to be connectable by the connectability testing unit 4.
6 refers to the appearance probability table 13 and adds the appearance probability value (1) to each word in the connectable J and 14 word buffers 5. 7 stores the words with appearance probability values. Qi word buffer, 8 is the total appearance probability, 7th shift; b
9 (output sentence qniji 1) 10 is a buffer for holding candidate words that stores the appearance probability value and a single word with an overall evaluation value, which will be described later.

第２図は上記出現確率認定部６が用いる出＠確率表１３
の内容の一例を示すものである。木表においては、単語
抽出対象文字列の位置と、先行する単語の品詞と字種を
組合わせた語順が行に、上記語順が列に配されている。Figure 2 shows the appearance probability table 13 used by the appearance probability recognition section 6.
This shows an example of the content. In the tree table, the word order, which is a combination of the position of the character string to be extracted and the part of speech and character type of the preceding word, is arranged in rows, and the word order is arranged in columns.

第３図は単語辞書１１の内容の一例を示すものである。FIG. 3 shows an example of the contents of the word dictionary 11.

頻度ランクは、例えば、前記使用頻度をクラス分けして
得た数値で、使用頻度の高いもの程大きい数値になって
いる。The frequency rank is, for example, a numerical value obtained by classifying the frequency of use, and the higher the frequency of use, the larger the numerical value.

上述の如く構成された本実施例の最尤候補単語抽出部の
動作を以下説明する。The operation of the maximum likelihood candidate word extraction section of this embodiment configured as described above will be described below.

第１の例として、入力文字列バッファ１に、文頭から「
このしやしんがほしい」という入力があった場合を考え
る。As a first example, in input string buffer 1, from the beginning of the sentence "
Let's consider a case where there is an input saying, "I want this shiyashin."

辞＠険索部２は上記入力文の先頭がら静間びさを行い、
「子Ｊ＋　’９ｍＪ、ｒこの」等を抽出して、こＡしを
Ｊ枦ｒＥｌｔハソファ３に格納する。単語バッファ３に
格納された各単語は、接続可否検定部４で接続可否の検
定を受ける。接続可否検定は、先行する単語の品詞と当
該単語の接続の可否を１’、’Ｏ’で表現したテーブル
を参照することによって行われる。この例では、先に挙
げた単語はどれも接続可なので、すべて接続可能Ｑ弓Ｒ
バッファ５に送られる。The dictionary@seasearch section 2 performs Shizuma bisa from the beginning of the above input sentence,
``ChiJ+'9mJ, rkono'' etc. are extracted and this A is stored in the JElt hash sofa 3. Each word stored in the word buffer 3 is tested by a connectability testing section 4 to see if it can be connected. The connectivity test is performed by referring to a table in which the part of speech of the preceding word and the connectivity of the word are expressed as 1' and 'O'. In this example, all of the words listed above can be connected, so all can be connected.
Sent to buffer 5.

次に出現確率の認定を行う。出現確率のバ、忍定には、
前述の出現確率表１３を用いる。第２図に示した如く、
単語が文頭にある場合には、連体詞である「この」のみ
に出現確率′１′が与えられており、他の名詞ｒ子」、
「弧」の出現確率は′Ｏ′となっている。これらの値を
ｆづ加された単語は、出現確率付き単語を格納する単語
バッファ７に送られる。Next, the appearance probability is determined. The probability of appearance, Shinoda, is
The aforementioned appearance probability table 13 is used. As shown in Figure 2,
When a word is at the beginning of a sentence, only the adnominal ``kono'' is given an appearance probability of '1', and other nouns are
The probability of appearance of an "arc" is 'O'. The words with these values added by f are sent to the word buffer 7 which stores words with appearance probabilities.

出現確率総合評価部８は、上記ｔｌｉ語バッファ７内の
ＩＪＩ語を１語ずつ取出して次のように出現確率総合評
価を行う。すなわち、出現確率総合評価値（以下、単に
「評価値」という）Ｆを、Ｆ（Ｑ、ｆ、ｃ）＝Ｉ＋、＋
ｆ＋２ｅと定義する。ここて、ｐは前記読み長、ｆは前記頻度ラ
ンク、ｅは上記出現確率の値である。上記評価１直Ｆの
値が最高の単語を出力するわけである。The appearance probability comprehensive evaluation section 8 extracts the IJI words from the tli word buffer 7 one by one and performs an appearance probability comprehensive evaluation as follows. That is, the appearance probability comprehensive evaluation value (hereinafter simply referred to as "evaluation value") F is expressed as F (Q, f, c) = I +, +
Define it as f+2e. Here, p is the reading length, f is the frequency rank, and e is the value of the appearance probability. This means that the word with the highest value of the above-mentioned evaluation 1st round F is output.

ここでは、Ｆ子」には、　　　　Ｉ　＋２　＋２　Ｘ　Ｏ＝　３「
弧」には、　　　　１＋１＋２ＸＯ＝２「この」には、
　　　２　＋　３　＋　２　Ｘ　］、　＝　７の如く数
値が与えられる。これにより、「この」が出力文字列バ
ッファ９に送られ、「子」と「弧」は前記候補単語保持
バッファ１０に貯えられて、後のバンクトラフ９等に対
処でさるようになる。Here, "F child" has I +2 +2 X O= 3"
For "arc", 1+1+2XO=2 for "this",
2 + 3 + 2 X], = 7 is given as a numerical value. As a result, "kono" is sent to the output character string buffer 9, and "child" and "arc" are stored in the candidate word holding buffer 10, so that they can be stored in the bank trough 9, etc. later.

次に、第２の例として、「女の」まで抽出が済んでいる
状態で、「このしやしん」と入力があった場合を考える
。この場合、出現確率の認定以前の動作は」二記第１の
例と同じであるので、説明は省略する。Next, as a second example, consider a case where "Konoshiyashin" is input in a state where "Onna no" has already been extracted. In this case, the operation before the recognition of the appearance probability is the same as in the first example of Section 2, so the explanation will be omitted.

第２図に示される如く、ｉ各助詞「の」に続く単語の出
現確率は、「子」と「弧」に′１・が、また、「この」
には０′がそ」しぞ、ｌｔ与えられている。As shown in Figure 2, the probability of appearance of the word following each particle ``i'' is ``1・'' in ``child'' and ``arc'', and ``kono''
is given as 0'.

従って、前記５・［Ｉ価値Ｆは、「子」に１よ、　　　　１　＋２　＋　２　Ｘ　ｌ　”
　５「弧」には、　　　　］＋］　＋２　Ｘ　１　＝　
４「この」には、　　　２　＋３　＋　２　Ｘ　Ｏ＝　
５の如く数値が与えられる。この場合には、「子」と「
この」が同じ数値になるので、ｊＳ初の’Ｉ−３Ｑ　ｒ
子」が出力文字列バッファ９に送られる。Therefore, the above 5.[I value F is 1 for “child”, 1 + 2 + 2 X l”
5 “Arc” has ]+] +2 X 1 =
4 “This” has 2 + 3 + 2 X O=
A numerical value such as 5 is given. In this case, "child" and "
This' is the same value, so jS's first 'I-3Q r
child' is sent to the output string buffer 9.

上述°の如く、本実施例装置によれば、文中での即諾の
位置によって、異なる単語が抽出されるようになるので
、最尤候補単語の選択抽出における妥当性が著しく向」
ニすることになる。As mentioned above, according to the device of this embodiment, different words are extracted depending on the position of the immediate consent in the sentence, so the validity of selecting and extracting the most likely candidate word is significantly improved.
I will do this.

なお、」−２評価値Ｆの内容は、」二記実施例に示した
ものに限られるものではなく、他の種々の表現が可能で
ある。例えば、頻度ランクの代りに頻度を直接用いるこ
と、出現確率表の記載を１′。Note that the contents of the "-2 evaluation value F" are not limited to those shown in the second embodiment, and various other expressions are possible. For example, use frequency directly instead of frequency rank, and write appearance probability table 1'.

０′でなく他の適切な値どすること等が可能である。It is possible to use other suitable values instead of 0'.

次に、」二連の如き方法によって選択抽出された最尤候
補単語から成る単語列（以下、「最尤候補単語列」とい
う）の確定について説明する。Next, the determination of a word string (hereinafter referred to as a "maximum likelihood candidate word string") consisting of the maximum likelihood candidate words selected and extracted by a method such as ``two series'' will be explained.

最尤候補単語列の確定は、各最先候補単語の内自立語で
あるものの数を累Ｏ１シてゆき、こＪＬが予め定めた閾
値に達した１１３′点で行うものとする。なお、本実施
例においては、この際、上述の閾値とは異なる、別に予
め定めた間直に対応する最力銀捕単語列が確定さ汎るこ
とになる。The most likely candidate word string is determined by accumulating the number of independent words among each of the first candidate words, and at point 113' when JL reaches a predetermined threshold. In addition, in this embodiment, at this time, the most powerful silver proverb string corresponding to a separately predetermined time, which is different from the above-mentioned threshold value, is determined.

以下、これを実施例に基づいて詳細に説明する。This will be explained in detail below based on examples.

第４図は本発明の一実施例であるカナ漢字変換処理装置
の要部を示すブロック図である。図において、２１は入
力文字列な保持する入力文字列保持部、２２は前述のノ
ｕ尤候補単詔Ｊ１１Ｊ出部、２３は該最尤候補単語抽出
部２２によって抽出された最尤候補！１ｉｆ？ｆを保持
する単語抽出結果作持部、２４は」二記最先候補’ｌ′
Ｌ：：ｄ抽出部２２により抽出された単語の品詞解釈を
行う品詞ｆＷ釈部、２５は該品詞解釈部２４による品詞
解釈の結果が自立語である場合にカラン１−アップされ
る自立語数カウンタである。本実施例においては」二記
自立語数カウンタの内容が３′になった０３５点で確定
処理を行い、前記自立語数カウンタの値は再び０′に切
期化される。なお、２６は上記自立語数カウンタ２５に
より起動され未確定エリア内のｒト語列在文節単位で確
定する確定処理部、２７は」二記未確定エリア内の単語
列を文節単位に保持する未１１℃定単語列保持部、２８
は」二記確定処理によっ゛Ｃ確定さＡした単語列を出力
するディスプレイ篇−の出力部である。FIG. 4 is a block diagram showing the main parts of a kana-kanji conversion processing device according to an embodiment of the present invention. In the figure, 21 is an input character string holding unit that holds the input character string, 22 is the above-mentioned Nou likely candidate single edict J11J output unit, and 23 is the maximum likelihood candidate extracted by the maximum likelihood candidate word extraction unit 22! 1 if? The word extraction result creation unit that holds f, 24, is the first candidate 'l'
L::d A part-of-speech fW interpretation unit that interprets the part-of-speech of the word extracted by the extraction unit 22, and 25 is an independent word number counter that is incremented by 1 when the result of part-of-speech interpretation by the part-of-speech interpretation unit 24 is an independent word. It is. In this embodiment, the determination process is performed at point 035, when the content of the independent word counter becomes 3', and the value of the independent word counter is reset to 0' again. In addition, 26 is a confirmation processing unit that is activated by the independent word counter 25 and confirms the word string in the unconfirmed area in units of phrases, and 27 is an unconfirmation processing unit that maintains the word strings in the unconfirmed area in units of clauses. 11°C constant word string holding section, 28
is an output section of the display section which outputs the word string that has been determined by the second determination process.

以下、本実施例装置の動作を説明する。The operation of the apparatus of this embodiment will be explained below.

前述の最尤候補単語抽出部２２によって得られたｌ、語
の各々について、品詞解釈部２４により品詞解釈を行う
。そして、品詞解釈の結果、自立語（体言、用に）と判
定されたものについては、その数を上記自立語数カウン
タ２５によりカラン１〜する。A part-of-speech interpretation unit 24 performs part-of-speech interpretation for each of the l words obtained by the maximum likelihood candidate word extraction unit 22 described above. As a result of the part-of-speech interpretation, for words that are determined to be independent words (indicative, idiom), the independent word number counter 25 counts them from 1 to 1.

上記自立語数カウンタ２５の内容が３′に達すると、前
記確定処理部２６が起動さ」し、上記３つの自立語を含
む少なくとも３つの文節の内、ｊ−初の文節を確定する
。When the content of the independent word number counter 25 reaches 3', the determination processing section 26 is activated and determines the j-first phrase among at least three phrases including the three independent words.

例えば、下記の如き文をｔえる。For example, consider the following sentences.

へいそはかくべつのおひきたて・・− この例文においでは、アンダーラインを付した自立語の
内、３つ口の「引立て」が抽出された時点で確定条件が
成立し、これにより、１つ目の自立語である「平素」を
含む「、’ｑＬ素は」という文節を１１１′定すること
になる。In this example sentence, the confirmation condition is met when three of the underlined independent words are extracted, and as a result, 1 The phrase ``,'qL element wa'' containing the second independent word ``heimoto'' is defined as 111'.

なお、この場合、」二記「引立て１以後における単語抽
出不能等によるバックトラック範凹が、上記第１文節よ
り後、すなわち、第２文節以１ｍに設定されることにな
る。In this case, the backtrack range due to the inability to extract words after the first phrase is set after the first phrase, that is, 1 m after the second phrase.

なお、」二記出力部２８への表示を、例えは、表示文字
列中の確定部分は固定表示に、また、未確定部分はブリ
ンク表示にする笠の方法を探ることにより、オペレータ
に現在のＭ　ｔＪ７状況を知らせることが可能であるこ
とは７；うまでしない。In addition, by exploring Kasa's method of displaying the display on the output section 28, for example, displaying the confirmed part in the display character string in a fixed display, and displaying the unconfirmed part in a blinking display, the operator can be informed of the current information. It is not possible to inform MtJ7 of the situation.

第５図は本発明の他の実施側々示すものであり、図にお
いて、記号］、、９，１１〜１３は第１図に示したと同
じ構成要素を示している。また、３０はカナ漢字変換制
御部、４０は総会５”ｊ’　ｆｌｌｆｉ　Ｍ単語バッフ
ァを示している。カナ漢字変換部３０は、第１図におい
ては個々の手段どして示した、辞書検索手段、接続可否
検定手段、出現確率；、Ｂ定手段および出現確率総合評
価手段等を含むものでｂる。FIG. 5 shows another embodiment of the present invention, and in the figure, symbols ], 9, 11 to 13 indicate the same components as shown in FIG. Further, 30 indicates a kana-kanji conversion control unit, and 40 indicates a general meeting 5''j' fullfi M word buffer. , a connection possibility test means, an appearance probability; a B-determined means, and an appearance probability comprehensive evaluation means.

本実施例装置の動作は、まず、入力文字列に対し単語辞
書１１を用いて辞店検索を行い、そこで単語が１つ児つ
かったら該単語に則して接続可否検定を」二記接続可否
検定表１２を参照して行う。The operation of the device of this embodiment is as follows: First, a dictionary search is performed on the input character string using the word dictionary 11, and when one word is found, a connectability test is performed based on the word. Perform this with reference to Test Table 12.

更に、接続可の単語に対して出現確率表１３を用いて出
現確率を求め、該出現確率と読み長、頻度ランクとを用
いた総合評価を行う。その結果を単語辞書１１の内容と
ともに、前記総合評価済ｑｔ語バッファ４０に格納する
。Furthermore, the appearance probabilities of connectable words are determined using the appearance probability table 13, and a comprehensive evaluation is performed using the appearance probabilities, reading lengths, and frequency ranks. The results are stored in the comprehensively evaluated qt word buffer 40 together with the contents of the word dictionary 11.

上記処理は、入力文字列の同じ位置からマツチする単語
が見つからなくなるまで行い、単語が見つかるごとに上
記検定、ｌｉｆ価、格納の手続きを繰り返す。これによ
って賀ら４した」二記総合評価済単語バッファ４０内の
単πｔ１の中から、Ｊｆ価値が最大のものを選択してこ
れを出力文字列バッファ９に　　　□送り出す。The above processing is performed until no matching word is found from the same position in the input character string, and the above verification, lif value, and storage procedures are repeated each time a word is found. As a result, from among the single πt1 in the comprehensively evaluated word buffer 40, the one with the largest Jf value is selected and sent to the output character string buffer 9.

本発明は上記実施例に限定せれるべきものではなく、種
々の変形が可能であること＋′ｌｌ：ｇうまてもないこ
とであり、本発明はそのような変形をも包含するもので
あることも３うまでもない・効　　　果以上述べた如く、本発明によれは、候補単語抽出に際し
て各単語の文中における位置を考慮することにより、筒
Ｊｌｊなイ１．７成で、確定処理の速度を向上させるこ
とが可能なカナ漢字変換処理装部を実現できるという顕
著な効果を奏するものである。The present invention should not be limited to the above-described embodiments, but may be modified in various ways, and the present invention includes such modifications. As mentioned above, the present invention takes into consideration the position of each word in the sentence when extracting candidate words, thereby increasing the speed of the confirmation process by a factor of 1.7. This has the remarkable effect of realizing a kana-kanji conversion processing unit that can improve the performance of the kana-kanji conversion processing unit.

[Brief explanation of drawings]

第１図は最尤候補単語抽出部の構成例を示す図、第２図
は出現確率表の内容の一例を示す図、第３図は単語辞書
の内容の一例を示す図、第４図、第５図は本発明の実施
例を示すブロツクＩ７１である。に人力文字列バッファ、２：辞７）検京部、３　；　１
１１゜詔バッファ、４＝接続可否検定部、５：接続可能
単語バッファ、６：出現確率認定部、７：単語バッファ
、８：出現確率総な評価部、９：出力文字列バッファ、
１０：候補単語保持用バッファ、１１：単語辞書、１２
：接続可否検定表、１３：出現確イ１表、２１：人力文
字列作持部、２２：最先候補単語抽出部、２３：卯語抽
出結果保持部、２４：品ｆｌｉ’１ｆｌｉ：釈部、２５
：自立語数カウンタ、２６：確定処方■部、２７：未確
定単語列作持部、２８：出力部、３０：カナ漢字変換部
、４０：総合ｉｉ’Ｉ’価済単５５ノくノファＣFIG. 1 is a diagram showing an example of the configuration of the maximum likelihood candidate word extraction unit, FIG. 2 is a diagram showing an example of the contents of an appearance probability table, FIG. 3 is a diagram showing an example of the contents of a word dictionary, and FIG. FIG. 5 is block I71 showing an embodiment of the present invention. Human string buffer, 2: 7) Kenkyobu, 3; 1
11゜Edict buffer, 4 = Connectability test unit, 5: Connectable word buffer, 6: Appearance probability recognition unit, 7: Word buffer, 8: Appearance probability total evaluation unit, 9: Output string buffer,
10: Buffer for holding candidate words, 11: Word dictionary, 12
: Connectability test table, 13: Appearance confirmation table 1, 21: Human string creation section, 22: First candidate word extraction section, 23: Uji extraction result holding section, 24: Product fli'1fli: Interpretation section , 25
: Independent word counter, 26: Determined prescription ■ part, 27: Undetermined word string creation part, 28: Output part, 30: Kana-kanji conversion part, 40: Comprehensive ii'I' valued single 55 kunofa C

Claims

[Claims]

(1) In a kana-kanji conversion processing device, means for giving an evaluation value unique to each searched word, means for determining a maximum likelihood candidate word based on the evaluation value, and means for determining a maximum likelihood candidate word based on the evaluation value. Provided is an independent word number counter that selects an independent word if it is an independent word, and a means for comparing the value of the independent word number counter with a predetermined threshold, and when the content of the independent word number counter exceeds the threshold, 1. A kana-kanji conversion processing device comprising: determining at least the first -- clause of an undetermined maximum likelihood candidate word string.

(2) The kana-kanji conversion processing device according to claim 1, wherein the evaluation value is an evaluation value that is calculated by multiplying the probability of appearance of the word depending on its position in a sentence.