JP2550012B2

JP2550012B2 - Pattern cutting and recognition method

Info

Publication number: JP2550012B2
Application number: JP59216139A
Authority: JP
Inventors: 康明中野; ▲浩▼道藤沢; 修国崎; 清道栗野; 彰三門田
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1984-10-17
Filing date: 1984-10-17
Publication date: 1996-10-30
Anticipated expiration: 2011-10-30
Also published as: JPS6195481A

Description

【発明の詳細な説明】〔発明の利用分野〕本発明はパターン切り出し及び認識方法に関し、たと
えば光学文字読み取り装置において、自然な筆記条件で
書かれた帳票のパターンを切り出す方法及びそれを認識
する方法に関するものである。Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a pattern cutout and recognition method, for example, a method for cutting out a pattern of a form written under natural writing conditions and a method for recognizing the pattern in an optical character reading device. It is about.

[Background of the Invention]

従来の文字読み取り装置（以下、OCRと略する）に読
ませる文字は、第１図（ａ）に示すように、文字ごとに
設定された文字枠11内に正しく筆記する必要がある。そ
の場合、多少の枠からのはみ出しは許容されているが、
その程度は第１図（ｂ）に示すように、上下方向につい
ては1.0〜1.5mm程度、左右方向については隣の枠に入ら
ない程度である。Characters to be read by a conventional character reading device (hereinafter abbreviated as OCR) must be correctly written in a character frame 11 set for each character, as shown in FIG. 1 (a). In that case, the protrusion from the frame is allowed,
As shown in FIG. 1 (b), the degree is about 1.0 to 1.5 mm in the up-down direction, and it is not in the adjacent frame in the left-right direction.

ところで、OCRを益々普及させるためには、上記のよ
うにOCR独特な文字枠内部に文字を筆記させることな
く、第２図，第３図に示すように文字枠にあまりこだわ
らず、通常我々が記入しているような筆記条件を可能に
することが必要である。By the way, in order to spread OCR more and more, we do not have to write the characters inside the OCR unique character frame as described above, and as shown in Figs. It is necessary to enable the writing conditions as written.

従来の文字枠は、寸法が大きいとともに、文字枠間ギ
ヤツプ５が1.0〜1.5mmであるのに対し、条件の緩和され
た文字枠は第２図の12で示すように、寸法が小さくなる
とともに、文字枠間ギヤツプ6,7が0mmとなる。その結果
として、文字の枠12からのはみ出しが大きくなり、また
文字相互が縦方向にオーバラツプしたり、あるいは文字
相互間が接触し易くなるという問題が生ずる。さらに文
字パターン成分が分離しているような場合、例えば、第
２図における文字「化」等では、その成分の大部分が隣
の枠に入ることがあり、文字読取上困難な問題となる。The conventional character frame has a large size and the gap 5 between character frames is 1.0 to 1.5 mm, while the character frame with relaxed conditions has a smaller size as shown by 12 in FIG. , Gears between character boxes 6, 7 are 0 mm. As a result, there is a problem in that the characters protrude from the frame 12, and the characters overlap each other in the vertical direction, or the characters easily come into contact with each other. Further, in the case where the character pattern components are separated, for example, in the case of the character "ka" or the like in FIG. 2, most of the components may be in the adjacent frame, which is a problem in character reading.

従来技術では、例えば「特開昭59−98283号公報」に
おいて、１文字パターンへの切り出しの際に、１文字パ
ターンを区切る境界の判断に曖昧性が生じている場合に
は、パターン間の境界に複数の仮説を作つて、各仮説の
単位パターンを認識部に送り、該認識部では各仮説の単
位パターンの認識結果を総合的に判断して各仮説の中か
ら単一の仮説を選択する方法が開示され、英字・数字・
片仮名の範囲では有効であることが知られている。しか
し、認識対象を漢字にまで拡張すると、この方法では第
３図に示すように漢字「化学」の中の文字「化」の偏と
旁が大きく離れて書かれた場合、切り出し・認識部がこ
れを片仮名の「イヒ」と切り出し部及び認識部がこれを
片仮名の「イヒ」と切り出し・認識する仮説は妥当であ
ると判断され、漢字としての認識結果「化」を単一の仮
説として選択できない問題があつた。In the prior art, for example, in Japanese Unexamined Patent Publication No. 59-98283, when there is ambiguity in the determination of the boundary separating one character pattern when cutting out into one character pattern, the boundary between patterns is determined. Create a plurality of hypotheses and send the unit pattern of each hypothesis to the recognition unit, and the recognition unit comprehensively judges the recognition result of the unit pattern of each hypothesis and selects a single hypothesis from each hypothesis. The method is disclosed, and letters, numbers, and
It is known to be effective in the range of katakana. However, if the recognition target is expanded to kanji, this method will cause the cutout / recognition part to cut when the deviation of the character "ka" in the kanji "chemistry" and the stroke are written far apart, as shown in FIG. It is judged that the hypothesis that Katakana “Ihi” is cut out and recognized by the cutout unit and recognition unit as Katakana “Ihi” is valid, and the recognition result “Kana” as Kanji is selected as a single hypothesis. There was a problem that I could not do.

[Object of the Invention]

本発明の目的は、このような従来の問題を解決するた
め文字枠から大きくはみ出したり、隣接文字と接触して
いるような自然な筆記条件で書かれた帳票上の漢字を含
むパターンを切り出し、それらのパターンを認識する手
段を提供することにある。An object of the present invention is to largely protrude from a character frame in order to solve such a conventional problem, or to cut out a pattern including kanji on a form written under natural writing conditions such that it is in contact with an adjacent character, It is to provide a means for recognizing those patterns.

[Outline of Invention]

本発明は、１文字単位の文字枠を有しない帳票を入力
し、切り出し部により電気的信号に変換された２次元映
像パターンから文字を含むパターンを切り出し、認識部
において切り出されたパターンをパターン辞書内の各パ
ターンと比較して認識するパターン切り出し及び認識方
法であって、上記２次元映像パターンの１文字行分に対
応する領域を対象として連続する黒画素の領域をパター
ン成分として抽出し、パターン成分の並びに関して順序
付けを行い、上記パターン成分を要素とする成分リスト
を作成するとともに、上記パターン成分についてその属
性を算出し、算出されたパターン成分の属性を参照し
て、文字を構成する１または複数のパターン成分の組み
合わせについての複数の仮説を立て、複数の仮説のそれ
ぞれに基づいて、上記成分リストから認識すべきパター
ン成分の組み合わせを構成してパターン辞書内の各パタ
ーンと照合して認識を行い、複数の仮説に基づく認識結
果に対し、予め定めた規制により単一の仮説がアクセプ
トとされた場合には、当該認識結果を最終的な認識結果
として出力し、複数の仮説がアクセプトされた場合に
は、複数の仮説に基づく各認識結果を得て、各認識結果
に対して言語辞書の内容と照合することにより確からし
い一の仮説に基づく認識結果を最終的な認識結果として
出力することを特徴とする。The present invention inputs a form that does not have a character frame for each character, cuts out a pattern including characters from the two-dimensional video pattern converted into an electric signal by the cutout unit, and cuts out the pattern in the recognition unit into a pattern dictionary. A pattern segmentation and recognition method that is performed by comparing each pattern in FIG. 1 with each pattern in FIG. The components are arranged in order, a component list having the pattern component as an element is created, the attribute of the pattern component is calculated, and the attribute of the calculated pattern component is referred to to construct a character 1 or Make multiple hypotheses about combinations of multiple pattern components, and based on each of the multiple hypotheses, A combination of pattern components to be recognized is constructed from the min list and matched against each pattern in the pattern dictionary for recognition.For a recognition result based on multiple hypotheses, a single hypothesis is accepted as a predetermined regulation. If a plurality of hypotheses are accepted, each recognition result based on the plurality of hypotheses is obtained, and the language dictionary is obtained for each recognition result. It is characterized by outputting a recognition result based on a certain hypothesis as a final recognition result by collating with the content of.

Example of Invention

以下、本発明の原理及び実施例を図面により説明す
る。Hereinafter, the principle and embodiments of the present invention will be described with reference to the drawings.

本発明の原理は、次の三つの点にある。すなわち、
（１）パターンの切り出しにおいて、曖昧性が生じた場
合は、無理に判断をすることなく、まず複数の仮説を立
てて、各々の仮説による単位パターンを認識部に送る。
（２）認識部では部分パターンや接触パターンの識別を
行い、総合的判断にもとづいて認識し、単一の候補カテ
ゴリが決定できない場合は各単位パターンに対して複数
の候補カテゴリを言語処理部に送る。（３）言語処理部
では複数の候補カテゴリの複数の仮説の系列から言語知
識を用いて総合判断を行い、その結果から切り出し及び
認識の妥当性のチエツクを行い曖昧性を解消する。The principle of the present invention lies in the following three points. That is,
(1) When ambiguity occurs in pattern cutting, a plurality of hypotheses are first set and a unit pattern based on each hypothesis is sent to the recognition unit without making a forcible judgment.
(2) The recognition unit identifies partial patterns and contact patterns and recognizes them based on comprehensive judgment. If a single candidate category cannot be determined, multiple candidate categories for each unit pattern are sent to the language processing unit. send. (3) The language processing unit makes a comprehensive judgment from a sequence of a plurality of hypotheses of a plurality of candidate categories by using language knowledge, and cuts out the result and checks the validity of recognition to eliminate the ambiguity.

第４図は、隣接文字パターンの種々の状態を示す図で
ある。FIG. 4 is a diagram showing various states of adjacent character patterns.

まず、第４図（ａ）ではパターン31と32とが縦方向に
オーバーラツプしている。この場合には、連続した黒領
域をパターン成分として抽出すれば正しいパターンを切
り出すことができる。連続した黒領域をパターン成分と
して抽出する方法は従来より知られており、枠内に正し
く文字が書かれている場合は勿論のこと、単純にオーバ
ーラツプしている場合でも、黒領域に沿つて枠外にはみ
出している部分まで抽出するので、単位パターンに正し
く切り出すことができる。First, in FIG. 4A, the patterns 31 and 32 overlap each other in the vertical direction. In this case, a correct pattern can be cut out by extracting a continuous black area as a pattern component. A method of extracting a continuous black area as a pattern component has been known in the past, and not only when the characters are correctly written in the frame, but also when the text is simply overlapped, the area along the black area is outside the frame. Since even the portion that protrudes is extracted, it is possible to correctly cut out into a unit pattern.

次に、第４図（ｂ）では、パターンが部分33と34に分
離していて、分離した成分34の大部分が隣接の枠に入つ
ている。パターン34が枠21に属するのか、枠22に属する
のか不明の場合は、双方をあり得るケースとして多重の
仮説を作る。そして、双方のケースを別個に認識部さら
に言語処理部に送つて、その結果からどちらの仮説が正
しかつたかを決定する。Next, in FIG. 4 (b), the pattern is separated into portions 33 and 34, and most of the separated components 34 are contained in the adjacent frames. When it is unclear whether the pattern 34 belongs to the frame 21 or the frame 22, a multiple hypothesis is created by considering both cases as possible cases. Then, both cases are separately sent to the recognition unit and the language processing unit, and which hypothesis is correct is determined from the result.

次に、第４図（ｃ）では、分離文字パターン36が接触
したケースである。Next, FIG. 4 (c) shows a case where the separated character pattern 36 contacts.

第４図（ｄ）では、分離パターン相互で接触したケー
スである。すなわち第４図（ｂ）の場合は、分離パター
ンが文字「非」のみであるのに対し、第４図（ｄ）の場
合は、文字「非」と「凡」の両方が分離パターンであ
り、それらの分離パターンが接触している。FIG. 4D shows a case where the separation patterns are in contact with each other. That is, in the case of FIG. 4B, the separation pattern is only the character “non”, whereas in the case of FIG. 4D, both the characters “non” and “normal” are separation patterns. , Their separation patterns are in contact.

第４図では、完備なパターン相互が接触したケースで
ある。つまり、分離していないパターンであるが、隣接
パターンが接触している場合である。In FIG. 4, complete patterns are in contact with each other. That is, the patterns are not separated, but the adjacent patterns are in contact with each other.

第４図（ｆ）では、（ｂ）と同様であるが、分離パタ
ーン40,41自体が正しい文字パターン「イ」，「ヒ」と
して存在し、これらを統合したパターンも文字「化」と
して存在する場合である。In FIG. 4 (f), the same as in (b), but the separation patterns 40, 41 themselves exist as the correct character patterns "a" and "hi", and the pattern obtained by integrating these also exists as the character "ka". This is the case.

第４図（ｂ）〜（ｆ）のケースに対して認識する方法
を次に説明する。A method of recognizing the cases shown in FIGS. 4B to 4F will be described below.

第５図〜第９図は、それぞれ本発明の認識原理を示す
図であつて、第５図は、切り出し部が複数の仮説を立て
た場合の動作説明図、第６図は、分離したパターン成分
が隣接文字と接触した場合の認識結果を示す図、第７図
は、分離したパターン成分相互で接触したときの動作説
明図、第８図は、完備なパターン相互で接触したときの
認識結果を示す図である。第９図は、分離したパターン
も、統合したパターンも文字として存在する場合の動作
説明図である。5 to 9 are diagrams showing the recognition principle of the present invention. FIG. 5 is a diagram for explaining the operation when the cutout unit makes a plurality of hypotheses, and FIG. 6 is a separated pattern. FIG. 7 is a diagram showing a recognition result when a component touches an adjacent character, FIG. 7 is an operation explanatory diagram when the separated pattern components touch each other, and FIG. 8 shows a recognition result when the complete pattern touches each other. FIG. FIG. 9 is an operation explanatory diagram when the separated pattern and the integrated pattern exist as characters.

まず、第４図（ｂ）のように、分離したパターン成分
34が隣接枠に入つている場合の認識方法を第５図により
説明する。First, as shown in FIG. 4 (b), the separated pattern components
A recognition method when 34 is in the adjacent frame will be described with reference to FIG.

第５図において、51,53は切り出し部が出力した二つ
の仮説である。200は認識部、100はパターン辞書、101
〜104はパターン辞書内の部分辞書である。認識部200は
二つの仮説51,53を入力して文字認識を行い、それぞれ
に対する認識結果52,54を出力する。すなわち、第一の
仮説に対する結果は（SP.非）と（RJ.?）である。ここ
で（SP.非）は、「非」のサブ・パターンの意味であ
り、部分パターンの辞書101を参照して認識されたもの
であり、また（RJ.?）はリジエクト（不読）である。さ
らに、第二の仮説に対する結果は、（AC.非）と（AC.
凡）であつて、文字「非」あるいは「凡」としてアクセ
プト（受理）したことを意味する。したがつて、第二の
仮説が妥当であり、認識結果は、文字「非、凡」とな
る。なお、パターン辞書100に設けられる四つの部分辞
書101〜104は、新しく設けられたものであつて、従来は
正常なパターンの辞書104のみが設けられていた。部分
辞書101は、部分パターンの辞書、部分辞書102は、部分
パターンと他の文字とが接触したパターンの辞書、部分
辞書103は、接触文字パターンの辞書である。In FIG. 5, 51 and 53 are two hypotheses output by the clipping unit. 200 is a recognition unit, 100 is a pattern dictionary, 101
˜104 are partial dictionaries in the pattern dictionary. The recognition unit 200 inputs the two hypotheses 51 and 53, performs character recognition, and outputs recognition results 52 and 54 for each. That is, the results for the first hypothesis are (SP.non) and (RJ.?). Here, (SP.non) means "non" sub-pattern, which is recognized by referring to the partial pattern dictionary 101, and (RJ.?) Is a reject (unread). is there. Furthermore, the results for the second hypothesis are (AC. Non) and (AC.
It means that it has been accepted (accepted) as the character "non" or "normal". Therefore, the second hypothesis is valid, and the recognition result is the character "non, ordinary". Note that the four partial dictionaries 101 to 104 provided in the pattern dictionary 100 are newly provided, and conventionally only the dictionary 104 having a normal pattern was provided. The partial dictionary 101 is a dictionary of partial patterns, the partial dictionary 102 is a dictionary of patterns in which a partial pattern contacts another character, and the partial dictionary 103 is a dictionary of contact character patterns.

次に、第４図（ｃ）の分離文字パターン成分が隣接文
字に接触している場合の認識方法を説明する。Next, a recognition method when the separated character pattern component of FIG. 4 (c) is in contact with an adjacent character will be described.

この場合、第６図に示すように、切り出し結果は55の
ようになり、認識結果56は（SP.非）、（SC.非、凡）と
なる。ここで、（SC.非、凡）が文字「非」の部分パタ
ーンと文字「凡」が接触したものであることを意味し、
部分パターン辞書102を参照して認識したものである。
この結果から、読み取り文字は文字「非、凡」であるこ
とが判断できる。In this case, as shown in FIG. 6, the cutout result is 55, and the recognition result 56 is (SP. Non), (SC. Non, general). Here, (SC.non, ordinary) means that the partial pattern of the character "non" and the character "normal" are in contact,
It is recognized by referring to the partial pattern dictionary 102.
From this result, it can be determined that the read character is the character “non, ordinary”.

次に、第４図（ｄ）の分離パターン成分相互で接触し
ている場合の認識方法を説明する。Next, a recognition method when the separated pattern components shown in FIG. 4D are in contact with each other will be described.

この場合には、第７図（ａ）に示すように、二つの仮
説57,59が立ち、認識結果58,60を得る。また、この場合
には、特にサブ・パターン61、すなわち第４図（ｄ）の
38を単独で認識して、その結果62の（SS.非、凡）を得
る。仮説57はサブ・パターン38が右側に付加されたもの
と仮定した場合であり、仮説59はサブ・パターン38が左
側に付加されたものと仮定した場合である。（SP.非）
（RJ.?）は「非」のサブ・パターンとリジエクト（全く
不明）であり、（RJ.?）（SP.凡）はリジエクト（全く
不明）と「凡」のサブ・パターンである。また、（SS.
非、凡）は「非」のサブ・パターンと文字「凡」のサブ
・パターンの接触したパターンであることを意味し、部
分パターンと他の文字とが接触したパターンの辞書102
が参照される。これらの結果を総合することにより、答
えは文字「非」、「凡」であると判断される。In this case, as shown in FIG. 7 (a), two hypotheses 57 and 59 stand and recognition results 58 and 60 are obtained. Further, in this case, in particular, the sub-pattern 61, that is, in FIG.
Recognize 38 alone, resulting in 62 (SS.non, mediocre). Hypothesis 57 is the case where sub pattern 38 is assumed to be added on the right side, and hypothesis 59 is the case where sub pattern 38 is assumed to be added to the left side. (SP. Non)
(RJ.?) Is the "non" sub-pattern and Rigid (totally unknown), and (RJ.?) (SP. Ron) is the Rigid (totally unknown) and "Robo" sub-pattern. Also, (SS.
(Non, ordinary) means that the sub pattern of “non” and the sub pattern of the character “normal” are in contact with each other, and the dictionary 102 of patterns in which the partial pattern and other characters are in contact with each other.
Is referred to. By summing up these results, the answer is judged to be the letters "non" and "normal".

次に、第４図（ｅ）の完備パターン相互が接触した場
合の認識方法を説明する。Next, a recognition method when the complete patterns of FIG. 4 (e) contact each other will be described.

この場合、第８図に示すように、無理に分割せずに全
体を認識部に送り、部分パターン辞書103を参照して同
じものを捜し、認識する。その結果（CC.大、山）が得
られたが、これは文字の「大」と「山」が接触したもの
であることが判断できる。In this case, as shown in FIG. 8, the whole is sent to the recognition unit without being forcibly divided, and the same pattern is searched for and recognized by referring to the partial pattern dictionary 103. The result (CC. Dai, Yama) was obtained, but it can be judged that this is the contact between the letters "Large" and "Mountain."

次に、第４図（ｆ）の、部分パターン自体も部分パタ
ーンを統合したパターンも、文字パターンとして存在す
る場合の認識方法を説明する。Next, a recognition method in the case where the partial pattern itself and the pattern obtained by integrating the partial patterns in FIG. 4 (f) exist as character patterns will be described.

この場合、第９図に示すように、63,64の二つの仮説
が立ち、63の認識結果として（AC・イ）、（AC・ヒ）、
（AC・学）が、64の認識結果として（AC・化）、（AC・
学）が、得られる。この両者は認識結果としては対等で
あり、いずれを採用すべきかはこの段階では判断できな
い。そのため言語辞書を参照して、「イヒ学」という単
語は存在しないが、「化学」という単語は存在すること
から、認識対象文字（群）は「化学」であることが判断
できる。言語辞書としては、単語のみならず文法，修
辞，語用など各種の知識をデータ化したものが利用でき
る。In this case, as shown in FIG. 9, two hypotheses of 63 and 64 stand, and the recognition results of 63 are (AC · a), (AC · hi),
(AC / gaku) has 64 recognition results (AC / a), (AC / g)
Science) is obtained. Both of them are equal in recognition result, and it is not possible to judge which one should be adopted at this stage. Therefore, referring to the linguistic dictionary, although the word “Ichaku” does not exist, but the word “chemistry” exists, it can be determined that the recognition target character (group) is “chemistry”. As the language dictionary, it is possible to use not only words but also data of various knowledge such as grammar, rhetoric, and vocabulary.

上述のように、本発明の動作原理として、認識結果と
言語処理結果を総合的に判断して最終的な答を出す方法
の概略説明を行つたが、実際には次のような規制に従つ
て処理することにより実現される。As described above, as the operating principle of the present invention, the outline of the method of comprehensively judging the recognition result and the language processing result and giving the final answer has been described. It is realized by processing.

まず、第４図（ａ）〜（ｆ）に対して、第５図〜第９
図で処理したことを整理すると、次のようになる。First, with respect to FIGS. 4A to 4F, FIGS.
The following is a summary of what was processed in the figure.

（ａ）（AC.大）（AC.山）→（AC.大）（AC.山）（ｃ）（SP.非）（SC.非、凡）→（AC.非）（AC.凡）（ｅ）（CC.大、山）→（AC.大）（AC.山）左辺の仮説ごとの認識結果コードは、左辺のような認
識結果コードに書換えがなされる。(A) (AC. Large) (AC. Mountain) → (AC. Large) (AC. Mountain) (C) (SP. Non) (SC. Non, general) → (AC. Non) (AC. General) (E) (CC. Large, Mountain) → (AC. Large) (AC. Mountain) The recognition result code for each hypothesis on the left side is rewritten to the recognition result code on the left side.

これらを一般化して法則にしたものを、書換え規則
（rewriting rules）と呼ぶことにする。The generalized rules of these are called rewriting rules.

本発明による切り出し方式では、書換え規則が次のよ
うになる。In the cutout method according to the present invention, the rewriting rule is as follows.

R1:（▲▼.a）（▲▼.b）＋（AC.a）（AC.b）
→（AC.a）（AC.b） R2:（SP.a）（SC.a,b）→（AC.a）（AC.b） R3:（SP.a）（▲▼．＊）＋（▲▼．＊）（SP.b）→（RCG） R4:（SP.a）（▲▼．＊）＋（▲▼．＊）（SP.b）＋（SS.a.b）→（AC.a）
（AC.b） R5:（CC.a,b）→（AC.a）（AC.b） R6:（AC.p）（AC.q）（SP.b）＋（AC.a）（AC.b）→（DCT）ここで、▲▼はACでないことを意味し、AC以外の
すべてを示す。また、＊は任意の値を取り得る。（RC
G）は切り出しの曖昧性を与えているサブ・パターンの
みを認識せよという意味である。（DCT）は認識で曖昧
性が残つている場合、言語規則を参照して決定せよとい
う意味である。規則R1は、前式の（ａ）と（ｂ）に対応
するもので、a,bをアクセプト（認識）していない場所
があつても、他に一つでもアクセプトした場所があれ
ば、認識できたことにする。規則R2は、前式の（ｃ）に
対応するもので、ａのサブ・パターンが認識される一
方、ａのサブ・パターンとｂのパターンの接触が認識さ
れたときには、ａとｂがアクセプト（認識）されたこと
にする。規則R3は、前式の（ｄ）に対応するもので、ａ
のサブ・パターンが認識され、アクセプト以外の例えば
リジエクタで任意の値の候補が与えられる一方、ｂのサ
ブ・パターンが認識され、アクセプト以外の任意の値の
候補が与えられる場合には、分離されているサブ・パタ
ーンのみを認識してみることを指示する。また、規則R4
も、（ｄ）に対応するものであり、R3の規則によつて処
理されたサブ・パターンのみを認識結果を含めて、総合
的に認識する場合を示している。すなわち、ａのサブ・
パターンと認識できないパターン、及びｂのサブ・パタ
ーンと認識できないパターン、及びａのサブ・パターン
とｂのパターンの接触したパターンの三つが認識された
場合には、総合的な認識によりａアクセプト、ｂアクセ
プトとなる。規則R5は、（ｅ）に対応するもので、ａと
ｂの接触したパターンは、ａアクセプト、ｂアクセプト
となることを示す。規則R6は、（ｆ）に対応するもの
で、ｐアクセプト、ｑアクセプト、ｂアクセプトなる認
識結果が与えられる場合と、ａアクセプト、ｂアクセプ
トなる認識結果が与えられる場合との二つの仮説が肯定
された場合には、言語的な知識を参照してａアクセプ
ト、ｂアクセプトとなる。R1: (▲ ▼ .a) (▲ ▼ .b) + (AC.a) (AC.b)
→ (AC.a) (AC.b) R2: (SP.a) (SC.a, b) → (AC.a) (AC.b) R3: (SP.a) (▲ ▼. *) + (▲ ▼. *) (SP.b) → (RCG) R4: (SP.a) (▲ ▼. *) + (▲ ▼. *) (SP.b) + (SS.ab) → (AC. a)
(AC.b) R5: (CC.a, b) → (AC.a) (AC.b) R6: (AC.p) (AC.q) (SP.b) + (AC.a) (AC .b) → (DCT) where ▲ ▼ means not AC and shows all except AC. Also, * can take any value. (RC
G) means to recognize only the sub-pattern that gives the ambiguity of cutout. (DCT) means that if there is ambiguity in recognition, make a decision by referring to the language rules. Rule R1 corresponds to the above expressions (a) and (b). Even if there is a place where a and b are not accepted (recognized), if there is another accepted place, it is recognized. I will do it. The rule R2 corresponds to the above equation (c), and when the sub pattern of a is recognized and the contact between the sub pattern of a and the pattern of b is recognized, a and b are accepted ( Be recognized). Rule R3 corresponds to the above equation (d), and
When the sub-pattern of b is recognized and a candidate of any value other than accept is given, it is separated. Instruct to recognize only the sub-pattern that has Also, rule R4
Also corresponds to (d), and shows a case where only the sub-pattern processed according to the rule of R3 is comprehensively recognized including the recognition result. That is, the sub of a
When three patterns, that is, a pattern that cannot be recognized as a pattern, a pattern that cannot be recognized as a sub pattern of b, and a pattern in which the sub pattern of a and the pattern of b are in contact, are recognized, a acceptance, b Become an accept. Rule R5 corresponds to (e), and shows that the pattern in which a and b are in contact is a accept or b accept. The rule R6 corresponds to (f), and two hypotheses are affirmed, that is, a recognition result of p accept, q accept, and b accept is given, and a recognition result of a accept and b accept is given. In this case, the linguistic knowledge is referred to for a accept and b accept.

第10図は本発明の実施例を示す文字読み取り装置のブ
ロツク図である。FIG. 10 is a block diagram of a character reading device showing an embodiment of the present invention.

この文字読み取り装置は、パターン観測部800、切り
出し部900、帳票フオーマツト辞書950、パターン認識部
200、パターン辞書100、認識結果最終判定部400、認識
結果書換え規則辞書300、言語辞書500、言語処理部600
から構成される。This character reading device consists of a pattern observation unit 800, a cutout unit 900, a form format dictionary 950, and a pattern recognition unit.
200, pattern dictionary 100, recognition result final determination unit 400, recognition result rewriting rule dictionary 300, language dictionary 500, language processing unit 600
Consists of

帳票75には、第２図に示すような自然な筆記条件で文
字が記入されている。帳票75がパターン観測部800に入
力され、光電変換及び前処理（二値化，帳票スキユー補
正）を受けると、二次元映像パターンが電気的信号とし
てパターン切り出し部900に送出される。パターン切り
出し部900では、帳票フオーマツト辞書950からの枠位置
パラメータを参照して、一枚の帳票の映像から一文字に
該当すると判断されるパターンを一組ずつ切り出してパ
ターン認識部200に送出する。パターン認識部200では、
入力された一文字分のパターン（前述のようにサブ・パ
ターンや接触した二文字分のパターンの場合もある）
と、第５図に示したパターン辞書100に記憶されている
各辞書パターンと比較照合し、認識結果を最終判定部40
0に送出する。最終判定部400は、認識結果に対して書換
え規則辞書300内の各書換え規則を適用できる書換え規
則がなくなるまで順次適用し、書換えの結果に応じた処
理を行う。すなわち、前記規則R1〜R5の条件の中から記
号化された認識結果がこれに合致するものを選択適用
し、その結果を採用する。言語処理部600は、認識結果
に対して言語辞書500を参照し、未確定のまま残つてい
る場合すなわち前記規則R6に相当する結果に対する処理
を行う。Characters are written on the form 75 under natural writing conditions as shown in FIG. When the form 75 is input to the pattern observation unit 800 and subjected to photoelectric conversion and preprocessing (binarization, form skew correction), the two-dimensional image pattern is sent to the pattern cutout unit 900 as an electric signal. The pattern cutout unit 900 refers to the frame position parameter from the form format dictionary 950 and cuts out a set of patterns determined to correspond to one character from the image of one form and sends them to the pattern recognition unit 200. In the pattern recognition unit 200,
Pattern for one character that was input (may be a sub-pattern or a pattern for two characters that touched as described above)
Is compared with each dictionary pattern stored in the pattern dictionary 100 shown in FIG.
Send to 0. The final determination unit 400 sequentially applies each rewriting rule in the rewriting rule dictionary 300 to the recognition result until there are no more rewriting rules, and performs processing according to the rewriting result. That is, from the conditions of the rules R1 to R5, the symbolized recognition result that matches this is selected and applied, and the result is adopted. The language processing unit 600 refers to the language dictionary 500 for the recognition result, and when it remains undetermined, that is, performs the process for the result corresponding to the rule R6.

第10図のうち、パターン観測部800は公知の技術で実
現できるので説明を省略する。In FIG. 10, the pattern observing section 800 can be realized by a known technique, and therefore its explanation is omitted.

パターン切り出し部900以降の処理を、さらに詳しく
説明する。The process after the pattern cutout unit 900 will be described in more detail.

第11図は、第10図の切り出し処理及び認識処理のフロ
ーチヤートと対応するデータの内容を示す図である。FIG. 11 is a diagram showing the content of data corresponding to the cutout processing and recognition processing flowchart of FIG.

ステツプ701では、帳票１枚分の映像パターン711より
１行分の映像パターン712を切り出す。次に、ステツプ7
02では、黒地パターンの連続性を利用して、黒地ごとの
パターン成分を抽出し、横方向に関して順序付けを行つ
た後、成分リスト713を作成する。さらに、各成分の属
性を計算し、成分属性リスト714を作成する。なお、成
分の属性とは、各成分の上下端，左右端の座標，輪郭総
長等である。In step 701, one line of image pattern 712 is cut out from one form of image pattern 711. Next, step 7
In 02, the pattern component for each black background is extracted by utilizing the continuity of the black background pattern, and after ordering in the horizontal direction, the component list 713 is created. Furthermore, the attribute of each component is calculated, and the component attribute list 714 is created. The component attributes include the upper and lower ends of each component, the coordinates of the left and right ends, the total length of the contour, and the like.

次に、ステツプ703では、成分属性リスト714と、帳票
フオーマツト辞書950の情報から文字間の境界の仮説を
立て、文字リスト715を作成する。このリスト715は各文
字パターンがどの成分から構成されているかを示すもの
で、第11図では、第一の仮説は順序１、２、５はそれぞ
れ一つの文字、３と４とを合わせて一つの文字となり、
第二の仮説は順序５のみで一つの文字、１と２、３と４
はそれぞれ合わせて一つの文字と仮定する。Next, in step 703, a hypothesis of a boundary between characters is established from the component attribute list 714 and the information in the form format dictionary 950 to create a character list 715. This list 715 shows which component each character pattern is composed of. In FIG. 11, the first hypothesis is that the orders 1, 2, and 5 are one character, 3 and 4, respectively. Becomes one character,
The second hypothesis is the order 5 only, one letter, 1 and 2, 3 and 4
Is assumed to be one character in total.

次に、認識部のステツプ704では、成分リスト713、成
分属性リスト714及び文字リスト715を入力し、文字リス
ト715に含まれる成分を集めてパターン整合を行い、そ
の結果を結果リスト716に書き込む。整合結果を表す結
果コードは、（SP.a）、（SC.a,b）、（SS.a,b）、（C
C.a,b）、（AC.a）、（RJ.a）等の記号形式をとる。こ
れらの意味は、前述のようにそれぞれカテゴリａのサブ
・パターン、カテゴリａのサブ・パターンとカテゴリｂ
のパターンの接触したもの、カテゴリａとカテゴリｂの
サブ・パターン相互が接触したもの、カテゴリａとカテ
ゴリｂの接触したもの、カテゴリａのパターン、候補は
カテゴリａであるがリジエクトという意味を持つてい
る。Next, in step 704 of the recognition unit, the component list 713, the component attribute list 714, and the character list 715 are input, the components included in the character list 715 are collected and pattern matching is performed, and the result is written in the result list 716. The result code indicating the matching result is (SP.a), (SC.a, b), (SS.a, b), (C
It takes a symbolic form such as (Ca, b), (AC.a), (RJ.a). As described above, these meanings are the sub-pattern of category a, the sub-pattern of category a and the category b, respectively.
Pattern, contact between sub-patterns of category a and category b, contact between category a and category b, pattern of category a. Candidate category is category a but has the meaning of "reject". There is.

ステツプ705では、結果リスト716に対して、書換え規
則辞書300内部のすべての規則を参照し、適用できる規
則がなくなるまで順次適用し、最終的に得られた結果に
応じた処理を行い、その結果を結果リスト717に書き込
む。In step 705, all rules in the rewriting rule dictionary 300 are referenced to the result list 716, sequentially applied until there are no applicable rules, and the processing according to the finally obtained result is performed. Into the result list 717.

ステツプ706では、結果リスト717を調べて、ステツプ
705で（DCT）なる判定が下されていたか否かを判定し、
（DCT）が存在したときには、言語辞書500との整合を行
つて、整合した結果を結果リスト718に書き込む。結果
リスト718が最終出力となる。In step 706, the result list 717 is examined and
It is determined whether or not the determination (DCT) has been made in 705,
When (DCT) is present, the matching with the language dictionary 500 is performed and the matching result is written in the result list 718. The result list 718 is the final output.

ステツプ707では、帳票上のすべての行が終了したか
否かを判断し、終了していなければステツプ701に戻つ
て終了するまで以上の処理を繰り返し行う。In step 707, it is judged whether or not all the lines on the form are completed, and if not completed, the process returns to step 701 and the above processing is repeated until it is completed.

〔The invention's effect〕

以上説明したどとく、本発明によれば帳票の筆記条件
が緩和されて、隣接パターン相互がオーバラツプした場
合、パターンの一部が本来の位置より大幅にずれて存在
する場合、さらに隣接パターン相互が接触した場合等で
も、文字読取装置において妥当なパターンの切出し及び
認識ができるので、ユーザにとりきわめて便利となり、
益々OCRを普及させることが可能となる。As described above, according to the present invention, the writing condition of the form is relaxed, and when the adjacent patterns overlap each other, when a part of the patterns exists with a large deviation from the original position, the adjacent patterns are further separated from each other. Even in the case of contact, the character reading device can cut out and recognize a proper pattern, which is extremely convenient for the user.
It becomes possible to spread OCR more and more.

なお実施例では、文字読取装置における文字の切出し
及び認識について説明したが、本発明は文字に限らず、
音声等一般のパターンにも適用可能であることは勿論で
ある。In addition, in the embodiment, the cutting and recognition of the character in the character reading device is described, but the present invention is not limited to the character,
Of course, it can be applied to general patterns such as voice.

【図面の簡単な説明】第１図は従来のOCR用帳票の文字枠を示す図、第２図は
制限を緩和したときの文字枠を示す図、第３図は漢字パ
ターンの状態を示す図、第４図は隣接文字パターンの種
々の状態を示す図、第5,6,7,8,9図はそれぞれ本発明の
認識原理を示す説明図、第10図は本発明の実施例を示す
文字読取装置の機能ブロツク図、第11図は第10図の切出
し及び認識処理の流れを示す図である。 75……帳票、100……パターン辞書、200……パターン認
識部、300……認識結果書換え規則辞書、400……認識結
果最終判定部、500……言語辞書、600……言語処理部、
800……パターン観測部、900……パターン切出し部、95
0……帳票フオーマツト辞書。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 shows a character frame of a conventional OCR form, FIG. 2 shows a character frame when restrictions are relaxed, and FIG. 3 shows a state of a kanji pattern. FIG. 4 is a diagram showing various states of adjacent character patterns, FIGS. 5, 6, 7, 8, and 9 are explanatory diagrams showing the recognition principle of the present invention, and FIG. 10 is an embodiment of the present invention. FIG. 11 is a functional block diagram of the character reading device, and FIG. 11 is a diagram showing a flow of the cutting and recognition processing of FIG. 75 …… Form, 100 …… Pattern dictionary, 200 …… Pattern recognition section, 300 …… Recognition result rewriting rule dictionary, 400 …… Recognition result final judgment section, 500 …… Language dictionary, 600 …… Language processing section,
800 …… Pattern observation section, 900 …… Pattern cutting section, 95
0 …… Form format dictionary.

フロントページの続き (72)発明者国崎修横浜市戸塚区吉田町292番地株式会社日立製作所マイクロエレクトロニクス機器開発研究所内 (72)発明者栗野清道小田原市国府津2880番地株式会社日立製作所小田原工場内 (72)発明者門田彰三小田原市国府津2880番地株式会社日立製作所小田原工場内 (56)参考文献特開昭59−91582（ＪＰ，Ａ) 特開昭59−112367（ＪＰ，Ａ) 特開昭58−48182（ＪＰ，Ａ) 特開昭57−209574（ＪＰ，Ａ)Front page continued (72) Inventor Osamu Kunisaki 292 Yoshida-cho, Totsuka-ku, Yokohama City Hitachi Ltd., Microelectronics Device Development Laboratory (72) Inventor Kiyomichi Kurino 2880 Kozu, Odawara City Hitachi Ltd. Odawara Plant ( 72) Inventor Shozo Kadota 2880, Kozu, Odawara City, Odawara Plant, Hitachi Ltd. (56) References JP 59-91582 (JP, A) JP 59-112367 (JP, A) JP 58- 48182 (JP, A) JP-A-57-209574 (JP, A)

Claims

(57) [Claims]

1. A form which does not have a character frame for each character is input, a pattern including characters is cut out from a two-dimensional video pattern converted into an electric signal by a cutout unit, and the pattern cut out by the recognition unit is used as a pattern. A pattern cutout and recognition method that is performed by comparing with each pattern in a dictionary, wherein a region of continuous black pixels is extracted as a pattern component for a region corresponding to one character line of the two-dimensional video pattern, The order of the arrangement of the pattern components is performed, a component list having the pattern components as elements is created, the attributes of the pattern components are calculated, and characters are formed by referring to the attributes of the calculated pattern components. A plurality of hypotheses about a combination of one or more pattern components On the basis of the above, a combination of pattern components to be recognized is constructed from the component list and is recognized by collating with each pattern in the pattern dictionary. If one hypothesis is accepted, the recognition result is output as the final recognition result, and if multiple hypotheses are accepted, each recognition result based on multiple hypotheses is obtained and A pattern segmentation and recognition method characterized by outputting a recognition result based on one probable hypothesis as a final recognition result by comparing the recognition result with the contents of a language dictionary.

2. The pattern cutout and recognition according to claim 1, wherein the attributes of the pattern component to be calculated include the upper and lower ends of the pattern component, the coordinates of the left and right ends, and the total contour length. Method.

3. A partial pattern of a pattern according to claim 1, wherein in forming a combination of pattern components and matching with each pattern in a pattern dictionary, (1) the pattern component represents a one-character category. If it is determined that the pattern component is a partial pattern, and the recognition result is a signal indicating the category name of the character category, (2) the pattern component is a complete pattern representing one character category. If it is judged that there is a pattern, a display indicating a complete pattern and a signal indicating the category name of the character category that is the recognition result, and (3) a pattern in which the pattern component represents a plurality of character categories When it is determined that the touched pattern is, the display indicating the touched pattern and the Pattern extraction and recognition characterized by outputting signals indicating category names and applying a predetermined regulation to the output signals for each pattern component that is the target of the above combination to test the validity of the hypothesis Method.