JP2003162689A

JP2003162689A - Similar character recognition program and similar character recognition method

Info

Publication number: JP2003162689A
Application number: JP2001360520A
Authority: JP
Inventors: Koji Kurokawa; 浩司黒川
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2001-11-27
Filing date: 2001-11-27
Publication date: 2003-06-06

Abstract

(57)【要約】【課題】本発明は、認識対象の文字から読み取ったイ
メージをもとに文字を認識する類似文字認識プログラム
および文字認識方法に関し、認識対象の類似文字につい
て、基準線と矩形の位置情報や１文字を構成する矩形の
情報などをもとに変換候補にあるときに該当文字に変換
し、判定条件を多くして類似文字の認識率を向上させる
ことを目的とする。【解決手段】認識対象の文字から読み取ったイメージ
をもとに文字認識して候補を生成する手段と、読み取っ
た認識対象の文字イメージが基準線よりも下にあり、か
つ候補中に文字ｊが含まれていたときに文字ｊと判定す
る手段と、読み取った認識対象の、１つの文字に対応す
る文字イメージが２つ以上に分離し、かつ候補中に文字
ｉが含まれていたときに文字ｉと判定する手段としてコ
ンピュータに機能させるための類似文字認識プログラム
および類似文字認識方法である。 (57) Abstract: The present invention relates to a similar character recognition program and a character recognition method for recognizing a character based on an image read from a character to be recognized. It is an object of the present invention to convert a candidate character based on the position information, the information of a rectangle constituting one character, etc., into a corresponding character when the candidate is a conversion candidate, to increase the determination conditions, and to improve the recognition rate of similar characters. SOLUTION: A means for generating a candidate by performing character recognition based on an image read from a character to be recognized, and a method in which the read character image to be recognized is below a reference line and a character j is included in the candidate Means for determining a character j when the character is included, and a character when the character image corresponding to one character of the read recognition target is separated into two or more and the character i is included in the candidate. A similar character recognition program and a similar character recognition method for causing a computer to function as means for determining i.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、認識対象の文字か
ら読み取ったイメージをもとに文字を認識する類似文字
認識プログラムおよび類似文字認識方法に関するもので
ある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a similar character recognition program and a similar character recognition method for recognizing a character based on an image read from a character to be recognized.

【０００２】[0002]

【従来の技術】従来、英宇類似文字識別は、図５の
（ａ）に示すように、文字矩形が基準線（文字の下接
線）の上にあるかどうか、一文字がいくつの矩形から構
成されているかなどの特徴により、類似文字の変換を行
っている。例えば、「ｉ」と「ｊ」の判別時には、文字
矩形が基準線（図５の（ａ−）参照）よりも下にあった
場合は、「ｊ」とし、「ｉ」と「１」（エル）の判別時
には，図５の（ａ−２）のように１文字矩形内で矩形が
分離している場合は「ｉ」とし、図５の（ａ−３）のよ
うに１文字矩形が分離していない場合には「ｊ」とす
る、などの単純な変換を行っていた。2. Description of the Related Art Conventionally, as shown in FIG. 5 (a), the English-U similar character identification is made up of whether or not a character rectangle is on a reference line (bottom tangent line of a character) and how many rectangles each character is. Similar characters are converted according to the characteristics such as whether or not they are used. For example, when the character rectangle is below the reference line (see (a-) in FIG. 5) when determining “i” and “j”, it is set to “j”, and “i” and “1” ( At the time of discrimination of (ell), if the rectangles are separated within the one-character rectangle as shown in (a-2) of FIG. 5, it is set as “i”, and the one-character rectangle is changed as shown in (a-3) of FIG. When not separated, a simple conversion such as “j” was performed.

【０００３】また、対象文字の周辺文字の属性を判定
し、周辺文字と同じ属性の文字に対象文字を変換する。
例えば、「１」（数字のイチ）「１」（エル）の判別時
には前後幾つかの文字が英字なのか数字なのかを調査
し、数字であつた場合は「１」（イチ）に、英字であつ
た場合は「１」（エル）に変換するようにしていた。Further, the attribute of the peripheral character of the target character is determined, and the target character is converted into a character having the same attribute as the peripheral character.
For example, when discriminating "1" (number is right) or "1" (ell), it is checked whether some characters before and after are alphabetic or numeric, and if it is a number, it is changed to "1" (ichi). If so, it was converted to "1" (L).

【０００４】[0004]

【発明が解決しようとする課題】このため、入力画像が
図５の（ｂ）に示すような劣化画像であつた場合の文字
形状の変化には対応できないという問題があった。Therefore, there is a problem that it is not possible to cope with the change in the character shape when the input image is a deteriorated image as shown in FIG. 5B.

【０００５】また、一文字矩形の位置、数などの情報に
よる類似文字の変換では、ある程度良好な変換結果を得
られるが、誤変換が多く実用的ではないという問題があ
った。In addition, conversion of similar characters based on information such as the position and number of one character rectangle gives a somewhat good conversion result, but there is a problem that many conversions are not practical.

【０００６】また、従来の上述した手法では、判定条件
が少ないため、１文字矩形が同位置、同数の場合には変
換が行えないなどの問題があった。例えば英宇内での類
似文字は全て属性が英字となってしまい、英宇内の類似
文字識別には不向きであるという問題もあった。Further, the above-mentioned conventional method has a problem that since the number of judgment conditions is small, conversion cannot be performed when one character rectangle has the same position and the same number. For example, there is a problem that all similar characters in Ei Uchi have an attribute of English, which is not suitable for identifying similar characters in Ei U.

【０００７】本発明は、これらの問題を解決するため、
認識対象の類似文字について、基準線と矩形の位置情報
や１文字を構成する矩形の情報などをもとに変換候補に
あるときに該当文字に変換し、判定条件を多くして類似
文字の認識率を向上させることを目的としている。The present invention solves these problems.
Regarding similar characters to be recognized, when they are candidates for conversion based on the position information of the reference line and the rectangle, the information of the rectangle that forms one character, etc., they are converted to the corresponding character, and the judgment conditions are increased to recognize similar characters. The purpose is to improve the rate.

【０００８】[0008]

【課題を解決するための手段】図１を参照して課題を解
決するための手段を説明する。[Means for Solving the Problems] Means for solving the problems will be described with reference to FIG.

【０００９】図１において、文字認識システム１は、認
識対象の文字から読み取ったイメージをもとに文字を認
識するものであって、文字認識手段２および類似文字認
識手段３などから構成されるものである。In FIG. 1, a character recognition system 1 recognizes a character based on an image read from a character to be recognized, and comprises a character recognition means 2 and a similar character recognition means 3. Is.

【００１０】文字認識手段２は、認識対象の文字から読
み取ったイメージについて、文字認識辞書４を検索して
文字認識して候補を生成するものである。The character recognition means 2 searches the character recognition dictionary 4 for the image read from the character to be recognized and recognizes the character to generate a candidate.

【００１１】類似文字認識手段３は、読み取った１文字
に対応するイメージをもとに、類似文字を判定するもの
である。The similar character recognizing means 3 determines a similar character based on the image corresponding to the read one character.

【００１２】次に、動作を説明する。文字認識システム
１を構成する文字認識手段２が認識対象の文字から読み
取ったイメージについて文字認識辞書４を検索して文字
認識して候補を生成し、類似文字認識手段３は読み取っ
た認識対象の文字イメージが基準線よりも下にあり、か
つ候補中に文字ｊが含まれていたときに文字ｊと判定し
たり、読み取った認識対象の、１つの文字に対応する文
字イメージが２つ以上に分離し、かつ候補中に文字ｉが
含まれていたときに文字ｉと判定するようにしている。Next, the operation will be described. The character recognition unit 2 included in the character recognition system 1 searches the character recognition dictionary 4 for an image read from a character to be recognized, recognizes the character, and generates a candidate. The similar character recognition unit 3 reads the recognized character to be recognized. When the image is below the reference line and the character j is included in the candidates, it is judged as the character j, or the character image corresponding to one character of the read recognition target is separated into two or more. In addition, when the candidate includes the character i, the character i is determined.

【００１３】この際、読み取った認識対象の、１つの文
字に対応するイメージが分離していないのに、候補中に
文字ｌ（エル）が含まれているときに文字ｌ（エル）と
判定するようにしている。At this time, when the image corresponding to one character of the read recognition target is not separated, but the character l (el) is included in the candidates, it is determined to be the character l (el). I am trying.

【００１４】従って、認識対象の類似文字について、基
準線と矩形の位置情報や１文字を構成する矩形の情報な
どをもとに変換候補にあるときに該当文字に変換するこ
とにより、判定条件を多くして類似文字の認識率を向上
させることが可能となる。Therefore, the similar character to be recognized is converted into the corresponding character when it is a conversion candidate based on the position information of the reference line and the rectangle, the information of the rectangle forming one character, etc. By increasing the number, it is possible to improve the recognition rate of similar characters.

【００１５】[0015]

【発明の実施の形態】次に、図１から図４を用いて本発
明の実施の形態および動作を順次詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION Next, embodiments and operations of the present invention will be sequentially described in detail with reference to FIGS.

【００１６】図１は、本発明のシステム構成図を示す。
図１において、文字認識システム１は、プログラムに従
い各種処理を実行するものであって、ここでは、認識対
象の文字から読み取ったイメージをもとに文字を認識す
るものであり、文字認識手段２および類似文字認識手段
３などから構成されるものである。FIG. 1 shows a system configuration diagram of the present invention.
In FIG. 1, a character recognition system 1 executes various processes according to a program, and here, recognizes a character based on an image read from a character to be recognized. It is composed of the similar character recognition means 3 and the like.

【００１７】文字認識手段２は、画像入力装置６によっ
て書類などから読み取った認識対象の文字のイメージに
ついて、文字認識辞書４を検索して文字認識して候補を
生成するものである。The character recognition means 2 searches the character recognition dictionary 4 for the image of the character to be recognized, which is read from the document by the image input device 6, and recognizes the character to generate a candidate.

【００１８】類似文字認識手段３は、読み取った１文字
に対応するイメージをもとに、類似文字を判定するもの
である。The similar character recognizing means 3 determines a similar character based on the image corresponding to the read one character.

【００１９】文字認識辞書４は、文字認識する情報（イ
メージに対応する文字情報）を登録したものである。The character recognition dictionary 4 is for registering character recognition information (character information corresponding to an image).

【００２０】出力ファイル５は、認識結果を格納するも
のである。画像入力装置６は、書類に印刷された認識対
象の文字のイメージを読み取るものであって、スキャナ
などである。The output file 5 stores the recognition result. The image input device 6 reads an image of characters to be recognized printed on a document, and is a scanner or the like.

【００２１】出力装置７は、認識した結果を出力するも
のであって、表示装置や印刷装置などである。The output device 7 outputs the recognition result, and is a display device, a printing device, or the like.

【００２２】次に、図２のフローチャートの順番に従
い、図３および図４を用い、図１の構成について詳細に
説明する。Next, the configuration of FIG. 1 will be described in detail with reference to FIGS. 3 and 4 in the order of the flow chart of FIG.

【００２３】図２は、本発明の動作説明フローチャート
を示す。図２において、Ｓ１は、画像入力装置で読取
後、文字認識する。これは、図１の画像入力装置である
例えばスキャナが書類に印刷された認識対象の文字のイ
メージを読み取り、読み取ったイメージについて文字認
識辞書４を検索して文字認識を行う。この際、後の処理
で使う、文字イメージの基準線を求めて保存しておく。
これらにより、書類から認識対象の文字のイメージを読
み取り、当該イメージをもとに文字認識辞書４を検索し
て候補文字を生成できたこととなる。FIG. 2 shows a flowchart for explaining the operation of the present invention. In FIG. 2, in step S1, characters are recognized after being read by the image input device. For example, a scanner, which is an image input device shown in FIG. 1, reads an image of a character to be recognized printed on a document, and the character recognition dictionary 4 is searched for the read image to perform character recognition. At this time, the reference line of the character image to be used in the subsequent processing is obtained and saved.
As a result, the image of the character to be recognized is read from the document, the character recognition dictionary 4 is searched based on the image, and the candidate character can be generated.

【００２４】Ｓ２は、認識結果が・Ｉ（アイ）・１（エル）・ｊ・ｉであった文字について以下の処理を行う。At S2, the recognition result is ・ I (eye) ・ 1 (L) ・ J ・ I The following processing is performed on the character that was.

【００２５】Ｓ３は、基準線よりも下に文字矩形がある
か判別する。ＹＥＳの場合には、Ｓ３１に進む。ＮＯの
場合には、Ｓ４に進む。In step S3, it is determined whether or not there is a character rectangle below the reference line. If YES, the process proceeds to S31. If NO, the process proceeds to S4.

【００２６】Ｓ３１は、候補内に「ｊ」があるか判別す
る。ＹＥＳの場合には、Ｓ３２で「ｊ」と判定する。Ｎ
Ｏの場合には、Ｓ４に進む。In step S31, it is determined whether "j" is present in the candidates. If YES, it is determined to be "j" in S32. N
If O, go to S4.

【００２７】以上のＳ２、Ｓ３，Ｓ３１，Ｓ３２からな
る処理１で、基準線よりも下に文字矩形があり、かつ候
補内に「ｊ」があったときに、文字「ｊ」と判定し、認
識条件を多くして認識精度を向上させることが可能とな
る（詳細は、後述する図３の処理１詳細を参照）。In the process 1 consisting of S2, S3, S31 and S32 described above, when there is a character rectangle below the reference line and there is "j" in the candidates, it is judged as the character "j", It is possible to increase the recognition accuracy by increasing the number of recognition conditions (for details, refer to details of processing 1 in FIG. 3 described later).

【００２８】次に、Ｓ４は、認識結果が確からしいか判
別する。ＹＥＳの場合には、確からしい文字と判定す
る。文字認識結果として、例えば正読率が８０％以上の
場合に、このときの認識した文字と判定する。ＮＯの場
合には、Ｓ５に進む。Next, in S4, it is determined whether the recognition result is likely. In the case of YES, the character is determined to be probable. For example, when the correct reading rate is 80% or more as the character recognition result, it is determined that the character is recognized at this time. If NO, the process proceeds to S5.

【００２９】以上のＳ４からなる処理２で、処理１の後
に認識結果が確からしい（正読率が例えば８０％以上）
のときに当該認識結果と判定することが可能となる（詳
細は、後述する図３の処理２詳細を参照）。In the process 2 consisting of the above S4, the recognition result is likely after the process 1 (correct reading rate is 80% or more, for example).
At this time, the recognition result can be determined (for details, refer to details of processing 2 in FIG. 3 described later).

【００３０】Ｓ５は、認識結果の１文字矩形が２つ以上
に分離しているか判別する。ＹＥＳの場合には、Ｓ５１
に進む。ＮＯの場合には、Ｓ６に進む。In step S5, it is determined whether the one-character rectangle as the recognition result is separated into two or more. If YES, S51
Proceed to. If NO, the process proceeds to S6.

【００３１】Ｓ５１は、更に、候補内に「ｉ」があるか
判別する。ＹＥＳの場合には、Ｓ５で「ｉ」と判定す
る。ＮＯの場合には、Ｓ６に進む。In step S51, it is further determined whether there is "i" in the candidates. If YES, it is determined to be "i" in S5. If NO, the process proceeds to S6.

【００３２】以上のＳ５、Ｓ５１，Ｓ５２からなる処理
３で、認識結果の1文字矩形が２つ以上に分離し、かつ
候補内に「ｉ」があったときに、文字「ｉ」と判定し、
認識条件を多くして認識精度を向上させることが可能と
なる（詳細は、後述する図３の処理３詳細を参照）。In the process 3 consisting of S5, S51, and S52 described above, when one character rectangle of the recognition result is separated into two or more and there is "i" in the candidate, it is judged as the character "i". ,
It is possible to increase the recognition accuracy by increasing the number of recognition conditions (for details, refer to details of processing 3 in FIG. 3 described later).

【００３３】Ｓ６は、候補内に「ｆ」または「t」があ
るか判別する。ＹＥＳの場合には、Ｓ６１で「ｆ」また
は「t」と判定する。ＮＯの場合には、Ｓ７に進む。In step S6, it is determined whether there is "f" or "t" in the candidates. If YES, it is determined to be "f" or "t" in S61. If NO, the process proceeds to S7.

【００３４】以上のＳ６、Ｓ６１からなる処理４で、認
識結果内に「ｆ」または「t」があったときに当該
「ｆ」または「t」と判定することが可能となる。In the process 4 consisting of S6 and S61 described above, it is possible to determine the "f" or "t" when the result is "f" or "t".

【００３５】Ｓ７は、矩形は分離していないのに、認識
結果が「i」であるか判別する。ＹＥＳの場合には、Ｓ
７１に進む。ＮＯの場合には、Ｓ８に進む。In step S7, it is determined whether the recognition result is "i" although the rectangle is not separated. If YES, S
Proceed to 71. If NO, the process proceeds to S8.

【００３６】Ｓ７１は、更に、候補内に「ｌ」（エル）
があるか判別する。ＹＥＳの場合には、Ｓ７２で「ｌ」
（エル）と判定する。ＮＯの場合には、Ｓ８に進む。In step S71, "l" (L) is added to the candidates.
Determine if there is. If YES, in S72 "l"
Judge as (L). If NO, the process proceeds to S8.

【００３７】以上のＳ７、Ｓ７１、Ｓ７２からなる処理
５で、1文字矩形が分離していないのに認識結果に
「ｌ」（エル）がある場合に、「ｌ」（エル）と判定す
ることが可能となる。In the process 5 consisting of S7, S71, and S72 described above, if one character rectangle is not separated but "l" (el) is present in the recognition result, it is determined to be "l" (el). Is possible.

【００３８】Ｓ８は、以上の処理１から処理５で判定し
た結果を出力する。以上のＳ１からＳ７などの手順を順
に実行することにより、認識した候補について、更に、
多くの条件を順次適用し、これら条件に適合した文字を
判定することにより、多くの条件で認識判定して認識精
度を向上させることが可能となる。In step S8, the result determined in the above processing 1 to 5 is output. By sequentially executing the above steps S1 to S7, the recognized candidates are further
By sequentially applying many conditions and determining a character that meets these conditions, it is possible to perform recognition determination under many conditions and improve the recognition accuracy.

【００３９】図３および図４は、本発明の説明図を示
す。これら図３および図４中の処理１詳細から処理５詳
細は、既述した図２のフローチャート中の処理１詳細か
ら処理５詳細の具体例をそれぞれ示す。3 and 4 are explanatory views of the present invention. Details of processing 1 to processing 5 in FIGS. 3 and 4 show specific examples of processing 1 to processing 5 in the flowchart of FIG. 2 described above.

【００４０】・処理１詳細：左側の文字矩形の場合であ
って、かつ、・図２のＳ３１のＹＥＳの例は、・第１候補 i ・第２候補 j であり、Ｓ３２で判定結果「ｊ」となる。-Details of processing 1: In the case of the character rectangle on the left side, and-Examples of YES in S31 of FIG. 2 are: -first candidate i-second candidate j, and judgment result "j It will be.

【００４１】・図２のＳ３１のＮＯの例は、・第１候補 i ・第２候補ｌであり、Ｓ４へ進む。An example of NO in S31 of FIG. 2 is ・ First candidate i ・ Second candidate l And proceed to S4.

【００４２】・処理２詳細：左側の文字矩形の場合であ
って、かつ、・図２のＳ４のＹＥＳの例は、・第１候補 i 正読確率８０％・第２候補 j 正読確率５０％であり、判定結果「ｉ」となる。-Details of processing 2: In the case of the character rectangle on the left side, and-The example of YES in S4 of Fig. 2 is: -First candidate i correct reading probability 80% -Second candidate j correct reading probability 50 %, And the determination result is “i”.

【００４３】・図２のＳ４のＮＯの例は、・第１候補 i 正読確率５０％・第２候補 j 正読確率４０％であり、Ｓ５へ進む。An example of NO in S4 of FIG. ・ First candidate i 50% correct reading probability ・ Second candidate j correct reading probability 40% And proceed to S5.

【００４４】・処理３詳細：左側の文字矩形の場合であ
って、かつ、・図２のＳ５１のＹＥＳの例は、・第１候補 j 正読確率７０％・第２候補 i 正読確率５０％であり、Ｓ５２で判定結果「ｉ」となる。-Details of processing 3: In the case of the character rectangle on the left side, and-The example of YES in S51 of Fig. 2 is: -First candidate j correct reading probability 70% -Second candidate i correct reading probability 50 %, And the determination result is “i” in S52.

【００４５】・図２のＳ５１のＮＯの例は、・第１候補 j 正読確率７０％・第２候補 l 正読確率５０％であり、候補に「ｉ」ないので、Ｓ６へ進む。An example of NO in S51 of FIG. 2 is ・ First candidate j, correct reading probability 70% ・ Second candidate l Probability of 50% correct reading Since there is no “i” in the candidate, the process proceeds to S6.

【００４６】・処理４詳細：・図２のＳ６のＹＥＳの例は、・第１候補 i ・第２候補ｔであり、Ｓ６１で判定結果「t」となる。Process 4 details: -The example of YES in S6 of FIG. ・ First candidate i ・ Second candidate t And the determination result is "t" in S61.

【００４７】・図２のＳ６のＹＥＳの例２は、・第１候補 i ・第２候補 f であり、Ｓ６１で判定結果「f」となる。The example 2 of YES in S6 of FIG. ・ First candidate i ・ Second candidate f And the determination result is “f” in S61.

【００４８】・図２のＳ６のＮＯの例は、・第１候補 i ・第２候補 l であり、t,fがなく、Ｓ７へ進む。An example of NO in S6 of FIG. ・ First candidate i ・ Second candidate l And there is no t or f, and the process proceeds to S7.

【００４９】・処理５詳細：左側の文字矩形の場合であ
って、かつ、・図２のＳ７１のＹＥＳの例は、・第１候補 i ・第２候補 l（エル）であり、Ｓ７２で判定結果「l」（エル）となる。-Details of processing 5: In the case of the character rectangle on the left side, and-Examples of YES in S71 of Fig. 2 are: -First candidate i-Second candidate l (ell), determined in S72 The result is "l".

【００５０】・図２のＳ３１のＮＯの例は、・第１候補 i ・第２候補 j であり、Ｓ８ヘ進む。An example of NO in S31 of FIG. 2 is ・ First candidate i ・ Second candidate j And proceed to S8.

【００５１】[0051]

【発明の効果】以上説明したように、本発明によれば、
認識対象の類似文字について、基準線と矩形の位置情報
や１文字を構成する矩形の情報などをもとに変換候補に
あるときに該当文字に変換などする構成を採用している
ため、判定条件を多くして類似文字の認識率を向上させ
ることが可能となる。As described above, according to the present invention,
As for the similar character to be recognized, a configuration is adopted in which it is converted to the corresponding character when it is a conversion candidate based on the position information of the reference line and the rectangle, the information of the rectangle forming one character, etc. It is possible to improve the recognition rate of similar characters by increasing the number of characters.

[Brief description of drawings]

【図１】本発明のシステム構成図である。FIG. 1 is a system configuration diagram of the present invention.

【図２】本発明の動作説明フローチャートである。FIG. 2 is a flowchart explaining the operation of the present invention.

【図３】本発明の説明図（その１）である。FIG. 3 is an explanatory diagram (1) of the present invention.

【図４】本発明の説明図（その２）である。FIG. 4 is an explanatory view (No. 2) of the present invention.

【図５】従来技術の説明図である。FIG. 5 is an explanatory diagram of a conventional technique.

[Explanation of symbols]

１：文字認識システム２：文字認識手段３：類似文字判定手段４：文字認識辞書５：出力ファイル６：画像入力装置７：出力装置 1: Character recognition system 2: Character recognition means 3: Similar character determination means 4: Character recognition dictionary 5: Output file 6: Image input device 7: Output device

Claims

[Claims]

1. A similar character recognition program for recognizing a character based on an image read from a character to be recognized, and means for recognizing the character based on the image read from the character to be recognized to generate candidates. A unit for determining the read recognition target character image as a character j when the character image is below the reference line and the character j is included in the candidates; The corresponding character image is separated into two or more, and the character i is included in the above candidate.
A similar character recognition program for causing a computer to function as a means for determining a character i when a character is included.

2. A character l (ell) is determined when the candidate includes a character l (el) even though the image corresponding to one character of the read recognition target is not separated. The similar character recognition program according to claim 1, further comprising:

3. A similar character recognition method for recognizing a character based on an image read from a character to be recognized, wherein the character recognition is performed based on the image read from the character to be recognized to generate candidates. Determining the character j when the read character image of the recognition target is below the reference line and the character j is included in the candidates; The corresponding character image is separated into two or more, and the character i is included in the above candidate.
The method for recognizing similar characters including the step of determining the character i when the character is included.