JP2006343797A

JP2006343797A - Character recognition device, character recognition method, and computer program

Info

Publication number: JP2006343797A
Application number: JP2005166416A
Authority: JP
Inventors: Masayuki Kawasaki; 正幸川崎; Kazuhiro Shibuya; 和宏澁谷; Tomotaka Takarabe; 倫孝財部
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 2005-06-07
Filing date: 2005-06-07
Publication date: 2006-12-21

Abstract

PROBLEM TO BE SOLVED: To realize higher character recognition accuracy in a structure equipped with a user dictionary. SOLUTION: Document image is input, and character recognition is performed by collating selected one of a plurality of used dictionaries with an updating dictionary GU. The updating dictionary GU has a data structure which uses groups of each item F1, F2, F3, F4 of "character code", "correspondence dictionary identification information", "number of times of updating", and "feature quantity" as one record. Such one record expresses one character. COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、文字画像の認識を行なう技術に関する。 The present invention relates to a technique for recognizing a character image.

文字認識装置は、文字画像と文字コードとを結びつける認識辞書を備え、文字画像を認識辞書と照合することにより文字認識を行なっていた。従来より、文字認識装置には、フォント別等によって複数種類の認識辞書を備える構成のものがある（以下の特許文献１）。 The character recognition device includes a recognition dictionary that links a character image and a character code, and performs character recognition by collating the character image with the recognition dictionary. 2. Description of the Related Art Conventionally, some character recognition devices have a configuration including a plurality of types of recognition dictionaries depending on fonts (Patent Document 1 below).

上記複数の認識辞書は、予め用意された、すなわちシステム内蔵の認識辞書（以下、「システム辞書」と呼ぶ）であるが、文字認識装置は、システム辞書のほかに、作業者自身が管理できる認識辞書（以下、「ユーザ辞書」と呼ぶ）を備える構成のものもある（以下の特許文献２を参照）。ユーザ辞書を備えることで、認識精度の向上が図られている。 The plurality of recognition dictionaries are prepared in advance, that is, recognition dictionaries built in the system (hereinafter referred to as “system dictionaries”). The character recognition device recognizes that the operator can manage in addition to the system dictionaries. Some have a dictionary (hereinafter referred to as a “user dictionary”) (see Patent Document 2 below). By providing the user dictionary, recognition accuracy is improved.

特開平１１−８５９０８号公報JP-A-11-85908 特開平６−２６６９０２号公報JP-A-6-266902

しかしながら、上記ユーザ辞書を備えた技術では、ユーザ辞書に文字画像を追加することにより、いままで認識できていた類似の文字が認識不能となる場合があり、文字認識の精度を必ずしも充分に高めることができないという問題があった。というのも、誤認識があったためにユーザ辞書に文字画像と正しい文字コードを追加した場合に、その追加した文字画像が、誤認識を行なった辞書とは異なる辞書に登録された文字パターンと類似することがあり、いままでこの異なる辞書で認識できていた文字が認識不能となるためである。 However, in the technology provided with the above user dictionary, adding a character image to the user dictionary may make it impossible to recognize similar characters that have been recognized so far, and the accuracy of character recognition is necessarily sufficiently increased. There was a problem that could not. This is because when a character image and a correct character code are added to the user dictionary due to misrecognition, the added character image is similar to a character pattern registered in a dictionary different from the dictionary that performed the misrecognition. This is because the characters that could be recognized by this different dictionary until now become unrecognizable.

本発明の解決しようとする課題は、ユーザ辞書を備えた構成において、より高い文字認識精度を実現することにある。 The problem to be solved by the present invention is to realize higher character recognition accuracy in a configuration including a user dictionary.

前述した課題の少なくとも一部を解決するための手段として、以下に示す構成をとった。 As means for solving at least a part of the problems described above, the following configuration is adopted.

本発明の文字認識装置は、
文字認識用の辞書として、予め登録内容の固定された第１の辞書と登録内容を更新可能な第２の辞書とを備え、文書画像を入力して、該文書画像に表わされる文字画像を前記第１の辞書および第２の辞書と照合することにより文字認識を行なう文字認識装置であって、
前記第１の辞書として、文字画像の特徴量と文字コードとを対応付けて格納する複数種類の辞書を記憶する第１の辞書記憶手段と、
前記第２の辞書を更新する第２の辞書更新手段と
を備え、
前記第２の辞書更新手段は、
前記文書画像から１文字の文字画像を切り出す文字画像切り出し手段と、
前記文字画像切り出し手段により切り出された文字画像に対して文字コードを指定する文字コード指定手段と、
前記第１の辞書記憶手段に記憶された複数種類の辞書から一の辞書を選択する辞書選択手段と、
前記文字画像切り出し手段により切り出された文字画像の特徴量を得る特徴量取得手段と、
前記辞書選択手段により選択された辞書から前記文字コード指定手段により指定された文字コードに対応する特徴量を読み出して、該読み出された特徴量を、前記特徴量取得手段により得た特徴量でもって補正する特徴量補正手段と、
前記文字コード指定手段により指定された文字コードと、前記特徴量補正手段による補正後の特徴量と、前記辞書選択手段により選択された辞書を識別するための識別情報との組を１文字分のデータとして前記第２の辞書に追加するデータ追加手段と
を備えることを特徴としている。 The character recognition device of the present invention is
As a dictionary for character recognition, a first dictionary whose registration contents are fixed in advance and a second dictionary that can update the registration contents are provided. A document image is input, and the character image represented in the document image is converted into the character image. A character recognition device that performs character recognition by collating with a first dictionary and a second dictionary,
A first dictionary storage means for storing a plurality of types of dictionaries that store character image feature quantities and character codes in association with each other as the first dictionary;
Second dictionary updating means for updating the second dictionary,
The second dictionary updating means includes
A character image cutting means for cutting out a character image of one character from the document image;
A character code specifying means for specifying a character code for the character image cut out by the character image cut-out means;
Dictionary selection means for selecting one dictionary from a plurality of types of dictionaries stored in the first dictionary storage means;
Feature quantity acquisition means for obtaining a feature quantity of the character image cut out by the character image cutout means;
The feature quantity corresponding to the character code designated by the character code designation means is read from the dictionary selected by the dictionary selection means, and the read feature quantity is the feature quantity obtained by the feature quantity acquisition means. A feature amount correcting means for correcting by,
A set of a character code designated by the character code designation unit, a feature amount after correction by the feature amount correction unit, and identification information for identifying the dictionary selected by the dictionary selection unit is set for one character. And a data adding means for adding the data to the second dictionary as data.

上記構成の文字認識装置によれば、文字画像は第１の辞書および第２の照合することにより文字認識がなされるが、さらに、第１の辞書としては、複数の辞書を備え、第２の辞書は更新がなされる。その第２の辞書の更新は、文字画像の切り出しと、その切り出した文字画像に対する文字コードの指定と、第１の辞書としての複数種類の辞書から一の辞書を選択することを受けて行なわれる。その切り出した文字画像の特徴量を取得して、上記選択された辞書から前記指定された文字コードに対応する特徴量を読み出して、その読み出された特徴量を前記取得した特徴量でもって補正を行なう。 According to the character recognition device having the above configuration, the character image is recognized by collating with the first dictionary and the second dictionary. The first dictionary further includes a plurality of dictionaries, The dictionary is updated. The update of the second dictionary is performed in response to the extraction of the character image, the specification of the character code for the extracted character image, and the selection of one dictionary from a plurality of types of dictionaries as the first dictionary. . The feature amount of the cut character image is acquired, the feature amount corresponding to the specified character code is read from the selected dictionary, and the read feature amount is corrected with the acquired feature amount. To do.

このために、辞書別に特徴量の補正がなされることから、誤認識があったために第２の辞書に文字画像と正しい文字コードを追加した場合に、その追加した文字画像が、誤認識を行なった辞書とは異なる辞書を用いた文字認識で利用されることがない。したがって、第２の辞書に文字画像を追加することにより、いままで認識できていた類似の文字が認識不能となることがないことから、文字認識の精度を充分に高めることができる。 For this reason, since the feature amount is corrected for each dictionary, when a character image and a correct character code are added to the second dictionary due to erroneous recognition, the added character image performs erroneous recognition. It is not used for character recognition using a dictionary different from the dictionary. Therefore, by adding a character image to the second dictionary, similar characters that have been recognized so far will not become unrecognizable, so that the accuracy of character recognition can be sufficiently increased.

前記構成の文字認識装置において、前記文字コード指定手段により指定された文字コードと、前記辞書選択手段により選択された辞書を識別するための識別情報とを備える１文字分のデータが前記第２の辞書に登録済みであるか否かを判定する登録判定手段と、前記登録判定手段により登録済みでないと判定されたときに、該登録済みのデータを前記特徴量補正手段による補正後の特徴量でもって修正するデータ修正手段とを備える構成としてもよい。 In the character recognition device having the above configuration, the data for one character including the character code specified by the character code specifying means and the identification information for identifying the dictionary selected by the dictionary selecting means is the second data. A registration determination unit that determines whether or not the dictionary has already been registered, and when the registration determination unit determines that it has not been registered, the registered data is used as a feature amount corrected by the feature amount correction unit. It is good also as a structure provided with the data correction means corrected by it.

この構成によれば、文字コードと辞書の識別情報をキーとして第２の辞書に既に登録がなされている場合に、その登録がなされている情報の修正を行なうことができる。 According to this configuration, when the registration is already made in the second dictionary using the character code and the dictionary identification information as a key, the registered information can be corrected.

また、前記文字認識装置において、前記文字コード指定手段は、作業者の操作指令に応じて入力された文字を表わす文字コードを指定する構成であり、前記辞書選択手段は、作業者の操作指令に応じて指定された辞書を選択する構成としてもよい。 In the character recognition device, the character code designating unit is configured to designate a character code representing a character input in response to an operator operation command, and the dictionary selecting unit is configured to respond to the operator operation command. It is good also as a structure which selects the dictionary designated according to it.

この構成によれば、作業者の意図する文字コードと辞書の識別情報を第２の辞書に登録することが容易にできる。 According to this configuration, it is possible to easily register the character code intended by the worker and the identification information of the dictionary in the second dictionary.

前記文字認識装置において、前記文書画像から１文字の文字画像を順次切り出して、該切り出された文字画像の特徴量を演算して、前記第１の辞書記憶手段に記憶された複数の辞書から選択された使用辞書と前記第２の辞書の中から前記演算して求めた特徴量に近い文字コードを選択することにより前記文字認識を行なう文字認識手段を備え、前記辞書選択手段は、前記使用辞書を選択した辞書とする構成としてもよい。 In the character recognition device, a character image of one character is sequentially cut out from the document image, a feature amount of the cut-out character image is calculated, and selected from a plurality of dictionaries stored in the first dictionary storage means A character recognition unit that performs character recognition by selecting a character code that is close to the feature amount obtained by the calculation from the used dictionary and the second dictionary, and the dictionary selection unit includes the use dictionary It is good also as a structure made into the selected dictionary.

この構成によれば、文字認識において使用した辞書を直ちに指定することができることから、構成が簡単である。 According to this configuration, since the dictionary used in character recognition can be immediately specified, the configuration is simple.

前記特徴量補正手段は、前記辞書から読み出された特徴量と前記特徴量取得手段により得た特徴量との平均値を求める平均化手段を備える構成としてもよい。この構成によれば、特徴量の補正が容易である。 The feature amount correction unit may include an averaging unit that calculates an average value of the feature amount read from the dictionary and the feature amount obtained by the feature amount acquisition unit. According to this configuration, the feature amount can be easily corrected.

本発明の文字認識方法は、
文字認識用の辞書として、予め登録内容の固定された第１の辞書と登録内容を更新可能な第２の辞書とを用意して、文書画像を入力して、該文書画像に表わされる文字画像を前記第１の辞書および第２の辞書と照合することにより文字認識を行なう文字認識方法であって、
（ａ）前記第１の辞書として、文字画像の特徴量と文字コードとを対応付けて格納する複数種類の辞書を記憶装置に予め記憶する行程と、
（ｂ）前記第２の辞書を更新する行程と
を備え、
前記行程（ｂ）は、
（ｂ−１）前記文書画像から１文字の文字画像を切り出す行程と、
（ｂ−２）前記行程（ｂ−１）により切り出された文字画像に対して文字コードを指定する行程と、
（ｂ−３）前記第１の辞書としての複数種類の辞書から一の辞書を選択する行程と、
（ｂ−４）前記行程（ｂ−１）により切り出された文字画像の特徴量を得る行程と、
（ｂ−５）前記行程（ｂ−３）により選択された辞書から前記行程（ｂ−２）により指定された文字コードに対応する特徴量を読み出して、該読み出された特徴量を、前記行程（ｂ−４）により得た特徴量でもって補正する行程と、
（ｂ−６）前記行程（ｂ−２）により指定された文字コードと、前記行程（ｂ−５）による補正後の特徴量と、前記行程（ｂ−３）により選択された辞書を識別するための識別情報との組を１文字分のデータとして前記第２の辞書に追加する行程と
を備えることを特徴としている。 The character recognition method of the present invention includes:
As a dictionary for character recognition, a first dictionary whose registration contents are fixed in advance and a second dictionary whose registration contents can be updated are prepared, a document image is input, and a character image represented in the document image A character recognition method for recognizing characters by comparing them with the first dictionary and the second dictionary,
(A) a step of previously storing a plurality of types of dictionaries for storing the character image feature quantity and the character code in association with each other as the first dictionary;
(B) a step of updating the second dictionary;
The step (b) is:
(B-1) a step of cutting out one character image from the document image;
(B-2) a step of designating a character code for the character image cut out in the step (b-1);
(B-3) a step of selecting one dictionary from a plurality of types of dictionaries as the first dictionary;
(B-4) a step of obtaining a feature amount of the character image cut out in the step (b-1);
(B-5) A feature amount corresponding to the character code specified in the step (b-2) is read from the dictionary selected in the step (b-3), and the read feature amount is A process of correcting with the feature amount obtained by the process (b-4);
(B-6) The character code designated by the step (b-2), the corrected feature amount by the step (b-5), and the dictionary selected by the step (b-3) are identified. And a step of adding a set of the identification information to the second dictionary as data for one character.

本発明のコンピュータプログラムは、
文書画像を入力して、該文書画像に表わされる文字画像を、予め登録内容の固定された第１の辞書および登録内容を更新可能な第２の辞書と照合することにより文字認識を行なうためのコンピュータプログラムであって、
（ａ）前記第１の辞書として、文字画像の特徴量と文字コードとを対応付けて格納する複数種類の辞書を記憶装置に予め記憶する機能と、
（ｂ）前記第２の辞書を更新する機能と
をコンピュータに実現させるとともに、
前記機能（ｂ）は、
（ｂ−１）前記文書画像から１文字の文字画像を切り出す機能と、
（ｂ−２）前記機能（ｂ−１）により切り出された文字画像に対して文字コードを指定する機能と、
（ｂ−３）前記第１の辞書としての複数種類の辞書から一の辞書を選択する機能と、
（ｂ−４）前記機能（ｂ−１）により切り出された文字画像の特徴量を得る機能と、
（ｂ−５）前記機能（ｂ−３）により選択された辞書から前記機能（ｂ−２）により指定された文字コードに対応する特徴量を読み出して、該読み出された特徴量を、前記機能（ｂ−４）により得た特徴量でもって補正する機能と、
（ｂ−６）前記機能（ｂ−２）により指定された文字コードと、前記機能（ｂ−５）による補正後の特徴量と、前記機能（ｂ−３）により選択された辞書を識別するための識別情報との組を１文字分のデータとして前記第２の辞書に追加する機能と
を備えることを特徴としている。 The computer program of the present invention is:
For performing character recognition by inputting a document image and collating the character image represented in the document image with a first dictionary whose registration contents are fixed in advance and a second dictionary whose registration contents can be updated. A computer program,
(A) a function of preliminarily storing in the storage device a plurality of types of dictionaries that store character image feature quantities and character codes in association with each other as the first dictionary;
(B) causing the computer to realize the function of updating the second dictionary;
The function (b) is
(B-1) a function of cutting out one character image from the document image;
(B-2) a function of designating a character code for the character image cut out by the function (b-1);
(B-3) a function of selecting one dictionary from a plurality of types of dictionaries as the first dictionary;
(B-4) a function of obtaining a feature amount of the character image cut out by the function (b-1);
(B-5) A feature amount corresponding to the character code designated by the function (b-2) is read from the dictionary selected by the function (b-3), and the read feature amount is A function of correcting with the feature amount obtained by the function (b-4);
(B-6) The character code specified by the function (b-2), the feature value after correction by the function (b-5), and the dictionary selected by the function (b-3) are identified. And a function of adding a set of the identification information to the second dictionary as data for one character.

本発明の文字認識方法およびコンピュータプログラムによっても、本発明の文字認識装置と同様に、登録内容の固定された第１の辞書に対して登録内容を更新可能な第２の辞書を備えた構成において、より高い文字認識精度を実現することができるという効果を奏する。 According to the character recognition method and the computer program of the present invention, as in the character recognition device of the present invention, the second dictionary capable of updating the registered contents with respect to the first dictionary with the fixed registered contents is provided. As a result, it is possible to achieve higher character recognition accuracy.

本発明の記録媒体は、本発明のコンピュータプログラムを記録したコンピュータ読み取り可能な記録媒体を特徴としている。本記録媒体は、本発明のコンピュータプログラムと同様な作用・効果を有している。 The recording medium of the present invention is characterized by a computer-readable recording medium that records the computer program of the present invention. This recording medium has the same operations and effects as the computer program of the present invention.

本発明は、以下のような他の態様も含んでいる。その第１の態様は、本発明のコンピュータプログラムを通信経路を介して供給するプログラム供給装置としての態様である。この第１の態様では、コンピュータプログラムをコンピュータネットワーク上のサーバなどに置き、通信経路を介して、必要なプログラムをコンピュータにダウンロードし、これを実行することで、上記の方法や装置を実現することができる。 The present invention includes other aspects as follows. The 1st aspect is an aspect as a program supply apparatus which supplies the computer program of this invention via a communication path. In this first aspect, the above-described method and apparatus are realized by placing a computer program on a server or the like on a computer network, downloading a necessary program to a computer via a communication path, and executing the program. Can do.

本発明を実施するための最良の形態を実施例に基づき説明する。この実施例を、次の順序に従って説明する。
１．装置の構成：
２．コンピュータ処理：
３．作用・効果
４．他の実施形態： The best mode for carrying out the present invention will be described based on examples. This embodiment will be described in the following order.
1. Device configuration:
2. Computer processing:
3. Action / effect 4. Other embodiments:

１．装置の構成：
図１は、本発明の一実施例を適用するコンピュータシステムの概略構成を示す説明図である。この実施例のコンピュータシステムは、本発明の文字認識装置を構成するパーソナルコンピュータ１０を中心に備え、その周辺装置として、ディスプレイ２０とキーボード２２とマウス２４を備える。さらに、パーソナルコンピュータ１０には、スキャナ２６とプリンタ２８が接続されている。 1. Device configuration:
FIG. 1 is an explanatory diagram showing a schematic configuration of a computer system to which an embodiment of the present invention is applied. The computer system of this embodiment mainly includes a personal computer 10 constituting the character recognition device of the present invention, and includes a display 20, a keyboard 22, and a mouse 24 as peripheral devices. Furthermore, a scanner 26 and a printer 28 are connected to the personal computer 10.

パーソナルコンピュータ１０は、中央演算処理装置としてのＣＰＵ１１を中心にバス１２により相互に接続されたメモリ１３、ハードディスクドライブ１４、入力制御ユニット１５、表示制御ユニット１６等を備える。メモリ１３は、各種データ等を記憶するもので、ＣＰＵ１１の作業領域となる。 The personal computer 10 includes a memory 13, a hard disk drive 14, an input control unit 15, a display control unit 16, and the like connected to each other by a bus 12 around a CPU 11 as a central processing unit. The memory 13 stores various data and becomes a work area of the CPU 11.

ハードディスクドライブ１４は、文字認識装置のソフトウェアとしてのコンピュータプログラムＰＲと、文字認識用の辞書を記憶する。文字認識用辞書としては、予め用意されたシステム辞書と、作業者自身が管理できるユーザ辞書とを備える。システム辞書は、予め登録内容の固定されたもので、本発明の「第１の辞書」に相当する。ユーザ辞書は、登録内容を更新可能なもので、本発明の「第２の辞書」に相当する。システム辞書としては、基本認識辞書Ｇ１、つぶれ文字認識辞書Ｇ２、かすれ文字認識辞書Ｇ３、明朝体認識辞書Ｇ４、ゴシック体認識辞書Ｇ５等のｎ個（ｎは複数）の辞書が用意されている。すなわち、システム辞書としては、基本的な認識辞書Ｇ１と、文字のつぶれ・かすれの程度を変えたつぶれ文字認識辞書Ｇ２、かすれ文字認識辞書Ｇ３等と、文字のフォントを変えた明朝体認識辞書Ｇ４、ゴシック体認識辞書Ｇ５等が用意されている。ユーザ辞書は、更新辞書ＧＵである。なお、このハードディスクドライブ１４が、本発明の「第１の辞書記憶手段」に相当する。 The hard disk drive 14 stores a computer program PR as software of the character recognition device and a dictionary for character recognition. The character recognition dictionary includes a system dictionary prepared in advance and a user dictionary that can be managed by the operator. The system dictionary has registered contents fixed in advance, and corresponds to the “first dictionary” of the present invention. The user dictionary can update the registered contents, and corresponds to the “second dictionary” of the present invention. As the system dictionary, n (n is a plurality) dictionaries such as a basic recognition dictionary G1, a collapsed character recognition dictionary G2, a faint character recognition dictionary G3, a Mincho recognition dictionary G4, and a Gothic recognition dictionary G5 are prepared. . That is, as a system dictionary, a basic recognition dictionary G1, a collapsed character recognition dictionary G2 in which the degree of character crushing / fading is changed, a blurred character recognition dictionary G3, etc., and a Mincho recognition dictionary in which the font of characters is changed. G4, Gothic recognition dictionary G5, etc. are prepared. The user dictionary is an update dictionary GU. The hard disk drive 14 corresponds to the “first dictionary storage means” of the present invention.

辞書Ｇ１〜Ｇｎは、文字画像の特徴量と文字コードとを文字毎に対応付けて格納するものである。「特徴量」とは、文字画像の形状を数値化して表現するためのものである。特徴量としては、この実施例では、周知のメッシュ特徴を利用している。メッシュ特徴は、文字領域を複数の小領域に分割し、その小領域に含まれる黒画素の数を特徴量にするものである。 The dictionaries G1 to Gn store character image feature quantities and character codes in association with each character. The “feature amount” is for expressing the shape of the character image in numerical form. In this embodiment, a known mesh feature is used as the feature amount. In the mesh feature, a character region is divided into a plurality of small regions, and the number of black pixels included in the small region is used as a feature amount.

更新辞書ＧＵは、システム辞書としての辞書Ｇ１〜Ｇｎに登録されている文字についての特徴量を補正するための辞書で、ユーザによって文字の登録（追加）・修正がなされる。更新辞書ＧＵの構成については後述する。また、ハードディスクドライブ１４は、単語辞書ＧＷと登録画像情報ファイルＰＦを記憶するが、これらについても後述する。 The update dictionary GU is a dictionary for correcting the feature amount of characters registered in the dictionaries G1 to Gn serving as system dictionaries. Characters are registered (added) and corrected by the user. The configuration of the update dictionary GU will be described later. The hard disk drive 14 stores a word dictionary GW and a registered image information file PF, which will be described later.

スキャナ２６は、写真、文書などの原稿を光学的に走査（スキャン）して、その反射光や透過光の強度を計り、これをアナログ−デジタル変換して画像（イメージ）をコンピュータに取込むためのデバイスである。ここでは、文書を原稿として、文書画像データＤＤを得るために用いられる。 The scanner 26 optically scans a manuscript such as a photograph or document, measures the intensity of the reflected light or transmitted light, and converts the light into analog-digital data to capture an image (image) into a computer. Device. Here, it is used to obtain document image data DD using a document as a document.

入力制御ユニット１５は、キーボード２２やマウス２４から入力操作を取り込むと共に、スキャナ２６から文書画像データＤＤを取り込む制御部である。表示制御ユニット１６は、ディスプレイ２０への信号出力を制御する制御部である。 The input control unit 15 is a control unit that captures input operations from the keyboard 22 and the mouse 24 and captures document image data DD from the scanner 26. The display control unit 16 is a control unit that controls signal output to the display 20.

コンピュータプログラムＰＲは、もともとは、記録媒体としてのＣＤ−ＲＯＭ（図示せず）に記憶されており、所定のインストールプログラムを起動することで、そのＣＤ−ＲＯＭからハードディスクドライブ１４にインストールされたものである。システム辞書としての辞書Ｇ１〜Ｇｎも同様に、ＣＤ−ＲＯＭからハードディスクドライブ１４にインストールされたものである。上記コンピュータプログラムＰＲをＣＰＵ１１が実行することにより、本発明の文字認識装置の各種構成要件は実現される。 The computer program PR is originally stored in a CD-ROM (not shown) as a recording medium, and is installed in the hard disk drive 14 from the CD-ROM by starting a predetermined installation program. is there. Similarly, dictionaries G1 to Gn as system dictionaries are installed in the hard disk drive 14 from the CD-ROM. Various constituent requirements of the character recognition device of the present invention are realized by the CPU 11 executing the computer program PR.

図１では、各種構成要件が、ＣＰＵ１１の内部で実現される機能のブロックによって示されている。ＣＰＵ１１は、コンピュータプログラムＰＲを実行することにより、文書画像データＤＤを入力して、文書画像データＤＤに表わされる文字画像を辞書Ｇ１〜Ｇｎおよび更新辞書ＧＵと照合することにより文字認識を行なうと共に、更新辞書ＧＵの更新を行なう。更新辞書ＧＵの更新は、第２の辞書更新部１８を用いて行なう。第２の辞書更新部１８は、文字画像切り出し部１８ａ、文字コード指定部１８ｂ、辞書選択部１８ｃ、特徴量取得部１８ｄ、特徴量補正部１８ｅ、データ追加部１８ｆを用いて行なわれる。各部１８ａ〜１８ｆの働きは後ほど詳述する。 In FIG. 1, various constituent requirements are indicated by functional blocks implemented in the CPU 11. The CPU 11 inputs the document image data DD by executing the computer program PR, and performs character recognition by collating the character image represented in the document image data DD with the dictionaries G1 to Gn and the update dictionary GU. The update dictionary GU is updated. The update dictionary GU is updated using the second dictionary update unit 18. The second dictionary update unit 18 is performed using a character image cutout unit 18a, a character code designation unit 18b, a dictionary selection unit 18c, a feature amount acquisition unit 18d, a feature amount correction unit 18e, and a data addition unit 18f. The function of each part 18a-18f will be described in detail later.

なお、コンピュータプログラムＰＲは、ＣＤ−ＲＯＭに替えて、フレキシブルディスク、光磁気ディスク、ＩＣカード等の他の携帯型記録媒体（可搬型記録媒体）に格納された構成として、これらから提供されたものとすることができる。また、このコンピュータプログラムＰＲは、外部のネットワークに接続される特定のサーバから、ネットワークを介して提供されたものとすることもできる。上記ネットワークとしては、インターネットであってもよく、特定のホームページからダウンロードして得たコンピュータプログラムであってもよい。あるいは、電子メールの添付ファイルの形態で供給されたコンピュータプログラムであってもよい。 The computer program PR is provided as a configuration stored in another portable recording medium (portable recording medium) such as a flexible disk, a magneto-optical disk, and an IC card in place of the CD-ROM. It can be. The computer program PR can also be provided via a network from a specific server connected to an external network. The network may be the Internet or a computer program obtained by downloading from a specific homepage. Alternatively, it may be a computer program supplied in the form of an email attachment.

２．コンピュータ処理：
コンピュータプログラムＰＲに従う制御処理について、以下詳細に説明する。図２は、この制御処理のメインルーチンを示すフローチャートである。このメインルーチンは、コンピュータプログラムＰＲを起動させる旨の指示がなされたときに、実行開始される。図示するように、処理が開始されると、ＣＰＵ１１は、まず、作業用のウィンドウＷＤをＣＲＴディスプレイ１２に表示する処理を行なう（ステップＳ１００）。 2. Computer processing:
The control process according to the computer program PR will be described in detail below. FIG. 2 is a flowchart showing a main routine of this control process. This main routine is started when an instruction to start the computer program PR is given. As shown in the figure, when the process is started, the CPU 11 first performs a process of displaying a work window WD on the CRT display 12 (step S100).

図３は、このウィンドウＷＤを示す説明図である。図示するように、ウィンドウＷＤには、メニューバーＢＲ１とツールバーＢＲ２とが設けられている。ツールバーＢＲ２には、作業手順に従った順にコマンドを実行するための「スキャン」、「認識」、「保存」のボタンＢＴ１，ＢＴ２，ＢＴ３が備えられている。ツールバーＢＲ２の下方には、文書画像表示用のフィールドＦＤ１と認識結果表示用のフィールドＦＤ２とが設けられている。各フィールドＦＤ１，ＦＤ２は、図では表示がなされた状態であるが、ステップＳ１００の時点では空欄である。 FIG. 3 is an explanatory diagram showing the window WD. As illustrated, the window WD is provided with a menu bar BR1 and a tool bar BR2. The toolbar BR2 is provided with “scan”, “recognition”, and “save” buttons BT1, BT2, and BT3 for executing commands in the order according to the work procedure. Below the toolbar BR2, a document image display field FD1 and a recognition result display field FD2 are provided. Each field FD1, FD2 is displayed in the figure, but is blank at the time of step S100.

図２に戻って、次いで、ＣＰＵ１１は、ツールバーＢＲ２上の「スキャン」のボタンＢＴ１が、マウス２４によってクリックされる操作を受けて、文字認識の対象となる文書をスキャンするスキャン処理を実行する（ステップＳ１１０）。この結果、文書のイメージを光学的に読み取った文書画像データＤＤが、メモリ１３に記憶されるとともに、ウィンドウＷＤの文書画像表示用のフィールドＦＤ１に表示される。続いて、ＣＰＵ１１は、ステップＳ１１０で得られた文書画像データＤＤに表わされる文字画像の認識を行なう文字認識処理を実行する（ステップＳ１２０）。 Returning to FIG. 2, the CPU 11 then executes a scan process for scanning a document that is a character recognition target in response to an operation in which the “scan” button BT 1 on the toolbar BR 2 is clicked by the mouse 24 ( Step S110). As a result, the document image data DD obtained by optically reading the document image is stored in the memory 13 and displayed in the document image display field FD1 of the window WD. Subsequently, the CPU 11 executes a character recognition process for recognizing the character image represented in the document image data DD obtained in step S110 (step S120).

図４は、ステップＳ１２０で実行される文字認識処理の詳細を表わすフローチャートである。図示するように、処理が開始されると、ＣＰＵ１１は、文書画像データＤＤから１つの文字画像を切り出す処理を行なう（ステップＳ２００）。詳細には、ＣＰＵ１１は、文書の文字の並びが横方向か縦方向かをみて、横方向の場合には行を、縦方向の場合には列を切り出して、その切り出した並びから１文字に相当する１つの文字画像を切り出す。 FIG. 4 is a flowchart showing details of the character recognition process executed in step S120. As shown in the figure, when the process is started, the CPU 11 performs a process of cutting out one character image from the document image data DD (step S200). Specifically, the CPU 11 checks whether the character sequence of the document is horizontal or vertical, cuts out a row in the horizontal direction and a column in the vertical direction, and converts the row into one character from the cut-out sequence. One corresponding character image is cut out.

次いで、ＣＰＵ１１は、ツールバーＢＲ２上の「認識」のボタンＢＴ２が、マウス２４によってクリックされる操作を受けて、ステップＳ２００で切り出した文字画像の認識に用いる使用辞書を、システム辞書としての辞書Ｇ１〜Ｇｎのうちから一つ選択する処理を行なう（ステップＳ２１０）。詳細には、ステップＳ２００で切り出した文字画像における文字の太さを見ることで、つぶれ文字認識辞書Ｇ２、かすれ文字認識辞書Ｇ３が使用辞書に適しているかを判断することができる。また、基本認識辞書Ｇ１と一旦照合（マッチング）を行なって、類似度が高いと判断された文字コードを検出して、その文字コードに対応した文字画像の特徴量を、明朝体認識辞書Ｇ４、ゴシック体認識辞書Ｇ５等のフォント別の認識辞書からそれぞれ読み出して、その読み出した特徴量がステップＳ２００で切り出した文字画像の特徴量と似通っているかを判定する。そうして最も類似度が高い文字画像を備える辞書を使用辞書として選択する。使用辞書の選択は、こうした手法に限るわけではなく、辞書Ｇ１〜Ｇｎの種類によって別の手法によるものとすることもできる。要は、ステップＳ２００で切り出した文字画像の認識に最も適した辞書Ｇ１〜Ｇｎを選択できるものであれば、どのような手法であってもよい。 Next, in response to an operation in which the “recognition” button BT2 on the tool bar BR2 is clicked by the mouse 24, the CPU 11 uses a dictionary used as a system dictionary for the character image clipped in step S200. A process of selecting one of Gn is performed (step S210). Specifically, by looking at the thickness of the characters in the character image cut out in step S200, it is possible to determine whether the collapsed character recognition dictionary G2 and the blurred character recognition dictionary G3 are suitable for the use dictionary. Further, the character code determined to have a high degree of similarity is once compared (matched) with the basic recognition dictionary G1, and the feature amount of the character image corresponding to the character code is determined as the Mincho character recognition dictionary G4. Then, it is read from each font-specific recognition dictionary such as the Gothic recognition dictionary G5, and it is determined whether the read feature value is similar to the character image feature value cut out in step S200. Then, the dictionary including the character image having the highest similarity is selected as the use dictionary. The selection of the use dictionary is not limited to such a method, and can be made by another method depending on the type of the dictionary G1 to Gn. In short, any method may be used as long as it can select dictionaries G1 to Gn most suitable for recognition of the character image cut out in step S200.

次いで、ＣＰＵ１１は、ステップＳ２００で切り出した文字画像を、ステップＳ２１０で使用辞書として選択した辞書（Ｇ１〜Ｇｎのいずれか一つ）および更新辞書ＧＵと照合（マッチング）することで、その文字画像の文字認識を行なう（ステップＳ２２０）。詳細には、切り出した文字画像の特徴量を演算して、使用辞書および更新辞書ＧＵの中からその演算して求めた特徴量に近い文字コードを選択して、その選択された文字コードを認識結果として出力する。「使用辞書および更新辞書ＧＵの中から」とは、両辞書に対してそれぞれマッチングを行ない、両辞書の中から特徴量の近い文字コードを選択するものである。なお、認識結果としては、最も特徴量が近い（類似度が高い）文字コードだけではなく、複数の候補が優先順位（類似度が高いほど優先順位が高い）と共に得られる。 Next, the CPU 11 collates the character image cut out in step S200 with the dictionary (any one of G1 to Gn) selected as the use dictionary in step S210 and the update dictionary GU, thereby matching the character image. Character recognition is performed (step S220). More specifically, the feature amount of the extracted character image is calculated, a character code close to the calculated feature amount is selected from the use dictionary and the update dictionary GU, and the selected character code is recognized. Output as a result. “From the use dictionary and the update dictionary GU” is to perform matching on both dictionaries and select a character code having a similar feature amount from both dictionaries. As a recognition result, not only a character code having the closest feature amount (highest similarity) but also a plurality of candidates are obtained with priority (higher priority is higher priority).

その後、ＣＰＵ１１は、ステップＳ２２０で得られた文字認識結果をメモリ１３に記憶する（ステップＳ２３０）。次いで、ＣＰＵ１１は、ステップＳ２００で切り出した文字画像が、文書画像データＤＤに含まれる文字画像の中の最後のものであるか否かを判定する（ステップＳ２４０）。ここで、最後のものでないと判定されると、ステップＳ２００に処理を戻して、次の文字画像の切り出しを行なって文字認識を行なう。一方、ステップＳ２３０で、最後の文字画像であると判定されると、文書画像データＤＤに含まれる全ての文字画像の認識が終了したとして、ＣＰＵ１１は、ステップＳ２５０に処理を進める。ステップＳ２５０では、ＣＰＵ１１は、認識結果、すなわち、ステップＳ２３０で記憶してきた第１位候補の文字コードの列を、ウィンドウＷＤの認識結果表示用のフィールドＦＤ２に表示する処理を行なう。ステップＳ２５０の実行後、ＣＰＵ１１は、「リターン」に処理を進めて、この文字認識処理のルーチンを抜ける。 Thereafter, the CPU 11 stores the character recognition result obtained in step S220 in the memory 13 (step S230). Next, the CPU 11 determines whether or not the character image cut out in step S200 is the last character image included in the document image data DD (step S240). If it is determined that it is not the last one, the process returns to step S200 to cut out the next character image and perform character recognition. On the other hand, if it is determined in step S230 that the character image is the last character image, the CPU 11 proceeds to step S250, assuming that all character images included in the document image data DD have been recognized. In step S250, the CPU 11 performs a process of displaying the recognition result, that is, the character code string of the first candidate stored in step S230, in the recognition result display field FD2 of the window WD. After execution of step S250, the CPU 11 advances the process to “return” and exits the routine of this character recognition process.

こうして図２においてステップＳ１２０の文字認識処理を抜けると、ＣＰＵ１１は、ステップＳ１３０に処理を進める。ステップＳ１３０では、ＣＰＵ１１は、ステップＳ１２０で得られた文字認識結果を補正する補正処理を行なう。ここで実行される補正処理としては、自動学習補正、半自動学習補正、手動補正の３種類がある。 In this way, after exiting the character recognition process of step S120 in FIG. 2, the CPU 11 advances the process to step S130. In step S130, the CPU 11 performs a correction process for correcting the character recognition result obtained in step S120. There are three types of correction processing executed here: automatic learning correction, semi-automatic learning correction, and manual correction.

図５は、上記３種類の学習補正を補正結果の一例を用いて説明するための説明図である。図中の第１段目には、ステップＳ１１０によるスキャン処理の結果得られた文字画像の配列が示されており、第３段目ないし第６段目には、ステップＳ１２０による文字認識処理の結果として得られた第１位候補から第４位候補までが示されている。図中の第２段目には、ステップＳ１３０による補正処理の結果が示されている。 FIG. 5 is an explanatory diagram for explaining the above three types of learning correction using an example of correction results. The first row in the figure shows the arrangement of character images obtained as a result of the scan processing in step S110, and the third to sixth rows show the result of character recognition processing in step S120. From the first candidate to the fourth candidate obtained as shown in FIG. The second level in the figure shows the result of the correction process in step S130.

自動学習補正は、作業者の手を借りずに、単語辞書ＧＷを用いて文字認識結果を自動的に補正するものである。すなわち、ステップＳ１２０の文字認識処理によって得られた認識結果としての第１候補は必ずしも正しいとは限らないことから、文字認識結果が、単語辞書ＧＷに格納された単語として成り立つ文字コードを、認識結果の第２候補以下から探し、正解として指定することで正しい結果に補正するものである。なお、この正解の文字コードの指定が、自動学習補正時における文字コード指定部１８ｂ（図１参照）を構成する。図示の例では、第２番目の文字が、自動学習補正によるものである。文字認識処理では、"書"が第２位候補として認識されるが、単語辞書ＧＷに「辞書」という単語が予め登録してあるので、ステップＳ１２０で単語辞書ＧＷを利用した自動学習補正による補正処理を行なうことで、認識結果として"書"が採用される。 In the automatic learning correction, the character recognition result is automatically corrected using the word dictionary GW without the help of an operator. That is, since the first candidate as the recognition result obtained by the character recognition processing in step S120 is not necessarily correct, the character recognition result is obtained from the recognition result obtained as a word stored in the word dictionary GW. Are searched from the second candidate and below and designated as the correct answer to correct the result. The designation of the correct character code constitutes the character code designation unit 18b (see FIG. 1) at the time of automatic learning correction. In the illustrated example, the second character is due to automatic learning correction. In the character recognition process, “book” is recognized as the second candidate, but since the word “dictionary” is registered in advance in the word dictionary GW, correction by automatic learning correction using the word dictionary GW in step S120. By performing the process, "book" is adopted as the recognition result.

半自動学習補正は、文字認識結果として第２位候補以下として得られた複数の文字コードの中から一つを、作業者の操作指令に応じて選択して、その選択された文字コードに補正するものである。図示の例では、第７番目の文字が、半自動学習補正によるものである。文字認識処理では、第１位候補として"に"が、第２位候補として"は"が認識されるが、文章としては"に"であっても問題ないため単語辞書ＧＷを用いた補正が行われず、文字認識処理の結果がそのままとなる。そこで、作業者の手を借りて、"に"を"は"に補正する半自動学習補正が行なわれることになる。 The semi-automatic learning correction selects one of a plurality of character codes obtained as the second or lower candidate as a character recognition result in accordance with the operator's operation command and corrects the selected character code. Is. In the illustrated example, the seventh character is due to semi-automatic learning correction. In the character recognition process, “ni” is recognized as the first candidate and “ha” is recognized as the second candidate, but even if it is “ni” as the sentence, there is no problem, so correction using the word dictionary GW is performed. It is not performed, and the result of the character recognition process remains as it is. Therefore, with the help of the operator, semi-automatic learning correction for correcting “to” to “to” is performed.

図６は、半自動学習補正時における認識結果表示用のフィールドＦＤ２の一例を示す説明図である。図示するように、ウィンドウＷＤに備えられた認識結果表示用のフィールドＦＤ２には、第７番目の誤り文字を含む認識結果が表示されている。作業者は、以下の手順で複数の候補文字から正しい文字を指定している。作業者は、まず、マウス２４を操作して、その誤り文字（"に"の文字）にカーソルを移動する。この結果、認識結果表示用のフィールドＦＤ２の最下部には、複数の候補文字が、類似度が高いものほど左側に位置するように表示される。作業者は、次いで、それら表示された候補文字から正しい文字をクリックする操作を行なう。図示は、「は」の候補文字ＴＣがクリックされたものであり、上記誤り文字（"に"の文字）は、クリックされた候補文字ＴＣ（"は"の文字）に補正される。この正しい文字コードをクリックする指定が、半自動学習補正時における文字コード指定部１８ｂ（図１参照）を構成する。 FIG. 6 is an explanatory diagram showing an example of a recognition result display field FD2 at the time of semi-automatic learning correction. As shown in the drawing, the recognition result including the seventh error character is displayed in the recognition result display field FD2 provided in the window WD. The operator designates a correct character from a plurality of candidate characters according to the following procedure. First, the operator operates the mouse 24 to move the cursor to the error character (the character “”). As a result, a plurality of candidate characters are displayed at the bottom of the recognition result display field FD2 such that the higher the similarity is, the more left the character is. Next, the operator performs an operation of clicking a correct character from the displayed candidate characters. In the figure, the candidate character TC “ha” is clicked, and the error character (“character” in “) is corrected to the clicked candidate character TC (“ character ”in“). The designation of clicking the correct character code constitutes the character code designation unit 18b (see FIG. 1) at the time of semi-automatic learning correction.

手動補正は、文字認識処理によって得られた候補文字以外の文字に補正を行ないたいときに、作業者から直接文字の入力を受け付けて、その入力された文字に補正するものである。図５に示した例では、第１８番目の文字が、手動補正によるものである。文字認識処理では、"書"の文字が候補として出ていない。この場合「辞書」という単語を候補文字の中から構成できないために補正ができず、間違った結果が出力される。そこで、作業者の手を借りて、"事"を"書"に補正する手動補正が行なわれることになる。 In manual correction, when it is desired to correct a character other than the candidate character obtained by the character recognition process, a character input is directly received from the operator and the input character is corrected. In the example shown in FIG. 5, the 18th character is due to manual correction. In the character recognition process, the characters “call” are not displayed as candidates. In this case, since the word “dictionary” cannot be constructed from the candidate characters, it cannot be corrected and an incorrect result is output. Therefore, manual correction for correcting “thing” to “call” is performed with the help of the operator.

作業者は、以下の手順で間違った認識結果を補正する処理を行なう。作業者は、まず、マウス２４を操作して、ウィンドウＷＤの認識結果表示用のフィールドＦＤ２に表示された文字列における誤り文字にカーソルを移動する。次いで、ウィンドウＷＤのツールバーＢＲ２上の「手動補正」のボタンＢＴ４（図３参照）をマウス２４によってクリックして、手動補正用のダイアログボックスＤＢを表示する。 The operator performs processing for correcting an incorrect recognition result in the following procedure. First, the operator operates the mouse 24 to move the cursor to an error character in the character string displayed in the recognition result display field FD2 of the window WD. Next, a “manual correction” button BT4 (see FIG. 3) on the tool bar BR2 of the window WD is clicked with the mouse 24 to display a dialog box DB for manual correction.

図７は、手動補正用のダイアログボックスＤＢを示す説明図である。このダイアログボックスＤＢには、「文字画像」と「一覧」の表示フィールドＦＤ１１，ＦＤ１２と、「文字」と「辞書」の入力フィールドＦＤ１３，ＦＤ１４とが備えられている。「文字画像」の表示フィールドＦＤ１１には、ウィンドウＷＤの文書画像表示用のフィールドＦＤ１に表示されている文書画像データＤＤがそのまま表示される。このとき、認識結果表示用のフィールドＦＤから先に指定した誤り文字は、赤枠ＲＦで囲まれている。作業者は、マウス２４を操作して、「辞書」の入力フィールドＦＤ１４から辞書を識別するための識別情報（ＩＤ）を入力すると共に、キーボード２２を操作して、「文字」の入力フィールドＦＤ１３から文字をキー入力する。その後、［追加］のボタンＢＴ１１をクリックする。この結果、「文字」の入力フィールドＦＤ１３から入力された文字を表わす文字コードが指定されることになり、上記誤り文字は、その指定された文字コード（"書"の文字コード）に補正される。なお、「文字」と「辞書」の入力フィールドＦＤ１３，ＦＤ１４から作業者の操作指令に応じて正しい文字の文字コードと選択する辞書を指定する構成が、手動補正時における文字コード指定部１８ｂおよび辞書選択部１８ｃ（図１参照）の一部を構成する。 FIG. 7 is an explanatory view showing a dialog box DB for manual correction. The dialog box DB includes “character image” and “list” display fields FD11 and FD12, and “character” and “dictionary” input fields FD13 and FD14. In the “character image” display field FD11, the document image data DD displayed in the document image display field FD1 of the window WD is displayed as it is. At this time, the error character specified earlier from the recognition result display field FD is surrounded by a red frame RF. The operator operates the mouse 24 to input identification information (ID) for identifying the dictionary from the “dictionary” input field FD14, and operates the keyboard 22 to input the “character” input field FD13. Key in characters. Thereafter, the “Add” button BT11 is clicked. As a result, the character code representing the character input from the “character” input field FD13 is designated, and the error character is corrected to the designated character code (character code of “book”). . Note that the configuration for designating the correct character code and the dictionary to be selected from the input fields FD13 and FD14 of “character” and “dictionary” in accordance with the operator's operation command is the character code designating portion 18b and dictionary at the time of manual correction. It constitutes a part of the selector 18c (see FIG. 1).

ステップＳ１３０の補正処理では、自動学習補正を行なう処理が必ず実行され（もちろん、文字認識結果が正しい場合には文字の変更はない）、さらに、作業者の上述した操作指令を受けた場合に半自動学習補正もしくは手動補正がなされる。なお、自動学習補正、半自動学習補正および手動補正のうちのいずれかで文字の変更がなされる毎に、更新辞書ＧＵを更新する処理がなされる。更新辞書ＧＵの更新処理について、次に詳述する。 In the correction process of step S130, a process of performing automatic learning correction is always executed (of course, there is no change of the character when the character recognition result is correct), and further, semi-automatic when receiving the above-described operation command from the operator. Learning correction or manual correction is performed. Each time a character is changed by any one of automatic learning correction, semi-automatic learning correction, and manual correction, a process of updating the update dictionary GU is performed. Next, the update processing of the update dictionary GU will be described in detail.

図８は、更新辞書ＧＵと登録画像情報ファイルＰＦのデータ構造を示す説明図である。図示するように、更新辞書ＧＵは、「文字コード」、「対応辞書識別情報」、「更新回数」、「特徴量」の各項目Ｆ１、Ｆ２、Ｆ３、Ｆ４の組を１レコードとするデータ構造を備えている。この１レコードで１文字が表わされている。こうした構成の更新辞書ＧＵによれば、「対応辞書識別情報」の項目Ｆ２に格納されている識別情報によって特定される辞書（Ｇ１〜Ｇｎのうちのいずれか）を補正の対象として、その辞書の中の「文字コード」の項目Ｆ１に格納されている文字コードと一致する文字についての補正がなされていることがわかり、その補正量としての特徴量が、「特徴量」の項目Ｆ４に格納されていることがわかる。 FIG. 8 is an explanatory diagram showing the data structures of the update dictionary GU and the registered image information file PF. As shown in the figure, the update dictionary GU has a data structure in which a set of items F1, F2, F3, and F4 of “character code”, “corresponding dictionary identification information”, “update count”, and “feature” is one record. It has. One character is represented by one record. According to the update dictionary GU having such a configuration, a dictionary (any one of G1 to Gn) specified by the identification information stored in the item F2 of “corresponding dictionary identification information” is used as a correction target, and It can be seen that a correction is made for the character that matches the character code stored in the item “Character code” F1, and the feature amount as the correction amount is stored in the item “F4” item F4. You can see that

一方、登録画像情報ファイルＰＦは、「文字コード」、「更新辞書の対応レコード番号」、「文字画像のファイル名」の各項目Ｅ１、Ｅ２、Ｅ３の組を１レコードとするデータ構造を備えている。「文字画像のファイル名」の項目Ｅ３は、ｍ個（ｍは正数）取り得る。「更新辞書の対応レコード番号」の項目Ｅ１に格納されるデータは、更新辞書ＧＵの１レコードを指定するためのものである。項目Ｅ１によって指定された更新辞書ＧＵの１レコードの「文字コード」の項目Ｆ１に格納されるデータは、「文字コード」の項目Ｅ１に格納されるデータと一致する。この登録画像情報ファイルＰＦは、更新辞書ＧＵを更新する度に、その更新に用いた文字画像を登録するためのものである。 On the other hand, the registered image information file PF has a data structure in which a set of items E1, E2, and E3 of “character code”, “update dictionary corresponding record number”, and “character image file name” is one record. Yes. The item “E3” of “character image file name” can be m (m is a positive number). The data stored in the item “E1 corresponding record number of the update dictionary” is for designating one record of the update dictionary GU. The data stored in the “character code” item F1 of one record of the update dictionary GU specified by the item E1 matches the data stored in the “character code” item E1. The registered image information file PF is for registering the character image used for the update every time the update dictionary GU is updated.

更新辞書ＧＵの更新処理においては、登録画像情報ファイルＰＦを直接的に用いることはない。登録画像情報ファイルＰＦは、更新辞書ＧＵが損傷して再作成する際に利用したり、更新辞書ＧＵに登録した文字画像を一覧として表示する際に利用される。 In the update process of the update dictionary GU, the registered image information file PF is not directly used. The registered image information file PF is used when the update dictionary GU is damaged and re-created, or when displaying the character images registered in the update dictionary GU as a list.

図９は、更新辞書ＧＵの更新処理を示すフローチャートである。この更新辞書ＧＵの更新処理は、自動学習補正、半自動学習補正および手動補正のうちのいずれかで文字の変更がなされたときに実行される。図示するように、処理が開始されると、ＣＰＵ１１は、まず、上述したいずれかの補正によって変更された文字コードＣＣを取り込む処理を行なう（ステップＳ３００）。次いで、ＣＰＵ１１は、変更前の文字画像の特徴量ＣＶ１を取り込む処理を行なう（ステップＳ３１０）。自動学習補正、半自動学習補正、手動補正の各処理においては、文書画像データＤＤから切り出した１文字の文字画像の特徴量を演算していることから、ステップＳ３１０では、この特徴量を取り込んでいる。この特徴量を取り込む構成が、特徴量取得部１８ｄ（図１参照）に対応する。 FIG. 9 is a flowchart showing an update process of the update dictionary GU. This update processing of the update dictionary GU is executed when a character is changed by any one of automatic learning correction, semi-automatic learning correction, and manual correction. As shown in the figure, when the process is started, the CPU 11 first performs a process of taking in the character code CC changed by any of the above-described corrections (step S300). Next, the CPU 11 performs processing for taking in the feature amount CV1 of the character image before the change (step S310). In each of the automatic learning correction, semi-automatic learning correction, and manual correction processes, the feature amount of the character image of one character cut out from the document image data DD is calculated. In step S310, the feature amount is captured. . The configuration for capturing the feature amount corresponds to the feature amount acquisition unit 18d (see FIG. 1).

次いで、ＣＰＵ１１は、自動、半自動学習補正の場合には、ステップＳ２１０で選択された使用辞書を識別するための識別情報を対応辞書識別情報Ｙとして記憶し、手動補正の場合には、「辞書」の入力フィールドＦＤ１４から操作者によって入力された辞書の識別情報を対応辞書識別情報Ｙとして記憶する（ステップＳ３２０）。なお、自動、半自動学習補正時においては、文字認識処理のステップＳ２１０で選択された使用辞書の識別情報を受け取る構成が、辞書選択部１８ｃ（図１参照）に対応し、手動補正時においては、上述したダイアログボックスＤＢに備えられた「辞書」の入力フィールドＦＤ１４と、この入力フィールドＦＤ１４から操作者によって入力された辞書の識別情報を取り込む構成が、辞書選択部１８ｃ（図１参照）に対応する。 Next, in the case of automatic or semi-automatic learning correction, the CPU 11 stores identification information for identifying the used dictionary selected in step S210 as corresponding dictionary identification information Y, and in the case of manual correction, “dictionary”. The dictionary identification information input by the operator from the input field FD14 is stored as corresponding dictionary identification information Y (step S320). In the automatic and semi-automatic learning correction, the configuration for receiving the identification information of the used dictionary selected in step S210 of the character recognition process corresponds to the dictionary selection unit 18c (see FIG. 1), and in the manual correction, The “dictionary” input field FD14 provided in the dialog box DB described above, and the configuration for capturing the dictionary identification information input by the operator from the input field FD14 correspond to the dictionary selection unit 18c (see FIG. 1). .

その後、ＣＰＵ１１は、更新辞書ＧＵに、「文字コード」の項目Ｆ１がステップＳ３００で取り込んだ文字コードＣＣと一致し、且つ「対応辞書識別情報」の項目Ｆ２がステップＳ３２０で読み込んだ対応辞書識別情報Ｙと一致するレコードが登録済みか否かを判定する（ステップＳ３３０）。ここで、否定判定、すなわち未登録と判定された場合には、ステップＳ３４０に処理を進める。 Thereafter, the CPU 11 stores in the updated dictionary GU, the corresponding character identification information that the item “F1” of “character code” matches the character code CC captured in step S300, and the item “F2” of “corresponding dictionary identification information” is read in step S320. It is determined whether or not a record matching Y is registered (step S330). Here, if a negative determination is made, that is, it is determined that it is not registered, the process proceeds to step S340.

ステップＳ３４０では、ＣＰＵ１１は、ステップＳ３２０で読み込んだ対応辞書識別情報Ｙによって特定される辞書から、ステップＳ３００で取り込んだ文字コードＣＣに対応する特徴量ＣＶ０を読み出す（ステップＳ３４０）。 In step S340, the CPU 11 reads the feature amount CV0 corresponding to the character code CC captured in step S300 from the dictionary specified by the corresponding dictionary identification information Y read in step S320 (step S340).

その後、更新辞書ＧＵに新しいレコードを追加する処理を行なう（ステップＳ３５０）。詳細には、ＣＰＵ１１は、「文字コード」の項目Ｆ１にはステップＳ３００で取り込んだ文字コードＣＣを転送し、「対応辞書識別情報」の項目Ｆ２にはステップＳ３２０で読み込んだ対応辞書識別情報Ｙを転送し、「更新回数」の項目Ｆ３には値１を転送し、「特徴量」の項目Ｆ４には、ステップＳ３４０で対応辞書から読み出した特徴量ＣＶ０とステップＳ３１０で取り込んだ特徴量ＣＶ１との平均値、すなわち、（ＣＶ０＋ＣＶ１）／２の演算結果を転送して作成した新しいレコードを更新辞書ＧＵに追加する。ステップＳ３５０の実行後、「リターン」に抜けてこの更新辞書ＧＵの更新処理を一旦終了する。 Thereafter, a process of adding a new record to the update dictionary GU is performed (step S350). Specifically, the CPU 11 transfers the character code CC fetched in step S300 to the item F1 of “character code”, and the correspondence dictionary identification information Y read in step S320 to the item F2 of “corresponding dictionary identification information”. The value 1 is transferred to the “update count” item F3, and the feature value CV0 read from the correspondence dictionary in step S340 and the feature value CV1 captured in step S310 are stored in the “feature value” item F4. A new record created by transferring the average value, that is, the calculation result of (CV0 + CV1) / 2 is added to the update dictionary GU. After execution of step S350, the process returns to “Return” to temporarily end the update process of the update dictionary GU.

一方、ステップＳ３３０で、肯定判定、すなわち登録済みと判定された場合には、上記一致すると判定されたレコード、すなわち、「文字コード」の項目Ｆ１がステップＳ３００で取り込んだ文字コードＣＣと一致し、且つ「対応辞書識別情報」の項目Ｆ２がステップＳ３２０で読み込んだ対応辞書識別情報Ｙと一致するレコードを修正する処理を行なう（ステップＳ３６０）。この修正の詳細は、一致すると判定されたレコードの「更新回数」の項目Ｆ３を、値１だけインクリメントするとともに、「特徴量」の項目Ｆ４に格納されている値を、ステップＳ３１０で取り込んだ特徴量ＣＶ１を用いて平均化するものである。この平均化は、「特徴量」の項目Ｆ４に格納されている値に「更新回数」の項目Ｆ３の値（上記インクリメント後の値）を乗算して、この乗算結果にステップＳ３１０で取り込んだ特徴量ＣＶ１を加算して、その加算結果を、「更新回数」の項目Ｆ３の値に値１を加えた値で除算することにより行なわれる。なお、「文字コード」の項目Ｆ１および「対応辞書識別情報」の項目Ｆ２は変更しない。ステップＳ３６０の実行後、「リターン」に抜けてこの更新辞書ＧＵの更新処理を一旦終了する。 On the other hand, if it is determined affirmative in step S330, that is, if it is registered, the record determined to match, that is, the item “F1” of “character code” matches the character code CC captured in step S300, In addition, a process for correcting the record in which the item F2 “corresponding dictionary identification information” matches the corresponding dictionary identification information Y read in step S320 is performed (step S360). The details of the correction are as follows. The item F3 of “update count” of the record determined to match is incremented by a value 1, and the value stored in the item F4 of “feature” is acquired in step S310. It is averaged using the quantity CV1. In this averaging, the value stored in the item F4 “feature” is multiplied by the value of the item F3 “number of updates” (the value after the increment), and the multiplication result is captured in step S310. This is performed by adding the amount CV1 and dividing the addition result by a value obtained by adding the value 1 to the value of the item F3 of “update count”. The “character code” item F1 and the “corresponding dictionary identification information” item F2 are not changed. After executing step S360, the process returns to “Return” to temporarily end the update process of the update dictionary GU.

ステップＳ３５０およびＳ３６０の詳細について、例をあげてさらに説明する。更新辞書ＧＵに新レコードとして追加されたとき、すなわち第１回目の辞書の更新時には、ステップＳ３１０で取り込まれた特徴量ＣＶ１をＶ１とすると、「更新回数」の項目Ｆ３は値１であり、「特徴量」の項目Ｆ４は（ＣＶ０＋Ｖ１）／２となる。 Details of steps S350 and S360 will be further described by way of examples. When a new record is added to the update dictionary GU, that is, when the dictionary is updated for the first time, assuming that the feature value CV1 captured in step S310 is V1, the item F3 of “number of updates” has a value of “ The item F4 of “feature” is (CV0 + V1) / 2.

第２回目の更新時には、ステップＳ３１０で取り込まれた特徴量ＣＶ１をＶ２とすると、「更新回数」の項目Ｆ３は値２であり、「特徴量」の項目Ｆ４は（ＣＶ０＋Ｖ１＋Ｖ２）／３となる。第３回目の更新時には、ステップＳ３１０で取り込まれた特徴量ＣＶ１をＶ３とすると、「更新回数」の項目Ｆ３は値３であり、「特徴量」の項目Ｆ４は（ＣＶ０＋Ｖ１＋Ｖ２＋Ｖ３）／４となる。すなわち、第ｋ回目（ｋは正数）の更新時には、ステップＳ３１０で取り込まれた特徴量ＣＶ１をＶｋとすると、「更新回数」の項目Ｆ３はｋであり、「特徴量」の項目Ｆ４は（ＣＶ０＋Ｖ１＋Ｖ２＋...＋Ｖｋ）／（ｋ＋１）となる。 At the time of the second update, assuming that the feature value CV1 captured in step S310 is V2, the “number of updates” item F3 is the value 2, and the “feature” item F4 is (CV0 + V1 + V2) / 3. At the time of the third update, assuming that the feature value CV1 captured in step S310 is V3, the “update count” item F3 is the value 3, and the “feature” item F4 is (CV0 + V1 + V2 + V3) / 4. That is, at the k-th update (k is a positive number), if the feature value CV1 captured in step S310 is Vk, the “number of updates” item F3 is k, and the “feature value” item F4 is ( CV0 + V1 + V2 + ... + Vk) / (k + 1).

上述した更新辞書ＧＵの更新処理は、前述したようにステップＳ１３０において、自動学習補正、半自動学習補正および手動補正のうちのいずれかで文字の変更がなされる度に行なわれるが、図２に戻って、ステップＳ１３０の補正処理を抜けると、ＣＰＵ１１は、ステップＳ１４０に処理を進める。ステップＳ１４０では、ＣＰＵ１１は、ツールバーＢＲ２上の「保存」のボタンＢＴ３が、マウス２４によってクリックされる操作を受けて、上記認識結果の文字列をハードディスクドライブ１４等の記憶装置に保存する処理を実行する。この保存は、キーボード２２からのファイル名の入力を受けてなされる。その後、ＣＰＵ１１は、ウィンドウＷＤを閉じて（ステップＳ１５０）、この制御処理を終了する。 The update processing of the update dictionary GU described above is performed every time a character is changed in any one of automatic learning correction, semi-automatic learning correction, and manual correction in step S130 as described above. After exiting the correction process in step S130, the CPU 11 advances the process to step S140. In step S140, the CPU 11 executes a process of saving the character string of the recognition result in a storage device such as the hard disk drive 14 in response to an operation in which the “save” button BT3 on the toolbar BR2 is clicked by the mouse 24. To do. This storage is performed upon receiving a file name input from the keyboard 22. Thereafter, the CPU 11 closes the window WD (step S150) and ends this control process.

ＣＰＵ１１で実行されるステップＳ１３０の処理が、図２で示した第２の辞書更新部１８に対応する。第２の辞書更新部１８に備えられる文字画像切り出し部１８ａ、文字コード指定部１８ｂ、辞書選択部１８ｃは自動学習補正、半自動学習補正、手動補正のそれぞれの中で実現されており、特徴量取得部１８ｄはステップＳ３１０の処理で実現され、特徴量補正部１８ｅおよびデータ追加部１８ｆは、ステップＳ３５０の処理で実現されている。 The process of step S130 executed by the CPU 11 corresponds to the second dictionary update unit 18 shown in FIG. The character image cutout unit 18a, the character code designation unit 18b, and the dictionary selection unit 18c included in the second dictionary update unit 18 are realized in each of automatic learning correction, semi-automatic learning correction, and manual correction, and feature amount acquisition is performed. The unit 18d is realized by the process of step S310, and the feature amount correcting unit 18e and the data adding unit 18f are realized by the process of step S350.

以上のようにして追加・修正がなされる更新辞書ＧＵは、文字認識処理におけるステップＳ２２０（図４）において用いられる。このステップＳ２２０では、使用辞書の識別情報と、更新辞書ＧＵの「対応辞書識別情報」の項目Ｆ２に格納されている識別情報とを比較して、一致する項目Ｆ２を備えるレコードだけを更新辞書ＧＵから選択して、その選択されたレコード内容だけをマッチングに用いるようにしている。 The updated dictionary GU that is added / modified as described above is used in step S220 (FIG. 4) in the character recognition process. In this step S220, the identification information of the used dictionary and the identification information stored in the item F2 of the “corresponding dictionary identification information” of the update dictionary GU are compared, and only the record having the matching item F2 is updated. And only the selected record content is used for matching.

３．作用・効果
以上のように構成された本実施例によれば、更新辞書ＧＵによって、辞書別に特徴量の補正がなされることになる。このために、誤認識があったために更新辞書ＧＵに文字画像と正しい文字コードを追加した場合に、その追加した文字画像が、誤認識を行なった辞書とは異なる辞書を用いた文字認識で利用されることがない。したがって、更新辞書ＧＵに文字画像を追加することにより、いままで認識できていた類似の文字が認識不能となることがないことから、文字認識の精度を充分に高めることができる。 3. Action / Effect According to the present embodiment configured as described above, the feature amount is corrected for each dictionary by the updated dictionary GU. For this reason, when a character image and a correct character code are added to the updated dictionary GU due to misrecognition, the added character image is used for character recognition using a dictionary that is different from the erroneously recognized dictionary. It will not be done. Therefore, by adding a character image to the update dictionary GU, similar characters that have been recognized so far do not become unrecognizable, so that the accuracy of character recognition can be sufficiently increased.

また、この実施例では、文字認識結果の補正を、自動学習補正、半自動学習補正、手動補正によって様々な認識結果の状況に合わせて行なうことができ、その上で、各補正時に更新辞書ＧＵの更新を行なうことができ、操作性に優れている。手動補正時には、更新辞書ＧＵの更新も手動で、すなわち、文字コードと辞書とを手動で指定することもでき、作業者の意図する更新を容易に行なうことができる。 In this embodiment, the correction of the character recognition result can be performed according to various recognition result situations by automatic learning correction, semi-automatic learning correction, and manual correction. It can be updated and has excellent operability. At the time of manual correction, the update dictionary GU can be updated manually, that is, the character code and the dictionary can be manually specified, and the update intended by the operator can be easily performed.

４．他の実施形態：
本発明は上記の実施例や実施形態に限られるものではなく、本発明の要旨を逸脱しない範囲において種々なる態様にて実施することが可能であり、例えば次のような変形も可能である。 4). Other embodiments:
The present invention is not limited to the above-described examples and embodiments, and can be implemented in various modes without departing from the gist of the present invention. For example, the following modifications are possible.

（１）前記実施例では、特徴量補正部１８ｅを、対応辞書から読み出した特徴量ＣＶ０と誤認識をした文字画像の特徴量ＣＶ１との単純平均を求めることにより構成していたが、これに換えて、重み付け平均等の他の手法により、辞書から読み出した特徴量を、誤認識をした文字画像の特徴量でもって補正する構成としてもよい。重み付け平均の場合には、対応辞書から読み出した特徴量ＣＶ０を重要視した重み付けを行なうことで、学習前の認識精度に大きな影響を与えることなく補正を行なうことができる。 (1) In the above-described embodiment, the feature amount correcting unit 18e is configured by obtaining a simple average between the feature amount CV0 read from the corresponding dictionary and the feature amount CV1 of the character image that has been erroneously recognized. Alternatively, the feature amount read from the dictionary may be corrected with the feature amount of the character image that has been erroneously recognized by another method such as weighted averaging. In the case of weighted average, weighting that places importance on the feature amount CV0 read from the corresponding dictionary can be performed without significantly affecting the recognition accuracy before learning.

（２）また、上記他の手法の１つとして、ＣＶ０とＣＶ１との比較を、その特徴量を表わす小領域毎に行なうことで、文字画像の中で最も特徴を表わしている部分（ＣＶ０とＣＶ１との差が大きい部分）を抽出して、その抽出された部分だけを補正する構成とすることもできる。 (2) Further, as one of the other methods described above, a comparison between CV0 and CV1 is performed for each small region representing the feature amount, so that a portion (CV0 It is also possible to extract a portion having a large difference from CV1 and correct only the extracted portion.

（３）前記実施例では、更新辞書ＧＵにおいて、同一の文字コード、同一の対応辞書識別情報についての修正が多数回になると、多くのデータを利用して平均化していくことになり、特徴のない辞書データとなる虞がある。このために、同一の文字コード、同一の対応辞書識別情報に対する補正回数に制限を設けて、補正回数がその制限を超えた場合には、例え同一の文字コード、同一の対応辞書識別情報であってもレコードを替えて登録するよう構成してもよい。この構成によれば、認識精度をより一層高めることができる。 (3) In the above-described embodiment, when the same character code and the same corresponding dictionary identification information are corrected a large number of times in the updated dictionary GU, a large amount of data is used for averaging. There is a risk that there will be no dictionary data. For this reason, if the number of corrections for the same character code and the same corresponding dictionary identification information is limited, and the number of corrections exceeds the limit, the same character code and the same corresponding dictionary identification information are used. However, the record may be changed and registered. According to this configuration, the recognition accuracy can be further enhanced.

本発明の一実施例を適用するコンピュータシステムの概略構成を示す説明図である。It is explanatory drawing which shows schematic structure of the computer system to which one Example of this invention is applied. 制御処理のメインルーチンを示すフローチャートである。It is a flowchart which shows the main routine of a control process. ディスプレイ２０に表示されるウィンドウＷＤを示す説明図である。12 is an explanatory diagram showing a window WD displayed on the display 20. FIG. 図２のステップＳ１２０で実行される文字認識処理の詳細を表わすフローチャートである。It is a flowchart showing the detail of the character recognition process performed by step S120 of FIG. 自動学習補正、半自動学習補正および手動補正を補正結果の一例を用いて説明するための説明図である。It is explanatory drawing for demonstrating automatic learning correction | amendment, semiautomatic learning correction | amendment, and manual correction using an example of a correction result. 半自動学習補正時における認識結果表示用のフィールドＦＤ２の一例を示す説明図である。It is explanatory drawing which shows an example of the field FD2 for a recognition result display at the time of semiautomatic learning correction | amendment. 手動補正用のダイアログボックスＤＢを示す説明図である。It is explanatory drawing which shows dialog box DB for manual correction | amendment. 更新辞書ＧＵと登録画像情報ファイルＰＦのデータ構造を示す説明図である。It is explanatory drawing which shows the data structure of the update dictionary GU and the registration image information file PF. 更新辞書ＧＵの更新処理を示すフローチャートである。It is a flowchart which shows the update process of the update dictionary GU.

Explanation of symbols

１０...パーソナルコンピュータ
１１...ＣＰＵ
１２...バス
１３...メモリ
１４...ハードディスクドライブ
１５...入力制御ユニット
１６...表示制御ユニット
１８...第２の辞書更新部
１８ａ...文字画像切り出し部
１８ｂ...文字コード指定部
１８ｃ...辞書選択部
１８ｄ...特徴量取得部
１８ｅ...特徴量補正部
１８ｆ...データ追加部
２０...ディスプレイ
２２...キーボード
２４...マウス
２６...スキャナ
２８...プリンタ
ＷＤ...ウィンドウ
ＢＲ１...メニューバー
ＢＲ２...ツールバー
ＢＴ１，ＢＴ２，ＢＴ３、ＢＴ４...ボタン
ＤＤ...文書画像データ
ＦＤ１...文書画像表示用のフィールド
ＦＤ２...認識結果表示用のフィールド
Ｇ１...基本認識辞書
Ｇ２...つぶれ文字認識辞書
Ｇ３...かすれ文字認識辞書
Ｇ４...明朝体認識辞書
Ｇ５...ゴシック体認識辞書
ＧＵ...更新辞書
ＧＷ...単語辞書
ＰＦ...登録画像情報ファイル
ＰＲ...コンピュータプログラム
ＤＢ...ダイアログボックス
ＦＤ１１...「文字画像」の表示フィールド
ＦＤ１２...「一覧」の表示フィールド
ＦＤ１３...「文字」の入力フィールド
ＦＤ１４...「辞書」の入力フィールド 10 ... Personal computer 11 ... CPU
12 ... Bus 13 ... Memory 14 ... Hard disk drive 15 ... Input control unit 16 ... Display control unit 18 ... Second dictionary update unit 18a ... Character image cutout unit 18b. ..Character code designation part 18c ... Dictionary selection part 18d ... Feature quantity acquisition part 18e ... Feature quantity correction part 18f ... Data addition part 20 ... Display 22 ... Keyboard 24 ... Mouse 26 ... Scanner 28 ... Printer WD ... Window BR1 ... Menu bar BR2 ... Toolbar BT1, BT2, BT3, BT4 ... Button DD ... Document image data FD1 ... Document Image display field FD2 ... Recognition result display field G1 ... Basic recognition dictionary G2 ... Shattered character recognition dictionary G3 ... Blurred character recognition dictionary G4 ... Mincho body recognition dictionary G5 .. .Gothic recognition dictionary GU ... update dictionary GW ... word dictionary PF ... registration Image information file PR ... Computer program DB ... Dialog box FD11 ... "Character image" display field FD12 ... "List" display field FD13 ... "Character" input field FD14 ... Dictionary input field

Claims

As a dictionary for character recognition, a first dictionary whose registration contents are fixed in advance and a second dictionary that can update the registration contents are provided. A document image is input, and the character image represented in the document image is converted into the character image. A character recognition device that performs character recognition by collating with a first dictionary and a second dictionary,
A first dictionary storage means for storing a plurality of types of dictionaries that store character image feature quantities and character codes in association with each other as the first dictionary;
Second dictionary updating means for updating the second dictionary,
The second dictionary updating means includes
A character image cutting means for cutting out a character image of one character from the document image;
A character code specifying means for specifying a character code for the character image cut out by the character image cut-out means;
Dictionary selection means for selecting one dictionary from a plurality of types of dictionaries stored in the first dictionary storage means;
Feature quantity acquisition means for obtaining a feature quantity of the character image cut out by the character image cutout means;
The feature quantity corresponding to the character code designated by the character code designation means is read from the dictionary selected by the dictionary selection means, and the read feature quantity is the feature quantity obtained by the feature quantity acquisition means. A feature amount correcting means for correcting by,
A set of a character code designated by the character code designation unit, a feature amount after correction by the feature amount correction unit, and identification information for identifying the dictionary selected by the dictionary selection unit is set for one character. A character recognizing device comprising: data adding means for adding data to the second dictionary as data.

The character recognition device according to claim 1,
Whether or not data for one character including the character code specified by the character code specifying means and identification information for identifying the dictionary selected by the dictionary selecting means has been registered in the second dictionary. Registration determination means for determining whether or not
A character recognition apparatus comprising: a data correction unit that corrects the registered data with the feature amount corrected by the feature amount correction unit when the registration determination unit determines that the registration is not completed.

The character recognition device according to claim 1 or 2,
The character code specifying means is configured to specify a character code representing a character input in response to an operator's operation command,
The said dictionary selection means is a character recognition apparatus which is the structure which selects the dictionary designated according to the operator's operation command.

The character recognition device according to claim 1 or 2,
A character image of one character is sequentially cut out from the document image, a feature amount of the cut out character image is calculated, and a use dictionary selected from a plurality of dictionaries stored in the first dictionary storage means, Character recognition means for recognizing the character by selecting a character code close to the feature value obtained by the calculation from the second dictionary;
The said dictionary selection means is a character recognition apparatus which is the structure which uses the said use dictionary as the selected dictionary.

The character recognition device according to any one of claims 1 to 4,
The feature amount correcting means includes
A character recognition device comprising: averaging means for obtaining an average value of a feature value read from the dictionary and a feature value obtained by the feature value acquisition means.

As a dictionary for character recognition, a first dictionary whose registration contents are fixed in advance and a second dictionary whose registration contents can be updated are prepared, a document image is input, and a character image represented in the document image A character recognition method for recognizing characters by comparing them with the first dictionary and the second dictionary,
(A) a step of previously storing a plurality of types of dictionaries for storing the character image feature quantity and the character code in association with each other as the first dictionary;
(B) a step of updating the second dictionary;
The step (b) is:
(B-1) a step of cutting out one character image from the document image;
(B-2) a step of designating a character code for the character image cut out in the step (b-1);
(B-3) a step of selecting one dictionary from a plurality of types of dictionaries as the first dictionary;
(B-4) a step of obtaining a feature amount of the character image cut out in the step (b-1);
(B-5) A feature amount corresponding to the character code specified in the step (b-2) is read from the dictionary selected in the step (b-3), and the read feature amount is A process of correcting with the feature amount obtained by the process (b-4);
(B-6) The character code designated by the step (b-2), the corrected feature amount by the step (b-5), and the dictionary selected by the step (b-3) are identified. And a step of adding a set of identification information to the second dictionary as data for one character.

For performing character recognition by inputting a document image and collating the character image represented in the document image with a first dictionary whose registration contents are fixed in advance and a second dictionary whose registration contents can be updated. A computer program,
(A) a function of preliminarily storing in the storage device a plurality of types of dictionaries that store character image feature quantities and character codes in association with each other as the first dictionary;
(B) causing the computer to realize the function of updating the second dictionary;
The function (b) is
(B-1) a function of cutting out one character image from the document image;
(B-2) a function of designating a character code for the character image cut out by the function (b-1);
(B-3) a function of selecting one dictionary from a plurality of types of dictionaries as the first dictionary;
(B-4) a function of obtaining a feature amount of the character image cut out by the function (b-1);
(B-5) The feature amount corresponding to the character code designated by the function (b-2) is read from the dictionary selected by the function (b-3), and the read feature amount is A function of correcting with the feature amount obtained by the function (b-4);
(B-6) The character code designated by the function (b-2), the feature value after the correction by the function (b-5), and the dictionary selected by the function (b-3) are identified. And a function of adding a set of identification information to the second dictionary as data for one character.

The computer program according to claim 7, further comprising:
(B-7) Data for one character comprising the character code designated by the function (b-2) and identification information for identifying the dictionary selected by the function (b-3) A function for determining whether or not it is already registered in the dictionary 2;
(B-8) a function of correcting the registered data with the feature amount corrected by the function (b-5) when it is determined that the function (b-7) has not been registered. Computer program to be realized.

A computer program according to claim 7 or 8,
The function (b-5) is
A computer program comprising a function for obtaining an average value of a feature value read from the dictionary selected by the function (b-3) and a feature value obtained by the function (b-4).

10. A computer-readable recording medium on which the computer program according to claim 7 is recorded.