JPH0676102A

JPH0676102A - Determining method for setting classifing process

Info

Publication number: JPH0676102A
Application number: JP5028788A
Authority: JP
Inventors: Jr Henry W Sang; ダブリュー．サング、ジュニアヘンリー; Dasari Lakshmi; ダサリラクシミ; M Pagaro Julia; エム．パガロジュリア
Original assignee: Xerox Corp
Current assignee: Xerox Corp
Priority date: 1992-02-26
Filing date: 1993-02-18
Publication date: 1994-03-18

Abstract

PURPOSE: To provide a form recognition system for identifying the kind of an original in a new fixed form, clarifying a characteristic, using it, and adding it to a data base. CONSTITUTION: This method is provided with a step for inputting a master original at plural times, and generating plural master bit maps (200), step for extracting a characteristic from the plural master bit maps (300), step for generating an attribute by combining the characteristic with random model parameter distribution, and generating the plural attributes in the larger number of that of the plural master bit maps, step for operating cluster analysis from all the master originals to the attributes, and generating plural super classes expressing the family of the master originals and plural non-super classes expressing the master originals which are not the members of the family, and step for operating classification analysis into the super classes and the non-super classes, and generating the setting of a classification processing enabling the recognition of the master original.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、フォーム認識システム
に関し、より詳細には、認識度を増加させるためにラン
ダムノイズおよび系統的ノイズの歪みモデルを用いるフ
ォーム認識システムに関する。FIELD OF THE INVENTION This invention relates to form recognition systems, and more particularly to form recognition systems that use random noise and systematic noise distortion models to increase recognition.

【０００２】[0002]

【従来の技術】フォームからデータを抽出するためにコ
ンピュータシステムを使用する１つの手法が、R.G.Case
y およびD.R.Fergusonによる論文「インテリジェントフ
ォーム処理(Intelligent Forms Processing)」（IBM Sy
stems Journal 、巻29、第３号、1990年、435 −450
頁）に述べられている。そこで開示される手法は、フォ
ームが走査されて画像の電子的なビットマップ表現を生
成するフォーム認識処理である。種々の画像処理技法を
用いるとき、そのビットマップフォームは、テンプレー
ト（原型）としてデータベースに記憶された他のビット
マップフォームと一致するまで分析及び比較される。一
致しない場合、ユーザは、コンピュータと対話し、テン
プレートのデータベースに新規の追加テンプレートを特
定することによって、新しい種類のフォームを入力しな
ければならない。この開示文献が自動的なフォームの認
識、すなわち、自動的なデータの抽出を扱うものである
としても、フォームが認識されてデータがうまく抽出さ
れる以前に、そのシステムはフォームに関して予め知識
をもつことが前提とされる。また、この文献において扱
われるものは、誤った位置合わせの走査によって発生す
る画像のスキューとオフセットを補正する概念である。
しかしながら、システムにおいて発生するかもしれない
その他の定誤差および確率的誤差に適応しない。したが
って、このシステムに取り込まれない誤差が不認識の誤
り率を高くさせる。より一層複雑なシステムが要求され
るのである。BACKGROUND OF THE INVENTION One method of using computer systems to extract data from forms is RGCase.
y and DR Ferguson, "Intelligent Forms Processing," IBM Sy
stems Journal, Volume 29, No. 3, 1990, 435-450
Page). The technique disclosed therein is a form recognition process in which the form is scanned to produce an electronic bitmap representation of the image. When using various image processing techniques, the bitmap form is analyzed and compared until it matches another bitmap form stored in the database as a template. If there is no match, the user must enter the new type of form by interacting with the computer and identifying the new additional template in the template's database. Even though this disclosure deals with automatic form recognition, i.e. automatic data extraction, the system has a prior knowledge of the form before it is recognized and the data is successfully extracted. Is assumed. Further, what is dealt with in this document is a concept of correcting an image skew and an offset caused by an erroneous alignment scanning.
However, it does not accommodate other constant and stochastic errors that may occur in the system. Therefore, errors not captured in this system increase the unrecognized error rate. More complex systems are required.

【０００３】Casey およびFergusonによる論文に類似し
て、米国特許第4,933,979 号はフォームシートの読み取
り装置を開示する。このシステムではフォームが走査さ
れて、ユーザのために図形表示端末に表示される電子的
なビットマップを生じることとなる。次に、ユーザは図
形入力インタフェースを用いて、表示されたビットマッ
プ上でブランクフォームを構成する項目を選択する。例
えば、ユーザはフォームに関して出された疑問に答える
情報を有するボックスの概略を描いてもよい。そのユー
ザの指示は第１ファイルに記憶され、後にテンプレート
として使用される。フィールドにデータを有するその原
稿は再度走査されて第２ファイルに保管（セーブ）され
る。そして、第１ファイルのビットマップ、即ち、テン
プレートが第２ファイルのビットマップから差し引か
れ、その結果、抽出され、さらに分析される非テンプレ
ート情報、即ち、データ、のみが残される。いったんフ
ォームがテンプレートとして記憶されると、走査される
その後のフォームがいくつかの記憶されたテンプレート
と照合され、データが抽出される。しかしながら、米国
特許第4,933,979 号は、画像のスキューを補償するだけ
で、システムにおいて発生するかもしれないその他の定
誤差および確率的誤差をアドレス指定しないだけでなく
補償もしていない。また、米国特許第4,933,979 号は、
フィールドを検出するためにフォーム上に少なくとも１
つの線があることを必要とし、それによって、フィール
ドを指定する線を持たないフォームを認識する可能性を
除去する。Similar to the article by Casey and Ferguson, US Pat. No. 4,933,979 discloses a foam sheet reader. In this system, the form will be scanned to produce an electronic bitmap that will be displayed on the graphical display terminal for the user. Next, the user uses the graphic input interface to select the items constituting the blank form on the displayed bitmap. For example, the user may sketch a box with information that answers the questions raised about the form. The user's instructions are stored in the first file and later used as a template. The original having data in the field is scanned again and saved in the second file. Then, the bitmap of the first file, i.e. the template, is subtracted from the bitmap of the second file, so that only non-template information, i.e. data, is extracted and further analyzed. Once the form is stored as a template, subsequent forms that are scanned are matched against some stored template and data is extracted. However, U.S. Pat. No. 4,933,979 only compensates for image skew and does not address or compensate for other constant and stochastic errors that may occur in the system. Also, U.S. Pat.No. 4,933,979
At least one on the form to detect fields
It requires that there be two lines, thereby eliminating the possibility of recognizing forms that do not have lines that specify fields.

【０００４】したがって、新規の固定形式の原稿の種類
を識別し、特徴を明らかにして利用し、データベースに
追加するための新しい手法は利点であり、かつ有益であ
る。その新しい手法は、フォームの再設計を必要とせ
ず、経時収集および外国フォームに使用することがで
き、異なる種類のフォームを追加および削除するのに拡
張しやすい。また、多くの数（1000より多い）の異なる
種類のフォームを識別して分類することができ、オフィ
ス環境に発見される定誤差および確率的誤差の共通の発
生源を利用し、補正して耐久性を確実とする。Therefore, a new approach for identifying, characterizing, utilizing, and adding new fixed format manuscript types to databases is both an advantage and a benefit. The new approach does not require form redesign, can be used for time-gathering and foreign forms, and is easy to extend to add and remove different types of forms. It can also identify and classify a large number (more than 1000) of different types of foam, take advantage of common sources of constant and stochastic errors found in the office environment, and compensate and endure them. Ensure sex.

【０００５】[0005]

【課題を解決するための手段と作用】本発明は、ランタ
イム動作の間、および、どのフォームをシステムが認識
すべきかをシステムに教える（学習モード）間、系統的
ノイズおよびランダムノイズの発生源を利用するフォー
ム認識システムを提供する。まず最初にフォームをビッ
トマップにディジタル化し、認識処理を用いて公知のフ
ォームとしてビットマップを識別することによって、情
報がフォームから抽出される。このステップの後、ディ
ジタル化されたフォームにおける情報が、フォームの種
類に関する知識によって支援される認識処理によって抽
出される。さらに、システムは訓練することが可能であ
り、ランダムノイズの異なる発生源を取り扱い、多くの
数のフォームの種類の存在において良好な認識精度を達
成するように設計される。SUMMARY OF THE INVENTION The present invention identifies sources of systematic and random noise during run-time operation and during teaching the system which form the system should recognize (learn mode). Provide a form recognition system to use. Information is extracted from the form by first digitizing the form into a bitmap and using a recognition process to identify the bitmap as a known form. After this step, the information in the digitized form is extracted by a recognition process supported by knowledge about the form type. Moreover, the system is trainable and designed to handle different sources of random noise and achieve good recognition accuracy in the presence of large numbers of foam types.

【０００６】本発明は、学習モードの間原稿認識システ
ムにおいて使用される、マスター原稿の認識を可能にす
る分類処理の設定を決定するための方法を提供する。そ
の方法は、１つのマスター原稿を複数回入力することに
よってそのマスター原稿を表現する複数のマスタービッ
トマップを生じるステップと、複数のマスタービットマ
ップから特徴を抽出するステップと、特徴をランダムモ
デルパラメータ分布と組み合わせることによって属性を
生成し、その結果として、複数のマスタービットマップ
の数より実質的に多い数の複数の属性を生じるステップ
と、すべてのマスター原稿に対して上述のステップを繰
り返すステップと、すべてのマスター原稿からの属性に
クラスタ分析を行って、クラスタ分析がマスター原稿の
ファミリー（族）を表現する複数のスーパークラスと複
数の非スーパークラスを発生するステップと、スーパー
クラスと非スーパークラスに分類分析を行い、その結果
として、マスター原稿の認識を可能にする分類処理設定
を生じるステップと、を備える。The present invention provides a method for determining a classification process setting that enables recognition of a master document used in a document recognition system during a learning mode. The method includes the steps of generating a plurality of master bitmaps expressing the master original by inputting one master original a plurality of times, extracting a feature from the plurality of master bitmaps, and distributing the features to a random model parameter distribution. Generating an attribute by combining with, resulting in a plurality of attributes that is substantially greater than the number of master bitmaps, and repeating the above steps for all master manuscripts, Performing a cluster analysis on attributes from all master manuscripts, where the cluster analysis yields multiple superclasses and multiple non-superclasses representing a family of master manuscripts, with superclasses and non-superclasses Performs a classification analysis and, as a result, the master Comprising the steps of: causing the classification process settings that allow recognition of a document, the.

【０００７】さらに、本発明は、ランタイムモードの
間、固定形式の原稿を表現する分類番号を、公知の組か
ら決定する方法を提供し、その方法は、電子的なビット
マップを生成するためにその固定形式の原稿を入力する
ステップと、そのビットマップを処理して分類の特徴だ
けを含む第２ビットマップが生成されるステップと、第
２ビットマップから分類の特徴を抽出するステップと、
分類の特徴をランダムモデルパラメータ分布と組み合わ
せて第２ビットマップに関する複数の属性を生成するス
テップと、第２ビットマップの属性を分類し、その結果
として、少なくとも１つのクラス番号を生じるステップ
と、を備える。Further, the present invention provides a method for determining a classification number representing a fixed-format manuscript from a known set during run-time mode, the method for generating an electronic bitmap. Inputting the fixed format manuscript, processing the bitmap to generate a second bitmap containing only classification features, and extracting classification features from the second bitmap;
Combining the classification features with a random model parameter distribution to generate a plurality of attributes for the second bitmap, and classifying the attributes of the second bitmap, resulting in at least one class number. Prepare

【０００８】[0008]

【実施例】図１を参照すると、本発明を利用する一般的
なハードウェアのシステム構成が示される。システム１
は、原稿を提供し、ビットマップの形式でその原稿のデ
ィジタル電子的表現を生成する構内（局所）入力サービ
ス２を備える。また、ビットマップ画像を生成する遠隔
入力サービス３が示され、それは、画像をディジタル化
してその画像のディジタル表現を電話回線あるいは同等
のものを用いて遠隔地へ送信するファクシミリ装置の形
式であってもよい。あるいは、遠隔入力サービス３は、
ページ記述言語(PDL) の形式で表現する他のコンピュー
タから情報を得ることもでき、したがって、そのページ
記述言語を使用可能なビットマップに変換する。構内入
力サービス２および遠隔入力サービス３によって生成さ
れるビットマップは、ワークステーション４によって受
信される。ワークステーション４は、オペレータがその
ビットマップを見ることを可能にし、また、編集、テン
プレートの生成、およびファイリングのようなビットマ
ップに関する種々のその他の管理機能を行うことを可能
にする。ワークステーション４は、後述されるようなビ
ットマップへの処理機能を実行する計算機サービス５に
接続される。また、計算機サービス５は、プリンター
６、データベース記憶ユニット９、および保存記憶ユニ
ット８のようなシステム１におけるその他の機能的ユニ
ットに対して、ビットマップを含む情報を分配するため
のディストリビュータとして作用する。上述したシステ
ム１は、１つの可能なハードウェアシステム構成の例で
あり、図１に示される構成要素に制限されることを意味
するものではない。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT Referring to FIG. 1, a general hardware system configuration utilizing the present invention is shown. System 1
Comprises a local (local) input service 2 which provides a manuscript and produces a digital electronic representation of the manuscript in the form of a bitmap. Also shown is a remote input service 3 that produces a bitmap image, which is in the form of a facsimile machine that digitizes the image and sends a digital representation of the image to a remote location using a telephone line or equivalent. Good. Alternatively, the remote input service 3
Information can also be obtained from other computers that represent in the form of a page description language (PDL), thus converting that page description language into a usable bitmap. The bitmaps generated by the local input service 2 and the remote input service 3 are received by the workstation 4. The workstation 4 allows the operator to view the bitmap and perform various other management functions regarding the bitmap, such as editing, template generation, and filing. The workstation 4 is connected to a computer service 5 that performs a bitmap processing function as described below. The computer service 5 also acts as a distributor for distributing information including bitmaps to other functional units in the system 1 such as the printer 6, the database storage unit 9, and the storage storage unit 8. The system 1 described above is an example of one possible hardware system configuration and is not meant to be limited to the components shown in FIG.

【０００９】ここで図２を参照すると、フォーム認識シ
ステム１０の概略が示される。図２に示されるそれぞれ
のサブシステムのブロックは、以下で図３から図１０を
参照してより詳細に説明される。明らかなように、シス
テム１０は訓練モードおよびランタイムモードで作動さ
れる。訓練モードあるいは学習モードは、システム１０
が認識する必要がある固定形式原稿をシステム１０に教
えるためのモードである。ランタイムモードでの動作
は、原稿を認識することを可能とし、またその原稿上の
情報を自動的に抽出することを可能とする。Referring now to FIG. 2, a schematic of form recognition system 10 is shown. The blocks of each subsystem shown in FIG. 2 are described in more detail below with reference to FIGS. 3-10. As will be appreciated, system 10 is operated in a training mode and a run time mode. The training mode or the learning mode is the system 10
This is a mode for teaching the system 10 a fixed format manuscript that the user needs to recognize. The operation in the run-time mode makes it possible to recognize the document and also to automatically extract the information on the document.

【００１０】システム１０の概略が示されるように、開
始ブロック１００においては、図１の構内入力サービス
２あるいは遠隔入力サービス３のようなディジタル化ソ
ースからの圧縮されたビットマップ画像が圧縮解除され
て、前処理ブロック２００に供給される。また、そのビ
ットマップ画像はどの外部入力デバイスから入力された
かに関する符号化された情報を含む。前処理ブロック２
００は、スキューの補正および解像度の変換のような画
像操作を実行し、データ抽出ブロック３００および原稿
認識ブロック４００によって処理するためのビットマッ
プを準備する。原稿認識ブロック４００は、テンプレー
トとして記憶されたどのフォームが走査されたビットマ
ップのフォームに一致するかを決定し、分類番号あるい
はそのテンプレートに属する分類番号の順位づけられた
リストをデータ抽出ブロック３００に送出する。分類番
号は、どのフォームが使用されるかを識別し、そのフォ
ームのテンプレートを識別し、抽出されたデータを処理
するのに必要な認識システムの種類と共に、ビットマッ
プ上のどの領域かを識別して、データを抽出するためテ
ンプレート上のそれぞれのフィールドを見つける。例え
ば、テンプレートは、テキストのフィールドはどこを見
つけるか、そのフィールドのテキストデータのテキスト
認識はどこに適用するか、を示す。この分類番号を用い
ると、データ抽出ブロック３００は、ビットマップから
情報を抽出し、またいくつかのより小さいビットマップ
と実際のデータの内の少なくとも一方を表現することが
できて、後のデータベースへの記憶となる。As shown schematically of system 10, at start block 100, a compressed bitmap image from a digitized source, such as local input service 2 or remote input service 3 of FIG. 1, is decompressed. , To the pre-processing block 200. Also, the bitmap image contains coded information about which external input device the input is from. Preprocessing block 2
00 performs image operations such as skew correction and resolution conversion, and prepares the bitmap for processing by the data extraction block 300 and the document recognition block 400. The manuscript recognition block 400 determines which form stored as a template matches the scanned bitmap form and places a ranked list of classification numbers or classification numbers belonging to that template in the data extraction block 300. Send out. The classification number identifies which form is used, identifies the template for that form, and identifies the area on the bitmap, along with the type of recognition system needed to process the extracted data. Find each field on the template to extract the data. For example, the template indicates where a field of text is found and where the text recognition of the textual data for that field applies. Using this classification number, the data extraction block 300 can extract information from the bitmap and represent some of the smaller bitmaps and / or the actual data to a later database. Will be the memory of.

【００１１】データ検証ブロック５００は、いくつかの
より小さいビットマップから抽出される情報が有効かど
うかを決定する。もし有効でなければ、そのデータはオ
ペレータによる追加の検査のために手動補正ブロック６
００に送出される。もしビットマップから抽出されたデ
ータが完全かつ正確であれば、データベースへの記憶の
ためにデータベース挿入ブロック７００に送出される。
もしシステムに新規のフォーム形式を有するビットマッ
プがシステムに入力される必要がある場合は、そのフィ
ールドを認識しデータを抽出を可能にするために、この
新規のフォームのためのテンプレートを追加する特殊処
理が実行されなければならない。これは学習モードの間
に発生する。この場合、認識初期化システムブロック１
０００が、システムにテンプレートを追加あるいは削除
するのに必要な機能を実行し、後にランタイム動作の間
の自動認識およびデータ抽出を可能にする。さらに、認
識初期化システムブロック１０００は、システム１０の
全体にわたって使用されるランダムモデルパラメータお
よび系統的モデル補正係数を導出する。The data verification block 500 determines if the information extracted from some smaller bitmaps is valid. If not valid, the data is manually corrected block 6 for additional inspection by the operator.
Sent to 00. If the data extracted from the bitmap is complete and accurate, it is sent to the database insert block 700 for storage in the database.
If the system needs a bitmap that has a new form format to be entered into the system, a special template that adds a template for this new form to recognize the field and allow data extraction. Processing must be performed. This happens during the learning mode. In this case, the recognition initialization system block 1
000 performs the functions required to add or remove templates to the system, allowing automatic recognition and data extraction later during runtime operation. In addition, the recognition initialization system block 1000 derives random model parameters and systematic model correction factors used throughout the system 10.

【００１２】図７から図１０を参照すると、認識初期化
システムブロック１０００の詳細なフロー図が示され
る。認識初期化システムブロック１０００は、新規フォ
ーム導入ブロック１１００、新規フォーム訓練ブロック
１５００、スキャナー較正（キャリブレーション）ブロ
ック１８００、の３つの部分からなる。認識初期化シス
テムブロック１０００の機能は、耐久性のある自動化さ
れた方法を用いてフォーム認識のための新規のテンプレ
ートをシステム１０に追加することである。また、認識
初期化システムブロック１０００を履行することはシス
テムの学習モードとして参照される。一般的に、学習を
可能にするには、オペレータが新規のブランクフォーム
あるいはマスターを走査してこれらのマスターのビット
マップを生成する。以下で明らかになるように、ランダ
ムモデルパラメータおよび系統的モデル補正係数が適用
される。特徴の抽出、属性の生成およびビットマップの
分類は、ランタイム動作の間に使用される分類処理の訓
練とともに実行される。マスターを生成するのに使用さ
れる装置の較正が、定誤差を決定するために実行され、
それは学習処理に含まれる。処理が完了すると、関連す
る分類番号および他の有益な情報を備えるマスターの表
現がテンプレートとしてテンプレート記憶部に追加され
る。Referring to FIGS. 7-10, a detailed flow diagram of the recognition initialization system block 1000 is shown. The recognition initialization system block 1000 consists of three parts: a new form introduction block 1100, a new form training block 1500, and a scanner calibration block 1800. The function of the recognition initialization system block 1000 is to add a new template to the system 10 for form recognition using a durable and automated method. Also, implementing the recognition initialization system block 1000 is referred to as the learning mode of the system. Generally, to enable learning, an operator scans new blank forms or masters to generate bitmaps for these masters. Random model parameters and systematic model correction factors are applied, as will become apparent below. Feature extraction, attribute generation and bitmap classification are performed with training in the classification process used during run-time operations. Calibration of the equipment used to generate the master is performed to determine the constant error,
It is included in the learning process. When the process is complete, the master's representation with associated classification numbers and other useful information is added to the template store as a template.

【００１３】一般的に、新規フォーム導入ブロック１１
００は、データベース情報ブロック１１８０、新規フォ
ームビットマップ１００４、および、較正シート１００
６、から入力を得て、システム１０に追加されるべきマ
スターを表現するビットマップに新規フォーム訓練ブロ
ック１５００を準備する。スキャナー較正ブロック１８
００は、較正シート１００６からの情報を用いて系統的
モデル補正係数１８６０およびランダムモデルパラメー
タ１８８０を提供する。新規フォーム訓練ブロック１５
００は、辞典記憶部１２４０、テンプレート記憶部１２
６０、スーパー分類１５４４、および、分類処理設定１
５５４を出力する。スキャナー較正システム１８００
は、分類処理の訓練に使用されるランダムモデルパラメ
ータ１８８０とともに、スキャナーおよびプリンターの
ための系統的モデル補正係数１８６０を出力する。以下
は認識初期化システム１０００の詳細な説明である。Generally, a new foam introduction block 11
00 is a database information block 1180, a new form bitmap 1004, and a calibration sheet 100.
Take the input from 6, and prepare the new form training block 1500 with a bitmap representing the master to be added to the system 10. Scanner calibration block 18
00 provides systematic model correction factors 1860 and random model parameters 1880 using information from calibration sheet 1006. New Form Training Block 15
00 is a dictionary storage unit 1240 and a template storage unit 12
60, super classification 1544, and classification processing setting 1
554 is output. Scanner calibration system 1800
Outputs systematic model correction factors 1860 for scanners and printers along with random model parameters 1880 used to train the classification process. The following is a detailed description of the recognition initialization system 1000.

【００１４】特に、図１０を参照すると、ディリースキ
ャナー較正ブロック１８００（図７）の詳細なフロー図
が示される。ブロック１８００の目的は、スキャナー較
正シート走査ブロック１８１０を用いて較正シート１０
０６（図７）を走査し、基準マーキング抽出ブロック１
８２０を用いて結果となるビットマップ１８１２からデ
ータを抽出し、ブロック１８３０を用いて系統的歪みモ
デルを導出し、ブロック１８４０を用いてランダム歪み
モデルを導出し、かつ、以下で詳細に記述されるように
認識システムのその他の部分で使用するため、系統的モ
デル補正係数１８６０とランダムモデルパラメータ１８
８０を出力する、ことである。In particular, referring to FIG. 10, a detailed flow diagram of the daily scanner calibration block 1800 (FIG. 7) is shown. The purpose of block 1800 is to scan the calibration sheet 10 using the scanner calibration sheet scan block 1810.
06 (FIG. 7), reference marking extraction block 1
820 is used to extract data from the resulting bitmap 1812, block 1830 is used to derive a systematic distortion model, block 1840 is used to derive a random distortion model, and are described in detail below. Systematic model correction factors 1860 and random model parameters 18 for use in the rest of the recognition system.
80 is output.

【００１５】系統的モデル補正係数１８６０の導出を容
易にするために、基準マークを備えたシートが較正シー
ト１００６として使用される。そのようなシートは、手
動あるいは機械測定のために、一連の容易に識別可能な
マーク、即ち、基準を含む。例えば、そのようなページ
は、ページ全体にわたってＸ方向とＹ方向の双方にそれ
ぞれ0.25インチ(0.635cm) の網目（グリッド）模様の一
連の小さな十字を含んでもよい。較正シート１００６
は、使用されるスキャナーの精度より高い配置精度を有
する装置によって測定される必要がある。To facilitate the derivation of the systematic model correction factor 1860, a sheet with fiducial marks is used as the calibration sheet 1006. Such sheets include a series of easily identifiable marks, or fiducials, for manual or mechanical measurement. For example, such a page may include a series of small crosses in a 0.25 inch (0.635 cm) grid pattern in both the X and Y directions throughout the page. Calibration sheet 1006
Must be measured by a device that has a higher placement accuracy than the accuracy of the scanner used.

【００１６】プリンター歪み情報モデル１８０２、ファ
クシミリ歪み情報モデル１８０３、写真複写機歪み情報
モデル１８０４、および種々の系統的補正係数を用い
て、ランダム歪みモデル導出ブロック１８４０はランダ
ムモデルパラメータ１８８０を提供する。システムに関
するランダムノイズは、機器のランダムな動作および機
器のランダムな使用からくるものである。したがって、
ランダムノイズの測定は種々の異なる方法でなされても
よい。ランダムな機器のバリエーション（変動）の場
合、較正測定あるいは種々の機器で行われるテストのい
ずれかにおいて収集されたデータは、機器の系統的な動
作からの偏差量あるいは偏差パターン（モデル）を生成
するのに使用されてよい。例えば、スキャナー較正の場
合、最小二乗法にあてはまる線から基準マークの行ある
いは列へのそれぞれの点の偏差がノイズの測度を与え
る。十分なデータ（数百のサンプル）によって分布関数
が決定されてもよい。ランダムノイズ分布に関するモデ
ルおよびモデルパラメータを生成するのに使用するた
め、曲線あてはめ関数がこのデータに適用されてもよ
い。Using printer distortion information model 1802, facsimile distortion information model 1803, photocopier distortion information model 1804, and various systematic correction factors, random distortion model derivation block 1840 provides random model parameters 1880. Random noise on the system comes from the random behavior of the equipment and the random use of the equipment. Therefore,
The measurement of random noise may be done in a variety of different ways. In the case of random instrument variations, the data collected, either in calibration measurements or tests performed on different instruments, will produce an amount of deviation or deviation pattern (model) from the systematic operation of the instrument. May be used for. For example, in the case of scanner calibration, the deviation of each point from the line fitting the least squares method to the row or column of fiducial marks gives a measure of noise. The distribution function may be determined by sufficient data (hundreds of samples). A curve fitting function may be applied to this data for use in generating the model and model parameters for the random noise distribution.

【００１７】ランダムノイズの第２の発生源は、異なる
装置の使用、あるいは較正が不明な装置の使用、または
異なる装置の使用の不明な順序、から生じる。ある無作
為の日に３つのプリンターの１つに印刷されたフォーム
を考える。そのフォームはその後複写機で複写され、送
出されてシステム１０の入力として走査される。せいぜ
い補正できるものは、そのフォームをシステム１０に走
査するのに使用されるスキャナーだけである。したがっ
て、フォームにおける変動は非常に意義があり、また多
くの未知の要素の結果として非常にランダムなものであ
る。A second source of random noise results from the use of different equipment, or the use of equipment of unknown calibration, or the unknown order of use of different equipment. Consider a form printed on one of three printers on a random day. The form is then copied on a copier, delivered and scanned as input to system 10. At best, only the scanner used to scan the form into the system 10 can be corrected. Therefore, the variation in form is very significant and is very random as a result of many unknown factors.

【００１８】ランダムノイズのこの第２の発生源の測定
は、使用される種々の機器で測定される系統的な歪みに
関するデータを編集することによって実現されてもよ
い。これらの機器のいくつかはシステム１０のオペレー
タの制御におえないものであり、省略時の設定に頼るこ
とを必要とする。これらのノイズ発生源のモデルを作成
するために、一定の期間にわたって、１日のさまざまな
時刻に機器に関する一連の測定を行う。また、データは
保守の間に位置合わせ許容範囲に携わる製造業者から収
集してもよい。このデータから、装置で起こる変動をプ
ロットすることができ、これらの変動を機器の仕様書に
決められたような範囲にとどめるようにしてもよい。ノ
イズの真のモデルを構成するには多くのデータが必要と
され（一般的に、モデルは構成されない）、モデルは提
供可能な測定の範囲といくつかの測定点とを備えると考
えられる。さらに、一般的なシナリオは、グローバル
（大域）ノイズモデルを捜し出すために、いくつかの異
なる装置からのデータを組み合わせることを包含する。
システムオペレータはこれらのモデルに対する遮断を変
更することができる。それぞれのモデルは、必要な位置
合わせの誤り率によって増強あるいは弛緩することがで
きる。The measurement of this second source of random noise may be accomplished by compiling data on systematic distortion measured on the various instruments used. Some of these instruments are out of the control of the operator of system 10 and require relying on default settings. To create a model of these noise sources, a series of measurements on the instrument are made at various times of the day over a period of time. Data may also be collected from manufacturers involved in alignment tolerances during maintenance. From this data, the variations that occur in the device can be plotted and these variations may be kept in the range as determined by the instrument specifications. Much data is needed to construct a true model of noise (generally the model is not constructed) and the model is considered to have a range of measurements that can be provided and some measurement points. Furthermore, a common scenario involves combining data from several different devices to find a global noise model.
The system operator can change the cutoff for these models. Each model can be augmented or relaxed with the required registration error rate.

【００１９】グローバルノイズモデルの生成は、ランダ
ムモデルパラメータを決定する１つの提案された手段で
ある。この方法においては、測定されるべき歪みを標準
化する（例えば、水平オフセットおよび垂直オフセット
と線形スケーリング）。次に、測定は、使用されること
が期待される、あるいは使用可能な機器の一般的状態を
示す、いくつかのプリンター、ファクシミリ装置および
複写機で行われる。それぞれの個々の機器は、上述した
較正シートおよび系統的なデータを抽出するための手順
を使用する。次に、これらのテストは一定の期間にわた
って行われる。複写機の使用が評価され、複写機の分布
とのたたみ込みによってプリンターのデータが増加され
る。データの増加が得られると、プリンターのデータに
たたみ込まれたプリンターおよび複写機の組み合わせ
が、プリンターのデータセット全体を生成するのに使用
される。一般的には、そのデータは、プリンターの比率
に応じて重みづけされる（もしプリンターＡが作業の50
％に使用され、プリンターＢおよびＣが作業の残りの部
分に使用される場合、プリンターＡのデータはＢとＣの
２倍の係数で重みづけされる）。適切なランダムモデル
は、このデータのプロットに基づいて選択されてもよ
い。次に、プログラムに適するモデルが使用されて、デ
ータをこのランダムモデルに適合させてもよい。ゆえ
に、ランダムモデルパラメータ１８８０が生成される。
ランダムモデルパラメータは、システム１０のすべての
ものが影響される可能性のある誤差の分布あるいは範囲
として表現される。Global noise model generation is one proposed means of determining random model parameters. In this method, the distortion to be measured is standardized (eg horizontal and vertical offsets and linear scaling). The measurements are then taken with a number of printers, fax machines and copiers that indicate the general condition of the equipment that is expected to be used or is available. Each individual instrument uses the calibration sheet and procedure for extracting systematic data described above. These tests are then run for a period of time. Copier use is evaluated and printer data is augmented by convolution with the copier distribution. Once the data growth is obtained, the combination of printer and copier convolved with the printer data is used to generate the entire printer data set. Generally, the data is weighted according to the proportion of printers (if printer A
%, And printers B and C are used for the rest of the work, the data for printer A is weighted by a factor of 2 times B and C). A suitable random model may be selected based on a plot of this data. A model suitable for the program may then be used to fit the data to this random model. Therefore, the random model parameter 1880 is generated.
Random model parameters are expressed as a distribution or range of errors that all of system 10 can affect.

【００２０】画像の歪みは画像全体に伝播することに注
意したい。例えば、スキューされた画像は、ページの上
部よりページの下部においてより大きな歪みを有する。
また、ここで論議される歪みは、事実上、線形あるいは
非線形のいずれであってもよいと仮定される。非線形の
歪みの例として、ページの長さ方向に101 ％の線形スケ
ーリングをまた幅の両端に99.5％の線形スケーリングを
示すプリンターが、位置(100,100) および位置(2000,20
00) にあると仮定される２つの画素を置き、それらの画
素を(101,99.5)および(2020,1990) の新しい場所にそれ
ぞれ移動することを考える。もしこのプリントアウト
が、両方向に98％の線形スケーリングを有する複写機に
よって写真複写されれば、２つの元の位置は(99.0,97.
5) および(1979.6,1950.2) にそれぞれシフトされるで
あろう。一方の点はそれが出発した場所からあまり遠く
に離れてはいないが、他方の点は相当の距離で離れてい
る。ゆえに、非線形の場合には累積誤差があることがわ
かる。もし何かが同一の量によって線形に増大かつ減少
されれば誤差はなくなる。しかしながら、そのことは非
線形の誤差に関しては真実ではなく、したがって、その
誤差を反転するために同一の相対する非線形関数が必要
とされる。Note that image distortion propagates throughout the image. For example, a skewed image has more distortion at the bottom of the page than at the top of the page.
Also, it is assumed that the distortions discussed here may be either linear or non-linear in nature. As an example of non-linear distortion, a printer that shows 101% linear scaling along the page length and 99.5% linear scaling across the width of the page has a position (100,100) and a position (2000,20).
Consider placing two pixels assumed to be at (00) and moving them to new locations at (101,99.5) and (2020,1990) respectively. If this printout is photocopied by a copier with 98% linear scaling in both directions, the two original positions are (99.0,97.
5) and (1979.6,1950.2), respectively. One point is not too far from where it left off, but the other is a considerable distance away. Therefore, it can be seen that there is a cumulative error in the nonlinear case. If something increases and decreases linearly with the same amount, the error goes away. However, that is not true for non-linear errors, so the same opposing non-linear function is needed to invert that error.

【００２１】ここで図８を参照すると、新規フォーム導
入ブロック１１００の詳細なフロー図が示される。新規
フォーム導入ブロック１１００の目的は、新規のフォー
ム、即ち、マスター（すなわち、ブランクフォーム）を
テンプレートの形態でシステム１０に追加することであ
る。マスターはデータによってうめられたフィールドを
持たない新規のフォームであるが、すべての構造（例え
ば、線、ボックスなど）およびフィールドを包含する。
テンプレートはマスターから生成される。そのテンプレ
ートは、ランタイム動作中に、フォームを認識しこれら
のフォームから情報を抽出するのを助けるために使用さ
れる。テンプレートはマスターの決まったフィールドを
指定する座標に関する情報および属性情報を含み、認識
システムがそれぞれのフィールド内の情報を識別するの
に使用される。線１１０１はブロック１１００の機能上
の区画を指定する。線１１０１の左側の機能は、システ
ム１０に記憶されようとするマスターのビットマップを
補正することである。線１１０１の右側の機能は、いっ
たん入力するビットマップが補正されると実行される動
作である。Referring now to FIG. 8, a detailed flow diagram of the new form introduction block 1100 is shown. The purpose of the new form introduction block 1100 is to add a new form, i.e. a master (i.e. blank form), to the system 10 in the form of a template. A master is a new form that has no fields populated by data, but includes all structures (eg lines, boxes, etc.) and fields.
The template is generated from the master. The template is used during run-time operation to help recognize forms and extract information from these forms. The template contains information about the coordinates that specify the master's fixed fields and attribute information that the recognition system uses to identify the information in each field. Line 1101 specifies the functional partition of block 1100. The function to the left of line 1101 is to correct the master bitmap that is about to be stored in system 10. The function to the right of line 1101 is the operation performed once the incoming bitmap has been corrected.

【００２２】また、マスターのプリントアウトによって
プリンター較正シートが印刷される。プリンター較正シ
ートから得られる情報は、系統的なプリンターの誤差を
補償する１セットの処理ルーチンに入る。プリンター較
正は、ビットマップ１１１２を提供するプリンター較正
走査ブロック１１１０によって行われる。そして、較正
シートを表現するビットマップ１１１２は、プリンター
補正係数が決定されるプリンター補正導出ブロック１１
２０に伝送される。同様の方法で、スキャナー較正が、
較正シートの電子的なバージョンを表現するビットマッ
プ１１３２を提供するスキャナー較正走査ブロック１１
３０によって実現される。ビットマップ１１３２が生成
され、系統的およびランダムスキャナー補正係数１１４
２が導出されるスキャナー補正導出ブロック１１４０に
渡される。この場合でのスキャナーおよびプリンターを
較正するのに使用される原稿は、図１０のブロック１８
００に関連して記述される原稿に類似する。Further, the printer calibration sheet is printed by the printout of the master. The information obtained from the printer calibration sheet enters a set of processing routines that compensate for systematic printer error. Printer calibration is performed by the printer calibration scan block 1110, which provides a bitmap 1112. The bitmap 1112 representing the calibration sheet is used in the printer correction derivation block 11 in which the printer correction coefficient is determined.
20 is transmitted. In a similar way, scanner calibration
Scanner calibration scan block 11 providing a bitmap 1132 representing an electronic version of a calibration sheet
It is realized by 30. A bitmap 1132 is generated and systematic and random scanner correction factors 114
2 is passed to the scanner correction derivation block 1140, where 2 is derived. The original used to calibrate the scanner and printer in this case is block 18 in FIG.
Similar to the manuscript described in connection with 00.

【００２３】較正は、一般的には、少なくとも１つの較
正シートによって行われる。ベルトの異なる個所は異な
る結果を有するため、ベルトシステムを用いる大きなプ
リンターに対してはいくつかのシートが必要とされる。
したがって、マスターに特有の系統的歪み較正モデルと
ランダム歪み較正モデルを生成するのを助ける複数のプ
リントアウト、および複数回の（例えば、５回）走査は
好ましい。その方法は、好ましくは少なくとも１つの較
正シートをプリントアウトし、スキャナーを較正し、プ
リンター較正シートを走査し、スキャナーの補正に基づ
いてプリンターの補正を導出し、系統的部分およびラン
ダムな部分の双方を有する全体の補正係数を導出するこ
とである。これらのランダムパラメータは、テンプレー
ト記憶部１２６０に入るものを導出する助けとなる。シ
ステムにおいては、実際には２つの異なるセットのラン
ダムパラメータが存在することに注意したい。全体補正
ブロック１１５０からのマスターに特有の系統的歪み較
正モデルおよびランダム歪み較正モデルは、較正に特有
のモデルが較正のみによって決定されるのに対して、シ
ステムモデルは多くの他の要因を考慮する点において、
システムモデルと異なる。ランダム較正パラメータは、
スキャナー較正およびプリンター較正のランダム補正係
数から導出され、また走査するときに使用されるプリン
トアウトの複数のマスターコピーの使用から導出され
る。これによって、較正されるときに、そのマスターの
セットに特有の独特の高度に同調されたランダム較正パ
ラメータをテンプレートツール１１９０に与える。これ
らのパラメータは日々変化する。Calibration is generally performed by at least one calibration sheet. Because different parts of the belt have different results, several sheets are needed for large printers that use belt systems.
Therefore, multiple printouts and multiple (eg, 5) scans to help generate master-specific systematic and random distortion calibration models are preferred. The method preferably prints out at least one calibration sheet, calibrates the scanner, scans the printer calibration sheet, derives the printer's corrections based on the scanner's corrections, both systematic and random parts. Is to derive the overall correction factor with. These random parameters help derive what goes into the template store 1260. Note that in the system there are actually two different sets of random parameters. The master-specific systematic distortion calibration model and the random distortion calibration model from the global correction block 1150 are calibrated only, while the system model considers many other factors. In terms of
Different from the system model. The random calibration parameters are
Derived from random correction factors for scanner and printer calibrations, and from the use of multiple master copies of the printout used when scanning. This provides the template tool 1190 with unique, highly tuned random calibration parameters that are unique to the set of masters when calibrated. These parameters change daily.

【００２４】全体補正ブロック１１５０は、スキャナー
補正係数とプリンター補正係数とを結合して、画像再位
置合わせブロック１１７０によって使用される１セット
の補正係数、系統的歪み較正モデル、およびランダム歪
み較正モデルを生じる。スキャナー補正係数１１４２お
よびプリンター補正係数１１２２がわかるため、新規フ
ォーム走査ブロック１１６０を用いて数回新規のマスタ
ーを走査し、その結果であるビットマップ１１６２を受
け取り、その日のこれら予め定められた系統的較正係数
によって、画像再位置合わせブロック１１７０を用い、
ビットマップ１１６２の画像を再位置合わせする。ま
た、このほとんど理想的なマスター画像を生成する処理
である画像再位置合わせブロック１１７０は、ランタイ
ム動作において使用されるマスターからの位置合わせ係
数３０８（図５）を生じる。さらに、画像再位置合わせ
ブロック１１７０はランダム歪み較正パラメータを、較
正システムで見つけられるランダム補正係数の範囲ある
いは分布を補償するため、テンプレートツール１１９０
に渡す。The global correction block 1150 combines the scanner correction factor and the printer correction factor into a set of correction factors used by the image realignment block 1170, a systematic distortion calibration model, and a random distortion calibration model. Occurs. Since the scanner correction factor 1142 and the printer correction factor 1122 are known, the new form scan block 1160 is used to scan the new master several times and the resulting bitmap 1162 is received, these predetermined systematic calibrations for the day. Depending on the coefficients, the image realignment block 1170 is used,
Re-register the image of bitmap 1162. The image realignment block 1170, which is the process of producing this nearly ideal master image, also produces the registration factor 308 (FIG. 5) from the master used in run-time operation. In addition, the image re-registration block 1170 uses template tools 1190 to compensate the random distortion calibration parameters to the range or distribution of random correction factors found in the calibration system.
Pass to.

【００２５】ビットマップ１１７２の形態である再位置
合わせされたマスターは、ランダム歪み較正パラメータ
とともにテンプレートツール１１９０に渡される。テン
プレートツール１１９０は、そのマスターからシステム
１０の認識部分およびデータ抽出部分で使用されるテン
プレートを組み立てるためのものである。テンプレート
ツール１１９０は、マスターのビットマップをグラフィ
ックワークステーションに表示させることができるオペ
レータによって手動で実行される。オペレータは、ある
フィールドがビットマップのどこに存在するかをグラフ
ィックワークステーションの画面に指定することができ
る。The realigned master in the form of bitmap 1172 is passed to the template tool 1190 along with random distortion calibration parameters. The template tool 1190 is for assembling the template used in the recognition part and the data extraction part of the system 10 from its master. The template tool 1190 is manually executed by an operator who can display the master bitmap on a graphics workstation. The operator can specify on the screen of the graphic workstation where a field is in the bitmap.

【００２６】テンプレートツール１１９０を用いて座標
を指定するとき、オペレータはビットマップのフィール
ドの角のような座標(x,y) を選択してもよい。しかし、
ランダムモデルパラメータによれば、座標(x + 2, y +
4)もまた同一のフィールドの同一の角を表現する。した
がって、オペレータによって指定されたフィールドのい
ずれの座標に対しても、ランダム較正パラメータが適用
されなければならない。ランダム較正パラメータの全範
囲（あるいは分布）の適用は、本質的に、それぞれのマ
スターに対していくつかのテンプレートを指定し、それ
ぞれのテンプレートはオペレータによって指定された元
のテンプレートから一定の量だけシフトされている。換
言すれば、オペレータによって指定される座標を有する
テンプレートならば、そのテンプレートに統計的なラン
ダム較正パラメータの分布を適用することによって、い
くつかの可能なテンプレートに同等のものがすべて同一
のマスターを表現することになる。基本的に、モデルは
マスターの指定されたフィールドの回りにファジイボッ
クスを画定する。ランダム較正パラメータは、ぼやけた
（ファジィ）層がボックスの回りにどの程度の厚さであ
るかを定義する。ぼやけは、ものがより確かになるとそ
のぼやけた層はより薄くなるような不確かさに等しい。
ここで使用されるぼやけ（ファジィ）係数は、認識に関
してよりクリッパーに関するものである。認識システム
はシステムランダムモデルパラメータを使用して、特徴
および属性によって不確かさを定義する。ただ１つのテ
ンプレートが記憶されるが、ランダム分布の記憶が多く
のテンプレートを記憶させる効果をもたらす。When designating coordinates using the template tool 1190, the operator may select coordinates (x, y) such as the corners of a field in a bitmap. But,
According to the random model parameters, the coordinates (x + 2, y +
4) also represents the same corner of the same field. Therefore, for any coordinate in the field specified by the operator, a random calibration parameter must be applied. Applying the full range (or distribution) of random calibration parameters essentially specifies several templates for each master, and each template is shifted by a certain amount from the original template specified by the operator. Has been done. In other words, if the template has coordinates specified by the operator, then applying the distribution of statistical random calibration parameters to that template, the equivalent of some possible templates all represent the same master. Will be done. Basically, the model defines a fuzzy box around a designated field in the master. The random calibration parameter defines how thick the fuzzy layer is around the box. Blur is equal to the uncertainty that the blurred layer becomes thinner as the object becomes more certain.
The fuzzy factors used here are more about the clipper with respect to recognition. The recognition system uses system random model parameters to define the uncertainty by features and attributes. Although only one template is stored, the random distribution memory has the effect of storing many templates.

【００２７】マスターはそれ自体に基準を持たない可能
性が高いため、位置合わせの機能が必要とされる。ゆえ
に、システムはフォームに特定のマーキングを見つけ
て、それを位置合わせの点（例えば、線の交点など）と
して使用する。次に、システムは基準のセットを見つけ
て、原稿を移動して位置合わせするために、特定のフォ
ームに対してカスタマイズされた基準のセットを使用す
ることができる。したがって、認識処理の間で、認識さ
れるべきフォームはテンプレートの統計的な誤差の範囲
内であり、かつマスターと同じフォームの種類であるも
のとしてそのまま認識される。Since the master is likely to have no reference on its own, a registration function is required. Therefore, the system finds a particular marking on the form and uses it as an alignment point (eg, line intersection, etc.). The system can then find the set of fiducials and use the customized set of fiducials for the particular form to move and align the document. Thus, during the recognition process, the form to be recognized is still within the statistical error of the template and is of the same type as the master.

【００２８】ビットマップ１１９２が、テンプレートツ
ール１１９０から位置合わせ特徴処理ブロック１２１０
に渡される。この位置合わせ特徴処理ブロック１２１０
において、上述されたランダム較正パラメータを補償す
るためにオペレータが指定したテンプレートに補正がな
される。A bitmap 1192 is created from the template tool 1190 and the alignment feature processing block 1210 is used.
Passed to. This alignment feature processing block 1210
At, the template specified by the operator is corrected to compensate for the random calibration parameters described above.

【００２９】テンプレートツール１１９０は、対応する
ランダム較正パラメータとともにオペレータが画定した
テンプレート１１９６と、テンプレートフォーム識別１
１９８と、をテンプレート記憶部１２６０に記憶する。
テンプレート特徴処理ブロック１２１０は、位置合わせ
パラメータ１２１２をテンプレート記憶部１２６０に記
憶する。位置合わせパラメータ１２１２は、ランダム較
正パラメータから導出するテンプレートに関する許容誤
差範囲の分布である。したがって、テンプレート記憶部
１２６０に記憶されるそれぞれのテンプレートに対し
て、「最善の場合の」テンプレートと、そのテンプレー
トに適用される許容誤差範囲の分布が記憶される。別の
方法は、テンプレート記憶部に記憶する前にランダム較
正パラメータを理想的なテンプレートに適用することに
よって、それぞれのマスターに対していくつかのテンプ
レートが記憶される。The template tool 1190 includes an operator-defined template 1196 with corresponding random calibration parameters and a template form identification 1
And 198 are stored in the template storage unit 1260.
The template feature processing block 1210 stores the alignment parameter 1212 in the template storage unit 1260. The registration parameter 1212 is the distribution of the allowable error range regarding the template derived from the random calibration parameter. Therefore, for each template stored in the template storage unit 1260, the “best case” template and the distribution of the allowable error range applied to the template are stored. Another method stores some templates for each master by applying random calibration parameters to the ideal template before storing it in the template store.

【００３０】マスターのフォームが生成される前に、し
ばしば、マスターのフォームを生成するものは、フォー
ムにどのような種類のフィールドが必要とされるかを知
っており、またいくつかの場合、フォームが使用される
とそれらのフィールドにどのような情報が供給されるか
を知っている。例えば、フォームがそのフォームを使用
する人の性別を指定するフィールドを必要とすれば、フ
ォームを生成するものはキーワードの「男性」あるいは
「女性」がフィールドに表示されることを期待できる。
コンピュータデータベース１１８０はよく、フォームの
それぞれのフィールドからの情報を容易に記憶するため
の記憶領域を有し、かつ、どのような種類のデータを期
待するかに関する情報を包含するように設定される。シ
ステム１０のランタイム認識動作においてわかるよう
に、フォームのフィールドからの情報が抽出されてデー
タベース１１８０に記憶される。ビットマップ１１９２
の知的使用を行うために、データの抽出に先立って辞典
ツール１２００はデータベース１１８０に記憶された情
報を使用する。辞典ツール１２００は、データベース予
想が完成されたフォームで使用されるキーワードを使用
して、ビットマップ１１９２からフィールドを検索する
ために設定される。これらのキーワードは辞典記憶部１
２４０に記憶され、後に認識システムの後処理機能によ
って使用される。Before the master form is generated, the ones that generate the master form often know what kind of fields are required in the form, and in some cases, the form. Know what information will be provided in those fields when used. For example, if a form requires a field that specifies the gender of the person using the form, then the creator of the form can expect the keywords "male" or "female" to appear in the field.
Computer database 1180 is often set up to have storage areas for easily storing information from the respective fields of the form, and to include information regarding what kind of data to expect. Information from the fields of the form is extracted and stored in the database 1180, as can be seen in the runtime recognition operation of the system 10. Bitmap 1192
To make intelligent use of the dictionary tool 1200 uses the information stored in the database 1180 prior to extracting the data. The dictionary tool 1200 is configured to retrieve fields from the bitmap 1192 using the keywords used in the completed database prediction form. These keywords are dictionary storage 1
Stored at 240 for later use by post processing functions of the recognition system.

【００３１】特質認識処理ブロック１２２０はビットマ
ップ１１９４を使用して、テンプレートとともに記憶さ
れるべき有用な情報を決定する。例えば、テンプレート
のフィールドが検査ボックス認識のために指定されれ
ば、特質認識処理ブロック１２２０はそのフィールド内
のビットがどれだけマスターに設定されるかを決定す
る。その情報によって、検査ボックス認識システムは入
力されるフォームからのフィールドにどれだけのビット
が設定されているかをカウントし、マスターにどれだけ
のビットが設定されているか減算するだけで、そのフィ
ールドのボックスが設定されているかどうかを決定す
る。もう１つの例は、ＯＣＲ処理のために予定されたフ
ィールドが色づけされた背景を持っていればＯＣＲプリ
プロセッサに警告することである。最後の例は、フィー
ルドに入力されるデータの種類（例えば、電話番号）で
従来から知られているものである。多くの他の特質認識
処理機能が考えられるが、ブロック１２２０は、ここで
例として記述されたものに制限されない。The feature recognition processing block 1220 uses the bitmap 1194 to determine useful information to be stored with the template. For example, if a field of the template is designated for check box recognition, the feature recognition processing block 1220 determines how many bits in that field are set to master. With that information, the inspection box recognition system counts how many bits are set in the field from the form being filled in, and simply subtracts how many bits are set in the master to get the box for that field. Determines if is set. Another example is to alert the OCR preprocessor if a field intended for OCR processing has a colored background. The last example is the type of data entered in the field (eg a telephone number), which is conventionally known. Many other feature recognition processing functions are possible, but block 1220 is not limited to those described herein as examples.

【００３２】画像再位置合わせブロック１１７０からの
ビットマップ１１７４は、画像訓練（トレーニング）記
憶部１２８０を生成するのに使用され、ここで画像が新
規フォーム訓練ブロック１５００（図９）を訓練するた
めに保持される。明らかなように、テンプレートツール
１１９０には１つのビットマップだけが入力されるのに
対して、マスターの複数回の走査から生成される複数の
ビットマップが、画像訓練記憶部１２８０に入力され
る。画像訓練記憶部１２８０が図９を用いて詳細に記述
される。The bitmap 1174 from the image realignment block 1170 is used to generate an image training (training) store 1280 where the image is trained in the new form training block 1500 (FIG. 9). Retained. As is apparent, only one bitmap is input to the template tool 1190, whereas multiple bitmaps generated from multiple scans of the master are input to the image training store 1280. The image training storage unit 1280 will be described in detail with reference to FIG.

【００３３】図９に戻ると、新規フォーム訓練ブロック
１５００の詳細なフロー図が示される。新規フォーム訓
練ブロック１５００の目的は、認識可能なフォームの追
加あるいは削除を可能とし、またランタイム動作中に使
用される分類処理のための訓練セットを生成することで
ある。認識システムからマスターが追加あるいは削除さ
れれば、マスターが追加あるいは削除されるたびに分類
処理は再訓練されなければならない。訓練のために使用
される特徴および属性が変わらないままである限りは、
新規のマスターが追加あるいは削除されるたびに処理を
再実行する代わりに、特徴抽出の中間結果が中間結果記
憶部１５２１に保管されてもよい。画像訓練記憶部１２
８０は、フォーム認識システムが所定時に認識しなけれ
ばならないすべてのマスターフォームのビットマップを
包含する。再訓練するのに必要な作業の量は、使用され
る分類処理の種類によって何度も変化する。Returning to FIG. 9, a detailed flow diagram of the new form training block 1500 is shown. The purpose of the new form training block 1500 is to allow the addition or deletion of recognizable forms and to generate a training set for the classification process used during runtime operation. If a master is added or removed from the recognition system, the classification process must be retrained each time a master is added or removed. As long as the features and attributes used for training remain unchanged,
Instead of re-executing the process each time a new master is added or deleted, the intermediate result of feature extraction may be stored in the intermediate result storage unit 1521. Image training storage unit 12
80 contains a bitmap of all master forms that the form recognition system must recognize at a given time. The amount of work required to retrain will change many times depending on the type of classification process used.

【００３４】画像訓練記憶部１２８０において画像の完
全な訓練セットを有するために、マスターの複数のコピ
ーが走査され、そして数回（例えば、３から５回）印刷
され、図８を用いて説明されたようにビットマップが記
憶される。説明のために、単一のフォームに対する複数
回走査された画像の複数のビットマップは、以下では
「生ファミリーセット(raw family set)」として参照さ
れる。特徴抽出ブロック１５１０は、画像処理機能を利
用して生ファミリーセットのそれぞれのビットマップか
ら分類のための特徴を抽出する。例えば、抽出される特
徴が水平な線であれば、水平線以外のすべてのものをビ
ットマップから削除するために、形態フィルター(morph
ological filter)が使用されてもよい。もう１つの例
は、ビットマップのテキストの傾きの方向を決定するた
めに、ＯＣＲを使用することである。分類において使用
される特徴は、媒体上の画像の位置モード（人物画に対
する風景画）、水平線の数、垂直線の数、中間調領域の
数とそれらの領域がページに分布されている場所、テキ
スト領域の数とそれらが分布されている場所、ボックス
の分布、ボックスの中のボックス、ボックスのマトリッ
クス、などを含んでもよい。明らかなように、特徴の多
くの他の種類が考えられまた利用されてもよい。特徴及
び属性辞書１５１１が、どのような特徴と属性が訓練セ
ット全体で使用されるかを定義する。To have a complete training set of images in the image training store 1280, multiple copies of the master were scanned and printed several times (eg 3 to 5 times) and are described using FIG. The bitmap is stored as described above. For purposes of explanation, multiple bitmaps of images scanned multiple times for a single form are referred to below as a "raw family set". The feature extraction block 1510 extracts a feature for classification from each bitmap of the raw family set using the image processing function. For example, if the features to be extracted are horizontal lines, then to remove everything but the horizontal lines from the bitmap, use the morph filter (morph
(ological filter) may be used. Another example is to use OCR to determine the direction of text tilt in a bitmap. The features used in the classification are the position mode of the image on the medium (landscape versus portrait), the number of horizontal lines, the number of vertical lines, the number of halftone areas and where those areas are distributed on the page, It may include the number of text areas and where they are distributed, the distribution of boxes, the boxes within a box, the matrix of boxes, etc. Obviously, many other types of features may be considered and utilized. The feature and attribute dictionary 1511 defines what features and attributes are used throughout the training set.

【００３５】関連する特徴だけを含むビットマップが生
成されると、特徴抽出ブロック１５１０が関連する特徴
を抽出し、分類処理の属性を有するデータセット生成ブ
ロック１５２０にその特徴情報を送出する。ブロック１
５２０は、生ファミリーセットから抽出された特徴を受
け取り、ランダムモデルパラメータを使用して特徴のよ
り大きなデータセット、「全ファミリー特徴セット(tot
al family feature set)」を生成する。それぞれの特徴
セットはマスターフォームの表現である。ランダムモデ
ルパラメータ１８８０からのランダム分布を生ファミリ
ーセットから抽出された特徴に適用することによって、
その特徴セットは、例えば50個の特徴セットを含む全フ
ァミリー特徴セットに増大される。例えば、抽出される
特徴が水平線であり、生データセットにおけるビットマ
ップの１つがｘ＝10で座標を有するとする。ランダムモ
デルは、誤りが２つの座標点に及んでもよいこと、ラン
ダムモデルがその特徴に適用されること、および、ｘ＝
11およびｘ＝12での水平線もまたマスターの水平線の許
容定義であること、を示す。全ファミリー特徴セットを
生成するためにランダム分布が、訓練記憶部１２８０に
おけるそれぞれの生ファミリーセットから抽出されたそ
れぞれの特徴に適用される。ランダム分布は、ビットマ
ップから抽出された特徴に適用される代わりに、より多
くのビットマップを生成する生ファミリーセットのビッ
トマップに適用されてもよかったことに注意したい。When the bit map including only the related features is generated, the feature extraction block 1510 extracts the related features and sends the feature information to the data set generation block 1520 having the attribute of the classification process. Block 1
520 receives the features extracted from the raw family set and uses a random model parameter to generate a larger data set of features, the "total family feature set (tot
al family feature set) "is generated. Each feature set is a representation of a masterform. By applying a random distribution from the random model parameters 1880 to the features extracted from the raw family set,
The feature set is expanded to an entire family feature set, including, for example, 50 feature sets. For example, suppose the feature to be extracted is a horizon and one of the bitmaps in the raw data set has coordinates at x = 10. The random model is such that the error may span two coordinate points, the random model is applied to the features, and x =
It indicates that the horizon at 11 and x = 12 is also an acceptable definition of the master's horizon. A random distribution is applied to each feature extracted from each raw family set in the training store 1280 to generate a full family feature set. Note that the random distribution could have been applied to the raw family set bitmaps that produced more bitmaps, instead of being applied to the features extracted from the bitmaps.

【００３６】特徴を抽出し全ファミリー特徴セットを生
成する処理は、システムが認識する必要のあるそれぞれ
のマスターに対して繰り返される。通常、追加のみが、
特徴と属性を得るために全処理を必要とする。なぜな
ら、以前に処理されたマスターすべてからの特徴抽出の
結果は中間結果記憶部１５２１に記憶されるからであ
る。システムを使用することは同一の特徴を使用するた
め、以前に処理されたマスターを再実行（ラン）する理
由はない。この場合、ただ分類処理を再実行する必要が
あるだけである。しかし、抽出に関する新しい特徴等の
新規の分類パラメータが追加されれば、以前に処理され
たすべてのマスターは再実行される必要がある。The process of extracting features and generating a full family feature set is repeated for each master that the system needs to recognize. Usually only add,
It takes the whole process to get the features and attributes. This is because the results of feature extraction from all previously processed masters are stored in the intermediate result storage unit 1521. There is no reason to rerun a previously processed master as using the system uses the same features. In this case, it is only necessary to re-execute the classification process. However, if new classification parameters, such as new features for extraction, are added, all previously processed masters will need to be redone.

【００３７】画像訓練記憶部１２８０において表現され
るすべてのマスターからのすべての全ファミリー特徴セ
ットは、一貫性検査ブロック１５３０およびクラスタ分
析ブロック１５４０によって利用される。一貫性検査ブ
ロック１５３０は、生ファミリーセットにおいて分類さ
れるすべてのビットマップが、実際に同一のマスターか
らのものかを確認するための検査を行う。その生ファミ
リーセットに関する一貫性の検査は、抽出された特徴を
分析することによって行われてもよい。例えば、もし、
５つの生ファミリーセットのビットマップの内の１つが
水平線を含まないが他の４つが水平線を含んでいるなら
ば、一貫性検査ブロック１５３０は人間の介入のためデ
ータ誤りブロック１５３２を介して誤りを報告する。All full family feature sets from all masters represented in image training store 1280 are utilized by consistency check block 1530 and cluster analysis block 1540. The consistency check block 1530 performs a check to ensure that all bitmaps classified in the raw family set are actually from the same master. A consistency check on the raw family set may be done by analyzing the extracted features. For example, if
If one of the five raw family set bitmaps does not contain horizontal lines but the other four contain horizontal lines, the consistency check block 1530 will fail via the data error block 1532 due to human intervention. To report.

【００３８】理論的には、同一のマスターから導出され
るそれぞれのビットマップは、それらがわずかに異なっ
ていても一緒に集められるべきである。クラスタ分析ブ
ロック１５４０はクラスタ分析を用いてどのフォームが
同一であると考えられるかを判断する。すなわち、非常
によく似たフォームの集合をスーパークラスに重ね合わ
せる。ここで使用されるクラスタ分析は、ＯＣＲシステ
ムにおいて使用されるクラスタ分析に類似している。ク
ラスタ分析ブロック１５４０の出力はスーパークラス１
５４２および非スーパークラス１５４３である。集合化
（クラスタ化）処理の目的は、訓練のためのマスターフ
ォームである類似する対象物の編成、即ち、クラスタを
生成することと、また非スーパークラス１５４３である
他のクラスタ中の対象物に類似しない対象物からなる独
立した別個のクラスタを生成することである。Theoretically, each bitmap derived from the same master should be put together, even if they are slightly different. Cluster analysis block 1540 uses cluster analysis to determine which forms are considered identical. That is, superimpose a set of very similar forms on a superclass. The cluster analysis used here is similar to the cluster analysis used in OCR systems. The output of the cluster analysis block 1540 is superclass 1
542 and non-superclass 1543. The purpose of the clustering process is to organize similar objects that are master forms for training, ie, to create clusters, and also to objects in other clusters that are non-superclass 1543. The goal is to generate independent and distinct clusters of dissimilar objects.

【００３９】分類処理訓練ブロック１５５０は、クラス
タ分析ブロック１５４０から生成されるスーパークラス
１５４２および非スーパークラス１５４３を受け取る。
次に、この情報は、ランタイム動作中に図４の分類処理
４４０によって使用される分類処理設定１５５４に変換
される。また分類処理訓練ブロック１５５０は分類処理
システムを包含する。この分類処理は分類処理４４０と
同一の種類（ベイズ法(Bayesian)、デシジョンツリーな
ど）でなければならない。分類処理設定１５５４が分類
処理４４０の階層構造に追加されれば、画像訓練記憶部
１２８０において表現されるすべてのマスターは、ラン
タイム動作中に分類処理４４０によって認識される。ま
た、手動オーバーライドが可能であって、もし何らかの
理由でオペレータがその方法にかかわらず望むならば、
オペレータは実際に２つの分類を１つのスーパークラス
に強制することができる。The classification process training block 1550 receives the superclass 1542 and non-superclass 1543 generated from the cluster analysis block 1540.
This information is then translated into the classification process settings 1554 used by the classification process 440 of FIG. 4 during runtime operation. The classification processing training block 1550 also includes a classification processing system. This classification process must be of the same type as the classification process 440 (Bayesian method, decision tree, etc.). Once the classification process settings 1554 are added to the hierarchical structure of the classification process 440, all masters represented in the image training store 1280 will be recognized by the classification process 440 during runtime operation. Also, a manual override is possible, and if for some reason the operator wants to
The operator can actually force two classifications into one superclass.

【００４０】分類処理設定１５５４が導出されると、シ
ステム１０が適切に訓練されたかどうかを検査すること
は有益となる。検査を容易にする１つの方法は、分類処
理の属性を有するデータセット生成ブロック１５２０か
らのデータのランダムな部分をクラスタ分析ブロック１
５４０および分類処理訓練ブロック１５５０に送り、残
りのデータを分類処理検査ブロック１５６０に送ること
である。一般的に、もしシステムがデータのランダムサ
ンプルで訓練されれば、システムは残りのデータを低い
認識率で認識することができる。システムは、システム
が訓練されたデータと同一のデータでは検査されないこ
とに注意したい。さらに、いくつかの検査が行われて、
分類システムがどの程度良好に作動しているかを決定す
る。誤り率及び特質ブロック１５６４が１つの訓練セッ
ションの誤り率を報告する。訓練処理で使用される分類
処理がランタイム認識処理で使用されるものと同一でな
ければならないとしても、パラメータは同一でなければ
ならないので、異なる種類の分類処理が検査に使用され
てもよい。異なる種類の分類処理を有することは、認識
システムの耐久性を改善し認識の誤り率を最小限に維持
する。Once the classification process settings 1554 are derived, it is useful to check if the system 10 has been properly trained. One way to facilitate inspection is to use the cluster analysis block 1 to extract a random portion of the data from the dataset generation block 1520 with the attributes of the classification process.
540 and the classification process training block 1550 and the rest of the data to the classification process check block 1560. In general, if the system is trained on a random sample of data, it can recognize the rest of the data with a low recognition rate. Note that the system is not tested on the same data on which it was trained. In addition, some tests were done,
Determine how well the classification system is performing. Error rate and quality block 1564 reports the error rate for one training session. Even though the classification process used in the training process must be the same as that used in the runtime recognition process, different types of classification processes may be used in the test because the parameters must be the same. Having different types of classification processes improves the durability of the recognition system and keeps the recognition error rate to a minimum.

【００４１】概略すると、新規フォーム訓練ブロック１
５００は、集合化技術に従ったそれ自体の分類処理を利
用して分類処理設定１５５４を決定する。次に、分類処
理設定１５５４は図４の分類処理４４０における情報の
更新に使用される。それによって、以前に新規フォーム
訓練ブロック１５００によって処理されたマスターのよ
うな構造を有するフォームがランタイム中に導入される
とき、原稿認識ブロック４００がそのフォームを認識す
ることができる。ゆえに、対応する分類番号を供給する
ことができる。新規フォーム訓練ブロック１５００の出
力は、分類処理設定１５５４、スーパークラス１５４
４、分類セッションを検査することの結果として生じる
誤り率及び特質１５６４、及び一貫性検査からのデータ
誤り１５３２である。In summary, the new form training block 1
The 500 determines the classification process settings 1554 using its own classification process according to the aggregation technique. Next, the classification process setting 1554 is used to update the information in the classification process 440 of FIG. Thereby, when a form having a master-like structure previously processed by the new form training block 1500 is introduced at runtime, the document recognition block 400 can recognize the form. Therefore, the corresponding classification number can be provided. The output of the new form training block 1500 is the classification process setting 1554, the super class 154.
4, error rates and attributes 1564 resulting from checking the classification session, and data errors 1532 from the consistency check.

【００４２】図２に一時的に戻って参照すると、認識初
期化システム１０００が作動されて分類処理の訓練が完
了すると、システム１０はランタイム動作、即ち、モー
ド、の準備が整う。ランタイムモードにおいては、完成
したフォームがシステムに供給され、それによってフォ
ームの自動認識およびデータの自動抽出が行われる。一
日あたり数千のフォームが処理を必要とする。以下で詳
細に記述される次のサブシステムが、ランタイムモード
中に作動される。それは、開始ブロック１００、画像前
処理ブロック２００、原稿認識ブロック４００、データ
抽出ブロック３００、データ検証ブロック５００、手動
補正ブロック６００およびデータベース挿入ブロック７
００である。Referring again briefly to FIG. 2, once the recognition initialization system 1000 is activated and the classification process has been trained, the system 10 is ready for run-time operation, or mode. In run-time mode, the completed form is provided to the system for automatic recognition of the form and automatic extraction of data. Thousands of forms per day require processing. The following subsystems, which are described in detail below, are run during run-time mode. It is a start block 100, an image preprocessing block 200, a document recognition block 400, a data extraction block 300, a data verification block 500, a manual correction block 600 and a database insertion block 7.
00.

【００４３】ここで図３を参照すると、画像前処理ブロ
ック２００の詳細なフロー図が示される。画像前処理ブ
ロック２００の目的は、追加の処理と認識のために入力
するビットマップを準備することである。ランタイムモ
ード中、画像前処理ブロック２００は、どの入力デバイ
スからそのビットマップが入力されたかを記述する情報
とともに、ビットマップの形式で開始ブロック１００か
らデータを受信する。これらのビットマップは、その完
成されたデータフィールドによってフォームを表現す
る。上述したようにこれらのビットマップは、スキャナ
ーあるいはファクシミリから入力されるか、あるいはビ
ットマップに変換されるＰＤＬ（図形記述言語）記述の
結果として入力される。Referring now to FIG. 3, a detailed flow diagram of the image preprocessing block 200 is shown. The purpose of the image preprocessing block 200 is to prepare the input bitmap for additional processing and recognition. During run-time mode, the image pre-processing block 200 receives data from the start block 100 in the form of a bitmap, with information describing from which input device the bitmap was input. These bitmaps represent the form with its completed data fields. As described above, these bitmaps are input from a scanner or a facsimile, or as a result of a PDL (graphic description language) description converted into a bitmap.

【００４４】開始ブロック１００から入力されるビット
マップ画像はノイズフィルタリングブロック２１０によ
って受信される。ノイズフィルタリングブロック２１０
は、ビットマップから、ごま塩ノイズ等、あるいは写真
複写によってしばしば引き起こされる画像全体にわたる
誤った小さな斑点のような不純物を取り除く。またノイ
ズフィルタリングブロック２１０は、ぎざぎざを除去す
るためにエッジ強化動作を行うか、またはノイズ除去に
使用されるその他のあらゆる画像処理機能を行う。フィ
ルタ処理された結果としてのビットマップ２１２は、次
に入力依存補正ブロック２２０によって処理される。補
正ブロック２２０は、ビットマップが生成された入力デ
バイスに関する情報を含んだビットマップ２１２の符号
化された情報を読み取り、その情報を用いて追加の補正
を適用する。補正ブロック２２０は、予め決定された系
統的モデル補正係数１８６０から補正データを得る。シ
ステムにおけるそれぞれの入力デバイスに関連するオフ
セットおよびスケーリング誤りなどの系統的な補正情報
が利用される。これらの系統的な誤りに対する補正は、
ビットマップが生成された入力デバイスに関連する補正
係数を利用して、ビットマップに対して行われる。たと
え補正係数が渡されても、それらはいつも使用されるわ
けではないことに注意したい。例えば、画像が拡大縮小
されれば、それを再度拡大縮小することで線の分割が生
じ、だれもこの点での再度の拡大縮小は望まないが、後
の使用のために補正情報を保持することを望むであろ
う。The bitmap image input from start block 100 is received by noise filtering block 210. Noise filtering block 210
Removes impurities from the bitmap, such as salt and pepper noise, or false small spots throughout the image often caused by photocopying. The noise filtering block 210 also performs edge enhancement operations to remove jaggedness, or any other image processing function used for noise removal. The resulting filtered bitmap 212 is then processed by the input dependent correction block 220. Correction block 220 reads the encoded information in bitmap 212 that contains information about the input device for which the bitmap was generated and uses that information to apply additional corrections. Correction block 220 obtains correction data from a predetermined systematic model correction factor 1860. Systematic correction information such as offset and scaling errors associated with each input device in the system is utilized. The correction for these systematic errors is
It is performed on the bitmap using a correction factor associated with the input device for which the bitmap was generated. Note that even if correction factors are passed, they are not always used. For example, if the image is scaled, rescaling it will result in line splitting, and no one wants to scale again at this point, but retain the correction information for later use. Would hope.

【００４５】また、補正ブロック２２０は必要ならば解
像度の変換を行う（例えば、400spiで走査された画像を
300spiに変換する）。結果として生じるビットマップ２
２２は、ビットマップの小さな角度のスキューが自動的
に検出され補正されるスキュー補正ブロック２４０に進
められる。Further, the correction block 220 performs resolution conversion if necessary (for example, an image scanned at 400 spi is
Convert to 300spi). The resulting bitmap 2
22 is advanced to skew correction block 240, where small angle skews in the bitmap are automatically detected and corrected.

【００４６】サブ画像準備ブロック２６０が、必要なら
ば（例えば、誤り検査）後に使用するためにオペレータ
によってグラフィックワークステーション上で観察する
ため、ビットマップ２２２を変更する。また、サブ画像
準備ブロック２６０は、走査された画像のアイコンを供
給してもよく、あるいは、望ましい他の表示方向機能の
中から同時に観察される１０個のサブ画像を含む画像を
供給してもよい。画像前処理ブロック２００は、補正さ
れたビットマップ２４２をデータ抽出ブロック３００お
よび原稿認識ブロック４００に出力するとともに、補正
ブロック２２０および補正ブロック２４０からの発信元
のスキャナーに関連する系統的補正係数２３０を出力す
る。The sub-image preparation block 260 modifies the bitmap 222 for viewing on the graphics workstation by the operator for later use if necessary (eg, error checking). The sub-image preparation block 260 may also provide an icon for the scanned image, or it may provide an image containing 10 sub-images to be viewed simultaneously among other desirable viewing orientation features. Good. The image preprocessing block 200 outputs the corrected bitmap 242 to the data extraction block 300 and the document recognition block 400, and also outputs the systematic correction factors 230 from the correction block 220 and the correction block 240 that are associated with the originating scanner. Output.

【００４７】ここで図４を参照すると、原稿認識ブロッ
ク４００の詳細なフロー図が示される。図４に描写され
る動作は、図９に描写される動作によく似ていることに
注意したい。原稿認識ブロック４００の目的は、現在ど
の固定形式の原稿の種類がシステムを通過しているかを
決定することである。原稿認識ブロック４００は、補正
されたビットマップ２４２を図３の画像前処理ブロック
２００から受信する。原稿認識ブロック４００は、入力
するビットマップに存在する２つの種類の特徴をアドレ
ス指定する。その１つは分類に使用される特徴であり、
もう１つは、入力するビットマップをテンプレートとし
て記憶された関係するマスターに位置合わせするのに使
用される特徴（すなわち、位置合わせ係数）である。補
正されたビットマップ２４２とともに、補正係数２３０
は、最初に、画像に特有の特徴手順ブロック４１０によ
って使用される。ブロック４１０は、形態フィルタなど
の種々の画像処理機能を用いて、関連する特定の特徴を
フィルタ処理し、画像の残りを削除することによって、
ビットマップ２４２を処理するための分類特徴を決定す
る。分類体系において使用される特徴は、図９に関連し
てすでに述べられている。Referring now to FIG. 4, a detailed flow diagram of the document recognition block 400 is shown. Note that the operation depicted in FIG. 4 is very similar to the operation depicted in FIG. The purpose of the manuscript recognition block 400 is to determine which fixed-form manuscript type is currently passing through the system. The document recognition block 400 receives the corrected bitmap 242 from the image preprocessing block 200 of FIG. The manuscript recognition block 400 addresses two types of features present in the incoming bitmap. One is the features used for classification,
The other is the feature (ie, the alignment factor) used to align the incoming bitmap with the associated master stored as a template. The correction coefficient 230 together with the corrected bitmap 242
Is first used by the image-specific feature procedure block 410. Block 410 uses various image processing functions, such as morphological filters, to filter out certain relevant features and remove the rest of the image,
Determine classification features for processing the bitmap 242. The features used in the classification scheme have already been described in connection with FIG.

【００４８】関連する特徴だけを含むビットマップが生
成されると、特徴抽出ブロック４２０が関連する特徴を
抽出し、その特徴に関する情報を属性生成ブロック４３
０に送出する。また、特徴抽出ブロック４２０は、将来
使用するために、処理されるフォームからの位置合わせ
係数４２４を供給する。テンプレートを処理されるフォ
ームに「一列に並べる（ラインアップ）」ことを試みる
とき、位置合わせ係数は非常に重要になる。例えば、線
のような特徴が抽出されると、位置合わせ係数はその線
に影響を与える。１つの例は、マスターテンプレートに
従って線が(x,y) から開始するなら、しかし走査中の誤
った位置合わせによって、認識およびデータ抽出を可能
にするために、(x + 2, y + 4)において位置合わせ係数
が使用されてテンプレートを調整してもよいことがわか
る。Once the bitmap containing only the relevant features is generated, the feature extraction block 420 extracts the relevant features and provides information about the features to the attribute generation block 43.
Send to 0. The feature extraction block 420 also provides the alignment factor 424 from the processed form for future use. The alignment factor becomes very important when trying to "line up" the template on the processed form. For example, when a feature such as a line is extracted, the alignment factor affects that line. One example is if the line starts at (x, y) according to the master template, but (x + 2, y + 4) to allow recognition and data extraction due to misalignment during scanning. It can be seen that the alignment factor may be used in to adjust the template.

【００４９】属性生成ブロック４３０が抽出された特徴
を受け取り、特徴を処理して属性を決定する。属性は、
少なくとも１つを越える特徴を、ただ１つの特徴を、あ
るいはすでに使用された特徴を使用してもよい。（ベイ
ズ分類処理が適切に作動するように、すべての入力は独
立あるいは直交しなければならない。）しかしながら、
いくつかの属性は特徴の異なる部分から作られてもよ
い。主要な点は、複数の特徴が１つの属性になり、２つ
の異なった属性が同一の特徴を使用してもよいことであ
る。例えば、垂直線および水平線の交点が、あるいはペ
ージの最初の1/4の線の数が、属性として使用されても
よい。もちろん、属性を生成するための多くの他の特徴
の組み合わせが考えられる。さらに、特徴生成ブロック
４３０は補正係数２３０を属性に適用して、スキャナー
のオフセットあるいはスキュー等のいかなる系統的な歪
みをも補正する。特徴の抽出および属性の生成は、訓練
システムに関して議論されたように、数回発生する。Attribute generation block 430 receives the extracted features and processes the features to determine the attributes. The attributes are
At least more than one feature, only one feature, or features already used may be used. (All inputs must be independent or orthogonal for the Bayesian classification process to work properly.) However,
Some attributes may be made up of different parts of the feature. The main point is that multiple features may be one attribute and two different attributes may use the same feature. For example, the intersection of vertical and horizontal lines, or the number of lines in the first quarter of the page may be used as an attribute. Of course, many other feature combinations for generating attributes are possible. In addition, the feature generation block 430 applies the correction factor 230 to the attribute to correct any systematic distortion such as scanner offset or skew. Feature extraction and attribute generation occur several times, as discussed for the training system.

【００５０】分類処理４４０は、属性生成ブロック４３
０から属性を受け取り、分類番号４４２と、元のビット
マップのフォームの種類に対するテンプレートに対応す
る分類番号の正しさの確率と、を決定する。分類処理ブ
ロック４４０は、以前に述べたどのような種類の分類処
理であってもよく、あるいは、それが訓練システムで使
用されたものと同じ種類の分類処理であるかぎりは、以
下で述べるような階層的分類処理であってもよい。階層
的分類処理は、１つの分類番号の代わりに、いくつかの
可能な分類の分類リストを提供するであろう。分類リス
トは、最初に最良の選択、つぎに第２の選択など、によ
る統計的なリスト化である。分類リストの使用は正しい
テンプレートを識別するのを助けるために好都合で、し
たがって、識別の誤り率を減少させる。例えば、所定の
入力に対して分類リストが生成され、それによって、リ
ストの第１の分類番号が90％の確かさであり、かつリス
トの第２の分類番号が10％の確かさであると決定されれ
ば、システムは第１の分類番号を用いて入力するビット
マップを自動的に処理する。しかしながら、リストの第
１の分類番号が55％の確かさであり、かつリストの第２
の分類番号が45％の確かさであれば、システムは、信頼
度検査機能４４１を介して、どの分類番号を処理のため
に使用するかの選択をオペレータに任せるであろう。ま
た、いつ自動的に分類番号が選択され、いつその選択が
信頼度検査４４１によってオペレータに任されるかを決
定するために、スレショルド値を設定してもよいことは
明らかである。さらに、ベイズ分類処理等のある種の他
の分類処理もまた分類リストを生成することに注意した
い。The classification process 440 is performed by the attribute generation block 43.
It receives attributes from 0 and determines the classification number 442 and the probability of correctness of the classification number corresponding to the template for the original bitmap form type. The classification process block 440 may be any type of classification process previously described, or as long as it is the same type of classification process used in the training system, as described below. It may be a hierarchical classification process. The hierarchical classification process would provide a classification list of several possible classifications instead of one classification number. The sorted list is a statistical listing by first best choice, then second choice, and so on. The use of a sorted list is convenient to help identify the correct template, and thus reduces the identification error rate. For example, a classification list may be generated for a given input, such that the first classification number in the list is 90% certain and the second classification number in the list is 10% certain. If so, the system automatically processes the incoming bitmap using the first classification number. However, the first classification number on the list is 55% certainty and the second classification number on the list is
If the classification number of is 45% certainty, the system will allow the operator, via the confidence checking function 441, to select which classification number to use for processing. It is also clear that the threshold value may be set to determine when the classification number is automatically selected and when the choice is left to the operator by the confidence check 441. Furthermore, note that some other classification processes, such as the Bayes classification process, also generate a classification list.

【００５１】階層的分類処理は、個別の分類処理から成
るツリー構造セットである。ツリーのそれぞれのノード
に分類処理システムの構成要素が存在する。それぞれの
構成要素は、図９の分類処理訓練ブロック１５５０によ
って、所定のセットの特徴と分類とを用いて訓練され
る。内部ノードのそれぞれの外向きのエッジは、分類セ
ットの外側の１つの分類によって識別され、内部ノード
における構成要素に対して有効である。階層的分類処理
は、ツリーを下向きに横切るときにフォームの分類を改
良（細分化）するための手順と考えてもよい。The hierarchical classification process is a tree structure set consisting of individual classification processes. There is a component of the classification processing system at each node of the tree. Each component is trained by the classification process training block 1550 of FIG. 9 with a predetermined set of features and classifications. Each outward edge of an interior node is identified by a classification outside the classification set and is valid for components at interior nodes. The hierarchical classification process may be thought of as a procedure for refining (subdividing) form classification as it traverses down the tree.

【００５２】階層的分類処理では、フォームの認識は次
のように起こる。最初に、フォームは、ルートノードに
おける構成要素に有効な分類と特徴とを用いて分類され
る。例えば、それは水平線の数であってもよい。ルート
ノードがリーフであれば、階層的分類処理は構成要素に
よって予測される分類あるいは分類リストに戻り、そう
でなければ、それは構成要素によって予測される分類に
よって識別される（ラベル付けされた）エッジを下向き
に進む。その分類の処理は、分類リストが決定されるま
で再帰的に適用される。In the hierarchical classification process, form recognition occurs as follows. First, the form is classified using the valid classifications and features for the components at the root node. For example, it may be the number of horizontal lines. If the root node is a leaf, the hierarchical classification process returns to the classification or classification list predicted by the component, otherwise it is the edge (labeled) identified by the classification predicted by the component. Go down. The classification process is applied recursively until the classification list is determined.

【００５３】それぞれの構成要素において同じ種類の分
類処理を、あるいはそれぞれの構成要素において異なる
種類の分類処理を使用してもよい。分類処理の種類は、
デシジョンツリー、ベイズ分類法、最近接点分類処理を
含んでもよい。この応用では、フォームの経路特徴はツ
リーの上位レベル（例えば、ページの水平線あるいは垂
直線の数）にあり、細かい特徴はツリーのより低いレベ
ル（例えば、微細なグリッドの線の数）にある。分類番
号あるいは分類番号のリストが識別されると、この情報
がデータ抽出ブロック３００に送出される。階層的分類
処理が開示されてはいるが、分類処理を組み合わせるた
めどのような種類の方式でも使用できることは明らかで
ある。The same type of classification processing may be used in each component, or different types of classification processing may be used in each component. The type of classification process is
It may include a decision tree, a Bayesian classification method, and a nearest neighbor classification process. In this application, form path features are at a higher level of the tree (eg, the number of horizontal or vertical lines on the page), and fine features are at a lower level of the tree (eg, the number of lines in the fine grid). Once the classification number or list of classification numbers is identified, this information is sent to the data extraction block 300. Although the hierarchical classification process are disclosed, it is clear that can be used in any type of system for combining the classification process.

【００５４】もう１つの手段である属性生成ブロック４
３０は、ビットマップに適用されるべきランダム補正を
表現するランダムモデルパラメータ１８８０を用いるこ
とによって属性を生成することができる。属性生成ブロ
ック４３０は、ランダムモデル補正係数１８８０を得る
ことができ、そのランダム分布を１つの属性のセットに
適用し、適用されたランダム分布の量だけが異なるいく
つかの属性のセットを提供する。フォームに対するラン
ダムノイズを加えることによって生成されるすべての属
性のセットが、分類処理ブロック４４０で検査される。
また、分類リストは、不適切な分類番号の割り当てによ
る誤り率を減少させる助けとなる。処理されるフォーム
にノイズを加えるならば、結果としての分類はｘあるい
はｚであってもよいと考えられる。分類リストは単に選
択の可能性を高めて、単一の分類番号の変化が誤った変
化であることを小さくする。Attribute generation block 4 which is another means
30 can generate attributes by using a random model parameter 1880 that represents a random correction to be applied to the bitmap. The attribute generation block 430 can obtain a random model correction factor 1880, applies the random distribution to one set of attributes, and provides several sets of attributes that differ only in the amount of random distribution applied. The set of all attributes generated by adding random noise to the form is examined at classification processing block 440.
The classification list also helps reduce the error rate due to improper allocation of classification numbers. It is believed that the resulting classification may be x or z, provided that it adds noise to the processed form. The taxonomy list simply increases the chances of selection, reducing changes in a single taxonomy number to be false changes.

【００５５】また、図５を参照すると、データ抽出ブロ
ック３００の詳細なフロー図が示される。一般的に、デ
ータ抽出システム３００によって供給される情報は、ど
の種類のフォームからその情報が抽出されたかの識別
と、処理されるフォームからの抽出されたデータの識別
と、どのフィールドからその抽出されたデータが入力さ
れたかの識別と、を含む。抽出されたデータは、後に、
記憶のためにデータベース１１８０（図７）に送出され
る。Referring also to FIG. 5, a detailed flow diagram of the data extraction block 300 is shown. In general, the information provided by the data extraction system 300 identifies the type of form from which the information was extracted, the identification of the extracted data from the processed form, and the field from which it was extracted. And whether the data was entered. The extracted data will later be
It is sent to the database 1180 (FIG. 7) for storage.

【００５６】データ抽出ブロック３００への入力は、補
正係数２３０と、ビットマップ２４２と、処理されるフ
ォームからの位置合わせ係数４２４と、マスターからの
位置合わせ係数３０８と、分類番号４４２（あるいは、
分類リスト）である。分類番号４４２は、ビットマップ
２４２に関連する正しいテンプレートを見つけるため
に、テンプレート記憶部１２６０によって使用される。
そのテンプレートは、フォームのどこに決まったフィー
ルドがあるかを知らせて、データ抽出を可能にする。上
述したように、テンプレート記憶部は、分類番号、フィ
ールド位置、抽出パラメータ、処理規則等のそれぞれの
テンプレートの関連情報とともに、システムに知られる
テンプレートのすべてが記憶される場所である。また、
それぞれのフィールド内のデータを復号化するのにどの
ような種類の認識システムが使用されるかがテンプレー
ト情報に含まれる。The inputs to the data extraction block 300 are a correction factor 230, a bitmap 242, an alignment factor 424 from the form to be processed, an alignment factor 308 from the master, and a classification number 442 (or
Classification list). The classification number 442 is used by the template store 1260 to find the correct template associated with the bitmap 242.
The template tells you where to find certain fields on the form and enables data extraction. As described above, the template storage unit is a place where all the templates known to the system are stored together with the related information of each template such as the classification number, the field position, the extraction parameter, and the processing rule. Also,
The template information includes what type of recognition system is used to decode the data in each field.

【００５７】画像再位置合わせブロック３１０は、処理
されるフォームからの位置合わせ係数４２４およびマス
ターからの位置合わせ係数３０８を利用して、ビットマ
ップ２４２をテンプレート記憶部１２６０から供給され
るフォームテンプレートに再位置合わせし、その位置合
わせされたビットマップをクリッパー３１４に送出す
る。ビットマップの再位置合わせは、正しい座標を見つ
けて使用するためにクリッパーにとって必要である。ク
リッパー３１４は、テンプレート情報で供給されるフィ
ールドの座標を使用してビットマップを分割し、いくつ
かのより小さいビットマップ３５２、３５４、３５６、
３６０、および３６２を生成する。これらのより小さい
クリップされたビットマップは、それぞれのテンプレー
トに関連するテンプレート記憶部１２６０に、テンプレ
ートからの情報を用いて、設定された命令に従い、異な
る認識システムに出力される。クリッパーはボックスの
外側の輪郭をなぞる。例えば、もしだれかが許容される
空間の外側に書き込み、あなたがその情報のすべてを得
たいとき、クリッパーが、すべての情報を捕獲する指定
されたボックスの境界線を越えたマークの輪郭をなぞ
る。The image realignment block 310 realigns the bitmap 242 into the form template supplied from the template store 1260 utilizing the registration factors 424 from the processed form and the registration factor 308 from the master. It aligns and sends the aligned bitmap to the clipper 314. Bitmap realignment is necessary for the clipper to find and use the correct coordinates. The clipper 314 uses the coordinates of the fields provided in the template information to split the bitmap into several smaller bitmaps 352, 354, 356,
Generate 360 and 362. These smaller clipped bitmaps are output to different recognition systems in the template store 1260 associated with each template, using the information from the templates, according to set instructions. The clipper traces the outer contour of the box. For example, if someone writes outside the allowed space and you want to get all of that information, the clipper will trace the outline of the mark beyond the boundaries of the designated box that captures all the information. .

【００５８】クリッピングの後、テキスト情報を含んだ
ビットマップ３５２は、ＯＣＲ処理を妨害する可能性の
あるノイズを除去するＯＣＲ前処理ユニット３２２に送
出される。１つの例は、色づけされた背景によって引き
起こされるノイズである。ノイズを補正されたビットマ
ップ３６４は、復号化のためにＯＣＲブロック３２６に
送出される。ＯＣＲブロック３２６は、可能であればテ
キスト情報の有効性を検査するＯＣＲ後処理ブロック３
３０にテキスト情報を引き渡す。ＯＣＲ後処理ブロック
３３０からの検査されたテキスト３８８が、抽出収集ブ
ロック３４２によって収集される。また、辞典記憶部１
２４０がＯＣＲブロック３２６およびＯＣＲ後処理ブロ
ック３３０に情報を供給する。また、ＯＣＲ後処理ブロ
ック３３０のようなシステムにおいて、状態情報が供給
されてもよい。ＯＣＲの場合、１つの文字が補正される
確率は有益な情報である。想像できるように、その他の
認識システムは、抽出された情報が供給される独自の余
分なデータを有する。抽出収集ブロック３４２は、デー
タベースにファイルされるため抽出されたデータを送出
する前に、その抽出されたデータを記憶するための一時
的な記憶領域である。After clipping, the bitmap 352 containing the textual information is sent to an OCR pre-processing unit 322 which removes noise that may interfere with OCR processing. One example is the noise caused by a colored background. The noise corrected bitmap 364 is sent to the OCR block 326 for decoding. The OCR block 326 is an OCR post-processing block 3 for checking the validity of the text information if possible.
Pass the text information to 30. The inspected text 388 from the OCR post processing block 330 is collected by the extract collection block 342. Also, the dictionary storage unit 1
240 provides information to OCR block 326 and OCR post-processing block 330. Also, status information may be provided in a system such as the OCR post-processing block 330. For OCR, the probability that one character will be corrected is useful information. As one can imagine, other recognition systems have their own extra data that is populated with the extracted information. The extraction collection block 342 is a temporary storage area for storing the extracted data before sending the extracted data as it is filed in the database.

【００５９】クリッパー３１４からのビットマップ３５
４はボックス検査情報を含み、その情報がボックス検査
ブロック３３４に送出される。ボックス検査ブロック３
３４はどのボックスが検査されたかを決定し、その情報
を論理(Logic) ３６８の種類でボックス検査後処理ブロ
ック３３８に送出する。ボックス検査後処理ブロック３
３８は、検査された１つのボックスあるいは複数のボッ
クスに関する追加情報を提供する。例えば、第３番目の
ボックスが検査され、ボックス検査後処理ブロック３３
８が第３番目のボックスはその月の第３火曜日を意味す
ることを知れば、それがその月の第３火曜日であること
が付加的な情報であり、その情報は値３８９のフォーム
で抽出収集ブロック３４２に送出される。Bitmap 35 from clipper 314
4 contains box inspection information, which is sent to box inspection block 334. Box inspection block 3
34 determines which boxes have been inspected and sends that information to the Box Inspect Post Processing block 338 in the type of Logic 368. Box inspection post-processing block 3
38 provides additional information about the box or boxes inspected. For example, the third box is inspected and the box inspect post-processing block 33.
If 8 knows that the third box means the third Tuesday of the month, then the additional information is that it is the third Tuesday of the month, and that information is extracted in the form of the value 389. It is sent to the collection block 342.

【００６０】クリッパー３１４からのビットマップ３５
６は、フィールドがマークされたかどうか決定するマー
ク感知ブロック３４６に送出される。また、画像クリー
ンアップ、特定フィールドによるボックスの外側のマー
クの検出、および、どのボックスが書き込まれている
か、が検出される。マーク感知ブロック３４６の結果は
論理３７０であり、その論理３７０がマーク感知後処理
ブロック３５０に送出される。マーク感知後処理ブロッ
ク３５０は誤りを検査し、マーク感知論理３８６および
ビットマップ３８４を抽出収集ブロック３４２に送出す
る。Bitmap 35 from clipper 314
6 is sent to the mark sensing block 346 which determines if the field has been marked. It also detects image cleanup, detection of marks outside the box due to specific fields, and which box is written. The result of mark sense block 346 is logic 370, which is sent to post mark sense processing block 350. Post-mark-sensing processing block 350 checks for errors and sends mark-sensing logic 386 and bitmap 384 to extract-gather block 342.

【００６１】同様にして、ビットマップ３６０が、フィ
ールドに何が書かれているかを検査するハンドプリント
認識ブロック３３６に送られ、追加の処理のためにテキ
スト３７２およびビットマップ３７４の両方をハンドプ
リント認識後処理ブロック３９０に送る。次に、テキス
ト３７８およびビットマップ３８０が抽出収集ブロック
３４２に送られる。ビットマップ３６２は、バーコード
のように機械が読み取ることが可能な符号化情報を含
み、機械読み取り可能コードブロック３４０によって解
読される。機械読み取り可能コード(MRC) ブロック３４
０は、追加の処理のためにＭＲＣ後処理ブロック３９４
に渡されるテキスト／番号情報３７６を決定する。次
に、テキスト／番号情報３７６は抽出収集ブロック３４
２に送られる。上述された後処理機能で利用される種々
の情報は、図７の認識初期化システム１０００によって
供給されてもよい。例えば、辞典記憶部１２４０からの
情報は、ＯＣＲ後処理において、あるいはその他の後処
理機能に対して、データを検査するのに有効である。Similarly, the bitmap 360 is sent to a handprint recognition block 336 which examines what is written to the field and handprints both the text 372 and the bitmap 374 for further processing. Send to post processing block 390. The text 378 and bitmap 380 are then sent to the extract collection block 342. Bitmap 362 contains machine-readable encoded information, such as a barcode, which is decoded by machine-readable code block 340. Machine readable code (MRC) block 34
0 is post MRC post processing block 394 for additional processing.
Determine the text / number information 376 passed to. Next, the text / number information 376 is stored in the extraction collection block 34.
Sent to 2. Various information utilized in the post-processing functions described above may be provided by the recognition initialization system 1000 of FIG. For example, the information from the dictionary storage 1240 is useful for inspecting data in OCR post-processing or for other post-processing functions.

【００６２】最後に図６を参照すると、システム１０
（図２）の検査ブロック５００、手動補正ブロック６０
０、およびデータベース挿入ブロック７００が示され
る。１つのフォームからデータが抽出されると、データ
ベースに入力する前に誤りが発生した場合、手動でデー
タを検査することは好都合である。データ抽出ブロック
３００からの情報は、検査ブロック５００の範囲検査ブ
ロック５１０に送られ、そこでは例えば、分かれば番号
と名前の範囲が検査されてもよい。追加の検査のため
に、一貫性検査ブロック５２０が情報の間の一貫性を検
査する。例えば、ある認識された名前に関連する有効な
番号の検査が行われてもよい。もし問題が見つかれば、
データ６０４が手動補正ブロック６００に送出され、そ
こでオペレータがデータを観察し、誤りの決定と補正の
内少なくとも一方をおこなう。誤りがフォームのどこで
発生したかのヒントをオペレータに与えるように、シス
テムが設定される。これは、図形的ユーザインタフェー
スに関連する領域を、あるいはその誤りを明瞭に指示す
る他の多くの方法を、強調することによって実現され
る。データが完全に検査されれば、データベース挿入ブ
ロック７００によってデータはデータベースに挿入され
る。テンプレート記憶部１２６０は、オペレータを支援
するために、手動補正ブロック６００と、一貫性検査ブ
ロック５２０と、範囲検査ブロック５１０と、に情報を
提供していることに注意したい。Finally, referring to FIG. 6, the system 10
(FIG. 2) inspection block 500, manual correction block 60
0, and the database insert block 700 is shown. Once the data is extracted from one form, it is convenient to manually inspect the data if an error occurs before entering it in the database. The information from the data extraction block 300 is sent to the range check block 510 of the check block 500, where, for example, a range of numbers and names may be checked if known. For additional checking, consistency check block 520 checks the consistency between the information. For example, a check for valid numbers associated with a recognized name may be performed. If you find a problem,
The data 604 is sent to the manual correction block 600, where the operator views the data to make error determinations and / or corrections. The system is set up to give the operator a hint as to where the error occurred on the form. This is accomplished by emphasizing the area associated with the graphical user interface, or many other ways to clearly indicate the error. If the data is fully examined, the insert database block 700 inserts the data into the database. Note that the template store 1260 provides information to the manual correction block 600, the consistency check block 520, and the range check block 510 to assist the operator.

[Brief description of drawings]

【図１】本発明を組み込んだ一般的なハードウェアシス
テムの構成を示す概略図である。FIG. 1 is a schematic diagram showing the configuration of a general hardware system incorporating the present invention.

【図２】本発明を組み込んだ主要なサブシステムを描写
するフォーム認識システムの最高位のフロー図である。FIG. 2 is a top level flow diagram of a form recognition system depicting the major subsystems incorporating the present invention.

【図３】画像処理サブシステムのフロー図である。FIG. 3 is a flow diagram of an image processing subsystem.

【図４】原稿認識サブシステムのフロー図である。FIG. 4 is a flowchart of a document recognition subsystem.

【図５】ランタイムデータ抽出サブシステムのフロー図
である。FIG. 5 is a flow diagram of a runtime data extraction subsystem.

【図６】検査サブシステム、手動補正サブシステム、お
よびデータベース挿入サブシステムのフロー図である。FIG. 6 is a flow diagram of an inspection subsystem, a manual correction subsystem, and a database insertion subsystem.

【図７】認識初期化サブシステムのフロー図である。FIG. 7 is a flow diagram of a recognition initialization subsystem.

【図８】新規フォーム導入サブシステムのフロー図であ
る。FIG. 8 is a flowchart of a new form introduction subsystem.

【図９】新規フォーム訓練サブシステムのフロー図であ
る。FIG. 9 is a flow diagram of a new form training subsystem.

【図１０】ディリースキャナー較正サブシステムのフロ
ー図である。FIG. 10 is a flow diagram of a daily scanner calibration subsystem.

フロントページの続き (72)発明者ラクシミダサリアメリカ合衆国 94306 カリフォルニア州パロアルトナンバー383 アルマストリート 3375 (72)発明者ジュリアエム．パガロアメリカ合衆国 95014 カリフォルニア州クパティーノストニーデイルドライブ 10181Front Page Continuation (72) Inventor Laximi Dasari United States 94306 Palo Alto, California Number 383 Alma Street 3375 (72) Inventor Julia Em. Pagaro United States 95014 Cupertino Stoneydale Drive 10181 California

Claims

[Claims]

1. A method used in a document recognition system for determining settings for a classification process to enable recognition of a master document, the method comprising: a) inputting one master document a plurality of times and, as a result, Producing a plurality of master bitmaps representing a master document, b) extracting features from the plurality of master bitmaps, and c) generating attributes by combining the features with a random model parameter distribution, Resulting in a number of said plurality of attributes substantially greater than the number of said plurality of master bitmaps, d) repeating steps a) to c) for all said master documents, and e) all Cluster analysis is performed on the attributes from the master manuscript of the Generating a plurality of superclasses that represent a master script and a plurality of non-superclasses that represent a master manuscript that is not a member of the family; , A step of generating a classification process setting that enables recognition of the master document, and