JP2006023946A

JP2006023946A - Image processing apparatus, control method therefor, and program

Info

Publication number: JP2006023946A
Application number: JP2004200808A
Authority: JP
Inventors: Kazuyo Ikeda; 和世池田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2004-07-07
Filing date: 2004-07-07
Publication date: 2006-01-26

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image processor capable of properly and safely searching for electronic documents composed of contents suited to a user's access level. <P>SOLUTION: Image data consisting of a plurality of kinds of component elements are stored in a storage part and managed while a different access level is set for each of the component elements. The access level of the user information inputted is obtained. A second image data corresponding to the first image data is obtained by reading from the storage part. Based on the access level of the user information obtained, a determination is made as to whether or not it is permitted to output each of the component elements of the second image data detected. The second image data consisting of the component elements permitted to be outputted are outputted. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、所望の画像データを検索する画像処理装置及びその制御方法、プログラムに関するものである。 The present invention relates to an image processing apparatus that searches for desired image data, a control method thereof, and a program.

近年、オフィスのペーパレス化が唱えられているが、紙文書には、目が疲れにくい、全体を概観しやすい、書き込めるなど、電子文書にはないメリットがあり、現状のオフィスでは、電子文書と紙文書のそれぞれのメリットを生かし、電子文書と紙文書が併用されている状況にある。このような状況では、紙文書と電子文書との扱いの差異を軽減するために、電子文書から紙文書へ、紙文書から電子文書への相互の移行が容易にできるような環境が求められている。 In recent years, paper-less offices have been advocated, but paper documents have advantages not found in electronic documents, such as less eye fatigue, easy overview, and writeability. Utilizing the merits of each document, electronic documents and paper documents are used together. In such a situation, in order to reduce the difference in handling between paper documents and electronic documents, an environment that facilitates the mutual transition from electronic documents to paper documents and from paper documents to electronic documents is required. Yes.

電子文書から紙文書への移行は、電子文書をプリンタによって印刷することで容易に達成できる。それに対して、紙文書から電子文書への移行は、紙文書をスキャナーで読み取り、画像データとして電子化する方法が一般的に行なわれている。 The transition from an electronic document to a paper document can be easily achieved by printing the electronic document with a printer. On the other hand, in order to shift from a paper document to an electronic document, a method in which a paper document is read by a scanner and digitized as image data is generally performed.

しかしながら、紙文書に対して、オリジナルの電子文書が存在する場合は、その紙文書を読み取ることでその紙文書から電子文書へ移行するよりも、紙文書に対するオリジナルの電子文書に移行するほうが望ましい。 However, when an original electronic document exists for a paper document, it is preferable to shift to the original electronic document for the paper document rather than reading the paper document to shift from the paper document to the electronic document.

これを実現する方法として、例えば、特許文献１では、オリジナルの電子文書をラスター画像に変換してその電子文書と対応付けて保存しておき、紙文書をスキャンして得られたラスター画像データとオリジナルの電子文書に対するラスター画像データとを、その特徴量の類似性を比較することにより、オリジナルの電子文書を検索することを可能にしている。 As a method for realizing this, for example, in Patent Document 1, an original electronic document is converted into a raster image, stored in association with the electronic document, raster image data obtained by scanning a paper document, It is possible to search the original electronic document by comparing the similarity of the feature amount with the raster image data for the original electronic document.

例えば、会議で発表する時に用いた資料が、オリジナルの電子文書を印刷することによって得られる紙文書として出席者に配布されている場合には、その配布された紙文書から、オリジナルの電子文書を取得することが可能になる。これにより、会議の出席者は、オリジナルの電子文書を編集したり、会議に出席していない人に配布したりすることが可能になる。 For example, if the material used for presentation at the meeting is distributed to attendees as a paper document obtained by printing the original electronic document, the original electronic document is extracted from the distributed paper document. It becomes possible to acquire. This allows attendees of the conference to edit the original electronic document or distribute it to people who are not attending the conference.

一方、近年、情報に対するセキュリティを厳格に運用しようという機運が高まり、文書の構成要素（テキスト、表、図形、写真など）ごとに、アクセスできるユーザを切り替える技術が提案されている。 On the other hand, in recent years, there has been an increase in the momentum to strictly operate security for information, and a technique for switching accessible users for each component of a document (text, table, figure, photograph, etc.) has been proposed.

例えば、特許文献２では、文書画像を送信する際、文書画像の構成要素（オブジェクト）ごとに、ある受信者には文字領域のみを、他の受信者には文字領域と写真領域を閲覧させるようにするために、文書画像をオブジェクト単位に分割し、オブジェクトごとに暗号化を施す技術が提案されている。 For example, in Patent Document 2, when a document image is transmitted, for each component (object) of the document image, a certain recipient is allowed to view only the character region, and other recipients are allowed to view the character region and the photograph region. In order to achieve this, a technique has been proposed in which a document image is divided into objects and encryption is performed for each object.

また、インターネットとＷＥＢブラウザの普及により、現在作成されている電子文書として、ＨＴＭＬ文書が増加しており、ＨＴＭＬ文書の中には、ＣＧＩなどのプログラムによって動的に生成する文書がある。このようなＨＴＭＬ文書では、アクセスする人によって、表示するオブジェクトを切り替えることも行われている。 Also, with the spread of the Internet and WEB browsers, HTML documents are increasing as electronic documents currently created, and some HTML documents are dynamically generated by a program such as CGI. In such an HTML document, an object to be displayed is switched by an accessing person.

このような動的に生成される電子文書に対して、紙文書からオリジナルの電子文書を生成しようとした場合、特許文献１で提案された技術を用いると、動的に生成される全てのパターンをラスター画像に変換して、動的に生成される電子文書と対応付けて保存しておく必要があった。
特開２００１−２５６２５６号公報特開２００２−３１８５３５号公報 When an original electronic document is to be generated from a paper document for such a dynamically generated electronic document, all the patterns that are dynamically generated can be obtained using the technique proposed in Patent Document 1. Needs to be converted into a raster image and stored in association with a dynamically generated electronic document.
JP 2001-256256 A JP 2002-318535 A

印刷された紙文書は、容易に人から人に渡すことができるため、上述のような動的に生成される電子文書の場合、例えば、Ａさんが閲覧した電子文書の印刷物を元に、Ｂさんがオリジナルの電子文書を検索すると、従来技術では、Ａさんが閲覧した電子文書が得られることになり、Ｂさんが閲覧できる情報とは異なる場合がある。このような場合、Ｂさんは、Ｂさんの閲覧できる情報を含んだ電子文書を取得できたほうが好ましい場合が多いが、従来の技術では、必要な情報が得られないことになる。 Since the printed paper document can be easily transferred from person to person, in the case of the electronic document generated dynamically as described above, for example, based on the printed matter of the electronic document viewed by Mr. A, B When Mr. A searches for an original electronic document, in the prior art, an electronic document browsed by Mr. A is obtained, which may be different from information that B can browse. In such a case, it is often preferable that Mr. B can obtain an electronic document including information that can be viewed by Mr. B, but the conventional technology cannot obtain necessary information.

また、電子文書の検索を行う場合、検索結果の候補の電子文書のサムネイル画像を一覧表示し、一覧表示されたサムネイル画像から所望の電子文書を選択することで、所望の文書を取得するということが一般的に行われている。ここで、紙文書からオリジナルの電子文書を検索する場合、一覧表示の中から、入力した紙文書と同じサムネイル画像を探すことになる。しかしながら、取得されるオリジナルの電子文書が、入力された紙文書とは異なる内容になると、入力した紙文書とは異なる内容の電子文書がサムネイル画像として表示されるため、入力した紙文書との対応がつかなくなり、所望の電子文書を検索しづらいという課題があった。 Also, when searching for an electronic document, a list of thumbnail images of candidate electronic documents as search results is displayed in a list, and a desired document is acquired by selecting the desired electronic document from the displayed thumbnail images. Is generally done. Here, when the original electronic document is searched from the paper document, the same thumbnail image as the input paper document is searched from the list display. However, if the original electronic document to be acquired has a different content from the input paper document, the electronic document with the content different from the input paper document is displayed as a thumbnail image. There is a problem that it is difficult to search for a desired electronic document.

更に、紙文書は容易に人から人に渡すことができるため、例えば、アクセスレベルの低い人が、その人がアクセスアクセスできない情報を含んだ紙文書を入手する可能性があり、このような紙文書を元にオリジナルの電子文書を検索して、オリジナルの電子文書を入手すると、オリジナルの電子文書の中に自分がアクセスできない情報が含まれていることになる。このように、電子文書は、紙文書以上に複製、送信などの再利用性が高まり、アクセス制限が掛けられた情報が拡散してしまうという危険性があった。 Furthermore, since paper documents can be easily passed from person to person, for example, a person with a low access level may obtain a paper document containing information that the person cannot access. When an original electronic document is retrieved based on the document and the original electronic document is obtained, information that cannot be accessed by the user is included in the original electronic document. As described above, the electronic document has a higher reusability such as duplication and transmission than a paper document, and there is a risk that information on which access restriction is applied spreads.

また、このように、アクセス制限が破られて、アクセスできない情報を含んだ紙文書が渡ったことが、従来技術では検知できず、また、そのことが管理者に通知されないため、情報漏洩に対応することができないという課題もあった。 In addition, in this way, it is not possible to detect that a paper document containing information that cannot be accessed because access restrictions have been violated, and it is not possible to notify the administrator of this, so it is possible to deal with information leakage. There was also a problem that we could not do it.

本発明は上記の課題を解決するためになされたものであり、ユーザのアクセスレベルに適した内容で構成される電子文書を適切にかつ安全に検索することができる画像処理装置及びその制御方法、プログラムを提供することを目的とする。 The present invention has been made to solve the above-described problem, and an image processing apparatus capable of appropriately and safely searching for an electronic document configured with contents suitable for a user's access level, and a control method thereof, The purpose is to provide a program.

上記の目的を達成するための本発明による画像処理装置は以下の構成を備える。即ち、
所望の画像データを検索する画像処理装置であって、
複数種類の構成要素からなる画像データを、その構成要素毎にアクセスレベルを設定して記憶管理する記憶手段と、
ユーザ情報を入力する入力手段と、
前記入力手段で入力されたユーザ情報のアクセスレベルを取得する取得手段と、
原稿を読み取る読取手段と、
前記読取手段によって得られる第１画像データに対応する第２画像データを前記記憶手段から検索する検索手段と、
前記取得手段で取得したユーザ情報のアクセスレベルに基づいて、前記検索手段で検索された第２画像データの各構成要素の出力の可否を判定する判定手段と、
前記判定手段で出力が許可された構成要素を含む第２画像データを出力する出力手段と
を備える。 In order to achieve the above object, an image processing apparatus according to the present invention comprises the following arrangement. That is,
An image processing apparatus for searching for desired image data,
Storage means for storing and managing image data composed of a plurality of types of components by setting an access level for each component;
An input means for inputting user information;
Obtaining means for obtaining an access level of the user information input by the input means;
Reading means for reading a document;
Search means for searching the storage means for second image data corresponding to the first image data obtained by the reading means;
A determination unit that determines whether or not each component of the second image data searched by the search unit can be output based on an access level of the user information acquired by the acquisition unit;
Output means for outputting second image data including components permitted to be output by the determination means.

また、好ましくは、前記検索手段は、前記読取手段によって得られる第１画像データと前記記憶手段に記憶されている画像データを比較する比較手段とを備え、
前記比較手段の比較結果に基づいて、前記第１画像データに対応する第２画像データを検索する。 Preferably, the search means includes first image data obtained by the reading means and comparison means for comparing the image data stored in the storage means,
Based on the comparison result of the comparison means, second image data corresponding to the first image data is searched.

また、好ましくは、前記比較手段は、前記第１画像データの特徴量を抽出する抽出手段を備え、
前記抽出手段で抽出した特徴量と、前記記憶手段に記憶されている画像データの特徴量と比較する。 Preferably, the comparison unit includes an extraction unit that extracts a feature amount of the first image data.
The feature quantity extracted by the extraction means is compared with the feature quantity of the image data stored in the storage means.

また、好ましくは、前記抽出手段は、前記第１データの画像特徴量及び文字特徴量のいずれかまたは両方を抽出する。 Preferably, the extraction unit extracts one or both of an image feature amount and a character feature amount of the first data.

また、好ましくは、前記検索手段で検索された前記第２画像データの候補となる候補画像データの一覧を表示する表示手段を更に備え、
前記判定手段は、前記取得手段で取得したアクセスレベルに基づいて、前記表示手段で表示された候補画像データの一覧から選択された前記２画像データとなる候補画像データの各構成要素の出力の可否を判定する。 Preferably, the apparatus further comprises display means for displaying a list of candidate image data that are candidates for the second image data searched by the search means,
Whether the determination unit outputs each component of the candidate image data to be the two image data selected from the list of candidate image data displayed by the display unit based on the access level acquired by the acquisition unit Determine.

また、好ましくは、前記判定手段は、前記検索手段で検索された第２画像データに対するアクセスレベルを設定する設定手段と、
前記設定手段で設定されたアクセスレベルと、前記取得手段で取得したユーザ情報のアクセスレベルを比較するアクセスレベル比較手段と、
前記アクセスレベル比較手段の比較結果に基づいて、警告情報を出力する警告情報出力手段と
を備える。 Preferably, the determination means includes a setting means for setting an access level for the second image data searched by the search means;
An access level comparing means for comparing the access level set by the setting means with the access level of the user information acquired by the acquiring means;
Warning information output means for outputting warning information based on the comparison result of the access level comparison means.

また、好ましくは、前記取得手段で取得したユーザ情報のアクセスレベルが、前記設定手段で設定されたアクセスレベルより高い場合、前記判定手段は、該設定手段で設定されたアクセスレベルに基づいて、前記検索手段で検索された第２画像データの各構成要素の出力の可否を判定する。 Preferably, when the access level of the user information acquired by the acquisition unit is higher than the access level set by the setting unit, the determination unit is configured based on the access level set by the setting unit. Whether to output each component of the second image data searched by the search means is determined.

また、好ましくは、前記取得手段で取得したユーザ情報のアクセスレベルが、前記設定手段で設定されたアクセスレベルと同じあるいはそれより低い場合、前記判定手段は、該取得手段で取得したユーザ情報のアクセスレベルに基づいて、前記検索手段で検索された第２画像データの各構成要素の出力の可否を判定する。 Preferably, when the access level of the user information acquired by the acquisition unit is the same as or lower than the access level set by the setting unit, the determination unit accesses the user information acquired by the acquisition unit. Based on the level, it is determined whether or not each component of the second image data searched by the search means can be output.

また、好ましくは、前記記憶手段は、複数種類の構成要素からなる画像データを、その構成要素毎にアクセスレベルを設定して記憶管理するとともに、その構成要素毎に特徴量を記憶管理する。 Preferably, the storage unit stores and manages image data including a plurality of types of components by setting an access level for each component, and stores and manages a feature amount for each component.

また、好ましくは、前記記憶手段は、複数種類の構成要素からなる画像データを、その構成要素毎にアクセスレベルを設定して記憶管理するとともに、その構成要素毎に特徴量を記憶管理する場合、同一の特徴量については、その特徴量を示す識別子を用いて記憶管理する。 Preferably, the storage unit stores and manages image data including a plurality of types of components by setting an access level for each component, and stores and manages a feature amount for each component. The same feature quantity is stored and managed using an identifier indicating the feature quantity.

上記の目的を達成するための本発明による画像処理装置の制御方法は以下の構成を備える。即ち、
複数種類の構成要素からなる画像データを、その構成要素毎にアクセスレベルを設定して記憶管理する記憶部から、所望の画像データを検索する画像処理装置の制御方法であって、
ユーザ情報を入力する入力工程と、
前記入力工程で入力されたユーザ情報のアクセスレベルを取得する取得工程と、
原稿を読み取る読取工程と、
前記読取工程によって得られる第１画像データに対応する第２画像データを前記記憶部から検索する検索工程と、
前記取得工程で取得したユーザ情報のアクセスレベルに基づいて、前記検索工程で検索された第２画像データの各構成要素の出力の可否を判定する判定工程と、
前記判定工程で出力が許可された構成要素を含む第２画像データを出力する出力工程と
を備える。 In order to achieve the above object, a method for controlling an image processing apparatus according to the present invention comprises the following arrangement. That is,
A method for controlling an image processing apparatus that searches for desired image data from a storage unit that stores and manages image data including a plurality of types of components by setting an access level for each component,
An input process for inputting user information;
An acquisition step of acquiring an access level of the user information input in the input step;
A reading process for reading a document;
A search step of searching the storage unit for second image data corresponding to the first image data obtained by the reading step;
A determination step of determining whether or not each component of the second image data searched in the search step can be output based on the access level of the user information acquired in the acquisition step;
An output step of outputting second image data including components permitted to be output in the determination step.

上記の目的を達成するための本発明によるプログラムは以下の構成を備える。即ち、
複数種類の構成要素からなる画像データを、その構成要素毎にアクセスレベルを設定して記憶管理する記憶部から、所望の画像データを検索する画像処理装置の制御を実現するプログラムであって、
ユーザ情報を入力する入力工程のプログラムコードと、
前記入力工程で入力されたユーザ情報のアクセスレベルを取得する取得工程のプログラムコードと、
原稿を読み取る読取工程のプログラムコードと、
前記読取工程によって得られる第１画像データに対応する第２画像データを前記記憶部から検索する検索工程のプログラムコードと、
前記取得工程で取得したユーザ情報のアクセスレベルに基づいて、前記検索工程で検索された第２画像データの各構成要素の出力の可否を判定する判定工程のプログラムコードと、
前記判定工程で出力が許可された構成要素を含む第２画像データを出力する出力工程のプログラムコードと
を備える。 In order to achieve the above object, a program according to the present invention comprises the following arrangement. That is,
A program that realizes control of an image processing apparatus that searches for desired image data from a storage unit that stores and manages image data including a plurality of types of components by setting an access level for each component,
A program code of an input process for inputting user information;
A program code of an acquisition step for acquiring an access level of the user information input in the input step;
A program code for a reading process for reading a document;
A program code of a search step for searching the storage unit for second image data corresponding to the first image data obtained by the reading step;
Based on the access level of the user information acquired in the acquisition step, the program code of the determination step for determining whether to output each component of the second image data searched in the search step,
And a program code of an output process for outputting second image data including components permitted to be output in the determination process.

以上説明したように、本発明によれば、ユーザのアクセスレベルに適した内容で構成される電子文書を適切にかつ安全に検索することができる画像処理装置及びその制御方法、プログラムを提供できる。 As described above, according to the present invention, it is possible to provide an image processing apparatus, a control method thereof, and a program that can appropriately and safely search an electronic document configured with contents suitable for a user's access level.

以下、本発明の実施の形態について図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は本発明の実施形態の画像処理システムの構成例を示すブロック図である。 FIG. 1 is a block diagram showing a configuration example of an image processing system according to an embodiment of the present invention.

ネットワーク１０３には、複数種類の機能（複写機能、印刷機能、送信機能等）を実現するＭＦＰ（マルチ・ファンクション・プリンタ）１００、ＭＦＰ１００を制御するマネージメントＰＣ１０１、クライアントＰＣ１０２、文書管理サーバ１０６が接続されている。 Connected to the network 103 are an MFP (multi-function printer) 100 that realizes a plurality of types of functions (copying function, printing function, transmission function, etc.), a management PC 101 that controls the MFP 100, a client PC 102, and a document management server 106. ing.

文書管理サーバ１０６には、文書データを格納するデータベース１０５が接続されており、文書管理サーバ１０６は、ＷＥＢアプリケーション（例えば、ＷＥＢブラウザ）により、データベース１０５に格納されている文書データを、クライアントＰＣ１０２に送信することができる。 A database 105 for storing document data is connected to the document management server 106, and the document management server 106 sends document data stored in the database 105 to the client PC 102 by a WEB application (for example, a WEB browser). Can be sent.

クライアントＰＣ１０２では、送信された文書データをＷＥＢアプリケーションを用いて閲覧し、必要に応じて印刷を行う。 In the client PC 102, the transmitted document data is browsed using a WEB application, and printing is performed as necessary.

ＭＦＰ１００は、本発明において、原稿である紙文書（印刷物）を電子的に読み取る画像読取部と画像読取部から得られる画像信号に対する画像処理を実行する画像処理部を有し、この画像信号は、ＬＡＮ１０４を用いてマネージメントＰＣ１０１に送信することができる。 In the present invention, the MFP 100 includes an image reading unit that electronically reads a paper document (printed material) that is a document, and an image processing unit that executes image processing on an image signal obtained from the image reading unit. Data can be transmitted to the management PC 101 using the LAN 104.

マネージメントＰＣ１０１は、通常のＰＣ（パーソナルコンピュータ）であり、内部に画像記憶部、画像処理部、表示部、入力部を有するが、その一部をＭＦＰ１００に一体化して構成されている。 The management PC 101 is a normal PC (personal computer), and includes an image storage unit, an image processing unit, a display unit, and an input unit. A part of the management PC 101 is integrated with the MFP 100.

尚、ネットワーク１０３は、典型的にはインターネットやＬＡＮやＷＡＮや電話回線、専用デジタル回線、ＡＴＭやフレームリレー回線、通信衛星回線、ケーブルテレビ回線、データ放送用無線回線等のいずれか、またはこれらの組み合わせにより実現されるいわゆる通信ネットワークであり、少なくともデータの送受信が可能であれば良い。 The network 103 is typically the Internet, LAN, WAN, telephone line, dedicated digital line, ATM, frame relay line, communication satellite line, cable TV line, data broadcasting radio line, or the like, It is a so-called communication network realized by the combination, and it is sufficient that at least data can be transmitted and received.

また、マネージメントＰＣ１０１、クライアントＰＣ１０２、文書管理サーバ１０６等の各種端末はそれぞれ、汎用コンピュータに搭載される標準的な構成要素（例えば、ＣＰＵ、ＲＡＭ、ＲＯＭ、ハードディスク、外部記憶装置、ネットワークインタフェース、ディスプレイ、キーボード、マウス等）を有している。 Various terminals such as the management PC 101, the client PC 102, and the document management server 106 are standard components (for example, CPU, RAM, ROM, hard disk, external storage device, network interface, display, Keyboard, mouse, etc.).

次に、ＭＦＰ１００の詳細構成について、図２を用いて説明する。 Next, a detailed configuration of the MFP 100 will be described with reference to FIG.

図２は本発明の実施形態のＭＦＰの詳細構成を示すブロック図である。 FIG. 2 is a block diagram showing a detailed configuration of the MFP according to the embodiment of the present invention.

図２において、画像入力部１１０は、例えば、スキャナやリーダで構成される画像読取部であり、特に、画像入力部１１０がスキャナやリーダで構成される場合には、オートドキュメントフィーダ（ＡＤＦ）で更に構成される。画像入力部１１０は、束状のあるいは１枚の原稿画像を光源（不図示）で照射し、原稿反射像をレンズで固体撮像素子上に結像し、固体撮像素子からラスタ状のスキャン画像データを所定密度（６００ＤＰＩ等）のラスタ画像として得る。 In FIG. 2, an image input unit 110 is an image reading unit configured by, for example, a scanner or a reader. In particular, when the image input unit 110 is configured by a scanner or a reader, an auto document feeder (ADF) is used. Further configured. The image input unit 110 irradiates a bundle or one original image with a light source (not shown), forms an original reflection image on a solid-state image sensor with a lens, and scans the raster image data from the solid-state image sensor. Is obtained as a raster image having a predetermined density (600 DPI or the like).

尚、画像入力部１１０は、スキャナやリーダ以外に、デジタルカメラやデジタルビデオ等の撮像装置、ＰＣやＰＤＡ等のＣＰＵを有する情報処理装置、移動携帯通信端末やＦＡＸ等の通信装置等、ラスタ画像データを入力可能な装置であれば、どのようなものでも良い。 In addition to the scanner and the reader, the image input unit 110 is a raster image such as an imaging device such as a digital camera or digital video, an information processing device having a CPU such as a PC or PDA, a communication device such as a mobile portable communication terminal or a FAX. Any device that can input data may be used.

次に、ＭＦＰ１００の主要な機能群について、以下に説明する。 Next, main function groups of MFP 100 will be described below.

「複写（コピー）機能」
ＭＦＰ１００は、スキャン画像データに対応する画像を印刷部１１２で記録媒体に印刷する複写機能を有し、原稿画像を１つ複写する場合には、このスキャン画像データをデータ処理部１１５（ＣＰＵ、ＲＡＭ、ＲＯＭ等から構成される）で各種の補正を行う画像処理を施して、印刷データを生成し、これを印刷部１１２によって記録媒体上に印刷させる。一方、原稿画像を複数複写する場合には、記憶部１１１に一旦一ページ分の印刷データを記憶保持させた後、これを印刷部１１２に順次出力して記録媒体上に印刷させる。 "Copy function"
The MFP 100 has a copying function for printing an image corresponding to the scanned image data on a recording medium by the printing unit 112. When copying one original image, the MFP 100 uses the scanned image data as a data processing unit 115 (CPU, RAM The image data is subjected to various corrections using a ROM, etc. to generate print data, which is printed on a recording medium by the printing unit 112. On the other hand, when copying a plurality of document images, the storage unit 111 temporarily stores and holds print data for one page, and then sequentially outputs the print data to the printing unit 112 for printing on a recording medium.

尚、記憶部１１１に印刷データを保持せずに、スキャン画像データをデータ処理部１１５にて各種の補正を行う画像処理を施して印刷データを生成して、直接印刷部１１２によって記録媒体上に印刷させることも可能である。 In addition, without storing the print data in the storage unit 111, the scan image data is subjected to various kinds of image processing for performing various corrections in the data processing unit 115 to generate the print data, and the print unit 112 directly onto the recording medium It is also possible to print.

「保存機能」
ＭＦＰ１００は、画像入力部１１０からスキャン画像データあるいは画像処理が施されたスキャン画像データを記憶部１１１に保存する。 "Save function"
The MFP 100 stores the scanned image data or the scanned image data subjected to the image processing from the image input unit 110 in the storage unit 111.

「送信機能」
ネットワークＩ／Ｆ１１４を介する送信機能においては、画像入力部１１０から得られるスキャン画像データあるいは保存機能で記憶部１１１に保存されたスキャン画像データを、ＴＩＦＦやＪＰＥＧ等の圧縮画像ファイル形式、あるいはＰＤＦ等のベクトルデータファイル形式の画像ファイルへと変換し、ネットワークＩ／Ｆ１１４から出力する。出力された画像ファイルは、ネットワーク１０３を介して文書管理サーバ１０６へ送信されたりする。 "Transmission function"
In the transmission function via the network I / F 114, the scan image data obtained from the image input unit 110 or the scan image data stored in the storage unit 111 by the storage function is converted into a compressed image file format such as TIFF or JPEG, PDF, or the like. To an image file of the vector data file format and output from the network I / F 114. The output image file is transmitted to the document management server 106 via the network 103.

また、ここでは図示しないが、ＦＡＸＩ／Ｆを使用して、スキャン画像データを電話回線を使用してファクシミリ送信する構成も可能である。また、記憶部１１１にスキャン画像データを保存せずに、そのスキャン画像データをデータ処理部１１５にて各種の送信に関する画像処理を施した後に、直接送信することも可能である。 Although not shown here, it is also possible to use a FAX I / F to send scanned image data by facsimile using a telephone line. Further, the scan image data may be directly transmitted after being subjected to various kinds of image processing by the data processing unit 115 without storing the scan image data in the storage unit 111.

「印刷機能」
印刷部１１２による印刷機能においては、例えば、クライアントＰＣ１０２から出力された印刷データをネットワークＩ／Ｆ１１４経由でデータ処理部１１５が受信する。データ処理部１１５は、その印刷データを印刷部１１２で印刷可能なラスタデータに変換し、記憶部１１１に一旦一ページ分の記録データを記憶保持した後、印刷部１１２によって印刷媒体上に画像を形成する。 "Print Function"
In the printing function by the printing unit 112, for example, the data processing unit 115 receives print data output from the client PC 102 via the network I / F 114. The data processing unit 115 converts the print data into raster data that can be printed by the printing unit 112, temporarily stores recording data for one page in the storage unit 111, and then prints an image on the print medium by the printing unit 112. Form.

以上、各種機能を実行するためのＭＦＰ１００への操作者の指示は、ＭＦＰ１００に装備されたキー操作部とマネージメントＰＣ１０１に接続されたキーボード及びマウスからなる入力部１１３から行われ、これら一連の動作はデータ処理部１１５内の制御部（不図示）で制御される。また、操作入力の状態表示及び処理中の画像データの表示は、表示部１１６で行われる。 As described above, an operator's instruction to the MFP 100 to execute various functions is performed from the key operation unit provided in the MFP 100 and the input unit 113 including the keyboard and mouse connected to the management PC 101. It is controlled by a control unit (not shown) in the data processing unit 115. Further, the display of the operation input status and the image data being processed is performed on the display unit 116.

記憶部１１１は、マネージメントＰＣ１０１からも制御され、ＭＦＰ１００とマネージメントＰＣ１０１とのデータの送受信及び制御は、ネットワークＩ／Ｆ１１７及びＬＡＮ１０４を介して行われる。 The storage unit 111 is also controlled by the management PC 101, and data transmission / reception and control between the MFP 100 and the management PC 101 are performed via the network I / F 117 and the LAN 104.

上記画像処理システムにおいて、文書管理サーバ１０６で管理されている電子文書は、その電子文書を構成する要素（文字ブロック、画像ブロック、表ブロック、グラフブロック等）毎にアクセスレベルが設定されて記憶管理されており、クライアントＰＣ１０２のＷＥＢブラウザから電子文書を閲覧する際、ユーザのアクセスレベルによって、同じ電子文書に対して出力（表示）される要素が異なる。 In the image processing system, an electronic document managed by the document management server 106 is stored and managed with an access level set for each element (character block, image block, table block, graph block, etc.) constituting the electronic document. Therefore, when an electronic document is viewed from the WEB browser of the client PC 102, the elements that are output (displayed) for the same electronic document differ depending on the access level of the user.

また、マネージメントＰＣ１０１から指示することにより、文書管理サーバ１０６で管理されている電子文書を印刷した紙文書を、ＭＦＰ１００の画像入力部１１０から読み取り、読み込まれた紙文書のオリジナルの電子文書を検索し、文書管理サーバ１０６からオリジナルの電子文書を取得することができる。 In response to an instruction from the management PC 101, a paper document on which an electronic document managed by the document management server 106 is printed is read from the image input unit 110 of the MFP 100, and the original electronic document of the read paper document is searched. The original electronic document can be acquired from the document management server 106.

以下、本発明の画像処理システムの詳細について説明する。 Details of the image processing system of the present invention will be described below.

まず、本実施形態において、データベース１０５に格納されている電子文書（文書データ）を、クライアントＰＣ１０２から閲覧して印刷する印刷処理について、図３を用いて説明する。 First, a printing process for browsing and printing an electronic document (document data) stored in the database 105 from the client PC 102 in the present embodiment will be described with reference to FIG.

図３は本発明の実施形態の印刷処理を示すフローチャートである。 FIG. 3 is a flowchart showing the printing process according to the embodiment of the present invention.

まず、ステップＳ３０１において、クライアントＰＣ１０２からＷＥＢブラウザを立ち上げ、文書管理サーバ１０６のＷＥＢアプリケーションのＴＯＰページのＵＲＬ（ＵｎｉｆｏｒｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）を指定し、ＷＥＢブラウザから文書管理システムにログインする。 First, in step S301, the WEB browser is started from the client PC 102, the URL (Uniform Resource Locator) of the TOP page of the WEB application of the document management server 106 is designated, and the WEB browser logs into the document management system.

ＴＯＰページのＵＲＬを指定すると、ユーザ名とパスワードを入力するページがＷＥＢブラウザに表示され、ユーザは、キーボードを用いて、ユーザ名とパスワードを入力する。ステップＳ３０２において、入力されたユーザ名を元に、図４に示すユーザ情報を参照し、入力されたパスワードをチェックする。 When the URL of the TOP page is designated, a page for inputting the user name and password is displayed on the WEB browser, and the user inputs the user name and password using the keyboard. In step S302, based on the input user name, the user information shown in FIG. 4 is referred to and the input password is checked.

ここで、ユーザ情報について、図４を用いて説明する。 Here, the user information will be described with reference to FIG.

図４は本発明の実施形態のユーザ情報の一例を示す図である。 FIG. 4 is a diagram showing an example of user information according to the embodiment of the present invention.

図４に示すように、ユーザ情報は、ユーザ名、文書管理システムのログイン時のパスワード、ユーザを固有に識別するためのユーザＩＤ、及び電子文書を構成する構成要素の出力（表示）の可否を規定するアクセスレベルが対応づけて構成されている。このユーザ情報は、例えば、文書管理サーバ１０６で管理されており、必要に応じて、管理者は、このユーザ情報を変更することができる。 As shown in FIG. 4, the user information includes a user name, a password at the time of logging in the document management system, a user ID for uniquely identifying the user, and whether or not to output (display) the components constituting the electronic document. The specified access levels are configured in association with each other. This user information is managed by the document management server 106, for example, and the administrator can change this user information as necessary.

図３の説明に戻る。 Returning to the description of FIG.

入力されたユーザ名とパスワードが正当でない場合（ステップＳ３０２でＮＯ）、処理を終了する。一方、入力されたユーザ名とパスワードが正当である場合（ステップＳ３０２でＹＥＳ）、文書管理サーバ１０６は、セッションＩＤを発行し、以降の処理では、セッション管理が行われる。ＷＥＢアプリケーションにおけるセッション管理は、Ｃｏｏｋｉｅを用いる方法などが知られており、広く一般的に行われているので詳細な説明は省略する。 If the input user name and password are not valid (NO in step S302), the process ends. On the other hand, if the input user name and password are valid (YES in step S302), the document management server 106 issues a session ID, and session management is performed in the subsequent processing. For session management in a WEB application, a method using Cookie is known and widely used, and thus detailed description thereof is omitted.

ステップＳ３０３において、図４のユーザ情報から、ユーザ名に対応したアクセスレベルを取得する。以降、ここで取得したアクセスレベルを、ユーザアクセスレベルと呼ぶことにする。 In step S303, the access level corresponding to the user name is acquired from the user information in FIG. Hereinafter, the access level acquired here will be referred to as a user access level.

続いて、ステップＳ３０４において、文書データの一覧を表示する。文書データの一覧は、図５に示す文書情報を元にＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）文書を生成する。 In step S304, a list of document data is displayed. The list of document data generates an HTML (Hyper Text Markup Language) document based on the document information shown in FIG.

ここで、文書情報について、図５を用いて説明する。 Here, the document information will be described with reference to FIG.

図５は本発明の実施形態の文書情報の一例を示す図である。 FIG. 5 is a diagram showing an example of document information according to the embodiment of the present invention.

図５に示されるように、文書情報には、個々の文書データに関する情報として、文書データの識別子である文書ＩＤと、その文書データを閲覧できるレベルを示すアクセスレベル（以降、文書アクセスレベルと呼ぶ）と、その文書データの名前である文書名、さらにその文書データの管理者とを対応させて格納している。この文書情報は、例えば、文書管理サーバ１０６で管理されており、必要に応じて、管理者は、このユーザ情報を変更することができる。 As shown in FIG. 5, in the document information, as information related to individual document data, a document ID that is an identifier of the document data and an access level indicating a level at which the document data can be browsed (hereinafter referred to as a document access level). ), The document name that is the name of the document data, and the administrator of the document data are stored in association with each other. The document information is managed by, for example, the document management server 106, and the administrator can change the user information as necessary.

また、各文書データに設定されているアクセスレベルは、文書データに対するアクセスの制限であり、アクセスレベルの値が大きいほど、アクセスできる人が少なくなることを意味している。また、このアクセスレベルの値は、後述する文書データ内のブロックに対するアクセスレベルから求めることができる。 Also, the access level set for each document data is a restriction on access to the document data, and means that the larger the access level value, the fewer people can access. Further, the value of this access level can be obtained from the access level for a block in document data to be described later.

図３の説明に戻る。 Returning to the description of FIG.

特に、ステップＳ３０４では、ステップＳ３０３で取得したユーザアクセスレベルと文書情報中の文書アクセスレベルとを比較し、ユーザアクセスレベルと等しいか、もしくは、小さい文書アクセスレベルに対応した文書データのリストを、文書名のリストとして生成する。文書データの一覧を表示する文書データ一覧画面の例を、図２９に示す。 In particular, in step S304, the user access level acquired in step S303 is compared with the document access level in the document information, and a list of document data corresponding to a document access level that is equal to or smaller than the user access level is obtained as a document. Generate as a list of names. FIG. 29 shows an example of a document data list screen that displays a list of document data.

ここで、文書データ一覧画面について、図２９を用いて説明する。 Here, the document data list screen will be described with reference to FIG.

図２９は本発明の実施形態の文書データ一覧画面の一例を示す図である。 FIG. 29 is a diagram showing an example of a document data list screen according to the embodiment of the present invention.

図２９では、各文書名の左側にラジオボタンを配置し、生成する文書データのアクセスレベルをリストボックスから選択できるようになっている。このリストボックスの中に表示されるアクセスレベルの最大値は、ステップＳ３０３で取得したユーザアクセスレベルの値となるようにする。更に、閲覧ボタンが構成されている。 In FIG. 29, a radio button is arranged on the left side of each document name, and the access level of the generated document data can be selected from a list box. The maximum value of the access level displayed in this list box is set to the value of the user access level acquired in step S303. In addition, a browse button is configured.

図３の説明に戻る。 Returning to the description of FIG.

続いて、ステップＳ３０５において、処理対象の文書を指定する。 In step S305, a document to be processed is designated.

具体的には、ユーザは、文書データ一覧画面において、所望の文書名に対するラジオボタンをマウスでクリックすることによって選択し、取得したい文書のアクセスレベルをリストボックスを用いて選択し、閲覧ボタンをマウスでクリックする。 Specifically, on the document data list screen, the user selects a desired document name by clicking on the radio button with the mouse, selects the access level of the document to be acquired using the list box, and clicks the browse button with the mouse. Click with.

これにより、ラジオボタンに対応付けられた文書ＩＤ、選択されたアクセスレベルが、文書管理サーバ１０６上の文書生成プログラムに送信され、ステップＳ３０６において、選択された文書ＩＤとアクセスレベルに対応した文書データを生成する。ＨＴＭＬ文書で定義されるＦＯＲＭタグを用いることで、このような動作を行うＨＴＭＬ文書は、ステップＳ３０４において容易に作成することができる。 As a result, the document ID associated with the radio button and the selected access level are transmitted to the document generation program on the document management server 106, and in step S306, the document data corresponding to the selected document ID and access level. Is generated. By using the FORM tag defined in the HTML document, an HTML document that performs such an operation can be easily created in step S304.

尚、ステップＳ３０６の文書生成処理は、図９を用いて後で説明する。この処理によって生成した文書データは、ＨＴＭＬ文書データであり、ステップＳ３０７において、クライアントＰＣ１０２に送信することで、クライアントＰＣ１０２のＷＥＢブラウザ上に表示される。 Note that the document generation processing in step S306 will be described later with reference to FIG. The document data generated by this processing is HTML document data, and is displayed on the WEB browser of the client PC 102 by being transmitted to the client PC 102 in step S307.

続いて、ステップＳ３０８で、ユーザの指示に基づき、クライアントＰＣ１０２のＷＥＢブラウザに表示されているＨＴＭＬ文書データを、ＭＦＰ１００に出力して印刷する。 Subsequently, in step S308, based on a user instruction, the HTML document data displayed on the WEB browser of the client PC 102 is output to the MFP 100 and printed.

次に、文書データを構成する構成要素であるブロックに関する文書ブロック情報について、図６を用いて説明する。 Next, document block information relating to blocks that are constituent elements of document data will be described with reference to FIG.

図６は本発明の実施形態の文書ブロック情報の一例を示す図である。 FIG. 6 is a diagram showing an example of document block information according to the embodiment of the present invention.

尚、上述の図５の文書情報、以下の図６の文書ブロック情報及び図７のオブジェクト情報は、文書管理サーバ１０６で管理され、データベース１０５に格納されている文書データに関する情報である。特に、本実施形態では、簡単のために、各文書データは、１ページから構成されているものとしているが、複数ページから構成される文書データを扱う拡張は、容易に行うことができる。 Note that the document information in FIG. 5 described above, the following document block information in FIG. 6 and the object information in FIG. 7 are information on document data managed by the document management server 106 and stored in the database 105. In particular, in the present embodiment, each document data is assumed to be composed of one page for the sake of simplicity. However, extension that handles document data composed of a plurality of pages can be easily performed.

図６に示されるように、各文書データのページは、複数のブロック（矩形領域）から構成されており、図６の文書ブロック情報には、文書ＩＤに対するブロックの情報が格納されている。 As shown in FIG. 6, each document data page is composed of a plurality of blocks (rectangular areas), and the block information for the document ID is stored in the document block information in FIG.

文書ブロック情報の中では、１つのブロックの情報が、その識別子であるブロックＩＤと、そのブロックが含まれる文書データの文書ＩＤ、ブロックの位置（例えば、文書データのページの左上を原点として、ブロックの左上の点の座標）、ブロックのサイズ（縦、横の長さ）とを対応させて格納されている。各ブロックの内容は、図７のオブジェクト情報で定義される。 In the document block information, information of one block includes a block ID that is an identifier, a document ID of the document data that includes the block, and a block position (for example, a block with the upper left corner of the page of the document data as the origin) Are stored in correspondence with the block size (vertical and horizontal lengths). The contents of each block are defined by the object information in FIG.

ここで、オブジェクト情報について、図７を用いて説明する。 Here, the object information will be described with reference to FIG.

図７は本発明の実施形態のオブジェクト情報の一例を示す図である。 FIG. 7 is a diagram showing an example of object information according to the embodiment of the present invention.

オブジェクト情報では、各ブロックの内容が、その識別子であるオブジェクトＩＤと、その内容が格納されているファイルのアドレスを示すブロック内容と、その内容の属性（画像、テキスト、表など）と、その内容を含むブロックのブロックＩＤと、そのブロックが含まれる文書ＩＤと、その内容に対するアクセスを制限するアクセスレベル（以降、オブジェクトアクセスレベルと呼ぶ）とが対応づけられて管理されている。 In the object information, the content of each block is the object ID that is the identifier, the block content indicating the address of the file in which the content is stored, the attribute of the content (image, text, table, etc.), and the content A block ID of a block including the document ID, a document ID including the block, and an access level (hereinafter referred to as an object access level) that restricts access to the content are managed in association with each other.

図７に示されるように、各ブロックの内容は、アクセスレベルによって異なっている。ひとつのブロックに対して、アクセスレベルが変わると、その内容を変更する場合や、アクセスレベルによってブロックの内容がなくなる場合などがある。図７のオブジェクト情報の例に基づき、文書ＩＤが０００１の文書データに対して、アクセスレベルによって、文書の内容が異なる例を、図８に示す。 As shown in FIG. 7, the contents of each block differ depending on the access level. When the access level changes for one block, the contents may be changed or the contents of the block may be lost depending on the access level. FIG. 8 shows an example in which the content of the document differs depending on the access level for the document data with the document ID 0001 based on the example of the object information in FIG.

図８は本発明の実施形態の文書データの出力例を示す図である。 FIG. 8 is a diagram showing an example of document data output according to the embodiment of the present invention.

図８では、文書ＩＤが０００１の文書データに対して、アクセスレベルが４の場合には、図８（ａ）に示すように、「ｄｏｇ」、「ｃａｒ」、「ａａａ」及び「ｂｂｂ」の４つのオブジェクトの内容が含まれる。また、図８（ｂ）に示すように、アクセスレベルが３と２の場合には、「ｄｏｇ」、「ｃａｒ」及び「ｂｂｂ」の３つのオブジェクトの内容が含まれる。更に、アクセスレベルが１の場合には、図８（ｃ）に示すように、「ｃａｔ」及び「ｂｂｂ」の２つのオブジェクトの内容が含まれる。つまり、アクセスレベルによって、含まれる（出力（表示）される）オブジェクトの内容がなくなり、別のオブジェクトに変更されている。 In FIG. 8, when the access level is 4 for the document data with the document ID 0001, “dog”, “car”, “aaa”, and “bbb” are displayed as shown in FIG. Contains the contents of four objects. As shown in FIG. 8B, when the access levels are 3 and 2, the contents of the three objects “dog”, “car”, and “bbb” are included. Further, when the access level is 1, the contents of two objects “cat” and “bbb” are included as shown in FIG. That is, depending on the access level, the content of the included (output (displayed)) object is lost and changed to another object.

このように、文書管理サーバ１０６で管理される文書データは、同じ文書ＩＤを持つ文書データでも、アクセスレベルによって含まれる（出力（表示）される）内容が異なる。換言すれば、アクセスレベルによって、文書データの構成要素の出力の可否が制御される。 As described above, the document data managed by the document management server 106 includes different contents (output (displayed)) depending on the access level even if the document data has the same document ID. In other words, whether to output the constituent elements of the document data is controlled according to the access level.

尚、図５の文書情報における文書アクセスレベルは、図７のオブジェクト情報に格納されているオブジェクトアクセスレベルから求めることができる。具体的には、オブジェクト情報の中で文書ＩＤに対して最も小さいオブジェクトアクセスレベルが、その文書の文書アクセスレベルになる。 The document access level in the document information of FIG. 5 can be obtained from the object access level stored in the object information of FIG. Specifically, the lowest object access level for the document ID in the object information is the document access level for the document.

次に、ステップＳ３０６の文書生成処理の詳細について、図９を用いて説明する。 Next, details of the document generation processing in step S306 will be described with reference to FIG.

図９は本発明の実施形態の文書生成処理の詳細を示すフローチャートである。 FIG. 9 is a flowchart showing details of the document generation processing according to the embodiment of the present invention.

ステップＳ９０１において、文書ブロック情報から、ステップＳ３０５で指定された文書データの文書ＩＤに対応したブロックＩＤをひとつ取得する。 In step S901, one block ID corresponding to the document ID of the document data designated in step S305 is acquired from the document block information.

ステップＳ９０２において、指定された文書データの文書ＩＤに対応した文書ブロック情報中のすべてのブロックＩＤの取得が終了したか否かを判定する。取得が終了した場合（ステップＳ９０２でＹＥＳ）、ステップＳ９０６に進む。一方、取得が終了していない場合（ステップＳ９０２でＮＯ）、ステップＳ９０３に進む。 In step S902, it is determined whether or not acquisition of all block IDs in the document block information corresponding to the document ID of the designated document data has been completed. When the acquisition is completed (YES in step S902), the process proceeds to step S906. On the other hand, if the acquisition has not ended (NO in step S902), the process proceeds to step S903.

ステップＳ９０３において、オブジェクト情報から、対象の文書ＩＤ、ブロックＩＤに対して、ステップＳ３０５で指定されたアクセスレベルに対応したブロックのブロック内容を取得する。 In step S903, the block contents of the block corresponding to the access level specified in step S305 are acquired from the object information for the target document ID and block ID.

ここで、ステップＳ３０５で指定されたアクセスレベルの値と同じ値のオブジェクトアクセスレベルがあれば、そのオブジェクトアクセスレベルに対応したブロックのブロック内容を取り出す。そうでなければ、指定されたアクセスレベルよりも小さく、最大の値を持つオブジェクトアクセスレベルを探し、そのオブジェクトアクセスレベルに対応したブロックのブロック内容を取得する。 If there is an object access level having the same value as the access level specified in step S305, the block contents of the block corresponding to the object access level are extracted. Otherwise, an object access level that is smaller than the specified access level and has the maximum value is searched, and the block contents of the block corresponding to the object access level are acquired.

ステップＳ９０４において、ブロック内容を取得したか否かを判定する。取得していない場合（ステップＳ９０４でＮＯ）、次のブロックＩＤを取得するために、ステップＳ９０１に戻る。一方、取得した場合（ステップＳ９０４でＹＥＳ）、ステップＳ９０５に進む。 In step S904, it is determined whether the block content has been acquired. If not acquired (NO in step S904), the process returns to step S901 to acquire the next block ID. On the other hand, if acquired (YES in step S904), the flow advances to step S905.

ステップＳ９０５において、ステップＳ３０５で指定された文書の文書ＩＤと、ステップＳ９０１で取得したブロックＩＤと、ステップＳ９０３で取得したブロック内容とを対応させてメモリに一時記憶する。その後、ステップＳ９０１に戻る。 In step S905, the document ID of the document specified in step S305, the block ID acquired in step S901, and the block content acquired in step S903 are temporarily stored in the memory in association with each other. Thereafter, the process returns to step S901.

ステップＳ９０２において、すべてのブロックＩＤの取得が終了した場合、ステップＳ９０６において、ステップＳ９０５で一時記憶したブロックとそのブロック内容を元に、文書データを生成する。生成する文書データは、ＨＴＭＬ文書データであり、ＣＳＳ（ＣａｓｃａｄｉｎｇＳｔｙｌｅｓｈｅｅｔs）を用いて、ブロック情報に格納されているブロックの位置サイズ情報に基づいて、ブロックの内容をレイアウトする。 If acquisition of all block IDs is completed in step S902, document data is generated in step S906 based on the block temporarily stored in step S905 and its block contents. The generated document data is HTML document data, and the content of the block is laid out based on the position size information of the block stored in the block information using CSS (Cascading Style sheets).

次に、ＭＦＰ１００を用いて文書管理サーバ１０６で管理されている文書データを検索するための前準備として、マネージメントＰＣ１０１の制御のもとに、文書データをＭＦＰ１００内のデータベース１１８に登録する登録処理の概要について、図１０を用いて説明する。 Next, as a preparation for searching for document data managed by the document management server 106 using the MFP 100, a registration process for registering the document data in the database 118 in the MFP 100 under the control of the management PC 101 is performed. The outline will be described with reference to FIG.

図１０は本発明の実施形態の登録処理を示すフローチャートである。 FIG. 10 is a flowchart showing registration processing according to the embodiment of the present invention.

まず、ステップＳ１００１において、文書管理サーバ１０６から取得して得られる文書情報を元に、文書ＩＤをひとつ取得する。 First, in step S1001, one document ID is acquired based on the document information acquired from the document management server 106.

ステップＳ１００２において、すべての文書ＩＤの取得が終了したか否かを判定する。取得が終了した場合（ステップＳ１００２でＹＥＳ）、ステップＳ１００９に進む。一方、取得が終了していない場合（ステップＳ１００２でＮＯ）、ステップＳ１００３に進む。 In step S1002, it is determined whether or not all document IDs have been acquired. If the acquisition is completed (YES in step S1002), the process proceeds to step S1009. On the other hand, if the acquisition has not ended (NO in step S1002), the process proceeds to step S1003.

ステップＳ１００３において、アクセスレベルの最大値から順番に設定する。ステップＳ１００４で、すべてのアクセスレベルの設定が終了したか否かを判定する。設定が終了した場合（ステップＳ１００４でＹＥＳ）、ステップＳ１００１に戻る。一方、設定が終了していない場合（ステップＳ１００４でＮＯ）、ステップＳ１００５に進む。 In step S1003, the maximum access level is set in order. In step S1004, it is determined whether or not all access levels have been set. When the setting is completed (YES in step S1004), the process returns to step S1001. On the other hand, if the setting has not been completed (NO in step S1004), the process proceeds to step S1005.

ステップＳ１００５において、ステップＳ３０６と同様にして、文書管理サーバ１０６から取得して得られる文書ブロック情報とオブジェクト情報を元に、ステップＳ１００１で取得した文書ＩＤとステップＳ１００３で設定したアクセスレベルに対する文書データを生成する。 In step S1005, similarly to step S306, based on the document block information and object information obtained from the document management server 106, the document ID obtained in step S1001 and the document data for the access level set in step S1003 are obtained. Generate.

続いて、ステップＳ１００６において、ステップＳ１００５で生成した文書データと同一の文書データが生成済であるか否かを判定する。生成済である場合（ステップＳ１００６でＹＥＳ）、ステップＳ１００８にス進む。一方、生成済でない場合（ステップＳ１００６でＮＯ）、ステップＳ１００７に進む。 In step S1006, it is determined whether the same document data as the document data generated in step S1005 has been generated. If it has been generated (YES in step S1006), the flow advances to step S1008. On the other hand, if it has not been generated (NO in step S1006), the process advances to step S1007.

尚、ステップＳ１００６において、生成した文書データと同一の文書データが生成済であるか否かの判定は、図１１に示す生成文書情報と図１２に示す生成文書ブロック情報を参照して実現する。 In step S1006, whether or not the same document data as the generated document data has been generated is determined with reference to the generated document information shown in FIG. 11 and the generated document block information shown in FIG.

ここで、生成文書情報及び生成文書ブロック情報について、図１１及び図１２を用いて説明する。 Here, the generated document information and the generated document block information will be described with reference to FIGS.

図１１は本発明の実施形態の生成文書情報の一例を示す図である。また、図１２は本発明の実施形態の生成文書ブロック情報の一例を示す図である。 FIG. 11 is a diagram illustrating an example of generated document information according to the embodiment of this invention. FIG. 12 is a diagram showing an example of generated document block information according to the embodiment of the present invention.

生成文書情報と生成文書ブロック情報は、文書ＩＤに対して、異なるアクセスレベルごとに生成される文書情報を格納する情報である。生成文書情報は、文書管理サーバ１０６で管理されている全ての文書データに対して、アクセスレベルによって生成される文書データのバリエーションを格納する情報であり、図１１に示すように、文書管理サーバ１０６で管理されている全ての文書ＩＤに対して、アクセスレベルごとに生成される文書データのＩＤである生成文書ＩＤと対応付けて管理される。 The generated document information and the generated document block information are information for storing document information generated for different access levels with respect to the document ID. The generated document information is information for storing variations of document data generated according to the access level for all document data managed by the document management server 106. As shown in FIG. Are managed in association with the generated document ID that is the ID of the document data generated for each access level.

また、異なるアクセスレベルでも、生成される文書データが同一の場合は、同じ生成文書ＩＤが付与される。 Also, even when the access levels are different, the same generated document ID is assigned when the generated document data is the same.

一方、生成文書ブロック情報は、図１２に示すように、生成文書ＩＤに対して、その文書データに含まれるブロック、およびそのブロック内容であるオブジェクトＩＤとが対応付けて管理される。 On the other hand, as shown in FIG. 12, the generated document block information is managed by associating a generated document ID with a block included in the document data and an object ID that is the content of the block.

図１０の説明に戻る。 Returning to the description of FIG.

ステップＳ１００６において、生成した文書データと同一の文書データが生成済でない場合、ステップＳ１００７において、図１２に示されるように、生成した文書データに対する生成文書ＩＤを新たに発行し、ステップＳ１００５で生成した文書データのブロックの情報（属性、位置、サイズ）とブロックの内容を示すオブジェクトＩＤとを生成文書ＩＤと対応させて、データベース１１８上の生成文書ブロック情報に格納する。 If the same document data as the generated document data has not been generated in step S1006, a generated document ID for the generated document data is newly issued in step S1007 and generated in step S1005, as shown in FIG. Document data block information (attribute, position, size) and object ID indicating the contents of the block are stored in the generated document block information on the database 118 in association with the generated document ID.

また、図１１に示されるように、生成文書ＩＤと文書ＩＤとアクセスレベルを対応させて、データベース１１８上の生成文書情報に格納する。また、生成文書情報には、生成した文書データに対して、ラスタ画像を作成して縮小することでサムネイル画像を作成し、そのサムネイル画像のアドレス情報を格納する。その後、ステップＳ１００１に戻る。 Further, as shown in FIG. 11, the generated document ID, the document ID, and the access level are associated with each other and stored in the generated document information on the database 118. In the generated document information, a thumbnail image is generated by reducing the generated document data by creating a raster image, and address information of the thumbnail image is stored. Thereafter, the process returns to step S1001.

一方、ステップＳ１００６において、生成した文書データと同一の文書データが生成済である場合、ステップＳ１００８において、生成文書情報の同一の生成文書データの生成文書ＩＤに対して、ステップＳ１００３で設定したアクセスレベルを追加する。その後、ステップＳ１００１に戻る。 On the other hand, if the same document data as the generated document data has been generated in step S1006, the access level set in step S1003 for the generated document ID of the same generated document data in the generated document information in step S1008. Add Thereafter, the process returns to step S1001.

ステップＳ１００２において、すべての文書ＩＤの取得が終了した場合（ステップＳ１００２でＹＥＳ）、ステップＳ１００９において、文書管理サーバ１０６から取得して得られるオブジェクト情報を元に、各オブジェクトの種別に応じて特徴量情報の抽出を行う。 If acquisition of all document IDs is completed in step S1002 (YES in step S1002), a feature amount corresponding to the type of each object is obtained based on the object information obtained from the document management server 106 in step S1009. Extract information.

例えば、画像ブロックについては、色に関する画像特徴量を抽出する。文字ブロックについては、その文字ブロック中の文字列を文字特徴量として抽出する。各オブジェクトの種別に応じて抽出した特徴量の内、画像特徴量は、図１３に示す色特徴量情報として、文字特徴量は、図１４に示す文字特徴量情報としてそれぞれデータベース１１８上に登録する。その後、登録処理を終了する。 For example, for an image block, an image feature amount related to color is extracted. For a character block, a character string in the character block is extracted as a character feature amount. Of the feature quantities extracted in accordance with the type of each object, the image feature quantity is registered on the database 118 as the color feature quantity information shown in FIG. 13, and the character feature quantity is registered on the database 118 as the character feature quantity information shown in FIG. . Thereafter, the registration process is terminated.

次に、紙文書から対応するオリジナルの文書データを検索し、印刷等の各種処理を実行する検索処理について、図１５を用いて説明する。 Next, a search process for searching corresponding original document data from a paper document and executing various processes such as printing will be described with reference to FIG.

図１５は本発明の実施形態の検索処理を示すフローチャートである。 FIG. 15 is a flowchart showing search processing according to the embodiment of the present invention.

まず、ステップＳ１５０１において、マネージメントＰＣ１０１からオリジナル文書検索システムのログインを行う。ステップＳ３０１と同様に、キーボードから、ユーザ名とパスワードを入力する。入力されたユーザ名とパスワードは、文書管理サーバ１０６へ送信され、ステップＳ３０１と同様に、ユーザ名とパスワードの正当性が判定され、正当性が確認されるとセッションが開始される。 In step S1501, the management PC 101 logs in the original document search system. As in step S301, the user name and password are input from the keyboard. The input user name and password are transmitted to the document management server 106, and as in step S301, the validity of the user name and password is determined. When the validity is confirmed, a session is started.

続いて、ステップＳ１５０２において、ステップＳ３０３と同様にして、文書管理サーバ１０６において、ユーザのアクセスレベルを取得する。 In step S1502, the document management server 106 acquires the user access level in the same manner as in step S303.

アクセスレベルを取得すると、ユーザ名とパスワードの正当性の判定結果が、文書管理サーバ１０６から返信され、正当である場合は、ステップＳ１５０３において、マネージメントＰＣ１０１は、ＭＦＰ１００のＡＤＦを含む画像入力部１１０を動作させ、紙文書をラスタ状に走査してラスタ画像を記憶部１１１に読み込む。 When the access level is acquired, the result of determination of the validity of the user name and password is returned from the document management server 106. If the result is valid, the management PC 101 displays the image input unit 110 including the ADF of the MFP 100 in step S1503. In operation, the paper document is scanned in a raster shape, and a raster image is read into the storage unit 111.

ステップＳ１５０４において、記憶部１１１に格納されている１ページ分のラスタ画像に対して、ブロックセレクション（ＢＳ）処理を行う。この処理は、マネージメントＰＣ１０１の制御によって実行する。 In step S1504, block selection (BS) processing is performed on the raster image for one page stored in the storage unit 111. This process is executed under the control of the management PC 101.

具体的には、マネージメントＰＣ１０１のＣＰＵは、記憶部１１１に格納された処理対象のラスタ画像を、まず、文字／線画部分とハーフトーン画像部分とに領域分割し、文字／線画部分は更に段落で塊として纏まっているブロック毎に、あるいは線で構成された表、図形毎に分割する。 Specifically, the CPU of the management PC 101 first divides the raster image to be processed stored in the storage unit 111 into a character / line drawing portion and a halftone image portion, and the character / line drawing portion is further divided into paragraphs. The data is divided for each block grouped as a block, or for each table or figure composed of lines.

一方、ハーフトーン画像部分は、矩形に分離されたブロックの画像部分、背景部分等のブロックに分割する。 On the other hand, the halftone image part is divided into blocks such as an image part of a block separated into a rectangle and a background part.

ここで、ブロックセレクション処理を行う前のラスタ画像とブロックセレクションの結果の対応の例を、図１６に示す。 Here, an example of the correspondence between the raster image before the block selection process and the result of the block selection is shown in FIG.

次に、ステップＳ１５０５において、各ブロックの種別に応じてブロック毎に特徴量抽出を行う。 Next, in step S1505, feature amount extraction is performed for each block according to the type of each block.

この特徴量抽出においては、画像ブロックについては、その位置とサイズを対応させ、さらに、色に関する画像特徴量を対応させて画像特徴量情報（紙文書画像特徴量情報）とする。紙文書画像特徴量情報の例を図１７に示す。また、文字ブロックに対しては、その位置とサイズを対応させ、さらに、ＯＣＲをかけて抽出された文字コードを文字特徴量（紙文書文字特徴量情報）とする。ここで、紙文書文字特徴量情報の例を図１８に示す。 In this feature amount extraction, the position and size of an image block are associated with each other, and further, the image feature amount relating to color is associated with image feature amount information (paper document image feature amount information). An example of the paper document image feature amount information is shown in FIG. In addition, the character block is made to correspond to the position and size of the character block, and the character code extracted by OCR is used as the character feature amount (paper document character feature amount information). Here, an example of the paper document character feature information is shown in FIG.

そして、これらの紙文書文字特徴量情報と紙文書画像特徴量情報は、記憶部１１１内に一時的に記憶される。 The paper document character feature amount information and the paper document image feature amount information are temporarily stored in the storage unit 111.

次に、ステップＳ１５０６において、比較処理を行う。データベース１１８に格納されている生成文書ブロック情報を元に、生成文書ＩＤをひとつずつ順番に処理し、生成文書ＩＤに対応したブロック情報（位置、サイズ、属性）と、そのブロック（オブジェクトＩＤ）に対応したデータベース１１８中の色特徴量情報とテキスト特徴量情報を、記憶部１１１内に格納されている紙文書画像特徴量情報と紙文書文字特徴量情報から類似度を算出し、所定の閾値よりも高い文書データを、文書候補リストに登録する。 Next, in step S1506, a comparison process is performed. Based on the generated document block information stored in the database 118, the generated document IDs are sequentially processed one by one, and the block information (position, size, attribute) corresponding to the generated document ID and the block (object ID) are processed. The color feature amount information and the text feature amount information in the corresponding database 118 are calculated from the paper document image feature amount information and the paper document character feature amount information stored in the storage unit 111, and the similarity is calculated from a predetermined threshold value. Document data having a higher value is registered in the document candidate list.

文書候補リストは、生成文書ＩＤと類似度を対応させたリストであり、その一例を、図１９に示す。文書候補リストの中では、類似度の値によって、生成文書ＩＤが降順にソーティングされる。 The document candidate list is a list in which the generated document ID is associated with the similarity, and an example thereof is shown in FIG. In the document candidate list, the generated document IDs are sorted in descending order according to the similarity value.

ステップＳ１５０７において、候補表示／選択を行う。文書候補リストに格納されている生成文書ＩＤに対する文書データのサムネイルを表示し、複数の候補の中からオペレータがオリジナルの文書データの生成文書ＩＤの特定を行う。サムネイルは、データベース１１８中に格納された生成文書情報から生成文書ＩＤに対応したサムネイル画像を得ることができる。 In step S1507, candidate display / selection is performed. A thumbnail of the document data corresponding to the generated document ID stored in the document candidate list is displayed, and the operator specifies the generated document ID of the original document data from a plurality of candidates. A thumbnail image corresponding to the generated document ID can be obtained from the generated document information stored in the database 118.

マネージメントＰＣ１０１のディスプレイに、生成文書ＩＤに対応したサムネイル画像の一覧を表示し、オペレータは、その中からサムネイル画像の一つを選択すると、選択されたサムネイル画像に対応した生成文書ＩＤが選択されることになる。 A list of thumbnail images corresponding to the generated document ID is displayed on the display of the management PC 101. When the operator selects one of the thumbnail images from the list, the generated document ID corresponding to the selected thumbnail image is selected. It will be.

続いて、ステップＳ１５０８において、ステップＳ１５０７で選択された生成文書ＩＤに対して、取得する文書ＩＤを設定し、さらに、取得する際に必要となるアクセスレベルの設定を行う。 In step S1508, a document ID to be acquired is set for the generated document ID selected in step S1507, and an access level necessary for acquisition is set.

続いて、ステップＳ１５０９において、ステップＳ１５０８で設定された文書ＩＤに対して、設定されたアクセスレベルに対応した文書データを文書管理サーバ１０６から取得する。これは、文書管理サーバ１０６の文書生成プログラムに、ステップＳ１５０１でログインしたセッションで、文書ＩＤとアクセスレベルを送信し、文書ＩＤとアクセスレベルに対応した文書データを生成させて取得する。この処理は、ステップＳ３０６と同様である。 In step S1509, document data corresponding to the set access level is acquired from the document management server 106 for the document ID set in step S1508. This is obtained by transmitting the document ID and the access level to the document generation program of the document management server 106 in the session logged in in step S1501, and generating the document data corresponding to the document ID and the access level. This process is the same as step S306.

ステップＳ１５１０において、ステップＳ１５０９で取得した文書データに対して、オペレータの指示に基づいて、印刷、配信、編集のいずれかの処理を行う。 In step S1510, any one of printing, distribution, and editing is performed on the document data acquired in step S1509 based on an instruction from the operator.

以下、各処理の詳細について説明する。 Details of each process will be described below.

まず、ステップＳ１５０４のブロックセレクション処理の詳細について説明する。 First, details of the block selection process in step S1504 will be described.

ブロックセレクション処理とは、例えば、図１６（ａ）の示すラスタ画像を、図１６（ｂ）のように、意味のある各オブジェクト毎の塊として認識し、該ブロック各々の属性（文字（ＴＥＸＴ）／図画（ＰＩＣＴＵＲＥ）／写真（ＰＨＯＴＯ）／線（ＬＩＮＥ）／表（ＴＡＢＬＥ）等）を判定し、異なる属性を持つブロックに分割する処理である。 In the block selection process, for example, the raster image shown in FIG. 16A is recognized as a block for each meaningful object as shown in FIG. 16B, and the attribute (character (TEXT)) of each block is recognized. / Picture (PICTURE) / Photo (PHOTO) / Line (LINE) / Table (TABLE), etc.), and is divided into blocks having different attributes.

ブロックセレクション処理の実施形態を以下に説明する。 An embodiment of the block selection process will be described below.

まず、入力画像を白黒に二値化し、輪郭線追跡を行って黒画素輪郭で囲まれる画素の塊を抽出する。面積の大きい黒画素の塊については、内部にある白画素に対しても輪郭線追跡を行って白画素の塊を抽出、さらに一定面積以上の白画素の塊の内部からは再帰的に黒画素の塊を抽出する。 First, the input image is binarized into black and white, and contour tracking is performed to extract a block of pixels surrounded by a black pixel contour. For a black pixel block with a large area, the white pixel block is extracted by tracing the outline of the white pixel inside, and a black pixel is recursively extracted from the white pixel block with a certain area or more. Extract the lump.

このようにして得られた黒画素の塊を、大きさ及び形状で分類し、異なる属性を持つブロックへ分類していく。例えば、縦横比が１に近く、大きさが一定の範囲のブロックは文字相当の画素塊とし、さらに近接する文字が整列良くグループ化可能な部分を文字ブロック、扁平な画素塊を線ブロック、一定大きさ以上でかつ矩形の白画素塊を整列よく内包する黒画素塊の占める範囲を表ブロック、不定形の画素塊が散在している領域を写真ブロック、それ以外の任意形状の画素塊を図画ブロックとする。 The blocks of black pixels obtained in this way are classified by size and shape, and are classified into blocks having different attributes. For example, a block in a range where the aspect ratio is close to 1 and the size is constant is a pixel block corresponding to a character, a portion where adjacent characters can be grouped in an aligned manner is a character block, and a flat pixel block is a line block. The area occupied by the black pixel block that is larger than the size and contains the rectangular white pixel block well aligned is a table block, the area where the irregular pixel block is scattered is a photo block, and the pixel block of any other shape is drawn. Let it be a block.

次に、ステップＳ１５０５の特徴量抽出処理の詳細について説明する。 Next, details of the feature amount extraction processing in step S1505 will be described.

尚、特徴量抽出は、画像、文字で処理方法が異なるので、それぞれ別に説明する。 It should be noted that the feature amount extraction will be described separately because the processing method differs between images and characters.

ここで、画像ブロックは、図１６（ｂ）の例の場合、写真ブロックと図画ブロックとするが、用途や目的に応じて、画像ブロックを写真ブロック及び図画ブロックの少なくとも一方にすることも可能である。 Here, in the example of FIG. 16B, the image block is a photographic block and a graphic block, but the image block may be at least one of a photographic block and a graphic block depending on the application and purpose. is there.

まず、画像ブロックに対する特徴量抽出処理について説明する。 First, feature amount extraction processing for an image block will be described.

尚、１文書に複数の画像ブロックが存在する場合は、その総数分、以下の処理を繰り返す。 If there are a plurality of image blocks in one document, the following processing is repeated for the total number of image blocks.

本実施形態では、一例として、画像の色に関する色特徴量を抽出する色特徴量情報抽出処理を行う。 In this embodiment, as an example, color feature amount information extraction processing for extracting a color feature amount related to the color of an image is performed.

この色特徴量情報抽出処理の詳細について、図２０を用いて説明する。 Details of the color feature amount information extraction processing will be described with reference to FIG.

図２０は本発明の実施形態の色特徴量情報抽出処理の詳細を示すフローチャートである。 FIG. 20 is a flowchart showing details of color feature amount information extraction processing according to the embodiment of the present invention.

尚、この処理では、処理対象画像を複数のメッシュブロックに分割した各メッシュブロックの色ヒストグラム中の最頻色を有する色と各メッシュブロックの位置情報を対応づけた情報を色特徴情報として抽出する。 In this process, information associating the color having the most frequent color in the color histogram of each mesh block obtained by dividing the processing target image into a plurality of mesh blocks and the position information of each mesh block is extracted as color feature information. .

まず、ステップＳ２０２０で、画像を複数のメッシュブロックに分割する。本実施形態では、図２１に示すように、画像を縦横をそれぞれ９メッシュブロックに分割する。特に、本実施形態では、表記の都合上９×９＝８１メッシュブロックに分割している例を示しているが、実際には、１５×１５＝２２５メッシュブロック程度であることが好ましい。 First, in step S2020, the image is divided into a plurality of mesh blocks. In the present embodiment, as shown in FIG. 21, the image is divided into 9 mesh blocks in the vertical and horizontal directions. In particular, in the present embodiment, an example in which it is divided into 9 × 9 = 81 mesh blocks is shown for convenience of description, but actually, it is preferable that the number is about 15 × 15 = 225 mesh blocks.

次に、ステップＳ２０３０で、処理対象となる着目メッシュブロックを左上端のブロックに設定する。尚、この着目メッシュブロックの設定は、例えば、図２２に示すように、予め処理順序が決定された順序決定テーブルを参照して行う。本実施形態では、左上端から右へ走査し、その行を終えると次の行の左端から右へスキャンする走査例を示している。 In step S2030, the target mesh block to be processed is set as the upper left block. For example, as shown in FIG. 22, the target mesh block is set with reference to an order determination table in which the processing order is determined in advance. In the present embodiment, a scanning example is shown in which scanning is performed from the upper left end to the right, and when the line is finished, scanning is performed from the left end of the next line to the right.

ステップＳ２０４０で、未処理の着目メッシュブロックの有無を判定する。未処理の着目メッシュブロックがない場合（ステップＳ２０４０でＮＯ）、処理を終了する。一方、未処理の着目メッシュブロックがある場合（ステップＳ２０４０でＹＥＳ）、ステップＳ２０５０に進む。 In step S2040, the presence / absence of an unprocessed target mesh block is determined. If there is no unprocessed target mesh block (NO in step S2040), the process ends. On the other hand, if there is an unprocessed target mesh block (YES in step S2040), the process proceeds to step S2050.

ステップＳ２０５０で、着目メッシュブロックの全画素の各濃度値を、図２３の色空間を分割して作った部分空間である色ビンへ射影し、色ビンに対する色ヒストグラムを生成する。 In step S2050, the density values of all the pixels of the target mesh block are projected onto a color bin, which is a partial space created by dividing the color space of FIG. 23, and a color histogram for the color bin is generated.

尚、本実施形態では、図２３に示すように、ＲＧＢ色空間を３×３×３＝２７に分割した色ビンへ着目メッシュブロックの全画素の濃度値を射影する場合を示しているが、実際には、ＲＧＢ色空間を６×６×６＝２１６に分割した色ビンへ着目メッシュブロックの全画素の濃度値を射影するほうが好ましい。 In the present embodiment, as shown in FIG. 23, the density values of all the pixels of the target mesh block are projected onto the color bin obtained by dividing the RGB color space into 3 × 3 × 3 = 27. Actually, it is preferable to project the density values of all the pixels of the target mesh block onto the color bin obtained by dividing the RGB color space into 6 × 6 × 6 = 216.

ステップＳ２０６０で、色ヒストグラムの最頻色ビンの色ビンＩＤをその着目メッシュブロックの代表色と決定し、その着目メッシュブロックとその位置に対応づけて記憶部１１１に記憶する。 In step S2060, the color bin ID of the most frequent color bin of the color histogram is determined as the representative color of the target mesh block, and stored in the storage unit 111 in association with the target mesh block and its position.

ステップＳ２０７０で、図２２の順序決定テーブルを参照して、次の処理対象となる着目メッシュブロックを設定する。その後、ステップＳ２０４０に戻り、未処理の着目メッシュブロックがなくなるまで、ステップＳ２０４０〜ステップＳ２０７０の処理を繰り返す。 In step S2070, the target mesh block to be processed next is set with reference to the order determination table of FIG. Then, it returns to step S2040 and repeats the process of step S2040-step S2070 until there is no unprocessed focused mesh block.

以上の処理によって、処理対象画像（画像ブロック）のメッシュブロック毎の代表色と各メッシュブロックの位置情報が対応付けられた情報を色特徴量情報として抽出することができる。 Through the above processing, information in which the representative color for each mesh block of the processing target image (image block) and the position information of each mesh block are associated can be extracted as color feature amount information.

次に、文字ブロックに対する文字特徴量情報抽出処理について説明する。 Next, character feature information extraction processing for character blocks will be described.

尚、１文書に複数の文字ブロックが存在する場合は、その総数分、以下の処理を繰り返す。 If there are a plurality of character blocks in one document, the following processing is repeated for the total number of character blocks.

文字ブロックに対する文字特徴量情報は、その文字ブロックにＯＣＲ（文字認識）処理を施して得られる文字コードとする。 Character feature amount information for a character block is a character code obtained by subjecting the character block to OCR (character recognition) processing.

ＯＣＲ（文字認識）処理は、文字ブロックから文字単位で切り出された文字画像に対し、パターンマッチングの一手法を用いて文字認識を行い、対応する文字コードを取得する。 In the OCR (character recognition) process, character recognition is performed on a character image cut out in character units from a character block using a pattern matching method, and a corresponding character code is acquired.

この文字認識処理は、文字画像から得られる特徴を数十次元の数値列に変換した観測特徴ベクトルと、あらかじめ字種毎に求められている辞書特徴ベクトルとを比較し、最も距離の近い字種を認識結果とするものである。 This character recognition process compares an observed feature vector obtained by converting a feature obtained from a character image into a numerical sequence of several tens of dimensions with a dictionary feature vector obtained in advance for each character type. Is the recognition result.

特徴ベクトルの抽出には種々の公知手法があり、例えば、文字をメッシュ状に分割し、各メッシュブロック内の文字線を方向別に線素としてカウントしたメッシュ数次元ベクトルを特徴とする方法がある。 There are various known methods for extracting a feature vector. For example, there is a method characterized by dividing a character into meshes and using a mesh number-dimensional vector obtained by counting character lines in each mesh block as line elements according to directions.

ブロックセレクション処理（ステップＳ１５０４）で抽出された文字ブロックに対して文字認識を行う場合は、まず、該当文字ブロックに対し横書き／縦書きの判定を行い、各々対応する方向に文字列を切り出し、その後、文字列から文字を切り出して文字画像を取得する。 When character recognition is performed on the character block extracted in the block selection process (step S1504), first, horizontal writing / vertical writing is determined for the corresponding character block, and a character string is cut out in the corresponding direction. The character image is obtained by cutting out the character from the character string.

横書き／縦書きの判定は、該当文字ブロック内で画素値に対する水平／垂直の射影を取り、水平射影の分散が大きい場合は横書き、垂直射影の分散が大きい場合は縦書きと判定する。文字列及び文字への分解は、横書きの文字ブロックである場合には、その水平方向の射影を利用して行を切り出し、さらに切り出された行に対する垂直方向の射影から、文字を切り出すことで行う。一方、縦書きの文字ブロックに対しては、水平と垂直を逆にすれば良い。 The horizontal / vertical writing is determined by taking a horizontal / vertical projection of the pixel value in the corresponding character block. If the horizontal projection has a large variance, the horizontal writing is determined, and if the vertical projection has a large variance, the vertical writing is determined. If the block is a horizontally written character block, the character string and character are decomposed by cutting out the line using the horizontal projection and cutting out the character from the vertical projection of the cut line. . On the other hand, for vertically written character blocks, horizontal and vertical may be reversed.

次に、ステップＳ１５０６の比較処理の詳細について、図２４を用いて説明する。 Next, details of the comparison processing in step S1506 will be described with reference to FIG.

図２４は本発明の実施形態の比較処理の詳細を示すフローチャートである。 FIG. 24 is a flowchart showing details of the comparison processing according to the embodiment of the present invention.

まず、ステップＳ２４１０で、比較先文書を管理する生成文書情報の先頭から文書ＩＤを順番に取得する。 First, in step S2410, document IDs are acquired in order from the top of generated document information for managing comparison target documents.

次に、ステップＳ２４２０で、すべての文書ＩＤの取得が終了したか否かを判定する。取得が終了した場合（ステップＳ２４２０でＮＯ）、ステップＳ２４７０に進む。一方、取得が終了していない場合（ステップＳ２４２０でＹＥＳ）、ステップＳ２４３０に進む。 In step S2420, it is determined whether all document IDs have been acquired. If the acquisition is complete (NO in step S2420), the process advances to step S2470. On the other hand, if acquisition has not ended (YES in step S2420), the flow advances to step S2430.

次に、ステップＳ２４３０で、レイアウトの比較を行う。ここで、レイアウトとは、ブロック情報にあるブロックの属性、サイズ、位置のことである。 In step S2430, layouts are compared. Here, the layout refers to the attribute, size, and position of the block in the block information.

ここで、ステップＳ１５０４で抽出したブロックセレクション処理で得られたレイアウトの情報は、属性が画像であるブロックのサイズと位置は、紙文書画像特徴量情報に格納されており、属性が文字であるブロックのサイズと位置は、紙文書テキスト特徴量情報に格納されている。 Here, in the layout information obtained by the block selection process extracted in step S1504, the size and position of the block whose attribute is an image are stored in the paper document image feature information, and the block whose attribute is a character. Are stored in the paper document text feature amount information.

具体的には、各ブロックの属性、サイズ、位置と、ステップＳ２４１０で取得した文書ＩＤに対応した生成文書ブロック情報中の各ブロックの属性、サイズ、位置を比較し、レイアウトが同じであるか否かを判定する。 Specifically, the attribute, size, and position of each block are compared with the attribute, size, and position of each block in the generated document block information corresponding to the document ID acquired in step S2410, and the layout is the same. Determine whether.

そして、比較元画像（紙文書）と比較先画像（オリジナル文書）のレイアウトが同じである場合（ステップＳ２４３０でＹＥＳ）、ステップＳ２４４０に進む。一方、比較元画像と比較先画像のレイアウトが同じでない場合（ステップＳ２４３０でＮＯ）、ステップＳ２４１０に戻る。 If the comparison source image (paper document) and the comparison destination image (original document) have the same layout (YES in step S2430), the process advances to step S2440. On the other hand, if the layouts of the comparison source image and the comparison destination image are not the same (NO in step S2430), the process returns to step S2410.

次に、ステップＳ２４４０で、比較元画像（紙文書）と比較先画像（オリジナル文書）のページ同士の比較を行うページ比較処理を実行する。この比較は、ブロックの属性に合わせ、文字、画像それぞれに応じた特徴量を用いて、複合的に比較を行い、ページの類似度を算出する。この処理の詳細については後述する。 Next, in step S2440, page comparison processing for comparing pages of the comparison source image (paper document) and the comparison destination image (original document) is executed. This comparison is performed in a composite manner using feature amounts corresponding to characters and images according to the block attributes, and the similarity of pages is calculated. Details of this processing will be described later.

次に、ステップＳ２４５０で、算出された類似度が閾値以上であるか否かを判定する。閾値未満である場合（ステップＳ２４５０でＮＯ）、ステップＳ２４１０に戻る。一方、閾値以上である場合（ステップＳ２４５０でＹＥＳ）、ステップＳ２４６０に進む。 Next, in step S2450, it is determined whether the calculated similarity is greater than or equal to a threshold value. If it is less than the threshold value (NO in step S2450), the process returns to step S2410. On the other hand, if it is equal to or greater than the threshold value (YES in step S2450), the process proceeds to step S2460.

次に、ステップＳ２４６０で、現在処理中の文書ＩＤに対応した文書候補リストの類似度合計に、ステップＳ２４４０で算出された類似度を累積加算する。その後、ステップＳ２４１０へ戻り、比較先文書となる次の文書ＩＤを生成文書情報から取得する。 Next, in step S2460, the similarity calculated in step S2440 is cumulatively added to the similarity total of the document candidate list corresponding to the currently processed document ID. Thereafter, the process returns to step S2410, and the next document ID to be the comparison target document is acquired from the generated document information.

一方、ステップＳ２４２０において、すべての文書ＩＤの取得が終了した場合、ステップＳ２４７０に進み、文書候補リストに登録されている文書ＩＤを対応する類似度合計の値によって降順にソートし、比較処理を終了する。 On the other hand, if all the document IDs have been acquired in step S2420, the process proceeds to step S2470, where the document IDs registered in the document candidate list are sorted in descending order by the corresponding similarity total value, and the comparison process ends. To do.

次に、ステップＳ２４４０の特徴量比較処理の詳細について、図２５を用いて説明する。 Next, details of the feature amount comparison processing in step S2440 will be described with reference to FIG.

図２５は本発明の実施形態の特徴量比較処理の詳細を示すフローチャートである。 FIG. 25 is a flowchart showing details of the feature amount comparison processing according to the embodiment of the present invention.

まず、ステップＳ２５１０で、生成文書ブロック情報を参照し、処理対象となる文書ＩＤに対応する文書データ中で、未比較のブロックの有無を判定する。未比較のブロックがない場合（ステップＳ２５１０でＮＯ）、ステップＳ２５７０に進む。一方、未比較のブロックがある場合（ステップＳ２５１０でＹＥＳ）、ステップＳ２５２０に進む。 First, in step S2510, the generated document block information is referenced to determine whether there is an uncompared block in the document data corresponding to the document ID to be processed. If there is no uncompared block (NO in step S2510), the process advances to step S2570. On the other hand, if there are uncompared blocks (YES in step S2510), the flow advances to step S2520.

次に、ステップＳ２５２０で、比較対象のブロックの属性を判定する。属性が画像ブロックである場合、ステップＳ２５４０に進む。一方、属性が文字ブロックである場合、ステップＳ２５６０に進む。 Next, in step S2520, the attribute of the comparison target block is determined. If the attribute is an image block, the process proceeds to step S2540. On the other hand, if the attribute is a character block, the process proceeds to step S2560.

属性が画像ブロックである場合、ステップＳ２５４０で、色に関する特徴量情報で比較元ブロックと比較先ブロックとの類似比較である色特徴量情報比較処理を行う。この処理の詳細については後述する。これによって得られる類似度は、比較先の文書ＩＤ、ブロックＩＤに対応させて記憶部１１１に一時記憶する。 If the attribute is an image block, in step S2540, color feature amount information comparison processing, which is a similarity comparison between the comparison source block and the comparison target block, is performed with the feature amount information about color. Details of this processing will be described later. The degree of similarity thus obtained is temporarily stored in the storage unit 111 in correspondence with the document ID and block ID of the comparison destination.

一方、属性が文字ブロックである場合、ステップＳ２５６０で、文字の特徴量情報での比較元ブロックと比較先ブロックとの類似比較である文字特徴量情報比較処理を行う。この処理の詳細については後述する。また、これによって得られる類似度は、比較先の文書ＩＤ、ブロックＩＤに対応させて記憶部１１１に一時記憶する。 On the other hand, if the attribute is a character block, in step S2560, a character feature amount information comparison process, which is a similarity comparison between the comparison source block and the comparison target block in the character feature amount information, is performed. Details of this processing will be described later. Further, the similarity obtained thereby is temporarily stored in the storage unit 111 in correspondence with the document ID and block ID of the comparison destination.

次に、ステップＳ２５１０において、全てのブロックとの比較が終了した場合（ステップＳ２５１０でＮＯ）、ステップＳ２５７０に進み、ステップＳ２５４０及びステップＳ２５６０の処理によって記憶部１１１に記憶されている、比較先文書（オリジナル文書）のページに含まれる全てのブロックの類似度を統合し、検索条件である紙文書とオリジナル文書中のページとの類似度を算出する統合処理を行う。 Next, in step S2510, when the comparison with all the blocks is completed (NO in step S2510), the process proceeds to step S2570, and the comparison destination document (stored in the storage unit 111 by the processing of step S2540 and step S2560). An integration process is performed in which the similarities of all blocks included in the page of the original document are integrated, and the similarity between the paper document that is the search condition and the page in the original document is calculated.

次に、ステップＳ２５４０の色特徴量情報比較処理の詳細について、図２６を用いて説明する。 Next, details of the color feature amount information comparison processing in step S2540 will be described with reference to FIG.

図２６は本発明の実施形態の色特徴量情報比較処理の詳細を示すフローチャートである。 FIG. 26 is a flowchart showing details of color feature amount information comparison processing according to the embodiment of the present invention.

まず、ステップＳ２６１０で、比較元画像と比較先画像の色特徴量を色特徴量情報から読み出す。 First, in step S2610, the color feature amounts of the comparison source image and the comparison destination image are read from the color feature amount information.

次に、ステップＳ２６２０で、処理対象とする画像中の着目メッシュブロックを先頭に設定する。ステップＳ２６３０で、比較元画像の色特徴量と、比較対象の色特徴量の類似度を示す類似距離を０にリセットする。 In step S2620, the target mesh block in the image to be processed is set at the head. In step S2630, the similarity distance indicating the similarity between the color feature value of the comparison source image and the color feature value to be compared is reset to zero.

ステップＳ２６４０で、未比較の着目メッシュブロックの有無を判定する。未比較の着目メッシュブロックがない場合（ステップＳ２６４０でＮＯ）、ステップＳ２６８０に進む。一方、未比較の着目メッシュブロックがある場合（ステップＳ２６４０でＹＥＳ）、ステップＳ２６５０に進む。 In step S2640, the presence / absence of an uncompared target mesh block is determined. If there is no uncompared target mesh block (NO in step S2640), the process advances to step S2680. On the other hand, if there is an uncompared target mesh block (YES in step S2640), the process advances to step S2650.

ステップＳ２６５０で、比較元画像と比較先画像のそれぞれの色特徴量から、それぞれの着目メッシュブロックの色ビンＩＤを取得する。 In step S2650, the color bin ID of each target mesh block is acquired from each color feature amount of the comparison source image and the comparison destination image.

ステップＳ２６６０で、図２７の色ビンペナルティマトリックスを参照して、取得した色ビンＩＤ間に対応する着目メッシュブロックの局所的類似距離を取得し、これを直前の処理で取得している類似距離に累積加算する。そして、この類似距離は記憶部１１１に記憶する。 In step S2660, the local similarity distance of the target mesh block corresponding to the acquired color bin ID is acquired with reference to the color bin penalty matrix of FIG. 27, and this is used as the similar distance acquired in the immediately preceding process. Cumulative addition. The similarity distance is stored in the storage unit 111.

ここで、色ビンペナルティマトリックスについて、図２７を用いて説明する。 Here, the color bin penalty matrix will be described with reference to FIG.

図２７は本発明の実施形態の色ビンペナルティマトリックスの構成を示す図である。 FIG. 27 is a diagram showing the configuration of the color bin penalty matrix according to the embodiment of the present invention.

色ビンペナルティマトリックスは、色ビンＩＤ同士の局所的類似距離を管理するマトリックスである。図２７によれば、色ビンペナルティマトリックスは、同一色ビンＩＤではその類似距離は０となり、色ビンＩＤ同士の差が大きくなるほど、つまり、類似度が低くなるほど、その類似距離は大きくなるように構成されている。また、同一色ビンＩＤの対角位置は全て、その類似距離は０で、それを境に対称性を持っている。 The color bin penalty matrix is a matrix that manages the local similarity distance between the color bin IDs. According to FIG. 27, the color bin penalty matrix has a similarity distance of 0 for the same color bin ID, and the similarity distance increases as the difference between the color bin IDs increases, that is, as the similarity decreases. It is configured. In addition, all the diagonal positions of the same color bin ID have a similarity distance of 0 and have symmetry with respect to the boundary.

このように、本実施形態では、色ビンペナルティマトリックスを参照するだけで、色ビンＩＤ同士の類似距離を取得することができるので、処理の高速化を図ることができる。 Thus, in this embodiment, the similarity distance between the color bin IDs can be acquired only by referring to the color bin penalty matrix, so that the processing can be speeded up.

そして、ステップＳ２６７０で、図２２の順序決定テーブルを参照して、次の処理対象となる着目メッシュブロックを設定する。その後、ステップＳ２６４０に戻る。 In step S2670, the target mesh block to be processed next is set with reference to the order determination table of FIG. Thereafter, the process returns to step S2640.

そして、ステップＳ２６４０で、未比較の着目メッシュブロックがない場合（ステップＳ２６４０でＮＯ）、ステップＳ２６８０に進み、記憶部１１１に記憶されている類似距離を類似度に変換し、ブロックＩＤと対にして出力する。 In step S2640, if there is no uncompared target mesh block (NO in step S2640), the process proceeds to step S2680, and the similarity distance stored in the storage unit 111 is converted into the similarity and paired with the block ID. Output.

尚、類似度への変換は、例えば、類似距離が最小値のときを類似度１００％、類似距離が最大値のときを類似度０％として、その範囲内の類似距離に対する類似度は、最小値あるいは最大値に対する差に基づいて算出するようにすれば良い。 The conversion to similarity is, for example, 100% similarity when the similarity distance is the minimum value, 0% similarity when the similarity distance is the maximum value, and the similarity to the similarity distance within the range is the minimum What is necessary is just to calculate based on the difference with respect to a value or a maximum value.

次に、ステップＳ２５６０の文字特徴量情報比較処理の詳細について説明する。 Next, details of the character feature amount information comparison processing in step S2560 will be described.

この処理では、比較元画像と比較先画像中のそれぞれの文字ブロック内の各文字コード同士の比較を行い、その一致度から類似度を算出する。 In this process, the character codes in the character blocks in the comparison source image and the comparison target image are compared with each other, and the similarity is calculated from the matching degree.

尚、検索条件とする紙文書とオリジナル文書との比較である場合、類似度は１００％となるのが理想的であるが、実際には、検索条件となる紙文書中の文字ブロックに対するＯＣＲ処理では誤認識が発生する場合があるので、オリジナル文書との比較であっても、類似度は１００％にならないことはあるが、かなり１００％に近い値となる。 It should be noted that when comparing a paper document as a search condition with an original document, the similarity is ideally 100%. However, in actuality, OCR processing is performed on a character block in a paper document as a search condition. In some cases, misrecognition may occur. Therefore, even when compared with the original document, the degree of similarity may not be 100%, but is a value close to 100%.

次に、ステップＳ２５７０の統合処理の詳細について説明する。 Next, details of the integration processing in step S2570 will be described.

この統合処理では、比較先画像であるオリジナル文書内で占めている割合の大きいブロックの類似度が、オリジナル文書全体の類似度としてより大きく反映されるような、算出されたブロック毎の類似度の統合を行う。 In this integration processing, the similarity of the calculated block-by-block similarity is such that the similarity of the block that accounts for a large proportion in the original document that is the comparison target image is more largely reflected as the similarity of the entire original document. Perform integration.

例えば、オリジナル文書中のブロックＢ１〜Ｂ６に対し、ブロック毎の類似率がｎ１〜ｎ６と算出されたとする。このときオリジナル文書全体の総合類似率Ｎは、以下の式で表現される。 For example, it is assumed that the similarity ratio for each block is calculated as n1 to n6 for the blocks B1 to B6 in the original document. At this time, the overall similarity N of the entire original document is expressed by the following equation.

Ｎ＝ｗ１＊ｎ１＋ｗ２＊ｎ２＋ｗ３＊ｎ３＋・・・＋ｗ６＊ｎ６（１）
ここで、ｗ１〜Ｗ６は、各ブロックの類似率を評価する重み係数である。重み係数ｗ１〜ｗ６は、ブロックのオリジナル文書内占有率により算出する。例えば、ブロック１〜６のサイズをＳ１〜Ｓ６とすると、ブロック１の占有率ｗ１は、
ｗ１＝Ｓ１／（Ｓ１＋Ｓ２＋・・・＋Ｓ６）（２）
として算出することができる。 N = w1 * n1 + w2 * n2 + w3 * n3 +... + W6 * n6 (1)
Here, w1 to W6 are weighting factors for evaluating the similarity of each block. The weighting factors w1 to w6 are calculated based on the occupation ratio of the block in the original document. For example, if the sizes of the blocks 1 to 6 are S1 to S6, the occupation ratio w1 of the block 1 is
w1 = S1 / (S1 + S2 +... + S6) (2)
Can be calculated as

このような占有率を用いた重み付け処理により、オリジナル文書内で大きな領域を占めるブロックの類似度がより、オリジナル文書全体の類似度に反映することができる。 By the weighting process using such an occupancy rate, the similarity of blocks that occupy a large area in the original document can be reflected in the similarity of the entire original document.

次に、ステップＳ１５０８のアクセスレベル設定処理の詳細について、図２８を用いて説明する。 Next, details of the access level setting process in step S1508 will be described with reference to FIG.

図２８は本発明の実施形態のアクセスレベル設定処理の詳細を示すフローチャートである。 FIG. 28 is a flowchart showing details of access level setting processing according to the embodiment of this invention.

ステップＳ２８０１において、ステップＳ１５０７で選択した生成文書ＩＤに対する文書アクセスレベルを取得し、ステップＳ１５０２で取得したユーザアクセスレベルと比較する。 In step S2801, the document access level for the generated document ID selected in step S1507 is acquired and compared with the user access level acquired in step S1502.

文書アクセスレベルは、生成文書情報から生成文書ＩＤに対応したアクセスレベルを取得でき、ひとつの生成文書ＩＤに対して、複数のアクセスレベルが存在する可能性がある。そして、ユーザアクセスレベルと同じ値が、生成文書情報から得られるアクセスレベルに含まれていれば、文書アクセスレベルを変更せずに、アクセスレベル設定処理を終了する。 As the document access level, an access level corresponding to the generated document ID can be acquired from the generated document information, and there is a possibility that a plurality of access levels exist for one generated document ID. If the same value as the user access level is included in the access level obtained from the generated document information, the access level setting process is terminated without changing the document access level.

一方、ユーザアクセスレベルが、生成文書情報から得られる文書アクセスレベルの最大値よりも大きい場合は、ステップＳ２８０２に進む。 On the other hand, if the user access level is higher than the maximum value of the document access level obtained from the generated document information, the process proceeds to step S2802.

ステップＳ２８０２において、ユーザに、取得する文書データのアクセスレベルを指定させる。これは、ステップＳ１５０３で読み込んだ紙文書のアクセスレベルが、ユーザのアクセスレベルよりも低い場合であり、ユーザが、欲しい文書データとして、スキャンした紙文書と同じアクセスレベルの文書データが欲しい場合と、自分のアクセスレベルに応じた文書データが欲しい場合の両方があり、その両方に対応するためである。 In step S2802, the user is allowed to specify the access level of the document data to be acquired. This is a case where the access level of the paper document read in step S1503 is lower than the access level of the user, and the user wants document data having the same access level as the scanned paper document as the desired document data. This is because there are both cases where document data corresponding to the access level of the user is desired, and both of them are supported.

また、この場合、読み込んだ紙文書よりも低いアクセスレベルの文書データを選択することも、読み込んだ紙文書よりも高く、ユーザよりも低いアクセスレベルの文書をデータ選択することも可能である。但し、ユーザよりも高いアクセスレベルの文書データを選択することはできないとする。 In this case, it is also possible to select document data with an access level lower than that of the read paper document, or to select data with an access level higher than that of the read paper document and lower than that of the user. However, it is assumed that document data having a higher access level than the user cannot be selected.

そこで、ステップＳ２８０２では、選択可能なアクセスレベルをユーザに提示し、ユーザはその中からアクセスレベルを選択することにより、必要に応じて、ステップＳ２８０１で取得した文書アクセスレベルを変更する。その後、処理を終了する。 Therefore, in step S2802, the selectable access level is presented to the user, and the user selects the access level from among them, thereby changing the document access level acquired in step S2801 as necessary. Thereafter, the process ends.

ステップＳ２８０１において、ユーザアクセスレベルが、生成文書情報から得られる文書アクセスレベルの最小値よりも小さい場合は、ステップＳ２８０３に進む。 If the user access level is smaller than the minimum value of the document access level obtained from the generated document information in step S2801, the process proceeds to step S2803.

ステップＳ２８０３において、ステップＳ１５０７で選択した生成文書ＩＤから、対応する文書ＩＤを取得し、その文書データを管理している管理者の管理者端末に、その文書データにアクセスできないユーザが、紙文書を利用しようとしていることを示す警告情報を出力する。これは、生成文書情報から選択された生成文書ＩＤに対する文書ＩＤを取得し、それに対応する文書情報から文書ＩＤに対応した管理者を取得することができる。 In step S2803, a corresponding document ID is acquired from the generated document ID selected in step S1507, and a user who cannot access the document data to a manager terminal of the manager who manages the document data creates a paper document. Outputs warning information indicating that it is going to be used. In this case, the document ID corresponding to the generated document ID selected from the generated document information can be acquired, and the administrator corresponding to the document ID can be acquired from the corresponding document information.

管理者への通知は、例えば、管理者に対して電子メールアドレスを対応付けておき、電子メールで、ステップＳ１５０１でスキャンしたユーザ名とアクセスレベルと文書名、文書ＩＤ等の必要な情報を通知する。あるいは、専用の通知画面を用いて、通知しても良い。 For notification to the administrator, for example, an e-mail address is associated with the administrator, and necessary information such as the user name, access level, document name, and document ID scanned in step S1501 is notified by e-mail. To do. Alternatively, notification may be made using a dedicated notification screen.

続いて、ステップＳ２８０４において、ステップＳ２８０１で取得した文書アクセスレベルの値を、ユーザアクセスレベルに変更する。その後、処理を終了する。 In step S2804, the value of the document access level acquired in step S2801 is changed to the user access level. Thereafter, the process ends.

以上説明したように、本実施形態によれば、ユーザに応じて動的に生成される文書データに対して、その印刷物から別のユーザがオリジナルの電子文書データを検索した場合でも、検索した文書データを取得する際に、検索者に応じた文書データを生成するので、検索者に応じた情報を含む文書データを取得することができる。 As described above, according to the present embodiment, even when another user searches the original electronic document data from the printed material for the document data dynamically generated according to the user, the searched document Since document data corresponding to the searcher is generated when data is acquired, document data including information corresponding to the searcher can be acquired.

また、ユーザに応じて動的に生成される文書データに対して、その印刷物から別のユーザがオリジナルの電子文書データを検索した場合に、検索結果を確認しやすくし、かつ、検索者に応じた情報を含むオリジナルの電子文書データを取得することができる。 In addition, for document data that is dynamically generated according to the user, when another user searches the original electronic document data from the printed matter, it is easy to check the search results and also according to the searcher. The original electronic document data including the information can be acquired.

また、紙文書からオリジナルの電子文書データを検索する場合に、アクセスできない情報を含む紙文書から検索が行われたことを検知することにより、アクセスできない情報が拡散することを防ぐようにすることができる。また、検知した後、その情報の管理者に通知することにより、情報漏洩の対応を促進することができる。 In addition, when searching for original electronic document data from a paper document, it is possible to prevent the inaccessible information from spreading by detecting that a search has been performed from a paper document including inaccessible information. it can. In addition, after the detection, the information management can be promoted by notifying the manager of the information.

＜その他の実施形態＞
上記実施形態において、図１０の登録処理では、特徴量を共用して使用するメモリを削減するためにオブジェクト毎に特徴量を登録していたが、実際の特徴量を比較し、同一の特徴量を共用するようにしてもよい。このようにした場合、例えば、生成文書ブロック情報、色特徴量情報、文字特徴量情報は、図３１、図３２、図３３に示されるように、オブジェクトＩＤの代わりに特徴量を識別するための特徴量ＩＤが格納される。 <Other embodiments>
In the above embodiment, in the registration process of FIG. 10, the feature amount is registered for each object in order to reduce the memory used by sharing the feature amount. However, the actual feature amount is compared, and the same feature amount is compared. May be shared. In this case, for example, the generated document block information, the color feature amount information, and the character feature amount information are used for identifying the feature amount instead of the object ID as shown in FIGS. 31, 32, and 33. The feature amount ID is stored.

そして、図１０の登録処理は、図３０の登録処理を実行することになる。 Then, the registration process of FIG. 10 executes the registration process of FIG.

ここで、他の実施形態における登録処理について、図３０を用いて説明する。 Here, a registration process in another embodiment will be described with reference to FIG.

図３０は本発明の他の実施形態の登録処理を示すフローチャートである。 FIG. 30 is a flowchart showing registration processing according to another embodiment of the present invention.

尚、図３０において、図１０と同一のステップについては、同一のステップ番号を付加して、その詳細については省略する。 In FIG. 30, the same steps as those in FIG. 10 are denoted by the same step numbers, and the details thereof are omitted.

図１０では、ステップＳ１００７でオブジェクトＩＤを生成文書ブロック情報に格納していたが、図３０のステップＳ３００７では、その格納を行わない。また、図１０のステップＳ１００９で特徴量登録を行っていた代わりに、ステップＳ３００９において特徴量登録を実行する。 In FIG. 10, the object ID is stored in the generated document block information in step S1007. However, in step S3007 in FIG. 30, the storage is not performed. Also, instead of performing feature amount registration in step S1009 in FIG. 10, feature amount registration is executed in step S3009.

まず、ステップＳ１００５で生成した文書データの各ブロックに対して、その属性に応じて特徴量の抽出処理を切り替え、画像ブロックに対しては、色特徴量を抽出し、文字ブロックに関しては文字特徴量を抽出する。 First, for each block of the document data generated in step S1005, the feature amount extraction processing is switched according to the attribute, the color feature amount is extracted for the image block, and the character feature amount for the character block. To extract.

そして、抽出した各特徴量に対し、画像ブロックは、図３２に示す色特徴量情報の中から、同じ特徴量を探し、同じ特徴量が存在すれば、対応する特徴量ＩＤを図３１に示す生成文書ブロック情報の対象のブロックに対応した特徴量ＩＤに格納する。 Then, for each extracted feature quantity, the image block searches for the same feature quantity from the color feature quantity information shown in FIG. 32. If the same feature quantity exists, the corresponding feature quantity ID is shown in FIG. Stored in the feature amount ID corresponding to the target block of the generated document block information.

一方、同じ特徴量が存在しなければ、新たに特徴量ＩＤを発行し、色特徴量情報に、特徴量ＩＤと色特徴量を対応させて追加する。 On the other hand, if the same feature amount does not exist, a new feature amount ID is issued, and the feature amount ID and the color feature amount are added in correspondence with the color feature amount information.

同様に、文字ブロックは、図３３に示す文字特徴量情報の中から、同じ特徴量を探し、同じ特徴量が存在すれば、対応する特徴量ＩＤを図３１に示す生成文書ブロック情報の対象のブロックに対応した特徴量ＩＤに格納する。 Similarly, the character block is searched for the same feature amount from the character feature amount information shown in FIG. 33, and if the same feature amount exists, the corresponding feature amount ID is the target of the generated document block information shown in FIG. Stored in the feature amount ID corresponding to the block.

一方、同じ特徴量が存在しなければ、新たに特徴量ＩＤを発行し、文字特徴量情報に、特徴量ＩＤとテキスト特徴量を対応させて追加する。 On the other hand, if the same feature quantity does not exist, a new feature quantity ID is issued, and the feature quantity ID and the text feature quantity are associated with each other and added to the character feature quantity information.

このようにすることで、同一の特徴量が共用されるようになる。 In this way, the same feature amount is shared.

また、上記実施形態において、生成文書ブロック情報において、ブロックの情報（属性、位置、サイズ）を、生成文書ＩＤに対してブロック毎に直接記述していたが、図３１等から明らかなように、生成文書ブロック情報の中には、ブロックの属性、位置、サイズが同じものが存在する。 In the above embodiment, in the generated document block information, the block information (attribute, position, size) is directly described for each block with respect to the generated document ID. As is clear from FIG. Some pieces of generated document block information have the same attribute, position, and size.

従って、これらのブロックの情報にＩＤを付与し、生成文書ブロック情報には、このＩＤを格納するようにし、ブロックの情報を別テーブルで管理するようにすれば、ブロックの情報の重複を避けることができる。 Therefore, it is possible to avoid duplication of block information by assigning IDs to these block information, storing this ID in the generated document block information, and managing the block information in a separate table. Can do.

また、同様に、画像特徴量や文字特徴量をのぞいたブロックのレイアウト情報（ページ内のブロック数、各ブロックの属性、位置、サイズ）も、別テーブルで管理するようにすれば、レイアウト情報の重複も避けることができる。 Similarly, if the layout information (number of blocks in the page, the attribute, position, and size of each block) excluding the image feature amount and character feature amount is also managed in a separate table, the layout information Duplication can also be avoided.

以上説明したように、他の実施形態によれば、上記実施形態で説明した効果に加えて紙文書からオリジナルの電子文書データを検索する場合に、検索のために登録されている文書データの特徴量の重複する特徴量を一つにまとめて記憶するため、特徴量を記憶するために必要な記憶容量を削減することができる。 As described above, according to another embodiment, in addition to the effects described in the above embodiment, when original electronic document data is searched from a paper document, the characteristics of document data registered for search are described. Since the feature amounts having the same amount are stored together as one, the storage capacity necessary for storing the feature amounts can be reduced.

尚、上記実施形態において、色特徴抽出情報処理は、処理対象画像の最頻色を色特徴情報として抽出する例を説明したが、これに限定されるものではなく、例えば、平均色を色特徴情報として抽出するようにしても良い。 In the above embodiment, the color feature extraction information processing is described as an example in which the most frequent color of the processing target image is extracted as the color feature information. However, the present invention is not limited to this. It may be extracted as information.

また、画像特徴量として色特徴量を用いたが、これに限定されるものではなく、例えば、最頻輝度、平均輝度等の輝度特徴量、共起行列、コントラスト、エントロピ、Ｇａｂｏｒ変換等で表現されるテクスチャ特徴量、エッジ、フーリエ記述子等の形状特徴量等の複数種類の画像特徴量を１つ、或いは、任意に組み合わせた画像特徴量を用いても良い。 In addition, although the color feature amount is used as the image feature amount, the present invention is not limited to this. For example, the feature amount is represented by luminance feature amounts such as mode luminance and average luminance, co-occurrence matrix, contrast, entropy, Gabor conversion, and the like. A plurality of types of image feature amounts such as texture feature amounts, shape features such as edges and Fourier descriptors, or any combination of image feature amounts may be used.

また、文字特徴量としては文字コードを採用したが、例えば、単語辞書とのマッチングを予め行って単語の品詞を抽出しておき、名詞である単語を文字特徴量としても良い。 In addition, although a character code is used as the character feature amount, for example, matching with a word dictionary may be performed in advance to extract a word part of speech, and a word that is a noun may be used as the character feature amount.

上記実施形態では、文書管理サーバ１０６で管理されている文書データをＭＦＰ１００内のデータベース１１８に登録する際、電子ファイルから直接、ブロックの情報、文字コード画像を抜き出していたが、電子フィアイルについて、一旦、ラスタ画像に変換した後、ラスタ画像に対して、ステップＳ１５０４からステップＳ１５０５と同様にして、ブロックの情報、ならびに、文字特徴量、画像特徴量を取得するようにしても良い。 In the above embodiment, when registering the document data managed by the document management server 106 in the database 118 in the MFP 100, the block information and the character code image are extracted directly from the electronic file. After the conversion to the raster image, the block information, the character feature amount, and the image feature amount may be acquired for the raster image in the same manner as in steps S1504 to S1505.

上記実施形態では、ステップＳ３０６で生成する文書データは、ＨＴＭＬ文書データとしているが、ＰＤＦ（ＰｏｒｔａｂｌｅＤｏｃｕｍｅｎｔＦｏｒｍａｔ）等、画像や文字等の生成する文書内のオブジェクトの位置をレイアウトできるフォーマットであれば、どのようなものを用いてもよい。 In the above embodiment, the document data generated in step S306 is HTML document data. However, any format that can lay out the position of an object in a document to be generated, such as a PDF (Portable Document Format), is used. Any thing may be used.

上記実施形態では、文書データに対するアクセス制御を行うために、アクセスレベルという概念を用い、アクセスレベルの上下によって、アクセス可能なオブジェクトが増減したが、オブジェクトごとに、アクセスできる個人、グループを記述するようにして、個人単位、所属するグループ単位で、個別にアクセス制御を行ってもよい。 In the above embodiment, in order to control access to document data, the concept of access level is used, and the number of accessible objects increases or decreases depending on the level of the access level. Thus, access control may be performed individually for each individual and for each group to which the user belongs.

上記実施形態では、画像読取部１１０（スキャナ）と印刷部１１２（プリンタ）が一体に構成されているＭＦＰ１００を例に挙げて説明したが、画像読取部１１０と印刷部１１２をそれぞれ個別の機器として、ＬＡＮ１０３、もしくはクライアントＰＣ１０２等に接続して構成するようにしても良い。この場合、ＭＦＰ１００のそれ以外の構成要素は、マネージメントＰＣ１０１に含まれることになり、画像読取部１１０（スキャナ）と印刷部１１２（プリンタ）との接続は、ネットワークＩ／Ｆ１１４を介して、ＬＡＮ１０３経由で接続されることになる。 In the above embodiment, the MFP 100 in which the image reading unit 110 (scanner) and the printing unit 112 (printer) are integrally configured has been described as an example. However, the image reading unit 110 and the printing unit 112 are respectively separate devices. It may be configured to be connected to the LAN 103 or the client PC 102 or the like. In this case, the other components of the MFP 100 are included in the management PC 101, and the connection between the image reading unit 110 (scanner) and the printing unit 112 (printer) is via the LAN 103 via the network I / F 114. Will be connected.

以上、実施形態例を詳述したが、本発明は、例えば、システム、装置、方法、プログラムもしくは記憶媒体等としての実施態様をとることが可能であり、具体的には、複数の機器から構成されるシステムに適用しても良いし、また、一つの機器からなる装置に適用しても良い。 Although the embodiments have been described in detail above, the present invention can take an embodiment as, for example, a system, an apparatus, a method, a program, or a storage medium, and specifically includes a plurality of devices. The present invention may be applied to a system that is configured, or may be applied to an apparatus that includes a single device.

尚、本発明は、前述した実施形態の機能を実現するソフトウェアのプログラム（実施形態では図に示すフローチャートに対応したプログラム）を、システムあるいは装置に直接あるいは遠隔から供給し、そのシステムあるいは装置のコンピュータが該供給されたプログラムコードを読み出して実行することによっても達成される場合を含む。 In the present invention, a software program (in the embodiment, a program corresponding to the flowchart shown in the figure) that realizes the functions of the above-described embodiment is directly or remotely supplied to the system or apparatus, and the computer of the system or apparatus Is also achieved by reading and executing the supplied program code.

従って、本発明の機能処理をコンピュータで実現するために、該コンピュータにインストールされるプログラムコード自体も本発明を実現するものである。つまり、本発明は、本発明の機能処理を実現するためのコンピュータプログラム自体も含まれる。 Accordingly, since the functions of the present invention are implemented by computer, the program code installed in the computer also implements the present invention. In other words, the present invention includes a computer program itself for realizing the functional processing of the present invention.

その場合、プログラムの機能を有していれば、オブジェクトコード、インタプリタにより実行されるプログラム、ＯＳに供給するスクリプトデータ等の形態であっても良い。 In that case, as long as it has the function of a program, it may be in the form of object code, a program executed by an interpreter, script data supplied to the OS, or the like.

プログラムを供給するための記録媒体としては、例えば、フロッピー（登録商標）ディスク、ハードディスク、光ディスク、光磁気ディスク、ＭＯ、ＣＤ−ＲＯＭ、ＣＤ−Ｒ、ＣＤ−ＲＷ、磁気テープ、不揮発性のメモリカード、ＲＯＭ、ＤＶＤ（ＤＶＤ−ＲＯＭ，ＤＶＤ−Ｒ）などがある。 As a recording medium for supplying the program, for example, floppy (registered trademark) disk, hard disk, optical disk, magneto-optical disk, MO, CD-ROM, CD-R, CD-RW, magnetic tape, nonvolatile memory card ROM, DVD (DVD-ROM, DVD-R) and the like.

その他、プログラムの供給方法としては、クライアントコンピュータのブラウザを用いてインターネットのホームページに接続し、該ホームページから本発明のコンピュータプログラムそのもの、もしくは圧縮され自動インストール機能を含むファイルをハードディスク等の記録媒体にダウンロードすることによっても供給できる。また、本発明のプログラムを構成するプログラムコードを複数のファイルに分割し、それぞれのファイルを異なるホームページからダウンロードすることによっても実現可能である。つまり、本発明の機能処理をコンピュータで実現するためのプログラムファイルを複数のユーザに対してダウンロードさせるＷＷＷサーバも、本発明に含まれるものである。 As another program supply method, a client computer browser is used to connect to an Internet homepage, and the computer program of the present invention itself or a compressed file including an automatic installation function is downloaded from the homepage to a recording medium such as a hard disk. Can also be supplied. It can also be realized by dividing the program code constituting the program of the present invention into a plurality of files and downloading each file from a different homepage. That is, a WWW server that allows a plurality of users to download a program file for realizing the functional processing of the present invention on a computer is also included in the present invention.

また、本発明のプログラムを暗号化してＣＤ−ＲＯＭ等の記憶媒体に格納してユーザに配布し、所定の条件をクリアしたユーザに対し、インターネットを介してホームページから暗号化を解く鍵情報をダウンロードさせ、その鍵情報を使用することにより暗号化されたプログラムを実行してコンピュータにインストールさせて実現することも可能である。 In addition, the program of the present invention is encrypted, stored in a storage medium such as a CD-ROM, distributed to users, and key information for decryption is downloaded from a homepage via the Internet to users who have cleared predetermined conditions. It is also possible to execute the encrypted program by using the key information and install the program on a computer.

また、コンピュータが、読み出したプログラムを実行することによって、前述した実施形態の機能が実現される他、そのプログラムの指示に基づき、コンピュータ上で稼動しているＯＳなどが、実際の処理の一部または全部を行ない、その処理によっても前述した実施形態の機能が実現され得る。 In addition to the functions of the above-described embodiments being realized by the computer executing the read program, the OS running on the computer based on an instruction of the program is a part of the actual processing. Alternatively, the functions of the above-described embodiment can be realized by performing all of them and performing the processing.

さらに、記録媒体から読み出されたプログラムが、コンピュータに挿入された機能拡張ボードやコンピュータに接続された機能拡張ユニットに備わるメモリに書き込まれた後、そのプログラムの指示に基づき、その機能拡張ボードや機能拡張ユニットに備わるＣＰＵなどが実際の処理の一部または全部を行ない、その処理によっても前述した実施形態の機能が実現される。 Furthermore, after the program read from the recording medium is written in a memory provided in a function expansion board inserted into the computer or a function expansion unit connected to the computer, the function expansion board or The CPU or the like provided in the function expansion unit performs part or all of the actual processing, and the functions of the above-described embodiments are realized by the processing.

本発明の実施形態の画像処理システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the image processing system of embodiment of this invention. 本発明の実施形態のＭＦＰの詳細構成を示すブロック図である。FIG. 2 is a block diagram illustrating a detailed configuration of the MFP according to the embodiment of the present invention. 本発明の実施形態の印刷処理を示すフローチャートである。6 is a flowchart illustrating print processing according to the embodiment of the present invention. 本発明の実施形態のユーザ情報の一例を示す図である。It is a figure which shows an example of the user information of embodiment of this invention. 本発明の実施形態の文書情報の一例を示す図である。It is a figure which shows an example of the document information of embodiment of this invention. 本発明の実施形態の文書ブロック情報の一例を示す図である。It is a figure which shows an example of the document block information of embodiment of this invention. 本発明の実施形態のオブジェクト情報の一例を示す図である。It is a figure which shows an example of the object information of embodiment of this invention. 本発明の実施形態の文書データの出力例を示す図である。It is a figure which shows the example of an output of the document data of embodiment of this invention. 本発明の実施形態の文書生成処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the document production | generation process of embodiment of this invention. 本発明の実施形態の登録処理を示すフローチャートである。It is a flowchart which shows the registration process of embodiment of this invention. 本発明の実施形態の生成文書情報の一例を示す図である。It is a figure which shows an example of the production | generation document information of embodiment of this invention. 本発明の実施形態の生成文書ブロック情報の一例を示す図である。It is a figure which shows an example of the production | generation document block information of embodiment of this invention. 本発明の実施形態の色特徴量情報の一例を示す図である。It is a figure which shows an example of the color feature-value information of embodiment of this invention. 本発明の実施形態の文字特徴量情報の一例を示す図である。It is a figure which shows an example of the character feature-value information of embodiment of this invention. 本発明の実施形態の検索処理を示すフローチャートである。It is a flowchart which shows the search process of embodiment of this invention. 本発明の実施形態のブロックセレクション処理の概念を説明する図である。It is a figure explaining the concept of the block selection process of embodiment of this invention. 本発明の実施形態の紙文書画像特徴量情報の一例を示す図である。It is a figure which shows an example of the paper document image feature-value information of embodiment of this invention. 本発明の実施形態の紙文書文字特徴量情報の一例を示すである。It is an example of the paper document character feature-value information of the embodiment of the present invention. 本発明の実施形態の文書候補リストの一例を示す図である。It is a figure which shows an example of the document candidate list | wrist of embodiment of this invention. 本発明の実施形態の色特徴量情報抽出処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the color feature-value information extraction process of embodiment of this invention. 本発明の実施形態の画像メッシュブロック分割の一例を示す図である。It is a figure which shows an example of the image mesh block division | segmentation of embodiment of this invention. 本発明の実施形態１の順序決定テーブルの一例を示す図である。It is a figure which shows an example of the order determination table of Embodiment 1 of this invention. 本発明の実施形態１の色空間上の色ビンの構成の一例を示す図である。It is a figure which shows an example of a structure of the color bin on the color space of Embodiment 1 of this invention. 本発明の実施形態の比較処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the comparison process of embodiment of this invention. 本発明の実施形態の特徴量比較処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the feature-value comparison process of embodiment of this invention. 本発明の実施形態の色特徴量情報比較処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the color feature-value information comparison process of embodiment of this invention. 本発明の実施形態の色ビンペナルティマトリックスの構成の一例を示す図である。It is a figure which shows an example of a structure of the color bin penalty matrix of embodiment of this invention. 本発明の実施形態のアクセスレベル設定処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the access level setting process of embodiment of this invention. 本発明の実施形態の文書データ一覧画面の一例を示す図である。It is a figure which shows an example of the document data list screen of embodiment of this invention. 本発明の他の実施形態の登録処理を示すフローチャートである。It is a flowchart which shows the registration process of other embodiment of this invention. 本発明の他の実施形態の生成文書ブロック情報の一例を示す図である。It is a figure which shows an example of the production | generation document block information of other embodiment of this invention. 本発明の他の実施形態の色特徴量情報の一例を示す図である。It is a figure which shows an example of the color feature-value information of other embodiment of this invention. 本発明の他の実施形態の文字特徴量情報の一例を示す図である。It is a figure which shows an example of the character feature-value information of other embodiment of this invention.

Explanation of symbols

１００ＭＦＰ
１０１マネージメントＰＣ
１０２クライアントＰＣ
１０３ネットワーク
１０４ＬＡＮ
１０５データベース
１０６文書管理サーバ
１１０画像入力部
１１１記憶部
１１２印刷部
１１３入力部
１１４、１１７ネットワークＩ／Ｆ
１１５データ処理部
１１６表示部 100 MFP
101 Management PC
102 Client PC
103 Network 104 LAN
105 Database 106 Document Management Server 110 Image Input Unit 111 Storage Unit 112 Printing Unit 113 Input Unit 114, 117 Network I / F
115 Data processing unit 116 Display unit

Claims

An image processing apparatus for searching for desired image data,
Storage means for storing and managing image data composed of a plurality of types of components by setting an access level for each component;
An input means for inputting user information;
Obtaining means for obtaining an access level of the user information input by the input means;
Reading means for reading a document;
Search means for searching the storage means for second image data corresponding to the first image data obtained by the reading means;
A determination unit that determines whether or not each component of the second image data searched by the search unit can be output based on an access level of the user information acquired by the acquisition unit;
An image processing apparatus comprising: output means for outputting second image data including components permitted to be output by the determination means.

The search means includes a first image data obtained by the reading means and a comparison means for comparing the image data stored in the storage means,
The image processing apparatus according to claim 1, wherein second image data corresponding to the first image data is searched based on a comparison result of the comparison unit.

The comparison means includes an extraction means for extracting a feature amount of the first image data,
The image processing apparatus according to claim 2, wherein the feature amount extracted by the extraction unit is compared with a feature amount of image data stored in the storage unit.

The image processing apparatus according to claim 3, wherein the extraction unit extracts one or both of an image feature amount and a character feature amount of the first data.

Display means for displaying a list of candidate image data to be candidates for the second image data searched by the search means;
Whether the determination unit outputs each component of the candidate image data to be the two image data selected from the list of candidate image data displayed by the display unit based on the access level acquired by the acquisition unit The image processing apparatus according to claim 1, wherein:

The determination means includes a setting means for setting an access level for the second image data searched by the search means;
An access level comparing means for comparing the access level set by the setting means with the access level of the user information acquired by the acquiring means;
The image processing apparatus according to claim 1, further comprising: warning information output means for outputting warning information based on a comparison result of the access level comparison means.

When the access level of the user information acquired by the acquisition unit is higher than the access level set by the setting unit, the determination unit is searched by the search unit based on the access level set by the setting unit. The image processing apparatus according to claim 6, wherein whether or not each component of the second image data is output is determined.

When the access level of the user information acquired by the acquisition unit is the same as or lower than the access level set by the setting unit, the determination unit is based on the access level of the user information acquired by the acquisition unit, The image processing apparatus according to claim 6, wherein whether or not each component of the second image data searched by the search unit can be output is determined.

The storage unit stores and manages image data including a plurality of types of components by setting an access level for each component and storing and managing a feature amount for each component. The image processing apparatus according to 1.

The storage means stores and manages image data composed of a plurality of types of components by setting an access level for each component, and stores and manages feature amounts for each component. The image processing apparatus according to claim 1, wherein storage management is performed using an identifier indicating the feature amount.

A method for controlling an image processing apparatus that searches for desired image data from a storage unit that stores and manages image data including a plurality of types of components by setting an access level for each component,
An input process for inputting user information;
An acquisition step of acquiring an access level of the user information input in the input step;
A reading process for reading a document;
A search step of searching the storage unit for second image data corresponding to the first image data obtained by the reading step;
A determination step of determining whether or not each component of the second image data searched in the search step can be output based on the access level of the user information acquired in the acquisition step;
An output step of outputting second image data including components permitted to be output in the determination step.

A program that realizes control of an image processing apparatus that searches for desired image data from a storage unit that stores and manages image data including a plurality of types of components by setting an access level for each component,
A program code of an input process for inputting user information;
A program code of an acquisition step for acquiring an access level of the user information input in the input step;
A program code for a reading process for reading a document;
A program code of a search step for searching the storage unit for second image data corresponding to the first image data obtained by the reading step;
Based on the access level of the user information acquired in the acquisition step, the program code of the determination step for determining whether to output each component of the second image data searched in the search step,
And a program code of an output process for outputting second image data including the components permitted to be output in the determination process.