JP2019068406A

JP2019068406A - INFORMATION PROCESSING APPARATUS, METHOD, AND PROGRAM

Info

Publication number: JP2019068406A
Application number: JP2018139489A
Authority: JP
Inventors: 欽也本田; Kinya Honda
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2017-10-03
Filing date: 2018-07-25
Publication date: 2019-04-25

Abstract

【課題】検索可能なファイルであることを通知するためのコマンドを用いることなく、ファイルが検索可能なファイルであるかを高精度に判定すること。【解決手段】本発明は、ファイルに含まれる画像オブジェクト内の文字領域を抽出する第１抽出手段と、前記第１抽出手段により抽出された文字領域と、前記ファイルに含まれる文字オブジェクトが描画された領域との位置関係に基づき、前記ファイルが、画像と、該画像に対応する文字認識の結果とを含む検索可能なファイルであるか否かを判定する第１判定手段と、前記第１判定手段により、前記ファイルが検索可能なファイルでないと判定されたことに応じて、前記画像オブジェクトに対する文字認識処理を実行し、該文字認識処理の結果を含むファイルを生成する生成手段とを有することを特徴とする情報処理装置である。【選択図】図４To accurately determine whether a file is a searchable file without using a command for notifying that the file is a searchable file. According to the present invention, first extracting means for extracting a character area in an image object included in a file, a character area extracted by the first extracting means, and a character object included in the file are drawn. Determination means for determining whether or not the file is a searchable file including an image and a character recognition result corresponding to the image, based on a positional relationship with the region, and the first determination Means for executing character recognition processing on the image object in response to determining that the file is not a searchable file, and generating a file including a result of the character recognition processing. This is a characteristic information processing device. [Selection diagram] Figure 4

Description

本発明は、ファイル内の画像中の文字列を認識して、検索可能なファイルを生成する情報処理装置、方法、及びプログラムに関する。 The present invention relates to an information processing apparatus, method, and program for recognizing a character string in an image in a file and generating a searchable file.

従来、文字列を検索可能なサーチャブルＰＤＦファイル（以下ＳＰＤＦファイル）を作成するソフトウェアがある。図２にＳＰＤＦファイルの一例を示す。ＳＰＤＦファイル２０１には、スキャン画像２０２が含まれる。さらに、ＳＰＤＦファイル２０１には、スキャン画像に重なるように、該スキャン画像内の文字画像の位置に合わせて透明な文字列が埋め込まれている。例えば、スキャン画像内の文字画像２０３の前面または背面に、透明な文字列「２００，０００」が配置されるように、その文字列の文字コードがＰＤＦ（ＰｏｒｔａｂｌｅＤｏｃｕｍｅｎｔＦｏｒｍａｔ）ファイルに記述される。また、図示していないが、スキャン画像内の文字画像２０３以外の全ての文字画像に対応する透明な文字列の文字コードもＰＤＦファイルに記述されている。スキャン画像内の文字画像だけでは、文字列の検索を行うことはできないが、文字認識処理の結果得られる文字コードを透明な文字列として、文字画像と対応する位置に埋め込むことで、スキャン画像の見た目を変更することなく、文字列の検索を可能とする。このように、ＳＰＤＦファイルを作成するソフトウェアは、スキャン画像に対して文字認識処理を行い、その結果得られる文字コードを透明な文字列としてＰＤＦファイルに追加する。 Conventionally, there is software for creating a searchable PDF file (hereinafter referred to as SPDF file) that can search for character strings. An example of the SPDF file is shown in FIG. The SPDF file 201 includes a scan image 202. Furthermore, in the SPDF file 201, a transparent character string is embedded according to the position of the character image in the scan image so as to overlap the scan image. For example, the character code of the character string is described in a PDF (Portable Document Format) file so that the transparent character string “200,000” is disposed on the front or back of the character image 203 in the scan image. Also, although not shown, a transparent character string character code corresponding to all character images other than the character image 203 in the scan image is also described in the PDF file. Although it is not possible to search for a character string using only the character image in the scanned image, the character code obtained as a result of character recognition processing is embedded as a transparent character string at a position corresponding to the character image. Enables searching of strings without changing the appearance. As described above, software that creates an SPDF file performs character recognition processing on a scanned image, and adds the character code obtained as a result to the PDF file as a transparent character string.

ＳＰＤＦファイルを作成するソフトウェアがインストールされたＰＣは、複写機やソフトウェアで作成されたスキャン画像、または画像を含むＰＤＦファイルを受け取り、該受け取ったデータを用いてＳＰＤＦファイルを作成する。しかし、例えば複写機によっては複写機自体がＳＰＤＦファイルを作成することも可能であり、ＳＰＤＦファイルを作成するソフトウェアを実行するＰＣへの入力データとして、複写機を含む該ＰＣ以外の装置で作成されたＳＰＤＦファイルが入力される場合がある。尚、本明細書で「入力データ」とは、ＳＰＤＦファイルを作成する際の作成元のデータを意味し、入力データに基づいてＳＰＤＦファイルが作成される。入力データとして例えば、複写機が作成するスキャン画像、又は、スキャン画像等の画像を含むＰＤＦファイル若しくはＳＰＤＦファイル等が考えられる。 The PC on which the software for creating the SPDF file is installed receives the scanned image created by the copying machine or software, or the PDF file including the image, and creates the SPDF file using the received data. However, for example, depending on the copying machine, it is possible for the copying machine itself to create the SPDF file, and the input data to the PC that executes the software for creating the SPDF file is created by an apparatus other than the PC including the copying machine SPDF files may be input. In the present specification, “input data” means data of the creation source when creating the SPDF file, and the SPDF file is created based on the input data. As input data, for example, a scanned image created by a copying machine, or a PDF file or an SPDF file including an image such as a scanned image can be considered.

ここで、ＳＰＤＦのファイル拡張子は、サーチャブル化されていないＰＤＦのファイル拡張子と同じく「ｐｄｆ」であるので、ファイル拡張子に基づいて、入力データのＰＤＦファイルがＳＰＤＦファイルであるか否か判定することができないという問題が生じる。 Here, since the file extension of SPDF is “pdf” as well as the file extension of PDF that is not searchable, it is determined whether the PDF file of the input data is an SPDF file based on the file extension. The problem arises that you can not

この問題を解決するための１つの方法として、特許文献１には、検索可能なファイルであることを通知するためのコマンドをファイルに埋め込むことで、ＰＤＦファイルがＳＰＤＦファイルであるか否かの判定を可能とする技術が開示されている。 As one method for solving this problem, Patent Document 1 determines whether the PDF file is an SPDF file by embedding a command for notifying that the file is a searchable file in the file. Technology that makes it possible to

特開２０１１−０１０２３２号公報JP, 2011-010232, A

しかしながら、ＳＰＤＦファイルを作成する既存のソフトウェアや複写機は、ＳＰＤＦファイルであることを通知するためのコマンドをファイルに埋め込む特許文献１のような機能を有さない。その理由は、ＰＤＦフォーマットの定義の中に、ＳＰＤＦファイルであるか否かを識別するための定義がないためである。このようなコマンドをファイルに埋め込む機能を有さない既存のソフトウェア等を用いて作成、送信されたＰＤＦファイルについては、該ＰＤＦファイルが入力された装置側で入力データのＰＤＦファイルがＳＰＤＦファイルであるか否かを判定することはできない。 However, existing software and copiers that create SPDF files do not have the function as in Patent Document 1 that embeds a command for notifying that they are SPDF files in the file. The reason is that in the definition of the PDF format, there is no definition for identifying whether it is an SPDF file or not. About the PDF file created and sent using the existing software etc. which do not have the function to embed such a command in a file, the PDF file of the input data is the SPDF file on the device side where the PDF file is input It can not be determined.

また、単純に、入力データ内に透明文字列があるか否かを基準として、入力データがＳＰＤＦファイルであるか否かを判定することも考えられるが、以下の理由からその判定基準は適切ではない。図３は、画像の内容と関係のない透明文字列が埋め込まれている一例を示す図である。ユーザーの運用によっては、ユーザーを識別するためのＩＤ等の情報を、透明文字列３０２として画像に埋め込むことがある。このように、スキャン画像内の文字画像とは関係のない透明文字列がファイル内に存在するケースがあるため、入力データ内に透明文字列があるか否かを基準として、入力データがＳＰＤＦファイルであるか否かを判定することは、精度の観点から適切ではない。 Also, it is conceivable to simply determine whether or not the input data is an SPDF file based on whether or not there is a transparent character string in the input data, but for the following reasons, the determination criterion is appropriate. Absent. FIG. 3 is a view showing an example in which a transparent character string unrelated to the content of the image is embedded. Depending on the operation of the user, information such as an ID for identifying the user may be embedded in the image as a transparent character string 302. As described above, since there is a case in which a transparent character string not related to the character image in the scanned image exists in the file, the input data is the SPDF file based on whether or not the transparent character string exists in the input data. It is not appropriate in terms of accuracy to determine whether or not.

そこで本発明は、上記の課題に鑑み、検索可能なファイルであることを通知するためのコマンドを用いることなく、ファイルが検索可能なファイルであるかを高精度に判定することを目的とする。 Therefore, in view of the above problems, it is an object of the present invention to determine with high accuracy whether a file is a searchable file without using a command for notifying that the file is a searchable file.

本発明は、ファイルに含まれる画像オブジェクト内の文字領域を抽出する第１抽出手段と、前記第１抽出手段により抽出された文字領域と、前記ファイルに含まれる文字オブジェクトが描画された領域との位置関係に基づき、前記ファイルが、画像と、該画像に対応する文字認識の結果とを含む検索可能なファイルであるか否かを判定する第１判定手段と、前記第１判定手段により、前記ファイルが検索可能なファイルでないと判定されたことに応じて、前記画像オブジェクトに対する文字認識処理を実行し、該文字認識処理の結果を含むファイルを生成する生成手段とを有することを特徴とする情報処理装置である。 According to the present invention, a first extraction unit for extracting a character area in an image object included in a file, a character area extracted by the first extraction unit, and an area in which a character object included in the file is drawn A first determination unit that determines whether the file is a searchable file including an image and a result of character recognition corresponding to the image based on a positional relationship; and the first determination unit Information characterized by performing character recognition processing on the image object in response to the determination that the file is not a searchable file, and generating a file including the result of the character recognition processing It is a processing device.

本発明により、検索可能なファイルであることを通知するためのコマンドを用いることなく、ファイルが検索可能なファイルであるかを高精度に判定することが可能になる。 According to the present invention, it is possible to determine with high accuracy whether a file is a searchable file without using a command for notifying that the file is a searchable file.

実施例１におけるシステム等の構成を示すブロック図。FIG. 1 is a block diagram showing the configuration of a system etc. in a first embodiment. ＳＰＤＦファイルの一例。An example of a SPDF file. 画像の内容と無関係な透明文字列が埋め込まれている一例。An example in which a transparent string unrelated to the content of the image is embedded. 実施例１における、ＰＤＦファイル判定処理を伴うＳＰＤＦファイル作成処理のフローチャート。10 is a flowchart of SPDF file creation processing accompanied by PDF file determination processing in the first embodiment. 実施例２における処理のフローチャート。10 is a flowchart of processing in Embodiment 2. ＳＰＤＦファイルを作成し直すかどうかを問うダイアログの一例。An example of a dialog asking if you want to recreate the SPDF file. 実施例３における処理のフローチャート。10 is a flowchart of processing in a third embodiment. 透明文字列を削除するかどうかを選択する画面の一例。An example of a screen to select whether to delete the transparent string. 実施例４における処理のフローチャート。10 is a flowchart of processing in a fourth embodiment. ＳＰＤＦファイルを作成する際のオプションを設定する画面の一例。An example of a screen to set options when creating an SPDF file. 実施例５における処理のフローチャート。16 is a flowchart of processing in a fifth embodiment. 認識セルの説明図。Explanatory drawing of a recognition cell. 実施例６における検索処理で表示される画面の例。16 shows an example of a screen displayed in the search process in the sixth embodiment. 実施例６における検索制御処理のフローチャート。16 is a flowchart of search control processing in a sixth embodiment.

［実施例１］
＜システムの構成について＞
以下、本実施例におけるシステムの構成について、図１を用いて説明する。 Example 1
<About the system configuration>
Hereinafter, the configuration of the system in the present embodiment will be described with reference to FIG.

図１（ａ）は、本実施例におけるシステムを示すブロック図である。図示するように、システム１００は、画像形成可能な複写機１１０と、情報処理装置１２０とを有する。複写機１１０と情報処理装置１２０とは有線又は無線で通信可能に接続されており、互いにデータを送受信することが可能である。 FIG. 1A is a block diagram showing a system in the present embodiment. As shown, the system 100 includes an image formable copying machine 110 and an information processing apparatus 120. The copying machine 110 and the information processing apparatus 120 are communicably connected in a wired or wireless manner, and can mutually transmit and receive data.

図１（ｂ）は、複写機１１０の機能構成を示すブロック図である。複写機１１０は、スキャナ部１１１と、送受信部１１２と、プリンタ部１１３と、制御部１１４とを有する。 FIG. 1B is a block diagram showing a functional configuration of the copying machine 110. As shown in FIG. The copying machine 110 includes a scanner unit 111, a transmission / reception unit 112, a printer unit 113, and a control unit 114.

図１（ｃ）は、情報処理装置１２０の機能構成を示すブロック図である。情報処理装置１２０は、送受信部１２１と、保存部１２２と、制御部１２３と、表示部１２４と、受付部１２５とを有する。情報処理装置１２０は、ＣＰＵと、ＲＯＭやハードディスク等の記憶媒体と、ＲＡＭ等の一時記憶媒体とを内部に有し、該ＲＯＭに格納されているプログラムが該ＲＡＭにロードされ、該ＣＰＵが該ロードされたプログラムを実行する。これにより、図１（ｃ）に示す各構成が実現され、当該各構成による処理が実行される。尚、受付部１２５は、キーボードとマウスとを含む形態が一般的であるが、この形態に限られない。またここでは、表示部１２４と受付部１２５とは別個の構成としているが、表示部１２４及び受付部１２５は、例えばタッチパネル等によって一体的に実現されても良い。 FIG. 1C is a block diagram showing a functional configuration of the information processing apparatus 120. The information processing apparatus 120 includes a transmission / reception unit 121, a storage unit 122, a control unit 123, a display unit 124, and a reception unit 125. The information processing apparatus 120 internally has a CPU, a storage medium such as a ROM and a hard disk, and a temporary storage medium such as a RAM, and a program stored in the ROM is loaded into the RAM, and the CPU Run the loaded program. Thereby, each configuration shown in FIG. 1C is realized, and the processing by each configuration is executed. The receiving unit 125 generally includes a keyboard and a mouse, but the present invention is not limited to this. Further, although the display unit 124 and the reception unit 125 are separately configured here, the display unit 124 and the reception unit 125 may be integrally realized by, for example, a touch panel or the like.

＜文書のスキャンからＳＰＤＦファイルの作成までの流れについて＞
以下、複写機１１０で文書をスキャンして、情報処理装置１２０でＳＰＤＦファイルを作成する流れについて説明する。複写機１１０のスキャナ部１１１が文書をスキャンすると、スキャン画像（スキャン画像データとも言う。）が作成される。作成されたスキャン画像を、送受信部１１２が情報処理装置１２０に送信する。すると、情報処理装置１２０の送受信部１２１がスキャン画像を受信し、制御部１２３は、該受信したスキャン画像を保存部１２２に保存する。 <Flow from document scanning to creation of SPDF file>
The flow of scanning a document by the copying machine 110 and creating an SPDF file by the information processing apparatus 120 will be described below. When the scanner unit 111 of the copier 110 scans a document, a scan image (also referred to as scan image data) is created. The transmission / reception unit 112 transmits the created scan image to the information processing apparatus 120. Then, the transmission / reception unit 121 of the information processing apparatus 120 receives the scan image, and the control unit 123 stores the received scan image in the storage unit 122.

ユーザーが受付部１２５を介して、保存部１２２に保存されている１又は複数のスキャン画像の中から、１つのスキャン画像を選択する。すると、制御部１２３は、そのスキャン画像を表示部１２４に表示する。 The user selects one scan image from the one or more scan images stored in the storage unit 122 via the reception unit 125. Then, the control unit 123 displays the scanned image on the display unit 124.

ユーザーが受付部１２５を介して、表示部１２４に表示されたスキャン画像に基づくＳＰＤＦファイルの作成を指示する。すると、制御部１２３は、スキャン画像に対し、下記の処理、即ち（１）〜（５）に示す領域分割、（６）に示す文字認識、及び（７）に示す透明文字列の埋め込みを実行することでＳＰＤＦファイルを作成し、該作成したＳＰＤＦファイルを保存部１２２に保存する。 The user instructs creation of the SPDF file based on the scan image displayed on the display unit 124 via the reception unit 125. Then, the control unit 123 executes the following processing, ie, area division shown in (1) to (5), character recognition shown in (6), and embedding of a transparent character string shown in (7), on the scanned image. By doing this, an SPDF file is created, and the created SPDF file is stored in the storage unit 122.

＜スキャン画像に基づくＳＰＤＦファイル作成処理について＞
以下、スキャン画像に基づきＳＰＤＦファイルを作成する処理について詳しく説明する。 <About SPDF file creation processing based on scanned image>
The process of creating the SPDF file based on the scanned image will be described in detail below.

（１）２値化
制御部１２３は、スキャン画像に対して２値化を行うことにより、黒画素と白画素のみから成る２値画像を得る。２値化により、スキャン画像における所定の閾値以上の濃度値を有する画素は黒画素、該所定の閾値未満の濃度値を有する画素は白画素となる。尚、以下では、スキャン画像が１００ＤＰＩであるものとして説明を行うが、スキャン画像がこの解像度に限られないことは言うまでもない。 (1) The binarization control unit 123 binarizes the scan image to obtain a binary image consisting of only black pixels and white pixels. By binarization, a pixel having a density value equal to or higher than a predetermined threshold value in the scan image is a black pixel, and a pixel having a density value lower than the predetermined threshold value is a white pixel. Although the following description will be made on the assumption that the scan image is 100 DPI, it goes without saying that the scan image is not limited to this resolution.

（２）黒画素塊の検出
制御部１２３は、２値画像に対して８連結で繋がる黒画素の輪郭を追跡することにより、８方向の何れかの方向で連続して存在する黒画素の塊（黒画素塊とする。）を検出する。８連結とは、左上、左、左下、下、右下、右、右上、上の８つの方向の何れかで、同色（今回のケースでは黒）の画素が連続しているという意味である。一方、４連結とは、左、下、右、上の４つの方向の何れかで同色の画素が連続しているという意味である。本実施例では、８方向に存在する８つの隣接画素の何れもが黒画素ではない単独の黒画素が、黒画素塊として検出されることはない。一方、８方向に存在する８つの隣接画素の何れか１つにでも黒画素が存在する黒画素は、その隣接する黒画素と共に、黒画素塊として検出されることになる。図１２の符号１２０１は、制御部１２３が検出した黒画素塊の一例を示す。 (2) The black pixel block detection control unit 123 tracks the outlines of the black pixels connected in eight connections with the binary image, thereby continuously displaying black pixel blocks in any of eight directions. Detect (set as black pixel block). The 8-connected means that pixels of the same color (black in this case) are continuous in any of eight directions: upper left, left, lower left, lower, lower right, upper right, upper right. On the other hand, 4-connected means that pixels of the same color are continuous in any of the four directions of left, lower, right and upper. In this embodiment, a single black pixel which is not a black pixel in any of eight adjacent pixels present in eight directions is not detected as a black pixel block. On the other hand, a black pixel in which a black pixel exists in any one of eight adjacent pixels existing in eight directions is detected as a black pixel block together with the adjacent black pixel. Reference numeral 1201 in FIG. 12 denotes an example of the black pixel block detected by the control unit 123.

また制御部１２３は、検出した黒画素塊の外接矩形の位置情報、具体的には、外接矩形の四頂点夫々のＸ座標、Ｙ座標を導出する。尚、Ｘ軸は右方向に伸び、Ｙ軸は下方向に伸びているものとする。図１２の符号１２０２は黒画素塊１２０１の外接矩形を示す。尚、本明細書で特に断り無く「矩形」と表現したときは、斜め向きの矩形は含まれず、四辺の全てがＸ軸とＹ軸との何れかと平行な矩形を表すものとする。 The control unit 123 also derives position information of the circumscribed rectangle of the detected black pixel block, specifically, the X coordinate and Y coordinate of each of the four vertices of the circumscribed rectangle. The X axis extends in the right direction, and the Y axis extends in the lower direction. Reference numeral 1202 in FIG. 12 denotes a circumscribed rectangle of the black pixel block 1201. Incidentally, when expressed as “rectangle” in the present specification, unless it is expressed as a “rectangle”, a diagonal rectangle is not included, and all the four sides represent a rectangle parallel to either the X axis or the Y axis.

（３）表領域の検出
制御部１２３は、検出した黒画素塊の夫々について、以下に示す３つの条件を全て満たすか判定し、３つの条件を全て満たす黒画素塊を、表の枠線を構成する黒画素塊と判定する。以下、３つの条件について説明する。 (3) The detection control unit 123 of the table area determines whether all the three conditions shown below are satisfied for each of the detected black pixel groups, and the black pixel group satisfying all the three conditions is set as the frame line of the table. It is determined that the black pixel block to be configured. The three conditions are described below.

第１の条件は、黒画素塊の外接矩形の幅が所定の閾値以上であり、かつ、該外接矩形の高さが所定の閾値以上であることである。本明細書で、「幅」はＸ方向の長さを、「高さ」はＹ方向の長さを指すものとする。尚、本実施例では、幅及び高さがともに、１００画素に相当する０．２５ｃｍ以上であるか判定する。 The first condition is that the width of the circumscribed rectangle of the black pixel block is equal to or greater than a predetermined threshold, and the height of the circumscribed rectangle is equal to or greater than the predetermined threshold. In the present specification, “width” refers to the length in the X direction, and “height” refers to the length in the Y direction. In this embodiment, it is determined whether the width and the height are both 0.25 cm or more, which corresponds to 100 pixels.

第２の条件は、外接矩形内部における黒画素塊の充填率が所定の閾値以下であることである。尚、本実施例では、黒画素塊の外接矩形に占める割合が２０％以下であるか判定する。 The second condition is that the filling factor of the black pixel block inside the circumscribed rectangle is equal to or less than a predetermined threshold. In this embodiment, it is determined whether the proportion of black pixel blocks in the circumscribed rectangle is 20% or less.

第３の条件は、黒画素塊の最大幅と外接矩形の幅との間の差、及び、黒画素塊の最大高さと外接矩形の高さとの間の差が何れも小さいことである。具体的には、黒画素塊の最大幅と外接矩形の幅との間の差が所定の閾値以下であり、かつ、黒画素塊の最大高さと外接矩形の高さとの間の差が所定の閾値以下であることである。尚、本実施例では、黒画素塊の最大幅と外接矩形の幅との間の差、及び、黒画素塊の最大高さと外接矩形の高さと間の差がともに、１０画素以下であるか判定する。 The third condition is that the difference between the maximum width of the black pixel block and the width of the circumscribed rectangle and the difference between the maximum height of the black pixel block and the height of the circumscribed rectangle are both small. Specifically, the difference between the maximum width of the black pixel block and the width of the circumscribed rectangle is less than or equal to a predetermined threshold, and the difference between the maximum height of the black pixel block and the height of the circumscribed rectangle is predetermined It is below the threshold. In the present embodiment, whether the difference between the maximum width of the black pixel block and the width of the circumscribed rectangle and the difference between the maximum height of the black pixel block and the height of the circumscribed rectangle are both 10 pixels or less judge.

制御部１２３は、黒画素塊の夫々について上述の第１〜第３の条件を全て満たすかの判定を行うことで、表の枠線を構成する黒画素塊であるかの判定を行い、表の枠線を構成する黒画素塊の外接矩形の位置情報を保存部１２２に保存する。このようにして保存された位置情報を持つ外接矩形の領域を表領域と呼ぶ。尚、図１２に示すケースでは、表領域の検出の結果、黒画素塊１２０１が、表の枠線を構成する黒画素塊と判定され、外接矩形１２０２の領域が、表領域として検出されたものとする。尚、本実施例では、上述の第１〜第３の条件を全て満たす黒画素塊を、表の枠線を構成する黒画素塊と判定したが、判定条件はこれに限られない。例えば、第１〜第３の条件のうちの少なくとも１つを満たす黒画素塊を、表の枠線を構成する黒画素塊と判定しても良い。 The control unit 123 determines whether each of the black pixel blocks satisfies the first to third conditions described above to determine whether it is a black pixel block forming a frame of the table, The storage unit 122 stores the position information of the circumscribed rectangle of the black pixel block constituting the frame line of The area of the circumscribed rectangle having the position information stored in this manner is called a table area. In the case shown in FIG. 12, as a result of the detection of the front area, the black pixel block 1201 is determined to be a black pixel block that constitutes the border of the table, and the area of the circumscribed rectangle 1202 is detected as the front area. I assume. In the present embodiment, the black pixel block satisfying all of the above-described first to third conditions is determined as the black pixel block forming the frame of the table, but the determination condition is not limited to this. For example, a black pixel block satisfying at least one of the first to third conditions may be determined as a black pixel block constituting a frame line of the table.

（４）認識セルの特定
制御部１２３は、表領域内部の認識セルを特定する。ここで「認識セル」とは、表領域内部の白画素塊の外接矩形である。認識セルを特定するためには、表領域内部の白画素の輪郭を追跡することにより、白画素塊を検出する必要がある。その上で、制御部１２３は、検出した白画素塊の夫々について、以下に示す３つの条件を満たすか判定し、３つの条件を全て満たす白画素塊の外接矩形を、認識セルとして特定する。 (4) Identification Control Unit of Recognition Cell The recognition control unit 123 identifies the recognition cell in the front area. Here, the "recognition cell" is a circumscribed rectangle of a white pixel block inside the front area. In order to specify the recognition cell, it is necessary to detect a white pixel block by tracking the outline of the white pixel inside the front area. Then, the control unit 123 determines whether the following three conditions are satisfied for each of the detected white pixel blocks, and specifies the circumscribed rectangle of the white pixel block satisfying all the three conditions as a recognition cell.

第１の条件は、白画素塊の外接矩形の幅が所定の閾値以上であり、かつ該外接矩形の高さが所定の閾値以上であることである。尚、本実施例では、幅及び高さがともに、２０画素以上であるか判定する。 The first condition is that the width of the circumscribed rectangle of the white pixel block is equal to or greater than a predetermined threshold, and the height of the circumscribed rectangle is equal to or greater than the predetermined threshold. In this embodiment, it is determined whether the width and height are both 20 pixels or more.

第３の条件は、白画素塊の最大幅と外接矩形の幅との間の差、及び、白画素塊の最大高さと外接矩形の高さとの間の差が何れも小さいことである。具体的には、白画素塊の最大幅と外接矩形の幅との間の差が所定の閾値以下であり、かつ、白画素塊の最大高さと外接矩形の高さとの間の差が所定の閾値以下であることである。尚、本実施例では、白画素塊の最大幅と外接矩形の幅との間の差、及び、白画素塊の最大高さと外接矩形の高さとの間の差がともに、５画素以下であるか判定する。 The third condition is that the difference between the maximum width of the white pixel block and the width of the circumscribed rectangle and the difference between the maximum height of the white pixel block and the height of the circumscribed rectangle are both small. Specifically, the difference between the maximum width of the white pixel block and the width of the circumscribed rectangle is less than or equal to a predetermined threshold, and the difference between the maximum height of the white pixel block and the height of the circumscribed rectangle is predetermined It is below the threshold. In the present embodiment, the difference between the maximum width of the white pixel block and the width of the circumscribed rectangle and the difference between the maximum height of the white pixel block and the height of the circumscribed rectangle are both 5 or less. Determine if

図１２の符号１２０３、１２０４は、制御部１２３が特定した認識セルを示す。制御部１２３は、特定した認識セルの位置情報を保存部１２２に保存する。 Reference numerals 1203 and 1204 in FIG. 12 denote recognition cells identified by the control unit 123. The control unit 123 stores the position information of the identified recognition cell in the storage unit 122.

尚、本実施例では、上述の第１〜第３の条件を全て満たす白画素塊の外接矩形を、認識セルとして特定したが、判定条件はこれに限られない。例えば、第１〜第３の条件のうちの少なくとも１つを満たす白画素塊の外接矩形を、認識セルとして特定しても良い。 In the present embodiment, the circumscribed rectangle of the white pixel block satisfying all of the above-described first to third conditions is specified as the recognition cell, but the determination condition is not limited to this. For example, a circumscribed rectangle of a white pixel block satisfying at least one of the first to third conditions may be specified as a recognition cell.

（５）認識セル内の文字領域の特定
制御部１２３は、各認識セルの内部に、その各認識セルに内接する白画素塊によって囲まれた黒画素塊があるか判定する。そして、黒画素塊があると判定した場合、あると判定された全ての黒画素塊に対する外接矩形を設定する。 (5) The specification control unit 123 of the character area in the recognition cell determines whether or not there is a black pixel block surrounded by a white pixel block inscribed in each recognition cell inside each recognition cell. Then, when it is determined that there is a black pixel block, circumscribed rectangles for all the black pixel blocks determined to be present are set.

さらに、制御部１２３は、１つの認識セルの中に複数の外接矩形を設定した場合に、外接矩形同士の距離が所定の閾値以下であるか判定する。具体的には、制御部１２３は、外接矩形を１つずつ選択し、選択した外接矩形からの距離が所定の閾値以下となる外接矩形を検出する。例えば、外接矩形同士の距離が２０画素以下であるか判定する。 Furthermore, when a plurality of circumscribed rectangles are set in one recognition cell, the control unit 123 determines whether the distance between the circumscribed rectangles is equal to or less than a predetermined threshold. Specifically, the control unit 123 selects circumscribed rectangles one by one and detects a circumscribed rectangle whose distance from the selected circumscribed rectangle is equal to or less than a predetermined threshold. For example, it is determined whether the distance between circumscribed rectangles is 20 pixels or less.

さらに、制御部１２３は、そのような外接矩形を検出した場合、検出した外接矩形と、選択した外接矩形とを統合する。つまり、制御部１２３は、これら両方の外接矩形に外接する新たな外接矩形を設定するとともに、選択した外接矩形と、検出した外接矩形とを削除する。 Furthermore, when such a circumscribed rectangle is detected, the control unit 123 integrates the detected circumscribed rectangle and the selected circumscribed rectangle. That is, the control unit 123 sets a new circumscribed rectangle circumscribing the both circumscribed rectangles, and deletes the selected circumscribed rectangle and the detected circumscribed rectangle.

新たな外接矩形の設定、２つの外接矩形の削除が完了した後、制御部１２３は、その認識セル内の外接矩形をまた初めから１つずつ選択し、互いの間の距離が所定の閾値以下の外接矩形同士を統合していく。以上の処理を繰り返す。即ち、互いの間の距離が所定の閾値以下の外接矩形が無くなるまで、外接矩形同士の統合が繰り返される。 After setting of a new circumscribed rectangle and deletion of two circumscribed rectangles are completed, the control unit 123 selects the circumscribed rectangles in the recognition cell one by one from the beginning, and the distance between each other is less than a predetermined threshold. Integrate the circumscribed rectangles of Repeat the above processing. That is, integration of circumscribed rectangles is repeated until there is no circumscribed rectangle in which the distance between each other is equal to or less than a predetermined threshold.

以上の通り、本実施例では、１つの認識セルの内部に存在する外接矩形同士の統合を行うが、認識セルをまたぐ外接矩形同士の統合を行わない。 As described above, in this embodiment, the circumscribed rectangles present inside one recognition cell are integrated, but the circumscribed rectangles straddling the recognition cell are not integrated.

以上の処理が終わって依然として設定されている外接矩形は、文字領域（又はテキストブロック）と呼ばれる。以上の処理を認識セル内の文字領域の特定と呼ぶ。制御部１２３は、認識セルの内部に存在する文字領域の位置情報を、該認識セルに関連付けて保存部１２２に保存する。 The circumscribed rectangle that has been set up after the above processing is called a character area (or text block). The above processing is called identification of a character area in a recognition cell. The control unit 123 stores the position information of the character area present inside the recognition cell in the storage unit 122 in association with the recognition cell.

図１２のケースでは、符号１２０５、１２０６が文字領域を示している。文字領域１２０５の位置情報は、認識セル１２０３に関連付けられて保存部１２２に保存される。また、文字領域１２０６の位置情報は、認識セル１２０４に関連付けられて保存部１２２に保存される。 In the case of FIG. 12, reference numerals 1205 and 1206 indicate character areas. The position information of the character area 1205 is stored in the storage unit 122 in association with the recognition cell 1203. The position information of the character area 1206 is stored in the storage unit 122 in association with the recognition cell 1204.

（６）文字認識
制御部１２３は、各文字領域に対して文字認識を行うことによって、各文字領域に対応する認識文字を取得する。ここで「認識文字」とは、オリジナル文字（スキャン画像中の文字画像のこと）を文字認識することで得られる文字コードであり、認識文字が透明文字列としてＰＤＦファイルに埋め込まれる。文字認識で取得された認識文字は、対応する文字領域と関連付けられて保存部１２２に保存される。そうすることによって、認識文字は、文字領域と予め関連付けられている認識セルとも関連付けられることになる。文字認識を行わない場合、もしくは文字認識に失敗した場合は、文字領域に関連付けられる認識文字はない。 (6) The character recognition control unit 123 performs character recognition on each character area to obtain recognized characters corresponding to each character area. Here, "recognized character" is a character code obtained by character recognition of an original character (character image in a scanned image), and the recognized character is embedded in the PDF file as a transparent character string. The recognized characters acquired by character recognition are stored in the storage unit 122 in association with the corresponding character area. By doing so, the recognition character will also be associated with the recognition cell previously associated with the character area. If character recognition is not performed or character recognition fails, there is no recognized character associated with the character area.

また、文字認識を行う際に、認識率も取得する場合がある。認識率は、文字を正しく認識できたかを示す指標であり、認識文字と関連付けられて保存部１２２に保存される。 Moreover, when performing character recognition, a recognition rate may be acquired. The recognition rate is an index indicating whether the character has been correctly recognized, and is stored in the storage unit 122 in association with the recognized character.

（７）透明文字列の埋め込み
制御部１２３は、入力データが画像の場合は、まず新規にＰＤＦファイルを作成して、その中にスキャン画像を格納する一方、入力データがＰＤＦファイルの場合は、ＰＤＦファイルに含まれる透明文字列を一旦全て削除する。その後、制御部１２３は、上述の（６）文字認識で取得した認識文字を、対応する文字領域の位置に、透明文字列としてＰＤＦファイルに追加する。 (7) When the input data is an image, the transparent character string embedding control unit 123 first creates a PDF file newly and stores a scan image in it, while when the input data is a PDF file, Delete all the transparent strings contained in the PDF file. Thereafter, the control unit 123 adds the recognized character acquired by the above-described (6) character recognition to the PDF file as a transparent character string at the position of the corresponding character area.

尚、ここでは上述の（１）〜（７）の処理を、情報処理装置１２０の制御部１２３が実行する場合について説明したが、これらの処理を、複写機１１０の制御部１１４で実行しても構わない。その場合は、複写機１１０がＳＰＤＦファイルを作成することになる。 Here, although the case where the control unit 123 of the information processing apparatus 120 executes the above-described processes (1) to (7) has been described, these processes are executed by the control unit 114 of the copying machine 110. I don't care. In that case, the copying machine 110 will create an SPDF file.

＜ＰＤＦファイル判定処理を伴うＳＰＤＦファイル作成処理について＞
以下、入力データに基づくＳＰＤＦファイル作成をユーザーが指示した場合に実行される、入力データのＰＤＦファイルがＳＰＤＦファイルか判定するＰＤＦファイル判定処理を伴うＳＰＤＦファイル作成処理について、図４を用いて説明する。 <About SPDF file creation processing with PDF file determination processing>
Hereinafter, SPDF file creation processing accompanied by PDF file determination processing for determining whether the PDF file of input data is an SPDF file, which is executed when the user instructs creation of an SPDF file based on input data, will be described using FIG. .

ステップＳ４０１において、制御部１２３は、入力データがＰＤＦファイルの場合に、該ＰＤＦファイル内にページサイズ相当の画像が含まれているか判定することで、該ＰＤＦファイルにスキャン画像が含まれているか判定する。ＰＤＦファイルにページサイズ相当の画像が含まれている場合、該ＰＤＦファイルにスキャン画像が含まれていると判定してステップＳ４０４に進む。一方、ＰＤＦファイルにページサイズ相当の画像が含まれていない場合、該ＰＤＦファイルにスキャン画像が含まれていないと判定してステップＳ４０２に進む。具体的に本実施例では、ステップＳ４０１において、まずＰＤＦファイルからページの幅ｘ［ｍｍ］と高さｙ［ｍｍ］とを取得した上で、次に、幅がｘ−５［ｍｍ］以上であって、かつ高さがｙ−５［ｍｍ］以上の画像がＰＤＦファイル内にあるか判定する。ここで５は、許容可能な誤差である。このように本ステップでは、スキャン画像からＰＤＦファイルを作成する際に余白が入る場合があることを考慮している。つまり、ページサイズと画像サイズとが完全に一致しなくても、ある程度ページサイズに近いサイズの画像があれば、ＰＤＦファイルにスキャン画像が含まれていると判定する。尚、本実施例では、許容可能な誤差を５［ｍｍ］に設定したが、この値はスキャン環境に応じて適宜変更して構わない。 In step S401, when the input data is a PDF file, the control unit 123 determines whether the PDF file includes a scanned image by determining whether the PDF file includes an image equivalent to the page size. Do. If the PDF file includes an image of a page size, it is determined that the PDF file includes a scan image, and the process advances to step S404. On the other hand, when the PDF file does not include the image of the page size, it is determined that the PDF file does not include the scan image, and the process proceeds to step S402. Specifically, in the present embodiment, in step S401, the width x [mm] and height y [mm] of the page are first acquired from the PDF file, and then the width is x -5 [mm] or more. It is determined whether there is an image having a height of y-5 [mm] or more in the PDF file. Here, 5 is an allowable error. As described above, in this step, it is considered that a margin may be included when creating a PDF file from a scanned image. That is, even if the page size and the image size do not completely match, if there is an image having a size close to the page size to some extent, it is determined that the PDF file includes the scan image. Although the allowable error is set to 5 [mm] in this embodiment, this value may be changed as appropriate depending on the scanning environment.

まず、ＰＤＦファイル内にページサイズ相当の画像が含まれていない（ステップＳ４０１でＮＯの場合）について説明する。この場合、ステップＳ４０２において、制御部１２３は、入力データのＰＤＦファイルがＳＰＤＦファイルではないと判定する。次いで、ステップＳ４０３において、制御部１２３は、入力データに対してＯＣＲ処理を実行し、その結果得られる文字コードを透明文字列として埋め込むことでＳＰＤＦファイルを作成し、該作成したＳＰＤＦファイルを保存部１２２に保存する。 First, the case where the image corresponding to the page size is not included in the PDF file (in the case of NO at step S401) will be described. In this case, in step S402, the control unit 123 determines that the PDF file of the input data is not an SPDF file. Next, in step S403, the control unit 123 executes an OCR process on the input data, and embeds the character code obtained as a result as a transparent character string, thereby creating a SPDF file, and storing the created SPDF file Save to 122

続いて、ＰＤＦファイル内にページサイズ相当の画像が含まれている場合（ステップＳ４０１でＹＥＳ）について説明する。この場合、ステップＳ４０４において、制御部１２３は、画像に対して領域分割を実行することで文字領域を抽出する。本ステップで得た文字領域の総数をＮ個とする。 Next, the case where the image of the page size is included in the PDF file (YES in step S401) will be described. In this case, in step S404, the control unit 123 extracts a character area by performing area division on the image. The total number of character areas obtained in this step is N.

ステップＳ４０５において、制御部１２３は、透明文字列と位置及びサイズが一致する文字領域をカウントするためのパラメータＸを初期化して０に設定する。 In step S405, the control unit 123 initializes a parameter X for counting character areas whose positions and sizes match the transparent character string, and sets the parameter X to zero.

ステップＳ４０６において、制御部１２３は、未チェックの文字領域の１つに着目する（着目文字領域の選択）。 In step S406, the control unit 123 focuses on one of the unchecked character areas (selection of a focused character area).

ステップＳ４０７において、制御部１２３は、位置及びサイズが着目文字領域のそれらと一致する透明文字列が存在するか判定する。本ステップでは、画像オブジェクト内の文字領域と、文字オブジェクト（具体的には透明文字列）が描画された領域との位置関係を見ている。ステップＳ４０７の判定結果が真の場合、ステップＳ４０８に進む一方、該判定結果が偽の場合、ステップＳ４０９に進む。尚、本ステップにおいて、スキャンのズレ等も考慮し、位置及びサイズが完全に一致していなくても、着目文字領域の位置及びサイズが透明文字列の位置及びサイズと一致すると判定しても良い。例えば、±１ｍｍ程度の範囲内の差異があったとしても、一致すると判定しても良い。 In step S407, the control unit 123 determines whether there is a transparent character string whose position and size match those of the target character area. In this step, the positional relationship between the character area in the image object and the area where the character object (specifically, a transparent character string) is drawn is seen. If the determination result in step S407 is true, the process proceeds to step S408, while if the determination result is false, the process proceeds to step S409. In this step, it is also possible to determine that the position and size of the target character area match the position and size of the transparent character string even if the position and size do not completely match, in consideration of scan deviations and the like. . For example, even if there is a difference in the range of about ± 1 mm, it may be determined that they match.

ステップＳ４０７でＹＥＳの場合、ステップＳ４０８において、制御部１２３は、Ｘをインクリメントし、ステップＳ４０９に進む。これに対し、ステップＳ４０７でＮＯの場合、Ｘをインクリメントすることなく、ステップＳ４０９に進む。 If YES in step S407, the control unit 123 increments X in step S408, and proceeds to step S409. On the other hand, if NO in step S407, the process proceeds to step S409 without incrementing X.

ステップＳ４０９において、制御部１２３は、全ての文字領域をチェックしたか判定する。ステップＳ４０９の判定結果が真の場合、ステップＳ４１０に進む一方、該判定結果が偽の場合、ステップＳ４０６に戻る。 In step S409, the control unit 123 determines whether all the character areas have been checked. If the determination result in step S409 is true, the process proceeds to step S410, while if the determination result is false, the process returns to step S406.

このように本実施例では、ステップＳ４０５〜ステップＳ４０９において、文字領域のうち、位置及びサイズが透明文字列と一致するものが何個あるかをカウントしている。 As described above, in this embodiment, in steps S405 to S409, it is counted how many positions and sizes of the character areas match the transparent character string.

ステップＳ４１０において、制御部１２３は、以下の式（１）を満たすか判定する。ステップＳ４１０の判定結果が真の場合、ステップＳ４１１に進む一方、該判定結果が偽の場合、ステップＳ４１３に進む。 In step S410, the control unit 123 determines whether the following formula (1) is satisfied. If the determination result in step S410 is true, the process proceeds to step S411, while if the determination result is false, the process proceeds to step S413.

式（１）において、Ｓは、入力データがＳＰＤＦファイルであるか判定するための閾値であって、位置及びサイズが透明文字列と一致する文字領域の数がいくつより大きいときに、入力データがＳＰＤＦファイルと判定するための閾値である。本実施例では、Ｓを０．９に設定した。Ｓを０．９に設定したということは、文字領域のうちの、位置及びサイズが透明文字列と一致する文字領域の割合が９０％より大きければ、入力データがＳＰＤＦファイルと判定されることを意味する。Ｓを１．０ではなく０．９とした理由は、環境に応じたスキャン精度の低下等が原因で、常に１００％の認識率で文字を認識できるわけではないことを考慮しているためである。尚、Ｓの値は当然、環境に応じて変更して構わない。 In Equation (1), S is a threshold value for determining whether the input data is a SPDF file, and the input data is a threshold when the number of character areas whose position and size match the transparent character string is greater than This is a threshold for determining an SPDF file. In the present embodiment, S is set to 0.9. Setting S to 0.9 means that the input data is determined to be an SPDF file if the ratio of the character area whose position and size match the transparent character string among the character areas is greater than 90%. means. The reason why S is set to 0.9 instead of 1.0 is because it takes into consideration that characters can not always be recognized with a recognition rate of 100% because of a decrease in scan accuracy according to the environment, etc. is there. Of course, the value of S may be changed according to the environment.

式（１）を満たす場合（ステップＳ４１０でＹＥＳ）、ステップＳ４１１において、制御部１２３は、入力データがＳＰＤＦファイルと判定する。次いで、ステップＳ４１２において、制御部１２３は、ＳＰＤＦファイルを作成し直すことなく、入力データのＳＰＤＦファイルをそのまま保存部１２２に保存し、一連の処理は終了する。 If equation (1) is satisfied (YES in step S410), in step S411, control unit 123 determines that the input data is a SPDF file. Next, in step S412, the control unit 123 stores the SPDF file of the input data as it is in the storage unit 122 without recreating the SPDF file, and the series of processing ends.

これに対し、式（１）を満たさない場合（ステップＳ４１０でＮＯ）、ステップＳ４１３において、制御部１２３は、入力データがＳＰＤＦファイルではないと判定する。次いで、ステップＳ４１４において、制御部１２３は、入力データに対してＯＣＲ処理を実行し、ＯＣＲ処理の結果取得する文字コードを透明文字列として埋め込むことでＳＰＤＦファイルを作成し、該作成したＳＰＤＦファイルを保存部１２２に保存する。以上が、本実施例における、ＰＤＦファイル判定処理を伴うＳＰＤＦファイル作成処理の内容である。 On the other hand, when the equation (1) is not satisfied (NO in step S410), in step S413, the control unit 123 determines that the input data is not an SPDF file. Next, in step S414, the control unit 123 executes an OCR process on the input data, and embeds the character code acquired as a result of the OCR process as a transparent character string to create a SPDF file, and the created SPDF file It is stored in the storage unit 122. The above is the contents of the SPDF file creation process accompanied by the PDF file determination process in the present embodiment.

本実施例により、検索可能なファイルであることを通知するためのコマンドを用いることなく、入力データのＰＤＦファイルがＳＰＤＦファイルであるかを高精度に判定することができる。また、入力データが既にＳＰＤＦファイルであった場合は、該ＳＰＤＦファイルをそのまま保存部に保存しＳＰＤＦファイルを作成し直すことがない。従って、無駄な処理を減らすことができる。 According to this embodiment, it is possible to determine with high accuracy whether the PDF file of the input data is an SPDF file without using a command for notifying that the file is a searchable file. If the input data is already an SPDF file, the SPDF file is stored as it is in the storage unit and the SPDF file is not re-created. Therefore, unnecessary processing can be reduced.

［実施例２］
本実施例では、入力データがＳＰＤＦファイルと判定された場合（ステップＳ４１０でＹＥＳ→ステップＳ４１１）に、ＳＰＤＦファイルを作成し直すか否かをユーザーに選択させる。尚、以下では既述の実施例との差分について主に説明し、既述の実施例と同様の内容については説明を適宜省略する。 Example 2
In the present embodiment, when it is determined that the input data is an SPDF file (YES in step S410 → step S411), the user is made to select whether or not to re-create the SPDF file. In the following, differences from the above-described embodiment will be mainly described, and the description of the same contents as the above-described embodiment will be appropriately omitted.

入力データが既にＳＰＤＦファイルだったとしても、情報処理装置１２０で文字認識をやり直すことで、文字認識の精度が向上する場合がある。例えば情報処理装置１２０に搭載されている文字認識エンジンが優れていれば、情報処理装置１２０で文字認識をやり直すことで、文字認識の精度が向上する可能性がある。一方で、元々ＳＰＤＦファイルを作成した文字認識エンジンの方が情報処理装置１２０に搭載されている文字認識エンジンより優れていれば、情報処理装置１２０で文字認識をやり直しても文字認識の精度が向上することはない。また、文字認識は処理にある程度の時間を要するため、パフォーマンスの観点から、常にＳＰＤＦファイルを作成し直すことは好ましくない。このように、ＳＰＤＦファイルを作成し直すことには一長一短があり、そのメリット及びデメリットもユーザー環境に依存するため、ＳＰＤＦファイルを作成し直すか否かをユーザーが選択できるようにする。 Even if the input data is already an SPDF file, the character recognition accuracy may be improved by re-doing the character recognition in the information processing apparatus 120. For example, if the character recognition engine installed in the information processing apparatus 120 is excellent, the character recognition accuracy may be improved by performing the character recognition again by the information processing apparatus 120. On the other hand, if the character recognition engine that originally created the SPDF file is superior to the character recognition engine installed in the information processing apparatus 120, the accuracy of the character recognition improves even if the information processing apparatus 120 re-executes character recognition There is nothing to do. In addition, since character recognition requires a certain amount of time for processing, it is not preferable to constantly re-create the SPDF file from the viewpoint of performance. As described above, there is an advantage and disadvantage in re-creating the SPDF file, and the merits and demerits also depend on the user environment, so the user can select whether or not to re-create the SPDF file.

以下、本実施例における処理について、図５を用いて説明する。 Hereinafter, the process in the present embodiment will be described with reference to FIG.

ステップＳ４１１で、入力データがＳＰＤＦファイルと判定された場合、ステップＳ５０１において、制御部１２３は、ＳＰＤＦファイルを作成し直すか否かをユーザーに選択させるためのダイアログを、表示部１２４に表示する。図６は、本ステップで表示されるダイアログの一例を示す図である。ダイアログ６０１は、ＳＰＤＦファイルを作成し直すか否かを問うメッセージ６０２を有する。また、ダイアログ６０１は、ＳＰＤＦファイルを作成する場合にユーザーが押下するＹｅｓボタン６０３、及び、ＳＰＤＦファイルを作成しない場合にユーザーが押下するＮｏボタン６０４を有する。 If it is determined in step S411 that the input data is an SPDF file, in step S501, the control unit 123 displays on the display unit 124 a dialog for causing the user to select whether or not to re-create the SPDF file. FIG. 6 is a diagram showing an example of the dialog displayed in this step. The dialog 601 has a message 602 asking whether to re-create the SPDF file. The dialog 601 also includes a Yes button 603 pressed by the user when creating the SPDF file, and a No button 604 pressed by the user when the SPDF file is not created.

ステップＳ５０２において、制御部１２３は、ダイアログ６０１のＹｅｓボタン６０３が押下されたか判定する。ステップＳ５０２の判定結果が真の場合、ステップＳ５０３に進む一方、該判定結果が偽の場合（即ち、Ｎｏボタン６０４が押下された場合）、ステップＳ４１２に進む。 In step S502, the control unit 123 determines whether the Yes button 603 of the dialog 601 is pressed. If the determination result in step S502 is true, the process proceeds to step S503, while if the determination result is false (ie, the No button 604 is pressed), the process proceeds to step S412.

ＹＥＳボタン６０３が押下された場合（ステップＳ５０２でＹＥＳ）、ステップＳ５０３において、制御部１２３は、ＳＰＤＦファイルを作成し直し、該作成し直したＳＰＤＦファイルを保存部１２２に保存する。 When the YES button 603 is pressed (YES in step S502), in step S503, the control unit 123 re-creates the SPDF file, and saves the re-created SPDF file in the saving unit 122.

本実施例により、入力データがＳＰＤＦファイルと判定された場合に、ＳＰＤＦファイルを作成し直すかどうかをユーザーに選択させることが可能になる。 According to the present embodiment, when the input data is determined to be an SPDF file, it is possible to allow the user to select whether to re-create the SPDF file.

［実施例３］
本実施例では、ＳＰＤＦファイルを作成し直す際に、画像の内容と無関係な透明文字列を削除せずに残す。 [Example 3]
In this embodiment, when the SPDF file is re-created, the transparent character string irrelevant to the content of the image is left without being deleted.

入力データのＳＰＤＦファイルに基づいてＳＰＤＦファイルを作成し直す場合は通常、元々埋め込まれている透明文字列を一旦削除し、新たに取得した文字コードを透明文字列として埋め込む。しかしこのような処理を図３のような画像の内容と無関係な透明文字列が存在するＳＰＤＦファイルに対して実行した場合、該透明文字列は、削除されたまま消えてしまうことになる。そのため本実施例では、透明文字列を一旦削除する際に、画像の内容と無関係な透明文字列を削除しないようにする。以下、本実施例におけるＳＰＤＦファイルを作成し直す処理について、図７を用いて説明する。 When recreating the SPDF file based on the SPDF file of the input data, normally, the originally embedded transparent character string is once deleted, and the newly acquired character code is embedded as the transparent character string. However, when such a process is performed on an SPDF file having a transparent character string unrelated to the content of the image as shown in FIG. 3, the transparent character string will be deleted and disappear. Therefore, in the present embodiment, when the transparent character string is once deleted, the transparent character string unrelated to the content of the image is not deleted. Hereinafter, the process of creating the SPDF file again in the present embodiment will be described with reference to FIG.

まずステップＳ７０１において、制御部１２３は、入力データの画像に対して領域分割、文字認識を行い、その結果得られる情報を保存部１２２に保存しておく。 First, in step S701, the control unit 123 performs area division and character recognition on an image of input data, and stores information obtained as a result in the storage unit 122.

ステップＳ７０２において、制御部１２３は、未チェックの透明文字列の１つに着目する（着目透明文字列の選択）。 In step S702, the control unit 123 focuses on one unchecked transparent character string (selection of a target transparent character string).

ステップＳ７０３において、制御部１２３は、着目透明文字列の位置及びサイズがステップＳ７０１で取得した文字領域の何れかの位置及びサイズと一致するか判定する。ステップＳ７０３の判定結果が真の場合、ステップＳ７０４に進む一方、該判定結果が偽の場合、ステップＳ７０６に進む。尚、本ステップにおいて、スキャンのズレ等も考慮し、位置及びサイズが完全に一致していなくても、着目透明文字列の位置及びサイズが文字領域の何れかの位置及びサイズと一致すると判定しても良い。 In step S703, the control unit 123 determines whether the position and size of the target transparent character string match any position and size of the character area acquired in step S701. If the determination result in step S703 is true, the process proceeds to step S704, while if the determination result is false, the process proceeds to step S706. In this step, it is determined that the position and the size of the target transparent character string coincide with any position and the size of the character area even if the position and the size do not completely match, considering the shift of the scan, etc. It is good.

着目透明文字列の位置及びサイズが文字領域の１つと一致する場合（ステップＳ７０３でＹＥＳ）、ステップＳ７０４において、制御部１２３は、着目透明文字列が画像の内容と関係する透明文字列、つまり検索用の透明文字列と判定する。次いで、ステップＳ７０５において、制御部１２３は、着目透明文字列を削除し、ステップＳ７０７に進む。 If the position and size of the target transparent character string match one of the character areas (YES in step S703), in step S704, the control unit 123 determines that the target transparent character string is a transparent character string related to the content of the image, that is, the search Determined to be a transparent string for Next, in step S705, the control unit 123 deletes the target transparent character string, and proceeds to step S707.

これに対し、着目透明文字列の位置及びサイズと一致する位置及びサイズを有する文字領域が存在しない場合（ステップＳ７０３でＮＯ）、ステップＳ７０６において、制御部１２３は、着目透明文字列が画像の内容と無関係な透明文字列と判定する。次いで、ステップＳ７０７に進む。 On the other hand, when there is no character area having a position and size matching the position and size of the target transparent character string (NO in step S703), in step S706, the control unit 123 determines that the target transparent character string is the content of the image. It is judged as a transparent string unrelated to. Then, it progresses to step S707.

ステップＳ７０７において、制御部１２３は、全ての透明文字列をチェックしたか判定する。ステップＳ７０７の判定結果が真の場合、ステップＳ７０８に進む一方、該判定結果が偽の場合、ステップＳ７０２に戻る。 In step S 707, the control unit 123 determines whether all the transparent character strings have been checked. If the determination result in step S707 is true, the process proceeds to step S708, while if the determination result is false, the process returns to step S702.

ステップＳ７０８において、制御部１２３は、文字認識で取得した文字コードを透明文字列として追加する。以上が、本実施例におけるＳＰＤＦファイルを作成し直す処理の内容である。 In step S 708, the control unit 123 adds the character code acquired by character recognition as a transparent character string. The above is the contents of the process of creating the SPDF file again in the present embodiment.

本実施例により、ＳＰＤＦファイルを作成し直す際に、画像の内容と無関係な透明文字列を削除せずに残すことができる。 According to this embodiment, when the SPDF file is re-created, it is possible to leave the transparent character string unrelated to the content of the image without deleting it.

［実施例４］
本実施例では、画像の内容と無関係な透明文字列を削除せずに残すか否かをユーザーが選択できるようにする。 Example 4
In this embodiment, it is possible for the user to select whether or not the transparent character string irrelevant to the content of the image is left without being deleted.

図８は、画像の内容と無関係な透明文字列を削除するか否かをユーザーに選択させる際に、制御部１２３が表示部１２４に表示するプロンプト画面の一例を示す図である。図８のケースでは、制御部１２３が画像の内容と無関係な透明文字列３０２を検出し、その外接矩形８０１を表示することで、透明文字列３０２を強調している。また、図示していないが、一時的に透明文字列３０２を半透明にして表示することで、ユーザーが透明文字列３０２を視認できるようにしている。さらに、透明文字列３０２の近くにチェックボックス８０２を表示している。ユーザーがこのチェックボックス８０２にチェックをつけた場合、透明文字列３０２はＳＰＤＦファイルを作成し直す際に削除されることになる。 FIG. 8 is a diagram showing an example of a prompt screen displayed on the display unit 124 by the control unit 123 when the user selects whether to delete a transparent character string unrelated to the content of the image. In the case of FIG. 8, the control unit 123 detects the transparent character string 302 irrelevant to the content of the image and displays the circumscribed rectangle 801 to emphasize the transparent character string 302. Further, although not shown, the user can visually recognize the transparent character string 302 by temporarily displaying the transparent character string 302 as translucent. Furthermore, a check box 802 is displayed near the transparent character string 302. If the user checks this check box 802, the transparent string 302 will be deleted when the SPDF file is recreated.

＜画面作成処理について＞
以下、本実施例における、図８に例示したような画面を作成する処理について、図９（ａ）を用いて説明する。 <About screen creation process>
Hereinafter, the process of creating the screen as illustrated in FIG. 8 in the present embodiment will be described with reference to FIG.

本実施例における画面作成処理では、ステップＳ７０６で着目透明文字列が画像の内容と無関係な透明文字列と判定された後、ステップＳ９０１に進む。そして、ステップＳ９０１において、制御部１２３は、着目透明文字列に対応する外接矩形と、チェックボックスとを、表示部１２４に表示するとともに、着目透明文字列を半透明にして表示する。 In the screen creation processing in the present embodiment, after it is determined in step S706 that the target transparent character string is a transparent character string unrelated to the content of the image, the processing proceeds to step S901. Then, in step S901, the control unit 123 displays the circumscribed rectangle corresponding to the target transparent character string and the check box on the display unit 124, and displays the target transparent character string as semitransparent.

＜ＳＰＤＦファイル作成処理について＞
以下、本実施例におけるＳＰＤＦファイル作成処理について、図９（ｂ）を用いて説明する。 <About SPDF file creation process>
Hereinafter, the SPDF file creation processing in the present embodiment will be described using FIG. 9 (b).

本実施例におけるＳＰＤＦファイル作成処理では、ステップＳ７０６で着目透明文字列が画像の内容と無関係な透明文字列と判定された後、ステップＳ９０２に進む。そして、ステップＳ９０２において、制御部１２３は、着目透明文字列に対応するチェックボックスにチェックがつけられたか判定する。ステップＳ９０２の判定結果が真の場合、ステップＳ７０５に進み、着目透明文字列を削除する一方、該判定結果が偽の場合、ステップＳ７０７に進む。この場合、着目透明文字列は削除されない。つまり本実施例では、画像の内容と無関係な透明文字列を必ず削除するのではなく、ユーザーの選択に応じて該透明文字列を削除する。 In the SPDF file creation process in the present embodiment, after it is determined in step S706 that the target transparent character string is a transparent character string unrelated to the content of the image, the process proceeds to step S902. Then, in step S902, the control unit 123 determines whether the check box corresponding to the target transparent character string is checked. If the determination result in step S902 is true, the process proceeds to step S705, and while the target transparent character string is deleted, the process proceeds to step S707 if the determination result is false. In this case, the target transparent character string is not deleted. That is, in this embodiment, the transparent character string irrelevant to the content of the image is not necessarily deleted, but the transparent character string is deleted according to the user's selection.

本実施例により、画像の内容と無関係な透明文字列を削除せずに残すか否かをユーザーが選択することが可能になる。 The present embodiment allows the user to select whether or not to keep transparent strings irrelevant to the content of the image without deleting them.

［実施例５］
本実施例では、ＳＰＤＦファイルを作成し直す際に、認識率の高い文字列のみを透明文字列で置き換える。上述したように、ＳＰＤＦファイルを作成し直す際には、既存の透明文字列を一旦削除してから文字認識を行い、透明文字列を追加する。しかし、認識率が低い場合、既存の透明文字列を取得した際よりも認識精度が落ちてしまう可能性が高い。そのため、本実施例では、認識率が閾値以上の文字列のみを透明文字列で置き換えるようにする。 [Example 5]
In this embodiment, only the character string having a high recognition rate is replaced with the transparent character string when the SPDF file is re-created. As described above, when the SPDF file is re-created, the existing transparent character string is once deleted, then character recognition is performed, and the transparent character string is added. However, when the recognition rate is low, there is a high possibility that the recognition accuracy will be lower than when the existing transparent character string is acquired. Therefore, in the present embodiment, only the character string whose recognition rate is equal to or more than the threshold value is replaced with the transparent character string.

図１０は、ＳＰＤＦファイルを作成する際のオプションを設定するためのダイアログの一例を示す図である。オプション設定ダイアログ１００１は、全ての透明文字を置き換える場合にチェックをつけるチェックボックス１００４と、認識率の高い文字列のみ置き換える場合にチェックをつけるチェックボックス１００５とを有する。ユーザーは、チェックボックス１００４とチェックボックス１００５との何れかにチェックをつける。 FIG. 10 is a diagram showing an example of a dialog for setting options when creating an SPDF file. The option setting dialog 1001 has a check box 1004 for checking when replacing all transparent characters, and a check box 1005 for checking when replacing only a character string with a high recognition rate. The user checks either the check box 1004 or the check box 1005.

＜ＳＰＤＦファイル作成処理について＞
以下、本実施例における、オプション設定ダイアログ１００１内のチェックボックス１００５にチェックが入っていた場合に実行されるＳＰＤＦファイル作成処理について、図１１を用いて説明する。 <About SPDF file creation process>
Hereinafter, the SPDF file creation process executed when the check box 1005 in the option setting dialog 1001 is checked in the present embodiment will be described using FIG.

本実施例では、ステップＳ７０４の次にステップＳ１１０１に進み、ステップＳ１１０１において、制御部１２３は、着目透明文字列に対応する認識文字の認識率が所定の閾値以上か判定する。ステップＳ１１０１の判定結果が真の場合、ステップＳ７０５に進んで着目透明文字列を削除する一方、該判定結果が偽の場合、ステップＳ７０７に進む。このように本ステップでは、後続のステップＳ１１０２で認識率が高い文字列のみを透明文字列で置き換えるために、認識率が高い既存の透明文字列を予め削除しておく。 In the present embodiment, the process proceeds to step S1101 after step S704, and in step S1101 the control unit 123 determines whether the recognition rate of the recognized character corresponding to the target transparent character string is equal to or more than a predetermined threshold. If the determination result in step S1101 is true, the process proceeds to step S705 to delete the target transparent character string, while if the determination result is false, the process proceeds to step S707. As described above, in this step, in order to replace only a character string having a high recognition rate with a transparent character string in the subsequent step S1102, an existing transparent character string having a high recognition rate is deleted in advance.

また、本実施例では、ステップＳ７０７の次にステップＳ１１０２に進み、ステップＳ１１０２において、制御部１２３は、認識文字のうち、認識率が所定の閾値以上のものを透明文字列として追加し、一連の処理は終了する。以上が、本実施例におけるＳＰＤＦファイル作成処理の内容である。 Further, in the present embodiment, the process proceeds to step S1102 after step S707, and in step S1102, the control unit 123 adds, among the recognized characters, characters whose recognition rate is equal to or more than a predetermined threshold as a transparent character string. The process ends. The above is the contents of the SPDF file creation process in this embodiment.

本実施例により、ＳＰＤＦファイルを作成し直す際に、認識率の高い文字列のみを透明文字列で置き換えることができる。 According to this embodiment, when the SPDF file is re-created, only the character string having a high recognition rate can be replaced with the transparent character string.

［実施例６］
本実施例では、上述の実施例で説明した方法で生成したＳＰＤＦファイルの活用方法の一例について述べる。本実施例では、保存部１２２などに保存された各ファイルを詳細に検索する仕組みを提供する。また、検索で見つかったファイルを，複写機１１０を用いて印刷する仕組みを提供する。図１３（ａ）及び図１３（ｂ）は、情報処理装置１２０の表示部１２４に表示される画面の例を示す図である。図１３（ａ）は、検索ワードを入力する画面の一例を示す図であり、図１３（ｂ）は、検索結果の一覧を示す画面の一例を示す図である。図１３（ａ）及び図１３（ｂ）に示す画面は、生成されたＳＰＤＦファイルや、コンピュータ内部の文書ファイルなどを管理する文書管理アプリケーションが提供する画面である。ユーザーは、図１３（ａ）及び図１３（ｂ）の画面を介してファイルの属性検索、全文検索、概念検索などを行うことができる。 [Example 6]
In this embodiment, an example of a method of utilizing the SPDF file generated by the method described in the above-described embodiment will be described. In the present embodiment, a mechanism is provided for searching in detail each file stored in the storage unit 122 or the like. Also, a mechanism for printing the file found by the search using the copying machine 110 is provided. FIGS. 13A and 13B illustrate examples of screens displayed on the display unit 124 of the information processing apparatus 120. FIG. FIG. 13A is a view showing an example of a screen for inputting a search word, and FIG. 13B is a view showing an example of a screen showing a list of search results. The screens illustrated in FIGS. 13A and 13B are screens provided by a document management application that manages the generated SPDF file, a document file in the computer, and the like. The user can perform file attribute search, full-text search, concept search and the like through the screens of FIGS. 13 (a) and 13 (b).

本実施例では、ＳＰＤＦを含むファイルの全文検索について説明する。領域１３０１は、検索キーワードを入力する領域である。ユーザーは、領域１３０１に検索したいキーワードを入力することができる。検索方式のチェックボックスは、各キーワードをＡＮＤで検索するかＯＲで検索するかを指定するためのチェックボックスである。Ｓｅａｒｃｈキー１３０２は、検索を行う際に押下されるキーである。 In this embodiment, full-text search of files including SPDF will be described. An area 1301 is an area for inputting a search keyword. The user can enter a keyword that he / she wants to search in the area 1301. The search method check box is a check box for specifying whether each keyword is to be searched by AND or OR. The Search key 1302 is a key pressed when performing a search.

図１３（ｂ）は、検索結果の一例を示す画面である。ここでは、「Ｋａｗａｓａｋｉ」、「ＢｌａｃｋＢｌｕｅ」の２つの検索キーワードを含む３つのファイル見つかり、リストに列挙されているケースを例示している。符号１３０３は、見つかった３つのファイルのうち、図２のＳＰＤＦファイルが指定されている様子を示している。ユーザーは、図１３（ｂ）の画面を介して検索で見つかったファイルを開いたり、印刷したりすることができる。 FIG. 13B is a screen showing an example of the search result. Here, three files including two search keywords of “Kawasaki” and “BlackBlue” are found and illustrated in the list. Reference numeral 1303 indicates that the SPDF file in FIG. 2 is specified among the three found files. The user can open and print the file found by the search through the screen of FIG. 13 (b).

具体的な制御方法について図１４を用いて説明する。図１４のフローチャートは、文書管理アプリケーションの管理画面を介してファイル検索のユーザー操作を受け付けたことに従って実行される。 A specific control method will be described with reference to FIG. The flowchart of FIG. 14 is executed according to the user operation of file search being accepted via the management screen of the document management application.

ステップＳ１４０１において、制御部１２３は、受付部１２５を介して検索キーワードの入力を受け付ける。検索キーワードの入力を受け付けた後に、Ｓｅａｒｃｈキー１３０２が押下されたことを検知すると、ステップＳ１４０２に進む。 In step S1401, the control unit 123 receives an input of a search keyword via the reception unit 125. If it is detected that the Search key 1302 has been pressed after receiving the search keyword input, the process proceeds to step S1402.

ステップＳ１４０２において、制御部１２３は、ステップＳ１４０１で入力されたキーワードに基づき、検索対象のストレージ内にあるファイル内の文字列を検索する。ファイル内の文字列にキーワードが含まれているファイルがあれば、表示対象として当該ファイルの属性情報を取得する。ストレージ内のファイルの検索が完了すると、ステップＳ１４０３に進む。尚、本実施例では、ＳＰＤＦファイルやＷＯＲＤファイル、エクセルファイル等を直接参照し、検索を行う場合を想定しているが、これに限定されるものではない。例えば、ファイル内の文字列をインデックス化したデータベースを予め生成しておき、該生成したデータベースを参照するようにしても良い。 In step S1402, the control unit 123 searches for a character string in the file in the storage to be searched based on the keyword input in step S1401. If there is a file in which a character string in the file includes a keyword, attribute information of the file is acquired as a display target. When the search of the file in the storage is completed, the process proceeds to step S1403. In the present embodiment, it is assumed that the search is performed by directly referring to the SPDF file, the WORD file, the Excel file or the like, but the present invention is not limited to this. For example, a database in which character strings in a file are indexed may be generated in advance, and the generated database may be referred to.

ステップＳ１４０３において、制御部１２３は、ステップＳ１４０２で見つかった検索結果の一覧を示す画面を表示する。図１３（ｂ）は、検索結果の一覧を示す画面の一例である。 In step S1403, the control unit 123 displays a screen showing a list of search results found in step S1402. FIG. 13B is an example of a screen showing a list of search results.

ステップＳ１４０４において、制御部１２３は、検索結果の一覧から文書を選択するユーザー操作を受け付ける。本ステップにおける文書の選択では、ユーザーは、複数の文書ファイルを選択することもできる。 In step S1404, the control unit 123 receives a user operation of selecting a document from the list of search results. In the document selection in this step, the user can also select a plurality of document files.

ステップＳ１４０５において、制御部１２３は、選択した文書を印刷するユーザー操作を受け付けたか判定する。本ステップの判定結果が真の場合、ステップＳ１４０６に進む一方、該判定結果が偽の場合、ステップＳ１４０７に進む。 In step S1405, the control unit 123 determines whether a user operation for printing the selected document has been accepted. If the determination result in this step is true, the process proceeds to step S1406, while if the determination result is false, the process proceeds to step S1407.

ステップＳ１４０６において、制御部１２３は、選択された文書に基づく印刷データを生成し、ＭＦＰ（例えば、複写機１１０）に該生成した印刷データを送信する。この印刷データを受信したＭＦＰは、受信した印刷データに基づき用紙等の記録媒体に画像を形成する。 In step S1406, the control unit 123 generates print data based on the selected document, and transmits the generated print data to the MFP (for example, the copying machine 110). The MFP that has received the print data forms an image on a recording medium such as a sheet based on the received print data.

ステップＳ１４０７において、制御部１２３は、選択文書を開くユーザー操作を受け付けたか判定する。本ステップの判定結果が真の場合、ステップＳ１４０８に進む一方、該判定結果が偽の場合、ステップＳ１４０４に戻る。 In step S1407, the control unit 123 determines whether a user operation for opening the selected document has been received. If the determination result in this step is true, the process proceeds to step S1408, while if the determination result is false, the process returns to step S1404.

ステップＳ１４０８では、制御部１２３は、選択された文書の拡張子に関連付けられたアプリケーションで、選択された文書を開く。 In step S1408, the control unit 123 opens the selected document with an application associated with the selected document extension.

以上説明したように、本実施例では、情報処理装置１２０内部に記憶されたＳＰＤＦ形式の文書ファイルの全文探索を行うことができる。従って、ユーザーが必要とする資料を簡単に探すことができるようになる。また、探索で見つかった文書ファイルについて、複写機１１０と連携して印刷することができる。従って、情報処理装置上で管理されている多種多様の文書ファイルから、所望のファイルを簡単にサーチし、再印刷することができる。従ってユーザーの利便性を高めることができる。 As described above, in the present embodiment, the full text search of the SPDF format document file stored in the information processing apparatus 120 can be performed. Thus, the user can easily find the material he needs. Also, the document file found by the search can be printed in cooperation with the copying machine 110. Therefore, a desired file can be easily searched and reprinted from various document files managed on the information processing apparatus. Therefore, the convenience of the user can be enhanced.

［その他の実施形態］
上述の各実施形態では、情報処理装置１２０と複写機１１０が有線又は無線で接続されている場合を例示しているがこれに限定されるものではない。例えば、ＳＰＤＦを生成したり、各文書ファイルを管理したりする情報処理装置１２０の各機能は、クラウド上のサーバとして実現することもできる。この場合、複写機などで読み取られたデータは、クラウド上のサーバに送信され、必要に応じてＳＰＤＦファイルに変換されてクラウドストレージ上に記憶される。また、クラウド上のサーバは、クライアントＰＣのＷｅｂブラウザ等からアクセス可能なＷｅｂアプリケーションを提供する。この場合、ユーザーは、Ｗｅｂアプリケーションを経由して、実施例６で説明した全文探索の機能や、探索で見つかった文書ファイルの印刷機能を利用するものとする。 Other Embodiments
In the above-described embodiments, the information processing apparatus 120 and the copying machine 110 are connected by wire or wireless, but the present invention is not limited to this. For example, each function of the information processing apparatus 120 that generates an SPDF or manages each document file can also be realized as a server on the cloud. In this case, data read by a copying machine or the like is transmitted to a server on the cloud, converted to an SPDF file as necessary, and stored on the cloud storage. Also, the server on the cloud provides a web application accessible from the web browser or the like of the client PC. In this case, the user uses the full-text search function described in the sixth embodiment and the print function of the document file found in the search via the Web application.

また、上述の実施形態では、検索可能に変換するファイルの一例としてＰＤＦを例示したがこれに限定されるものではない。その他ファイルフォーマット（例えば、Ｍｉｃｒｏｓｏｆｔが提唱するＸＰＳフォーマットや、Ｘｅｒｏｘが提唱するＤｏｃｕＷｏｒｋｓ文書で用いられるＸＤＷフォーマットのファイルを解析し、検索可能なファイルに変換することもできる。 Moreover, in the above-mentioned embodiment, although PDF was illustrated as an example of a file which transforms into searchable, it is not limited to this. Other file formats (for example, XPS format proposed by Microsoft, XDW format file used in DocuWorks document proposed by Xerox) can be analyzed and converted into searchable files.

本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or storage medium, and one or more processors in a computer of the system or apparatus read and execute the program. Can also be realized. It can also be implemented by a circuit (eg, an ASIC) that implements one or more functions.

Claims

First extracting means for extracting a character area in an image object included in the file;
Based on the positional relationship between the character area extracted by the first extraction means and the area where the character object included in the file is drawn, the file includes an image and a result of character recognition corresponding to the image. A first determination unit that determines whether or not the file is a searchable file that includes;
Generating means for executing a character recognition process on the image object in response to the first determination means determining that the file is not a searchable file, and generating a file including a result of the character recognition process; An information processing apparatus comprising:

The information processing apparatus according to claim 1, wherein the first determination unit determines that the file is the searchable file when the following expression is satisfied.

In the above equation, X is the number of character areas in which the positional relationship between the character area in the image object and the area in which the character object is drawn satisfies a predetermined condition, and N is the first extraction. It is the total number of character areas extracted by the means, and S is a predetermined threshold.

The file from which the character area is extracted by the first extraction means is a PDF file, and the file generated by the generation means is a searchable PDF file to which the result of the character recognition processing is added as a transparent character object The information processing apparatus according to claim 1 or 2, wherein

The apparatus further comprises a second determination unit that determines whether the PDF file includes an image equivalent to a page size before the determination by the first determination unit is performed,
When the second determination unit determines that the PDF file includes an image equivalent to a page size, the first determination unit determines whether the PDF file is the searchable PDF file. on the other hand,
When the second determination means determines that the PDF file does not contain an image of a page size, the first determination means determines whether the PDF file is the searchable PDF file. The information processing apparatus according to claim 3, wherein the information processing apparatus is not provided.

When the second determination unit determines that the PDF file includes an image equivalent to a page size, the first extraction unit extracts a character region by dividing the image.
The character recognition process is performed on the character area extracted by the first extraction unit.
The generation unit generates a searchable PDF file to which a transparent character object indicating a character string obtained as a result of the character recognition process is added at the same position as the character area extracted by the first extraction unit. The information processing apparatus according to claim 4, wherein:

When the first determination unit determines that the PDF file input to the information processing apparatus is a searchable PDF file, the input does not generate a searchable PDF file based on the input PDF file. The information processing apparatus according to claim 3, wherein the stored PDF file is stored.

A second extracting unit is further provided for extracting a plurality of files including saved searchable PDF files as a search target, and extracting a file in which a character string in the search target file includes a keyword designated by the user And
4. The information processing apparatus according to claim 3, wherein a list of files extracted by the second extraction unit is displayed on a display unit.

The information processing apparatus is communicably connected to the image forming apparatus.
The print data generation method further includes generation means for generating print data based on the selected file in response to receiving a user operation for printing a file selected from the list, and the print data generated by the generation means The information processing apparatus according to claim 7, wherein the information processing apparatus transmits the image data to the image forming apparatus.

When the first determination means determines that the PDF file input to the information processing apparatus is a searchable PDF file, the user is asked whether the searchable PDF file is to be recreated based on the input PDF file. The information processing apparatus according to claim 3, wherein a prompt for causing selection is displayed on the display unit.

When recreating a searchable PDF file based on the PDF file input to the information processing apparatus, the information processing apparatus further includes a third determination unit that determines whether the transparent character string in the input PDF file is irrelevant to the content of the image. The information processing apparatus according to claim 9, characterized in that:

10. The information processing apparatus according to claim 9, wherein a prompt for causing the user to select whether to delete a transparent character string unrelated to the content of the image is displayed on the display unit.

10. The information processing apparatus according to claim 9, wherein only a character string having a high recognition rate is replaced with a transparent character string.

The first determination means determines the total number of character areas in the image object included in the file, and of the character areas, the character area whose position and size match the position and size of the transparent character object in the file. The information processing apparatus according to any one of claims 1 to 12, wherein whether or not the file is a searchable file is determined based on the number.

An extraction step of extracting a character area in an image object included in the file;
The file includes an image and a result of character recognition corresponding to the image based on the positional relationship between the character area extracted in the extracting step and the area where the character object included in the file is drawn. A determination step of determining whether or not the file is a searchable file;
Generating a file including a result of the character recognition process by executing character recognition processing on the image object in response to the determination that the file is not a searchable file in the determination step; A method characterized by having.

A program for making a computer execute the method according to claim 14.