JP2019068323A

JP2019068323A - Device for setting file name for scanned image, control method thereof and program

Info

Publication number: JP2019068323A
Application number: JP2017193615A
Authority: JP
Inventors: 憲鴇田; Ken Tokita
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2017-10-03
Filing date: 2017-10-03
Publication date: 2019-04-25

Abstract

To enhance the user's convenience in setting of a file name and the like by using a character string obtained by performing OCR processing on a scanned image.SOLUTION: A device for performing predetermined processing on a scanned image obtained by scanning a document incudes display control means for displaying a UI screen for performing predetermined processing, in which a unit area presumed to be a continuous character strings in the scanned image displayed so as to be distinguished by a user and setting means for performing OCR processing on the unit area selected by the user via the UI screen and setting incidental information for the predetermined processing by using the character string extracted by the OCR processing, and the display control means displays the unit area on the UI screen such that the unit area can be distinguished by the user according to the size of the unit area in the direction intersecting the direction along the document direction of the document.SELECTED DRAWING: Figure 10

Description

本発明は、スキャン画像の文字認識結果を用いて、当該スキャン画像に対してファイル名等を設定する技術に関する。 The present invention relates to a technique for setting a file name or the like for a scanned image using character recognition results of the scanned image.

従来より、文書の管理手法として、紙文書をスキャンして電子化する手法が広く行われている。そして、文書の電子化に際して、スキャンされた画像データに対して文字認識処理（ＯＣＲ処理）を行い、認識された文字を、電子化した文書ファイルのファイル名に使用する技術がある。例えば、特許文献１に記載のドキュメントスキャナでは、ドキュメントをスキャンして得た画像においてユーザにより位置が指定されると、この位置に基づき抽出エリアを決定する。そして、抽出エリアに対してＯＣＲ処理を行ってメタデータ（文字列情報）を取得する。 Conventionally, as a document management method, a method of scanning and digitizing a paper document is widely used. Then, at the time of digitizing a document, there is a technique of performing character recognition processing (OCR processing) on scanned image data, and using the recognized character as the file name of the digitized document file. For example, in the document scanner described in Patent Document 1, when a position is designated by a user in an image obtained by scanning a document, an extraction area is determined based on the position. Then, OCR processing is performed on the extraction area to acquire metadata (character string information).

特表２００７−５０３０３２号公報Japanese Patent Publication No. 2007-503032

しかしながら、特許文献１では、ユーザが意図した文字列以外の文字を含んだ抽出エリアあるいはユーザが意図した文字列の一部しか含まない抽出エリアが決定されることがあり、ユーザが意図した適切な文字列を取得できない場合があった。一方、ユーザにより画像内の全ての文字列を選択可能とすることが考えられる。この場合、ユーザにより文字列が誤って選択されると、誤って選択された文字列を削除してから、所望の文字列を再度選択しなければならない場合があった。いずれの場合も、ユーザの利便性を低下させてしまう可能性があった。 However, in Patent Document 1, an extraction area containing characters other than the character string intended by the user or an extraction area containing only a part of the character string intended by the user may be determined, and the user intended appropriate. There was a case that could not get the string. On the other hand, it is conceivable that the user can select all character strings in the image. In this case, if the user mistakenly selects a character string, the user may have to delete the erroneously selected character string and then select a desired character string again. In either case, the user's convenience may be reduced.

本発明は、上記の課題に鑑みなされたものであり、スキャン画像に対してＯＣＲ処理を行って得られた文字列を用いてファイル名等の設定を行う場面において、ユーザの利便性を高めることを目的としている。 The present invention has been made in view of the above problems, and it is an object of the present invention to improve user's convenience in setting a file name and the like using a character string obtained by performing an OCR process on a scanned image. It is an object.

上述した課題を解決する本発明に係る装置は、文書をスキャンして得られたスキャン画像に対して所定の処理を行う装置であって、前記スキャン画像内の一続きの文字列と推認される単位領域をユーザが識別可能に表示した、前記所定の処理を行うためのＵＩ画面を表示する表示制御手段と、前記ＵＩ画面を介してユーザにより選択した単位領域に対してＯＣＲ処理を行い、該ＯＣＲ処理によって抽出された文字列を用いて前記所定の処理のための付帯情報を設定する設定手段と、を備え、前記表示制御手段は、前記文書の文章方向に沿う方向と交わる方向における前記単位領域の大きさに応じて該単位領域をユーザにより識別可能に前記ＵＩ画面に表示することを特徴とする。 An apparatus according to the present invention which solves the above-mentioned problem is an apparatus which performs predetermined processing on a scanned image obtained by scanning a document, and is presumed to be a series of character strings in the scanned image. A display control means for displaying a UI screen for performing the predetermined processing, in which a unit area is identifiably displayed, and performing OCR processing on the unit area selected by the user via the UI screen; And setting means for setting incidental information for the predetermined process using the character string extracted by the OCR process, the display control means may set the unit in a direction intersecting the direction along the text direction of the document. The unit area is displayed on the UI screen so as to be distinguishable by the user according to the size of the area.

本発明によれば、スキャン画像に対してＯＣＲ処理を行って得られた文字列を用いてファイル名等の設定を行う場面において、ユーザの利便性を高めることができる。 According to the present invention, the convenience of the user can be enhanced in the setting of the file name and the like using the character string obtained by performing the OCR processing on the scan image.

画像処理システムの全体構成を示す図Diagram showing the overall configuration of the image processing system ＭＦＰのハードウェア構成図MFP hardware configuration ファイルサーバのハードウェア構成図File server hardware configuration diagram ＭＦＰのソフトウェア構成図Software configuration of MFP スキャン画像の生成からアップロードまでの動作制御の流れを示すフローチャートFlow chart showing the flow of operation control from generation of scan image to upload スキャン設定画面の一例を示す図A diagram showing an example of a scan setting screen プレビュー画面の一例を示す図A diagram showing an example of a preview screen プレビュー画面でファイル名が生成された状態を示す図Diagram showing the file name generated on the preview screen アップロード設定画面の一例を示す図Figure showing an example of upload setting screen ファイル名生成処理の詳細を示すフローチャートFlowchart showing details of file name generation processing 文字列領域表示処理の詳細を示すフローチャートFlowchart showing details of character string area display processing 第２の実施形態に係るプレビュー画面の一例を示す図A figure showing an example of a preview screen concerning a 2nd embodiment ファイル名生成処理の詳細を示すフローチャートFlowchart showing details of file name generation processing 第３の実施形態に係る、非表示文字列領域の表示処理を示すフローチャートFlowchart showing display processing of non-display character string area according to the third embodiment

以下、本発明を実施するための形態について図面を用いて説明する。なお、以下の実施の形態は特許請求の範囲に係る発明を限定するものでなく、また実施の形態で説明されている特徴の組み合わせの全てが発明の解決手段に必須のものとは限らない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. The following embodiments do not limit the invention according to the claims, and all combinations of the features described in the embodiments are not necessarily essential to the solution means of the invention.

［第１の実施形態］
＜画像処理システムの構成＞
図１は、本実施形態に係る画像処理システムの全体構成を示す図である。画像処理システムは、ＭＦＰ１１０とファイルサーバ１２０とで構成され、ＬＡＮ（Local Area Network）を介して互いに通信可能に接続されている。 First Embodiment
<Configuration of image processing system>
FIG. 1 is a diagram showing an overall configuration of an image processing system according to the present embodiment. The image processing system includes an MFP 110 and a file server 120, and is communicably connected to each other via a LAN (Local Area Network).

ＭＦＰ（Multi Function Printer）１１０は、スキャナやプリンタといった複数の機能を有する複合機であり、画像処理装置の一例である。ファイルサーバ１２０は、電子化された文書ファイルを保存・管理する外部サーバの一例である。本実施形態の画像処理システムは、ＭＦＰ１１０とファイルサーバ１２０とからなる構成としているがこれに限定されない。例えば、ＭＦＰ１１０がファイルサーバ１２０の役割を兼ね備えてもよい。また、ＬＡＮに代えてインターネットなどを介した接続形態であってもよい。また、ＭＦＰ１１０は、ＰＳＴＮ（Public Switched Telephone Networks）に接続され、ファクシミリ装置（不図示）との間で画像データをファクシミリ通信することができる。 An MFP (Multi Function Printer) 110 is a multifunction peripheral having a plurality of functions such as a scanner and a printer, and is an example of an image processing apparatus. The file server 120 is an example of an external server that stores and manages digitized document files. The image processing system of the present embodiment is configured of the MFP 110 and the file server 120, but is not limited to this. For example, the MFP 110 may have the role of the file server 120. Also, instead of the LAN, connection may be made via the Internet or the like. Further, the MFP 110 is connected to a Public Switched Telephone Networks (PSTN), and can perform facsimile communication of image data with a facsimile apparatus (not shown).

＜ＭＦＰのハードウェア構成＞
図２は、ＭＦＰ１１０のハードウェア構成図である。ＭＦＰ１１０は、制御部２１０、操作部２２０、プリンタ部２２１、スキャナ部２２２、モデム２２３で構成される。制御部２１０は、以下の各部２１１〜２１９で構成され、ＭＦＰ１１０全体の動作を制御する。ＣＰＵ２１１は、ＲＯＭ２１２に記憶された制御プログラムを読み出して、読取／印刷／通信などＭＦＰ１１０が有する各種機能を実行・制御する。ＲＡＭ２１３は、ＣＰＵ２１１の主メモリ、ワークエリア等の一時記憶領域として用いられる。なお、本実施形態では１つのＣＰＵ２１１が１つのメモリ（ＲＡＭ２１３またはＨＤＤ２１４）を用いて後述のフローチャートに示す各処理を実行するものとするが、他の態様であっても構わない。例えば、複数のＣＰＵや複数のＲＡＭまたはＨＤＤを協働させて各処理を実行してもよい。ＨＤＤ２１４は、画像データや各種プログラムを記憶する大容量記憶部である。操作部Ｉ／Ｆ２１５は、操作部２２０と制御部２１０を接続するインタフェースである。操作部２２０には、タッチパネル機能を有する液晶表示部やキーボードなどが備えられており、ユーザによる操作／入力／指示を受け付ける受付部としての役割を担う。これらのユーザによる操作などは液晶パネルに対するタッチによって受け付けてもよいし、ユーザによるキーボードやボタンなどの操作によって受け付けてもよい。プリンタＩ／Ｆ２１６は、プリンタ部２２１と制御部２１０を接続するインタフェースである。印刷用の画像データはプリンタＩ／Ｆ２１６を介して制御部２１０からプリンタ部２２１へ転送され、記録媒体上に印刷される。スキャナＩ／Ｆ２１７は、スキャナ部２２２と制御部２１０を接続するインタフェースである。スキャナ部２２２は、不図示の原稿台やＡＤＦ（Auto Document Feeder）にセットされた原稿を読み取って画像データを生成し、スキャナＩ／Ｆ２１７を介して制御部２１０に入力する。ＭＦＰ１１０は、スキャナ部２２２で生成された画像データをプリンタ部２２１から印刷出力（コピー）するほか、ファイル送信またはメール送信することができる。モデムＩ／Ｆ２１８は、モデム２２３と制御部２１０を接続するインタフェースである。モデム２２３は、ＰＳＴＮ上のファクシミリ装置との間で画像データをファクシミリ通信する。ネットワークＩ／Ｆ２１９は、制御部２１０（ＭＦＰ１１０）をＬＡＮに接続するインタフェースである。ＭＦＰ１１０は、ネットワークＩ／Ｆ２１９を用いてＬＡＮ上の外部装置（ファイルサーバ１２０など）に画像データや情報を送信したり、各種情報を受信したりする。 <Hardware Configuration of MFP>
FIG. 2 is a hardware configuration diagram of the MFP 110. As shown in FIG. The MFP 110 includes a control unit 210, an operation unit 220, a printer unit 221, a scanner unit 222, and a modem 223. The control unit 210 includes the following units 211 to 219, and controls the overall operation of the MFP 110. The CPU 211 reads out the control program stored in the ROM 212, and executes / controls various functions of the MFP 110 such as reading / printing / communication. The RAM 213 is used as a main memory of the CPU 211 and a temporary storage area such as a work area. In the present embodiment, one CPU 211 executes each process shown in a flowchart to be described later using one memory (the RAM 213 or the HDD 214). However, another mode may be used. For example, multiple CPUs or multiple RAMs or HDDs may cooperate to execute each process. The HDD 214 is a large-capacity storage unit that stores image data and various programs. The operation unit I / F 215 is an interface that connects the operation unit 220 and the control unit 210. The operation unit 220 is provided with a liquid crystal display unit having a touch panel function, a keyboard, and the like, and plays a role as a reception unit for receiving an operation / input / instruction by the user. These operations by the user may be received by touching the liquid crystal panel, or may be received by the user operating the keyboard or button. The printer I / F 216 is an interface that connects the printer unit 221 and the control unit 210. Image data for printing is transferred from the control unit 210 to the printer unit 221 via the printer I / F 216 and printed on a recording medium. The scanner I / F 217 is an interface that connects the scanner unit 222 and the control unit 210. The scanner unit 222 reads an original document set on an unshown document table or an ADF (Auto Document Feeder) to generate image data, and inputs the image data to the control unit 210 via the scanner I / F 217. The MFP 110 can print out (copy) the image data generated by the scanner unit 222 from the printer unit 221, and can transmit a file or a mail. The modem I / F 218 is an interface that connects the modem 223 and the control unit 210. A modem 223 carries out facsimile communication of image data with a facsimile apparatus on the PSTN. A network I / F 219 is an interface that connects the control unit 210 (MFP 110) to a LAN. The MFP 110 transmits image data and information to an external device (such as the file server 120) on the LAN using the network I / F 219, and receives various information.

＜ファイルサーバのハードウェア構成＞
図３は、ファイルサーバ１２０のハードウェア構成図である。ファイルサーバ１２０は、ＣＰＵ３１１、ＲＯＭ３１２、ＲＡＭ３１３、ＨＤＤ３１４及びネットワークＩ／Ｆ３１５で構成される。ＣＰＵ３１１は、ＲＯＭ３１２に記憶された制御プログラムを読み出して各種処理を実行することで、ファイルサーバ１２０全体の動作を制御する。ＲＡＭ３１３は、ＣＰＵ３１１の主メモリ、ワークエリアなどの一時記憶領域として用いられる。ＨＤＤ３１４は、画像データや各種プログラムを記憶する大容量記憶部である。ネットワークＩ／Ｆ３１５は、ファイルサーバ１２０をＬＡＮに接続するインタフェースである。ファイルサーバ１２０は、ネットワークＩ／Ｆ３１５を用いてＬＡＮ上の他の装置（例えばＭＦＰ１１０）との間で各種情報を送受信する。 <Hardware Configuration of File Server>
FIG. 3 is a hardware configuration diagram of the file server 120. As shown in FIG. The file server 120 includes a CPU 311, a ROM 312, a RAM 313, an HDD 314, and a network I / F 315. The CPU 311 controls the entire operation of the file server 120 by reading out the control program stored in the ROM 312 and executing various processes. The RAM 313 is used as a main memory of the CPU 311 and a temporary storage area such as a work area. The HDD 314 is a large-capacity storage unit that stores image data and various programs. The network I / F 315 is an interface that connects the file server 120 to a LAN. The file server 120 transmits / receives various information to / from another device (for example, the MFP 110) on the LAN using the network I / F 315.

＜ＭＦＰのソフトウェア構成＞
図４は、ＭＦＰ１１０のソフトウェア構成図である。ＭＦＰ１１０のソフトウェアは、ネイティブ機能モジュール４１０とアディショナル機能モジュール４２０の２つに大きく分けられる。ネイティブ機能モジュール４１０に含まれる各部は、ＭＦＰ１１０に標準的に備えられた機能モジュールである。これに対し、アディショナル機能モジュール４２０に含まれる各部は、ＭＦＰ１１０に追加的に備わる機能モジュールであり、例えばアプリケーションのインストールによって実現される機能モジュールである。本実施形態におけるアディショナル機能モジュール４２０は、Ｊａｖａ（登録商標）をベースとしたアプリケーションであり、ＭＦＰ１１０への機能追加を容易に実現できる。なお、ＭＦＰ１１０には図示しないほかのアプリケーションがインストールされていてもよい。以下、両機能モジュール４１０及び４２０を構成する各部について説明する。 <Software Configuration of MFP>
FIG. 4 is a software configuration diagram of the MFP 110. The software of the MFP 110 can be roughly divided into two, a native function module 410 and an additional function module 420. Each unit included in the native function module 410 is a function module provided in the MFP 110 as a standard. On the other hand, each unit included in the additional function module 420 is a function module additionally provided to the MFP 110, and is, for example, a function module realized by installing an application. The additional function module 420 in the present embodiment is an application based on Java (registered trademark), and can easily realize the addition of functions to the MFP 110. Note that another application (not shown) may be installed in the MFP 110. Hereinafter, each part which comprises both functional modules 410 and 420 is demonstrated.

アプリケーション表示部４２３は、操作部２２０のタッチパネルに、ユーザによる各種操作を受け付けるためのＵＩ画面を表示する表示制御を行う。表示するＵＩ画面の詳細については後述する。スキャン指示部４２１は、ＵＩ画面を介して入力されたユーザ指示に従い、スキャン設定や転送設定の情報を含んだスキャン要求をスキャン実行部４１１に送信して、スキャン処理を実行させる。このスキャン要求には、後述するアプリケーション受信部４２２が受信する画像データが、当該スキャン要求に対応したものかどうかを識別するための要求ＩＤも含まれる。 The application display unit 423 performs display control to display a UI screen for receiving various operations by the user on the touch panel of the operation unit 220. Details of the UI screen to be displayed will be described later. The scan instruction unit 421 transmits a scan request including information on scan setting and transfer setting to the scan execution unit 411 according to a user instruction input via the UI screen, and causes the scan execution unit 411 to execute a scan process. The scan request also includes a request ID for identifying whether the image data received by the application reception unit 422 described later corresponds to the scan request.

スキャン実行部４１１は、スキャン指示部４２１からスキャン要求を受けて、スキャナＩ／Ｆ２１７を介してスキャナ部２２２を制御しスキャン処理を実行する。これにより、原稿が読み取られて画像データが生成される。スキャンによって生成された画像データ（以下、「スキャン画像データ」と呼ぶ。）はその転送先を特定する転送特定の情報と共に転送部４１２に渡される。転送部４１２は、スキャン実行部４１１から受け取ったスキャン画像データを、転送設定に従って転送する。スキャン画像データの転送先としては、ファイルサーバ１２０や、ＬＡＮ上のＰＣ（不図示）等を設定可能である。ただし、本実施形態ではスキャン実行部４１１で生成したスキャン画像データは、一旦全て、アディショナル機能モジュール４２０に転送するように設定されているものとする。転送部４１２は、ＦＴＰ（File Transfer Protocol）クライアント機能を有しており、ＦＴＰサーバ機能を有するアプリケーション受信部４２２に対してＦＴＰでスキャン画像データを転送する。 The scan execution unit 411 receives a scan request from the scan instruction unit 421, controls the scanner unit 222 via the scanner I / F 217, and executes a scan process. Thus, the document is read and image data is generated. The image data (hereinafter referred to as “scan image data”) generated by the scan is passed to the transfer unit 412 together with transfer specific information specifying the transfer destination. The transfer unit 412 transfers the scan image data received from the scan execution unit 411 according to the transfer setting. As the transfer destination of the scan image data, the file server 120 or a PC (not shown) on the LAN can be set. However, in the present embodiment, it is assumed that all scan image data generated by the scan execution unit 411 is temporarily set to be transferred to the additional function module 420. The transfer unit 412 has an FTP (File Transfer Protocol) client function, and transfers scan image data to the application reception unit 422 having an FTP server function by FTP.

アプリケーション受信部４２２は、転送部４１２から内部転送されたスキャン画像データを受信し、アプリケーション転送部４２４に渡す。アプリケーション転送部４２４は、受信したスキャン画像データを画像解析部４２５に渡す。 The application reception unit 422 receives the scan image data internally transferred from the transfer unit 412, and passes the scan image data to the application transfer unit 424. The application transfer unit 424 passes the received scan image data to the image analysis unit 425.

画像解析部４２５は、受け取ったスキャン画像データに対して文字列領域の特定や文字列認識や文章方向の特定といった画像解析処理を行なう。そして、スキャン画像データとその解析結果は、アプリケーション転送部４２４を介して、プレビュー表示部４２６に渡される。プレビュー表示部４２６は、スキャン画像データとその解析結果に基づき、ユーザがファイル名設定を行うためのＵＩ画面であるプレビュー画面を生成し、操作部２２０のタッチパネルに表示するといった表示制御を行う。表示するプレビュー画面の詳細については後述する。 The image analysis unit 425 performs an image analysis process such as specifying a character string area, character string recognition, and specifying a sentence direction on the received scan image data. Then, the scan image data and the analysis result thereof are passed to the preview display unit 426 via the application transfer unit 424. The preview display unit 426 performs display control such as generating a preview screen which is a UI screen for the user to set the file name based on the scan image data and the analysis result, and displaying the preview screen on the touch panel of the operation unit 220. Details of the preview screen to be displayed will be described later.

アップロード指示部４２７は、ユーザがフォルダパス設定を行うためのＵＩ画面を生成し、操作部２２０のタッチパネルに表示する。アプリケーション転送部４２４は、ファイルサーバ１２０にスキャン画像データを転送（送信）する。アプリケーション転送部４２４は、転送が終了すると、アプリケーション表示部４２３に転送が終了したことを通知する。アプリケーション表示部４２３は、アプリケーション転送部４２４からの通知を受けて、表示内容を更新する。また、アプリケーション転送部４２４は、ＳＭＢ（Server Message Block）クライアント機能を有している。これにより、ＳＭＢサーバ機能を有するファイルサーバ１２０に対してＳＭＢを用いてファイル操作及びフォルダ操作を行う。なお、使用するプロトコルは、ＳＭＢに限定されるものではなく、WebDAV、ＦＴＰ、ＳＭＴＰなどであってもよい。また、ＳＯＡＰやＲＥＳＴといったファイル送信目的以外のプロトコルも使用可能である。 The upload instruction unit 427 generates a UI screen for the user to set folder paths, and displays the UI screen on the touch panel of the operation unit 220. The application transfer unit 424 transfers (sends) scan image data to the file server 120. When the transfer is completed, the application transfer unit 424 notifies the application display unit 423 that the transfer is completed. The application display unit 423 receives the notification from the application transfer unit 424, and updates the display content. In addition, the application transfer unit 424 has an SMB (Server Message Block) client function. As a result, file operation and folder operation are performed on the file server 120 having the SMB server function using the SMB. The protocol to be used is not limited to SMB, and may be WebDAV, FTP, SMTP or the like. Also, protocols other than file transmission purpose such as SOAP and REST can be used.

続いて、ＭＦＰ１１０において原稿からスキャン画像を生成し、ファイルサーバ１２０にアップロードする際の動作制御について説明する。図５は、スキャン画像データの生成からアップロードまでの制御の流れを示すフローチャートである。この一連の処理は、制御部２１０がＨＤＤ２１４に記憶された制御プログラムを実行することにより実現される。以下、詳しく説明する。 Subsequently, operation control at the time of generating a scan image from a document in the MFP 110 and uploading it to the file server 120 will be described. FIG. 5 is a flowchart showing a flow of control from generation of scan image data to upload. This series of processing is realized by control unit 210 executing a control program stored in HDD 214. Details will be described below.

ステップ５０１では、アプリケーション表示部４２３により、スキャン処理のための各種設定を行うためのＵＩ画面（以下、「スキャン設定画面」と呼ぶ。）が、操作部２２０のタッチパネルに表示される。図６は、スキャン設定画面の一例を示す図である。図６のスキャン設定画面６００には、５つの設定ボタン６０１〜６０５が存在する。［カラー設定］ボタン６０１は、原稿をスキャンする際のカラー／モノクロを設定するためのボタンである。［解像度設定］ボタン６０２は、原稿をスキャンする際の解像度を設定するためのボタンである。［両面読み取り設定］ボタン６０３は、原稿の両面をスキャンしたい場合に用いる設定ボタンである。［原稿混載設定］ボタン６０４は、サイズが異なる原稿をまとめてスキャンしたい場合に用いる設定ボタンである。［画像形式設定］ボタン６０５は、スキャン画像データの保存形式を指定する際に用いる設定ボタンである。これら設定ボタン６０１〜６０５を用いた設定時には、ＭＦＰ１１０においてサポートされている範囲で設定可能な候補（選択肢）が表示され、ユーザが表示された候補から望むものを選択する。なお、上述の設定ボタンは一例であって、これらすべての設定項目が存在しなくてもよいし、これら以外の設定項目が存在してもよい。ユーザは、このようなスキャン設定画面６００を介してスキャン処理についても詳細な設定を行う。［キャンセル］ボタン６２０は、スキャン設定を中止する場合に用いるボタンである。［スキャン開始］ボタン６２１は、原稿台等にセットした原稿に対するスキャン処理の開始を指示するためのボタンである。 In step 501, the application display unit 423 displays a UI screen (hereinafter, referred to as “scan setting screen”) for performing various settings for scan processing on the touch panel of the operation unit 220. FIG. 6 is a diagram showing an example of a scan setting screen. Five setting buttons 601 to 605 exist in the scan setting screen 600 of FIG. [Color setting] button 601 is a button for setting color / monochrome when scanning an original. [Resolution setting] button 602 is a button for setting the resolution when scanning an original. [Duplex reading setting] button 603 is a setting button used when it is desired to scan both sides of a document. An “mixed original setting” button 604 is a setting button used when it is desired to scan originals of different sizes collectively. An “image format setting” button 605 is a setting button used to designate a storage format of scan image data. At the time of setting using these setting buttons 601 to 605, settable candidates (options) are displayed in the range supported by the MFP 110, and the user selects a desired one from the displayed candidates. In addition, the above-mentioned setting button is an example, Comprising: All these setting items may not exist, and setting items other than these may exist. The user also makes detailed settings for scan processing via such a scan setting screen 600. [Cancel] button 620 is used to cancel the scan setting. [Scan start] button 621 is a button for instructing start of scan processing for the document set on the document table or the like.

ステップ５０２では、アプリケーション表示部４２３により、ユーザによるボタン操作の内容によって処理の切り分けがなされる。［スキャン開始］ボタン６２１の押下を検出された場合、アプリケーション表示部４２３によって各設定ボタン６０１〜６０５で設定された内容とスキャン処理の実行開始の受付が、スキャン指示部４２１に対して伝えられる。一方、［キャンセル］ボタン６２０の押下を検出された場合は本処理を終える。 In step 502, the application display unit 423 divides the processing according to the content of the button operation by the user. When pressing of the [scan start] button 621 is detected, the application display unit 423 notifies the scan instruction unit 421 of the contents set by the setting buttons 601 to 605 and the acceptance of the start of the scan process. On the other hand, when it is detected that the [cancel] button 620 has been pressed, this processing ends.

ステップ５０３では、スキャン指示部４２１からの指示に応答して、スキャン実行部４１１によってスキャン処理が実行される。これにより得られたスキャン画像データは、ステップ５０４において、転送部４１２を通じてアプリケーション受信部４２２にＦＴＰで内部転送され、さらにアプリケーション転送部４２４を介して画像解析部４２５に送られる。 In step 503, in response to the instruction from the scan instruction unit 421, the scan execution unit 411 executes scan processing. The scan image data thus obtained is internally transferred by FTP to the application receiving unit 422 through the transfer unit 412 in step 504, and is further sent to the image analysis unit 425 through the application transfer unit 424.

ステップ５０５では、画像解析部４２５において、スキャン画像データの解析処理が実行される。具体的には、まず、スキャン画像データに対し、画像信号の二値化やスキャン時にずれた原稿の傾きの修正、原稿が正立する方向への回転などを行って、解析処理を行いやすい状態に補正する。そして、補正されたスキャン画像（２値画像）に対しエッジ抽出などを行って、当該画像内の文字列領域、すなわち、一続きの文字列と推認される塊（単位領域）を特定する。文字列領域の座標、文字列領域の幅方向（横方向）および高さ方向（縦方向）の大きさが特定される。文字列領域の幅方向（横方向）とは、文書の文章方向に沿う方向を示している。文字列領域の高さ方向（縦方向）とは、文書の文章方向に沿う方向と交わる方向、例えば直交する方向を示している。また、原稿にて文章方向が横書きであるか縦書きであるかを特定する。これはスキャン画像（２値画像）に対し縦と横の射影をとって、この射影の分散の低いほうを行方向と判定する方法があり、これを用いることができる。以下の表１は、ある見積書のスキャン画像の一部に対して画像解析処理を行った結果の一例を示している。 In step 505, the image analysis unit 425 executes analysis processing of scan image data. More specifically, the analysis process can be easily performed by first performing binarization of the image signal, correcting the skew of the document shifted during scanning, and rotating the document in a direction in which the document is erected, with respect to the scanned image data. Correct to Then, edge extraction or the like is performed on the corrected scan image (binary image) to specify a character string area in the image, that is, a block (unit area) that is presumed to be a continuous character string. The coordinates of the character string area, and the size in the width direction (horizontal direction) and the height direction (longitudinal direction) of the character string area are specified. The width direction (horizontal direction) of the character string area indicates the direction along the text direction of the document. The height direction (longitudinal direction) of the character string area indicates a direction intersecting the direction along the text direction of the document, for example, the orthogonal direction. In addition, it is specified in the original document whether the text direction is horizontal or vertical. There is a method of taking vertical and horizontal projections with respect to a scan image (binary image) and determining the one with the lower dispersion of this projection as the row direction, which can be used. Table 1 below shows an example of the result of performing the image analysis processing on a part of the scanned image of a certain written estimate.

上記表１において、［番号］は、特定された各文字列領域を一意に示す番号である。この例では１から１０までの通し番号が、認識した順番に付けられている。座標はプレビュー領域７１０の左上を原点（０，０）として右方向にＸ軸、下方向にＹ軸をとるものとする。［領域のＸ座標］は、特定された各文字列領域の左上隅のＸ座標を示す。［領域のＹ座標］は、特定された各文字列領域の左上隅のＹ座標を示す。以後、文字列領域に対して“座標”という場合は、特に断らない限り、文字列領域の左上隅の位置座標のことを意味するものとする。［領域の幅］は、特定された各文字列領域の左辺から右辺までの距離を示す。［領域の高さ］は、特定された各文字列領域の上辺から下辺までの距離を示す。本実施形態では、［領域のＸ座標］、［領域のＹ座標］、［領域の幅］、［領域の高さ］はいずれもピクセルで示すが、ポイントやインチ等で示してもよい。スキャン画像から抽出された各文字列領域の情報（以下、「文字列領域情報」と呼ぶ。）は、画像解析データとしてアプリケーション転送部４２４に渡される。画像解析データは、例えばＣＳＶやＸＭＬのフォーマットとするが、他のフォーマットであっても構わない。また、ＨＤＤ２１４に一旦保存した上で、所定のタイミングで渡してもよい。 In Table 1 above, [No.] is a number uniquely indicating each identified character string area. In this example, serial numbers 1 to 10 are assigned in the order of recognition. Coordinates take the X axis in the right direction and the Y axis in the lower direction, with the upper left of the preview area 710 as the origin (0, 0). [X coordinate of area] indicates the X coordinate of the upper left corner of each specified character string area. [Y coordinate of area] indicates the Y coordinate of the upper left corner of each specified character string area. Hereinafter, the term "coordinates" with respect to the character string area means the position coordinates of the upper left corner of the character string area unless otherwise specified. [Area width] indicates the distance from the left side to the right side of each of the specified character string areas. [Area height] indicates the distance from the upper side to the lower side of each specified character string area. In the present embodiment, “X coordinate of area”, “Y coordinate of area”, “width of area”, and “height of area” are all represented by pixels, but may be represented by points, inches or the like. Information on each character string area extracted from the scan image (hereinafter referred to as “character string area information”) is passed to the application transfer unit 424 as image analysis data. The image analysis data is, for example, in the format of CSV or XML, but may be another format. Alternatively, the data may be temporarily stored in the HDD 214 and then delivered at a predetermined timing.

ステップ５０６では、アプリケーション転送部４２４により、ステップ５０３で読み込まれたスキャン画像が、操作部２２０のタッチパネルにプレビュー表示される。このプレビュー表示用の画面データは、アプリケーション表示部４２３によって生成・提供される。ユーザは、スキャン画像がプレビュー表示されたＵＩ画面（以下、「プレビュー画面」と呼ぶ。）を介して、スキャン画像データのファイル名を設定することができる。図７（ａ）は、プレビュー画面の一例を示す図である。プレビュー画面７００において、画面中央にあるプレビュー領域７１０内には、スキャン画像と共にその表示状態を変更するための複数のボタン７１１〜７１４も表示される。ボタン７１１及び７１２はスキャン画像の全体を表示しきれないときに現れるボタンで、表示領域を縦方向にスクロールするためのボタンである。ＭＦＰ１１０が備えるタッチパネルは通常それほど大きくはない。そこで、例えば、スキャン画像がＡ４縦・横書きの原稿を読み取ったものである場合は、スキャン画像の幅方向（短手方向）全体がプレビュー領域７１０にちょうど収まるように上詰めで縮小表示されるよう初期設定される。つまり、初期設定においては、Ａ４縦のスキャン画像の下部は、プレビュー領域７１０内に表示されないことになる。このようなとき、「↓」のボタン７１２を押下すると下に表示領域がスクロールし、下部を表示させることができる。さらに、スキャン画像が例えばＡ４横やＡ３などの場合には、表示領域を横方向にスクロールするためのボタンをさらに設ければよい。ボタン７１３及び７１４は、表示領域を拡大・縮小するためのボタンであり、「＋」のボタン７１３を押下するとズームインし、「−」のボタン７１４を押下するとズームアウトする。これらボタン操作による動作を、プレビュー画面上でスワイプやピンチアウト／ピンチインといったユーザの指による操作で実現してもよい。また、プレビュー領域７１０には、ステップ５０５の画像解析処理によって特定された文字領域が、上述の文字列領域情報に基づき、ユーザに選択可能で識別可能な態様にて表示される。この文字列領域表示処理の詳細については後述する。図７（ｂ）は、図７（ａ）のプレビュー画面７００において、各文字列領域が囲み枠によって識別可能に表示された状態を示す図である。文字列領域の表示処理の詳細については後述する。ユーザがユーザに識別可能な態様でプレビュー領域７１０に表示された文字列領域の中から任意の文字列領域を選択（例えば指でタッチ）する。この選択操作に伴い、そこに含まれる文字列がファイル名入力欄７０１に表示、すなわち自動入力され、ファイル名を構成する文字列の一部となる。［戻る］ボタン７２０は、プレビュー表示を中止する場合に用いるボタンである。［次へ］ボタン７２１は、読み込まれたスキャン画像データのアップロード先を設定する画面に移行するためのボタンである。また、ボタン７０２はファイル名のフォーマットなどを設定するためのボタンである。なお、上述した各種ボタンの種類、各文字列領域の表示や選択の態様は一例にすぎず、これに限定されない。例えば、ファイル名入力欄７０１に表示された文字列を修正・変更したり、ファイル名を確定したりするためのボタンがあってもよい。 In step 506, the application transfer unit 424 displays a preview of the scan image read in step 503 on the touch panel of the operation unit 220. The screen data for preview display is generated and provided by the application display unit 423. The user can set a file name of scan image data via a UI screen (hereinafter referred to as a “preview screen”) on which a scan image is previewed. FIG. 7A shows an example of the preview screen. In the preview screen 700, in the preview area 710 at the center of the screen, a plurality of buttons 711-714 for changing the display state are also displayed along with the scanned image. Buttons 711 and 712 appear when the entire scanned image can not be displayed, and are buttons for vertically scrolling the display area. The touch panel provided in the MFP 110 is usually not so large. Therefore, for example, when the scanned image is a read of an A4 vertical / horizontal written document, the entire image in the width direction (short side direction) of the scanned image is displayed in a reduced size so that it fits exactly in the preview area 710. Initialized. That is, in the initial setting, the lower part of the A4 vertical scan image is not displayed in the preview area 710. In such a case, when the “↓” button 712 is pressed, the display area scrolls downward, and the lower part can be displayed. Furthermore, in the case where the scan image is, for example, A4 side or A3 side, a button may be further provided to scroll the display area in the horizontal direction. The buttons 713 and 714 are buttons for enlarging or reducing the display area, and zoom in is performed when the “+” button 713 is pressed, and zoom out is performed when the “−” button 714 is pressed. The operation by these button operations may be realized by the user's finger operation such as swipe or pinch out / pinch in on the preview screen. Further, in the preview area 710, the character area identified by the image analysis processing in step 505 is displayed in a manner that can be selected and identified by the user based on the above-described character string area information. Details of the character string area display processing will be described later. FIG. 7B is a view showing a state in which each character string area is displayed in a distinguishable manner by a surrounding frame on the preview screen 700 of FIG. 7A. Details of the display processing of the character string area will be described later. An arbitrary character string area is selected (for example, touched with a finger) from among the character string areas displayed in the preview area 710 in such a manner that the user can identify the user. With this selection operation, the character string contained therein is displayed in the file name input field 701, that is, automatically input, and becomes part of the character string constituting the file name. A [Back] button 720 is a button used to cancel the preview display. [Next] button 721 is a button for shifting to a screen for setting an upload destination of the read scan image data. A button 702 is a button for setting the format of the file name and the like. Note that the types of various buttons described above and the manner of displaying and selecting each character string area are merely an example, and the present invention is not limited thereto. For example, there may be a button for correcting or changing the character string displayed in the file name input field 701 or determining the file name.

ステップ５０７では、プレビュー表示部４２６により、読み込まれたスキャン画像に対するファイル名の生成がなされる。図８は、ファイル名生成後のプレビュー画面７００の状態を示している。この例では、「見積書」、「東京株式会社」、「2017年04月14日」に対応する文字列領域が順次選択されたことで、「見積書＿東京株式会社＿2017年04月14日」の文字列が、ファイル名入力欄７０１に表示・設定されている。また、プレビュー領域７１０では、ユーザがタッチ操作した文字列領域の囲みが、ファイル名の一部として選択されたことを示すため太枠に変わっている。このファイル名生成処理の詳細については後述する。所望するファイル名が生成されてユーザが［次へ］ボタン７２１を押下すると、ステップ５０８へ進む。 In step 507, the preview display unit 426 generates a file name for the read scan image. FIG. 8 shows the state of the preview screen 700 after file name generation. In this example, “quotation sheet _ Tokyo Co., Ltd. _ April 14, 2017” is selected by sequentially selecting character string areas corresponding to “quotation sheet”, “Tokyo corporation”, and “April 14, 2017”. The character string “is displayed and set in the file name input field 701. Further, in the preview area 710, the box of the character string area touched by the user is changed to a thick frame to indicate that it has been selected as a part of the file name. Details of this file name generation process will be described later. When a desired file name is generated and the user presses the “next” button 721, the process proceeds to step 508.

ステップ５０８では、プレビュー表示部４２６により、ステップ５０２と同様、ユーザによるボタン操作の内容によって処理の切り分けがなされる。［次へ］ボタン７２１の押下が検出された場合は、ファイル名入力欄７０１に表示中のファイル名（ファイル名として設定された文字列）の情報がアップロード指示部４２７へ送られ、ステップ５０９へ進む。一方、［戻る］ボタン７２０の押下が検出された場合は、ステップ５０１（スキャン設定画面の表示）へ戻る。 In step 508, as in step 502, the preview display unit 426 divides the processing according to the content of the button operation by the user. When pressing of the [Next] button 721 is detected, information of the file name (character string set as the file name) being displayed in the file name input field 701 is sent to the upload instruction unit 427, and the process proceeds to step 509. move on. On the other hand, if pressing of the [Return] button 720 is detected, the process returns to step 501 (display of scan setting screen).

ステップ５０９では、アップロード指示部４２７により、スキャン画像データの転送先を設定するためのＵＩ画面（以下、「アップロード設定画面」と呼ぶ。）が、操作部２２０のタッチパネルに表示される。ユーザは、このアップロード設定画面を介して、ファイルサーバ１２０へのアップロードに関する詳細設定を行う。図９は、アップロード設定画面の一例を示す図である。ユーザは、［フォルダパス］入力欄９０１に、ファイルサーバ１２０へ外部転送する際のフォルダパスの名称（パス名）を入力する。図９の例では、“2017_04_14”がパス名として入力されている。入力方法としては、例えば［フォルダパス］入力欄９０１へのタップ操作に応じてキーボード画面のサブウインドウ（不図示）を表示し、ユーザに、当該キーボード画面を介してパス名を入力させる方法がある。あるいは、ＨＤＤ２１４内に保持するアドレス帳の中から選択して設定できるようにしてもよい。［戻る］ボタン９２０は、アップロードに関する詳細設定を中止する場合に用いるボタンである。［アップロード］ボタン９２１は、［フォルダパス］入力欄９０１で設定したフォルダパスへのアップロードを指示するためのボタンである。 In step 509, the upload instruction unit 427 displays a UI screen (hereinafter referred to as “upload setting screen”) for setting a transfer destination of the scan image data on the touch panel of the operation unit 220. The user performs detailed settings for uploading to the file server 120 via the upload setting screen. FIG. 9 is a diagram showing an example of the upload setting screen. The user inputs the name (path name) of the folder path for external transfer to the file server 120 in the [folder path] input field 901. In the example of FIG. 9, “2017 — 04 — 14” is input as the path name. As an input method, for example, there is a method of displaying a sub window (not shown) of the keyboard screen in response to a tap operation on the [folder path] input column 901 and having the user input a path name via the keyboard screen. . Alternatively, the address book stored in the HDD 214 may be selected and set. A “Back” button 920 is a button used to cancel the detailed setting regarding upload. An “Upload” button 921 is a button for instructing uploading to the folder path set in the “Folder path” input field 901.

ステップ５１０では、アップロード指示部４２７により、ステップ５０８と同様、ユーザによるボタン操作の内容によって処理の切り分けがなされる。［アップロード］ボタン９２１の押下が検出された場合は、ステップ５１１へ進む。その際、［フォルダパス］入力欄９０１に入力されたパス名、ステップ５０７で生成されたファイル名、ファイルサーバ設定は、ファイルサーバ１２０にスキャン画像データを格納するために必要な情報である。具体的には、ホスト名、フォルダパスの起点、ログイン用のユーザ名とパスワードといった情報を含む。一方、［戻る］ボタン９２０の押下が検出された場合は、ステップ５０６（プレビュー画面の表示）へ戻る。 In step 510, the upload instruction unit 427 divides the processing according to the content of the button operation by the user, as in step 508. If pressing of the “Upload” button 921 is detected, the process proceeds to Step 511. At this time, the path name input to the [folder path] input field 901, the file name generated in step 507, and the file server setting are information necessary for storing the scan image data in the file server 120. Specifically, it includes information such as host name, folder path start point, login user name and password. On the other hand, when pressing of the [Return] button 920 is detected, the process returns to Step 506 (display of a preview screen).

ステップ５１１では、アプリケーション転送部４２４により、スキャン画像データの格納先パスが、アップロード指示部４２７から受け取った情報に基づき生成される。具体的には、ファイルサーバ設定（ファイルサーバ１２０のホスト名、フォルダパスの起点）にフォルダパスを加えて生成される。これにより、例えば“\\server01\Share\2017_04_14”といった格納先パスが生成される。そして、ステップ５１２では、アプリケーション転送部４２４により、ファイルサーバ１２０に対するアクセスが実行される。この際、ファイルサーバ設定に含まれるユーザ名とパスワードをファイルサーバ１２０に送信する。ユーザ名とパスワードを受け取ったファイルサーバ１２０では、認証処理が実行される。 In step 511, the application transfer unit 424 generates a storage destination path of the scan image data based on the information received from the upload instructing unit 427. Specifically, it is generated by adding a folder path to the file server setting (host name of file server 120, starting point of folder path). As a result, for example, a storage destination path such as "\\ server01 \ Share \ 2017_04_14" is generated. Then, in step 512, the application transfer unit 424 executes access to the file server 120. At this time, the user name and password included in the file server setting are transmitted to the file server 120. The file server 120 that has received the user name and password executes an authentication process.

ステップ５１３では、アプリケーション転送部４２４により、ファイルサーバ１２０での認証結果に従って処理の切り分けがなされる。すなわち、ファイルサーバ１２０から認証成功の通知を受信した場合は、ステップ５１４に進む。一方、認証失敗の通知を受信した場合は、本処理を終了する。 In step 513, the application transfer unit 424 divides the processing according to the authentication result of the file server 120. That is, when the notification of the authentication success is received from the file server 120, the process proceeds to step 514. On the other hand, when the notification of the authentication failure is received, the present process ends.

ステップ５１４では、アプリケーション転送部４２４により、ステップ５１１で生成された格納先パスが示すフォルダにスキャン画像データが外部転送され、ファイルサーバ１２０内に格納される。 In step 514, the application transfer unit 424 externally transfers the scan image data to the folder indicated by the storage destination path generated in step 511, and stores the scan image data in the file server 120.

以上が、本実施形態に係るスキャン画像の生成からアップロードまでの動作制御の内容である。なお、本実施形態では、ステップ５０５〜５０７の処理を、スキャンによって生成された１ページ分の画像データに対して行うことを想定している。例えば、プレビュー画面７００内に次のページの画像解析を行うためのボタンを設け、その解析によって得られた次ページのプレビュー表示を行って、次ページ以降の文字列領域からファイル名を構成する文字列を設定できるようにしてもよい。 The above is the contents of operation control from generation of a scan image to upload according to the present embodiment. In the present embodiment, it is assumed that the processing in steps 505 to 507 is performed on image data of one page generated by scanning. For example, a button for performing image analysis of the next page is provided in the preview screen 700, and the preview display of the next page obtained by the analysis is performed, and the characters constituting the file name from the character string area of the subsequent pages It may be possible to set the column.

＜ファイル名の生成＞
図１０は、プレビュー表示部４２６におけるファイル名生成処理（ステップ５０７）の詳細を示すフローチャートである。以下、図１０のフローに沿って説明する。 <Generation of file name>
FIG. 10 is a flowchart showing details of the file name generation process (step 507) in the preview display unit 426. Hereinafter, description will be made along the flow of FIG.

ステップ１００１では、タッチパネル上に表示されたプレビュー画面７００へのユーザによるタッチ操作の有無が監視される。タッチ操作が検出されるとステップ１００２へ進む。続くステップ１００２では、タッチ操作の内容によって処理の切り分けがなされる。タッチ操作の内容が、［次へ］ボタン７２１または［戻る］ボタン７２０の押下であった場合は、本処理を終了する。［次へ］ボタン７２１または［戻る］ボタン７２０の押下ではなかった場合は、ステップ１００３へ進む。 In step 1001, the presence or absence of a touch operation by the user on the preview screen 700 displayed on the touch panel is monitored. If a touch operation is detected, the process proceeds to step 1002. In the subsequent step 1002, the processing is divided according to the content of the touch operation. If the content of the touch operation is the pressing of the [Next] button 721 or the [Back] button 720, this processing ends. If the “next” button 721 or the “back” button 720 is not pressed, the process proceeds to step 1003.

ステップ１００３では、タッチ操作がなされたタッチパネル上の位置座標（x,y）が取得される。続くステップ１００４では、タッチ操作された位置座標が、ユーザに選択可能で識別可能な態様にて表示されたいずれかの文字列領域と重なるかどうかが判定される。例えば、タッチ操作された位置座標が、ユーザに選択可能で識別可能な態様にてプレビュー領域７１０に表示された各文字列領域の内側（文字列領域の四隅を表す位置座標で特定される矩形の内側）に含まれるかどうかで判定する。タッチ操作された位置座標がユーザに選択可能で識別可能な態様にて表示されたいずれかの文字列領域と重なっている場合は、ステップ１００５へ進む。一方、重なっていない場合は、ステップ１００１に戻る。 In step 1003, position coordinates (x, y) on the touch panel on which the touch operation has been performed are acquired. In the following step 1004, it is determined whether the position coordinates touched by the touch overlap with any of the character string areas displayed in a selectable and identifiable manner by the user. For example, the position coordinates touched by the touch may be located inside each character string area displayed in the preview area 710 in a selectable and distinguishable manner (a rectangular shape specified by the position coordinates representing the four corners of the character string area (Inside) to determine if it is included. If the position coordinates of the touch operation overlap with any of the character string areas displayed in a selectable and distinguishable manner for the user, the process proceeds to step 1005. On the other hand, if they do not overlap, the process returns to step 1001.

ステップ１００５では、タッチ操作された位置座標と重なっている文字列領域の文字列が取得される。文字列の取得は、タッチ操作によって選択された文字列領域（以下、「選択文字列領域」と呼ぶ。）に対するＯＣＲ処理を画像解析部４２５で実行して、文字列を抽出することで取得される。そして、ステップ１００６では、ファイル名入力欄７０１に現在表示中のファイル名が取得される。続くステップ１００７では、取得したファイル名の中身が“空”であるか判定する。ここで、ファイル名が“空”とは、ファイル名入力欄７０１内に何らの文字列も表示されていない空欄状態を意味する。取得したファイル名が“空”であった場合は、ステップ１００９に進む。一方、“空”でなかった場合は、ステップ１００８に進む。 In step 1005, a character string in a character string area overlapping the position coordinates touched is acquired. Acquisition of a character string is performed by executing OCR processing on a character string area (hereinafter referred to as “selected character string area”) selected by a touch operation by the image analysis unit 425 to extract a character string. Ru. Then, in step 1006, the file name currently displayed in the file name input field 701 is acquired. In the next step 1007, it is determined whether the content of the acquired file name is "empty". Here, the file name "empty" means a blank state in which no character string is displayed in the file name input field 701. If the acquired file name is "empty", the process proceeds to step 1009. On the other hand, if it is not "empty", the process proceeds to step 1008.

ステップ１００８では、ステップ１００６で取得したファイル名の末尾に、所定の区切り文字を追加する処理がなされる。所定の区切り文字として、ここでは、アンダーバーを例に説明するが、これに限定されるものではない。例えばハイフンなどアンダーバー以外の記号・文字でも構わないし、さらにはスペースのような実体を伴わないものでも構わない。 In step 1008, processing is performed to add a predetermined delimiter to the end of the file name acquired in step 1006. Here, underbars will be described as an example of predetermined delimiters, but the present invention is not limited to this. For example, hyphens and other symbols and characters other than underscores may be used, and those without an entity such as a space may also be used.

ステップ１００９では、ステップ１００５で取得した文字列（選択文字列領域から抽出した文字列）が、ファイル名の構成要素として設定される。この際、既に設定された文字列が存在している場合は、その末尾に追加される。そして、ステップ１０１０では、現時点で設定されている文字列が、ファイル名入力欄７０１に表示（自動入力）される。 In step 1009, the character string acquired in step 1005 (character string extracted from the selected character string area) is set as a component of the file name. At this time, if there is a string that has already been set, it is added to the end. Then, in step 1010, the character string currently set is displayed (automatically input) in the file name input field 701.

以上が、ファイル名生成処理の内容である。このような処理によって、ユーザに選択された複数の文字列領域の間に区切り文字を挿入して、スキャン画像のファイル名が生成される。 The above is the contents of the file name generation process. By such processing, the delimiter is inserted between the plurality of character string areas selected by the user, and the file name of the scan image is generated.

＜文字列領域の表示＞
図１１は、プレビュー表示部４２６におけるプレビュー画面表示処理（ステップ５０６）の詳細を示すフローチャートである。以下、図１１のフローに沿って説明する。 <Display of character string area>
FIG. 11 is a flowchart showing details of preview screen display processing (step 506) in the preview display unit 426. Hereinafter, description will be made along the flow of FIG.

ステップ１１０１では、プレビュー表示部４２６により、ステップ５０５の画像処理によって特定された文章方向の情報がアプリケーション転送部４２４を介して取得される。 In step 1101, the preview display unit 426 acquires information on the text direction specified by the image processing in step 505 via the application transfer unit 424.

ステップ１１０２では、プレビュー表示部４２６により、ＨＤＤ２１４などに予め記憶された文字列領域の表示条件が取得される。文字列領域の表示条件は、文字列領域をプレビュー画面に表示するか否かを判断するための条件である。例えば、文字列領域の縦横比を表示／非表示の判定値（例えば、上限値、下限値、基準値など）として用いることが可能である。判定値は、原稿サイズに応じて変えることが可能である。 In step 1102, the preview display unit 426 acquires the display conditions of the character string area stored in advance in the HDD 214 or the like. The display condition of the character string area is a condition for determining whether to display the character string area on the preview screen. For example, it is possible to use the aspect ratio of the character string area as the display / non-display determination value (for example, the upper limit value, the lower limit value, the reference value, etc.). The determination value can be changed according to the document size.

判定値は、文章方向で異なる。文章（文字列）が横書きである場合、文字列領域の幅方向（横方向）の大きさを文字列領域の高さ方向（縦方向）の大きさで除算して得られた値が判定値として用いられる。また、文章（文字列）が縦書きである場合、文字列領域の高さ方向の大きさを文字列領域の幅方向の大きさで除算して得られた値が判定値として用いられる。本実施形態では、プレビュー画面に文字列領域を表示する上限値を１５とする。上限値を１５とした理由は、一文字の幅と高さがほぼ同じであると仮定すると約１５文字の文字列領域であると予想でき、１５文字の文字列は本文などの文章である可能性が高く、ファイル名に用いられる可能性が低いためである。この判定値の設定は、例えば、ＨＤＤ２１４にあらかじめ記憶させておいてもよい。この場合、判定値を原稿サイズに応じて設定しておくことも可能である。あるいは、スキャン設定画面における設定ボタンの押下に応じて表示条件設定画面のサブウインドウ（不図示）を表示し、ユーザに、当該表示条件設定画面を介して判定値を入力させてもよい。 Judgment values differ in the sentence direction. When a sentence (character string) is written horizontally, the value obtained by dividing the size in the width direction (horizontal direction) of the character string area by the size in the height direction (vertical direction) of the character string area is the judgment value Used as Further, when the sentence (character string) is written vertically, a value obtained by dividing the size in the height direction of the character string area by the size in the width direction of the character string area is used as the determination value. In the present embodiment, the upper limit value for displaying the character string area on the preview screen is 15. The reason why the upper limit is set to 15 is that if it is assumed that the width and height of one character are almost the same, it can be expected to be a character string area of about 15 characters, and the 15 character character string may be a sentence such as text And is unlikely to be used for file names. The setting of the determination value may be stored in advance in the HDD 214, for example. In this case, it is also possible to set the determination value in accordance with the document size. Alternatively, a sub window (not shown) of the display condition setting screen may be displayed in response to pressing of the setting button on the scan setting screen, and the user may input the determination value via the display condition setting screen.

ステップ１１０３では、プレビュー表示部４２６により、画像解析部４２５で得られた複数の文字列領域の中から１つの文字領域が決定される。 In step 1103, the preview display unit 426 determines one character area from among the plurality of character string areas obtained by the image analysis unit 425.

ステップ１１０４では、プレビュー表示部４２６により、ステップ１１０３で決定された文字列領域に関し、ステップ５０５の画像処理によって特定された文字列領域の情報がアプリケーション転送部４２４を介して取得される。 In step 1104, the preview display unit 426 acquires information on the character string area specified by the image processing in step 505 with respect to the character string area determined in step 1103 via the application transfer unit 424.

ステップ１１０５では、プレビュー表示部４２６により、ステップ１１０１で取得した文章方向の情報と、ステップ１００４で取得した文字列領域の情報とに基づき、文字列領域の縦横比（アスペクト比）が算出される。スキャン画像が横書きの原稿を読み取ったものである場合、文字列領域の縦横比の算出方法として、文字列領域の幅方向（横方向）の大きさを文字列領域の高さ方向（縦方向）の大きさで除算する算出方法を用いることが可能である。また、スキャン画像が縦書きの原稿を読み取ったものである場合、文字列領域の縦横比の算出方法として、文字列領域の高さ方向（縦方向）の大きさを文字列領域の横方向（幅方向）の大きさで除算する算出方法を用いることが可能である。例えば、表１の番号１に対応する文字列領域の縦横比は、幅方向の大きさ２０を高さ方向の大きさ１０で除算することで得られ、２となる。一方、表１の番号１０に対応する文字列領域の縦横比は、幅方向の大きさ８０を縦方向の大きさ５で除算することで得られ、１８となる。 In step 1105, the preview display unit 426 calculates the aspect ratio (aspect ratio) of the character string area based on the information on the text direction acquired in step 1101 and the information on the character string area acquired in step 1004. When the scanned image is read from a horizontally written document, the size in the width direction (horizontal direction) of the character string area is the height direction (vertical direction) of the character string area as a method of calculating the aspect ratio of the character string area It is possible to use a calculation method of dividing by the size of. When the scanned image is a read of a vertically-written document, the size of the character string area in the height direction (longitudinal direction) is the width direction of the character string area as a method of calculating the aspect ratio of the character string area. It is possible to use a calculation method of dividing by the size of the width direction). For example, the aspect ratio of the character string area corresponding to the number 1 in Table 1 is obtained by dividing the size 20 in the width direction by the size 10 in the height direction, and becomes 2. On the other hand, the aspect ratio of the character string area corresponding to the number 10 in Table 1 is obtained by dividing the size 80 in the width direction by the size 5 in the vertical direction, and becomes 18.

ステップ１１０６では、プレビュー表示部４２６により、文字列領域がユーザに選択可能で識別可能な態様にてタッチパネルに表示する文字列領域表示条件を満たすかどうかが判定される。この判定は、ステップ１１０５で算出された結果とステップ１１０２で取得された文字列領域表示条件とに基づき行われる。例えば、ステップ１１０２で取得された判定値を、ユーザに識別可能に文字列領域を表示する上限値として用いる場合について以下に説明する。ステップ１１０５で算出された文字列領域の縦横比が文字列領域表示条件を満たす場合、すなわち、ステップ１１０２で取得された判定値と同じか判定値よりも小さい場合、ステップ１１０７に進む。一方、ステップ１１０５で算出された文字列領域の縦横比が文字列領域表示条件を満たさない場合、すなわち、判定閾値より大きい場合には、囲み枠によって識別可能に文字列領域をプレビュー画面に表示するステップ１１０７へは進まず、ステップ１１０８に進む。例えば、判定値を１５とした場合、下記表２に示すように、番号１〜９では、文字列領域の縦横比（幅／高さ）が１５以下であり文字列領域の表示条件を満たすと判定される。一方、番号１０では文字列領域の縦横比（幅／高さ）が１５よりも大きい１８であり文字列領域の表示条件を満たさないと判定される。ステップ１１０６において条件を満たしていない場合、プレビュー画面７００では、符号７２２で示すような文字列の記載箇所に囲み枠が表示されない。 In step 1106, it is determined by the preview display unit 426 whether the character string area display condition to be displayed on the touch panel in a selectable and distinguishable manner for the user is satisfied. This determination is performed based on the result calculated in step 1105 and the character string area display condition acquired in step 1102. For example, the case where the determination value acquired in step 1102 is used as the upper limit value for displaying the character string area in a distinguishable manner to the user will be described below. If the aspect ratio of the character string area calculated in step 1105 satisfies the character string area display condition, that is, if it is equal to or smaller than the determination value acquired in step 1102, the process proceeds to step 1107. On the other hand, if the aspect ratio of the character string area calculated in step 1105 does not satisfy the character string area display condition, that is, it is larger than the determination threshold, the character string area is displayed on the preview screen identifiably by the enclosing frame. Without going to step 1107, the process proceeds to step 1108. For example, when the determination value is 15, as shown in Table 2 below, if the aspect ratio (width / height) of the character string area is 15 or less at the numbers 1 to 9, and the display condition of the character string area is satisfied. It is judged. On the other hand, in the case of No. 10, it is determined that the aspect ratio (width / height) of the character string area is 18 greater than 15 and the display condition of the character string area is not satisfied. If the condition is not satisfied in step 1106, the frame is not displayed at the position where the character string is written as indicated by reference numeral 722 on the preview screen 700.

ステップ１００７では、プレビュー表示部４２６により、ステップ１１０３で決定された文字列領域が、ユーザに選択可能で識別可能な態様にてプレビュー画面７００のプレビュー領域７１０に表示される。例えば、図８のプレビュー領域７１０に示されるように文字列領域を囲む枠線が表示される。 In step 1007, the preview display unit 426 displays the character string area determined in step 1103 in the preview area 710 of the preview screen 700 in a selectable and distinguishable manner by the user. For example, as shown in the preview area 710 of FIG. 8, a border surrounding the character string area is displayed.

ステップ１１０８では、プレビュー表示部４２６により、未処理の文字列領域があるか判定される。未処理の文字列領域がある場合には、ステップ１１０３に戻り、ステップ１１０４〜１１０８の処理が行われる。他方、未処理の文字列領域が無い場合は、本処理を終了する。 In step 1108, the preview display unit 426 determines whether there is an unprocessed character string area. If there is an unprocessed character string area, the process returns to step 1103 and the processes of steps 1104 to 1108 are performed. On the other hand, if there is no unprocessed character string area, this processing ends.

文字列領域表示条件に基づき、ユーザに選択可能で識別可能な態様にて文字列領域をプレビュー画面に表示することから、文字列領域表示条件が無い場合と比べ、プレビュー画面に表示される文字列領域を制限することができ、ユーザの利便性を高めることができる。 Since the character string area is displayed on the preview screen in a selectable and distinguishable manner based on the character string area display condition, the character string displayed on the preview screen as compared with the case where there is no character string area display condition The area can be restricted, and the convenience of the user can be enhanced.

ここで、図８で示す「見積書＿東京株式会社＿2017年04月14日」のファイル名が生成されるまでの過程を、前述の表１を参照して説明する。まず、表１における１〜１０の通し番号は、それぞれ以下に示すように各文字列領域と対応付けられている。
１：「見積書」
２：「東京株式会社」
３：「御中」
４：「見積もり番号」
５：「R-1234」
６：「納品希望日」
７：「2017年04月14日」
８：「横浜株式会社」
９：「神奈川県横浜市1000-1」
１０：「平素は格別のご高配を賜り厚く御礼申し上げます。」 Here, the process until the file name of “estimate document _ Tokyo Ltd. April 14, 2017” shown in FIG. 8 is generated will be described with reference to the above-mentioned Table 1. First, serial numbers 1 to 10 in Table 1 are associated with the respective character string areas as shown below.
1: "Estimate"
2: "Tokyo Corporation"
3: "On the house"
4: "Estimate number"
5: "R-1234"
6: "desired delivery date"
7: "April 14, 2017"
8: "Yokohama Corporation"
9: "1000-1 Yokohama City, Kanagawa Prefecture"
10: "I would like to thank you very much for your exceptional service."

最初に、スキャン画像データに対し画像解析処理されて文字列領域が取得されると、取得された総ての文字列領域の縦横比が算出される。そして、文字列表示条件を満たした文字列領域のみがユーザに選択可能で識別可能な態様にてプレビュー画面のプレビュー領域に表示される。文字列表示条件の判定値を文字列領域の縦横比が１５以下とすると、番号１〜９に対応する文字列領域には、ユーザに選択可能で識別可能な態様の一例である、文字列領域を囲む枠がプレビュー画面のプレビュー領域に表示される。一方、番号１０に対応する文字列領域の縦横比が１８であり、文字列表示条件を満たさないので、ユーザに選択可能で識別可能な態様の一例である、文字列領域を囲む枠がプレビュー画面のプレビュー領域には表示されないこととなる。 First, when image analysis processing is performed on scan image data and a character string area is acquired, the aspect ratio of all acquired character string areas is calculated. Then, only the character string area that satisfies the character string display condition is displayed in the preview area of the preview screen in a selectable and distinguishable manner for the user. Assuming that the aspect ratio of the character string area is 15 or less, the character string area corresponding to the numbers 1 to 9 is an example of a character string area that can be selected and identified by the user. A frame surrounding is displayed in the preview area of the preview screen. On the other hand, since the aspect ratio of the character string area corresponding to the number 10 is 18, and the character string display condition is not satisfied, a frame surrounding the character string area is a preview screen which is an example of a user selectable and distinguishable mode. It will not be displayed in the preview area of.

次に、タッチパネルへのタッチ操作を検知すると、タッチされた座標と重なる文字列が判定される（ステップ１００１〜１００４）。「見積書」の文字列領域へのタッチ操作を検知すると、「見積書」の文字列領域に対しＯＣＲ処理されて「見積書」の文字が取得される（ステップ１００５〜１００６）。そして、この段階ではファイル名入力欄７０１は何らの文字列も表示されておらず“空”の状態なので（ステップ１００７でＹｅｓ）、ファイル名入力欄７０１に「見積書」が表示される（ステップ１０１０）。また、プレビュー領域７１０では、ユーザのタッチ操作によってファイル名の一部として選択されたことを示すため、「見積書」の文字列領域の囲みが太枠に変わる。 Next, when a touch operation on the touch panel is detected, a character string overlapping the touched coordinates is determined (steps 1001 to 1004). When a touch operation on the character string area of “estimate sheet” is detected, the character string area of “estimate sheet” is subjected to OCR processing to acquire the characters “estimate sheet” (steps 1005 to 1006). Then, at this stage, the file name input field 701 does not display any character string and is "empty" (Yes in step 1007), so the "quotation" is displayed in the file name input field 701 (step 1010). Further, in the preview area 710, in order to indicate that the file is selected as a part of the file name by the touch operation of the user, the box of the character string area of “quotation sheet” changes to a thick frame.

次に、「東京株式会社」の文字列領域へのタッチ操作を検知する（ステップ１００１〜１００６）と、「東京株式会社」の文字列領域に対しＯＣＲ処理されて「東京株式会社」の文字が取得される（ステップ１００１〜１００６）。そして、この段階ではファイル名入力欄７０１には「見積書」の文字が表示されており“空”ではない（ステップ１００７でＮｏ）。よって、ファイル名入力欄７０１において、「見積書」の文字に続けて区切り文字であるアンダーバー“＿”が追加設定される（ステップ１００８）。この時点でファイル名は「見積書＿」となる。その後、さらに「東京株式会社」の文字が追加され（ステップ１００９）、ファイル名入力欄７０１に「見積書＿東京株式会社」が表示され（ステップ１０１０）、ファイル名は「見積書＿東京株式会社」となる。 Next, when a touch operation on the character string area of "Tokyo, Inc." is detected (steps 1001 to 1006), the character string area of "Tokyo, Inc." is subjected to OCR processing and the character of "Tokyo, Inc." It is acquired (steps 1001 to 1006). Then, at this stage, the characters "Estimate" are displayed in the file name input field 701 and they are not "empty" (No in step 1007). Therefore, in the file name input field 701, an underscore “_” which is a delimiter is added and set following the characters “estimate sheet” (step 1008). At this point, the file name is "estimate _". Thereafter, the characters "Tokyo, Ltd." are further added (step 1009), and "quotation document _ Tokyo corporation" is displayed in the file name input field 701 (step 1010), and the file name is "quotation document _ Tokyo corporation" It becomes ".

そして、「2017年04月14日」の文字列領域にてタッチ操作が検知される（ステップ１００１〜１００６）と、「2017年04月14日」の文字列領域に対してＯＣＲ処理が行われて「東京株式会社」の文字が取得される（ステップ１００１〜１００６）。よって、ファイル名入力欄７０１において、「東京株式会社」の文字に続けて区切り文字であるアンダーバー“＿”が追加設定される（ステップ１００８）。この時点でファイル名は「見積書＿東京株式会社＿」となる。その後、さらに「2017年04月14日」の文字が追加され（ステップ１００９）、ファイル名入力欄７０１に「見積書＿東京株式会社＿2017年04月14日」が表示される（ステップ１０１０）。最終的に「見積書＿東京株式会社＿2017年04月14日」のファイル名が得られる。また、プレビュー領域７１０では、「見積書」の文字列領域と同様、「東京株式会社」と「2017年04月14日」の文字列領域の囲みも太枠に変わる。 Then, when the touch operation is detected in the character string area of "Apr. 14, 2017" (steps 1001 to 1006), the OCR processing is performed on the character string area of "Apr. 14, 2017". Thus, the characters "Tokyo, Ltd." are obtained (steps 1001 to 1006). Therefore, in the file name input field 701, an underscore “_” which is a delimiter is added and set following the characters “Tokyo, Inc.” (step 1008). At this point, the file name is "estimate _ Tokyo Ltd. _". Thereafter, the characters "April 14, 2017" are further added (step 1009), and "quotation document-Tokyo, Ltd.-April 14, 2017" is displayed in the file name input field 701 (step 1010). Finally, the file name of "estimate _ Tokyo Ltd. _ April 14, 2017" is obtained. Further, in the preview area 710, similarly to the character string area of the "quotation sheet", the box of the character string areas of "Tokyo, Ltd." and "April 14, 2017" is also changed to a bold frame.

以上が、ファイル名生成処理の内容である。本実施形態では、スキャン画像内の文字列領域の文字認識結果を用いてファイル名を設定する場面を例に説明したが、本発明の適用範囲はファイル名を設定する場面に限定されるものではない。例えば、スキャン画像のアップロード先といったデータの転送先設定や、ＦＡＸ送信やメール送信の宛先設定に適用してもよい。この場合、例えば、前述の図９のアップロード設定画面９００において、スキャン画像内の文字列領域の文字認識結果をフォルダパスの候補として選択可能に表示し、ユーザ選択に応じてパス名を設定できるようにする。また、不図示の宛先設定画面において、スキャン画像内の文字列領域の文字認識結果をＦＡＸ番号やメールアドレスの候補として選択可能に表示し、ユーザ選択に応じてＦＡＸ番号やメールアドレスを設定できるようにする。このように、ファイル名設定以外の場面、例えばデータの転送先等の設定など、スキャン画像に関連する様々な情報（付帯情報）の設定を行う場面において、本発明は幅広く適用可能である。 The above is the contents of the file name generation process. In the present embodiment, the scene where the file name is set using the character recognition result of the character string area in the scan image is described as an example, but the scope of application of the present invention is limited to the scene where the file name is set. Absent. For example, the present invention may be applied to transfer destination setting of data such as a scan image upload destination, and destination setting of FAX transmission and mail transmission. In this case, for example, in the upload setting screen 900 of FIG. 9 described above, the character recognition result of the character string area in the scan image can be displayed as selectable folder path candidates so that the path name can be set according to user selection. Make it In addition, on the destination setting screen (not shown), the character recognition result of the character string area in the scan image can be displayed as a FAX number or a mail address candidate so that it can be selected, and the FAX number or the mail address can be set according to the user selection. Make it As described above, the present invention can be widely applied to the setting of various information (incidental information) related to a scanned image, such as setting of a transfer destination of data, etc., other than setting of a file name.

したがって、本実施形態によれば、文字列領域の表示条件を満たした文字列領域をユーザに選択可能で識別可能な態様にて表示する。これにより、スキャン画像の全ての文字列領域をユーザに選択可能な態様で表示する場合と比べて、ユーザによる誤操作を未然に防ぎ、ユーザの利便性を高めることができる。 Therefore, according to the present embodiment, the character string area that satisfies the display condition of the character string area is displayed in a selectable and distinguishable manner for the user. As a result, compared with the case where all the character string areas of the scan image are displayed in a selectable manner for the user, an erroneous operation by the user can be prevented and the convenience of the user can be enhanced.

ユーザにより選択された文字列領域に対してＯＣＲ処理を行うことから、スキャン画像データ全体に対してＯＣＲ処理を行う場合と比べて、スキャン画像データからの文字列の取得を効率よく行うことができる。 Since the OCR processing is performed on the character string area selected by the user, it is possible to efficiently acquire the character string from the scanned image data as compared to the case where the OCR processing is performed on the entire scanned image data. .

［第２の実施形態］
次に、文章の直交方向での文字列領域の大きさに応じた文字列表示条件を有する場合、文字列領域の縦横比を算出しなくても、ユーザに識別可能に表示する態様を第２の実施形態として説明する。なお、ＭＦＰ１１０におけるスキャン動作の制御の流れなど第１の実施形態と共通する内容については説明を省略することとし、以下では本実施形態の特徴である文字列領域の表示処理を中心に説明するものとする。 Second Embodiment
Next, in a case where the character string display condition according to the size of the character string area in the orthogonal direction of the sentence is provided, the second aspect of displaying identifiably to the user without calculating the aspect ratio of the character string area It demonstrates as embodiment of. The description of the flow common to the first embodiment such as the flow of control of the scan operation in the MFP 110 will be omitted, and the following description will be focused on the display processing of the character string area which is the feature of the present embodiment. I assume.

本実施形態では、例えば、図１０の文字列領域表示判定において、文字列領域の縦横比の代わりに、スキャン画像データの原稿サイズに対応した文字列領域の高さ方向（または幅方向）での大きさの範囲（上限値と下限値の間の範囲）を用いる。 In the present embodiment, for example, in the character string area display determination of FIG. 10, instead of the aspect ratio of the character string area, the height direction (or width direction) of the character string area corresponding to the document size of the scan image data. Use a range of sizes (range between upper limit and lower limit).

文字列領域の高さ方向（または幅方向）の大きさが、原稿サイズに対応した文字列領域の高さ方向（または幅方向）での大きさ範囲内にある場合には、文字列領域がユーザに選択可能で識別可能な態様にてプレビュー画面のプレビュー領域に表示される。一方、文字列領域の高さ方向（または幅方向）の大きさが、原稿サイズに対応した文字列領域の高さ方向（または幅方向）での大きさ範囲外である場合、文字列領域がユーザに選択可能で識別可能な態様ではプレビュー画面のプレビュー領域に表示されないこととなる。 If the size in the height direction (or width direction) of the character string area is within the size range in the height direction (or width direction) of the character string area corresponding to the document size, the character string area is It is displayed in the preview area of the preview screen in a manner that can be selected and identified by the user. On the other hand, if the size in the height direction (or width direction) of the character string area is outside the size range in the height direction (or width direction) of the character string area corresponding to the document size, the character string area In a user selectable and distinguishable aspect, the preview area is not displayed in the preview area of the preview screen.

以上が、文字列領域表示処理の内容である。これにより、明らかにファイル名などに利用しないと推認される小さな文字または大きな文字の文字列領域を表示させなくすることができる。そのため、文字列領域表示条件によりユーザが選択する可能性の低い文字列領域を非表示とするが、ファイル名などに利用する可能性が高い特定の大きさの文字列領域のみを表示することができ、ユーザによる誤選択を防ぐことができる。 The above is the content of the character string area display processing. As a result, it is possible to prevent the display of a character string area of small characters or large characters that are clearly assumed not to be used for file names and the like. Therefore, although the character string area which is unlikely to be selected by the user according to the character string area display condition is not displayed, it is possible to display only the character string area of a specific size which is highly likely to be used as a file name. It is possible to prevent false selection by the user.

［第３の実施形態］
次に、複数の入力エリア（入力欄）がある場合に、それぞれのエリアに対応した文字列領域表示条件に基づき、文字列領域をユーザに識別可能に表示する態様を第３の実施形態として説明する。ファイル名入力欄とは異なる文字列領域表示条件に基づき入力されるフォルダ名入力欄とファイル名入力欄とを有する場合に、入力エリアに応じた文字列表示条件をユーザがそのつど選択しなくても、自動選択する態様を第３の実施形態として説明する。なお、ＭＦＰ１１０におけるスキャン動作の制御の流れなど第１の実施形態と共通する内容については説明を省略することとする。以下では本実施形態の特徴である、入力条件の異なる複数の入力エリアがある場合における、ファイル名などの生成処理を中心に説明するものとする。 Third Embodiment
Next, when there is a plurality of input areas (input fields), a mode of displaying the character string area in a discriminable manner for the user based on the character string area display condition corresponding to each area will be described as the third embodiment. Do. In the case of having a folder name input box and a file name input box input based on a character string area display condition different from the file name input box, the user does not select the character string display condition according to the input area each time Also, an aspect of automatically selecting will be described as a third embodiment. The description of the flow of control of the scan operation in MFP 110 and the contents common to the first embodiment will be omitted. In the following description, processing for generating a file name and the like when there are a plurality of input areas having different input conditions, which is a feature of the present embodiment, will be mainly described.

図１２（ａ）は、複数入力エリアを備えたプレビュー画面の一例を示す図である。プレビュー画面１２００においては、画面中央にあるプレビュー領域１２０１の上方には、ファイル名の入力に対応した入力エリア１２０２と共に、フォルダ名の入力に対応した入力エリア１２０３とが表示される。入力エリア１２０２と入力エリア１２０３とは、上下で隣接して表示される。図１２（ｂ）は、図１２（ａ）のプレビュー画面１２００において、各文字列領域が囲み枠によって識別可能に表示された状態を示す図である。また、プレビュー画面１２００では、ユーザがタッチ操作したフォルダ名の入力エリア１２０３の囲みが、当該エリアを選択した状態であることを示すため太枠に変わっている。 FIG. 12A is a view showing an example of a preview screen provided with a plurality of input areas. In the preview screen 1200, an input area 1202 corresponding to the input of the file name and an input area 1203 corresponding to the input of the folder name are displayed above the preview area 1201 at the center of the screen. The input area 1202 and the input area 1203 are displayed adjacent to each other at the top and bottom. FIG. 12B is a view showing a state in which each character string area is displayed in a distinguishable manner by a surrounding frame on the preview screen 1200 of FIG. 12A. Further, in the preview screen 1200, the box of the input area 1203 of the folder name touched by the user is changed to a thick frame to indicate that the area is selected.

図１３は、ユーザ操作によって操作中の入力エリアが切り替わった時のプレビュー表示部４２６による、文字列領域の再表示処理の詳細を示すフローチャートである。以下、図１３のフローチャートに沿って説明する。 FIG. 13 is a flowchart showing details of the redisplay processing of the character string area by the preview display unit 426 when the input area in operation is switched by the user operation. Hereinafter, description will be made along the flowchart of FIG.

ステップ１３０１では、タッチパネル上に表示されたプレビュー画面１２００へのユーザによるタッチ操作の有無が監視される。タッチ操作が検出されるとステップ１３０２へ進む。続くステップ１３０２では、タッチ操作の内容によって処理の切り分けがなされる。タッチ操作の内容が、［次へ］ボタンまたは［戻る］ボタンの押下であった場合は、本処理を終了する。［次へ］ボタン１３２１または［戻る］ボタンの押下ではなかった場合は、ステップ１３０３へ進む。 In step 1301, the presence or absence of a touch operation by the user on the preview screen 1200 displayed on the touch panel is monitored. If a touch operation is detected, the process proceeds to step 1302. In the following step 1302, processing is separated according to the content of the touch operation. If the content of the touch operation is the pressing of the [Next] button or the [Back] button, this processing ends. If the “Next” button 1321 or the “Return” button is not pressed, the process proceeds to Step 1303.

ステップ１３０３では、タッチ操作がなされたタッチパネル上の位置座標（x,y）が取得される。続くステップ１３０４では、タッチ操作された位置座標が、入力エリアと重なるかどうかが判定される。例えば、タッチ操作された位置座標が、ファイル名入力エリア１３０２またはフォルダ名入力エリア１３０３の内側（矩形の四隅を表す位置座標で特定される矩形の内側）に含まれるかどうかで判定される。タッチ操作された位置座標がファイル名入力エリア１２０２またはフォルダ名入力エリア１２０３のいずれか一方と重なっている場合は、ステップ１３０５へ進む。一方、重なっていない場合は、ステップ１３０１に戻る。 In step 1303, position coordinates (x, y) on the touch panel on which the touch operation has been performed are acquired. In the following step 1304, it is determined whether the position coordinates touched by the touch overlap the input area. For example, it is determined whether the position coordinates touched are included inside the file name input area 1302 or the folder name input area 1303 (inside the rectangle specified by the position coordinates representing the four corners of the rectangle). If the position coordinates of the touch operation overlap with either the file name input area 1202 or the folder name input area 1203, the process proceeds to step 1305. On the other hand, if they do not overlap, the process returns to step 1301.

ステップ１３０５では、タッチ操作された位置座標と重なり当該タッチ操作によって選択された入力エリア（以下、「選択入力エリア」と呼ぶ。）に対応した文字列領域表示条件が取得される。この文字列領域表示条件は入力エリアに応じて予め設定されている。 In step 1305, a character string area display condition corresponding to the touch-operated position coordinate and the input area selected by the touch operation (hereinafter referred to as "selection input area") is acquired. The character string area display conditions are preset according to the input area.

表３は、入力エリアごとに保持された文字列領域表示条件テーブルの一例を示している。 Table 3 shows an example of the character string area display condition table held for each input area.

例えば、比較的長い文字列を入力する可能性のあるファイル名入力エリア（番号１）には、文字列領域の縦横比の閾値として１５が設定されている。また、長い文字列を入力する可能性の少なく、ファイル名入力エリアと比べて短い文字列しか入力しないフォルダ名入力エリア（番号２）には、文字列領域の縦横比の閾値として１０が設定されている。文字列領域表示条件は、ＨＤＤ２１４などに予め記憶されている。プレビュー表示部４２６は、入力エリアへの入力の選択を受け付けると、選択入力エリアに対応した文字列領域表示条件がＨＤＤ２１４などから取得される。 For example, in the file name input area (number 1) where there is a possibility of inputting a relatively long character string, 15 is set as the threshold of the aspect ratio of the character string area. Also, in the folder name input area (No. 2) where there is little possibility of inputting a long character string and only a character string shorter than the file name input area, 10 is set as the threshold of the aspect ratio of the character string area ing. The character string area display conditions are stored in advance in the HDD 214 or the like. When the preview display unit 426 receives the selection of the input to the input area, the character string area display condition corresponding to the selection input area is acquired from the HDD 214 or the like.

続いてステップ１３０６以降にて、文字列領域表示処理が行われる。ステップ１３０６〜１３１１の各処理は、第１の実施形態の図１１に示されるステップ１１０３〜１１０８と同じである。例えば、フォルダ名入力エリア（番号２）の文字列領域表示条件は縦横比の閾値（上限値）が１０である。そのため、表２の番号１〜８に対応する文字列領域の縦横比は、それぞれ、ステップ１３０５で取得された文字列領域表示条件である１０以下であり、ユーザに識別可能な態様で表示される。一方、表２の番号９，１０に対応する文字列領域の縦横比は、それぞれ、ステップ１３０５で取得された文字列領域表示条件である１０以下ではなく、ユーザに識別可能な態様ではプレビュー画面のプレビュー領域には表示されない。符号１３０４が示すような文字列が記載されている箇所に囲み枠が表示されない。 Subsequently, in step 1306 and subsequent steps, character string area display processing is performed. Each process of steps 1306 to 1311 is the same as steps 1103 to 1108 shown in FIG. 11 of the first embodiment. For example, the character string area display condition of the folder name input area (number 2) has an aspect ratio threshold (upper limit) of 10. Therefore, the aspect ratio of the character string area corresponding to the numbers 1 to 8 in Table 2 is 10 or less, which is the character string area display condition acquired in step 1305, and is displayed in an identifiable manner for the user. . On the other hand, the aspect ratio of the character string area corresponding to the numbers 9 and 10 in Table 2 is not 10 or less which is the character string area display condition acquired in step 1305, respectively, and the preview screen It is not displayed in the preview area. The box is not displayed where the character string indicated by reference numeral 1304 is described.

以上が、文字列表示条件の異なる複数の入力エリアがある場合の文字列領域表示処理の内容である。これにより、入力エリアに対して入力可能な文字列領域のみをユーザに識別可能な態様で表示することができる。すなわち、選択中の入力エリアに従って表示する文字列領域を切り替えることができる。 The above is the contents of the character string area display processing when there are a plurality of input areas with different character string display conditions. Thus, only the character string area that can be input to the input area can be displayed in a manner that can be identified by the user. That is, the character string area to be displayed can be switched according to the input area being selected.

なお、入力エリアに入力可能な文字数が決められている場合、入力エリアに入力された文字数に応じて、ユーザに選択可能で識別可能な態様で表示される文字列領域を変更することも可能である。 If the number of characters that can be input to the input area is determined, it is possible to change the character string area displayed in a selectable and distinguishable manner according to the number of characters input to the input area. is there.

［第４の実施形態］
次に、文字列表示条件を満たしていない文字列領域をユーザに選択可能で識別可能に表示する態様を第４の実施形態として説明する。なお、ＭＦＰ１１０におけるスキャン動作の制御の流れなど第１の実施形態と共通する内容については説明を省略することとし、以下では本実施形態の特徴である文字列領域の表示処理を中心に説明するものとする。 Fourth Embodiment
Next, an aspect will be described as a fourth embodiment in which a character string area that does not satisfy the character string display condition can be displayed on the user in a selectable and distinguishable manner. The description of the flow common to the first embodiment such as the flow of control of the scan operation in the MFP 110 will be omitted, and the following description will be focused on the display processing of the character string area which is the feature of the present embodiment. I assume.

本実施形態では、例えば、上述の第１〜３の実施形態とは異なる特定のタッチ操作（例えば、長押しタッチ、ダブルタップ）が検出された場合の処理内容である。 The present embodiment is, for example, processing content when a specific touch operation (for example, long press touch, double tap) different from the first to third embodiments described above is detected.

図１４は、特定のユーザ操作が検出された場合の文字列領域表示処理の詳細を示すフローチャートである。図１４のフローチャートに沿って説明する。 FIG. 14 is a flowchart showing details of character string area display processing when a specific user operation is detected. It demonstrates along the flowchart of FIG.

ステップ１４０１では、タッチパネル上に表示されたプレビュー画面へのユーザによる特定のタッチ操作の有無が監視される。特定のタッチ操作としては、例えば、画面長押しタッチ、ダブルタップなどのボタンを１回押下する場合の操作と区別して検出可能なタッチ操作が挙げられる。画面長押しタッチなどの特定のタッチ操作が検出されるとステップ１４０２へ進む。続くステップ１４０２では、タッチ操作の内容によって処理の切り分けがなされる。タッチ操作の内容が、［次へ］ボタン７２１または［戻る］ボタン７２０の押下であった場合は、本処理を終了する。［次へ］ボタン７２１または［戻る］ボタン７２０の押下ではなかった場合は、ステップ１４０３へ進む。 In step 1401, the presence or absence of a specific touch operation by the user on the preview screen displayed on the touch panel is monitored. The specific touch operation includes, for example, a touch operation which can be detected separately from an operation in the case of pressing a button such as a screen long press touch, double tap, etc. once. If a specific touch operation such as a long screen touch is detected, the process proceeds to step 1402. In the following step 1402, processing is separated according to the content of the touch operation. If the content of the touch operation is the pressing of the [Next] button 721 or the [Back] button 720, this processing ends. If the “next” button 721 or the “back” button 720 is not pressed, the process proceeds to step 1403.

ステップ１４０３では、タッチ操作がなされたタッチパネル上の位置座標（x,y）が取得される。続くステップ１４０４では、タッチ操作された位置座標が、ユーザに選択可能で識別可能な態様にて表示さていない文字列領域（以下、「非表示文字列領域」と呼ぶ。）と重なるかどうかが判定される。例えば、タッチ操作された位置座標が、ユーザに識別可能な態様で表示されていない文字列領域の内側（文字列領域の四隅を表す位置座標で特定される矩形の内側）に含まれるかどうかで判定する。タッチ操作された位置座標がユーザに識別可能な態様で表示されていない文字列領域と重なっている場合は、ステップ１４０５へ進む。一方、重なっていない場合は、ステップ１４０１に戻る。 In step 1403, position coordinates (x, y) on the touch panel on which the touch operation has been performed are acquired. In the following step 1404, it is determined whether the position coordinates touched by the touch overlap a character string area (hereinafter referred to as "non-display character string area") which is not displayed in a selectable and distinguishable manner for the user. Be done. For example, whether or not the touch-operated position coordinate is included inside the character string area (inside the rectangle specified by the position coordinate representing the four corners of the character string area) that is not displayed in a user identifiable manner judge. If the position coordinates touched by the touch overlap the character string area not displayed in a distinguishable manner for the user, the process advances to step 1405. On the other hand, if they do not overlap, the process returns to step 1401.

ステップ１４０５では、タッチ操作された位置座標と重なっている文字列領域がユーザに選択可能で識別可能な態様にてプレビュー画面に表示される。 In step 1405, a character string area overlapping the position coordinates touched by the touch operation is displayed on the preview screen in a selectable and distinguishable manner by the user.

以上が、非表示文字列領域の表示処理の内容である。このような処理によって、文字列領域表示条件を満たさないと判定されて非表示状態（選択不可状態）となった文字列領域であっても、ユーザに選択可能で識別可能な態様でプレビュー画面に表示できる。よって、文字を含むスキャン画像から所望の文字列の抽出を煩雑な作業を必要とせず簡易な作業で行うことができる。 The above is the contents of the display processing of the non-display character string area. Even if the character string area is determined to not satisfy the character string area display condition by such processing and is in the non-display state (non-selectable state), the preview screen can be selected and identifiable by the user. It can be displayed. Therefore, it is possible to perform extraction of a desired character string from a scanned image including characters by a simple operation without requiring a complicated operation.

［変形例］
以下に本実施形態の変形例について説明する。 [Modification]
The modification of this embodiment is described below.

ファイル名入力欄に表示されたファイル名の先頭に、該ファイル名側から区切り文字およびタッチ操作で選択された文字列を記載順に追加してファイル名を生成することも可能である。 It is also possible to add a delimiter selected from the file name side to the beginning of the file name displayed in the file name input field and a character string selected by touch operation in the order of description to generate a file name.

［その他の実施形態］
本発明は、上述の実施形態の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読み出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。 Other Embodiments
The present invention supplies a program that implements one or more functions of the above-described embodiments to a system or apparatus via a network or storage medium, and one or more processors in a computer of the system or apparatus read and execute the program. Can also be realized. It can also be implemented by a circuit (eg, an ASIC) that implements one or more functions.

１１０ＭＦＰ
１２０ファイルサーバ
２１０制御部
２１１ＣＰＵ
２１２ＲＯＭ
２１３ＲＡＭ
２１４ＨＤＤ
４１０ネイティブ機能モジュール
４１１スキャン実行部
４２０アディショナル機能モジュール
４２６プレビュー表示部 110 MFP
120 file server 210 control unit 211 CPU
212 ROM
213 RAM
214 HDD
410 native function module 411 scan execution unit 420 additional function module 426 preview display unit

Claims

An apparatus for performing predetermined processing on a scanned image obtained by scanning a document,
A display control unit that displays a UI screen for performing the predetermined processing, in which a unit area inferred as a series of character strings in the scanned image is identifiably displayed by a user;
Setting means for performing an OCR process on a unit area selected by the user via the UI screen, and setting additional information for the predetermined process using a character string extracted by the OCR process;
Equipped with
The display control means may display the unit area on the UI screen so as to be distinguishable by a user according to the size of the unit area in a direction intersecting the direction along the text direction of the document.

The apparatus according to claim 1, wherein the display control means displays the unit area on the UI screen so as to be distinguishable by a user according to an aspect ratio of the unit area.

The display control means divides a size of the unit area in a direction along the text direction of the document by a size of the unit area in a direction intersecting the direction along the text direction of the document. 3. The apparatus according to claim 2, wherein the unit area which is equal to or less than the determination value is displayed on the UI screen so as to be distinguishable by a user.

The apparatus according to any one of claims 1 to 3, wherein the display control unit displays the unit area on the UI screen together with the scan image.

The UI screen has a plurality of input areas that can be input,
The display control unit according to any one of claims 1 to 4, wherein the unit area corresponding to the input area selected by the user is displayed on the UI screen so as to be distinguishable by the user. apparatus.

The UI screen has a plurality of input areas that can be input,
The apparatus according to any one of claims 1 to 4, wherein the display control means switches the unit area displayed on the UI screen so as to be distinguishable by the user according to the input area selected by the user. .

The apparatus according to any one of claims 1 to 6, wherein the display control means displays the unit area which can not be identified by the user as unselectable.

When the user performs a specific operation on the unit area that can not be selected by the user, the display control unit can select the unit area on which the user performed the specific operation on the UI screen so that the user can select the unit area. The apparatus according to claim 7, characterized in that it is displayed.

The predetermined process is a process of storing data of the scan image,
The apparatus according to any one of claims 1 to 8, wherein the incidental information is a file name when the scanned image is stored.

The predetermined process is a process of storing data of the scan image,
The apparatus according to any one of claims 1 to 8, wherein the incidental information is a folder name for storing the scan image.

The predetermined process is a process of uploading the data of the scanned image to a predetermined place,
The apparatus according to any one of claims 1 to 8, wherein the incidental information is a path name indicating an upload destination.

The predetermined process is a process of faxing data of the scanned image,
The apparatus according to any one of claims 1 to 8, wherein the incidental information is a fax number.

The predetermined process is a process of transmitting data of the scan image by e-mail,
The apparatus according to any one of claims 1 to 8, wherein the incidental information is a mail address.

An apparatus for performing predetermined processing on a scanned image obtained by scanning a document,
Display control means for displaying a UI screen for performing the predetermined processing, in which a user can selectably display a unit area that is presumed to be a series of character strings in the scanned image;
Setting means for performing an OCR process on a unit area selected by the user via the UI screen, and setting additional information for the predetermined process using a character string extracted by the OCR process;
Equipped with
The display control means may display the unit area on the UI screen so as to be selectable by a user according to the size of the unit area in a direction intersecting the direction along the text direction of the document.

A control method of an apparatus for performing predetermined processing on a scanned image obtained by scanning a document, comprising:
Displaying a UI screen for performing the predetermined processing, in which the display control means displays a unit area that is considered as a series of character strings in the scan image in a selectable manner by the user;
The setting unit performs an OCR process on a unit area selected by the user via the UI screen, and sets additional information for the predetermined process using a character string extracted by the OCR process; ,
Including
And displaying the unit area on the UI screen so that the unit area can be identified by the user according to the size of the unit area in a direction intersecting the direction along the text direction of the document.

The program for functioning a computer as an apparatus as described in any one of Claims 1-14.