JP4882523B2

JP4882523B2 - Document impression evaluation apparatus and document impression evaluation program

Info

Publication number: JP4882523B2
Application number: JP2006153882A
Authority: JP
Inventors: 吉嗣広瀬; 一志小澤
Original assignee: Fuji Xerox Co Ltd; Fujifilm Business Innovation Corp
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2006-06-01
Filing date: 2006-06-01
Publication date: 2012-02-22
Anticipated expiration: 2026-06-01
Also published as: JP2007323435A

Description

本発明は、ドキュメントから取得される視覚的情報に基づいて、当該ドキュメントが人間に与える印象に関する情報を提示するドキュメント印象評価装置及びドキュメント印象評価プログラムに関する。 The present invention relates to a document impression evaluation apparatus and a document impression evaluation program that present information related to an impression that a document gives to a person based on visual information acquired from the document.

企業のコーポレートマーク、商品、プレゼンテーション資料やウェブサイトのページ等の近年の情報処理の高速化及び大容量化に伴って、プレゼンテーション資料や各種のウェブサイトにおいて多数の色彩を施されたドキュメントが利用されるようになっている。これらのドキュメントにおいては、ドキュメント中に用いられる色の選択、配置、面積及び面積比などによって、人間に与える視覚的な印象が大きく異なる。 With the recent increase in information processing speed and capacity of corporate marks, products, presentation materials, website pages, etc., many colored documents are used in presentation materials and various websites. It has become so. In these documents, the visual impression given to humans varies greatly depending on the selection, arrangement, area, and area ratio of colors used in the documents.

しかしながら、実際のドキュメント作成の多くは一般的なユーザに委ねられている。ドキュメント作成に不慣れな一般的なユーザは、ドキュメントをいかに着色及び配色することによって、自分の意図する雰囲気、美観及び視認性が得ることができるか分からないことが多い。また、ドキュメント作成の専門家にとっても、作成したドキュメントがユーザに与える印象を客観的に評価する手段が必要とされている。 However, much of the actual document creation is left to the general user. A general user who is unfamiliar with document creation often does not know how to color and color a document to obtain his intended atmosphere, aesthetics and visibility. In addition, a means for objectively evaluating an impression given to a user by a created document is also required for a document creation specialist.

この問題を解決するために、人間の感性に基づいた処理やドキュメント評価の研究が盛んに行われており、なかでも人間の感性を評価する感性評価手段と、その結果とドキュメントの特徴とを解析するドキュメント解析手段とを結びつけた感性的ドキュメント作成支援手段が数多く提案されている。 In order to solve this problem, research on processing and document evaluation based on human sensitivity has been actively conducted, and in particular, sensitivity evaluation means for evaluating human sensitivity, and the results and document characteristics are analyzed. Many sensuous document creation support means have been proposed that combine document analysis means.

特許文献１及び特許文献２には、対話型の配色支援方法及びそれを用いた配色支援装置が開示されている。この配色支援方法では、ユーザが画像データを編集する際に、画像の背景、文字又はグラフなどの配色の適正を具体的な数値として提示したり、配色に適した色彩を候補として提示したりする。 Patent Document 1 and Patent Document 2 disclose an interactive color arrangement support method and a color arrangement support apparatus using the same. In this color arrangement support method, when a user edits image data, the appropriateness of the color arrangement such as the background of the image, characters or graphs is presented as a specific numerical value, or a color suitable for the color arrangement is presented as a candidate. .

また、特許文献３には、画像に関する視覚的な特徴量に対する条件と画像を見た人が感じる印象を表す表現情報とを関連付けたデータベースを参照して、評価対象となる画像から特徴量を抽出し、抽出された特徴量が満たす条件と関連付けられた表現情報を抽出し、その表現情報を画像が人間に与える印象に関する情報として出力画面に表示する画像印象評価装置が開示されている。 In Patent Document 3, a feature amount is extracted from an image to be evaluated with reference to a database in which a condition for a visual feature amount related to an image is associated with expression information representing an impression felt by a person who views the image. An image impression evaluation apparatus is disclosed that extracts expression information associated with a condition that the extracted feature amount satisfies, and displays the expression information on an output screen as information about an impression that the image gives to a person.

特開平７−３０６９３５号公報JP-A-7-306935 特開平８−２４９４４１号公報JP-A-8-249441 特開２００４−３０３７７号公報JP 2004-30377 A

しかしながら、上記従来技術では、評価対象とされるドキュメントの表現の目的に応じてドキュメントが人に与える印象を評価していなかった。例えば、公告等のドキュメントに含まれる商品の種類やサービスの内容、ホームページ等の情報の提供先の業種、対象者の性別や年齢等が異なることによってそのドキュメントが人に与える印象が異なるが、このようなドキュメントの表現の目的（情報の内容や情報の提供先）に応じた評価は行うことができなかった。 However, in the above prior art, an impression given to a person by a document is not evaluated according to the purpose of expressing the document to be evaluated. For example, the impression that the document gives to people varies depending on the type of product included in the document such as the announcement, the content of the service, the type of business to which information such as the homepage is provided, the gender and age of the target person, etc. Evaluation according to the purpose of document expression (information content and information provider) could not be performed.

本発明は、上記従来の技術の課題を鑑み、評価対象ドキュメントの表現の目的に応じてドキュメントの印象の評価を行うためのドキュメント印象評価装置及びドキュメント印象評価プログラムを提供することを目的とする。 SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems of the related art, and an object thereof is to provide a document impression evaluation apparatus and a document impression evaluation program for evaluating a document impression according to the purpose of expressing an evaluation target document.

本発明は、ドキュメントから抽出される視覚的情報に基づいて、当該ドキュメントが人間に与える印象に関する情報を提示するドキュメント印象評価装置であって、ドキュメントの内容に関するキーワードと分類区分とを関連付けた分類用データベースを参照して、評価対象となる被評価ドキュメントに含まれるキーワードに関連付けられている分類区分を判別するドキュメント判別手段と、前記被評価ドキュメントから少なくとも１つの特徴量を抽出する特徴量抽出手段と、前記分類区分毎に、ドキュメントに関する少なくとも１つの視覚的な特徴量に対する条件と、ドキュメントを見た人間が感じる印象を表す表現情報と、を関連付けた表現情報データベースを参照して、前記ドキュメント判別手段で得られた前記被評価ドキュメントの分類区分的及び前記被評価ドキュメントの特徴量が満たす条件、と関連付けられた表現情報を抽出する表現情報抽出手段と、前記抽出された表現情報を、前記被評価ドキュメントとなるドキュメントが人間に与える印象に関する情報として出力する出力手段と、を備え、前記被評価ドキュメントは、商品の広告であって、前記分類区分は、商品の種類、サービスの内容、又は閲覧者の特性の少なくとも１つで表されることを特徴とする。 The present invention relates to a document impression evaluation apparatus that presents information related to an impression given to a human by a document based on visual information extracted from the document. The document impression evaluation apparatus relates a keyword related to a document content to a classification category. A document discriminating unit that discriminates a classification category associated with a keyword included in an evaluated document to be evaluated with reference to a database; and a feature amount extracting unit that extracts at least one feature amount from the evaluated document; The document discriminating unit refers to an expression information database in which a condition for at least one visual feature amount related to a document is associated with expression information representing an impression felt by a person who views the document for each classification category. Of the document to be evaluated Impression class division and the conditions characteristic of the evaluation document satisfies the expression information extraction means for extracting the expression information associated with the representation of the extracted information, the the object to be evaluated document document has on the human And the output document is an advertisement of a product, and the classification is represented by at least one of a product type, service content, or viewer characteristics. It is characterized by that .

これら本発明に係るドキュメント印象評価装置はコンピュータで実行されるプログラムによって実現することができる。具体的には、ドキュメントから抽出される視覚的情報に基づいて、当該ドキュメントが人間に与える印象に関する情報を提示するドキュメント印象評価を行うためのドキュメント印象評価プログラムであって、コンピュータを、ドキュメントの内容に関するキーワードと分類区分とを関連付けた分類用データベースを参照して、評価対象となる被評価ドキュメントに含まれるキーワードに関連付けられている分類区分を判別するドキュメント判別手段と、前記被評価ドキュメントから少なくとも１つの特徴量を抽出する特徴量抽出手段と、前記分類区分毎に、ドキュメントに関する少なくとも１つの視覚的な特徴量に対する条件と、ドキュメントを見た人間が感じる印象を表す表現情報と、を関連付けた表現情報データベースを参照して、前記ドキュメント判別手段で得られた前記被評価ドキュメントの分類区分及び前記被評価ドキュメントの特徴量が満たす条件、と関連付けられた表現情報を抽出する表現情報抽出手段と、前記抽出された表現情報を、前記被評価ドキュメントとなるドキュメントが人間に与える印象に関する情報として出力する出力手段と、として機能させ、前記被評価ドキュメントは、商品の広告であって、前記分類区分は、商品の種類、サービスの内容、又は閲覧者の特性の少なくとも１つで表されることを特徴とする、ドキュメント印象評価プログラムによって実現することができる。 These document impression evaluation apparatuses according to the present invention can be realized by a program executed by a computer. Specifically, it is a document impression evaluation program for performing a document impression evaluation that presents information related to impressions given to human beings based on visual information extracted from a document, the computer including the contents of the document A document discriminating means for discriminating a classification category associated with a keyword included in the evaluated document to be evaluated with reference to a classification database in which keywords related to the classification category are associated, and at least one from the evaluated document An expression in which feature quantity extraction means for extracting one feature quantity, a condition for at least one visual feature quantity relating to a document, and expression information representing an impression felt by a person who sees the document are associated with each classification category Browse the information database before Conditions characteristic of classification categories and the evaluation target document obtained the evaluation target document meets the document discrimination means, and the expression information extraction means for extracting the expression information associated, the extracted expression information, the Functioning as an output means for outputting information about an impression given to a human by a document to be evaluated, wherein the evaluated document is an advertisement of a product, and the classification category includes a product type, a service content, Alternatively, it can be realized by a document impression evaluation program characterized by being represented by at least one of the viewer characteristics .

なお、上記ドキュメント印象評価プログラムによって実現される各手段は本発明の実施の形態におけるフローチャートの各ステップ又は各ステップの組み合わせと同等である。また、上記ドキュメント印象評価プログラムによって実現される各手段は１つのコンピュータによって実現する必要はなく、ネットワーク等によって情報伝達可能に接続された複数のコンピュータによって実現されてもよい。 Note that each means realized by the document impression evaluation program is equivalent to each step or combination of steps in the flowchart in the embodiment of the present invention. Each means realized by the document impression evaluation program need not be realized by one computer, and may be realized by a plurality of computers connected so as to be able to transmit information through a network or the like.

ここで、前記出力手段は、出力画面の領域を複数の分割画面領域に分割し、前記分割画面領域毎に前記条件を割り当て、前記被評価ドキュメントの特徴量が満たす前記条件に対応する前記分割画面領域に前記抽出された表現情報を表示することが好適である。また、前記出力手段は、出力画面の領域を複数の分割画面領域に分割し、前記分割画面領域毎に前記条件を割り当て、前記被評価ドキュメントの特徴量が満たす前記条件に対応する前記分割画面領域に前記被評価ドキュメントの特徴量に関する情報を表示することも好適である。 Here, the output means divides the area of the output screen into a plurality of divided screen areas, assigns the condition for each of the divided screen areas, and the divided screen corresponding to the condition that the feature amount of the evaluated document satisfies. It is preferable to display the extracted expression information in a region. The output means divides an output screen area into a plurality of divided screen areas, assigns the condition to each of the divided screen areas, and the divided screen area corresponding to the condition satisfied by the feature amount of the evaluated document. It is also preferable to display information on the feature amount of the document to be evaluated.

このように、出力画面を分割してチャート状に表現情報や特徴量に関する情報を表示させることによって、被評価ドキュメントの印象をユーザにわかり易く提示することができる。 In this way, by dividing the output screen and displaying the information related to the expression information and the feature amount in the form of a chart, the impression of the evaluated document can be presented to the user in an easily understandable manner.

本発明によれば、評価対象ドキュメントの表現の目的に応じて適切なドキュメントの印象の評価を行うことができる。 According to the present invention, it is possible to evaluate the impression of an appropriate document according to the purpose of expressing the evaluation target document.

以下に、本発明の実施の形態について図を参照して詳細に説明する。本発明の実施の形態における印象評価装置は、図１のように、制御部１０、内部記憶部１２、大容量記憶部１４、出力部１６、標準入力部１８及びドキュメント入力部２０から基本的に構成される。制御部１０、内部記憶部１２、大容量記憶部１４、出力部１６、標準入力部１８及びドキュメント入力部２０はバス２４を介して情報（データ）伝達可能に接続される。 Embodiments of the present invention will be described below in detail with reference to the drawings. As shown in FIG. 1, an impression evaluation apparatus according to an embodiment of the present invention basically includes a control unit 10, an internal storage unit 12, a large-capacity storage unit 14, an output unit 16, a standard input unit 18, and a document input unit 20. Composed. The control unit 10, the internal storage unit 12, the mass storage unit 14, the output unit 16, the standard input unit 18, and the document input unit 20 are connected via a bus 24 so that information (data) can be transmitted.

印象評価装置は、さらに、インターフェース部２２を含んでなることが好適である。インターフェース部２２は、他の構成要素とバス２４を介して情報（データ）伝達可能に接続される。インターフェース部２２は、ネットワーク２６を介して、印象評価装置の外部のサーバからウェブページ等のドキュメントデータを読み込むために用いることができる。 It is preferable that the impression evaluation apparatus further includes an interface unit 22. The interface unit 22 is connected to other components via a bus 24 so that information (data) can be transmitted. The interface unit 22 can be used for reading document data such as a web page from a server external to the impression evaluation apparatus via the network 26.

制御部１０は、内部記憶部１２に保持されているドキュメント印象評価プログラムを実行し、大容量記憶部１４に格納されているドキュメントデータ又はネットワーク２６を介してドキュメントデータを適宜読み込んで、そのドキュメントデータにより表現されるドキュメントの印象評価を行う。印象評価の結果は、出力部１６又は内部記憶部１２へ出力される。制御部１０には、一般的なコンピュータのＣＰＵを用いることができる。 The control unit 10 executes the document impression evaluation program held in the internal storage unit 12 and appropriately reads the document data stored in the large-capacity storage unit 14 or the document data via the network 26, and the document data Evaluate the impression of the document expressed by. The result of impression evaluation is output to the output unit 16 or the internal storage unit 12. As the control unit 10, a CPU of a general computer can be used.

内部記憶部１２は、制御部１０で実行される印象評価プログラム、評価処理に用いられる各パラメータ及び処理結果である印象評価値等を格納及び保持する。内部記憶部１２に保持されたデータは、バス２４を介して、制御部１０から適宜参照することができる。内部記憶部１２は、一般的な半導体メモリ、ハードディスク装置、光磁気ディスク装置等を適宜選択して用いることができる。 The internal storage unit 12 stores and holds an impression evaluation program executed by the control unit 10, each parameter used for evaluation processing, an impression evaluation value that is a processing result, and the like. The data held in the internal storage unit 12 can be referred to as appropriate from the control unit 10 via the bus 24. As the internal storage unit 12, a general semiconductor memory, a hard disk device, a magneto-optical disk device, or the like can be appropriately selected and used.

大容量記憶部１４は、制御部１０での印象評価の対象となるドキュメント（以下、被評価ドキュメントという）のドキュメントデータを格納及び保持する。ドキュメントデータは、ビットマップ形式、ｊｐｅｇ形式、ｔｉｆｆ形式、ＨＴＭＬ形式などの様々なデータ形式で保持される。大容量記憶部１４に保持されたドキュメントデータは、バス２４を介して、制御部１０から適宜参照することができる。大容量記憶部１４は、一般的なハードディスク装置、光磁気ディスク装置等の大容量の記憶装置を選択して用いることができる。 The large-capacity storage unit 14 stores and holds document data of a document (hereinafter referred to as an “evaluated document”) subjected to impression evaluation by the control unit 10. Document data is held in various data formats such as a bitmap format, a jpeg format, a tiff format, and an HTML format. The document data held in the large-capacity storage unit 14 can be referred to as appropriate from the control unit 10 via the bus 24. The large-capacity storage unit 14 can select and use a large-capacity storage device such as a general hard disk device or a magneto-optical disk device.

出力部１６は、制御部１０から出力された処理結果をユーザが確認可能な情報として出力する。また、印象評価の処理に必要な制御コマンドやパラメータをユーザが入力する際のユーザインターフェースとしても用いられる。出力部１６は、ディスプレイ装置、プリンタ又はタッチパネル等を適宜選択して用いることができる。 The output unit 16 outputs the processing result output from the control unit 10 as information that can be confirmed by the user. It is also used as a user interface when the user inputs control commands and parameters necessary for impression evaluation processing. As the output unit 16, a display device, a printer, a touch panel, or the like can be appropriately selected and used.

標準入力部１８は、ドキュメントの評価を行う際の制御コマンドやパラメータをユーザが入力するために用いられる。例えば、制御部１０に対して、内部記憶部１２に保持された印象評価プログラムの実行開始を命令する制御コマンドを入力する際に用いられる。標準入力部１８は、キーボード、マウス又はタッチパネル等を適宜選択して用いることができる。 The standard input unit 18 is used by a user to input control commands and parameters for document evaluation. For example, it is used when a control command for instructing the control unit 10 to start executing an impression evaluation program held in the internal storage unit 12 is used. The standard input unit 18 can be used by appropriately selecting a keyboard, a mouse, a touch panel, or the like.

ドキュメント入力部２０は、印象評価の対象となるドキュメントデータの入力を行う。ドキュメント入力部２０から入力されたドキュメントデータは、大容量記憶部１４又は内部記憶部１２に格納及び保持される。ドキュメント入力部２０は、スキャナ、デジタルカメラ又はデジタルビデオなどの一般的なドキュメント入力装置を適宜選択して用いることができる。 The document input unit 20 inputs document data to be subjected to impression evaluation. Document data input from the document input unit 20 is stored and held in the mass storage unit 14 or the internal storage unit 12. The document input unit 20 can appropriately select and use a general document input device such as a scanner, a digital camera, or a digital video.

本実施の形態の印象評価装置は、一般的なコンピュータによって基本的に構成することができる。また、上記構成に限定されるものではなく、適宜構成要素を付加、削除又は変更することが好適である。 The impression evaluation apparatus according to the present embodiment can be basically configured by a general computer. Moreover, it is not limited to the said structure, It is suitable to add, delete, or change a component suitably.

＜ドキュメント印象評価方法＞
以下に、上記印象評価装置を用いたドキュメントの印象評価の処理方法について、図を参照して説明する。図２に、第１のドキュメントの印象評価方法におけるフローチャートを示す。 <Document impression evaluation method>
A document impression evaluation processing method using the impression evaluation apparatus will be described below with reference to the drawings. FIG. 2 shows a flowchart of the first document impression evaluation method.

本印象評価方法は、図２のように、被評価ドキュメントの入力（ステップＳ１）、被評価ドキュメントの表現の目的の判定（ステップＳ２）、被評価ドキュメントの分割（ステップＳ３）、物理的特徴量を取得するサブルーチン（ステップＳ４）、印象評価の抽出（ステップＳ５）及び評価結果の出力（ステップＳ６）を含み、各ステップを順次行うことによって実行される。具体的には、各ステップをコンピュータで実行可能な印象評価プログラムとして内部記憶部１２に格納及び保持し、制御部１０によってその印象評価プログラムを実行することによって行われる。 As shown in FIG. 2, the present impression evaluation method inputs an evaluated document (step S1), determines the purpose of expressing the evaluated document (step S2), divides the evaluated document (step S3), and physical features. Including a subroutine (step S4) for acquiring the image, extraction of impression evaluation (step S5), and output of the evaluation result (step S6), which are executed by sequentially performing each step. Specifically, each step is stored and held in the internal storage unit 12 as an impression evaluation program that can be executed by a computer, and the impression evaluation program is executed by the control unit 10.

ユーザが、標準入力部１８から印象評価プログラムの実行開始の制御コマンドを入力することによって、制御部１０は内部記憶部１２に保持された印象評価プログラムを実行する。その結果、直ちにステップＳ１へ処理が移行される。 When the user inputs a control command for starting execution of the impression evaluation program from the standard input unit 18, the control unit 10 executes the impression evaluation program held in the internal storage unit 12. As a result, the process immediately proceeds to step S1.

ステップＳ１では、ドキュメント入力部２０を用いて、印象の評価対象となるドキュメントのドキュメントデータが取得される。例えば、ネットワーク２６を介して、ウェブページ等のディスプレイ上に表示されるドキュメントデータが取得される。また、ドキュメント入力部２０を用いて、紙等の印刷媒体上に表示されるドキュメントデータが取得される。ドキュメントデータは、大容量記憶部１４又は内部記憶部１２に格納及び保持され、制御部１０によって適宜読み出されて処理に供される。 In step S <b> 1, using the document input unit 20, document data of a document that is an impression evaluation target is acquired. For example, document data displayed on a display such as a web page is acquired via the network 26. In addition, document data displayed on a print medium such as paper is acquired using the document input unit 20. The document data is stored and held in the large-capacity storage unit 14 or the internal storage unit 12, and is appropriately read out by the control unit 10 for processing.

ステップＳ２では、被評価ドキュメントの表現の目的が判定される。本実施の形態では、被評価ドキュメントに含まれる文字データや画像データを解析して、被評価ドキュメントの表現の目的を判定して、ドキュメントを表現の目的の種類毎に分類する。 In step S2, the purpose of expressing the evaluated document is determined. In the present embodiment, character data and image data included in the document to be evaluated are analyzed, the purpose of expression of the document to be evaluated is determined, and the document is classified for each type of expression purpose.

被評価ドキュメントの広告等の特定の情報を閲覧者（ユーザ）に呈示することを目的する場合、被評価ドキュメントのコンテンツの内容に基づいて分類を行う。被評価ドキュメントのコンテンツに含まれる商品やサービスを表す情報を抽出し、抽出された情報に基づいて被評価ドキュメントを分類する。 When it is intended to present specific information such as an advertisement of the evaluated document to the viewer (user), classification is performed based on the content of the content of the evaluated document. Information representing a product or service included in the content of the evaluated document is extracted, and the evaluated document is classified based on the extracted information.

被評価ドキュメントの内容に基づいて分類する場合、ドキュメントの分類区分毎に１つ又は複数のキーワードを設定しておき、被評価ドキュメントから抽出された文字データと一致するキーワードが設定されている分類区分を調査し、一致するキーワードが設定されている分類区分にその被評価ドキュメントを分類することができる。 When classifying based on the contents of the evaluated document, one or more keywords are set for each document classification category, and the classification category is set with a keyword that matches the character data extracted from the evaluated document. And the evaluated document can be classified into classification categories for which matching keywords are set.

例えば、「食品」の分類区分には“肉”，“野菜”，“果物”・・・等のキーワードを関連付け、「電化製品」の分類区分には“冷蔵庫”，“洗濯機”，“掃除機”・・・等のキーワードを関連付けドキュメント分類用データベースとして記憶部１２に登録する。そして、既存の単語抽出アルゴリズム等を用いて被評価ドキュメントのコンテンツからキーワードを抽出し、ドキュメント分類用データベースにおいて抽出されたキーワードに関連付けられている分類区分をその被評価ドキュメントのコンテンツの内容とする。例えば、被評価ドキュメントに“冷蔵庫”というキーワードが含まれている場合には、その被評価ドキュメントのコンテンツの内容は「電化製品」であると判定される。 For example, keywords such as “meat”, “vegetables”, “fruits”, etc. are associated with the “food” category, and “refrigerator”, “washing machine”, “cleaning” are associated with the “electrical appliance” category. A keyword such as “machine” is registered in the storage unit 12 as an associated document classification database. Then, a keyword is extracted from the content of the evaluated document using an existing word extraction algorithm or the like, and a classification category associated with the keyword extracted in the document classification database is set as the content of the content of the evaluated document. For example, when the keyword “refrigerator” is included in the evaluated document, it is determined that the content of the content of the evaluated document is “electric appliance”.

なお、ドキュメント分類用データベースに登録するキーワードと分類区分との関連付けを変えることによって、被評価ドキュメントの内容に基づいて被評価ドキュメントを「人物」、「風景」、「物」・・・の区分に分類することもできる。 In addition, by changing the association between the keywords registered in the document classification database and the classification categories, the documents to be evaluated are classified into categories of “person”, “landscape”, “object”, etc. based on the contents of the document to be evaluated. It can also be classified.

また、被評価ドキュメントが企業のウェブページ等の特定の特性を有する閲覧者に情報を呈示することを表現の目的とする場合、被評価ドキュメントの閲覧対象者の特性に基づいて分類を行うことも好適である。被評価ドキュメントを閲覧するであろうユーザの性別、年齢（年代）、居住地域、身体的特徴（身長、体重等）、経済的特徴（資産額、収入額、所得額等）等を抽出し、抽出された情報に基づいて被評価ドキュメントを分類する。 In addition, when the evaluated document is intended to present information to a viewer having specific characteristics, such as a company web page, the classification may be performed based on the characteristics of the target audience of the evaluated document. Is preferred. Extract the gender, age (age), residential area, physical characteristics (height, weight, etc.), economic characteristics (assets, income, income, etc.) of the users who will be browsing the evaluated document, Classify the evaluated document based on the extracted information.

具体的には、被評価ドキュメントのコンテンツに含まれる情報の提供先となる閲覧者（ユーザ）の分類区分毎にキーワードを関連付けてドキュメント分類用データベースとして記憶部１２に登録しておき、既存の単語抽出アルゴリズム等を用いて被評価ドキュメントに含まれているキーワードを抽出し、ドキュメント分類用データベースにおいて抽出されたキーワードに関連付けられている分類区分をその被評価ドキュメントのユーザの特性とする。 Specifically, a keyword is associated with each classification category of a viewer (user) who is a destination of information included in the content of the evaluated document, and is registered in the storage unit 12 as a document classification database. A keyword included in the evaluated document is extracted using an extraction algorithm or the like, and a classification category associated with the extracted keyword in the document classification database is set as a user characteristic of the evaluated document.

例えば、「男性・２０代」の分類区分には“自動車”、“コンピュータ”・・・等のキーワードを関連付け、「女性・２０代」の分類区分には“携帯電話”、“化粧品”・・・等のキーワードを関連付けてドキュメント分類用データベースとして記憶部１２に登録する。他の性別と他の年代の組み合わせについても同様にキーワードを関連付けてドキュメント分類用データベースに登録する。そして、被評価ドキュメントに含まれるキーワードを抽出し、その抽出されたキーワードが“自動車”であれば、ドキュメント分類用データベースから「男性・２０代」という分類区分が被評価ドキュメントの閲覧者の特性として決定される。 For example, keywords such as “automobile”, “computer”,... Are associated with the category of “male / 20s”, and “mobile phone”, “cosmetics”,. The keywords such as are associated and registered in the storage unit 12 as a document classification database. Similarly, other genders and other age combinations are associated with keywords and registered in the document classification database. Then, a keyword included in the evaluated document is extracted, and if the extracted keyword is “automobile”, the classification category “male / 20s” from the document classification database is a characteristic of the viewer of the evaluated document. It is determined.

なお、標準入力部１８を用いて被評価ドキュメントの表現の目的をユーザが入力することによって被評価ドキュメントの表現の目的を手動で設定してもよい。 The purpose of expressing the evaluated document may be manually set by the user inputting the purpose of expressing the evaluated document using the standard input unit 18.

ステップＳ３では、被評価ドキュメントの分割処理が行われる。ステップＳ１で取得されるドキュメントデータは、図３に示すようなウェブサイト等のドキュメント全体であってもよいし、ヘッダ部３０、タイトル部３１、見出し部３２、リンクボタン部３３、イメージ画像部３４及びテキスト部３５等に領域分割されたドキュメントであってもよい。また、Ｋ平均アルゴリズムによるクラスタリング等に代表される画像領域分割アルゴリズムを適用してドキュメントを自動的に領域分割して、それぞれを被対象ドキュメントとすることも好適である。 In step S3, a process for dividing the document to be evaluated is performed. The document data acquired in step S1 may be an entire document such as a website as shown in FIG. 3, or a header part 30, a title part 31, a heading part 32, a link button part 33, and an image image part 34. Also, the document may be divided into regions such as a text portion 35. It is also preferable to automatically divide the document into regions by applying an image region dividing algorithm typified by clustering based on the K-average algorithm, and to make each a target document.

ステップＳ４では、ドキュメント領域ごとに物理的な特徴量が算出される。ステップＳ４は、図４に示すように、サブルーチンとして処理される。以下では、ウェブサイト等のドキュメントを幾つかのドキュメント領域に分割して各領域を被評価ドキュメントとして処理する場合について説明する。 In step S4, a physical feature amount is calculated for each document area. Step S4 is processed as a subroutine as shown in FIG. Hereinafter, a case where a document such as a website is divided into several document areas and each area is processed as an evaluated document will be described.

ここで、物理的な特徴量とは、ドキュメント領域の面積、ドキュメント領域の幅ｗ及び高さｈ、左上の位置ｘ，ｙ、ドキュメント領域全体の色平均、Ｌ^＊成分の平均値、ａ^＊成分の平均値、ｂ^＊成分の平均値、ドキュメント全体の色平均とドキュメント領域全体の色平均の色差ΔＬ^＊ａ^＊ｂ^＊、ドキュメント領域と近接するドキュメント領域との色平均の色差Δ_ｎＬ^＊ａ^＊ｂ^＊、背景とドキュメント領域との色平均の色差Δ_ｂＬ^＊ａ^＊ｂ^＊、ドキュメント領域内における色の分散、同色数などのドキュメントの特徴をいう。 Here, the physical feature amounts are the area of the document area, the width w and height h of the document area, the upper left positions x and y, the color average of the entire document area, the average value of the L ^* component, and the a ^* component. mean values of, b ^* average component, color difference color average and the document area overall color average of the entire document ΔL * ^a * ^b ^*, color differences in color average of the document area to be close to the document region Δ _n L ^* a ^* B ^* , the color difference of the average color between the background and the document area [Delta _] bL ^* a ^* b ^* , the color dispersion in the document area, the number of the same color, and the like.

以下、各特徴量の算出方法について説明するが、本実施の形態では総ての特徴量について算出する必要はなく、ステップＳ４以降においてドキュメントの評価に使用される特徴量のみについて算出すればよい。 Hereinafter, although the calculation method of each feature-value is demonstrated, it is not necessary to calculate about all the feature-values in this Embodiment, What is necessary is just to calculate only about the feature-value used for document evaluation in step S4 and subsequent steps.

ステップＳ４１では、分割された各ドキュメント領域のドキュメントデータの色空間がＲＧＢ空間からＬ^＊ａ^＊ｂ^＊空間に変換される。通常、コンピュータで利用されるドキュメントデータは、不均等色空間であるＲＧＢ色空間によって表現されていることが多く、人の感覚に近い分析ができない。そこで、不均等色空間であるＲＧＢ色空間から均等色空間であるＬ^＊ａ^＊ｂ^＊色空間に変換することによって、ドキュメントをより人の感性に近い印象として分析・評価することができる。また、Ｌ^＊ａ^＊ｂ^＊空間はドキュメントのダイナミックレンジを表現できる点からも好適である。勿論、ドキュメントデータが既にＬ^＊ａ^＊ｂ^＊色空間で表現されている場合には変換をする必要はない。 In step S41, the color space of the document data of each divided document area is converted from the RGB space to the L ^* a ^* b ^* space. In general, document data used in a computer is often expressed in an RGB color space, which is a non-uniform color space, and cannot be analyzed close to the human sense. Therefore, by converting the RGB color space, which is a non-uniform color space, to the L ^* a ^* b ^* color space, which is a uniform color space, the document can be analyzed and evaluated as an impression closer to human sensitivity. The L ^* a ^* b ^* space is also preferable because it can express the dynamic range of the document. Of course, if the document data is already expressed in the L ^* a ^* b ^* color space, no conversion is necessary.

図５に、均等色空間を一般的なマンセル色相環として表現した例を示す（図面上において、異なる色は、異なるハッチングによって表現する）。通常、マンセル色相環はさらに多色の色分類を行っているが、図５にはその一部を示した。以下、色空間を色相及び明度ともに１０分割したマンセル色相環を用いて説明する。勿論、さらに色空間を細分化したカラーパレットを用いることが好適である。 FIG. 5 shows an example in which the uniform color space is expressed as a general Munsell hue ring (different colors are expressed by different hatching in the drawing). Usually, the Munsell hue ring performs multi-color classification, but a part of them is shown in FIG. In the following, description will be made using a Munsell hue ring in which the color space is divided into 10 for both hue and brightness. Of course, it is preferable to use a color palette in which the color space is further subdivided.

ステップＳ４２では、色空間変換された各ドキュメント領域のドキュメントデータを大容量記憶部１４又は内部記憶部１２に格納及び保持する。 In step S42, the document data of each document area subjected to color space conversion is stored and held in the large-capacity storage unit 14 or the internal storage unit 12.

ステップＳ４３では、各ドキュメント領域に対して位置、面積、色平均、色の分散及び色数の特徴量の抽出を行う。大容量記憶部１４又は内部記憶部１２に保持されている各ドキュメント領域のドキュメントデータを順次選択し、それぞれのドキュメント領域に対する各特徴量を求める。このとき、全ドキュメント領域の中において最大面積を有するドキュメント領域を背景領域と定義する。 In step S43, the feature quantity of position, area, color average, color dispersion, and number of colors is extracted for each document area. The document data of each document area held in the large-capacity storage unit 14 or the internal storage unit 12 is sequentially selected, and each feature amount for each document area is obtained. At this time, a document area having the maximum area among all document areas is defined as a background area.

例えば、位置は、各ドキュメント領域に対して左上の画素の位置ｘ，ｙを求めることにより決定する。面積は、各ドキュメント領域の幅ｗ及び高さｈを抽出することによって決定する。色平均及び色の分散は、各ドキュメント領域内の全画素のＬ^＊成分、ａ^＊成分及びｂ^＊成分に対して、それぞれ算術平均及び分散値を求めることによって決定する。また、色数は、各ドキュメント領域内の全画素に対して、Ｌ^＊成分、ａ^＊成分及びｂ^＊成分の全てが異なる色の数を累積することによって決定する。同様に、ドキュメント全体及び背景領域の色平均、色の分散及び色数を求めることができる。 For example, the position is determined by obtaining the position x, y of the upper left pixel for each document area. The area is determined by extracting the width w and height h of each document area. The color average and color variance are determined by calculating the arithmetic mean and variance value for the L ^* component, a ^* component, and b ^* component of all pixels in each document area, respectively. The number of colors is determined by accumulating the number of colors in which all of the L ^* component, a ^* component, and b ^* component are different for all pixels in each document area. Similarly, the color average, color dispersion, and number of colors of the entire document and the background area can be obtained.

ステップＳ４４では、各ドキュメント領域とドキュメント全体及び背景領域との色差を求める。まず、各ドキュメント領域についてドキュメント全体との各色成分の差ΔＬ_ｉ ^＊，Δａ_ｉ ^＊及びΔｂ_ｉ ^＊を求める。各色成分の差ΔＬ_ｉ ^＊，Δａ_ｉ ^＊及びΔｂ_ｉ ^＊は数式（１）で表される。
（数１）
ΔＬ_ｉ ^＊＝ドキュメント領域ｉのＬ^＊成分の色平均−ドキュメント全体のＬ^＊成分の色平均
Δａ_ｉ ^＊＝ドキュメント領域ｉのａ^＊成分の色平均−ドキュメント全体のａ^＊成分の色平均
Δｂ_ｉ ^＊＝ドキュメント領域ｉのｂ^＊成分の色平均−ドキュメント全体のｂ^＊成分の色平均
ここで、ｉはドキュメント領域の識別符号・・・・・・・・・・・・・（１） In step S44, the color difference between each document area, the entire document, and the background area is obtained. First, differences ΔL _i ^* , Δa _i ^*, and Δb _i ^* of the respective color components with respect to the entire document are obtained for each document area. Differences ΔL _i ^* , Δa _i ^*, and Δb _i ^* between the color components are expressed by Equation (1).
(Equation 1)
[Delta] L _i ^* = color average of ^{the L *} component of the document area i - color average of the entire document ^{L *} component .DELTA.a _i ^* = color average of ^{a *} component of the document area i - entire document ^{a *} component of the color average [Delta] b _i ^* = B of the document area i ^* color average of the component-b of the entire document ^* color average of the component where i is an identification code of the document area (1)

同様に、各ドキュメント領域について背景領域との各色成分の差ΔＬ_ｊ ^＊，Δａ_ｊ ^＊及びΔｂ_ｊ ^＊を求める。各色成分の差ΔＬ_ｊ ^＊，Δａ_ｊ ^＊及びΔｂ_ｊ ^＊は数式（２）で表される。
（数２）
ΔＬ_ｊ ^＊＝ドキュメント領域ｊのＬ^＊成分の色平均−背景領域のＬ^＊成分の色平均
Δａ_ｊ ^＊＝ドキュメント領域ｊのａ^＊成分の色平均−背景領域のａ^＊成分の色平均
Δｂ_ｊ ^＊＝ドキュメント領域ｊのｂ^＊成分の色平均−背景領域のｂ^＊成分の色平均
ここで、ｊはドキュメント領域の識別符号・・・・・・・・・・・・（２） Similarly, differences ΔL _j ^* , Δa _j ^*, and Δb _j ^* between the color components of the document area and the background area are obtained. Differences ΔL _j ^* , Δa _j ^*, and Δb _j ^* between the color components are expressed by Equation (2).
(Equation 2)
ΔL _j ^* = color average of ^{the L *} component of the document area j - color average of ^{the L *} component of the background area .DELTA.a _j ^* = color average of ^{a *} component of the document area j - the background area ^{a *} component of the color average [Delta] b _j ^* = B of the document area j ^* color average of the component-b of the background area ^* color average of the component where j is an identification code of the document area (2)

各ドキュメント領域とドキュメント全体との色差Δ_ｉＬ^＊ａ^＊ｂ^＊及び各ドキュメント領域と背景領域との色差Δ_ｂｊＬ^＊ａ^＊ｂ^＊は数式（３）を用いて求めることができる。
（数３）
Δ_ｉＬ^＊ａ^＊ｂ^＊＝｛ΔＬ_ｉ ^＊２＋Δａ_ｉ ^＊２＋Δｂ_ｉ ^＊２｝^１／２
Δ_ｂjＬ^＊ａ^＊ｂ^＊＝｛ΔＬ_ｊ ^＊２＋Δａ_ｊ ^＊２＋Δｂ_ｊ ^＊２｝^１／２
ここで、ｉ，ｊはドキュメント領域の識別子・・・・・・・・・・・・・（３） The color difference Δ _i L ^* a ^* b ^* between each document area and the entire document and the color difference Δ _bj L ^* a ^* b ^* between each document area and the background area can be obtained using Equation (3).
(Equation 3)
Δ _i L ^* a ^* b ^* = {ΔL _i ^{* 2} + Δa _i ^{* 2} + Δb _i ^{* 2} } ^1/2
Δ _bj L ^* a ^* b ^* = {ΔL _j ^{* 2} + Δa _j ^{* 2} + Δb _j ^{* 2} } ^1/2
Here, i and j are document area identifiers (3).

ステップＳ４５では、各ドキュメント領域とその領域に隣接するドキュメント領域との色差を求める。ドキュメント領域ｉがドキュメント領域ｊと隣接していた場合には、数式（４）を用いて各色成分の差ΔＬ_ｉｊ ^＊，Δａ_ｉｊ ^＊及びΔｂ_ｉｊ ^＊を求める。
（数４）
ΔＬ_ｉｊ ^＊＝ドキュメント領域ｉのＬ^＊成分の色平均−ドキュメント領域ｊのＬ^＊成分の色平均
Δａ_ｉｊ ^＊＝ドキュメント領域ｉのａ^＊成分の色平均−ドキュメント領域ｊのａ^＊成分の色平均
Δｂ_ｉｊ ^＊＝ドキュメント領域ｉのｂ^＊成分の色平均−ドキュメント領域ｊのｂ^＊成分の色平均
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・（４） In step S45, the color difference between each document area and the document area adjacent to that area is obtained. When the document area i is adjacent to the document area j, the differences ΔL _ij ^* , Δa _ij ^*, and Δb _ij ^* of the respective color components are obtained using Equation (4).
(Equation 4)
[Delta] L _ij ^* = color average of ^{the L *} component of the document area i - color average of ^{the L *} component of the document area j .DELTA.a _ij ^* = color average of ^{a *} component of the document area i - of the ^{a *} component of the document area j color average [Delta] b _ij ^* = average color of ^{the b *} component of the document area i - of the document area j ^{b *} component of the color average ........................... (4)

ドキュメント領域ｉとドキュメント領域ｊとの色差Δ_ｎｉＬ^＊ａ^＊ｂ^＊は、数式（５）を用いて求めることができる。
（数５）
Δ_ｎｉＬ^＊ａ^＊ｂ^＊＝｛ΔＬ_ｉｊ ^＊２＋Δａ_ｉｊ ^＊２＋Δｂ_ｉｊ ^＊２｝^１／２
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・（５） The color difference Δ _ni L ^* a ^* b ^* between the document area i and the document area j can be obtained using Equation (5).
(Equation 5)
_Δni L ^* a ^* b ^* = {ΔL _ij ^{* 2} + Δa _ij ^{* 2} + Δb _ij ^{* 2} } ^1/2
・・・・・・・・・・・・・・・・・・・・・・・・・・（5）

ドキュメント領域が複数のドキュメント領域と隣接していた場合には、隣接する全てのドキュメント領域に対する各色成分の差ΔＬ_ｉｊ ^＊，Δａ_ｉｊ ^＊及びΔｂ_ｉｊ ^＊を求め、それらの算術平均を行うことによって色差を求めることが好適である。 If the document area is adjacent to a plurality of document areas, the difference ΔL _ij ^* , Δa _ij ^* and Δb _ij ^* of each color component with respect to all adjacent document areas is obtained, and the color difference is obtained by performing an arithmetic average thereof. Is preferably obtained.

図６に、各ドキュメント領域に対して各特徴量を求めた結果を例示する。これらの特徴量は、各ドキュメント領域と対応付けられて大容量記憶部１４又は内部記憶部１２に格納及び保持される。各特徴量の抽出を終了すると、メインルーチンのステップＳ５に処理を移行する。 FIG. 6 illustrates the result of obtaining each feature amount for each document area. These feature amounts are stored and held in the mass storage unit 14 or the internal storage unit 12 in association with each document area. When the extraction of each feature amount is completed, the process proceeds to step S5 of the main routine.

ステップＳ５では、ステップＳ４で抽出された各特徴量に基づいてドキュメントの印象に対する評価を求める。具体的には、ドキュメントの特徴量の値とドキュメントの印象を表現する表現情報とを関連付けた表現情報データベースを予め準備しておき、ステップＳ４で抽出された特徴量に対応する表現情報を抽出する。表現情報は、テキスト情報のみならず、画像情報や音声情報とすることも好適である。 In step S5, an evaluation for the impression of the document is obtained based on each feature amount extracted in step S4. Specifically, an expression information database in which document feature value values and expression information expressing the impression of the document are associated is prepared in advance, and expression information corresponding to the feature values extracted in step S4 is extracted. . The expression information is preferably not only text information but also image information and audio information.

表現情報データベースには、すくなくとも１つの特徴量に関する条件と、ドキュメントを見た人間が感じる印象に関する表現情報とが関連付けられて登録される。このとき、同じ特徴量の条件であっても被評価ドキュメントの表現の目的の違いに応じた表現情報が関連付けられている。表現情報データベースは、内部記憶部１２又は大容量記憶部１４に格納及び保持される。 In the expression information database, a condition relating to at least one feature amount and expression information relating to an impression felt by a person who views the document are associated and registered. At this time, expression information corresponding to the difference in the purpose of expression of the evaluated document is associated even under the same feature amount condition. The expression information database is stored and held in the internal storage unit 12 or the mass storage unit 14.

ここでは、例として、図７に示す表現情報データベースを用いて説明を行う。本データベースでは、ドキュメント内の平均色のＬ^＊成分、ａ^＊成分及びｂ^＊成分の組み合わせを条件として情報の内容を表す商品種毎にそれぞれ表現情報が登録されている。すなわち、ここでの表現の目的は、ドキュメントを閲覧するユーザに対して示されるコンテンツに含まれる商品種である。 Here, as an example, description will be made using the expression information database shown in FIG. In this database, expression information is registered for each product type representing the content of information on the condition that the combination of the L ^* component, a ^* component, and b ^* component of the average color in the document. That is, the purpose of the expression here is the product type included in the content shown to the user who browses the document.

例えば、背景領域（面積が最も大きいドキュメント領域）のＬ^＊成分が６、ａ^＊成分が２０及びｂ^＊成分が１０である場合、ステップＳ２において被評価ドキュメントが表す商品が“電化製品”であると判定されている場合には、条件Ａ２に該当して「都会的な」という表現情報が抽出され、被評価ドキュメントが“食品”であると判定されている場合には、条件Ｂ２に該当して「ゴージャスな」という表現情報が抽出される。取得された特徴量が他の条件に合致する場合には、その条件に関連付けられた表現情報が抽出される。 For example, if the L ^* component of the background region (the document region having the largest area) is 6, the a ^* component is 20 and the b ^* component is 10, the product represented by the evaluated document in step S2 is “electric appliance”. If it is determined that the expression information “urban” is extracted under the condition A2, and the document to be evaluated is determined as “food”, the condition B2 is satisfied. Thus, the expression information “gorgeous” is extracted. When the acquired feature amount matches other conditions, expression information associated with the conditions is extracted.

また、上記のように、被評価ドキュメントの閲覧者の特性に基づいて処理を行う場合、図８に示す表現情報データベースを用いることができる。本データベースでは、ドキュメント内の平均色のＬ^＊成分、ａ^＊成分及びｂ^＊成分の組み合わせを条件として情報の閲覧者となるユーザの特性毎にそれぞれ表現情報が登録されている。すなわち、ここでの表現の目的は、ドキュメントを閲覧するユーザの性別及び年代である。 Further, as described above, when processing is performed based on the characteristics of the viewer of the evaluated document, the expression information database shown in FIG. 8 can be used. In this database, expression information is registered for each characteristic of the user who becomes the viewer of information on the condition that the combination of the L ^* component, a ^* component, and b ^* component of the average color in the document. That is, the purpose of the expression here is the gender and age of the user viewing the document.

例えば、背景領域（面積が最も大きいドキュメント領域）のＬ^＊成分が６、ａ^＊成分が２０及びｂ^＊成分が１０である場合、ステップＳ２において被評価ドキュメントの閲覧者となるユーザが“男性・２０代”であると判定されている場合には、条件Ａ２に該当して「ゴージャスな」という表現情報が抽出され、被評価ドキュメントが“女性・２０代”であると判定されている場合には、条件Ｂ２に該当して「かわいい」という表現情報が抽出される。取得された特徴量が他の条件に合致する場合には、その条件に関連付けられた表現情報が抽出される。 For example, if the L ^* component of the background area (the document area having the largest area) is 6, the a ^* component is 20 and the b ^* component is 10, the user who becomes the viewer of the evaluated document in step S2 is “male. If it is determined that it is “20's”, the expression information “gorgeous” corresponding to the condition A2 is extracted, and the document to be evaluated is determined to be “female / 20's” The expression information “cute” corresponding to the condition B2 is extracted. When the acquired feature amount matches other conditions, expression information associated with the conditions is extracted.

なお、ドキュメントの印象に関する情報を抽出するために用いることができる特徴量の条件はドキュメント内の平均色のＬ^＊成分、ａ^＊成分及びｂ^＊成分の組み合わせに限られるものではなく、例えば、背景領域の色平均（Ｌ^＊成分、ａ^＊成分及びｂ^＊成分）、背景領域からの色差が所定値以上で面積が最も大きいドキュメント領域（メイン領域）の色成分に関する特徴量、面積が所定値以下で背景領域からの色差が最も大きいドキュメント領域（アクセント領域）の色成分に関する特徴量を用いてもよい。また、特徴量としてはドキュメント領域の面積、ドキュメント領域の幅ｗ及び高さｈ、左上の位置ｘ，ｙ、ドキュメント領域全体の色平均、Ｌ^＊成分の平均値、ａ^＊成分の平均値、ｂ^＊成分の平均値、Ｒ成分の平均値、Ｇ成分の平均値、Ｂ成分の平均値、ドキュメント全体の色平均とドキュメント領域全体の色平均の色差ΔＬ^＊ａ^＊ｂ^＊、ドキュメント領域と近接するドキュメント領域との色平均の色差Δ_ｎＬ^＊ａ^＊ｂ^＊、背景とドキュメント領域との色平均の色差Δ_ｂＬ^＊ａ^＊ｂ^＊、ドキュメント領域内における色の分散、同色数を用いてもよい。 Note that the feature amount condition that can be used to extract information related to the impression of the document is not limited to the combination of the L ^* component, a ^* component, and b ^* component of the average color in the document. Area color average (L ^* component, a ^* component and b ^* component), feature amount related to color components of document area (main area) with color difference from background area greater than or equal to a predetermined value, and area less than predetermined value The feature amount relating to the color component of the document area (accent area) having the largest color difference from the background area may be used. The feature amount includes the area of the document area, the width w and height h of the document area, the upper left positions x and y, the color average of the entire document area, the average value of the L ^* component, the average value of the a ^* component, b ^* Average value of components, average value of R components, average value of G components, average value of B components, color difference ΔL ^* a ^* b ^* between the color average of the entire document and the color average of the entire document area, and close to the document area chrominance color average of the document region _{^{^{Δ n L * a * b *}}} , the background and the color difference of the color average of the document region _{^{^{Δ b L * a * b *}}} , the color of the dispersion in the document area, even with the same color number Good.

また、これらの特徴量を組み合わせた条件とドキュメントの印象に関する表現情報とを関連付けた多次元の表現情報データベースを用いてもよい。このように、出来る限り多くの特徴量を組み合わせることにより、ドキュメントが人間に与える印象をより詳細なものとすることができる。特に、ドキュメント領域間の色差など、異なるドキュメント領域間の特徴量の差異を含んだ条件を用いることにより、被評価ドキュメントの中で強いコントラストを持つ箇所等から受ける印象を表す表現情報を抽出することも可能となる。 Further, a multidimensional expression information database in which conditions combining these feature quantities and expression information related to the impression of a document are associated may be used. In this way, by combining as many feature quantities as possible, the impression that the document gives to humans can be made more detailed. In particular, by using conditions that include differences in feature quantities between different document areas, such as color differences between document areas, to extract expression information that represents impressions received from places with strong contrast in the evaluated document Is also possible.

また、特徴量の条件と組み合わせられる表現の目的についても、複数の属性を組み合わせてもよい。ドキュメントのコンテンツの種別を複数組み合わせて、それぞれの組み合わせについて特徴量の条件に関連付けて表現情報を登録してもよい。例えば、“電化製品”と“自動車”の組み合わせ、“食品”と“衣料品”の組み合わせ・・・に対して、それぞれ特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に関連付けられた表現情報を表現情報データベースに登録する等してもよい。 Moreover, you may combine a some attribute also about the objective of the expression combined with the conditions of a feature-value. A plurality of document content types may be combined, and the expression information may be registered in association with the feature amount condition for each combination. For example, for the combination of “electric appliance” and “automobile”, the combination of “food” and “clothing”, etc., the L ^* component, the a ^* component, and the b ^* component of the feature amount are combined. The associated expression information may be registered in the expression information database.

また、被評価ドキュメントを「人物」、「風景」、「物」・・・の区分に分類した場合、これらのコンテンツの種別毎に特徴量の条件と表現情報との関係を表現情報データベースとして登録して用いてもよい。 In addition, when the document to be evaluated is classified into “person”, “landscape”, “object”, etc., the relationship between the feature amount condition and the expression information is registered in the expression information database for each content type. May be used.

また、ドキュメントの受け手となるユーザの属性を単独又は複数組み合わせて、それぞれについて特徴量の条件に関連付けて表現情報を登録してもよい。例えば、“性別”と“年代”と“居住地域”の組み合わせに対して、それぞれ特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に関連付けられた表現情報を表現情報データベースに登録する等してもよい。 Further, the expression information may be registered in association with the feature amount condition for each of the attributes of the user who will receive the document, alone or in combination. For example, for the combination of “gender”, “age”, and “residential area”, the expression information associated with the combination of the L ^* component, the a ^* component, and the b ^* component of the feature amount is stored in the expression information database. You may register.

ステップＳ６では、出力部１６において、取得された特徴量に関する情報及び抽出された表現情報を表示する。出力は、ドキュメントに関する情報がユーザに把握し易い態様で行うことが好適である。 In step S <b> 6, the output unit 16 displays information about the acquired feature amount and the extracted expression information. The output is preferably performed in such a manner that the user can easily grasp information about the document.

例えば、図９に示すように、被評価ドキュメント９０、被評価ドキュメントに含まれる各ドキュメント領域の面積及び平均色の分布を示すドキュメント色彩情報表示部９１、特徴量に基づいて抽出された表現情報を示すカラーイメージ９２及び主なドキュメント領域の平均色を示す表示部９３を含むことが好適である。 For example, as shown in FIG. 9, a document 90 to be evaluated, a document color information display unit 91 indicating the distribution and average color distribution of each document area included in the document to be evaluated, and expression information extracted based on the feature amount It is preferred to include a color image 92 to be shown and a display 93 to show the average color of the main document area.

ドキュメント色彩情報表示部９１は、図１０に示すように、被評価ドキュメントに含まれるドキュメント領域毎に、各ドキュメント領域の平均色に最も近似するカラーパレット上の色を求め、カラーパレットの色毎に、その色に近似する平均色を有するドキュメント領域の合計面積を求め、その面積に比例した領域をそのカラーパレットの色で塗り潰して表示する。 As shown in FIG. 10, the document color information display unit 91 obtains, for each document area included in the evaluated document, a color on the color palette that most closely approximates the average color of each document area, and for each color of the color palette. Then, the total area of the document areas having an average color approximate to the color is obtained, and the area proportional to the area is filled with the color of the color palette and displayed.

具体的には、図１１に示すように、ドキュメント色彩情報表示部９１の画面位置とカラーパレットに含まれる色成分範囲の条件とを関連付けたデータベースを内部記憶部１２又は大容量記憶部１４に格納及び保持しておき、内部記憶部１２又は大容量記憶部１４に保持された各ドキュメント領域の平均色の各色成分（Ｌ^＊成分、ａ^＊成分及びｂ^＊成分）及び面積を参照して、各色成分範囲の条件と一致する平均色を有するドキュメント領域の合計面積を求め、その色成分範囲の条件に関連付けられた画面位置に、その合計面積に比例する領域をそのカラーパレットの色で塗り潰したドキュメント情報を表示する。 Specifically, as shown in FIG. 11, a database in which the screen position of the document color information display unit 91 and the condition of the color component range included in the color palette are associated is stored in the internal storage unit 12 or the large capacity storage unit 14. Each color is referred to by referring to each color component (L ^* component, a ^* component and b ^* component) and area of the average color of each document area held in the internal storage unit 12 or the large-capacity storage unit 14 A document in which the total area of a document area having an average color that matches the component range condition is obtained, and the area proportional to the total area is filled with the color palette color at the screen position associated with the color component range condition. Display information.

このドキュメント色彩情報表示部９１によって、ユーザは、評価対象となったドキュメントに含まれる色（平均色）の分布を一見して確認することができる。 The document color information display unit 91 allows the user to confirm at a glance the distribution of colors (average colors) included in the document to be evaluated.

カラーイメージ９２は、図１２に示すように、出力画面の領域を複数の分割画面領域９５として分割し、分割画面領域９５毎にステップＳ４のデータベースに含まれる特徴量に対する条件を予め割り当てておき、ステップＳ４においてドキュメントの印象に関する情報を抽出する際に用いた条件が割り当てられた分割画面領域９５に、その条件から抽出された表現情報を表示する。 As shown in FIG. 12, the color image 92 divides the area of the output screen into a plurality of divided screen areas 95, and assigns a condition for the feature amount included in the database in step S4 to each divided screen area 95 in advance. In step S4, the expression information extracted from the condition is displayed in the divided screen area 95 to which the condition used when extracting the information related to the impression of the document is assigned.

具体的には、図１３に示すように、各分割画面領域９５の位置情報と、図７に示すデータベースに含まれる特徴量に対する条件と、を対応付けた表示用データベースを予め内部記憶部１２又は大容量記憶部１４に保持しておき、このデータベースを参照して、一致する条件の画面の位置に表現情報を表示する。このとき、分割画面領域９５に条件を割り当てる際に、人間がドキュメントから感じる印象が近い条件ほど、互いに近接する分割画面領域９５に割り当てることが好適である。 Specifically, as shown in FIG. 13, a display database in which the position information of each divided screen area 95 and the condition for the feature amount included in the database shown in FIG. The expression information is displayed at the position of the screen of the matching condition with reference to the database by storing in the large-capacity storage unit 14. At this time, when assigning conditions to the divided screen area 95, it is preferable to assign conditions closer to the impression that humans feel from the document to the divided screen areas 95 that are close to each other.

例えば、図１４のカラーイメージ９２では、“暖かい”印象を与える条件ほどカラーイメージ９２の左側に割り当て、逆に、“冷たい”印象を与える条件ほどカラーイメージ９２の右側に割り当てている。また、“柔らかい”印象を与える条件ほどカラーイメージ９２の上側に割り当て、逆に、“硬い”印象を与える条件ほどカラーイメージ９２の下側に割り当てている。但し、これらに限られるものではなく、ユーザが必要とする情報に応じて割り当てを変更しても良い。 For example, in the color image 92 of FIG. 14, conditions that give a “warm” impression are assigned to the left side of the color image 92, and conversely, conditions that give a “cold” impression are assigned to the right side of the color image 92. In addition, a condition that gives a “soft” impression is assigned to the upper side of the color image 92, and conversely, a condition that gives a “hard” impression is assigned to the lower side of the color image 92. However, the assignment is not limited to these, and the assignment may be changed according to information required by the user.

例えば、画面領域の左側から右側に向けてａ^＊成分が増加し、画面領域の下側から上側に向けてｂ^＊成分が増加するような条件を割り当てることによって、人間に与える印象が近い条件を、より近接する分割画面領域９５に割り当てることができる。他の特徴量についても同様である。このように、条件を傾向付けて割り当てることによって、カラーイメージ９２に表示された情報の位置や集中度から評価対象となったドキュメントの印象に対する傾向を知ることができる。 For example, by assigning a condition in which the a ^* component increases from the left side to the right side of the screen area and the b ^* component increases from the lower side to the upper side of the screen area, , It can be assigned to a divided screen area 95 that is closer. The same applies to other feature amounts. In this way, by assigning conditions with a tendency, it is possible to know the tendency for the impression of the document to be evaluated from the position and concentration of the information displayed on the color image 92.

また、ドキュメントの表現の目的毎に表示用データベースを登録し、ステップＳ２で判別された被評価ドキュメントの表現の目的に応じて参照する表示用データベースを変更して処理を行う。本実施の形態では、図１３のように情報の内容が“電化製品”、“食品”・・・であるドキュメントに対する表示用データベースを準備する。 Further, a display database is registered for each purpose of document expression, and processing is performed by changing the display database to be referred to according to the purpose of expression of the evaluated document determined in step S2. In the present embodiment, as shown in FIG. 13, a display database is prepared for documents whose information content is “electric appliances”, “food”,.

例えば、被評価ドキュメントの１つであるドキュメント２の情報の内容が“電化製品”であって、ドキュメント２から取得された特徴量が図７の条件Ａ２に合致して「都会的な」という表現情報が抽出された場合、図１３の表示用データベースを参照して、図９及び図１４のように、カラーイメージ９２における条件Ａ１が割り当てられた分割画面領域９６に表現情報である「都会的な」という文字を表示させる。 For example, the information content of the document 2 which is one of the evaluated documents is “electrical appliance”, and the feature amount acquired from the document 2 matches the condition A2 in FIG. When the information is extracted, referring to the display database in FIG. 13, as shown in FIGS. 9 and 14, the expression information is displayed in the divided screen area 96 to which the condition A <b> 1 in the color image 92 is assigned. "Is displayed.

さらに、条件に合致した特徴量を示す情報を同時に表示することが好適である。例えば、図９及び図１４に示すように、被評価ドキュメントであるドキュメント２について条件Ａ１に合致した被評価ドキュメントの面積に比例した領域９６を被評価ドキュメントの平均色で塗り潰して表示する。このように、カラーイメージ９２上にドキュメントの印象を示すテキスト情報と特徴量を示す情報とを同時に表示することによって、ユーザは、どのような特徴量に基づいてドキュメントの印象が評価されたかを、その特徴量に対応付けて容易に認識することが可能となる。 Furthermore, it is preferable to simultaneously display information indicating feature amounts that match the conditions. For example, as shown in FIG. 9 and FIG. 14, an area 96 proportional to the area of the document to be evaluated that matches the condition A1 is displayed with the average color of the document to be evaluated. Thus, by simultaneously displaying text information indicating the impression of the document and information indicating the feature amount on the color image 92, the user can determine what feature amount is used to evaluate the impression of the document. It can be easily recognized in association with the feature amount.

もちろん、ステップＳ５において複数の表現の目的を組み合わせて処理を行った場合、ステップＳ６での処理も複数の表現の目的を組み合わせた表示用データベースを用いて行うことができる。例えば、ステップＳ５において、図１５に示すように、“電化製品”と“自動車”の組み合わせ、“食品”と“衣料品”の組み合わせ・・・に対して、それぞれ特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に表現情報を関連付けた表現情報データベースを用いて被評価ドキュメントに対する表現情報を抽出することができる。この場合、ステップＳ６では、図１６に示すように、“電化製品”と“自動車”の組み合わせ、“食品”と“衣料品”の組み合わせ・・・に対して、特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に表示情報の表示位置を関連付けて登録した表示用データベースを用いて処理を行うことによって、ステップＳ５において表現情報の抽出処理に利用された特徴量の条件に関連付けられている表示位置にステップＳ５において抽出された表現情報を呈示することができる。 Of course, when processing is performed by combining a plurality of expressions in step S5, the processing in step S6 can also be performed using a display database that combines a plurality of expressions. For example, in step S5, as shown in FIG. 15, a combination of "Electronics" and "car", "food" and for the combination ... of "clothing", each characteristic quantity ^{L *} component, a Expression information for the document to be evaluated can be extracted using an expression information database in which expression information is associated with a condition combining the ^* component and the b ^* component. In this case, in step S6, as shown in FIG. 16, a combination of "Electronics" and "car", the combination ... for "food" and "clothing", the feature of ^{the L *} component, a By performing processing using the display database registered by associating the display position of the display information with the combination of the ^* component and the b ^* component, the feature amount condition used for the expression information extraction processing in step S5 is obtained. The expression information extracted in step S5 can be presented at the associated display position.

また、被評価ドキュメントを「人物」、「風景」、「物」・・・の区分に分類した場合も同様に処理を行うことができる。例えば、ステップＳ５において、図１７に示すように、“人物”、“風景”、“物”・・・に対して、それぞれ特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に表現情報を関連付けた表現情報データベースを用いて被評価ドキュメントに対する表現情報を抽出することができる。この場合、ステップＳ６では、図１８に示すように、“人物”、“風景”、“物”・・・に対して、特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に表示情報の表示位置を関連付けて登録した表示用データベースを用いて処理を行うことによって、ステップＳ５において表現情報の抽出処理に利用された特徴量の条件に関連付けられている表示位置にステップＳ５において抽出された表現情報を呈示することができる。 The same processing can be performed when the document to be evaluated is classified into “person”, “landscape”, “object”,... For example, in step S5, as shown in FIG. 17, a condition that combines the L ^* component, a ^* component, and b ^* component of the feature quantity with respect to “person”, “scenery”, “object”,. Expression information for the document to be evaluated can be extracted using an expression information database in which expression information is associated with. In this case, in step S6, as shown in FIG. 18, a condition in which the L ^* component, a ^* component, and b ^* component of the feature quantity are combined with “person”, “landscape”, “object”. By performing processing using the display database registered in association with the display position of the display information, the display position associated with the feature amount condition used in the expression information extraction process in step S5 is set in step S5. The extracted expression information can be presented.

また、被評価ドキュメントを情報の提供先に応じて分類して印象評価を行った場合も同様に処理を行うことができる。すなわち、図１９に示すように、情報の提供先毎に特徴量の条件に対して表現情報の表示位置を関連付けた表示用データベースを用いることが好適である。この場合、ステップＳ５において表現情報の抽出に用いられた特徴量の条件に関連付けられている表示位置にステップＳ５において抽出された表現情報を呈示する。 The same processing can also be performed when impression evaluation is performed by classifying evaluated documents according to information providing destinations. That is, as shown in FIG. 19, it is preferable to use a display database in which the display position of the expression information is associated with the feature amount condition for each information providing destination. In this case, the expression information extracted in step S5 is presented at the display position associated with the feature amount condition used for extraction of expression information in step S5.

また、ドキュメントの受け手となる閲覧者（ユーザ）の属性を単独又は複数組み合わせて、それぞれについて特徴量の条件に関連付けて表現情報を登録してもよい。例えば、“性別”と“年代”と“居住地域”の組み合わせに対して、それぞれ特徴量のＬ^＊成分、ａ^＊成分及びｂ^＊成分を組み合わせた条件に関連付けられた表現情報を表現情報データベースに登録する等してもよい。 Further, the expression information may be registered by associating the attributes of the viewer (user) serving as a document recipient alone or in combination with each other and associating them with the feature amount condition. For example, for the combination of “gender”, “age”, and “residential area”, the expression information associated with the combination of the L ^* component, the a ^* component, and the b ^* component of the feature amount is stored in the expression information database. You may register.

また、上記本実施の形態では、ドキュメントの表現の目的毎に異なる表現情報データベース及び表示用データベースを用いたが、ドキュメントの表現の目的によらず同じ表現情報データベース及び表示用データベースを用いて処理を行うこともできる。例えば、ステップＳ４で抽出された特徴量をドキュメントの表現の目的に応じて補正し、表現情報データベースを参照して補正された特徴量が満たす条件に対応付けられた表現情報を抽出し、また、表示用データベースを参照して補正された特徴量が満たす条件に対応付けられたカラーイメージ上の分割画面領域に表現情報等を表示させる。 Further, in the present embodiment, different expression information databases and display databases are used for each purpose of document expression, but processing is performed using the same expression information database and display database regardless of the purpose of document expression. It can also be done. For example, the feature amount extracted in step S4 is corrected according to the purpose of document expression, the expression information associated with the condition that the corrected feature amount satisfies is extracted with reference to the expression information database, Expression information or the like is displayed in the divided screen area on the color image associated with the condition that the feature amount corrected with reference to the display database satisfies.

＜第２のドキュメント印象評価方法＞
第２のドキュメント印象評価方法を以下に説明する。第２のドキュメント印象評価方法では、予め複数の色を含んだカラーパレットを準備し、そのカラーパレットに含まれる各色を組み合わせた条件にドキュメントの印象を表す表現情報を割り当てたデータベースを準備し、被評価ドキュメントから色に関する特徴量を抽出し、その特徴量に対応するカラーパレット上の色に割り当てられた表現情報を選択して表示する。 <Second document impression evaluation method>
The second document impression evaluation method will be described below. In the second document impression evaluation method, a color palette including a plurality of colors is prepared in advance, a database is prepared in which expression information representing the impression of the document is assigned to a combination of the colors included in the color palette. A feature amount relating to the color is extracted from the evaluation document, and the expression information assigned to the color on the color palette corresponding to the feature amount is selected and displayed.

以下に、カラーパレットを用いたドキュメントの印象評価の処理方法について、図を参照して説明する。上記第１のドキュメント印象評価方法と同様に、図２に示すように、被評価ドキュメントの入力（ステップＳ１）、被評価ドキュメントの表現の目的の判定（ステップＳ２）、被評価ドキュメントの分割（ステップＳ３）、物理的特徴量を取得するサブルーチン（ステップＳ４）、印象評価の抽出（ステップＳ５）及び評価結果の出力（ステップＳ６）を含み、各ステップを順次行うことによって実行される。 A document impression evaluation processing method using a color palette will be described below with reference to the drawings. As in the first document impression evaluation method, as shown in FIG. 2, input of the evaluated document (step S1), determination of the purpose of expressing the evaluated document (step S2), division of the evaluated document (step S3), including a subroutine for obtaining physical features (step S4), impression evaluation extraction (step S5), and evaluation result output (step S6), and is executed by sequentially performing each step.

ステップＳ１〜Ｓ３では、上記第１のドキュメント印象評価方法と同様に処理が行われるため、説明を省略する。 In steps S1 to S3, processing is performed in the same manner as in the first document impression evaluation method, and a description thereof will be omitted.

ステップＳ４では、ドキュメント領域毎に特徴量が算出され、その中の色に関する特徴量とカラーパレット上の色への対応付けが行われる。ステップＳ４は、図２０に示すように、サブルーチンとして処理される。 In step S4, a feature amount is calculated for each document area, and the feature amount relating to the color in the document region is associated with the color on the color palette. Step S4 is processed as a subroutine as shown in FIG.

ステップＳ４１では、各ドキュメント領域のドキュメントデータの色空間がＲＧＢ空間からＬ^＊ａ^＊ｂ^＊空間に変換される。ステップＳ４２では、色空間変換された各ドキュメント領域のドキュメントデータを大容量記憶部１４又は内部記憶部１２に格納及び保持する。これらの処理は、上記第１のドキュメント印象評価方法と同様であるため、説明は省略する。 In step S41, the color space of the document data in each document area is converted from the RGB space to the L ^* a ^* b ^* space. In step S42, the document data of each document area subjected to color space conversion is stored and held in the large-capacity storage unit 14 or the internal storage unit 12. Since these processes are the same as those in the first document impression evaluation method, description thereof will be omitted.

ステップＳ４７では、各ドキュメント領域に対して位置、面積、色の分散及び色数の特徴量の抽出を行うと共に、さらに各ドキュメント領域の色平均の特徴量に最も近似するカラーパレット上の色を特徴量として求める。 In step S47, the feature quantity of the position, area, color dispersion, and number of colors is extracted for each document area, and the color on the color palette that approximates the feature quantity of the color average of each document area is further characterized. Calculate as a quantity.

各ドキュメント領域の位置、面積、色の分散及び色数についての特徴量の算出方法は、上記第１のドキュメント印象評価方法と同様であるので、説明は省略する。 Since the calculation method of the feature amount regarding the position, area, color dispersion, and number of colors of each document area is the same as the first document impression evaluation method, description thereof is omitted.

カラーパレット上の色の特徴量は、上記第１のドキュメント印象評価方法と同様に算出した各ドキュメント領域の色平均に基づいてパターンマッチングを行うことによって、その色平均に最も近似するカラーパレット上の色を求める。 The feature amount of the color on the color palette is obtained by performing pattern matching based on the color average of each document area calculated in the same manner as the first document impression evaluation method, so that the color palette is closest to the color average. Find the color.

ステップＳ４８では、ドキュメント領域毎に求められたカラーパレット上の色と、そのドキュメント領域の面積に基づいて、ベースカラー、サブカラー及びアクセントカラーを算出する。ここで、ベースカラーとは、被評価ドキュメントの中で最大面積用いられている色をいう。サブカラーとは、ベースカラーからの色差が所定値以上であり、かつ、より広い面積に用いられている色を順に２色選択したものをいう。アクセントカラーとは、用いられている面積が被評価ドキュメントの面積に対して所定割合以下であり、かつベースカラーからの色差が大きいものから順に２色選択したものをいう。図２１に、選択されたベースカラー１色、サブカラー２色及びアクセントカラー２色を例示する。但し、色の選択条件はこれらに限られるものではなく、被評価ドキュメントの特徴を明確に示す色であれば良い。 In step S48, the base color, sub color, and accent color are calculated based on the color palette color obtained for each document area and the area of the document area. Here, the base color is a color used in the maximum area in the document to be evaluated. The sub color is a color in which the color difference from the base color is not less than a predetermined value and two colors used in a wider area are selected in order. The accent color is a color selected from two colors in order from the one in which the area used is equal to or less than a predetermined ratio to the area of the document to be evaluated and the color difference from the base color is large. FIG. 21 illustrates one selected base color, two sub-colors, and two accent colors. However, the color selection conditions are not limited to these, and any color that clearly shows the characteristics of the document to be evaluated may be used.

ステップＳ５では、選択された色、すなわちベースカラー、サブカラー及びアクセントカラーに基づいて、被評価ドキュメントが人間に与える印象に関する情報を抽出する。具体的には、予めベースカラー、サブカラー又はアクセントカラーを組み合わせた条件と、その組合せから人間が受ける印象に関する表現情報とを関連付けた表現情報データベースを作成しておき、選択されたベースカラー、サブカラー及びアクセントカラーの組合せと一致する条件に関連付けられた表現情報を抽出する。 In step S5, based on the selected color, that is, the base color, the sub color, and the accent color, information on the impression given to the human by the evaluated document is extracted. Specifically, an expression information database is created in advance by associating conditions that combine base colors, sub-colors, or accent colors with expression information related to impressions received by humans from the combinations, and select the selected base color and sub-color. Expression information associated with a condition that matches the combination of color and accent color is extracted.

このとき、本実施の形態においても、ベースカラー、サブカラー又はアクセントカラーを組み合わせた条件に対して被評価ドキュメントのタイプに応じて異なる表現情報を関連付けた表現情報データベースを用いて評価を行う。 At this time, also in the present embodiment, the evaluation is performed using the expression information database in which different expression information is associated with the condition combining the base color, the sub color, or the accent color according to the type of the document to be evaluated.

例として、図２２に示すデータベースを用いて説明する。本データベースでは、ベースカラーとサブカラー２色の組合せを条件としたデータベースと、アクセントカラー２色の組合せを条件としたデータベースを含んでいる。但し、ベースカラー、サブカラー及びアクセントカラーの組合せの条件はこれらに限られるものではなく、その他の組合せを条件としても良い。 An example will be described using the database shown in FIG. This database includes a database that uses a combination of base colors and two sub-colors as a condition, and a database that uses a combination of two accent colors. However, the combination conditions of the base color, the sub color, and the accent color are not limited to these, and other combinations may be used as the conditions.

本実施の形態においても被評価ドキュメントの表現の目的に応じて、同じ特徴量の値に対して異なる表現情報を関連付けた表現情報データベースを用いて評価を行う。これらのデータベースを参照すると、例えば、ステップＳ２において被評価ドキュメントが表す商品が“電化製品”であると判定されており、図２１のように被評価ドキュメントから選択されたベースカラー、サブカラー及びアクセントカラーが抽出された場合には、「クラシックな」というテキスト情報が抽出される。 Also in the present embodiment, the evaluation is performed using an expression information database in which different expression information is associated with the same feature value depending on the purpose of expression of the evaluated document. Referring to these databases, for example, it is determined in step S2 that the product represented by the evaluated document is an “electric appliance”, and the base color, sub color, and accent selected from the evaluated document as shown in FIG. When the color is extracted, the text information “classic” is extracted.

ステップＳ６では、出力部１６において、選択されたベースカラー、サブカラー及びアクセントカラーや抽出された表現情報を表示する。出力は、ドキュメントに関する情報がユーザに把握し易い態様で行うことが好適である。 In step S6, the output unit 16 displays the selected base color, sub color, accent color, and extracted expression information. The output is preferably performed in such a manner that the user can easily grasp information about the document.

例えば、図２３のように、被評価ドキュメント１６０、被評価ドキュメントに含まれるカラーパレット上の色の面積を示すドキュメント色彩情報表示部１６１、選択されたカラーから抽出された表現情報を表示するカラーイメージ１６２及び選択されたカラーを表示する表示部１６３を含むことが好適である。 For example, as shown in FIG. 23, a document 160 to be evaluated 160, a document color information display unit 161 indicating the color area on the color palette included in the document to be evaluated, and a color image that displays expression information extracted from the selected color. It is preferable to include a display unit 163 that displays 162 and the selected color.

ドキュメント色彩情報表示部１６１は、上記第１のドキュメント印象評価方法と同様に、被評価ドキュメント内に含まれるカラーパレット上の色の面積を、その面積に比例した画面領域をそのカラーパレット上の色で塗り潰して表示される。 Similarly to the first document impression evaluation method, the document color information display unit 161 displays the color area on the color palette included in the document to be evaluated and the screen area proportional to the area on the color palette. It is displayed filled with.

カラーイメージ１６２は、出力画面の領域を複数の分割画面領域に分割し、分割画面領域毎にベースカラー、サブカラー及びアクセントカラーの組合せからなる条件を割り当て、表現情報を抽出するときに用いた条件と一致する条件が割り当てられた領域に、その条件から抽出された表現情報を表示する。また、併せて、表現情報の抽出に用いられたベースカラー、サブカラー及びアクセントカラーを表示することも好適である。 The color image 162 divides an output screen area into a plurality of divided screen areas, assigns a condition composed of a combination of a base color, a sub color, and an accent color to each divided screen area, and is used when extracting expression information The expression information extracted from the condition is displayed in the area to which the condition that matches is assigned. In addition, it is also preferable to display the base color, the sub color, and the accent color used for extracting the expression information.

このとき、第１の実施の形態と同様に、ドキュメントの表現の目的毎に表示用データベースを準備して、ステップＳ２で決定された被評価ドキュメントの表現の目的に応じた表示用データベースを参照して表現情報やカラーを表示する画面位置を決定する。具体的な方法については、上記第１のドキュメント印象評価方法と同様であるので、ここでの説明は省略する。 At this time, as in the first embodiment, a display database is prepared for each purpose of document expression, and the display database corresponding to the purpose of expression of the evaluated document determined in step S2 is referred to. To determine the screen position for displaying expression information and color. Since the specific method is the same as the first document impression evaluation method, the description thereof is omitted here.

以上のように、第１及び第２のドキュメント印象評価方法によれば、ドキュメントが人間に与える印象を評価するために適した特徴量及びその組合せが満たす条件から、ドキュメントが人間に与える印象に関する情報を自動で提示することができる。したがって、ドキュメント作成の経験が少ないユーザであっても、ドキュメントデータを入力するだけで、的確な印象に関する情報を得ることができる。また、ドキュメントが人間に与える印象に関する情報のみならず、評価に用いられた特徴量等を、ユーザが把握し易い態様で提示することができ、ユーザはドキュメントに関する全般的な情報を容易に把握することができる。 As described above, according to the first and second document impression evaluation methods, information on the impression that the document gives to the human from the feature amount suitable for evaluating the impression that the document gives to the human and the condition that the combination satisfies. Can be presented automatically. Therefore, even a user who has little experience in document creation can obtain information regarding an accurate impression only by inputting document data. In addition to the information related to the impression that the document gives to humans, the feature quantities used in the evaluation can be presented in an easy-to-understand manner for the user, and the user can easily grasp general information about the document. be able to.

さらに、ドキュメントの表現の目的に応じて評価に適したデータベースを用いたり、ドキュメントの表現の目的に応じて特徴量を補正したりすることによって、ドキュメントの表現の目的に応じてドキュメントの印象を適切に評価することができる。 In addition, by using a database that is suitable for evaluation according to the purpose of document representation, or by correcting features according to the purpose of document representation, the impression of the document can be made appropriate for the purpose of document representation. Can be evaluated.

＜第３のドキュメント印象評価方法＞
ハイパーテキスト形式等のコンテンツが構造化された画像やドキュメントの場合には、それらを構成しているテキスト情報の構文を解析することによって、ドキュメントの領域分割及び特徴量の抽出を自動で行うことができる。 <Third document impression evaluation method>
In the case of images and documents in which content such as hypertext format is structured, document segmentation and feature extraction can be automatically performed by analyzing the syntax of the text information that composes the content. it can.

以下に、構造化されたドキュメントの印象評価の処理方法について、図を参照して説明する。上記第１のドキュメント印象評価方法と同様に、被評価ドキュメントの入力（ステップＳ１）、被評価ドキュメントの表現の目的の判定（ステップＳ２）、被評価ドキュメントの分割（ステップＳ３）、物理的特徴量を取得するサブルーチン（ステップＳ４）、印象評価の抽出（ステップＳ５）及び評価結果の出力（ステップＳ６）を含み、各ステップを順次行うことによって実行される。 Hereinafter, a processing method for impression evaluation of a structured document will be described with reference to the drawings. Similar to the first document impression evaluation method, input of the evaluated document (step S1), determination of the purpose of expression of the evaluated document (step S2), division of the evaluated document (step S3), physical feature amount Including a subroutine (step S4) for acquiring the image, extraction of impression evaluation (step S5), and output of the evaluation result (step S6), which are executed by sequentially performing each step.

ステップＳ１は、第１のドキュメント印象評価方法と同様に被評価ドキュメントが取得される。ステップＳ２では、被評価ドキュメントの表現の目的が判別される。ここでの処理は、第１の実施の形態と同様であるので説明を省略する。 In step S1, an evaluated document is acquired in the same manner as in the first document impression evaluation method. In step S2, the purpose of expressing the evaluated document is determined. Since the process here is the same as that of the first embodiment, a description thereof will be omitted.

ステップＳ３では、被評価ドキュメントの分割処理が行われる。ここでは、被評価ドキュメントを複数の領域に分割する際に、構造化されたドキュメントデータを解析し、構造化されているコンテンツ毎に分割ドキュメント領域として分割する。例えば、ネットワーク２６から読み込まれたＨＴＭＬ形式で記述されたドキュメントが評価対象である場合は、そのドキュメントデータに含まれる各タグで括られるコンテンツを１つの分割ドキュメント領域とする。例えば、＜ｈｅａｄ＞タグと＜／ｈｅａｄ＞タグの間をヘッダ部として分割することができる。 In step S3, a process for dividing the document to be evaluated is performed. Here, when the document to be evaluated is divided into a plurality of regions, the structured document data is analyzed and divided into divided document regions for each structured content. For example, when a document described in the HTML format read from the network 26 is an evaluation target, the content enclosed by each tag included in the document data is set as one divided document area. For example, a portion between a <head> tag and a </ head> tag can be divided as a header portion.

ステップＳ４では、ステップＳ２で分割された各ドキュメント領域から特徴量を抽出する。すなわち、各ドキュメント領域として分割されたコンテンツ毎の色、大きさ、配置等の情報を特徴量として抽出する。 In step S4, a feature amount is extracted from each document area divided in step S2. That is, information such as color, size, and arrangement for each content divided as each document area is extracted as a feature amount.

ステップＳ５及びステップＳ６では、第１のドキュメント印象評価方法と同様に処理を行うため、説明を省略する。 In step S5 and step S6, since the process is performed in the same manner as the first document impression evaluation method, the description is omitted.

以上のように、第３のドキュメント印象評価方法によれば、ハイパーテキスト形式等の構造化されたドキュメントデータから特徴量を抽出し、そのドキュメントが人間に与える印象に関する情報を提示することができる。さらに、被評価ドキュメントのビットマップ形式のドキュメントデータと、上記構文解析を組み合わせることによって、より正確な領域分割を行うこともできる。 As described above, according to the third document impression evaluation method, it is possible to extract feature amounts from structured document data such as a hypertext format and present information related to impressions given to humans by the document. Furthermore, by combining the document data in the bitmap format of the document to be evaluated and the above parsing, more accurate area division can be performed.

本実施の形態におけるドキュメント印象評価装置の構成のブロック図である。It is a block diagram of the structure of the document impression evaluation apparatus in this Embodiment. 本実施の形態におけるドキュメント印象評価方法のフローチャートを示す図である。It is a figure which shows the flowchart of the document impression evaluation method in this Embodiment. 評価対象となるドキュメントの例を示す図である。It is a figure which shows the example of the document used as evaluation object. 本発明の実施の形態における第１のドキュメント印象評価方法のステップＳ４のフローチャートを示す図である。It is a figure which shows the flowchart of step S4 of the 1st document impression evaluation method in embodiment of this invention. 均等色空間におけるマンセル色相環を示す図である。It is a figure which shows the Munsell hue ring in uniform color space. 本発明の実施の形態におけるドキュメント印象評価方法で取得された特徴量の例を示す図である。It is a figure which shows the example of the feature-value acquired with the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法で用いられる表現情報データベースの例を示す図である。It is a figure which shows the example of the expression information database used with the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法で用いられる表現情報データベースの別例を示す図である。It is a figure which shows another example of the expression information database used with the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法の評価結果の出力例を示す図である。It is a figure which shows the example of an output of the evaluation result of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のドキュメント色彩情報表示部を表示する方法の説明図である。It is explanatory drawing of the method of displaying the document color information display part of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のドキュメント色彩情報表示部を表示する方法の説明図である。It is explanatory drawing of the method of displaying the document color information display part of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージを表示する方法の説明図である。It is explanatory drawing of the method of displaying the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージを表示するための表示用データベースの例である。It is an example of the database for a display for displaying the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージの表示例を示す図である。It is a figure which shows the example of a display of the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法で用いられる表現情報データベースの別例を示す図である。It is a figure which shows another example of the expression information database used with the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージを表示するための表示用データベースの別例である。It is another example of the database for a display for displaying the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法で用いられる表現情報データベースの別例を示す図である。It is a figure which shows another example of the expression information database used with the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージを表示するための表示用データベースの別例である。It is another example of the database for a display for displaying the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態におけるドキュメント印象評価方法のカラーイメージを表示するための表示用データベースの別例である。It is another example of the database for a display for displaying the color image of the document impression evaluation method in embodiment of this invention. 本発明の実施の形態における第２のドキュメント印象評価方法のステップＳ４のフローチャートを示す図である。It is a figure which shows the flowchart of step S4 of the 2nd document impression evaluation method in embodiment of this invention. 本発明の実施の形態における第２のドキュメント印象評価方法で取得されたベースカラー、サブカラー及びアクセントカラーの例を示す図である。It is a figure which shows the example of the base color, sub color, and accent color which were acquired with the 2nd document impression evaluation method in embodiment of this invention. 本発明の実施の形態における第２のドキュメント印象評価方法で用いられる表現情報データベースの例を示す図である。It is a figure which shows the example of the expression information database used with the 2nd document impression evaluation method in embodiment of this invention. 本発明の実施の形態における第２のドキュメント印象評価方法の評価結果の出力例を示す図である。It is a figure which shows the example of an output of the evaluation result of the 2nd document impression evaluation method in embodiment of this invention.

Explanation of symbols

１０制御部、１２内部記憶部、１４大容量記憶部、１６出力部、１８標準入力部、２０ドキュメント入力部、２２インターフェース部、２４バス、２６ネットワーク、３０ヘッダ部、３１タイトル部、３２見出し部、３３リンクボタン部、３４イメージ画像部、３５テキスト部、９０被評価ドキュメント、９１ドキュメント色彩情報表示部、９２カラーイメージ、９３平均色表示部、９５分割画面領域、９６表示領域、１６０被評価ドキュメント、１６１ドキュメント色彩情報表示部、１６２カラーイメージ、１６３カラー表示部。 10 control section, 12 internal storage section, 14 large capacity storage section, 16 output section, 18 standard input section, 20 document input section, 22 interface section, 24 bus, 26 network, 30 header section, 31 title section, 32 header section , 33 Link button part, 34 Image image part, 35 Text part, 90 Document to be evaluated, 91 Document color information display part, 92 Color image, 93 Average color display part, 95 Split screen area, 96 Display area, 160 Document to be evaluated 161 Document color information display section, 162 color image, 163 color display section.

Claims

A document impression evaluation device that presents information about an impression given to a human by the document based on visual information extracted from the document,
A document discriminating means for discriminating a classification category associated with a keyword included in an evaluated document to be evaluated with reference to a classification database in which keywords related to document contents are associated with classification categories ;
Feature quantity extraction means for extracting at least one feature quantity from the evaluated document;
For each classification category , the document discrimination means refers to an expression information database that associates a condition for at least one visual feature amount related to a document with expression information representing an impression felt by a person who views the document. Expression information extraction means for extracting expression information associated with the classification category of the obtained document to be evaluated and the condition that the feature amount of the document to be evaluated satisfies,
An output means for outputting the extracted expression information as information on an impression given to a human by the document to be evaluated;
Equipped with a,
The evaluated document is an advertisement of a product,
The classification is represented by at least one of a product type, service content, or viewer characteristics,
Document impression evaluation device.

The document impression evaluation device according to claim 1 ,
The output means divides the area of the output screen into a plurality of divided screen areas, assigns the condition for each of the divided screen areas, and assigns the condition to the divided screen area corresponding to the condition satisfied by the feature amount of the evaluated document. A document impression evaluation apparatus characterized by displaying extracted expression information.

The document impression evaluation device according to claim 1 or 2 ,
The output means divides the area of the output screen into a plurality of divided screen areas, assigns the condition for each of the divided screen areas, and assigns the condition to the divided screen area corresponding to the condition satisfied by the feature amount of the evaluated document. A document impression evaluation apparatus characterized by displaying information on a feature amount of a document to be evaluated.

A document impression evaluation program for performing a document impression evaluation based on visual information extracted from a document and presenting information on an impression given to a human by the document,
Computer
A document discriminating means for discriminating a classification category associated with a keyword included in an evaluated document to be evaluated with reference to a classification database in which keywords related to document contents are associated with classification categories ;
Feature quantity extraction means for extracting at least one feature quantity from the evaluated document;
For each classification category , the document discrimination means refers to an expression information database that associates a condition for at least one visual feature amount related to a document with expression information representing an impression felt by a person who views the document. Expression information extraction means for extracting expression information associated with the classification category of the obtained document to be evaluated and the condition that the feature amount of the document to be evaluated satisfies,
An output means for outputting the extracted expression information as information on an impression given to a human by the document to be evaluated;
To function as,
The evaluated document is an advertisement of a product,
The classification is represented by at least one of a product type, service content, or viewer characteristics.
Document impression evaluation program characterized by that.