WO2008035394A1

WO2008035394A1 - Electronic watermark embedding device

Info

Publication number: WO2008035394A1
Application number: PCT/JP2006/318492
Authority: WO
Inventors: Taizo Anan; Kensuke Kuraki; Motoo Masui; Shohei Nakagata; Hideaki Ishii
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2006-09-19
Filing date: 2006-09-19
Publication date: 2008-03-27
Anticipated expiration: 2009-03-19

Abstract

Information embedded in a document image is converted to a watermark symbol sequence expressed by a dot pattern, each watermark symbol is embedded in a subject embedding region, and embedding advisability is judged by checking whether it is detected. As a result, the watermark symbol is embedded in a region where embedding is possible. Further, the watermark symbol sequence is divided into a plurality of blocks, which are embedded in a document image as a plurality of watermark packets.

Description

明細書 Specification

電子透かし埋め込み装置 Digital watermark embedding device

技術分野 Technical field

[0001] 本発明は、文書データへ人間の目にはわかりにくい形で情報を埋め込む電子透かし埋め込み装置に関する。 The present invention relates to an electronic watermark embedding apparatus that embeds information in document data in a form that is difficult for human eyes to understand.

背景技術 Background art

[0002] 近年、会社内で保管する情報の電子化が進み、将来的にも紙文書による保管は減つていくことが予想される。し力しながら、紙文書はなくなるわけではないため、必然的に発生し続ける紙文書と電子文書の混在環境になっているのが現状である。また、企業や公共機関が取り扱う顧客データ等の個人情報の流出'漏洩が懸念されており、より厳格な管理システムの導入が求められている。実際に、情報漏洩の約半数は印刷物を媒体として発生しているとのデータもあり、印刷物からの情報漏洩対策が重要になっている。 [0002] In recent years, the digitization of information stored in the company has progressed, and it is expected that the storage by paper documents will decrease in the future. However, since paper documents are not lost, the current situation is that paper and electronic documents are inevitably mixed. In addition, there is concern about the leakage or leakage of personal information such as customer data handled by corporations and public institutions, and the introduction of a more strict management system is required. In fact, there is data that about half of information leaks occur on printed materials, and measures to prevent information leaks from printed materials are important.

[0003] このような状況に対応するためのセキュリティ技術の 1つとして、電子透かしが挙げられる。電子透かしとは、画像、文書、音声等のデータに著作権者名や複製履歴等の情報を埋め込む技術である。会社内で取り扱う重要なデータや会社外へ提供する製品に情報を埋め込むことで、情報漏洩防止、複製防止、原本保証等を可能にする [0003] One of the security technologies for dealing with such a situation is digital watermarking. Digital watermarking is a technology that embeds information such as the copyright holder name and copy history in data such as images, documents, and audio. By embedding important data handled within the company and products provided outside the company, it is possible to prevent information leakage, copy protection, original guarantee, etc.

[0004] 文書データは印刷されて紙文書となるが、その紙文書をスキャナで読み込み、ソフトウエアによって解析することで、埋め込まれた透力し情報を検出することが可能である。この技術を用いて印刷物に印刷者の氏名、 ID、印刷日時等を埋め込むことによつて、万が一印刷物が会社外等に持ち出された場合でも、印刷物のコピーもしくは紙の一部が何らかの形で回収されれば、透力し情報を検出することでだれが印刷したのかを調べることができる。すなわち、印刷物力の情報漏洩元追跡技術としてその応用が期待されている。 [0004] The document data is printed and becomes a paper document. The paper document is read by a scanner and analyzed by software, so that embedded transparency information can be detected. By embedding the printer's name, ID, printing date, etc. in the printed material using this technology, even if the printed material is taken outside the company, a copy of the printed material or a part of the paper is collected in some form. If so, you can check who printed by detecting the information. In other words, it is expected to be applied as a technology for tracking information leakage sources for printed materials.

[0005] このように、電子化されたデータに対する電子透かしについての研究は盛んであり、各種の電子透力しが製品化されている。し力しながら、 2値の文書データ、特に一度印刷出力された紙媒体から、埋め込まれた情報を抽出するのは困難である。地紋透力しと呼ばれる電子透力は、文書の背景に地紋パターンを付加し、そのノターンを微妙に変化させることにより、情報を埋め込む技術である。地紋透力しをセキュリティ用途で用いる場合、地紋透かしが満たすべき性能としては、下記のものが重要であると言われている。 [0005] As described above, research on digital watermarking for digitized data is vigorous, and various types of electronic permeability have been commercialized. However, it is difficult to extract embedded information from binary document data, especially paper media that has been printed once. Electronic transparency, known as tint pattern transparency, is a technology that embeds information by adding a tint block pattern to the background of a document and changing its pattern slightly. It is said that the following is important as the performance that the ground pattern watermark should satisfy when using the ground pattern transparency for security purposes.

(1)コピー耐性 (1) Copy resistance

地紋が付加された文書を数世代コピーした後でも、埋め込まれた透力情報を正確に抽出することが必要である。なぜなら、情報漏洩者が業者等に情報を売却する場合、コピーを渡す場合が多いからである。 Even after copying several generations of a document with a tint block, it is necessary to accurately extract the embedded transparency information. This is because when an information leaker sells information to a business operator, etc., a copy is often given.

(2)埋め込み情報量 (2) Amount of embedded information

十分な量の情報を埋め込める必要がある。すなわち、印刷者の名前、 ID、印刷日時等、犯人を特定するための十分な情報を埋め込める必要がある。 It is necessary to embed a sufficient amount of information. In other words, it is necessary to embed sufficient information for identifying the criminal, such as the name, ID, and date of printing.

(3)切り取り耐性 (3) Cutting resistance

文書の一部力もでも透力情報を検出できる必要がある。切り取られる形は様々であり、文書のどこを切り取っても、透かし情報を検出可能な仕組みを達成する必要がある。 It is necessary to be able to detect the permeability information even with a part of the document. There are various forms to be cut out, and it is necessary to achieve a mechanism that can detect watermark information no matter where the document is cut.

[0006] 下記の特許文献 1の方法では、図 1に示すように、透力し情報を表すシンボルをラスタ走査によって繰り返し挿入することで透カゝし画像を作成し、その後、文書を画像化したものに透カゝし画像を重ねることで、透かし入り文書画像が生成される。 [0006] In the method of Patent Document 1 below, as shown in FIG. 1, a transparent image is created by repeatedly inserting symbols representing the transparent information by raster scanning, and then the document is converted into an image. A watermarked document image is generated by overlaying the transparent image on the converted image.

[0007] この例では、人の名前が埋め込み情報として用いられ、そのバイナリ表現である" 1 , 0, 1, 1, 1, 0, 0, 1, が透力し情報 103として、文書画像 104内に敷き詰められる。透かし情報 103を構成する透かしシンボル 101および 102は、符号" 0"および" 1 "をそれぞれ表す。透かし情報 103の 9個のシンボルを" A, B, C, D, E, F, G, H, I "と表記した場合、例えば、これらのシンボルを図 2に示すように繰り返し挿入することも可能である。 In this example, a person's name is used as embedded information, and its binary representation “1, 0, 1, 1, 1, 0, 0, 1, is transparent and is used as information 103 as a document image 104. The watermark symbols 101 and 102 constituting the watermark information 103 represent the codes “0” and “1”, respectively.The nine symbols of the watermark information 103 are represented by “A, B, C, D, E”. , F, G, H, I ", for example, these symbols can be repeatedly inserted as shown in FIG.

[0008] 透かしシンボルの抽出時には、図 3に示すように、繰り返し挿入されたそれぞれの透力し情報力もシンボルを抽出し、抽出されたシンボルの多数決をとることによって、埋め込み情報が復元される。 [0009] 例えば、 4つの透かし情報から抽出されたシンボル列 301、 302、 303、および 304 において、それぞれ" C"、 "H"、 "E"、および" D"のシンボルが誤抽出された場合でも、多数決をとることにより正しいシンボル列 305が求められる。 At the time of extracting a watermark symbol, as shown in FIG. 3, the embedded information is restored by extracting the symbol of each of the repeatedly inserted transparent information power and taking the majority of the extracted symbols. [0009] For example, in the case of symbol strings 301, 302, 303, and 304 extracted from four watermark information, the symbols "C", "H", "E", and "D" are erroneously extracted, respectively. However, the correct symbol string 305 is obtained by taking the majority vote.

[0010] しかし、このような方法では、透力しシンボルを覆、隠す文字や図等が文書画像に多く含まれている場合に、正確に透カゝしシンボルを抽出し、透かし情報を復元することが困難になる。例えば、図 4に示す文書画像では、文書画像 401の 2行目および 3 行目の透力し情報 103の上に文字が重なるため、透力しシンボルを抽出することができない。 [0010] However, in such a method, when a character image or figure that covers and hides a transparent symbol is included in the document image, the transparent symbol is accurately extracted and the watermark information is restored. Difficult to do. For example, in the document image shown in FIG. 4, since the characters overlap the transparency information 103 in the second and third lines of the document image 401, the transparency symbol cannot be extracted.

[0011] また、ラスタ走査による単純な埋め込みのため、文書画像の任意の切り取り画像から透力情報を検出することは困難である。例えば、図 5に示す文書画像 501では、横方向の切り取り画像 502から抽出されたシンボル列 504には、すべてのシンボルが正しい順序で含まれているため、切り取り画像 502から透かレ f青報 103を検出することは可能である。し力し、縦方向の切り取り画像 503から抽出されたシンボル列 505 にはシンボル" G"が含まれておらず、他のシンボルも正し!/、順序で並んで!/、な!/、ため、切り取り画像 503から透力し情報 103を検出するのは難しい。 [0011] In addition, because of simple embedding by raster scanning, it is difficult to detect permeability information from an arbitrary cut image of a document image. For example, in the document image 501 shown in FIG. 5, all symbols are included in the correct order in the symbol sequence 504 extracted from the horizontal cut image 502. It is possible to detect 103. The symbol sequence 505 extracted from the vertically cut image 503 does not include the symbol “G”, and the other symbols are also correct! /, Arranged in order! /, Na! /, Therefore, it is difficult to detect the information 103 from the cut image 503 with transparency.

[0012] そこで、特許文献 2の方法では、図 6に示すように、印刷物の文字領域を避けながら透力しシンボルを埋め込んでいる。そして、透力し情報の検出時には、文字領域と透かしシンボル領域を判別しながら透かしシンボルを抽出し、抽出されたシンボル列 601および 602に対して多数決を行うことで、正しいシンボル列 603が求められる。 [0012] Therefore, in the method of Patent Document 2, as shown in FIG. 6, the symbol is embedded with transparency while avoiding the character area of the printed matter. When detecting the transparent information, the watermark symbol is extracted while discriminating the character area and the watermark symbol area, and a majority decision is performed on the extracted symbol strings 601 and 602, whereby the correct symbol string 603 is obtained. .

[0013] ただし、この方法では、検出時に文字領域と透力しシンボル領域とを正確に判別することが前提となっている。この判別に失敗すると、図 7に示すように、抽出結果に文字が含まれ、例えば、シンボル列 701および 702が抽出されてしまう。この場合、シンボル列 701に 3つの文字が含まれて!/、るため、本来ならシンボル列 701に含まれるはずのシンボル" G"、 "H"、および "I"が、次のシンボル列 702の先頭に移動している。 However, in this method, it is assumed that the character area and the symbol area are accurately discriminated at the time of detection. If this determination fails, as shown in FIG. 7, characters are included in the extraction result, and for example, symbol strings 701 and 702 are extracted. In this case, the symbol string 701 contains three characters! /, So the symbols “G”, “H”, and “I” that were originally included in the symbol string 701 are the next symbols. Moved to the beginning of column 702.

[0014] このため、 2つのシンボル列の多数決を取る際に、透かし情報 103の先頭 (A)と先頭 (A)、 2番目 (B)と 2番目（B)、 · · ·、 6番目（F)と 6番目（F)のように、正し!/ヽシンポル同士を比較することができず、透かし検出性能の著しい劣化を引き起こす。このような劣化を、便宜上、多数決の位相ずれと呼んでいる。例えば、数世代にわたるコピ一等を行うと、印刷物の劣化は激しぐ文字領域と透力シンボル領域を正確に分離することが難しくなる。 [0014] Therefore, when taking the majority of two symbol sequences, the beginning (A) and the beginning (A) of the watermark information 103, the second (B) and the second (B), ..., the sixth Like (F) and 6th (F), correct! / ヽ symbols cannot be compared with each other, and the watermark detection performance is significantly degraded. Such deterioration is called the majority phase shift for convenience. For example, copy over several generations If it is done for the first time, it will be difficult to accurately separate the character area and the transparency symbol area where the deterioration of the printed matter will be severe.

[0015] シンボルを埋め込む領域に文字が含まれるか否かは、シンボルと同じ面積を持つ領域中に含まれる黒画素の割合により、判定される。一般には、黒画素の割合が大きい領域ほど、透力しが黒画素に重なってしまい、検出性能が劣化すると考えられる。そこで、黒画素の割合が 50%未満であれば、文字が含まれていない (透かしが検出される）と判定し、逆に 50%以上であれば、文字が含まれる（透力しが検出されない）と判定する。 [0015] Whether or not a character is included in the region in which the symbol is embedded is determined by the ratio of black pixels included in the region having the same area as the symbol. In general, it is considered that as the ratio of black pixels is larger, the permeability overlaps with the black pixels and the detection performance deteriorates. Therefore, if the percentage of black pixels is less than 50%, it is determined that characters are not included (watermarks are detected), and conversely if it is 50% or more, characters are included (transmittance is detected). It is not).

[0016] 例えば、図 8の領域 A〜Dに透かしシンボルを埋め込み可能か否かは、黒画素率 R を用いて以下のように判定される。 For example, whether or not a watermark symbol can be embedded in the areas A to D in FIG. 8 is determined using the black pixel rate R as follows.

領域 A:R= 14Z81 < 50%→埋め込み可 Area A: R = 14Z81 <50% → can be embedded

領域 B :R= 14Z81く 50%→埋め込み可 Area B: R = 14Z81 50% → embeddable

領域 C :R=4lZ81 > 50%→埋め込み不可 Area C: R = 4lZ81> 50% → cannot be embedded

領域 D :R=4lZ81 > 50%→埋め込み不可 Area D: R = 4lZ81> 50% → Cannot be embedded

しかし、実際に透力を埋め込んで、検出時に透力しが誤検出される確率は、必ずしも黒画素の割合に比例しな、と、う事実がある。 However, there is a fact that the probability that the permeability is actually detected and the permeability is erroneously detected at the time of detection is not always proportional to the proportion of black pixels.

[0017] 特許文献 2の例では、図 9に示すように、透かしシンボル 901および 902の波の方向によって" 0"または" 1"の符号を対応付けて、透かし検出を行っている。例えば、波の方向が左上がりである透かしシンボル 901には" 0"が対応付けられ、右上がりである透かしシンボル 902には" 1"が対応付けられる。このような透かしシンボルが文字領域と重なった場合、その領域の黒画素率 Rが 50%未満であれば、透力しが検出されると判定され、 50%以上であれば、透力しが検出されないと判定される。 In the example of Patent Document 2, as shown in FIG. 9, a watermark detection is performed by associating a code of “0” or “1” with the wave directions of watermark symbols 901 and 902. For example, “0” is associated with the watermark symbol 901 with the wave direction rising to the left, and “1” is associated with the watermark symbol 902 with the wave rising to the right. When such a watermark symbol overlaps with a character area, if the black pixel ratio R of the area is less than 50%, it is determined that a watermark is detected, and if it is 50% or more, the watermark is detected. It is determined that it is not detected.

[0018] 図 8の領域 A〜Dに図 9の透かしシンボルを埋め込むと、図 10に示すようになる。ただし、領域 Aおよび Cには透かしシンボル 901を埋め込み、領域 Bおよび Dには透かしシンボル 902を埋め込んでいる。それぞれの領域の（）内の数字は、埋め込まれた透かしシンボルに対応する符号を表して、る。 When the watermark symbol of FIG. 9 is embedded in the areas A to D of FIG. 8, the result is as shown in FIG. However, watermark symbols 901 are embedded in areas A and C, and watermark symbols 902 are embedded in areas B and D. The numbers in parentheses in each area represent the code corresponding to the embedded watermark symbol.

[0019] この場合、透かし検出時には、矢印で示すような波の方向が検出される。つまり、黒画素率 Rにより埋め込み可と判定された領域 Aおよび Bのうち、 "0"が埋め込まれた領域 Aからは、 "1"を示す波の方向が検出され、 "1"が埋め込まれた領域 Bからは、 " 0"を示す波の方向が検出される。したがって、実際には、領域 Aには" 0"を示す透かしシンボル 901を埋め込むことはできず、領域 Bには" 1"を示す透かしシンボル 902 を埋め込むことはできな、。 In this case, at the time of watermark detection, the wave direction as indicated by the arrow is detected. In other words, among the areas A and B that are determined to be embeddable by the black pixel ratio R, “0” is embedded. From the region A, the direction of the wave indicating “1” is detected, and from the region B in which “1” is embedded, the direction of the wave indicating “0” is detected. Therefore, in practice, the watermark symbol 901 indicating “0” cannot be embedded in the area A, and the watermark symbol 902 indicating “1” cannot be embedded in the area B.

[0020] これに対して、埋め込み不可と判定された領域 Cおよび Dにおいては、検出される波の方向が埋め込まれた透力しシンボルのそれと同じであるため、埋め込まれた透力しが正確に検出できるという現象が生じる。それぞれの領域に透力しシンボル 901 および 902を実際に埋め込む場合の埋め込み可否をまとめると、以下のようになる。領域 A: "0"→埋め込み不可， "1"→埋め込み可 [0020] On the other hand, in regions C and D that are determined to be non-embedded, the direction of the detected wave is the same as that of the embedded permeable symbol, so the embedded permeable The phenomenon that it can be detected occurs. The following is a summary of the embedding possibility when the symbols 901 and 902 are actually embedded through the respective areas. Area A: "0" → cannot be embedded, "1" → can be embedded

領域 B : "0"→埋め込み可， "1"→埋め込み不可 Area B: "0" → embeddable, "1" → not embedding

領域 C : "0"→埋め込み可， "1"→埋め込み不可 Area C: "0" → embeddable, "1" → not embedding

領域 D : "0"→埋め込み不可， "1"→埋め込み可 Area D: "0" → cannot be embedded, "1" → can be embedded

このように、黒画素率が 50%未満でも埋め込み不可の場合があるし、逆に 50%以上でも埋め込み可の場合がある。黒画素率による透かし埋め込み可否の判定は、上記のような矛盾した問題を含んでおり、透かし検出の安定度の面で問題がある。 Thus, even if the black pixel rate is less than 50%, embedding may not be possible. Conversely, embedding may be possible even if the black pixel ratio is 50% or more. The determination of whether or not to embed a watermark based on the black pixel rate includes the above contradictory problems, and there is a problem in the stability of watermark detection.

[0021] 以上説明した従来技術の問題点をまとめると、次のようになる。 [0021] The problems of the prior art described above are summarized as follows.

(1)透力しシンボルのドットパターンと文書画像の文字、図、写真等の領域が重なる場合に、透かし検出能力が低下する。 (1) The watermark detection capability decreases when the dot pattern of the transparent symbol overlaps the character image, figure, photo, or other area of the document image.

(2)単純に文字、図、写真等の領域を避けて透力シンボルを埋め込んだ場合、透力しシンボルと文字領域等を正確に分類する必要があり、これを誤ると多数決の位相ずれを生じる。このため、透かし検出能力が著しく低下する。 (2) When embedding the transparency symbol simply avoiding the area of characters, diagrams, photographs, etc., it is necessary to accurately classify the transparency symbol and the character area etc. Arise. For this reason, the watermark detection capability is significantly reduced.

(3)文字、図、写真等の領域を避けるための手段として、黒画素率を計算して閾値処理を行うと、透かしシンボルを埋め込める領域であっても、埋め込み不可と判定されたり、逆に埋め込めない領域であっても、埋め込み可と判定される場合があり、上記（ 2)と同様の問題を引き起こす。 (3) As a means to avoid areas such as characters, diagrams, and photographs, if threshold processing is performed by calculating the black pixel ratio, even if the watermark symbol can be embedded, it may be determined that embedding is impossible. Conversely, even in an area that cannot be embedded, it may be determined that embedding is possible, causing the same problem as in (2) above.

(4)透力しシンボルが埋め込まれた文書画像のどの領域を切り出しても、埋め込まれた情報を正確に復元できることが望まれる。つまり、文書画像内に一様に透かレ f青報を埋め込む必要がある。 [0022] 下記の特許文献 3は、複数枚の部分画像を正確な配置で結合することの可能な、情報埋め込み方法および情報抽出方法に関し、非特許文献 1は、モルフォロジ一変換の一種であるエロージョン（erosion )およびダイレーシヨン（dilation)に関する。特許文献 1：特開 2006 - 101161号公報 (4) It is desirable that the embedded information can be accurately restored by cutting out any region of the document image in which the transparent symbol is embedded. In other words, it is necessary to uniformly embed transparent blueprints in the document image. [0022] Patent Document 3 below relates to an information embedding method and an information extraction method capable of combining a plurality of partial images in an accurate arrangement, and Non-Patent Document 1 is an erosion that is a kind of morphological transformation. (Erosion) and dilation. Patent Document 1: Japanese Patent Laid-Open No. 2006-101161

特許文献 2：特開 2003 - 209676号公報 Patent Document 2: Japanese Patent Laid-Open No. 2003-209676

特許文献 3：特開 2005 - 333359号公報 Patent Document 3: Japanese Patent Laid-Open No. 2005-333359

非特許文献 1 :小畑秀文著， "モルフォロジ一"，コロナ社， p. 72- 77, 1996年 11月発明の開示 Non-Patent Document 1: Hidefumi Obata, "Morphology", Corona, p. 72-77, November 1996 Disclosure of Invention

[0023] 本発明の課題は、文字、図、写真等の領域を含む電子文書に地紋透かしを付与する際に、コピー耐性や切り取り耐性を有し、安定した検出が可能な地紋パターンにより、情報を埋め込むことである。 [0023] An object of the present invention is to provide a copy-forgery-inhibited pattern that has copy resistance and cut-out resistance and can be stably detected when a watermark is applied to an electronic document including areas such as characters, diagrams, and photographs. Information is embedded.

[0024] 本発明の第 1の局面において、電子透かし埋め込み装置は、文書画像入力部、透かし変換部、透力埋め込み可否領域判定部、および透力埋め込み部を備え、ドットパターンを用いて文書画像の背景領域に情報を埋め込む。 [0024] In the first aspect of the present invention, the digital watermark embedding device includes a document image input unit, a watermark conversion unit, a permeability embedding area determination unit, and a permeability embedding unit, and uses a dot pattern. Information is embedded in the background area of the document image.

[0025] 文書画像入力部は、文書画像を入力し、透カゝし変換部は、埋め込まれる情報に、それぞれ異なるドットパターンで表現される複数の透力しシンボルを割り当てて、その情報を透力シンボルの列に変換する。透かし埋め込み可否領域判定部は、文書画像内の領域に透かしシンボルを埋め込んで、埋め込まれた透かしシンボルが検出されるか否かをチェックし、透力しシンボルが検出される場合にその領域を埋め込み可と判定し、透力しシンボルが検出されない場合にその領域を埋め込み不可と判定する。 The document image input unit inputs a document image, and the transparency conversion unit assigns a plurality of transparent symbols each represented by a different dot pattern to the embedded information, and transmits the information. Force Converts to a sequence of symbols. The watermark embedding area determination unit embeds a watermark symbol in an area in the document image, checks whether or not the embedded watermark symbol is detected, and if the watermark symbol is detected, the area is determined. Judgment is made that embedding is possible, and if the symbol is not detected due to transparency, the area is judged not to be embedding.

[0026] 透かし埋め込み部は、文書画像内の埋め込み対象領域の位置情報と、透かしシンボルの列力も選択された透力しシンボルの情報を、透かし埋め込み可否領域判定部に渡して判定結果を受け取る。そして、埋め込み可と判定された埋め込み対象領域に選択された透カゝしシンボルを埋め込んで、透かし入り文書画像を生成する。 [0026] The watermark embedding unit passes the position information of the embedding target region in the document image and the information of the transparent symbol, in which the column power of the watermark symbol is also selected, to the watermark embedding availability determining unit, and receives the determination result. . Then, the selected transparent symbol is embedded in the embedding target area determined to be embeddable to generate a watermarked document image.

[0027] 透力し変換部により、埋め込まれる情報が透力しシンボルの列に変換され、透かし埋め込み部により、その透力しシンボルの列が文書画像に埋め込まれる。このとき、透力し埋め込み部は、各透力しシンボルを埋め込み対象領域に埋め込み可能力否かを、透力埋め込み可否領域判定部に問い合わせ、透かし埋め込み可否領域判定部は、その領域にその透力しシンボルを埋め込んでみて、それが検出される力否かをテストする。 [0027] The information to be embedded is transparently converted into a symbol column by the transparency converting unit, and the watermark symbol is embedded in the document image by the watermark embedding unit. At this time, the permeable embedding unit determines whether each permeable symbol can be embedded in the region to be embedded. The watermark embedding / non-embedding area determining unit tests whether or not the force is detected by trying to embed the transparent symbol in the area.

[0028] 透力しシンボルが検出された場合は、埋め込み可の判定結果が透力し埋め込み部に通知され、その領域に透かしシンボルが埋め込まれる。一方、透かしシンボルが検出されなかった場合は、埋め込み不可の判定結果が通知され、透力しシンボルとは異なるドットパターンで表現されるシンボル、または、ランダムに選択された透力しシンボルが、その領域に埋め込まれる。 [0028] When a transparent symbol is detected, a determination result indicating whether embedding is possible is transmitted to the embedding unit, and a watermark symbol is embedded in the area. On the other hand, when a watermark symbol is not detected, a determination result indicating that embedding is not possible is notified, and a symbol expressed by a dot pattern different from the transparent symbol or a randomly selected transparent symbol is displayed. Embedded in that area.

[0029] このように、透力しシンボルを埋め込む際に検出可能力否かのテストを行うことで、文字、図、写真等が多く含まれる文書であっても、確実に検出可能な領域に情報を埋め込むことができる。したがって、印刷文書を数世代コピーした後でも、埋め込まれた情報が安定して検出される。 [0029] In this way, by performing a test of whether or not detectability is possible when embedding a transparent symbol, even a document containing a large number of characters, drawings, photographs, etc. can be reliably detected. Information can be embedded. Therefore, embedded information can be detected stably even after several generations of copies of a printed document.

[0030] 本発明の第 2の局面において、電子透かし埋め込み装置は、文書画像入力部、透かし変換部、透かしパケットィ匕部、および透力し埋め込み部を備え、ドットパターンを用いて文書画像の背景領域に情報を埋め込む。 [0030] In the second aspect of the present invention, the digital watermark embedding device includes a document image input unit, a watermark conversion unit, a watermark packetizer unit, and a watermark embedding unit, and uses a dot pattern as a document image. Embed information in the background area.

[0031] 文書画像入力部は、文書画像を入力し、透カゝし変換部は、埋め込まれる情報に、それぞれ異なるドットパターンで表現される複数の透力しシンボルを割り当てて、その情報を透力しシンボルの列に変換する。透かしパケットィ匕部は、透力しシンボルの列を複数の透力しブロックに分割し、各透力しブロックの先頭に、先頭を示す先頭シンボルと、複数の透かしブロックにおける各透かしブロックの順序を示す 1つ以上のシンボルとを付加して、透かしパケットを生成する。透力し埋め込み部は、複数の透かしブロック力も生成された複数の透かしパケットを文書画像に埋め込む。 [0031] The document image input unit inputs a document image, and the transparency conversion unit assigns a plurality of transparent symbols each represented by a different dot pattern to the embedded information, and transmits the information. And convert it to a sequence of symbols. The watermark packet part divides the transparent symbol row into a plurality of transparent blocks, and at the beginning of each transparent block, a head symbol indicating the head, and the order of the watermark blocks in the plurality of watermark blocks. A watermark packet is generated by adding one or more symbols indicating. The transparent embedding unit embeds a plurality of watermark packets in which a plurality of watermark blocking forces are also generated in a document image.

[0032] 透力し変換部により、埋め込まれる情報が透力しシンボルの列に変換され、透かしパケットィ匕部により、透力しシンボルの列が複数の透力しブロックに分割されて、それぞれ異なる透力しブロックを含む複数の透かしパケットが生成される。そして、透かし埋め込み部により、それらの透かしパケットが文書画像に埋め込まれる。 [0032] The information to be embedded is transparently converted into a symbol string by the permeability converter, and the watermark symbol is divided into a plurality of transparent blocks by the watermark packet part. A plurality of watermark packets including different transparent blocks are generated. The watermark embedding unit embeds these watermark packets in the document image.

[0033] このように、情報を複数の部分に分割して文書画像に埋め込み、それぞれの部分に順序を示す情報を付加しておくことで、紙片し力残って、な、印刷物であっても、すべての部分が含まれて、れば、元の情報を復元することが可能になる。図面の簡単な説明 [0033] As described above, the information is divided into a plurality of parts and embedded in the document image, and information indicating the order is added to each part. , If all parts are included, the original information can be restored. Brief Description of Drawings

[図 1]従来の透かし入り文書画像を示す図である。 FIG. 1 is a diagram showing a conventional watermarked document image.

[図 2]繰り返し挿入された透カゝしシンボルを示す図である。 FIG. 2 is a diagram showing a transparent symbol inserted repeatedly.

[図 3]透力しシンボルの多数決処理を示す図である。 FIG. 3 is a diagram showing the majority processing of a transparent symbol.

圆 4]透力情報の上に重なった文字を示す図である。圆 4] Permeability This is a figure showing the characters superimposed on the information.

圆 5]文書力も切り取られた画像を示す図である。 [5] It is a diagram showing an image in which the document power is also cut off.

圆 6]多数決が成功する場合を示す図である。 [6] This is a diagram showing the case where the majority vote is successful.

圆 7]多数決が失敗する場合を示す図である。 [7] This is a diagram showing a case where the majority vote fails.

[図 8]黒画素率の異なる領域を示す図である。 FIG. 8 is a diagram showing regions with different black pixel ratios.

[図 9]透かしシンボルの波の方向を示す図である。 FIG. 9 is a diagram showing the wave direction of a watermark symbol.

[図 10]黒画素率による埋め込み可否判定を示す図である。 FIG. 10 is a diagram showing embedding possibility determination based on a black pixel rate.

圆 11]第 1の文書画像を示す図である。 [11] FIG. 11 shows a first document image.

[図 12]穴を有する透力しシンボルを示す図である。 FIG. 12 is a diagram showing a permeable symbol having a hole.

[図 13]第 1の透かしシンボルの埋め込み結果を示す図である。 FIG. 13 is a diagram showing a result of embedding a first watermark symbol.

[図 14]第 2の透かしシンボルの埋め込み結果を示す図である。 FIG. 14 is a diagram showing a result of embedding a second watermark symbol.

[図 15]透かしパケットの生成方法を示す図である。 FIG. 15 is a diagram showing a method of generating a watermark packet.

[図 16]透かしパケットの配置方法を示す図である。 FIG. 16 is a diagram showing a method for arranging a watermark packet.

[図 17]第 1の電子透力埋め込み装置の構成図である。 FIG. 17 is a configuration diagram of a first electronic permeability embedding device.

[図 18]透かし変換処理のフローチャートである。 FIG. 18 is a flowchart of watermark conversion processing.

[図 19]透力変換処理を示す図である。 FIG. 19 is a diagram showing a permeability conversion process.

[図 20]透かしパケット化処理のフローチャートである。 FIG. 20 is a flowchart of watermark packetization processing.

[図 21]透力し埋め込み可否領域判定処理のフローチャートである。 FIG. 21 is a flowchart of a permeability and embeddability determination process.

[図 22]透力しシンボルの穴と構造要素を示す図である。 FIG. 22 is a diagram showing a hole and a structural element of a transparent symbol.

[図 23]透かしシンボル埋め込み可の場合を示す図である。 FIG. 23 is a diagram showing a case where a watermark symbol can be embedded.

[図 24]透かしシンボル埋め込み不可の場合を示す図である。 FIG. 24 is a diagram showing a case where a watermark symbol cannot be embedded.

[図 25]第 1の透かし埋め込み処理のフローチャートである。 FIG. 25 is a flowchart of a first watermark embedding process.

[図 26]透力埋め込み処理の第 1の状態を示す図である。 [図 27]透力埋め込み処理の第 2の状態を示す図である。 FIG. 26 is a diagram showing a first state of the permeability embedding process. FIG. 27 is a diagram showing a second state of the permeability embedding process.

[図 28]透力埋め込み処理の第 3の状態を示す図である。 FIG. 28 is a diagram showing a third state of the permeability embedding process.

[図 29]透力埋め込み処理の第 4の状態を示す図である。 FIG. 29 is a diagram showing a fourth state of the permeability embedding process.

[図 30]透力埋め込み処理の第 5の状態を示す図である。 FIG. 30 is a diagram showing a fifth state of the permeability embedding process.

[図 31]透かしシンボルの地紋パターンを示す図である。 FIG. 31 is a diagram showing a background pattern of a watermark symbol.

[図 32]地紋パターンにおける空白領域を示す図である。 FIG. 32 is a diagram showing a blank area in a tint block pattern.

圆 33]構造要素を示す図である。 [33] It is a diagram showing a structural element.

[図 34]エロージョン処理の進行過程を示す図である。 FIG. 34 is a diagram showing the progress of erosion processing.

[図 35]エロージョン処理の結果を示す図である。 FIG. 35 is a diagram showing a result of erosion processing.

[図 36]エロージョン処理の結果のブロック分割を示す図である, FIG. 36 is a diagram showing block division as a result of erosion processing.

[図 37]第 2の電子透力し埋め込み装置の構成図である。 FIG. 37 is a block diagram of a second electronically permeable embedding device.

[図 38]カテゴリー抽出処理のフローチャート（その 1)である。 FIG. 38 is a flowchart (part 1) of category extraction processing.

[図 39]カテゴリー抽出処理のフローチャート（その 2)である。 FIG. 39 is a second flowchart of category extraction processing.

[図 40]カテゴリー抽出処理のフローチャート（その 3)である。 FIG. 40 is a flowchart (part 3) of the category extraction process.

[図 41]カテゴリー抽出処理のフローチャート（その 4)である。 FIG. 41 is a flowchart (part 4) of category extraction processing.

[図 42]第 2の文書画像を示す図である。 FIG. 42 shows a second document image.

[図 43]ラベル画像を示す図である。 FIG. 43 is a diagram showing a label image.

[図 44]透かしパケットを示す図である。 FIG. 44 is a diagram showing a watermark packet.

[図 45]第 1の検査配列を示す図である。 FIG. 45 is a diagram showing a first test arrangement.

[図 46]第 1の AND演算を示す図である。 FIG. 46 is a diagram showing a first AND operation.

[図 47]第 2の AND演算を示す図である。 FIG. 47 is a diagram showing a second AND operation.

[図 48]ラベル画像の第 1の更新結果を示す図である。 FIG. 48 is a diagram showing a first update result of a label image.

[図 49]第 2の検査配列を示す図である。 FIG. 49 is a diagram showing a second test arrangement.

[図 50]第 3の AND演算を示す図である。 FIG. 50 is a diagram showing a third AND operation.

[図 51]第 4の AND演算を示す図である。 FIG. 51 is a diagram showing a fourth AND operation.

[図 52]ラベル画像の第 2の更新結果を示す図である。 FIG. 52 is a diagram showing a second update result of the label image.

[図 53]第 3の検査配列を示す図である。 FIG. 53 is a diagram showing a third test arrangement.

[図 54]第 5の AND演算を示す図である。 [図 55]第 6の AND演算を示す図である。 FIG. 54 is a diagram showing a fifth AND operation. FIG. 55 is a diagram showing a sixth AND operation.

[図 56]カテゴリー分類された文書画像を示す図である。 FIG. 56 is a view showing document images classified into categories.

[図 57]反転前の透かしパケットを示す図である。 FIG. 57 shows a watermark packet before inversion.

[図 58]反転された透かしパケットを示す図である。 FIG. 58 shows an inverted watermark packet.

[図 59]反転透かしパケットの埋め込みを示す図である。 FIG. 59 is a diagram showing embedding a reverse watermark packet.

[図 60]第 2の透かし埋め込み処理のフローチャートである。 FIG. 60 is a flowchart of a second watermark embedding process.

[図 61]ラベル画像と透力生成用画像を用いた埋め込み処理を示す図である。 FIG. 61 is a diagram showing an embedding process using a label image and a permeability generation image.

[図 62]電子透かし検出装置の構成図である。 FIG. 62 is a block diagram of a digital watermark detection apparatus.

[図 63]先頭シンボル探索処理のフローチャートである。 FIG. 63 is a flowchart of head symbol search processing.

[図 64]3つのシンボルに対するエロージョン処理を示す図である。 FIG. 64 is a diagram showing an erosion process for three symbols.

[図 65]パケット切り出し処理のフローチャートである。 FIG. 65 is a flowchart of packet cut-out processing.

[図 66]透かし復元処理のフローチャートである。 FIG. 66 is a flowchart of watermark restoration processing.

[図 67]透かし復元処理を示す図である。 FIG. 67 shows a watermark restoration process.

[図 68]情報処理装置の構成図である。 FIG. 68 is a block diagram of an information processing apparatus.

[図 69]プログラムおよびデータの提供方法を示す図である。 FIG. 69 is a diagram showing a program and data providing method.

発明を実施するための最良の形態 BEST MODE FOR CARRYING OUT THE INVENTION

[0035] 以下、図面を参照しながら、本発明を実施するための最良の形態を詳細に説明する。 Hereinafter, the best mode for carrying out the present invention will be described in detail with reference to the drawings.

本発明の実施形態では、黒画素率の閾値で透かし埋め込み可否を判断するのではなぐ実際に透力情報を埋め込んで検出が可能か否かをテストし、確実に透力情報が検出できる領域のみに透力し情報を埋め込む。これにより、透力しシンボルと文字等の領域が重なる場合でも、安定して透かレ f青報を検出できるようになる。 In the embodiment of the present invention, whether or not the watermark can be embedded is determined based on the threshold value of the black pixel ratio. Instead, it is tested whether or not the transparency information is actually embedded and can be detected, and only the area where the permeability information can be reliably detected. Embed information with transparency. As a result, even when the transparent area is overlapped with the area such as the symbol and the character, the transparent watermark can be detected stably.

[0036] 一例として、図 11のような文章画像に、図 12のような透かしシンボル 1201および 1 202を埋め込む場合について説明する。図 12の透かしシンボルでは、透かしシンポル中に開けられた空白領域（穴）の中心位置とその形状によって、シンボルが区別される。透力しシンボル 1201および 1202は、符号" 0"および" 1"をそれぞれ表している。 As an example, a case will be described in which watermark symbols 1201 and 1202 as shown in FIG. 12 are embedded in a sentence image as shown in FIG. In the watermark symbol in Fig. 12, the symbol is distinguished by the center position and shape of the blank area (hole) opened in the watermark symbol. The permeable symbols 1201 and 1202 represent the signs “0” and “1”, respectively.

[0037] 図 13および図 14は、図 11の文章画像にそれぞれ透かしシンボル 1201および 12 02を埋め込んだ結果を示している。この場合、透かし検出の可否は、透かしシンボル 1201および 1202の穴が文字によって塞がれているか否かに依存している。図 13の例では、領域 Aのみに透かしシンボル 1201を埋め込むことができ、図 14の例では、領域 Cおよび Dのみに透かしシンボル 1202を埋め込むことができることがわ力る。 FIG. 13 and FIG. 14 show the watermark symbols 1201 and 12 respectively on the sentence image of FIG. The result of embedding 02 is shown. In this case, whether or not the watermark can be detected depends on whether or not the holes of the watermark symbols 1201 and 1202 are closed by characters. In the example of FIG. 13, the watermark symbol 1201 can be embedded only in the region A, and in the example of FIG. 14, the watermark symbol 1202 can be embedded only in the regions C and D.

[0038] 透力埋め込み可否のテストは、実際に透力を埋め込んで検出する処理を経なくとも、文書画像のみを用いて、透力しシンボルの穴が文字等によって塞がれるか否かの簡易的なテストを行うことで代替可能である。これにより、テストに要する計算量を肖 IJ減することがでさる。 [0038] Permeability The embedding / non-embedding test determines whether or not a hole in a symbol is filled with characters or the like by using only a document image without actually performing processing for embedding the permeability and detecting it. It can be replaced by a simple test. As a result, the amount of computation required for the test can be reduced by IJ.

[0039] また、図 15に示すように、透力し情報は複数の部分に分割されてパケットィ匕された後に、文書画像に埋め込まれる。まず、右上部に穴を有し、符号" 0"を表す透力シンボル 1501と、左下部に穴を有し、符号" 1"を表す透かしシンボル 1502を用いて、埋め込まれる情報が透カゝし情報 1503に変換される。 Further, as shown in FIG. 15, the permeability information is divided into a plurality of portions and packetized, and then embedded in the document image. First, information to be embedded is transparent using a transparent symbol 1501 having a hole in the upper right portion and representing the sign “0” and a watermark symbol 1502 having a hole in the lower left portion and representing the sign “1”. Converted to fraud information 1503

[0040] 次に、透力レ f青報 1503力それぞれ 3個の透かしシンボルからなる 3つの透かしブロック 1504〜1506に分割される。そして、それぞれの透かしブロックの先頭に、先頭を示すシンボル 1510と、分割された情報の何番目のブロックに相当するのかを示す識別情報 (パケット ID) 1511が付加され、透力しブロックがパケットィ匕される。これにより、 1番目〜 3番目の力しブ Pック 1504〜1506力ら、それぞれ力しノゲット 150 7〜1509力生成される。 [0040] Next, the transparency level f blue information 1503 force is divided into three watermark blocks 1504 to 1506 each consisting of three watermark symbols. Then, at the head of each watermark block, a symbol 1510 indicating the head and identification information (packet ID) 1511 indicating the numbered block of the divided information are added. Is done. As a result, the first to third force blocks 1504 to 1506 are generated, and force nogets 150 7 to 1509 are generated.

[0041] 図 16に示すように、パケットィ匕された情報を、乱数によって決められる順序に従って文書画像 1601にランダムに配置することによって、透力し情報が文書画像 1601のどの領域にも一様に埋め込まれ、どの部分を切り出しても透力情報を検出できるようになる。したがって、横方向の切り取り画像のみならず、縦方向の切り取り画像からも透かレ隋報を検出することが可能である。 [0041] As shown in FIG. 16, the packetized information is randomly placed on the document image 1601 according to the order determined by the random numbers, so that the information is uniformly transmitted to any region of the document image 1601. It is embedded and the permeability information can be detected by cutting out any part. Therefore, it is possible to detect the transparency information not only from the cropped image in the horizontal direction but also from the cropped image in the vertical direction.

[0042] また、透かしパケットの先頭を示すシンボル 1510として一意のドットパターンを採用することで、透力し情報と文字領域等を正確に判別することができ、検出時の多数決で位相のずれを生じることなぐ透力し情報を安定して検出することが可能になる。 [0042] In addition, by adopting a unique dot pattern as the symbol 1510 indicating the head of the watermark packet, it is possible to accurately determine the transparent information and the character area, etc., and the phase shift is determined by majority decision at the time of detection. It is possible to stably detect information that does not occur.

[0043] 透かしパケットは、文字、図、写真等が存在する領域にも埋め込むことが可能である力透力し埋め込み可否のテストを行った後、埋め込み可と判定された領域のみに埋め込んでもよい。 [0043] The watermark packet can be embedded even in an area where characters, drawings, photographs, etc. exist. May be embedded.

[0044] 透かしパケットにカ卩えて、透かしパケットの管理情報を表す透力し管理パケット 151 2を、文書画像 1601に埋め込んでおくことも有効である。透力し管理パケット 1512は、先頭を示すシンボル 1513と、透力し管理パケットを示す識別情報 (パケット ID) 15 14と、 1つの透かしブロックのサイズ 1515と、透かし情報の分割数（ブロック数） 1516 とを含む。 It is also effective to embed a transparency management packet 1512 representing the management information of the watermark packet in the document image 1601 in addition to the watermark packet. The transparent management packet 1512 includes a symbol 1513 indicating the head, identification information (packet ID) 15 14 indicating the transparent management packet, the size 1515 of one watermark block, and the number of watermark information divisions (number of blocks). Including 1516.

[0045] このような透力し管理パケット 1512を埋め込んでおくことで、透力し情報の分割数やブロックサイズ等の埋め込み方法を文書毎に変更することが可能になる。検出時には、このパケットを解読することにより、複数の透かしパケットを組み合わせて、埋め込まれた透かし情報を復元できる。 By embedding such a permeability management packet 1512, it is possible to change the embedding method such as the number of divisions and block size of the permeability information for each document. At the time of detection, the embedded watermark information can be restored by decoding this packet and combining multiple watermark packets.

[0046] 図 17は、このような透かしパケットを用いた電子透力し埋め込み装置の構成例を示している。この電子透力埋め込み装置は、文書画像入力部 1701、透かし変換部 1 702、透かしパケットィ匕部 1703、透力し埋め込み可否領域判定部 1704、および透力埋め込み部 1705を備える。以下では、各部の動作を具体的に説明する。 FIG. 17 shows a configuration example of an electronically permeable embedding device using such a watermark packet. This electronic permeability embedding device includes a document image input unit 1701, a watermark conversion unit 1702, a watermark packet unit 1703, a permeability and embedding area determination unit 1704, and a transparency embedding unit 1705. Below, operation | movement of each part is demonstrated concretely.

[1]文書画像入力部 [1] Document image input section

文書画像入力部 1701は、例えば、特許出願の明細書のような文書をビットマップ等に変換して得られる、文書画像を入力する。 Windows (登録商標)オペレーティングシステム等では、文書の印刷時に MS Officeや PowerPoint (登録商標）等のデータは、 EMF (Enhanced Metafile Format)フォーマットと呼ばれる印刷データに変換される。したがって、 EMFフォーマットをー且画像化して得られた文書画像を入力することが可能である。 The document image input unit 1701 inputs a document image obtained by converting a document such as a specification of a patent application into a bitmap or the like, for example. In the Windows (registered trademark) operating system, data such as MS Office and PowerPoint (registered trademark) is converted to print data called EMF (Enhanced Metafile Format) format when printing a document. Therefore, it is possible to input a document image obtained by converting the EMF format into an image.

[2]透かし変換部 [2] Watermark converter

透力変換部 1702は、文書画像中に埋め込まれる秘密情報を電子透かしィ匕する。埋め込まれる秘密情報は、 N種類の符号力なる符号列に変換され、それぞれの符号に対応する透力しシンボルを割り当てることで、透力し情報が生成される。以下の説明では、簡単のため、 N = 2としている力 Nは 2以上の整数であれば何でもよい。 N= 2の場合は、秘密情報は" 0"ど' 1"の 2進数で表現され、一般には N進数で表現される。 [0047] 図 18は、このような透かし変換処理のフローチャートである。透かし変換部 1702は、まず、図 19に示すように、符号" 0"および" 1"を表す透かしシンボル 1501および 1 502と、先頭を表す先頭シンボル 1901を入力する（ステップ 1801)。次に、秘密情報を" 1, 0, 1, 1, 1, 0, 0, 1, 1"のような符号列に変換し、各符号を対応する透かしシンボルの画像に置き換えて、透力し情報 1503を生成する (ステップ 1802)。 The transparency conversion unit 1702 digitally watermarks confidential information embedded in the document image. The secret information to be embedded is converted into a code string having N types of code powers, and by assigning a power symbol corresponding to each code, the power information is generated. In the following explanation, for simplicity, N = 2 force N can be any integer greater than or equal to 2. In the case of N = 2, the secret information is expressed in binary numbers of “0” or “1”, and is generally expressed in N-digit numbers. FIG. 18 is a flowchart of such a watermark conversion process. First, as shown in FIG. 19, watermark conversion section 1702 inputs watermark symbols 1501 and 1502 representing codes “0” and “1” and leading symbol 1901 representing the beginning (step 1801). Next, the secret information is converted into a code string such as “1, 0, 1, 1, 1, 0, 0, 1, 1”, and each code is replaced with a corresponding watermark symbol image. Information 1503 is generated (step 1802).

[3]透かしパケット化部 [3] Watermark packetizer

図 20は、透かしパケットィ匕部 1703が行うパケットィ匕処理のフローチャートである。透かしパケットィ匕部 1703は、まず、透力し情報 1503を、例えば、 3シンボル（3ビット）ずつのブロックに分割し、透力しブロック 1504〜 1506を生成する（ステップ 2001)。 FIG. 20 is a flowchart of the packet key process performed by the watermark packet key unit 1703. The watermark packet unit 1703 first divides the transparency information 1503 into, for example, three blocks (3 bits), and generates the transparency blocks 1504 to 1506 (step 2001).

[0048] 次に、それぞれの透かしブロックに、先頭シンボル 1901の画像を付カ卩し、 2ビットのパケット IDを透かしシンボルの画像に置き換えて、先頭シンボル 1901と透かしブロックの【こ挿人する（ステップ 2002)。こうして、ノケッ卜 ID1511、 1911、および 1912 をそれぞれ含む、透力しノゲット 1507、 1508、および 1509力 S生成される。 [0048] Next, an image of the leading symbol 1901 is attached to each watermark block, the 2-bit packet ID is replaced with the watermark symbol image, and the leading symbol 1901 and the watermark block are inserted. (Step 2002). In this way, the permeable forces 1507, 1508, and 1509 forces S are generated, including the socket IDs 1511, 1911, and 1912, respectively.

[0049] 次に、図 15に示したような透かし管理パケット 1512を生成する（ステップ 2003)。 Next, a watermark management packet 1512 as shown in FIG. 15 is generated (step 2003).

透かし管理パケット 1512には、ブロックサイズ 1515として 3 (2進表現： 11)が設定され、分割数 1516として 3 (2進表現： 11)が設定されている。したがって、元の透かし情報 1503が 3つの透かしパケットに分割され、各透かしパケットには 3シンボルの長さを持つ透力しブロックが含まれて、ることを示して、る。 In the watermark management packet 1512, 3 (binary representation: 11) is set as the block size 1515, and 3 (binary representation: 11) is set as the division number 1516. Therefore, it is shown that the original watermark information 1503 is divided into three watermark packets, and each watermark packet includes a transparent block having a length of 3 symbols.

[4]透かし埋め込み可否領域判定部 [4] Watermark embedding area determination unit

透力し埋め込み可否領域判定部 1704は、図 10、図 13、および図 14に示したように、文書画像に透力しシンボルを埋め込んだ場合、検出時にそれが正確に検出可能か否かを判断する。 As shown in FIGS. 10, 13, and 14, the transparent and embeddable / unembedable area determination unit 1704 determines whether or not it can be accurately detected at the time of detection when a transparent image is embedded in a document image. Judging.

[0050] 図 21は、このような埋め込み可否領域判定処理のフローチャートである。この処理では、例えば、図 22に示すように、 m画素 X n画素の大きさのシンボル領域を 4分割し、分割された領域内の穴の位置により、透かしシンボル 1501または 1502が埋め込まれる。逆に、透かし情報の検出時には、穴の位置を検出し、その位置情報に基づいて埋め込まれた透かしシンボルの値が判定される。 FIG. 21 is a flowchart of such an embeddability determination process. In this process, for example, as shown in FIG. 22, a symbol area having a size of m pixels × n pixels is divided into four, and a watermark symbol 1501 or 1502 is embedded depending on the position of the hole in the divided area. Conversely, when detecting watermark information, the position of the hole is detected, and the value of the embedded watermark symbol is determined based on the position information.

[0051] 透力し埋め込み可否領域判定部 1704は、まず、実際に、シンボルを埋め込む位置にシンボルを埋め込んでみる（ステップ 2101)。このとき、文書画像およびシンポル画像中の黒画素の値を" 0"、白画素の値を" 1"として、文書画像および埋め込まれたシンボル画像の論理積 (AND)演算が行われる。 [0051] The transparent and embeddable area determination unit 1704 first determines the position in which the symbol is actually embedded. Try embedding a symbol in the box (step 2101). At this time, the logical product (AND) operation of the document image and the embedded symbol image is performed by setting the value of the black pixel in the document image and the symbol image to “0” and the value of the white pixel to “1”.

[0052] 例えば、図 23に示すように、文書画像 2301のシンボル埋め込み位置 Qに、透かしシンボル 1501を埋め込む場合を考えると、 AND演算の結果として画像 2301が得られる。同様にして、シンボル埋め込み位置 Qに透かしシンボル 1502を埋め込む場合、図 24に示すように、 AND演算により画像 2401が得られる。 For example, as shown in FIG. 23, considering a case where a watermark symbol 1501 is embedded at a symbol embedding position Q of a document image 2301, an image 2301 is obtained as a result of an AND operation. Similarly, when the watermark symbol 1502 is embedded at the symbol embedding position Q, an image 2401 is obtained by AND operation as shown in FIG.

[0053] 次に、得られた画像に対して、モルフォロジ一変換の一種であるエロージョン処理を行う（ステップ 2102)。このとき、図 22に示すような、構造要素と呼ばれる図形 220 1を用いて各画素の値を変換し、実際に埋め込まれたシンボルの穴の位置を検出する。 Next, an erosion process, which is a type of morphological transformation, is performed on the obtained image (step 2102). At this time, the value of each pixel is converted using a graphic 2201 called a structural element as shown in FIG. 22, and the position of the hole of the symbol that is actually embedded is detected.

[0054] エロージョン処理は、 AND演算により得られた画像に対して、シンボル埋め込み位置 Q力も透かしシンボル画像の面積の範囲で行われる。そして、構造要素 2201をシンボル画像の範囲内で移動させながら、構造要素 2201全体がすっぽり入る空白領域を検出する。構造要素 2201がそつくり入る空白領域では、構造要素 2201の中心の描く軌跡が白くなり、他の画素は黒く塗りつぶされる。このような演算処理は、モルフォロジー数学においてエロージョン演算と呼ばれ、例えば、前述した非特許文献 1 に記載されている。 In the erosion process, the symbol embedding position Q force is also applied to the image obtained by the AND operation within the range of the area of the watermark symbol image. Then, while moving the structural element 2201 within the range of the symbol image, a blank area in which the entire structural element 2201 completely enters is detected. In the blank area into which the structural element 2201 enters, the locus drawn by the center of the structural element 2201 becomes white, and the other pixels are painted black. Such calculation processing is called erosion calculation in morphology mathematics, and is described in, for example, Non-Patent Document 1 described above.

[0055] 図 23の画像 2301に対して構造要素 2201を用いたエロージョン処理を行うと、画像 2302力得られ、図 24の画像 2401に対して同様のエロージョン処理を行うと、画像 2402が得られる。 When the erosion process using the structural element 2201 is performed on the image 2301 in FIG. 23, an image 2302 force is obtained, and when the same erosion process is performed on the image 2401 in FIG. 24, the image 2402 is obtained. can get.

[0056] 次に、エロージョン処理により得られた画像における空白領域の位置座標から、埋め込まれた透カゝしシンボルの値を識別し (ステップ 2103)、埋め込まれた値と検出された値を比較する (ステップ 2104)。そして、両者が一致すれば埋め込み可と判定し (ステップ 2105)、一致しなければ埋め込み不可と判定する (ステップ 2106)。 [0056] Next, the value of the embedded transparent symbol is identified from the position coordinates of the blank area in the image obtained by the erosion process (step 2103), and the embedded value is detected. Compare the values (step 2104). If they match, it is determined that embedding is possible (step 2105), and if they do not match, it is determined that embedding is impossible (step 2106).

[0057] 図 23の画像 2302では、空白領域 2303がシンボル画像の右上部に存在するので、シンボルの値は" 0"と識別される。この値は、埋め込まれた透かしシンボル 1501の値と一致するため、埋め込み可と判定される。 [0058] これに対して、図 24の画像 2402では、シンボル画像の範囲内に空白領域が存在しないので、シンボルの値は" 0"でもなぐ "1"でもないと判定される。この場合、シンボルの値は埋め込まれた透かしシンボル 1502の値と一致しないため、埋め込み不可と判定される。 In the image 2302 of FIG. 23, since the blank area 2303 exists in the upper right part of the symbol image, the value of the symbol is identified as “0”. Since this value matches the value of the embedded watermark symbol 1501, it is determined that embedding is possible. On the other hand, in the image 2402 in FIG. 24, since there is no blank area in the range of the symbol image, it is determined that the symbol value is neither “0” nor “1”. In this case, since the value of the symbol does not match the value of the embedded watermark symbol 1502, it is determined that embedding is impossible.

[0059] なお、図 21のステップ 2102において、エロージョン処理の代わりに、 AND演算により得られた画像を単純に走査して、白画素で構成される空白領域の有無をチェックしてちよい。 Note that, in step 2102 of FIG. 21, instead of erosion processing, the image obtained by the AND operation may be simply scanned to check for the presence of a blank area composed of white pixels.

[5]透かし埋め込み部 [5] Watermark embedding part

図 25は、透かし埋め込み部 1705が行う埋め込み処理のフローチャートである。透力し埋め込み部 1705は、まず、透かしパケットィ匕部 1703により生成された複数の透力しパケットの中から 1つをランダムに選択する（ステップ 2501)。ここでは、図 19の力しノゲッ卜 1507〜1509の中力ら、 f列; tは、、力しノゲッ卜 1507力 S}gキ尺される。 FIG. 25 is a flowchart of the embedding process performed by the watermark embedding unit 1705. The transparent embedding unit 1705 first randomly selects one of the plurality of transparent packets generated by the watermark packet unit 1703 (step 2501). Here, the intermediate force and the like in FIG. 19, the force column 1507 to 1509, f row; t is the force force 1507 force S} g scale.

[0060] 次に、文書画像中で透力し情報を埋め込もうとしている位置に着目し、その位置に先頭シンボルが埋め込み可能か否かをチェックする（ステップ 2502)。具体的には、先頭シンボル 1901とシンボル埋め込み位置の情報を、透かし埋め込み可否領域判定部 1704に渡して、埋め込み可否の判定を依頼し、判定結果を受け取る。 Next, paying attention to the position where the information is to be embedded in the document image, it is checked whether or not the leading symbol can be embedded at that position (step 2502). Specifically, the head symbol 1901 and the symbol embedding position information are passed to the watermark embedding availability determination unit 1704 to request the embedding permission and receive the determination result.

[0061] 例えば、図 26に示すようなシンボル埋め込み位置 Qの情報を受け取ると、透かし埋め込み可否領域判定部 1704は、図 21の処理により実際に先頭シンボル 1901を埋め込み、その位置 Qに埋め込み可能力否かを判定する。 For example, when information on symbol embedding position Q as shown in FIG. 26 is received, watermark embedding availability determining unit 1704 actually embeds the first symbol 1901 by the process of FIG. It is determined whether or not it is possible to embed.

[0062] 次に、透かし埋め込み可否領域判定部 1704から受け取った判定結果をチックする（ステップ 2503)。判定結果が埋め込み可であれば、図 26に示すように、シンボル埋め込み位置 Qに先頭シンボル 1901を埋め込む（ステップ 2505)。 Next, the determination result received from the watermark embedding availability determining unit 1704 is checked (step 2503). If the determination result is embeddable, the leading symbol 1901 is embedded at the symbol embedding position Q as shown in FIG. 26 (step 2505).

[0063] 次に、シンボル埋め込み位置 Qを 1シンボル分右に移動し、選択された透かしパケットの次のシンボルが埋め込み可能か否かをチェックする（ステップ 2506)。このとき、透力しノケッ卜 1507の先頭シンポノレ 1901に続く透力しシンポノレ 1501と、更新されたシンボル埋め込み位置 Qの情報力透力し埋め込み可否領域判定部 1704に渡される。 Next, the symbol embedding position Q is moved to the right by one symbol, and it is checked whether or not the next symbol of the selected watermark packet can be embedded (step 2506). At this time, the permeable symphonor 1501 following the leading symphonor 1901 of the permeable pad 1507 and the updated information power of the symbol embedding position Q are transmitted to the embeddability determination unit 1704.

[0064] 次に、透かし埋め込み可否領域判定部 1704から受け取った判定結果をチェックする（ステップ 2507)。判定結果が埋め込み可であれば、図 27に示すように、シンボル埋め込み位置 Qに透かしシンボル 1501を埋め込む（ステップ 2508)。 Next, the determination result received from the watermark embedding availability determining unit 1704 is checked. (Step 2507). If the determination result can be embedded, a watermark symbol 1501 is embedded at the symbol embedding position Q as shown in FIG. 27 (step 2508).

[0065] 次に、透かしパケット全体を埋め込んだか否かをチェックし (ステップ 2509)、未処理のシンボルが残っていれば、ステップ 2506以降の処理を繰り返す。そして、 1つの透かしパケットの埋め込みが完了すると、ステップ 2501以降の処理を繰り返す。 Next, it is checked whether or not the entire watermark packet has been embedded (step 2509). If unprocessed symbols remain, the processing from step 2506 is repeated. When the embedding of one watermark packet is completed, the processing after step 2501 is repeated.

[0066] 1つの透かしパケットの途中で、図 28に示すように、透かしシンボルを埋め込めない場合は、ステップ 2507において埋め込み不可の判定結果を受け取る。そこで、その透かしパケットの埋め込みを中止して、ステップ 2501以降の処理を繰り返す。これにより、埋め込めなかった位置から処理が再開され、新たに選択された透かしパケットが埋め込み可能力否かがチヱックされる。 If it is not possible to embed a watermark symbol in the middle of one watermark packet as shown in FIG. 28, a determination result indicating that embedding cannot be performed is received in step 2507. Therefore, the embedding of the watermark packet is stopped, and the processing after step 2501 is repeated. As a result, the processing is resumed from the position where the watermark could not be embedded, and whether or not the newly selected watermark packet can be embedded is checked.

[0067] ステップ 2503において、先頭シンボル 1901の判定結果が埋め込み不可であっても、図 29に示すように、その位置 Qに先頭シンボル 1901を強制的に挿入する（ステップ 2504)。そして、ステップ 2501に戻って新たな透かしパケットを選択した後、再度、先頭シンボルの挿入を試みる。したがって、図 30に示すように、複数の先頭シンボル 1901が連続して埋め込まれる場合もある。 In step 2503, even if the determination result of the leading symbol 1901 cannot be embedded, the leading symbol 1901 is forcibly inserted at the position Q as shown in FIG. 29 (step 2504). Then, after returning to step 2501 and selecting a new watermark packet, an attempt is made to insert the first symbol again. Therefore, as shown in FIG. 30, a plurality of head symbols 1901 may be embedded continuously.

[0068] 以上の処理を繰り返すことによって、透力しシンボルが検出可能な領域には、透かしパケットの一部もしくは全部が埋め込まれ、透力しシンボルが検出不可能な領域には、先頭シンボルのみが埋め込まれる。したがって、先頭シンボルから次の先頭シンボルまでの範囲に埋め込まれた透かしシンボルを抽出すれば、透かしパケットの情報を検出することができる。 [0068] By repeating the above process, a part or all of the watermark packet is embedded in the area where the transparent symbol can be detected, and the area where the transparent symbol cannot be detected is embedded at the head. Only symbols are embedded. Therefore, if the watermark symbol embedded in the range from the first symbol to the next first symbol is extracted, the information of the watermark packet can be detected.

[0069] なお、ステップ 2504において、先頭シンボル 1901の代わりに、透かしシンボル 15 01および 1502のいずれかをランダムに選択して埋め込んでもよぐこれらのシンポルとは異なるドットパターンを持つシンボルを埋め込んでもよい。 [0069] In step 2504, instead of the first symbol 1901, either one of the watermark symbols 1501 and 1502 may be randomly selected and embedded, or a symbol having a dot pattern different from these symbols may be embedded. Good.

[0070] ここで、図 31から図 36までを参照しながら、図 21のエロージョン処理による透かしシンボルの識別方法について、より具体的に説明する。 Here, with reference to FIGS. 31 to 36, the watermark symbol identifying method by the erosion process of FIG. 21 will be described more specifically.

数理形態学にぉ、て、集合 Xと集合 Yのミンコフスキー和もしくはミンコフスキー差を求める際、集合 Xを処理対象の図形とした場合の集合 Yを、構造要素と呼ぶ。構造要素は行列や関数として扱うことが可能であり、本実施形態では構造要素を行列として扱う。視覚的には、所定形状を有する小領域として構造要素を表現することができる In mathematical morphology, when obtaining the Minkowski sum or Minkowski difference between set X and set Y, set Y when set X is the figure to be processed is called a structural element. Structural elements can be handled as matrices and functions. In this embodiment, structural elements are used as matrices. deal with. Visually, a structural element can be expressed as a small region having a predetermined shape

[0071] エロージョンとは、画像中の着目画素を中心に構造要素を配置して、構造要素の定義域に対応する範囲内の画素の輝度を求め、着目画素の輝度をそれらの輝度の最小値で置き換える操作である。 [0071] Erosion is the arrangement of structural elements around the pixel of interest in the image to determine the luminance of the pixels within the range corresponding to the definition area of the structural element, and the luminance of the pixel of interest is the minimum value of those luminances. This is a replacement operation.

[0072] 図 31に示す 4つの地紋パターン 3101〜3104は、それぞれ異なる透かしシンボルを表している。これらの地紋パターン内には、図 32に示すように、パターン毎に特徴の異なる空白領域が形成されるように複数のドットが配置されている。ここで、空白領域の特徴とは、空白領域の数、面積、形状、画素の平均値等を意味する。各地紋パターンに対してエロージョン処理を行うことで、ノターン内部の空白領域を検出することがでさる。 [0072] The four copy-forgery-inhibited pattern patterns 3101 to 3104 shown in FIG. 31 represent different watermark symbols. In these tint block patterns, as shown in FIG. 32, a plurality of dots are arranged so that blank areas having different features are formed for each pattern. Here, the characteristics of the blank area mean the number, area, shape, average value of pixels, etc. of the blank area. By performing erosion processing on each pattern pattern, it is possible to detect blank areas inside the no pattern.

[0073] なお、文書画像全体の濃度を一様にして、人の目で地紋パターンを判別しに《するために、それぞれの地紋パターンを構成するドットの数は同じにすることが望ましい図 31の 4つの地紋パターンから、図 32のような円で囲まれた部分の空白領域を抽出する場合、図 33に示すような菱形の構造要素 3301を用いてエロージョン処理が行われる。例えば、地紋パターン 3101に対してエロージョン処理を行うと、図 34に示すように、図 32に示した左上部の空白領域を含む変換結果 3501が得られる。 It should be noted that in order to make the density of the entire document image uniform and to discriminate the tint block pattern with human eyes, it is desirable that the number of dots constituting each tint block pattern be the same. When extracting a blank area surrounded by a circle as shown in FIG. 32 from the four tint block patterns of 31, erosion processing is performed using a diamond-shaped structural element 3301 as shown in FIG. For example, when the erosion process is performed on the tint block pattern 3101, as shown in FIG. 34, a conversion result 3501 including the blank area at the upper left shown in FIG. 32 is obtained.

[0074] この処理では、構造要素 3301の中心を基準位置として、その位置を透力しシンポル画像中の着目画素に重ねて構造要素 3301が配置される。そして、構造要素の定義域内の画素の輝度が求められ、着目画素の輝度がそれらの輝度の最小値で置き換えられる。構造要素 3301により画像全体をラスタ走査しながら、このような置換を繰り返すことで、変換結果 3501が得られる。 In this processing, with the center of the structural element 3301 as a reference position, the structural element 3301 is arranged so as to be transparent and overlapped with the pixel of interest in the symbol image. Then, the luminance of the pixels within the definition area of the structural element is obtained, and the luminance of the pixel of interest is replaced with the minimum value of those luminances. The conversion result 3501 is obtained by repeating such replacement while raster scanning the entire image by the structural element 3301.

[0075] 同様にして、地紋パターン 3102、 3103、および 3104からは、図 35に示すような変換結果 3502、 3503、および 3504力それぞれ得られる。 Similarly, conversion results 3502, 3503, and 3504 forces as shown in FIG. 35 are obtained from the tint block patterns 3102, 3103, and 3104, respectively.

これらの変換結果から、構造要素 3301を用いることで、地紋パターン 3101〜310 4内の一定サイズ未満の空白領域は黒く塗りつぶされ、一定サイズ以上の空白領域のみを検出できることがわかる。こうして得られた画像の変換結果において、シンボル画像の範囲内における空白領域の位置情報に基づいて、シンボルの値が識別される。 From these conversion results, it can be seen that by using the structural element 3301, blank areas of a background pattern of less than a certain size in the background pattern 3101 to 3104 are painted black, and only blank areas of a certain size or more can be detected. In the conversion result of the image obtained in this way, the symbol The symbol value is identified based on the position information of the blank area within the image.

[0076] 具体的には、シンボル領域を複数のブロックに分割し、各ブロックに含まれる空白領域の数に基づいてシンボルを識別する。図 35に示した変換結果の場合は、図 36 に示すように、シンボル領域が 2 X 2のブロックに分割され、空白領域が存在するプロックの位置力もシンボルが識別される。 [0076] Specifically, the symbol area is divided into a plurality of blocks, and the symbols are identified based on the number of blank areas included in each block. In the case of the conversion result shown in FIG. 35, as shown in FIG. 36, the symbol area is divided into 2 × 2 blocks, and the symbol is also identified by the position power of the block in which the blank area exists.

[0077] ところで、モルフォロジ一変換によれば、地紋パターンの空白領域の代わりに、輝度の小さな黒色領域を抽出することも可能である。地紋パターンがパターン毎に特徴の異なる黒色領域を有する場合、パターン内部の黒色領域を抽出することで、埋め込まれた情報を検出することができる。 By the way, according to the morphological transformation, it is also possible to extract a black region having a low luminance instead of the blank region of the tint block pattern. When the tint block pattern has a black area having a different characteristic for each pattern, the embedded information can be detected by extracting the black area inside the pattern.

[0078] これらの地紋パターンから黒色領域を検出する場合、エロージョンの代わりにダイレーシヨンが用いられる。ダイレーシヨンとは、画像中の着目画素を中心に構造要素を配置して、構造要素の定義域に対応する範囲内の画素の輝度を求め、着目画素の輝度をそれらの輝度の最大値で置き換える操作である。 When a black area is detected from these tint block patterns, a dilation is used instead of erosion. Dilation is an operation in which structural elements are placed around the pixel of interest in the image, the luminance of pixels within the range corresponding to the definition area of the structural element is obtained, and the luminance of the pixel of interest is replaced with the maximum value of those luminances. It is.

[0079] 次に、文書画像中の領域を複数のカテゴリーに分類し、カテゴリー毎に透力し情報の埋め込み方法を変更する実施形態について説明する。領域に応じた埋め込み方法を採用することにより、透力情報を効率的に埋め込むことができる。また、透かし情報の検出時にも、より安定した透かし検出が可能となり、計算量も削減される。 Next, an embodiment will be described in which an area in a document image is classified into a plurality of categories and the information embedding method is changed for each category to change the information embedding method. By adopting the embedding method according to the area, the permeability information can be embedded efficiently. In addition, more stable watermark detection is possible when detecting watermark information, and the amount of calculation is reduced.

[0080] 図 37は、このようなカテゴリー分類を用いた電子透力し埋め込み装置の構成例を示している。この電子透力埋め込み装置は、文書画像入力部 3701、透かし変換部 3702、透かしパケットィ匕部 3703、透力し埋め込み可否領域判定部 3704、透かしカテゴリー抽出部 3705、および透力し埋め込み部 3706を備える。 FIG. 37 shows an example of the configuration of an electronic permeability embedding device using such a category classification. This electronic transparency embedding device includes a document image input unit 3701, a watermark conversion unit 3702, a watermark packet input unit 3703, a transparent and embedding area determination unit 3704, a watermark category extraction unit 3705, and a transparent embedding unit 3706. .

[0081] このうち、文書画像入力部 3701、透かし変換部 3702、透かしパケットィ匕部 3703、および透力埋め込み可否領域判定部 3704の動作は、図 17の文書画像入力部 17 01、透かし変換部 1702、透かしパケットィ匕部 1703、および透かし埋め込み可否領域判定部 1704の動作と同様である。以下では、透かしカテゴリー抽出部 3705および透力埋め込み部 3706の動作を具体的に説明する。 Among these, the operations of the document image input unit 3701, the watermark conversion unit 3702, the watermark packet unit 3703, and the transparency embedding / non-permission area determination unit 3704 are the same as those of the document image input unit 1701 and the watermark conversion unit 1702 in FIG. The operations of the watermark packet unit 1703 and the watermark embedding availability determination unit 1704 are the same. Hereinafter, the operations of the watermark category extraction unit 3705 and the permeability embedding unit 3706 will be specifically described.

[1]透かしカテゴリー抽出部透かしカテゴリー抽出部 3705は、文書画像を参照しながら、シンボルを埋め込める領域をカテゴリ一別に分類する。各領域は、例えば、次の 4つのカテゴリーに分類される。 [1] Watermark category extraction unit The watermark category extraction unit 3705 classifies the area in which the symbol can be embedded by category while referring to the document image. Each area is classified into the following four categories, for example.

カテゴリー 1：全種類の透かしパケットを挿入可能な領域 Category 1: Area where all types of watermark packets can be inserted

カテゴリー 2： 1種類の透かしパケットを完全に挿入可能な領域 Category 2: Area where one type of watermark packet can be completely inserted

カテゴリー 3： 1種類の透かしパケットの一部を挿入可能な領域 Category 3: Area where a part of one kind of watermark packet can be inserted

カテゴリー 4：上記、ずれのカテゴリーにも該当しな、領域 Category 4: Areas that do not fall under the above category

カテゴリー 3の領域としては、少なくとも、透かしパケットの先頭シンボルおよびパケッ HD力なるヘッダ部を挿入可能な領域が抽出される。ヘッダ部に加えて、透かしブロック内の 1つ以上のシンボルが埋め込み可能な領域も、カテゴリー 3に分類される。 As the category 3 area, at least the area where the header symbol which is the head symbol of the watermark packet and the packet HD force can be inserted is extracted. In addition to the header part, the area where one or more symbols in the watermark block can be embedded is also classified as Category 3.

[0082] 図 38〜図 41は、透かしカテゴリー抽出部 3705が行うカテゴリー抽出処理のフローチャートである。この処理では、文書画像が上記カテゴリーに従って領域分割され、それぞれの領域のカテゴリーを示すラベル画像が生成される。 FIG. 38 to FIG. 41 are flowcharts of category extraction processing performed by the watermark category extraction unit 3705. In this process, the document image is divided into regions according to the categories, and a label image indicating the category of each region is generated.

[0083] 透かしカテゴリー抽出部 3705は、まず、図 42に示すように、文書画像の大きさを k 画素 X I画素、透かしシンボルの大きさを m画素 X n画素として、図 43に示すような（k Zm)画素 X (lZn)画素の大きさを持つラベル画像の記憶領域を用意する。そして、透かしシンボルの大きさを 1ブロックとして、文書画像を、 (k/m) X (lZn)個のブロックに分割する (ステップ 3801)。このラベル画像に、カテゴリーを示すラベル値が書き込まれていく。 As shown in FIG. 42, the watermark category extraction unit 3705 first sets the size of the document image to k pixels XI pixels and the size of the watermark symbol to m pixels Xn pixels as shown in FIG. 43 ( A storage area for a label image having a size of (k Zm) pixel X (lZn) pixel is prepared. Then, the size of the watermark symbol is set to 1 block, and the document image is divided into (k / m) X (lZn) blocks (step 3801). The label value indicating the category is written into this label image.

[0084] 次に、得られたブロック毎に文書画像をラスタ走査し、各ブロックに透力しシンボルが埋め込み可能力否かをチェックする (ステップ 3802)。具体的には、判定用シンポルと埋め込み対象ブロックの情報を、透かし埋め込み可否領域判定部 3704に渡して、埋め込み可否の判定を依頼し、判定結果を受け取る。判定用シンボルとしては、例えば、先頭シンボルと全種類の透かしシンボルの空白領域の論理和（OR)を、空白領域として持つようなシンボルが用いられる。 Next, the document image is raster-scanned for each obtained block, and it is checked whether or not the symbol is embeddable by transmitting through each block (step 3802). Specifically, the determination symbol and embedding target block information are passed to the watermark embedding availability determination unit 3704 to request the determination of embedding and receive the determination result. As the determination symbol, for example, a symbol having, as a blank area, a logical sum (OR) of blank areas of the head symbol and all types of watermark symbols is used.

[0085] 次に、透かし埋め込み可否領域判定部 3704から受け取った判定結果をチェックする（ステップ 3803)。判定結果が埋め込み可であれば、図 43のラベル画像の対応する画素にラベル値" 0"を書き込み (ステップ 3806)、埋め込み不可であれば、ラベル値" 1"を書き込む（ステップ 3804)。 Next, the determination result received from the watermark embedding availability determining unit 3704 is checked (step 3803). If the judgment result is embeddable, the label image shown in Fig. 43 The label value “0” is written to the pixel to be printed (step 3806), and if embedding is impossible, the label value “1” is written (step 3804).

[0086] 次に、すべてのブロックの走査を行ったか否かをチェックし (ステップ 3805)、未処理のブロックがあれば、ステップ 3802以降の処理を繰り返す。こうして、例えば、図 4Next, it is checked whether or not all blocks have been scanned (step 3805). If there is an unprocessed block, the processing from step 3802 is repeated. Thus, for example, Figure 4

3に示したようなラベル画像が得られる。 A label image as shown in Fig. 3 is obtained.

[0087] 次に、図 39の処理に移行し、カテゴリー 1の領域を抽出する。ここでは、図 44に示すように、それぞれ S個のシンボル力もなる P種類の透かしパケットが、文書画像に埋め込まれるものとする。 S個のシンボルのうち、先頭の H個のシンボルがヘッダ部に対応する。この例では、 P = 3、 S = 6、 H = 3である。 Next, the processing shifts to the processing in FIG. 39, and the category 1 region is extracted. Here, as shown in FIG. 44, it is assumed that P types of watermark packets each having S symbol powers are embedded in the document image. Of the S symbols, the first H symbols correspond to the header. In this example, P = 3, S = 6, H = 3.

[0088] また、図 45に示すように、 P画素 X S画素の大きさの検査配列が用意される。この検查配列の各要素には、すべて" 1"が書き込まれている。ラベル画像においてラベル値が" 0"の領域のうち、検査配列がすっぽり収まる領域力カテゴリー 1の領域に決定される。 Further, as shown in FIG. 45, an inspection array having a size of P pixel X S pixel is prepared. "1" is written in each element of this verification array. Of the areas where the label value is “0” in the label image, the area is determined as the area of category 1 where the test arrangement fits perfectly.

[0089] 透かしカテゴリー抽出部 3705は、まず、ラベル画像をラスタ走査し、着目画素を起点として、検査配列とラベル画像の AND演算を行う（ステップ 3901)。そして、得られた画像において、検査配列の範囲内にラベル値" 1"が存在するか否かをチェックする (ステップ 3902)。検査配列の範囲内に" 1"が存在すれば、着目画素の位置を隣の画素にシフトして（ステップ 3906)、ステップ 3901以降の処理を繰り返す。 The watermark category extraction unit 3705 first raster-scans the label image, and performs an AND operation on the inspection array and the label image starting from the pixel of interest (step 3901). Then, in the obtained image, it is checked whether or not the label value “1” exists within the range of the inspection array (step 3902). If “1” exists in the range of the inspection array, the position of the pixel of interest is shifted to the adjacent pixel (step 3906), and the processing after step 3901 is repeated.

[0090] ステップ 3902において、検査配列の範囲内に" 1"が存在しなければ、検査配列の範囲内のラベル値をすベて" 2"に書き換える（ステップ 3903)。そして、着目画素の位置を検査配列の横幅に相当する S画素分移動させる (ステップ 3904)。これにより、次の検査は、 S画素分移動した点を起点として行われる。 In step 3902, if “1” does not exist within the range of the check array, all label values within the range of the check array are rewritten to “2” (step 3903). Then, the position of the pixel of interest is moved by S pixels corresponding to the horizontal width of the inspection array (step 3904). Thus, the next inspection is performed starting from the point moved by S pixels.

[0091] 次に、全画素を走査したか否かをチェックし (ステップ 3905)、未処理の画素があれば、ステップ 3901以降の処理を繰り返す。 Next, it is checked whether or not all the pixels have been scanned (step 3905). If there is an unprocessed pixel, the processing from step 3901 is repeated.

例えば、図 46に示すように、ラベル画像の左上の画素 4601を着目画素として AN D演算を行うと、演算結果にはラベル値" 1"が含まれているので、着目画素の位置が右隣の画素にシフトする。 For example, as shown in FIG. 46, when the ND calculation is performed with the pixel 4601 at the upper left of the label image as the target pixel, the calculation result includes the label value “1”, so the position of the target pixel is adjacent to the right Shift to the next pixel.

[0092] その後、図 47に示すように、ラベル画像の 3行 1列の画素 4701を着目画素として A ND演算が行われると、演算結果にはラベル値" 1"が含まれていないので、検査配列の範囲内のラベル値が" 2"に書き換えられる。その結果、着目画素の位置は、右へ S画素分離れた画素 4702に移動する。 [0092] Then, as shown in FIG. 47, the pixel 4701 in the 3rd row and the 1st column of the label image is set as the target pixel A. When the ND operation is performed, the label value “1” is not included in the operation result, so the label value in the inspection array range is rewritten to “2”. As a result, the position of the target pixel moves to the pixel 4702 separated by S pixels to the right.

[0093] こうして、ラベル画像が更新され、図 48に示すような結果が得られる。図 48において、ラベル値" 2"が書き込まれた領域力 P個の透かしパケットをすベて埋め込める力テゴリー 1の領域となる。 In this way, the label image is updated, and a result as shown in FIG. 48 is obtained. In FIG. 48, the area power field in which the label value “2” is written is the field of power category 1 that can embed all P watermark packets.

[0094] 次に、図 40の処理に移行し、カテゴリー 2の領域を抽出する。ここでは、更新されたラベル画像と、図 49に示すような S画素 X 1画素の大きさの検査配列が用いられる。この検査配列の各要素には、あら力じめ" 1"が書き込まれている。ラベル画像においてラベル値が" 0"の領域のうち、検査配列がすっぽり収まる領域力カテゴリー 2の領域に決定される。 Next, the process proceeds to the processing of FIG. 40, and the category 2 region is extracted. Here, the updated label image and a test array of S pixel X 1 pixel size as shown in FIG. 49 are used. In each element of this check array, a brilliant “1” is written. Of the areas where the label value is "0" in the label image, the area force is determined to be the area of category 2 where the test arrangement fits completely.

[0095] 透かしカテゴリー抽出部 3705は、まず、ラベル画像をラスタ走査し、着目画素のラベル値をチェックする（ステップ 4001)。ラベル値カ '2"であれば、着目画素の位置を S画素分シフトし (ステップ 4007)、ステップ 4001以降の処理を繰り返す。 The watermark category extraction unit 3705 first raster-scans the label image and checks the label value of the pixel of interest (step 4001). If the label value is “2”, the position of the pixel of interest is shifted by S pixels (step 4007), and the processing after step 4001 is repeated.

[0096] ラベル値力 ' 2"でなければ、着目画素を起点として、ラベル画像と検査配列の AN D演算を行い (ステップ 4002)、検査配列の範囲内にラベル値" 1"が存在するか否かをチヱックする（ステップ 4003)。検査配列の範囲内に" 1"が存在すれば、着目画素の位置を隣の画素にシフトして (ステップ 4008)、ステップ 4001以降の処理を繰り返す。 [0096] If the label value force is not '2', the NAND operation is performed on the label image and the test array from the target pixel (step 4002), and the label value "1" exists within the test array range. (Step 4003) If “1” exists in the range of the inspection array, the position of the target pixel is shifted to the next pixel (Step 4008), and the processing after Step 4001 is repeated. .

[0097] ステップ 4003において、検査配列の範囲内に" 1"が存在しなければ、検査配列の範囲内のラベル値をすベて" 3"に書き換え (ステップ 4004)、着目画素の位置を S画素分移動させる (ステップ 4005)。 [0097] In step 4003, if "1" does not exist within the inspection array range, all label values within the inspection array range are rewritten to "3" (step 4004), and the position of the pixel of interest is set to S. Move the pixel part (step 4005).

[0098] 次に、全画素を走査したか否かをチェックし (ステップ 4006)、未処理の画素があれば、ステップ 4001以降の処理を繰り返す。 Next, it is checked whether or not all the pixels have been scanned (step 4006). If there is an unprocessed pixel, the processing from step 4001 is repeated.

例えば、図 50に示すように、ラベル画像の左上の画素 5001を着目画素として AN For example, as shown in FIG. 50, the upper left pixel 5001 of the label image is used as the target pixel.

D演算を行うと、演算結果にはラベル値" 1"が含まれているので、着目画素の位置が右隣の画素にシフトする。 When D calculation is performed, the label value “1” is included in the calculation result, so the position of the pixel of interest shifts to the pixel on the right.

[0099] その後、図 51に示すように、ラベル画像の 6行 1列の画素 5101を着目画素として A ND演算が行われると、演算結果にはラベル値" 1"が含まれていないので、検査配列の範囲内のラベル値が" 3"に書き換えられる。その結果、着目画素の位置は、右へ S画素分離れた画素 5102に移動する。 [0099] After that, as shown in FIG. 51, the pixel 5101 in the 6th row and the 1st column of the label image is set as the target pixel A. When the ND operation is performed, the label value “1” is not included in the operation result, so the label value in the inspection array range is rewritten to “3”. As a result, the position of the pixel of interest moves to the pixel 5102 separated by S pixels to the right.

[0100] こうして、ラベル画像が更新され、図 52に示すような結果が得られる。図 52において、ラベル値" 3"が書き込まれた領域力 1つの透かしパケットを埋め込めるカテゴリ一 2の領域となる。 [0100] In this way, the label image is updated, and a result as shown in FIG. 52 is obtained. In FIG. 52, the area power in which the label value “3” is written is the area of category 1 in which one watermark packet can be embedded.

[0101] 次に、図 41の処理に移行し、カテゴリー 3の領域を抽出する。ここでは、更新されたラベル画像と、図 53に示すような H画素 X 1画素の大きさの検査配列が用いられる。この検査配列の各要素には、あら力じめ" 1"が書き込まれている。ラベル画像においてラベル値が" 0"の領域のうち、検査配列がすっぽり収まる領域力カテゴリー 3の領域に決定される。 Next, the processing shifts to the processing in FIG. 41, and the category 3 region is extracted. Here, the updated label image and a test array having a size of H pixel X 1 pixel as shown in FIG. 53 are used. In each element of this check array, a brilliant “1” is written. Of the areas where the label value is “0” in the label image, the area is determined as the area of category 3 where the test arrangement fits completely.

[0102] 透かしカテゴリー抽出部 3705は、まず、ラベル画像をラスタ走査し、着目画素のラベル値をチェックする（ステップ 4101)。ラベル値カ '2"または" 3"であれば、着目画素の位置を H画素分シフトし (ステップ 4107)、ステップ 4101以降の処理を繰り返す [0102] The watermark category extraction unit 3705 first raster-scans the label image to check the label value of the pixel of interest (step 4101). If the label value is '2' or '3', the position of the pixel of interest is shifted by H pixels (step 4107) and the processing from step 4101 is repeated.

[0103] ラベル値が" 2"または" 3"でなければ、着目画素を起点として、ラベル画像と検査配列の AND演算を行い（ステップ 4102)、検査配列の範囲内にラベル値" 1"が存在するか否かをチェックする（ステップ 4103)。検査配列の範囲内に" 1"が存在すれば、着目画素の位置を隣の画素にシフトして（ステップ 4108)、ステップ 4101以降の処理を繰り返す。 [0103] If the label value is not "2" or "3", an AND operation is performed on the label image and the test array starting from the target pixel (step 4102), and the label value "1" is within the range of the test array. It is checked whether or not it exists (step 4103). If “1” exists within the range of the inspection array, the position of the pixel of interest is shifted to the adjacent pixel (step 4108), and the processing after step 4101 is repeated.

[0104] ステップ 4103において、検査配列の範囲内に" 1"が存在しなければ、着目画素以降の連続するラベル値" 0"をすベて" 4"に書き換える (ステップ 4104)。これにより、 H画素〜（S— 1)画素の領域のラベル値が書き換えられる。そして、着目画素の位置を、書き換えられた領域の幅分移動させる (ステップ 4105)。 In step 4103, if “1” does not exist within the range of the inspection array, all the continuous label values “0” after the pixel of interest are rewritten to “4” (step 4104). As a result, the label value in the region from H pixel to (S-1) pixel is rewritten. Then, the position of the pixel of interest is moved by the width of the rewritten area (step 4105).

[0105] 次に、全画素を走査したか否かをチェックし (ステップ 4106)、未処理の画素があれば、ステップ 4101以降の処理を繰り返す。 Next, it is checked whether or not all pixels have been scanned (step 4106). If there are unprocessed pixels, the processing from step 4101 is repeated.

例えば、図 54に示すように、ラベル画像の左上の画素 5401を着目画素として AN D演算を行うと、演算結果にはラベル値" 1"が含まれているので、着目画素の位置が右隣の画素にシフトする。 For example, as shown in FIG. 54, when the ND calculation is performed with the pixel 5401 at the upper left of the label image as the target pixel, the calculation result includes the label value “1”. Shift to the next pixel to the right.

[0106] その後、図 55に示すように、ラベル画像の 1行 5列の画素 5501を着目画素として A ND演算が行われると、演算結果にはラベル値" 1"が含まれていないので、画素 550 1から 4画素分の領域のラベル値が" 4"に書き換えられる。その結果、着目画素の位置は、 4画素分離れた画素 5502に移動する。 Thereafter, as shown in FIG. 55, when the AND calculation is performed using the pixel 5501 in the first row and the fifth column of the label image as the pixel of interest, the calculation result does not include the label value “1”. The label value of the area from pixel 550 1 to 4 pixels is rewritten to "4". As a result, the position of the target pixel moves to a pixel 5502 that is separated by four pixels.

[0107] こうして、ラベル画像が更新され、ラベル値" 4"が書き込まれた領域力少なくとも透かしパケットのヘッダ部を埋め込めるカテゴリー 3の領域となる。そして、更新されたラベル画像において、ラベル値" 0"または" 1"が残っている領域は、カテゴリー 4の領域となる。 In this way, the label image is updated, and the area force in which the label value “4” is written becomes at least a category 3 area in which the header portion of the watermark packet can be embedded. In the updated label image, the area where the label value “0” or “1” remains is the category 4 area.

[0108] 以上の処理を行うことで、文書画像がカテゴリー 1〜カテゴリー 4の領域に分類される。 [0108] By performing the above processing, the document image is classified into categories 1 to 4.

図 56は、透かしカテゴリー抽出部 3705により分類された、文書画像の領域を示している。この文書画像において、パターン 5601により示される 1ライン以上の領域は、カテゴリー 1に対応し、パターン 5602により示される 1ラインの領域は、カテゴリー 2 に対応し、パターン 5603により示される領域は、カテゴリー 3に対応する。そして、文字領域および残りの空白領域は、カテゴリー 4に対応する。 FIG. 56 shows a document image area classified by the watermark category extraction unit 3705. In this document image, an area of one or more lines indicated by pattern 5601 corresponds to category 1, an area of one line indicated by pattern 5602 corresponds to category 2, and an area indicated by pattern 5603 corresponds to category 3. Corresponding to The character area and the remaining blank area correspond to Category 4.

[2]透かし埋め込み部 [2] Watermark embedding part

透力し埋め込み部 3706は、透かしカテゴリー抽出部 3705により生成されたラベル画像に従って、それぞれの領域に次のようなシンボルを埋め込む。 The transparency embedding unit 3706 embeds the following symbols in each area according to the label image generated by the watermark category extraction unit 3705.

(1)カテゴリー 1の領域 (1) Category 1 area

全種類の透かしパケットを乱数等に従って配置する。 All types of watermark packets are arranged according to random numbers or the like.

(2)カテゴリー 2の領域 (2) Category 2 area

1つの透かしパケットを乱数等により選択し、選択した透かしパケットを埋め込む。 One watermark packet is selected by a random number or the like, and the selected watermark packet is embedded.

(3)カテゴリー 3の領域 (3) Category 3 area

1つの透かしパケットを乱数等により選択し、選択した透かしパケットの先頭から、埋め込み可能な最大数のシンボルを埋め込む。このとき、所定の確率で、先頭シンポル以外のシンボルの埋め込み順序を反転させ、反転したことを示すために、先頭シンボルを別の地紋パターンに置き換える。 [0109] 例えば、図 57に示すような透かしパケットの場合、図 58に示すように、先頭シンポノレ 5701力 Uのシンポノレ 5801に変更され、それに続くシンポノレ歹 IJ"A, B, C, D, E, F, G"が反転されて、 "G, F, E, D, C, B, A"に変更される。 One watermark packet is selected by random numbers, etc., and the maximum number of symbols that can be embedded is embedded from the beginning of the selected watermark packet. At this time, the embedding order of symbols other than the leading symbol is reversed with a predetermined probability, and the leading symbol is replaced with another tint block pattern to indicate that the symbol has been reversed. For example, in the case of a watermark packet as shown in FIG. 57, as shown in FIG. "E, F, G" is inverted and changed to "G, F, E, D, C, B, A".

[0110] すべての透かしパケットを反転させずに埋め込むと、図 59の文書画像 5901のように、埋め込まれた透かしパケットのシンボルのうち、後半部分の" E"、 "F"、 "G"はほとんど文書画像 5901中に埋め込まれない。これに対して、例えば、 1Z2の確率で図 58の反転透かしパケットを埋め込むことで、文書画像 5902のように、透かしパケットの前半部分と後半部分がバランスよく配分される。 [0110] When all the watermark packets are embedded without being inverted, as shown in the document image 5901 in Fig. 59, among the embedded watermark packet symbols, "E", "F", and "G" in the latter half are Mostly it is not embedded in the document image 5901. On the other hand, for example, by embedding the inverted watermark packet in FIG. 58 with a probability of 1Z2, the first half and the second half of the watermark packet are distributed in a well-balanced manner as in the document image 5902.

(4)カテゴリー 4の領域 (4) Category 4 area

透かしパケットの本体である透かしブロックを埋め込むことができな、ため、先頭シンボルを埋め込む力、または、透かしシンボルのいずれかをランダムに選択して埋め込む。あるいは、これらのシンボルとは異なるドットパターンを持つシンボルを埋め込んでもよい。 Since the watermark block that is the body of the watermark packet cannot be embedded, either the power to embed the leading symbol or the watermark symbol is selected at random and embedded. Or you may embed symbols with a different dot pattern from these symbols.

[0111] 図 60は、透かし埋め込み部 3706が行う埋め込み処理のフローチャートである。この処理では、図 61に示すように、文書画像と同じ大きさを持つ透力生成用画像 61 02が用意され、ラベル画像 6101を参照しながら、透かし生成用画像 6102にシンポルが埋め込まれていく。 FIG. 60 is a flowchart of the embedding process performed by the watermark embedding unit 3706. In this processing, as shown in FIG. 61, a transparency generating image 6102 having the same size as the document image is prepared, and a symbol is embedded in the watermark generating image 6102 while referring to the label image 6101. To go.

[0112] 透力し埋め込み部 3706は、まず、ラベル画像をラスタ走査し、ラベル画像の着目画素のラベル値をチェックする（ステップ 6001〜6004)。 The transparent embedding unit 3706 first raster-scans the label image and checks the label value of the pixel of interest in the label image (steps 6001 to 6004).

ラベル値が" 0"または" 1"であれば、透力し生成用画像 6102の対応するブロックに If the label value is "0" or "1", the transparency is applied to the corresponding block in the generation image 6102.

、先頭シンボルを埋め込む（ステップ 6007)。 The first symbol is embedded (step 6007).

[0113] ラベル値力 ' 2"であれば、全種類の透かしパケットが挿入可能なので、透かし生成用画像 6102の対応する領域に、 P種類の透かしパケットを並べて埋め込む (ステツプ 6008)。 [0113] If the label value power is '2', all types of watermark packets can be inserted, and therefore P types of watermark packets are embedded side by side in the corresponding region of the watermark generation image 6102 (step 6008).

[0114] ラベル値が" 3"であれば、乱数により 1つの透かしパケットを選択し、透かし生成用画像 6102の対応する領域に、その透かしパケットを埋め込む（ステップ 6009)。ラベル値が" 4"であれば、乱数により 1つの透かしパケットを選択し、その透かしパケットのシンボルの順序を 1Z2の確率で反転させる（ステップ 6010)。そして、透かし生成用画像 6102の対応する領域に、その透かしパケットのシンボルを埋め込めるだけ埋め込む（ステップ 6011)。 If the label value is “3”, one watermark packet is selected by a random number, and the watermark packet is embedded in the corresponding area of the watermark generation image 6102 (step 6009). If the label value is “4”, one watermark packet is selected by a random number, and the order of the symbols of the watermark packet is reversed with a probability of 1Z2 (step 6010). And watermark The watermark packet symbol is embedded only in the corresponding area of the generation image 6102 (step 6011).

[0115] 次に、全画素を走査したか否かをチェックし (ステップ 6005)、未処理の画素があれば、ステップ 6001以降の処理を繰り返す。全画素の走査を終了すると、透かし生成用画像 6102全体にシンボルが埋め込まれた画像 6103が得られる。 Next, it is checked whether or not all the pixels have been scanned (step 6005). If there are unprocessed pixels, the processing from step 6001 is repeated. When scanning of all pixels is completed, an image 6103 in which symbols are embedded in the entire watermark generation image 6102 is obtained.

[0116] その後、文書画像と得られた透力生成用画像を重ねることで、透かし入り文書画像を生成する (ステップ 6006)。ただし、両者の画素値が異なる場合には、文書画像の画素値が優先される。 [0116] After that, the document image and the obtained image for generating transparency are overlapped to generate a watermarked document image (step 6006). However, if the pixel values are different, the pixel value of the document image is given priority.

[0117] 次に、図 17または図 37の電子透力埋め込み装置により埋め込まれた情報を検出する、電子透かし検出装置の構成および動作について説明する。 Next, the configuration and operation of a digital watermark detection apparatus that detects information embedded by the electronic permeability embedding apparatus of FIG. 17 or FIG. 37 will be described.

図 62は、電子透かし検出装置の構成例を示している。この電子透かし検出装置は FIG. 62 shows a configuration example of the digital watermark detection apparatus. This digital watermark detection device

、透かし入り文書画像入力部 6201、先頭シンボル探索部 6202、透かし復元部 620, Watermarked document image input unit 6201, head symbol search unit 6202, watermark restoration unit 620

3、およびパケット切り出し部 6204を備える。以下では、各部の動作を具体的に説明する。 3 and a packet cutout unit 6204 are provided. In the following, the operation of each part will be described specifically.

[1]透かし入り文書画像入力部 [1] Watermarked document image input section

透かし入り文書画像入力部 6201は、透かしパケットが埋め込まれた文書画像を入力する。 A watermarked document image input unit 6201 inputs a document image in which a watermark packet is embedded.

[2]先頭シンボル探索部 [2] Lead symbol search part

先頭シンボル探索部 6202は、文書画像に埋め込まれて!/、る先頭シンボルを探索することで、透かしパケットの位置を検出する。 The leading symbol search unit 6202 detects the position of the watermark packet by searching for the leading symbol embedded in the document image.

[0118] 図 63は、先頭シンボル探索部 6202が行う探索処理のフローチャートである。先頭シンボル探索部 6202は、まず、文書画像をラスタ走査して、着目画素を含むシンポル画像の範囲を対象とするエロージョン処理を行!ヽ、シンボルの穴の位置を検出する（ステップ 6301)。 FIG. 63 is a flowchart of the search process performed by the head symbol search unit 6202. First, symbol search unit 6202 performs raster erosion on the document image, performs erosion processing on the range of the symbol image including the pixel of interest, and detects the position of the symbol hole (step 6301). .

[0119] 図 64に示すように、対象領域内に先頭シンボル 1901が含まれていれば、構造要素 2201を用いたエロージョン処理により、中心に穴を有する画像 6401が得られる。次に、検出された穴の位置をチェックする（ステップ 6302)。そして、シンボルの中心に穴があれば、そのシンボルを先頭シンボル 1901とみなして、着目画素の位置を先頭シンボル位置としてパケット切り出し部 6204に渡す (ステップ 6305)。 As shown in FIG. 64, if the first symbol 1901 is included in the target region, an image 6401 having a hole at the center is obtained by erosion processing using the structural element 2201. Next, the position of the detected hole is checked (step 6302). If there is a hole in the center of the symbol, the symbol is regarded as the first symbol 1901 and the position of the target pixel is determined. It is passed to the packet cutout unit 6204 as the head symbol position (step 6305).

[0120] パケット切り出し部 6204からパケット切り出しの終了通知を受け取ると、全画素を走查した力否かをチェックし (ステップ 6303)、未処理の画素があれば、着目画素の位置をシンボル画像のサイズ分（1シンボル分）離れた画素にシフトして (ステップ 6306[0120] Upon receiving a packet cut-out completion notification from the packet cut-out unit 6204, it is checked whether or not the power of all pixels has been crushed (step 6303). Shift to a pixel that is one size away (one symbol) (step 6306

)、ステップ 6301以降の処理を繰り返す。 ), Repeat the process after step 6301.

[0121] ステップ 6302において、シンボルの中心に穴がなければ、ステップ 6303以降の処理を行う。そして、全画素の走査を終了すると、探索が終了したことを透かし復元部 6[0121] If there is no hole at the center of the symbol in step 6302, the processing from step 6303 is performed. When the scanning of all pixels is finished, the watermark restoration unit 6 indicates that the search is finished.

203に通知する（ステップ 6304)。 203 is notified (step 6304).

[3]パケット切り出し部 [3] Packet clipping unit

パケット切り出し部 6204は、先頭シンボル探索部 6202から受け取った先頭シンポル位置に基づいて、透かしパケットの情報を抽出する。 The packet cutout unit 6204 extracts watermark packet information based on the start symbol position received from the start symbol search unit 6202.

[0122] 図 65は、パケット切り出し部 6204が行う切り出し処理のフローチャートである。パケット切り出し部 6204は、まず、先頭シンボル位置により示される着目画素の位置を 1 シンボル分シフトし、着目画素を含むシンボル画像の範囲を対象とするエロージョン処理を行う（ステップ 6501)。これにより、先頭シンボルの次のシンボルに対するエロ一ジョン処理が行われる。 FIG. 65 is a flowchart of the clipping process performed by the packet clipping unit 6204. The packet cutout unit 6204 first shifts the position of the target pixel indicated by the head symbol position by one symbol, and performs an erosion process on the range of the symbol image including the target pixel (step 6501). As a result, the erosion process for the next symbol after the first symbol is performed.

[0123] 図 64に示すように、対象領域内に透かしシンボル 1501が含まれていれば、構造要素 2201を用いたエロージョン処理により、右上部に穴を有する画像 6402が得られ、透かしシンボル 1502が含まれていれば、左下部に穴を有する画像 6403が得られる As shown in FIG. 64, if watermark symbol 1501 is included in the target area, an image 6402 having a hole in the upper right part is obtained by erosion processing using structural element 2201, and watermark symbol 1502 Is included, an image 6403 with a hole in the lower left is obtained.

[0124] 次に、得られた画像における穴の位置から、埋め込まれた透力しシンボルの値を識別する（ステップ 6502および 6503)。シンボルの右上部に穴があれば、そのシンポルを透かしシンボル 1501とみなして、メモリ内のシンボル位置に対応するアドレスに符号" 0"を書き込む (ステップ 6504)。一方、シンボルの左下部に穴があれば、そのシンボルを透かしシンボル 1502とみなして、メモリ内の対応するアドレスに符号" 1" を書き込む (ステップ 6505)。そして、着目画素の位置を 1シンボル分シフトして、ステップ 6501以降の処理を繰り返す。 Next, the value of the embedded transparent symbol is identified from the position of the hole in the obtained image (steps 6502 and 6503). If there is a hole in the upper right part of the symbol, the symbol is regarded as a watermark symbol 1501, and a code "0" is written at an address corresponding to the symbol position in the memory (step 6504). On the other hand, if there is a hole in the lower left part of the symbol, the symbol is regarded as a watermark symbol 1502, and a code "1" is written at the corresponding address in the memory (step 6505). Then, the position of the target pixel is shifted by one symbol, and the processing from step 6501 is repeated.

[0125] ステップ 6502および 6503において、シンボルの右上部および左下部のいずれにも穴がない場合、すなわち、先頭シンボル 1901のように別の位置に穴がある力、あるいは、シンボル内に穴がない場合は、メモリ内の対応するアドレスにエラー情報を記録する (ステップ 6506)。そして、パケット切り出しが終了したことを、先頭シンボル探索部 6202に通知する（ステップ 6507)。 [0125] In steps 6502 and 6503, either in the upper right or lower left of the symbol If there is no hole, that is, if there is a hole in another position, such as the first symbol 1901, or if there is no hole in the symbol, error information is recorded at the corresponding address in the memory ( Step 6506). Then, it notifies the head symbol search unit 6202 that the packet cut-out has been completed (step 6507).

[0126] こうして、文書画像中の 2つの先頭シンボルで挟まれた部分から、最大（S— 1)個の値が検出される。 In this way, the maximum (S-1) values are detected from the portion between the two leading symbols in the document image.

[4]透かし復元部 [4] Watermark restoration part

透かし復元部 6203は、先頭シンボル探索部 6202から探索終了を通知されると、シンボルの識別結果が記録されたメモリを参照して、埋め込まれた情報を復元する。 When the watermark restoration unit 6203 is notified of the end of the search from the head symbol search unit 6202, the watermark restoration unit 6203 refers to the memory in which the symbol identification result is recorded and restores the embedded information.

[0127] 図 66は、透かし復元部 6203が行う復元処理のフローチャートである。透かし復元部 6203は、まず、同じパケット ID毎に多数決をとつて、 P種類の透かしパケットを復元する（ステップ 6601)。 FIG. 66 is a flowchart of the restoration process performed by the watermark restoration unit 6203. The watermark restoration unit 6203 first takes a majority vote for each packet ID and restores P type watermark packets (step 6601).

[0128] このとき、（S— 1)個未満の値力もなるパケット切り出し結果については、透かしパケットの一部として多数決に使用される。ただし、パケット IDが完全に復元されていなければ、そのパケットは多数決の対象から除外される。 [0128] At this time, the packet cut-out result that also has a value less than (S-1) is used in the majority decision as part of the watermark packet. However, if the packet ID is not completely restored, the packet is excluded from the majority vote.

[0129] 次に、復元された P個の透かしパケットからそれぞれ透力しブロックを切り出し (ステップ 6602)、パケット IDにより指定される順序で結合して、透かし情報を復元する (ステツプ 6603)。そして、透かレ f青報を符号列に変換し (ステップ 6604)、埋め込まれた秘密情報を復元する (ステップ 6605)。 [0129] Next, transparent blocks are cut out from the restored P watermark packets (step 6602), combined in the order specified by the packet ID, and watermark information is restored (step 6603). ). Then, the transparent blue information is converted into a code string (step 6604), and the embedded secret information is restored (step 6605).

[0130] 例えば、図 67に示すように、多数決により 3種類の透力しパケット 1507〜1509力 S 復元された場合、それらの透かしパケットから透かしブロック 1504〜1506が切り出される。そして、それらの透力しブロックを結合することで、透力し情報 1503が組み立てられ、符号列" 1, 0, 1, 1, 1, 0, 0, 1, 1"に変換される。こうして、人の名前を示す秘密情報が復元される。 For example, as shown in FIG. 67, when three kinds of transparent packets 1507 to 1509 are restored by majority, watermark blocks 1504 to 1506 are cut out from these watermark packets. Then, by combining these permeable blocks, the permeable information 1503 is assembled and converted into a code string "1, 0, 1, 1, 1, 0, 0, 1, 1". In this way, secret information indicating the person's name is restored.

[0131] なお、図 59に示したように、通常の透かしパケットと反転透かしパケットが併用されている場合は、先頭シンボル探索部 6202力図 63のステップ 6302において、先頭シンボルの穴の位置により両者を判別し、判別結果をパケット切り出し部 6204に通知する。パケット切り出し部 6204は、反転透かしパケットの場合、先頭シンボルを除く (S— l)個の識別値の順序を反転させて、メモリに記録する。これにより、通常の透かしパケットと反転透かしパケットを合わせて、多数決をとることが可能になる。 [0131] As shown in Fig. 59, when the normal watermark packet and the inverted watermark packet are used together, the leading symbol search unit 6202 forces both in the step 6302 of Fig. 63 depending on the position of the hole of the leading symbol. And the discrimination result is notified to the packet cutout unit 6204. The packet cutout unit 6204 excludes the first symbol in the case of a reverse watermark packet. The order of (S-l) identification values is reversed and recorded in the memory. This makes it possible to take a majority vote by combining a normal watermark packet and a reverse watermark packet.

[0132] 図 17および図 37の電子透かし埋め込み装置と、図 62の電子透かし検出装置は、例えば、図 68に示すような情報処理装置 (コンピュータ）を用いて構成される。図 68 の情報処理装置は、 CPU (中央処理装置） 6801、メモリ 6802、入力装置 6803、出力装置 6804、外部記憶装置 6805、媒体駆動装置 6806、およびネットワーク接続装置 6807を備え、それらはノス 6808により互いに接続されている。 The digital watermark embedding device shown in FIGS. 17 and 37 and the digital watermark detection device shown in FIG. 62 are configured using an information processing device (computer) as shown in FIG. 68, for example. 68 includes a CPU (central processing unit) 6801, a memory 6802, an input device 6803, an output device 6804, an external storage device 6805, a medium drive device 6806, and a network connection device 6807, which are noses. 6808 are connected to each other.

[0133] メモリ 6802は、例えば、 ROM (read only memory)、 RAM (random access memory )等を含み、処理に用いられるプログラムおよびデータを格納する。 CPU6801は、メモリ 6802を利用してプログラムを実行することにより、上述した処理を行う。 [0133] The memory 6802 includes, for example, a ROM (read only memory), a RAM (random access memory), and the like, and stores programs and data used for processing. The CPU 6801 performs the above-described processing by executing a program using the memory 6802.

[0134] この場合、図 17の文書画像入力部 1701、透かし変換部 1702、透かしパケットィ匕部 1703、透かし埋め込み可否領域判定部 1704、および透かし埋め込み部 1705と、図 37の文書画像入力部 3701、透かし変換部 3702、透かしパケットィ匕部 3703、透かし埋め込み可否領域判定部 3704、透かしカテゴリー抽出部 3705、および透かし埋め込み部 3706と、透かし入り文書画像入力部 6201、先頭シンボル探索部 6202 、透かし復元部 6203、およびパケット切り出し部 6204は、メモリ 6802に格納されたプログラムに対応する。 In this case, the document image input unit 1701, the watermark conversion unit 1702, the watermark packetization unit 1703, the watermark embedding availability determination unit 1704, and the watermark embedding unit 1705 in FIG. 17, the document image input unit 3701 in FIG. Watermark conversion unit 3702, watermark packet input unit 3703, watermark embedding possibility determination unit 3704, watermark category extraction unit 3705, watermark embedding unit 3706, watermarked document image input unit 6201, head symbol search unit 6202, watermark restoration Unit 6203 and packet cutout unit 6204 correspond to programs stored in memory 6802.

[0135] 入力装置 6803は、例えば、キーボード、ポインティングデバイス等であり、オペレータカもの指示や情報の入力に用いられる。出力装置 6804は、例えば、ディスプレイ、プリンタ、スピーカ等であり、オペレータへの問い合わせや処理結果の出力に用いられる。 [0135] The input device 6803 is, for example, a keyboard, a pointing device, and the like, and is used for inputting instructions and information from the operator. The output device 6804 is, for example, a display, a printer, a speaker, and the like, and is used to output an inquiry to the operator and a processing result.

[0136] 外部記憶装置 6805は、例えば、磁気ディスク装置、光ディスク装置、光磁気デイスク装置、テープ装置等である。情報処理装置は、この外部記憶装置 6805に、プログラムおよびデータを格納しておき、必要に応じて、それらをメモリ 6802にロードして使用する。 [0136] The external storage device 6805 is, for example, a magnetic disk device, an optical disk device, a magneto-optical disk device, a tape device, or the like. The information processing apparatus stores programs and data in the external storage device 6805, and loads them into the memory 6802 and uses them as necessary.

[0137] 媒体駆動装置 6806は、可搬記録媒体 6809を駆動し、その記録内容にアクセスする。可搬記録媒体 6809は、メモリカード、フレキシブルディスク、光ディスク、光磁気ディスク等の任意のコンピュータ読み取り可能な記録媒体である。オペレータは、この可搬記録媒体 6809にプログラムおよびデータを格納しておき、必要に応じて、それらをメモリ 6802にロードして使用する。 [0137] The medium driving device 6806 drives the portable recording medium 6809 and accesses the recorded contents. The portable recording medium 6809 is an arbitrary computer-readable recording medium such as a memory card, a flexible disk, an optical disk, and a magneto-optical disk. The operator A program and data are stored in the portable recording medium 6809, and are loaded into the memory 6802 and used as necessary.

[0138] ネットワーク接続装置 6807は、 LAN (local area network)等の通信ネットワークに接続され、通信に伴うデータ変換を行う。また、情報処理装置は、必要に応じて、プログラムおよびデータを外部の装置力ネットワーク接続装置 6807を介して受け取り、それらをメモリ 6802にロードして使用する。 [0138] The network connection device 6807 is connected to a communication network such as a LAN (local area network) and performs data conversion accompanying communication. Further, the information processing apparatus receives programs and data as needed via the external apparatus power network connection apparatus 6807 and loads them into the memory 6802 for use.

[0139] 図 69は、図 68の情報処理装置にプログラムおよびデータを提供する方法を示して V、る。可搬記録媒体 6809やサーバ 6901のデータベース 6911に格納されたプログラムおよびデータは、情報処理装置 6902のメモリ 6802にロードされる。サーバ 690 1は、そのプログラムおよびデータを搬送する搬送信号を生成し、通信ネットワーク上の任意の伝送媒体を介して情報処理装置 6902に送信する。 CPU6801は、そのデータを用いてそのプログラムを実行し、上述した処理を行う。 FIG. 69 shows a method for providing a program and data to the information processing apparatus of FIG. Programs and data stored in the portable recording medium 6809 and the database 6911 of the server 6901 are loaded into the memory 6802 of the information processing apparatus 6902. The server 6901 generates a carrier signal for carrying the program and data, and transmits the carrier signal to the information processing device 6902 via an arbitrary transmission medium on the communication network. The CPU 6801 executes the program using the data and performs the above-described processing.

産業上の利用可能性 Industrial applicability

[0140] 本発明によれば、文字、図、写真等が多く含まれる文書に対しても、地紋透かしを埋め込むことが可能になる。また、透力を埋め込まれた印刷文書を数世代コピーしても、あるいは、紙片し力残っていない印刷物であっても、そこ力も埋め込まれた透かしを安定して検出することができる。 [0140] According to the present invention, it is possible to embed a copy-forgery-inhibited pattern watermark even in a document including a large number of characters, figures, photographs, and the like. In addition, even if a printed document with embedded transparency is copied for several generations, or even a printed matter that does not have the paper remaining force, the watermark with embedded force can be detected stably.

Claims

The scope of the claims

[1] A computer program for embedding information in a background area of a document image using a dot pattern,

A plurality of watermark symbols expressed by different dot patterns are assigned to the embedded information, and the embedded information is converted into a watermark symbol sequence,

Embedding a watermark symbol selected from the column of transparency symbols in an embedding target area in the document image and checking whether the embedded watermark symbol is detected;

When the embedded watermark symbol is detected, it is determined that the embedding target area can be embedded, and when the embedded transparency symbol is not detected, it is determined that the embedding target area cannot be embedded,

A watermarked document image is generated by embedding the selected watermark symbol in an embedding target area determined to be embedding.

A program for causing a computer to execute processing.

[2] In the embedding target area determined as non-embeddable, a symbol represented by a dot pattern different from the plurality of transparent symbols, or a transparent card selected at random from the plurality of watermark symbols. 2. The program according to claim 1, which causes the computer to execute a process of embedding a fraud symbol.

[3] When the selected permeability symbol is embedded in the embedding target area, it is checked whether or not the position of the blank area included in the embedded permeability symbol is detected. 3. The program according to claim 1, wherein the computer is caused to execute a process of checking whether or not an embedded transparency symbol is detected.

[4] The watermark symbol sequence is divided into a plurality of watermark blocks, and a head symbol indicating the head and the order of each transparent block in the plurality of transparent blocks are indicated at the beginning of each watermark block. A watermark packet is generated by adding one or more symbols, and the symbol selected from the watermark packet is embedded in the embedding target area determined to be embedding, and generated from the plurality of watermark blocks. Before multiple watermark packets The program according to claim 1 or 2, wherein the computer is caused to execute processing for embedding in a document image.

[5] Classifying a plurality of embedding target areas determined to be embedding into a plurality of categories, and embedding a symbol selected from the watermark packet in the embedding target area by a different embedding method for each category, 5. The program according to claim 4, characterized in that the program is executed.

[6] Cause the computer to execute a process of embedding the plurality of watermark packets in an area composed of a plurality of embedding target areas classified into a category indicating that all of the plurality of watermark packets can be inserted. The program according to claim 5, wherein:

[7] Processing to embed one watermark packet selected from the plurality of watermark packets in an area composed of a plurality of embedding target areas classified into a category indicating that one watermark packet can be inserted 6. The program according to claim 5, wherein the program is executed by the computer.

[8] A part of one watermark packet selected from the plurality of watermark packets in an area composed of a plurality of embedding target areas classified into a category indicating that a part of one watermark packet can be inserted. 6. The program according to claim 5, wherein the computer is caused to execute a process of embedding.

[9] A process of reversing the order of symbols included in the selected one watermark packet with a predetermined probability and embedding a part of the obtained watermark packet in an area having the plurality of embedding target area powers, The program according to claim 8, wherein the program is executed by the computer.

[10] The plurality of categories one includes a first category indicating that all of the plurality of watermark packets can be inserted, and a second force category indicating that one watermark packet can be inserted, and A third category indicating that a part of one watermark packet can be inserted, and a fourth category other than the first, second, and third categories. Processing to embed a symbol represented by a dot pattern different from the plurality of watermark symbols or a watermark symbol randomly selected from the plurality of watermark symbols into the embedding target area classified into the four categories The above 6. The program according to claim 5, wherein the program is executed by a computer.

[11] A transparency management packet including a plurality of symbols indicating the number of the plurality of watermark blocks and the size of each watermark block is generated, and the watermark management packet is selected from the watermark management packets determined to be embedded. 5. The program according to claim 4, wherein the computer is caused to execute a process of embedding a symbol that has been processed.

[12] A computer program for embedding information in a background area of a document image using a dot pattern,

The watermark symbol sequence is divided into a plurality of watermark blocks, and at the beginning of each watermark block, a leading symbol indicating the head and one or more indicating the order of each transparent block in the plurality of transparent blocks To generate a watermark packet,

Embeds multiple watermark packets with multiple transparency and block power in the document image to generate a watermarked document image

A program for causing a computer to execute processing.

[13] A transparency management packet including a plurality of symbols indicating the number of the plurality of watermark blocks and the size of each watermark block is generated, and is selected from the watermark management packet as an embedding target area determined to be embeddable. 13. The program according to claim 12, which causes the computer to execute a process of embedding a symbol that has been processed.

[14] A digital watermark embedding device that embeds information in a background area of a document image using a dot pattern,

A document image input unit for inputting the document image;

A watermark converting unit that assigns a plurality of watermark symbols each represented by a different dot pattern to the embedded information, and converts the embedded information into a watermark symbol sequence;

A watermark symbol is embedded in an area in the document image, and it is checked whether or not the embedded watermark symbol is detected. If the transparent symbol is detected, it is determined that the area can be embedded. If the transparent symbol is not detected, the region is embedded A watermark embedding availability determining unit that determines that it is impossible;

The position information of the embedding target area in the document image and the information of the transparency symbol selected from the watermark symbol column are passed to the watermark embedding possibility determination section, the determination result is received, and it is determined that embedding is possible An electronic transparency embedding device comprising: a transparency embedding unit that embeds the selected transparency symbol in an embedding target region to generate a watermarked document image.

[15] The watermark embedding unit may embed a symbol represented by a dot pattern different from the plurality of transparent symbols in the embedding target area determined to be non-embeddable, or the medium power of the plurality of watermark symbols. 15. The digital watermark embedding apparatus according to claim 14, wherein the selected watermark symbol is embedded in.

[16] The watermark embedding possibility determination unit determines whether or not the position of the blank area included in the embedded watermark symbol is detected when the selected transparent symbol is embedded in the embedding target area. 16. The digital watermark embedding apparatus according to claim 14, wherein it is checked whether or not the embedded transparency symbol is detected by checking.

[17] The watermark symbol sequence is divided into a plurality of watermark blocks, and a head symbol indicating the head and the order of each transparent block in the plurality of transparent blocks are indicated at the beginning of each watermark block. A watermark packetizing unit that adds one or more symbols to generate a watermark packet is further included, and the transparency embedding unit adds the watermark packet to the embedding target area determined to be embedding from the watermark packet. 16. The digital watermark embedding apparatus according to claim 14 or 15, wherein a plurality of watermark packets in which the plurality of permeability block forces are also embedded are embedded in the document image by embedding the selected symbol.

[18] The method further comprises a watermark category extraction unit that classifies a plurality of embedding target areas determined to be embedding into a plurality of categories, and the watermark embedding unit is selected from the watermark packet by an embedding method different for each category. 18. The electronic permeability embedding apparatus according to claim 17, wherein the embedded symbol is embedded in the embedding target area.