WO2023281995A1

WO2023281995A1 - Personal information masking method, and personal information masking device

Info

Publication number: WO2023281995A1
Application number: PCT/JP2022/023832
Authority: WO
Inventors: 久美生大橋; 義樹山田
Original assignee: BITS Co Ltd
Current assignee: BITS Co Ltd
Priority date: 2021-07-06
Filing date: 2022-06-14
Publication date: 2023-01-12
Anticipated expiration: 2024-01-06
Also published as: JP2023008507A; JP7156771B1

Abstract

[problem] To provide a personal information masking method which erases personal information of characters or numbers from a color image. [Solution] This personal information masking method comprises: a step (S21) for acquiring a color image; a size change step (S22) for changing the acquired color image to a first color image of a prescribed size; a step (S28) for changing the first color image to a binary value image; an area detection step (S30) for detecting, from the binary image, an area in which personal information is possibly present; a decision step (S33) for deciding whether the personal information is present with respect to the area in which the personal information is likely to be present; and an erasing step (S34) for drawing surrounding pixels of the personal information on the personal information of the first color image and erasing the personal information.

Description

Personal information masking method and personal information masking device

本発明は、画像に写し込まれた個人を特定する個人情報の箇所にマスキングを付す個人情報マスキング方法、及び個人情報マスキング装置に係る。 The present invention relates to a personal information masking method and a personal information masking apparatus for masking a portion of personal information that identifies an individual captured in an image.

近年、ユーザが、企業もしくは学校等での活動報告をクラウド上のＳＮＳ（ソーシャルネットワークサービス）に報告したり、人物を含む風景をＳＮＳに投稿したりすることが多い。その活動報告や投稿には、テキストデータの文章だけでなく、静止画もしくは動画の画像情報もＳＮＳにアップロードされることも多い。 In recent years, users often report activity reports at companies, schools, etc. to SNSs (social network services) on the cloud, and post scenery including people to SNSs. In the activity reports and posts, not only sentences of text data but also image information of still images or moving images are often uploaded to the SNS.

しかしながら、その画像（静止画・動画）の中に、個人情報（名前、住所、生年月日、性別、自動車登録番号等の文字及び数字からなる情報、以下、本明細書では文字及び数字からなる情報を個人情報という。）が写し込まれることも多い。このような個人情報は、個人のプライバシーを侵害する恐れがあり、また犯罪に利用されてしまう問題を生じている。 However, in the image (still image / video), personal information (information consisting of letters and numbers such as name, address, date of birth, gender, car registration number, etc.) information is called personal information.) is often imprinted. Such personal information may infringe on the privacy of individuals, and pose a problem of being used for crimes.

このような問題を生じないようにするために、特許文献１には、ディスプレイに画像が表示された後、個人情報の保護の観点から画像に共有したくない個人情報が存在するか否かをユーザの判定結果を受け付け、その判定結果に基づいて、個人情報を認知できないように、画像の一部領域をマスキングするように構成する技術思想が開示されている。また文字もしくは数字からなる個人情報を画像から検出する手法は、特許文献２に開示されるように、パターンマッチング技術を用いたＯＣＲ（Optical Character Recognition）が使用されている。 In order to avoid such a problem, Patent Document 1 discloses that after an image is displayed on a display, it is determined whether or not there is personal information that the user does not want to share in the image from the viewpoint of protecting personal information. A technical idea is disclosed in which a user's determination result is accepted, and based on the determination result, a partial area of an image is masked so that personal information cannot be recognized. As a method for detecting personal information consisting of letters or numbers from an image, OCR (Optical Character Recognition) using pattern matching technology is used, as disclosed in Patent Document 2.

特開２０２０－１５６０３３号公報JP 2020-156033 A 特開２０１５－０２８７３５号公報JP 2015-028735 A

しかしながら、特許文献１の個人情報等が存在するか否かをユーザの判断に委ねられており、個人情報等が多く含まれる場合には、ユーザに著しい負担が生じる。またユーザが精査しないと気が付かない個人情報等もある可能性があり、気が付かない場合には個人情報等についてはマスキングされず、個人情報等が保護されないという問題点があった。また、特許文献２に記載されているＯＣＲでは解像度、縦横の配列、手書き文字、くずし文字などは文字もしくは数字を認識することが困難であり、特に画像背景に含まれている小さな文字もしくは数字は、ほとんど認識できなかった。 However, it is up to the user to decide whether or not the personal information, etc. of Patent Document 1 exists. In addition, there is a possibility that there may be personal information, etc., that the user does not notice unless the user carefully examines it. If the user does not notice, the personal information, etc. is not masked, and there is a problem that the personal information, etc. is not protected. In addition, in the OCR described in Patent Document 2, it is difficult to recognize characters or numbers such as resolution, vertical and horizontal arrangement, handwritten characters, and broken characters, especially small characters or numbers included in the image background. , was almost unrecognizable.

　また欧州では、個人情報やプライバシーの保護に関して、ＧＤＰＲ（General Data Protection Regulation）が施行され、欧州以外でも個人情報のさらなる保護規制が進められているため、個人情報を保護する要望が高くなっている。 In addition, in Europe, the GDPR (General Data Protection Regulation) has been enforced to protect personal information and privacy, and further regulations for the protection of personal information are being promoted outside of Europe, so there is a growing demand for the protection of personal information. .

本発明は、上記の問題点を踏まえ、カラー画像から文字もしくは数字と思われる個人情報を消去するとともに、個人情報を消去してもカラー画像として違和感がない画像を出力する個人情報マスキング装置及び方法を提供する。 In view of the above problems, the present invention provides a personal information masking apparatus and method for erasing personal information that appears to be characters or numbers from a color image, and for outputting an image that does not give a sense of incongruity as a color image even if the personal information is erased. I will provide a.

本実施形態に係る個人情報マスキング方法は、カラー画像を取得する工程と、取得したカラー画像を所定サイズの第１カラー画像に変換するサイズ変換工程と、第１カラー画像を二値画像に変換する工程とを備える。さらにマスキング方法は、二値画像に対して個人情報が存在する可能性がある領域を検出する領域検出工程と、個人情報が存在しそうな領域に対して個人情報の有無を判定する判定工程と、判定工程で判定された個人情報と第１カラー画像とに基づいて、個人情報の周辺画素を第１カラー画像の個人情報に描き入れ個人情報を消去する消去工程と、消去工程後のカラー画像を出力する工程と、を備える。
　このサイズ変換工程は、取得したカラー画像を所定サイズの第１グレースケール画像に変換し、領域検出工程は、第１グレースケール画像に対して個人情報が存在する可能性のある領域を検出してもよい。 A personal information masking method according to the present embodiment comprises a step of obtaining a color image, a size conversion step of converting the obtained color image into a first color image of a predetermined size, and a step of converting the first color image into a binary image. and a step. Furthermore, the masking method includes an area detection step of detecting an area in which personal information may exist in a binary image, a determination step of determining whether or not personal information exists in an area in which personal information is likely to exist, and Based on the personal information determined in the determining step and the first color image, an erasing step of drawing peripheral pixels of the personal information into the personal information of the first color image to erase the personal information, and a color image after the erasing step. and outputting.
The size conversion step converts the obtained color image into a first grayscale image of a predetermined size, and the region detection step detects a region in which personal information may exist in the first grayscale image. good too.

別の実施形態に係る個人情報マスキング方法は、カラー画像を取得する工程と、取得したカラー画像を所定サイズの第１カラー画像及び第１グレースケール画像に変換するサイズ変換工程と、第１グレースケール画像に対して個人情報が存在する可能性のある領域を検出する領域検出工程と、を備える。さらにマスキング方法は、個人情報が存在しそうな領域に対して個人情報の有無を判定する判定工程と、判定工程で判定された個人情報と第１カラー画像とに基づいて個人情報の周辺画素を第１カラー画像の個人情報に描き入れ個人情報を消去する消去工程と、消去工程後のカラー画像を出力する工程と、を備える。 A personal information masking method according to another embodiment includes steps of obtaining a color image, a size conversion step of converting the obtained color image into a first color image and a first grayscale image of a predetermined size, and a first grayscale image. and an area detection step of detecting areas in the image where personal information may exist. Further, the masking method comprises a determination step of determining the presence or absence of personal information in an area in which personal information is likely to exist, and a pixel surrounding the personal information based on the personal information determined in the determination step and the first color image. An erasing step of drawing in personal information of one color image and erasing the personal information, and a step of outputting the color image after the erasing step are provided.

　さらにマスキング方法は、第１カラー画像に対して人物の保護領域を検出する工程と、保護領域を検出した場合に保護領域を保存する工程と、消去工程後の第１カラー画像に、保護領域を上書きする上書き工程と、を備えてもよい。
　さらにマスキング方法は、第１カラー画像に対してノイズ除去する工程と、第１カラー画像に対して色空間変換を処理する工程と、ノイズ除去する工程及び色空間変換を処理する工程の後に第１カラー画像を第２グレースケール画像に変換する変換工程と、を備えても良い。
　さらにマスキング方法は、第２グレースケール画像に対して局所的ヒストグラム平坦化を処理し、第３グレースケール画像に変換する工程、を備え、領域検出工程は、第２グレースケール画像又は第３グレースケール画像に対して個人情報が存在する可能性のある領域を検出しても良い。 Further, the masking method includes the steps of detecting a protected area of a person in the first color image, storing the protected area when the protected area is detected, and removing the protected area from the first color image after the erasing step. and an overwriting step of overwriting.
Further, the masking method comprises the steps of denoising the first color image, processing a color space transformation on the first color image, and after the steps of denoising and processing the color space transformation, the first masking method. and a converting step of converting the color image into a second grayscale image.
The masking method further comprises processing a local histogram equalization on the second grayscale image and converting to a third grayscale image, wherein the region detection step comprises the second grayscale image or the third grayscale image. A region in which personal information may exist in the image may be detected.

　判定工程は、
（ａ）　個人情報が存在しそうな領域とかかる領域を所定角度回転させた領域との論理積が第１閾値より大きいかに基づいて、個人情報の有無を判定するフィルター、
（ｂ）　個人情報が存在しそうな領域の骨組みとなる細線から領域の境界線までの距離の標準偏差が第２閾値より大きいかに基づいて、個人情報の有無を判定するフィルター、
（ｃ）　個人情報が存在しそうな領域の縦長さ又は横長さが所定サイズに対する第３閾値より大きいかに基づいて、個人情報の有無を判定するフィルター、
（ｄ）　個人情報が存在しそうな領域の骨組みとなる細線の面積と個人情報が存在しそうな領域の面積との比が第４閾値より大きいかに基づいて、個人情報の有無を判定するフィルター、
　の少なくとも１つのフィリター処理を適用することが好ましい。 The judgment process is
(a) A filter for determining the presence or absence of personal information based on whether the logical product of the area where personal information is likely to exist and the area obtained by rotating the area by a predetermined angle is greater than a first threshold;
(b) a filter for determining the presence or absence of personal information based on whether the standard deviation of the distance from the thin line that forms the framework of the area where personal information is likely to exist to the boundary line of the area is greater than a second threshold;
(c) a filter that determines the presence or absence of personal information based on whether the vertical or horizontal length of an area where personal information is likely to exist is greater than a third threshold for a predetermined size;
(d) A filter that determines the presence or absence of personal information based on whether the ratio of the area of the thin line that forms the framework of the area where personal information is likely to exist and the area of the area where personal information is likely to exist is greater than a fourth threshold;
It is preferred to apply at least one filter treatment of

　第１カラー画像に対して個人情報を検出する個人情報検出工程を、備えても良い。
　上書き工程後に、エッジ保存フィルターにより仕上げ処理する仕上げ工程、を備えても良い。 A personal information detection step may be provided for detecting personal information for the first color image.
After the overwriting step, a finishing step of finishing with an edge preserving filter may be provided.

　本実施形態に係る個人情報マスキング装置は、所定サイズの第１カラー画像を取得する取得部と、第１カラー画像を二値画像に変換する二値変換部と、二値画像に対して、個人情報が存在する可能性がある領域を検出する領域検出工程と、個人情報が存在しそうな領域に対して個人情報の有無を判定する判定部と、判定部で判定された個人情報と第１カラー画像とに基づいて、個人情報の周辺画素を第１カラー画像の個人情報に描き入れ個人情報を消去する消去部と、消去後のカラー画像を出力する出力部と、を備える。 The personal information masking apparatus according to the present embodiment includes an acquisition unit that acquires a first color image of a predetermined size, a binary conversion unit that converts the first color image into a binary image, and a personal information masking unit for the binary image. an area detection step for detecting an area in which information may exist; a determination unit for determining the presence or absence of personal information in an area in which personal information is likely to exist; and the personal information determined by the determination unit and the first color. An erasing section for erasing the personal information by drawing surrounding pixels of the personal information into the personal information of the first color image based on the image, and an outputting section for outputting the erased color image.

　また個人情報マスキング装置は、第１カラー画像に対して人物の保護領域を検出する保護領域検出部をさらに備え、該保護領域検出部は、
　　ｉ）ディープラーニングを使用した顔領域の検出する機能、
　ｉｉ）ディープラーニングを使用した特定物体の検出する機能、
ｉｉｉ）第１カラー画像にある皮膚の色による皮膚領域の検出する機能、
　ｉｖ）第１カラー画像に保護すべき保護枠を設定させて、その保護枠内の領域を検出する機能、
　の少なくとも１つを有していることが好ましい。 The personal information masking device further includes a protected area detection unit for detecting a protected area of a person in the first color image, the protected area detection unit comprising:
i) the ability to detect facial regions using deep learning;
ii) the ability to detect specific objects using deep learning;
iii) the ability to detect skin areas by skin color in the first color image;
iv) a function of setting a protection frame to be protected on the first color image and detecting an area within the protection frame;
It is preferable to have at least one of

　本明細書において、個人情報とは、画像（静止画・動画）に撮影された文字（多言語の文字及び数字を含め、以下、“文字”という。）を意味する。 In this specification, personal information means characters (including multilingual characters and numbers, hereinafter referred to as "characters") captured in images (still images/videos).

本発明の個人情報マスキング装置及び個人情報マスキング方法によれば、個人情報の周辺画素をカラー画像の個人情報に描き入れて個人情報を消去するので、個人情報が保護される。 According to the personal information masking device and the personal information masking method of the present invention, the personal information is erased by drawing the peripheral pixels of the personal information in the personal information of the color image, so that the personal information is protected.

個人情報マスキング装置の一実施形態のブロック図である。1 is a block diagram of one embodiment of a personal information masking device; FIG. 個人情報マスキング装置を使ったフローチャートである。It is a flow chart using a personal information masking device. （Ａ）は色空間変換部による色空間変換について説明した図である。（Ｂ）は領域検出部による輪郭の検出、外枠の検出のイメージ図である。（Ｃ）は個人情報を消去する仕組みを説明した図である。(A) is a diagram explaining color space conversion by a color space conversion unit. (B) is an image diagram of outline detection and outer frame detection by the area detection unit. (C) is a diagram explaining a mechanism for erasing personal information. 保護領域の検出の機能を示したフローチャートである。4 is a flow chart showing the function of detection of a protected area; パーソナルコンピュータ２３又は携帯情報端末２５で保護領域をユーザが設定する際の概念図である。4 is a conceptual diagram when a user sets a protection area on the personal computer 23 or the mobile information terminal 25. FIG. 個人情報（文字であるか否か）の判定の手法を示したフローチャートである。10 is a flow chart showing a method of determining personal information (whether or not it is characters). （Ａ）は文字又は図形を回転させて、元の文字又は図形に重ね合わせた状態を示した図である。（Ｂ）は文字又は図形を一定の距離をシフトさせて且つ回転させて、元の文字又は図形に重ね合わせた状態を示した図である。(A) is a diagram showing a state in which a character or graphic is rotated and superimposed on the original character or graphic. (B) is a diagram showing a state in which a character or graphic is shifted by a fixed distance and rotated, and superimposed on the original character or graphic. （Ａ）は文字又は図形の中心線から境界線までの距離について説明した図である。（Ｂ）は文字又は図形の中心線と中心線領域検出部による輪郭の検出、外枠の検出のイメージ図である。(A) is a diagram explaining the distance from the center line of a character or figure to the boundary line. (B) is an image diagram of outline detection and outer frame detection by the center line of a character or figure and the center line area detection unit. 個人情報マスキング装置１００に取得されるカラー画像の一例である。It is an example of a color image acquired by the personal information masking device 100 . 出力したカラー画像の一例（実施例１）である。It is an example (Example 1) of the output color image. 出力したカラー画像の一例（実施例２）である。It is an example (Example 2) of the output color image. 出力したカラー画像の一例（実施例３）である。It is an example (Example 3) of the output color image. 出力したカラー画像の一例（実施例４）である。It is an example (Example 4) of the output color image.

以下、実施形態の個人情報マスキング装置について図を参照しながら詳しく説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 Hereinafter, the personal information masking device of the embodiment will be described in detail with reference to the drawings. In the present specification and drawings, constituent elements having substantially the same functional configuration are denoted by the same reference numerals, thereby omitting redundant description.

＜＜個人情報マスキング装置の概要＞＞
　本実施形態に係る個人情報マスキング装置は、カメラ・ビデオカメラで撮影された日常風景などのカラー画像から、個人情報である文字を検出する。画像から文字を認識する方法としてＯＣＲ（Optical Character Recognition）が広く使用されている。しかし本実施形態に係る個人情報マスキング装置は、ＯＣＲを使用しないため、予め認識したい文字をすべて登録しておく必要がなく、多言語の個人情報を削除することができる。また、個人情報マスキング装置は、ＯＣＲ（光学式文字読み取り装置）では検出できない微細な文字もしくは崩れた文字、例えば背景に含まれる微細な個人情報を削除することができる。 <<Overview of personal information masking device>>
A personal information masking apparatus according to the present embodiment detects characters, which are personal information, from color images such as everyday scenery captured by a camera/video camera. OCR (Optical Character Recognition) is widely used as a method of recognizing characters from an image. However, since the personal information masking apparatus according to the present embodiment does not use OCR, it is not necessary to register all characters to be recognized in advance, and multilingual personal information can be deleted. In addition, the personal information masking device can delete fine characters or corrupted characters that cannot be detected by an OCR (optical character reader), such as fine personal information contained in the background.

　図１は、ユーザＨが例えば学校で撮影した画像（静止画、動画）を個人情報マスキング装置１００にアップロードし、個人情報マスキング装置１００から個人情報が消去された画像をダウンロードもしくは受信するシステム概要を示した図である。ユーザＨは、この個人状況が消去された画像をＳＮＳ等にアップロードしても、その画像に含まれる学校内の個人情報が消去されているため、個人情報の漏洩にならない。 FIG. 1 shows an outline of a system in which a user H uploads images (still images and moving images) taken at school, for example, to a personal information masking device 100, and downloads or receives images from which personal information has been deleted from the personal information masking device 100. It is a diagram showing. Even if the user H uploads the image from which the personal situation has been deleted to an SNS or the like, the personal information within the school included in the image has been deleted, so the personal information will not be leaked.

　ユーザＨは例えばデジタルカメラ２１で学校の所定の場所の静止画を撮像する。そしてカメラ２１で撮影された静止画は、パーソナルコンピュータ２３に記録される。ユーザＨはそのパーソナルコンピュータ２３を使って、インターネット等の通信ネットワークＮＥＴを介して、個人情報マスキング装置１００に静止画をアップロードすることができる。またユーザＨは、タブレット型ＰＣ又はスマートフォン等のカメラ機能付き携帯情報端末２５を使って動画を撮影し、通信ネットワークＮＥＴを介して、個人情報マスキング装置１００に動画をアップロードすることができる。特に説明しないが、個人情報マスキング装置１００にアクセスするためには、ユーザＨはユーザ名、パスワード、クレジットカード番号等を事前に登録するようにしてもよい。 User H captures a still image of a predetermined place in the school with the digital camera 21, for example. A still image captured by the camera 21 is recorded in the personal computer 23 . User H can use the personal computer 23 to upload a still image to the personal information masking device 100 via a communication network NET such as the Internet. Also, the user H can take a moving image using the portable information terminal 25 with a camera function such as a tablet PC or a smartphone, and upload the moving image to the personal information masking device 100 via the communication network NET. Although not specifically described, in order to access the personal information masking device 100, the user H may register a user name, password, credit card number, etc. in advance.

＜＜個人情報マスキング装置１００の構成の概要＞＞
　図１に描かれた個人情報マスキング装置１００は、一実施形態のブロック構成が示されている。個人情報マスキング装置１００は、ハードウェア構成としては、クラウドサーバー等が適しており、具体的には１以上のプロセッサ、１以上のメモリー及び通信インターフェースなどを有している。以下に説明するカラー画像取得部、画像サイズ変換部等はプロセッサ、メモリー及び通信インターフェース並びにプロセッサで実行されるプログラム等で構成される。 <<Overview of Configuration of Personal Information Masking Device 100>>
The personal information masking device 100 depicted in FIG. 1 shows a block configuration of one embodiment. A cloud server or the like is suitable for the hardware configuration of the personal information masking device 100. Specifically, it has one or more processors, one or more memories, a communication interface, and the like. A color image acquisition unit, an image size conversion unit, and the like, which will be described below, are composed of a processor, a memory, a communication interface, programs executed by the processor, and the like.

個人情報マスキング装置１００は、パーソナルコンピュータ２３又は携帯情報端末２５などの外部からカラー画像（静止画もしくは動画）を取得するカラー画像取得部１０１と、取得した画像のサイズを所定サイズに変換する画像サイズ変換部１０２と、所定サイズに変換されたカラー画像中に人物の顔や時計等（保護領域）があるか検出する保護領域検出部１０３と、カラー画像に含まれるノイズを除去するノイズ除去部（カラー）１０４と、背景色に埋もれて見え難くなった個人情報を見やすくする色空間変換部１０５とを有している。なお画像サイズ変換部１０２は、カラーノイズを除去する前のカラー画像をグレースケールに変換する機能も有している。 The personal information masking device 100 includes a color image acquisition unit 101 that acquires a color image (still image or moving image) from an external device such as the personal computer 23 or the mobile information terminal 25, and an image size converter that converts the size of the acquired image into a predetermined size. A conversion unit 102, a protection area detection unit 103 that detects whether there is a person's face, clock, etc. (protection area) in the color image converted to a predetermined size, and a noise removal unit that removes noise contained in the color image ( color) 104, and a color space conversion unit 105 that makes it easier to see personal information that is hidden in the background color and is difficult to see. Note that the image size conversion unit 102 also has a function of converting a color image before removing color noise into a gray scale.

　さらに個人情報マスキング装置１００は、ノイズ除去されたカラー画像をグレースケール画像に変換するグレースケール変換部１０６と、グレースケール画像の明度を平坦化する局所的ヒストグラム平坦化部１０７と、明度が平坦化されたグレースケール画像を二値画像に変換する二値変換部１０８と、二値画像に含まれる白黒ノイズを除去するノイズ除去部（二値）１０９と、二値画像もしくはグレースケール画像から個人情報が存在しそうな領域を検出する領域検出部１１０とを備える。領域検出部１１０は、二値画像もしくはグレースケール画像に存在する線画、物の影もしくはノイズなどによる境界や外郭を個人情報が存在しそうな領域として検出する。このため、領域検出部１１０は、人間の目では気づきにくい微細な文字も検出することができ、また、特定の言語に限らず、あらゆる言語の文字に適用できるようになっている。その一方で、領域検出部１１０は、文字ではない物の影もしくはノイズなども個人情報が存在しそうな領域として検出する。 Further, the personal information masking apparatus 100 includes a grayscale conversion unit 106 that converts the noise-removed color image into a grayscale image, a local histogram flattening unit 107 that flattens the brightness of the grayscale image, and a flattening of the brightness. A binary conversion unit 108 that converts the converted grayscale image into a binary image, a noise removal unit (binary) 109 that removes black and white noise included in the binary image, and personal information is extracted from the binary image or the grayscale image. and an area detection unit 110 for detecting an area in which is likely to exist. The area detection unit 110 detects boundaries and outlines due to line drawings, shadows of objects, noise, etc. existing in a binary image or a grayscale image as areas in which personal information is likely to exist. Therefore, the area detection unit 110 can detect fine characters that are difficult for the human eye to notice, and can be applied to characters of any language, not limited to a specific language. On the other hand, the area detection unit 110 also detects shadows of objects other than characters or noise as areas in which personal information is likely to exist.

　さらに個人情報マスキング装置１００は、個人情報が存在しそうな領域に個人情報が存在するかを判定する個人情報判定部１１１と，所定サイズに変換されたカラー画像から判定された個人情報を消去する個人情報消去部１１２と、保護領域検出部１０３で人物の顔、皮膚又は時計などを検出した場合に保存しておいた保護領域を、個人情報が消去されたカラー画像に中に上書きする保護領域上書き部１１３と、カラー画像を仕上げする仕上げ部１１４と、カラー画像をパーソナルコンピュータ２３又は携帯情報端末２５などの外部に出力するカラー画像出力部１１５とを備える。また、個人情報マスキング装置１００は、ニューラルネットワークを利用した情景内文字検出プログラム（ＥＡＳＴ：An Efficient and Accurate Scene Text Detector）等の個人情報の検出部１１６を有しても良い。 Further, the personal information masking apparatus 100 includes a personal information determination unit 111 that determines whether personal information exists in an area where personal information is likely to exist, and a personal information determination unit 111 that erases the personal information determined from the color image converted to a predetermined size. Protected area overwriting for overwriting a protected area saved when a person's face, skin or clock is detected by the information erasing section 112 and the protected area detecting section 103 into a color image from which personal information has been erased. a finishing unit 114 for finishing the color image; and a color image output unit 115 for outputting the color image to the outside such as the personal computer 23 or the portable information terminal 25 . The personal information masking apparatus 100 may also include a personal information detection unit 116 such as an in-scene character detection program (EAST: An Efficient and Accurate Scene Text Detector) using a neural network.

個人情報マスキング装置１００は、上記複数の構成要素を物理的に同一の場所に有するだけでなく、複数のサーバの集合体であってもよい。また上記複数の構成の一部を、アプリケーションソフトをダウンロードしたパーソナルコンピュータ２３もしくは携帯情報端末２５に担わせても良い。 The personal information masking device 100 may not only have the plurality of components described above physically at the same location, but may also be a collection of a plurality of servers. Also, a part of the plurality of configurations described above may be performed by the personal computer 23 or the portable information terminal 25 to which the application software is downloaded.

＜＜個人情報マスキング方法のフローチャート＞＞
　次に、本実施形態の個人情報マスキング方法の動作を説明する。図２は個人情報マスキング方法のカラー画像の取得から、個人情報を消去したカラー画像を出力するまでのフローチャートである。 <<Flow chart of personal information masking method>>
Next, the operation of the personal information masking method of this embodiment will be described. FIG. 2 is a flow chart of the personal information masking method from acquisition of a color image to output of a color image from which personal information has been erased.

　まず、カラー画像取得部１０１が、パーソナルコンピュータ２３もしくは携帯情報端末２５からカラー画像を取得する（ステップＳ２１）。このカラー画像には、例えば、静止画であればＨＤ（１２８０＊７２０）、ＦＨＤ（１９２０＊１０８８）、６Ｍワイド（３２６４＊１８３６）等、動画であればＨＤ（１２８０＊７２０）、ＦＨＤ（１９２０＊１０８８）、４Ｋ（３８４０＊２１６０）等の画像を取得する。 First, the color image acquisition unit 101 acquires a color image from the personal computer 23 or the mobile information terminal 25 (step S21). For example, this color image includes HD (1280*720), FHD (1920*1088) and 6M wide (3264*1836) for still images, and HD (1280*720) and FHD (1920) for moving images. *1088) and 4K (3840*2160) images.

　次に、画像サイズ変換部１０２が種々のサイズのカラー画像を所定のサイズの画像に変換する（Ｓ２２）。例えば、本実施形態では画像サイズ変換部１０２はＦＨＤサイズのカラー画像に変換する。この所定サイズに変換されたカラー画像は、不図示のメモリー等に保存される。画像サイズ変換部１０２は、カラー画像をグレースケールに変換する機能も有している。 Next, the image size conversion unit 102 converts color images of various sizes into images of a predetermined size (S22). For example, in this embodiment, the image size conversion unit 102 converts the image into an FHD size color image. The color image converted to the predetermined size is stored in a memory (not shown) or the like. The image size conversion unit 102 also has a function of converting a color image into grayscale.

　次に、保護領域検出部１０３が所定サイズのカラー画像に含まれる保護領域を検出する（Ｓ２３）。また保護領域が検出されなかった場合にはそのカラー画像には保護領域が無いことを示すフラグが追加されることが好ましい。保護領域の検出の機能については、後述する。 Next, the protected area detection unit 103 detects a protected area included in the color image of a predetermined size (S23). Also, when no protected area is detected, it is preferable to add a flag indicating that there is no protected area to the color image. The protection area detection function will be described later.

　次に、色彩に埋もれてしまっている個人情報を見やすくするように、色空間変換部１０５が色空間変換する（Ｓ２４）。カラー画像の表現方式には、光の三原色でカラー画像を表すＲＧＢ方式、色相（hue）、彩度（Saturation）及び明度（Value）でカラー画像を表すＨＳＶ方式がある。色空間変換部１０５は、所定サイズのカラー画像（ＲＧＢ）をカラー画像（ＨＳＶ）に変換するとともに、濃い背景色を消去することで個人情報を浮き立たせる。図３（Ａ）は、赤色の帽子に黒字で名前が書いてあるカラー写真（Ａ－１）と、そのカラー写真のグレースケール画像（Ａ－２）と、彩度（Ｓ）をゼロにした画像（Ａ－３）とを示した図である。濃い背景色（特に赤色、青色、灰色等）に黒色で個人情報が描かれているカラー写真（Ａ－１）が、グレースケールもしくは二値に変換されると、黒色の個人情報は濃い背景色に埋もれて逆に検出しにくくなる（Ａ－２を参照）。一方、彩度（Ｓ）をゼロにした画像（Ａ－３）では、個人情報が浮き出て目立っている。 Next, the color space conversion unit 105 converts the color space so that the personal information buried in colors can be easily seen (S24). Color image expression methods include the RGB method, which expresses color images using the three primary colors of light, and the HSV method, which expresses color images using hue, saturation, and value. A color space conversion unit 105 converts a color image (RGB) of a predetermined size into a color image (HSV) and erases a dark background color to highlight personal information. FIG. 3(A) shows a color photograph (A-1) with a name written in black on a red cap, a grayscale image of the color photograph (A-2), and a saturation (S) of zero. FIG. 10 is a diagram showing an image (A-3); When the color photograph (A-1), in which personal information is drawn in black against a dark background color (especially red, blue, gray, etc.), is converted to grayscale or binary, the black personal information becomes a dark background color. This makes it difficult to detect (see A-2). On the other hand, in the image (A-3) with the saturation (S) set to zero, the personal information stands out and stands out.

　次に、カラー画像に対して、ノイズ除去部（カラー）１０４によりノイズを除去する（Ｓ２５）。領域検出部１１０が、線画、物の影もしくはノイズなどを個人情報が存在しそうな領域として検出し計算量が増えるため、カラー画像に含まれるノイズをできるだけ除去することが好ましい。ここで、カラー画像に含まれるノイズとは、たとえば被写体が革表面等であると、領域検出部１１０が革表面等の質感（テクスチャー）も物の影等として検出するため、カラー画像に写った物体表面のざらつきもカラー画像に含まれるノイズとして処理される。 Next, noise is removed from the color image by the noise removal unit (color) 104 (S25). Since the area detection unit 110 detects a line drawing, a shadow of an object, noise, or the like as an area in which personal information is likely to exist and the amount of calculation increases, it is preferable to remove noise contained in the color image as much as possible. Here, the noise included in the color image is, for example, if the subject is a leather surface or the like, the area detection unit 110 detects the texture of the leather surface or the like as a shadow of an object, etc. Roughness on the surface of the object is also processed as noise contained in the color image.

　カラー画像に含まれるノイズを除去する方法としてガウシアンぼかし (Gaussian Blur）処理と呼ばれるガウス関数を用いてカラー画像全体をぼかす処理がある。しかし、ガウシアンぼかし処理は、カラー画像内に含まれる個人情報のエッジ部分まで平滑化されるため、個人情報を検出できなくなる可能性もある。このため本実施形態のノイズ除去部（カラー）１０４は、エッジ保持平滑化フィルター処理（バイラテラルフィルター、ミーンシフトフィルター、アダプティブバイラテラルフィルター）を適用することが好ましい。 As a method of removing noise contained in color images, there is a process called Gaussian Blur processing that uses a Gaussian function to blur the entire color image. However, since Gaussian blur processing smoothes even the edge portion of personal information contained in a color image, there is a possibility that personal information cannot be detected. Therefore, the noise removal unit (color) 104 of the present embodiment preferably applies edge-preserving smoothing filter processing (bilateral filter, mean shift filter, adaptive bilateral filter).

　次にグレースケール変換部１０６が、色空間変換された画像をグレースケール画像に変換する（Ｓ２６）。
　引き続き、グレースケール画像の暗部・明部に埋もれた個人情報を検出しやすいように、局所的ヒストグラム平坦化部１０７がグレースケール画像を平坦化する（Ｓ２７）。例えば、全体的に暗い背景に明るい対象物がグレースケール画像に存在する場合に、全体的にコントラストを平坦化すると、明るい対象物が真っ白になってしまい、明るい対象物に存在する個人情報が消えてしまうことがある。このため、局所的ヒストグラム平坦化の処理をグレースケール画像に適用することにより、暗部・明部に埋もれた個人情報を検出しやすいようにする。局所的ヒストグラム平坦化は、具体的には、適応ヒストグラム平坦化（adaptive histogram equalization）、コントラスト制限適応ヒストグラム平坦化（ＣＬＡＨＥ）、マルチピークヒストグラム平坦化（ＭＰＨＥ）、および多目的ベータ最適化バイヒストグラム平坦化（ＭＢＯＢＨＥ）処理等がある。 Next, the grayscale conversion unit 106 converts the color space-converted image into a grayscale image (S26).
Subsequently, the local histogram flattening unit 107 flattens the grayscale image so that personal information buried in the dark/bright areas of the grayscale image can be easily detected (S27). For example, if a grayscale image has a bright object on an overall dark background, if the overall contrast is flattened, the bright object will become pure white, and the personal information present in the bright object will disappear. Sometimes I end up Therefore, applying the local histogram equalization process to the grayscale image makes it easier to detect personal information buried in dark and bright areas. Local histogram equalization specifically includes adaptive histogram equalization, contrast-limited adaptive histogram equalization (CLAHE), multi-peak histogram equalization (MPHE), and multi-objective beta-optimized bihistogram equalization. (MBOBHE) processing and the like.

　次に二値変換部１０８が、明度が平坦化されたグレースケール画像を二値画像に変換する（Ｓ２８）。二値画像の背景が暗い場合に個人情報が黒色で描かれていると、特定の閾値で二値変換してしまうと背景と個人情報とが同値になってしまい、領域検出部１１０が個人情報の領域を検出できなくなる。そこで、二値化変換部１０８は、グレースケール画像の全体の平均値を利用して２値化する（大津の二値化処理（Discriminant Analysis method））、もしくはグレースケール画像の一部毎（比較的狭い領域内）の平均値を利用して２値化する（Adaptive Gaussian ThresholdingまたはAdaptive Mean Thresholding）適応的閾値処理で、暗部の黒文字を浮き立たせることが好ましい。 Next, the binary conversion unit 108 converts the grayscale image whose brightness has been flattened into a binary image (S28). If the background of the binary image is dark and the personal information is drawn in black, the background and the personal information will have the same value if the binary conversion is performed with a specific threshold value. area cannot be detected. Therefore, the binarization conversion unit 108 binarizes the grayscale image using the average value of the entire grayscale image (Otsu's binarization process (Discriminant Analysis method)), or for each part of the grayscale image (comparison Adaptive Gaussian Thresholding or Adaptive Mean Thresholding is preferably used to make dark black characters stand out.

　次にノイズ除去部(二値)１０９が、二値画像に残っているノイズを除去する。具体的には、ノイズ除去部(二値)１０９は、二値画像内の対象物の境界のピクセルを除去する収縮(Erosion)とその境界にピクセルを加える膨張(Dilation)とを繰り返すモルフォロジー変換して、二値ノイズを除去する。 Next, a noise removal unit (binary) 109 removes noise remaining in the binary image. Specifically, the noise removal unit (binary) 109 repeats erosion for removing pixels on the boundary of the object in the binary image and dilation for adding pixels to the boundary. to remove binary noise.

　次に、領域検出部１１０は二値画像から個人情報が存在しそうな輪郭を検出する（Ｓ３０）。個人情報が存在しそうな輪郭は、二値画像に線などがある輪郭を検出することである。概略的には、領域検出部１１０は、二値画像をスキャンして対象物のピクセルを見つけ、それが外側の境界か孔の境界かを決定していく。その決定をスキャンして繰り返していくことで最も外側の境界すなわち輪郭を検出していく。例えば。図３（Ｂ－１）は、二値画像中に含まれる線画に一点鎖線の輪郭５０が検出されたイメージ図である。より詳細は、論文Topological structural analysis of digitized binary images by border following；Satoshi Suzuki著（Computer Vision, Graphics, and Image Processing　Volume 30, Issue 1, April 1985, Pages 32-46）に開示されている。 Next, the area detection unit 110 detects contours in which personal information is likely to exist from the binary image (S30). Contours in which personal information is likely to exist are detected by detecting contours such as lines in a binary image. In general, the area detector 110 scans the binary image for pixels of the object and determines whether it is an outer boundary or a hole boundary. By scanning and repeating the determination, the outermost boundary or contour is detected. for example. FIG. 3B-1 is an image diagram in which a dashed-dotted line contour 50 is detected in a line drawing included in a binary image. More details are disclosed in the paper Topological structural analysis of digitized binary images by border following; by Satoshi Suzuki (Computer Vision, Graphics, and Image Processing Volume 30, Issue 1, April 1985, Pages 32-46).

　また、ステップＳ３０と同時に、領域検出部１１０はグレースケール画像から個人情報が存在しそうな外枠を検出する（Ｓ３１）。グレースケール画像は、ステップＳ２２、Ｓ２６もしくはＳ２７で生成されたグレースケール画像のいずれを使用してもよい。概略的には、領域検出部１１０は、グレースケール画像中の輝度値が近い所を１つの領域にまとめていく，グレースケール画像の領域分割に使用する。輝度値が近い所をまとめていくという発想から，安定した分布のある外枠として解釈して、その代表点を特徴量として導出している。例えば。図３（Ｂ－２）は、グレースケール画像中に含まれる線画に二点鎖線の外枠５１が検出されたイメージ図である。図３（Ｂ－２）が図３（Ｂ－１）と特に違う点は、“０”の内側も外枠として検出する点である。より詳細は、論文Efficient Maximally Stable Extremal Region (MSER) Tracking；M. Donoser et al著（2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)）に開示されている。 At the same time as step S30, the area detection unit 110 detects an outer frame in which personal information is likely to exist from the grayscale image (S31). Any of the grayscale images generated in steps S22, S26 or S27 may be used as the grayscale image. Schematically, the area detection unit 110 is used for area division of a grayscale image, in which areas having similar luminance values in the grayscale image are grouped into one area. Based on the idea of grouping together areas with similar luminance values, it is interpreted as an outer frame with a stable distribution, and its representative points are derived as feature quantities. for example. FIG. 3B-2 is an image diagram in which the outer frame 51 of the two-dot chain line is detected in the line drawing included in the grayscale image. FIG. 3B-2 is particularly different from FIG. 3B-1 in that the inside of "0" is also detected as an outer frame. More details are disclosed in the paper Efficient Maximally Stable Extremal Region (MSER) Tracking; by M. Donoser et al (2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)).

　個人情報の検出部１１６でカラー画像から個人情報を検出する（Ｓ３２）。個人情報の検出部１１６は、ステップＳ２２で生成された所定サイズのカラー画像に情景内文字検出プログラム（ＥＡＳＴ：An Efficient and Accurate Scene Text Detector）等を適用して個人情報（文字）を検出する。個人情報の検出部１１６は、カラー画面に含まれる比較的大きな文字等を検出することができるが、背景などに含まれる小さな文字等は検出しにくい。 Personal information is detected from the color image by the personal information detection unit 116 (S32). The personal information detection unit 116 detects personal information (characters) by applying an in-scene character detection program (EAST: An Efficient and Accurate Scene Text Detector) or the like to the color image of a predetermined size generated in step S22. The personal information detection unit 116 can detect relatively large characters and the like included in the color screen, but it is difficult to detect small characters and the like included in the background.

　次に、ステップＳ３０、Ｓ３１で検出された個人情報が存在しそうな輪郭又は外枠領域に対して、個人情報判定部１１１で個人情報、つまり文字・数字があるか否かを判定する（Ｓ３３）。また個人情報の検出部１１６で検出された文字に対しても。同様に個人情報判定部１１１で個人情報、つまり文字があるか否かを判定してもよい（Ｓ３３）。個人情報の判定の手法に関しては、図６で後述する。 Next, the personal information judging unit 111 judges whether or not there is personal information, that is, letters and numbers, in the outline or outer frame area that is likely to contain personal information detected in steps S30 and S31 (S33). . Also for characters detected by the personal information detection unit 116 . Similarly, the personal information determining unit 111 may determine whether or not there is personal information, that is, characters (S33). A method for determining personal information will be described later with reference to FIG.

　なお、本実施形態では、二値画像から個人情報が存在するかを判定するルート（Ｓ２９、Ｓ３０、Ｓ３３）、グレースケール画像から個人情報が存在するか判定するルート（Ｓ２２（又はＳ２６、Ｓ２７），Ｓ３１、Ｓ３３）、カラー画像から個人情報が存在するかを判定するルート（Ｓ２２，Ｓ３２，Ｓ３３）があり、それぞれのルートで個人情報（文字）を不図示のメモリーに記憶している。そしてそれぞれのル－トで記憶された個人情報（文字）を統合して、カラー画像に含まれている個人情報としている。しかし、これに限らず、これら３ルートの少なくとも１ルートだけを適用してもよく、３ルート中の２ルートを選択して統合してもよい。 In this embodiment, a route for determining whether personal information exists from a binary image (S29, S30, S33) and a route for determining whether personal information exists from a grayscale image (S22 (or S26, S27) , S31, S33) and a route (S22, S32, S33) for determining whether personal information exists from the color image, and personal information (characters) is stored in a memory (not shown) in each route. Then, the personal information (characters) stored in each route is integrated to obtain the personal information contained in the color image. However, the present invention is not limited to this, and at least one route out of these three routes may be applied, or two routes out of the three routes may be selected and integrated.

　ステップＳ３３で個人情報が特定されると、個人情報消去部１１２は所定サイズのカラー画像から判定された個人情報を、その周辺画像で違和感なく消去する（Ｓ３４）。図３（Ｃ）は、所定サイズのカラー画像に個人情報（図３（Ｃ）では線状のキズ）が描かれているカラー写真（Ｃ－１）と。黒色背景にステップＳ３３で判定された個人情報を白抜きしたデータ（Ｃ－２）と、個人情報にその周囲の画素で描き入れていて（inpainting）したカラー画像（Ｃ－３）とを示した図である。別の言い方をすると、個人情報消去部１１２は、所定サイズのカラー画像（Ｃ－１）と同じサイズの黒色背景に個人情報を白抜きした画像（Ｃ－２）とを用意し、個人情報の境界から内側に向かって徐々に個人情報に周囲の画素で描き入れていていくことで（Inpainting）、個人情報が消去される（Ｃ－３）。 When the personal information is specified in step S33, the personal information erasing unit 112 erases the personal information determined from the color image of the predetermined size with the peripheral image without discomfort (S34). FIG. 3(C) is a color photograph (C-1) in which personal information (a linear flaw in FIG. 3(C)) is drawn on a color image of a predetermined size. Data (C-2) in which the personal information determined in step S33 is outlined on a black background, and a color image (C-3) in which the personal information is inpainted with surrounding pixels are shown. It is a diagram. In other words, the personal information erasing unit 112 prepares a color image (C-1) of a predetermined size and an image (C-2) of the same size in which the personal information is outlined against a black background and erases the personal information. The personal information is erased (C-3) by gradually drawing in the personal information with surrounding pixels from the boundary toward the inside (Inpainting).

　個人情報にその周囲の画素で描き入れていていく（inpainting）処理は、計算量が多く、プロセッサに負担がかかる又は時間がかかるため。ステップＳ３４では、計算量を削減する工夫をすることが好ましい。具体的には、個人情報消去部１１２は所定サイズ（例えばＦＨＤ）から縮小サイズ画像（例えばＨＤ）に変換する。この縮小サイズでカラー写真（Ｃ－１）と白抜きデータと（Ｃ－２）とに基づいて、個人情報の周囲の画素で描き入れる（inpainting）。そして個人情報消去部１１２は、縮小サイズの個人情報が消去された画像（Ｃ－３）を作成する。そして縮小サイズの個人情報が消去された画像（Ｃ－３）を所定サイズに戻し、所定サイズのカラー画像（Ｃ－１）に、所定サイズに戻された画像（Ｃ－３）で白抜きした位置（Ｃ－２）の画像を上書きすればよい。このような処理により計算量を格段に減らすことができる。 This is because the process of drawing (inpainting) personal information with the surrounding pixels requires a large amount of calculations, which puts a heavy burden on the processor or takes time. In step S34, it is preferable to devise ways to reduce the amount of calculation. Specifically, the personal information erasing unit 112 converts an image of a predetermined size (eg, FHD) into a reduced size image (eg, HD). Based on the color photograph (C-1), the white data (C-2), and the reduced size, pixels surrounding the personal information are drawn in (inpainting). Then, the personal information erasing unit 112 creates a reduced-size image (C-3) from which the personal information has been erased. Then, the reduced size image (C-3) from which the personal information has been deleted is returned to a predetermined size, and the color image (C-1) of the predetermined size is blanked with the image (C-3) restored to the predetermined size. The image at position (C-2) should be overwritten. Such processing can significantly reduce the amount of calculation.

　より詳しく説明すると、個人情報の境界から内側に向かって徐々に個人情報に周囲の画素で描き入れていく（Inpainting）手法には２つある。１つは、消去する近傍領域上のある一画素の値を、その周囲の画素の中で画素値が既に分かっている画素の画素値の重み付き和で置換し、ある画素を描き入れたら、ＦＭＭ（Fast Marching Method)を使って次の最近傍点に移動し、個人情報がないもしくは消去済みの画素に近い画素から順番に描き入れていく。より詳細は、論文An image inpainting technique based on the fast marching method； Alexandru Telea著（January 2004　Journal of Graphics Tools）に開示されている。もう一つは、個人情報の周囲の既知の領域の画素値から個人情報の領域へエッジに沿って探索し画素を描き入れていく手法もある。より詳細は、論文Navier-stokes, fluid dynamics, and image and video inpainting；Marcelo Bertalmio et al著（Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.）に開示されている。 To explain in more detail, there are two methods of inpainting (Inpainting) in which the surrounding pixels are gradually drawn into the personal information from the boundary of the personal information toward the inside. One is to replace the value of a pixel in the neighboring area to be erased with the weighted sum of the pixel values of pixels whose pixel values are already known among the surrounding pixels, and draw a certain pixel, FMM (Fast Marching Method) is used to move to the next nearest neighbor point, and the pixels close to the pixels without personal information or erased are drawn in order. More details are disclosed in the article An image inpainting technique based on the fast marching method; by Alexandru Telea (January 2004 Journal of Graphics Tools). Another method is to draw pixels by searching along edges from pixel values in a known area around the personal information to the area of the personal information. More details are disclosed in the paper Navier-stokes, fluid dynamics, and image and video inpainting; by Marcelo Bertalmio et al (Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.).

　次に、保護領域検出部１０３で保護領域を検出した場合、保護領域上書き部１１３は、保存しておいた保護領域を、個人情報が消去されたカラー画像中に上書きする（Ｓ３５）。領域検出部１１０は個人情報が存在しそうな領域には、例えば眼鏡のテンプルや鼻の影等が含まれることがあり、個人情報消去部１１２でテンプルや鼻の影を消去してしまうと保護領域が乱れてしまう。このため、保存しておいた保護領域を個人情報が消去されたカラー画像に上書きすることで保護領域を保護する。ステップＳ２３で保護領域が検出されなかった場合には、フラグに基づいてステップＳ３５をスキップしても良い。 Next, when the protected area detection unit 103 detects a protected area, the protected area overwrite unit 113 overwrites the saved protected area in the color image from which the personal information has been deleted (S35). The area detection unit 110 detects that areas in which personal information is likely to exist include, for example, the temples of eyeglasses and the shadow of the nose. is disturbed. Therefore, the protected area is protected by overwriting the saved protected area with the color image from which the personal information has been deleted. If no protected area is detected in step S23, step S35 may be skipped based on the flag.

　そして、保護領域を上書きしたカラー写真に対して、仕上げ部１１４が仕上げ処理することが好ましい（Ｓ３６）。具体的には、境界をまたいでの輝度値の平滑を抑制するパターン保存フィルター（ノンローカルミーンフィルター）又はガウシアンぼかし処理をカラー画像に対して使用する。上書きした保護領域との境界を目立たなくするとともに、消去できなかった微小な個人情報（文字）を判読不可能にすることができる。仕上げ処理がなくても問題はない。 Then, it is preferable that the finishing section 114 performs finishing processing on the color photograph in which the protected area has been overwritten (S36). Specifically, a pattern preserving filter (non-local mean filter) that suppresses smoothing of luminance values across boundaries or Gaussian blurring is used for color images. The boundary with the overwritten protected area can be made inconspicuous, and minute personal information (characters) that could not be erased can be made illegible. There is no problem even if there is no finishing treatment.

　最後に、カラー画像出力部１１５が、仕上げ処理されたカラー画像を、パーソナルコンピュータ２３もしくは携帯情報端末２５に出力する（Ｓ３７）。 Finally, the color image output unit 115 outputs the finished color image to the personal computer 23 or the mobile information terminal 25 (S37).

＜保護領域の検出＞
　ここで、保護領域検出部１０３が保護領域を検出する機能（Ｓ２３）について、図３及び図４を使って説明する。 <Detection of protection area>
Here, the function (S23) of detecting the protected area by the protected area detection unit 103 will be described with reference to FIGS. 3 and 4. FIG.

　まず、ステップＳ２３１では、保護領域検出部１０３は、ディープラーニングを行えるＡＩ等を使用し、所定サイズに変更されたカラー画像に基づいて、目や鼻、口の位置といった人間の顔が持つ特徴から、顔領域を検出したりする。顔領域は、個人情報（文字）を含まない保護領域であり、ステップＳ２３５で顔領域が保存され、Ｓ３５で説明したように、個人情報が消去されたカラー画像中に上書きされる。 First, in step S231, the protection area detection unit 103 uses an AI capable of deep learning, etc., based on the color image that has been changed to a predetermined size, from the features of the human face such as the positions of the eyes, nose, and mouth. , to detect the face area. The face area is a protected area that does not contain personal information (characters). The face area is saved in step S235 and overwritten in the color image from which the personal information has been erased as described in S35.

　次に、ステップＳ２３２において、保護領域検出部１０３は、ＡＩ等を使用し、カラー画像に含まれる物体をラベル化する。このため、カラー画像に時計があったり、カレンダーがあったり、又はテーブルがあったりすると、保護領域検出部１０３は、“clock”、“wall calendar”又は“table”等のラベルを出力する。例えば、時計やカレンダーには、時計やカレンダーに時間や日付の数字（文字）が描かれているが、文字が含む物体であるが個人情報を含むものではない物体、時計等を予め登録しておく。時計等は保護領域として検出され、ステップＳ２３５で時計等の保護領域が保存される。 Next, in step S232, the protected area detection unit 103 uses AI or the like to label the objects included in the color image. Therefore, when a color image includes a clock, calendar, or table, the protected area detection unit 103 outputs a label such as "clock", "wall calendar", or "table". For example, clocks and calendars have time and date numbers (characters) drawn on them, but objects that contain characters but do not contain personal information, clocks, etc., are registered in advance. back. A clock or the like is detected as a protected area, and the protected area of the clock or the like is saved in step S235.

　次に、ステップＳ２３３において、保護領域検出部１０３は、所定サイズに変更されたカラー画像に皮膚の色があるかを検出する。例えば指と指との間の空間（影）が一本の線の文字（例えばアルファベットの“Ｉ”）と認識されることを防ぐ。皮膚領域は、個人情報を含まない保護領域であり、ステップＳ２３５で皮膚領域が保存される。 Next, in step S233, the protected area detection unit 103 detects whether the color image resized to the predetermined size has skin color. For example, the spaces (shadows) between the fingers are prevented from being recognized as a single line character (for example, "I" of the alphabet). The skin area is a protected area that does not contain personal information, and the skin area is saved in step S235.

　保護領域検出部１０３は、所定サイズのカラー画像（ＲＧＢ）に例えば、カラー画像の表示方式ＲＧＢ、ＨＳＶ及びＹＣｂＣｒを使って皮膚の色を検出しても良い。より詳細は、論文Human Skin Detection Using RGB, HSV and YCbCr Color Models；S. Kolkur et al著（ICCASP/ICMMD-2016. Published by Atlantic Press）に開示されている。本実施形態では、カラー画像（ＲＧＢ）をカラー画像（ＨＳＶ）に変換するとともに、色相（hue）、彩度（Saturation）及び明度（Value）を調整して皮膚の色の領域を検出する。例えば８ビット（０－２５５）で調整するならば、Ｈ＝１６～４０（赤っぽい～黄色っぽい）Ｓ＝２０～２００、Ｖ＝１２８～２５５を範囲としておけば、カラー画像中の皮膚の色が存在する領域を検出できる。 The protected area detection unit 103 may detect the color of the skin using, for example, color image display methods RGB, HSV, and YCbCr for a color image (RGB) of a predetermined size. More details are disclosed in the paper Human Skin Detection Using RGB, HSV and YCbCr Color Models; by S. Kolkur et al (ICCASP/ICMMD-2016. Published by Atlantic Press). In this embodiment, a color image (RGB) is converted into a color image (HSV), and hue, saturation, and value are adjusted to detect a skin color area. For example, if adjusting with 8 bits (0-255), H = 16 to 40 (reddish to yellowish) S = 20 to 200, V = 128 to 255, the skin in the color image It can detect areas where color exists.

　次に、ステップＳ２３４において、保護領域検出部１０３は、ユーザＨにパーソナルコンピュータ２３又は携帯情報端末２５の画面上で保護領域を設定できるように、保護枠を表示させる。図５は、アップロードされたカラー画像５１が携帯情報端末２５の画面に表示された一例であり、カラー画像５１は学校の教室の一部である。教室内には、２つの机と、１つの棚があり、机には文字情報が映った状態のパーソナルコンピュータがあり、棚には複数の本やファイルがある。教室の壁には、時計とプラカードと窓とがある。時計には文字盤があり、プラカードには「優勝おめでとう」の文字がある。 Next, in step S234, the protected area detection unit 103 displays a protected frame so that the user H can set a protected area on the screen of the personal computer 23 or the mobile information terminal 25. FIG. 5 is an example of an uploaded color image 51 displayed on the screen of the mobile information terminal 25, and the color image 51 is part of a school classroom. In the classroom, there are two desks and one shelf, the desk has a personal computer on which character information is displayed, and the shelf has a plurality of books and files. There are clocks, placards and windows on the walls of the classroom. The clock has a dial and the placard has the words "Congratulations on winning".

　保護領域検出部１０３は、図５の携帯情報端末２５に「画像中で残したい文字を枠で囲ってください。」の表示５３をさせる。ユーザＨが枠５７をクリックすると、枠５７が画面上に表示される。ユーザＨは、その枠５７の大きさと位置を変えて、例えばプラカードの周囲に枠５７ａを設定し、棚の一番下段に枠５７ｂを設定する。ユーザＨは枠５７の設定が終了すると完了ボタン５４をクリックする。このように枠５７ａ又は５７ｂで設定された領域が保護領域として検出され、ステップＳ２３５で保護領域が保存される。Ｓ３５で説明したように、個人情報が消去されたカラー画像中に保護領域が上書きされる。なお、図５の教室の壁の時計は、ステップＳ２３４で保護領域として囲われなくても、ステップ２３２が実行されれば時計は保護領域とされる。図５では静止画を前提に説明したが動画でも保護領域の設定は可能である。例えば、ユーザＨが指定した枠領域をテンプレート画像として保持し、テンプレートマッチング処理により、動画内にテンプレート画像とマッチする領域をチェックすることで対応できる。 The protected area detection unit 103 causes the portable information terminal 25 of FIG. When the user H clicks on the frame 57, the frame 57 is displayed on the screen. The user H changes the size and position of the frame 57, for example, sets a frame 57a around the placard and sets a frame 57b at the bottom of the shelf. User H clicks the completion button 54 when the setting of the frame 57 is completed. The area set by the frame 57a or 57b in this manner is detected as a protected area, and the protected area is saved in step S235. As described in S35, the protected area is overwritten in the color image from which the personal information has been erased. Note that even if the clock on the wall of the classroom in FIG. 5 is not enclosed as a protected area in step S234, the clock is made a protected area when step 232 is executed. Although FIG. 5 has been described on the premise of a still image, the protection area can also be set for a moving image. For example, a frame area designated by the user H is held as a template image, and template matching processing is performed to check areas in the moving image that match the template image.

　すべての保護領域をステップＳ２３１からステップＳ２３４の順番が入れ替わっても良く、また例えば顔領域のみを検出する等、いずれか一つのステップのみを実行してもよい。また、ステップＳ３２等において、個人情報の領域検出部１１０で個人情報の領域を検出するとともに、その個人情報検出部１１１で検出された検出領域に対して、顔領域もしくは皮膚領域等の保護領域が存在するかを検出しても良い。 The order of steps S231 to S234 may be changed for all protected areas, or only one of the steps may be executed, such as detecting only the face area. In step S32 and the like, the personal information area detection unit 110 detects the area of personal information, and the detection area detected by the personal information detection unit 111 includes a protection area such as a face area or a skin area. You may detect whether it exists.

＜個人情報（文字・数字）の判定＞
　ここで、個人情報が存在しそうな領域に対して、個人情報判定部１１１が個人情報であるか否かを判定する手法（Ｓ３３）について、図６から図８を使って説明する。なお、個人情報の判定手法では、個人情報が文字（数字）である場合について説明する。そのため、個人情報が存在しそうな領域にある個人情報を推定文字（Estimated letter）と呼ぶ。 <Determination of personal information (characters/numbers)>
Here, a method (S33) for determining whether personal information is personal information by the personal information determining unit 111 for an area in which personal information is likely to exist will be described with reference to FIGS. 6 to 8. FIG. In the personal information determination method, a case where the personal information is characters (numbers) will be described. Therefore, personal information in an area where personal information is likely to exist is called an estimated letter.

　まず、個人情報判定部１１１は、推定文字を回転させて元の推定文字に重ね合わせた面積と元の推定文字の面積との面積比が、ある閾値ｋ１よりも大きいか否かを判定する（Ｓ３３１）。閾値ｋ１より面積比が大きい場合には図形等（文字以外）であると判定してステップＳ３３６に進み、閾値ｋ１より面積比が小さい場合には文字であると判定してステップＳ３３２に進む。 First, the personal information determination unit 111 determines whether or not the area ratio between the area of the estimated character rotated and superimposed on the original estimated character and the area of the original estimated character is greater than a certain threshold k1 ( S331). If the area ratio is larger than the threshold k1, it is determined to be a figure (other than a character) and the process proceeds to step S336.

　図７（Ａ）に、推定文字を回転させた例を示す。図７（Ａ）では、アルファベットの“Ｃ”、“Ｏ”及び“Ｚ”並びに真円、正方形及び正三角形の図形を推定文字の例とする。また図７（Ａ）に示した表では、左から０°（回転せず）、９０°回転、０°の推定文字と９０°回転した推定文字との重ね合わせ、１８０°回転、０°の推定文字と１８０°回転した推定文字との重ね合わせ、２７０°回転、０°の推定文字と２７０°回転した推定文字との重ね合わせが示されている。なお回転は、文字領域または図形領域の重心位置を中心に回転させている。 Fig. 7(A) shows an example of rotating the estimated characters. In FIG. 7A, alphabetic characters "C", "O" and "Z" and figures of perfect circles, squares and equilateral triangles are used as examples of estimated characters. Further, in the table shown in FIG. 7A, from the left, 0° (not rotated), 90° rotation, superposition of 0° estimated character and 90° rotated estimated character, 180° rotation, 0° rotation Superposition of the estimated character and the estimated character rotated by 180°, superposition of the estimated character rotated by 270°, and the estimated character rotated by 0° and the estimated character rotated by 270° are shown. Note that the rotation is performed around the position of the center of gravity of the character area or figure area.

　まず、０°と９０°との重ね合わせでは、“Ｃ”、“Ｏ”及び“Ｚ”は、重なり領域が５０％以下である。一方、真円、正方形及び正三角形は、重なり領域が５０％以上である。これらの文字及び図形は、０°と２７０°との重ね合わせにおいても同様である。 First, in the superimposition of 0° and 90°, the overlap region of "C", "O" and "Z" is 50% or less. On the other hand, a perfect circle, a square and an equilateral triangle have an overlapping area of 50% or more. These letters and figures are the same in superposition of 0° and 270°.

　次に０°と１８０°との重ね合わせでは、“Ｏ”及び“Ｚ”は、それらの重なり領域が１００％であり、“Ｃ”の重なり領域は約８０％である。真円及び正方形は、重なり領域が１００％であり、正三角形の重なり領域は約５０％である。 Next, in the superposition of 0° and 180°, "O" and "Z" have 100% of their overlapping area, and "C" has about 80% of their overlapping area. A perfect circle and a square have an overlapping area of 100%, and an equilateral triangle has an overlapping area of about 50%.

　本実施形態では、個人情報判定部１１１は、アルファベットだけでなく多言語に対応するため、０°と９０°との重ね合わせの閾値ｋ１１、０°と１８０°との重ね合わせの閾値ｋ１２、０°と２７０°との重ね合わせの閾値ｋ１３をそれぞれ有しており、その３つ回転角度の重ね合わせを組み合わせて、推定文字が文字であるかを判定する。個人情報判定部１１１が、ある特定言語の推定文字を文字判定するのであれば、例えば０°と９０°との重ね合わせの閾値ｋ１１だけを使用してもよい。また、閾値ｋ１は可変できるようにすることが好ましく、言語ごとに閾値ｋ１値を可変できるようにすることが好ましい。 In this embodiment, the personal information determination unit 111 supports not only the alphabet but also multiple languages. It has a threshold value k13 for superposition of ° and 270°, respectively, and it is determined whether or not the estimated character is a character by combining the superposition of the three rotation angles. If the personal information determination unit 111 performs character determination on an estimated character of a certain specific language, for example, only the threshold value k11 for overlapping 0° and 90° may be used. Moreover, it is preferable to make the threshold k1 variable, and it is preferable to make the threshold k1 value variable for each language.

　上述したように、０°と１８０°との重ね合わせでは、“Ｃ”の重なり領域は約８０％であり、正三角形の重なり領域は約５０％であり、文字より図形の方が重なり領域が小さくなっている。これでは推定文字が文字であるかを判定し難いため、図７（Ｂ）に示すように、推定文字を回転させてもよい。図７（Ｂ）では、アルファベットの“Ｃ”、真円及び正三角形の図形を推定文字の例とする。また図７（Ｂ）に示した表では、左から０°（回転せず）、距離Ｓだけシフト、シフトした状態で１８０°回転、０°の推定文字とシフトして１８０°回転した推定文字との重ね合わせが示されている。なお回転は、文字領域または図形領域の重心位置を中心に回転させている。“Ｃ”の重なり領域は約１０％であり、新円の重なり領域は約７０％であり、正三角形の重なり領域は約５０％である。０度の推定文字と距離Ｓのシフト且つ１８０°回転した推定文字との重なり領域で、閾値ｋ１２より大きいか否かで推定文字が文字であるかを判定できる。シフトして１８０°回転させる手法と、９０°回転と、２７０°回転とを組み合わせてもよい。なお、図７ではアルファベットについて説明したが、算用数字や漢字でも同様である。 As described above, in the superimposition of 0° and 180°, the overlapping area of "C" is about 80%, and the overlapping area of the equilateral triangle is about 50%. It's getting smaller. Since it is difficult to determine whether the estimated character is a character, the estimated character may be rotated as shown in FIG. 7B. In FIG. 7B, an alphabet "C", a perfect circle, and an equilateral triangle are examples of estimated characters. Further, in the table shown in FIG. 7B, 0° from the left (not rotated), shifted by a distance S, rotated 180° with the shift, the estimated character at 0°, and the estimated character shifted and rotated 180°. A superposition with is shown. Note that the rotation is performed around the position of the center of gravity of the character area or figure area. The overlap area of the "C" is about 10%, the overlap area of the new circle is about 70%, and the overlap area of the equilateral triangle is about 50%. Whether or not the estimated character is a character can be determined by checking whether or not the overlap region of the estimated character at 0 degrees and the estimated character shifted by the distance S and rotated by 180 degrees is larger than the threshold value k12. A technique of shifting and rotating 180°, rotating 90°, and rotating 270° may be combined. Although the alphabet is explained in FIG. 7, the same applies to Arabic numerals and Chinese characters.

　ステップＳ３３２では、個人情報判定部１１１は、推定文字の中心線から推定文字の境界線までの距離の標準偏差を算出し、標準偏差と閾値ｋ２とを比較して文字と判定する。閾値ｋ２より標準偏差が大きい場合には図形などであると判定してステップＳ３３６に進み、閾値ｋ２より標準偏差が小さい場合には文字であると判定してステップＳ３３３に進む。既にステップＳ３３１で文字であると判定しているが、ステップＳ３３２では別の手法により推定文字が文字であるかを判定してもよい。 In step S332, the personal information determination unit 111 calculates the standard deviation of the distance from the center line of the estimated character to the boundary line of the estimated character, compares the standard deviation with a threshold value k2, and determines that the character is a character. If the standard deviation is larger than the threshold k2, it is determined to be a figure or the like, and the process proceeds to step S336. Although it has already been determined in step S331 that the character is a character, in step S332 another method may be used to determine whether the estimated character is a character.

　文字は、文字の中心線（骨組み）から文字の境界線までの距離（複数個所）に大きな変化が少ないという特性を有している。このため、推定文字の中心線から推定文字の境界線までの距離の標準偏差を算出する。ただし、文字の大きさに比例して標準偏差も大きくなるため、中心線から境界線までの平均距離で割る、つまり閾値ｋ２＝距離の標準偏差／平均距離を計算することが好ましい。平均距離で割ることで閾値ｋ２は、文字の大きさに影響を受けにくくなる。また中心線から境界線までの平均距離で割る代わりに、（距離／平均距離）の標準偏差で閾値ｋ２を設定しても良い。また図形等と文字との判定基準を調整するため、閾値ｋ２値を可変できるようにすることが好ましい。 Characters have the characteristic that the distance (multiple points) from the center line (framework) of the character to the boundary line of the character does not change significantly. Therefore, the standard deviation of the distance from the center line of the estimated character to the boundary line of the estimated character is calculated. However, since the standard deviation also increases in proportion to the character size, it is preferable to divide by the average distance from the center line to the boundary line, that is, to calculate threshold k2=standard deviation of distance/average distance. By dividing by the average distance, the threshold k2 becomes less susceptible to the character size. Also, instead of dividing by the average distance from the center line to the boundary line, the threshold value k2 may be set by the standard deviation of (distance/average distance). Also, it is preferable to make the threshold value k2 variable in order to adjust the criteria for judging graphics and the like and characters.

　図８（Ａ）は、アルファベットの“Ｊ”とＪに似た台形の図形を示した例である。“Ｊ”の中心線（骨組み）６１からは境界線６３までの距離Ｌ１は、ほとんど一定でありその標準偏差は小さい。図８（Ａ）ではＪはゴジック体（Gothic）であるが他の書体例えばニューセンチュリー（New Century ）であっても標準偏差は小さい。台形の図形の中心線（骨組み）６１からは境界線６３までの距離Ｌ２は大きく変化しており、その標準偏差は大きくなる。例えば閾値ｋ２＝０．６と設定してもよい。 FIG. 8(A) is an example showing the alphabet "J" and a trapezoidal figure similar to J. The distance L1 from the center line (framework) 61 of "J" to the boundary line 63 is almost constant and its standard deviation is small. In FIG. 8A, J is Gothic, but the standard deviation is small even in other typefaces such as New Century. The distance L2 from the center line (framework) 61 of the trapezoidal figure to the boundary line 63 varies greatly, and its standard deviation increases. For example, the threshold k2 may be set to 0.6.

　ステップＳ３３３では、個人情報判定部１１１は、推定文字の文字自体の面積と推定文字を囲む領域全体の面積との面積比を算出し、閾値ｋ３より小さければ文字と判定する。閾値ｋ３より大きい場合には図形などであると判定してステップＳ３３６に進み、閾値ｋ３より小さい場合には文字であると判定してステップＳ３３４に進む。既にステップＳ３３１及びＳ３３２で文字であると判定しているが、ステップＳ３３３では別の手法により推定文字が文字であるかを判定してもよい。 In step S333, the personal information determination unit 111 calculates the area ratio of the area of the estimated character itself and the area of the entire area surrounding the estimated character, and determines that it is a character if it is smaller than the threshold value k3. If it is larger than the threshold k3, it is determined to be a figure or the like and the process proceeds to step S336. If it is smaller than the threshold k3, it is determined to be a character and the process proceeds to step S334. Although it has already been determined that the character is a character in steps S331 and S332, another method may be used to determine whether the estimated character is a character in step S333.

　図８（Ｂ）は、ステップＳ３３３の具体例であり、アルファベットの“Ｊ”とＪに似たスプーン形状の図形を示した例である。図８（Ｂ）ではＪ文字自体の面積６６とＪを囲む領域全体の面積６５との面積比は約６０％であり、スプーン形状の図形の面積６６とスプー形状の図形を囲む領域全体の面積６５との面積比は約８０％である。例えば閾値ｋ３＝７０%と設定してもよい。 FIG. 8(B) is a specific example of step S333, showing an alphabet "J" and a spoon-shaped figure similar to J. In FIG. 8B, the area ratio of the area 66 of the J character itself to the area 65 of the entire area surrounding J is about 60%, and the area 66 of the spoon-shaped figure and the area of the entire area surrounding the spoon-shaped figure The area ratio with 65 is about 80%. For example, the threshold k3 may be set to 70%.

　ステップＳ３３１からステップＳ３３３の順番が入れ替わっても良く、またいずれか一つのステップのみで推定文字が文字であると判定してもよい。ステップＳ３３４及びＳ３３５では、ステップＳ３３１、Ｓ３３２又はＳ３３３で文字であることが判定されたので、その文字が個人情報であるか否かを判定する。　The order of steps S331 to S333 may be changed, or it may be determined that the estimated character is a character in only one of the steps. In steps S334 and S335, it is determined whether or not the character is personal information because it was determined in step S331, S332 or S333 that the character is a character.

　再び図６に戻り、ステップＳ３３４では、推定文字が所定サイズの画像に対してある閾値ｋ４よりも大きいか否かを判定する。閾値ｋ４より大きい場合には大きな文字であると判定してステップＳ３３８に進み、閾値ｋ４より小さい場合には小さな文字であると判定してステップＳ３３６に進む。より具体的には、画像サイズがＦＨＤ（１９２０＊１０８８）である場合、推定文字の少なくとも一方の縦方向のピクセル数もしくは横方向のピクセル数が、画像サイズの例えば５パーセント（９６＊５４）以上であるか否を判定する。大きい場合にはステップＳ３３９に進む。画像サイズの何パーセント（閾値ｋ４）であるかは可変できることが好ましい。ユーザが残したい大きさ文字を決定できるように、カラー画像をアップロードするｗｅｂ画面等に、閾値ｋ４を可変できる機能を表示されても良い。 Returning to FIG. 6 again, in step S334, it is determined whether or not the estimated character is larger than a certain threshold k4 for an image of a predetermined size. If it is larger than the threshold k4, it is determined that the character is large, and the process proceeds to step S338. If it is smaller than the threshold k4, it is determined that the character is small, and the process proceeds to step S336. More specifically, when the image size is FHD (1920*1088), at least one of the estimated characters has a vertical pixel count or a horizontal pixel count of, for example, 5 percent (96*54) or more of the image size. It is determined whether or not. If larger, the process proceeds to step S339. The percentage of the image size (threshold k4) is preferably variable. A function that can change the threshold value k4 may be displayed on a web screen or the like for uploading a color image so that the user can determine the size of characters that the user wants to leave.

　ステップＳ３３５では、文字がある閾値ｋ５よりも大きな太い線幅で描かれているか否かを判定する。閾値ｋ５より大きい場合には個人情報でないと判定してステップＳ３３８に進み、閾値ｋ５より小さい場合には細い線幅の文字であると判定してステップＳ３３７に進む。閾値ｋ５も可変できることが好ましい。ユーザが残したい太い線幅の文字を決定できるように、カラー画像をアップロードするｗｅｂ画面等に、閾値ｋ５を可変できる機能を表示されても良い。 In step S335, it is determined whether or not the character is drawn with a thick line width greater than a certain threshold value k5. If it is larger than the threshold k5, it is determined that the information is not personal information, and the process proceeds to step S338. Preferably, the threshold k5 is also variable. A function that can change the threshold value k5 may be displayed on a web screen or the like for uploading a color image so that the user can determine the characters with a thick line width that the user wants to leave.

　図８（Ｃ）は、ステップＳ３３５の具体例であり、アルファベットの太い線幅の“Ｉ”と細い線幅の“Ｉ”を示した例である。“Ｉ”文字自体６８を構成する面積と“Ｉ”の３本の中心線（骨組み）６７の面積とを計算し、その面積比が閾値ｋ５より、例えば１０よりも大きいかを判定する。 FIG. 8(C) is a specific example of step S335, showing a thick line width "I" and a thin line width "I" of the alphabet. The area of the "I" character itself 68 and the area of the three centerlines (framework) 67 of the "I" are calculated, and it is determined whether the ratio of the areas is greater than a threshold k5, for example 10.

　図６のステップＳ３３６において、ステップＳ３３１、Ｓ３３２又はＳ３３３で、文字ではないとして判定された推定文字は図形等と考えられるため、カラー画像の一部として残される。　In step S336 of FIG. 6, the estimated characters determined as not being characters in steps S331, S332, or S333 are considered to be graphics or the like, and are therefore left as part of the color image.

　ステップＳ３３７において、ステップＳ３３１又はＳ３３２で推定文字は文字と判定され、且つステップＳ３３４又はＳ３３５で大きな文字でもなく太い線幅の文字でもないと判定されたため、これらの文字は個人情報としてカラー画像から消去される対象となる。 In step S337, it is determined that the estimated characters are characters in step S331 or S332, and that they are neither large characters nor thick line width characters in step S334 or S335. Therefore, these characters are deleted from the color image as personal information. subject to

　ステップＳ３３８において、ステップＳ３３２又はＳ３３３で推定文字は文字と判定され、且つステップＳ３３４又はＳ３３５で大きな文字又は太い線幅の文字と判断されたため、これらの文字は個人情報ではなく、例えばカラー画面に大きく写った学校の入り口に吊るされた「卒業式」の看板や、太い線幅でＴシャツに描かれた文字は、カラー画像に残される。なお、大きな文字又は太い線幅の文字であれば、名前、住所、生年月日、性別、ナンバープレートの数字等であっても、本実施形態では個人情報として扱わない。 In step S338, the estimated characters are determined to be characters in step S332 or S333, and are determined to be large characters or characters with a thick line width in step S334 or S335. The "Graduation Ceremony" signboard hanging at the entrance of the school and the letters drawn in thick lines on the T-shirt are preserved in the color image. In the present embodiment, large characters or characters with a thick line width are not treated as personal information, even if they are names, addresses, dates of birth, sex, numbers on license plates, and the like.

　図９は、パーソナルコンピュータ２３から個人情報マスキング装置１００に取得されるカラー画像である。また、図１０、図１１、図１２及び図１３は、個人情報マスキング装置１００がパーソナルコンピュータ２３に出力したカラー画像である。 9 is a color image acquired by the personal information masking device 100 from the personal computer 23. FIG. 10, 11, 12 and 13 are color images output to the personal computer 23 by the personal information masking device 100. FIG.

　図９に示されたカラー画像には、１０台以上の自動車のナンバープレート、自動車に描かれた８種類（英語、タイ語、アラビア語、キリル語、日本語、グルジア語、ハングル語、ミャンマー語）の言語の文字、ニュースタイトル等の英語の文字、レポータの顔などが含まれている。 The color image shown in FIG. 9 includes license plates of more than 10 cars, 8 languages (English, Thai, Arabic, Cyrillic, Japanese, Georgian, Korean, Burmese) drawn on the car. ) language characters, English characters such as news titles, and reporter faces.

　図１０は実施例１の出力例であり、第１実施例は、図２のステップＳ３１及びＳ３２を使用せずに、カラー画像を処理した例である。つまり図１０は、図２のステップＳ２１－Ｓ３０、Ｓ３３－Ｓ３７を実行したカラー画像である。なおステップＳ２３は、図３のＳ２３１（顔領域の保護）のみ実行されている。 FIG. 10 is an output example of Example 1, and Example 1 is an example of processing a color image without using steps S31 and S32 of FIG. That is, FIG. 10 is a color image obtained by executing steps S21-S30 and S33-S37 in FIG. In step S23, only S231 (face area protection) in FIG. 3 is executed.

　図１０に示されたカラー画像には、タイ語、日本語、ハングル語及びキリル語が消去されている。しかし英語（アルファベット）、ミャンマー語及びアラビア語の一部や数字が残っている。多くの白線は図形として判定されたため、多くの道路の白線が残っている。レポータの名札の名前、レポ―タが持つマイクや放送局名も消去されている。ＬＩＶＥ　ＮＥＷＳの文字や、ニュースタイトルはしっかりと残っている。ステップＳ２３１）及びＳ３６の実行によりレポータの顔ははっきりしている。 Thai, Japanese, Hangul, and Cyrillic are deleted from the color image shown in FIG. However, English (alphabet), Burmese and Arabic parts and numbers remain. Since many white lines were determined as figures, many white lines of the road remain. The name on the reporter's name tag, the microphone held by the reporter, and the name of the broadcasting station have also been erased. The words LIVE NEWS and the news titles are still there. The reporter's face has been clarified by the execution of steps S231) and S36.

　図１１は実施例２の出力例であり、実施例２では、図２のステップＳ２４－３０及びＳ３２が実行されていない。つまり、図１１は、ステップＳ２２で変換されたグレースケール画像をステップＳ３１で実行したカラー画像である。なおステップＳ２３は、図３のＳ２３１（顔領域の保護）のみ実行されている。 FIG. 11 is an output example of Example 2, in which Steps S24-30 and S32 of FIG. 2 are not executed. That is, FIG. 11 is a color image obtained by performing step S31 on the grayscale image converted in step S22. In step S23, only S231 (face area protection) in FIG. 3 is executed.

　図１１に示されたカラー画像には、遠くの自動車のナンバープレートも含めてすべてのナンバープレートの文字及び数字が消去されている。また、自動車に描かれた８種類の言語すべてが消去されている。レポータの名札の名前、レポ―タが持つマイクや放送局名も消去されている。また道路の白線も文字として判定されて消去されている。ＬＩＶＥ　ＮＥＷＳの文字や、ニュースタイトルはボケている箇所もあるが消去されていない。なお、図１０と図１１とを比べても分かりにくいが、グレースケール画像をステップＳ３１で実行したカラー画像では、特に赤地に黒字などは消去されにくいことがある。ステップＳ２３１及びＳ３６の実行によりレポータの顔ははっきりしている。 In the color image shown in Fig. 11, the letters and numbers on all license plates, including those of distant cars, have been erased. Also, all eight languages painted on the car have been erased. The name on the reporter's name tag, the microphone held by the reporter, and the name of the broadcasting station have also been erased. The white lines on the road are also determined as characters and deleted. Some parts of the LIVE NEWS text and news titles are blurred, but they have not been erased. Although it is difficult to understand by comparing FIG. 10 and FIG. 11, in the color image obtained by processing the grayscale image in step S31, black characters on a red background may be particularly difficult to erase. The reporter's face is clear due to the execution of steps S231 and S36.

　図１２は実施例３の出力例であり、実施例３では、ステップＳ３２を除き、図２のフローチャートがすべて実行されている。このため図１２は、図１０及び図１１を重ね合わせた写真と同等になっている。なおステップＳ２３は、図３のＳ２３１（顔領域の保護）のみ実行されている。 FIG. 12 is an output example of Example 3. In Example 3, all the flowcharts of FIG. 2 are executed except for step S32. Therefore, FIG. 12 is equivalent to the photograph in which FIGS. 10 and 11 are superimposed. In step S23, only S231 (face area protection) in FIG. 3 is executed.

　図１２に示されたカラー画像には、遠くの自動車のナンバープレートも含めてすべてのナンバープレートの文字及び数字が消去されている一方で、図１１では消去されていた、白線の一部や自動車の車体の一部が描かれている。ステップＳ２３１及びＳ３６の実行によりレポータの顔ははっきりしている。 In the color image shown in FIG. 12, the letters and numbers of all the license plates, including the license plate of the distant car, have been erased, while some of the white lines and the car have been erased in FIG. A part of the car body of is drawn. The reporter's face is clear due to the execution of steps S231 and S36.

　図１３は実施例４の出力例であり、実施例４では、実施例２と同様にステップＳ２４－３０及びＳ３２が実行されておらず、ステップＳ２２で変換されたグレースケール画像をステップＳ３１で実行したカラー画像である。但し、ステップＳ２３は、図３のＳ２３４（保護領域の設定）のみ実行されており、ユーザＨに設定された保護領域は、レポータの写っているレポータ枠である。 FIG. 13 is an output example of Example 4. In Example 4, steps S24-30 and S32 are not executed as in Example 2, and the grayscale image converted in step S22 is executed in step S31. This is a color image. However, in step S23, only S234 (setting of protected area) in FIG. 3 is executed, and the protected area set by user H is the reporter frame in which the reporter is captured.

　図１３に示されたカラー画像には、遠くの自動車のナンバープレートも含めてすべてのナンバープレートの文字及び数字が消去されている。また、自動車に描かれた８種類の言語すべてが消去されている。一方、レポータの名札の名前、レポ―タが持つマイクの文字、並びにレポータ枠の左上の放送局名及び日付も元のカラー画像そのままに維持されている。もちろん、図１３に示されたカラー画像では、レポータ枠内にあるレポータの顔及び指先もはっきりしている。 In the color image shown in Fig. 13, the letters and numbers on all license plates, including those of distant cars, have been erased. Also, all eight languages painted on the car have been erased. On the other hand, the name of the reporter's name tag, the characters on the microphone held by the reporter, and the broadcasting station name and date on the upper left of the reporter's frame are also maintained as they are in the original color image. Of course, the color image shown in FIG. 13 also clearly shows the reporter's face and fingertips within the reporter's frame.

　本実施形態の個人情報マスキング装置は、カラー画像の取得から個人情報を消去したカラー画像の出力までをすべて処理している。しかしながら一部の処理をパーソナルコンピュータ２３もしくは携帯情報端末２５に担わせても良い。例えば、パーソナルコンピュータ２３もしくは携帯情報端末２５は、クラウドサーバーからアプリを事前にダウンロードしておき、パーソナルコンピュータ２３もしくは携帯情報端末２５で、カラー画像を所定のサイズに変換し、保護領域例えば顔領域等を検出し保存してから、個人情報マスキング装置１００にアップロードしてもよい。そして個人情報が消去された後に、個人情報マスキング装置１００からカラー画像を出力し、パーソナルコンピュータ２３もしくは携帯情報端末２５で保護領域を上書きしてもよい。 The personal information masking device of this embodiment processes everything from acquiring a color image to outputting a color image from which personal information has been erased. However, the personal computer 23 or the mobile information terminal 25 may be responsible for part of the processing. For example, the personal computer 23 or the mobile information terminal 25 downloads an application from the cloud server in advance, and the personal computer 23 or the mobile information terminal 25 converts the color image into a predetermined size and protects the protected area such as the face area. may be detected and stored before being uploaded to the personal information masking device 100 . After the personal information is erased, a color image may be output from the personal information masking apparatus 100 and the protected area may be overwritten by the personal computer 23 or the portable information terminal 25 .

　また、パーソナルコンピュータ２３もしくは携帯情報端末２５で、個人情報マスキング装置１００のソフトウェアの処理をすべて担わせてもよい。　 Also, the personal computer 23 or the portable information terminal 25 may be responsible for all software processing of the personal information masking device 100 .　

２３　パーソナルコンピュータ
２５　携帯情報端末
１００　個人情報マスキング装置
１０１　カラー画像取得部
１０２　画像サイズ変換部
１０３　保護領域検出部と、
１０４　ノイズ除去部（カラー）
１０５　色空間変換部
１０６　グレースケール変換部
１０７　局所的ヒストグラム平坦化部
１０８　二値変換部
１０９　ノイズ除去部（二値）
１１０　領域検出部
１１１　個人情報判定部
１１２　個人情報消去部
１１３　保護領域上書き部
１１４　仕上げ部
１１５　カラー画像出力部
１１６　個人情報の検出部

23 personal computer 25 portable information terminal 100 personal information masking device 101 color image acquisition unit 102 image size conversion unit 103 protected area detection unit;
104 Noise remover (color)
105 color space converter 106 grayscale converter 107 local histogram equalizer 108 binary converter 109 noise remover (binary)
110 Area detection unit 111 Personal information determination unit 112 Personal information deletion unit 113 Protected area overwrite unit 114 Finishing unit 115 Color image output unit 116 Personal information detection unit

Claims

obtaining a color image;
a size conversion step of converting the obtained color image into a first color image of a predetermined size;
converting the first color image to a binary image;
a region detection step of detecting a region in which personal information may exist in the binary image;
a determination step of determining the presence or absence of personal information in the area where the personal information is likely to exist;
an erasing step of drawing peripheral pixels of the personal information into the personal information of the first color image and erasing the personal information based on the personal information determined in the determining step and the first color image;
a step of outputting a color image after the erasing step;
A method of masking personal information, comprising:

The size conversion step converts the obtained color image into a first grayscale image of the predetermined size,
The region detection step detects regions in which personal information may exist in the first grayscale image.
The personal information masking method according to claim 1.

obtaining a color image;
a size conversion step of converting the obtained color image into a first color image and a first grayscale image of a predetermined size;
an area detection step of detecting an area in which personal information may exist in the first grayscale image;
a determination step of determining the presence or absence of personal information in the area where the personal information is likely to exist;
an erasing step of drawing peripheral pixels of the personal information into the personal information of the first color image and erasing the personal information based on the personal information determined in the determining step and the first color image;
a step of outputting a color image after the erasing step;
A method of masking personal information, comprising:

detecting protected areas of a person for the first color image;
storing the protected area when the protected area is detected;
an overwriting step of overwriting the protected area on the first color image after the erasing step;
A personal information masking method according to any one of claims 1 to 3, comprising:

denoising the first color image;
performing a color space transformation on the first color image;
a converting step of converting the first color image into a second grayscale image after the steps of denoising and processing color space conversion;
A personal information masking method according to any one of claims 1 to 4, comprising:

processing a local histogram equalization on the second grayscale image to convert to a third grayscale image;
6. The personal information masking method according to claim 5, wherein said area detection step detects areas in which personal information may exist in said second grayscale image or said third grayscale image.

The determination step includes
(a) a filter for determining the presence or absence of personal information based on whether the logical product of the area where the personal information is likely to exist and the area obtained by rotating the area by a predetermined angle is greater than a first threshold;
(b) a filter that determines the presence or absence of personal information based on whether the standard deviation of the distance from the thin line that forms the framework of the area where the personal information is likely to exist to the boundary line of the area is greater than a second threshold;
(c) a filter for determining the presence or absence of personal information based on whether the area ratio of the area where the personal information is likely to exist and the area surrounding the area where the personal information is likely to exist is greater than a third threshold;
(d) a filter for determining the presence or absence of personal information based on whether the vertical or horizontal length of the area where the personal information is likely to exist is greater than a fourth threshold for the predetermined size;
(e) Determining the presence or absence of personal information based on whether the ratio of the area of the thin line that forms the framework of the area where the personal information is likely to exist and the area of the area where the personal information is likely to exist is greater than a fifth threshold. filter,
7. The method of masking personal information according to any one of claims 1 to 6, applying at least one filter process of .

The personal information masking method according to any one of claims 1 to 7, comprising a personal information detection step of detecting personal information for the first color image.

After the overwriting step, a finishing step of finishing with an edge preserving filter;
5. The personal information masking method of claim 4, comprising:

an acquisition unit that acquires a first color image of a predetermined size;
a binary conversion unit that converts the first color image into a binary image;
a region detection step of detecting a region in which personal information may exist in the binary image;
a determination unit that determines the presence or absence of personal information in the area where the personal information is likely to exist;
an erasing unit that draws peripheral pixels of the personal information into the personal information of the first color image and erases the personal information based on the personal information determined by the determining unit and the first color image;
an output unit that outputs the erased color image;
A personal information masking device comprising:

The determination unit
(a) a filter for determining the presence or absence of personal information based on whether the logical product of the area where the personal information is likely to exist and the area obtained by rotating the area by a predetermined angle is greater than a first threshold;
(b) a filter that determines the presence or absence of personal information based on whether the standard deviation of the distance from the thin line that forms the framework of the area where the personal information is likely to exist to the boundary line of the area is greater than a second threshold;
(c) a filter for determining the presence or absence of personal information based on whether the area ratio of the area where the personal information is likely to exist and the area surrounding the area where the personal information is likely to exist is greater than a third threshold;
(d) a filter for determining the presence or absence of personal information based on whether the vertical or horizontal length of the area where the personal information is likely to exist is greater than a fourth threshold for the predetermined size;
(e) Determining the presence or absence of personal information based on whether the ratio of the area of the thin line that forms the framework of the area where the personal information is likely to exist and the area of the area where the personal information is likely to exist is greater than a fifth threshold. filter,
10. The personal information masking device of claim 9, applying at least one filter of .

Further comprising a protected area detection unit for detecting a protected area of a person in the first color image, the protected area detection unit comprising:
i) the ability to detect facial regions using deep learning;
ii) the ability to detect specific objects using deep learning;
iii) detecting skin areas by skin color in said first color image;
iv) a function of setting a protective frame to be protected on the first color image and detecting an area within the protective frame;
12. Personal information masking device according to claim 10 or claim 11, applying at least one function of