JPH05143651A

JPH05143651A - Face picture retrieving device

Info

Publication number: JPH05143651A
Application number: JP3300488A
Authority: JP
Inventors: Toru Kameda; 通亀田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1991-11-15
Filing date: 1991-11-15
Publication date: 1993-06-11

Abstract

(57)【要約】【目的】ニューラルネット情報処理機構により入力顔
画像と教師用顔画像との類似度を測定し、電話機の自分
仕様化、入室管理、銀行での容疑者チェック、留守番Ｔ
Ｖインターホン等のＩＤチェックに適用する。【構成】顔画像を入力する為のカメラ又は画像入力装
置と、デジタルイメージに変換する装置と、このデジタ
ルイメージをトリミングする機構と、ニューラルネット
により検索する機能と、ニューラルネットに顔画像を埋
め込む機能と、カメラからの映像を、絶えずニューラル
ネットに送る機構とを有する。【効果】短時間で画像検索が可能である事、検索結果
の非線形の類似度が、アナログ的デジタル値に出力され
るので、計量的評価が可能である。 (57) [Abstract] [Purpose] Neural network information processing mechanism measures the similarity between the input face image and the teacher's face image, and makes the phone a personal specification, entrance control, suspect check at bank, answering machine T
It is applied to the ID check of V intercoms. [Structure] A camera or an image input device for inputting a face image, a device for converting into a digital image, a mechanism for trimming this digital image, a function for searching by a neural network, and a function for embedding a face image in the neural network. And a mechanism for constantly sending the image from the camera to the neural network. [Effect] It is possible to perform an image search in a short time, and the nonlinear similarity of the search result is output to an analog digital value, which enables quantitative evaluation.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、ＩＤチェック、安全
監視、光デスク画像ファイリングシステムなどに適用さ
れる、ニューラルネット情報処理機構を用いて顔画像を
検索する装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device for retrieving a face image using a neural network information processing mechanism, which is applied to ID check, safety monitoring, optical desk image filing system and the like.

【０００２】[0002]

【従来の技術】従来、公共もしくは不特定、特定多数の
人々が利用する施設、装置において、特定人物の同定チ
ェック、ＩＤチェック等は、磁気カード、ＩＣカード等
の所持でき、移動の可能なＩＤカード形式によって行な
われていた。2. Description of the Related Art Conventionally, in facilities and devices used by a large number of people who are public or unspecified, a person who has a magnetic card, an IC card, or the like can carry an identification check, an ID check, etc. It was done in card format.

【０００３】図４は、従来の電話機による相手先登録の
例である。この例では、登録した相手は、その電話番号
や、名前を、画面２０に文字で表示する事によって、そ
のボタン２１の内容を、列挙し、目的の相手に電話をす
る事が可能となっている。ここに示すものは、各個人
が、その手元においている電話機である。しかし、その
使用者が構内で移動した場合、その移動先の場所で、そ
こにある電話機を、自分用に設定したい場合、通常は出
先場所にて、電話機もしくはＴＶ電話機を、自分仕様に
設定する為、磁気又はＩＣカード化されたＩＤカード２
２を差し替えて、電話機に挿入し、設定していた。FIG. 4 is an example of registration of a destination by a conventional telephone. In this example, the registered person can enumerate the contents of the button 21 by displaying the telephone number or name of the person on the screen 20 and call the intended person. There is. Shown here is a telephone that each individual has at hand. However, when the user moves on the premises, if the user wants to set his / her own telephone at the place where the user moves, normally, at the place where he / she goes, set the telephone or TV phone to his own specifications. Therefore, the magnetic or IC card ID card 2
I replaced 2 and inserted it into the phone and set it.

【０００４】図７（ａ）は、従来の勤務時間管理をする
為のタイムカード機の例である。従来は、各自が持って
いる磁気カードやＩＣカード等のカード２２を、カード
機２８に挿入して、機械的、もしくは、電気的刻印を行
なっていた。FIG. 7A shows an example of a conventional time card machine for managing working hours. Conventionally, a card 22 such as a magnetic card or an IC card owned by each person is inserted into a card machine 28 to perform mechanical or electrical marking.

【０００５】図８（ａ）は、従来の極秘データ管理室へ
の特定人物の入出室時のＩＤチェックの例である。各自
が磁気又はＩＣカード化されたＩＤカード２２を持ち、
これを検査機２９に挿入し、電気的チェックを行なう事
によって判定し入室許可を与えられていた。FIG. 8 (a) shows an example of a conventional ID check when a specific person enters or leaves the confidential data management room. Each person has a magnetic or IC card ID card 22,
This was inserted into the inspection machine 29, and an electrical check was performed to make a determination and permission to enter the room was given.

【０００６】図９（ａ）は門塀３０に設けられたインタ
ーホンの例で、従来は、防犯の為の遠隔ＴＶ監視装置２
１だけであった。また、従来の銀行等の金融機関での監
視業務では、一般的に、警備室等に、各所に配置され
た、監視カメラからの映像出力が、集中的にモニタ画面
に映し出されて、何か、不正が行なわれているか、もし
くは、その危険性があるか絶えず見ている。FIG. 9 (a) shows an example of an intercom installed in a gate fence 30, which is conventionally a remote TV monitoring device 2 for crime prevention.
It was only 1. Further, in the conventional surveillance work in a financial institution such as a bank, in general, video output from surveillance cameras arranged in various places such as a security room is intensively displayed on a monitor screen, and , Is constantly looking at fraud and its potential risks.

【０００７】一方、顔画像の識別には、その顔自体が、
もともと頭を含む３次元的な要素である為、非常に識別
が困難である。そのため、従来の顔画像検索装置は、角
「並行トップダウン処理方式による顔画像の解析」電子
情報通信学会コンピュータビジョン６１ー３、永嶋
「顔の３次元モデルの自動生成法に関する検討」電子情
報通信学会ＯＳ９０−３１，ＩＥ９０−４６，沼崎
「モデルに基ずく顔画像からの人物識別」電子情報通信
学会ＩＥ８７−１３６に示す様に３次元データの処理
となってしまう。On the other hand, to identify a face image, the face itself is
Since it is a three-dimensional element including the head, it is very difficult to identify it. For this reason, the conventional face image retrieval device is a corner "analysis of face image by parallel top-down processing method" The Institute of Electronics, Information and Communication Engineers Computer Vision 61-3, Nagashima "Study on automatic generation method of three-dimensional model of face" Electronic information and communication Society OS 90-31, IE 90-46, Numazaki “Person identification from face image based on model” IEICE IE 87-136 results in processing of three-dimensional data.

【０００８】図１１（ａ）は、従来の３次元モデルに基
ずく顔画像からの人物識別の方法で、予め頭部の３次元
基本モデルを数値データとして作成して置き、入力され
た顔画像に対し、モデルを合わせる様に変形し、さらに
顔の正面方向に対し、方向変形を行ない、これに、さら
に、この変形したモデルと、すでに登録しておいた２次
元画像から、比較を行ない、照合結果で、識別する。FIG. 11 (a) shows a conventional method for identifying a person from a face image based on a three-dimensional model, in which a three-dimensional basic model of the head is created in advance as numerical data and placed, and the input face image is input. On the other hand, the model is deformed so as to match, and further the direction is deformed in the frontal direction of the face. Further, the deformed model and the already registered two-dimensional image are compared, Identify by the matching result.

【０００９】図１１（ｂ）は、従来の臨場感会議通信を
行なう為の顔の３次元モデルの自動生成の検討段階での
顔の特徴点の図で、この場合も同様に頭部の標準３次元
モデルデータを、石膏のレーザスキャンで得て、横顔の
入力画像では曲率で、正面顔では輪郭線から、誘導され
た各特徴点と、モデルの特徴点を合致する様に変形を
し、これで得られた３次元データを用いて、仮想会議空
間に転写している。FIG. 11 (b) is a diagram of facial feature points at the stage of studying automatic generation of a three-dimensional model of a face for performing conventional live-action conference communication. 3D model data was obtained by laser scanning of plaster, and the input image of the side face was curved, and the contour of the front face was deformed so as to match each of the derived characteristic points with the characteristic points of the model. Using the three-dimensional data obtained in this way, it is transferred to the virtual conference space.

【００１０】図１１（ｃ）は、従来の並行述語トップダ
ウン処理による顔画像の解析法の例で、入力した顔画像
に対し、画像信号処理にて抽出された輪郭線を、述語論
理的手段で、予め作っておいた、知識ルールベースに対
し、プロダクション型実行で検索し、パターン照合を行
なう。この処理を、予め登録しているそれぞれの顔知識
データベースに対して、並列処理、並行実行の形式で処
理する。ゴールが得られない各実行に対しては、全く無
駄となるが、本方式では、アルゴリズム切り替えの複雑
な制御が不要である。それ故、高速サーチが、達成され
る。FIG. 11 (c) shows an example of a face image analysis method by the conventional parallel predicate top-down processing. The contour line extracted by the image signal processing is applied to the input face image by the predicate logical means. Then, the knowledge rule base created in advance is searched by the production type execution and the pattern matching is performed. This processing is performed in parallel processing and parallel execution format for each face knowledge database registered in advance. It is completely useless for each execution in which no goal is obtained, but this method does not require complicated control of algorithm switching. Therefore, a fast search is achieved.

【００１１】[0011]

【発明が解決しようとする課題】従来の電話機や、各種
のＩＤチェック機器は、以上の様に、各自の設定仕様に
する為や、ＩＣカード等を用いていた。しかし、移動す
る度にＩＣカードを、持ち歩く必要があり、紛失等の危
険性があり、もし、紛失してしまった場合、その再発行
処理が即時に行なえなく、また、設定仕様の再生も不可
能であるなど、様々な制約があった。これらの危険性
は、何も電話機のみならず、入室ＩＤカード等にもその
危険性があり、種々の問題点があった。As described above, the conventional telephones and various ID checking devices use the IC card or the like in order to have their own setting specifications. However, it is necessary to carry an IC card with you every time you move, and there is a risk of loss. If you lose it, the reissuing process cannot be performed immediately, and the setting specifications cannot be reproduced. There were various restrictions such as possible. These dangers are not only in the telephones but also in the entrance ID cards and the like, and there are various problems.

【００１２】また、顔画像の検索処理は３次元のデータ
処理となるので、複雑であり、識別が困難であるという
問題点があった。この発明はニューラルネット型の情報
処理機能を利用して顔画像検索をし、顔画像の合致によ
るＩＤチェックの機能により、入室管理、電話機の自分
仕様化、容疑者チェック、訪問者識別を可能とする事を
目的とする。Further, since the face image search process is a three-dimensional data process, there is a problem that it is complicated and difficult to identify. The present invention uses a neural network type information processing function to search for a face image, and an ID check function based on matching of face images enables entry control, personalization of a telephone, suspect check, and visitor identification. The purpose is to do.

【００１３】[0013]

【課題を解決するための手段】この発明における顔画像
検索装置は、カメラ及び密着型画像入力装置により入力
した画像をデジタルイメージに変換する機構と、顔画像
部分を抜き出しする機構と、該抜き出した顔画像をニュ
ーラルネット機構の入力層に入力する機構と、顔判定用
の教師顔画像信号をニューラルネットに埋め込む手段
と、誤差逆伝搬型系アルゴリズムにより処理を行なうニ
ューラルネット情報処理機構とを備えたものである。A face image retrieval apparatus according to the present invention is a mechanism for converting an image input by a camera and a contact type image input device into a digital image, a mechanism for extracting a face image portion, and a mechanism for extracting the face image portion. A mechanism for inputting a face image into the input layer of the neural network mechanism, means for embedding a teacher face image signal for face determination in the neural network, and a neural network information processing mechanism for processing by the error backpropagation system algorithm It is a thing.

【００１４】また他の発明における顔画像検索装置は、
請求項１記載の該検索装置において、抜き出し顔画像の
ニューラルネット機構入力手段に、画像デジタルイメー
ジを軸変換する座標軸変換装置と、軸変換された該デジ
タルイメージを情報をそのまま圧縮、変換処理を施して
ニューラルネットに入力する圧縮、変換処理手段とを有
するものである。A face image retrieval apparatus according to another invention is
2. The retrieval apparatus according to claim 1, wherein the neural network mechanism inputting means for the extracted face image has a coordinate axis transformation device for axially transforming the digital image of the image, and the axially transformed digital image is directly compressed and subjected to conversion processing. And a compression / conversion processing means for inputting to the neural network.

【００１５】[0015]

【作用】この発明における、画像検索装置のニューラル
ネット機構入力手段はカメラもしくはイメージスキャナ
装置等から取り込んだ顔画像信号をニューラルネットに
登録、埋め込みをする。またＢＰ型ニューラルネット機
構は、情報比較機能を用いて、ニューラルネット処理の
即時性、非線形性を利用し、アナログ量的なデジタル値
で出力する非線形な類似度測度により、入力信号に多少
の変形を含んでいても、当事者の識別動作が可能にな
る。The neural network mechanism input means of the image retrieval apparatus according to the present invention registers and embeds a face image signal taken in from a camera or an image scanner into a neural network. Further, the BP type neural network mechanism uses the information comparison function to utilize the immediacy and non-linearity of the neural network processing, and slightly deforms the input signal by the non-linear similarity measure output as an analog quantitative digital value. Even if it includes, the identification operation of the party becomes possible.

【００１６】また他の発明における画像検索装置の座標
軸変換手段は、画像デジタルイメージの画像信号を全包
囲角度の座標軸で軸変換し、圧縮、変換処理手段は軸変
換された該デジタルイメージを情報をそのまま圧縮、変
換処理を施してニューラルネット機構に入力する。In another aspect of the invention, the coordinate axis conversion means of the image retrieval apparatus performs the axis conversion of the image signal of the image digital image with the coordinate axes of the entire enclosing angle, and the compression and conversion processing means converts the information of the axially converted digital image. It is directly compressed and converted and input to the neural network mechanism.

【００１７】[0017]

【Example】

実施例１、図１〜図３に、それぞれ、この発明の実施例
を示す。図１は、この発明に用いられる方式の基本的概
念を示すもので、ニューラルネットの処理形態の概念図
である。この図１において、１は入力層のニューロン機
能素に相当し、これはデータ蓄積機能も持ち、これに
は、入力画像のアナログ量に相当するデジタル値を入力
する。この１の入力層からの信号は、シナプスと呼ばれ
る可変の重み係数を含む入力側シナプス伝達系４に、送
られる。そして、この入力側シナプス伝達系４は、それ
ぞれの入力層ニューロン素１から、中間層ニューロン素
２に対し、総当たり的に接続されており、中間層ニュー
ロン素２は、さらにその次の中間層ニューロン素、もし
くは、最終出力層ニューロン素３に接続されている。こ
の中間層ニューロン素２はその層の個数に制限はなく、
要求される入力カテゴリーと、要求される教師要素によ
って、適宜、決定されるものである。Example 1 and FIGS. 1 to 3 show examples of the present invention. FIG. 1 shows the basic concept of the system used in the present invention, and is a conceptual diagram of the processing form of a neural network. In FIG. 1, 1 corresponds to a neuron functional element of the input layer, which also has a data storage function, to which a digital value corresponding to the analog amount of the input image is input. The signal from the 1 input layer is sent to the input side synapse transmission system 4 including a variable weighting coefficient called a synapse. The input-side synapse transmission system 4 is connected from each input-layer neuron element 1 to the intermediate-layer neuron element 2 in a brute force manner, and the intermediate-layer neuron element 2 is further connected to the next intermediate layer. It is connected to the neuron element or the final output layer neuron element 3. This intermediate layer neuron 2 has no limitation on the number of layers,
It is appropriately determined depending on the required input category and the required teacher element.

【００１８】この全体の伝達系４、５は、それぞれ、入
力層ニューロン素１から、出力層ニューロン素３に向け
て、全多重並列的に信号が伝達される。そして、その最
終結果が、埋め込み後の結果として現れる。一方、この
信号伝達系は、教育モードの時は、その模範教師信号１
８と、入力カテゴリー１４からの出力値との誤差で、間
接的に反映して、帰還１５し、重み係数を変化させる。
それゆえ系全体が最急降下的に任意の安定状態に、漸近
的に向かう事となり、最終安定状態で保持される事とな
る。In the entire transmission systems 4 and 5, signals are transmitted from the input layer neuron element 1 to the output layer neuron element 3 in full multiplex parallel fashion. Then, the final result appears as a result after the embedding. On the other hand, this signal transmission system has the model teacher signal 1 in the education mode.
The error between 8 and the output value from the input category 14 is indirectly reflected, and feedback 15 is performed to change the weighting coefficient.
Therefore, the whole system goes asymptotically to an arbitrary stable state, and is held in the final stable state.

【００１９】図２（ａ）は本発明の画像処理の為の概念
図である。６は入力画像信号系を、トランスペアレンシ
ーとして、形式的に表わしている。このトランスペアレ
ンシー６を透過した信号は、９のニューラルネット処理
系で処理される。FIG. 2A is a conceptual diagram for the image processing of the present invention. Reference numeral 6 formally represents the input image signal system as transparency. The signal that has passed through the transparency 6 is processed by the neural network processing system 9.

【００２０】図２（ｂ）は上記の関係を、側面図として
示したもので、トランスペアレンシー６の前から、コヒ
ーレントな照明光８を与え、そのままの画像信号をニュ
ーラルネット９に与える。FIG. 2B shows the above relationship as a side view, in which coherent illumination light 8 is applied from the front of the transparency 6 and the image signal as it is is applied to the neural network 9.

【００２１】図２（ｃ）は、同様に側面図で示したもの
である。先と同様な配置であるが、これで異なるのは、
１０で示された座標軸変換機能装置であって、これは、
便宜的に光学系の凸レンズで表わしている。そして、そ
の配置は、コヒーレント光８でトランスペアレンシー６
を透過した画像信号入力は、該レンズ機能１０の焦点位
置で、第２の散乱型トランスペアレンシー７に入射し、
このトランスペアレンシー７を通した出力は、第１の画
像信号を、全包囲角度の座標軸で、フーリエ変換され
た、信号となっており、これが、入力層ニューロン素１
に入力される。このような変換処理は、入力画像で、目
的の画像要素が、その位置に係わる不適合性がある為、
これを防ぐ為に行なわれるが、これは、何もフーリエ変
換に限定されるべきものでも無く、他の軸変換手法でも
良い。Similarly, FIG. 2C is a side view. The arrangement is similar to the previous one, but the difference is that
A coordinate axis conversion function device shown by 10, which is
For convenience, it is represented by a convex lens of the optical system. The arrangement is such that the coherent light 8 and the transparency 6
The image signal input transmitted through is incident on the second scattering type transparency 7 at the focal position of the lens function 10.
The output through the transparency 7 is a signal obtained by Fourier-transforming the first image signal with the coordinate axes of the entire envelopment angle, which is the input layer neuron element 1
Entered in. In such a conversion process, since the target image element in the input image has incompatibility related to its position,
This is performed to prevent this, but this should not be limited to the Fourier transform, and other axis transform methods may be used.

【００２２】図３は上記の基本原理をもとに、実際のシ
ステムを構築する際の、機能ブロックの構成図である。
１１はＴＶカメラ、１２はイメージリーダ装置、１３は
画像アナログ信号をデジタルイメージに変換する為のＡ
／Ｄ変換器であり、１４はそのイメージデータから顔部
分のみを取り出す画像切り出し装置である。画像切り出
し装置１４によって取り出された顔画像データは、ニュ
ーラルネット９により処理される。信号端子１８は教師
信号用であり、動作モード切り替えＳＷ１９が、教育モ
ード側に接続されていると、教師信号１８とのニューラ
ルネット出力結果との誤差を、画像切り出し装置１４の
逆方向に伝達素４の重み係数を変化させるという動作
で、状態帰還をさせている。１７は学習の達成度をチェ
ックする為の監視用誤差信号、１５は誤差逆伝搬部分、
１６は検索モード時の結果の出力端子である。FIG. 3 is a block diagram of functional blocks when constructing an actual system based on the above basic principle.
11 is a TV camera, 12 is an image reader device, and 13 is A for converting an image analog signal into a digital image.
A D / D converter, and 14 is an image clipping device that extracts only the face portion from the image data. The face image data extracted by the image clipping device 14 is processed by the neural network 9. The signal terminal 18 is for a teacher signal, and when the operation mode switching SW 19 is connected to the education mode side, an error between the teacher signal 18 and the neural network output result is transmitted in the reverse direction of the image cropping device 14. State feedback is performed by the operation of changing the weighting factor of 4. 17 is a monitoring error signal for checking the degree of achievement of learning, 15 is an error back propagation part,
Reference numeral 16 is an output terminal of the result in the search mode.

【００２３】以下、６つの適用例を示す。適用例１、図５（ａ）はＩＣカードを使わずに、設定す
る事が可能な本発明の適用実施例で、まずあらかじめ登
録している相手先は、可能な限り、顔画像を入手し、こ
れを画面上に提示する事によって、ビジュアルに対応可
能としている。この事によって、もっと親密性のあるＴ
Ｖ電話機が実現できる。Six application examples will be shown below. Application example 1 and FIG. 5 (a) are application examples of the present invention that can be set without using an IC card. First, the destination registered in advance obtains face images as much as possible. By presenting this on the screen, it is possible to respond visually. This makes T more intimate
V telephone can be realized.

【００２４】図５（ｂ）は、本発明のＴＶカメラからの
画像によって、自分専用の電話機に、設定する過程を示
した図である。付属のＴＶカメラ２１によって、設定し
ようとする本人２５の顔画像を入れ、この顔画像は、デ
ジタルイメージに変換され、そのイメージ２３は、交換
器２２に送られる。この交換器２２側では、電話機から
送られて来た画像と似たものを検索し、その候補２４
を、電話機側に送り返す。もし、候補が唯一個であれ
ば、返送せずに、そのまま、その候補に相当する一連の
関連する登録されている顔画像データを連続して送り返
してくる。これによって設定され、自分用の設定とな
る。これによって、従来の様にＩＣカードを持参して、
挿入する事なく、出先で自由に自分用に設定出来る。FIG. 5 (b) is a diagram showing a process of setting an image from the TV camera of the present invention in a telephone set for one's own use. The attached TV camera 21 puts a face image of the person 25 to be set, the face image is converted into a digital image, and the image 23 is sent to the exchange 22. On the exchange 22 side, a search is made for a similar image to the image sent from the telephone, and the candidate 24
To the telephone side. If there is only one candidate, the series of related face image data corresponding to the candidate is continuously sent back without being returned. It is set by this, and becomes a setting for oneself. With this, bring your IC card as before,
You can freely set it for yourself on the go without inserting it.

【００２５】適用例２、図６は、銀行等の金融機関での
監視装置に適用している例である。まず、カメラ２１
で、銀行の窓口の画像が、モニタＣＲＴ２６に映される
と集中監視室で監視している。この映像信号は分岐して
取り出され、Ａ／Ｄ変換器１３でＡ／Ｄ変換しデジタル
イメージにされる。このデジタルイメージの中から、画
像切り出し装置１４で必要な顔画像部分を、デジタル的
に抜き出す。この抜き出された顔画像部分をニューラル
ネット情報処理機構９に入力する。２７はその出力を、
ＬＥＤで表示したもので、実際は、ＣＲＴ上で警告表現
すれば良い。１１、１２はニューラルネットに容疑者の
顔を、埋め込む為の顔画像の入力装置である。画像切り
出し装置１４での画像データと、ニューラルネット情報
処理機構９での画像データを比較する事によって、同様
な顔があれば、警告を発するのである。本実施例では、
従来の銀行等の金融機関での監視業務の一部を代行する
機能を持つ。Application example 2, FIG. 6 is an example of application to a monitoring device in a financial institution such as a bank. First, the camera 21
Then, when the image of the window of the bank is displayed on the monitor CRT 26, it is monitored in the central monitoring room. This video signal is branched and taken out, and is A / D converted by the A / D converter 13 to be a digital image. From this digital image, a face image portion required by the image clipping device 14 is digitally extracted. The extracted face image portion is input to the neural network information processing mechanism 9. 27 outputs the output
It is displayed by an LED, and in actuality, a warning may be displayed on the CRT. Reference numerals 11 and 12 denote face image input devices for embedding the suspect's face in the neural network. By comparing the image data in the image clipping device 14 and the image data in the neural network information processing mechanism 9, if there is a similar face, a warning is issued. In this example,
It has the function of acting as a proxy for part of the monitoring work of conventional financial institutions such as banks.

【００２６】まず複数の銀行強盗等の容疑者の顔写真
を、入力装置１１又は１２から入力し、ニューラルネ
ット情報処理機構９に学習させ埋め込ませる。その後、
チェックモードに戻し、監視業務に導入する。この状態
にして置くと、突然、容疑者が銀行を訪れた時、ただち
に、警報をだす。これによって、警備陣はただちに警戒
態勢を敷く事が可能となり、容疑者にその気配があれ
ば、ただちに逮捕可能となる。この早期警戒、監視の可
能であるか否かは、非常に重要で、これによっては、非
常に多くの犯罪を未然に、もしくは、甚大な被害を出さ
ずに防ぐ事が可能となる。First, facial photographs of a plurality of suspects such as bank robbers are input from the input device 11 or 12, and the neural network information processing mechanism 9 learns and embeds them. afterwards,
Return to check mode and introduce it to the monitoring work. If left in this state, when a suspect suddenly visits a bank, he will immediately issue an alarm. This would allow the guards to immediately put themselves in a vigilant position and, if the suspect had the sign, could immediately arrest them. Whether or not this early warning and monitoring is possible is very important, and it is possible to prevent a large number of crimes before they occur or without causing serious damage.

【００２７】適用例３、図９（ａ）は、一般家庭での応
用例である。門３０において、ＴＶカメラ２１付きの門
塀インターホンが設置されている。ここでは、このＴＶ
カメラ付きのインターホンに、留守番機能を持たせる事
を特徴とする。この機能を持つことによって、たとえ、
留守中に人が訪ねて来ても、その人に相当した対応をす
る事が可能となる。例えば、いつもの酒屋さんが、注文
をとりにきても、それと分かれば、予め録音しておいた
伝言を流す事によって、日常雑事が円滑に済み、全く見
知らぬ訪問者が来ても、それに対応する留守番電話的
な、相手の伝言を記録するような、機能を持たせる事が
可能となる。Application example 3, FIG. 9A is an application example in a general household. At the gate 30, a gate fence intercom with a TV camera 21 is installed. Here, this TV
The feature is that the intercom with a camera has an answering machine function. By having this feature, even
Even if a person visits while he / she is away, he / she can respond to the person. For example, even if an ordinary liquor store picks up an order, if you know it, by sending a pre-recorded message, your daily chores will be smooth and you will be able to respond to strangers even if you do not know them. It becomes possible to have a function such as recording an answering machine's message like an answering machine.

【００２８】図９（ｂ）は、カメラ２１とモニタ２６
は、ＴＶインターホンの構成要素で、これに、画像切り
出し装置１４、ニューラルネット情報処理機構９の、デ
ジタルイメージ化装置や、ニューラルネット型、顔画像
検索機能要素を付ける事によって、訪問者識別を行な
う。この事に依って、マイクロＣＰＵによって、録再の
箇所が自由に制御できる録音機を設置し、訪問者個別
の、対応の録再対応が出来る。なお、３１はＣＰＵで制
御可能なテープレコーダ、３２は門塀ＴＶインターホン
のスピーカである。FIG. 9B shows the camera 21 and the monitor 26.
Is a component of the TV intercom, and a visitor identification is performed by attaching a digital image forming device of the image clipping device 14, the neural network information processing mechanism 9, a neural network type, and a face image search function element to the component. .. According to this, a recording device that can freely control the recording / reproducing location by the micro CPU is installed, and the recording / reproducing can be handled individually for each visitor. Incidentally, 31 is a tape recorder controllable by a CPU, and 32 is a speaker of a gate fence TV intercom.

【００２９】適用例４、第７（ｂ）図は、カメラ２１
と、ニューラルネットを用いた勤務時間管理をする為の
タイムレコーダ２８の例である。カメラ２１に映る顔画
像を、ニューラルネットで確認し、打刻する。Application example 4, FIG. 7 (b) shows a camera 21.
And an example of a time recorder 28 for managing working hours using a neural network. The face image displayed on the camera 21 is confirmed by a neural network and stamped.

【００３０】適用例５、図８（ｂ）は、本発明を用いて
入室管理への適用を行なったもので、カメラ２１とニュ
ーラルネットを用いてＩＤチェック機能のみを代行し、
入室許可を与えているものである。この場合も、ＩＤカ
ード等を持ち歩く必要が無い。Application example 5, FIG. 8 (b) is an application to entrance control using the present invention, in which only the ID check function is performed by using the camera 21 and the neural network.
It gives permission to enter the room. Also in this case, it is not necessary to carry around an ID card or the like.

【００３１】適用例６、図１０は、光ディスク画像ファ
イリングシステムで適用した例である。これらは、例え
ば会社においては、社員の個人情報を顔画像とともに蓄
積しており、ＩＤカードの再発行、新入作成、検索した
り等の、社員の人事情報の処理を行なう。構成を詳述す
ると、ＣＲＴ２６は顔画像や種々の情報の表示器で、カ
メラ１１や、イメージリーダ１２を介して、顔画像を光
ディスクドライブ３４に、録画、再生させる。３３はそ
の制御部である。また３５はＬＡＮ回線接続である。そ
して、これらの画像データは、ＬＡＮ回線を通して、ア
クセスする事が可能で、社員が、出張等で、社内関連場
所に出かけたとしても、その遠隔地から、社内コンピュ
ータ網を、介して、本画像システムをアクセスする、こ
の事によって、遠隔地のＴＶ電話型コンピュータ端末３
６から、通常のＩＤカードを使用しなくても、個人同定
が可能となり、またその個人に応じたデータ接触クラス
も、設定されており、アクセス記録とともに、極秘デー
タの漏れ対策も、可能となる。このシステムでの第２の
応用例は、警察等での、容疑者を洗い出しに利用する例
である。ある犯行の現場写真、もしくは、監視用ビデオ
画像を使用して、顔画像を入力する事によって、ただち
に、類似度の高い疑わしい容疑者候補を、取り出す事が
可能となる。Application example 6, FIG. 10 is an example of application in an optical disk image filing system. For example, in a company, personal information of an employee is stored together with a face image, and the personnel information of the employee is processed such as reissuing an ID card, creating a new member, and searching. More specifically, the CRT 26 is a display for displaying a face image and various information, and causes the optical disc drive 34 to record and reproduce the face image via the camera 11 and the image reader 12. Reference numeral 33 is its control unit. Reference numeral 35 is a LAN line connection. Then, these image data can be accessed through a LAN line, and even if an employee goes to a place related to the company on a business trip or the like, the main image can be obtained from the remote place through the company computer network. The system is accessed, and by this, the remote TV-phone type computer terminal 3
From 6, it is possible to identify an individual without using an ordinary ID card, and a data contact class is set according to the individual, and it is possible to prevent leakage of confidential data along with access records. .. A second application example of this system is an example in which a suspect is used to identify a suspect in police or the like. By inputting a face image using a scene photograph of a certain crime or a video image for surveillance, it is possible to immediately extract a suspicious suspect candidate with high similarity.

【００３２】[0032]

【発明の効果】以上のように、この発明によれば、ニュ
ーラルネット情報処理機構によって、短時間で顔画像の
判定が可能となり、また、提示された画像が、教育モー
ドで使用された画像と、全く同一でなくても、その類似
度比較を非線形的に行ない、アナログ量的デジタル値で
報告してくる。また、上記非線形の類似度比較をする
為、顔の一部をマスクした画像であっても、判定が可能
となる柔軟性がある。As described above, according to the present invention, the facial image can be determined in a short time by the neural network information processing mechanism, and the presented image is the same as the image used in the education mode. Even if they are not exactly the same, the similarity comparison is performed non-linearly, and the analog quantitative digital value is reported. In addition, since the above-described nonlinear similarity comparison is performed, there is flexibility that it is possible to make a determination even for an image in which a part of the face is masked.

【００３３】さらに、別な発明においては、画像デジタ
ルイメージを軸変換し、軸変換された該デジタルイメー
ジを情報をそのまま圧縮、変換処理を施してニューラル
ネットに入力しているので、入力画像で、目的の画像要
素が、その位置に係わる不適合性があっても当事者の識
別動作が可能になる。Further, in another invention, the image digital image is axially converted, and the axially converted digital image is directly compressed and converted to be inputted to the neural network. Even if the target image element has incompatibility related to its position, the identification operation of the parties becomes possible.

【００３４】[0034]

[Brief description of drawings]

【図１】ニューラルネットの基本的構成図。FIG. 1 is a basic configuration diagram of a neural network.

【図２】（ａ）は画像処理の為の画像のニューラルネッ
トへの入力形態の３次元表現図。（ｂ）は画像のピクセ
ルそのままの入力形態図。（ｃ）は画像の入力の座標軸
変換した入力形態図。FIG. 2A is a three-dimensional representation of an input form of an image for image processing into a neural network. (B) is an input form diagram in which pixels of an image are directly input. (C) is an input form diagram in which the coordinate axes of the image input are converted.

【図３】ニューラルネットを用いた顔画像の検索の為の
機能ブロック図FIG. 3 is a functional block diagram for searching a face image using a neural network.

【図４】従来の電話機の機能の自分仕様化に設定する従
来の方法図。FIG. 4 is a diagram of a conventional method of setting a function of a conventional telephone set to a personalized specification.

【図５】（ａ）は本発明を用いた顔画像制御によるＴＶ
電話機の自分仕様化の方法図。（ｂ）は本発明を用いた
顔画像制御によるＴＶ電話機の自分仕様化の方法図。FIG. 5A is a TV by face image control using the present invention.
Diagram of how to make your own phone. FIG. 6B is a diagram of a method for personalizing a TV phone by controlling a face image using the present invention.

【図６】銀行の防犯監視に応用した場合の機能回路ブロ
ック図。FIG. 6 is a functional circuit block diagram when applied to crime prevention monitoring of a bank.

【図７】（ａ）は勤務管理の為の従来のタイムレコーダ
斜視図。（ｂ）は勤務管理の為の本発明のタイムレコー
ダ斜視図。FIG. 7A is a perspective view of a conventional time recorder for work management. (B) is a perspective view of a time recorder of the present invention for work management.

【図８】（ａ）は極秘データ室への従来の入出管理装置
の斜視図。（ｂ）は極秘データ室への本発明の入出管理
装置の斜視図。FIG. 8A is a perspective view of a conventional entry / exit management device for a confidential data room. (B) is a perspective view of the access control apparatus of this invention to a confidential data room.

【図９】（ａ）は家庭用の門塀インターホンに応用した
斜視図。（ｂ）は家庭用の玄関インターホンに応用する
例の内部ブロック図FIG. 9A is a perspective view applied to a home gate intercom. (B) is an internal block diagram of an example applied to a home entrance intercom

【図１０】光ディスク画像ファイリングシステムを用い
た、社員ＩＤカード作成、人事データ処理システムブロ
ック図。FIG. 10 is a block diagram of an employee ID card creation and personnel data processing system using an optical disk image filing system.

【図１１】（ａ）は従来のモデルに基ずく顔画像からの
人物識別の方法を示す図。（ｂ）は従来の顔の３次元モ
デルの自動生成法での顔の特徴点部分を示す図。（ｃ）
は従来の並行トップダウン処理による顔画像の解析法を
示す図。FIG. 11A is a diagram showing a conventional method for identifying a person from a face image based on a model. (B) is a figure which shows the feature point part of the face by the conventional automatic generation method of the three-dimensional model of the face. (C)
Is a diagram showing a conventional facial image analysis method by parallel top-down processing.

[Explanation of symbols]

１ニューラルネットの入力層のニューロン素２ニューラルネットの中間層のニューロン素３ニューラルネットの出力層のニューロン素４ニューラルネットの入力側シナプス伝達系５ニューラルネットの出力側シナプス伝達系６直接入力画像のトランスペアレンシー７軸変換画像入力のトランスペアレンシー８コヒーレント照明光９ニューラルネットの３次元表示１０凸レンズもしくは、フーリエ変換器１１ＴＶカメラ１２イメージリーダ装置１３ビデオ信号のＡ／Ｄ変換器１４画像の切り出し装置１５誤差逆伝搬部分１６結果送出部１７学習度評価の為の誤差出力１８教師信号入力１９ニューラルネット処理結果２０ＴＶ電話機の画像表示器２１ＴＶ電話機の付属のカメラ部２２ＴＶ電話機用のＩＣカード２３自分仕様化の為の交換機へ送るＴＶ電話機付属の
カメラからの画像２４交換機から送られて来る目的の仕様に即した設定
用の画像群２５新しい使用者２６防犯監視の為のモニタＴＶＣＲＴ群２７ニューラルネットからの警報出力２８勤務管理の為のタイムレコーダ２９極秘室へのＩＤチェック機３０家庭の玄関３１ＣＰＵで制御可能な録再テープレコーダ３２玄関ＴＶインターホンのスピーカ３３光ディスク画像ファイリングシステムの制御機３４光ディスク画像ファイリングドライブ３５コンピュ−タＬＡＮ回線３６遠隔地のＴＶ電話型コンピュータ端末器1 Neuron element of input layer of neural network 2 Neuron element of intermediate layer of neural network 3 Neuron element of output layer of neural network 4 Synapse transmission system of input side of neural network 5 Synapse transmission system of output side of neural network 6 Direct input image Transparency 7 Transparency of axis conversion image input 8 Coherent illumination light 9 Three-dimensional display of neural network 10 Convex lens or Fourier transformer 11 TV camera 12 Image reader device 13 A / D converter of video signal 14 Image clipping device 15 Error back propagation part 16 Result sending part 17 Error output for learning degree evaluation 18 Teacher signal input 19 Neural net processing result 20 Image display of TV phone 21 Camera part attached to TV phone 22 IC card for TV phone 2 Image sent from the camera attached to the TV phone for personalization of the switch 24 Image of settings sent from the switch according to the intended specifications 25 New user 26 Monitor TVCRT group for crime prevention monitoring 27 Neural Alarm output from the net 28 Time recorder for work management 29 ID check machine to the secret room 30 Home entrance 31 Recording / playback tape recorder 32 controllable by CPU 32 Entrance TV intercom speaker 33 Optical disk image filing system controller 34 Optical disk Image filing drive 35 Computer LAN line 36 TV telephone type computer terminal at remote location

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁵ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｌ 9/00 9/10 9/12 ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁵ Identification code Internal reference number FI technical display location H04L 9/00 9/10 9/12

Claims

[Claims]

1. A means for converting an image input by a camera or a contact image input device into a digital image, a means for extracting a face image portion, and a means for inputting the extracted face image to an input layer of a neural network mechanism. And a neural network information processing means for performing processing by an error back-propagation type algorithm, and a face image retrieval apparatus.

2. A neural network mechanism inputting means for the extracted face image, a coordinate axis converting device for axially converting the digital image of the image, and the digitally converted image is compressed and converted as it is and input to the neural network. 2. The face image search device according to claim 1, further comprising compression and conversion processing means.