CN201084168Y - A face-identifying device - Google Patents
A face-identifying device Download PDFInfo
- Publication number
- CN201084168Y CN201084168Y CNU2007201733791U CN200720173379U CN201084168Y CN 201084168 Y CN201084168 Y CN 201084168Y CN U2007201733791 U CNU2007201733791 U CN U2007201733791U CN 200720173379 U CN200720173379 U CN 200720173379U CN 201084168 Y CN201084168 Y CN 201084168Y
- Authority
- CN
- China
- Prior art keywords
- ultrasonic
- face
- echo
- face recognition
- recognition device
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 241000282414 Homo sapiens Species 0.000 claims abstract description 19
- 238000012545 processing Methods 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 12
- 238000013528 artificial neural network Methods 0.000 claims description 7
- 238000012549 training Methods 0.000 claims description 6
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 claims description 4
- 238000004148 unit process Methods 0.000 claims description 3
- 230000001815 facial effect Effects 0.000 abstract description 4
- 238000000034 method Methods 0.000 description 25
- 238000003909 pattern recognition Methods 0.000 description 11
- 230000006835 compression Effects 0.000 description 9
- 238000007906 compression Methods 0.000 description 9
- 238000005516 engineering process Methods 0.000 description 8
- 239000000463 material Substances 0.000 description 8
- 239000013598 vector Substances 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 238000010408 sweeping Methods 0.000 description 5
- 238000002592 echocardiography Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 238000005259 measurement Methods 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 230000036544 posture Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 2
- 239000000919 ceramic Substances 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 239000010409 thin film Substances 0.000 description 2
- 238000002604 ultrasonography Methods 0.000 description 2
- 238000013398 bayesian method Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 230000008921 facial expression Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002969 morbid Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 238000012567 pattern recognition method Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Landscapes
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
本实用新型涉及一种人脸识别装置,包括超声波发声器,用于发射超声扫频信号;超声波接收器,用于接收回波信号;控制处理器,与所述超声波发生器的输入端、所述超声波接收器的输出端和识别结果输出装置的输入端相连接,并与存储超声波人脸信息数据库的存储器互联;所述存储器用来存储超声波人脸信息数据库,并供控制处理器访问、控制。本实用新型的优点包括:可以得到很高的空间分辨率,能够提取丰富的面部信息;能够减小背景对人脸识别的影响;可以将3D模型与人脸区分开;可以克服照片、视频的欺骗;数据量减小,能够提高识别速度;具有较高的识别率;所需的超声波人脸数据库具有数据量小的特点,方便建立大规模超声波人脸数据库。
The utility model relates to a face recognition device, which comprises an ultrasonic sounder for emitting ultrasonic frequency sweep signals; an ultrasonic receiver for receiving echo signals; a control processor connected with the input end of the ultrasonic generator, the The output end of the ultrasonic receiver is connected to the input end of the recognition result output device, and is interconnected with the memory storing the ultrasonic face information database; the memory is used to store the ultrasonic face information database, and is accessed and controlled by the control processor . The utility model has the advantages of: high spatial resolution can be obtained, rich facial information can be extracted; the influence of background on face recognition can be reduced; 3D models can be distinguished from human faces; Deception; the data volume is reduced, which can improve the recognition speed; it has a high recognition rate; the required ultrasonic face database has the characteristics of small data volume, which is convenient for establishing a large-scale ultrasonic face database.
Description
技术领域technical field
本实用新型涉及模式识别技术领域,具体地说,本实用新型特别涉及一种基于模式识别的人脸识别装置。The utility model relates to the technical field of pattern recognition, in particular, the utility model particularly relates to a face recognition device based on pattern recognition.
背景技术Background technique
人脸识别装置是利用人脸的特征信息进行身份鉴别的电子及机械部件所组成的装置。目前常用的人脸识别技术是基于人脸的面部特征,对输入的人脸图像或者视频流,首先判断是否存在人脸,如果存在,则进一步给出每个脸的位置、大小和各个主要面部器官的位置信息。并根据这些信息,进一步提取每个人脸中所蕴含的身份特征,并将其与已知的人脸进行对比,从而识别每个人脸的身份。The face recognition device is a device composed of electronic and mechanical components that use the feature information of the face for identity verification. At present, the commonly used face recognition technology is based on the facial features of the face. For the input face image or video stream, it is first judged whether there is a face. If it exists, the position, size and main faces of each face are further given. The location information of the organ. And based on this information, further extract the identity features contained in each face, and compare them with known faces, so as to identify the identity of each face.
然而有多个因素影响常用的人脸识别技术的性能,例如:背景和头发;人脸在图象平面内的平移、缩放、旋转;人脸在图象平面外的偏转和俯仰;光源位置和强度的变化;年龄的变化;表情的变化;附着物(眼镜、胡须)的影响;照相机的变化。此外,通过照片、视频、3D模型等手段欺骗人脸识别系统引擎的企图,一直是人脸识别系统的弱点。However, there are many factors that affect the performance of commonly used face recognition technology, such as: background and hair; translation, scaling, and rotation of the face in the image plane; deflection and pitch of the face outside the image plane; light source position and Changes in intensity; changes in age; changes in expression; influence of attachments (glasses, beard); changes in camera. In addition, attempts to deceive the face recognition system engine through photos, videos, 3D models, etc. have always been the weakness of the face recognition system.
随着科技的发展,一些新的方法在一定程度上对上面的问题进行了弥补,例如基于近红外的技术在一定程度上弥补了光源位置和强度的变化,然而这种系统需要特殊的红外传感器,进而增加了成本。这种方法采集到的是近红外人脸图片或视频流,同样也存在常用人脸识别技术所面临的其他问题。With the development of science and technology, some new methods have made up for the above problems to a certain extent. For example, the technology based on near-infrared can make up for the changes in the position and intensity of the light source to a certain extent. However, this system requires a special infrared sensor. , thereby increasing the cost. This method collects near-infrared face pictures or video streams, and there are also other problems faced by common face recognition technologies.
为了克服姿态变化、位置变化、表情变化的影响,提出了三维人脸识别系统,这种方法在一定程度上能缓解上述问题带来的压力,然而从单一未知光源条件的图像中恢复3D形状信息和表面反射率是经典的视觉难题,本质上是一个病态的问题,建立三维人脸模型需要采用特殊的装置,例如激光扫描仪等,成本高,在很大程度上限制了这种方法的推广。In order to overcome the influence of posture changes, position changes, and expression changes, a 3D face recognition system is proposed. This method can relieve the pressure caused by the above problems to a certain extent, but recovers 3D shape information from images with a single unknown light source condition. And surface reflectance is a classic visual problem, which is essentially a morbid problem. The establishment of a 3D face model requires the use of special devices, such as laser scanners, etc., and the high cost limits the promotion of this method to a large extent. .
综上所述,现在的人脸识别装置存在着很多困难,因此人们期待一种新的人脸识别方法或手段来解决给目前的难题,为人脸识别注入活力。To sum up, there are many difficulties in current face recognition devices, so people expect a new face recognition method or means to solve the current problems and inject vitality into face recognition.
发明内容Contents of the invention
本实用新型的目的是克服现有技术的不足,将超声波技术与模式识别技术相结合,提出一种正确率高、与背景独立,不会被照片或视频欺骗、能较好的辨别3D模型且数据量小的基于模式识别的超声波人脸识别装置。The purpose of this utility model is to overcome the deficiencies of the prior art, combine the ultrasonic technology with the pattern recognition technology, propose a high correct rate, independent from the background, not be deceived by photos or videos, and can better distinguish 3D models and Ultrasonic face recognition device based on pattern recognition with small amount of data.
为实现上述发明目的,本实用新型提供的人脸识别装置包括:In order to achieve the purpose of the above invention, the face recognition device provided by the utility model includes:
一种超声波人脸识别装置,包括:An ultrasonic face recognition device, comprising:
超声波发声器,用于发射超声扫频信号;Ultrasonic sounder, used for transmitting ultrasonic frequency sweep signal;
超声波接收器,用于接收回波信号;Ultrasonic receiver for receiving echo signals;
控制处理器,与所述超声波发生器的输入端、所述超声波接收器的输出端和识别结果输出装置的输入端相连接,并与存储超声波人脸信息数据库的存储器互联;The control processor is connected to the input end of the ultrasonic generator, the output end of the ultrasonic receiver and the input end of the recognition result output device, and is interconnected with the memory storing the ultrasonic face information database;
所述存储器用来存储超声波人脸信息数据库,并供控制处理器访问、控制。The memory is used to store the ultrasonic face information database, and is accessed and controlled by the control processor.
上述技术方案中,所述控制处理器包括A/D转换器,该A/D转换器与所述超声波接收器的输出端连接,用于对回波信号进行数字采样。In the above technical solution, the control processor includes an A/D converter connected to the output terminal of the ultrasonic receiver for digital sampling of the echo signal.
上述技术方案中,所述控制处理器包括:扫频信号产生单元、超声波人脸信息数据库控制单元和回波处理单元;所述扫频信号生成单元产生驱动所述超声波发声器的扫频信号;所述超声波人脸数据库控制单元包括对存储数据库的存储器的读、写、修改操作;所述回波处理单元对回波信号进行处理。In the above technical solution, the control processor includes: a frequency sweep signal generating unit, an ultrasonic face information database control unit, and an echo processing unit; the frequency sweep signal generating unit generates a frequency sweep signal to drive the ultrasonic sounder; The ultrasonic face database control unit includes operations of reading, writing and modifying the memory storing the database; the echo processing unit processes echo signals.
上述技术方案中,所述回波处理单元包括顺序连接的频域滤波电路、时域截取电路、信号解调电路、特征提取电路和样本训练分类器。In the above technical solution, the echo processing unit includes a frequency domain filter circuit, a time domain intercept circuit, a signal demodulation circuit, a feature extraction circuit and a sample training classifier connected in sequence.
上述技术方案中,所述样本训练分类器是模式分类器或神经网络分类器。In the above technical solution, the sample training classifier is a pattern classifier or a neural network classifier.
上述技术方案中,所述模式分类器可以采用Bias、GMM、HMM或SVM模式分类器。In the above technical solution, the pattern classifier may adopt a Bias, GMM, HMM or SVM pattern classifier.
上述技术方案中,所述神经网络分类器可以采用BP,RBF,SOM网络等神经网络分类器替换。In the above technical solution, the neural network classifier may be replaced by neural network classifiers such as BP, RBF, and SOM networks.
上述技术方案中,所述超声波发声器至少为两个,该至少两个超声波发声器安装在不同位置,使得所述人脸识别装置能够从不同方向向待识别目标发射超声波。In the above technical solution, there are at least two ultrasonic sounders, and the at least two ultrasonic sounders are installed in different positions, so that the face recognition device can emit ultrasonic waves to the target to be recognized from different directions.
与现有技术相比,本实用新型的优点在于:Compared with the prior art, the utility model has the advantages of:
A.本实用新型采用宽带的线性扫频信号,通过信号压缩,可以得到很高的空间分辨率,能够提取丰富的面部信息;A. The utility model adopts a wide-band linear sweep signal, and through signal compression, a very high spatial resolution can be obtained, and rich facial information can be extracted;
B.利用周围反射物与传声器之间的距离一般不同于人脸与传声器之间的距离的特点,运用数字信号处理的方法可以容易的将背景回波同人脸回波分离开,从而减小背景对人脸识别的影响;B. Utilizing the fact that the distance between the surrounding reflector and the microphone is generally different from the distance between the face and the microphone, the background echo can be easily separated from the face echo by using digital signal processing, thereby reducing the background Impact on face recognition;
C.利用超声波对不同材质的反射效应不同特点(因3D模型的材质与人脸的材质不同,使得回波的频率成分不同),可以将3D模型与人脸区分开;C. Using the different characteristics of the reflection effect of ultrasound on different materials (because the material of the 3D model is different from that of the face, the frequency components of the echo are different), the 3D model can be distinguished from the face;
D.利用超声回波是由人脸不同部位反射回波共同组成的特点,可以克服照片、视频的欺骗,从而克服了目前人脸识别技术中存在的一大难题;D. Utilizing the characteristic that the ultrasonic echo is composed of echoes reflected from different parts of the face, it can overcome the deception of photos and videos, thus overcoming a major problem in the current face recognition technology;
E.超声回波被采集之后,经过一系列的简单处理,得到一维数据,从中选取有效部分作为最终识别的特征。与传统的人脸识别中的二维图像数据相比,具有非常明显的数据量小,特征容易提取的优势,因此使的识别速度的提高成为可能(普通120*120的二维图像的数据量为14400bit;本专利中若采用单发单收,数据量约为160bit,若采用单发双收约为240bit,若采用双发双收为约480bit);E. After the ultrasonic echo is collected, one-dimensional data is obtained through a series of simple processing, and the effective part is selected as the final recognition feature. Compared with the two-dimensional image data in traditional face recognition, it has the obvious advantages of small data volume and easy feature extraction, so it is possible to improve the recognition speed (the data volume of a normal 120*120 two-dimensional image It is 14400bit; if single transmission and single reception is adopted in this patent, the amount of data is about 160bit; if single transmission and double reception are adopted, it is about 240bit;
F.本实用新型采用多个超声波发声器和多个超声波接收器从不同角度发射和接收超声波的方式,可以在一定程度上解决人脸表情、姿态等的变化对人脸识别影响的问题,具有较高的识别率;F. The utility model adopts a plurality of ultrasonic sounders and a plurality of ultrasonic receivers to transmit and receive ultrasonic waves from different angles, which can solve the problem of the influence of changes in facial expressions and postures on face recognition to a certain extent, and has the advantages of High recognition rate;
G.实用新型所建立的超声波人脸数据库具有数据量小的特点(如采用单发单收方式,采集100个不同姿态的数据,约为16k bit/人),方便建立大规模超声波人脸数据库。G. The ultrasonic face database established by the utility model has the characteristics of small amount of data (such as adopting a single-shot and single-receive method, collecting data of 100 different postures, about 16k bit/person), which facilitates the establishment of a large-scale ultrasonic face database .
附图说明Description of drawings
图1表示基于模式识别的超声波人脸识别装置的结构框图;Fig. 1 represents the structural block diagram of the ultrasonic face recognition device based on pattern recognition;
图2表示回波处理单元结构示意图;FIG. 2 shows a schematic structural diagram of an echo processing unit;
图3表示本实用新型一实施例的超声波发声器及传感器位置分布示意图;Fig. 3 shows the ultrasonic sounder of an embodiment of the present invention and the schematic diagram of sensor position distribution;
图4表示本实用新型一实施例中经过滤波和截取的回声信号示意图;Fig. 4 shows a schematic diagram of the echo signal filtered and intercepted in an embodiment of the present invention;
图5表示本实用新型一实施例中经信号压缩得到的回波强度与距离关系图;Fig. 5 shows the echo intensity and the distance relationship figure obtained through signal compression in an embodiment of the utility model;
图6表示本实用新型一实施例中经信号压缩得到的回波强度与距离关系图的局部示意图。FIG. 6 shows a partial schematic diagram of the relationship between echo intensity and distance obtained through signal compression in an embodiment of the present invention.
具体实施方式Detailed ways
本实用新型的基本构思是利用超声波对人脸进行识别。这是由于超声波具有识别3D物体的性能,同时,超声波对不同材质的反射效应不同。而人脸有凸凹面,类似于3D,且人脸的皮肤有通性也有异性,通性在于均为人类的皮肤,异性在于不同人之间皮肤的细腻程度、弹性等不同。因此可以利用人脸的类3D性及人脸之间皮肤的不同来区分不同的人脸。同时也可以用人脸的通性将人脸与3D模型区分开来。The basic idea of the utility model is to use ultrasonic waves to recognize human faces. This is because ultrasonic waves have the ability to identify 3D objects, and at the same time, ultrasonic waves have different reflection effects on different materials. The human face has convex and concave surfaces, similar to 3D, and the skin of the human face has both commonality and heterosexuality. The commonality lies in the skin of all human beings, and the heterosexuality lies in the difference in the fineness and elasticity of the skin between different people. Therefore, the 3D-like nature of human faces and the difference in skin between human faces can be used to distinguish different human faces. At the same time, the commonality of the face can also be used to distinguish the face from the 3D model.
超声波传播过程中遇到人脸反射产生回波。此回波携带着人脸的特征信息,其中回波的强度反映反射面的大小,回波相对入射波的时延反映反射面与声源之间的距离。同时回波中还包含人脸皮肤的信息。During the propagation of the ultrasonic wave, it encounters the reflection of the human face to generate echoes. The echo carries the feature information of the human face, where the intensity of the echo reflects the size of the reflecting surface, and the time delay of the echo relative to the incident wave reflects the distance between the reflecting surface and the sound source. At the same time, the echo also contains the information of human face skin.
超声波人脸识别方法与普通超声波材质识别方法的区别是:在超声波材质识别中,主要用一个反射面(材质表面)的反射回波信息,根据材质对不同频率成分的吸收程度区分材质;在超声波人脸识别中,充分利用多个反射面(人脸的不同位置)的反射回波信息,利用各回波的强度、回波之间的先后关系及人脸对不同频率成分的吸收程度区分人脸。The difference between the ultrasonic face recognition method and the ordinary ultrasonic material recognition method is: in the ultrasonic material recognition, the reflected echo information of a reflective surface (material surface) is mainly used to distinguish the material according to the degree of absorption of different frequency components by the material; In face recognition, the reflected echo information of multiple reflective surfaces (different positions of the face) is fully utilized, and the strength of each echo, the sequence relationship between echoes and the degree of absorption of different frequency components by the face are used to distinguish faces. .
下面结合附图和具体实施方式对本实用新型作进一步详细描述:Below in conjunction with accompanying drawing and specific embodiment the utility model is described in further detail:
实施例1Example 1
本实用新型利用超声波遇人脸反射的上述特点及连续扫频信号空间分辨率与频带带宽成反比的关系,借助模式识别的方法,有效的克服了传统人脸识别的诸多问题,实现了人脸识别。本实施例的具体的实施过程包括以下几步:The utility model utilizes the above-mentioned characteristics of the reflection of the ultrasonic wave on the human face and the inverse relationship between the spatial resolution of the continuous frequency sweep signal and the frequency band bandwidth, and by means of the pattern recognition method, effectively overcomes many problems of the traditional face recognition, and realizes the recognition of the human face. identify. The specific implementation process of this embodiment includes the following steps:
如图1所示,制作一个依据本实用新型的基于模式识别的超声波人脸识别装置,包括:超声波发声器1,发射连续扫频超声波信号,超声信号遇障碍物反射,产生回波信号;超声波接收器2,接收回波信号;A/D转换器3,与所述超声波接收器2的输出端连接,对回波信号进行数字采样;控制处理器4,与所述超声波发生器1的输入端、所述A/D转换器3的输出端和一显示器或一发声器9的输入端相连接;所述显示器或发声器9接收到所述控制处理器4的识别结果后通过图像或语音的方式显示出来。所述控制处理器4由扫频信号产生单元、超声波人脸信息数据库控制单元及回波处理单元组成;所述扫频信号产生单元产生驱动所述超声波发声器1的线性扫频信号;所述超声波人脸数据库控制单元包括对所述存储超声波人脸信息数据库的存储器5的读、写、修改操作;所述回波处理单元,对回波信号进行处理,如图2所示,主要包括频域滤波电路、时域截取电路、信号压缩电路、特征提取电路和模式识别电路顺序连接。当所述模式识别电路的结果判断受测对象为一人脸且在超声波人脸数据库中时,所述回波处理单元将识别结果传给显示器或发声器9,所述显示器或发声器9将结果以图像或声音的方式给出。所述模式识别电路使用贝叶斯分类器。所述基本信息7包括姓名、性别、出生年月及面部图像,通过键盘及普通相机录入。As shown in Figure 1, make an ultrasonic face recognition device based on pattern recognition according to the utility model, comprising: an
本实施例中的超声波发声器可以是压电薄膜换能器或由压电薄膜换能器组成阵列发声器,或者是压电陶瓷换能器或压电陶瓷换能器组成的阵列发声器,或者是静电换能器或由静电换能器组成的阵列发声器,或者是超声波发音器。如图3所示,在本实施例中采用两个超声波发声器和两个接收器,并将其分为两组,分别为A组(包括静电换能器1A,传感器2A)和B组(包括静电换能器1B,传感器2B),每一组装置与人脸之间的距离均为1m,其中B组在人脸的正前方,A组在鼻尖的斜上方45度处。考虑到线性扫频信号的空间分辨率、回波的强度及对回波信号的实时处理速度、装置的频响曲线及能耗等因素,超声波发射器1的发射频率为25KHz--100KHz,长度为100ms的扫频信号,发射间隔为150ms。A/D转换器3对回波信号的采样频率是500KHz。The ultrasonic sounder in this embodiment can be a piezoelectric thin film transducer or an array sounder composed of piezoelectric thin film transducers, or an array sounder composed of piezoelectric ceramic transducers or piezoelectric ceramic transducers, Either an electrostatic transducer or an array sounder composed of electrostatic transducers, or an ultrasonic sounder. As shown in Figure 3, in this embodiment, two ultrasonic sounders and two receivers are used, and they are divided into two groups, which are Group A (including electrostatic transducer 1A, sensor 2A) and Group B ( Including electrostatic transducer 1B, sensor 2B), the distance between each group of devices and the face is 1m, where B group is in front of the face, and A group is 45 degrees above the tip of the nose. Considering the spatial resolution of the linear sweep signal, the intensity of the echo, the real-time processing speed of the echo signal, the frequency response curve of the device and the energy consumption, etc., the transmitting frequency of the
以上的各部分电路,若没有特别注明的,均采用本领域技术人员熟知的常规产品或常规电路并采用常规方式连接。All parts of the above circuits, unless otherwise specified, are conventional products or conventional circuits well known to those skilled in the art and are connected in a conventional manner.
由于需要建立超声波数据库并进行模式识别,本实施例的基于模式识别的超声波人脸识别方法,包括如下步骤:Due to the need to set up an ultrasonic database and perform pattern recognition, the ultrasonic face recognition method based on pattern recognition of the present embodiment includes the following steps:
1)采集欲加入到人脸识别数据库中对象的基本信息,包括姓名、性别、出生年月及相应的面部正面图像。1) Collect the basic information of the object to be added to the face recognition database, including name, gender, date of birth and the corresponding frontal face image.
2)对每个测量对象分别测量多组数据,提取特征,建立超声波人脸数据库;2) Measure multiple sets of data for each measurement object, extract features, and establish an ultrasonic face database;
考虑到超声波发声器的指向性及保证回波对同一对象的相对稳定性,首先设定一参考点,此参考点距离传声器的距离均为1m,并且超声波发声器和传声器均指向该参考点。同时为尽可能的减小周围物体对回波的影响,此装置需置于一空旷的房间中,参考点与地面距离1.14m。样本采集的过程中受测者的鼻尖紧贴参考点且目视前方,在整个测量过程中可以处于静止或有细微运动。发射脉冲信号,并记录下回波信号。对每一个测量对象,用水平方向的超声波发声器1B发射超声波和并用水平方向传声器2B记录下100个回波信号,即100个样本。随后用斜方向45度处的超声波发生器1A发射超声波并用斜方向45度处的传声器2A接收记录下100个回波信号。对每一个测量对象共计采集200个样本。对每一个欲加入此超声波人脸识别数据库的对象均用上述方法采集样本。分别对每个样本进行处理,处理的过程如图2所示。Considering the directivity of the ultrasonic sounder and ensuring the relative stability of the echo to the same object, first set a reference point, which is 1m away from the microphone, and both the ultrasonic sounder and the microphone point to the reference point. At the same time, in order to reduce the influence of the surrounding objects on the echo as much as possible, the device should be placed in an open room, and the distance between the reference point and the ground is 1.14m. During the sample collection process, the test subject's nose tip is close to the reference point and looks forward, and may be stationary or have slight movements during the entire measurement process. Send the pulse signal and record the echo signal. For each measurement object, ultrasonic waves are emitted by the ultrasonic generator 1B in the horizontal direction and 100 echo signals, ie, 100 samples, are recorded by the microphone 2B in the horizontal direction. Then, the ultrasonic generator 1A at an oblique direction of 45 degrees was used to emit ultrasonic waves, and the microphone 2A at an oblique direction of 45 degrees was used to receive and
首先,因回波信号的频率范围为25kHz-100kHz,故用20k-110kHz的带通滤波器对接收到的回波信号滤波,随后截取每个回波信号的前100ms的数据,截取后的回波信号如图4所示。First, because the frequency range of the echo signal is 25kHz-100kHz, a 20k-110kHz band-pass filter is used to filter the received echo signal, and then the data of the first 100ms of each echo signal is intercepted, and the intercepted echo signal The wave signal is shown in Figure 4.
其次为信号压缩。因为扫频信号的时长为100ms,使得不同位置处的反射回波混叠在一起,因此从截取后的回波信号中难以直接提取人脸的回波信息,故需要经过信号压缩获得高分辨率的回波距离信息。调频信号压缩可以用匹配滤波器实现,也可以通过将发射信号与回波信号混频并作傅立叶变换实现。本实例选用后一种方法:即先将发射信号与原信号混频,后做快速付利叶变换。图5表示经信号压缩得到的回波强度与距离关系图。图6为图5的局部放大,根据人脸与传声器之间的距离为1m这个已知条件,可以明显的看到人脸的回波出现在1m-1.5m的区域内,并且具有很高的信噪比。The second is signal compression. Because the duration of the frequency sweep signal is 100ms, the reflected echoes at different positions are mixed together, so it is difficult to directly extract the echo information of the face from the intercepted echo signal, so it is necessary to obtain high resolution through signal compression echo distance information. FM signal compression can be realized with a matched filter, or by mixing the transmitted signal with the echo signal and doing Fourier transform. This example chooses the latter method: first mix the transmitted signal with the original signal, and then do fast Fourier transform. Fig. 5 shows the relationship between echo strength and distance obtained through signal compression. Figure 6 is a partial enlargement of Figure 5. According to the known condition that the distance between the face and the microphone is 1m, it can be clearly seen that the echo of the face appears in the area of 1m-1.5m, and has a high SNR.
接下来的过程就是提取特征向量。特征提取方法包括选取时域包络、频域能量、信号经过如离散余弦变换或小波变换后的系数、信号经解调后的时域包络或频域能量或倒谱系数或用现代谱估计得到的系数或小波变换后的系数、压缩后的信号,从上述一系列特征中提取部分或全部特征作为特征向量。在实施例中运用了两方法提取特征。第一种是根据McKerrow(McKerrow,P.J.and Yoong,K.K.Face classification with ultrasonic sensing,Proceedings TOWARDS Autonomous Robotic Systems TAROS-06,September4-6,pp 111-117.)提到的提取15个特征值作为特征向量的方法;第二种为从压缩的回波信号中直接截取人脸的回波数据,将此作为特征向量。The next step is to extract the feature vectors. Feature extraction methods include selecting time-domain envelope, frequency-domain energy, coefficients of signals such as discrete cosine transform or wavelet transform, time-domain envelope or frequency-domain energy or cepstral coefficients after signal demodulation, or using modern spectrum estimation The obtained coefficients or wavelet-transformed coefficients and the compressed signal are extracted from the above-mentioned series of features as part or all of the features as feature vectors. In the embodiment, two methods are used to extract features. The first is to extract 15 eigenvalues as feature vectors according to McKerrow (McKerrow, P.J.and Yoong, K.K.Face classification with ultrasonic sensing, Proceedings TOWARDS Autonomous Robotic Systems TAROS-06, September4-6, pp 111-117.) The second method is to directly intercept the echo data of the face from the compressed echo signal, and use this as a feature vector.
第二种方法的细节如下:首先将噪声1(如图6所示,由串扰噪声及直达声组成)去掉,其次利用门限法查找到人脸回波的起始点,最后根据人脸的平均几何尺寸截取固定长度的数据。(该门限法为通用的门限法。从图六中可以看出,除去噪声1后,剩下的较强的信号为人脸回波信号,故可用门限法查找到人脸回波的起点。本实施例中从起始点开始截取80个点的数据作为特征向量)The details of the second method are as follows: First, the noise 1 (as shown in Figure 6, composed of crosstalk noise and direct sound) is removed; secondly, the threshold method is used to find the starting point of the face echo; finally, according to the average geometry of the face, Dimensions intercept fixed-length data. (This threshold method is a general threshold method. As can be seen from Figure 6, after the
特征向量提取之后的就是选择分类器,本实例选用Bias分类器。为验证其性能,选取数据库中每类样本中的60%来组成训练样本集;剩余的40%作为测试样本集。表1列出了Bias分类器的分类结果情况,并将两种特征提取的方法进行了比较。识别结果如表1所示。After the feature vector is extracted, the classifier is selected. In this example, the Bias classifier is selected. In order to verify its performance, 60% of each type of samples in the database are selected to form a training sample set; the remaining 40% are used as a test sample set. Table 1 lists the classification results of the Bias classifier, and compares the two feature extraction methods. The recognition results are shown in Table 1.
表1Table 1
该表格中“正前方+斜上方45度”指的是先对两组超声波传声器的接收数据进行综合处理再得出识别结果。In this form, "directly in front + 45 degrees obliquely above" refers to comprehensively processing the received data of two sets of ultrasonic microphones before obtaining the recognition result.
从表1中可以看出,利用第二种特征提取方法明显优于McKerrow提出的特征提取方法,并且可以明显看到:将多个传声器提取到的特征加权结合,识别率高于单个传声器的识别率。It can be seen from Table 1 that the second feature extraction method is significantly better than the feature extraction method proposed by McKerrow, and it can be clearly seen that the recognition rate is higher than that of a single microphone by combining the features extracted by multiple microphones. Rate.
3)选取分类器后,进行实际探测;超声波发声器1发射连续扫频超声波信号,并用超声波接收器2接收回波信号;3) After the classifier is selected, the actual detection is carried out; the
4)控制处理器4对回波信号进行预处理(包括对回波信号的频域滤波、时域截取和信号压缩);4) The control processor 4 performs preprocessing on the echo signal (including frequency domain filtering, time domain interception and signal compression on the echo signal);
5)控制处理器4对回波信号提取特征向量;5) the control processor 4 extracts a feature vector to the echo signal;
6)利用模式分类器判断该回波信号是否为人脸回波信号(本实施例中设定一阈值,若超过该阈值则认为是人脸回波信号),若是,则执行7),否则返回步骤3);6) Use the pattern classifier to judge whether the echo signal is a human face echo signal (a threshold is set in this embodiment, if it exceeds the threshold, it is considered to be a human face echo signal), if so, then perform 7), otherwise return step 3);
7)利用Bias模式分类器在已建成的超声人脸信息数据库中进行身份识别对比,若待识别目标存在于数据库中,则执行步骤8),否则返回步骤3);7) Utilize the Bias pattern classifier to carry out identification comparison in the established ultrasonic face information database, if the target to be identified exists in the database, then perform step 8), otherwise return to step 3);
8)以图像或声音的方式在显示器或发声器上给出识别结果。8) Give the recognition result on the display or sounder in the form of image or sound.
9)完成相关处理后返回上述步骤3)开始顺序向下执行。9) After completing the relevant processing, return to the above step 3) and start executing sequentially downwards.
本实施例中虽然采用了Bias模式分类器作为样本训练分类器,但该分类器可以用GMM,HMM,SVM等其它类型的模式分类器替换,也可以用BP,RBF,SOM网络等神经网络分类器替换。另外,本实用新型也可根据需求提供不同类型的识别结果,如鉴别(identification),即判断给定的人脸超声回波属于哪一张人脸;或者验证(verification),即判断给定的人脸超声回波是否属于某个人。Although the Bias pattern classifier is used as a sample training classifier in this embodiment, the classifier can be replaced by other types of pattern classifiers such as GMM, HMM, SVM, etc., or can be classified by neural networks such as BP, RBF, and SOM networks. device replacement. In addition, the utility model can also provide different types of recognition results according to requirements, such as identification (identification), that is, to judge which face a given face ultrasonic echo belongs to; or verification (verification), that is, to judge a given face. Whether the face ultrasound echo belongs to a certain person.
另外,本实施例中的识别方法仅仅是一个举例说明,该方法可以通过不同的硬件或软件实现,并不限于本实施例中的识别装置。如本实用新型也可采用更广义的超声波人脸识别装置,如一种超声波人脸识别装置,该装置包括:超声波发声器,用于发射超声波信号;超声波接收器,用于接收回波信号;特征提取模块,用于对回波信号提取特征向量;超声人脸信息数据库,用于存储注册会员的基本信息以及预先获得的超声回波信号的特征向量;身份识别模块,用于对超声回波信号的特征向量进行比对分析,得出识别结果。In addition, the identification method in this embodiment is only an example, and the method can be implemented by different hardware or software, and is not limited to the identification device in this embodiment. As the utility model also can adopt ultrasonic face recognition device in a broader sense, as a kind of ultrasonic face recognition device, this device comprises: ultrasonic sounder, is used for transmitting ultrasonic signal; Ultrasonic receiver, is used for receiving echo signal; The extraction module is used to extract the eigenvector of the echo signal; the ultrasonic face information database is used to store the basic information of registered members and the eigenvector of the pre-acquired ultrasonic echo signal; the identity recognition module is used to extract the ultrasonic echo signal The eigenvectors are compared and analyzed to obtain the recognition results.
最后所应说明的是,以上实施例仅用以说明本实用新型的技术方案而非限制。尽管参照实施例对本实用新型进行了详细说明,本领域的普通技术人员应当理解,对本实用新型的技术方案进行修改或者等同替换,都不脱离本实用新型技术方案的精神和范围,其均应涵盖在本实用新型的权利要求范围当中。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present utility model rather than limit them. Although the present utility model has been described in detail with reference to the embodiments, those skilled in the art should understand that any modification or equivalent replacement of the technical solution of the present utility model does not depart from the spirit and scope of the technical solution of the present utility model, and all of them should cover In the scope of the claims of the present utility model.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNU2007201733791U CN201084168Y (en) | 2007-09-26 | 2007-09-26 | A face-identifying device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNU2007201733791U CN201084168Y (en) | 2007-09-26 | 2007-09-26 | A face-identifying device |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN201084168Y true CN201084168Y (en) | 2008-07-09 |
Family
ID=39626640
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNU2007201733791U Expired - Lifetime CN201084168Y (en) | 2007-09-26 | 2007-09-26 | A face-identifying device |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN201084168Y (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107844763A (en) * | 2017-10-27 | 2018-03-27 | 努比亚技术有限公司 | A kind of face identification method, mobile terminal and computer-readable recording medium |
| CN111492373A (en) * | 2017-10-30 | 2020-08-04 | 纽约州州立大学研究基金会 | Systems and methods associated with acoustic echo signature-based user authentication |
| CN111860514A (en) * | 2020-05-21 | 2020-10-30 | 江苏大学 | A multi-category real-time segmentation method of orchard scene based on improved DeepLab |
| WO2022156562A1 (en) * | 2021-01-19 | 2022-07-28 | 腾讯科技(深圳)有限公司 | Object recognition method and apparatus based on ultrasonic echo, and storage medium |
| CN120279951A (en) * | 2025-06-09 | 2025-07-08 | 山东大学 | Facial expression recognition method and system based on sound perception |
-
2007
- 2007-09-26 CN CNU2007201733791U patent/CN201084168Y/en not_active Expired - Lifetime
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107844763A (en) * | 2017-10-27 | 2018-03-27 | 努比亚技术有限公司 | A kind of face identification method, mobile terminal and computer-readable recording medium |
| CN111492373A (en) * | 2017-10-30 | 2020-08-04 | 纽约州州立大学研究基金会 | Systems and methods associated with acoustic echo signature-based user authentication |
| CN111860514A (en) * | 2020-05-21 | 2020-10-30 | 江苏大学 | A multi-category real-time segmentation method of orchard scene based on improved DeepLab |
| WO2022156562A1 (en) * | 2021-01-19 | 2022-07-28 | 腾讯科技(深圳)有限公司 | Object recognition method and apparatus based on ultrasonic echo, and storage medium |
| US12449524B2 (en) | 2021-01-19 | 2025-10-21 | Tencent Technology (Shenzhen) Company Limited | Object recognition method and apparatus based on ultrasonic echoes and storage medium |
| CN120279951A (en) * | 2025-06-09 | 2025-07-08 | 山东大学 | Facial expression recognition method and system based on sound perception |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101396277A (en) | Ultrasonics face recognition method and device | |
| Hu et al. | Milliear: Millimeter-wave acoustic eavesdropping with unconstrained vocabulary | |
| US12298425B2 (en) | Sound source positioning method and apparatus | |
| US8275622B2 (en) | Ultrasonic doppler sensor for speaker recognition | |
| TWI695182B (en) | System and method for counterfeit detection and activity analysis | |
| CN109581385B (en) | Target localization device and method based on bi-auricular bionic sonar of big-eared bat | |
| Zhang et al. | AmbiEar: mmWave based voice recognition in NLoS scenarios | |
| Roitblat et al. | Sonar recognition of targets embedded in sediment | |
| CN201084168Y (en) | A face-identifying device | |
| US20040165478A1 (en) | Biomimetic sonar system and method | |
| Hu et al. | Towards unconstrained vocabulary eavesdropping with mmwave radar using gan | |
| CN106265004A (en) | Multi-sensor intelligent blind person's guiding method and device | |
| CN101571919A (en) | Face recognition method based on optics nonsubsampled Contourlet conversion | |
| CN114203196B (en) | Audio reconstruction method based on millimeter wave signals | |
| US7821870B2 (en) | Method and system for identifying moving objects using acoustic signals | |
| Zhao et al. | Reconstruction of Lamb wave dispersion curves by sparse representation with continuity constraints | |
| CN104714237A (en) | A multi-feature and multi-directional data fusion method for fish identification | |
| CN114998776A (en) | A statistical device and method for fish target recognition combined with acoustics and images | |
| Wu et al. | A novel multiperson activity recognition algorithm based on point clouds measured by millimeter-wave MIMO radar | |
| Sabatini | A digital-signal-processing technique for ultrasonic signal modeling and classification | |
| CN110444220A (en) | A multi-modal remote speech perception method and device | |
| Radermacher et al. | Humpback whale (Megaptera novaeangliae) breathing sound characteristics from simultaneous above and underwater measurements | |
| CN110661973B (en) | Control method and electronic equipment | |
| Hwang et al. | Bat-g net: Bat-inspired high-resolution 3D image reconstruction using ultrasonic echoes | |
| Saeed et al. | Extracting visual micro-Doppler signatures from human lips motion using UoG radar sensing data for hearing aid applications |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CX01 | Expiry of patent term |
Granted publication date: 20080709 |
|
| CX01 | Expiry of patent term |