[go: up one dir, main page]

WO2016150240A1 - Identity authentication method and apparatus - Google Patents

Identity authentication method and apparatus Download PDF

Info

Publication number
WO2016150240A1
WO2016150240A1 PCT/CN2016/070830 CN2016070830W WO2016150240A1 WO 2016150240 A1 WO2016150240 A1 WO 2016150240A1 CN 2016070830 W CN2016070830 W CN 2016070830W WO 2016150240 A1 WO2016150240 A1 WO 2016150240A1
Authority
WO
WIPO (PCT)
Prior art keywords
face image
image sample
pyramid
feature
pair
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2016/070830
Other languages
French (fr)
Chinese (zh)
Inventor
毛秀萍
朱和贵
张祥德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Techshino Technology Co Ltd
Original Assignee
Beijing Techshino Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Techshino Technology Co Ltd filed Critical Beijing Techshino Technology Co Ltd
Publication of WO2016150240A1 publication Critical patent/WO2016150240A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries

Definitions

  • the present invention relates to digital image processing and pattern recognition, and more particularly to an identity authentication method and apparatus.
  • the second-generation ID card is a valid certificate for the residents of China, and records the basic information of the holder. In most cases, ID card registration is still manual, which is very inconvenient. With the maturity of computer technology, image processing and character recognition algorithms, it has become possible to use computers to automatically identify and enter ID cards. The identity of the person can be determined by comparing the photo on the ID card with the video photo taken on site. However, due to the low pixel and large age span of the second generation photo, the video photos collected on site are subject to interference caused by illumination, posture, expression, glasses, etc., as well as the single sample problem of the ID card, which makes the current based on The face authentication system for ID cards faces many challenges.
  • China's second-generation resident ID card is made by non-contact IC card technology.
  • the machine ID can be obtained by machine reading, and compared with the face photo collected by the on-site camera imaging device to determine whether it is the same person and belongs to face authentication. category.
  • the ID authentication system based on ID card adopts the existing face recognition algorithm. From the perspective of face feature extraction, there are mainly feature subspace method, local feature based method and machine learning based method. All three face authentication methods have the disadvantages of poor anti-interference and low accuracy.
  • the technical problem to be solved by the present invention is to provide an identity authentication method and apparatus with strong anti-interference and high accuracy.
  • the present invention provides the following technical solutions:
  • An authentication method includes:
  • the face image pyramid pair is processed by the face image pyramid algorithm to obtain a pyramid structure of the face image
  • An identity authentication device comprising:
  • the acquisition module is configured to obtain a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image acquired in the field;
  • Sampling module used to process the acquired face image sample pairs by using the face image pyramid algorithm to obtain a pyramid structure of the face image
  • the calculation module is configured to calculate a gray gradient direction of the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature;
  • the judging module is configured to calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belong to the same person.
  • the identity authentication method of the present invention first acquires a pair of face image samples, wherein one face image is an ID card photo, and the other face image is an image acquired in the field; then the face image pyramid algorithm is used.
  • the obtained face image sample pairs are processed to obtain a pyramid structure of the face image.
  • the algorithm uses a multi-resolution method to represent the face image, which can comprehensively describe the face image feature, and avoids adopting only one in the prior art.
  • the resolution of the image indicates that the image features are not comprehensively expressed, which effectively improves the accuracy of the identity authentication of the present invention, and the number of pixels of each layer of the pyramid structure of the face image is continuously reduced from bottom to top.
  • the calculation amount of the invention in the data processing process is reduced, and the identity authentication process of the present invention is made easier to implement; then the grayscale is calculated for the face image of each scale in the face image pyramid structure of the obtained face image sample pair.
  • Gradient direction get the gradient direction pyramid GOP feature, Since the GOP feature of the face image is robust to external factors such as illumination, age and expression, the anti-interference of the invention is enhanced, and the accuracy of the face authentication result is further improved; finally, the obtained face image is obtained.
  • the GOP feature of the sample pair is calculated to determine whether the face image sample pair belongs to the same person.
  • the identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.
  • FIG. 1 is a schematic flowchart 1 of an identity authentication method according to the present invention.
  • FIG. 2 is a schematic flow chart of extracting a face gradient direction pyramid GOP feature according to the present invention.
  • FIG. 3 is a schematic diagram of a hyper-classification interval hyperplane of the SVM classification algorithm of the present invention.
  • FIG. 4 is a schematic flowchart 2 of an identity authentication method according to the present invention.
  • FIG. 5 is a schematic structural diagram 1 of the identity authentication apparatus of the present invention.
  • FIG. 6 is a schematic flowchart 2 of an identity authentication method according to the present invention.
  • FIG. 7 is a schematic diagram of a GOP+SVM identity authentication process according to the present invention.
  • the present invention provides an identity authentication method, as shown in FIG. 1, including:
  • Step S101 Acquire a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image collected in the field;
  • the images collected on the spot can be obtained by intercepting the collected video samples, or by taking pictures on the spot.
  • Step S102 processing the acquired face image sample pair by using a face image pyramid algorithm to obtain a face image pyramid structure
  • the face image pyramid structure is used to represent the face image in multi-scale and multi-resolution.
  • the algorithm can comprehensively describe the face image feature, and the face image pyramid structure is from bottom to top.
  • the number is continuously reduced, reducing the invention in the data processing process The amount of calculation.
  • Step S103 Calculating a grayscale gradient direction on the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature;
  • the present invention preferably uses the GOP feature of the face image sample pair to enhance the anti-interference of the present invention. It also enhances the accuracy of identity authentication.
  • Step S104 Calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belongs to the same person.
  • the identity authentication method of the present invention first acquires a pair of face image samples, wherein one face image is an ID card photo, and the other face image is an image acquired in the field; then the face image pyramid algorithm is used to acquire the face image sample pair.
  • the processing is performed to obtain a pyramid structure of the face image.
  • the method uses a multi-resolution method to represent the face image, and can comprehensively describe the face image feature, thereby avoiding the prior art that only one resolution is used to represent the image.
  • the phenomenon that the image features are not comprehensively expressed effectively improves the accuracy of the identity authentication of the present invention, and the number of pixels of each layer of the face image pyramid structure is reduced from bottom to top, which reduces the data processing of the present invention.
  • the amount of calculation in the process makes the identity authentication process of the present invention easier to implement; nextly, the grayscale gradient direction is calculated for the face image of each scale in the face image pyramid structure of the obtained face image sample pair, and the gradient direction pyramid is obtained.
  • GOP features due to external factors such as illumination, age, and expression of GOP features of face images
  • the robustness enhances the anti-interference of the present invention, and further improves the accuracy of the face authentication result.
  • it is calculated whether the face image sample pair is the same. personal.
  • the identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.
  • the algorithm in the above step S103 may specifically be: a gray gradient direction pyramid GOP feature for calculating a pair of face image samples, and the specific calculation process of the GOP feature may be referred to as follows:
  • is an artificially set threshold such that the gradient direction at which the gradation is almost constant is zero, so that the gradient direction pyramid of the original two-dimensional image I(x, y) is defined as all pixel points at each scale Cascading of gradient directions, expressed as
  • G(I) is represented as a GOP feature of a face image, which is a d ⁇ 2 matrix, where d represents the sum of the number of images on a face pyramid.
  • the GOP feature extraction process in the present invention may be as shown in FIG. 2, wherein (a) is the original face image I(x, y), and (b) is the face image pyramid P(x, y), (c) represents a gradient direction pyramid, and (d) represents a GOP feature G(I).
  • step S102 may further be: obtaining a face image pyramid structure by using Gaussian kernel convolution smoothing and downsampling on the acquired face image sample pairs.
  • the face image pyramid is a structure for interpreting a face image with multiple resolutions.
  • the pyramid structure of an image is a collection of images whose resolution is gradually reduced in a pyramid shape.
  • the pyramid is an image representation that combines Gaussian kernel convolution smoothing and downsampling operations.
  • the bottom of the pyramid is the original high resolution representation of the image to be processed, while the top is a low resolution approximation. As you move toward the upper level of the pyramid, the size and resolution are reduced, resulting in images at different scales.
  • the specific calculation process of the face image gold tower is as follows:
  • Equation (2) * represents the convolution operation, s is the index of the scale, ⁇ 2 represents the downsampling, and ⁇ (x, y) represents the Gaussian kernel function of the same size as the image I(x, y), which can be expressed as
  • the image I(x, y) at the scale s-1 is convolved with the Gaussian kernel function to obtain a smoothed image, and then the image under the next scale s is obtained by the downsampling operation, and the image size is at the previous scale.
  • a Gaussian kernel function convolution smoothing and a downsampling operation are preferably used to obtain a face image pyramid.
  • the face image pyramid structure is a multi-scale, multi-resolution representation of the face image, the method can more fully describe the face image feature and enhance the accuracy of the identity authentication in the present invention.
  • step S104 may further be: calculating a feature vector for the GOP feature of the obtained face image sample pair, and using SVM binary classification processing to calculate whether the face image sample pair belongs to the same person.
  • the SVM classification algorithm is preferably used to first calculate the probability that the output label corresponding to the input feature is 1, and compare with a preset threshold.
  • the threshold is greater than the threshold, and the two faces are from the same person. Conversely, assigning to -1 means that it is from a different person.
  • the SVM needs to find a hyperplane with the largest interval to separate the two types of samples. As shown in Figure 3, the maximum spacing ensures that the hyperplane has the best generalization ability.
  • the SVM feature classification algorithm used in the present invention converts the actual problem into a high-dimensional feature space through a nonlinear transformation, and constructs a linear discriminant function in a high-dimensional space.
  • the nonlinear discriminant function in the original space is realized, and the SVM algorithm can obtain the optimal solution for a limited sample without multi-sample acquisition, which is suitable for the single photo case of the ID card in the invention, further improving the identity authentication of the present invention.
  • the specific algorithm of SVM classification can refer to the following process:
  • SVC Support Vector Classifier
  • the interval is two classes. The size of the space between them or the degree of separation determined by the hyperplane. Geometrically, the interval corresponds to the shortest distance from the data point to the hyperplane, and zeros w and b represent the weight vector and the optimum, respectively.
  • Hyperplane offset the corresponding hyperplane can be defined as
  • SVC is to find the optimal hyperplane parameter values w and b to maximize the classification interval between the two classes. Therefore, SVC is also called the maximum interval classifier. Fix the interval to 1, then for a given training set Have
  • the classification interval can be expressed as
  • the optimal weight vector w * in (13) can be calculated.
  • the soft interval is to extend the SVC algorithm so that the hyperplane allows a small amount of noise to exist.
  • Parameter C is used to balance the complexity and fault tolerance of the classifier. It is a regularization parameter selected by the user. Using the same Lagrangian multiplier method as before, the optimal weight w * and the optimal offset b * can be determined.
  • the kernel function transforms data from a low-dimensional nonlinear space to a high-dimensional space, thereby making the data
  • the high dimensional feature space is linearly separable, and the kernel function is defined as follows:
  • the RBF core is preferred.
  • Training sample set The model parameters A, B are determined by maximum likelihood estimation. To test sample x, first obtain f from the trained SVM model, and then obtain the probability that the category label is 1 by using the probability model, compare the probability value with a preset threshold, and finally determine the category label. The selection of the threshold needs to be determined by the verification set. In a specific method, the class probability of each pair of verification samples in the verification set can be sorted from small to large, and sequentially used as a threshold, and the false positive rate and the authentication rate corresponding to each threshold are obtained, according to the prior The required false positive rate determines the optimal threshold.
  • a random forest method in addition to the SVM classification, a random forest method can also be adopted.
  • the so-called random forest is a classifier that contains multiple decision trees.
  • the so-called randomness means that the generation of each decision tree is random (the training samples of each tree are randomly selected from the original samples).
  • the set of attributes used in the decision tree construction process is also selected by random selection of equal probability, that is, random selection of training samples and random selection of splitting attributes.
  • each decision tree in the forest judges the input samples separately, obtains the respective decision results, and then summarizes the decision results to the forest output. For classification problems, simply count all tree-to-category votes, select the one with the highest number of votes, and assign the input samples to that category.
  • step S104 may further be: performing a point multiplication operation on the GOP features of the obtained face image sample pairs, obtaining a pair of cosine similarity vectors as feature vectors, and using SVM two-class processing to calculate and judge Whether the face image sample pair belongs to the same person.
  • Face authentication can be regarded as a two-category problem. Given a pair of face images I 1 (x, y) and I 2 (x, y), which represent ID photos and live video photos, our purpose is It is judged whether I 1 and I 2 are from the same person, and if so, the judgment result is 1, and if not, it is judged as -1, therefore, it is necessary to map I 1 and I 2 to the feature space.
  • I ⁇ I ⁇ R d is defined as follows:
  • . ⁇ denotes the point multiplication operation of the matrix, and the cosine similarity between the GOP features of the pair of face images calculated by the equation is point by point, and the result is a d-dimensional column vector as the feature classification process. input of.
  • the cosine similarity vector of the gradient direction pyramid is used as the feature vector, which can comprehensively realize the feature extraction of the face image, and the cosine similarity vector is robust to external factors such as illumination, expression and age, and further strengthens The accuracy and anti-interference of the present invention.
  • the method may include:
  • Step S1021 performing face detection processing on the face image sample pair
  • Step S1022 Perform feature point setting on the face image sample pair after the face detection processing Bit processing
  • Step S1023 Normalize the face image sample pairs subjected to the feature point positioning process.
  • face detection is the primary part of face analysis, and the problem is to confirm whether there is a face image in the image.
  • Adaboost face detection algorithm is preferably adopted, and Adaboost algorithm is a classifier algorithm.
  • the basic idea is to use a large number of simple classifiers with general classification ability to superimpose them by a certain method to form a strong classifier with strong classification ability, and then connect several strong classifiers into a classifier to complete image search and detection.
  • the detector has a fast detection speed and a short development cycle, which is feasible.
  • the feature point location may adopt various methods known to those skilled in the art, such as a method based on geometric features, based on template matching. Methods, model-based methods, etc.
  • a method based on gradation characteristics and geometric features is preferably adopted, which can realize feature point localization of a face image simply and effectively.
  • the eyes are aligned to the same position set in advance by image registration, and normalization is achieved.
  • the face image is normalized.
  • the processing may also employ various methods known to those skilled in the art, such as maximum-minimum normalization, Fourier transform-based face image normalization, and the like.
  • the present invention further provides an identity authentication device, as shown in FIG. 5, including:
  • the obtaining module 11 is configured to obtain a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image collected in the field;
  • the sampling module 12 is configured to process the acquired face image sample pair by using a face image pyramid algorithm to obtain a face image pyramid structure
  • the calculating module 13 is configured to calculate a grayscale gradient direction for the face image of each scale in the pyramid image structure of the obtained face image sample pair, to obtain a gradient direction pyramid GOP feature;
  • the determining module 14 is configured to calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belongs to the same person.
  • the identity authentication apparatus of the present invention first acquires a pair of face image samples by the module 11;
  • the post-sampling module 12 processes the acquired face image sample pairs by using the face image pyramid algorithm to obtain a pyramid structure of the face image.
  • the method uses a multi-resolution method to represent the face image, and can comprehensively describe the face image.
  • the feature effectively improves the accuracy of the identity authentication of the present invention; then the calculation module 13 calculates the grayscale gradient direction of the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain the gradient direction.
  • the pyramid GOP feature enhances the anti-interference of the present invention, and further improves the accuracy of the face authentication result.
  • the determining module 14 calculates, according to the obtained GOP feature of the face image sample pair, whether the face image sample pair is the same. personal.
  • the identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.
  • the sampling module 12 can be further configured to obtain a face image pyramid structure by using Gaussian kernel convolution smoothing and downsampling on the acquired face image sample pairs.
  • the present invention it is preferable to use a Gaussian kernel function convolution smoothing and downsampling operation to the face image pyramid.
  • the structure performs multi-scale and multi-resolution representation of the face image, and the method can describe the face image feature more comprehensively, and enhances the accuracy of the identity authentication in the present invention. It is preferably employed in the present invention.
  • the determining module 14 is further configured to calculate a feature vector for the GOP feature of the obtained face image sample pair, and use the SVM binary classification process to calculate whether the face image sample pair belongs to the same person.
  • the SVM feature classification algorithm adopted by the judging module 14 converts the actual problem into a high-dimensional feature space by nonlinear transformation, and constructs a linear discriminant function in a high-dimensional space to realize The nonlinear discriminant function in the original space, and the SVM algorithm can obtain the optimal solution for a limited sample, without multi-sample acquisition, and is suitable for the single photo case of the ID card in the present invention, further improving the identity authentication of the present invention. accuracy.
  • the determining module 14 may be further configured to perform a point multiplication operation on the GOP features of the obtained face image sample pairs, and obtain a pair of cosine similarity vectors as feature vectors, and use SVM two-class processing to calculate and judge Whether the face image sample pair is the same personal.
  • the cosine similarity vector of the gradient direction pyramid is used as the feature vector, which can comprehensively realize the feature extraction of the face image, and the cosine similarity vector is robust to external factors such as illumination, expression and age, and further strengthens.
  • the accuracy and anti-interference of the present invention is used as the feature vector, which can comprehensively realize the feature extraction of the face image, and the cosine similarity vector is robust to external factors such as illumination, expression and age, and further strengthens.
  • the acquisition module 11 and the sampling module 12 are further connected with:
  • the detecting module 121 is configured to perform face detection processing on the face image sample pair;
  • the positioning module 122 is configured to perform feature point positioning processing on the face image sample pair subjected to the face detection processing;
  • the normalization module 123 is configured to perform normalization processing on the face image sample pairs subjected to the feature point positioning processing.
  • the problem that the detection module 121 processes is to confirm whether there is a face image in the image, so as to prevent the collected non-face image from entering the device, resulting in a waste of device computing resources.
  • the positioning module 122 adopts feature point location processing on the acquired face image sample pair, thereby avoiding the influence of posture and location information on identity authentication.
  • the normalization module 123 normalizes the face image sample pairs processed by the feature point positioning to enhance the consistency of the face image sample pairs, thereby avoiding the influence of factors such as illumination, direction and noise, and improving the invention. Anti-interference.
  • Step 1 Obtain a pair of face image samples, wherein one face image sample is taken from the ID database, and another face image sample is taken from the live collection video library;
  • Step 2 adopting feature point location processing on the acquired face image sample pair
  • Step 3 normalizing the face image sample pairs after the feature point positioning processing
  • Step 4 obtaining a face image pyramid structure by normalizing the face image sample pair by Gaussian kernel convolution smoothing and downsampling;
  • Step 5 Calculate the gradient direction of the face image at each scale in the pyramid structure of the face image, and obtain the gradient direction pyramid GOP feature of the face image.
  • Step 6 Calculate the cosine similarity vector of the gradient direction pyramid for the face image sample pair to obtain the face feature expression.
  • Step 7 Using the SVM classification algorithm to perform class probability prediction on the obtained face image representation
  • Step 8 The obtained class probability prediction is compared with a given threshold (Th);
  • Step 9 Obtain the authentication result, wherein the probability value is greater than Th, then the value is 1, indicating that the face graph sample pair is from the same person, and the probability value is less than Th, then the value is -1, indicating that the face graph sample pair is different. People.
  • the face gradient direction pyramid is used to obtain the feature expression of the face, and the gradient information of the face at different scales is utilized, and only the gradient direction is discarded, and the gradient amplitude is discarded.
  • the method can catch people more.
  • the essential information of the face is robust to factors such as illumination, expression and age.
  • the algorithm directly solves the ID card authentication problem from the perspective of face authentication, avoids the problem caused by the single sample of the ID card photo, and the test of the second generation ID card photo of 229 volunteers and a video photo taken on site. In the library test, when the system's error acceptance rate is 1%, 10%, the corresponding correct acceptance rates are 82.97% and 99.13%, respectively.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Collating Specific Patterns (AREA)

Abstract

The present invention relates to the field of digital image processing and mode recognition. Disclosed are an identity authentication method and apparatus. The method comprises: acquiring a face image sample pair, wherein one face image is an identity card picture and the other face image is an image collected on site; processing the acquired face image sample pair by using a face image pyramid algorithm, so as to obtain a face image pyramid structure; calculating the gray scale and the gradient orientation of a face image of each size in the obtained face image pyramid structure of the face image sample pair, so as to obtain a gradient orientation pyramid (GOP) characteristic; and performing calculation according to the obtained GOP characteristic of the face image sample pair, so as to determine whether the face image sample pair belongs to a same person. The present invention effectively avoids interferences from factors such as illumination, an age and an expression, and also remarkably improves the accuracy of identity authentication.

Description

身份认证方法和装置Identity authentication method and device 技术领域Technical field

本发明涉及数字图像处理与模式识别,特别是指一种身份认证方法和装置。The present invention relates to digital image processing and pattern recognition, and more particularly to an identity authentication method and apparatus.

背景技术Background technique

二代身份证是我国居民的有效证件,记载了持证人的基本信息。在目前大多数情况下,身份证登记仍然采用人工方式,很不方便。随着计算机技术、图像处理和字符识别算法的成熟,利用计算机进行身份证自动识别录入已经成为可能。通过将身份证上的照片和现场采集的视频照片进行比对,可以判定人员身份。但由于二代证相片本身存在的低像素、年龄跨度大等问题,现场采集的视频照片受到光照、姿态、表情、眼镜等遮挡物的干扰问题,以及身份证相片单样本问题,使得目前的基于身份证的人脸认证系统面临诸多挑战。The second-generation ID card is a valid certificate for the residents of China, and records the basic information of the holder. In most cases, ID card registration is still manual, which is very inconvenient. With the maturity of computer technology, image processing and character recognition algorithms, it has become possible to use computers to automatically identify and enter ID cards. The identity of the person can be determined by comparing the photo on the ID card with the video photo taken on site. However, due to the low pixel and large age span of the second generation photo, the video photos collected on site are subject to interference caused by illumination, posture, expression, glasses, etc., as well as the single sample problem of the ID card, which makes the current based on The face authentication system for ID cards faces many challenges.

我国第二代居民身份证采用非接触式IC卡技术制作,通过机读可以获取身份证照片,与现场的摄像机成像设备采集的人脸照片进行比对,判断是否是同一个人,属于人脸认证范畴。目前,基于身份证的人脸认证系统采用的都是已有的人脸识别算法,从人脸特征提取的角度,主要有特征子空间法、基于局部特征的方法和基于机器学习的方法。这三种人脸认证方法都存在抗干扰性差、准确性低的缺点。China's second-generation resident ID card is made by non-contact IC card technology. The machine ID can be obtained by machine reading, and compared with the face photo collected by the on-site camera imaging device to determine whether it is the same person and belongs to face authentication. category. At present, the ID authentication system based on ID card adopts the existing face recognition algorithm. From the perspective of face feature extraction, there are mainly feature subspace method, local feature based method and machine learning based method. All three face authentication methods have the disadvantages of poor anti-interference and low accuracy.

发明内容Summary of the invention

本发明要解决的技术问题是提供一种抗干扰性强、准确性高的身份认证方法和装置。The technical problem to be solved by the present invention is to provide an identity authentication method and apparatus with strong anti-interference and high accuracy.

为解决上述技术问题,本发明提供技术方案如下:In order to solve the above technical problem, the present invention provides the following technical solutions:

一种身份认证方法,包括: An authentication method includes:

获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;Obtaining a pair of face image samples, wherein one of the face images is an ID card photo, and the other face image is an image acquired in the field;

采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;The face image pyramid pair is processed by the face image pyramid algorithm to obtain a pyramid structure of the face image;

对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;Calculating a grayscale gradient direction on the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature;

根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。According to the obtained GOP feature of the face image sample pair, it is calculated whether the face image sample pair belongs to the same person.

一种身份认证装置,包括:An identity authentication device comprising:

获取模块:用于获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;The acquisition module is configured to obtain a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image acquired in the field;

采样模块:用于采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;Sampling module: used to process the acquired face image sample pairs by using the face image pyramid algorithm to obtain a pyramid structure of the face image;

计算模块:用于对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;The calculation module is configured to calculate a gray gradient direction of the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature;

判断模块:用于根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。The judging module is configured to calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belong to the same person.

本发明具有以下有益效果:The invention has the following beneficial effects:

与现有技术相比,本发明的身份认证方法首先获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;然后采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构,该算法采用多分辨率的方式对人脸图像进行表示,能够全面地描述人脸图像特征,避免了现有技术中只采用一种分辨率对图像进行表示而导致图像特征表达不全面的现象,有效的提高了本发明进行身份认证的准确性,同时人脸图像金字塔结构自下而上每一层的像素数都不断减少,降低了本发明在数据处理过程中的计算量,使本发明的身份认证过程更容易实现;接下来对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征, 由于人脸图像的GOP特征对光照、年龄和表情等外界因素具有鲁棒性,使本发明的抗干扰性增强,同时也进一步提高了人脸认证结果的准确性;最后根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。本发明的身份认证方法有效地避免了光照、年龄和表情等因素的干扰,也显著地提高了身份认证的准确性。Compared with the prior art, the identity authentication method of the present invention first acquires a pair of face image samples, wherein one face image is an ID card photo, and the other face image is an image acquired in the field; then the face image pyramid algorithm is used. The obtained face image sample pairs are processed to obtain a pyramid structure of the face image. The algorithm uses a multi-resolution method to represent the face image, which can comprehensively describe the face image feature, and avoids adopting only one in the prior art. The resolution of the image indicates that the image features are not comprehensively expressed, which effectively improves the accuracy of the identity authentication of the present invention, and the number of pixels of each layer of the pyramid structure of the face image is continuously reduced from bottom to top. The calculation amount of the invention in the data processing process is reduced, and the identity authentication process of the present invention is made easier to implement; then the grayscale is calculated for the face image of each scale in the face image pyramid structure of the obtained face image sample pair. Gradient direction, get the gradient direction pyramid GOP feature, Since the GOP feature of the face image is robust to external factors such as illumination, age and expression, the anti-interference of the invention is enhanced, and the accuracy of the face authentication result is further improved; finally, the obtained face image is obtained. The GOP feature of the sample pair is calculated to determine whether the face image sample pair belongs to the same person. The identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.

附图说明DRAWINGS

图1为本发明的身份认证方法的流程示意图一;1 is a schematic flowchart 1 of an identity authentication method according to the present invention;

图2为本发明的提取人脸梯度方向金字塔GOP特征的流程示意图;2 is a schematic flow chart of extracting a face gradient direction pyramid GOP feature according to the present invention;

图3为本发明的SVM分类算法的最大化分类间隔超平面示意图;3 is a schematic diagram of a hyper-classification interval hyperplane of the SVM classification algorithm of the present invention;

图4为本发明的身份认证方法的流程示意图二;4 is a schematic flowchart 2 of an identity authentication method according to the present invention;

图5为本发明的身份认证装置的结构示意图一;Figure 5 is a schematic structural diagram 1 of the identity authentication apparatus of the present invention;

图6为本发明的身份认证方法的流程示意图二;6 is a schematic flowchart 2 of an identity authentication method according to the present invention;

图7为本发明的GOP+SVM身份认证流程示意图。FIG. 7 is a schematic diagram of a GOP+SVM identity authentication process according to the present invention.

具体实施方式detailed description

为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。The technical problems, the technical solutions, and the advantages of the present invention will be more clearly described in the following description.

一方面,本发明提供一种身份认证方法,如图1所示,包括:In one aspect, the present invention provides an identity authentication method, as shown in FIG. 1, including:

步骤S101:获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;Step S101: Acquire a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image collected in the field;

本步骤中,现场采集的图像可以通过对采集的视频样本进行截取获得,也可以通过现场拍照获得。In this step, the images collected on the spot can be obtained by intercepting the collected video samples, or by taking pictures on the spot.

步骤S102:采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;Step S102: processing the acquired face image sample pair by using a face image pyramid algorithm to obtain a face image pyramid structure;

本步骤中,采用人脸图像金字塔结构对人脸图像进行多尺度、多分辨率的表示,该算法能够全面地描述人脸图像特征,同时人脸图像金字塔结构自下而上每一层的像素数都不断减少,降低了本发明在数据处理过程中 的计算量。In this step, the face image pyramid structure is used to represent the face image in multi-scale and multi-resolution. The algorithm can comprehensively describe the face image feature, and the face image pyramid structure is from bottom to top. The number is continuously reduced, reducing the invention in the data processing process The amount of calculation.

步骤S103:对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;Step S103: Calculating a grayscale gradient direction on the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature;

本步骤中,由于人脸梯度方向金字塔GOP特征对光照、表情和年龄等因素具有鲁棒性的特性,本发明优选采用计算人脸图像样本对的GOP特征,使本发明的抗干扰性增强,也加强了身份认证的准确性。In this step, since the face gradient direction pyramid GOP feature is robust to factors such as illumination, expression and age, the present invention preferably uses the GOP feature of the face image sample pair to enhance the anti-interference of the present invention. It also enhances the accuracy of identity authentication.

步骤S104:根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。Step S104: Calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belongs to the same person.

本发明的身份认证方法首先获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;然后采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构,该方法采用多分辨率的方式对人脸图像进行表示,能够全面地描述人脸图像特征,避免了现有技术中只采用一种分辨率对图像进行表示而导致图像特征表达不全面的现象,有效的提高了本发明进行身份认证的准确性,同时人脸图像金字塔结构自下而上每一层的像素数都不断减少,降低了本发明在数据处理过程中的计算量,使本发明的身份认证过程更容易实现;接下来对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征,由于人脸图像的GOP特征对光照、年龄和表情等外界因素具有鲁棒性,使本发明的抗干扰性增强,同时也进一步提高了人脸认证结果的准确性;最后根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。本发明的身份认证方法有效地避免了光照、年龄和表情等因素的干扰,同时也显著地提高了身份认证的准确性。The identity authentication method of the present invention first acquires a pair of face image samples, wherein one face image is an ID card photo, and the other face image is an image acquired in the field; then the face image pyramid algorithm is used to acquire the face image sample pair. The processing is performed to obtain a pyramid structure of the face image. The method uses a multi-resolution method to represent the face image, and can comprehensively describe the face image feature, thereby avoiding the prior art that only one resolution is used to represent the image. The phenomenon that the image features are not comprehensively expressed effectively improves the accuracy of the identity authentication of the present invention, and the number of pixels of each layer of the face image pyramid structure is reduced from bottom to top, which reduces the data processing of the present invention. The amount of calculation in the process makes the identity authentication process of the present invention easier to implement; nextly, the grayscale gradient direction is calculated for the face image of each scale in the face image pyramid structure of the obtained face image sample pair, and the gradient direction pyramid is obtained. GOP features, due to external factors such as illumination, age, and expression of GOP features of face images The robustness enhances the anti-interference of the present invention, and further improves the accuracy of the face authentication result. Finally, according to the GOP feature of the obtained face image sample pair, it is calculated whether the face image sample pair is the same. personal. The identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.

上述步骤S103中的算法具体可以是:采用计算人脸图像样本对的灰度梯度方向金字塔GOP特征,GOP特征具体计算过程可以参考如下:The algorithm in the above step S103 may specifically be: a gray gradient direction pyramid GOP feature for calculating a pair of face image samples, and the specific calculation process of the GOP feature may be referred to as follows:

对于一幅二维人脸图像I(x,y),在尺度s下每个像素点(x,y)处的梯度方向定义为 For a two-dimensional face image I(x,y), the gradient direction at each pixel (x,y) at scale s is defined as

Figure PCTCN2016070830-appb-000001
Figure PCTCN2016070830-appb-000001

其中,τ为人为设定的阈值,以使灰度几乎不变处的梯度方向为零,于是,原始二维图像I(x,y)的梯度方向金字塔定义为各个尺度下所有像素点处的梯度方向的级联,表示为Where τ is an artificially set threshold such that the gradient direction at which the gradation is almost constant is zero, so that the gradient direction pyramid of the original two-dimensional image I(x, y) is defined as all pixel points at each scale Cascading of gradient directions, expressed as

Figure PCTCN2016070830-appb-000002
Figure PCTCN2016070830-appb-000002

G(I)就表示为一幅人脸图像的GOP特征,它是一个d×2的矩阵,其中d表示一幅人脸金字塔上像数数目之和。G(I) is represented as a GOP feature of a face image, which is a d×2 matrix, where d represents the sum of the number of images on a face pyramid.

本发明中的GOP特征提取过程可以如图2所示,其中,图(a)为原始人脸图像I(x,y),图(b)表示人脸图像金字塔P(x,y),图(c)表示梯度方向金字塔,图(d)表示GOP特征G(I)。The GOP feature extraction process in the present invention may be as shown in FIG. 2, wherein (a) is the original face image I(x, y), and (b) is the face image pyramid P(x, y), (c) represents a gradient direction pyramid, and (d) represents a GOP feature G(I).

作为本发明的一种改进,步骤S102可以进一步为:对获取的人脸图像样本对采用高斯核卷积平滑和下采样得到人脸图像金字塔结构。As an improvement of the present invention, step S102 may further be: obtaining a face image pyramid structure by using Gaussian kernel convolution smoothing and downsampling on the acquired face image sample pairs.

本步骤中,人脸图像金字塔是以多分辨率来解释人脸图像的一种结构。一幅图像的金字塔结构是一系列以金字塔形状排列的分辨率逐步降低的图像集合。金字塔是结合高斯核卷积平滑和下采样操作的一种图像表示方式。金字塔的底部是待处理图像的原始高分辨率表示,而顶部是低分辨率的近似。当向金字塔的上层移动时,尺寸和分辨率就降低,这样就得到不同尺度下的图像。人脸图像金子塔的具体计算过程如下:In this step, the face image pyramid is a structure for interpreting a face image with multiple resolutions. The pyramid structure of an image is a collection of images whose resolution is gradually reduced in a pyramid shape. The pyramid is an image representation that combines Gaussian kernel convolution smoothing and downsampling operations. The bottom of the pyramid is the original high resolution representation of the image to be processed, while the top is a low resolution approximation. As you move toward the upper level of the pyramid, the size and resolution are reduced, resulting in images at different scales. The specific calculation process of the face image gold tower is as follows:

给定一幅人脸图像I(x,y),定义其人脸图像金字塔为P(x,y),表示如下:Given a face image I(x,y), define its face image pyramid as P(x,y), which is as follows:

Figure PCTCN2016070830-appb-000003
Figure PCTCN2016070830-appb-000003

其中,among them,

I(x,y;0)=I(x,y)I(x,y;0)=I(x,y)

I(x,y;s)=[I(x,y;s-1)*Φ(x,y)]↓2,s=1,2,...SI(x,y;s)=[I(x,y;s-1)*Φ(x,y)]↓2,s=1,2,...S

公式(2)中,*代表卷积操作,s是尺度的索引,↓2表示下采样,Φ(x,y)表示与图像I(x,y)有相同尺寸的高斯核函数,可以表示为In equation (2), * represents the convolution operation, s is the index of the scale, ↓ 2 represents the downsampling, and Φ(x, y) represents the Gaussian kernel function of the same size as the image I(x, y), which can be expressed as

Figure PCTCN2016070830-appb-000004
Figure PCTCN2016070830-appb-000004

在尺度s-1下的图像I(x,y)与高斯核函数卷积,得到平滑后的图像,再通过下采样操作,就得到下一个尺度s下的图像,图像大小为上一个尺度下图像的四分之一,图像分辨率也逐步降低,逐层卷积下采样,就得到了人脸图像金字塔。The image I(x, y) at the scale s-1 is convolved with the Gaussian kernel function to obtain a smoothed image, and then the image under the next scale s is obtained by the downsampling operation, and the image size is at the previous scale. A quarter of the image, the image resolution is also gradually reduced, and the layer-by-layer convolution downsampling, the pyramid of the face image is obtained.

在多数情况下,人脸图像在单一尺度下不容易被察觉的特征在多尺度下很容易被捕捉到,因此,本发明中优选采用高斯核函数卷积平滑和下采样操作得到人脸图像金字塔结构,由于人脸图像金字塔结构是对人脸图像的多尺度、多分辨率的表示,所以该方法能够更全面地描述人脸图像特征,加强了本发明中身份认证的准确性。In most cases, features that are not easily perceived by a face image at a single scale are easily captured at multiple scales. Therefore, in the present invention, a Gaussian kernel function convolution smoothing and a downsampling operation are preferably used to obtain a face image pyramid. Structure, since the face image pyramid structure is a multi-scale, multi-resolution representation of the face image, the method can more fully describe the face image feature and enhance the accuracy of the identity authentication in the present invention.

作为本发明的一种改进,步骤S104可以进一步为:对得到的人脸图像样本对的GOP特征计算特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。As an improvement of the present invention, step S104 may further be: calculating a feature vector for the GOP feature of the obtained face image sample pair, and using SVM binary classification processing to calculate whether the face image sample pair belongs to the same person.

本步骤中,优选采用SVM分类算法,首先计算得到输入特征对应的输出标签为1的概率,跟预先设定的阈值比较,大于该阈值赋标签为1,代表两张人脸图像来自于同一个人,反之,赋为-1,则表示来自于不同人。对于两类线性学习任务,SVM要找到一个间隔最大的超平面将两类样本分开,如图3所示,最大间隔能确保该超平面具有最好的泛化能力。而对于本发明的身份认证过程中存在非线性的学习任务,本发明中采用的SVM特征分类算法将实际问题通过非线性变换转换到高维的特征空间,在高维空间中构造线性判别函数来实现原空间中的非线性判别函数,同时SVM算法能够针对有限的样本得到最优解,无需多样本采集,适用于本发明中的身份证照片样本单一的情况,进一步提高了本发明的身份认证的准确性。SVM分类的具体算法可以参考如下过程:In this step, the SVM classification algorithm is preferably used to first calculate the probability that the output label corresponding to the input feature is 1, and compare with a preset threshold. The threshold is greater than the threshold, and the two faces are from the same person. Conversely, assigning to -1 means that it is from a different person. For two types of linear learning tasks, the SVM needs to find a hyperplane with the largest interval to separate the two types of samples. As shown in Figure 3, the maximum spacing ensures that the hyperplane has the best generalization ability. For the non-linear learning task in the identity authentication process of the present invention, the SVM feature classification algorithm used in the present invention converts the actual problem into a high-dimensional feature space through a nonlinear transformation, and constructs a linear discriminant function in a high-dimensional space. The nonlinear discriminant function in the original space is realized, and the SVM algorithm can obtain the optimal solution for a limited sample without multi-sample acquisition, which is suitable for the single photo case of the ID card in the invention, further improving the identity authentication of the present invention. The accuracy. The specific algorithm of SVM classification can refer to the following process:

(1)支持向量分类器(SVC)(1) Support Vector Classifier (SVC)

对于两类线性可分任务,SVC要找到一个间隔最大的超平面将两类样本分开,对大间隔能确保该超平面有最好的泛化能力,从直观上看,间隔就是两个类之间的空间大小或者超平面所确定的分离程度。从几何上讲,间隔对应数据点到超平面的最短距离,零w和b分别表示权重向量和最优 超平面偏移,则相应的超平面可以被定义为For two types of linear separable tasks, SVC needs to find a hyperplane with the largest interval to separate the two types of samples. The large interval can ensure the best generalization ability of the hyperplane. From the visual point of view, the interval is two classes. The size of the space between them or the degree of separation determined by the hyperplane. Geometrically, the interval corresponds to the shortest distance from the data point to the hyperplane, and zeros w and b represent the weight vector and the optimum, respectively. Hyperplane offset, the corresponding hyperplane can be defined as

wTx+b=0  (5)w T x+b=0 (5)

样本数据x到超平面的几何距离是The geometric distance from the sample data x to the hyperplane is

Figure PCTCN2016070830-appb-000005
Figure PCTCN2016070830-appb-000005

其中,g(x)=wT+b是超平面所确定的判别函数,也成为给定w和b的x的泛函间隔。Where g(x)=w T +b is the discriminant function determined by the hyperplane and also becomes the functional interval of x for a given w and b.

SVC就是要寻找最优超平面的参数值w和b,以最大化两类间的分类间隔,因此,SVC也被称为最大间隔分类器。将间隔固定为1,那么对于给定训练集

Figure PCTCN2016070830-appb-000006
有SVC is to find the optimal hyperplane parameter values w and b to maximize the classification interval between the two classes. Therefore, SVC is also called the maximum interval classifier. Fix the interval to 1, then for a given training set
Figure PCTCN2016070830-appb-000006
Have

Figure PCTCN2016070830-appb-000007
Figure PCTCN2016070830-appb-000007

有一些数据点满足公式(7),称为支持向量,支持向量到最优超平面的距离就是There are some data points that satisfy the formula (7), called the support vector, and the distance from the support vector to the optimal hyperplane is

Figure PCTCN2016070830-appb-000008
Figure PCTCN2016070830-appb-000008

分类间隔可以表示成The classification interval can be expressed as

Figure PCTCN2016070830-appb-000009
Figure PCTCN2016070830-appb-000009

为了找到间隔最大的超平面,SVC要用w和b最大化ρ:In order to find the hyperplane with the largest interval, SVC should maximize ρ with w and b:

Figure PCTCN2016070830-appb-000010
Figure PCTCN2016070830-appb-000010

等价于Equivalent to

Figure PCTCN2016070830-appb-000011
Figure PCTCN2016070830-appb-000011

一般情况下,用拉格朗日对偶求解式(11)的最优化问题, In general, the Lagrangian dual solution (11) is optimized.

Figure PCTCN2016070830-appb-000012
Figure PCTCN2016070830-appb-000012

对L(w,b,α)的w和b求偏导,并令之为零,得到最优化条件For partial deviations of w and b of L(w, b, α), and let it be zero, get the optimization condition

Figure PCTCN2016070830-appb-000013
Figure PCTCN2016070830-appb-000013

带入式(12)可得对偶问题Bring in (12) to get the dual problem

Figure PCTCN2016070830-appb-000014
Figure PCTCN2016070830-appb-000014

在确定最优拉格朗日乘子后,可以计算(13)中的最优权重向量w*After determining the optimal Lagrangian multiplier, the optimal weight vector w * in (13) can be calculated.

Figure PCTCN2016070830-appb-000015
Figure PCTCN2016070830-appb-000015

从而确定最优偏移b*=1-wTxs,当ys=+1Thus determining the optimal offset b * =1-w T x s when y s = +1

(2)支持向量的软间隔优化(2) Soft interval optimization of support vectors

对于现实中的数据,往往存在一些异常点,这些点对于最大间隔超平面而言是训练错误,软间隔就是要扩展SVC算法,以使超平面允许少量的噪声存在,通过引入松弛变量ζi来量化分类器的错误For real-world data, there are often some anomalies. These points are training errors for the maximum interval hyperplane. The soft interval is to extend the SVC algorithm so that the hyperplane allows a small amount of noise to exist. By introducing the slack variable ζ i Quantifier classifier error

Figure PCTCN2016070830-appb-000016
Figure PCTCN2016070830-appb-000016

参数C用来平衡分类器的复杂度和容错能力,是一个由用户选择的正则化参数。使用与前面相同的拉格朗日乘子法,可确定最优权重w*和最优偏移b*Parameter C is used to balance the complexity and fault tolerance of the classifier. It is a regularization parameter selected by the user. Using the same Lagrangian multiplier method as before, the optimal weight w * and the optimal offset b * can be determined.

(3)核函数(3) Kernel function

核函数可将数据从低维非线性空间变换到高维空间,从而使得数据在 高维特征空间是线性可分的,核函数定义如下:The kernel function transforms data from a low-dimensional nonlinear space to a high-dimensional space, thereby making the data The high dimensional feature space is linearly separable, and the kernel function is defined as follows:

K(x,x')=φ(x)Tφ(x')  (17)K(x,x')=φ(x) T φ(x') (17)

其中

Figure PCTCN2016070830-appb-000017
表示从低维输入空间到高维特征空间的映射,带入公式(15)中的最优权重向量表达式中,可得最优分类器among them
Figure PCTCN2016070830-appb-000017
The mapping from the low-dimensional input space to the high-dimensional feature space is introduced into the optimal weight vector expression in equation (15), and the optimal classifier can be obtained.

Figure PCTCN2016070830-appb-000018
Figure PCTCN2016070830-appb-000018

一般情况下,优先选择RBF核。In general, the RBF core is preferred.

(4)类概率预测(4) Probability prediction

给定待测样例x,我们希望得到将其归属为某一类的概率,以便和预先设定的阈值比较,从而确定分类标签。采用sigmod函数拟合,条件概率:Given the sample x to be tested, we want to get the probability of categorizing it as a class to compare with a preset threshold to determine the classification label. Fit with sigmod function, conditional probability:

Figure PCTCN2016070830-appb-000019
Figure PCTCN2016070830-appb-000019

对训练样本集

Figure PCTCN2016070830-appb-000020
通过极大似然估计确定模型参数A、B。对待测样例x,首先由训练好的SVM模型得到f,再通过此概率模型便可得到类别标签为1的概率,将此概率值与预先设定好的阈值相比较,最终确定类别标签。阈值的选取需要通过验证集确定,在具体的做法中,可将验证集中每对验证样本的类别概率从小到大排序,依次作为阈值,得到每个阈值对应的误判率和认证率,根据事先要求的误判率确定最佳阈值。Training sample set
Figure PCTCN2016070830-appb-000020
The model parameters A, B are determined by maximum likelihood estimation. To test sample x, first obtain f from the trained SVM model, and then obtain the probability that the category label is 1 by using the probability model, compare the probability value with a preset threshold, and finally determine the category label. The selection of the threshold needs to be determined by the verification set. In a specific method, the class probability of each pair of verification samples in the verification set can be sorted from small to large, and sequentially used as a threshold, and the false positive rate and the authentication rate corresponding to each threshold are obtained, according to the prior The required false positive rate determines the optimal threshold.

通过训练样本确定分类器模型后,对于一对待测试的身份证照片和现场采集的人脸图像,归一化后首先提取各自的GOP特征,并计算分类特征x,经训练过的SVM分类器,预测标签

Figure PCTCN2016070830-appb-000021
若为1,则判为同一个人,为-1则判为不同人。After determining the classifier model through the training sample, for a ID image to be tested and a face image collected in the field, after normalization, first extract the respective GOP features, and calculate the classification feature x, the trained SVM classifier, Forecast label
Figure PCTCN2016070830-appb-000021
If it is 1, it is judged to be the same person, and if it is -1, it is judged as a different person.

本发明中的特征分类过程,除采用SVM分类外,还可以采用随机森林方法。所谓随机森林,是一个包含多个决策树的分类器。所谓随机,指的便是每棵决策树的生成具有随机性(每棵树的训练样本从原始样本中随机选择而来)。决策树构建过程中用到的各个属性集合,也是通过等概率的随机选择而来,即训练样本的随机选择和分裂属性的随机选择。当有一 个新的输入样本进入的时候,森林中的每棵决策树分别对该输入样本进行判断,得到各自的决策结果,再将决策结果汇总到森林的输出。对于分类问题,简单的统计所有树对类别的投票,选择投票数最多的一类,将输入样本归属为该类。In the feature classification process of the present invention, in addition to the SVM classification, a random forest method can also be adopted. The so-called random forest is a classifier that contains multiple decision trees. The so-called randomness means that the generation of each decision tree is random (the training samples of each tree are randomly selected from the original samples). The set of attributes used in the decision tree construction process is also selected by random selection of equal probability, that is, random selection of training samples and random selection of splitting attributes. When there is one When a new input sample enters, each decision tree in the forest judges the input samples separately, obtains the respective decision results, and then summarizes the decision results to the forest output. For classification problems, simply count all tree-to-category votes, select the one with the highest number of votes, and assign the input samples to that category.

作为本发明的进一步改进,步骤S104可以进一步为:用于对得到的人脸图像样本对的GOP特征采用点乘操作,得到一对余弦相似度向量作为特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。As a further improvement of the present invention, step S104 may further be: performing a point multiplication operation on the GOP features of the obtained face image sample pairs, obtaining a pair of cosine similarity vectors as feature vectors, and using SVM two-class processing to calculate and judge Whether the face image sample pair belongs to the same person.

本步骤中,余弦相似度向量的具体计算过程如下:In this step, the specific calculation process of the cosine similarity vector is as follows:

人脸认证可看做是二分类问题,给定一对人脸图像I1(x,y)、I2(x,y),分别代表身份证照片和现场采集的视频照片,我们的目的是判断I1、I2是否来自于同一个人,若是,判定结果为1,若否,判为-1,因此,需要把I1、I2映射到特征空间Face authentication can be regarded as a two-category problem. Given a pair of face images I 1 (x, y) and I 2 (x, y), which represent ID photos and live video photos, our purpose is It is judged whether I 1 and I 2 are from the same person, and if so, the judgment result is 1, and if not, it is judged as -1, therefore, it is necessary to map I 1 and I 2 to the feature space.

x=F(I1,I2)  (20)x=F(I 1 ,I 2 ) (20)

其中,x∈Rd,表示从一对人脸中提取的特征向量,特征提取函数F:I×I→Rd,定义如下:Where x∈R d represents the feature vector extracted from a pair of faces, and the feature extraction function F: I×I→R d is defined as follows:

Figure PCTCN2016070830-appb-000022
Figure PCTCN2016070830-appb-000022

公式(21)中,.×表示矩阵的点乘操作,该式计算的一对人脸图像各自GOP特征逐点之间的余弦相似度,结果为一个d维的列向量,作为特征分类过程中的输入。In formula (21), .× denotes the point multiplication operation of the matrix, and the cosine similarity between the GOP features of the pair of face images calculated by the equation is point by point, and the result is a d-dimensional column vector as the feature classification process. input of.

本步骤中,采用梯度方向金字塔的余弦相似度向量作为特征向量,能够全面的实现人脸图像特征的提取,同时余弦相似度向量对光照、表情和年龄等外界因素还具有鲁棒性,进一步加强了本发明的准确性和抗干扰性。In this step, the cosine similarity vector of the gradient direction pyramid is used as the feature vector, which can comprehensively realize the feature extraction of the face image, and the cosine similarity vector is robust to external factors such as illumination, expression and age, and further strengthens The accuracy and anti-interference of the present invention.

本发明中,如图4所示,步骤S102之前可以包括:In the present invention, as shown in FIG. 4, before step S102, the method may include:

步骤S1021:对人脸图像样本对进行人脸检测处理;Step S1021: performing face detection processing on the face image sample pair;

步骤S1022:对经过人脸检测处理后的人脸图像样本对进行特征点定 位处理;Step S1022: Perform feature point setting on the face image sample pair after the face detection processing Bit processing

步骤S1023:对经过特征点定位处理后的人脸图像样本对进行归一化处理。Step S1023: Normalize the face image sample pairs subjected to the feature point positioning process.

本步骤中,人脸检测是人脸分析的首要环节,其处理的问题是确认图像中是否存在人脸图像,本发明中,优选采用Adaboost人脸检测算法,Adaboost算法是一种分类器算法,其基本思想是利用大量的分类能力一般的简单分类器通过一定的方法叠加起来,构成一个分类能力很强的强分类器,再将若干个强分类器串联成为分级分类器完成图像搜索检测,该检测器检测速度快,开发周期短,具有可行性。人脸检测完成后,如果存在人脸图像则对人脸图像进行特征点定位,本发明中,特征点定位可以采用本领域技术人员公知的各种方法,如基于几何特征的方法、基于模板匹配的方法、基于模型的方法等。本发明中优选采用基于灰度特性和几何特征的方法,该方法能够简单、有效的实现人脸图像的特征点定位。特征点定位完成之后,对每幅人脸图像,根据双眼定位坐标,通过图像配准,将双眼对齐到预先设置的相同位置处,实现归一化,本发明中,对人脸图像的归一化处理还可以采用本领域技术人员公知的各种方法,如最大值-最小值归一化,基于傅里叶变换的人脸图像归一化等。In this step, face detection is the primary part of face analysis, and the problem is to confirm whether there is a face image in the image. In the present invention, Adaboost face detection algorithm is preferably adopted, and Adaboost algorithm is a classifier algorithm. The basic idea is to use a large number of simple classifiers with general classification ability to superimpose them by a certain method to form a strong classifier with strong classification ability, and then connect several strong classifiers into a classifier to complete image search and detection. The detector has a fast detection speed and a short development cycle, which is feasible. After the face detection is completed, if there is a face image, the feature image is positioned on the face image. In the present invention, the feature point location may adopt various methods known to those skilled in the art, such as a method based on geometric features, based on template matching. Methods, model-based methods, etc. In the present invention, a method based on gradation characteristics and geometric features is preferably adopted, which can realize feature point localization of a face image simply and effectively. After the feature point positioning is completed, for each face image, according to the binocular positioning coordinates, the eyes are aligned to the same position set in advance by image registration, and normalization is achieved. In the present invention, the face image is normalized. The processing may also employ various methods known to those skilled in the art, such as maximum-minimum normalization, Fourier transform-based face image normalization, and the like.

另一方面,与上述的方法相对应,本发明还提供了一种身份认证装置,如图5所示,包括:On the other hand, corresponding to the above method, the present invention further provides an identity authentication device, as shown in FIG. 5, including:

获取模块11:用于获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;The obtaining module 11 is configured to obtain a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image collected in the field;

采样模块12:用于采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;The sampling module 12 is configured to process the acquired face image sample pair by using a face image pyramid algorithm to obtain a face image pyramid structure;

计算模块13:用于对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;The calculating module 13 is configured to calculate a grayscale gradient direction for the face image of each scale in the pyramid image structure of the obtained face image sample pair, to obtain a gradient direction pyramid GOP feature;

判断模块14:用于根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。The determining module 14 is configured to calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belongs to the same person.

本发明的身份认证装置,首先获取模块11获取人脸图像样本对;然 后采样模块12采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构,该方法采用多分辨率的方式对人脸图像进行表示,能够全面地描述人脸图像特征,有效的提高了本发明进行身份认证的准确性;接下来计算模块13对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征,使本发明的抗干扰性增强,进一步提高了人脸认证结果的准确性;最后判断模块14根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。本发明的身份认证方法有效地避免光照、年龄和表情等因素的干扰,同时也显著地提高了身份认证的准确性。The identity authentication apparatus of the present invention first acquires a pair of face image samples by the module 11; The post-sampling module 12 processes the acquired face image sample pairs by using the face image pyramid algorithm to obtain a pyramid structure of the face image. The method uses a multi-resolution method to represent the face image, and can comprehensively describe the face image. The feature effectively improves the accuracy of the identity authentication of the present invention; then the calculation module 13 calculates the grayscale gradient direction of the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain the gradient direction. The pyramid GOP feature enhances the anti-interference of the present invention, and further improves the accuracy of the face authentication result. Finally, the determining module 14 calculates, according to the obtained GOP feature of the face image sample pair, whether the face image sample pair is the same. personal. The identity authentication method of the invention effectively avoids interference of factors such as illumination, age and expression, and also significantly improves the accuracy of identity authentication.

作为本发明的一种改进,采样模块12可以进一步用于对获取的人脸图像样本对采用高斯核卷积平滑和下采样得到人脸图像金字塔结构。As an improvement of the present invention, the sampling module 12 can be further configured to obtain a face image pyramid structure by using Gaussian kernel convolution smoothing and downsampling on the acquired face image sample pairs.

多数情况下,人脸图像在单一尺度下不容易被察觉的特征在多尺度下很容易被捕捉到,因此,本发明中优选采用高斯核函数卷积平滑和下采样操作的到人脸图像金字塔结构对人脸图像进行多尺度、多分辨率的表示,该方法能够更全面地描述人脸图像特征,加强了本发明中身份认证的准确性。本发明中优选采用。In most cases, features that are not easily perceived by a face image at a single scale are easily captured at multiple scales. Therefore, in the present invention, it is preferable to use a Gaussian kernel function convolution smoothing and downsampling operation to the face image pyramid. The structure performs multi-scale and multi-resolution representation of the face image, and the method can describe the face image feature more comprehensively, and enhances the accuracy of the identity authentication in the present invention. It is preferably employed in the present invention.

优选的,判断模块14,可以进一步用于对得到的人脸图像样本对的GOP特征计算特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。Preferably, the determining module 14 is further configured to calculate a feature vector for the GOP feature of the obtained face image sample pair, and use the SVM binary classification process to calculate whether the face image sample pair belongs to the same person.

对于本发明的身份认证过程中存在非线性的学习任务,判断模块14采用的SVM特征分类算法将实际问题通过非线性变换转换到高维的特征空间,在高维空间中构造线性判别函数来实现原空间中的非线性判别函数,同时SVM算法能够针对有限的样本得到最优解,无需多样本采集,适用于本发明中的身份证照片样本单一的情况,进一步提高了本发明的身份认证的准确性。For the non-linear learning task in the identity authentication process of the present invention, the SVM feature classification algorithm adopted by the judging module 14 converts the actual problem into a high-dimensional feature space by nonlinear transformation, and constructs a linear discriminant function in a high-dimensional space to realize The nonlinear discriminant function in the original space, and the SVM algorithm can obtain the optimal solution for a limited sample, without multi-sample acquisition, and is suitable for the single photo case of the ID card in the present invention, further improving the identity authentication of the present invention. accuracy.

作为本发明的一种改进,判断模块14可以进一步用于对得到的人脸图像样本对的GOP特征采用点乘操作,得到一对余弦相似度向量作为特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一 个人。As an improvement of the present invention, the determining module 14 may be further configured to perform a point multiplication operation on the GOP features of the obtained face image sample pairs, and obtain a pair of cosine similarity vectors as feature vectors, and use SVM two-class processing to calculate and judge Whether the face image sample pair is the same personal.

本发明中采用梯度方向金字塔的余弦相似度向量作为特征向量,能够全面的实现人脸图像特征的提取,同时余弦相似度向量对光照、表情和年龄等外界因素还具有鲁棒性,进一步加强了本发明的准确性和抗干扰性。In the present invention, the cosine similarity vector of the gradient direction pyramid is used as the feature vector, which can comprehensively realize the feature extraction of the face image, and the cosine similarity vector is robust to external factors such as illumination, expression and age, and further strengthens. The accuracy and anti-interference of the present invention.

作为本发明的进一步改进,如图6所示,获取模块11和采样模块12之间还连接有:As a further improvement of the present invention, as shown in FIG. 6, the acquisition module 11 and the sampling module 12 are further connected with:

检测模块121,用于对人脸图像样本对进行人脸检测处理;The detecting module 121 is configured to perform face detection processing on the face image sample pair;

定位模块122,用于对经过人脸检测处理后的人脸图像样本对进行特征点定位处理;The positioning module 122 is configured to perform feature point positioning processing on the face image sample pair subjected to the face detection processing;

归一化模块123,用于对经过特征点定位处理后的人脸图像样本对进行归一化处理。The normalization module 123 is configured to perform normalization processing on the face image sample pairs subjected to the feature point positioning processing.

本发明中,检测模块121处理的问题是确认图像中是否存在人脸图像,以免采集到的非人脸图像进入装置中,导致浪费装置计算资源的现象。定位模块122对获取的人脸图像样本对采用特征点定位处理,避免了姿态和位置信息对身份认证的影响。归一化模块123对经过特征点定位处理的人脸图像样本对进行归一化处理,加强人脸图像样本对的一致性,从而避免光照、方向和噪声等因素的影响,提高了本发明的抗干扰性。In the present invention, the problem that the detection module 121 processes is to confirm whether there is a face image in the image, so as to prevent the collected non-face image from entering the device, resulting in a waste of device computing resources. The positioning module 122 adopts feature point location processing on the acquired face image sample pair, thereby avoiding the influence of posture and location information on identity authentication. The normalization module 123 normalizes the face image sample pairs processed by the feature point positioning to enhance the consistency of the face image sample pairs, thereby avoiding the influence of factors such as illumination, direction and noise, and improving the invention. Anti-interference.

本发明中,如图7所示,在具体工作时,可以参照如下步骤进行:In the present invention, as shown in FIG. 7, in the specific work, the following steps can be performed:

步骤1:获取人脸图像样本对,其中一个人脸图像样本取自身份证数据库,另一个人脸图像样本取自现场采集视频库;Step 1: Obtain a pair of face image samples, wherein one face image sample is taken from the ID database, and another face image sample is taken from the live collection video library;

步骤2:对获取的人脸图像样本对采用特征点定位处理;Step 2: adopting feature point location processing on the acquired face image sample pair;

步骤3:对经过特征点定位处理后的人脸图像样本对进行归一化处理;Step 3: normalizing the face image sample pairs after the feature point positioning processing;

步骤4:对经过归一化处理后的人脸图像样本对通过高斯核卷积平滑和下采样得到人脸图像金字塔结构;Step 4: obtaining a face image pyramid structure by normalizing the face image sample pair by Gaussian kernel convolution smoothing and downsampling;

步骤5:对人脸图像金字塔结构中各个尺度下的人脸图像计算梯度方向,得到人脸图像的梯度方向金字塔GOP特征。Step 5: Calculate the gradient direction of the face image at each scale in the pyramid structure of the face image, and obtain the gradient direction pyramid GOP feature of the face image.

步骤6:对于人脸图像样本对计算其梯度方向金字塔的余弦相似度向量,得到人脸特征表达。 Step 6: Calculate the cosine similarity vector of the gradient direction pyramid for the face image sample pair to obtain the face feature expression.

步骤7:采用SVM分类算法对得到的人脸图像表达进行类概率预测;Step 7: Using the SVM classification algorithm to perform class probability prediction on the obtained face image representation;

步骤8:得到的类概率预测与给定的阈值(Th)进行比较;Step 8: The obtained class probability prediction is compared with a given threshold (Th);

步骤9:得到认证结果,其中,概率值大于Th,则赋值为1,表示人脸图形样本对来自于同一个人,概率值小于Th,则赋值为-1,表示人脸图形样本对来自于不同的人。Step 9: Obtain the authentication result, wherein the probability value is greater than Th, then the value is 1, indicating that the face graph sample pair is from the same person, and the probability value is less than Th, then the value is -1, indicating that the face graph sample pair is different. People.

上述实施例中,采用人脸梯度方向金字塔,获得人脸的特征表达,利用了人脸在不同尺度下的梯度信息,只取梯度方向而舍弃梯度幅值,实验表明该做法更能抓住人脸的本质信息,且对光照、表情和年龄等因素具有较好的鲁棒性,同时,实验表明,该算法简洁高效,时间复杂度和空间复杂度都比较低。本算法直接从人脸认证的角度解决身份证认证问题,避免了身份证照片的单样本带来的问题,在229名自愿者的二代身份证相片和一张现场采集的视频照组成的测试库上测试,当系统的错误接受率分别为1%,10%时,相对应的正确接受率分别为82.97%和99.13%。In the above embodiment, the face gradient direction pyramid is used to obtain the feature expression of the face, and the gradient information of the face at different scales is utilized, and only the gradient direction is discarded, and the gradient amplitude is discarded. Experiments show that the method can catch people more. The essential information of the face is robust to factors such as illumination, expression and age. At the same time, experiments show that the algorithm is simple and efficient, and the time complexity and space complexity are relatively low. The algorithm directly solves the ID card authentication problem from the perspective of face authentication, avoids the problem caused by the single sample of the ID card photo, and the test of the second generation ID card photo of 229 volunteers and a video photo taken on site. In the library test, when the system's error acceptance rate is 1%, 10%, the corresponding correct acceptance rates are 82.97% and 99.13%, respectively.

以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以作出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。 The above is a preferred embodiment of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.

Claims (10)

一种身份认证方法,其特征在于,包括:An identity authentication method, comprising: 获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;Obtaining a pair of face image samples, wherein one of the face images is an ID card photo, and the other face image is an image acquired in the field; 采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;The face image pyramid pair is processed by the face image pyramid algorithm to obtain a pyramid structure of the face image; 对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;Calculating a grayscale gradient direction on the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature; 根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。According to the obtained GOP feature of the face image sample pair, it is calculated whether the face image sample pair belongs to the same person. 根据权利要求1所述的身份认证方法,其特征在于,所述采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构进一步为:The identity authentication method according to claim 1, wherein the face image pyramid algorithm is used to process the acquired face image sample pair, and the pyramid structure of the face image is further: 对获取的人脸图像样本对采用高斯核卷积平滑和下采样得到人脸图像金字塔结构。A face image pyramid structure is obtained by using Gaussian kernel convolution smoothing and downsampling for the acquired face image sample pairs. 根据权利要求1所述的身份认证方法,其特征在于,所述根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人进一步为:The identity authentication method according to claim 1, wherein the calculating, according to the GOP feature of the obtained face image sample pair, determining whether the face image sample pair belongs to the same person is further: 对得到的人脸图像样本对的GOP特征计算特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。The feature vector is calculated for the GOP feature of the obtained face image sample pair, and the SVM binary classification process is used to calculate whether the face image sample pair belongs to the same person. 根据权利要求1所述的身份认证方法,其特征在于,所述根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人进一步为:The identity authentication method according to claim 1, wherein the calculating, according to the GOP feature of the obtained face image sample pair, determining whether the face image sample pair belongs to the same person is further: 用于对得到的人脸图像样本对的GOP特征采用点乘操作,得到一对余弦相似度向量作为特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。The GOP feature of the obtained face image sample pair is subjected to a point multiplication operation to obtain a pair of cosine similarity vectors as the feature vector, and the SVM binary classification process is used to calculate whether the face image sample pair belongs to the same person. 根据权利要求1至4中任一所述的身份认证方法,其特征在于,所 述采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构之前包括:An identity authentication method according to any one of claims 1 to 4, characterized in that The face image pyramid algorithm is used to process the acquired face image sample pairs, and the face image pyramid structure is obtained before: 对人脸图像样本对进行人脸检测处理;Perform face detection processing on face image sample pairs; 对经过人脸检测处理后的人脸图像样本对进行特征点定位处理;Perform feature point location processing on the face image sample pairs after the face detection processing; 对经过特征点定位处理后的人脸图像样本对进行归一化处理。The face image sample pairs processed by the feature point are normalized. 一种身份认证装置,其特征在于,包括:An identity authentication device, comprising: 获取模块:用于获取人脸图像样本对,其中一个人脸图像为身份证照片,另一个人脸图像为现场采集的图像;The acquisition module is configured to obtain a pair of face image samples, wherein one face image is an ID card photo, and another face image is an image acquired in the field; 采样模块:用于采用人脸图像金字塔算法对获取的人脸图像样本对进行处理,得到人脸图像金字塔结构;Sampling module: used to process the acquired face image sample pairs by using the face image pyramid algorithm to obtain a pyramid structure of the face image; 计算模块:用于对得到的人脸图像样本对的人脸图像金字塔结构中各个尺度的人脸图像计算灰度梯度方向,得到梯度方向金字塔GOP特征;The calculation module is configured to calculate a gray gradient direction of the face image of each scale in the pyramid image structure of the obtained face image sample pair to obtain a gradient direction pyramid GOP feature; 判断模块:用于根据得到的人脸图像样本对的GOP特征,计算判断人脸图像样本对是否属于同一个人。The judging module is configured to calculate, according to the GOP feature of the obtained face image sample pair, whether the pair of face image samples belong to the same person. 根据权利要求6所述的身份认证装置,其特征在于,所述采样模块,进一步用于对获取的人脸图像样本对采用高斯核卷积平滑和下采样得到人脸图像金字塔结构。The identity authentication apparatus according to claim 6, wherein the sampling module is further configured to obtain a face image pyramid structure by using Gaussian kernel convolution smoothing and downsampling on the acquired face image sample pairs. 根据权利要求6所述的身份认证装置,其特征在于,所述判断模块,进一步用于对得到的人脸图像样本对的GOP特征计算特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。The identity authentication device according to claim 6, wherein the determining module is further configured to calculate a feature vector for the GOP feature of the obtained face image sample pair, and use SVM binary classification processing to calculate and determine the face image sample. Whether it belongs to the same person. 根据权利要求6所述的身份认证装置,其特征在于,所述判断模块,进一步用于对得到的人脸图像样本对的GOP特征采用点乘操作,得到一对余弦相似度向量作为特征向量,采用SVM二分类处理,计算判断人脸图像样本对是否属于同一个人。The identity authentication apparatus according to claim 6, wherein the determining module is further configured to perform a point multiplication operation on the GOP features of the obtained face image sample pair to obtain a pair of cosine similarity vectors as feature vectors. The SVM binary classification process is used to calculate whether the face image sample pairs belong to the same person. 根据权利要求6所述的身份认证装置,其特征在于,所述获取模块和采样模块之间连接有:The identity authentication device according to claim 6, wherein the connection between the acquisition module and the sampling module is: 检测模块,用于对人脸图像样本对进行人脸检测处理;a detecting module, configured to perform face detection processing on a face image sample pair; 定位模块,用于对经过人脸检测处理后的人脸图像样本对进行特征点 定位处理;a positioning module for performing feature points on the face image sample pairs after the face detection processing Positioning processing 归一化模块,用于对经过特征点定位处理后的人脸图像样本对进行归一化处理。 The normalization module is configured to normalize the face image sample pairs after the feature point positioning processing.
PCT/CN2016/070830 2015-03-24 2016-01-13 Identity authentication method and apparatus Ceased WO2016150240A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510131563.9A CN105447532A (en) 2015-03-24 2015-03-24 Identity authentication method and device
CN201510131563.9 2015-03-24

Publications (1)

Publication Number Publication Date
WO2016150240A1 true WO2016150240A1 (en) 2016-09-29

Family

ID=55557691

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/070830 Ceased WO2016150240A1 (en) 2015-03-24 2016-01-13 Identity authentication method and apparatus

Country Status (2)

Country Link
CN (1) CN105447532A (en)
WO (1) WO2016150240A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446855A (en) * 2016-09-30 2017-02-22 深圳市商汤科技有限公司 Real name authentication device
CN109063565A (en) * 2018-06-29 2018-12-21 中国科学院信息工程研究所 A kind of low resolution face identification method and device
CN110210341A (en) * 2019-05-20 2019-09-06 深圳供电局有限公司 Identity card authentication method based on face recognition, system thereof and readable storage medium
US20200074608A1 (en) * 2018-09-05 2020-03-05 Infineon Technologies Ag Time of Flight Camera and Method for Calibrating a Time of Flight Camera
CN111160250A (en) * 2019-12-30 2020-05-15 安徽易刚信息技术有限公司 Blueberry growing period detection method and device based on artificial neural network
CN111414888A (en) * 2020-03-31 2020-07-14 杭州博雅鸿图视频技术有限公司 Low-resolution face recognition method, system, device and storage medium
CN113343955A (en) * 2021-08-06 2021-09-03 北京惠朗时代科技有限公司 Face recognition intelligent tail box application method based on depth pyramid
CN117095444A (en) * 2023-08-29 2023-11-21 广州市星飞达电子科技有限公司 An image recognition method based on artificial intelligence
CN121117809A (en) * 2025-11-14 2025-12-12 中铁十四局集团大盾构工程有限公司 A method and system for intelligent identification of geological formations during tunnel boring machine (TBM) excavation based on rigorous single-sample prototype network and validation set scale calibration.

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106127103B (en) * 2016-06-12 2019-06-25 广州广电运通金融电子股份有限公司 A method and device for offline identity authentication
CN107871100B (en) * 2016-09-23 2021-07-06 北京眼神科技有限公司 Face model training method and device, face authentication method and device
CN106599772B (en) * 2016-10-31 2020-04-28 北京旷视科技有限公司 Living body verification method and device and identity authentication method and device
CN106780906B (en) * 2016-12-28 2019-06-21 北京品恩科技股份有限公司 A kind of testimony of a witness unification recognition methods and system based on depth convolutional neural networks
CN108875478B (en) * 2017-08-14 2022-04-12 北京旷视科技有限公司 People-authentication-integrated verification method, device and system and storage medium
CN108647583B (en) * 2018-04-19 2022-02-22 浙江大承机器人科技有限公司 Face recognition algorithm training method based on multi-target learning
CN109284675B (en) * 2018-08-13 2022-06-07 创新先进技术有限公司 User identification method, device and equipment
CN109635626B (en) * 2018-10-18 2022-11-25 北京和鸿盈科技术有限公司 Single-sample low-resolution single-class face recognition method
US11200333B2 (en) 2019-08-13 2021-12-14 International Business Machines Corporation File exposure to an intended recipient
CN111932800A (en) * 2020-08-19 2020-11-13 中国工商银行股份有限公司 Security verification method and device
CN112668572B (en) * 2020-12-24 2023-01-31 成都新希望金融信息有限公司 Identity card image standardization method and device, electronic equipment and storage medium
CN115019927B (en) * 2022-08-08 2022-10-04 北京国安广传网络科技有限公司 Physical examination data processing method and public health physical examination vehicle for realizing physical examination data processing

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2356509A (en) * 1999-11-16 2001-05-23 Sony Uk Ltd Calculating data quantity per picture according to quantity of metadata in a GOP
CN101510257A (en) * 2009-03-31 2009-08-19 华为技术有限公司 Human face similarity degree matching method and device
CN102663413A (en) * 2012-03-09 2012-09-12 中盾信安科技(江苏)有限公司 Multi-gesture and cross-age oriented face image authentication method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2356509A (en) * 1999-11-16 2001-05-23 Sony Uk Ltd Calculating data quantity per picture according to quantity of metadata in a GOP
CN101510257A (en) * 2009-03-31 2009-08-19 华为技术有限公司 Human face similarity degree matching method and device
CN102663413A (en) * 2012-03-09 2012-09-12 中盾信安科技(江苏)有限公司 Multi-gesture and cross-age oriented face image authentication method

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106446855A (en) * 2016-09-30 2017-02-22 深圳市商汤科技有限公司 Real name authentication device
CN109063565A (en) * 2018-06-29 2018-12-21 中国科学院信息工程研究所 A kind of low resolution face identification method and device
CN109063565B (en) * 2018-06-29 2021-12-10 中国科学院信息工程研究所 Low-resolution face recognition method and device
US20200074608A1 (en) * 2018-09-05 2020-03-05 Infineon Technologies Ag Time of Flight Camera and Method for Calibrating a Time of Flight Camera
CN110210341A (en) * 2019-05-20 2019-09-06 深圳供电局有限公司 Identity card authentication method based on face recognition, system thereof and readable storage medium
CN110210341B (en) * 2019-05-20 2022-12-06 深圳供电局有限公司 ID card authentication method and system based on face recognition, and readable storage medium
CN111160250A (en) * 2019-12-30 2020-05-15 安徽易刚信息技术有限公司 Blueberry growing period detection method and device based on artificial neural network
CN111414888A (en) * 2020-03-31 2020-07-14 杭州博雅鸿图视频技术有限公司 Low-resolution face recognition method, system, device and storage medium
CN113343955A (en) * 2021-08-06 2021-09-03 北京惠朗时代科技有限公司 Face recognition intelligent tail box application method based on depth pyramid
CN117095444A (en) * 2023-08-29 2023-11-21 广州市星飞达电子科技有限公司 An image recognition method based on artificial intelligence
CN117095444B (en) * 2023-08-29 2024-02-13 广州市星飞达电子科技有限公司 Image recognition method based on artificial intelligence
CN121117809A (en) * 2025-11-14 2025-12-12 中铁十四局集团大盾构工程有限公司 A method and system for intelligent identification of geological formations during tunnel boring machine (TBM) excavation based on rigorous single-sample prototype network and validation set scale calibration.

Also Published As

Publication number Publication date
CN105447532A (en) 2016-03-30

Similar Documents

Publication Publication Date Title
WO2016150240A1 (en) Identity authentication method and apparatus
US8805018B2 (en) Method of detecting facial attributes
WO2017024963A1 (en) Image recognition method, measure learning method and image source recognition method and device
WO2019033569A1 (en) Eyeball movement analysis method, device and storage medium
CN104504366A (en) System and method for smiling face recognition based on optical flow features
CN111126240A (en) A three-channel feature fusion face recognition method
JP2017033469A (en) Image identification method, image identification apparatus, and program
WO2016145940A1 (en) Face authentication method and device
Feng et al. Iris R-CNN: Accurate iris segmentation and localization in non-cooperative environment with visible illumination
WO2019033570A1 (en) Lip movement analysis method, apparatus and storage medium
JP6071002B2 (en) Reliability acquisition device, reliability acquisition method, and reliability acquisition program
WO2019033568A1 (en) Lip movement capturing method, apparatus and storage medium
WO2017206914A1 (en) Fingerprint recognition method, fingerprint recognition system, and electronic device
CN107766864B (en) Method and device for extracting features and method and device for object recognition
Deng et al. Attention-aware dual-stream network for multimodal face anti-spoofing
CN105550641B (en) Age estimation method and system based on multi-scale linear differential texture features
JP2013206458A (en) Object classification based on external appearance and context in image
CN111127407A (en) A style transfer forgery image detection device and method based on Fourier transform
CN110163182A (en) A kind of hand back vein identification method based on KAZE feature
CN108154107B (en) Method for determining scene category to which remote sensing image belongs
TWI731919B (en) Image recognition method and device and metric learning method and device
CN110458064B (en) Combining data-driven and knowledge-driven low-altitude target detection and recognition methods
TWI632509B (en) Face recognition apparatus and method thereof, method for increasing image recognition accuracy, and computer-readable storage medium
Dhar et al. Fish image classification by XGBoost based on gist and GLCM features
CN114495198B (en) Image processing method, device, computer equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16767617

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16767617

Country of ref document: EP

Kind code of ref document: A1