Multi-azimuth identity identification authentication system and method
Technical Field
The invention relates to the technical field of identity identification authentication, in particular to a multi-azimuth identity identification authentication system and method.
Background
During the online service transaction, the identity of the operator needs to be confirmed through identity recognition authentication, and the current identity recognition authentication includes short message code authentication, identity card authentication, face recognition verification and the like. The traditional face recognition authentication technology is easy to crack by adopting the modes of photos and the like, and the safety is not high. In order to improve safety, in-vivo detection is carried out by introducing actions such as shaking head, blinking and the like, but at present, the detection has certain loopholes, a human face mask can be manufactured through a 3D printer for cracking, a head model can be manufactured to attach corresponding front face pictures to the front side of the model, side face pictures are attached to two sides of the model, and partial in-vivo detection can be carried out by rotating the model when the in-vivo detection prompts shaking head.
Disclosure of Invention
The invention aims to provide a multi-azimuth identity identification authentication system and method aiming at the defects in the prior art.
In order to achieve the above object, in a first aspect, the present invention provides a multi-directional identity authentication system, including a terminal and a server, where after entering an identity authentication process, the server randomly generates a segment of authentication information and sends the authentication information and preset guidance information to the terminal;
the terminal comprises an information output component, wherein the information output component is controlled to output authentication information and guiding information after receiving the authentication information and the guiding information sent by the server, so that a person to be authenticated can know the content of the authentication information, and the guiding information comprises guiding the person to be authenticated to read the authentication information;
the terminal also comprises a camera used for acquiring video data of the person to be authenticated after receiving the authentication information and the guide information sent by the server;
the terminal also comprises a communication module used for receiving the authentication information and the guide information sent by the server and sending the video data to the server;
the server extracts a plurality of continuous frames of first image data from the video data and carries out face identification authentication based on the first image data;
after the face recognition authentication is passed, the server also extracts a plurality of second image data at set time intervals, compares the second image data with the first image data based on the face recognition, if the second image data is above a set threshold, then compares the image similarity between the second image data, and if the similarity is below the set threshold, outputs the result of passing the identity recognition authentication.
The terminal further comprises a telephone receiver used for acquiring voice data when the person to be authenticated reads the authentication message, the communication module sends the voice data to a server, the server compares the voice data with the authentication information based on voice recognition to judge whether the voice data is consistent with the authentication information, and if so, an identity recognition authentication passing result is output.
Further, the server identifies the mouth shape of the video data based on the mouth shape identification model, and synchronously compares the identification result with the voice data to judge whether the voice data is sent out by the personnel in the current video.
Further, the information output component comprises a display screen, and the authentication information and the guide information are displayed on the display screen in a text mode.
Further, the information output component comprises a display screen and a loudspeaker, the authentication information is displayed on the display screen in a text mode, and the guide information is output by the loudspeaker in a voice mode.
In a second aspect, the present invention provides a multi-directional identity authentication method, applied in the system composed of the terminal and the server, including:
after entering an identity identification authentication process, the server randomly generates a section of authentication information and sends the authentication information and preset guide information to the terminal;
the terminal receives the authentication information and the guide information sent by the server through a communication module, and after receiving the authentication information and the guide information sent by the server, the terminal controls an information output component to output the authentication information and the guide information so that a person to be authenticated can know the content of the authentication information, and the guide information comprises a guide part for guiding the person to be authenticated to read the authentication information;
after receiving authentication information and guide information sent by a server, the terminal controls a camera of the terminal to acquire video data of a person to be authenticated, and sends the video data to the server through a communication module;
the server extracts a plurality of continuous frames of first image data from the video data and carries out face identification authentication based on the first image data;
after the face recognition authentication is passed, the server also extracts a plurality of second image data at set time intervals, compares the second image data with the first image data based on the face recognition, if the second image data is above a set threshold, then compares the image similarity between the second image data, and if the similarity is below the set threshold, outputs the result of passing the identity recognition authentication.
Further, the terminal obtains voice data when a person to be authenticated reads the authentication message through a telephone receiver, the communication module sends the voice data to the server, the server compares the voice data with the authentication information based on voice recognition to judge whether the voice data is consistent with the authentication information, and if so, an identity recognition authentication passing result is output.
Further, the server identifies the mouth shape of the video data based on the mouth shape identification model, and synchronously compares the identification result with the voice data to judge whether the voice data is sent out by the personnel in the current video.
Furthermore, the authentication information and the guiding information are both displayed on a display screen of the terminal in a text mode.
Further, the authentication information is displayed on a display screen of the terminal in a text mode, and the guide information is output by a loudspeaker of the terminal in a voice mode.
Has the advantages that: the invention sends the authentication information to the terminal of the person to be authenticated during the authentication, guides the person to be authenticated to read the authentication information, collects the video data and the voice data of the person to be authenticated through the camera and the receiver, and carries out identity identification authentication based on a plurality of directions of face identification, image comparison, mouth shape identification and voice identification, thereby ensuring the authentication accuracy and preventing illegal persons from easily cracking.
Drawings
Fig. 1 is a schematic flow chart of a multi-azimuth identity authentication method according to an embodiment of the present invention.
Detailed Description
The present invention will be further illustrated with reference to the accompanying drawings and specific examples, which are carried out on the premise of the technical solution of the present invention, and it should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention.
As shown in fig. 1, an embodiment of the present invention provides a multi-directional identity authentication system, which includes a terminal and a server, where after entering an identity authentication process, the server randomly generates a segment of authentication information and sends the authentication information and preset guidance information to the terminal. Specifically, the authentication information may be one or several complete sentences, and the meaning expressed by these sentences may be completely unrelated to the service itself, or may be multiple words without regularity.
The terminal comprises an information output component, after receiving the authentication information and the guide information sent by the server, the terminal controls the information output component to output the authentication information and the guide information so that the person to be authenticated can know the content of the authentication information, and the guide information comprises a guide for the person to be authenticated to read the authentication information. Specifically, the information output unit may employ a display screen on which authentication information and guidance information are displayed in text. The guidance information may be "please read the following" or the like. The information output means may be a display screen and a speaker, the authentication information is preferably displayed on the display screen in a text form, and the guidance information is output by the speaker in a voice form.
The terminal further comprises a camera, and the camera is used for acquiring video data of the person to be authenticated after receiving the authentication information and the guide information sent by the server. The video data can be set time length which can meet the reading speed requirement of most people and ensure that the reading authentication information can be easily finished within the set time length.
The terminal also comprises a communication module which is used for receiving the authentication information and the guide information sent by the server and sending the video data to the server so as to meet the normal interaction function with the server.
After the server receives the video data, the server extracts a plurality of continuous frames of first image data from the video data and carries out face recognition authentication based on the first image data. The first image data can be generally extracted from the front part of the video data, at the moment, the person to be authenticated does not read authentication information, facial features, expressions and the like do not change too much, and the accuracy of face recognition authentication is improved.
After the face recognition authentication is passed, the server also extracts a plurality of second image data at set time intervals, compares the second image data with the first image data based on the face recognition, if the second image data is above a set threshold value, the person to be authenticated is ensured not to be replaced during the authentication, and then compares the image similarity between the second image data. If the similarity is above the set threshold, the person to be authenticated currently appearing in the video is probably a photo or a head model or the like. The second image data may be a plurality of sets extracted at intervals, preferably evenly covering the entire video.
The terminal of the embodiment of the invention also comprises a telephone receiver used for acquiring voice data when the person to be authenticated reads the authentication message, the communication module sends the voice data acquired by the telephone receiver to the server, the server compares the voice data with the authentication information based on voice recognition to judge whether the voice data is consistent with the authentication information, and if so, an identity recognition authentication passing result is output. After the voice recognition judgment is combined, the accuracy of identity recognition and authentication can be further improved.
After receiving the video data and the voice data, the server can identify the mouth shape of the video data based on the mouth shape identification model, and synchronously compare the identification result with the voice data to judge whether the voice data is sent out by the personnel in the current video. Therefore, the accuracy of identity identification authentication can be further improved, and cracking caused by adopting a human head model and the like is avoided.
Based on the above embodiments, the present invention further provides a multi-directional identity authentication method, which is applied in a system composed of a terminal and a server, and includes:
after entering the identity identification authentication process, the server randomly generates a section of authentication information and sends the authentication information and preset guide information to the terminal. Specifically, the authentication information may be one or several complete sentences, and the meaning expressed by these sentences may be completely unrelated to the service itself, or may be multiple words without regularity.
The terminal receives the authentication information and the guide information sent by the server through the communication module, and after receiving the authentication information and the guide information sent by the server, the terminal controls the information output part to output the authentication information and the guide information so that the person to be authenticated can know the content of the authentication information, and the guide information comprises the function of guiding the person to be authenticated to read the authentication information. Specifically, the information output unit may employ a display screen on which authentication information and guidance information are displayed in text. The guidance information may be "please read the following" or the like. The information output means may be a display screen and a speaker, the authentication information is preferably displayed on the display screen in a text form, and the guidance information is output by the speaker in a voice form.
After receiving the authentication information and the guide information sent by the server, the terminal controls the camera of the terminal to acquire the video data of the person to be authenticated, and sends the video data to the server through the communication module. . The video data can be set time length which can meet the reading speed requirement of most people and ensure that the reading authentication information can be easily finished within the set time length.
After the server receives the video data, the server extracts a plurality of continuous frames of first image data from the video data, and carries out face recognition authentication based on the first image data. The first image data can be generally extracted from the front part of the video data, at the moment, the person to be authenticated does not read authentication information, facial features, expressions and the like do not change too much, and the accuracy of face recognition authentication is improved.
After the face recognition authentication is passed, the server also extracts a plurality of second image data at set time intervals, compares the second image data with the first image data based on the face recognition, if the second image data is above a set threshold value, the person to be authenticated is ensured not to be replaced during the authentication, and then compares the image similarity between the second image data. If the similarity is above the set threshold, the person to be authenticated currently appearing in the video is probably a photo or a head model or the like. The second image data may be a plurality of sets extracted at intervals, preferably evenly covering the entire video.
According to the embodiment of the invention, voice data of a person to be authenticated when reading the authentication message is acquired through the telephone receiver of the terminal, the communication module sends the voice data acquired by the telephone receiver to the server, and the server compares the voice data with the authentication information based on voice recognition to judge whether the voice data is consistent with the authentication information or not, and if so, an identity recognition authentication passing result is output. After the voice recognition judgment is combined, the accuracy of identity recognition and authentication can be further improved.
After receiving the video data and the voice data, the server can also recognize the mouth shape of the video data based on the mouth shape recognition model, and synchronously compares the recognition result with the voice data to judge whether the voice data is sent by the personnel in the current video. Therefore, the accuracy of identity identification authentication can be further improved, and cracking caused by adopting a human head model and the like is avoided.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that other parts not specifically described are within the prior art or common general knowledge to those of ordinary skill in the art. Without departing from the principle of the invention, several improvements and modifications can be made, and these improvements and modifications should also be construed as the scope of the invention.