CN110543584B - Method, device, processing server and storage medium for establishing face index - Google Patents
Method, device, processing server and storage medium for establishing face index Download PDFInfo
- Publication number
- CN110543584B CN110543584B CN201810534101.5A CN201810534101A CN110543584B CN 110543584 B CN110543584 B CN 110543584B CN 201810534101 A CN201810534101 A CN 201810534101A CN 110543584 B CN110543584 B CN 110543584B
- Authority
- CN
- China
- Prior art keywords
- video
- face
- cluster
- features
- index
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Abstract
Description
技术领域technical field
本发明涉及数据处理技术领域,具体涉及一种建立人脸索引的方法、装置、处理服务器及存储介质。The invention relates to the technical field of data processing, in particular to a method, device, processing server and storage medium for establishing a face index.
背景技术Background technique
人脸索引表示的是视频中的人脸特征与视频信息的关联,通过为视频建立人脸索引,可为在视频中查询目标人物等场景时,高效的提供视频中与目标人物关联的视频信息(如视频中目标人物出现的视频时间点、视频进度等);基于人脸索引的特性,人脸索引在视频点播、安防等领域得到了广泛的应用。The face index represents the association between the face features in the video and the video information. By establishing a face index for the video, it can efficiently provide the video information associated with the target person in the video when querying scenes such as the target person in the video. (such as the video time point when the target person appears in the video, the video progress, etc.); based on the characteristics of the face index, the face index has been widely used in video on demand, security and other fields.
目前人脸索引的建立过程一般需要用户提前进行人脸信息的标记;然而,视频中的人物往往众多,基于用户标记人脸信息的方式来建立人脸索引,将存在极为繁琐的用户标记人脸信息的工作,并且标记人脸信息的难度也极大,这无疑导致人脸索引的建立效率极低。At present, the establishment process of the face index generally requires the user to mark the face information in advance; however, there are often many people in the video, and the establishment of the face index based on the way the user marks the face information will be extremely cumbersome. The work of information, and the difficulty of marking face information is also extremely difficult, which undoubtedly leads to the extremely low efficiency of face index establishment.
发明内容Contents of the invention
有鉴于此,本发明实施例提供一种建立人脸索引的方法、装置、处理服务器及存储介质,以在无人脸信息标记的情况下,高效的建立人脸索引。In view of this, the embodiments of the present invention provide a method, device, processing server, and storage medium for establishing a face index, so as to efficiently establish a face index when there is no face information tag.
为实现上述目的,本发明实施例提供如下技术方案:In order to achieve the above purpose, embodiments of the present invention provide the following technical solutions:
一种建立人脸索引的方法,包括:A method for establishing a face index, comprising:
获取至少一条视频;Get at least one video;
分别确定各条视频相应的人脸特征,并确定各人脸特征在相应视频的视频信息;Respectively determine the corresponding facial features of each video, and determine the video information of each facial feature in the corresponding video;
将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类;其中,一个第一聚类所聚集的人脸特征表示,所述至少一条视频中同一人脸的人脸特征;The facial features of the same face are clustered to obtain at least one first cluster; wherein, the facial features gathered by a first cluster represent the facial features of the same face in the at least one video;
针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到所述至少一条视频的人脸索引。For each face feature gathered by each first cluster, associate the face feature with the video information of the corresponding video, and obtain the face index of the at least one video.
本发明实施例还提供一种建立人脸索引的装置,包括:The embodiment of the present invention also provides a device for establishing a face index, including:
视频获取模块,用于获取至少一条视频;A video acquisition module, configured to acquire at least one video;
人脸特征及视频信息确定模块,用于分别确定各条视频相应的人脸特征,并确定各人脸特征在相应视频的视频信息;Facial features and video information determining module, used to respectively determine the corresponding facial features of each video, and determine the video information of each facial feature in the corresponding video;
第一聚类得到模块,用于将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类;其中,一个第一聚类所聚集的人脸特征表示,所述至少一条视频中同一人脸的人脸特征;The first clustering obtaining module is used to cluster the facial features of the same face to obtain at least one first cluster; wherein, the facial features gathered by one first cluster indicate that the at least one video Facial features of the same face in the
人脸索引建立模块,用于针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到所述至少一条视频的人脸索引。The face index establishment module is used for associating the video information of the face feature in the corresponding video with respect to each face feature gathered by each first cluster, so as to obtain the face index of the at least one video.
本发明实施例还提供一种处理服务器,包括:至少一个存储器和至少一个处理芯片,所述存储器存储有程序,所述处理芯片调用所述程序,以实现上述所述的建立人脸索引的方法的步骤。An embodiment of the present invention also provides a processing server, including: at least one memory and at least one processing chip, the memory stores a program, and the processing chip calls the program to implement the above-mentioned method for establishing a face index A step of.
本发明实施例还提供一种存储介质,所述存储介质存储有适于处理芯片执行的程序,以实现上述所述的建立人脸索引的方法的步骤。The embodiment of the present invention also provides a storage medium, the storage medium stores a program suitable for execution by the processing chip, so as to realize the steps of the above-mentioned method for establishing a face index.
基于上述技术方案,本发明实施例提供的建立人脸索引的方法,可以对至少一条视频,进行同一人脸的人脸特征的聚类处理,得到适于所述至少一条视频相应的第一聚类;进而将各第一聚类所聚集的各人脸特征,进行人脸特征在相应视频的视频信息的关联,可在无人脸信息标记的情况下,实现高效的建立适于所述至少一条视频的人脸索引。Based on the above technical solution, the method for establishing a face index provided by the embodiment of the present invention can perform clustering processing of the face features of the same face on at least one video, and obtain the corresponding first clustering method suitable for the at least one video. Classes; and then each of the face features gathered by each first cluster is associated with the video information of the corresponding video, so that it can be efficiently established in the case of no face information mark. Face index of a video.
进一步,通过将所述至少一条视频中,同一人脸的人脸特征进行聚类,可以认为一个第一聚类中聚集的人脸特征表示的是同一人脸,可以减少人脸索引库中的人脸数量,可极大的减小人脸索引的数据冗余。Further, by clustering the face features of the same face in the at least one video, it can be considered that the face features gathered in a first cluster represent the same face, which can reduce the number of faces in the face index library. The number of faces can greatly reduce the data redundancy of the face index.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only It is an embodiment of the present invention, and those skilled in the art can also obtain other drawings according to the provided drawings without creative work.
图1为传统的基于用户标记人脸信息,来建立人脸索引的示意图;Fig. 1 is a traditional schematic diagram of establishing a face index based on user-marked face information;
图2为本发明实施例提供的建立人脸索引的系统的架构示意图;FIG. 2 is a schematic structural diagram of a system for establishing a face index provided by an embodiment of the present invention;
图3为本发明实施例提供的建立人脸索引的方法的信令流程图;FIG. 3 is a signaling flow chart of a method for establishing a face index provided by an embodiment of the present invention;
图4为多条视频情况下,建立人脸索引的处理示例图;Fig. 4 is under the situation of a plurality of videos, establishes the processing example diagram of face index;
图5为本发明实施例提供的建立人脸索引的方法的另一信令流程图;Fig. 5 is another signaling flowchart of the method for establishing a face index provided by an embodiment of the present invention;
图6为一条视频情况下,建立人脸索引的处理示例图;Fig. 6 is an example diagram of processing of establishing a face index in the case of a video;
图7为本发明实施例提供的将同一人脸的人脸特征进行聚类处理的方法流程图;7 is a flow chart of a method for clustering facial features of the same face provided by an embodiment of the present invention;
图8为本发明实施例提供的将同一人脸的人脸特征进行聚类处理的另一方法流程图;FIG. 8 is a flow chart of another method for clustering facial features of the same face provided by an embodiment of the present invention;
图9为将人脸特征加入到聚类的示意图;Fig. 9 is a schematic diagram of adding face features to clustering;
图10为删除次要人脸的人脸特征的聚类的示意图;Fig. 10 is a schematic diagram of clustering of facial features for deleting secondary faces;
图11为本发明实施例提供的应用示例图;Fig. 11 is an application example diagram provided by an embodiment of the present invention;
图12为本发明实施例提供的建立人脸索引的装置的结构框图;FIG. 12 is a structural block diagram of a device for establishing a face index provided by an embodiment of the present invention;
图13为本发明实施例提供的处理服务器的结构框图。Fig. 13 is a structural block diagram of a processing server provided by an embodiment of the present invention.
具体实施方式Detailed ways
图1为传统的基于用户标记人脸信息,来建立人脸索引的示意图,如图1所示,在建立人脸索引时,用户需要标记人物的基础信息(如人物名称、性别等)、人脸图像(如人物的正脸图像、侧脸图像等)等人脸信息,从而服务器将标记的人脸信息作为注册的输入,通过注册处理,确定出视频中与该人脸图像的人脸特征相应的视频信息,并关联该视频信息与该人物的基础信息和人脸特征,建立出该人物的人脸索引;Figure 1 is a traditional schematic diagram of building a face index based on the user's marked face information. face image (such as a person's front face image, side face image, etc.), so that the server uses the marked face information as an input for registration, and through the registration process, determines the facial features of the face image in the video Corresponding video information, and associating the video information with the basic information and face features of the person, to establish the face index of the person;
可以看出,传统建立人脸索引的方式,在视频(特别对于大规模视频数据)中的人物较多时,将存在极为繁琐的用户标记人脸信息的工作,并且标记人脸信息的难度也极大,存在人脸索引的建立效率极低的问题;It can be seen that when there are many people in the video (especially for large-scale video data) in the traditional way of building a face index, there will be extremely cumbersome work for users to mark face information, and the difficulty of marking face information is also extremely high. Large, there is a problem that the establishment efficiency of the face index is extremely low;
进一步,对于没有标记人脸信息但存在于视频中的人物,传统的建立人脸索引的方式是,将视频中没有标记人脸信息的人脸图像作为独立的输入来进行注册处理,这使得人脸索引的数据极为冗余。Furthermore, for people who are not marked with face information but exist in the video, the traditional way of building a face index is to use the face image without marked face information in the video as an independent input for registration processing, which makes people The data of the face index is extremely redundant.
为解决上述缺陷,本发明实施例提供一种在无人脸信息标记的情况下,进行高效的人脸索引建立的方案;下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to solve the above-mentioned defects, the embodiment of the present invention provides a scheme for establishing an efficient face index in the case of no face information tag; The technical solution is clearly and completely described, and obviously, the described embodiments are only some embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
图2为本发明实施例提供的建立人脸索引的系统的一种可选架构示意图,如图2所示,该系统可以包括:视频源10,处理服务器20,人脸索引库30。FIG. 2 is a schematic diagram of an optional architecture of a system for establishing a face index provided by an embodiment of the present invention. As shown in FIG. 2 , the system may include: a
其中,视频源10可以认为是视频的源头,视本发明实施例应用场景而定,可选的,视频可以是以流式数据表示,如视频流;Wherein, the
作为一种示例说明,对于直播场景而言,视频源可以是视频直播服务器,本发明实施例可对视频直播服务器输出的直播视频进行人脸索引建立;对于点播场景而言,视频源可以是点播视频库(点播库中记录有多个视频),点播视频库可以在用户点播视频时,提供用户所点播的视频,而在本发明实施例中,也可对点播视频库中记录的视频进行人脸索引的建立;当然,视频源还可以是电影视频库、电视视频库等,具体不再一一说明;显然,视频源可以是上述描述的视频源形式的至少一个(一个或多个)。As an example, for a live broadcast scene, the video source may be a live video server, and the embodiment of the present invention may establish a face index for the live video output by the live video server; for an on-demand scene, the video source may be an on-demand Video library (multiple videos are recorded in the on-demand library), the video-on-demand library can provide the video requested by the user when the user orders the video, and in the embodiment of the present invention, the video recorded in the video-on-demand library can also be processed manually. The establishment of face index; Of course, the video source can also be a movie video library, a TV video library, etc., which will not be explained one by one; obviously, the video source can be at least one (one or more) of the video source forms described above.
处理服务器20为对视频源10提供的视频建立人脸索引的服务设备,是本发明实施例建立人脸索引的主要处理设备;处理服务器可以由单一服务器实现,也可以由多台服务器组成的服务器群组实现。The
人脸索引库30为记录本发明实施例建立的人脸索引的数据库。The
作为一种系统结构变形,人脸索引库也可以由处理服务器中的存储单元实现,如可由处理服务器中具备数据存储能力的存储设备,记录本发明实施例所建立的人脸索引。As a variant of the system structure, the face index database can also be implemented by a storage unit in the processing server, for example, a storage device with data storage capability in the processing server can record the face index established by the embodiment of the present invention.
为实现在无人脸信息标记的情况下,高效的建立人脸索引,本发明实施例建立人脸索引的核心流程可以如下:In order to realize efficient establishment of a face index in the absence of face information marking, the core process of establishing a face index in the embodiment of the present invention may be as follows:
处理服务器获取视频源发送的用于建立人脸索引的至少一条视频;针对所述至少一条视频,处理服务器可分别确定各条视频相应的人脸特征,并确定出各人脸特征在相应视频的视频信息(如各人脸特征在所属视频的视频时间点,和/或视频进度等);从而,处理服务器可将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类(为便于描述,本发明实施例可将所述至少一条视频中同一人脸的人脸特征的一个聚类,定义为一个第一聚类;即一个第一聚类所聚集的人脸特征可以表示,所述至少一条视频中同一人脸的人脸特征);进而,针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到适于所述至少一条视频的人脸索引。The processing server obtains at least one video for establishing a face index sent by the video source; for the at least one video, the processing server can determine the corresponding facial features of each video, and determine that each facial feature is in the corresponding video. Video information (such as each face feature at the video time point of the video to which it belongs, and/or video progress, etc.); thus, the processing server can cluster the face features of the same face to obtain at least one first cluster ( For ease of description, in this embodiment of the present invention, a cluster of facial features of the same face in the at least one video can be defined as a first cluster; that is, the facial features gathered by a first cluster can represent , the face feature of the same face in the at least one video); and then, for each face feature gathered by each first cluster, associate the face feature in the video information of the corresponding video, and obtain the video information suitable for the at least one video. Video face index.
基于上述核心流程,处理服务器可在无人脸信息标记的情况下,对获取的至少一条视频进行同一人脸的人脸特征的聚类处理,得到至少一个第一聚类,从而对各第一聚类聚集的各人脸特征,进行人脸特征在相应视频的视频信息的关联,得到所述至少一条视频的人脸索引;实现在无人脸信息标记的情况下,高效的建立人脸索引。Based on the above-mentioned core process, the processing server can perform clustering processing of facial features of the same face on at least one acquired video without any face information mark, to obtain at least one first cluster, so that each first Each facial feature of clustering gathers, carries out the correlation of facial feature in the video information of corresponding video, obtains the human face index of described at least one video; Realize under the situation of no face information mark, efficiently establish human face index .
基于上述核心流程,本发明实施例可对一条视频进行人脸索引的建立,也可对多条视频进行人脸索引的建立,下面将分别对这两种情况进行说明。Based on the above core process, the embodiment of the present invention can establish a face index for one video, and can also establish a face index for multiple videos. The two cases will be described below.
可选的,图3示出了在对多条视频建立人脸索引的情况下,本发明实施例提供的建立人脸索引的一种可选信令流程;值得注意的是,图3所示流程仅是本发明实施例在多条视频的情况下,建立人脸索引的一种可选流程,基于上述描述的核心流程,本发明实施例也可在多条视频的情况下,以其他方法流程进行人脸索引的建立;Optionally, Figure 3 shows an optional signaling process for establishing a face index provided by an embodiment of the present invention in the case of establishing a face index for multiple videos; it is worth noting that, as shown in Figure 3 The process is only an optional process for establishing a face index in the case of multiple videos in the embodiment of the present invention. Based on the core process described above, the embodiment of the present invention can also use other methods in the case of multiple videos. Process to establish face index;
参照图3,在基于多条视频进行人脸索引建立时,本发明实施例提供的流程可以包括:Referring to Fig. 3, when the face index is established based on multiple videos, the process provided by the embodiment of the present invention may include:
步骤S10、视频源向处理服务器输入多条视频。Step S10, the video source inputs multiple videos to the processing server.
此处所指的多条视频可以是视频源所具有的多条视频,可以认为是本发明实施例用于建立人脸索引的多条视频,即本发明实施例可为该多条视频综合的建立人脸索引;The multiple videos referred to here can be the multiple videos that the video source has, and can be considered as the multiple videos used to establish the face index in the embodiment of the present invention, that is, the embodiment of the present invention can be the integrated video of the multiple videos. Create a face index;
作为一种示例,视频源可以随机的向处理服务器输入多条视频,如视频源随机的将视频源具有的多条视频输入给处理服务器,这个过程中,处理服务器可对视频源输入的多条视频,综合的建立人脸索引;可选的,视频源也可以受指定(如受处理服务器指定,或者工作人员的指定),向处理服务器输入指定的多条视频;As an example, the video source can randomly input multiple videos to the processing server. For example, the video source randomly inputs multiple videos that the video source has to the processing server. Video, comprehensively builds a face index; optionally, the video source can also be specified (such as specified by the processing server, or specified by a staff member), and input multiple specified videos to the processing server;
可选的,在本发明实施例中,视频源所具有的每条视频可以具有唯一的标识(如视频ID);Optionally, in the embodiment of the present invention, each video that the video source has may have a unique identifier (such as a video ID);
可选的,视频源向处理服务器输入的任一条视频可以是流形式的(即视频流),相应的,视频源可以向处理服务器输入多条视频流;显然,视频以流形式存在仅是可选的,本发明实施例也可支持对其他形式的视频进行人脸索引建立,如处理服务器可从视频源获取完整的多条视频后,对该完整的多条视频进行人脸索引建立。Optionally, any video input from the video source to the processing server can be in the form of a stream (i.e. video stream), and correspondingly, the video source can input multiple video streams to the processing server; obviously, the existence of video in the form of a stream is only possible Optionally, the embodiments of the present invention may also support face index building for other forms of videos. For example, after the processing server acquires multiple complete videos from the video source, it may perform face index building on the complete multiple videos.
相应的,处理服务器可以获取到多条视频。Correspondingly, the processing server can obtain multiple videos.
步骤S11、处理服务器分别确定各条视频相应的人脸特征,并确定各人脸特征在相应视频的视频信息。Step S11, the processing server respectively determines the corresponding facial features of each video, and determines the video information of each facial feature in the corresponding video.
处理服务器可对获取的各条视频,分别确定相应的人脸特征,并确定所确定的各人脸特征在相应视频的视频信息(如人脸特征在相应视频的视频时间点,和/或视频进度等);The processing server can respectively determine corresponding facial features for each piece of video obtained, and determine the video information of each determined facial feature in the corresponding video (such as the video time point of the corresponding video, and/or video progress, etc.);
作为一种示例,对于处理服务器获取的任一条视频,处理服务器可对视频中的关键视频帧提取人脸特征,以此对处理服务器获取的各条视频进行处理,确定出各条视频相应的人脸特征;同时,对于所确定的各条视频相应的人脸特征,确定各人脸特征相应的关键视频帧在相应视频的视频信息(如各人脸特征相应的关键视频帧,在所属视频的视频时间点和/或视频进度);As an example, for any video obtained by the processing server, the processing server can extract facial features from key video frames in the video, so as to process each video obtained by the processing server, and determine the corresponding person in each video. Face feature; Simultaneously, for the corresponding human face feature of each determined video, determine the corresponding key video frame of each human face feature in the video information of the corresponding video (as the corresponding key video frame of each human face feature, in the belonging video video timing and/or video progress);
例如,对于处理服务器获取的某一条视频,处理服务器可提取该视频中的关键视频帧的人脸特征,并确定各人脸特征的关键视频帧在该视频的视频信息。For example, for a certain video acquired by the processing server, the processing server may extract the facial features of the key video frames in the video, and determine the video information of the key video frames of each facial feature in the video.
作为另一种示例,对于处理服务器获取的任一条视频,处理服务器可按照设定时间间隔,对视频进行截图,对各截图提取人脸特征,以此对处理服务器获取的各条视频进行处理,确定出各条视频相应的人脸特征;同时,确定各人脸特征相应的截图在相应视频的视频信息(如各人脸特征相应的截图,在相应视频中的视频时间点和/或视频进度);As another example, for any video obtained by the processing server, the processing server may take screenshots of the video according to a set time interval, and extract facial features from each screenshot, so as to process each video obtained by the processing server, Determine the corresponding face features of each video; at the same time, determine the corresponding screenshots of each face feature in the video information of the corresponding video (such as the corresponding screenshot of each face feature, the video time point and/or video progress in the corresponding video );
例如,对于处理服务器获取的某一条视频,处理服务器可按照设定时间间隔,对该视频进行截图,提取各截图的人脸特征,并确定各人脸特征的截图在该视频的视频信息。For example, for a certain video acquired by the processing server, the processing server may take screenshots of the video according to a set time interval, extract the facial features of each screenshot, and determine the video information of the screenshot of each facial feature in the video.
可选的,上述示出的两种确定各条视频相应的人脸特征的方式可以结合使用,也可以择一使用;当然,本发明实施例也可对获取的各条视频的各视频帧均进行人脸特征提取,确定出各条视频相应的人脸特征;同时,确定各人脸特征相应的视频帧在相应视频的视频信息。Optionally, the above two methods of determining the corresponding facial features of each video can be used in combination, or one can be used; of course, the embodiment of the present invention can also be used for each video frame of each video acquired Perform facial feature extraction to determine the corresponding facial features of each video; at the same time, determine the video information of the video frame corresponding to each facial feature in the corresponding video.
可选的,人脸特征在相应视频的视频信息还可以包括:人脸特征相应的视频的标识(如视频ID)。Optionally, the video information of the face feature in the corresponding video may also include: an identifier (such as a video ID) of the video corresponding to the face feature.
步骤S12、处理服务器分别针对各条视频相应的人脸特征,将同一人脸的人脸特征进行聚类处理,以分别得到各条视频相应的至少一个第二聚类。Step S12 , the processing server clusters the facial features of the same face with respect to the corresponding facial features of each video, so as to obtain at least one second cluster corresponding to each video.
在确定各条视频相应的人脸特征后,对于任一条视频,本发明实施例可将该视频中同一人脸的人脸特征聚为一类,得到该视频相应的第二聚类(即一个视频的一个第二聚类可以表示,该视频中同一人脸的人脸特征的一个聚类);以此对各条视频均进行处理,则可得到各条视频相应的第二聚类;After determining the corresponding facial features of each video, for any video, the embodiment of the present invention can cluster the facial features of the same human face in the video into one category to obtain the corresponding second cluster of the video (i.e. a A second clustering of video can represent, a clustering of the face feature of same face in this video); Each video is all processed with this, then can obtain the corresponding second clustering of each video;
可以理解的是,一个第一聚类指的是处理服务器用于创建人脸索引的至少一条视频中,同一人脸的人脸特征的一个聚类;而第二聚类指的是处理服务器在获取多条视频的情况下,一条视频中同一人脸的人脸特征的一个聚类。It can be understood that a first cluster refers to a cluster of facial features of the same face in at least one video used by the processing server to create a face index; In the case of obtaining multiple videos, a cluster of facial features of the same face in one video.
对于任一条视频,本发明实施例可分析该视频的人脸特征间的相似度,将该视频中相似度符合预定相似度要求的人脸特征聚为一类,实现对该视频中同一人脸的人脸特征的聚类处理;For any video, the embodiment of the present invention can analyze the similarity between the facial features of the video, and group the facial features whose similarity meets the predetermined similarity requirements in the video into one category, so as to realize the same human face in the video Clustering processing of facial features;
可选的,预定相似度要求可以根据实际情况而定,本发明实施例并不限制,作为一种可选实现,人脸特征可以使用人脸特征向量表示(如使用高纬人脸特征向量表示人脸特征),人脸特征间的相似度可以使用人脸特征向量间的距离表示,可以设置表示同一人脸的人脸特征相应的预定向量距离,将人脸特征向量间的距离在预定向量距离内的人脸特征聚为一类。Optionally, the predetermined similarity requirement can be determined according to the actual situation, and the embodiment of the present invention is not limited. As an optional implementation, the facial features can be represented by human facial feature vectors (such as using high-dimensional facial feature vector representations face features), the similarity between face features can be represented by the distance between face feature vectors, and the corresponding predetermined vector distance can be set to represent the face features of the same face, and the distance between face feature vectors can be set in the predetermined vector The face features within a distance are clustered into one class.
进一步,一条视频中往往存在主要人脸和次要人脸,对于任一条视频,本发明实施例可确定该视频中各主要人脸的人脸特征的聚类(一条视频中主要人脸的数量可能是一个或多个),得到该视频相应的第二聚类;Further, there are often main human faces and secondary human faces in a video, and for any video, the embodiment of the present invention can determine the clustering of the facial features of each main human face in the video (the number of main human faces in a video may be one or more), to obtain the corresponding second cluster of the video;
相应的,在分别针对各条视频相应的人脸特征,将同一人脸的人脸特征进行聚类处理时,对于任一条视频,本发明实施例可在将该视频中同一人脸的人脸特征聚类后,将该视频的聚类中次要人脸相应的聚类进行删除,仅保留该视频的聚类中主要人脸相应的聚类,得到该视频相应的第二聚类;Correspondingly, when the face features of the same face are clustered for the corresponding face features of each video, for any video, the embodiment of the present invention can cluster the faces of the same face in the video. After feature clustering, delete the cluster corresponding to the secondary face in the cluster of the video, and only keep the cluster corresponding to the main face in the cluster of the video to obtain the corresponding second cluster of the video;
作为一种可选实现,对于任一条视频,本发明实施例可将该视频中同一人脸的人脸特征聚类后,进一步将该视频的聚类中人脸特征出现次数小于次数阈值的聚类进行删除(即删除该视频的聚类中次要人脸的聚类),得到该视频相应的第二聚类。As an optional implementation, for any video, the embodiment of the present invention can cluster the facial features of the same face in the video, and then further cluster the video with the number of occurrences of the facial features in the cluster less than the number threshold. Class is deleted (that is, the cluster of the secondary face in the cluster of the video is deleted), and the corresponding second cluster of the video is obtained.
步骤S13、处理服务器分别针对各条视频相应的各第二聚类所聚集的人脸特征,关联人脸特征在相应视频的视频信息,得到各条视频的聚类结果。Step S13 , the processing server correlates the facial features gathered by the second clusters corresponding to each video with the video information of the corresponding video, and obtains the clustering results of each video.
在分别针对各条视频,确定相应的第二聚类,得到各条视频相应的至少一个第二聚类后,本发明实施例可分别针对各条视频相应的各第二聚类所聚集的人脸特征,进行人脸特征在相应视频的视频信息的关联;如对于任一个第二聚类(一个第二聚类表示,一条视频中同一人脸的人脸特征的聚类),本发明实施例可对该第二聚类所聚集的各人脸特征,关联人脸特征在所属视频的视频时间点和/或视频进度;以此对每一条视频相应的第二聚类进行处理,可得到各条视频的聚类结果;After determining the corresponding second clusters for each video and obtaining at least one second cluster corresponding to each video, the embodiment of the present invention can respectively focus on the people gathered by the second clusters corresponding to each video. Face feature, carries out the association of face feature in the video information of corresponding video; As for any second clustering (a second clustering represents, the clustering of the face feature of same face in a video), the present invention implements For example, each face feature gathered by the second cluster can be associated with the video time point and/or video progress of the video to which the face feature belongs; in this way, the corresponding second cluster of each video can be processed to obtain The clustering results of each video;
可以看出,一条视频的聚类结果可以至少包括:聚集该视频中同一人脸的人脸特征的至少一个第二聚类,及各第二聚类所聚集的各人脸特征在该视频的视频信息。It can be seen that the clustering result of a video may at least include: at least one second cluster that gathers the facial features of the same face in the video, and each facial feature gathered by each second cluster in the video. video information.
需要说明的是,步骤S12和步骤S13仅是本发明实施例获取各条视频的聚类结果的一种可选方式,本发明实施例也可使用其他方式获取各条视频的聚类结果。It should be noted that step S12 and step S13 are only an optional method for obtaining the clustering result of each video in the embodiment of the present invention, and the embodiment of the present invention may also use other methods to obtain the clustering result of each video.
步骤S14、处理服务器针对各条视频的聚类结果,将同一人脸的人脸特征进行聚类处理,得到所述多条视频相应的至少一个第一聚类。Step S14 , the processing server clusters the facial features of the same face for the clustering results of each video, and obtains at least one first cluster corresponding to the plurality of videos.
在得到各条视频的聚类结果后,可以针对各条视频,分别明确同一人脸的人脸特征的第二聚类;在此基础上,可针对各条视频的第二聚类,将同一人脸的人脸特征进行聚类处理,确定出适于所述多条视频的同一人脸的人脸特征的聚类,得到所述多条视频相应的第一聚类(即所述至少一条视频为多条视频的情况下,该多条视频中同一人脸的人脸特征的聚类)。After obtaining the clustering results of each video, the second clustering of the facial features of the same face can be clearly defined for each video; The facial features of the human face are clustered, and the clustering of the facial features of the same human face suitable for the multiple videos is determined to obtain the corresponding first cluster of the multiple videos (that is, the at least one When the video is multiple videos, the clustering of the facial features of the same face in the multiple videos).
步骤S15、针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到所述至少一条视频的人脸索引。Step S15 , for each facial feature gathered by each first cluster, associate the facial feature with the video information of the corresponding video, and obtain the face index of the at least one video.
在得到所述多条视频相应的第一聚类后,针对各第一聚类所聚集的各人脸特征,可将人脸特征在相应视频的视频信息进行关联,得到所述多条视频的人脸索引。After obtaining the corresponding first clusters of the multiple videos, for each facial feature gathered by each first cluster, the facial features can be associated with the video information of the corresponding video to obtain the facial features of the multiple videos. face index.
步骤S16、处理服务器将所述人脸索引写入人脸索引库。Step S16, the processing server writes the face index into the face index library.
本发明实施例提供的建立人脸索引的方法,可以在多条视频的情况下,分别针对各条视频,进行同一人脸的人脸特征的聚类处理,得到各条视频相应的第二聚类,通过将各条视频相应的各第二聚类所聚集的人脸特征,进行人脸特征在相应视频的视频信息的关联,可得到各条视频的聚类结果;从而可进一步将各条视频的聚类结果中,同一人脸的人脸特征再次进行聚类处理,得到适于所述多条视频相应的第一聚类;进而将各第一聚类所聚集的各人脸特征,进行人脸特征在相应视频的视频信息的关联,可在无人脸信息标记的情况下,实现高效的建立适于所述至少一条视频的人脸索引。The method for establishing a face index provided by the embodiment of the present invention can perform clustering processing of the facial features of the same face for each video in the case of multiple videos, and obtain the corresponding second clustering of each video. Class, by the face feature that each video corresponding second cluster gathers, carries out the correlation of face feature in the video information of corresponding video, can obtain the clustering result of each video; Thereby each can be further In the clustering result of the video, the face features of the same face are clustered again to obtain the corresponding first clusters suitable for the plurality of videos; and then the face features gathered by each first cluster, By associating the face features with the video information of the corresponding video, an efficient establishment of a face index suitable for the at least one video can be realized in the absence of a face information tag.
为便于理解,以多条视频为两条视频,且分为第一视频和第二视频为例进行说明,假设第一视频和第二视频存在相同的人物A,第一视频存在与第二视频不同的人物B,第二视频存在与第一视频不同的人物C(可选的,人物A和B可以是第一视频中主要人脸的人物,人物A和C可以是第二视频中主要人脸的人物),相应的处理示例可如图4所示:For ease of understanding, multiple videos are divided into two videos and divided into the first video and the second video as an example. Assume that the first video and the second video have the same person A, and the first video has the same person A as the second video. Different characters B, there is a character C different from the first video in the second video (optional, characters A and B can be the characters of the main faces in the first video, and characters A and C can be the main people in the second video face), the corresponding processing example can be shown in Figure 4:
处理服务器获取第一视频后,可对第一视频中的关键视频帧和/或,间隔设定时间间隔的截图提取人脸特征,并对同一人脸的人脸特征进行聚类处理,得到第一视频中人物A的人脸特征的第二聚类FA,使用[FA1,FA2,…,FAn]表示,及人物A的各人脸特征在第一视频相应的时间点TA,使用[TA1,TA2,…,TAn]表示;得到第一视频中人物B的人脸特征的第二聚类FB,使用[FB1,FB2,…,FBn]表示,及人物B的各人脸特征在第一视频相应的时间点TB,使用[TB1,TB2,…,TBn]表示;After the processing server acquires the first video, it can extract facial features from key video frames in the first video and/or screenshots with a set time interval, and perform clustering processing on the facial features of the same face to obtain the first The second cluster FA of the face features of person A in a video is represented by [FA1, FA2, ..., FAn], and each face feature of person A is at the corresponding time point TA of the first video, using [TA1, TA2,...,TAn] represent; obtain the second clustering FB of the face feature of character B in the first video, use [FB1, FB2,...,FBn] to represent, and each face feature of character B in the first video The corresponding time point TB is represented by [TB1, TB2, ..., TBn];
并且,处理服务器获取第二视频后,可对第二视频中的关键视频帧和/或,间隔设定时间间隔的截图提取人脸特征,并对同一人脸的人脸特征进行聚类处理,得到第二视频中人物A的人脸特征的第二聚类FA’,使用[FA1’,FA2’,…,FAn’]表示,及人物A的各人脸特征在第二视频相应的时间点TA’,使用[TA1’,TA2’,…,TAn’]表示;得到第二视频中人物C的各人脸特征的第二聚类FC,使用[FC1,FC2,…,FCn]表示,及人物C的各人脸特征在第二视频相应的各时间点TC,使用[TC1,TC2,…,TCn]表示;And, after the processing server acquires the second video, it can extract facial features from key video frames in the second video and/or screenshots with a set time interval, and perform clustering processing on the facial features of the same human face, Obtain the second cluster FA' of the face features of person A in the second video, represented by [FA1', FA2',..., FAn'], and each face feature of person A at the corresponding time point of the second video TA', represented by [TA1', TA2',..., TAn']; the second clustering FC of each face feature of character C in the second video is obtained, represented by [FC1, FC2,..., FCn], and Each face feature of character C is represented by [TC1, TC2, ..., TCn] at each time point TC corresponding to the second video;
处理服务器将第一视频中人物A的各人脸特征聚为一类,人物B的各人脸特征聚为一类后,可针对第一视频中人物A的人脸特征的第二聚类的各人脸特征,关联在第一视频相应的各时间点,及所对应的第一视频的ID(第一视频的ID可使用VID1表示),得到第一视频中人物A的聚类结果,可使用[<FA1,VID1,TA1>,<FA2,VID1,TA2>…,<FAn,VID1,TAn>]表示;针对第一视频中人物B的人脸特征的第二聚类的各人脸特征,关联在第一视频相应的各时间点,及所对应的第一视频的ID,得到第一视频中人物B的聚类结果,可使用[<FB1,VID1,TB1>,<FB2,VID1,TB2>…,<FBn,VID1,TBn>]表示;After the processing server clusters the facial features of person A in the first video into one category, and the facial features of person B into one category, it can target the second clustering of the facial features of person A in the first video. Each face feature is associated with each time point corresponding to the first video, and the ID of the corresponding first video (the ID of the first video can be represented by VID1), and the clustering result of character A in the first video is obtained, which can be Use [<FA1, VID1, TA1>, <FA2, VID1, TA2>..., <FAn, VID1, TAn>] to represent; face features of the second cluster for the face features of person B in the first video , associated with each time point corresponding to the first video, and the corresponding ID of the first video, to obtain the clustering result of character B in the first video, you can use [<FB1, VID1, TB1>, <FB2, VID1, TB2>…, <FBn, VID1, TBn>] means;
并且,处理服务器将第二视频中人物A的各人脸特征聚为一类,人物C的各人脸特征聚为一类后,可针对第二视频中人物A的人脸特征的第二聚类的各人脸特征,关联在第二视频相应的各时间点,及所对应的第二视频的ID(第二视频的ID可使用VID2表示),得到第二视频中人物A的聚类结果,可使用[<FA1’,VID2,TA1’>,<FA2’,VID2,TA2’>…,<FAn’,VID2,TAn’>]表示;针对第二视频中人物C的人脸特征的第二聚类的各人脸特征,关联在第二视频相应的各时间点,及所对应的第二视频的ID,得到第二视频中人物C的聚类结果,可使用[<FC1,VID2,TC1>,<FC2,VID2,TC2>…,<FCn,VID2,TCn>]表示;And, after the processing server clusters the facial features of person A in the second video into one group, and after the facial features of character C are grouped into one group, the second clustering of the facial features of character A in the second video can be Each face feature of the class is associated with each time point corresponding to the second video, and the ID of the corresponding second video (the ID of the second video can be represented by VID2), and the clustering result of person A in the second video is obtained , can be represented by [<FA1', VID2, TA1'>, <FA2', VID2, TA2'>..., <FAn', VID2, TAn'>]; The face features of the two clusters are associated with the corresponding time points of the second video and the ID of the corresponding second video to obtain the clustering result of the character C in the second video, which can be used [<FC1, VID2, TC1>, <FC2, VID2, TC2>…, <FCn, VID2, TCn>] means;
处理服务器在得到第一视频中人物A的聚类结果、人物B的聚类结果,并且得到第二视频中人物A的聚类结果、人物C的聚类结果后,可将第一视频和第二视频中同一人脸的人脸特征进行聚类处理,即将第一视频中人物A的聚类结果中的人脸特征,和第二视频中人物A的聚类结果中的人脸特征进行聚类处理,从而得到适于第一视频和第二视频的人物A的第一聚类,使用[<FA1,FA2,…,FAn>,<FA1’,FA2’,…,FAn’>]表示;由于人物B和C为第一视频和第二视频中的不同人物,因此第一视频中人物B的第二聚类可以作为适于第一视频和第二视频的人物B的第一聚类,第二视频中人物C的第二聚类可以作为适于第一视频和第二视频的人物C的第一聚类;After the processing server obtains the clustering results of character A and character B in the first video, and obtains the clustering results of character A and character C in the second video, the processing server can combine the first video and the second video The face features of the same face in the second video are clustered, that is, the face features in the clustering result of person A in the first video are clustered with the face features in the clustering result of person A in the second video Class processing, thereby obtaining the first clustering of the person A suitable for the first video and the second video, using [<FA1, FA2, ..., FAn>, <FA1', FA2', ..., FAn'>] to represent; Since characters B and C are different characters in the first video and the second video, the second clustering of character B in the first video can be used as the first clustering of character B suitable for the first video and the second video, The second cluster of person C in the second video may serve as the first cluster of person C applicable to the first video and the second video;
进而,可将各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到适于第一视频和第二视频的人脸索引;即得到的人脸索引可以表示为:[<FA1,VID1,TA1>,<FA2,VID1,TA2>…,<FAn,VID1,TAn>,<FA1’,VID2,TA1’>,<FA2’,VID2,TA2’>…,<FAn’,VID2,TAn’>],[<FB1,VID1,TB1>,<FB2,VID1,TB2>…,<FBn,VID1,TBn>],[<FC1,VID2,TC1>,<FC2,VID2,TC2>…,<FCn,VID2,TCn>]。Furthermore, each face feature gathered by each first cluster can be associated with the video information of the corresponding video to obtain a face index suitable for the first video and the second video; that is, the obtained face index can be Expressed as: [<FA1, VID1, TA1>, <FA2, VID1, TA2>…, <FAn, VID1, TAn>, <FA1’, VID2, TA1’>, <FA2’, VID2, TA2’>…, <FAn', VID2, TAn'>], [<FB1, VID1, TB1>, <FB2, VID1, TB2>…, <FBn, VID1, TBn>], [<FC1, VID2, TC1>, <FC2, VID2, TC2>..., <FCn, VID2, TCn>].
可选的,上述第一视频中人物A的人脸特征的第二聚类FA,也可以使用[FA1,FA2,…,FAn]的均值表示,上述第一视频中人物B的人脸特征的第二聚类FB,也可以使用[FB1,FB2,…,FBn]的均值表示,上述第二视频中人物A的人脸特征的第二聚类FA’,也可以使用[FA1’,FA2’,…,FAn’]的均值表示,上述第二视频中人物C的人脸特征的第二聚类FC,也可以使用[FC1,FC2,…,FCn]的均值表示。Optionally, the second cluster FA of the face features of person A in the above-mentioned first video can also be represented by the mean value of [FA1, FA2, ..., FAn], and the face features of person B in the above-mentioned first video The second cluster FB can also be represented by the mean value of [FB1, FB2, ..., FBn], and the second cluster FA' of the facial features of person A in the second video can also be represented by [FA1', FA2' , ..., FAn'] means that the second cluster FC of the face features of person C in the second video above can also be represented by the mean of [FC1, FC2, ..., FCn].
上文示出了在多条视频情况下,建立人脸索引的方案;本发明实施例也可在一条视频的情况下,建立人脸索引;可选的,图5示出了本发明实施例提供的建立人脸索引的另一种可选信令流程;参照图5,在基于一条视频进行人脸索引建立时,本发明实施例提供的流程可以包括:The above shows the scheme of establishing a face index in the case of multiple videos; the embodiment of the present invention can also establish a face index in the case of a video; optionally, Figure 5 shows the embodiment of the present invention Another optional signaling process for establishing a face index is provided; referring to FIG. 5, when establishing a face index based on a video, the process provided by the embodiment of the present invention may include:
步骤S20、视频源向处理服务器输入视频。Step S20, the video source inputs video to the processing server.
此处所指的视频可以是视频源所具有的任意一条视频,可以认为是本发明实施例用于建立人脸索引的任意一条视频;The video referred to here can be any video that the video source has, and can be considered as any video used to establish a face index in the embodiment of the present invention;
作为一种示例,视频源可以逐一或随机的向处理服务器输入视频,如视频源逐一或随机的将视频源具有的视频输入给处理服务器,则这个过程中,处理服务器可对视频源输入的任意一条视频,逐一的建立人脸索引;可选的,视频源也可以受指定(如受处理服务器指定,或者工作人员的指定),向处理服务器输入指定的视频;As an example, the video source can input video to the processing server one by one or randomly, such as the video source can input the video that the video source has to the processing server one by one or randomly, then in this process, the processing server can input any video source. One video, build face index one by one; Optionally, the video source can also be specified (such as specified by the processing server, or specified by the staff), and input the specified video to the processing server;
可选的,在本发明实施例中,视频源所具有的每条视频可以具有唯一的标识(如视频ID);Optionally, in the embodiment of the present invention, each video that the video source has may have a unique identifier (such as a video ID);
可选的,视频也可以是以视频流形式存在。Optionally, the video may also exist in the form of a video stream.
相应的,处理服务器可以获取到视频。Correspondingly, the processing server can obtain the video.
步骤S21、处理服务器确定所述视频相应的人脸特征,并确定各人脸特征在所述视频相应的视频信息。Step S21, the processing server determines the facial features corresponding to the video, and determines the video information corresponding to each facial feature in the video.
可选的,对于一条视频,确定视频相应的人脸特征的方式可以参照图3步骤S11所示。Optionally, for a video, the manner of determining the corresponding facial features of the video may refer to step S11 in FIG. 3 .
步骤S22、处理服务器针对所述视频相应的各人脸特征,将同一人脸的人脸特征进行聚类处理,得到所述视频相应的第一聚类。Step S22 , the processing server clusters the facial features of the same face for each facial feature corresponding to the video to obtain a first cluster corresponding to the video.
可选的,对于一条视频,将视频相应的人脸特征中,同一人脸的人脸特征进行聚类处理的过程可以参照图3步骤S12所示;Optionally, for a video, the process of clustering the face features of the same face among the corresponding face features of the video can be shown in step S12 in FIG. 3 ;
这里需要注意的是,在多条视频的情况下,每条视频的同一人脸的人脸特征的聚类称为第二聚类,而该多条视频中各第二聚类的同一人脸的人脸特征聚类可称为适于该多条视频的第一聚类(本发明实施例所指的第一聚类在多条视频情况下的一种形式);It should be noted here that in the case of multiple videos, the clustering of the face features of the same face in each video is called the second cluster, and the same face of each second cluster in the multiple videos The face feature clustering can be referred to as the first clustering suitable for the multiple videos (a form of the first clustering referred to in the embodiment of the present invention in the case of multiple videos);
而在一条视频的情况下,视频中同一人脸的人脸特征的聚类可认为是适于该条视频的第一聚类(本发明实施例所指的第一聚类在一条视频情况下的一种形式)。And in the case of a video, the clustering of the facial features of the same face in the video can be considered as the first cluster suitable for the video (the first cluster referred to in the embodiment of the present invention is in the case of a video) a form of).
步骤S23、处理服务器针对所述视频相应的各聚类的各人脸特征,关联人脸特征在所述视频的视频信息,建立出所述视频的人脸索引。Step S23 , the processing server correlates the face features of each cluster corresponding to the video with the video information of the video, and establishes a face index of the video.
在将所述视频中同一人脸的人脸特征进行聚类处理,得到所述视频相应的各聚类后,本发明实施例可以对所述视频相应的各聚类的各人脸特征,关联人脸特征在所述视频的视频信息,得到所述视频相应的聚类结果;从而可在对一条视频建立人脸索引的情况下,将所述视频相应的聚类结果,作为所述视频的人脸特征。After clustering the facial features of the same face in the video to obtain the corresponding clusters of the video, the embodiment of the present invention can associate the facial features of the corresponding clusters of the video with The face feature is in the video information of the video, and the corresponding clustering result of the video is obtained; thus, when a face index is established for a video, the corresponding clustering result of the video can be used as the clustering result of the video facial features.
可见,在本发明实施例中,一条视频相应的聚类结果可以包括:该视频中同一人脸的人脸特征的各聚类,及各聚类的各人脸特征在该视频相应的视频信息。It can be seen that, in the embodiment of the present invention, the corresponding clustering result of a video may include: each cluster of the facial features of the same face in the video, and the corresponding video information of each facial feature of each cluster in the video .
可选的,一个聚类的人脸特征在相应视频的视频信息可以至少包括:该聚类的各人脸特征在相应视频的时间点;作为一种替代方式,一个聚类的人脸特征在相应视频的视频信息可以包括:该聚类的各人脸特征在相应视频的视频进度;显然,一个聚类的人脸特征在相应视频的视频信息也可以包括:该聚类的各人脸特征在相应视频的时间点,和/或视频进度;Optionally, the video information of a clustered face feature in the corresponding video may at least include: each face feature of the cluster at the time point of the corresponding video; as an alternative, a clustered face feature in The video information of the corresponding video may include: the video progress of each face feature of the cluster in the corresponding video; obviously, the video information of a cluster of face features in the corresponding video may also include: each face feature of the cluster at the point in time of the corresponding video, and/or the progress of the video;
进一步,视频信息也可以包括:视频的标识,如视频的ID等。Further, the video information may also include: a video identifier, such as a video ID.
作为一种假设示例,在一条视频的情况下,若以F表示人脸特征(如使用高纬人脸特征向量进行表示),T表示各人脸特征在视频中相应的时间点,若该视频存在一个主要人脸,则可得到该视频中主要人脸的人脸特征的第一聚类,使用[F11,F12,…,,F1n]表示;同时,可确定出该第一聚类的各人脸特征在视频相应的各时间点[T11,T12,…,T1n];对该第一聚类的各人脸特征,关联人脸特征在该视频相应的时间点后,在一条视频下,人脸索引的一种可选表示形式可以为:As a hypothetical example, in the case of a video, if F represents the facial feature (such as using a high-dimensional facial feature vector), T represents the corresponding time point of each facial feature in the video, if the video There is a main face, then the first clustering of the facial features of the main face in the video can be obtained, expressed by [F11, F12, ...,, F1n]; at the same time, each of the first clustering can be determined The face features are at the corresponding time points [T11, T12, ..., T1n] of the video; for each face feature of the first cluster, after the associated face features are at the corresponding time points of the video, under a video, An alternative representation of a face index could be:
[<F11,F12,…,,F1n>,<T11,T12,…,T1n>];其中,<F11,F12,…,,F1n>也可以使用人脸特征的均值表示;可选的,该人脸索引中还可以包括:该视频的标识;[<F11, F12, ...,, F1n>, <T11, T12, ..., T1n>]; where, <F11, F12, ...,, F1n> can also be represented by the mean value of face features; optional, the The face index may also include: the identification of the video;
当然,在一条视频中存在多个人脸(如多个主要人脸)时,上述人脸索引中,可以存在多个属于不同人脸的人脸特征的第一聚类,并在各第一聚类的各人脸特征关联在该视频相应的时间点。Of course, when there are multiple faces (such as multiple main faces) in a video, in the above-mentioned face index, there may be multiple first clusters of face features belonging to different faces, and each first cluster Each face feature of the class is associated with the corresponding time point in the video.
步骤S24、处理服务器将所述人脸索引,写入人脸索引库。Step S24, the processing server writes the face index into the face index database.
在对一条视频进行人脸索引建立的情况下,在得到该视频相应的聚类结果后,本发明实施例可通过该视频相应的聚类结果,实现该视频中同一人脸的人脸特征的聚类,并关联出各聚类的人脸特征在该视频相应的视频信息,实现无人脸信息标记情况下,对该一条视频中至少一个相同人物的人脸特征与视频信息进行关联,达成该视频的人脸索引的建立。In the case of building a face index for a video, after obtaining the corresponding clustering result of the video, the embodiment of the present invention can use the corresponding clustering result of the video to realize the facial features of the same face in the video. Clustering, and associating the face features of each cluster in the corresponding video information of the video, when no face information is marked, the face features of at least one identical person in the video are associated with the video information to achieve The establishment of the face index of the video.
并且通过将同一人脸的人脸特征进行聚类处理,可以认为同一聚类的人脸特征表示的是同一人脸,可以减少人脸索引库中的人脸数量,可极大的减小人脸索引的数据冗余。And by clustering the face features of the same face, it can be considered that the face features of the same cluster represent the same face, which can reduce the number of faces in the face index database, which can greatly reduce the number of faces. Data redundancy for face indexing.
为便于理解,在一条视频的情况下,相应处理过程可如图6所示,以人脸特征在该视频相应的视频点表示视频信息,参照图6,则将该视频中同一人脸的人脸特征进行聚类处理后,可得到多个聚类(即该视频相应的多个第一聚类);其中,聚类1可以是同一人脸的人脸特征<F11,F12,…,F1n>的聚类,并且将聚类1中各人脸特征在视频相应的视频时间点进行关联,可得到[<F11,T11>,<F12,T12>…,<F1n,T1n>],其中,<T 11,T12,…,T 1n>表示[F11,F12,…,F1n]在该视频分别相应的时间点;For ease of understanding, in the case of a video, the corresponding processing process can be shown in Figure 6. The video information is represented by the face feature at the corresponding video point of the video. Referring to Figure 6, the person with the same face in the video is After the face features are clustered, a plurality of clusters (that is, multiple first clusters corresponding to the video) can be obtained; wherein,
聚类2可以是另一个人脸的人脸特征[F21,F22,…,,F2n]的聚类,将聚类2中各人脸特征在视频相应的视频时间点进行关联,可得到[<F21,T21>,<F22,T22>…,<F2n,T2n>],其中,<T21,T22,…,T2n>表示[F21,F22,…,F2n]在该视频分别相应的时间点。
上述描述了基于多条视频情况下的人脸索引建立,及一条视频情况下的人脸索引建立;综合来说,本发明实施例是针对需要建立人脸索引的至少一条视频,将同一人脸的人脸特征进行聚类处理,得到适于所述至少一条视频的第一聚类,并关联各第一聚类的人脸特征在相应视频的视频信息,建立出适于所述至少一条视频的人脸索引;指的注意的是,所述至少一条视频可以是多条视频(如上述图3所示的处理情况),也可以是一条视频(如上述图5所示的处理情况);The above describes the establishment of face index based on multiple videos, and the establishment of face index in the case of one video; generally speaking, the embodiment of the present invention aims at at least one video for which a face index needs to be established, and the same face The facial features of each first cluster are clustered to obtain the first cluster suitable for the at least one video, and the facial features of the first clusters are associated with the video information of the corresponding video to establish a cluster suitable for the at least one video. The face index; Refer to note that said at least one video can be a plurality of videos (the processing situation shown in Figure 3 above), or a video (the processing situation shown in Figure 5 above);
当然,除上述图3和图5所示处理情况外,本发明实施例也可在多条视频的情况下,将该多条视频中同一人脸的人脸特征进行聚类处理,而并不分别针对各条视频,将各条视频中同一人脸的人脸特征进行聚类处理。Of course, in addition to the above-mentioned processing situations shown in FIG. 3 and FIG. 5 , in the case of multiple videos, the embodiment of the present invention can also cluster the facial features of the same face in the multiple videos without For each video, the facial features of the same face in each video are clustered.
可选的,在将视频的人脸特征中,相同的人脸特征进行聚类处理的实现过程中,本发明实施例可设置预定相似度要求,通过分析人脸特征间的相似度,将相似度符合预定相似度要求的人脸特征聚为一类,实现对视频的人脸特征中同一人脸的人脸特征的聚类处理。Optionally, in the implementation process of clustering the same facial features among the facial features of the video, the embodiment of the present invention can set a predetermined similarity requirement, and by analyzing the similarity between facial features, similar The face features whose degrees meet the predetermined similarity requirements are clustered into one class, and the clustering processing of the face features of the same face in the face features of the video is realized.
可选的,本发明实施例也可按照视频的播放时间顺序(特别是视频为视频流的情况),从视频中提取人脸特征;此时,视频的人脸特征是按照播放时间顺序先后提取的,因此本发明实施例也可按照人脸特征的提取先后顺序,进行同一人脸的人脸特征的聚类处理,即对于先提取的人脸特征,本发明实施例可先执行聚类处理,对于后提取的人脸特征,本发明实施例可后执行聚类处理;Optionally, the embodiment of the present invention can also extract facial features from the video according to the playing time sequence of the video (especially when the video is a video stream); at this time, the facial features of the video are extracted successively according to the playing time order Therefore, in the embodiment of the present invention, the clustering processing of the facial features of the same face can also be performed according to the sequence of extraction of facial features, that is, for the facial features extracted first, the embodiment of the present invention can perform clustering processing first , for the later extracted facial features, the embodiment of the present invention can perform clustering processing later;
可选的,图7示出了本发明实施例提供的将同一人脸的人脸特征进行聚类处理的方法流程图,该方法可以适用于前文所述的任一将同一人脸的人脸特征进行聚类处理的阶段,如图3步骤S12、图3步骤S14、图5步骤S22等阶段;参照图7,该方法可以包括:Optionally, FIG. 7 shows a flow chart of a method for clustering facial features of the same face provided by an embodiment of the present invention. The stage of clustering processing of features, such as steps S12 in Figure 3, step S14 in Figure 3, and step S22 in Figure 5; with reference to Figure 7, the method may include:
步骤S30、对于任一待聚类人脸特征,检测已得到的聚类中,是否存在与所述待聚类人脸特征的相似度符合预定相似度要求的目标聚类,若是,执行步骤S31,若否,执行步骤S32。Step S30, for any face feature to be clustered, detect whether there is a target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirement among the obtained clusters, if so, execute step S31 , if not, go to step S32.
在本发明实施例中,人脸特征可以使用人脸特征向量表示;由于人脸特征是按照播放时间顺序从视频中先后提取的,因此对于提取到的一人脸特征,处理服务器可能在之前已将与该人脸特征相似的其他人脸特征进行过聚类,也可能没有进行过聚类;In the embodiment of the present invention, the face features can be represented by face feature vectors; since the face features are extracted from the video in sequence according to the playing time, the processing server may have previously Other facial features similar to this facial feature have been clustered, or may not have been clustered;
可选的,本发明实施例可对于任一待聚类人脸特征(如当前从任一条视频中提取的人脸特征,或者,在多条视频情况下,任一条视频的第二聚类中的任一人脸特征),检测已得到的聚类中,是否存在与该待聚类人脸特征的相似度符合预定相似度要求的目标聚类,以判断与该待聚类人脸特征属于同一人脸的其他人脸特征是否已被聚类。Optionally, in this embodiment of the present invention, for any face feature to be clustered (such as the face feature currently extracted from any video, or, in the case of multiple videos, any video in the second cluster Any face feature of the face feature), detect whether there is a target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirements in the obtained clusters, so as to judge whether the face feature to be clustered belongs to the same Whether other facial features of the face have been clustered.
步骤S31、将所述待聚类人脸特征聚集到所述目标聚类。Step S31. Gather the face features to be clustered into the target clusters.
在检测到已得到的聚类中,存在与该待聚类人脸特征的相似度符合预定相似度要求的目标聚类时,可认为与该待聚类人脸特征属于同一人脸的其他人脸特征已被聚类,可将该待聚类人脸特征聚集到该目标聚类。When it is detected that among the obtained clusters, there is a target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirements, it can be considered that other people belonging to the same face as the face feature to be clustered The face features have been clustered, and the face features to be clustered can be aggregated into the target cluster.
步骤S32、设置新的聚类,将所述待聚类人脸特征聚集到该新的聚类。Step S32, setting a new cluster, and gathering the face features to be clustered into the new cluster.
在检测到已得到的聚类中,不存在与该待聚类人脸特征的相似度符合预定相似度要求的目标聚类时,可认为该待聚类人脸特征属于新的人脸,可将该待聚类人脸特征独立为一个聚类,即通过设置新的聚类,将该待聚类人脸特征聚集到该新的聚类中。When it is detected that among the obtained clusters, there is no target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirements, it can be considered that the face feature to be clustered belongs to a new face. Separate the face features to be clustered into a cluster, that is, set a new cluster and gather the face features to be clustered into the new cluster.
本发明实施例可通过人脸特征的人脸特征向量的距离,来表示人脸特征间的相似度;可选的,图8示出了本发明实施例提供的将同一人脸的人脸特征进行聚类处理的另一方法流程,图8所示流程可以认为是图7所示流程的一种细化,参照图8,该流程可以包括:In this embodiment of the present invention, the similarity between human face features can be represented by the distance of the face feature vectors of human face features; optionally, FIG. Another method flow for clustering processing, the flow shown in Figure 8 can be considered as a refinement of the flow shown in Figure 7, referring to Figure 8, the flow can include:
步骤S40、对于任一待聚类人脸特征,检测在所述待聚类人脸特征的人脸特征向量的预定向量距离内,是否存在已得到的聚类,若是,执行步骤S41,若否,执行步骤S43。Step S40, for any face feature to be clustered, detect whether there is a cluster obtained within the predetermined vector distance of the face feature vector of the face feature to be clustered, if yes, execute step S41, if not , execute step S43.
步骤S41、判断在该已得到的聚类中加入所述待聚类人脸特征后,该已得到的聚类相应的半径是否大于半径阈值,若否,执行步骤S42,若是,执行步骤S43。Step S41. After adding the face feature to be clustered into the obtained cluster, whether the corresponding radius of the obtained cluster is greater than the radius threshold, if not, execute step S42, and if yes, execute step S43.
步骤S42、确定所述已得到的聚类为目标聚类,将所述待聚类人脸特征聚集到所述目标聚类。Step S42. Determine the obtained cluster as the target cluster, and gather the face features to be clustered into the target cluster.
步骤S43、设置新的聚类,将所述待聚类人脸特征聚集到该新的聚类。Step S43, setting a new cluster, and gathering the face features to be clustered into the new cluster.
可以看出,若在待聚类人脸特征相应的人脸特征向量的预定向量距离内,存在已得到的聚类,并且在该已得到的聚类中加入所述待聚类人脸特征后,所述已得到的聚类的半径不大于半径阈值,则可确定所述已得到聚类为所述目标聚类,可将所述待聚类人脸特征聚集到所述目标聚类;It can be seen that if within the predetermined vector distance of the face feature vector corresponding to the face feature to be clustered, there is an obtained cluster, and after adding the face feature to be clustered in the obtained cluster, , the radius of the obtained cluster is not greater than the radius threshold, then it can be determined that the obtained cluster is the target cluster, and the facial features to be clustered can be gathered into the target cluster;
而若在待聚类人脸特征相应的人脸特征向量的预定向量距离内,不存在已得到的聚类,则可确定所述待聚类人脸特征属于新的人脸,可设置新的聚类,将所述待聚类人脸特征聚集到该新的聚类;And if within the predetermined vector distance of the corresponding face feature vector of the face feature to be clustered, there is no clustering obtained, then it can be determined that the face feature to be clustered belongs to a new face, and a new Clustering, gathering the face features to be clustered into this new cluster;
若在待聚类人脸特征相应的人脸特征向量的预定向量距离内,存在已得到的聚类,但在已得到的聚类中加入所述待聚类人脸特征后,所述已得到的聚类的半径大于半径阈值,则确定所述已得到的聚类所聚类的人脸特征,与所述待聚类人脸特征不属于同一人脸,可设置新的聚类,将所述待聚类人脸特征聚集到该新的聚类。If within the predetermined vector distance of the face feature vector corresponding to the face features to be clustered, there are already obtained clusters, but after adding the face features to be clustered in the obtained clusters, the obtained If the radius of the clustering is greater than the radius threshold, then it is determined that the face features clustered by the obtained clusters do not belong to the same face as the face features to be clustered, a new cluster can be set, and all The face features to be clustered are gathered into this new cluster.
进一步,在将待聚类人脸特征聚集到一个聚类后(可能是已得到的目标聚类,也可能是新的聚类),可以更新所述待聚类人脸特征聚集到的聚类相应的半径及质心;进一步,还可对于每一聚类设置相应的视频信息列表(如视频时间点列表、视频进度列表等),从而在所述待聚类人脸特征聚类到某一聚类后,在所述待聚类人脸特征聚集到的聚类相应的视频信息列表中插入,所述待聚类人脸特征在相应视频的视频信息。Further, after the face features to be clustered are gathered into a cluster (may be the obtained target cluster, or a new cluster), the cluster to which the face features to be clustered are gathered can be updated Corresponding radius and centroid; Further, can also set corresponding video information list (as video time point list, video progress list etc.) After the clustering, it is inserted into the video information list corresponding to the cluster where the face features to be clustered are gathered, and the face features to be clustered are included in the video information of the corresponding video.
相应的,在对于任一待聚类人脸特征,检测在所述待聚类人脸特征的人脸特征向量的预定向量距离内,是否存在已得到的聚类时,可判断距所述待聚类人脸特征的人脸特征向量的距离为ε的近邻里,是否存在已得到的聚类;Correspondingly, for any face feature to be clustered, when detecting whether there is an obtained cluster within the predetermined vector distance of the face feature vector of the face feature to be clustered, it can be judged that the distance from the face feature vector to be clustered is Whether the face feature vector distance of the clustered face feature is the nearest neighbor of ε, whether there is a cluster obtained;
作为一种示例,以对一条视频中同一人脸的人脸特征进行聚类为例,则可按照该视频的视频帧顺序提取人脸特征,所提取的人脸特征成为待聚类人脸特征,可按照该视频的视频时间点<T1,T2,…,Tn>,提取到同一人脸的人脸特征<F1,F2,…,,Fn>;As an example, taking the clustering of the facial features of the same face in a video as an example, the facial features can be extracted according to the video frame sequence of the video, and the extracted facial features become the facial features to be clustered , according to the video time point <T1, T2, ..., Tn> of the video, the facial features <F1, F2, ...,, Fn> of the same face can be extracted;
对于提取到的任一人脸特征,可判断距该人脸特征的人脸特征向量的距离为ε的近邻里是否存在聚类;若存在聚类,则可尝试调整聚类,确认加入该人脸特征后,聚类的半径更新是否超过最大半径阈值:若超过最大半径阈值,则将此人脸特征置为新的聚类,在该聚类相应的视频时间点列表中插入人脸特征在视频的视频时间点;否则,可在聚类中加入该人脸特征后,更新聚类的质心和半径,同时在该聚类相应的视频时间点列表,更新入该人脸特征在视频的视频时间点,相应的示意可如图9所示。For any extracted face feature, it can be judged whether there is a cluster in the neighborhood whose distance from the face feature vector of the face feature is ε; if there is a cluster, you can try to adjust the cluster and confirm to add the face After the feature, whether the update of the radius of the cluster exceeds the maximum radius threshold: if it exceeds the maximum radius threshold, set this face feature as a new cluster, and insert the face feature in the video time point list corresponding to the cluster. Otherwise, after adding the face feature to the cluster, update the centroid and radius of the cluster, and at the same time list the corresponding video time points of the cluster, update the video time of the face feature in the video point, the corresponding diagram can be shown in Figure 9.
这里需要说明的是,聚类其实就是按照某个特定标准(如距离准则)把一个数据集分割成不同的类或簇,使得同一个簇内的数据的相似性尽可能大,同时不在同一个簇中的数据对象的差异性也尽可能地大。即聚类后同一类的数据尽可能聚集到一起,不同数据尽量分离;在上述描述的图9相应的示意中,可设置经验半径数值ε,以ε为半径形成一个圆,在圆内的数据就认为是相似的,而聚类相应的质心就是圆心,当某一聚类中聚集的数据发生改变时,该聚类相应的质心和半径则相应更新。What needs to be explained here is that clustering is actually to divide a data set into different classes or clusters according to a certain standard (such as distance criterion), so that the similarity of the data in the same cluster is as large as possible, while not in the same cluster. The diversity of data objects in the cluster is also as large as possible. That is, after clustering, the data of the same class are gathered together as much as possible, and the different data are separated as much as possible; in the corresponding diagram of Figure 9 described above, the empirical radius value ε can be set, and a circle is formed with ε as the radius, and the data in the circle It is considered to be similar, and the corresponding centroid of the cluster is the center of the circle. When the data gathered in a cluster changes, the corresponding centroid and radius of the cluster will be updated accordingly.
进一步需要说明的是,如果视频是以视频流形式输入到处理服务器,则处理服务器可通过建立视频索引链的方式,区分多条视频的视频流;可选的,处理服务器在获取到一个视频流时,可通过视频流的标识,判断该视频流的视频索引链是否存在,一个视频索引链可记录该视频中聚集的不同人脸的人脸特征的聚类(若在多条视频下,则是第二聚类的形式);若是,则可针对该视频流,按照视频帧顺序(视频播放时间顺序的一种形式)提取该视频流的人脸特征,并确定提取的人脸特征在该视频的视频索引链中相应的聚类(如图7或图8方式实现);It should be further noted that if the video is input to the processing server in the form of a video stream, the processing server can distinguish the video streams of multiple videos by establishing a video index chain; At this time, it can be judged whether the video index chain of the video stream exists through the identification of the video stream, and a video index chain can record the clustering of the facial features of different faces gathered in the video (if under multiple videos, then is the form of the second clustering); if so, then for the video stream, the facial features of the video stream can be extracted according to the sequence of video frames (a form of video playback time order), and it is determined that the extracted facial features are in the Corresponding clustering in the video index chain of video (as shown in Fig. 7 or Fig. 8 mode realization);
若否,则可以创建该视频流的视频索引链,按照视频帧顺序提取该视频流的人脸特征,并确定提取的人脸特征在所创建的该视频的视频索引链中相应的聚类(如图7或图8方式实现)。If not, then the video index chain of this video stream can be created, extract the face features of this video stream according to the video frame sequence, and determine the corresponding clustering ( as shown in Figure 7 or Figure 8).
可选的,视频索引链可以键值形式存在,如一个视频的视频索引链可以该视频的标识为主键,以该视频中同一人脸的人脸特征的各聚类为主键关联的值。Optionally, the video index chain can exist in key-value form. For example, the video index chain of a video can be the identification of the video as the primary key, and the values associated with each cluster of facial features of the same face in the video as the primary key.
可选的,对于上述所述的将任一条视频中,同一人脸的人脸特征进行聚类处理,得到同一人脸特征的聚类的阶段(如图3所示步骤S12、图5所示步骤S22等阶段),本发明实施例可设置任一条视频相应的人脸特征的聚类,为视频中主要人脸的人脸特征的聚类;Optionally, for any of the above-mentioned videos, the face features of the same face are clustered to obtain the clustering stage of the same face feature (step S12 shown in Figure 3, shown in Figure 5 Step S22 and other stages), the embodiment of the present invention can set the clustering of the corresponding facial features of any video, which is the clustering of the facial features of the main faces in the video;
可选的,对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,可将该至少一个聚类中人脸特征出现次数小于次数阈值的聚类进行删除;Optionally, for any video, after at least one cluster of facial features of the same face in the video is obtained, the clusters whose appearance times of facial features in the at least one cluster are less than the number threshold can be deleted;
如在多条视频的情况下,得到各条视频相应的第二聚类的处理时,对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,可将该至少一个聚类中人脸特征出现次数小于次数阈值的聚类进行删除,得到该视频相应的至少一个第二聚类;As in the case of multiple videos, when obtaining the corresponding second clustering process of each video, for any video, after obtaining at least one cluster of the facial features of the same face in the video, the Deleting at least one cluster whose number of occurrences of face features in at least one cluster is less than the number threshold, to obtain at least one second cluster corresponding to the video;
如在一条视频的情况下,得到该视频相应的第一聚类的处理时,在得到该视频中同一人脸的人脸特征的至少一个聚类后,可将人脸特征出现次数小于次数阈值的聚类进行删除,得到该视频相应的至少一个第一聚类。For example, in the case of a video, when obtaining the processing of the corresponding first cluster of the video, after obtaining at least one cluster of the facial features of the same face in the video, the number of occurrences of the facial features can be less than the number of times threshold The clusters are deleted to obtain at least one first cluster corresponding to the video.
可选的,人脸特征在相应视频的视频信息可以包括:人脸特征在相应视频的视频时间点;可选的,对于任一条视频,本发明实施例可通过图10所示方法删除该视频的人脸特征的聚类中,次要人脸的人脸特征的聚类;参照图10,该方法可以包括:Optionally, the video information of the face feature in the corresponding video may include: the video time point of the face feature in the corresponding video; optionally, for any video, the embodiment of the present invention can delete the video by the method shown in FIG. 10 In the clustering of facial features, the clustering of the facial features of secondary faces; with reference to Figure 10, the method may include:
步骤S50、对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,确定该视频的各聚类的人脸特征相应的视频时间点分布。Step S50, for any video, after obtaining at least one cluster of facial features of the same face in the video, determine the distribution of video time points corresponding to the facial features of each cluster in the video.
为了区分主要人脸和次要人脸,在对任一条视频,将视频中同一人脸的人脸特征进行聚类后,可对每一聚类所聚集的人脸特征在相应视频的视频时间点,确定视频时间点分布,即确定出一条视频中,各聚类的人脸特征在该视频相应的视频时间点的分布。In order to distinguish between the main face and the secondary face, after clustering the face features of the same face in the video for any video, the face features gathered by each cluster at the video time of the corresponding video can be Points to determine the distribution of video time points, that is, to determine the distribution of face features of each cluster at the corresponding video time points in a video.
步骤S51、删除视频时间点分布不处于至少一个设定时长间隔的聚类。Step S51 , deleting the clusters whose distribution of video time points does not fall within at least one set duration interval.
可选的,本发明实施例可设置至少一个设定时长间隔,如n个时长为t的时间间隔;从而可对任一条视频,判断该视频的各聚类的人脸特征相应的视频时间点分布,是否在多于该n个时长为t的时间间隔中,若否,则可确定该聚类的人脸特征属于视频中的次要人脸,可进行删除,若是,则可确定该聚类的人脸特征属于视频中的主要人脸,可进行保留。Optionally, the embodiment of the present invention can set at least one set time interval, such as n time intervals whose duration is t; thus, for any video, the video time point corresponding to the facial features of each cluster of the video can be judged Distribution, whether it is in more than the n time intervals of t, if not, it can be determined that the face feature of the cluster belongs to the secondary face in the video, and can be deleted, if so, it can be determined that the cluster The face features of the class belong to the main faces in the video and can be preserved.
上文从不同的角度描述了在无人脸信息标记的情况下,对一条或多条视频建立人脸索引的方案,通过将一条或多条视频中,同一人脸的人脸特征进行聚类处理,得到适于该一条或多条视频的第一聚类;并对各第一聚类聚集的各人脸特征,进行人脸特征在相应视频的视频信息的关联,可实现高效的人脸索引建立;进一步,通过各第一聚类,可对该一条或多条视频中同一人脸的人脸特征进行聚集,可以减少人脸索引库中的人脸数量,可极大的减小人脸索引的数据冗余。The above describes the scheme of building a face index for one or more videos without face information marking from different angles, by clustering the face features of the same face in one or more videos Processing to obtain the first cluster suitable for the one or more videos; Index establishment; further, through each first clustering, can gather the face features of the same face in this one or more videos, can reduce the number of faces in the face index storehouse, can greatly reduce the number of faces Data redundancy for face indexing.
在建立出人脸索引后,本发明实施例可支持用户在检索视频时,通过输入目标人物的人脸图像,来检索出具有该目标人物的人脸的视频;作为一种应用示例,本发明实施例的应用过程可如图11所示,参照图11,该过程可以包括:After the face index is established, the embodiment of the present invention can support the user to retrieve the video with the face of the target person by inputting the face image of the target person when retrieving the video; as an application example, the present invention The application process of the embodiment can be shown in Figure 11, referring to Figure 11, the process can include:
步骤S60、终端向处理服务器发送视频检索请求,所述视频检索请求携带有目标人脸图像。Step S60, the terminal sends a video retrieval request to the processing server, and the video retrieval request carries a target face image.
用户在需要搜索具有目标人物的视频时,可向终端输入目标人物的目标人脸图像,终端可相应的向处理服务器发送视频检索请求,并在视频检索请求中携带目标人脸图像;When the user needs to search for a video with a target person, he can input the target face image of the target person to the terminal, and the terminal can correspondingly send a video retrieval request to the processing server, and carry the target face image in the video retrieval request;
作为一种示例,用户可在终端装载的视频客户端的页面,导入目标人脸图像,用户点击页面的搜索按钮后,终端可通过视频客户端向处理服务器发送携带目标人脸图像的视频检索请求。As an example, the user can import the target face image on the page of the video client loaded on the terminal, and after the user clicks the search button on the page, the terminal can send a video retrieval request carrying the target face image to the processing server through the video client.
步骤S61、处理服务器提取所述目标人脸图像的目标人脸特征。Step S61, the processing server extracts the target face features of the target face image.
处理服务器获取所述视频检索请求后,可解析得到所述视频检索请求中携带的目标人脸图像,并提取所述目标人脸图像的人脸特征(称为目标人脸特征)。After the processing server obtains the video retrieval request, it can parse to obtain the target facial image carried in the video retrieval request, and extract facial features of the target facial image (referred to as target facial features).
步骤S62、处理服务器从人脸索引中,确定所述目标人脸特征关联的相应视频的视频信息。Step S62, the processing server determines the video information of the corresponding video associated with the target face feature from the face index.
处理服务器在建立人脸索引后,基于人脸索引中各第一聚类所聚集的人脸特征,关联的人脸特征在相应视频的视频信息,处理服务器可确定所述目标人脸特征关联的相应视频的视频信息(包括目标人脸特征相应的各视频,及目标人脸特征在相应的各视频中的视频时间点等)。After the processing server establishes the face index, based on the face features gathered by each first cluster in the face index, the associated face features are in the video information of the corresponding video, and the processing server can determine the associated face features of the target. The video information of the corresponding video (including each video corresponding to the target face feature, and the video time point of the target face feature in each corresponding video, etc.).
可选的,作为一种实现,处理服务器可基于目标人脸特征,对人脸索引中的各第一聚类的质心进行k近邻查询,得到目标人脸特征所在的第一聚类,并确定目标人脸特征所在的第一聚类中聚集的各人脸特征相应的各视频的视频ID,及在各视频中的视频时间点。Optionally, as an implementation, the processing server may perform a k-nearest neighbor query on the centroids of each first cluster in the face index based on the target face feature, obtain the first cluster where the target face feature is located, and determine The video IDs of the videos corresponding to the facial features gathered in the first cluster where the target facial feature is located, and the video time points in each video.
步骤S63、处理服务器向终端发送所述视频信息。Step S63, the processing server sends the video information to the terminal.
处理服务器在确定所述目标人脸特征关联的相应视频的视频信息后,可输出所确定的视频信息,即向终端发送所述视频信息。After determining the video information of the corresponding video associated with the target face feature, the processing server may output the determined video information, that is, send the video information to the terminal.
步骤S64、终端展示所述视频信息。Step S64, the terminal displays the video information.
可选的,终端可在视频客户端的搜索结果页面,展示所述视频信息,如展示目标人脸特征所相应的视频,及在所相应的视频的视频时间点等。Optionally, the terminal may display the video information on the search result page of the video client, such as displaying the video corresponding to the target face feature, and the video time point of the corresponding video.
上述给出的应用示例是以用户使用视频客户端进行视频检索进行说明,当然用户也可使用浏览器等浏览组件登录视频网站,通过视频网站进行视频检索。The application example given above is explained by using a video client to perform video retrieval by the user. Of course, the user can also use a browsing component such as a browser to log in to a video website and perform video retrieval through the video website.
下面对本发明实施例提供的建立人脸索引的装置进行介绍;下文描述的建立人脸索引的装置可以认为是,处理服务器为实现本发明实施例提供的建立人脸索引的方法所需设置的程序模块。下文描述的建立人脸索引的装置的内容,可与上文描述的建立人脸索引的方法的内容相互对应参照。The device for establishing the face index provided by the embodiment of the present invention is introduced below; the device for establishing the face index described below can be considered as the program that the processing server needs to set for realizing the method for establishing the face index provided by the embodiment of the present invention module. The content of the apparatus for establishing a face index described below may be referred to in correspondence with the content of the method for establishing a face index described above.
图12为本发明实施例提供的建立人脸索引的装置的结构框图,该装置可应用于处理服务器,参照图12,该装置可以包括:Fig. 12 is a structural block diagram of a device for establishing a face index provided by an embodiment of the present invention. The device can be applied to a processing server. Referring to Fig. 12, the device may include:
视频获取模块100,用于获取至少一条视频;A
人脸特征及视频信息确定模块200,用于分别确定各条视频相应的人脸特征,并确定各人脸特征在相应视频的视频信息;Facial features and video
第一聚类得到模块300,用于将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类;其中,一个第一聚类所聚集的人脸特征表示,所述至少一条视频中同一人脸的人脸特征;The first
人脸索引建立模块400,用于针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到所述至少一条视频的人脸索引。The face
可选的,在多条视频的情况下,第一聚类得到模块300,用于将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类,具体包括:Optionally, in the case of multiple videos, the first
获取各条视频的聚类结果,其中,一条视频的聚类结果至少包括:聚集该视频中同一人脸的人脸特征的至少一个第二聚类,及各第二聚类所聚集的各人脸特征在该视频的视频信息;Obtain the clustering results of each video, wherein the clustering results of a video at least include: at least one second cluster that gathers the facial features of the same face in the video, and each person gathered by each second cluster The video information of the face feature in the video;
针对各条视频的聚类结果,将同一人脸的人脸特征进行聚类处理,得到所述多条视频相应的至少一个第一聚类。For the clustering results of each video, the facial features of the same face are clustered to obtain at least one first cluster corresponding to the plurality of videos.
可选的,在多条视频的情况下,第一聚类得到模块300,用于获取各条视频的聚类结果,具体包括:Optionally, in the case of multiple videos, the
分别针对各条视频相应的人脸特征,将同一人脸的人脸特征进行聚类处理,以分别得到各条视频相应的至少一个第二聚类;For the corresponding facial features of each video, the facial features of the same face are clustered to obtain at least one second cluster corresponding to each video;
分别针对各条视频相应的各第二聚类所聚集的人脸特征,关联人脸特征在相应视频的视频信息,得到各条视频的聚类结果。For the face features collected by the second clusters corresponding to each video, correlate the face features with the video information of the corresponding video, and obtain the clustering results of each video.
可选的,在多条视频的情况下,第一聚类得到模块300,用于分别得到各条视频相应的至少一个第二聚类,具体包括:Optionally, in the case of multiple videos, the first
对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,将该至少一个聚类中人脸特征出现次数小于次数阈值的聚类进行删除,得到该视频相应的至少一个第二聚类。For any video, after obtaining at least one cluster of the facial features of the same face in the video, delete the clusters whose facial features in the at least one cluster appear less than the number of times threshold, and obtain the corresponding video At least one second cluster.
可选的,所述人脸特征在相应视频的视频信息至少包括:人脸特征在相应视频的视频时间点;在多条视频的情况下,第一聚类得到模块300,用于对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,将该至少一个聚类中人脸特征出现次数小于次数阈值的聚类进行删除,得到该视频相应的至少一个第二聚类,具体包括:Optionally, the video information of the corresponding video of the facial feature at least includes: the video time point of the corresponding video of the facial feature; in the case of multiple videos, the
对于任一条视频,在得到该视频中同一人脸的人脸特征的至少一个聚类后,确定该视频的各聚类的人脸特征相应的视频时间点分布;For any piece of video, after obtaining at least one cluster of the facial features of the same face in the video, determine the corresponding video time point distribution of the facial features of each cluster of the video;
删除视频时间点分布不处于至少一个设定时长间隔的聚类,得到该视频相应的至少一个第二聚类。Deleting the clusters whose video time point distribution is not at least one set duration interval, and obtaining at least one second cluster corresponding to the video.
可选的,在多条视频的情况下,第一聚类得到模块300,用于分别针对各条视频相应的人脸特征,将同一人脸的人脸特征进行聚类处理,具体包括:Optionally, in the case of multiple videos, the
若获取的任一条视频存在视频索引链,对于从该视频中提取的人脸特征,确定该人脸特征在该视频的视频索引链中相应的第二聚类;其中,一个视频索引链记录该视频中聚集的不同人脸的人脸特征的各第二聚类;If there is a video index chain in any video obtained, for the face feature extracted from the video, determine the corresponding second clustering of the face feature in the video index chain of the video; wherein, a video index chain records the each second clustering of facial features of different faces clustered in the video;
若获取的任一条视频不存在视频索引链,创建该视频相应的视频索引链,对于从该视频中提取的人脸特征,确定提取的人脸特征在所创建的该视频的视频索引链中相应的第二聚类。If there is no video index chain for any video obtained, create a video index chain corresponding to the video, and for the facial features extracted from the video, make sure that the extracted facial features correspond to the created video index chain the second cluster of .
可选的,对于任一待聚类人脸特征,执行所述将同一人脸的人脸特征进行聚类处理的过程,具体包括:Optionally, for any face feature to be clustered, perform the process of clustering the face features of the same face, specifically including:
检测已得到的聚类中,是否存在与所述待聚类人脸特征的相似度符合预定相似度要求的目标聚类;Detecting whether there is a target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirement among the obtained clusters;
若存在所述目标聚类,将所述待聚类人脸特征聚集到所述目标聚类;If the target cluster exists, gathering the face features to be clustered into the target cluster;
若不存在所述目标聚类,设置新的聚类,将所述待聚类人脸特征聚集到该新的聚类。If the target cluster does not exist, a new cluster is set, and the face features to be clustered are gathered into the new cluster.
可选的,上述执行检测已得到的聚类中,是否存在与所述待聚类人脸特征的相似度符合预定相似度要求的目标聚类的过程,可以具体包括:Optionally, the above-mentioned process of detecting whether there is a target cluster whose similarity with the face feature to be clustered meets the predetermined similarity requirement among the obtained clusters may specifically include:
检测在所述待聚类人脸特征的人脸特征向量的预定向量距离内,是否存在已得到的聚类;Detecting whether there is an obtained cluster within a predetermined vector distance of the face feature vector of the face features to be clustered;
若存在已得到的聚类,且在已得到的聚类中加入所述待聚类人脸特征后,所述已得到的聚类相应的半径不大于半径阈值,则确定所述已得到聚类为所述目标聚类;If there are clusters that have been obtained, and after the face features to be clustered are added to the clusters that have been obtained, the corresponding radius of the clusters that have been obtained is not greater than the radius threshold, then determine that the clusters that have been obtained clustering for said target;
若不存在已得到的聚类,或,存在已得到的聚类,但在已得到的聚类中加入所述待聚类人脸特征后,所述已得到的聚类的半径大于半径阈值,则确定不存在所述目标聚类。If there is no obtained cluster, or, there is an obtained cluster, but after adding the face features to be clustered in the obtained cluster, the radius of the obtained cluster is greater than the radius threshold, Then it is determined that the target cluster does not exist.
可选的,对于所述待聚类人脸特征,执行所述关联人脸特征在相应视频的视频信息的过程,可以具体包括:Optionally, for the facial features to be clustered, performing the process of associating the video information of the corresponding video with the facial features may specifically include:
在所述待聚类人脸特征聚集到的聚类相应的视频信息列表中插入,所述待聚类人脸特征在相应视频的视频信息。Insert in the video information list corresponding to the cluster where the face features to be clustered are gathered, and the face features to be clustered are in the video information of the corresponding video.
可选的,本发明实施例提供的建立人脸索引的装置还可以用于:更新所述待聚类人脸特征聚集到的聚类相应的半径及质心。Optionally, the apparatus for establishing a face index provided in the embodiment of the present invention may also be used to: update the corresponding radius and centroid of the clusters into which the face features to be clustered are gathered.
可选的,在本发明实施例中,人脸特征在相应视频的视频信息可以具体包括:Optionally, in the embodiment of the present invention, the video information of the face feature in the corresponding video may specifically include:
人脸特征在相应视频的视频时间点,和/或,视频进度,和/或,视频的标识。Facial features at the video time point of the corresponding video, and/or, the video progress, and/or, the identification of the video.
可选的,本发明实施例提供的建立人脸索引的装置还可以用于:Optionally, the device for establishing a face index provided in the embodiment of the present invention can also be used for:
获取视频检索请求,所述视频检索请求携带有目标人脸图像;Obtaining a video retrieval request, the video retrieval request carrying a target face image;
提取所述目标人脸图像的目标人脸特征;Extracting the target face feature of the target face image;
从所述人脸索引中确定,所述目标人脸特征关联的相应视频的视频信息;Determine from the face index, the video information of the corresponding video associated with the target face feature;
输出所确定的视频信息。The determined video information is output.
本发明实施例还提供一种处理服务器,该处理服务器可以装载上述的程序模块,以执行本发明实施例提供的建立人脸索引的方法;上述程序模块可通过程序形式实现,图13示出了本发明实施例提供的处理服务器的结构框图,参照图13,该处理服务器可以包括:The embodiment of the present invention also provides a processing server, which can be loaded with the above-mentioned program module to execute the method for establishing a face index provided by the embodiment of the present invention; the above-mentioned program module can be implemented in the form of a program, as shown in FIG. 13 Referring to FIG. 13 for a structural block diagram of a processing server provided in an embodiment of the present invention, the processing server may include:
至少一个处理芯片1,至少一个通信接口2,至少一个存储器3和至少一个通信总线4;At least one
在本发明实施例中,处理芯片1、通信接口2、存储器3、通信总线4的数量为至少一个,且处理芯片1、通信接口2、存储器3通过通信总线4完成相互间的通信;In the embodiment of the present invention, the number of the
处理芯片1可能是一个中央处理器CPU,或者是特定集成电路ASIC(ApplicationSpecific Integrated Circuit),或者是被配置成实施本发明实施例的一个或多个集成电路。The
存储器3可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatilememory),例如至少一个磁盘存储器。The
其中,存储器3存储有程序,处理芯片1调用存储器3所存储的程序,以实现上述所述的由处理服务器执行的建立人脸索引的方法的步骤。Wherein, the
该程序具体用于:The program is specifically intended for:
获取至少一条视频;Get at least one video;
分别确定各条视频相应的人脸特征,并确定各人脸特征在相应视频的视频信息;Respectively determine the corresponding facial features of each video, and determine the video information of each facial feature in the corresponding video;
将同一人脸的人脸特征进行聚类处理,得到至少一个第一聚类;其中,一个第一聚类所聚集的人脸特征表示,所述至少一条视频中同一人脸的人脸特征;The facial features of the same face are clustered to obtain at least one first cluster; wherein, the facial features gathered by a first cluster represent the facial features of the same face in the at least one video;
针对各第一聚类所聚集的各人脸特征,关联人脸特征在相应视频的视频信息,得到所述至少一条视频的人脸索引。For each face feature gathered by each first cluster, associate the face feature with the video information of the corresponding video, and obtain the face index of the at least one video.
可选的,本发明实施例还提供一种存储介质,该存储介质可存储有适于处理芯片执行的程序,以实现上述所述的由处理服务器执行的建立人脸索引的方法的步骤。Optionally, the embodiment of the present invention further provides a storage medium, which can store a program suitable for execution by the processing chip, so as to implement the above-mentioned steps of the method for establishing a face index executed by the processing server.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例公开的装置而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts of each embodiment can be referred to each other. As for the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and for relevant details, please refer to the description of the method part.
专业人员还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Professionals can further realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two. In order to clearly illustrate the possible For interchangeability, in the above description, the composition and steps of each example have been generally described according to their functions. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present invention.
结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理芯片执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。The steps of the methods or algorithms described in conjunction with the embodiments disclosed herein may be directly implemented by hardware, software modules executed by a processing chip, or a combination of both. Software modules can be placed in random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other Any other known storage medium.
对所公开的实施例的上述说明,使本领域专业技术人员能够实现或使用本发明。对这些实施例的多种修改对本领域的专业技术人员来说将是显而易见的,本文中所定义的一般原理可以在不脱离本发明的核心思想或范围的情况下,在其它实施例中实现。因此,本发明将不会被限制于本文所示的这些实施例,而是要符合与本文所公开的原理和新颖特点相一致的最宽的范围。The above description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be implemented in other embodiments without departing from the spirit or scope of the invention. Therefore, the present invention will not be limited to the embodiments shown herein, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
Claims (12)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810534101.5A CN110543584B (en) | 2018-05-29 | 2018-05-29 | Method, device, processing server and storage medium for establishing face index |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810534101.5A CN110543584B (en) | 2018-05-29 | 2018-05-29 | Method, device, processing server and storage medium for establishing face index |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN110543584A CN110543584A (en) | 2019-12-06 |
| CN110543584B true CN110543584B (en) | 2023-05-05 |
Family
ID=68701656
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201810534101.5A Active CN110543584B (en) | 2018-05-29 | 2018-05-29 | Method, device, processing server and storage medium for establishing face index |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN110543584B (en) |
Families Citing this family (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113449543B (en) * | 2020-03-24 | 2022-09-27 | 百度在线网络技术(北京)有限公司 | Video detection method, device, equipment and storage medium |
| CN111414517B (en) * | 2020-03-26 | 2023-05-19 | 成都市喜爱科技有限公司 | Video face analysis method, device and server |
| CN113965772B (en) * | 2021-10-29 | 2024-05-10 | 北京百度网讯科技有限公司 | Live video processing method, device, electronic device and storage medium |
| CN116168316A (en) * | 2022-12-09 | 2023-05-26 | 北京百度网讯科技有限公司 | Video processing and video content searching method and device, electronic equipment and medium |
| CN116127133B (en) * | 2023-04-17 | 2023-08-08 | 湖南柚子树文化传媒有限公司 | File searching method, system, equipment and medium based on artificial intelligence |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101799827A (en) * | 2010-03-11 | 2010-08-11 | 浙江大学 | Video database management method based on layering structure |
| CN101833569A (en) * | 2010-04-08 | 2010-09-15 | 中国科学院自动化研究所 | A method for automatic identification of movie face images |
| CN103336795A (en) * | 2013-06-09 | 2013-10-02 | 华中科技大学 | Video indexing method based on multiple features |
| CN103631786A (en) * | 2012-08-22 | 2014-03-12 | 腾讯科技(深圳)有限公司 | Clustering method and device for video files |
| CN105915930A (en) * | 2016-04-15 | 2016-08-31 | 北京网康科技有限公司 | Video file sending method and apparatus |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8351661B2 (en) * | 2009-12-02 | 2013-01-08 | At&T Intellectual Property I, L.P. | System and method to assign a digital image to a face cluster |
| CN103530652B (en) * | 2013-10-23 | 2016-09-14 | 北京中视广信科技有限公司 | A kind of video categorization based on face cluster, search method and system thereof |
| CN104133875B (en) * | 2014-07-24 | 2017-03-22 | 北京中视广信科技有限公司 | Face-based video labeling method and face-based video retrieving method |
-
2018
- 2018-05-29 CN CN201810534101.5A patent/CN110543584B/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101799827A (en) * | 2010-03-11 | 2010-08-11 | 浙江大学 | Video database management method based on layering structure |
| CN101833569A (en) * | 2010-04-08 | 2010-09-15 | 中国科学院自动化研究所 | A method for automatic identification of movie face images |
| CN103631786A (en) * | 2012-08-22 | 2014-03-12 | 腾讯科技(深圳)有限公司 | Clustering method and device for video files |
| CN103336795A (en) * | 2013-06-09 | 2013-10-02 | 华中科技大学 | Video indexing method based on multiple features |
| CN105915930A (en) * | 2016-04-15 | 2016-08-31 | 北京网康科技有限公司 | Video file sending method and apparatus |
Non-Patent Citations (2)
| Title |
|---|
| Xiaochun Cao等.Constrained multi-view video face clustering .《IEEE Transactions on Image Processing 》.2015,第24卷(第11期),4381-4393. * |
| 张博.基于移动视频的动态人脸识别技术的研究与实现.《中国优秀硕士学位论文全文数据库 信息科技辑》.2016,I138-1030. * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN110543584A (en) | 2019-12-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110543584B (en) | Method, device, processing server and storage medium for establishing face index | |
| TWI740537B (en) | Information processing method, device and storage medium thereof | |
| US8270684B2 (en) | Automatic media sharing via shutter click | |
| CN110147722A (en) | A kind of method for processing video frequency, video process apparatus and terminal device | |
| CN109635146B (en) | Target query method and system based on image characteristics | |
| US20210357624A1 (en) | Information processing method and device, and storage medium | |
| CN105574506A (en) | Intelligent face tracking system and method based on depth learning and large-scale clustering | |
| CN107870959A (en) | Providing relevant video scenes in response to a video search query | |
| CN102650993A (en) | Index establishing and searching methods, devices and systems for audio-video file | |
| CN205263812U (en) | Distributing type face identification orbit searching system | |
| CN105095211A (en) | Acquisition method and device for multimedia data | |
| KR20110132884A (en) | Intelligent video information retrieval device and method capable of multiple video indexing and retrieval | |
| US20150254342A1 (en) | Video dna (vdna) method and system for multi-dimensional content matching | |
| WO2013097101A1 (en) | Method and device for analysing video file | |
| US20130006951A1 (en) | Video dna (vdna) method and system for multi-dimensional content matching | |
| CN108563651A (en) | A kind of Target Searching Method, device and the equipment of more videos | |
| CN110602059B (en) | A method for accurately restoring the fingerprint of the plaintext length of the encrypted transmission data of the TLS protocol | |
| CN102932421A (en) | Cloud back-up method and device | |
| CN110876090B (en) | Video abstract playback method and device, electronic equipment and readable storage medium | |
| CN107241618B (en) | Recording method and recording apparatus | |
| US20230177083A1 (en) | Method and apparatus for simultaneous video retrieval and alignment | |
| CN108228101B (en) | Method and system for managing data | |
| CN108520047B (en) | Video characteristic information retrieval method | |
| CN114610711A (en) | Big data storage method and device, electronic equipment and storage medium | |
| CN106156024A (en) | A kind of information processing method and server |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TG01 | Patent term adjustment | ||
| TG01 | Patent term adjustment |