CN103440269B

CN103440269B - A kind of video data search method based on study mutually

Info

Publication number: CN103440269B
Application number: CN201310332612.6A
Authority: CN
Inventors: 韩军伟; 吉祥; 郭雷; 胡新韬
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2013-08-01
Filing date: 2013-08-01
Publication date: 2016-06-08
Anticipated expiration: 2033-08-01
Also published as: CN103440269A

Abstract

The invention relates to a video data retrieval method based on mutual learning, which is characterized in that: calculating the similarity matrix of different types of video data features, and using the similarity matrix to calculate the Laplacian matrix; calculating the Laplacian of different types of video data The eigenvalues and eigenvectors of the matrix are the eigenvectors corresponding to the first M largest eigenvalues in the Laplacian matrix; the similarity matrix of different types of video data eigenvectors is calculated, and the corresponding elements of the similarity matrix of the eigenvectors are compared Multiply the learning matrix; multiply the learning matrix and the corresponding elements of the similarity matrix of each feature to obtain the learned similarity matrix; use the learned similarity matrix to sort the video data, and count the first few sorted In the video data, the number of video data belonging to the same category as the query target video data is used to obtain the corresponding retrieval accuracy. In the method of the invention, the retrieval accuracy rate is greatly improved compared with that before learning.

Description

A Video Data Retrieval Method Based on Mutual Learning

技术领域technical field

本发明涉及一种基于相互学习的视频数据检索方法，可以应用于不同种类视频数据的检索当中。The invention relates to a video data retrieval method based on mutual learning, which can be applied to the retrieval of different types of video data.

背景技术Background technique

互联网技术和数码摄影技术的迅速发展使得网络上的视频数据越来越多，视频数据检索也成为一个多媒体技术中的热点和难点问题。国内外学者提出了多种特征来进行视频数据的检索，这些特征大都基于视频数据的颜色，纹理和形状，统称为底层特征，近年来，有学者提出了基于人脑认知的脑功能特征，比底层特征的检索准确率有所提高，但是我们发现，不同种类的特征反映视频数据不同的特质，如果能将这些特征的优势融合起来，那么检索的准确率必将得到更大的提高。The rapid development of Internet technology and digital photography technology makes more and more video data on the network, and video data retrieval has become a hot and difficult problem in multimedia technology. Scholars at home and abroad have proposed a variety of features to retrieve video data. Most of these features are based on the color, texture, and shape of video data, collectively referred to as underlying features. In recent years, some scholars have proposed brain function features based on human brain cognition. Compared with the bottom-level features, the retrieval accuracy has been improved, but we found that different types of features reflect different characteristics of video data. If the advantages of these features can be combined, the retrieval accuracy will be greatly improved.

发明内容Contents of the invention

要解决的技术问题technical problem to be solved

为了避免现有技术的不足之处，本发明提出一种基于相互学习的视频数据检索方法，让底层特征和脑功能特征相互学习对方的优点，然后用于视频数据的检索中，结果显示，经过相互学习的特征可以大幅提高检索的准确率。In order to avoid the deficiencies of the prior art, the present invention proposes a video data retrieval method based on mutual learning, so that the underlying features and brain function features learn each other's advantages, and then used in video data retrieval, the results show that after The features learned from each other can greatly improve the accuracy of retrieval.

技术方案Technical solutions

一种基于相互学习的视频数据检索方法，其特征在于步骤如下：A video data retrieval method based on mutual learning, characterized in that the steps are as follows:

步骤1、计算N个视频数据的特征X₁,X₂,...,X_N的相似性矩阵W₁和特征Y₁,Y₂,...,Y_N的相似性矩阵W₂：Step 1. Calculate the similarity matrix W 1 of the features X ₁ , X ₂ ,...,X _N of N video data and the similarity matrix W ₂ of the features Y ₁ , Y ₂ ,...,Y _N _:

采用 $w_{i, j}^{1} = \exp \frac{{(X_{i} - X_{j})}^{T} \times (X_{i} - X_{j})}{σ^{2}}$ 计算得到相似性矩阵W₁；use $w_{i, j}^{1} = \exp \frac{{(x_{i} - x_{j})}^{T} \times (x_{i} - x_{j})}{σ^{2}}$ Calculate the similarity matrix W ₁ ;

采用 $w_{i, j}^{2} = \exp \frac{{(Y_{i} - Y_{j})}^{T} \times (Y_{i} - Y_{j})}{σ^{2}}$ 计算得到相似性矩阵W₂；use $w_{i, j}^{2} = \exp \frac{{(Y_{i} - Y_{j})}^{T} \times (Y_{i} - Y_{j})}{σ^{2}}$ Calculate the similarity matrix W ₂ ;

其中，X₁,X₂,...,X_N表示第1、2和N个视频数据的第一种特征；Y₁,Y₂,...,Y_N表示第1、2和N个视频数据的第二种特征；表示矩阵W₁第i行第j列元素；表示矩阵W₂第i行第j列元素；X_i,X_j表示第i个和第j个视频数据的第一种特征；Y_i,Y_j表示第i个和第j个视频数据的第二种特征；exp表示取指数；i,j＝1,2,...,N；N＞0；σ＞0，为常数；上标T表示向量转置；Among them, X ₁ , X ₂ ,..., X _N represent the first feature of the 1st, 2nd and N video data; Y ₁ , Y ₂ ,..., Y _N represent the 1st, 2nd and N The second feature of the video data; Indicates the element in row i, column j of matrix W ₁ ; Represents the elements of the i-th row and the j-th column of the matrix W ₂ ; X _i , X _j represent the first feature of the i-th and j-th video data; Y _i , Y _j represent the i-th and j-th video data Two kinds of features; exp means taking exponents; i, j=1,2,...,N; N>0;σ>0, which is a constant; superscript T means vector transposition;

步骤2：利用计算W₁的拉普拉斯矩阵L₁；利用计算W₂的拉普拉斯矩阵L₂；Step 2: Take advantage of Calculate the Laplacian matrix L _{1 of W 1} _; use Calculate the Laplacian matrix L _{2 of W 2} _;

其中，D₁表示对角线矩阵，其元素 $d_{i, j}^{1} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{1} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t＝1,2,...,N；表示矩阵W₁的第i行第t列的元素；D₂表示对角线矩阵，其元素 $d_{i, j}^{2} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{2} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t＝1,2,...,N；表示矩阵W₂的第i行第t列的元素；where D1 represents _a diagonal matrix whose elements $d_{i, j}^{1} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{1} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t=1,2,...,N; Represents the elements of row i and column t of matrix W ₁ ; D ₂ represents a diagonal matrix whose elements $d_{i, j}^{2} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{2} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t=1,2,...,N; Represents the elements of the i-th row and the t-column of the matrix W ₂ ;

步骤3：计算拉普拉斯矩阵L₁和L₂的特征值和特征向量，然后分别选取前M个最大特征值所对应的特征向量U₁,U₂,...,U_M和V₁,V₂,...,V_M；其中，M≥1表示常数；U₁,U₂,...,U_M表示属于L₁的大小为N×1的特征向量；V₁,V₂,...,V_M表示属于L₂的大小为N×1的特征向量；Step 3: Calculate the eigenvalues and eigenvectors of the Laplacian matrices L ₁ and L ₂ , and then select the eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ corresponding to the first M largest eigenvalues respectively ,V ₂ ,...,V _M ; among them, M≥1 means a constant; U ₁ , U ₂ ,..., U _M means a feature vector of size N×1 belonging to L ₁ ; V ₁ , V ₂ ,...,V _M represents the feature vector of size N×1 belonging to L ₂ ;

步骤4：利用特征向量U₁,U₂,...,U_M和V₁,V₂,...,V_M构造矩阵P＝[U₁U₂...U_M]和Q＝[V₁V₂...V_M]；计算[K₁K₂...K_N]^T的相似性矩阵S₁和[L₁L₂...L_N]^T的相似性矩阵S₂，Step 4: Use eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ , V ₂ ,..., V _M to construct matrix P=[U ₁ U ₂ ... U _M ] and Q=[ V ₁ V ₂ ... V _M ]; calculate the similarity matrix S ₁ of [K ₁ K ₂ ... K _N ] ^T and the similarity matrix S ₂ of [L ₁ L ₂ ... L _N ] ^T ,

S₁的元素计算公式为 $s_{i, j}^{1} = \exp \frac{{(K_{i} - K_{j})}^{T} \times (K_{i} - K_{j})}{σ^{2}};$ The formula for calculating the elements of S ₁ is ${the s}_{i, j}^{1} = \exp \frac{{(K_{i} - K_{j})}^{T} \times (K_{i} - K_{j})}{σ^{2}};$

S₂的元素计算公式为 $s_{i, j}^{2} = \exp \frac{{(L_{i} - L_{j})}^{T} \times (L_{i} - L_{j})}{σ^{2}};$ The element calculation formula of S ₂ is ${the s}_{i, j}^{2} = \exp \frac{{(L_{i} - L_{j})}^{T} \times (L_{i} - L_{j})}{σ^{2}};$

其中，K₁,K₂,...,K_N表示矩阵P的第1,2,...,N行元素；L₁,L₂,...,L_N表示矩阵Q的第1,2,...,N行元素；Among them, K ₁ , K ₂ ,...,K _N represent elements of the 1st, 2nd,...,N rows of the matrix P; L ₁ , L ₂ ,...,L _N represent the 1st, 2nd,...,N elements of the matrix Q 2,...,N rows of elements;

步骤5：将相似性矩阵S₁和S₂的对应元素相乘得到学习矩阵S；Step ₅ : Multiply the corresponding elements _of the similarity matrix S1 and S2 to obtain the learning matrix S;

步骤6：将相似性矩阵W₁和学习矩阵S的对应元素相乘得到学习后的相似性矩阵E₁，将相似性矩阵W₂和学习矩阵S的对应元素相乘得到学习后的相似性矩阵E₂；Step 6: Multiply the similarity matrix W ₁ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix E ₁ , multiply the similarity matrix W ₂ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix _E2 ;

步骤7：利用公式r＝β(I-λE₁)^-1T和f＝β(I-λE₂)^-1T计算N个视频数据两种特征学习后的分数向量r和f，并将N个视频数据按照分数大小从高到低排列，得到排序后的视频数据；Step 7: Use the formula r=β(I-λE ₁ ) ^-1 T and f=β(I-λE ₂ ) ^-1 T to calculate the score vectors r and f of N video data after two kinds of feature learning, and set N A video data is arranged according to the score size from high to low, and the sorted video data is obtained;

其中，r＝(r₁,r₂,...,r_N)表示N个视频数据的第一种特征进行检索后的得分向量，r₁,r₂,...,r_N表示第1,2,...,N个视频数据的得分；f＝(f₁,f₂,...,f_N)表示N个视频数据的第二种特征进行检索后的得分向量；f₁,f₂,...,f_N表示第1,2,...,N个视频数据的得分；β＝1-λ表示常数；λ＞0表示常数；T＝[t₁,...,t_N]^T表示检索时的查询向量，t_i＝1表示第i个视频数据为查询目标视频数据，否则t_i＝0。Among them, r=(r ₁ ,r ₂ ,...,r _N ) represents the score vector after retrieval of the first feature of N video data, and r ₁ ,r ₂ ,...,r _N represents the first ,2,...,N video data scores; f=(f ₁ ,f ₂ ,...,f _N ) represents the score vector after retrieval of the second feature of N video data; f ₁ , f ₂ ,...,f _N represent the scores of the 1st, 2nd,...,N video data; β=1-λ represents a constant; λ>0 represents a constant; T=[t ₁ ,..., t _N ] ^T represents the query vector during retrieval, t _i =1 indicates that the i-th video data is the query target video data, otherwise t _i =0.

在步骤7后采用统计前Q个排序后的视频数据中与查询目标视频数据属于同一类的视频数据数量C，计算检索准确率A＝C/Q。After step 7, the number C of video data belonging to the same category as the query target video data in the first Q sorted video data is counted, and the retrieval accuracy rate A=C/Q is calculated.

有益效果Beneficial effect

本发明提出的一种基于相互学习的视频数据检索方法，首先，计算不同种类视频数据特征的相似性矩阵，并利用相似性矩阵计算拉普拉斯矩阵；其次，计算不同种类视频数据拉普拉斯矩阵的特征值和特征向量，分别找出这些拉普拉斯矩阵中前M个最大特征值所对应的特征向量；第三，分别计算不同种类视频数据特征向量的相似性矩阵，将特征向量的相似性矩阵的对应元素相乘得到学习矩阵；第四，利用学习矩阵和第一步中每种特征的相似性矩阵的对应元素相乘，得到学习后的相似性矩阵，第五，对每个查询目标视频数据，利用学习后的相似性矩阵，计算每个视频数据的分数，并将视频数据按照分数从高到低排列，统计前若干个视频数据中和查询目标视频数据一致的视频数据数量，计算检索准确率。A kind of video data retrieval method based on mutual learning that the present invention proposes, at first, calculate the similarity matrix of different kinds of video data features, and utilize the similarity matrix to calculate Laplacian matrix; Secondly, calculate different kinds of video data Laplacian The eigenvalues and eigenvectors of the Laplacian matrices, respectively find out the eigenvectors corresponding to the first M largest eigenvalues in these Laplacian matrices; third, calculate the similarity matrix of the eigenvectors of different types of video data, and divide the eigenvectors The learning matrix is obtained by multiplying the corresponding elements of the similarity matrix of each feature; fourth, the learning matrix is multiplied by the corresponding elements of the similarity matrix of each feature in the first step to obtain the learned similarity matrix; fifth, for each Query target video data, use the similarity matrix after learning to calculate the score of each video data, and arrange the video data according to the score from high to low, and count the video data consistent with the query target video data in the first few video data Quantity to calculate the retrieval accuracy.

本发明提出的方法，能够让不同种类的视频数据特征相互学习对方的优点，与学习前相比，大大提高了视频数据检索的准确率。The method proposed by the invention can allow different types of video data features to learn each other's advantages, and compared with before learning, the accuracy of video data retrieval is greatly improved.

附图说明Description of drawings

图1：本发明方法的基本流程图Fig. 1: basic flowchart of the inventive method

图2：本发明方法的检索准确率Fig. 2: the retrieval accuracy rate of the method of the present invention

具体实施方式detailed description

现结合实施例、附图对本发明作进一步描述：Now in conjunction with embodiment, accompanying drawing, the present invention will be further described:

用于实施的硬件环境是：AMDAthlon64×25000+计算机、2GB内存、256M显卡，运行的软件环境是：Matlab2009a和WindowsXP。我们用Matlab软件实现了本发明提出的方法。The hardware environment used for implementation is: AMDAthlon64×25000+ computer, 2GB memory, 256M graphics card, and the running software environment is: Matlab2009a and WindowsXP. We have realized the method that the present invention proposes with Matlab software.

本发明具体实施如下：The present invention is specifically implemented as follows:

本发明流程图如附图1所示。用于检索的1256个视频数据包含三类，分别为：561个运动视频数据、364个天气预报视频数据和331个广告视频数据。两种特征分别为脑功能特征和底层特征，具体步骤如下：The flow chart of the present invention is as shown in accompanying drawing 1. The 1256 video data used for retrieval include three categories, namely: 561 sports video data, 364 weather forecast video data and 331 advertising video data. The two features are brain function features and underlying features. The specific steps are as follows:

1、计算N个视频数据的特征X₁,X₂,...,X_N的相似性矩阵W₁和特征Y₁,Y₂,...,Y_N的相似性矩阵W₂，W₁的元素计算公式为：1. Calculate the similarity matrix W ₁ of the features X ₁ , X ₂ ,...,X _N of N video data and the similarity matrix W ₂ , W ₁ of the features Y ₁ , Y ₂ ,...,Y _N The element calculation formula of is:

同理计算矩阵W₂，其元素计算公式为 $w_{i, j}^{2} = \exp \frac{{(Y_{i} - Y_{j})}^{T} \times (Y_{i} - Y_{j})}{σ^{2}};$ Calculate the matrix W ₂ in the same way, and its element calculation formula is $w_{i, j}^{2} = \exp \frac{{(Y_{i} - Y_{j})}^{T} \times (Y_{i} - Y_{j})}{σ^{2}};$

其中，X₁,X₂,...,X_N表示第1、2和N个视频数据的第一种特征；Y₁,Y₂,...,Y_N表示第1、2和N个视频数据的第二种特征；表示矩阵W₁第i行第j列元素；表示矩阵W₂第i行第j列元素；X_i,X_j表示第i个和第j个视频数据的第一种特征；Y_i,Y_j表示第i个和第j个视频数据的第二种特征；exp表示取指数；i,j＝1,2,...,N；N＝1256；上标T表示向量转置；σ＝8×10^-6；Among them, X ₁ , X ₂ ,..., X _N represent the first feature of the 1st, 2nd and N video data; Y ₁ , Y ₂ ,..., Y _N represent the 1st, 2nd and N The second feature of the video data; Indicates the element in row i, column j of matrix W ₁ ; Represents the elements of the i-th row and the j-th column of the matrix W ₂ ; X _i , X _j represent the first feature of the i-th and j-th video data; Y _i , Y _j represent the i-th and j-th video data Two kinds of features; exp means taking exponents; i, j=1,2,...,N; N=1256; superscript T means vector transposition; σ=8×10 ^-6 ;

2、利用公式计算W₁的拉普拉斯矩阵L₁，同理计算其中，D₁表示对角线矩阵，2. Use the formula Calculate the Laplacian matrix L ₁ of W ₁ , and calculate in the same way where _D1 represents the diagonal matrix,

其元素 $d_{i, j}^{1} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{1} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t＝1,2,...,N；表示矩阵W₁的第i行第t列的元素；D₂表示对角线矩阵，其元素 $d_{i, j}^{2} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{2} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t＝1,2,...,N；表示矩阵W₂的第i行第t列的元素；its elements $d_{i, j}^{1} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{1} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t=1,2,...,N; Represents the elements of row i and column t of matrix W ₁ ; D ₂ represents a diagonal matrix whose elements $d_{i, j}^{2} = \{\begin{matrix} Σ_{t = 1}^{N} w_{i, t}^{2} & i = j \\ 0 & i &NotEqual; j \end{matrix};$ t=1,2,...,N; Represents the elements of the i-th row and the t-column of the matrix W ₂ ;

3、计算拉普拉斯矩阵L₁和L₂的特征值和特征向量，选取前M个最大特征值所对应的特征向量U₁,U₂,...,U_M和V₁,V₂,...,V_M；3. Calculate the eigenvalues and eigenvectors of the Laplacian matrices L ₁ and L ₂ , and select the eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ , V ₂ corresponding to the first M largest eigenvalues ,...,V _M ;

其中，M≥1表示常数；U₁,U₂,...,U_M表示属于L₁的大小为N×1的特征向量；V₁,V₂,...,V_M表示属于L₂的大小为N×1的特征向量；Among them, M≥1 represents a constant; U ₁ , U ₂ ,..., U _M represent the eigenvectors of size N×1 belonging to L ₁ ; V ₁ , V ₂ ,..., V _M represent those belonging to L ₂ An eigenvector of size N×1;

4、利用特征向量U₁,U₂,...,U_M和V₁,V₂,...,V_M构造矩阵P＝[U₁U₂...U_M]和Q＝[V₁V₂...V_M]；计算[K₁K₂...K_N]^T的相似性矩阵S₁和[L₁L₂...L_N]^T的相似性矩阵S₂，S₁的元素计算公式为：4. Utilize eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ , V ₂ ,..., V _M to construct matrix P=[U ₁ U ₂ ... U _M ] and Q=[V ₁ V ₂ ... V _M ]; calculate the similarity matrix S ₁ of [K ₁ K ₂ ... K _N ] ^T and the similarity matrix S ₂ of [L ₁ L ₂ ... L _N ] ^T , S The formula for calculating the elements of ₁ is:

同理计算S₂，S₂的元素计算公式为 $s_{i, j}^{2} = \exp \frac{{(L_{i} - L_{j})}^{T} \times (L_{i} - L_{j})}{σ^{2}};$ Calculate S ₂ in the same way, the element calculation formula of S ₂ is ${the s}_{i, j}^{2} = \exp \frac{{(L_{i} - L_{j})}^{T} \times (L_{i} - L_{j})}{σ^{2}};$

5、将相似性矩阵S₁和S₂的对应元素相乘得到学习矩阵S。5. Multiply the corresponding elements of the similarity matrix S ₁ and S ₂ to obtain the learning matrix S.

6、将相似性矩阵W₁和学习矩阵S的对应元素相乘得到学习后的相似性矩阵E₁，将相似性矩阵W₂和学习矩阵S的对应元素相乘得到学习后的相似性矩阵E₂ 6. Multiply the similarity matrix W ₁ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix E ₁ , multiply the similarity matrix W ₂ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix E ₂

7、利用公式r＝β(I-λE₁)^-1T和f＝β(I-λE₂)^-1T计算N个视频数据两种特征学习后的分数向量r和f，并将N个视频数据按照分数大小从高到低排列，得到排序后的视频数据。7. Use the formula r=β(I-λE ₁ ) ^-1 T and f=β(I-λE ₂ ) ^-1 T to calculate the score vectors r and f after learning the two features of N video data, and combine N The video data is arranged according to the scores from high to low, and the sorted video data is obtained.

其中，r＝(r₁,r₂,...,r_N)表示N个视频数据的第一种特征进行检索后的得分向量，r₁,r₂,...,r_N表示第1,2,...,N个视频数据的得分；f＝(f₁,f₂,...,f_N)表示N个视频数据的第二种特征进行检索后的得分向量；f₁,f₂,...,f_N表示第1,2,...,N个视频数据的得分；β＝1-λ表示常数；λ＝0.99；T＝[t₁,...,t_N]T表示检索时的查询向量，t_i＝1表示第i个视频数据为查询目标视频数据，否则t_i＝0；Among them, r=(r ₁ ,r ₂ ,...,r _N ) represents the score vector after retrieval of the first feature of N video data, and r ₁ ,r ₂ ,...,r _N represents the first ,2,...,N video data scores; f=(f ₁ ,f ₂ ,...,f _N ) represents the score vector after retrieval of the second feature of N video data; f ₁ , f ₂ ,...,f _N represent the scores of the 1st, 2nd,...,N video data; β=1-λ represents a constant; λ=0.99; T=[t ₁ ,...,t _N ] T represents the query vector when retrieving, t _i =1 means that the i-th video data is the query target video data, otherwise t _i =0;

8、统计前Q个排序后的视频数据中与查询目标视频数据属于同一类的视频数据数量C，计算检索准确率A＝C/Q。8. Count the number C of video data belonging to the same category as the query target video data among the first Q sorted video data, and calculate the retrieval accuracy rate A=C/Q.

利用本算法进行视频数据检索，将1256个视频数据中每个视频数据都作为查询目标视频数据进行一次检索，在前5，10，15和20个排序后的视频数据内分别统计与查询目标视频数据属于同一类别的视频数据所占的百分比。对1256个视频数据查询所得的百分比进行平均，得到1256个视频的平均检索准确率。如附图2所示。作为对比，我们也使用脑功能特征和底层特征单独进行检索，检索过程不进行相互学习，将得到的检索准确率也显示在附图2中，从图中可以看出，学习后的脑功能特征和底层特征的检索准确率比学习前都有了大幅提高。其中，脑功能特征比学习前提高了19.8%，底层特征比学习前提高了27.5%。Use this algorithm to retrieve video data, and use each of the 1256 video data as the query target video data for a retrieval, and count and query target videos in the first 5, 10, 15 and 20 sorted video data respectively The percentage of video data whose data belongs to the same category. The average retrieval accuracy of 1256 videos is obtained by averaging the percentages obtained from the query of 1256 videos. As shown in Figure 2. As a comparison, we also use the brain function features and underlying features to search separately. The retrieval process does not carry out mutual learning, and the retrieval accuracy obtained is also shown in Figure 2. It can be seen from the figure that the brain function features after learning The retrieval accuracy of the low-level and low-level features has been greatly improved compared with that before learning. Among them, the brain function features have improved by 19.8% compared with before learning, and the underlying features have increased by 27.5% compared with before learning.

Claims

1. A video data retrieval method based on mutual learning, characterized in that the steps are as follows:

Step 1. Calculate the similarity matrix W 1 of the features X ₁ , X ₂ ,...,X _N of N video data and the similarity matrix W ₂ of the features Y ₁ , Y ₂ ,...,Y _N _:

use Calculate the similarity matrix W ₁ ;

use Calculate the similarity matrix W ₂ ;

Among them, X ₁ , X ₂ ,...,X _N represent the first feature of the 1st, 2nd...N video data; Y ₁ , Y ₂ ,...,Y _N represent the 1st, 2...N The second feature of the video data, the first feature and the second feature are brain function features and underlying features respectively; Indicates the element in row i, column j of matrix W ₁ ; Represents the elements of the i-th row and the j-th column of the matrix W ₂ ; X _i , X _j represent the first feature of the i-th and j-th video data; Y _i , Y _j represent the i-th and j-th video data Two kinds of features; exp means taking exponents; i, j=1,2,...,N; N>0;σ>0, which is a constant; superscript T means vector transposition;

Step 2: Take advantage of Calculate the Laplacian matrix L _{1 of W 1} _; use Calculate the Laplacian matrix L _{2 of W 2} _;

where D1 represents _a diagonal matrix whose elements Represents the elements of the i-th row and column t of the matrix W ₁ ; D ₂ represents the diagonal matrix, whose elements Represents the elements of the i-th row and the t-column of the matrix W ₂ ;

Step 3: Calculate the eigenvalues and eigenvectors of the Laplacian matrices L ₁ and L ₂ , and then select the eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ corresponding to the first M largest eigenvalues respectively ,V ₂ ,...,V _M ; among them, M≥1 means a constant; U ₁ , U ₂ ,..., U _M means a feature vector of size N×1 belonging to L ₁ ; V ₁ , V ₂ ,...,V _M represents the feature vector of size N×1 belonging to L ₂ ;

Step 4: Use eigenvectors U ₁ , U ₂ ,..., U _M and V ₁ , V ₂ ,..., V _M to construct matrix P=[U ₁ U ₂ ... U _M ] and Q=[ V ₁ V ₂ ... V _M ]; calculate the similarity matrix S ₁ of [K ₁ K ₂ ... K _N ] ^T and the similarity matrix S ₂ of [L ₁ L ₂ ... L _N ] ^T ,

The formula for calculating the elements of S ₁ is

The element calculation formula of S ₂ is

Among them, K ₁ , K ₂ ,...,K _N represent elements of the 1st, 2nd,...,N rows of the matrix P; L ₁ , L ₂ ,...,L _N represent the 1st, 2nd,...,N elements of the matrix Q 2,...,N rows of elements;

Step ₅ : Multiply the corresponding elements _of the similarity matrix S1 and S2 to obtain the learning matrix S;

Step 6: Multiply the similarity matrix W ₁ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix E ₁ , multiply the similarity matrix W ₂ and the corresponding elements of the learning matrix S to obtain the learned similarity matrix _E2 ;

Step 7: Use the formula r=β(I-λE ₁ ) ^-1 T and f=β(I-λE ₂ ) ^-1 T to calculate the score vectors r and f of N video data after two kinds of feature learning, and set N A video data is arranged according to the score size from high to low, and the sorted video data is obtained;

Among them, r=(r ₁ ,r ₂ ,...,r _N ) represents the score vector after retrieval of the first feature of N video data, and r ₁ ,r ₂ ,...,r _N represents the first ,2,...,N video data scores; f=(f ₁ ,f ₂ ,...,f _N ) represents the score vector after retrieval of the second feature of N video data; f ₁ , f ₂ ,...,f _N represent the scores of the 1st, 2nd,...,N video data; β=1-λ represents a constant; λ>0 represents a constant; T=[t ₁ ,..., t _N ] ^T represents the query vector during retrieval, t _i =1 indicates that the i-th video data is the query target video data, otherwise t _i =0.

2. the video data retrieval method based on mutual learning according to claim 1 is characterized in that: after step 7, adopt the video data quantity C that belongs to the same class with the query target video data in the video data after the first Q ordering of statistics , calculate the retrieval accuracy rate A=C/Q.