CN103440332A

CN103440332A - Image searching method based on relation matrix regularization enhancement representation

Info

Publication number: CN103440332A
Application number: CN2013103997347A
Authority: CN
Inventors: 杨育彬; 李亚楠
Original assignee: Nanjing University
Current assignee: Nanjing University
Priority date: 2013-09-05
Filing date: 2013-09-05
Publication date: 2013-12-11
Anticipated expiration: 2033-09-05
Also published as: CN103440332B

Abstract

The invention discloses a method for retrieving images from an image instance database based on a relational matrix regularization enhancement method, comprising the following steps: step 1, inputting the image to be retrieved; step 2, extracting the features of the image to be retrieved and the image in the image instance database ; Step 3, select P image classes from the image instance feature library, and select n images from each image class to form sample data X; Step 4, based on the manifold learning algorithm of spectrogram theory, construct three Matrix; step 5, initially establish the enhanced relationship matrix W'; step 6, calculate the regularized enhanced relationship matrix W ^* ; step 7, calculate the generalized feature matrix A; step 8, calculate the final image representation; step 9, calculate the image to be retrieved image representation; step 10, using the Euclidean distance to calculate the similarity between the image to be retrieved and all the images in the image instance database, and output the image in the image instance database that is most similar to the image to be retrieved in descending order of similarity.

Description

An Image Retrieval Method Based on Regularized Enhanced Representation of Relation Matrix

技术领域technical field

本发明属于图像检索领域，特别是一种基于关系矩阵正则化增强方法的图像检索方法。The invention belongs to the field of image retrieval, in particular to an image retrieval method based on a relationship matrix regularization enhancement method.

背景技术Background technique

在科技日益发达的今天，随着图像获取处理设备和互联网技术的迅猛发展和普及应用，以图像为代表的新一代信息资源已经成为与材料、能源具有同等重要地位的战略资源，其数据量也已达到海量规模，成为当前信息处理和信息资源建设的主体。由于图像具有信息量大、内容丰富、表现力强等优点，因此对海量规模的图像进行有效的信息处理和应用，已成为众多实际应用领域的核心问题。In today's increasingly advanced technology, with the rapid development and popularization of image acquisition and processing equipment and Internet technology, the new generation of information resources represented by images has become a strategic resource with the same importance as materials and energy. It has reached a massive scale and has become the main body of current information processing and information resource construction. Because images have the advantages of large amount of information, rich content, and strong expressiveness, effective information processing and application of massive images has become a core issue in many practical application fields.

由于当前图像数据已呈海量规模，并且在不断增长，传统的技术手段已经无法适应这种需求，这对图像的组织、分析、检索和管理等技术都提出了全新的挑战。尽管目前基于内容的图像检索研究已经取得了很大的进展，有效克服了基于手工标注的文本信息进行图像检索的局限性，但离真正的实用阶段还有一定的距离，尤其是对图像的高层语义理解方面。大部分方法还仅仅停留在围绕图像的底层特征进行语义描述和学习这一层次，相对于人类能够理解和运用的丰富多彩的语义概念，底层数据特征的表达能力尚有很大局限，因此底层特征与高层语义之间存在着较大差距，即所谓的“语义鸿沟”（semantic gap），从而导致在图像检索的准确率和效率上还远远达不到实际应用的需要，尤其是对图像的多种丰富语义进行准确有效的理解和检索方面。时至今日，图像检索中的“语义鸿沟”问题仍然没有得到很好的解决，仍然是困扰研究者的关键性难题之一。在解决这一难题的众多技术当中，基于相关反馈的图像检索技术提供了一种可行的解决方案。早期的相关反馈技术主要集中于基于相关反馈的信息，修正查询向量即图像特征，例如对查询向量的每一维数值重新分配权值，调整查询向量的位置等。近年来，由于流形学习的兴起，许多研究者转向通过流形学习技术，将高维的图像数据空间降维来探求图像特征空间的内在结构，其主要的理论假设是将图像看成是一种流形，目标就是发现其内在的结构信息。发现嵌入在高维数据中的低维子空间是学习数据潜在流形的重要手段，流形学习中子空间的学习方法都是基于局部分析的。通过流形学习的方法学习其所对应的低维的语义子空间，这与流形学习假设整个数据集只在局部满足欧氏距离相吻合，因此通过分析图像数据的局部信息，发掘局部的语义流形结构对图像检索来说更加有意义。Due to the massive scale of current image data and its continuous growth, traditional technical means can no longer meet this demand, which poses new challenges to technologies such as image organization, analysis, retrieval, and management. Although the current research on content-based image retrieval has made great progress, effectively overcoming the limitations of image retrieval based on manually annotated text information, there is still a certain distance from the real practical stage, especially for high-level images. Semantic understanding. Most methods only stay at the level of semantic description and learning around the underlying features of the image. Compared with the rich and colorful semantic concepts that humans can understand and use, the expressive ability of the underlying data features is still very limited. Therefore, the underlying features There is a large gap between the high-level semantics and the so-called "semantic gap", which leads to the fact that the accuracy and efficiency of image retrieval are far from meeting the needs of practical applications, especially for image retrieval. A variety of rich semantics for accurate and effective understanding and retrieval. Today, the "semantic gap" problem in image retrieval has not been well resolved, and it is still one of the key problems that plague researchers. Among the many techniques to solve this problem, image retrieval based on relevance feedback provides a feasible solution. Early relevant feedback technology mainly focused on information based on relevant feedback, modifying the query vector, that is, image features, such as reassigning weights to each dimension value of the query vector, adjusting the position of the query vector, etc. In recent years, due to the rise of manifold learning, many researchers have turned to the manifold learning technology to reduce the dimensionality of the high-dimensional image data space to explore the internal structure of the image feature space. The main theoretical assumption is to regard the image as a A manifold, the goal is to discover its internal structure information. Discovering low-dimensional subspaces embedded in high-dimensional data is an important means of learning the latent manifold of data, and the learning methods of subspaces in manifold learning are all based on local analysis. The corresponding low-dimensional semantic subspace is learned through the manifold learning method, which is consistent with the manifold learning assumption that the entire data set only satisfies the Euclidean distance locally. Therefore, by analyzing the local information of the image data, the local semantics can be discovered. The manifold structure is more meaningful for image retrieval.

发明内容Contents of the invention

发明目的：本发明为了解决现有技术中的问题，提出了一种基于关系矩阵正则化增强表示的图像检索方法，有效地解决大规模数据下，图像的快速准确检索问题。Purpose of the invention: In order to solve the problems in the prior art, the present invention proposes an image retrieval method based on the regularized enhanced representation of the relationship matrix, which effectively solves the problem of fast and accurate retrieval of images under large-scale data.

发明内容：本发明公开了一种基于关系矩阵正则化增强方法的图像检索方法，该方法从图像实例库中检索图像，包含如下步骤：SUMMARY OF THE INVENTION: The present invention discloses an image retrieval method based on a relational matrix regularization enhancement method. The method retrieves an image from an image instance database, including the following steps:

步骤1，输入待检索图像；Step 1, input the image to be retrieved;

步骤2，抽取待检索图像和图像实例库中图像的特征，用N维向量描述每幅图像，N=112，得到图像实例特征库U＝(u₁,…,u_M)，u_i为图像实例库第i幅图像的特征，i＝1,…M，M为图像实例库中所包含的图像数，以及待检索图像的特征v，所述图像实例库包括50个以上的图像类，每一个图像类表示一个语义类，每个图像类包括600幅以上的图像；Step 2, extract the features of the image to be retrieved and the images in the image instance library, describe each image with N-dimensional vector, N=112, and obtain the image instance feature library U=(u ₁ ,...,u _M ), u _i is the image The feature of the i-th image in the example library, i=1,...M, M is the number of images contained in the image example library, and the feature v of the image to be retrieved, the image example library includes more than 50 image classes, each An image class represents a semantic class, and each image class includes more than 600 images;

步骤3，从图像实例特征库中选取P个图像类，P取值范围20～50，从每一个图像类选取n幅图像，n取值范围100～500，P个图像类共有n×P张图像构成样本数据X；例如发明的一个实施例中，从中选取30个图像类，每一类表示了不同的语义类，每一类有100幅图像，共有3000张图像构成样本数据X，X＝(x₁,…,x_q),q＝n×P，x_i为样本数据中第i幅图像的特征，q为样本数据大小，X为112×q维的矩阵；Step 3. Select P image classes from the image instance feature library, and the value range of P is 20-50. Select n images from each image class, and the value range of n is 100-500. There are n×P images in P image classes Image constitutes sample data X; for example, in an embodiment of the invention, 30 image classes are selected therefrom, each class represents a different semantic class, each class has 100 images, and a total of 3000 images constitute sample data X, X= (x ₁ ,...,x _q ), q=n×P, x _i is the feature of the i-th image in the sample data, q is the size of the sample data, and X is a 112×q-dimensional matrix;

步骤4，基于谱图理论的流形学习算法，对样本数据X构建增强关系矩阵W、正例关系矩阵W^P和反例关系矩阵W^N；Step 4, based on the manifold learning algorithm of spectrogram theory, construct the enhanced relationship matrix W, the positive example relationship matrix W ^P and the negative example relationship matrix W ^N for the sample data X;

步骤5，对构建的关系矩阵W进行增强，初步建立增强关系矩阵W′；Step 5, enhance the constructed relationship matrix W, and preliminarily establish the enhanced relationship matrix W';

步骤6，借助概率转移矩阵正则化增强关系矩阵W′得到正则化增强关系矩阵W^*；Step 6, regularizing the enhanced relationship matrix W' with the help of the probability transition matrix to obtain the regularized enhanced relationship matrix W ^* ;

步骤7，根据正则化增强关系矩阵W^*构建目标方程，计算广义特征矩阵A；Step 7. Construct the objective equation according to the regularized enhanced relationship matrix W ^* , and calculate the generalized feature matrix A;

步骤8，利用广义特征矩阵A对图像实例特征库中的所有图像进行降维，即AU＝A*(u₁,…,u_M)＝(A*u₁,…,A*u_M)，记y_i＝A*x_i，i＝1,…M，得到最终的图像表示Y＝(y₁,…,y_M)，y_i为图像实例库第i幅图像特征降维后的特征；Step 8, use the generalized feature matrix A to reduce the dimensionality of all images in the image instance feature library, that is, AU=A*(u ₁ ,...,u _M )=(A*u ₁ ,...,A*u _M ), Record y _i =A* _xi , i=1,...M, to obtain the final image representation Y=(y ₁ ,...,y _M ), and y _i is the dimensionality-reduced feature of the i-th image in the image instance library;

步骤9，利用广义特征矩阵A对待检索图像特征v降维，得到待检索图像的图像表示f＝A*v；Step 9, use the generalized feature matrix A to reduce the dimensionality of the image feature v to be retrieved, and obtain the image representation f=A*v of the image to be retrieved;

步骤10，根据步骤8的最终的图像表示和步骤9的待检索图像的图像表示的欧氏距离计算待检索图像与图像实例库中所有图像的相似度，即计算待检索图像降维特征f与图像实例特征库每幅图像特征降维后特征的欧氏距离||f-y_i||²,i＝1,…M，y_i为图像实例库第i幅图像特征降维后的特征，按照相似度由大到小输出图像实例库中与待检索图像最相似的图像。Step 10, calculate the similarity between the image to be retrieved and all images in the image instance library according to the Euclidean distance between the final image representation in step 8 and the image representation of the image to be retrieved in step 9, that is, calculate the dimensionality reduction feature f and _The Euclidean distance of each image feature in the image instance feature library after dimensionality reduction|| ^fy _i || Output the image most similar to the image to be retrieved in the image instance library from large to small.

步骤2中图像特征包括颜色矩、Tamura纹理特征、Gabor纹理特征、颜色直方图。Image features in step 2 include color moments, Tamura texture features, Gabor texture features, and color histograms.

步骤4具体包括如下步骤：在样本数据X中随机选取一幅图像，计算该图像与样本数据X中其他图像的欧式距离，利用相关反馈检索技术，根据返回结果中的同类图像和不同类图像对应设立正例集合和反例集合，并采用简单的k近邻方法建立关系矩阵，即属于k近邻并且是同一个图像类的两图像间的权值为1，否则为0。Step 4 specifically includes the following steps: randomly select an image in the sample data X, calculate the Euclidean distance between the image and other images in the sample data X, and use the relevant feedback retrieval technology to correspond to images of the same type and images of different types in the returned results Set up a set of positive examples and a set of negative examples, and use the simple k-nearest neighbor method to establish a relationship matrix, that is, the weight between two images belonging to the k-nearest neighbor and the same image class is 1, otherwise it is 0.

步骤4中采用基于反馈技术的嵌入关系拓宽ARE方法作为谱图理论的流形学习算法，包括以下步骤：In step 4, the embedded relationship widening ARE method based on feedback technology is used as the manifold learning algorithm of spectrogram theory, including the following steps:

（1）首先对样本数据X构建关系矩阵W,从样本数据X中随机抽取一幅图像I，图像I的特征为x_i，采用k近邻方法计算x_i与样本数据X中其他图像特征的欧式距离，得到与图像I最相似的k幅图像，其中k取值范围5～10；(1) First construct a relationship matrix W for the sample data X, randomly select an image I from the sample data X, the feature of the image I is x _i , use the k nearest neighbor method to calculate the Euclidean relationship between x _i and other image features in the sample data X distance to obtain k images most similar to image I, where k ranges from 5 to 10;

从k幅图像中任意取出一幅图像T属于，图像T的特征为x_t，则图像I与图像T之间的权值W_it为1，图像I与k幅图像以外的图像之间的权值为0；即x_i∈N^k(x_t)或x_t∈N^k(x_i)，W_it＝1，其中N^k(x_i)表示图像x_i的k近邻集合，N^k(x_t)表示图像x_t的k近邻集合；得到关系矩阵W，关系矩阵W第i行第t列的值即为W_it；An image T is randomly selected from k images, and the feature of image T is x _t , then the weight W it between image I and image T is 1, and the weight W _it between image I and images other than k images The value is 0; that is, x _i ∈ N ^k (x _t ) or x _t ∈ ^{N k} ( _xi ), W _it = 1, where N ^k (xi ₎ represents the k-nearest neighbor set of image x _i , N ^k (x _t ) represents the k-nearest neighbor set of the image x _t ; the relationship matrix W is obtained, and the value of the i-th row and the t-column of the relationship matrix W is W _it ;

公式为:The formula is:

将k幅图像中与图像I属于同一图像类的图像记为正例集合Pos，不同图像类的图像记为反例集合Neg，；The images belonging to the same image class as image I in the k images are recorded as positive example set Pos, and the images of different image classes are recorded as negative example set Neg;

（2）构建正例关系矩阵W^P，如果图像R与图像I属于同一图像类且都属于k幅图像，且图像R的特征为x_r，则图像I与图像R之间的权值为1，图像I与k幅图像以外的图像之间的权值为0；即，

为图像I与图像R之间的权值，x_i,x_r∈Pos为表示特征x_i,x_r属于正例集合Pos，正例关系矩阵W^P的第i行第r列的值即为

公式为：(2) Construct a positive example relationship matrix W ^P , if image R and image I belong to the same image class and both belong to k images, and the feature of image R is x _r , then the weight between image I and image R is 1 , the weight between image I and images other than k images is 0; that is,

is the weight between the image I and the image R, x _i , x _r ∈ Pos is the representation feature x _i , x _r belongs to the positive example set Pos, the value of the i-th row and the r-th column of the positive example relationship matrix W ^P is

The formula is:

（3）构建反例关系矩阵W^N，如果图像H与图像I属于不同图像类且都属于k幅图像，图像H的特征为x_h，则图像I与图像H之间的权值为1，图像I与k幅图像以外的图像之间的权值为0；即x_i∈Pos且x_h∈Neg或x_h∈Pos且

表示特征x_i属于正例集合Pos，x_h∈Neg表示特征x_h属于反例集合Neg，x_h∈Pos表示特征x_h属于正例集合Pos，x_i∈Neg表示特征x_i属于反例集合Neg，

为图像I与图像H之间的权值，反例关系矩阵W^N的第i第h列为

公式为：(3) Construct a negative example relationship matrix W ^N , if image H and image I belong to different image categories and both belong to k images, and the feature of image H is x _h , then the weight between image I and image H is 1, and image The weight between I and images other than k images is 0; that is, x _i ∈ Pos and x _h ∈ Neg or x _h ∈ Pos and

Indicates that the feature x _i belongs to the positive example set Pos, x _h ∈ Neg indicates that the feature x _h belongs to the negative example set Neg, x _h ∈ Pos indicates that the feature x _h belongs to the positive example set Pos, x _i ∈ Neg indicates that the feature x _i belongs to the negative example set Neg,

is the weight between image I and image H, and the ith column h of the counterexample relationship matrix W ^N is

The formula is:

最后构建得到三个关系矩阵W,W^P和W^N，其中和为计算广义特征矩阵需要用到的关系矩阵。Finally, three relationship matrices W, W ^P and W ^N are constructed, and the sum is the relationship matrix needed to calculate the generalized feature matrix.

步骤5具体包括如下步骤：从关系矩阵W出发，如果图像z是图像i的近邻图像，且图像z也是图像j的近邻图像，则采用下式计算增强图像i与图像j之间的权值W′_ij：W′_ij＝Σ_zW_izW_jz Step 5 specifically includes the following steps: starting from the relationship matrix W, if image z is the neighbor image of image i, and image z is also the neighbor image of image j, then use the following formula to calculate the weight W between the enhanced image i and image j ′ _ij : W′ _ij = Σ _z W _iz W _jz

其中W_iz为图像i与图像z的权值，W_jz为图像j与图像z的权值，W′_ij即为增强关系矩阵W′的i行第j列值。Where W _iz is the weight of image i and image z, W _jz is the weight of image j and image z, and W' _ij is the value of row i and column j of the enhanced relationship matrix W'.

步骤6具体包括如下步骤：Step 6 specifically includes the following steps:

多次传播图像间的近邻关系得到新的增强关系矩阵W″，公式为W″＝W′*W′；The neighbor relationship among the multi-propagation images gets a new enhanced relationship matrix W″, the formula is W″=W′*W′;

利用转移概率矩阵表示图像间的转移关系，相应的转移矩阵为P＝[P_ij]_n×n，P_ij＝p(j|i)为样本数据X中任一图像i到任一图像j的转移概率，根据欧式距离选择与图像i最相似的n幅图像，图像j的特征为x_j,转移概率P(j|i)的计算公式为：Use the transition probability matrix to represent the transition relationship between images, the corresponding transition matrix is P=[P _ij ] _n×n , and P _ij =p(j|i) is the transition from any image i to any image j in the sample data X Transition probability, select n images most similar to image i according to Euclidean distance, the feature of image j is x _j , the calculation formula of transition probability P(j|i) is:

其中d_ij＝||x_i-x_j||²,表示图像i与图像j特征的欧氏距离。Where d _ij ＝|| _xi -x _j || ² represents the Euclidean distance between image i and image j features.

采用下式计算关系矩阵正则化增强的模型W^R：The model W ^R enhanced by the regularization of the relationship matrix is calculated by the following formula:

W^R＝ηP+(1-η)ge^T W ^R ＝ηP+(1-η)ge ^T

其中，η为图像i转移到图像j这个事件发生的概率，(1-η)为图像i随机跳转的概率，g＝(1/n)e，其中g是一个均匀随机分布向量，e是n维单位列向量，n即每个图像类的图像数，e＝(1,1,…)^T，矩阵P的第i行第j列为P(j|i)；Among them, η is the probability of image i transferring to image j, (1-η) is the probability of image i jumping randomly, g=(1/n)e, where g is a uniform random distribution vector, e is n-dimensional unit column vector, n is the number of images of each image class, e=(1,1,...) ^T , the i-th row and j-th column of the matrix P is P(j|i);

图像i与图像j之间的新的关系权值

计算公式为：The new relationship weight between image i and image j

The calculation formula is:

${w w}_{ij ij}^{* *} = = {w w}_{ij ij}^{″ ″} \cdot &Center Dot; {w w}_{ij ij}^{R R}$

w″_ij为图像i与图像j的权值，w″_ij为W″的第i行第j列的值，

为图像i跳转到图像j的概率权值，

为W^R的第i行第j列的值；w″ _ij is the weight of image i and image j, w″ _ij is the value of row i and column j of W″,

is the probability weight of image i jumping to image j,

is the value of row i and column j of W ^R ;

最终得到正则化增强关系矩阵W^*，W^*的第i行第j列为

Finally, the regularized enhanced relationship matrix W ^* is obtained, and the i-th row and j-th column of W ^* are

步骤7中包括如下步骤：Step 7 includes the following steps:

首先从样本数据X中选取任意两幅图像的特征x_i和x_j，两幅图像的关系权值为W_ij，两幅图像的正例关系权值为两幅图像的反例关系权值为

根据以下目标方程计算得到广义特征矩阵A：First select the features x _i and x _j of any two images from the sample data X, the relationship weight of the two images is W _ij , and the positive relationship weight of the two images is The weight of the negative relationship between the two images is

The generalized characteristic matrix A is calculated according to the following objective equation:

X(L^N-γL^P)X^TA=λXLX^TA，X(L ^N -γL ^P )X ^T A=λXLX ^T A,

L为关系矩阵W的拉普拉斯矩阵，L^N为反例关系矩阵W^N的拉普拉斯矩阵，L^P为正例关系矩阵W^P的拉普拉斯矩阵，γ为与反例图像个数和正例图像个数的比值成正比的常数，X^T表示样本数据X的转置矩阵，λ表示方程求解的特征值。L is the Laplacian matrix of the relationship matrix W, L ^N is the Laplacian matrix of the negative relationship matrix W ^N , L ^P is the Laplacian matrix of the positive relationship matrix W ^P , and γ is the number of negative examples A constant proportional to the ratio of the number of positive images, X ^T represents the transpose matrix of the sample data X, and λ represents the eigenvalue of the equation solution.

本发明中ARE为拓宽关系嵌入方法(Augmented Relation Embedding)，一种拓宽关系图嵌入的流形学习降维算法，ARE主要利用正例关系矩阵与反例关系矩阵嵌入全局关系矩阵中，寻找投影矩阵，即广义特征矩阵，从而实现对数据特征的降维。In the present invention, ARE is an Augmented Relation Embedding method (Augmented Relation Embedding), a manifold learning dimensionality reduction algorithm for augmenting relational graph embedding. ARE mainly uses the positive example relationship matrix and the negative example relationship matrix to embed into the global relationship matrix to find the projection matrix. That is, the generalized feature matrix, so as to realize the dimensionality reduction of data features.

本发明原理为，样本数据X＝(x₁,…,x_N),x_i∈R^m，数据点间的关系矩阵W∈R^N×N表示，矩阵的元素衡量了每对数据点间的相似度。对角矩阵D和相应的拉普拉斯矩阵L由下式定义：The principle of the present invention is that the sample data X=(x ₁ ,…,x _N ), x _i ∈ R ^m , and the relationship matrix W ∈ R ^N×N between data points is represented, and the elements of the matrix measure the relationship between each pair of data points similarity. The diagonal matrix D and the corresponding Laplacian matrix L are defined by:

${D D.}_{ii i} = = {Σ Σ}_{j j} {W W}_{ij ij},, &ForAll; &ForAll; i i,,$

L＝D-WL=D-W

D_ii为对角矩阵D的第i行第i列，假设广义特征矩阵为A，通过投影完成原始数据空间的低维嵌入，A可由下式最小化求得：D _ii is the i-th row and i-th column of the diagonal matrix D. Assuming the generalized feature matrix is A, the low-dimensional embedding of the original data space is completed through projection. A can be obtained by minimizing the following formula:

$\underset{ij ij}{Σ Σ} {(({A A}^{T T} {x x}_{i i} - - {A A}^{T T} {x x}_{j j}))}^{22} {W W}_{ij ij}$

矩阵A的每列a_j单独作用，故上式可写成argmin_aΣ_ij(a^Tx_i-a^Tx_j)²W_ij，其中a为待求的特征向量。令y_i＝a^Tx_i，则有：Each column a _j of matrix A acts independently, so the above formula can be written as argmin _a Σ _ij (a ^T x _i -a ^T x _j ) ² W _ij , where a is the eigenvector to be sought. Let y _i =a ^T x _i , then:

$\underset{ij ij}{Σ Σ} {(({y the y}_{i i} - - {y the y}_{j j}))}^{22} {w w}_{ij ij} = = \underset{ij ij}{Σ Σ} {y the y}_{i i}^{22} {w w}_{ij ij} - - 22 \underset{ij ij}{Σ Σ} {y the y}_{i i} {y the y}_{j j} {w w}_{ij ij} + + \underset{ij ij}{Σ Σ} {y the y}_{j j}^{22} {W W}_{ij ij}$

$= = 22 {Σ Σ}_{i i} {y the y}_{i i}^{22} {D D.}_{ii i} - - 22 {Σ Σ}_{ij ij} {y the y}_{i i} {y the y}_{j j} {W W}_{ij ij}$

$= = {22 y the y}^{T T} ((D D. - - W W)) y the y = = {22 y the y}^{T T} Ly Ly$

其中，y表示所有数据在a这个投影向量上的投影，且y＝a^TX。对转换后的坐标限制，D_ii表示与第i个点相连接的个数，某种程度上说明了该点重要性程度，进而可增加约束使得y^TDy＝1。这一约束可使重要性高的点转换后其坐标值更加接近域原点，让最密集区域位于原点，最终求解的目标函数方程变为：Wherein, y represents the projection of all data on the projection vector a, and y=a ^T X . For the coordinate constraints after conversion, D _ii represents the number connected to the i-th point, which explains the importance of the point to some extent, and then can increase constraints so that y ^T Dy = 1. This constraint can make the coordinate values of the highly important points closer to the origin of the domain after conversion, so that the densest area is located at the origin, and the final objective function equation to be solved becomes:

${a a}^{* *} = = arg arg \underset{a a}{min min} {a a}^{T T} {XLX XLX}^{T T} a a,, s the s . . t t . . {a a}^{T T} {XDX XDX}^{T T} a a = = 11$

从推导过程来看，关系矩阵W在整个过程起着主导作用，投影后的数据点y也与W有着密切的关系，例如当W_ij较大时，表示x_i和x_j相似度较大，降维后y_i和y_j间的距离也应该越小越好;若W_ij较小，表示x_i和x_j相似度较小，降维后y_i和y_j间的距离也应该越大越好。这里的相似度关系可以表示数据间是否属于同一个类别，同类数据间的相似度自然很高；对于没有类别信息的数据，数据间的相似度就用近邻关系来衡量，近邻数据点间的相似度应该较高；对于既不是同类数据，也不具有近邻关系的数据点间的相似度会比较低，一般令W_ij＝0。From the point of view of the derivation process, the relationship matrix W plays a leading role in the whole process, and the projected data point y also has a close relationship with W. For example, when W _ij is larger, it means that x _i and x _j are more similar. The distance between y _i and y _j after dimension reduction should be as small as possible; if W _ij is small, it means that the similarity between xi _and x _j is small, and the distance between y _i and y _j after dimension reduction should also be larger and closer. good. The similarity relationship here can indicate whether the data belong to the same category, and the similarity between similar data is naturally high; for data without category information, the similarity between data is measured by the neighbor relationship, and the similarity between neighboring data points The degree of similarity should be high; for data points that are neither the same kind of data nor have a neighbor relationship, the similarity between data points will be relatively low, generally set W _ij =0.

有益效果：本发明利用关系矩阵正则化增强表示对图像实例特征进行降维，该方法能够有效加强同类图像之间的关系，构建关系矩阵的过程中融合了数据的类别信息，使其很容易的扩展到半监督学习的框架中，从而充分利用标记数据和未标记数据，有效的提高算法的稳定性并降低计算复杂度，同时使得图像查询具有较高的准确率，因此关系矩阵正则化增强表示的图像检索方法具有较高的使用价值。Beneficial effects: the present invention utilizes the regularized enhanced representation of the relational matrix to reduce the dimensionality of image instance features. This method can effectively strengthen the relationship between images of the same type, and the category information of the data is integrated in the process of constructing the relational matrix, making it easy to Extended to the framework of semi-supervised learning, so as to make full use of labeled data and unlabeled data, effectively improve the stability of the algorithm and reduce the computational complexity, and at the same time make the image query have a high accuracy rate, so the regularization of the relationship matrix enhances the representation The image retrieval method has a high use value.

附图说明Description of drawings

图1为本发明的流程图。Fig. 1 is a flowchart of the present invention.

图2位图像实例库特征降维流程图。Figure 2. Dimensionality reduction flow chart of bit image instance library features.

图3为待检索图像特征降维流程图。Fig. 3 is a flowchart of dimensionality reduction of image features to be retrieved.

图4为图像关系增强示意图。Fig. 4 is a schematic diagram of image relationship enhancement.

图5为图像随机游走模型示意图。Figure 5 is a schematic diagram of an image random walk model.

图6位正则化增强关系示意图。Figure 6. Schematic diagram of bit regularization enhancement relationship.

图7为图像检索结果示意图。Fig. 7 is a schematic diagram of image retrieval results.

具体实施方式Detailed ways

如图1所示，本发明公开了一种基于正则化增强关系矩阵表示的图像检索方法；包含如下步骤：As shown in Figure 1, the present invention discloses a kind of image retrieval method based on regularized enhanced relational matrix representation; It comprises the following steps:

步骤1：输入待检索图像；Step 1: Input the image to be retrieved;

如图2～3所示，构建图像正则化增强关系矩阵主要由步骤2～步骤6进行，对图像实例特征库降维由步骤8进行，对待检索图像特征降维由步骤9进行：As shown in Figures 2 to 3, the construction of the image regularization enhancement relationship matrix is mainly carried out by steps 2 to 6, the dimensionality reduction of the image instance feature library is carried out by step 8, and the dimensionality reduction of the image features to be retrieved is carried out by step 9:

步骤2，抽取待检索图像和图像实例库图像的图像特征，，特征包括颜色矩、Tamura纹理特征、Gabor纹理特征和颜色直方图，用N维的向量来描述每幅图像，N=112，待检索图像为v，图像实例特征库为U＝(u₁,…,u_M),M为图像实例库图像总数，U为N×M维矩阵；Step 2, extract the image features of the image to be retrieved and the image instance library image, the features include color moment, Tamura texture feature, Gabor texture feature and color histogram, use N-dimensional vector to describe each image, N=112, to be The retrieved image is v, the image instance feature library is U=(u ₁ ,...,u _M ), M is the total number of images in the image instance library, and U is an N×M dimensional matrix;

步骤3，抽取后的特征表示每幅图像，从图像实例库中选取30个图像类，每一类表示一个语义类，每一类有100幅图像，共有3000张图像，并将其作为样本数据X，X＝(x₁,…,x₃₀₀₀)，矩阵X为112×3000维；Step 3, the extracted features represent each image, select 30 image classes from the image instance library, each class represents a semantic class, each class has 100 images, a total of 3000 images, and use them as sample data X, X=(x ₁ ,...,x ₃₀₀₀ ), the matrix X is 112×3000 dimensions;

步骤5，增强关系矩阵W，初步建立增强关系矩阵W′；Step 5, enhance the relationship matrix W, and initially establish the enhanced relationship matrix W';

步骤7，根据正则化增强关系矩阵W^*和正例关系矩阵W^P和反例关系矩阵W^N构建目标函数，求解广义特征矩阵A；Step 7, construct the objective function according to the regularized enhanced relationship matrix W ^* , the positive example relationship matrix ^WP and the negative example relationship matrix ^WN , and solve the generalized feature matrix A;

步骤8，利用广义特征矩阵A对图像实例库中所有图像进行降维，即AU＝A*(u₁,…,u_M)＝(A*u₁,…，A*u_M)，记y_i＝A*x_i，i＝1,…M，得到最终的图像表示Y＝(y₁,…,y_M)；Step 8, use the generalized feature matrix A to reduce the dimensionality of all images in the image instance library, that is, AU=A*(u ₁ ,…,u _M )=(A*u ₁ ,…,A*u _M ), record y _i =A* _xi , i=1,...M, to obtain the final image representation Y=(y ₁ ,...,y _M );

步骤9，如图3所示，利用广义特征矩阵A对待检索图像特征v进行降维，得到待检索图像的图像表示f＝A*v；Step 9, as shown in Figure 3, use the generalized feature matrix A to reduce the dimensionality of the feature v of the image to be retrieved, and obtain the image representation f=A*v of the image to be retrieved;

步骤10，采用欧式距离计算待检索图像与图像实例库中所有图像的相似性，即计算||f-y_i||²,i＝1,…M，按照相似度由大到小输出图像实例库中与待检索图像最相似的图像。Step 10, use the Euclidean distance to calculate the similarity between the image to be retrieved and all the images in the image instance library, that is, calculate ||fy _i || ² , i=1,...M, and output the image instance library in descending order of similarity The image most similar to the image to be retrieved.

步骤2具体包括如下步骤：Step 2 specifically includes the following steps:

抽取每幅图像特征，即图像描述方面由颜色矩(RGB颜色空间)：9维；颜色矩（LUV颜色空间）：9维；Tamura纹理特征：6维；Gabor纹理特征：24维；颜色直方图（HSV颜色空间）：64维组成。Extract the features of each image, that is, the image description is composed of color moments (RGB color space): 9 dimensions; color moments (LUV color space): 9 dimensions; Tamura texture features: 6 dimensions; Gabor texture features: 24 dimensions; color histogram (HSV color space): 64-dimensional composition.

从k幅图像中任意取出一幅图像T属于，图像T的特征为x_t，则图像I与图像T之间的权值W_it为1，图像I与k幅图像以外的图像之间的权值为0；即x_i∈N^k(x_t)或x_t∈N^k(x_i)，W_it＝1，其中N^k(x_i)表示图像x_i的k近邻集合，N^k(x_t)表示图像x_t的k近邻集合；得到关系矩阵W，关系矩阵W第i行第t列的值即为W_it；An image T is randomly selected from k images, and the feature of image T is x _t , then the weight W it between image I and image T is 1, and the weight W _it between image I and images other than k images The value is 0; that is, x _i ∈ ^{N k} (x _t ) or x _t ∈ ^{N k} ( _xi ), W _it = 1, where N ^k (xi ₎ represents the k-nearest neighbor set of image x _i , N ^k (x _t ) represents the k-nearest neighbor set of the image x _t ; the relationship matrix W is obtained, and the value of the i-th row and the t-column of the relationship matrix W is W _it ;

公式为:The formula is:

The formula is:

x_i∈Pos表示特征x_i属于正例集合Pos，x_h∈Neg表示特征x_h属于反例集合Neg，x_h∈Pos表示特征x_h属于正例集合Pos，x_i∈Neg表示特征x_i属于反例集合Neg，为图像I与图像H之间的权值，反例关系矩阵W^N的第i第h列为

x _i ∈ Pos means the feature x _i belongs to the positive set Pos, x _h ∈ Neg means the feature x _h belongs to the negative set Neg, x _h ∈ Pos means the feature x _h belongs to the positive set Pos, x _i ∈ Neg means the feature x _i belongs to Negative set Neg, is the weight between image I and image H, and the ith column h of the counterexample relationship matrix W ^N is

The formula is:

步骤5具体包括如下步骤：从关系矩阵W出发，如果图像z是图像i的近邻图像，且图像z也是图像j的近邻图像，则采用下式计算增强图像i与图像j之间的权值W′_ij：Step 5 specifically includes the following steps: starting from the relationship matrix W, if image z is the neighbor image of image i, and image z is also the neighbor image of image j, then use the following formula to calculate the weight W between the enhanced image i and image j ′ _ij :

W′_ij＝∑_zW_izW_jz W′ _ij ＝∑ _z W _iz W _jz

W^R＝ηP+(1-η)ge^T其中，η为图像i转移到图像j这个事件发生的概率，(1-η)为图像i随机跳转的概率，g＝(1/n)e，其中g是一个均匀随机分布向量，e是n维单位列向量，n即每个图像类的图像数，e＝(1,1,…)^T，矩阵P的第i行第j列为P(j|i)；W ^R =ηP+(1-η)ge ^T Wherein, η is the probability that image i transfers to image j this event takes place, (1-η) is the probability that image i jumps randomly, g=(1/n)e, Where g is a uniform random distribution vector, e is an n-dimensional unit column vector, n is the number of images of each image class, e=(1,1,...) ^T , the i-th row and j-th column of the matrix P is P( j|i);

图像i与图像j之间的新的关系权值

计算公式为：The new relationship weight between image i and image j

The calculation formula is:

${w w}_{ij ij}^{* *} = = {w w}_{ij ij}^{″ ″} \cdot \cdot {w w}_{ij ij}^{R R}$

w″_ij为图像i与图像j的权值，w″_ij为W″的第i行第j列的值，

为图像i跳转到图像j的概率权值，

为W^R的第i行第j列的值；最终得到正则化增强关系矩阵W^*，W^*的第i行第j列为

步骤7中包括如下步骤：首先从样本数据X中选取任意两幅图像的特征x_i和x_j，两幅图像的关系权值为W_ij，两幅图像的正例关系权值为

两幅图像的反例关系权值为

根据以下目标方程计算得到广义特征矩阵A：X(L^N-γL^P)X^TA=λXLX^TA，w″ _ij is the weight of image i and image j, w″ _ij is the value of row i and column j of W″,

is the probability weight of image i jumping to image j,

is the value of the i-th row and j-column of W ^R ; the regularized enhanced relationship matrix W ^* is finally obtained, and the i-th row and j-column of W ^* is

Step 7 includes the following steps: first select the features x _i and x _j of any two images from the sample data X, the relationship weight of the two images is W _ij , and the positive relationship weight of the two images is

The weight of the negative relationship between the two images is

The generalized characteristic matrix A is calculated according to the following objective equation: X(L ^N -γL ^P )X ^T A=λXLX ^T A,

实施例1Example 1

本实施例包括以下部分：This embodiment includes the following parts:

1.输入一幅待检索图像I；1. Input an image I to be retrieved;

2.抽取图像实例库和待检索图像的图像特征，各个特征和其对应的维数如下所示：2. Extract the image features of the image instance library and the image to be retrieved. Each feature and its corresponding dimension are as follows:

颜色矩(RGB颜色空间)：9维；颜色矩（LUV颜色空间）：9维；Tamura纹理特征：6维；Gabor纹理特征：24维；颜色直方图（HSV颜色空间）：64维。这样每幅图像将用112维的向量来描述，待检索图像为v，图像实例特征库为U＝（u₁,…,u_M),M为图像实例库图像总数，U为N×M维矩阵；Color moment (RGB color space): 9 dimensions; color moment (LUV color space): 9 dimensions; Tamura texture feature: 6 dimensions; Gabor texture feature: 24 dimensions; color histogram (HSV color space): 64 dimensions. In this way, each image will be described by a 112-dimensional vector, the image to be retrieved is v, the image instance feature library is U=(u ₁ ,...,u _M ), M is the total number of images in the image instance library, and U is N×M dimensions matrix;

3.从图像特征库U中选取训练样本数据，每幅图像用抽取特征表示，并从中选取30个图像类，每一类表示一个语义类，每一类有100幅图像，共有3000张图像，并将其作为样本数据X，X＝（x₁,…,x₃₀₀₀)，矩阵X为112×3000维；。3. Select training sample data from the image feature library U, each image is represented by extracted features, and 30 image categories are selected from it, each category represents a semantic category, each category has 100 images, a total of 3000 images, And take it as sample data X, X=(x ₁ ,...,x ₃₀₀₀ ), matrix X is 112×3000 dimensional;.

4.在样本数据X中随机选取一幅图像，计算该图像与样本数据X中其他图像的欧式距离，利用相关反馈检索技术，根据返回结果中的同类图像和不同类图像对应设立正例集合和反例集合，并采用简单的k近邻方法建立关系矩阵，即属于k近邻并且是同一个图像类的两图像间的权值为1，否则为0。4. Randomly select an image in the sample data X, calculate the Euclidean distance between the image and other images in the sample data X, and use the relevant feedback retrieval technology to establish a set of positive examples and corresponding images of the same type and different types in the returned results. Counterexample set, and use the simple k-nearest neighbor method to establish a relationship matrix, that is, the weight between two images belonging to the k-nearest neighbor and the same image class is 1, otherwise it is 0.

（1）首先对样本数据X构建关系矩阵W,从样本数据X中随机抽取一幅图像I，图像I的特征为x_i，采用k近邻方法计算x_i与样本数据X中其他图像特征的欧式距离，得到与图像I最相似的k幅图像，其中k取值5；(1) First construct a relationship matrix W for the sample data X, randomly select an image I from the sample data X, the feature of the image I is x _i , use the k nearest neighbor method to calculate the Euclidean relationship between x _i and other image features in the sample data X Distance, to obtain k images most similar to image I, where k takes a value of 5;

公式为:The formula is:

The formula is:

（3）构建反例关系矩阵W^N，如果图像H与图像I属于不同图像类且都属于k幅图像，图像H的特征为x_h，则图像I与图像H之间的权值为1，图像I与k幅图像以外的图像之间的权值为0;即x_i∈Pos且x_h∈Neg或x_h∈Pos且x_i∈pos表示特征x_i属于正例集合Pos，x_h∈Neg表示特征x_h属于反例集合Neg，x_h∈Pos表示特征x_h属于正例集合Pos，x_i∈Neg表示特征x_i属于反例集合Neg，

为图像I与图像H之间的权值，反例关系矩阵W^N的第i第h列为

公式为：(3) Construct a negative example relationship matrix W ^N , if image H and image I belong to different image categories and both belong to k images, and the feature of image H is x _h , then the weight between image I and image H is 1, and image The weight between I and images other than k images is 0; that is, x _i ∈ Pos and x _h ∈ Neg or x _h ∈ Pos and x _i ∈ pos means that feature x _i belongs to positive set Pos, x _h ∈ Neg means feature x _h belongs to negative set Neg, x _h ∈ Pos means feature x _h belongs to positive set Pos, x _i ∈ Neg means feature x _i belongs to Negative set Neg,

The formula is:

最后构建得到三个关系矩阵W，W^P和W^N，为计算广义特征矩阵需要用到的关系矩阵。Finally, three relationship matrices W, W ^P and W ^N are constructed, which are the relationship matrices needed to calculate the generalized feature matrix.

5.建立初始关系增强矩阵W′，从关系矩阵W出发，如果图像z是图像i的近邻图像，且图像z也是图像j的近邻图像，则采用下式计算增强图像i与图像j之间的权值W′_ij：5. Establish the initial relationship enhancement matrix W′, starting from the relationship matrix W, if image z is the neighbor image of image i, and image z is also the neighbor image of image j, then use the following formula to calculate the enhanced relationship between image i and image j Weight W′ _ij :

W′_ij＝Σ_zW_izW_jz W′ _ij ＝Σ _z W _iz W _jz

其中W_iz为图像i与图像z的权值，W_jz为图像j与图像z的权值，W′_ij即为增强关系矩阵W′的i行第j列值。具体实例如图4所示，图像3是图像1的近邻图像，图像3是图像2的近邻图像，图像间用有箭头的实线连接代表近邻关系，图像1与图像2之间用虚线连接，代表图像1与图像2之间的关系需要增强。Where W _iz is the weight of image i and image z, W _jz is the weight of image j and image z, and W' _ij is the value of row i and column j of the enhanced relationship matrix W'. The specific example is shown in Figure 4. Image 3 is the neighbor image of image 1, and image 3 is the neighbor image of image 2. The solid lines with arrows are connected between the images to represent the neighbor relationship, and the image 1 and image 2 are connected by dotted lines. Represents the relationship between image 1 and image 2 needs to be enhanced.

6.构建概率转移矩阵W^R并对增强关系矩阵W′进行正则化，6. Construct the probability transition matrix W ^R and regularize the enhanced relation matrix W′,

多次传播图像间的近邻关系得到新的增强关系矩阵W"，公式为w"＝w′*w′;The neighbor relationship between images is propagated multiple times to obtain a new enhanced relationship matrix W", the formula is w"=w'*w';

利用转移概率矩阵表示图像间的转移关系，相应的转移矩阵为P=[P_ij]_n×n，P_ij=P（j|i)为样本数据X中任一图像i到任一图像j的转移概率，根据欧式距离选择与图像i最相似的n幅图像，图像j的特征为x_j，转移概率P(j|i)的计算公式为：Use the transition probability matrix to represent the transition relationship between images, the corresponding transition matrix is P=[P _ij ] _n×n , P _ij =P(j|i) is the transition from any image i to any image j in the sample data X Transition probability, select n images most similar to image i according to Euclidean distance, the feature of image j is x _j , the calculation formula of transition probability P(j|i) is:

W^R＝ηP+(1-η)ge^T W ^R ＝ηP+(1-η)ge ^T

其中，η为图像i转移到图像j这个事件发生的概率，η取为0.85，(1-η)为图像i随机跳转的概率，g＝(1/n)e，其中g是一个均匀随机分布向量，e是n维单位列向量，n即每个图像类的图像数，e＝(1,1,…)^T，矩阵P的第i行第j列为P(j|i)；Among them, η is the probability of image i transferring to image j, η is taken as 0.85, (1-η) is the probability of image i jumping randomly, g=(1/n)e, where g is a uniform random Distribution vector, e is an n-dimensional unit column vector, n is the number of images of each image class, e=(1,1,...) ^T , the i-th row and j-th column of the matrix P is P(j|i);

图像i与图像j之间的新的关系权值

计算公式为：The new relationship weight between image i and image j

The calculation formula is:

w″_ij为图像i与图像j的权值，w″_ij为W″的第i行第j列的值，

为图像i跳转到图像j的概率权值，为W^R的第i行第j列的值；w″ _ij is the weight of image i and image j, w″ _ij is the value of row i and column j of W″,

is the probability weight of image i jumping to image j, is the value of row i and column j of W ^R ;

最终得到正则化增强关系矩阵W^*，W^*的第i行第j列为

具体实例如图5～6所示，图5代表图像间的概率转移权值关系，图6中左上图1表示图像间的增强关系矩阵W″，两幅图像间用实线连接的是近邻图像，虚线连接代表是两幅图像之间的增强关系，右上图2表示图像间的转移概率矩阵W^R，图像间用实现连接代表图像间存在转移关系，下图3代表图像间的正则化增强关系矩阵W^*，由W″和W^R相乘得到；Finally, the regularized enhanced relationship matrix W ^* is obtained, and the i-th row and j-th column of W ^* are

Specific examples are shown in Figures 5-6. Figure 5 represents the probability transfer weight relationship between images. Figure 1 on the upper left of Figure 6 represents the enhanced relationship matrix W" between images. The adjacent images are connected by solid lines between the two images. , the dotted line connection represents the enhanced relationship between the two images, the upper right figure 2 represents the transition probability matrix W ^R between the images, and the realized connection between the images represents the transfer relationship between the images, and the following figure 3 represents the regularized enhanced relationship between the images Matrix W ^* , obtained by multiplying W″ and W ^R ;

7.根据正则化后的关系增强矩阵W^*构建目标函数，求解广义特征矩阵A，7. Construct the objective function according to the regularized relationship enhancement matrix W ^* , and solve the generalized feature matrix A,

首先从样本数据X中选取任意两幅图像的特征x_i和x_j，两幅图像的关系权值为W_ij，两幅图像的正例关系权值为

两幅图像的反例关系权值为

根据以下目标方程计算得到广义特征矩阵A：First select the features x _i and x _j of any two images from the sample data X, the relationship weight of the two images is W _ij , and the positive relationship weight of the two images is

The weight of the negative relationship between the two images is

X(L^N-γL^P)X^TA=λXLX^TA，X(L ^N -γL ^P )X ^T A=λXLX ^T A,

8.主要利用广义特征矩阵A对图像实例特征库中图像数据进行降维得到最终的图像表示，即AU＝A*(u₁,…,u_M)＝(A*u_i,…,A*u_M)，记y_i＝A*x_i，i＝1,…M，最终的图像表示为Y＝(y₁,…,y_M)；8. Mainly use the generalized feature matrix A to reduce the dimensionality of the image data in the image instance feature library to obtain the final image representation, that is, AU=A*(u ₁ ,…,u _M )=(A*u _i ,…,A* u _M ), record y _i =A*xi _i , i=1,...M, and the final image is expressed as Y=(y ₁ ,...,y _M );

9.主要利用广义特征矩阵A对待检索图像特征v进行降维，得到待检索图像的图像表示f，f＝A*v；9. Mainly use the generalized feature matrix A to reduce the dimensionality of the feature v of the image to be retrieved, and obtain the image representation f of the image to be retrieved, f=A*v;

10.计算待检索图像与图像实例库中图像相似性：10. Calculate the similarity between the image to be retrieved and the image in the image instance database:

采用欧式距离计算待检索图像与图像实例库中所有图像的相似性，即计算||f-y_i||²,i＝1,…M，||f-y_i||²越小相似度越大，按照相似度由大到小输出图像实例库中与待检索图像最相似的图像。如图7所示，根据欧氏距离计算待检索图像与图像实例库所有图像的相似性，根据相似度由大到小输出4幅最相似的图像。Use the Euclidean distance to calculate the similarity between the image to be retrieved and all the images in ^the image instance library, that is, calculate ||fy _i || ² , i=1,...M, the smaller ||fy _i || The similarity is from large to small to output the image most similar to the image to be retrieved in the image instance library. As shown in Figure 7, the similarity between the image to be retrieved and all images in the image instance database is calculated according to the Euclidean distance, and the 4 most similar images are output according to the similarity from large to small.

实施例2Example 2

图1为实施例2检索流程图，图中图像来源为公用的Corel5k数据库。图中2是对原始图像进行预处理，用颜色矩、Tamura纹理特征、Gabor纹理特征和颜色直方图表示一幅图像，图中3选取特征样本，从图像实例库中选取30个图像类，每一类表示了一个语义类，每一类有100幅图像，共有3000幅图像，为了提高计算速度，只用结果集中前400幅图像作为全局的数据集，用于建立关系矩阵W，正例关系矩阵W^P，反例关系矩阵W^N。然后对关系矩阵W进行增强得到W′，并利用概率转移矩阵W^R正则化增强关系矩阵，得到W^*，然后根据正则化的增强关系矩阵W^*求解目标函数的广义特征矩阵A，最后利用广义特征矩阵A对图像实例库中图像特征和待检索图像特征进行降维，对待检索图像进行检索，利用欧氏距离计算待检索图像与图像实例库中图像的相似度，按照相似度由大到小输出图像实例库中与待检索图像最相似的图像。Fig. 1 is the retrieval flowchart of embodiment 2, and image source among the figure is public Corel5k database. Figure 2 is to preprocess the original image, using color moment, Tamura texture feature, Gabor texture feature and color histogram to represent an image, Figure 3 selects feature samples, selects 30 image classes from the image instance library, and each One class represents a semantic class, each class has 100 images, a total of 3000 images, in order to improve the calculation speed, only the first 400 images in the result set are used as the global data set, which is used to establish the relationship matrix W, the positive relationship Matrix W ^P , counterexample relationship matrix W ^N . Then the relationship matrix W is enhanced to obtain W′, and the probability transition matrix W ^R is used to regularize the enhanced relationship matrix to obtain W ^* , and then the generalized characteristic matrix A of the objective function is solved according to the regularized enhanced relationship matrix W ^* , and finally the generalized The feature matrix A reduces the dimensionality of the image features in the image instance database and the image features to be retrieved, retrieves the image to be retrieved, and uses the Euclidean distance to calculate the similarity between the image to be retrieved and the image in the image instance database, and the similarity is from large to small Output the image most similar to the image to be retrieved in the image instance database.

本发明提供了一种正则化增强关系矩阵表示的图像检索方法，具体实现该技术方案的方法和途径很多，以上所述仅是本发明的优选实施方式，应当指出，对于本技术领域的普通技术人员来说，在不脱离本发明原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也应视为本发明的保护范围。本实施例中未明确的各组成部分均可用现有技术加以实现。The present invention provides an image retrieval method represented by a regularized enhanced relational matrix. There are many methods and approaches to specifically realize the technical solution. The above description is only a preferred embodiment of the present invention. As far as people are concerned, some improvements and modifications can be made without departing from the principle of the present invention, and these improvements and modifications should also be regarded as the protection scope of the present invention. All components that are not specified in this embodiment can be realized by existing technologies.

Claims

1. the image search method based on relational matrix regularization Enhancement Method, is characterized in that, the method is retrieving images from the image case library, comprises following steps:

Step 1, input image to be retrieved;

Step 2, extract the feature of image in image to be retrieved and image case library, with the N dimensional vector, every width image is described, N=112, obtain image example aspects storehouse and with the feature of retrieving images, described image case library comprises the image class more than 50, and each image class means a semantic category, and each image class comprises the image that 600 width are above;

Step 3 is chosen P image class from image example aspects storehouse, and P span 20～50, choose n width image from each image class, n span 100～500, and P the total n * P of image class opens image construction sample data X; Step 4, the manifold learning arithmetic based on spectral graph theory, build and strengthen relational matrix W, positive example relational matrix W sample data X ^pwith counter-example relational matrix W ⁿ; Step 5, strengthened the relational matrix W built, and preliminary foundation strengthens relational matrix W '; Step 6, strengthen relational matrix W ' by the probability transfer matrix regularization and obtain regularization enhancing relational matrix W ^*step 7, strengthen relational matrix W according to regularization ^*the establishing target equation, calculate generalized characteristic matrix A;

Step 8, utilize generalized characteristic matrix A to carry out dimensionality reduction to all images in image example aspects storehouse, obtains final image representation;

Step 9, utilize generalized characteristic matrix A to treat the retrieving images dimensionality reduction, obtains the image representation of image to be retrieved;

Step 10, calculate the similarity of all images in image to be retrieved and image case library according to the Euclidean distance of the image representation of the image to be retrieved of the final image representation of step 8 and step 9, according to the image the most similar to image to be retrieved in the descending output image case library of similarity.

2. a kind of image search method meaned that strengthens based on the relational matrix regularization according to claim 1, is characterized in that, in step 2, characteristics of image comprises color moment, Tamura textural characteristics, Gabor textural characteristics, color histogram.

3. a kind of image search method strengthen meaned based on the relational matrix regularization according to claim 2, it is characterized in that, step 4 specifically comprises the steps: to choose at random piece image in sample data X, calculate the Euclidean distance of other images in this image and sample data X, utilize the relevance feedback retrieval technology, set up positive example set and counter-example set according to the similar image in returning results and inhomogeneity image correspondence, and adopt simple k near neighbor method opening relationships matrix, belong to the k neighbour and be that weights between two images of same image class are 1, otherwise be 0.

4. a kind of image search method strengthen meaned based on the relational matrix regularization according to claim 3, it is characterized in that, in step 4, adopt the imbeding relation based on feedback technique to widen the manifold learning arithmetic of ARE method as spectral graph theory, comprise the following steps:

(1) at first sample data X is built to relational matrix W, randomly draw piece image I from sample data X, image I be characterized as x _i, adopt the k near neighbor method to calculate x _iwith the Euclidean distance of other characteristics of image in sample data X, obtain the k width image the most similar to image I, wherein the k span 5～10;

Take out arbitrarily piece image T and belong to from k width image, image T is characterized as x _t, the weights W between image I and image T _itbe 1, the weights between the image beyond image I and k width image are 0; Be x _i∈ N ^k(x _t) or x _t∈ N ^k(x _i), W _it=1, N wherein ^k(x _i) presentation video x _ik neighbour set, N ^k(x _t) presentation video x _tk neighbour set; Obtain relational matrix W, the value of the capable t row of relational matrix W i is W _it;

The image that belongs to same image class with image I in k width image is designated as to positive example set Pos, and the image of different images class is designated as counter-example set Neg;

(2) build positive example relational matrix W ^pif image R and image I belong to same image class and all belong to k width image, and image R be characterized as x _r, the weights between image I and image R are 1, the weights between the image beyond image I and k width image are 0; That is,

for the weights between image I and image R, x _i, x _r∈ Pos is representation feature xi, x _rbelong to positive example set Pos, positive example relational matrix W ^pthe value of the capable r of i row be

formula is:

(3) build counter-example relational matrix W ⁿif image H and image I belong to the different images class and all belong to k width image, image H is characterized as x _h, the weights between image I and image H are 1, the weights between the image beyond image I and k width image are 0; Be x _i∈ Posand x _h∈ negor x _h∈ Posand x _i∈ neg,

x _i∈ pos representation feature x _ibelonging to positive example set Pos is, x _h∈ Neg representation feature x _hbelong to counter-example set Neg, x _h∈ Pos representation feature x _hbelong to positive example set Pos, x _i∈ Neg representation feature x _ibelonging to counter-example set Neg is,

for the weights between image I and image H, counter-example relational matrix W ⁿi h classify as

formula is:

finally build and obtain three relational matrix W, W ^pand W ⁿ, the relational matrix that its neutralization need to be used for calculating generalized characteristic matrix.

5. a kind of image search method strengthen meaned based on the relational matrix regularization according to claim 4, it is characterized in that, step 5 specifically comprises the steps: from relational matrix W, if image z is neighbour's image of image i, and image z is also neighbour's image of image j, adopt following formula to calculate the weights W strengthened between image i and image j ^' _ij: W ^' _ij∑ _zw _izw _jz, W wherein _izfor the weights of image i and image z, W _jzfor the weights of image j and image z, W ' _ijbe the capable j train value of i that strengthens relational matrix W '.

6. a kind of image search method meaned that strengthens based on the relational matrix regularization according to claim 5, is characterized in that, step 6 specifically comprises the steps:

The neighbor relationships of repeatedly propagating between image obtains new enhancing relational matrix W ", formula is W "=W ' * W ';

Utilize the transfer relationship between the transition probability matrix presentation video, corresponding transition matrix is P=[P _ij] _{n * n}, P _ij=p (j|i) is that in sample data X, arbitrary image i, to the transition probability of arbitrary image j, selects the n width image the most similar to image i according to Euclidean distance, and image j is characterized as x _j, the computing formula of transition probability P (j|i) is:

D wherein _ij=|| x _i-x _j|| ², the Euclidean distance of presentation video i and image j feature;

The model W that adopts the regularization of following formula calculated relationship matrix to strengthen ^r:

W ^R＝ηP+(1-η)ge ^T

Wherein, η is that image i transfers to the probability that this event of image j occurs, and (1-η) is the probability of the random redirect of image i, g=(1/n) e, wherein g is an even stochastic distribution vector, and e is n dimension unit column vector, and n is the picture number of each image class, e=(1,1 ...) ^t, the capable j of i of matrix P classifies P (j|i) as;

The new weights that concern between image i and image j

computing formula is:

w_{ij}^{*} = w_{ij}^{″} \cdot w_{ij}^{R}

W " _ijfor the weights of image i and image j, w " _ijfor W " the value of the capable j of i row,

for image i jumps to the probability weights of image j, for W ^rthe value of the capable j of i row;

Finally obtain regularization and strengthen relational matrix W ^*, W ^*the capable j of i classify as

7. a kind of image search method meaned that strengthens based on the relational matrix regularization according to claim 6, is characterized in that, in step 7, comprises the steps:

At first choose the feature x of any two width images from sample data X _iand x _j, the weights that concern of two width images are W _ij, the positive example of two width images concerns that weights are

the counter-example of two width images concerns that weights are

calculate generalized characteristic matrix A according to following target equation:

X(L ^N-γL ^P)X ^TA＝λXLX ^TA

The Laplacian Matrix that L is relational matrix W, L ⁿfor counter-example relational matrix W ⁿlaplacian Matrix, L ^pfor positive example relational matrix W ^plaplacian Matrix, γ is the constant that is directly proportional to the ratio of counter-example image number and positive example image number, X ^tmean the transposed matrix of sample data X, λ means the eigenwert of equation solution.