CN117409217A

CN117409217A - An inter-domain difference measurement method for image data sets

Info

Publication number: CN117409217A
Application number: CN202311013494.2A
Authority: CN
Inventors: 欧中洪; 朱子谦; 郭翼天; 林炎龙; 宋美娜; 尧思远
Original assignee: Beijing University of Posts and Telecommunications
Current assignee: Beijing University of Posts and Telecommunications
Priority date: 2023-08-11
Filing date: 2023-08-11
Publication date: 2024-01-16
Anticipated expiration: 2043-08-11
Also published as: CN117409217B

Abstract

The present invention proposes an inter-domain difference measurement method for image data sets, which includes: performing mathematical feature analysis on the image data set to realize mathematical abstraction of the difference measurement of the image data set; and establishing a mathematical method with targeted image features based on mathematical abstraction. Measurement method; optimize the mathematical measurement method; obtain the image data set to be processed, and perform difference measurement on the image data set to be processed according to the optimized mathematical measurement method. The present invention can calculate the difference between image data sets in an objective and quantitative manner, improves the generalization of image task domain adaptation, and also saves computing resources and running time.

Description

Inter-domain difference measurement method for image dataset

Technical Field

The invention belongs to the field of computer vision.

Background

In computer vision, domain (domain) generally refers to the source or distribution of a data set. In a real scene there are typically a plurality of different domains, each domain having a different data distribution, i.e. statistics of different data samples. Because of the different data distributions of the different domains, a model trained on one domain cannot simply be applied directly to another domain. Domain adaptation problems refer to the fact that a model is able to adaptively solve the same task for a corresponding domain from data of different domains. The domain adaptation problem essentially is to migrate knowledge of the source domain to the target domain, so that the distance between two domains, i.e. the distance between two domain data distributions, needs to be measured in order to find its commonalities and differences, thereby realizing that a model is trained in the source domain and can be effectively applied to the target domain.

At present, a common calculation method of the measurement distribution difference in the neural network mainly comprises KL divergence, JS divergence and Hellinger measurement. The KL divergence and JS divergence are simple in calculation mode and low in calculation cost, but the KL divergence and JS divergence do not meet symmetry and triangle inequality in measurement definition, so that the method has no universality; the Hellinger metric, while satisfying the metric definition, does not have generalization of the topology space of different attributes. Furthermore, none of the above methods for calculating the metric differences can calculate the distance between non-overlapping distributions, i.e. the magnitude of the difference between the hardly related image datasets can not be given.

As described above, the current domain metric method mainly has the following drawbacks:

the current image similarity measurement method is mainly to directly compare the similarity between two images or use a feature classifier to compare the similarity between two image sets. The use of feature classifiers necessarily has a bias in the features of a particular dataset or datasets, and the determination of complex or extreme environmental image sets is subject to large deviations.

Most of the existing methods cannot meet the requirements of non-negativity, symmetry, non-degeneracy and triangle inequality at the same time, cannot be popularized to the field with strict definition requirements on distance, and therefore application scenes of the methods are limited.

The existing set measurement mode needs to calculate the single element distance of each element in the set to be measured one by one, and the method has high calculation cost in the image set domain difference measurement task due to higher image dimension.

Disclosure of Invention

The present invention aims to solve at least one of the technical problems in the related art to some extent.

Therefore, the invention aims to provide an inter-domain difference measurement method for an image data set, which is used for realizing inter-domain difference measurement of a complete image data set.

To achieve the above object, an embodiment of a first aspect of the present invention provides an inter-domain difference measurement method for an image dataset, including:

performing mathematical feature analysis on an image dataset to realize mathematical abstraction of difference measurement of the image dataset;

establishing a mathematical measurement method with image characteristic pertinence based on the mathematical abstraction;

optimizing the mathematical measure;

and acquiring an image data set to be processed, and carrying out difference measurement on the image data set to be processed according to an optimized mathematical measurement method.

In addition, an inter-domain difference measurement method for an image dataset according to the above embodiment of the present invention may further have the following additional technical features:

further, in one embodiment of the present invention, the performing mathematical feature analysis on the image dataset includes:

regarding each image in the image dataset, regarding each pixel therein as a dimension;

define a dimension as p ² The standard metric space (N, d) defines a mapping f of arbitrary space a to N a→n such that all images can be mapped into the standard metric space, whereby the differences of the image dataset are abstracted to the distances of discrete distribution in the standard metric space.

Further, in an embodiment of the present invention, the establishing a mathematical metric method with image feature pertinence based on the mathematical abstraction includes:

the Wasserstein metric is used for the metric of the high-dimensional discrete distribution:

s is the source domain image distribution, T is the target domain image distribution, and pi [ S, T ] is the set of all the joint distributions of the source domain image distribution and the target domain image distribution;

the metric d in the standard metric space is constructed as follows:

wherein the method comprises the steps ofIs an adjustable parameter and is related to the dimension p of the standard measurement space ² Related to; wherein the method comprises the steps of

Wherein x and y are images of the source domain image set and the target domain image set, respectively, μ _x Representing the mean value, sigma, of the pixels of the image _x Representing the standard deviation, sigma, of the pixels of the image _xy Representing the covariance of two image pixels.

Further, in one embodiment of the present invention, after establishing a mathematical measurement method with image feature pertinence based on the mathematical abstraction, the method further includes calculating a high-dimensional image measurement according to the mathematical measurement method, which specifically includes:

equivalent transformation of Wasserstein metric:

W[S,T]＝min _P <C,P>

C _ij ＝d(x _i ,y _j )，

wherein C is a distance matrix in which the elements consist of the distances of individual images between different data sets, i.e. the metric values in a standard metric space, x _i Is the ith image, y in the source domain image dataset _j Is the j-th image in the target domain image dataset; p is a coupling matrix, representing an optimal transmission scheme; the inner product of the distance matrix C and the coupling matrix P of optimal transmission is the obtained Wasserstein metric.

Further, in one embodiment of the present invention, the method further includes:

introducing matrix entropy to perform iterative approximate solution on the Wasserstein metric:

H(P)＝-∑ _ij P _ij logP _ij

where epsilon is an adjustable regularization constant,is a regularized wasperstein metric; the regularized Wasserstein measurement is approximately solved by using a Sinkhorn iteration method:

u ⁽⁰⁾ ＝[0] _|S| ,v ⁽⁰⁾ ＝[0] _|T| ^T

u ⁽ⁿ⁺¹⁾ ＝u ⁽ⁿ⁾ +ε(logμ-LSE _J K(u ⁽ⁿ⁾ ,v ⁽ⁿ⁾ ))

v ⁽ⁿ⁺¹⁾ ＝v ⁽ⁿ⁾ +ε(logv-LSE _I K(u ⁽ⁿ⁺¹⁾ ,v ⁽ⁿ⁾ ))

where |s| is the cardinal number of S, i.e., the number of pictures of the source domain image dataset, and |t| is the number of pictures of the target domain image dataset; mu being an elementColumn vectors of total S elements, v being an element +.>Row vectors of total |T| elements, which respectively represent the weights of each image of two image datasets; u (u) ⁽⁰⁾ Is a column vector with 0 elements and total I S elements, v ⁽⁰⁾ Is an element ∈>Performing iteration in the formula (6) by taking a row vector of total I and T as an iteration starting point, wherein LSE is a logsumexp function, and the LSE is the maximum value of each row or each column;

obtaining a final vector after carrying out set times of iteration, obtaining a regularized coupling matrix, and finally obtaining regularized Wasserstein measurement:

where m is the maximum number of iterations,is a regularized coupling matrix.

Further, in an embodiment of the present invention, the optimizing the mathematical metric method includes:

solving regularized Wasserstein metrics according to random filling method, comprising setting a positive integer random threshold r, each time without undue experimentation from a source domain image dataset and a targetDomain image dataset random selectionAnd->The distance matrix is constructed by the sheet image, regularized Wasserstein metrics are calculated, and the sum of the regularized Wasserstein metrics of each batch is obtained from forward approximation to the original regularized Wasserstein metrics:

wherein S is _r And T _r Respectively representing image sets randomly selected from a source domain image dataset and a target domain image dataset in a single batch;

the approximate regularized Wasserstein metric calculated by the random filling method is used for replacing the original regularized Wasserstein metric, so that the time required for calculation is reduced.

To achieve the above object, an embodiment of the second aspect of the present invention provides an inter-domain difference measurement device for an image dataset, including:

the analysis module is used for carrying out mathematical characteristic analysis on the image data set and realizing mathematical abstraction of difference measurement of the image data set;

the construction module is used for establishing a mathematical measurement method with image characteristic pertinence based on the mathematical abstraction;

the optimizing module is used for optimizing the mathematical measurement method;

and the measurement module is used for acquiring an image data set to be measured and carrying out difference measurement on the image data set to be processed according to the optimized mathematical measurement method.

To achieve the above object, an embodiment of the present invention provides a computer device, which is characterized by comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements an inter-domain difference measurement method for image data sets as described above when executing the computer program.

To achieve the above object, a fourth aspect of the present invention provides a computer-readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements an inter-domain difference measurement method for image data sets as described above.

According to the inter-domain difference measurement method for the image data set, mathematical abstraction of the difference measurement of the image data set is completed through mathematical feature analysis of the image data set; a set of mathematical measurement method with image characteristic pertinence is established based on the mathematical abstraction, and an optimization method is provided on the calculation mode, so that the optimization effect is remarkably improved; a complete set of inter-domain difference measurement methods for the image dataset is formed.

The invention has universality for various image data sets, can calculate the difference between the image data sets in an objective quantitative mode, improves the generalization of image task domain self-adaption, and saves calculation resources and running time.

Drawings

The foregoing and/or additional aspects and advantages of the invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings, in which:

fig. 1 is a flow chart of an inter-domain difference measurement method for an image dataset according to an embodiment of the present invention.

Fig. 2 is a schematic diagram of an inter-domain difference measurement device for an image dataset according to an embodiment of the present invention.

Detailed Description

Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative and intended to explain the present invention and should not be construed as limiting the invention.

An inter-domain difference measurement method for an image dataset according to an embodiment of the present invention is described below with reference to the accompanying drawings.

As shown in fig. 1, the inter-domain difference measurement method for an image dataset includes the following steps:

s101: performing mathematical characteristic analysis on the image data set to realize mathematical abstraction of difference measurement of the image data set;

two different sets of image data may be considered two sets, with each image being an element of a set; the comparison of the differences between the two sets is based on the comparison of the elements therein, requiring that all elements in the set be comparable, i.e. that the elements must be structurally identical to each other. For each image, each pixel is regarded as a dimension, and the value of the dimension is the corresponding pixel value, so that each image can be regarded as a point in the high-dimensional space.

Further, in one embodiment of the invention, performing mathematical feature analysis on an image dataset includes:

define a dimension as p ² The standard metric space (N, d) defines a mapping f of arbitrary space a to N, a→n, such that all images can be mapped into the standard metric space, whereby differences in the image dataset are abstracted to discrete distributed distances in the standard metric space.

S102: establishing a mathematical measurement method with image characteristic pertinence based on mathematical abstraction;

further, in one embodiment of the present invention, a mathematical metric method with image feature pertinence is established based on mathematical abstraction, including:

s is the source domain image distribution, T is the target domain image distribution, and pi [ S, T ] is the set of all joint distributions of the source domain image distribution and the target domain image distribution;

the metric d in the standard metric space is constructed as follows:

The meaning of the formula (1) is to find a joint distribution with the minimum transformation cost, namely the Wasserstein measurement.

Since the II x-y II in the formula (1) is defined by the metric d in the standard metric space, W [ S, T ] can be derived to prove that W [ S, T ] meets the non-negativity, non-degeneracy, symmetry and triangle inequality, namely W [ S, T ] also meets the metric definition, and the metric mode has good universality and generalization.

The single-element similarity kernel corresponds to a metric d in the standard metric space, representing the distance between two images mapped into the standard metric space. Unlike a general image similarity comparison task, the image domain comparison task does not require precise alignment between images, but rather it is desirable that images with similar semantic information have a higher degree of alignment.

Based on the above, the single-element similarity kernel function adopts an SSIM index, as shown in formula (3), and the index simultaneously considers the brightness, contrast and structural information of the image, so that compared with Euclidean distance used in the traditional work, the single-element similarity kernel function is more suitable for measuring the similarity of intra-domain and inter-domain images. Furthermore, SSIM index has insensitivity to image translation, scaling, rotation, which is a drawback in the task of image accurate alignment, but is instead an advantage in the task of intra-domain, inter-domain image similarity measurement.

In order to construct the measurement d in the standard measurement space, the experimental result shows that the SSIM index of the same-domain image is lower than 0.3 and the SSIM index of the different-domain image is lower than 0.1, so that a downward convex mapping is required to be constructed, the absolute value of the measurement of the same-domain image is lower than the difference between the measurement values of two different-domain images, and the effects of small intra-domain distance and large inter-domain distance are achieved.

Further, in one embodiment of the present invention, after establishing the mathematical measurement method with image feature pertinence based on the mathematical abstraction, the method further includes calculating the high-dimensional image measurement according to the mathematical measurement method, which specifically includes:

equivalent transformation of Wasserstein metric:

W[S,T]＝min _P <C,P>

C _ij ＝d(x _i ,y _j )，

wherein C is a distance matrix, wherein the elements consist of the distances of individual images between different data sets, i.e. the metric values in a standard metric space, x _i Is the ith image, y in the source domain image dataset _j Is the j-th image in the target domain image dataset; p is a coupling matrix, representing an optimal transmission scheme; the inner product of the distance matrix C and the coupling matrix P of optimal transmission is the obtained Wasserstein metric.

introducing matrix entropy to perform iterative approximate solution on Wasserstein measurement:

H(P)＝-∑ _ij P _ij logP _ij

u ⁽⁰⁾ ＝[0] _|S| ,v ⁽⁰⁾ ＝[0] _|T| ^T

u ⁽ⁿ⁺¹⁾ ＝u ⁽ⁿ⁾ +ε(logμ-LSE _J K(u ⁽ⁿ⁾ ,v ⁽ⁿ⁾ ))

where |s| is the cardinal number of S, i.e., the number of pictures of the source domain image dataset, and |t| is the number of pictures of the target domain image dataset; mu being an elementColumn vectors of total S elements, v being an element +.>Row vectors of total |t| elements representing each image of two image datasetsWeights of (2); u (u) ⁽⁰⁾ Is a column vector with 0 elements and total I S elements, v ⁽⁰⁾ Is an element ∈>Performing iteration in the formula (6) by taking a row vector of total I and T as an iteration starting point, wherein LSE is a logsumexp function, and the LSE is the maximum value of each row or each column;

where m is the maximum number of iterations,is a regularized coupling matrix.

S103: optimizing a logarithmic metric method;

the overall calculation process of the metrics is divided into three steps. The first step is to complete the mapping of all images to the standard metric space, and uniformly scale all images to p×p size, i.e. the dimension of the standard metric space N. And secondly, completing construction of a distance matrix, and carrying out matching measurement on single images of the source image dataset and the target image dataset to obtain each element in the distance matrix C. The third step is to complete the approximate iteration solution of Wasserstein measurement, and iterate to obtain a regularized matrix by taking C as a base pointAnd obtaining the inner product of the two matrixes to obtain a metric value.

The most consumed computing resource and computing time in the overall step is step two. If a pairing metric is performed for each image of the source domain image dataset and the target domain image dataset, the metric for the two common datasets takes a very long time to complete. The proposal proposes an optimization method using distance matrix random filling to reduce the computational overhead.

Further, in one embodiment of the present invention, optimizing the digital metric method includes:

solving regularized Wasserstein metrics according to a random fill method includes setting a positive integer random threshold r, randomly selecting from a source domain image dataset and a target domain image dataset each time without undue experimentationAnd->The distance matrix is constructed by the sheet image, regularized Wasserstein metrics are calculated, and the sum of the regularized Wasserstein metrics of each batch is obtained from forward approximation to the original regularized Wasserstein metrics:

The approximate regularized Wasserstein metric calculated by random filling method can be slightly larger than the accurate value of the regularized Wasserstein metric within the acceptable error range, namelyMust be at->So the approximation solution is in the right neighborhood of (3) and the approximation solution is in factThe method can replace accurate solution for analysis and use when in use. The method greatly reduces calculation cost, and changes calculation time into original +.>

S104: and acquiring an image data set to be processed, and carrying out difference measurement on the image data set to be processed according to the optimized mathematical measurement method.

Compared with the prior art, the invention has the advantages that:

1) The current image similarity measurement method is mainly used for comparing the similarity between two images, and large deviation is easy to occur in the judgment of a complex or extreme environment image set; the mathematical abstraction of the image dataset difference measurement task is completed through the mathematical feature analysis of the image dataset, so that the mathematical abstraction can accurately describe the inter-domain measurement distance.

2) Most of the existing methods cannot meet the requirements of non-negativity, symmetry, non-degeneracy and triangular inequality at the same time. The proposal provides a method for calculating the similarity among single elements by using an SSIM kernel function and solving the similarity by using a sink horn high-dimensionality method, and optimizing the calculation process by using a block matrix operation mode, so as to finally obtain a strict domain measurement mode conforming to non-negativity, symmetry, non-degeneracy and triangle inequality, and the measurement mode can be popularized to a scene with strict definition requirements on distance.

In order to implement the above embodiment, the present invention also proposes an inter-domain difference measurement device for an image dataset.

Fig. 2 is a schematic structural diagram of an inter-domain difference measurement device for an image dataset according to an embodiment of the present invention.

As shown in fig. 2, the inter-domain difference measurement device for an image dataset includes: an analysis module 100, a construction module 200, an optimization module 300, a metrics module 400, wherein,

the construction module is used for establishing a mathematical measurement method with image characteristic pertinence based on mathematical abstraction;

the optimization module is used for optimizing the digital measurement method;

and the measurement module is used for acquiring the image data set to be measured and carrying out difference measurement on the image data set to be processed according to the optimized mathematical measurement method.

To achieve the above object, an embodiment of a third aspect of the present invention provides a computer device, which is characterized by comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the inter-domain difference measurement method for image dataset as described above when executing the computer program.

To achieve the above object, a fourth aspect of the present invention provides a computer-readable storage medium having stored thereon a computer program, wherein the computer program, when executed by a processor, implements an inter-domain difference measurement method for image dataset as described above.

In the description of the present specification, a description referring to terms "one embodiment," "some embodiments," "examples," "specific examples," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, the different embodiments or examples described in this specification and the features of the different embodiments or examples may be combined and combined by those skilled in the art without contradiction.

Furthermore, the terms "first," "second," and the like, are used for descriptive purposes only and are not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include at least one such feature. In the description of the present invention, the meaning of "plurality" means at least two, for example, two, three, etc., unless specifically defined otherwise.

While embodiments of the present invention have been shown and described above, it will be understood that the above embodiments are illustrative and not to be construed as limiting the invention, and that variations, modifications, alternatives and variations may be made to the above embodiments by one of ordinary skill in the art within the scope of the invention.

Claims

1. An inter-domain difference measurement method for an image dataset, comprising the steps of:

optimizing the mathematical measure;

2. The method of claim 1, wherein the performing mathematical feature analysis on the image dataset comprises:

define a dimension as p ² The standard metric space (N, d) defines a mapping f of arbitrary space A to N, A→N, such that all images can be mapped toIn the standard metric space, whereby the differences of the image dataset are abstracted to the distances of the discrete distributions in the standard metric space.

3. The method of claim 1, wherein the establishing a mathematical metric method having image feature pertinence based on the mathematical abstraction comprises:

the metric d in the standard metric space is constructed as follows:

4. The method according to claim 1, further comprising calculating a high-dimensional image metric according to the mathematical metric method after establishing the mathematical metric method having image feature pertinence based on the mathematical abstraction, specifically comprising:

equivalent transformation of Wasserstein metric:

W[S,T]＝min _P <C,P>

C _ij ＝d(x _i ,y _j )，

5. The method as recited in claim 4, further comprising:

H(P)＝-∑ _ij P _ij logP _ij

u ⁽⁰⁾ ＝[0] _|S| ,v ⁽⁰⁾ ＝[0] _|T| ^T

u ⁽ⁿ⁺¹⁾ ＝u ⁽ⁿ⁾ +ε(logμ-LSE _J K(u ⁽ⁿ⁾ ,v ⁽ⁿ⁾ ))

where m is the maximum stackThe number of times of substitution is counted,is a regularized coupling matrix.

6. The method of claim 1, wherein said optimizing said mathematical metric method comprises:

7. An inter-domain difference measurement device for an image dataset, comprising the following modules:

8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the image dataset oriented inter-domain difference measurement method of any of claims 1-6 when the computer program is executed.

9. A computer readable storage medium, on which a computer program is stored, which computer program, when being executed by a processor, implements the inter-domain difference measurement method for image dataset according to any of claims 1-6.