CN118365523A - Method, system, electronic device and storage medium for representing images of any scale - Google Patents
Method, system, electronic device and storage medium for representing images of any scale Download PDFInfo
- Publication number
- CN118365523A CN118365523A CN202410483340.8A CN202410483340A CN118365523A CN 118365523 A CN118365523 A CN 118365523A CN 202410483340 A CN202410483340 A CN 202410483340A CN 118365523 A CN118365523 A CN 118365523A
- Authority
- CN
- China
- Prior art keywords
- modulation
- resolution
- rendering
- low
- hidden
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4053—Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformations in the plane of the image
- G06T3/40—Scaling of whole images or parts thereof, e.g. expanding or contracting
- G06T3/4046—Scaling of whole images or parts thereof, e.g. expanding or contracting using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Image Processing (AREA)
Abstract
Description
技术领域Technical Field
本申请涉及图像处理技术领域,尤其涉及任意尺度图像的表示方法、系统、电子设备及存储介质。The present application relates to the field of image processing technology, and in particular to a method, system, electronic device and storage medium for representing images of any scale.
背景技术Background technique
隐式神经表示(INR)可以用于信号重建、图像和视频压缩,以及图像生成。INR基于神经辐射场(NeRF),学习多层感知机(MLP)来推断3D场景中任何给定坐标和视角的像素值(RGB)和密度。然而INR面临一个主要挑战是需要进行广泛的训练过程,以从头开始拟合信号。为了解决这个问题,一些人使用多个小型MLP独立学习信号的不同区域,另一种解决方案是超网络-基网络设计。在测试期间,预训练好的超网络能直接为任何输入信号生成INR。INR面临的另一个挑战是在渲染时MLP需要独立预测每一个像素点,这将会耗费大量计算成本,并需要漫长的运行时间。Implicit neural representations (INR) can be used for signal reconstruction, image and video compression, and image generation. INR is based on neural radiance fields (NeRFs) and learns a multi-layer perceptron (MLP) to infer the pixel value (RGB) and density of any given coordinates and viewing angles in a 3D scene. However, a major challenge facing INR is the need for an extensive training process to fit the signal from scratch. To address this issue, some people use multiple small MLPs to independently learn different areas of the signal. Another solution is the hypernetwork-base network design. During testing, the pre-trained hypernetwork can directly generate INR for any input signal. Another challenge facing INR is that the MLP needs to predict each pixel independently during rendering, which will cost a lot of computation and require a long running time.
任意尺度的图像超分辨方法旨在使用单一网络在任何放大倍数下进行分辨率重建,具有实用性和便利性,在INR的基础上,有人提出了一种局部隐式图像函数(LIIF),其使用MLP接收坐标和邻近的二维特征输入来推断该坐标上的像素值。尽管基于INR的任意尺度超分辨率方法可以在高达30倍的超分辨率中提供稳定的表现,但它们遵循在高分辨率-高位空间上解码的范式,导致了随着放大倍数二次增加的高昂计算成本和很长的运行时间,其计算成本也随着缩放倍数的增加而迅速增加。Arbitrary-scale image super-resolution methods aim to use a single network to reconstruct resolution at any magnification, which is practical and convenient. Based on INR, a local implicit image function (LIIF) is proposed, which uses MLP to receive coordinates and adjacent two-dimensional feature inputs to infer the pixel value at the coordinate. Although arbitrary-scale super-resolution methods based on INR can provide stable performance in super-resolution up to 30 times, they follow the paradigm of decoding in high-resolution-high-bit space, resulting in high computational cost and long running time with quadratic increase in magnification, and their computational cost also increases rapidly with the increase in scaling.
发明内容Summary of the invention
本申请实施例的主要目的在于提出任意尺度图像的表示方法、系统、电子设备及存储介质,旨在降低任意尺度图像表示的计算成本,并且提高图像处理速度。The main purpose of the embodiments of the present application is to propose a method, system, electronic device and storage medium for representing images of any scale, aiming to reduce the computational cost of representing images of any scale and improve the image processing speed.
为实现上述目的,本申请实施例的一方面提出了任意尺度图像的表示方法,所述方法包括:To achieve the above purpose, an embodiment of the present application provides a method for representing an image of any scale, the method comprising:
获取低分辨率图像、倍数与调制查找表和超分辨率倍数参数;Obtain low-resolution images, magnification and modulation lookup tables, and super-resolution magnification parameters;
通过编码器对所述低分辨率图像进行编码处理,得到低分辨率高维特征图;The low-resolution image is encoded by an encoder to obtain a low-resolution high-dimensional feature map;
将所述低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码;Inputting the low-resolution high-dimensional feature map into an implicit multi-layer perceptron to obtain an implicit modulation and compression code for each feature vector;
根据所述超分辨率倍数参数、所述压缩隐码、所述隐调制和所述倍数与调制查找表,采用目标渲染多层感知机对所述低分辨率图像进行渲染处理,得到超分辨率图像。According to the super-resolution multiple parameter, the compression hidden code, the hidden modulation and the multiple and modulation lookup table, a target rendering multi-layer perceptron is used to render the low-resolution image to obtain a super-resolution image.
在一些实施例中,所述获取低分辨率图像、倍数与调制查找表和超分辨率倍数参数的步骤中,获取倍数与调制查找表的步骤,包括:In some embodiments, in the step of acquiring the low-resolution image, the magnification and modulation lookup table and the super-resolution magnification parameter, the step of acquiring the magnification and modulation lookup table includes:
对常见的放大倍数进行采样,得到所述放大倍数的最小调制均值和最大调制均值;Sampling common magnifications to obtain a minimum modulation mean value and a maximum modulation mean value of the magnifications;
根据所述最小调制均值和所述最大调制均值生成倍数与调制查找表。A multiplier and modulation lookup table is generated according to the minimum modulation mean value and the maximum modulation mean value.
在一些实施例中,所述将所述低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码,包括:In some embodiments, the step of inputting the low-resolution high-dimensional feature map into an implicit multi-layer perceptron to obtain an implicit modulation and compression code of each feature vector includes:
对所述特征图进行特征向量提取处理,得到高维隐码;Performing feature vector extraction processing on the feature map to obtain a high-dimensional hidden code;
通过隐式多层感知机对所述高维隐码进行感知处理,得到第一处理结果;Performing perceptual processing on the high-dimensional latent code through an implicit multi-layer perceptron to obtain a first processing result;
对所述第一处理结果进行通道方向分割处理,得到压缩隐码;Performing channel-wise segmentation processing on the first processing result to obtain a compressed hidden code;
根据所述高维隐码生成隐调制。Hidden modulation is generated according to the high-dimensional hidden code.
在一些实施例中,所述隐调制由若干个特征线性调制层组成;其中,每层所述特征线性调制层包括一个缩放调制和一个偏移调制。In some embodiments, the implicit modulation is composed of several characteristic linear modulation layers; wherein each characteristic linear modulation layer includes a scaling modulation and an offset modulation.
在一些实施例中,所述根据所述超分辨率倍数参数、所述压缩隐码、所述隐调制和所述倍数与调制查找表,采用目标渲染多层感知机对所述低分辨率图像进行渲染处理,得到超分辨率图像,包括:In some embodiments, the method of rendering the low-resolution image using a target rendering multi-layer perceptron according to the super-resolution multiple parameter, the compression hidden code, the hidden modulation, and the multiple and modulation lookup table to obtain a super-resolution image includes:
初始化第一渲染倍数和第一渲染图像,并计算每个所述压缩隐码的偏移调制均值;Initializing a first rendering multiple and a first rendering image, and calculating an offset modulation mean value of each of the compression hidden codes;
根据所述倍数与调制查找表的下一个倍数,更新所述第一渲染倍数;Update the first rendering multiple according to the multiple and the next multiple in the modulation lookup table;
采用双线性插值将所述第一渲染图像采样到与所述第一渲染倍数对应的分辨率;Using bilinear interpolation to sample the first rendered image to a resolution corresponding to the first rendering multiple;
查询所述倍数与调制查找表,确定所述第一渲染倍数的调制均值范围;Querying the multiple and modulation lookup table to determine the modulation mean range of the first rendering multiple;
将所述偏移调制均值落入所述调制均值范围内的所述压缩隐码输入目标渲染多层感知机中进行渲染,得到所述第一渲染倍数的渲染像素值;Inputting the compressed hidden code whose offset modulation mean value falls within the modulation mean value range into a target rendering multi-layer perceptron for rendering, to obtain a rendering pixel value of the first rendering multiple;
返回执行根据所述倍数与调制查找表的下一个倍数,更新所述第一渲染倍数的步骤,直至所有的所述压缩隐码已被解码或者所述第一渲染倍数为最终需要的超分辨率倍数,得到目标超分辨率图像。Return to execute the step of updating the first rendering multiple according to the multiple and the next multiple of the modulation lookup table, until all the compression codes have been decoded or the first rendering multiple is the final required super-resolution multiple, to obtain the target super-resolution image.
在一些实施例中,所述方法还包括:In some embodiments, the method further comprises:
采用所述隐调制对初始渲染多层感知机进行调制,得到目标渲染多层感知机。The implicit modulation is used to modulate the initial rendering multi-layer perceptron to obtain a target rendering multi-layer perceptron.
在一些实施例中,所述目标渲染多层感知机在每一层进行渲染的表达式为:In some embodiments, the target rendering multi-layer perceptron performs rendering at each layer as follows:
其中,表示第k层隐藏特征;表示第k+1层隐藏特征;⊙表示通道方向点乘;σ表示激活函数;表示目标渲染多层感知机第k+1层的渲染处理。in, represents the hidden features of the kth layer; represents the hidden features of the k+1th layer; ⊙ represents the channel direction dot product; σ represents the activation function; Represents the rendering process of the k+1th layer of the target rendering multi-layer perceptron.
为实现上目的,本申请实施例的另一方面提出了任意尺度图像的表示系统,所述系统包括:To achieve the above purpose, another aspect of the present application provides a representation system for an image of any scale, the system comprising:
第一模块,用于获取低分辨率图像、倍数与调制查找表和超分辨率倍数参数;The first module is used to obtain low-resolution images, magnification and modulation lookup tables, and super-resolution magnification parameters;
第二模块,用于通过编码器对所述低分辨率图像进行编码处理,得到低分辨率高维特征图;The second module is used to encode the low-resolution image through an encoder to obtain a low-resolution high-dimensional feature map;
第三模块,用于将所述低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码;The third module is used to input the low-resolution high-dimensional feature map into an implicit multi-layer perceptron to obtain an implicit modulation and compression code of each feature vector;
第四模块,用于根据所述超分辨率倍数参数、所述压缩隐码、所述隐调制和所述倍数与调制查找表,采用目标渲染多层感知机对所述低分辨率图像进行渲染处理,得到超分辨率图像。The fourth module is used to render the low-resolution image using a target rendering multi-layer perceptron according to the super-resolution multiple parameter, the compression hidden code, the hidden modulation and the multiple and modulation lookup table to obtain a super-resolution image.
为实现上述目的,本申请实施例的另一方面提出了一种电子设备,所述电子设备包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现前面所述的方法。To achieve the above objective, another aspect of an embodiment of the present application provides an electronic device, the electronic device comprising a memory and a processor, the memory storing a computer program, and the processor implementing the above-mentioned method when executing the computer program.
为实现上述目的,本申请实施例的另一方面提出了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现前面所述的方法。To achieve the above objective, another aspect of an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the method described above is implemented.
本申请实施例至少包括以下有益效果:通过将低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码,可以将解码过程从高分辨率高维空间解耦到低分辨率高维的隐空间和高分辨率低维的渲染空间,有利于降低计算成本。结合超分辨率倍数参数、压缩隐码、隐调制和倍数与调制查找表,采用目标渲染多层感知机对低分辨率图像进行渲染,可以实现以不同的精确度和速度进行渲染,同时自然适应图像内容。本申请的实施例具有计算成本低、图像处理速度快的优点。The embodiments of the present application include at least the following beneficial effects: by inputting a low-resolution high-dimensional feature map into an implicit multi-layer perceptron, the implicit modulation and compression hidden code of each feature vector are obtained, and the decoding process can be decoupled from the high-resolution high-dimensional space to the low-resolution high-dimensional hidden space and the high-resolution low-dimensional rendering space, which is beneficial to reducing the computational cost. In combination with super-resolution multiple parameters, compression hidden codes, hidden modulation, and multiple and modulation lookup tables, a target rendering multi-layer perceptron is used to render low-resolution images, which can achieve rendering with different accuracy and speed, while naturally adapting to the image content. The embodiments of the present application have the advantages of low computational cost and fast image processing speed.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
附图用来提供对本申请技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本申请的技术方案,并不构成对本申请技术方案的限制。The accompanying drawings are used to provide further understanding of the technical solution of the present application and constitute a part of the specification. Together with the embodiments of the present application, they are used to explain the technical solution of the present application and do not constitute a limitation on the technical solution of the present application.
图1是本申请实施例提供的任意尺度图像的表示方法的步骤图;FIG1 is a step diagram of a method for representing an image of any scale provided in an embodiment of the present application;
图2是本申请实施例提供的任意尺度图像表示方法的整体框架图;FIG2 is an overall framework diagram of an arbitrary scale image representation method provided by an embodiment of the present application;
图3是本申请实施例提供的任意初度图像的表示方法的数据处理流程图;FIG3 is a data processing flow chart of a method for representing an arbitrary primary image provided in an embodiment of the present application;
图4是本申请实施例提供的4超分辨率下的偏移调制归一化均值示意图;FIG4 is a schematic diagram of a normalized mean value of offset modulation under 4 super-resolutions provided in an embodiment of the present application;
图5是本申请实施例提供的LM-LIIF预测图像与双线性上采样图像之间的归一化残差示意图;FIG5 is a schematic diagram of the normalized residual between the LM-LIIF predicted image and the bilinear upsampled image provided in an embodiment of the present application;
图6是本申请实施例提供的任意尺度图像的表示系统的结构示意图;FIG6 is a schematic diagram of the structure of a system for representing images of any scale provided in an embodiment of the present application;
图7是本申请实施例提供的电子设备的硬件结构示意图。FIG. 7 is a schematic diagram of the hardware structure of an electronic device provided in an embodiment of the present application.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请实施例相一致的所有实施方式,它们仅是与如所附权利要求书中所详述的、本申请实施例的一些方面相一致的装置和方法的例子。In order to make the purpose, technical solutions and advantages of the present application clearer, the present application is further described in detail below in conjunction with the accompanying drawings and examples. It should be understood that the specific embodiments described herein are only used to explain the present application and are not intended to limit the present application. When the following description refers to the drawings, unless otherwise indicated, the same numbers in different drawings represent the same or similar elements. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the embodiments of the present application. They are only examples of devices and methods consistent with some aspects of the embodiments of the present application as detailed in the attached claims.
虽然在系统示意图中进行了功能模块划分,在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于系统中的模块划分,或流程图中的顺序执行所示出或描述的步骤。说明书和权利要求书及上述附图中的术语“第一/S100”、“第二/S200”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。Although the functional modules are divided in the system schematic diagram and the logical order is shown in the flowchart, in some cases, the steps shown or described may be performed in a different order than the module division in the system or the order in the flowchart. The terms "first/S100", "second/S200", etc. in the specification, claims and the above drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence.
可以理解,本申请所使用的术语“第一”、“第二”等可在本文中用于描述各种概念,但除非特别说明,这些概念不受这些术语限制。这些术语仅用于将一个概念与另一个概念区分。例如,在不脱离本申请实施例范围的情况下,第一信息也可以被称为第二信息,类似地,第二信息也可以被称为第一信息。取决于语境,如在此所使用的词语“若”、“如果”可以被解释成为“在……时”或“当……时”或“响应于确定”。It is understood that the terms "first", "second", etc. used in this application can be used to describe various concepts in this article, but unless otherwise specified, these concepts are not limited by these terms. These terms are only used to distinguish one concept from another concept. For example, without departing from the scope of the embodiment of the present application, the first information may also be referred to as the second information, and similarly, the second information may also be referred to as the first information. Depending on the context, the words "if" and "if" as used herein can be interpreted as "at the time of" or "when" or "in response to determination".
本申请所使用的术语“至少一个”、“多个”、“每个”、“任一”等,至少一个包括一个、两个或两个以上,多个包括两个或两个以上,每个是指对应的多个中的每一个,任一是指多个中的任意一个。The terms "at least one", "multiple", "each", "any", etc. used in this application, at least one includes one, two or more, multiple includes two or more, each refers to each of the corresponding multiple, and any refers to any one of the multiple.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference to "embodiments" herein means that a particular feature, structure, or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present application. The appearance of the phrase in various locations in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment that is mutually exclusive with other embodiments. It is explicitly and implicitly understood by those skilled in the art that the embodiments described herein may be combined with other embodiments.
除非另有定义,本文所使用的所有的技术和科学术语与属于本申请的技术领域的技术人员通常理解的含义相同。本文中所使用的术语只是为了描述本申请实施例的目的,不是旨在限制本申请。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as those commonly understood by those skilled in the art to which this application belongs. The terms used herein are only for the purpose of describing the embodiments of this application and are not intended to limit this application.
相关技术中,基于隐式神经表示(INR)的任意尺度图像表示研究通过使用多层感知机(MLP)解码低分辨率特征,用于进行任意尺度超分辨率重建,然而,这些连续的图像表示通常是在高分辨率-高维空间中完成解码的,这导致计算成本随着图像放大倍数的增加而呈二次增加,不利于任意尺度超分的实际应用。In the related art, the research on arbitrary-scale image representation based on implicit neural representation (INR) uses a multi-layer perceptron (MLP) to decode low-resolution features for arbitrary-scale super-resolution reconstruction. However, these continuous image representations are usually decoded in a high-resolution-high-dimensional space, which causes the computational cost to increase quadratically with the increase in the image magnification, which is not conducive to the practical application of arbitrary-scale super-resolution.
有鉴于此,本申请实施例中提供任意尺度图像的表示方法、系统、电子设备及存储介质,该方案提出一种隐调制函数(Latent Modulated Function,LMF),结合该函数进行任意尺度图像表示,可以将高分辨率-高维空间解码过程解耦为低分辨率-高维空间中的共享隐解码过程,以及高分辨率-低维空间中的独立渲染过程,从而实现了连续图像表示的首个计算上的最优范式。进一步地,本申请实施例提出利用隐空间中的高维MLP为每个低分辨率特征向量生成对应的隐空间调制,这使得在渲染空间中的倍调制的低维MLP能够迅速适应任何输入特征向量,并以任意分辨率进行渲染。本申请实施例进一步利用调制强度与输入图像复杂度之间的正相关关系提出了可控多尺度渲染方法(Controllable Multi-ScaleRendering,CMSR),结合该方法进行图像渲染,可以基于渲染精度提供灵活的解码效率调整。In view of this, the present application provides a method, system, electronic device and storage medium for representing images of any scale. The scheme proposes a latent modulation function (LMF). By combining the latent modulation function with the LMF for representing images of any scale, the high-resolution-high-dimensional space decoding process can be decoupled into a shared latent decoding process in a low-resolution-high-dimensional space and an independent rendering process in a high-resolution-low-dimensional space, thereby realizing the first computationally optimal paradigm for continuous image representation. Furthermore, the present application proposes to use a high-dimensional MLP in the latent space to generate a corresponding latent space modulation for each low-resolution feature vector, which enables the low-dimensional MLP with multiple modulation in the rendering space to quickly adapt to any input feature vector and render at any resolution. The present application further proposes a controllable multi-scale rendering method (CMSR) using the positive correlation between the modulation intensity and the complexity of the input image. By combining this method for image rendering, flexible decoding efficiency adjustment can be provided based on rendering accuracy.
本申请实施例提供的任意尺度图像的表示方法,涉及图像处理技术领域。本申请实施例提供的任意尺度图像的表示方法可应用于终端中,也可应用于服务器中,还可以是运行于终端或服务器中的软件。在一些实施例中,终端可以是智能手机、平板电脑、笔记本电脑、台式计算机、智能音箱、智能手表以及车载终端等,但并不局限于此;服务器端可以配置成独立的物理服务器,也可以配置成多个物理服务器构成的服务器集群或者分布式系统,还可以配置成提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN以及大数据和人工智能平台等基础云计算服务的云服务器,服务器还可以是区块链网络中的一个节点服务器;软件可以是实现任意尺度图像的表示方法的应用等,但并不局限于以上形式。The representation method of an image of any scale provided in the embodiment of the present application relates to the field of image processing technology. The representation method of an image of any scale provided in the embodiment of the present application can be applied to a terminal, can also be applied to a server, and can also be software running in a terminal or a server. In some embodiments, the terminal can be a smart phone, a tablet computer, a laptop computer, a desktop computer, a smart speaker, a smart watch, and a car terminal, etc., but is not limited to this; the server side can be configured as an independent physical server, or it can be configured as a server cluster or a distributed system composed of multiple physical servers, and can also be configured to provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms and other basic cloud computing services. The cloud server, the server can also be a node server in a blockchain network; the software can be an application that implements the representation method of an image of any scale, etc., but is not limited to the above forms.
本申请可用于众多通用或专用的计算机系统环境或配置中。例如:个人计算机、服务器计算机、手持设备或便携式设备、平板型设备、多处理器系统、基于微处理器的系统、置顶盒、可编程的消费电子设备、网络PC、小型计算机、大型计算机、包括以上任何系统或设备的分布式计算环境等等。本申请可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构等等。也可以在分布式计算环境中实践本申请,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。The present application can be used in many general or special computer system environments or configurations. For example: personal computers, server computers, handheld or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set-top boxes, programmable consumer electronics, network PCs, minicomputers, mainframe computers, distributed computing environments including any of the above systems or devices, etc. The present application can be described in the general context of computer executable instructions executed by a computer, such as program modules. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform specific tasks or implement specific abstract data types. The present application can also be practiced in distributed computing environments, in which tasks are performed by remote processing devices connected through a communication network. In a distributed computing environment, program modules can be located in local and remote computer storage media including storage devices.
需要说明的是,在本申请的各个具体实施方式中,当涉及到需要根据用户信息、用户行为数据,用户历史数据以及用户位置信息等与用户身份或特性相关的数据进行相关处理时,都会先获得用户的许可或者同意,而且,对这些数据的收集、使用和处理等,都会遵守相关法律法规和标准。此外,当本申请实施例需要获取用户的敏感个人信息时,会通过弹窗或者跳转到确认页面等方式获得用户的单独许可或者单独同意,在明确获得用户的单独许可或者单独同意之后,再获取用于使本申请实施例能够正常运行的必要的用户相关数据。It should be noted that in each specific implementation of the present application, when it comes to the need to perform relevant processing based on data related to user identity or characteristics such as user information, user behavior data, user historical data, and user location information, the user's permission or consent will be obtained first, and the collection, use, and processing of these data will comply with relevant laws, regulations, and standards. In addition, when the embodiment of the present application needs to obtain the user's sensitive personal information, the user's separate permission or consent will be obtained through a pop-up window or by jumping to a confirmation page. After clearly obtaining the user's separate permission or consent, the necessary user-related data for the normal operation of the embodiment of the present application will be obtained.
图1是本申请实施例提供的任意尺度图像的表示方法的一个可选的步骤图,图1中的方法可以包括但不限于包括步骤S100~S400。FIG. 1 is an optional step diagram of a method for representing an image of any scale provided in an embodiment of the present application. The method in FIG. 1 may include but is not limited to steps S100 to S400 .
步骤S100,获取低分辨率图像、倍数与调制查找表和超分辨率倍数参数。Step S100, obtaining a low-resolution image, a magnification and modulation lookup table, and a super-resolution magnification parameter.
步骤S200,通过编码器对所述低分辨率图像进行编码处理,得到低分辨率高维特征图。Step S200: encoding the low-resolution image through an encoder to obtain a low-resolution high-dimensional feature map.
步骤S300,将所述低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码。Step S300, inputting the low-resolution high-dimensional feature map into an implicit multi-layer perceptron to obtain an implicit modulation and compression code for each feature vector.
步骤S400,根据所述超分辨率倍数参数、所述压缩隐码、所述隐调制和所述倍数与调制查找表,采用目标渲染多层感知机对所述低分辨率图像进行渲染处理,得到超分辨率图像。Step S400, according to the super-resolution multiple parameter, the compression hidden code, the hidden modulation and the multiple and modulation lookup table, a target rendering multi-layer perceptron is used to render the low-resolution image to obtain a super-resolution image.
本申请实施例所示意的步骤S100至步骤S400,通过将从低分辨率高维特征图中得到的高维隐码压缩得到低维的压缩隐码,并且计算出隐调制,能够使得低维渲染多层感知机快速适应任何输入隐码,并且实现在任意分辨率下的高效渲染。In steps S100 to S400 shown in the embodiment of the present application, by compressing the high-dimensional hidden code obtained from the low-resolution high-dimensional feature map to obtain a low-dimensional compressed hidden code, and calculating the hidden modulation, the low-dimensional rendering multi-layer perceptron can quickly adapt to any input hidden code and achieve efficient rendering at any resolution.
在一些实施例的步骤S100中,低分辨率图像是和超分辨率倍数参数可以由输入获取,获取倍数与调制查找表的步骤可以包括但不限于包括步骤S110~S120:In step S100 of some embodiments, the low-resolution image and the super-resolution multiple parameter may be obtained by input, and the step of obtaining the multiple and modulation lookup table may include but is not limited to steps S110 to S120:
步骤S110,对常见的放大倍数进行采样,得到所述放大倍数的最小调制均值和最大调制均值;Step S110, sampling common amplification factors to obtain the minimum modulation mean value and the maximum modulation mean value of the amplification factors;
步骤S120,根据所述最小调制均值和所述最大调制均值生成倍数与调制查找表。Step S120: Generate a multiple and modulation lookup table according to the minimum modulation mean value and the maximum modulation mean value.
在一些实施例的步骤S200中,编码器可以是现有超分辨率方法中的任何特征提取器,连续图像表示的本质是解码函数,该函数可以任意地将低分辨率高维特征上采样为高分辨率低维像素值。In step S200 of some embodiments, the encoder may be any feature extractor in existing super-resolution methods, and the essence of continuous image representation is a decoding function that can arbitrarily upsample low-resolution high-dimensional features to high-resolution low-dimensional pixel values.
在一些实施例中,步骤S300可以包括但不限于包括以下步骤S310~S340:In some embodiments, step S300 may include but is not limited to the following steps S310 to S340:
步骤S310,对所述特征图进行特征向量提取处理,得到高维隐码;Step S310, extracting feature vectors from the feature graph to obtain a high-dimensional hidden code;
步骤S320,通过隐式多层感知机对所述高维隐码进行感知处理,得到第一处理结果;Step S320, performing perception processing on the high-dimensional latent code by using an implicit multi-layer perceptron to obtain a first processing result;
步骤S330,对所述第一处理结果进行通道方向分割处理,得到压缩隐码;Step S330, performing channel direction segmentation processing on the first processing result to obtain a compressed hidden code;
步骤S340,根据所述高维隐码生成隐调制。Step S340: generating hidden modulation according to the high-dimensional hidden code.
在一些实施例中,所述隐调制由若干个特征线性调制层组成;其中,每层所述特征线性调制层包括一个缩放调制和一个偏移调制。In some embodiments, the implicit modulation is composed of several characteristic linear modulation layers; wherein each characteristic linear modulation layer includes a scaling modulation and an offset modulation.
在一些实施例中,步骤S400包括但不限于包括以下步骤S410~S460:In some embodiments, step S400 includes but is not limited to the following steps S410 to S460:
步骤S410,初始化第一渲染倍数和第一渲染图像,并计算每个所述压缩隐码的偏移调制均值;Step S410, initializing a first rendering multiple and a first rendering image, and calculating an offset modulation mean value of each of the compression hidden codes;
步骤S420,根据所述倍数与调制查找表的下一个倍数,更新所述第一渲染倍数;Step S420, updating the first rendering multiple according to the multiple and the next multiple in the modulation lookup table;
步骤S430,采用双线性插值将所述第一渲染图像采样到与所述第一渲染倍数对应的分辨率;Step S430, sampling the first rendered image to a resolution corresponding to the first rendering multiple by using bilinear interpolation;
步骤S440,查询所述倍数与调制查找表,确定所述第一渲染倍数的调制均值范围;Step S440, querying the multiple and modulation lookup table to determine the modulation mean range of the first rendering multiple;
步骤S450,将所述偏移调制均值落入所述调制均值范围内的所述压缩隐码输入目标渲染多层感知机中进行渲染,得到所述第一渲染倍数的渲染像素值;Step S450, inputting the compressed hidden code whose offset modulation mean value falls within the modulation mean value range into a target rendering multi-layer perceptron for rendering, to obtain a rendering pixel value of the first rendering multiple;
步骤S460,返回执行根据所述倍数与调制查找表的下一个倍数,更新所述第一渲染倍数的步骤,直至所有的所述压缩隐码已被解码或者所述第一渲染倍数为最终需要的超分辨率倍数,得到目标超分辨率图像。Step S460, returns to execute the step of updating the first rendering multiple according to the multiple and the next multiple of the modulation lookup table, until all the compressed hidden codes have been decoded or the first rendering multiple is the final required super-resolution multiple, to obtain the target super-resolution image.
在一些实施例中,所述任意尺度图像的表示方法还包括以下步骤S500:In some embodiments, the method for representing an image of any scale further includes the following steps S500:
步骤S500,采用所述隐调制对初始渲染多层感知机进行调制,得到目标渲染多层感知机。Step S500, using the implicit modulation to modulate the initial rendering multi-layer perceptron to obtain a target rendering multi-layer perceptron.
在一些实施例中,所述目标渲染多层感知机在每一层进行渲染的表达式为:In some embodiments, the target rendering multi-layer perceptron performs rendering at each layer as follows:
其中,表示第k层隐藏特征;表示第k+1层隐藏特征;⊙表示通道方向点乘;σ表示激活函数;表示目标渲染多层感知机第k+1层的渲染处理。in, represents the hidden features of the kth layer; represents the hidden features of the k+1th layer; ⊙ represents the channel direction dot product; σ represents the activation function; Represents the rendering process of the k+1th layer of the target rendering multi-layer perceptron.
下面,结合具体图像渲染表示场景的应用例子,对本申请实施例的方案作详细介绍和说明:Below, the solution of the embodiment of the present application is described in detail and explained in conjunction with an application example of a specific image rendering representation scene:
本申请实施例中,提供任意尺度图像的表示方法,该方法可以应用于对任意尺度图像进行任意分辨率的渲染,在连续图像表示中,输入图像被表示为低分辨率高维的隐空间中的特征图。一个隐码代表从特征图中提取的一个特征向量,用于解码在连续空间中离其最近的所有坐标。由于编码器可以是现有超分辨率方法中的任何特征提取器,连续图像表示的本质是解码函数,该函数可以任意地将低分辨率高维特征上采样为高分辨率低维度像素值。一个简单直观的范式是在高分辨率高维空间中学习一个被参数化为MLP的解码函数,独立地解码每个高分辨率坐标的像素值。然而这种基本范式存在大量的计算冗余。基于此,本申请实施例提出一种任意尺度图像的表示方法,通过隐调制函数,将高分辨率高维解码过程分解为隐空间中的共享解码和渲染空间中的独立渲染,还可以进行适应输入图像内容的可控多尺度渲染,最终得到任意尺度的图像。In an embodiment of the present application, a representation method for an image of any scale is provided, which can be applied to rendering an image of any scale at any resolution. In the continuous image representation, the input image is represented as a feature map in a low-resolution high-dimensional latent space. A hidden code represents a feature vector extracted from the feature map, which is used to decode all coordinates closest to it in the continuous space. Since the encoder can be any feature extractor in the existing super-resolution method, the essence of the continuous image representation is a decoding function that can arbitrarily upsample low-resolution high-dimensional features to high-resolution low-dimensional pixel values. A simple and intuitive paradigm is to learn a decoding function parameterized as an MLP in a high-resolution high-dimensional space, and independently decode the pixel value of each high-resolution coordinate. However, this basic paradigm has a lot of computational redundancy. Based on this, an embodiment of the present application proposes a representation method for an image of any scale, which decomposes the high-resolution high-dimensional decoding process into shared decoding in the latent space and independent rendering in the rendering space through an implicit modulation function, and can also perform controllable multi-scale rendering adapted to the content of the input image, and finally obtain an image of any scale.
如图2所示,图2示例了本申请实施例的任意尺度图像表示方法的框架结构。编码器将输入图像映射为隐空间的特征图,作为其隐式表示,然后由一个高维MLP在隐空间上为这些隐码生成隐调制。每个隐调制会用作一个低维MLP参数的一部分,大幅提高低维MLP在渲染对应区域的高分辨率像素值时的精度。给定输出图像的分辨率,本申请实施例进一步使用提出的可控多尺度渲染方法,使用隐调制的均值计算出在给定渲染精度下每个隐码实际需要的渲染倍数,从而使得计算成本降低至输入图像的内容实际需要的量级,并实现计算效率和渲染精度的灵活调整。在训练阶段,可以使用预测像素值和真实图像的像素值计算像素级损失进行优化。编码器和解码函数在自监督超分辨率任务中联合训练,将学习到的网络参数对所有图像共享。As shown in Figure 2, Figure 2 illustrates the framework structure of the arbitrary scale image representation method of the embodiment of the present application. The encoder maps the input image to a feature map of the latent space as its implicit representation, and then a high-dimensional MLP generates hidden modulations for these hidden codes in the latent space. Each hidden modulation will be used as part of a low-dimensional MLP parameter, greatly improving the accuracy of the low-dimensional MLP in rendering high-resolution pixel values in the corresponding area. Given the resolution of the output image, the embodiment of the present application further uses the proposed controllable multi-scale rendering method, and uses the mean of the hidden modulation to calculate the actual rendering multiple required for each hidden code under a given rendering accuracy, thereby reducing the computational cost to the actual level required by the content of the input image, and realizing flexible adjustment of computational efficiency and rendering accuracy. In the training stage, the pixel-level loss can be calculated using the predicted pixel values and the pixel values of the real image for optimization. The encoder and decoding functions are jointly trained in the self-supervised super-resolution task, and the learned network parameters are shared for all images.
下面介绍本申请实施例提供的隐调制函数:The implicit modulation function provided by the embodiment of the present application is introduced below:
为了消除高分辨率高维解码中的计算冗余,首先提供了一个包含隐式MLP(θl)和渲染MLP(θr)的两阶段解码过程。隐式MLP处理每个隐码z*对应的高分辨率区域内部的共享解码过程,而渲染MLP独立预测每个高分辨率坐标的像素值。来自隐式MLP的低分辨率高维输出则被用来辅助渲染MLP进行渲染,其渲染表达式为:In order to eliminate computational redundancy in high-resolution high-dimensional decoding, a two-stage decoding process consisting of an implicit MLP (θ l ) and a rendering MLP (θ r ) is first provided. The implicit MLP handles the shared decoding process inside the high-resolution region corresponding to each hidden code z * , while the rendering MLP independently predicts each high-resolution coordinate The low-resolution, high-dimensional output from the implicit MLP is used to assist the rendering MLP in rendering, and its rendering expression is:
公式(1)中,表示坐标的渲染像素值,表示隐式MLP处理,表示渲染MLP处理。In formula (1), Representing coordinates The rendered pixel value, represents implicit MLP processing, Represents the rendering MLP process.
然而,如果直接将隐式MLP的输出作为渲染MLP的输入类似于直接使用隐码本身作为输入,并不会带来计算效率的提升。此外,为了精确渲染任意分辨率的像素,渲染MLP需要在高分辨率空间中进行计算。因此,最可行的来最小化渲染计算量的方法是减少渲染MLP的维度或深度。然而,直接将隐式MLP的输出输入到渲染MLP意味着需要一个高维的渲染MLP来完全接收和处理输入,但本申请使用低维渲染MLP以减少复杂度。However, if the output of the implicit MLP is directly used as the input of the rendering MLP, it is similar to directly using the hidden code itself as input, and it will not bring about an improvement in computational efficiency. In addition, in order to accurately render pixels of arbitrary resolution, the rendering MLP needs to be calculated in a high-resolution space. Therefore, the most feasible way to minimize the amount of rendering calculations is to reduce the dimension or depth of the rendering MLP. However, directly inputting the output of the implicit MLP into the rendering MLP means that a high-dimensional rendering MLP is required to fully receive and process the input, but this application uses a low-dimensional rendering MLP to reduce complexity.
为了克服这些限制,本申请实施例采用调制来连接隐空间和渲染空间。目前的调制始终由输入图像共享,并在图像级别工作,导致了欠佳的性能和困难的训练。因此,本申请提出了一个由隐式MLP使用隐码z*生成的隐调制m*。隐调制由K个FiLM层(特征线性调制层)组成,其中每层都包括一个缩放调制和一个偏移调制。隐调制的表达式为:To overcome these limitations, the embodiments of the present application use modulation to connect the latent space and the rendering space. The current modulation is always shared by the input image and works at the image level, resulting in poor performance and difficult training. Therefore, the present application proposes an implicit modulation m * generated by an implicit MLP using a hidden code z * . The implicit modulation consists of K FiLM layers (feature linear modulation layers), each of which includes a scaling modulation and an offset modulation. The expression of the implicit modulation is:
公式(2)中,表示第一层的缩放调制,表示第一层的偏移调制,表示最后一层的缩放调制,表示最后一层的偏移调制。In formula (2), represents the scaling modulation of the first layer, represents the offset modulation of the first layer, represents the scaling modulation of the last layer, represents the offset modulation of the last layer.
隐调制m*仅在渲染最接近其隐码z*的高分辨率坐标时对渲染MLP进行调制,并且在每个隐藏线性层之后应用于隐藏特征上:The latent modulation m * modulates the rendering MLP only when rendering the high-resolution coordinates closest to its latent code z * , and is applied to the hidden features after each hidden linear layer:
公式(3)中,表示第k层的隐藏特征,表示第k+1层的隐藏特征,⊙表示通道方向点乘,σ表示激活函数,表示渲染MLP第k+1层的处理。In formula (3), represents the hidden features of the kth layer, represents the hidden features of the k+1th layer, ⊙ represents the channel direction dot product, σ represents the activation function, Represents the process of rendering the k+1th layer of the MLP.
重要的是,隐调制有效地解决了低分辨率高维隐码与高分辨率低维渲染之间的不兼容性,并显著地从参数中解耦了计算。例如,对于具有192维的潜在调制,本申请实施例可以使用仅具有16维的7层渲染MLP准确地渲染图像。这个极小的渲染MLP只学习自然图像的一般渲染规则,而隐调制负责根据每个输入隐码的特征优化渲染MLP的渲染过程。因此,隐调制使低维渲染MLP能够快速适应任何输入隐码,并实现在任意分辨率下的高效渲染。然而,一个剩余的问题是,现有基于INR的任意尺度超分辨率方法中的隐码通常具有高维度,在直接输入到渲染MLP时会导致不必要的计算。因此,本申请进一步使用隐式MLP将高维隐码z*压缩成低维隐码总的来说,基于隐调制,本申请提出了连续图像表示的隐调制函数:Importantly, implicit modulation effectively resolves the incompatibility between low-resolution high-dimensional hidden codes and high-resolution low-dimensional rendering, and significantly decouples computation from parameters. For example, for a potential modulation with 192 dimensions, an embodiment of the present application can accurately render an image using a 7-layer rendering MLP with only 16 dimensions. This extremely small rendering MLP only learns general rendering rules for natural images, while the implicit modulation is responsible for optimizing the rendering process of the rendering MLP based on the characteristics of each input hidden code. Therefore, implicit modulation enables the low-dimensional rendering MLP to quickly adapt to any input hidden code and achieve efficient rendering at arbitrary resolution. However, a remaining problem is that the hidden codes in existing INR-based arbitrary-scale super-resolution methods are typically high-dimensional, which can result in unnecessary computation when directly input into the rendering MLP. Therefore, the present application further uses an implicit MLP to compress the high-dimensional hidden code z * into a low-dimensional hidden code In general, based on implicit modulation, this application proposes an implicit modulation function for continuous image representation:
公式(4)中,Split表示通道方向分割,表示由隐调制m*调制的渲染MLP。In formula (4), Split represents channel direction segmentation. Represents the rendered MLP modulated by the implicit modulation m * .
本申请提出的LMF是高效任意尺度超分辨率的通用方法,与现有基于INR的方法完全兼容。通过轻松地将耗时的操作和大型MLP转移到隐空间,LMF允许一个最小的渲染MLP执行与原始任意尺度超分辨率方法相同级别的超分辨率。此外,LMF还有有效地解耦了特征展开和局部集成等计算昂贵的技术,具有积极的有益效果。The LMF proposed in this application is a general method for efficient arbitrary-scale super-resolution, which is fully compatible with existing INR-based methods. By easily transferring time-consuming operations and large MLPs to the latent space, LMF allows a minimal rendering MLP to perform the same level of super-resolution as the original arbitrary-scale super-resolution method. In addition, LMF also effectively decouples computationally expensive techniques such as feature expansion and local integration, which has positive beneficial effects.
本申请的隐调制函数的处理流程如下:The processing flow of the implicit modulation function of this application is as follows:
1、输入:低分辨率图像经过编码器编码后的特征图,以及超分辨率倍数参数。1. Input: feature map of the low-resolution image after being encoded by the encoder, and super-resolution multiple parameters.
2、隐解码:将低分辨率特征图输入隐式MLP,得到每个特征向量的隐调制和压缩隐码。2. Hidden decoding: Input the low-resolution feature map into the implicit MLP to obtain the implicit modulation and compression hidden code of each feature vector.
3、生成高分辨率坐标:根据超分倍数,生成高分辨率坐标网格。3. Generate high-resolution coordinates: Generate a high-resolution coordinate grid based on the super-resolution multiples.
4、渲染图像:依次将每个高分辨率坐标和其压缩隐码输入到由隐调制进行调制的渲染MLP,计算得到该坐标的像素值。4. Rendering the image: Each high-resolution coordinate and its compressed hidden code are input into the rendering MLP modulated by hidden modulation in turn to calculate the pixel value of the coordinate.
5、输出:超分辨率图像。5. Output: super-resolution image.
下面介绍本申请实施例的可控多尺度渲染方法:The controllable multi-scale rendering method of the embodiment of the present application is introduced below:
两个隐码的高分辨率区域通常具有不同的信号复杂性,允许使用不同的解码函数。然而,挑战在于获取每个隐码的信号复杂性。公式(3)中的调制策略表明,隐调制的强度与其隐码的信号复杂性成正比。较低的强度意味着信号可以被渲染MLP本身充分渲染。图4展示了偏移调制的归一化均值。图5展示了基于LMF的LIIF的超分辨率结果与双线性上采样结果的归一化残差,很明显,调制的均值与预测图像的信号复杂性密切相关。调制均值较高的区域始终对应于更复杂的纹理和边缘。基于此发现,本申请实施例使用偏移调制的均值指示隐调制的强度。The high-resolution regions of the two hidden codes typically have different signal complexities, allowing different decoding functions to be used. However, the challenge lies in obtaining the signal complexity of each hidden code. The modulation strategy in formula (3) shows that the strength of the hidden modulation is proportional to the signal complexity of its hidden code. A lower intensity means that the signal can be adequately rendered by the rendering MLP itself. Figure 4 shows the normalized mean of the offset modulation. Figure 5 shows the normalized residual of the super-resolution result of the LMF-based LIIF and the bilinear upsampling result. It is obvious that the mean of the modulation is closely related to the signal complexity of the predicted image. Regions with higher modulation means always correspond to more complex textures and edges. Based on this finding, the embodiment of the present application uses the mean of the offset modulation to indicate the strength of the hidden modulation.
在LMF中,隐码z*的信号复杂性定义为完全表示其信号所需的最小放大倍数s*。简单来说,任何放大倍数为s,s>s*的渲染将产生与使用s*倍数进行渲染后再用倍插值(例如双线性差值)相同的结果。因此,当对图像进行s倍的上采样时,只需要用min(s,s*)倍渲染隐码z*。In LMF, the signal complexity of a hidden code z * is defined as the minimum magnification s * required to fully represent its signal. In simple terms, any rendering with a magnification s, s>s * will produce the same signal complexity as rendering with a magnification of s * and then using Therefore, when upsampling the image by a factor of s, it is only necessary to render the hidden z * by min(s,s * ) times.
为了获取每个调制均值对应的最小渲染倍数,本申请实施例通过采样常见的放大倍数并记录它们的最小和最大调制均值来创建一个倍数-调制查找表。基于倍数-调制查找表,本申请提出了一种可控多尺度渲染(Controllable Multi-Scale Rendering,CMSR)方法来解码具有最小渲染倍数的隐码。对于每个采样的倍数si∈[s1,s2,…,s],首先使用双线性插值将si-1的结果上采样到与si对应的分辨率;然后查询倍数-调制查找表以获取si的调制均值范围对于si,渲染MLP只渲染调制均值落在内的隐码。重复上述过程,直到获得最终的超分辨率结果或所有隐码已被解码。对于精度可控渲染,当创建倍数-调制查找表时,设置一个MSE阈值t并定义隐码的信号复杂性为渲染其信号且在此误差t内所需的最小放大倍数。通过在测试过程中轻松地调整MSE阈值,CMSR能够以不同的精确度和速度进行渲染,同时自然适应图像的内容。In order to obtain the minimum rendering multiple corresponding to each modulation mean, the embodiment of the present application creates a multiple-modulation lookup table by sampling common magnifications and recording their minimum and maximum modulation means. Based on the multiple-modulation lookup table, the present application proposes a controllable multi-scale rendering (CMSR) method to decode hidden codes with minimum rendering multiples. For each sampled multiple s i ∈ [s 1 ,s 2 ,…,s], first use bilinear interpolation to upsample the result of s i-1 to a resolution corresponding to s i ; then query the multiple-modulation lookup table to obtain the modulation mean range of s i For s i , the rendering MLP only renders the modulation mean falling within The above process is repeated until the final super-resolution result is obtained or all hidden codes have been decoded. For controllable precision rendering, when creating the multiplication-modulation lookup table, an MSE threshold t is set and the signal complexity of the hidden code is defined as the minimum magnification required to render its signal within this error t. By easily adjusting the MSE threshold during testing, CMSR can render with different accuracy and speed while naturally adapting to the content of the image.
本申请的可控多尺度渲染的处理流程如下:The processing flow of controllable multi-scale rendering of this application is as follows:
1、输入:低分辨率图像,压缩隐码,隐调制,倍数-调制查找表,以及超分辨率倍数参数。1. Input: low-resolution image, compression code, hidden modulation, multiplier-modulation lookup table, and super-resolution multiplier parameters.
2、初始化:初始渲染倍数为1,初始渲染图像为低分辨率图像。计算每个隐调制的偏移调制均值。2. Initialization: The initial rendering factor is 1, and the initial rendering image is a low-resolution image. Calculate the offset modulation mean of each implicit modulation.
3、更新渲染倍数:渲染倍数更新为倍数-调制查找表的下一个倍数。3. Update rendering multiple: The rendering multiple is updated to multiple - the next multiple of the modulation lookup table.
4、插值上采样:对于当前渲染倍数,首先使用双线性插值将上一倍数的图像上采样到与当前倍数对应的分辨率。4. Interpolation upsampling: For the current rendering multiple, first use bilinear interpolation to upsample the image of the previous multiple to the resolution corresponding to the current multiple.
5、查表:查询倍数-调制查找表以获取其调制均值范围。5. Lookup table: Query the multiple-modulation lookup table to obtain its modulation mean range.
6、渲染:对于当前倍数,将调制均值落在查询范围内的隐码输入到渲染MLP进行渲染,得到当前倍数的渲染像素值。6. Rendering: For the current multiple, the hidden code whose modulation mean falls within the query range is input into the rendering MLP for rendering to obtain the rendered pixel value of the current multiple.
7、迭代:重复(2)-(5)的过程,直到获得最终的超分辨率结果或所有隐码已被解码。7. Iteration: Repeat the process of (2)-(5) until the final super-resolution result is obtained or all hidden codes have been decoded.
8、输出:超分辨率图像。8. Output: super-resolution image.
基于上述的隐调制函数和可控多尺度渲染,参照图5,本申请实施例的其中一种任意尺度图像的表示方法的处理流程为:Based on the above implicit modulation function and controllable multi-scale rendering, referring to FIG. 5 , the processing flow of a method for representing an image of any scale in an embodiment of the present application is as follows:
1、输入:低分辨率图像,倍数-调制查找表,以及超分辨率倍数参数。1. Input: low-resolution image, multiplication-modulation lookup table, and super-resolution multiplication parameters.
2、编码:将低分辨率图像输入编码器,得到低分辨率高维度特征图。2. Encoding: Input the low-resolution image into the encoder to obtain a low-resolution high-dimensional feature map.
3、隐解码:将低分辨率高维度特征图输入隐式MLP,得到每个特征向量的隐调制和压缩隐码。3. Hidden decoding: Input the low-resolution high-dimensional feature map into the implicit MLP to obtain the implicit modulation and compression hidden code of each feature vector.
4、可控多尺度渲染:4. Controllable multi-scale rendering:
(1)初始化:初始渲染倍数为1,初始渲染图像为低分辨率图像。计算每个隐调制的偏移调制均值。(1) Initialization: The initial rendering factor is 1, and the initial rendered image is a low-resolution image. Calculate the offset modulation mean of each implicit modulation.
(2)更新渲染倍数:渲染倍数更新为倍数-调制查找表的下一个倍数。(2) Update the rendering multiple: The rendering multiple is updated to multiple - the next multiple in the modulation lookup table.
(3)插值上采样:对于当前渲染倍数,首先使用双线性插值将上一倍数的图像上采样到与当前倍数对应的分辨率。(3) Interpolation upsampling: For the current rendering multiple, bilinear interpolation is first used to upsample the image of the previous multiple to the resolution corresponding to the current multiple.
(4)查表:查询倍数-调制查找表以获取其调制均值范围。(4) Lookup: Look up the multiplier-modulation lookup table to obtain its modulation mean range.
(5)渲染:对于当前倍数,将调制均值落在查询范围内的隐码输入到渲染MLP进行渲染,得到当前倍数的渲染像素值。(5) Rendering: For the current multiple, the hidden code whose modulation mean falls within the query range is input into the rendering MLP for rendering to obtain the rendered pixel value of the current multiple.
(6)迭代:重复(3)-(6)的过程,直到获得最终的超分辨率结果(即当前渲染倍数就是最终的超分辨率倍数结果需要的倍数时)或所有隐码已被解码。(6) Iteration: Repeat the process of (3)-(6) until the final super-resolution result is obtained (that is, the current rendering magnification is the magnification required by the final super-resolution result) or all hidden codes have been decoded.
5、输出:超分辨率图像。5. Output: super-resolution image.
综上所述,本申请实施例至少具有如下有益效果:In summary, the embodiments of the present application have at least the following beneficial effects:
本申请提出了一种新颖的基于隐调制函数的任意尺度图像表示方法及系统,以实现连续图像表示的最优计算。通过利用隐调制,LMF成功地将解码过程从高分辨率高维空间解耦到隐空间(低分辨率高维)和渲染空间(高分辨率低维)。LMF首先使用隐式MLP生成给定隐码的隐调制。在查询隐码附近的坐标时,隐调制被应用于渲染MLP中的每个隐藏线性层。这种调制使LMF能够在不牺牲性能的情况下最小化渲染MLP的维度和参数。此外,本申请基于隐调制强度与输入图像复杂性之间存在正相关提供了一种可控多尺度渲染算法。CMSR不仅允许渲染的计算成本与输入图像的复杂性而不是输出分辨率成正比,而且还提供了在测试期间平衡精度和效率的灵活性。结合隐调制函数和可控多尺度渲染,本申请可以计算最优且快速的生成任意尺度图像超分辨率表示。The present application proposes a novel arbitrary-scale image representation method and system based on implicit modulation function to achieve optimal calculation of continuous image representation. By utilizing implicit modulation, LMF successfully decouples the decoding process from high-resolution high-dimensional space to latent space (low-resolution high-dimensional) and rendering space (high-resolution low-dimensional). LMF first uses implicit MLP to generate implicit modulation for a given hidden code. When querying coordinates near the hidden code, implicit modulation is applied to each hidden linear layer in the rendering MLP. This modulation enables LMF to minimize the dimensions and parameters of the rendering MLP without sacrificing performance. In addition, the present application provides a controllable multi-scale rendering algorithm based on the positive correlation between the implicit modulation intensity and the complexity of the input image. CMSR not only allows the computational cost of rendering to be proportional to the complexity of the input image rather than the output resolution, but also provides the flexibility to balance accuracy and efficiency during testing. Combined with implicit modulation functions and controllable multi-scale rendering, the present application can calculate the optimal and fast generation of arbitrary-scale image super-resolution representations.
实验表明,将现有基于INR的任意尺度超分辨率方法转换为基于LMF的方法可将计算成本降低90.4%至99.9%,将推理速度提高2.4倍至56.9倍,并节省45.1%至76.0%的参数,同时保持具有竞争力的PSNR性能。Experiments show that converting existing INR-based arbitrary-scale super-resolution methods to LMF-based methods can reduce the computational cost by 90.4% to 99.9%, increase the inference speed by 2.4 times to 56.9 times, and save 45.1% to 76.0% of parameters, while maintaining competitive PSNR performance.
请参阅图6,本申请实施例还提供任意尺度图像的表示系统100,可以实现上述任意尺度图像的表示方法,该系统包括:Referring to FIG. 6 , the embodiment of the present application further provides a system 100 for representing an image of any scale, which can implement the above-mentioned method for representing an image of any scale. The system includes:
第一模块101,用于获取低分辨率图像、倍数与调制查找表和超分辨率倍数参数;The first module 101 is used to obtain a low-resolution image, a magnification and modulation lookup table, and a super-resolution magnification parameter;
第二模块102,用于通过编码器对所述低分辨率图像进行编码处理,得到低分辨率高维特征图;The second module 102 is used to encode the low-resolution image through an encoder to obtain a low-resolution high-dimensional feature map;
第三模块103,用于将所述低分辨率高维特征图输入隐式多层感知机,得到每个特征向量的隐调制和压缩隐码;The third module 103 is used to input the low-resolution high-dimensional feature map into an implicit multi-layer perceptron to obtain an implicit modulation and compression code of each feature vector;
第四模块104,用于根据所述超分辨率倍数参数、所述压缩隐码、所述隐调制和所述倍数与调制查找表,采用目标渲染多层感知机对所述低分辨率图像进行渲染处理,得到超分辨率图像。The fourth module 104 is used to render the low-resolution image using a target rendering multi-layer perceptron according to the super-resolution multiple parameter, the compression hidden code, the hidden modulation and the multiple and modulation lookup table to obtain a super-resolution image.
可以理解的是,上述方法实施例中的内容均适用于本系统实施例中,本系统实施例所具体实现的功能与上述方法实施例相同,并且达到的有益效果与上述方法实施例所达到的有益效果也相同。It can be understood that the contents of the above method embodiments are all applicable to the present system embodiments, the functions specifically implemented by the present system embodiments are the same as those of the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.
本申请实施例还提供了一种电子设备,电子设备包括存储器和处理器,存储器存储有计算机程序,处理器执行计算机程序时实现上述任意尺度图像的表示方法。该电子设备可以为包括平板电脑、车载电脑等任意智能终端。The embodiment of the present application also provides an electronic device, the electronic device includes a memory and a processor, the memory stores a computer program, and the processor implements the above-mentioned method for representing an image of any scale when executing the computer program. The electronic device can be any intelligent terminal including a tablet computer, a car computer, etc.
可以理解的是,上述方法实施例中的内容均适用于本设备实施例中,本设备实施例所具体实现的功能与上述方法实施例相同,并且达到的有益效果与上述方法实施例所达到的有益效果也相同。It can be understood that the contents of the above method embodiments are all applicable to the present device embodiments, the functions specifically implemented by the present device embodiments are the same as those of the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.
请参阅图7,图7示意了另一实施例的电子设备的硬件结构,电子设备包括:Please refer to FIG. 7 , which schematically shows the hardware structure of an electronic device according to another embodiment. The electronic device includes:
处理器201,可以采用通用的CPU(Central Processing Unit,中央处理器)、微处理器、应用专用集成电路(Application Specific Integrated Circuit,ASIC)、或者一个或多个集成电路等方式实现,用于执行相关程序,以实现本申请实施例所提供的技术方案;The processor 201 may be implemented by a general-purpose CPU (Central Processing Unit), a microprocessor, an application-specific integrated circuit (ASIC), or one or more integrated circuits, and is used to execute relevant programs to implement the technical solutions provided in the embodiments of the present application;
存储器202,可以采用只读存储器(Read Only Memory,ROM)、静态存储设备、动态存储设备或者随机存取存储器(Random Access Memory,RAM)等形式实现。存储器202可以存储操作系统和其他应用程序,在通过软件或者固件来实现本说明书实施例所提供的技术方案时,相关的程序代码保存在存储器202中,并由处理器201来调用执行本申请实施例的任意尺度图像的表示方法;The memory 202 can be implemented in the form of a read-only memory (ROM), a static storage device, a dynamic storage device, or a random access memory (RAM). The memory 202 can store an operating system and other application programs. When the technical solution provided in the embodiment of this specification is implemented by software or firmware, the relevant program code is stored in the memory 202, and the processor 201 calls and executes the method for representing an image of any scale in the embodiment of this application;
输入/输出接口203,用于实现信息输入及输出;Input/output interface 203, used to implement information input and output;
通信接口204,用于实现本设备与其他设备的通信交互,可以通过有线方式(例如USB、网线等)实现通信,也可以通过无线方式(例如移动网络、WIFI、蓝牙等)实现通信;The communication interface 204 is used to realize the communication interaction between the device and other devices. The communication can be realized through a wired manner (such as USB, network cable, etc.) or a wireless manner (such as mobile network, WIFI, Bluetooth, etc.);
总线205,在设备的各个组件(例如处理器201、存储器202、输入/输出接口203和通信接口204)之间传输信息;Bus 205 , which transmits information between various components of the device (e.g., processor 201 , memory 202 , input/output interface 203 , and communication interface 204 );
其中处理器201、存储器202、输入/输出接口203和通信接口204通过总线205实现彼此之间在设备内部的通信连接。The processor 201 , the memory 202 , the input/output interface 203 and the communication interface 204 are connected to each other in communication within the device via the bus 205 .
本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质存储有计算机程序,该计算机程序被处理器执行时实现上述任意尺度图像的表示方法。An embodiment of the present application further provides a computer-readable storage medium, which stores a computer program. When the computer program is executed by a processor, the above-mentioned method for representing an image of any scale is implemented.
可以理解的是,上述方法实施例中的内容均适用于本存储介质实施例中,本存储介质实施例所具体实现的功能与上述方法实施例相同,并且达到的有益效果与上述方法实施例所达到的有益效果也相同。It can be understood that the contents of the above method embodiments are all applicable to the present storage medium embodiments, the functions specifically implemented by the present storage medium embodiments are the same as those of the above method embodiments, and the beneficial effects achieved are also the same as those achieved by the above method embodiments.
存储器作为一种非暂态计算机可读存储介质,可用于存储非暂态软件程序以及非暂态性计算机可执行程序。此外,存储器可以包括高速随机存取存储器,还可以包括非暂态存储器,例如至少一个磁盘存储器件、闪存器件、或其他非暂态固态存储器件。在一些实施方式中,存储器可选包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至该处理器。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory, as a non-transient computer-readable storage medium, can be used to store non-transient software programs and non-transient computer executable programs. In addition, the memory may include a high-speed random access memory, and may also include a non-transient memory, such as at least one disk storage device, a flash memory device, or other non-transient solid-state storage device. In some embodiments, the memory may optionally include a memory remotely disposed relative to the processor, and these remote memories may be connected to the processor via a network. Examples of the above-mentioned network include, but are not limited to, the Internet, an intranet, a local area network, a mobile communication network, and combinations thereof.
本申请实施例描述的实施例是为了更加清楚的说明本申请实施例的技术方案,并不构成对于本申请实施例提供的技术方案的限定,本领域技术人员可知,随着技术的演变和新应用场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。The embodiments described in the embodiments of the present application are intended to more clearly illustrate the technical solutions of the embodiments of the present application and do not constitute a limitation on the technical solutions provided in the embodiments of the present application. Those skilled in the art will appreciate that with the evolution of technology and the emergence of new application scenarios, the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.
本领域技术人员可以理解的是,图中示出的技术方案并不构成对本申请实施例的限定,可以包括比图示更多或更少的步骤,或者组合某些步骤,或者不同的步骤。Those skilled in the art will appreciate that the technical solutions shown in the figures do not constitute a limitation on the embodiments of the present application, and may include more or fewer steps than shown in the figures, or a combination of certain steps, or different steps.
以上所描述的系统实施例仅仅是示意性的,其中作为分离部件说明的单元可以是或者也可以不是物理上分开的,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The system embodiments described above are merely illustrative, and the units described as separate components may or may not be physically separated, that is, they may be located in one place or distributed on multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the present embodiment.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统、设备中的功能模块/单元可以被实施为软件、固件、硬件及其适当的组合。Those skilled in the art will appreciate that all or some of the steps in the methods disclosed above, and the functional modules/units in the systems and devices may be implemented as software, firmware, hardware, or a suitable combination thereof.
本申请的说明书及上述附图中的术语“第一”、“第二”、“第三”、“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本申请的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third", "fourth", etc. (if any) in the specification of the present application and the above-mentioned drawings are used to distinguish similar objects, and are not necessarily used to describe a specific order or sequence. It should be understood that the data used in this way can be interchangeable where appropriate, so that the embodiments of the present application described herein can be implemented in an order other than those illustrated or described herein. In addition, the terms "including" and "having" and any of their variations are intended to cover non-exclusive inclusions, for example, a process, method, system, product or device comprising a series of steps or units is not necessarily limited to those steps or units clearly listed, but may include other steps or units that are not clearly listed or inherent to these processes, methods, products or devices.
应当理解,在本申请中,“至少一个(项)”是指一个或者多个,“多个”是指两个或两个以上。“和/或”,用于描述关联对象的关联关系,表示可以存在三种关系,例如,“A和/或B”可以表示:只存在A,只存在B以及同时存在A和B三种情况,其中A,B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。“以下至少一项(个)”或其类似表达,是指这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b或c中的至少一项(个),可以表示:a,b,c,“a和b”,“a和c”,“b和c”,或“a和b和c”,其中a,b,c可以是单个,也可以是多个。It should be understood that in the present application, "at least one (item)" means one or more, and "plurality" means two or more. "And/or" is used to describe the association relationship of associated objects, indicating that three relationships may exist. For example, "A and/or B" can mean: only A exists, only B exists, and A and B exist at the same time, where A and B can be singular or plural. The character "/" generally indicates that the objects associated before and after are in an "or" relationship. "At least one of the following" or similar expressions refers to any combination of these items, including any combination of single or plural items. For example, at least one of a, b or c can mean: a, b, c, "a and b", "a and c", "b and c", or "a and b and c", where a, b, c can be single or multiple.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的系统实施例仅仅是示意性的,例如,上述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed devices and methods can be implemented in other ways. For example, the system embodiments described above are only schematic. For example, the division of the above units is only a logical function division. There may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be an indirect coupling or communication connection through some interfaces, devices or units, which can be electrical, mechanical or other forms.
上述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described above as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware or in the form of software functional units.
集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括多指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例的方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,简称ROM)、随机存取存储器(Random Access Memory,简称RAM)、磁碟或者光盘等各种可以存储程序的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including multiple instructions to enable a computer device (which can be a personal computer, server, or network device, etc.) to execute all or part of the steps of the methods of various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM), disk or optical disk and other media that can store programs.
以上参照附图说明了本申请实施例的优选实施例,并非因此局限本申请实施例的权利范围。本领域技术人员不脱离本申请实施例的范围和实质内所作的任何修改、等同替换和改进,均应在本申请实施例的权利范围之内。The preferred embodiments of the present application are described above with reference to the accompanying drawings, but the scope of the rights of the present application is not limited thereto. Any modification, equivalent substitution and improvement made by a person skilled in the art without departing from the scope and essence of the present application should be within the scope of the rights of the present application.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410483340.8A CN118365523A (en) | 2024-04-22 | 2024-04-22 | Method, system, electronic device and storage medium for representing images of any scale |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202410483340.8A CN118365523A (en) | 2024-04-22 | 2024-04-22 | Method, system, electronic device and storage medium for representing images of any scale |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN118365523A true CN118365523A (en) | 2024-07-19 |
Family
ID=91887221
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202410483340.8A Pending CN118365523A (en) | 2024-04-22 | 2024-04-22 | Method, system, electronic device and storage medium for representing images of any scale |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN118365523A (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119313621A (en) * | 2024-09-26 | 2025-01-14 | 杭州电子科技大学 | Robotic loop closure detection method based on multi-task low-dimensional latent code and neural radiance field |
| CN119648526A (en) * | 2024-11-14 | 2025-03-18 | 华东交通大学 | A microscopic electron microscope image processing method and system based on multi-scale zooming |
-
2024
- 2024-04-22 CN CN202410483340.8A patent/CN118365523A/en active Pending
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN119313621A (en) * | 2024-09-26 | 2025-01-14 | 杭州电子科技大学 | Robotic loop closure detection method based on multi-task low-dimensional latent code and neural radiance field |
| CN119648526A (en) * | 2024-11-14 | 2025-03-18 | 华东交通大学 | A microscopic electron microscope image processing method and system based on multi-scale zooming |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11501415B2 (en) | Method and system for high-resolution image inpainting | |
| EP3937124B1 (en) | Image processing method, device and apparatus, and storage medium | |
| WO2020019761A1 (en) | Monocular image depth estimation method and apparatus, device, program and storage medium | |
| CN114612289A (en) | Stylized image generation method and device and image processing equipment | |
| CN118365523A (en) | Method, system, electronic device and storage medium for representing images of any scale | |
| CN118570054B (en) | Training method, related device and medium for image generation model | |
| CN112927200B (en) | Intrinsic image decomposition method and device, readable storage medium and electronic equipment | |
| CN117975484B (en) | Training method of change detection model, change detection method, device and equipment | |
| CN114978189A (en) | A data encoding method and related equipment | |
| CN117894038A (en) | Method and device for generating object gesture in image | |
| CN117975211A (en) | Image processing method and device based on multi-mode information | |
| US20190340728A1 (en) | Superpixel Sampling Networks | |
| CN115049717A (en) | Depth estimation method and device | |
| CN115731446B (en) | Image processing method, system, device and computer-readable storage medium | |
| CN115861091B (en) | Image inpainting method based on multi-fusion attention module and deep image prior | |
| CN118691818A (en) | Image segmentation method, device, electronic device and storage medium | |
| CN114821068B (en) | Panorama segmentation and depth determination processing method, device, equipment and medium | |
| CN116129228A (en) | Training method of image matching model, image matching method and device thereof | |
| CN119131434B (en) | Feature matching method, device, equipment and medium | |
| CN111860764A (en) | Visual task processing method and model training method, device and electronic system | |
| CN119496914B (en) | Decompressed point cloud quality enhancement method based on attention fusion of multi-scale features | |
| CN116310660B (en) | Enhanced sample generation method and device | |
| US20250069321A1 (en) | Reconstructive latent-space neural radiance fields (rels-nerf) for 3d scene representations | |
| CN118381936B (en) | Video data processing method, device, equipment and storage medium | |
| Kim et al. | Image enhancement with intensity transformation on embedding space |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |