TWI659396B

TWI659396B - Method and image processing device for image reconstruction in multimodal noise suppression

Info

Publication number: TWI659396B
Application number: TW106129837A
Authority: TW
Inventors: Bo-Hao Chen; 陳柏豪; Jia-li YIN; 印佳麗
Original assignee: Yuan Ze University; 元智大學
Priority date: 2017-08-31
Filing date: 2017-08-31
Publication date: 2019-05-11
Also published as: TW201913563A

Abstract

一種用以抑制多模雜訊的影像重建方法及其影像處理裝置，此方法包括下列步驟。首先，接收包括多個畫素的輸入影像，此些畫素包括多個損壞畫素以及多個乾淨畫素。接著，將輸入影像輸入至影像重建模型，以取得重建影像，其中影像重建模型關聯於各個畫素的全域環境、區域環境、相關環境以及三個係數，全域環境關聯於所有乾淨畫素，區域環境關聯於各個畫素的鄰近畫素所構成的區塊，相關環境關聯於各個畫素的相似畫素所構成的群聚，此些係數分別關聯於全域環境、區域環境以及相關環境並且是於影像重建模型的訓練階段時以半監督式學習法所取得。An image reconstruction method and an image processing device for suppressing multi-mode noise. The method includes the following steps. First, an input image including a plurality of pixels is received, and the pixels include a plurality of damaged pixels and a plurality of clean pixels. The input image is then input to an image reconstruction model to obtain a reconstructed image. The image reconstruction model is associated with the global environment, regional environment, related environment, and three coefficients of each pixel. The global environment is associated with all clean pixels and regional environment. Blocks made up of neighboring pixels that are related to each pixel. The related environment is related to the clusters of similar pixels of each pixel. These coefficients are related to the global environment, regional environment, and related environment and are related to the image. The training phase of the reconstructed model is obtained by a semi-supervised learning method.

Description

Image reconstruction method and image processing device for suppressing multi-mode noise

本發明是有關於一種影像重建方法及其影像處理裝置，且特別是有關於一種用以抑制多模雜訊的影像重建方法及其影像處理裝置。The invention relates to an image reconstruction method and an image processing device thereof, and in particular to an image reconstruction method and an image processing device for suppressing multi-mode noise.

傳輸誤差、位元錯誤以及雜訊感測等影像取得過程中所出現的干擾往往會使得所取得的影像中存在脈衝雜訊（impulse noise，IN）而造成不好的視覺效果。假設一個無雜訊的乾淨影像可表示成，當影像中的第個位置的畫素因脈衝雜訊而損壞，則可以方程式(1)來表示：其中，為第個位置的脈衝雜訊，為雜訊損壞的機率。一般來說，以8位元的灰階影像為例，因椒鹽雜訊（salt and pepper noise，SPN）而損壞的畫素的像素值等於最大值（即，255）或是最小值（即，0）。 Interferences in image acquisition such as transmission errors, bit errors, and noise sensing tend to cause impulse noise (IN) in the acquired images and cause poor visual effects. Suppose a clean image without noise can be represented as , When the first Pixels Damage due to impulse noise can be expressed by equation (1): among them , First Pulse noise at several locations, The probability of noise damage. Generally speaking, taking an 8-bit grayscale image as an example, the pixel value of a pixel damaged by salt and pepper noise (SPN) is equal to the maximum value (ie, 255) or the minimum value (ie, 0).

一種自影像中移除椒鹽雜訊的習知方式是先利用例如是方程式(2)所表示的二值化遮罩（binary mask）來偵測：其中所為0時代表位於的畫素因椒鹽雜訊而損壞，而所為1時代表位於的畫素為無雜訊。然而，此種方式在對於高度損壞的影像（即，雜訊的比例高於80%）並無法有效地移除雜訊。 A known way to remove salt and pepper noise from an image is to first detect it using a binary mask represented by equation (2), for example: among them When it is 0, it means Of pixels are damaged by salt and pepper noise, and When it is 1, the representative is located The pixels are no noise. However, this method cannot effectively remove noise in highly damaged images (that is, the proportion of noise is higher than 80%).

另一種習知方式是利用矩陣分解（matrix factorization）的方式，以將影像分解為內容成份以及雜訊成份，從而消除影像中的雜訊。然而，此種方式並未考慮到影像區塊之間的相關性而造成誤差。Another conventional method is to use matrix factorization to decompose the image into content components and noise components, thereby eliminating noise in the image. However, this method does not take into account the correlation between image blocks and causes errors.

有鑑於此，本發明提供一種用以抑制多模雜訊的影像重建方法及其影像處理裝置，其可針對具有高密度雜訊的損壞影像進行有效的修復，以重建出具有良好視覺效果的影像。In view of this, the present invention provides an image reconstruction method and an image processing device for suppressing multi-mode noise, which can effectively repair a damaged image with high-density noise to reconstruct an image with good visual effects. .

在本發明的一實施例中，上述的影像重建方法適用於影像處理裝置並且包括下列步驟。首先，接收輸入影像，其中輸入影像包括多個畫素，此些畫素包括具有雜訊的多個損壞畫素以及無雜訊的多個乾淨畫素。接著，將輸入影像輸入至影像重建模型，以取得影像重建模型所輸出的重建影像，其中影像重建模型關聯於各個畫素的全域環境、區域環境、相關環境、第一係數、第二係數以及第三係數，全域環境關聯於所有乾淨畫素，區域環境關聯於各個畫素的鄰近畫素所構成的區塊，相關環境關聯於各個畫素的相似畫素所構成的群聚，第一係數、第二係數以及第三係數分別關聯於全域環境、區域環境以及相關環境並且是於影像重建模型的訓練階段時以半監督式學習法所取得。In an embodiment of the present invention, the image reconstruction method is suitable for an image processing apparatus and includes the following steps. First, an input image is received, where the input image includes a plurality of pixels, and the pixels include a plurality of damaged pixels with noise and a plurality of clean pixels without noise. Then, the input image is input to the image reconstruction model to obtain the reconstructed image output by the image reconstruction model. The image reconstruction model is associated with the global environment, regional environment, related environment, first coefficient, second coefficient, and Three coefficients, the global environment is related to all clean pixels, the regional environment is related to the block made up of neighboring pixels of each pixel, and the related environment is related to the cluster of similar pixels of each pixel. The first coefficient, The second coefficient and the third coefficient are respectively related to the global environment, the regional environment, and the related environment and are obtained by the semi-supervised learning method during the training phase of the image reconstruction model.

在本發明的一實施例中，上述的影像處理裝置包括記憶體以及處理器，其中處理器耦接記憶體。記憶體用以儲存資料以及影像。處理器用以接收輸入影像，將輸入影像輸入至影像重建模型，以及取得影像重建模型所輸出的重建影像，其中輸入影像包括多個畫素，此些畫素包括具有雜訊的多個損壞畫素以及無雜訊的多個乾淨畫素，影像重建模型關聯於各個畫素的全域環境、區域環境、相關環境、第一係數、第二係數以及第三係數，全域環境關聯於所有乾淨畫素，區域環境關聯於各個畫素的鄰近畫素所構成的區塊，相關環境關聯於各個畫素的相似畫素所構成的群聚，第一係數、第二係數以及第三係數分別關聯於全域環境、區域環境以及相關環境並且是於影像重建模型的訓練階段時以半監督式學習法所取得。In an embodiment of the present invention, the image processing apparatus includes a memory and a processor, wherein the processor is coupled to the memory. The memory is used to store data and images. The processor is configured to receive an input image, input the input image to an image reconstruction model, and obtain a reconstructed image output from the image reconstruction model. The input image includes multiple pixels, and the pixels include multiple damaged pixels with noise. And multiple clean pixels without noise, the image reconstruction model is related to the global environment, regional environment, related environment, first coefficient, second coefficient, and third coefficient of each pixel, and the global environment is related to all clean pixels. The regional environment is related to the block formed by the neighboring pixels of each pixel. The related environment is related to the cluster of similar pixels of each pixel. The first coefficient, the second coefficient, and the third coefficient are related to the global environment. , Regional environment and related environment and obtained during the training phase of the image reconstruction model by a semi-supervised learning method.

為讓本發明的上述特徵和優點能更明顯易懂，下文特舉實施例，並配合所附圖式作詳細說明如下。In order to make the above features and advantages of the present invention more comprehensible, embodiments are hereinafter described in detail with reference to the accompanying drawings.

圖1是根據本發明一實施例所繪示的影像處理裝置的方塊圖，但此僅是為了方便說明，並不用以限制本發明。首先圖1先介紹影像處理裝置之所有構件以及配置關係，詳細功能將配合圖2一併揭露。FIG. 1 is a block diagram of an image processing apparatus according to an embodiment of the present invention, but this is only for convenience of description and is not intended to limit the present invention. First, FIG. 1 first introduces all components and configuration relationships of the image processing device, and detailed functions will be disclosed together with FIG. 2.

請參照圖1，影像處理裝置100至少包括記憶體110以及處理器120，其中處理器120耦接於記憶體110。影像處理裝置100可以是外接或是內建於例如是個人電腦、筆記型電腦、數位相機、數位攝影機、網路攝影機、智慧型手機、平板電腦、行車紀錄器、汽車影音系統等電子裝置。Referring to FIG. 1, the image processing apparatus 100 includes at least a memory 110 and a processor 120. The processor 120 is coupled to the memory 110. The image processing device 100 may be an external or built-in electronic device such as a personal computer, a notebook computer, a digital camera, a digital camera, a web camera, a smart phone, a tablet computer, a driving recorder, a car audio and video system, and the like.

記憶體110用以儲存視訊影像、資料，其可以例如是任意型式的固定式或可移動式隨機存取記憶體（random access memory，RAM）、唯讀記憶體（read-only memory，ROM）、快閃記憶體（flash memory）、硬碟或其他類似裝置或這些裝置的組合。The memory 110 is used to store video images and data, and may be, for example, any type of fixed or removable random access memory (RAM), read-only memory (ROM), Flash memory, hard disk, or other similar devices or a combination of these devices.

處理器120用以執行所提出的影像增強方法，其可以例如是中央處理單元（central processing unit，CPU），或是其他可程式化之一般用途或特殊用途的微處理器（microprocessor）、數位訊號處理器（digital signal processor，DSP）、可程式化控制器、特殊應用積體電路（application specific integrated circuits，ASIC）、可程式化邏輯裝置（programmable logic device，PLD）或其他類似裝置或這些裝置的組合。The processor 120 is configured to execute the proposed image enhancement method, which may be, for example, a central processing unit (CPU), or other programmable general-purpose or special-purpose microprocessors, digital signals, Processor (digital signal processor (DSP), programmable controller, application specific integrated circuits (ASIC), programmable logic device (programmable logic device (PLD) or other similar device or these devices) combination.

圖2是根據本發明之一實施例所繪示的影像重建方法的流程圖，而圖2的方法流程可以圖1的影像處理裝置100的各元件實現。FIG. 2 is a flowchart of an image reconstruction method according to an embodiment of the present invention, and the method flow of FIG. 2 may be implemented by each element of the image processing apparatus 100 of FIG. 1.

請同時參照圖1以及圖2，首先，影像處理裝置100的處理器120將接收輸入影像（步驟S202），並且將輸入影像輸入至影像重建模型（步驟S204）。在此的影像重建模型是基於影像中的畫素與其它畫素存在特定權重的正相關關聯性的假設下所建立，其可以全域環境（global context）、區域環境（local context）以及相關環境（social context）等三個屬性來建構，以下將分敘說明。Please refer to FIG. 1 and FIG. 2 at the same time. First, the processor 120 of the image processing apparatus 100 receives the input image (step S202), and inputs the input image to the image reconstruction model (step S204). The image reconstruction model here is based on the assumption that there is a positive correlation between pixels in the image and other pixels with specific weights. It can be used in global context, local context, and related environments ( social context) and other attributes.

以全域環境來說，其是利用距離反比權重法（inverse-distance weighting）的多變量插值（multivariate interpolation）全域性地自損壞影像中的所有乾淨畫素（clean pixels）所獲得，並且用以重建各個畫素以恢復整張損壞影像。當給定已知無雜訊的乾淨畫素時，第個位置上具有雜訊的損壞畫素可以方程式(3)來進行重建：其中為第個位置的乾淨畫素，為距離反比權重並且可以方程式(4)來計算出：其中為冪參數（power parameter）並且可以是設定為2.75，然而本發明並不限於此。 Global environment In other words, it is obtained from all clean pixels in the damaged image globally using multivariate interpolation of inverse-distance weighting, and is used to reconstruct each pixel To restore the entire damaged image. When given a known clean pixel with no noise When Corrupted pixels with noise at each location It can be reconstructed by equation (3): among them First Clean pixels at locations, Is the inverse distance weight and can be calculated by equation (4): among them Is a power parameter and can be set to 2.75, however the invention is not limited to this.

以區域環境來說，其可在基於一個畫素有很高的機率與其鄰近畫素的平均亮度值接近的假設下計算權重。因此，在一個區塊中的鄰近畫素的平均插值可用以產生一個雜訊畫素的區域環境，並且可以方程式(5)來表示：其中為以畫素為中心的區塊中的畫素，為區塊中的畫素的數量。 Regional environment In other words, it can calculate weights on the assumption that a pixel has a high probability of being close to the average brightness value of its neighboring pixels. Therefore, the average interpolation of neighboring pixels in a block can be used to generate the regional environment of a noisy pixel, and can be expressed by equation (5): among them Pixels Centered block Pixels in Block The number of pixels in the.

以相關環境來說，由於一般影像往往具有重覆的組成結構，因此其用以描述非區域性的重覆內容。以圖3A根據本發明之一實施例所繪示的輸入影像Img的示意圖為例，區塊311～314、區塊321～322、區塊331～334以及區塊341～343分別具有相似的組成結構並且具有相關性。基於此，具有相似的組成結構的區塊可以分群成一個群聚（cluster）。以圖3B根據本發明之一實施例所繪示的群聚的示意圖為例，區塊311～314、區塊321～322、區塊331～334以及區塊341～343將分別組合成群聚C1、C2、C3以及C4。在此可透過相似的非區域性的區塊，以雜訊畫素的相關環境的群聚插值取得類別內的散佈（within-class scatter）的距離權重，如方程式(6)所示：其中為類別內的散佈中的距離權重，並且可以方程式(7)來計算出：其中為以畫素為中心的區塊所屬的第個群聚，而為第個群聚中的區塊的數量。在本實施例中，類別內的散佈是利用K-means分群法（K-means clustering）而取得。 Relevant context In general, because general images often have repeated composition structures, they are used to describe non-regional repeated content. Taking the schematic diagram of the input image Img shown in FIG. 3A according to an embodiment of the present invention as an example, blocks 311-314, blocks 321-322, blocks 331-334, and blocks 341-343 have similar compositions, respectively. Structure and relevance. Based on this, blocks with similar composition structure can be grouped into a cluster. Taking the grouping diagram shown in FIG. 3B according to an embodiment of the present invention as an example, blocks 311 to 314, blocks 321 to 322, blocks 331 to 334, and blocks 341 to 343 will be combined into groups C1, C2, C3, and C4. Here, a similar non-regional block can be used to obtain the weight of the distance within the class (within-class scatter) by clustering interpolation of the relevant environment of the noise pixels, as shown in equation (6): among them Is the distance weight in the scatter within the category, and can be calculated by equation (7): among them Pixels Centered block Belongs to Groups, and First The number of blocks in a cluster. In this embodiment, the distribution within a category is obtained by using K-means clustering.

在本實施例中，第個位置的畫素的影像重建模型可以方程式(8)來表示：其中、、分別為線性係數，為模型的隨機誤差並且為高斯分布（即，）。方程式(8)所提出的影像重建模型可以將因椒鹽雜訊而損壞的雜訊影像進行有效地修復，而以下將利用半監督式學習方式（semi-supervised learning）以大量的雜訊影像來取得最佳化的三個係數、、以及變數。 In this embodiment, the first Pixels The image reconstruction model can be expressed by equation (8): among them , , Are linear coefficients, Is the random error of the model and is Gaussian (i.e., ). The image reconstruction model proposed by equation (8) can effectively repair the noisy images damaged by salt and pepper noise, and the following will use semi-supervised learning to obtain a large number of noisy images Three optimized coefficients , , And variables .

由於影像重建牽涉到高密度雜訊（high-density noise）、稀疏訓練樣本（sparse training samples）以及多模密度雜訊（multimodal density noise）等三個議題，為了有效地自影像中移除雜訊，在此將以方程式(9)的成本函數的最小化問題來描述：以下將以三個階段來說明此成本函數。 As image reconstruction involves three topics, high-density noise, sparse training samples, and multimodal density noise, in order to effectively remove noise from the image , Here we will use the cost function of equation (9) The minimization problem is described: The cost function will be explained in three stages below .

在第一階段中，影像重建模型的成本函數可有效地管理因高密度雜訊而損壞的影像。在此將以兩個原則來規範學習模型：1) 對於各個乾淨畫素，的估測值需接近於原始數值；以及2) 對於各個雜訊畫素與其相鄰畫素所估測的像素值之間的差值（即，）需和與之間的時空距離（spatiotemporal distance）成正比。根據此些原則，成本函數的第一項以及第二項可分別以方程式(10)以及方程式(11)來定義：其中為空間因子並且可以利用方程式(12)而計算出：其中可以是設定為1.25。 Cost function of image reconstruction model in the first stage Effectively manages images damaged by high-density noise. Here we will standardize the learning model with two principles: 1) For each clean pixel , The estimated value of X is close to the original value; and 2) for each noise pixel Adjacent pixels The difference between the estimated pixel values (i.e., ) Need and versus The spatiotemporal distance between them is directly proportional. According to these principles, the first term of the cost function And the second Can be defined by equation (10) and equation (11) respectively: among them Is a space factor and can be calculated using equation (12): among them Can be set to 1.25.

此外，由於損壞影像中大部份的雜訊畫素呈隨機分布，損壞影像中的乾淨畫素的平均亮度值將會接近於乾淨影像的平均亮度值。因此，成本函數的第三項可以方程式(13)來定義：其中以及分別為損壞影像中的乾淨畫素的數量以及全部畫素的數量。結合上述三項，成本函數可以方程式(14)來表示：其中在第一階段中，為正則化參數（regularization parameter）並且可以設定為0.3。 In addition, since most of the noise pixels in the damaged image are randomly distributed, the average brightness value of the clean pixels in the damaged image will be close to the average brightness value of the clean image. Therefore, the third term of the cost function It can be defined by equation (13): among them as well as The number of clean pixels and the total number of pixels in the damaged image. Combining the above three terms, the cost function can be expressed by equation (14): Among them in the first stage, Is a regularization parameter and can be set to 0.3.

在第二階段中，基於具有大量影像的影像資料庫中不具有雜訊的影像的數量有限，因此訓練集合中標記為乾淨影像者將極為稀疏。在此的影像資料庫可以分成損壞影像集合以及標記的乾淨影像集合。 In the second phase, based on an image database with a large number of images The number of images without noise in the image is limited, so those marked clean in the training set will be extremely sparse. Image library here Can be divided into a collection of damaged images And labeled clean image collection .

因此，在此的成本函數可以針對乾淨影像集合利用半監督式學習的方式來重建出乾淨影像。因此，成本函數可以進一步地以方程式(15)來表示：其中並且將會使得重建的畫素接近於標記的版本，並且可以方程式(17)來表示：因此，即便是影像集合中不完全是無損壞影像，影像重建模型中的參數仍可藉由重建出的影像與標記的乾淨影像具有高相似度的特性而學習出。 So the cost function here Can target clean image collections Use semi-supervised learning to reconstruct clean images. Therefore, the cost function can be further expressed by equation (15): among them and Will make the reconstructed pixels close to the labeled version, and can be represented by equation (17): Therefore, even if the image collection is not completely non-destructive, the parameters in the image reconstruction model can still be learned by the high similarity between the reconstructed image and the labeled clean image.

在第三階段中，由於影像資料庫中的損壞影像的雜訊密度不同，基於重建影像的平均亮度值將會接近於損壞影像集合中的乾淨畫素的平均亮度值，成本函數更可進一步地新增第五項，並且可以方程式(18)來表示：其中以及分別為影像資料庫中的乾淨畫素的數量以及全部畫素的數量。 In the third stage, due to the image database The noise density of the damaged image is different. Based on the average brightness value of the reconstructed image will be close to the average brightness value of the clean pixels in the damaged image collection. The cost function can further add a fifth term , And can be expressed by equation (18): among them as well as Image database The number of clean pixels in and the number of all pixels.

結合上述五項，成本函數可以方程式(19)來表示：其中如前述，此成本函數將會使得影像重建模型以此三個議題為基礎下來重建雜訊影像，而處理器120將會取得由影像重建模型所輸出的重建影像（步驟S206），而重建的影像達到良好的視覺效果。 Combining the above five terms, the cost function It can be expressed by equation (19): among them As mentioned earlier, this cost function Image reconstruction model Based on these three topics, the noise image is reconstructed, and the processor 120 will obtain the reconstructed image output by the image reconstruction model (step S206), and the reconstructed image achieves good visual effects.

基於影像資料庫中稀疏的標記的乾淨影像，影像重建模型可將每個輸入畫素修復成乾淨畫素。在此可將最大似然估計（maximum likelihood estimation，MLE）應用於成本函數以透過半監督式學習的方式來學習出係數、、以及變數。 Image database Sparsely labeled clean images, image reconstruction models Each input pixel can be Fixed into clean pixels. Here, maximum likelihood estimation (MLE) can be applied to the cost function Learn coefficients through semi-supervised learning , , And variables .

詳細來說，在此可隨機地以0%到90%的比例來將椒鹽雜訊效果合成於做為訓練的畫素中。在一範例中，做為訓練的畫素可以是自LabelMe資料庫所取得的109,379,580+個畫素，其包括52,324,300+個雜訊畫素以及34,055,270+無雜訊畫素。在此的目標是取得可以構成如方程式(21)的似然函數的聯合條件：其中為機率密度函數，為損壞畫素的重建畫素，為影像資料庫中畫素的數量。假設每一重建畫素的隨機誤差為獨立，則方程式(21)可以寫成方程式(22)： In detail, the salt and pepper noise effect can be randomly combined into the pixels used as training at a ratio of 0% to 90%. In an example, the pixels used for training may be 109,379,580+ pixels obtained from the LabelMe database, including 52,324,300+ noise pixels and 34,055,270+ noise-free pixels. The goal here is to obtain the joint conditions that can constitute a likelihood function such as equation (21): among them Is the probability density function, For damaged pixels Reconstructed pixels, Is the number of pixels in the image database. Assuming that the random error of each reconstructed pixel is independent, equation (21) can be written as equation (22):

在此可以似然函數的對數來取得最佳化的係數、、以及變數，以使似然函數可達到最大值，其可以方程式(23)來表示：接著，可將相對於係數、、進行偏微分計算，並且比方程式(24)來迭代更新：其中，為可將左邊項次的值設定為右邊項次的值的運算子。在此，以變數可分別對進行偏微分計算將可得到方程式(25)：接著，若是將方程式(25)設定為零，則可利用最大似然估計方式取得變數的估計值，如方程式(26)：另一方面，方程式(24)中的（其中）可以方程式(27)、方程式(28)以及方程式(29)表示： Likelihood function here Logarithm to get the optimal coefficient , , And variables To make the likelihood function It can reach the maximum value, which can be expressed by equation (23): Then, you can add Relative coefficient , , Perform partial differential calculations and update iteratively than equation (24): among them , An operator that can set the value of the left term to the value of the right term. Here, with the variable Separately Performing partial differential calculations will give equation (25): Next, if equation (25) is set to zero, the variable can be obtained using the maximum likelihood estimation method. The estimated value, as in equation (26): On the other hand, in equation (24), (among them ) Can be expressed by equation (27), equation (28), and equation (29):

在前述範例中，在經過20次迭代後，可得到最佳化的係數、、以及變數。 In the previous example, after 20 iterations, an optimized coefficient can be obtained , , And variables .

簡單來說，方程式(8)的影像重建模型中的線性係數、、以及變數可以圖4根據本發明一實施例所繪示的參數估算方法的流程圖來取得。 In simple terms, the linear coefficients in the image reconstruction model of equation (8) , , And variables It can be obtained by using the flowchart of the parameter estimation method shown in FIG. 4 according to an embodiment of the present invention.

請參照圖4，首先，處理器120將接收訓練影像集合的輸入影像（步驟S402）。接著，處理器120將開始進行參數的初始化設定：係數，係數，係數，迭代次數（步驟S404）。之後，處理器120將根據方程式(26)計算變數（步驟S406）。接著，在迭代的過程中，處理器120將根據方程式(27)更新各個影像的第個畫素的（步驟S408），根據方程式(28)更新各個影像的第個畫素的（步驟S410）以及根據方程式(29)更新各個影像的第個畫素的（步驟S412）。之後，處理器120將判斷迭代次數是否已達到最大迭代次數，即（步驟S414）。若否，則處理器120將重新回到步驟S408，以再次進行迭代的流程。若是，則處理器120將會輸出最新更新的係數、、以及變數（步驟S416），而完成參數估算方法的流程步驟。 Please refer to FIG. 4. First, the processor 120 receives a training image set. Input image (Step S402). Next, the processor 120 will start to initialize the parameters: coefficients ,coefficient ,coefficient , Number of iterations (Step S404). After that, the processor 120 will calculate the variables according to equation (26) (Step S406). Then, during the iteration, the processor 120 will update each image according to equation (27) First Pixels (Step S408), updating each image according to equation (28) First Pixels (Step S410) and updating each image according to equation (29) First Pixels (Step S412). After that, the processor 120 will determine whether the number of iterations has reached the maximum number of iterations. , which is (Step S414). If not, the processor 120 will return to step S408 to perform the iterative process again. If yes, the processor 120 will output the newly updated coefficients , , And variables (Step S416), and the process steps of the parameter estimation method are completed.

圖5是根據本發明之一實施例所繪示的影像重建方法的功能方塊圖。FIG. 5 is a functional block diagram of an image reconstruction method according to an embodiment of the present invention.

請參照圖5，I _IN為具有80%雜訊密度的輸入影像，而I’為其無雜訊版本的影像以做為對照參考之用。在處理器120會將輸入影像I _IN輸入至前述所提出的影像重建模型IRM後，將會取得修復後的重建影像I _OUT。 Please refer to FIG. 5, I _IN is an input image with 80% noise density, and I ′ is an image without noise for reference. After the processor 120 inputs the input image I _IN to the aforementioned image reconstruction model IRM, a restored reconstructed image I _OUT will be obtained.

綜上所述，本發明所提出用以抑制多模雜訊的影像重建方法及其影像處理裝置，其在考量到畫素的全域環境、局部環境以及相關環境的前提下，可利用根據半監督式學習所訓練出的影像重建模型有效地針對具有高密度雜訊的損壞影像進行有效的修復，以重建出具有良好視覺效果的影像。In summary, the image reconstruction method and image processing device for suppressing multi-mode noise proposed by the present invention can utilize the semi-supervised method on the premise of considering the global environment, local environment and related environment of pixels. The image reconstruction model trained by the learning method can effectively repair the damaged image with high-density noise to reconstruct an image with good visual effects.

雖然本發明已以實施例揭露如上，然其並非用以限定本發明，任何所屬技術領域中具有通常知識者，在不脫離本發明的精神和範圍內，當可作些許的更動與潤飾，故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed as above with the examples, it is not intended to limit the present invention. Any person with ordinary knowledge in the technical field can make some modifications and retouching without departing from the spirit and scope of the present invention. The protection scope of the present invention shall be determined by the scope of the attached patent application.

100：影像處理裝置 110：記憶體 120：處理器 S202～S206、S402～S416：步驟 311～314、321～323、331～334、341～343：區塊 C1、C2、C3、C4：群聚 I _IN：輸入影像 I _OUT：輸出影像 I’：原始影像 IRM：影像重建模型 100: Image processing device 110: Memory 120: Processors S202-S206, S402-S416: Steps 311-314, 321-323, 331-334, 341-343: Blocks C1, C2, C3, C4: Grouping I _IN : input image I _OUT : output image I ': original image IRM: image reconstruction model

圖1是根據本發明一實施例所繪示的影像處理裝置的方塊圖。圖2是根據本發明之一實施例所繪示的影像重建方法的流程圖。圖3A是根據本發明一實施例所繪示的輸入影像的示意圖。圖3B是根據本發明一實施例所繪示的群聚的示意圖。圖4是根據本發明之一實施例所繪示的參數估算方法的流程圖。圖5是根據本發明之一實施例所繪示的影像重建方法的功能方塊圖FIG. 1 is a block diagram of an image processing apparatus according to an embodiment of the present invention. FIG. 2 is a flowchart of an image reconstruction method according to an embodiment of the present invention. FIG. 3A is a schematic diagram of an input image according to an embodiment of the present invention. FIG. 3B is a schematic diagram of clustering according to an embodiment of the present invention. FIG. 4 is a flowchart of a parameter estimation method according to an embodiment of the present invention. 5 is a functional block diagram of an image reconstruction method according to an embodiment of the present invention;

Claims

An image reconstruction method for suppressing multi-mode noise is applicable to an image processing device. The method includes the following steps: receiving an input image, wherein the input image includes a plurality of pixels, and the pixels include a plurality of pixels with noise. Damaged pixels and multiple clean pixels without noise; input the input image to the image reconstruction model during the prediction stage of the image reconstruction model, where the image reconstruction model is associated with the global environment, regional environment, A related environment, a first coefficient, a second coefficient, and a third coefficient, the global environment is associated with all the clean pixels, and the regional environment is associated with a block formed by a plurality of adjacent pixels of each of the pixels, the The related environment is related to a cluster formed by a plurality of similar pixels of each of the pixels, the first coefficient, the second coefficient, and the third coefficient are respectively related to the global environment, the regional environment, and the related environment, and Obtained during the training phase of the image reconstruction model by a semi-supervised learning method; and obtaining a reconstructed image output by the image reconstruction model, wherein each of the images is This image reconstruction model _{is: f (x i) = θ} 0 g (x i) + θ 1 l (x i) + θ 2 s (x i) + ε (x i) where x _i is located i th Position pixel, f ( x _i ) is the image reconstruction model of the pixel x _i , g ( x _i ) is the global environment of the pixel x _i , and l ( x _i ) is the regional environment of the pixel x _i , S ( x _i ) is the relevant environment of the pixel x _i , θ _{0 is} the first coefficient, θ _{1 is} the second coefficient, θ _{2 is} the third coefficient, and ε ( x _i ) is the pixel x The random error of _i is also Gaussian, where ε ( x _i ) ~ G (0, σ ² ).

The method according to item 1 of the scope of the patent application, wherein the global environment g ( x _i ) of the pixel x _i is obtained from all the clean pixels by multivariate interpolation using an inverse distance weighting method.

The method according to item 2 of the scope of patent application, wherein the global environment g ( x _i ) of the pixel x _i is: Where x ^nf is the clean pixel, x _j is the clean pixel at the j- th position, λ _J is the inverse distance weight and is: Where β is a power parameter.

The method as defined in claim 3 of the scope of the item, wherein the pixel x _i in the local environment L (x _i) is calculated based on the average of the pixels adjacent to the interpolation plurality of pixels of the x _i.

The method according to item 4 of the scope of patent application, wherein the regional environment l ( x _i ) of the pixel x _i is: Where x _k is a pixel in a block Ω centered on the pixel x _i , and ω is the number of multiple pixels in the block Ω.

The method according to item 5 of the scope of patent application, wherein the relevant environment s ( x _i ) of the pixel x _i is obtained by using the clustering difference value to obtain the scattered distance weights within the category.

The method according to item 6 of the scope of patent application, wherein the relevant environment s ( x _i ) of the pixel x _i is: Where ω _k is the distance weight that is scattered within the category and is: Where Ω ( x _k ) is the c- th cluster to which the block Ω centered on the pixel x _k is, and ω is the number of all blocks in the c- th cluster.

The method according to item 1 of the scope of patent application, wherein the training phase is a cost function of the image reconstruction model based on high-density noise, sparse training samples, and multi-mode density noise, and uses a clean The image and the multiple noise images are trained using the semi-supervised learning method to optimize the cost function of the first coefficient, the second coefficient, and the third coefficient, wherein the number of the labeled clean images is large The number of noise images.

An image processing device includes: a memory for storing images and data; and a processor coupled to the memory for: receiving an input image, wherein the input image includes a plurality of pixels, and the pixels include a plurality of pixels Noisy pixels and multiple clean pixels without noise; input the input image to the image reconstruction model during the prediction phase of the image reconstruction model, where the image reconstruction model is related to the global environment, regional environment, A related environment, a first coefficient, a second coefficient, and a third coefficient, the global environment is associated with all the clean pixels in the input image, and the regional environment is associated with a plurality of neighboring pixels of each of the pixels Block, the related environment is associated with a cluster formed by a plurality of similar pixels of each of the pixels, the first coefficient, the second coefficient, and the third coefficient are associated with the global environment, the region, respectively The environment and the related environment are obtained by a semi-supervised learning method during the training phase of the image reconstruction model; and obtaining the input image output by the image reconstruction model Reconstructed image, wherein the image reconstruction of the model of each pixel _{is: f (x i) = θ} 0 g (x i) + θ 1 l (x i) + θ 2 s (x i) + ε (x _i ) where x _i is the pixel at the i- th position, f ( x _i ) is the image reconstruction model of the pixel x _i , g ( x _i ) is the global environment of the pixel x _i , and l ( x _i ) Is the regional environment of the pixel x _i , s ( x _i ) is the relevant environment of the pixel x _i , θ _{0 is} the first coefficient, θ _{1 is} the second coefficient, and θ _{2 is} the third coefficient, ε ( x _i ) is a random error of the pixel x _i and is a Gaussian distribution, where ε ( x _i ) ~ G (0, σ ² ).