CN102256126A

CN102256126A - Method for coding mixed image

Info

Publication number: CN102256126A
Application number: CN2011102001787A
Authority: CN
Inventors: 丁文鹏; 朱维佳; 施云慧; 尹宝才
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2011-07-14
Filing date: 2011-07-14
Publication date: 2011-11-23

Abstract

The invention discloses a method for coding a mixed image with higher coding efficiency. In the method, a residual scalar quantization (RSQ) mode is used in the interframe coding of a mixed image, according to a principle that rate distortion cost is minimum, each transformation unit (TU) adaptively selects a discrete cosine transform (DCT) mode or the RSQ mode, and a bit is used for expressing the selection; and residual errors obtained after interframe prediction are directly quantized in the RSQ mode, and a step of transforming codes is omitted.

Description

Coding Method for Mixed Images

技术领域 technical field

本发明属于图像处理的技术领域，具体地涉及一种混合图像的编码方法。The invention belongs to the technical field of image processing, and in particular relates to a method for encoding mixed images.

背景技术 Background technique

在已有的技术中，混合图像的编码大致可以分为以下三类。In existing technologies, coding of mixed images can be roughly classified into the following three categories.

1.基于图像编码的方法：1. Method based on image coding:

这种方法还是采用传统的编码框架来对混合图像进行编码，但是针对不同的区域采用不同的量化步长，对于包含文字和图表内容的块，编码后重建效果会非常模糊，所以量化这些块时减少量化步长，为其多分配比特，让其重建的质量尽可能好。但是对于固定的比特数，如果为文字和图表块多分配了比特，那么自然图像块所分配的比特数就会相应地减少，所以总体来说，这种方法仍不能达到很好的效果。This method still uses the traditional encoding framework to encode the mixed image, but uses different quantization steps for different regions. For blocks containing text and graphics, the reconstruction effect after encoding will be very blurred, so when quantizing these blocks Reduce the quantization step size, allocate more bits to it, and make its reconstruction quality as good as possible. But for a fixed number of bits, if more bits are allocated for text and graphic blocks, then the number of bits allocated for natural image blocks will be correspondingly reduced, so generally speaking, this method still cannot achieve good results.

2.基于分层编码的方法：2. Method based on layered coding:

这种方法采用混合光栅模型来进行压缩。它将一幅图像或者一个图像块分为前景层、背景层和一个二进制算子的平面层。这个二进制算子的平面层表明图像或图像块的每个像素属于前景层还是背景层，并且这个二进制算子平面可以用JBIG标准或JBIG2标准进行压缩。而经过数据填充算法模糊后的前景层和背景层可以用传统的图像编码框架来进行压缩。论文“Compressing compound documents”(The Document and ImageCompression Handbook，M.Barni，Ed.New York：Marcel-Dekker，2005.)提出了这种方法，并通过实验证明这种方法与传统的编解码框架相比，可以取得很大的增益。但是这种方法有一些缺点：首先增益在很大程度上取决于分割的算法，即如何将图像分为前景和背景。第二，分割出来的洞如果不经过任何处理会损害编码的效果。第三，分别编码背景层的文本颜色和二进制算子的文本形状也会降低编码的效率。所以这种方法仍不能很好地满足应用的需求This method uses a hybrid raster model for compression. It divides an image or an image block into a foreground layer, a background layer and a plane layer of a binary operator. The plane layer of this binary operator indicates that each pixel of the image or image block belongs to the foreground layer or the background layer, and this binary operator plane can be compressed with JBIG standard or JBIG2 standard. The foreground and background layers blurred by the data filling algorithm can be compressed using the traditional image coding framework. The paper "Compressing compound documents" (The Document and ImageCompression Handbook, M.Barni, Ed.New York: Marcel-Dekker, 2005.) proposed this method, and it was proved by experiments that this method is compared with the traditional codec framework , a large gain can be obtained. But this method has some disadvantages: First, the gain depends largely on the segmentation algorithm, that is, how to divide the image into foreground and background. Second, the segmented holes will damage the encoding effect if they are not processed. Third, separately encoding the text color of the background layer and the text shape of the binary operator also reduces the efficiency of encoding. Therefore, this method still cannot meet the needs of the application well.

3.基于块的编码方法：3. Block-based encoding method:

这种方法可以分为两类：This approach can be divided into two categories:

一种是利用空域的性质，如直方图统计，梯度，颜色的数目等，可以将当前编码块分为几类中的一种，如文本，图表，自然图像等。然后不同的类别用不同的编码框架来进行压缩以适用于它们的统计特性，如图1所示。One is to use the properties of the airspace, such as histogram statistics, gradients, the number of colors, etc., to classify the current coding block into one of several categories, such as text, charts, natural images, etc. Different categories are then compressed with different encoding frameworks to suit their statistical properties, as shown in Figure 1.

另一种是考虑到混合图像中的文字和图表的直方图较稀疏，将其在时域分解为基础颜色和索引表的形式，即Base color and Index map(BCIM)模式。The other is to consider that the histogram of the text and graphics in the mixed image is relatively sparse, and decompose it into the form of the basic color and index table in the time domain, that is, the Base color and Index map (BCIM) mode.

尽管基于块的编码方法相比于前面提到的两种技术已经取得了很好的增益，但是目前所有这种编码方法都有一个共同点，就是只考虑了空间的相关性，即只对帧内部分进行了改进，而忽略了时间的相关性，没有将帧间编码的特点考虑进去，并没有设计出针对帧间编码的混合图像的编码方案。Although the block-based coding method has achieved good gains compared to the two aforementioned techniques, all current coding methods have one thing in common, that is, only the spatial correlation is considered, that is, only the frame The internal part is improved, but the time correlation is ignored, the characteristics of inter-frame coding are not taken into account, and the coding scheme for inter-frame coding mixed images is not designed.

发明内容 Contents of the invention

本发明的技术解决问题是：克服现有技术的不足，提供一种编码效率更高的混合图像的编码方法。The problem solved by the technology of the present invention is to overcome the shortcomings of the prior art and provide a method for encoding mixed images with higher encoding efficiency.

本发明的技术解决方案是：这种混合图像的编码方法，将RSQ(Residualscalar quantization，残差标量量化)模式用于混合图像的帧间编码：根据率失真代价最小的原则，每个TU(Transform unit，变换单元)自适应地选择DCT(Discrete cosine transform，离散余弦变换)模式或RSQ模式，并用一个比特表示该选择；RSQ模式是将帧间预测后得到的残差进行量化。The technical solution of the present invention is: this mixed image coding method uses RSQ (Residual scalar quantization, residual scalar quantization) mode for interframe coding of mixed images: according to the principle of minimum rate-distortion cost, each TU (Transform unit, transformation unit) adaptively selects DCT (Discrete cosine transform, discrete cosine transform) mode or RSQ mode, and uses one bit to represent the selection; RSQ mode quantizes the residual obtained after inter-frame prediction.

由于本发明解决了混合图像不适用于变换编码的问题，且由于经过帧间预测后所得到的残差会更小，所以对于混合图像的帧间编码效率会更高。Since the present invention solves the problem that the mixed image is not suitable for transform coding, and since the residual error obtained after the inter-frame prediction will be smaller, the inter-frame coding efficiency for the mixed image will be higher.

附图说明 Description of drawings

图1示出了现有的RSQ模式，即利用空域性质的基于块的编码方法；Figure 1 shows an existing RSQ mode, a block-based coding method that exploits spatial properties;

图2示出了根据本发明的混合图像的编码方法的流程图。Fig. 2 shows a flow chart of a method for encoding mixed images according to the present invention.

具体实施方式 Detailed ways

这种混合图像的编码方法，将RSQ(Residual scalar quantization，残差标量量化)模式用于混合图像的帧间编码：根据率失真代价最小的原则，每个TU(Transform unit，变换单元)自适应地选择DCT(Discretecosine transform，离散余弦变换)模式或RSQ模式，并用一个比特表示该选择；RSQ模式是将帧间预测后得到的残差直接进行量化，跳过变换编码的步骤。This mixed image coding method uses RSQ (Residual scalar quantization) mode for interframe coding of mixed images: according to the principle of the least rate-distortion cost, each TU (Transform unit, transformation unit) adaptive Select DCT (Discretecosine transform, discrete cosine transform) mode or RSQ mode freely, and use one bit to represent the choice; RSQ mode is to directly quantize the residual obtained after inter-frame prediction, and skip the step of transform coding.

当传输RSQ编码时，采用行顺序扫描，并为其分配上下文。When transmitting RSQ encoding, scan in row order and assign a context to it.

如图2所示，这种混合图像的编码方法，包括以下步骤：As shown in Figure 2, the encoding method of this mixed image includes the following steps:

(1)对于PU中的亮度分量，得到其预测块，进而与原始值相减得到残差，对残差块进行不同尺寸的划分，获得TU(Transform Unit，变换单元)；(1) For the luminance component in the PU, obtain its prediction block, and then subtract it from the original value to obtain the residual, divide the residual block into different sizes, and obtain TU (Transform Unit, transformation unit);

(2)将当前TU中的残差进行量化；(2) quantify the residual in the current TU;

(3)将量化后的TU传送至解码端；(3) Send the quantized TU to the decoding end;

(4)解码端的每个TU进行反量化；(4) Inverse quantization is performed on each TU at the decoding end;

(5)TU组合为PU，与经过运动补偿的得到的当前PU的预测值进行相加，得到PU的重构值，进而重建当前的CU(Coding Unit，编码单元)。(5) The TU is combined into a PU, which is added to the predicted value of the current PU after motion compensation to obtain the reconstruction value of the PU, and then the current CU (Coding Unit, coding unit) is reconstructed.

优选地，步骤(2)中的量化采用公式：C(S)＝sign(S)×max(0，floor(|S|/Q+1-Z)，其中S为输入信号，floor为向下取整函数，Q为量化步长，Z是预设的调节参数。Preferably, the quantization in step (2) adopts the formula: C(S)=sign(S)×max(0, floor(|S|/Q+1-Z), wherein S is the input signal, and floor is downward Rounding function, Q is the quantization step size, and Z is the preset adjustment parameter.

优选地，步骤(3)中的传送与HM标准不同的是不用zig-zag顺序扫描残差，而是按光栅扫描顺序依次扫描每个残差系数，然后依次传输。Preferably, the transmission in step (3) is different from the HM standard in that the residuals are not scanned in zig-zag order, but each residual coefficient is scanned sequentially in raster scanning order, and then transmitted sequentially.

优选地，步骤(4)中的反量化采用公式：R(x)＝sign(x)×round(q×|x|+p)，其中round为四舍五入取整函数，q，p是随着量化参数而变换的控制参数。Preferably, the dequantization in step (4) adopts the formula: R(x)=sign(x)×round(q×|x|+p), wherein round is a rounding function, and q and p are The control parameter transformed by the parameter.

以上所述，仅是本发明的较佳实施例，并非对本发明作任何形式上的限制，凡是依据本发明的技术实质对以上实施例所作的任何简单修改、等同变化与修饰，均仍属本发明技术方案的保护范围。The above are only preferred embodiments of the present invention, and are not intended to limit the present invention in any form. Any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention are still within the scope of this invention. The protection scope of the technical solution of the invention.

Claims

1. The coding method of the mixed image is characterized in that: the residual scalar quantization RSQ mode is used for the interframe coding of the mixed image: according to the principle of the minimum rate-distortion cost, each transformation unit TU adaptively selects the discrete cosine transform DCT mode Or RSQ mode, and use one bit to represent the selection; RSQ mode is to quantize the residual obtained after inter-frame prediction.

2. The method according to claim 1, characterized in that: when transmitting RSQ codes, row sequential scanning is used and an independent context is assigned to it.

3. the coding method of mixed image, it is characterized in that: comprise the following steps:

(1) For the luminance component in the prediction unit PU, the prediction block is obtained, and then subtracted from the original value to obtain the residual, and the residual block is divided into different sizes to obtain the transformation unit TU;

(2) quantify the residual in the current TU;

(3) Send the quantized TU to the decoding end;

(4) Inverse quantization is performed on each TU at the decoding end;

(5) The TUs are combined into PUs, which are added to the predicted value of the current PU after motion compensation to obtain the reconstruction value of the PU, and then the coding unit CU of the current coding unit is reconstructed.

4. The method according to claim 3, characterized in that: the quantization in the step (2) adopts formula: C(S)=sign(S)×max(0, floor(|S|/Q+1-Z ), where S is the input signal, floor is the rounding down function, Q is the quantization step size, and Z is the preset adjustment parameter.

5. The method according to claim 3, characterized in that: the transmission in step (3) is to scan each residual coefficient sequentially in a raster scanning order, and then transmit them sequentially.

6. The method according to claim 4, characterized in that: the inverse quantization in the step (4) adopts formula: R(x)=sign(x)×round(q×|x|+p), wherein round is The rounding and rounding function, q, p are the control parameters that change with the quantization parameters.