[go: up one dir, main page]

CN102256126A - Method for coding mixed image - Google Patents

Method for coding mixed image Download PDF

Info

Publication number
CN102256126A
CN102256126A CN2011102001787A CN201110200178A CN102256126A CN 102256126 A CN102256126 A CN 102256126A CN 2011102001787 A CN2011102001787 A CN 2011102001787A CN 201110200178 A CN201110200178 A CN 201110200178A CN 102256126 A CN102256126 A CN 102256126A
Authority
CN
China
Prior art keywords
residual
coding
mode
rsq
quantization
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011102001787A
Other languages
Chinese (zh)
Inventor
丁文鹏
朱维佳
施云慧
尹宝才
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN2011102001787A priority Critical patent/CN102256126A/en
Publication of CN102256126A publication Critical patent/CN102256126A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The invention discloses a method for coding a mixed image with higher coding efficiency. In the method, a residual scalar quantization (RSQ) mode is used in the interframe coding of a mixed image, according to a principle that rate distortion cost is minimum, each transformation unit (TU) adaptively selects a discrete cosine transform (DCT) mode or the RSQ mode, and a bit is used for expressing the selection; and residual errors obtained after interframe prediction are directly quantized in the RSQ mode, and a step of transforming codes is omitted.

Description

混合图像的编码方法Coding Method for Mixed Images

技术领域 technical field

本发明属于图像处理的技术领域,具体地涉及一种混合图像的编码方法。The invention belongs to the technical field of image processing, and in particular relates to a method for encoding mixed images.

背景技术 Background technique

在已有的技术中,混合图像的编码大致可以分为以下三类。In existing technologies, coding of mixed images can be roughly classified into the following three categories.

1.基于图像编码的方法:1. Method based on image coding:

这种方法还是采用传统的编码框架来对混合图像进行编码,但是针对不同的区域采用不同的量化步长,对于包含文字和图表内容的块,编码后重建效果会非常模糊,所以量化这些块时减少量化步长,为其多分配比特,让其重建的质量尽可能好。但是对于固定的比特数,如果为文字和图表块多分配了比特,那么自然图像块所分配的比特数就会相应地减少,所以总体来说,这种方法仍不能达到很好的效果。This method still uses the traditional encoding framework to encode the mixed image, but uses different quantization steps for different regions. For blocks containing text and graphics, the reconstruction effect after encoding will be very blurred, so when quantizing these blocks Reduce the quantization step size, allocate more bits to it, and make its reconstruction quality as good as possible. But for a fixed number of bits, if more bits are allocated for text and graphic blocks, then the number of bits allocated for natural image blocks will be correspondingly reduced, so generally speaking, this method still cannot achieve good results.

2.基于分层编码的方法:2. Method based on layered coding:

这种方法采用混合光栅模型来进行压缩。它将一幅图像或者一个图像块分为前景层、背景层和一个二进制算子的平面层。这个二进制算子的平面层表明图像或图像块的每个像素属于前景层还是背景层,并且这个二进制算子平面可以用JBIG标准或JBIG2标准进行压缩。而经过数据填充算法模糊后的前景层和背景层可以用传统的图像编码框架来进行压缩。论文“Compressing compound documents”(The Document and ImageCompression Handbook,M.Barni,Ed.New York:Marcel-Dekker,2005.)提出了这种方法,并通过实验证明这种方法与传统的编解码框架相比,可以取得很大的增益。但是这种方法有一些缺点:首先增益在很大程度上取决于分割的算法,即如何将图像分为前景和背景。第二,分割出来的洞如果不经过任何处理会损害编码的效果。第三,分别编码背景层的文本颜色和二进制算子的文本形状也会降低编码的效率。所以这种方法仍不能很好地满足应用的需求This method uses a hybrid raster model for compression. It divides an image or an image block into a foreground layer, a background layer and a plane layer of a binary operator. The plane layer of this binary operator indicates that each pixel of the image or image block belongs to the foreground layer or the background layer, and this binary operator plane can be compressed with JBIG standard or JBIG2 standard. The foreground and background layers blurred by the data filling algorithm can be compressed using the traditional image coding framework. The paper "Compressing compound documents" (The Document and ImageCompression Handbook, M.Barni, Ed.New York: Marcel-Dekker, 2005.) proposed this method, and it was proved by experiments that this method is compared with the traditional codec framework , a large gain can be obtained. But this method has some disadvantages: First, the gain depends largely on the segmentation algorithm, that is, how to divide the image into foreground and background. Second, the segmented holes will damage the encoding effect if they are not processed. Third, separately encoding the text color of the background layer and the text shape of the binary operator also reduces the efficiency of encoding. Therefore, this method still cannot meet the needs of the application well.

3.基于块的编码方法:3. Block-based encoding method:

这种方法可以分为两类:This approach can be divided into two categories:

一种是利用空域的性质,如直方图统计,梯度,颜色的数目等,可以将当前编码块分为几类中的一种,如文本,图表,自然图像等。然后不同的类别用不同的编码框架来进行压缩以适用于它们的统计特性,如图1所示。One is to use the properties of the airspace, such as histogram statistics, gradients, the number of colors, etc., to classify the current coding block into one of several categories, such as text, charts, natural images, etc. Different categories are then compressed with different encoding frameworks to suit their statistical properties, as shown in Figure 1.

另一种是考虑到混合图像中的文字和图表的直方图较稀疏,将其在时域分解为基础颜色和索引表的形式,即Base color and Index map(BCIM)模式。The other is to consider that the histogram of the text and graphics in the mixed image is relatively sparse, and decompose it into the form of the basic color and index table in the time domain, that is, the Base color and Index map (BCIM) mode.

尽管基于块的编码方法相比于前面提到的两种技术已经取得了很好的增益,但是目前所有这种编码方法都有一个共同点,就是只考虑了空间的相关性,即只对帧内部分进行了改进,而忽略了时间的相关性,没有将帧间编码的特点考虑进去,并没有设计出针对帧间编码的混合图像的编码方案。Although the block-based coding method has achieved good gains compared to the two aforementioned techniques, all current coding methods have one thing in common, that is, only the spatial correlation is considered, that is, only the frame The internal part is improved, but the time correlation is ignored, the characteristics of inter-frame coding are not taken into account, and the coding scheme for inter-frame coding mixed images is not designed.

发明内容 Contents of the invention

本发明的技术解决问题是:克服现有技术的不足,提供一种编码效率更高的混合图像的编码方法。The problem solved by the technology of the present invention is to overcome the shortcomings of the prior art and provide a method for encoding mixed images with higher encoding efficiency.

本发明的技术解决方案是:这种混合图像的编码方法,将RSQ(Residualscalar quantization,残差标量量化)模式用于混合图像的帧间编码:根据率失真代价最小的原则,每个TU(Transform unit,变换单元)自适应地选择DCT(Discrete cosine transform,离散余弦变换)模式或RSQ模式,并用一个比特表示该选择;RSQ模式是将帧间预测后得到的残差进行量化。The technical solution of the present invention is: this mixed image coding method uses RSQ (Residual scalar quantization, residual scalar quantization) mode for interframe coding of mixed images: according to the principle of minimum rate-distortion cost, each TU (Transform unit, transformation unit) adaptively selects DCT (Discrete cosine transform, discrete cosine transform) mode or RSQ mode, and uses one bit to represent the selection; RSQ mode quantizes the residual obtained after inter-frame prediction.

由于本发明解决了混合图像不适用于变换编码的问题,且由于经过帧间预测后所得到的残差会更小,所以对于混合图像的帧间编码效率会更高。Since the present invention solves the problem that the mixed image is not suitable for transform coding, and since the residual error obtained after the inter-frame prediction will be smaller, the inter-frame coding efficiency for the mixed image will be higher.

附图说明 Description of drawings

图1示出了现有的RSQ模式,即利用空域性质的基于块的编码方法;Figure 1 shows an existing RSQ mode, a block-based coding method that exploits spatial properties;

图2示出了根据本发明的混合图像的编码方法的流程图。Fig. 2 shows a flow chart of a method for encoding mixed images according to the present invention.

具体实施方式 Detailed ways

这种混合图像的编码方法,将RSQ(Residual scalar quantization,残差标量量化)模式用于混合图像的帧间编码:根据率失真代价最小的原则,每个TU(Transform unit,变换单元)自适应地选择DCT(Discretecosine transform,离散余弦变换)模式或RSQ模式,并用一个比特表示该选择;RSQ模式是将帧间预测后得到的残差直接进行量化,跳过变换编码的步骤。This mixed image coding method uses RSQ (Residual scalar quantization) mode for interframe coding of mixed images: according to the principle of the least rate-distortion cost, each TU (Transform unit, transformation unit) adaptive Select DCT (Discretecosine transform, discrete cosine transform) mode or RSQ mode freely, and use one bit to represent the choice; RSQ mode is to directly quantize the residual obtained after inter-frame prediction, and skip the step of transform coding.

当传输RSQ编码时,采用行顺序扫描,并为其分配上下文。When transmitting RSQ encoding, scan in row order and assign a context to it.

如图2所示,这种混合图像的编码方法,包括以下步骤:As shown in Figure 2, the encoding method of this mixed image includes the following steps:

(1)对于PU中的亮度分量,得到其预测块,进而与原始值相减得到残差,对残差块进行不同尺寸的划分,获得TU(Transform Unit,变换单元);(1) For the luminance component in the PU, obtain its prediction block, and then subtract it from the original value to obtain the residual, divide the residual block into different sizes, and obtain TU (Transform Unit, transformation unit);

(2)将当前TU中的残差进行量化;(2) quantify the residual in the current TU;

(3)将量化后的TU传送至解码端;(3) Send the quantized TU to the decoding end;

(4)解码端的每个TU进行反量化;(4) Inverse quantization is performed on each TU at the decoding end;

(5)TU组合为PU,与经过运动补偿的得到的当前PU的预测值进行相加,得到PU的重构值,进而重建当前的CU(Coding Unit,编码单元)。(5) The TU is combined into a PU, which is added to the predicted value of the current PU after motion compensation to obtain the reconstruction value of the PU, and then the current CU (Coding Unit, coding unit) is reconstructed.

优选地,步骤(2)中的量化采用公式:C(S)=sign(S)×max(0,floor(|S|/Q+1-Z),其中S为输入信号,floor为向下取整函数,Q为量化步长,Z是预设的调节参数。Preferably, the quantization in step (2) adopts the formula: C(S)=sign(S)×max(0, floor(|S|/Q+1-Z), wherein S is the input signal, and floor is downward Rounding function, Q is the quantization step size, and Z is the preset adjustment parameter.

优选地,步骤(3)中的传送与HM标准不同的是不用zig-zag顺序扫描残差,而是按光栅扫描顺序依次扫描每个残差系数,然后依次传输。Preferably, the transmission in step (3) is different from the HM standard in that the residuals are not scanned in zig-zag order, but each residual coefficient is scanned sequentially in raster scanning order, and then transmitted sequentially.

优选地,步骤(4)中的反量化采用公式:R(x)=sign(x)×round(q×|x|+p),其中round为四舍五入取整函数,q,p是随着量化参数而变换的控制参数。Preferably, the dequantization in step (4) adopts the formula: R(x)=sign(x)×round(q×|x|+p), wherein round is a rounding function, and q and p are The control parameter transformed by the parameter.

以上所述,仅是本发明的较佳实施例,并非对本发明作任何形式上的限制,凡是依据本发明的技术实质对以上实施例所作的任何简单修改、等同变化与修饰,均仍属本发明技术方案的保护范围。The above are only preferred embodiments of the present invention, and are not intended to limit the present invention in any form. Any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention are still within the scope of this invention. The protection scope of the technical solution of the invention.

Claims (6)

1.混合图像的编码方法,其特征在于:将残差标量量化RSQ模式用于混合图像的帧间编码:根据率失真代价最小的原则,每个变换单元TU自适应地选择离散余弦变换DCT模式或RSQ模式,并用一个比特表示该选择;RSQ模式是将帧间预测后得到的残差进行量化。1. The coding method of the mixed image is characterized in that: the residual scalar quantization RSQ mode is used for the interframe coding of the mixed image: according to the principle of the minimum rate-distortion cost, each transformation unit TU adaptively selects the discrete cosine transform DCT mode Or RSQ mode, and use one bit to represent the selection; RSQ mode is to quantize the residual obtained after inter-frame prediction. 2.根据权利要求1所述的方法,其特征在于:当传输RSQ编码时,采用行顺序扫描,并为其分配独立的上下文。2. The method according to claim 1, characterized in that: when transmitting RSQ codes, row sequential scanning is used and an independent context is assigned to it. 3.混合图像的编码方法,其特征在于:包括以下步骤:3. the coding method of mixed image, it is characterized in that: comprise the following steps: (1)对于预测单元PU中的亮度分量,得到其预测块,进而与原始值相减得到残差,对残差块进行不同尺寸的划分,获得变换单元TU;(1) For the luminance component in the prediction unit PU, the prediction block is obtained, and then subtracted from the original value to obtain the residual, and the residual block is divided into different sizes to obtain the transformation unit TU; (2)将当前TU中的残差进行量化;(2) quantify the residual in the current TU; (3)将量化后的TU传送至解码端;(3) Send the quantized TU to the decoding end; (4)解码端的每个TU进行反量化;(4) Inverse quantization is performed on each TU at the decoding end; (5)TU组合为PU,与经过运动补偿的得到的当前PU的预测值进行相加,得到PU的重构值,进而重建当前编码单元的编码单元CU。(5) The TUs are combined into PUs, which are added to the predicted value of the current PU after motion compensation to obtain the reconstruction value of the PU, and then the coding unit CU of the current coding unit is reconstructed. 4.根据权利要求3所述的方法,其特征在于:步骤(2)中的量化采用公式:C(S)=sign(S)×max(0,floor(|S|/Q+1-Z),其中S为输入信号,floor为向下取整函数,Q为量化步长,Z是预设的调节参数。4. The method according to claim 3, characterized in that: the quantization in the step (2) adopts formula: C(S)=sign(S)×max(0, floor(|S|/Q+1-Z ), where S is the input signal, floor is the rounding down function, Q is the quantization step size, and Z is the preset adjustment parameter. 5.根据权利要求3所述的方法,其特征在于:步骤(3)中的传送是按光栅扫描顺序依次扫描每个残差系数,然后依次传输。5. The method according to claim 3, characterized in that: the transmission in step (3) is to scan each residual coefficient sequentially in a raster scanning order, and then transmit them sequentially. 6.根据权利要求4所述的方法,其特征在于:步骤(4)中的反量化采用公式:R(x)=sign(x)×round(q×|x|+p),其中round为四舍五入取整函数,q,p是随着量化参数而变换的控制参数。6. The method according to claim 4, characterized in that: the inverse quantization in the step (4) adopts formula: R(x)=sign(x)×round(q×|x|+p), wherein round is The rounding and rounding function, q, p are the control parameters that change with the quantization parameters.
CN2011102001787A 2011-07-14 2011-07-14 Method for coding mixed image Pending CN102256126A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011102001787A CN102256126A (en) 2011-07-14 2011-07-14 Method for coding mixed image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011102001787A CN102256126A (en) 2011-07-14 2011-07-14 Method for coding mixed image

Publications (1)

Publication Number Publication Date
CN102256126A true CN102256126A (en) 2011-11-23

Family

ID=44983073

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011102001787A Pending CN102256126A (en) 2011-07-14 2011-07-14 Method for coding mixed image

Country Status (1)

Country Link
CN (1) CN102256126A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102447907A (en) * 2012-01-31 2012-05-09 北京工业大学 A method for encoding video sequences aimed at HEVC
CN104521232A (en) * 2012-07-02 2015-04-15 韩国电子通信研究院 Method and device for encoding/decoding images
CN107465923A (en) * 2016-06-06 2017-12-12 谷歌公司 Adaptive overlapping block prediction in the video coding of variable block length
CN109324778A (en) * 2018-12-04 2019-02-12 深圳市华星光电半导体显示技术有限公司 Compensate table compression method
US10298955B2 (en) 2012-08-23 2019-05-21 Microsoft Technology Licensing, Llc Non-transform coding

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1754389A (en) * 2003-01-08 2006-03-29 苹果电脑有限公司 Method and apparatus for improved coding mode selection
CN1846446A (en) * 2003-09-03 2006-10-11 汤姆森许可贸易公司 Method and arrangement for encoding video images
CN101009839A (en) * 2006-01-23 2007-08-01 三星电子株式会社 Method for video encoding or decoding based on orthogonal transform and vector quantization, and apparatus thereof
CN101305610A (en) * 2005-10-19 2008-11-12 飞思卡尔半导体公司 System and method for encoding mode determination for video encoding

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1754389A (en) * 2003-01-08 2006-03-29 苹果电脑有限公司 Method and apparatus for improved coding mode selection
CN1846446A (en) * 2003-09-03 2006-10-11 汤姆森许可贸易公司 Method and arrangement for encoding video images
CN101305610A (en) * 2005-10-19 2008-11-12 飞思卡尔半导体公司 System and method for encoding mode determination for video encoding
CN101009839A (en) * 2006-01-23 2007-08-01 三星电子株式会社 Method for video encoding or decoding based on orthogonal transform and vector quantization, and apparatus thereof

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102447907A (en) * 2012-01-31 2012-05-09 北京工业大学 A method for encoding video sequences aimed at HEVC
US10419765B2 (en) 2012-07-02 2019-09-17 Electronics And Telecommunications Research Institute Method and apparatus for coding/decoding image
US9843809B2 (en) 2012-07-02 2017-12-12 Electronics And Telecommunications Research Method and apparatus for coding/decoding image
US10045031B2 (en) 2012-07-02 2018-08-07 Electronic And Telecommunications Research Method and apparatus for coding/decoding image
US10187643B2 (en) 2012-07-02 2019-01-22 Electronics And Telecommunications Research Method and apparatus for encoding and decoding image
US10187644B2 (en) 2012-07-02 2019-01-22 Electronics And Telecommunications Research Method and apparatus for coding/decoding image
CN104521232A (en) * 2012-07-02 2015-04-15 韩国电子通信研究院 Method and device for encoding/decoding images
US10554982B2 (en) 2012-07-02 2020-02-04 Electronics And Telecommunications Research Institute Method and apparatus for coding/decoding image
US10554983B2 (en) 2012-07-02 2020-02-04 Electronics And Telecommunications Research Institute Method and apparatus for coding/decoding image
US12273538B2 (en) 2012-07-02 2025-04-08 Electronics And Telecommunications Research Institute Method and apparatus for coding/decoding image
US10298955B2 (en) 2012-08-23 2019-05-21 Microsoft Technology Licensing, Llc Non-transform coding
CN107465923A (en) * 2016-06-06 2017-12-12 谷歌公司 Adaptive overlapping block prediction in the video coding of variable block length
CN107465923B (en) * 2016-06-06 2020-09-25 谷歌有限责任公司 Adaptive overlapped block prediction in variable block size video coding
CN109324778A (en) * 2018-12-04 2019-02-12 深圳市华星光电半导体显示技术有限公司 Compensate table compression method

Similar Documents

Publication Publication Date Title
CN113411577B (en) Coding methods and devices
CN111819853B (en) Image block encoding device and image block encoding method
JP6749922B2 (en) Improved Palette Mode in High Efficiency Video Coding (HEVC) Screen Content Coding (SCC)
CN102223541B (en) Method for coding hybrid image
CN101356818B (en) Image encoding/decoding method and apparatus
US11418786B2 (en) Image encoding and decoding apparatus, and image encoding and decoding method
CN101938657B (en) Self-adaptively dividing method for code units in high-efficiency video coding
JP6568078B2 (en) Method and apparatus for encoding image data and method and apparatus for decoding image data
JP2018160932A5 (en) Palette coding for screen content coding
CN106170092A (en) Fast encoding method for lossless coding
CN104041035A (en) Lossless Coding and Associated Signaling Methods for Compound Video
CN107231557B (en) Encoding, decoding methods and apparatus for intelligent reordering in recursive block partitioning for advanced intra prediction in video coding
CN105659606A (en) Features of base color index map mode for video and image coding and decoding
CN109644277A (en) Video coding tool for sample process in circuit
US12075048B2 (en) Adaptive coding of prediction modes using probability distributions
CN107018416B (en) Adaptive tile data size coding for video and image compression
US12231639B2 (en) Image encoding and decoding apparatus, and image encoding and decoding method using contour mode based intra prediction
CN108174214A (en) A Remote Desktop Sharing Method Based on Screen Content Video Coding
CN102256126A (en) Method for coding mixed image
Naccari et al. Intensity dependent spatial quantization with application in HEVC
US20160029034A1 (en) Edge-selective intra coding
CN102845061A (en) Codeword restriction for high performance video coding
KR20180032570A (en) Modification to block size for transform mode in display stream compression
Giusto et al. Data Compression for Digital Photography: Performance comparison between proprietary solutions and standards
Liu et al. A novel bit rate allocation algorithm for motion JPEG2000

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20111123