CN102256126A - Method for coding mixed image - Google Patents
Method for coding mixed image Download PDFInfo
- Publication number
- CN102256126A CN102256126A CN2011102001787A CN201110200178A CN102256126A CN 102256126 A CN102256126 A CN 102256126A CN 2011102001787 A CN2011102001787 A CN 2011102001787A CN 201110200178 A CN201110200178 A CN 201110200178A CN 102256126 A CN102256126 A CN 102256126A
- Authority
- CN
- China
- Prior art keywords
- residual
- coding
- mode
- rsq
- quantization
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
技术领域 technical field
本发明属于图像处理的技术领域,具体地涉及一种混合图像的编码方法。The invention belongs to the technical field of image processing, and in particular relates to a method for encoding mixed images.
背景技术 Background technique
在已有的技术中,混合图像的编码大致可以分为以下三类。In existing technologies, coding of mixed images can be roughly classified into the following three categories.
1.基于图像编码的方法:1. Method based on image coding:
这种方法还是采用传统的编码框架来对混合图像进行编码,但是针对不同的区域采用不同的量化步长,对于包含文字和图表内容的块,编码后重建效果会非常模糊,所以量化这些块时减少量化步长,为其多分配比特,让其重建的质量尽可能好。但是对于固定的比特数,如果为文字和图表块多分配了比特,那么自然图像块所分配的比特数就会相应地减少,所以总体来说,这种方法仍不能达到很好的效果。This method still uses the traditional encoding framework to encode the mixed image, but uses different quantization steps for different regions. For blocks containing text and graphics, the reconstruction effect after encoding will be very blurred, so when quantizing these blocks Reduce the quantization step size, allocate more bits to it, and make its reconstruction quality as good as possible. But for a fixed number of bits, if more bits are allocated for text and graphic blocks, then the number of bits allocated for natural image blocks will be correspondingly reduced, so generally speaking, this method still cannot achieve good results.
2.基于分层编码的方法:2. Method based on layered coding:
这种方法采用混合光栅模型来进行压缩。它将一幅图像或者一个图像块分为前景层、背景层和一个二进制算子的平面层。这个二进制算子的平面层表明图像或图像块的每个像素属于前景层还是背景层,并且这个二进制算子平面可以用JBIG标准或JBIG2标准进行压缩。而经过数据填充算法模糊后的前景层和背景层可以用传统的图像编码框架来进行压缩。论文“Compressing compound documents”(The Document and ImageCompression Handbook,M.Barni,Ed.New York:Marcel-Dekker,2005.)提出了这种方法,并通过实验证明这种方法与传统的编解码框架相比,可以取得很大的增益。但是这种方法有一些缺点:首先增益在很大程度上取决于分割的算法,即如何将图像分为前景和背景。第二,分割出来的洞如果不经过任何处理会损害编码的效果。第三,分别编码背景层的文本颜色和二进制算子的文本形状也会降低编码的效率。所以这种方法仍不能很好地满足应用的需求This method uses a hybrid raster model for compression. It divides an image or an image block into a foreground layer, a background layer and a plane layer of a binary operator. The plane layer of this binary operator indicates that each pixel of the image or image block belongs to the foreground layer or the background layer, and this binary operator plane can be compressed with JBIG standard or JBIG2 standard. The foreground and background layers blurred by the data filling algorithm can be compressed using the traditional image coding framework. The paper "Compressing compound documents" (The Document and ImageCompression Handbook, M.Barni, Ed.New York: Marcel-Dekker, 2005.) proposed this method, and it was proved by experiments that this method is compared with the traditional codec framework , a large gain can be obtained. But this method has some disadvantages: First, the gain depends largely on the segmentation algorithm, that is, how to divide the image into foreground and background. Second, the segmented holes will damage the encoding effect if they are not processed. Third, separately encoding the text color of the background layer and the text shape of the binary operator also reduces the efficiency of encoding. Therefore, this method still cannot meet the needs of the application well.
3.基于块的编码方法:3. Block-based encoding method:
这种方法可以分为两类:This approach can be divided into two categories:
一种是利用空域的性质,如直方图统计,梯度,颜色的数目等,可以将当前编码块分为几类中的一种,如文本,图表,自然图像等。然后不同的类别用不同的编码框架来进行压缩以适用于它们的统计特性,如图1所示。One is to use the properties of the airspace, such as histogram statistics, gradients, the number of colors, etc., to classify the current coding block into one of several categories, such as text, charts, natural images, etc. Different categories are then compressed with different encoding frameworks to suit their statistical properties, as shown in Figure 1.
另一种是考虑到混合图像中的文字和图表的直方图较稀疏,将其在时域分解为基础颜色和索引表的形式,即Base color and Index map(BCIM)模式。The other is to consider that the histogram of the text and graphics in the mixed image is relatively sparse, and decompose it into the form of the basic color and index table in the time domain, that is, the Base color and Index map (BCIM) mode.
尽管基于块的编码方法相比于前面提到的两种技术已经取得了很好的增益,但是目前所有这种编码方法都有一个共同点,就是只考虑了空间的相关性,即只对帧内部分进行了改进,而忽略了时间的相关性,没有将帧间编码的特点考虑进去,并没有设计出针对帧间编码的混合图像的编码方案。Although the block-based coding method has achieved good gains compared to the two aforementioned techniques, all current coding methods have one thing in common, that is, only the spatial correlation is considered, that is, only the frame The internal part is improved, but the time correlation is ignored, the characteristics of inter-frame coding are not taken into account, and the coding scheme for inter-frame coding mixed images is not designed.
发明内容 Contents of the invention
本发明的技术解决问题是:克服现有技术的不足,提供一种编码效率更高的混合图像的编码方法。The problem solved by the technology of the present invention is to overcome the shortcomings of the prior art and provide a method for encoding mixed images with higher encoding efficiency.
本发明的技术解决方案是:这种混合图像的编码方法,将RSQ(Residualscalar quantization,残差标量量化)模式用于混合图像的帧间编码:根据率失真代价最小的原则,每个TU(Transform unit,变换单元)自适应地选择DCT(Discrete cosine transform,离散余弦变换)模式或RSQ模式,并用一个比特表示该选择;RSQ模式是将帧间预测后得到的残差进行量化。The technical solution of the present invention is: this mixed image coding method uses RSQ (Residual scalar quantization, residual scalar quantization) mode for interframe coding of mixed images: according to the principle of minimum rate-distortion cost, each TU (Transform unit, transformation unit) adaptively selects DCT (Discrete cosine transform, discrete cosine transform) mode or RSQ mode, and uses one bit to represent the selection; RSQ mode quantizes the residual obtained after inter-frame prediction.
由于本发明解决了混合图像不适用于变换编码的问题,且由于经过帧间预测后所得到的残差会更小,所以对于混合图像的帧间编码效率会更高。Since the present invention solves the problem that the mixed image is not suitable for transform coding, and since the residual error obtained after the inter-frame prediction will be smaller, the inter-frame coding efficiency for the mixed image will be higher.
附图说明 Description of drawings
图1示出了现有的RSQ模式,即利用空域性质的基于块的编码方法;Figure 1 shows an existing RSQ mode, a block-based coding method that exploits spatial properties;
图2示出了根据本发明的混合图像的编码方法的流程图。Fig. 2 shows a flow chart of a method for encoding mixed images according to the present invention.
具体实施方式 Detailed ways
这种混合图像的编码方法,将RSQ(Residual scalar quantization,残差标量量化)模式用于混合图像的帧间编码:根据率失真代价最小的原则,每个TU(Transform unit,变换单元)自适应地选择DCT(Discretecosine transform,离散余弦变换)模式或RSQ模式,并用一个比特表示该选择;RSQ模式是将帧间预测后得到的残差直接进行量化,跳过变换编码的步骤。This mixed image coding method uses RSQ (Residual scalar quantization) mode for interframe coding of mixed images: according to the principle of the least rate-distortion cost, each TU (Transform unit, transformation unit) adaptive Select DCT (Discretecosine transform, discrete cosine transform) mode or RSQ mode freely, and use one bit to represent the choice; RSQ mode is to directly quantize the residual obtained after inter-frame prediction, and skip the step of transform coding.
当传输RSQ编码时,采用行顺序扫描,并为其分配上下文。When transmitting RSQ encoding, scan in row order and assign a context to it.
如图2所示,这种混合图像的编码方法,包括以下步骤:As shown in Figure 2, the encoding method of this mixed image includes the following steps:
(1)对于PU中的亮度分量,得到其预测块,进而与原始值相减得到残差,对残差块进行不同尺寸的划分,获得TU(Transform Unit,变换单元);(1) For the luminance component in the PU, obtain its prediction block, and then subtract it from the original value to obtain the residual, divide the residual block into different sizes, and obtain TU (Transform Unit, transformation unit);
(2)将当前TU中的残差进行量化;(2) quantify the residual in the current TU;
(3)将量化后的TU传送至解码端;(3) Send the quantized TU to the decoding end;
(4)解码端的每个TU进行反量化;(4) Inverse quantization is performed on each TU at the decoding end;
(5)TU组合为PU,与经过运动补偿的得到的当前PU的预测值进行相加,得到PU的重构值,进而重建当前的CU(Coding Unit,编码单元)。(5) The TU is combined into a PU, which is added to the predicted value of the current PU after motion compensation to obtain the reconstruction value of the PU, and then the current CU (Coding Unit, coding unit) is reconstructed.
优选地,步骤(2)中的量化采用公式:C(S)=sign(S)×max(0,floor(|S|/Q+1-Z),其中S为输入信号,floor为向下取整函数,Q为量化步长,Z是预设的调节参数。Preferably, the quantization in step (2) adopts the formula: C(S)=sign(S)×max(0, floor(|S|/Q+1-Z), wherein S is the input signal, and floor is downward Rounding function, Q is the quantization step size, and Z is the preset adjustment parameter.
优选地,步骤(3)中的传送与HM标准不同的是不用zig-zag顺序扫描残差,而是按光栅扫描顺序依次扫描每个残差系数,然后依次传输。Preferably, the transmission in step (3) is different from the HM standard in that the residuals are not scanned in zig-zag order, but each residual coefficient is scanned sequentially in raster scanning order, and then transmitted sequentially.
优选地,步骤(4)中的反量化采用公式:R(x)=sign(x)×round(q×|x|+p),其中round为四舍五入取整函数,q,p是随着量化参数而变换的控制参数。Preferably, the dequantization in step (4) adopts the formula: R(x)=sign(x)×round(q×|x|+p), wherein round is a rounding function, and q and p are The control parameter transformed by the parameter.
以上所述,仅是本发明的较佳实施例,并非对本发明作任何形式上的限制,凡是依据本发明的技术实质对以上实施例所作的任何简单修改、等同变化与修饰,均仍属本发明技术方案的保护范围。The above are only preferred embodiments of the present invention, and are not intended to limit the present invention in any form. Any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical essence of the present invention are still within the scope of this invention. The protection scope of the technical solution of the invention.
Claims (6)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2011102001787A CN102256126A (en) | 2011-07-14 | 2011-07-14 | Method for coding mixed image |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2011102001787A CN102256126A (en) | 2011-07-14 | 2011-07-14 | Method for coding mixed image |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN102256126A true CN102256126A (en) | 2011-11-23 |
Family
ID=44983073
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2011102001787A Pending CN102256126A (en) | 2011-07-14 | 2011-07-14 | Method for coding mixed image |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN102256126A (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102447907A (en) * | 2012-01-31 | 2012-05-09 | 北京工业大学 | A method for encoding video sequences aimed at HEVC |
| CN104521232A (en) * | 2012-07-02 | 2015-04-15 | 韩国电子通信研究院 | Method and device for encoding/decoding images |
| CN107465923A (en) * | 2016-06-06 | 2017-12-12 | 谷歌公司 | Adaptive overlapping block prediction in the video coding of variable block length |
| CN109324778A (en) * | 2018-12-04 | 2019-02-12 | 深圳市华星光电半导体显示技术有限公司 | Compensate table compression method |
| US10298955B2 (en) | 2012-08-23 | 2019-05-21 | Microsoft Technology Licensing, Llc | Non-transform coding |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1754389A (en) * | 2003-01-08 | 2006-03-29 | 苹果电脑有限公司 | Method and apparatus for improved coding mode selection |
| CN1846446A (en) * | 2003-09-03 | 2006-10-11 | 汤姆森许可贸易公司 | Method and arrangement for encoding video images |
| CN101009839A (en) * | 2006-01-23 | 2007-08-01 | 三星电子株式会社 | Method for video encoding or decoding based on orthogonal transform and vector quantization, and apparatus thereof |
| CN101305610A (en) * | 2005-10-19 | 2008-11-12 | 飞思卡尔半导体公司 | System and method for encoding mode determination for video encoding |
-
2011
- 2011-07-14 CN CN2011102001787A patent/CN102256126A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1754389A (en) * | 2003-01-08 | 2006-03-29 | 苹果电脑有限公司 | Method and apparatus for improved coding mode selection |
| CN1846446A (en) * | 2003-09-03 | 2006-10-11 | 汤姆森许可贸易公司 | Method and arrangement for encoding video images |
| CN101305610A (en) * | 2005-10-19 | 2008-11-12 | 飞思卡尔半导体公司 | System and method for encoding mode determination for video encoding |
| CN101009839A (en) * | 2006-01-23 | 2007-08-01 | 三星电子株式会社 | Method for video encoding or decoding based on orthogonal transform and vector quantization, and apparatus thereof |
Cited By (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102447907A (en) * | 2012-01-31 | 2012-05-09 | 北京工业大学 | A method for encoding video sequences aimed at HEVC |
| US10419765B2 (en) | 2012-07-02 | 2019-09-17 | Electronics And Telecommunications Research Institute | Method and apparatus for coding/decoding image |
| US9843809B2 (en) | 2012-07-02 | 2017-12-12 | Electronics And Telecommunications Research | Method and apparatus for coding/decoding image |
| US10045031B2 (en) | 2012-07-02 | 2018-08-07 | Electronic And Telecommunications Research | Method and apparatus for coding/decoding image |
| US10187643B2 (en) | 2012-07-02 | 2019-01-22 | Electronics And Telecommunications Research | Method and apparatus for encoding and decoding image |
| US10187644B2 (en) | 2012-07-02 | 2019-01-22 | Electronics And Telecommunications Research | Method and apparatus for coding/decoding image |
| CN104521232A (en) * | 2012-07-02 | 2015-04-15 | 韩国电子通信研究院 | Method and device for encoding/decoding images |
| US10554982B2 (en) | 2012-07-02 | 2020-02-04 | Electronics And Telecommunications Research Institute | Method and apparatus for coding/decoding image |
| US10554983B2 (en) | 2012-07-02 | 2020-02-04 | Electronics And Telecommunications Research Institute | Method and apparatus for coding/decoding image |
| US12273538B2 (en) | 2012-07-02 | 2025-04-08 | Electronics And Telecommunications Research Institute | Method and apparatus for coding/decoding image |
| US10298955B2 (en) | 2012-08-23 | 2019-05-21 | Microsoft Technology Licensing, Llc | Non-transform coding |
| CN107465923A (en) * | 2016-06-06 | 2017-12-12 | 谷歌公司 | Adaptive overlapping block prediction in the video coding of variable block length |
| CN107465923B (en) * | 2016-06-06 | 2020-09-25 | 谷歌有限责任公司 | Adaptive overlapped block prediction in variable block size video coding |
| CN109324778A (en) * | 2018-12-04 | 2019-02-12 | 深圳市华星光电半导体显示技术有限公司 | Compensate table compression method |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113411577B (en) | Coding methods and devices | |
| CN111819853B (en) | Image block encoding device and image block encoding method | |
| JP6749922B2 (en) | Improved Palette Mode in High Efficiency Video Coding (HEVC) Screen Content Coding (SCC) | |
| CN102223541B (en) | Method for coding hybrid image | |
| CN101356818B (en) | Image encoding/decoding method and apparatus | |
| US11418786B2 (en) | Image encoding and decoding apparatus, and image encoding and decoding method | |
| CN101938657B (en) | Self-adaptively dividing method for code units in high-efficiency video coding | |
| JP6568078B2 (en) | Method and apparatus for encoding image data and method and apparatus for decoding image data | |
| JP2018160932A5 (en) | Palette coding for screen content coding | |
| CN106170092A (en) | Fast encoding method for lossless coding | |
| CN104041035A (en) | Lossless Coding and Associated Signaling Methods for Compound Video | |
| CN107231557B (en) | Encoding, decoding methods and apparatus for intelligent reordering in recursive block partitioning for advanced intra prediction in video coding | |
| CN105659606A (en) | Features of base color index map mode for video and image coding and decoding | |
| CN109644277A (en) | Video coding tool for sample process in circuit | |
| US12075048B2 (en) | Adaptive coding of prediction modes using probability distributions | |
| CN107018416B (en) | Adaptive tile data size coding for video and image compression | |
| US12231639B2 (en) | Image encoding and decoding apparatus, and image encoding and decoding method using contour mode based intra prediction | |
| CN108174214A (en) | A Remote Desktop Sharing Method Based on Screen Content Video Coding | |
| CN102256126A (en) | Method for coding mixed image | |
| Naccari et al. | Intensity dependent spatial quantization with application in HEVC | |
| US20160029034A1 (en) | Edge-selective intra coding | |
| CN102845061A (en) | Codeword restriction for high performance video coding | |
| KR20180032570A (en) | Modification to block size for transform mode in display stream compression | |
| Giusto et al. | Data Compression for Digital Photography: Performance comparison between proprietary solutions and standards | |
| Liu et al. | A novel bit rate allocation algorithm for motion JPEG2000 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20111123 |