CN1564604A - Gradation tree set partitioning image compression method based on tree shaped structure - Google Patents
Gradation tree set partitioning image compression method based on tree shaped structure Download PDFInfo
- Publication number
- CN1564604A CN1564604A CN 200410017558 CN200410017558A CN1564604A CN 1564604 A CN1564604 A CN 1564604A CN 200410017558 CN200410017558 CN 200410017558 CN 200410017558 A CN200410017558 A CN 200410017558A CN 1564604 A CN1564604 A CN 1564604A
- Authority
- CN
- China
- Prior art keywords
- tree
- encoding
- threshold
- wavelet coefficients
- code stream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
本发明为一种基于树状结构的等级树集合划分(SPIHT)视频图像压缩方法。编码端首先通过离散小波变换得到图像能量在时频率域上的分布,根据小波系数之间的相关性,将各级的小波系数按照树状结构进行划分;然后对每棵树的小波系数分别进行SPIHT编码,编码结果分别暂时存放在编码端;最后将每棵树的编码结果合成为一个码流用于存储或者传输。解码过程为编码过程的逆过程。本发明在不消耗多余计算量的前提下,大大节省计算过程中的内存使用,从而适应视频流实时高效的压缩,特别适用于硬件实现的专用系统,用较少的存储空间,就能实现高压缩比和低失真度的视频压缩。The invention is a method for compressing video images based on hierarchical tree set partitioning (SPIHT) based on tree structure. At the encoding end, the distribution of image energy in the time-frequency domain is firstly obtained through discrete wavelet transform. According to the correlation between wavelet coefficients, the wavelet coefficients of each level are divided according to the tree structure; then the wavelet coefficients of each tree are respectively SPIHT encoding, the encoding results are temporarily stored in the encoding end; finally, the encoding results of each tree are synthesized into a code stream for storage or transmission. The decoding process is the reverse process of the encoding process. The present invention greatly saves the use of memory in the calculation process without consuming redundant calculations, thereby adapting to real-time and efficient compression of video streams, and is especially suitable for special systems implemented by hardware, and can achieve high Compression ratio and low distortion video compression.
Description
技术领域technical field
本发明属于视频图像压缩技术领域,具体涉及一种基于树状结构的等级树集合划分视频图像压缩方法。The invention belongs to the technical field of video image compression, and in particular relates to a video image compression method based on hierarchical tree set division of a tree structure.
背景技术Background technique
等级树集合划分(SPIHT)算法充分考虑了数据之间的相关性,并且在编码时还考虑了同一数据中高比特数据重要性高于低比特数据的特性。所以使用SPIHT方法来压缩、解压缩视频图像可以得到比较高的压缩比而不增加解压缩结果的失真度,所以该方法受到了日益广泛的关注。在具体实现的过程中,编码系统需要建立三个链表,即重要像素链表(LSP)、不重要像素链表(LIP)和不重要像素集合链表(LIS),这三个链表用来记录树状结构分裂的中间数据。通过链表的使用,编码码流可以按照阈值(重要性)下降的顺序排列,从而保证重要信息的传输而可以截断非重要信息,得到任意截断码流的高压缩比的压缩效果。为了提高压缩效果,要求树状结构包括更多的数据。但是随着树状结构中数据量的增加,三个链表的长度就越来越长,在实际应用中就要求有巨大的内存空间,这就增加了系统的成本和复杂度。所以在不降低压缩效果的前提下,缩短链表长度的方法正成为研究的热点。The Hierarchical Tree Set Partitioning (SPIHT) algorithm fully considers the correlation between data, and also considers the characteristic that high-bit data is more important than low-bit data in the same data when encoding. So using the SPIHT method to compress and decompress video images can get a relatively high compression ratio without increasing the distortion of the decompressed results, so this method has received increasing attention. In the specific implementation process, the encoding system needs to establish three linked lists, namely the important pixel linked list (LSP), the unimportant pixel linked list (LIP) and the unimportant pixel set linked list (LIS). These three linked lists are used to record the tree structure. Split intermediate data. Through the use of the linked list, the coded streams can be arranged in the descending order of the threshold (importance), so as to ensure the transmission of important information and truncate non-important information, and obtain the compression effect of arbitrarily truncated code streams with high compression ratio. In order to improve the compression effect, the tree structure is required to include more data. However, as the amount of data in the tree structure increases, the length of the three linked lists becomes longer and longer, which requires a huge memory space in practical applications, which increases the cost and complexity of the system. Therefore, under the premise of not reducing the compression effect, the method of shortening the length of the linked list is becoming a research hotspot.
发明内容Contents of the invention
本发明的目的是提出一种基于树状结构的等级树集合划分(SPIHT)视频图像压缩方法,以保证压缩效果不下降的前提下,大大缩短链表的长度,节省系统的内存空间开销。The purpose of this invention is to propose a kind of hierarchical tree set partitioning (SPIHT) video image compression method based on tree structure, under the premise that guarantees that compression effect does not descend, shorten the length of link list greatly, save the memory space overhead of system.
本发明提出的基于树状结构的等级树集合划分(SPIHT)视频图像压缩方法,编码的具体步骤如下:首先通过离散小波变换得到图像能量在时频率域上的分布,由于图像的平滑性,图像能量集中在低频部分;根据小波系数之间的相关性,将各级的小波系数按照树状结构进行划分;然后对每棵树的小波系数分别进行SPIHT编码,编码结果分别暂时存放在编码端;最后将每棵树的编码结果合成为一个码流用于存储或者传输。According to the hierarchical tree set partitioning (SPIHT) video image compression method based on the tree structure proposed by the present invention, the specific steps of encoding are as follows: firstly, the distribution of image energy in the time-frequency domain is obtained by discrete wavelet transform, and due to the smoothness of the image, the image The energy is concentrated in the low frequency part; according to the correlation between the wavelet coefficients, the wavelet coefficients of each level are divided according to the tree structure; then the wavelet coefficients of each tree are separately encoded by SPIHT, and the encoding results are temporarily stored in the encoding end; Finally, the encoding results of each tree are synthesized into a code stream for storage or transmission.
根据小波系数之间的数据相关性,将各级的小波系数按照树状结构进行划分是指以最低频子带的每个系数为树根,按照不同级别之间小波系数位置的数据相关性得到树状结构中每个点的数据。树状结构中,上一级小波系数和下一级小波系数之间的关系称为父母和子女或后代的关系。在小波系数中,不同子带相同位置的系数,往往在数值上有相似性,根据这样的关系,将最低频子带的每个系数作为树的根节点,高一级的子代中相同位置的系数作为树状结构的第一级子女,更高一级的子代中与每个第一级的子女相同位置的系数作为第一级子女的子女,也是树状结构的第二级子女,……直到最高频子带的系数作为最后一级的子女。According to the data correlation between the wavelet coefficients, dividing the wavelet coefficients of each level according to the tree structure refers to taking each coefficient of the lowest frequency sub-band as the root of the tree, and according to the data correlation of the positions of the wavelet coefficients between different levels, we can get Data for each point in the tree structure. In the tree structure, the relationship between the upper-level wavelet coefficients and the lower-level wavelet coefficients is called the relationship between parents and children or offspring. In the wavelet coefficients, the coefficients in the same position of different sub-bands are often similar in value. According to this relationship, each coefficient of the lowest frequency sub-band is used as the root node of the tree, and the same position in the higher-level children The coefficients of are taken as the first-level children of the tree structure, and the coefficients of the higher-level children in the same position as each first-level child are the children of the first-level children, which are also the second-level children of the tree structure, ...up to the coefficients of the highest frequency subband as children of the last level.
对每棵树的小波系数分别进行SPIHT编码,可以减少同时处理的小波系数,产生的中间结果较少,缩短了重要像素链表(LSP)、不重要像素链表(LIP)和不重要像素集合链表(LIS)的长度。其方法就是将每棵树的编码结果都按照阈值下降的顺序依次得到,直到阈值下降到可以满足压缩要求为止。阈值下降极限可以由前一帧组的最小阈值或者经验阈值得到的预测阈值决定。每棵树的小波系数进行SPIHT编码的结果不予直接传输,而是暂存在编码端,存放时将各阈值情况下的编码码流依次存放,并且记录各阈值情况下的编码码流长度。SPIHT encoding is performed on the wavelet coefficients of each tree separately, which can reduce the wavelet coefficients processed at the same time, produce less intermediate results, and shorten the important pixel linked list (LSP), unimportant pixel linked list (LIP) and unimportant pixel set linked list ( LIS) length. The method is to obtain the encoding results of each tree in the descending order of the threshold until the threshold drops enough to meet the compression requirements. The threshold drop limit can be determined by the minimum threshold of the previous frame group or the predicted threshold obtained from the empirical threshold. The result of SPIHT encoding of the wavelet coefficients of each tree is not directly transmitted, but is temporarily stored at the encoding end. When storing, the coded streams under each threshold are stored in sequence, and the length of the coded stream under each threshold is recorded.
在所有的树状结构的小波系数编码结束后,为了得到符合压缩比要求的目标码流,需要将每棵树的编码结果合成为目标码流。合成码流的方法是确定最小的阈值,称为截断阈值,使得每棵树编码码流中不小于该阈值的码流之和不大于目标码流长度,将这些编码码流和码流长度合成为目标码流,剩余的目标码流再由每棵树的其余编码码流平均分配。就是将每棵树编码结果中阈值不小于截断阈值的码流和这些码流的长度直接作为目标码流,目标码流不足的部分由每棵树编码结果中阈值小于截断阈值的码流平均分配。After the coding of all tree-structured wavelet coefficients is completed, in order to obtain the target code stream that meets the compression ratio requirements, it is necessary to synthesize the coding results of each tree into the target code stream. The method of synthesizing the code stream is to determine the minimum threshold, which is called the truncation threshold, so that the sum of the code streams not less than the threshold in the code stream of each tree is not greater than the target code stream length, and these code streams and code stream lengths are synthesized is the target code stream, and the remaining target code streams are evenly distributed by the remaining code streams of each tree. That is, the code streams whose threshold value is not less than the truncation threshold in the coding result of each tree and the length of these code streams are directly used as the target code stream, and the insufficient part of the target code stream is evenly distributed by the code streams whose threshold is smaller than the truncation threshold in the coding result of each tree .
编码过程的重点在于树状结构的划分、树状结构小波系数编码结果的存放和目标码流的合成。The key points of the coding process are the division of the tree structure, the storage of the coding result of the tree structure wavelet coefficients and the synthesis of the target code stream.
在解码端,解码过程是编码过程的逆过程:首先将待解码的码流分配给每棵树的缓存,再对每棵树分配到的码流依次进行SPIHT解码,得到树状结构的小波系数,再将树状结构的小波系数还原为按子带排布的小波系数,通过小波逆变换得到解码图像。At the decoding end, the decoding process is the inverse process of the encoding process: firstly, the code stream to be decoded is allocated to the cache of each tree, and then the code stream allocated to each tree is sequentially decoded by SPIHT to obtain the wavelet coefficients of the tree structure , and then restore the wavelet coefficients in the tree structure to wavelet coefficients arranged in sub-bands, and obtain the decoded image through inverse wavelet transform.
本发明所提出的基于树状结构的等级树集合划分(SPIHT)视频图像压缩方法,有效的解决了图像数据量和链表长度之间的矛盾。为了提高压缩效果,可以将多帧的图像(帧组)一起进行离散小波变换,使得每棵树可以包括足够多的小波系数;由于每棵树分别编码,并不会导致重要像素链表(LSP)、不重要像素链表(LIP)和不重要像素集合链表(LIS)长度的过度加长。The hierarchical tree set partitioning (SPIHT) video image compression method based on the tree structure proposed by the present invention effectively solves the contradiction between the amount of image data and the length of the linked list. In order to improve the compression effect, multiple frames of images (frame groups) can be subjected to discrete wavelet transformation together, so that each tree can include enough wavelet coefficients; since each tree is encoded separately, it will not lead to a linked list of important pixels (LSP) , The length of the unimportant pixel linked list (LIP) and the unimportant pixel set linked list (LIS) is excessively lengthened.
具体实施方式Detailed ways
一、编码端的具体实施方法1. The specific implementation method of the encoding end
1、离散小波变换结果的树状结构划分1. Tree structure division of discrete wavelet transform results
离散小波变换可以使用三维的离散小波变换,即在行方向、列方向和时间方向分别进行离散小波变换。变换结果的最低频每个系数作为一棵树的根节点,并且按照下面的关系,构成树状结构。假设最低频系数的大小为Wmin×Hmin,其中Wmin和Hmin分别是最低频帧的最低频子带的宽度和高度。Discrete wavelet transform can use three-dimensional discrete wavelet transform, that is, perform discrete wavelet transform in row direction, column direction and time direction respectively. Each coefficient of the lowest frequency of the transformation result is used as the root node of a tree, and a tree structure is formed according to the following relationship. It is assumed that the size of the lowest frequency coefficient is W min ×H min , where W min and H min are respectively the width and height of the lowest frequency sub-band of the lowest frequency frame.
1)根节点子女寻找方法 其子女为:1) The method of finding the children of the root node The children are:
2)二维子女寻找方法 其子女为:2) Two-dimensional child search method whose children are:
{(2x,2y,z),(2x+1,2y,z),(2x,2y+1,z),(2x+1,2y+1,z)} (2){(2x, 2y, z), (2x+1, 2y, z), (2x, 2y+1, z), (2x+1, 2y+1, z)} (2)
3)三维子女寻找方法 其子女为:3) The method of finding three-dimensional children whose children are:
2、每棵树状结构小波系数SPIHT编码结果的存放2. The storage of the SPIHT encoding result of each tree structure wavelet coefficient
DM,N表示阈值从2N+1下降到2N时第M棵树阈值为N的编码数据。LM,N表示阈值从2N+1下降到2N时第M棵树阈值为N的编码数据长度。所有树的编码结果存放的格式如下:
3、目标码流的合成3. Synthesis of the target code stream
如果有M棵树,要求的目标码流长度为Q。在阈值降到2P时,所有树的总码流长度为
第1棵树
第2棵树
……...
第M棵树
D1,P-1中的第1个比特;D2,P-1中的第1个比特;……DM,P-1中的第1个比特;D 1, the first bit in P-1 ; D 2, the first bit in P-1 ; ... D M, the first bit in P-1 ;
D1,P-1中的第2个比特;D2,P-1中的第2个比特;……DM,P-1中的第2个比特;D 1, the 2nd bit in P-1 ; D 2, the 2nd bit in P-1 ; ... D M, the 2nd bit in P-1 ;
……...
直到目标码流长度达到要求。Until the target stream length reaches the requirement.
二、解码端的具体实施方法2. The specific implementation method of the decoder
解码的过程完全为编码的逆过程。首先将待解码的码流分配给每棵树的缓存,再对每棵树分配到的码流依次进行SPIHT解码,得到树状结构的小波系数,再将树状结构的小波系数还原为按子带排布的小波系数,通过小波逆变换得到解码图像。The decoding process is completely the reverse process of encoding. First, the code stream to be decoded is allocated to the cache of each tree, and then the code stream allocated to each tree is sequentially decoded by SPIHT to obtain the wavelet coefficient of the tree structure, and then the wavelet coefficient of the tree structure is restored to the The wavelet coefficients with arrangement are obtained by inverse wavelet transform to obtain the decoded image.
三、仿真的结果3. Simulation results
具体的仿真条件如下:The specific simulation conditions are as follows:
Miss American视频图像组1-8帧图像的Y值数据,每帧图像大小为352×288。进行三级三维离散小波变换,再对低频帧进行两级二维离散小波变换,小波基选用Daubechies9/7双正交小波(行方向和列方向)和Haar小波(时间方向)。共有99棵树。The Y value data of frames 1-8 of the Miss American video image group, and the image size of each frame is 352×288. Three-level three-dimensional discrete wavelet transform is performed, and then two-level two-dimensional discrete wavelet transform is performed on low-frequency frames. The wavelet base uses Daubechies9/7 biorthogonal wavelet (row direction and column direction) and Haar wavelet (time direction). There are 99 trees in total.
实验结果如下:
*优化后的LIP、LIP、LIS是99棵树中最大的长度,并且每棵树编码都进行到阈值降为8为止,试验证明阈值降到8,一般就能满足压缩比的要求。 * The optimized LIP, LIP, and LIS are the largest lengths among the 99 trees, and each tree is coded until the threshold is reduced to 8. The test proves that the threshold is reduced to 8, which can generally meet the requirements of the compression ratio.
通过上面的实验结果我们可以发现,本SPIHT编码方法的结果虽然降低了PSNR(降低得非常小),但是用于存储链表的空间可以大大的减小。Through the above experimental results, we can find that although the result of the SPIHT encoding method reduces the PSNR (reduces very little), the space for storing the linked list can be greatly reduced.
Claims (5)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN 200410017558 CN1564604A (en) | 2004-04-08 | 2004-04-08 | Gradation tree set partitioning image compression method based on tree shaped structure |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN 200410017558 CN1564604A (en) | 2004-04-08 | 2004-04-08 | Gradation tree set partitioning image compression method based on tree shaped structure |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN1564604A true CN1564604A (en) | 2005-01-12 |
Family
ID=34479040
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN 200410017558 Pending CN1564604A (en) | 2004-04-08 | 2004-04-08 | Gradation tree set partitioning image compression method based on tree shaped structure |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN1564604A (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101682765B (en) * | 2007-05-30 | 2012-04-11 | 三叉微系统(远东)有限公司 | Method of determining an image distribution for a light field data structure |
| CN102473315A (en) * | 2009-08-20 | 2012-05-23 | 汤姆森特许公司 | Method and apparatus for reusing tree structures to encode and decode binary sets |
| CN103024399A (en) * | 2013-01-18 | 2013-04-03 | 北京航空航天大学 | Wavelet transform based extreme-low bit-rate video compressing and coding method |
| CN102187656B (en) * | 2008-10-27 | 2014-10-01 | 日本电信电话株式会社 | Method for automatically generating pixel prediction value generation sequence, image encoding method, image decoding method, and device thereof |
| CN106612437A (en) * | 2016-01-11 | 2017-05-03 | 四川用联信息技术有限公司 | Graphic image compression method |
| CN112995637A (en) * | 2021-03-10 | 2021-06-18 | 湘潭大学 | Multi-section medical image compression method based on three-dimensional discrete wavelet transform |
-
2004
- 2004-04-08 CN CN 200410017558 patent/CN1564604A/en active Pending
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101682765B (en) * | 2007-05-30 | 2012-04-11 | 三叉微系统(远东)有限公司 | Method of determining an image distribution for a light field data structure |
| CN102187656B (en) * | 2008-10-27 | 2014-10-01 | 日本电信电话株式会社 | Method for automatically generating pixel prediction value generation sequence, image encoding method, image decoding method, and device thereof |
| US9124289B2 (en) | 2008-10-27 | 2015-09-01 | Nippon Telegraph And Telephone Corporation | Predicted pixel value generation procedure automatic producing method, image encoding method, image decoding method, apparatus therefor, programs therefor, and storage media which store the programs |
| CN102473315A (en) * | 2009-08-20 | 2012-05-23 | 汤姆森特许公司 | Method and apparatus for reusing tree structures to encode and decode binary sets |
| CN103024399A (en) * | 2013-01-18 | 2013-04-03 | 北京航空航天大学 | Wavelet transform based extreme-low bit-rate video compressing and coding method |
| CN106612437A (en) * | 2016-01-11 | 2017-05-03 | 四川用联信息技术有限公司 | Graphic image compression method |
| CN112995637A (en) * | 2021-03-10 | 2021-06-18 | 湘潭大学 | Multi-section medical image compression method based on three-dimensional discrete wavelet transform |
| CN112995637B (en) * | 2021-03-10 | 2023-02-28 | 湘潭大学 | A Multi-section Medical Image Compression Method Based on 3D Discrete Wavelet Transform |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1181690C (en) | Coding method for compressing video sequences | |
| CN1798341B (en) | Adaptive coefficient scan order | |
| CN107481295B (en) | Image Compression System Based on Dynamic Byte Length Allocation with Convolutional Neural Network | |
| CN101984666A (en) | Image lossless compression and decompression method based on lifting wavelet transform | |
| CN101631243B (en) | Image encoding/decoding method based on wavelet transformation | |
| CN111669588B (en) | Ultra-high definition video compression coding and decoding method with ultra-low time delay | |
| CN1802667A (en) | Overcomplete basis transform-based motion residual frame coding method and apparatus for video compression | |
| CN102014283A (en) | First-order difference prefix notation coding method for lossless compression of image data | |
| CN110572682A (en) | An Embedded Zerotree Wavelet Image Coding and Compression Method | |
| CN1581977A (en) | Tree-structure-based grade tree aggregation-divided video image compression method | |
| CN103581691A (en) | Efficient and parallelable image coding method oriented to sparse coefficients | |
| CN1685731A (en) | Scalable Video Coding | |
| CN1564604A (en) | Gradation tree set partitioning image compression method based on tree shaped structure | |
| CN102790882A (en) | Encoding method of remote sensing image | |
| CN1284120C (en) | Synthetic aperture radar complex numeric image data real time automatic compression method | |
| CN1255770C (en) | Hierarchy tree set partition image coding decoding method based of digital signal processor | |
| CN1322472C (en) | Quad tree image compressing and decompressing method based on wavelet conversion prediction | |
| CN1179486C (en) | Variable length coding method and variable length decoding method | |
| CN111131834A (en) | Reversible self-encoder, encoding and decoding method, image compression method and device | |
| CN1571514A (en) | An embedded image compression technique based on wavelet transformation | |
| Zhu et al. | An improved SPIHT algorithm based on wavelet coefficient blocks for image coding | |
| CN1666530A (en) | Sub-band video decoding method and device | |
| CN1860793A (en) | 3-d morphological operations with adaptive structuring elements for clustering of significant coefficients within an overcomplete wavelet video coding framework | |
| JP2001298738A (en) | Image encoding method, image encoding device, and medium storing image encoding program | |
| CN1234247C (en) | Image compression coding method using rectangle block filling code word to reduce space redundancy |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
| WD01 | Invention patent application deemed withdrawn after publication |