WO2022109916A1

WO2022109916A1 - Image encoding method and device, image decoding method and device, image processing system, mobile platform, image transmission system and storage medium

Info

Publication number: WO2022109916A1
Application number: PCT/CN2020/131718
Authority: WO
Inventors: 赵文军; 邱孟品
Original assignee: 深圳市大疆创新科技有限公司
Priority date: 2020-11-26
Filing date: 2020-11-26
Publication date: 2022-06-02

Abstract

An image encoding method and device, an image decoding method and device, an image processing system, a mobile platform, an image transmission system and a storage medium. The encoding method comprises: acquiring a plurality of frequency coefficients of an image block to be encoded (S101); obtaining one or more coefficient blocks according to bit planes related to the plurality of frequency coefficients, the coefficient blocks comprising binary bits of at least one frequency coefficient (S102); determining position information of a first non-zero bit plane in each of the coefficient blocks (S103); and encoding the position information of the first non-zero bit plane of each of the coefficient blocks, and encoding bits on each of the bit planes starting from the first non-zero bit plane in each of the coefficient blocks (S104). The encoding method does not need to encode all zero-bit planes in each coefficient block, thereby achieving efficient compression of an image block to be encoded.

Description

Image coding method, decoding method, device, image processing system, movable platform, image transmission system and storage medium

technical field

The present application relates to the technical field of image processing, and in particular, to an image encoding method, a decoding method, an apparatus, an image processing system, a movable platform, an image transmission system, and a storage medium.

Background technique

With the development of technology, high-definition imaging technologies characterized by high resolution, high frame rate, and high bit depth are applied in imaging devices, such as high-definition video cameras or single-lens reflex cameras. However, these high-definition image data will cause a problem of excessive bandwidth loss during storage or transmission. For example, a large amount of high-definition image data generated by these imaging devices is cached in the memory. When these high-definition images need to be read from the memory or written into the memory, a large amount of read and write bandwidth is required, resulting in limited device performance. and increased power consumption. Therefore, it is necessary to provide an image coding method to compress images.

SUMMARY OF THE INVENTION

In view of this, one of the objectives of the present application is to provide an image encoding method, decoding method, apparatus, image processing system, movable platform, image transmission system and storage medium.

In a first aspect, an embodiment of the present application provides an image encoding method, including:

Obtain multiple frequency coefficients of the image block to be encoded;

obtaining one or more blocks of coefficients from the bit planes for the plurality of frequency coefficients; the blocks of coefficients comprising binary bits of at least one frequency coefficient;

determining the position information of the first non-zero bit plane in each of the coefficient blocks;

encoding the position information of the first non-zero bit plane of each of the coefficient blocks, to generate first encoding information; encoding to generate second encoding information;

According to the first encoding information and the second encoding information, an encoded code stream for the to-be-encoded image block is generated.

In a second aspect, an embodiment of the present application provides an image decoding method, including:

obtaining an encoded code stream, where the encoded code stream includes first encoding information and second encoding information;

Decoding the first encoding information to obtain the position information of the first non-zero bit plane of each coefficient block; and decoding the second encoding information to obtain the first non-zero bit plane from the coefficient block in each coefficient block Bits on each bit plane at the beginning of the plane;

According to the position information of the first non-zero bit plane of each coefficient block and the bits on each bit plane starting from the first non-zero bit plane in each of the coefficient blocks, obtain a plurality of frequency coefficients on the bit plane;

The decoded image block is obtained by using the plurality of frequency coefficients.

In a third aspect, an embodiment of the present application provides an image encoding apparatus, including:

processor;

memory for storing processor-executable instructions;

Wherein, the processor invokes the executable instruction, and when the executable instruction is executed, is used to execute:

Obtain multiple frequency coefficients of the image block to be encoded;

According to the first encoding information and the second encoding information, an encoded code stream of the image block to be encoded is generated.

In a fourth aspect, an embodiment of the present application provides an image decoding apparatus, characterized in that it includes:

processor;

memory for storing processor-executable instructions;

In a fifth aspect, an embodiment of the present application provides an image processing system, including an image processing module, an image encoding device, an image decoding device, and a memory;

The image processing module is used to process the image, and transmit the processed image to the image encoding device;

The image encoding device is configured to, after dividing the processed image into a plurality of image blocks to be encoded, perform compression encoding processing on the image blocks to be encoded to generate an encoded code stream;

the memory is used for storing the encoded code stream;

The image decoding device is configured to decode the encoded code stream, and transmit the decoded result to the image processing module.

In a sixth aspect, an embodiment of the present application provides a movable platform, including the above-mentioned image encoding apparatus; or, including the above-mentioned image processing system.

In a seventh aspect, an embodiment of the present application provides an image transmission system, including an image transmitter and an image receiver; wherein the image transmitter includes the above-mentioned image encoding apparatus, and the image receiver includes the above-mentioned image decoding apparatus ;

The image transmitter is configured to send the encoded code stream generated after encoding by the image encoding device to the image receiver;

The image receiver is configured to use the image decoding device to decode the encoded code stream after receiving the encoded code stream.

In an eighth aspect, an embodiment of the present application provides a computer-readable storage medium on which computer instructions are stored, and when the instructions are executed by a processor, implement the method described in the first aspect or the second aspect.

An image encoding method, decoding method, device, image processing system, movable platform, image transmission system, and storage medium provided by the embodiments of the present application convert the encoding of the zero-bit plane into the encoding of the first non-zero-bit plane The encoding of position information eliminates the need to encode all zero bit planes in each coefficient block, realizes efficient compression of the image block to be encoded, improves the compression rate, and also improves the compression efficiency of the image block to be encoded.

Description of drawings

In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.

1 is a schematic diagram of an image processing system provided by an embodiment of the present application;

2 is a schematic structural diagram of an image transmission system provided by an embodiment of the present application;

3 is a schematic flowchart of an image encoding method provided by an embodiment of the present application;

4 is a schematic diagram of different subbands provided by an embodiment of the present application;

5 is a schematic diagram of a mirror-symmetric filling provided by an embodiment of the present application;

6 is a schematic diagram of a front-to-back inversion provided by an embodiment of the present application;

7 is a schematic diagram of a bit plane and a coefficient block provided by an embodiment of the present application;

8 is a schematic diagram of the position of the first non-zero bit plane in the bit plane provided by an embodiment of the present application;

9 is a schematic diagram of a coding unit in a coefficient block provided by an embodiment of the present application;

10 is a schematic diagram of a scanning sequence provided by an embodiment of the present application;

11 is a schematic diagram of a mixed scanning of coding units of YUV components provided by an embodiment of the present application;

12 is a schematic diagram of a second coding mode corresponding to a target coding unit provided by an embodiment of the present application;

13 is a schematic flowchart of an image decoding method provided by an embodiment of the present application;

FIG. 14 is a schematic structural diagram of an image encoding apparatus provided by an embodiment of the present application.

Detailed ways

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of this application.

In view of the problem that high-definition image data may cause excessive bandwidth loss during storage or transmission, the embodiments of the present application provide an image encoding method and an image decoding method. Before storing or transmitting high-definition image data, this The applied image encoding method encodes these high-definition image data to obtain an encoded code stream, and then caches the encoded code stream in the memory or transmits the encoded code stream. Compared with high-definition image data, the amount of data is smaller, which can effectively reduce the loss of read and write bandwidth or transmission bandwidth, and also help to improve the read-write efficiency or transmission efficiency of high-definition image data; When the data is processed, the encoded code stream corresponding to the high-definition image data can be read from the memory or the encoded code stream transmitted by the external device can be received, and then the image decoding method provided by the embodiment of the present application can be used to decode the encoded code stream. The encoded code stream is decoded to obtain a decoded image, and then the decoded image can be processed according to actual needs.

When encoding an image by using the image encoding method provided in this embodiment of the present application, the image may be firstly divided into multiple image blocks to be encoded, and for each image block to be encoded, multiple frequency coefficients of the image block to be encoded are obtained; Regarding the bit planes of the plurality of frequency coefficients, obtain one or more coefficient blocks; the coefficient blocks include binary bits of at least one frequency coefficient, and then determine the position information of the first non-zero bit plane in each of the coefficient blocks , and then encode the position information of the first non-zero bit plane of each of the coefficient blocks to generate the first encoding information; encoding the bits to generate second encoding information; finally, according to the first encoding information and the second encoding information, generate an encoded code stream about the to-be-encoded image block; finally, based on a plurality of to-be-encoded image blocks The encoded code stream of the image can be obtained. In this embodiment, the coding of the zero bit plane is converted into the coding of the position information of the first non-zero bit plane, so that it is not necessary to encode all the zero bit planes in each coefficient block, and efficient compression of the image block to be coded is realized. , and the compression efficiency of the to-be-coded image block is also improved.

The image encoding method provided by the embodiments of the present application can be applied to an image encoding apparatus. In an implementation manner, the image encoding apparatus may be an electronic device with data processing capability, such as a computer, a server, a cloud server or terminal, a movable platform (such as an unmanned aerial vehicle, an unmanned vehicle, or a mobile robot, etc.) Wait. In another implementation manner, the image encoding device may also be a computer chip or integrated circuit with data processing capability, such as a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP) , Application Specific Integrated Circuit (ASIC) or off-the-shelf Programmable Gate Array (Field-Programmable Gate Array, FPGA), etc. Wherein, the image coding device as a computer chip or an integrated circuit can be installed on an electronic device, such as a movable platform.

The image decoding method provided by the embodiments of the present application can be applied to an image decoding apparatus. In an implementation manner, the image decoding apparatus may be an electronic device with data processing capability, such as a computer, a server, a cloud server or a terminal, a movable platform (such as an unmanned aerial vehicle, an unmanned vehicle, or a mobile robot, etc.) Wait. In another implementation manner, the image decoding apparatus may also be a computer chip or integrated circuit with data processing capability, such as a central processing unit (Central Processing Unit, CPU), a digital signal processor (Digital Signal Processor, DSP) , Application Specific Integrated Circuit (ASIC) or off-the-shelf Programmable Gate Array (Field-Programmable Gate Array, FPGA), etc. Wherein, the image decoding device as a computer chip or an integrated circuit can be installed on an electronic device, such as a movable platform.

It can be understood that when used as electronic equipment, the image decoding apparatus and the image coding apparatus may be the same electronic equipment or different electronic equipment; when used as computer chips or integrated circuits, the image decoding apparatus and all The image encoding apparatus may be set in the same device, or may be set in different devices, and specific settings may be made according to actual application scenarios.

In an exemplary embodiment, when the image decoding device and the image encoding device are computer chips or integrated circuits, the image decoding device and the image encoding device are set in the same device as an example for description: The image decoding device and the image encoding device can be applied to the image processing system as shown in FIG. 1 , and the image processing system can be installed on a movable platform (such as an unmanned aerial vehicle), a terminal device or a server equipped with images. processing functions on electronic devices.

The image processing system includes an image processing module 11 , the image decoding device 13 , the image encoding device 12 and a memory 14 , and the image processing module 11 passes the image encoding device 12 before writing the image into the memory 14 . The image is encoded and compressed, and the encoded code stream of the image is obtained and stored in the memory 14, so that the bandwidth required for writing to the memory 14 can be reduced; when the image processing module 11 needs to When the image is processed, the coded code stream of the image is read from the memory 14, which can also effectively reduce the reading bandwidth, and the coded code stream of the image is further processed by the image decoding device 13. The image is obtained by decoding processing, and then the image processing module 11 can process the image. In this embodiment, the encoded code stream of the image is stored in the memory 14 instead of the image. , since the data amount of the encoded code stream of the image is smaller than that of the image, the loss of the read and write bandwidth can be reduced, and the read and write efficiency can be improved at the same time.

Wherein, the image processing module 11 includes but is not limited to a graphics processing unit (GPU) or an image signal processor (ISP), etc., and the memory 14 includes but is not limited to DDR memory, static random access memory (SRAM), magnetic memory, Disk or CD etc.

In another exemplary embodiment, when the image decoding device and the image encoding device are computer chips or integrated circuits, the image decoding device and the image encoding device are set in different devices as an example for description : The image decoding device 13 and the image encoding device 12 can be applied to the image transmission system as shown in FIG. 2 , and the image transmission system includes an image transmitter 21 and an image receiver 31 ; wherein, the image transmission system The image encoder 21 is installed with the image encoding device 12, the image receiver 31 is installed with the image decoding device 13; the image transmitter 21 can be installed on a movable platform 20 (such as an unmanned aerial vehicle), so The image receiver 31 can be installed on the terminal device 30 (eg, a remote controller). Before sending the image, the image transmitter 21 can use the image encoding device 12 to compress and encode the image, generate an encoded code stream of the image, and send it to the image receiver 31. After receiving the encoded code stream of the image, the receiver 31 can use the image encoding device 12 to decode the encoded code stream to obtain the image, and then the terminal device can decode the image. The image is further processed or displayed on its own display. In this embodiment, for the encoded code stream of the image to be transmitted, since the data amount of the encoded code stream of the image is smaller than that of the image, the loss of transmission bandwidth can be reduced. , but also help to improve the transmission efficiency.

Next, the image encoding method provided by the embodiment of the present application will be described: please refer to FIG. 3 , which is a schematic flowchart of an image encoding method provided by the embodiment of the present application. The image encoding method is applied to an image encoding device. The methods described include:

In step S101, a plurality of frequency coefficients of the image block to be encoded are acquired.

In step S102, one or more coefficient blocks are obtained according to the bit planes about the plurality of frequency coefficients; the coefficient blocks include binary bits of at least one frequency coefficient.

In step S103, the position information of the first non-zero bit plane in each of the coefficient blocks is determined.

In step S104, the position information of the first non-zero bit plane of each of the coefficient blocks is encoded to generate first encoding information; and, for each of the coefficient blocks starting from the first non-zero bit plane The bits on the plane are encoded to generate second encoded information.

In step S105, an encoded code stream for the to-be-encoded image block is generated according to the first encoding information and the second encoding information.

The to-be-coded image block may be part or all of the to-be-coded image; for example, taking the to-be-coded image block as a part of the to-be-coded image as an example, the image coding apparatus obtains the to-be-coded image After the image is generated, in order to reduce the overhead of a line buffer, the image to be encoded may be divided into a plurality of image blocks to be encoded, and then each image block to be encoded is encoded separately. In addition, after encoding a plurality of image blocks to be encoded to obtain an encoded code stream of the image block to be encoded, in the actual use process, the encoded code stream of one of the image blocks may be obtained according to the actual situation to perform encoding. It is not necessary to decode all encoded code streams corresponding to all image blocks of the entire image, so that the granularity of random access can be maintained.

It can be understood that the present application does not impose any restrictions on the size and quantity of the image blocks to be encoded, and specific settings may be made according to actual application scenarios. In an example, an image may be divided into image blocks with sizes such as 64x4, 64x2, or 32x1, but not limited thereto.

When encoding the image block to be encoded, the image encoding apparatus first transforms the image block to be encoded from the time domain to the frequency domain, so as to obtain a plurality of frequency coefficients of the image block to be encoded.

In some embodiments, the image to be encoded may be wavelet transformed to generate the plurality of frequency coefficients. It can be understood that this embodiment does not impose any restrictions on the wavelet transform algorithm, and can be specifically selected according to actual application scenarios. For example, the wavelet transform algorithm may be 5/3 wavelet transform or Haar wavelet transform, etc., but not limited to. this.

Wavelet transform is briefly introduced here. In the wavelet transform of the image block to be encoded, as shown in FIG. 4 , the image block to be encoded is divided into a high frequency region with high spectral frequency and a low frequency region with low spectral frequency. Efficient compression coding is performed by dividing the data of bands with low spectral frequencies into smaller regions. "L" and "H" represent the low frequency region and the high frequency region, respectively. The numbers before "L" and "H" indicate the division level of each area. For example, the division level of the to-be-coded image block shown in FIG. 4 is 4, that is, it has undergone division and transformation processing four times.

As shown in FIG. 4 , for an image block to be encoded with a size of 64×2, in the first transformation and segmentation process, transformation and segmentation processing is performed on the image block to be encoded in the vertical direction, and the image to be encoded The block partition is transformed into 2 regions (that is, 2 sub-bands (sub-bands for short)) with a size of 32x2, which are the "1L" sub-band (with the most low-frequency components) and the "1H" sub-band (with the most high-frequency components) ), and then perform division processing in the horizontal direction to divide the to-be-encoded image into 4 regions with a size of 32×1, which are “1LL, 1LH, 1HL and 1HH” respectively, wherein the “1HH” subband has the most of high frequency components, the "1LL" subband has the most low frequency components.

Because the image information of the to-be-coded image block is mainly concentrated on the low-frequency components, the low-frequency components are continuously transformed and segmented, and efficient compression coding is performed. As shown in Figure 4, in the second transformation and segmentation process, the "1LL" subband is continuously transformed and segmented in the vertical direction to obtain the "2L" subband and the "2H" subband, with a size of 16x1, Among them, the "2L" subband has the most low frequency components after the second transformation and division. In the third transformation and division process, the "2L" sub-band is continuously transformed and divided in the vertical direction, and the "3L" sub-band and the "3H" sub-band are obtained, and the size is 8x1. After transformation and segmentation, the "3L" subband has the most low frequency components. In the fourth transformation and division process, the "3L" sub-band is continuously transformed and divided in the vertical direction, and the "4L" sub-band and the "4H" sub-band are obtained, and the size is 4x1. After transformation and segmentation, the "4L" subband has the most low frequency components. After 4 transformations and divisions, a total of 7 subbands are generated.

It can be understood that, in other implementation manners, the present application does not impose any restrictions on the division level and division direction, and in each transformation and division process, the transformation and division can be selected in the vertical direction and/or the horizontal direction. , which can be set according to the actual application scenario.

The image block to be processed may be wavelet transformed using a filter bank including high frequency filters and low frequency filters, for example, the image block to be encoded is temporarily stored in a buffer, including high frequency filters and low frequency filters. The filter bank of the filter reads the image block to be encoded from the buffer, performs filtering processing on the read image block to be encoded, so as to generate frequency coefficients in the low-frequency region and high-frequency region, and then stores the frequency coefficients in the buffer. . A filter bank including a high-frequency filter and a low-frequency filter reads the frequency coefficients from the buffer, and uses an analysis filter to filter the read frequency coefficients, thereby generating further The resulting frequency coefficients are then stored on the buffer again. The above process is repeated, and when the division level reaches a predetermined level, a plurality of frequency coefficients on the buffer are acquired for subsequent processing.

In an implementation manner, the 5/3 wavelet transform algorithm or the Haar wavelet transform algorithm may be used to transform the image block to be encoded. During the transformation process of each pixel of the image block to be encoded, each pixel It needs to rely on the neighboring pixels on the left and right sides, and the pixels at the border will lack a part of the neighboring pixels, so that the pixels on the border cannot meet the width requirements of the filter when the wavelet transform is performed. Therefore, in order to process the border pixels, it is necessary to The border is filled with pixels. In one implementation, as shown in Figure 5, each grid represents a pixel. Assuming that there are n pixels, n is a natural number not less than 0, counting from 0, and the number of the first pixel is 0, the number of the last pixel is n-1. When processing the boundary pixel A and the boundary pixel U, a mirror-symmetric filling method can be used to achieve accurate processing of the boundary pixel A and the boundary pixel U, as shown in Figure 5 The gray part of is the filled pixel.

In one embodiment, considering that a part of the coefficients will be lost during the wavelet transform process, as shown in FIG. 4 , the low-frequency coefficients are usually located on the left side, and the high-frequency coefficients are located on the right side, and the high-frequency coefficients on the right side are generally discarded. As a result, there will be errors in the pixels obtained after inverse transformation. Further, considering that in some compression coding processes, in order to reduce the blocking effect, an adaptive filter will be used to perform deblocking filtering, and the deblocking filtering process will use the rear pixels of the previous image block to correct the current image block. The image block is filtered, and the rear pixels of the previous image block have errors because some coefficients are lost in the wavelet transform process. The process of de-blocking filtering makes this error used and amplified, resulting in poor image processing. Therefore, in this embodiment, before the wavelet transform is performed, the to-be-coded image block is flipped back and forth. As shown in FIG. 6 , in the order from left to right, the to-be-coded image block includes pixels A, B, C, After D and E are flipped back and forth, pixels E, D, C, B and A are obtained; then the wavelet transform is performed on the image to be encoded after flipping back and forth, and then in the subsequent deblocking filtering process, due to the The front and back are reversed, so that the pixels of the lost part of the coefficients are on the front side instead of the back side. During the de-blocking filtering process, the acquired pixels on the back side are no longer the pixels whose coefficients have been lost, making the pixels with small error loss. It can be referenced by de-blocking, which is beneficial to improve the quality of image processing.

In order to implement wavelet transform on the image to be encoded after the front and rear flips, please refer to FIG. 5 , suppose that the image block to be encoded has n pixels, n is a natural number not less than 0, counting from 0, the number of the first pixel is 0, the number of the last pixel is n-1, when the image coding apparatus performs wavelet transform on the image block to be coded, it does not start from the 0th image, but starts from the n-1th pixel. Wavelet transform is used to implement wavelet transform on the image to be encoded after being flipped back and forth.

After transforming the image block to be encoded from the time domain to the frequency to generate the plurality of frequency coefficients, the image encoding device performs compression encoding according to the plurality of frequency coefficients. In step S102, the image encoding device First determine the bit plane of the plurality of frequency coefficients, for example, the plurality of frequency coefficients can be expanded on the bit plane, for example, please refer to Fig. 7, Fig. 7 is to expand the 10 frequency coefficients expressed in hexadecimal into place The result of the plane, after expanding multiple frequency coefficients to the bit plane, the ordinate represents the bit depth (where MSB represents the most significant bit, LSB represents the least significant bit), and each column represents the binary bits of one of the frequency coefficients. Next, the image encoding apparatus may obtain one or more coefficient blocks including binary bits of at least one frequency coefficient according to the bit planes with respect to the plurality of frequency coefficients; that is, according to every n( n is an integer greater than 0) the binary bits of the frequency coefficients divide the bit plane, so as to obtain one or more coefficient blocks on the bit plane, for example, please refer to FIG. 7, the binary bits of every 8 frequency coefficients can be divided into bits as a block of coefficients.

Then, in step S103, the image coding apparatus determines the position information of the first non-zero bit plane in each of the coefficient blocks, wherein the zero bit plane refers to the bit planes that are all 0, then the non-zero bit plane Refers to the bit plane that is not all 0 (that is, some bits are 1); the image encoding device can be along the direction of gradually increasing bit depth, please refer to FIG. 7, that is, the direction from bit depth 0 to bit depth 12 (or Say the direction from the bit plane of the most significant bit to the bit plane of the least significant bit), determine the position information of the first non-zero bit plane in each of the coefficient blocks, for example, in Figure 8, Figure 8 shows the size of 61x1 The image block to be encoded is obtained after 4 wavelet transforms, and it is divided into 4L and 4H regions of size 4x1, 3H region of size 8x1, 2H region of size 16x1 and 1H region of size 31x1, assuming that in Figure 8 On the bit plane shown, each coefficient block includes 8 coefficients, and the area of 4L and 4H is 4x1, so the 4L and 4H areas are used as 1 coefficient block, the 3H area is 1 coefficient block, and the 2H area includes 2 coefficient blocks. , the 1H region includes 4 coefficient blocks, where the ordinate represents the bit depth, and the abscissa represents the position information of the first non-zero bit plane in each coefficient block. It can be seen that in the coefficient blocks including the 4L and 4H regions, the The location of the first non-zero bit plane is bit depth 4.

After determining the position information of the first non-zero bit plane in each of the coefficient blocks, the image encoding apparatus may not encode the zero bit plane in each coefficient block, but use the position information of the first non-zero bit plane The position information is used to replace the zero bit plane, and the position information of the first non-zero bit plane of each of the coefficient blocks is encoded to generate the first encoding information; The bits on each bit plane starting from the plane are encoded to generate second encoding information; finally, an encoded code stream about the to-be-encoded image block is generated according to the first encoding information and the second encoding information. In this embodiment, the encoding of the zero bit plane is converted into the encoding of the position information of the first non-zero bit plane, so that it is not necessary to encode all the zero bit planes in each coefficient block. The data amount of the position information of the first non-zero bit plane of each coefficient block is usually less than the data amount of all the zero bit planes of each coefficient block, then the first code generated after encoding the position information of the first non-zero bit plane of each coefficient block The data amount of the information is less, thereby realizing efficient compression of the image block to be encoded, and also improving the compression efficiency of the image block to be encoded.

In some embodiments, the frequency coefficients include both signed and absolute value components. In an implementation manner, the bit plane only includes a bit plane related to the absolute value of the frequency coefficients, for example, after acquiring the plurality of frequency coefficients, the image coding apparatus converts the plurality of frequency coefficients It is decomposed into a sign with respect to the plurality of frequency coefficients and an absolute value with respect to the plurality of frequency coefficients, and then a bit plane with respect to the plurality of frequency coefficients is determined according to the absolute values of the plurality of frequency coefficients. On the other hand, regarding the signs of the plurality of frequency coefficients, the image encoding apparatus encodes the signs of the plurality of frequency coefficients in units of the coefficient blocks, and generates third encoding information. In an example, a coefficient block includes binary bits of absolute values of 8 frequency coefficients, then the coefficient block can be used as a single block, and the symbols of the 8 frequency coefficients included in the coefficient block can be encoded as a whole, The third encoding information is generated, and then an encoded code stream of the image block to be encoded may be generated according to the first encoding information, the second encoding information and the third encoding information.

Further, for the part with the frequency coefficient of 0, considering that the positive and negative conditions of the symbol do not affect the meaning of its representation, in order to improve the compression effect, the symbol of the part with the frequency coefficient of 0 may not be encoded, but only The signs of the frequency coefficients that are not 0 in each coefficient block are encoded to generate the third encoding information, so as to realize efficient compression of the signs of the frequency coefficients.

When encoding the sign of the frequency coefficient, the negative sign may be encoded as 1, and the positive sign may be encoded as 0; or, the negative sign may be encoded as 0, and the positive sign may be encoded as 0. The encoding is 1; but not limited to the above encoding.

In some embodiments, when encoding the position information of the first non-zero bit plane of each of the coefficient blocks, the image encoding device selects a target encoding mode from at least two first encoding modes, and uses the The target encoding mode encodes the position information of the first non-zero bit plane of each of the coefficient blocks. Wherein, when selecting the target encoding mode, the image encoding apparatus may, based on each first encoding mode in the at least two first encoding modes, respectively adjust the first non-zero bit plane of each of the coefficient blocks. The location information is encoded to obtain at least two encoding results, and then the first encoding mode corresponding to the optimal encoding result among the at least two encoding results is determined as the target encoding mode; the optimal encoding result may be If the encoding length of the at least two encoding results is the smallest, the target encoding mode is the optimal one of the at least two first encoding modes, so that the image encoding device uses the target encoding mode to When the position information of the first non-zero bit plane of the coefficient block is encoded, the generated first encoded information has a better compression effect.

After the target encoding mode is determined, in order to ensure that the image decoding apparatus can accurately decode the first encoding information, the target encoding mode needs to be encoded, that is, the encoded code stream of the to-be-encoded image block includes There is encoding information of the target encoding mode, so that the image decoding apparatus can correctly decode the first encoding information according to the target encoding mode.

The at least two first encoding modes include at least one encoding mode using a code table and/or at least one predictive encoding mode. It can be understood that this embodiment does not impose any limitations on the specific implementation of the predictive coding mode, which may be specifically selected according to actual application scenarios. For example, the predictive coding mode may be DCMP (Differential Predictive Coding Modulation).

Wherein, the at least two first encoding modes may be variable length coding (VLC, Variable Length Coding) modes; in the mode of encoding using a code table, the code table may be a VLC code table; in the In the predictive coding mode, the obtained residual data can be subjected to VLC coding. We know that the purpose of encoding is to reduce the amount of data of the image block to be encoded. In the process of variable-length encoding, in order to achieve the optimal encoding effect, the idea of encoding is usually to encode information symbols with high probability into short code words , while the information symbols with small probability are encoded with long code words, so that the average code word length is the shortest, that is, the encoded length of the finally obtained first encoded information is the shortest.

Considering that the image information (the image information includes but not limited to image gradient information, signal-to-noise ratio, local variance or mean square error, etc.) contained in images or image blocks obtained under different acquisition scenarios is not the same, that is It is said that in different acquisition scenarios, the occurrence probability of various image information in the image or image block is not the same. If a unified code table is used to encode it, it may not get a better compression encoding effect, and the compression rate is not high. . Therefore, in order to obtain better compression coding results, it is possible to pre-estimate the possible acquisition scenarios of the image or image block to be coded, and then determine a code table suitable for the scenario according to the acquisition scenario; for example, some reference data in the capture scenario can be obtained Images or reference image blocks, the distribution characteristics of the image information of these reference images or reference image blocks are counted, and then a code table suitable for the collection scene can be determined according to the distribution characteristics of the image information of these reference images or reference image blocks; or, The frequency distribution features can be obtained in the frequency domain according to these reference images or reference image blocks. For example, the frequency distribution features can be obtained according to these reference images or reference image blocks. The position information of the first non-zero bit plane in each coefficient block. distribution characteristics, and then determine a code table suitable for the acquisition scene based on the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block obtained from these reference images or reference image blocks.

That is, the code table may be determined according to the estimated acquisition scene of the image block to be encoded. Further, a reference image block related to the estimated acquisition scene can be obtained, for example, the reference image block is acquired under the estimated acquisition scene, so that the code table can be based on the image of the reference image block Or, the position information of the first non-zero bit plane in each coefficient block can be obtained according to the frequency coefficient of the reference image block, so that the first non-zero bit plane in each coefficient block corresponding to the reference image block can be obtained according to the reference image block. The distribution feature of the position information of the non-zero bit plane determines the code table in the acquisition scene. Wherein, the distribution characteristics of the image information of the reference image block are also different depending on the acquisition scene. Further, the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block are also different. Then the corresponding code tables are also different, so that the position information of the first non-zero bit plane in each coefficient block corresponding to the image block to be encoded can be encoded based on a suitable code table to obtain a first code with a shorter encoding length. information to improve the compression ratio. In this embodiment, the possible acquisition scenarios of the image block to be encoded may be estimated in advance, and then the position of the first non-zero bit plane in each coefficient block corresponding to the image block to be encoded by an appropriate code table may be determined according to the estimated acquisition scenario. The encoding of the information is beneficial to obtain the first encoded information with a shorter encoding length, to realize an efficient compression process, and to improve the compression rate.

In an exemplary embodiment, a simple scene, a general scene, and a complex scene may be determined according to the complexity of the acquisition scene. Exemplarily, the complexity may be determined by the number of types of acquisition objects included in the acquisition scene, such as In a sky image, if it only contains blue sky and white clouds, it can be determined to be a simple scene; if the sky image contains blue sky, white clouds, the sun and birds, it can be determined to belong to a general scene; if the sky image contains blue sky , white clouds, sun, birds, airplanes, mountains, etc., can be determined to belong to complex scenes. Of course, the degree of complexity can also be measured by other factors, for example, the degree of complexity can also be determined according to the amount of image information, the more image information, the more complex the image. In this embodiment, all possible acquisition scenarios of the image or image block to be encoded can be estimated in advance, and then these acquisition scenarios can be divided into simple scenarios, general scenarios and complex scenarios. The suitable code table can be determined according to the characteristics of the simple scene, the general scene and the complex scene respectively; further, it can be determined according to the distribution characteristics of the image information of the reference image obtained under the simple scene, the general scene and the complex scene. The code table of the scene; or, according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the image information of the reference image obtained under the simple scene, the general scene and the complex scene respectively, determine the corresponding scene. stopwatch. Then the mode of using the code table encoding in the first encoding mode can include the mode of using the code table under the simple scene to encode, the mode of using the code table under the general scene to encode, and the code table under the complex scene. encoding mode, so that when encoding the position information of the first non-zero bit plane in each coefficient block corresponding to the image block to be encoded, encoding can be performed based on a suitable code table to obtain first encoding information with a shorter encoding length, To achieve an efficient compression process, it is beneficial to improve the compression rate.

Considering that the distribution characteristics of the image information of different color components of the same image or image block are also different, that is to say, under different color components, the appearance probabilities of various image information in the image or image block are not the same. If the code table is used to encode it, it may not get a better compression encoding effect. Therefore, in order to further improve the compression coding effect, for the position information of the first non-zero bit plane in each of the coefficient blocks corresponding to each color component obtained according to the frequency coefficients of each color component, this embodiment independently performs coding according to different color components , which helps to improve the compression ratio. Specifically, the image encoding device obtains the frequency coefficients corresponding to different color components of the to-be-coded image block, and then decomposes the corresponding frequency coefficients into bit planes according to different color components. Each of the plurality of bit planes includes a plurality of bit planes corresponding to different color components; wherein, at least one of the plurality of bit planes corresponding to the different color components corresponds to the at least two first encoding modes.

It can be understood that the color components can be three color components of YUV, or three color components of RGB, that is, the bit planes corresponding to the plurality of different color components include the bit plane of the Y component, the bit plane of the U component, and the bit plane of the U component. A bit plane of the V component; or, the plurality of bit planes corresponding to the different color components include a bit plane of the R component, a bit plane of the G component, and a bit plane of the B component.

Taking three color components of YUV as an example here, the image encoding device can obtain multiple frequency coefficients of the Y component, multiple frequency coefficients of the U component and multiple frequency coefficients of the V component of the to-be-coded image. a plurality of frequency coefficients of a color component, obtain one or more coefficient blocks of the color component according to the bit planes of the frequency coefficients of the color component, and determine the first non-zero bit plane in each of the coefficient blocks. Position information; wherein, the Y component, the U component and the V component respectively correspond to at least two first encoding modes, and the at least two first encoding modes include at least one mode encoded using a code table and/or at least one In the predictive coding mode, the code table is determined according to the estimated acquisition scene of the image block to be encoded. In the same acquisition scene or in different acquisition scenes, the code table corresponding to different color components may also be different. Then, the image encoding device selects the target encoding mode of this color component from the at least two first encoding modes corresponding to each color component, and uses the target encoding mode of this color component to describe each of the color components The position information of the first non-zero bit plane of the coefficient block is encoded, thereby generating the first encoding information of the Y component, the first encoding information of the U component, and the first encoding information of the V component; this embodiment determines that it is suitable for different color components. to improve the compression rate of the image block to be encoded.

Further, the plurality of frequency coefficients are generated by transforming the image block to be encoded from the time domain to the frequency domain, and during the transformation process, the image block to be encoded will be divided into a plurality of different subbands , in different subbands, the distribution characteristics of its frequency coefficients are also different, or in other words, there may be one or more coefficient blocks in different subbands, and in different subbands, the first non-linear coefficient of the coefficient block The distribution characteristics of the position information of the zero plane are also different. Therefore, in order to obtain a better encoding and compression effect, each subband may be set to correspond to the at least two different first encoding modes, so as to be suitable for different subbands. In other words, the plurality of frequency coefficients include a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient respectively belong to different subbands in the frequency domain; for example, the The first frequency coefficient belongs to the coefficient of the first frequency band, the second frequency coefficient belongs to the coefficient of the second frequency band, and the first frequency band and the second frequency band are one of high frequency or low frequency; the first frequency coefficient and the Each of the second frequency coefficients corresponds to the at least two first coding modes. In the encoding mode using the code table, different sub-bands correspond to different code tables. This embodiment improves the compression rate of the image block to be encoded by setting code tables suitable for different subbands.

The image encoding device transforms the to-be-encoded image from the time domain to the frequency domain, and obtains multiple frequency coefficients belonging to different subbands, wherein the different subbands all correspond to the at least two first encoding modes, so the The at least two first encoding modes include at least one encoding mode using a code table and/or at least one predictive encoding mode, the code table is determined according to the estimated acquisition scene of the image block to be encoded, and in the same acquisition scene Or in different acquisition scenarios, the code tables corresponding to different sub-bands may also be different; then the image encoding device obtains one or more coefficient blocks according to the bit planes related to multiple frequency coefficients, and determines the first non-linear coefficient in each of the coefficient blocks. The position information of the zero bit plane; wherein, the coefficient block includes binary bits of at least one frequency coefficient, and the image coding device can determine the corresponding at least two corresponding subbands according to the frequency coefficient included in the coefficient block. A first encoding mode, a target encoding mode is selected from the at least two first encoding modes, and the position information of the first non-zero bit plane of the coefficient block is encoded using the target encoding mode. In this embodiment, considering that the distribution characteristics of frequency coefficients in different subbands are different, determining a code table suitable for different subbands is conducive to optimizing the compression coding effect and further improving the compression rate.

In an exemplary embodiment, it is assumed that three types of code tables are set according to a simple scene, a general scene and a complex scene. There are three types of code tables for scene and complex scene. Further, for the multiple frequency coefficients of each color component, they belong to different subbands, and each subband corresponds to three types of codes: simple scene, general scene and complex scene. surface. It is described in YUV444 format (each Y component corresponds to a set of UV components). The frequency coefficients of the Y component belong to 4 different subbands, and each subband has 3 code tables (corresponding to simple scenes, general scenes and complex scenes respectively) , then there are 12 code tables. Similarly, the frequency coefficients of the U component belong to 4 different sub-bands. Each sub-band has 3 code tables and 12 code tables. Similarly, the V component also has 12 code tables. There are 36 code tables, and a suitable code table can be selected for encoding according to the actual application scenario to obtain the first encoding information with the shortest encoding length, so as to achieve the optimal encoding effect and achieve the maximum compression rate.

In addition to acquiring the first encoding information, the image encoding apparatus further encodes the bits on the respective bit planes starting from the first non-zero bit plane in the respective coefficient blocks to generate the second encoding information, where During encoding, the image encoding apparatus encodes all bits of the same bit plane starting from a non-zero bit plane in the coefficient block as one encoding unit. As an example, please refer to FIG. 9, the non-zero bit plane in the coefficient block is the plane where the bit depth 0 is located, then all the bits of the bit plane where the bit depth 0 is located in the coefficient block is used as a coding unit, the coefficient All bits of the bit plane where the bit depth 1 is located in the block is regarded as a coding unit, and so on, the coefficient block has a total of 13 coding units.

The size of the encoded code stream of the image block to be encoded is determined by the code rate, and the size of the code stream is proportional to the code rate. Therefore, the code rate corresponding to the image block to be encoded can be determined according to actual needs, For example, you can choose to configure the bit rate according to the lossless encoding method, or you can configure the bit rate according to the lossy encoding method, and perform specific configuration according to the actual application scenario. When the image coding apparatus encodes the coding units in the coefficient block, firstly, according to the target coding length indicated by the predetermined code rate, the target coding unit may be determined from the coding units according to a predetermined scanning order; The encoding unit indicates the object to be encoded in the encoding unit, and then the image encoding apparatus encodes the target encoding unit to generate the second encoding information.

Wherein, when determining the target coding unit, the image coding apparatus may sequentially determine the coding length of the coding unit according to a predetermined scanning order, and then according to the target coding length and the coding length of the coding unit, from the The number of target coding units is determined in the coding unit, that is, the image coding device can scan each coding unit in sequence according to a predetermined scan and accumulate the total coding length of all coding units currently scanned. When the total coding length reaches During the target coding length indicated by the predetermined code rate, stop scanning the remaining coding units, determine that the scanned coding unit is the target coding unit, and record the position of the last target coding unit, the last target coding unit The position is the truncation position, and the image encoding apparatus also needs to encode the position information of the last target coding unit, so that when the image decoding apparatus performs decoding, the image decoding apparatus can The second encoding information is obtained from the encoded code stream according to the location information, so as to correctly decode the second encoding information.

Among them, it can be understood that this embodiment does not impose any restrictions on the specific scanning order, and specific settings can be made according to actual application scenarios, please refer to FIG. 10 , please refer to the direction of the arrow in FIG. The predetermined scanning order includes at least the order from low frequency to high frequency and/or the order from the most significant bit bit plane to the least significant bit bit plane.

In one embodiment, the image encoding device acquires the frequency coefficients of different color components of the to-be-coded image block, and then expands the frequency coefficients of each color component to a bit plane, that is, the number of the bit planes is large. A plurality of the bit planes include a plurality of bit planes corresponding to different color components, and then one or more coefficient blocks of each color component are obtained on the bit planes corresponding to each different color component, and then the corresponding coefficient blocks of the different color components are obtained. Coding units in each coefficient block; in order to implement fine rate control on the to-be-coded image block based on a predetermined code rate, the predetermined scan order instructs to mix a plurality of coding units corresponding to the different color components The order of scanning, so that different color components can be combined for fine rate control.

In an example, taking YUV as an example to illustrate, for example, the Y component corresponds to 2 coding units a1, a2, the U component corresponds to 2 coding units b1, b2, and the V component corresponds to 2 coding units c1, c2; The predetermined scanning sequence may be a1→b1→c1→a2→b2→c2, which realizes the mixed scanning of YUV components, thereby ensuring the uniformity of the obtained YUV coding units, and making the image quality of the decompressed image blocks clear and natural. In another example, please refer to FIG. 11, taking YUV422 (meaning that every 2 Y components corresponds to 1 set of UV components) as an example, in the bit plane (or called bit plane) of the YUV components in the figure, each The space represents a coding unit, and the number on each coding unit represents its scanning order. In this embodiment, the coding units of the YUV components are mixed and scanned, so as to realize the joint and fine rate control of different color components, so as to ensure the obtained The uniformity of the coding unit of YUV makes the image quality of the decompressed image block clear and natural. It can be understood that the scanning sequence shown in FIG. 11 is only an example, but not limited to this. The gray part in the figure does not require coding. part.

When coding the target coding unit to generate the second coding information, the image coding apparatus may code the target coding unit according to a second coding mode corresponding to the target coding unit. The second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes. The variable-length coding mode includes, but is not limited to, Huffman coding mode or Shannon coding mode.

The second coding mode corresponding to the target coding unit may be determined according to the distribution characteristics of the target coding unit; or, the second coding mode corresponding to the target coding unit may be determined according to the location of the target coding unit. Sure. The position where the target coding unit is located includes at least one of the following: the bit depth of the bit plane corresponding to the target coding unit or the subband to which the target coding unit belongs. In this embodiment, determining a suitable second encoding mode according to the distribution characteristics of the target unit or its location helps to optimize the encoding effect of the second encoding information and improve the compression rate.

In an implementation manner, the different distribution characteristics of the target coding indicate that the probabilities of various information symbols appearing in different target coding units may be different. Therefore, in order to achieve a better coding effect, the image coding apparatus may estimate in advance A possible collection scene of the image block to be encoded, then obtain a reference image related to the collection scene and the second encoding mode used in the compression encoding process, determine the target encoding unit of the reference image, and then according to the The distribution feature of the target coding unit of the image and the second coding mode used by it are referred to, and the mapping relationship between the distribution feature and the second coding mode is obtained. Then, in the process of encoding the graphic block to be encoded, the second encoding mode corresponding to the target encoding unit can be obtained from the mapping relationship between the pre-stored distribution characteristic and the second encoding mode according to the distribution characteristic of the target encoding unit.

Further, in order to facilitate the decoding process of the image decoding apparatus, the second encoding mode corresponding to the target coding unit may also be encoded to generate the encoded code stream; or the image decoding apparatus has pre-determined the the second coding mode corresponding to the target coding unit, it is not necessary to encode the second coding mode corresponding to the target coding unit.

As an example, the mapping relationship between the distribution feature and the second encoding mode can be obtained through machine learning, the distribution feature of the target encoding unit of the reference image and the second encoding mode used in the reference image can be used as training samples, and the The distribution feature of the target coding unit of the reference image is input into a preset model to obtain a prediction result, and then the parameters of the preset model are adjusted according to the difference between the prediction result and the second encoding mode used by the reference image, Further, the mapping relationship between the distribution feature and the second encoding mode is obtained through the adjusted model.

As an example, the distribution characteristics of target coding units of a large number of reference images and the second coding mode used therefor may be obtained, and then statistical analysis is performed in a clustering manner to obtain the mapping relationship between the distribution characteristics and the second coding mode.

In another implementation manner, considering that the target coding units are located at different positions, the distribution characteristics of the target coding are different, so that the probabilities of various information symbols in different target coding units may be different. Therefore, in order to achieve better It is possible to pre-estimate the possible acquisition scene of the image block to be encoded, and then acquire the reference image related to the acquisition scene and the second encoding mode used in the compression encoding process, and determine the target of the reference image. The coding unit then obtains the mapping relationship between the position of the target coding unit and the second coding mode according to the position of the target coding unit of the reference image and the second coding mode used. Then in the process of encoding the to-be-encoded graphic block, the second encoding mode corresponding to the target encoding unit may be based on the position of the target encoding unit from the position of the pre-stored target encoding unit and the second encoding mode. Find it in the mapping relationship. Wherein, the position where the target coding unit is located includes at least any one of the following: the bit depth of the bit plane corresponding to the target coding unit or the subband to which the target coding unit belongs.

As an example, the mapping relationship between the position of the target coding unit and the second coding mode may be obtained through machine learning, and the position of the target coding unit of the reference image and the second coding mode used may be used as training samples, input the position of the target coding unit of the reference image into a preset model, obtain a prediction result, and then adjust the parameters of the preset model, and then obtain the mapping relationship between the position of the target coding unit and the second coding mode through the adjusted model.

As an example, the position of the target coding unit in a large number of reference images and the second coding mode used by it may be obtained, and then statistical analysis is performed through a clustering method, so as to obtain the position of the target coding unit and the second coding mode. Schema mapping relationship.

Exemplarily, referring to FIG. 12 , for example, a possible acquisition scene of the image block to be encoded may be estimated in advance, and then the target coding units of multiple reference images related to the acquisition scene may be acquired, and then according to the target coding unit of the target coding unit The distribution characteristics determine at least three second encoding modes, and the second encoding modes include at least fixed-length encoding modes and/or at least two variable-length encoding modes, that is, different distribution characteristics of the target coding unit correspond to different second encoding modes mode, for example, the distribution characteristics of the target coding unit can be divided into three types, for example, the distribution characteristics of the target coding unit are divided into the first type of distribution characteristics, the second type of distribution characteristics and the third type of distribution characteristics, respectively in For the first area, second area and third area of the bit plane, it is assumed that the second encoding mode corresponding to the first area (ie, the first type of distribution feature) is preset as the first variable-length encoding mode, and the second area (ie, the second The second encoding mode corresponding to the class distribution feature) is the second variable-length encoding mode, and the second encoding mode corresponding to the third region (the third class distribution) is the fixed-length encoding mode; when encoding the target coding unit, Determine the position where the target coding unit is located, determine the second coding mode corresponding to the target coding unit based on the region to which the target coding unit is located, and determine the second coding mode corresponding to the target coding unit according to the second coding unit corresponding to the target coding unit. mode encodes the target coding unit. In this embodiment, determining a suitable code table according to the distribution characteristics of the target coding unit is beneficial to obtain the second coding information with a shorter coding length and improve the compression rate.

Each of the at least two variable-length encoding modes includes a code table. Considering that the data volume of the coding unit is larger, when a code table is used to encode it, the more the table entries of the code table to be searched are. many, the coding efficiency is low, in order to further optimize the coding effect and improve the coding efficiency, each of the at least two variable-length coding modes includes at least two code tables, when the second coding mode corresponding to the target coding unit In the case of the variable-length encoding mode, the at least two code tables are used to encode different bits of the target encoding unit respectively, thereby helping to speed up the encoding speed and encoding efficiency, as well as optimizing the encoding effect and improving the compression rate. . For example, the target coding unit is an 8-bit character string. If it is set as a code table to encode it, the code table may be set with 2 ⁸ =256 entries, and it takes a long time to search the code table. , so in this embodiment, it can be divided into two 4-bit character strings, correspondingly set with 2 code tables, use one of the code tables to encode the 4-bit characters, and use the other code table to encode the other 4-bit characters, then Each code table may only need to set 2 ⁴ =16 entries, and the time for searching the code table is significantly reduced, which is conducive to speeding up the encoding speed and encoding efficiency, and is also conducive to optimizing the encoding effect and improving the compression rate.

It can be understood that this application does not impose any restrictions on the number of code tables in the variable-length encoding mode, which can be determined according to the data volume of the encoding unit, such as setting a code table corresponding to every 4 bits, and the code The table may be determined according to distribution characteristics or locations of the coding units.

In one embodiment, if the encoded length of the remaining bits of the target coding unit cannot meet the coding length indicated by the predetermined code rate, then preset data is added according to the coding length of the target coding unit, thereby ensuring that Finally, the encoding length of the encoded code stream of the image block to be encoded can conform to the encoding length indicated by the predetermined code rate, thereby enabling the image decoding apparatus to perform correct decoding. In an example, for example, the length of the remaining bits of the target coding unit after being coded is 5 bits, and currently 10 bits may be required to meet the coding length indicated by the predetermined code rate, then the target coding unit needs to be coded according to the target coding length. The encoded length of the remaining bits of the unit needs to be determined by adding 5 (5=10-5) bits of preset data to satisfy the encoding length requirement indicated by the predetermined code rate, thereby enabling the image decoding apparatus to perform correct decoding.

Wherein, if the data length (or the data amount) of the preset data to be added is greater than the preset length (or the preset data amount), the last target coding unit for encoding using the at least two variable-length encoding methods The fixed-length encoding method is used instead for encoding, and after the encoding is completed, the data length of the preset data to be added is re-determined, thereby helping to reduce the complexity of hardware implementation. The preset length may be the length of one coding unit.

In order to speed up the decoding speed, the image decoding apparatus can usually perform decoding from two ends synchronously. In order to support the decoding process of the image decoding apparatus, in the encoded code stream about the image block to be encoded, all The first encoding information is located at a first position in the encoded code stream, the second encoding information is located at a second position in the encoded code stream, the first position and the second position for enabling the decoding end to synchronously decode the encoded code stream from the first position and the second position.

As mentioned above, the plurality of frequency coefficients are decomposed into symbols and absolute values for encoding respectively, in addition to the first encoding information and the second encoding information, the third encoding information obtained by encoding the symbols is also included. In the encoded code stream about the image block to be encoded, the first encoding information is located in a first position in the encoded code stream, and the second encoding information and the third encoding information are located in the second position in the encoded code stream, the first position and the second position are used to enable the decoding end to synchronously decode the encoded code stream from the first position and the second position .

Wherein, one of the first position and the second position is the head position of the encoded code stream, and the other is the tail position of the encoded code stream.

In addition to the first encoding information, the second encoding information and the third encoding information, the target encoding mode and the position information of the last target encoding unit are also encoded, and these encoded information can be placed in the encoded The third position of the code stream, for example, the third position is the head position of the encoded code stream, and one of the first position and the second position is the position after the third position , and the other is the tail position of the encoded code stream.

Correspondingly, referring to FIG. 13 , an embodiment of the present application further provides an image decoding method, and the method includes:

In step S201, an encoded code stream is obtained, where the encoded code stream includes first encoding information and second encoding information.

In step S202, the first encoding information is decoded to obtain the position information of the first non-zero bit plane of each coefficient block; and the second encoding information is decoded to obtain the Bits on each bit plane starting with the first non-zero bit plane.

In step S203, according to the position information of the first non-zero bit plane of each coefficient block and the bits on each bit plane starting from the first non-zero bit plane in each coefficient block, obtain multiple frequency factor.

In step S204, a decoded image block is obtained through the plurality of frequency coefficients.

In an embodiment, the encoded code stream includes a target encoding mode corresponding to the generation of the first encoding information.

The decoding the first encoding information includes: decoding the first encoding information according to the target encoding mode.

In an embodiment, the target encoding mode is one of the following: at least one mode using code table encoding and/or at least one predictive encoding mode.

In an embodiment, in the mode of encoding using a code table, the code table is determined according to an estimated acquisition scene of the decoded image block; or the code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.

In one embodiment, the code tables corresponding to different acquisition scenarios are different, or, the code tables corresponding to different distribution characteristics of the image information of the reference image block are different; or, the first one of the coefficient blocks corresponding to the reference image block is The code tables corresponding to different distribution characteristics of the position information of the non-zero bit plane are different.

In an embodiment, the plurality of frequency coefficients are generated after transforming the image block to be encoded from the time domain to the frequency domain; wherein the plurality of frequency coefficients include a first frequency coefficient and a second frequency coefficient, And the first frequency coefficient and the second frequency coefficient respectively belong to different subbands in the frequency domain.

Both the first frequency coefficient and the second frequency coefficient correspond to the target coding mode; in other words, each of the different subbands corresponds to the target coding mode.

In one embodiment, if the target encoding mode is the mode using the code table encoding, different subbands correspond to different code tables.

In an embodiment, the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, at least one of the multiple bit planes corresponding to the different color components Corresponding to the target encoding mode.

In an embodiment, a plurality of bit planes corresponding to the different color components include a bit plane of the Y component, a bit plane of the U component, and a bit plane of the V component; or, the plurality of bit planes corresponding to the different color components include: The bit plane of the R component, the bit plane of the G component, and the bit plane of the B component.

In an embodiment, in the encoding process, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are taken as one coding unit.

The encoded code stream further includes position information of the last target coding unit; the target coding unit indicates the object to be encoded in the coding unit.

Before the decoding the second encoding information, the method includes: determining and acquiring the second encoding information from the encoded code stream according to the position information of the last target coding unit. It can be said that the position information of the last target coding unit plays a role in positioning, so that the second coding information can be determined from the coded code stream.

In an embodiment, the target coding unit is determined by the coding end after scanning the coding unit according to a predetermined scanning order.

In an embodiment, the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components; The order in which coding units are mixed-encoded.

In one embodiment, the predetermined scanning order includes at least an order from low frequency to high frequency and/or an order from the most significant bit-plane to the least significant bit-plane.

In an embodiment, the decoding the second encoding information further includes: decoding the second encoding information according to a second encoding mode corresponding to the second encoding information to obtain a target encoding unit.

Wherein, the encoded code stream includes the second encoding mode corresponding to the second encoding information; or, the image decoding apparatus obtains the second encoding mode corresponding to the second encoding information in advance.

In an embodiment, during the encoding process, the second encoding mode is determined according to the distribution characteristics of the target coding unit; or, the second encoding mode is determined according to the position where the target coding unit is located.

In one embodiment, during the encoding process, the second encoding mode is determined according to the distribution characteristics of the target coding unit and the mapping relationship between the pre-stored distribution characteristics and the second encoding mode.

Alternatively, the second coding mode is determined according to the location of the target coding unit and a pre-stored mapping relationship between the location of the target coding unit and the second coding mode.

In an embodiment, the location where the target coding unit is located includes at least one of the following: a bit depth of a bit plane corresponding to the target coding unit or a subband to which the target coding unit belongs.

In an embodiment, the pre-stored mapping relationship between the distribution feature and the second encoding mode or the pre-stored mapping relationship between the location of the target coding unit and the second encoding mode is obtained based on machine learning.

In an embodiment, the second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes.

In one embodiment, the variable length coding mode includes at least two code tables.

The decoding of the second encoding information further includes: when the second encoding mode is a variable-length encoding mode, using the at least two code tables to respectively decode different code words in the second encoding information to decode.

In an embodiment, the encoded code stream further includes third encoding information.

The method further includes: decoding the third encoded information to obtain the sign of the frequency coefficient.

In an embodiment, the method further includes: acquiring the first encoding information from a first position of the encoded code stream, and acquiring the second encoding information from a second position of the encoded code stream encoding information.

The decoding of the first encoded information and the decoding of the second encoded information include: synchronously decoding the first encoded information and the second encoded information.

In an embodiment, the method further includes: acquiring the first encoding information from a first position of the encoded code stream, and acquiring the second encoding information from a second position of the encoded code stream Encoded information or third stream.

The decoding of the first encoded information and the decoding of the second encoded information include: synchronously decoding the first encoded information and the second encoded information; or,

The decoding of the first encoded information and the decoding of the third encoded information include: synchronous decoding of the first encoded information and the third encoded information.

In one embodiment, one of the first position and the second position is the head position of the encoded code stream, and the other is the tail position of the encoded code stream.

In one embodiment, the decoded image is obtained by performing inverse wavelet transform on the plurality of frequency coefficients.

In one embodiment, the decoded image is obtained by performing inverse wavelet transform on the plurality of frequency coefficients, and then performing front-to-back inversion.

Correspondingly, referring to FIG. 14 , an embodiment of the present application further provides an image encoding apparatus 12, including:

processor 121;

memory 122 for storing instructions executable by processor 121;

Obtain multiple frequency coefficients of the image block to be encoded;

The various embodiments described herein can be implemented using computer readable media such as computer software, hardware, or any combination thereof. For hardware implementation, the embodiments described herein can be implemented using application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( FPGA), processors, controllers, microcontrollers, microprocessors, electronic units designed to perform the functions described herein are implemented. For software implementation, embodiments such as procedures or functions may be implemented with separate software modules that allow the performance of at least one function or operation. The software codes may be implemented by a software application (or program) written in any suitable programming language, which may be stored in memory and executed by a controller.

Those skilled in the art can understand that FIG. 14 is only an example of the image coding apparatus 12, and does not constitute a limitation to the image coding apparatus 12, and may include more or less components than the one shown, or combine some components, or different Components, such as devices, may also include input and output devices, network access devices, buses, and the like.

In one embodiment, the processor 121 is further configured to: select a target encoding mode from at least two first encoding modes, and use the target encoding mode to perform a calculation on the first non-zero bit plane of each of the coefficient blocks. encoding the position information; and, the generating an encoded code stream for the image block to be encoded further includes: encoding the target encoding mode.

In one embodiment, the at least two first encoding modes include at least one encoding mode using a code table and/or at least one predictive encoding mode.

In one embodiment, the code table is determined according to the estimated acquisition scene of the image block to be encoded; or, the code table is determined according to the distribution characteristics of the image information of the reference image block related to the estimated acquisition scene or, the code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.

In an embodiment, the plurality of frequency coefficients are generated after transforming the image block to be encoded from the time domain to the frequency domain; wherein the plurality of frequency coefficients include a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain respectively; the first frequency coefficient and the second frequency coefficient both correspond to the at least two first coding modes .

In one embodiment, in the encoding mode using the code table, different sub-bands correspond to different code tables.

In an embodiment, the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, at least one of the multiple bit planes corresponding to the different color components The at least two first encoding modes correspond to each other.

In an embodiment, the processor 121 is further configured to: based on each of the at least two first encoding modes, respectively, for the position information of the first non-zero bit plane of each of the coefficient blocks Encoding is performed to obtain at least two encoding results; the first encoding mode corresponding to the optimal encoding result in the at least two encoding results is determined as the target encoding mode

In an embodiment, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are regarded as one coding unit.

The processor 121 is further configured to: determine a target coding unit from the coding units according to a predetermined scanning order according to a target coding length indicated by a predetermined code rate; the target coding unit indicates that the coding unit needs to be coded encoding the target coding unit to generate the second coding information; and coding the position information of the last target coding unit.

In one embodiment, the processor 121 is further configured to: sequentially determine the encoding length of the encoding unit according to a predetermined scanning order; Determines the number of target coding units in .

In an embodiment, the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components; The order in which the coding units are scanned for blending.

In an embodiment, the processor 121 is further configured to: encode the target coding unit according to the second coding mode corresponding to the target coding unit.

In one embodiment, the second coding mode corresponding to the target coding unit is determined according to the distribution characteristics of the target coding unit; or, the second coding mode corresponding to the target coding unit is determined according to the location of the target coding unit. determined by the location.

In one embodiment, the second coding mode corresponding to the target coding unit is determined according to the distribution feature of the target coding unit and the mapping relationship between the pre-stored distribution feature and the second coding mode; The second encoding mode is determined according to the location of the target encoding unit and the pre-stored mapping relationship between the location of the target encoding unit and the second encoding mode.

In an embodiment, each of the at least two variable-length encoding modes includes at least two code tables; the processor 121 is further configured to: when the second encoding mode corresponding to the target encoding unit is: In the variable-length coding mode, the at least two code tables are used to encode different bits of the target coding unit respectively.

In an embodiment, the processor 121 is further configured to: if the encoded length of the remaining bits of the target coding unit cannot meet the coding length indicated by the predetermined code rate, then according to the encoding length of the target coding unit Preset data is added to the encoding length.

In an embodiment, the processor 121 is further configured to: decompose the plurality of frequency coefficients into signs of the plurality of frequency coefficients and absolute values of the plurality of frequency coefficients;

determining a bit plane with respect to the plurality of frequency coefficients according to the absolute values of the plurality of frequency coefficients;

Using the coefficient block as a unit, the symbols of the plurality of frequency coefficients are encoded to generate third encoded information.

In an embodiment, the processor 121 is further configured to: encode the sign of the frequency coefficient that is not 0.

In an embodiment, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream, and the second encoding information is located at a first position in the encoded code stream. at a second position in the encoded codestream.

The first position and the second position are used for the decoding end to synchronously decode the encoded code stream from the first position and the second position.

In an embodiment, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream, and the second encoding information is located at a first position in the encoded code stream. and the third encoding information is located at a second position in the encoded code stream.

In an embodiment, the plurality of frequency coefficients are generated based on wavelet transform of the image block to be encoded.

In an embodiment, the plurality of frequency coefficients are generated based on wavelet transform of the image to be encoded after being flipped back and forth.

Correspondingly, the embodiments of the present application also provide an image decoding apparatus, including:

processor;

memory for storing processor-executable instructions;

The processor is further configured to: decode the first encoding information according to the target encoding mode.

In an embodiment, the plurality of frequency coefficients are generated after transforming the image block to be encoded from the time domain to the frequency domain; wherein the plurality of frequency coefficients include a first frequency coefficient and a second frequency coefficient, In addition, the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain, respectively; the first frequency coefficient and the second frequency coefficient both correspond to the target coding mode.

The processor is further configured to: obtain the second encoding information from the encoded code stream according to the position information of the last target encoding unit.

In an embodiment, the processor is further configured to: decode the second encoding information according to a second encoding mode corresponding to the second encoding information to obtain a target encoding unit.

In one embodiment, during the encoding process, the second encoding mode is determined according to the distribution characteristics of the target coding unit and the mapping relationship between the pre-stored distribution characteristics and the second encoding mode; or, the second encoding mode It is determined according to the position of the target coding unit and the pre-stored mapping relationship between the position of the target coding unit and the second coding mode.

In an embodiment, the position where the target coding unit is located includes at least any one of the following: the bit depth of the bit plane corresponding to the target coding unit or the subband to which the target coding unit belongs.

The processor is further configured to: when the second encoding mode is a variable-length encoding mode, use the at least two code tables to respectively decode different code words in the second encoding information.

The processor is further configured to: decode the third encoded information to obtain the sign of the frequency coefficient.

In an embodiment, the processor is further configured to: obtain the first encoding information from a first position of the encoded code stream, and obtain the first encoding information from a second position of the encoded code stream second encoding information; synchronously decoding the first encoding information and the second encoding information.

In an embodiment, the processor is further configured to: obtain the first encoding information from a first position of the encoded code stream, and obtain the first encoding information from a second position of the encoded code stream the second encoded information or the third code stream;

Perform synchronous decoding on the first encoded information and the second encoded information; or perform synchronous decoding on the first encoded information and the third encoded information.

Correspondingly, referring to FIG. 1 , an embodiment of the present application further provides an image processing system, including an image processing module, the above-mentioned image encoding apparatus, the above-mentioned image decoding apparatus, and a memory;

the memory is used for storing the encoded code stream;

Correspondingly, an embodiment of the present application further provides a movable platform, including the above-mentioned image encoding apparatus; or, including the above-mentioned image processing system.

Wherein, the movable platform includes at least an unmanned aerial vehicle, an unmanned vehicle or a mobile robot.

Correspondingly, referring to FIG. 2 , an embodiment of the present application further provides an image transmission system, including an image transmitter and an image receiver; wherein the image transmitter includes the above-mentioned image encoding apparatus, and the image receiver includes The above-mentioned image decoding device;

For details of the implementation process of the functions and functions of the above-mentioned modules, please refer to the implementation process of the corresponding steps in the above method, which will not be repeated here.

In an exemplary embodiment, there is also provided a non-transitory computer-readable storage medium, such as a memory including instructions, executable by a processor of an apparatus to perform the above-described method. For example, the non-transitory computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.

A non-transitory computer-readable storage medium on which computer instructions are stored, when the instructions in the storage medium are executed by a processor of an image encoding device, the above-mentioned image encoding method can be executed; or, when the instructions in the storage medium are executed by When executed by the processor of the image decoding apparatus, the above-described image decoding method can be executed.

It should be noted that, in this document, relational terms such as first and second are used only to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any relationship between these entities or operations. any such actual relationship or sequence exists. The terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion such that a process, method, article or device comprising a list of elements includes not only those elements, but also other not expressly listed elements, or also include elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.

The methods and devices provided by the embodiments of the present application have been introduced in detail above, and specific examples are used to illustrate the principles and implementations of the present application. At the same time, for those of ordinary skill in the art, according to the idea of the application, there will be changes in the specific implementation and application scope. In summary, the content of this specification should not be construed as a limitation to the application. .

Claims

An image coding method, comprising:

Obtain multiple frequency coefficients of the image block to be encoded;

obtaining one or more blocks of coefficients from the bit planes for the plurality of frequency coefficients; the blocks of coefficients comprising binary bits of at least one frequency coefficient;

determining the position information of the first non-zero bit plane in each of the coefficient blocks;

encoding the position information of the first non-zero bit plane of each of the coefficient blocks, to generate first encoding information; encoding to generate second encoding information;

According to the first encoding information and the second encoding information, an encoded code stream of the image block to be encoded is generated.
The method according to claim 1, wherein the encoding the position information of the first non-zero bit plane of each of the coefficient blocks comprises:

selecting a target encoding mode from at least two first encoding modes, and encoding the position information of the first non-zero bit plane of each of the coefficient blocks using the target encoding mode; and,

The generating an encoded code stream for the image block to be encoded further includes: encoding the target encoding mode.
The method according to claim 2, wherein the at least two first encoding modes include at least one mode using code table encoding and/or at least one predictive encoding mode.
The method according to claim 3, wherein the code table is determined according to an estimated acquisition scene of the image block to be encoded; or, the code table is determined according to a reference image related to the estimated acquisition scene or the code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.
The method according to claim 4, wherein the code tables corresponding to different collection scenarios are different, or the code tables corresponding to different distribution characteristics of the image information of the reference image blocks are different; or, the reference image blocks correspond to The code tables corresponding to the different distribution characteristics of the position information of the first non-zero bit plane in each coefficient block of , are different.
The method according to claim 3, wherein the plurality of frequency coefficients are generated after transforming the to-be-coded image block from the time domain to the frequency domain; wherein the plurality of frequency coefficients comprise a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain respectively;

Both the first frequency coefficient and the second frequency coefficient correspond to the at least two first coding modes.
The method according to claim 6, characterized in that, in the encoding mode using a code table, different sub-bands correspond to different code tables.
The method according to claim 2, wherein the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, the multiple different color components correspond to At least one of the bit planes corresponds to the at least two first coding modes.
The method according to claim 8, wherein the plurality of bit planes corresponding to the different color components comprise a bit plane of the Y component, a bit plane of the U component, and a bit plane of the V component;

Alternatively, the plurality of bit planes corresponding to the different color components include a bit plane of the R component, a bit plane of the G component, and a bit plane of the B component.
The method according to claim 2, wherein the selecting a target encoding mode from at least two first encoding modes comprises:

encoding the position information of the first non-zero bit plane of each of the coefficient blocks based on each of the at least two first encoding modes, to obtain at least two encoding results;

A first encoding mode corresponding to an optimal encoding result among the at least two encoding results is determined as the target encoding mode.
The method according to claim 1, wherein, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are used as a coding unit;

The encoding of each bit plane starting from the first non-zero bit plane in each of the coefficient blocks includes:

According to the target coding length indicated by the predetermined code rate, a target coding unit is determined from the coding units according to a predetermined scanning order; the target coding unit indicates the object to be coded in the coding unit;

encoding the target encoding unit to generate the second encoding information;

The generating an encoded code stream about the to-be-encoded image block further includes: encoding the position information of the last target coding unit.
The method according to claim 11, wherein the determining a target coding unit from the coding units according to a predetermined scanning order according to a target coding length indicated by a predetermined code rate comprises:

Determine the coding length of the coding unit sequentially according to a predetermined scanning order;

The number of target coding units is determined from the coding units according to the target coding length and the coding length of the coding units.
The method according to claim 11, wherein the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components;

The predetermined scanning order indicates an order in which the plurality of coding units corresponding to the different color components are mixed and scanned.
The method of claim 11, wherein the predetermined scanning order includes at least an order from low frequency to high frequency and/or an order from the most significant bit-plane to the least significant bit-plane.
The method according to claim 11, wherein the encoding the target coding unit comprises:

The target coding unit is coded according to the second coding mode corresponding to the target coding unit.
The method according to claim 15, wherein the second coding mode corresponding to the target coding unit is determined according to distribution characteristics of the target coding unit;

Or, the second coding mode corresponding to the target coding unit is determined according to the position where the target coding unit is located.
The method according to claim 16, wherein the second coding mode corresponding to the target coding unit is determined according to the distribution feature of the target coding unit and the mapping relationship between the pre-stored distribution feature and the second coding mode;

Alternatively, the second encoding mode corresponding to the target coding unit is determined according to the position of the target coding unit and a pre-stored mapping relationship between the position of the target coding unit and the second encoding mode.
The method according to claim 16 or 17, wherein the position where the target coding unit is located includes at least any one of the following: a bit depth of a bit plane corresponding to the target coding unit or a subclass to which the target coding unit belongs. bring.
The method according to claim 17, wherein the pre-stored mapping relationship between the distribution feature and the second coding mode or the pre-stored mapping relationship between the location of the target coding unit and the second coding mode based on machine learning.
The method according to claim 15, wherein the second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes.
The method of claim 20, wherein each of the at least two variable-length encoding modes includes at least two code tables;

The encoding of the target encoding unit according to the second encoding mode corresponding to the target encoding unit includes:

When the second coding mode corresponding to the target coding unit is a variable-length coding mode, the at least two code tables are used to respectively encode different bits of the target coding unit.
The method of claim 11, further comprising:

If the coded length of the remaining bits of the target coding unit cannot meet the coding length indicated by the predetermined code rate, preset data is added according to the coding length of the target coding unit.
The method according to claim 1, wherein the bit plane with respect to the plurality of frequency coefficients is determined in the following manner:

decomposing the plurality of frequency coefficients into signs for the plurality of frequency coefficients and absolute values for the plurality of frequency coefficients;

determining a bit plane with respect to the plurality of frequency coefficients according to the absolute values of the plurality of frequency coefficients;

The method further includes: using the coefficient block as a unit, encoding the symbols of the plurality of frequency coefficients to generate third encoding information.
The method according to claim 23, wherein the encoding the symbols of the plurality of frequency coefficients further comprises:

The signs of frequency coefficients that are not zero are encoded.
The method according to claim 1, wherein, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream , the second encoding information is located at a second position in the encoded code stream;

The first position and the second position are used for the decoding end to synchronously decode the encoded code stream from the first position and the second position.
The method according to claim 23, wherein, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream , the second encoding information and the third encoding information are located at the second position in the encoded code stream;

The first position and the second position are used for the decoding end to synchronously decode the encoded code stream from the first position and the second position.
The method according to claim 25 or 26, wherein one of the first position and the second position is a header position of the encoded code stream, and the other is the encoded code stream. The tail position of the encoded bitstream.
The method according to claim 1, wherein the plurality of frequency coefficients are generated after wavelet transform is performed on the image block to be encoded.
The method according to claim 28, wherein the plurality of frequency coefficients are generated based on wavelet transform of the image to be encoded after the front and rear flips.
An image decoding method, comprising:

obtaining an encoded code stream, where the encoded code stream includes first encoding information and second encoding information;

Decoding the first encoding information to obtain the position information of the first non-zero bit plane of each coefficient block; and decoding the second encoding information to obtain the first non-zero bit plane from the coefficient block in each coefficient block Bits on each bit plane at the beginning of the plane;

According to the position information of the first non-zero bit plane of each coefficient block and the bits on each bit plane starting from the first non-zero bit plane in each of the coefficient blocks, obtain a plurality of frequency coefficients on the bit plane;

The decoded image block is obtained by using the plurality of frequency coefficients.
The method according to claim 30, wherein the encoded code stream includes a target encoding mode corresponding to the generation of the first encoding information;

The decoding the first encoding information includes: decoding the first encoding information according to the target encoding mode.
The method according to claim 31, wherein the target encoding mode is one of the following: at least one encoding mode using a code table and/or at least one predictive encoding mode.
The method according to claim 32, wherein, in the mode of encoding using a code table, the code table is determined according to an estimated acquisition scene of the decoded image block; or,

The stop table is determined according to the distribution characteristics of the image information of the reference image blocks related to the estimated acquisition scene; or,

The code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.
The method according to claim 33, wherein the code tables corresponding to different collection scenarios are different, or the code tables corresponding to different distribution characteristics of the image information of the reference image blocks are different; or, the reference image blocks correspond to The code tables corresponding to the different distribution characteristics of the position information of the first non-zero bit plane in each coefficient block of , are different.
The method according to claim 31, wherein the plurality of frequency coefficients are generated after transforming the to-be-coded image block from a time domain to a frequency domain; wherein the plurality of frequency coefficients comprise a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain respectively;

Both the first frequency coefficient and the second frequency coefficient correspond to the target coding mode.
The method according to claim 35, characterized in that, if the target encoding mode is the encoding mode using a code table, different sub-bands correspond to different code tables.
The method according to claim 31, wherein the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, the multiple different color components correspond to At least one of the bit planes corresponds to the target coding mode.
method according to claim 37, is characterized in that, the bit plane corresponding to a plurality of described different color components comprises the bit plane of Y component, the bit plane of U component and the bit plane of V component;

Alternatively, the plurality of bit planes corresponding to the different color components include a bit plane of the R component, a bit plane of the G component, and a bit plane of the B component.
The method according to claim 30, wherein in the coding process, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are taken as one coding unit;

The coded code stream also includes the position information of the last target coding unit; the target coding unit indicates the object to be coded in the coding unit;

Before the decoding of the second encoded information, the method includes:

The second encoding information is acquired from the encoded code stream according to the position information of the last target encoding unit.
The method according to claim 39, wherein the target coding unit is determined by the coding end after scanning the coding unit according to a predetermined scanning order.
The method of claim 40, wherein the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components;

The predetermined scanning order indicates an order in which a plurality of coding units corresponding to the different color components are mixedly scanned.
The method according to claim 40, wherein the predetermined scanning order comprises at least an order from low frequency to high frequency and/or an order from the most significant bit bit plane to the least significant bit bit plane.
The method according to claim 39, wherein the decoding the second encoded information further comprises:

Decode the second encoding information according to the second encoding mode corresponding to the second encoding information to obtain a target coding unit.
The method according to claim 43, wherein, in the encoding process, the second encoding mode is determined according to the distribution characteristics of the target encoding unit; or, the second encoding mode is determined according to the target encoding unit. determined by the location.
The method according to claim 44, wherein, in the encoding process, the second encoding mode is determined according to the distribution characteristics of the target coding unit and the mapping relationship between the pre-stored distribution characteristics and the second encoding mode;

Alternatively, the second coding mode is determined according to the location of the target coding unit and a pre-stored mapping relationship between the location of the target coding unit and the second coding mode.
The method according to claim 44 or 45, wherein the position where the target coding unit is located includes at least any one of the following: a bit depth of a bit plane corresponding to the target coding unit or a subclass to which the target coding unit belongs. bring.
The method according to claim 45, wherein the pre-stored mapping relationship between the distribution feature and the second coding mode or the pre-stored mapping relationship between the location of the target coding unit and the second coding mode based on machine learning.
The method according to claim 43, wherein the second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes.
The method of claim 48, wherein the variable-length coding mode includes at least two code tables;

The decoding of the second encoded information further includes:

When the second encoding mode is a variable-length encoding mode, the at least two code tables are used to decode different code words in the second encoding information respectively.
The method according to claim 30, wherein the encoded code stream further includes third encoding information;

The method further includes: decoding the third encoded information to obtain the sign of the frequency coefficient.
The method of claim 30, wherein the method further comprises:

Obtaining the first encoding information from a first position of the encoded code stream, and obtaining the second encoding information from a second position of the encoded code stream;

The decoding of the first encoded information and the decoding of the second encoded information include: synchronously decoding the first encoded information and the second encoded information.
The method of claim 50, wherein the method further comprises:

Obtain the first encoding information from the first position of the encoded code stream, and obtain the second encoding information or the third code stream from the second position of the encoded code stream;

The decoding of the first encoded information and the decoding of the second encoded information include: synchronously decoding the first encoded information and the second encoded information; or,

The decoding of the first encoded information and the decoding of the third encoded information include: synchronous decoding of the first encoded information and the third encoded information.
The method according to claim 51 or 52, wherein one of the first position and the second position is a header position of the encoded code stream, and the other is the encoded code stream. The tail position of the encoded bitstream.
The method according to claim 30, wherein the decoded image is obtained by performing inverse wavelet transform on the plurality of frequency coefficients.
The method according to claim 54, wherein the decoded image is obtained by performing inverse wavelet transform on the plurality of frequency coefficients, and then performing front and rear inversion.
An image encoding device, comprising:

processor;

memory for storing processor-executable instructions;

Wherein, the processor invokes the executable instruction, and when the executable instruction is executed, is used to execute:

Obtain multiple frequency coefficients of the image block to be encoded;

obtaining one or more blocks of coefficients from the bit planes for the plurality of frequency coefficients; the blocks of coefficients comprising binary bits of at least one frequency coefficient;

determining the position information of the first non-zero bit plane in each of the coefficient blocks;

encoding the position information of the first non-zero bit plane of each of the coefficient blocks, to generate first encoding information; encoding to generate second encoding information;

According to the first encoding information and the second encoding information, an encoded code stream of the image block to be encoded is generated.
The apparatus according to claim 56, wherein the processor is further configured to: select a target encoding mode from at least two first encoding modes, and use the target encoding mode to perform a encoding the position information of the non-zero bit planes; and,

The generating an encoded code stream for the image block to be encoded further includes: encoding the target encoding mode.
The apparatus of claim 57, wherein the at least two first encoding modes include at least one mode using code table encoding and/or at least one predictive encoding mode.
The apparatus according to claim 58, wherein the code table is determined according to an estimated acquisition scene of the image block to be encoded; or, the code table is determined according to a reference image related to the estimated acquisition scene or the code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.
The device according to claim 59, wherein the code tables corresponding to different collection scenarios are different, or the code tables corresponding to different distribution characteristics of the image information of the reference image blocks are different; or, the reference image blocks correspond to The code tables corresponding to the different distribution characteristics of the position information of the first non-zero bit plane in each coefficient block of , are different.
The apparatus according to claim 58, wherein the plurality of frequency coefficients are generated after transforming the to-be-coded image block from the time domain to the frequency domain; wherein the plurality of frequency coefficients comprise a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain respectively;

Both the first frequency coefficient and the second frequency coefficient correspond to the at least two first coding modes.
The apparatus according to claim 61, wherein in the mode of encoding using a code table, different sub-bands correspond to different code tables.
The apparatus according to claim 57, wherein the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, the multiple different color components correspond to At least one of the bit planes corresponds to the at least two first coding modes.
The device according to claim 63, wherein the plurality of bit planes corresponding to the different color components comprise a bit plane of the Y component, a bit plane of the U component, and a bit plane of the V component;

Alternatively, the plurality of bit planes corresponding to the different color components include a bit plane of the R component, a bit plane of the G component, and a bit plane of the B component.
The apparatus of claim 57, wherein the processor is further configured to:

encoding the position information of the first non-zero bit plane of each of the coefficient blocks based on each of the at least two first encoding modes, to obtain at least two encoding results;

A first encoding mode corresponding to an optimal encoding result among the at least two encoding results is determined as the target encoding mode.
The apparatus according to claim 56, wherein, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are used as a coding unit;

The processor is further configured to: determine a target encoding unit from the encoding units according to a predetermined scanning order according to a target encoding length indicated by a predetermined code rate; the target encoding unit indicates the encoding unit that needs to be encoded. object;

encoding the target encoding unit to generate the second encoding information;

Encode the location information of the last target coding unit.
The apparatus of claim 66, wherein the processor is further configured to:

Determine the coding length of the coding unit sequentially according to a predetermined scanning order;

The number of target coding units is determined from the coding units according to the target coding length and the coding length of the coding units.
The device according to claim 66, wherein the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components;

The predetermined scanning order indicates an order in which a plurality of coding units corresponding to the different color components are mixedly scanned.
66. The apparatus of claim 66, wherein the predetermined scanning order includes at least an order from low frequency to high frequency and/or an order from the most significant bit bit plane to the least significant bit bit plane.
The apparatus according to claim 66, wherein the processor is further configured to: encode the target coding unit according to a second coding mode corresponding to the target coding unit.
The device according to claim 70, wherein the second coding mode corresponding to the target coding unit is determined according to the distribution characteristics of the target coding unit;

Or, the second coding mode corresponding to the target coding unit is determined according to the position where the target coding unit is located.
The device according to claim 71, wherein the second coding mode corresponding to the target coding unit is determined according to the distribution feature of the target coding unit and the mapping relationship between the pre-stored distribution feature and the second coding mode;

Alternatively, the second encoding mode corresponding to the target coding unit is determined according to the position of the target coding unit and a pre-stored mapping relationship between the position of the target coding unit and the second encoding mode.
The apparatus according to claim 71 or 72, wherein the position where the target coding unit is located includes at least any one of the following: a bit depth of a bit plane corresponding to the target coding unit or a subclass to which the target coding unit belongs. bring.
The apparatus according to claim 72, wherein the pre-stored mapping relationship between the distribution feature and the second coding mode or the pre-stored mapping relationship between the location of the target coding unit and the second coding mode based on machine learning.
The apparatus according to claim 70, wherein the second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes.
The apparatus of claim 75, wherein each of the at least two variable-length encoding modes includes at least two code tables;

The processor is further configured to use the at least two code tables to encode different bits of the target coding unit respectively when the second coding mode corresponding to the target coding unit is a variable-length coding mode.
The apparatus according to claim 65, wherein the processor is further configured to: if the encoded length of the remaining bits of the target coding unit cannot meet the encoding length indicated by the predetermined code rate, according to Preset data is added to the coding length of the target coding unit.
The apparatus of claim 56, wherein the processor is further configured to: decompose the plurality of frequency coefficients into signs with respect to the plurality of frequency coefficients and absolute values with respect to the plurality of frequency coefficients ;

determining a bit plane with respect to the plurality of frequency coefficients according to the absolute values of the plurality of frequency coefficients;

Using the coefficient block as a unit, the symbols of the plurality of frequency coefficients are encoded to generate third encoded information.
The apparatus of claim 78, wherein the processor is further configured to: encode the sign of the frequency coefficient that is not zero.
The apparatus according to claim 56, wherein, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream , the second encoding information is located at a second position in the encoded code stream;

The first position and the second position are used for the decoding end to synchronously decode the encoded code stream from the first position and the second position.
The apparatus according to claim 78, wherein, in the encoded code stream about the image block to be encoded, the first encoding information is located at a first position in the encoded code stream , the second encoding information and the third encoding information are located at the second position in the encoded code stream;

The first position and the second position are used for the decoding end to synchronously decode the encoded code stream from the first position and the second position.
The apparatus according to claim 56, wherein the plurality of frequency coefficients are generated after wavelet transform is performed on the image block to be encoded.
The apparatus according to claim 82, wherein the plurality of frequency coefficients are generated based on wavelet transform of the image to be encoded after being flipped back and forth.
An image decoding device, comprising:

processor;

memory for storing processor-executable instructions;

Wherein, the processor invokes the executable instruction, and when the executable instruction is executed, is used to execute:

obtaining an encoded code stream, where the encoded code stream includes first encoding information and second encoding information;

Decoding the first encoding information to obtain the position information of the first non-zero bit plane of each coefficient block; and decoding the second encoding information to obtain the first non-zero bit plane from the coefficient block in each coefficient block Bits on each bit plane at the beginning of the plane;

According to the position information of the first non-zero bit plane of each coefficient block and the bits on each bit plane starting from the first non-zero bit plane in each of the coefficient blocks, obtain a plurality of frequency coefficients on the bit plane;

The decoded image block is obtained by using the plurality of frequency coefficients.
The apparatus according to claim 84, wherein the encoded code stream includes a target encoding mode corresponding to generating the first encoding information;

The processor is further configured to: decode the first encoding information according to the target encoding mode.
The apparatus according to claim 85, wherein the target encoding mode is one of the following: at least one encoding mode using a code table and/or at least one predictive encoding mode.
The apparatus according to claim 86, wherein, in the mode of encoding using a code table, the code table is determined according to an estimated acquisition scene of the decoded image block; or,

The stop table is determined according to the distribution characteristics of the image information of the reference image blocks related to the estimated acquisition scene; or,

The code table is determined according to the distribution characteristics of the position information of the first non-zero bit plane in each coefficient block corresponding to the reference image block.
The apparatus according to claim 87, wherein the code tables corresponding to different collection scenarios are different, or the code tables corresponding to different distribution characteristics of the image information of the reference image blocks are different; or, the reference image blocks correspond to The code tables corresponding to the different distribution characteristics of the position information of the first non-zero bit plane in each coefficient block of , are different.
The apparatus according to claim 85, wherein the plurality of frequency coefficients are generated after transforming the to-be-coded image block from the time domain to the frequency domain; wherein the plurality of frequency coefficients comprise a first frequency coefficient and a second frequency coefficient, and the first frequency coefficient and the second frequency coefficient belong to different subbands in the frequency domain respectively;

Both the first frequency coefficient and the second frequency coefficient correspond to the target coding mode.
The apparatus according to claim 89, wherein, if the target encoding mode is the mode using the code table encoding, different subbands correspond to different code tables.
The apparatus according to claim 85, wherein the number of the bit planes is multiple, and the multiple bit planes include multiple bit planes corresponding to different color components; wherein, the multiple different color components correspond to At least one of the bit planes corresponds to the target coding mode.
The device according to claim 91, wherein the plurality of bit planes corresponding to the different color components comprise a bit plane of the Y component, a bit plane of the U component, and a bit plane of the V component;

Alternatively, the plurality of bit planes corresponding to the different color components include a bit plane of the R component, a bit plane of the G component, and a bit plane of the B component.
The apparatus according to claim 84, wherein in the coding process, in the coefficient block, starting from the first non-zero bit plane, all bits of the same bit plane in the coefficient block are taken as one coding unit;

The coded code stream also includes the position information of the last target coding unit; the target coding unit indicates the object to be coded in the coding unit;

The processor is further configured to: obtain the second encoding information from the encoded code stream according to the position information of the last target encoding unit.
The apparatus according to claim 93, wherein the target coding unit is determined by the coding end after scanning the coding unit according to a predetermined scanning order.
The device according to claim 94, wherein the number of the bit planes is multiple, and the multiple bit planes include a plurality of bit planes corresponding to different color components;

The predetermined scanning order indicates an order in which a plurality of coding units corresponding to the different color components are mixedly scanned.
The apparatus of claim 94, wherein the predetermined scanning order includes at least an order from low frequency to high frequency and/or an order from the most significant bit bit plane to the least significant bit bit plane.
The apparatus according to claim 93, wherein the processor is further configured to: decode the second encoding information according to the second encoding mode corresponding to the second encoding information to obtain the target encoding unit.
The apparatus according to claim 97, wherein in the encoding process, the second encoding mode is determined according to the distribution characteristics of the target encoding unit; or, the second encoding mode is determined according to the target encoding unit. determined by the location.
The device according to claim 98, wherein in the encoding process, the second encoding mode is determined according to the distribution characteristics of the target coding unit and the mapping relationship between the pre-stored distribution characteristics and the second encoding mode;

Alternatively, the second coding mode is determined according to the location of the target coding unit and a pre-stored mapping relationship between the location of the target coding unit and the second coding mode.
The apparatus according to claim 98 or 99, wherein the position where the target coding unit is located includes at least any one of the following: a bit depth of a bit plane corresponding to the target coding unit or a subclass to which the target coding unit belongs. bring.
The apparatus according to claim 99, wherein the pre-stored mapping relationship between the distribution feature and the second coding mode or the pre-stored mapping relationship between the location of the target coding unit and the second coding mode based on machine learning.
The apparatus according to claim 97, wherein the second encoding mode includes at least a fixed-length encoding mode and/or at least two variable-length encoding modes.
The apparatus of claim 102, wherein the variable-length coding mode includes at least two code tables;

The processor is further configured to: when the second encoding mode is a variable-length encoding mode, use the at least two code tables to respectively decode different code words in the second encoding information.
The apparatus according to claim 84, wherein the encoded code stream further includes third encoding information;

The processor is further configured to: decode the third encoded information to obtain the sign of the frequency coefficient.
The apparatus according to claim 84, wherein the processor is further configured to: obtain the first encoding information from a first position of the encoded code stream, and obtain the first encoding information from the encoded code stream The second encoding information is obtained from the second location of the device; and the first encoding information and the second encoding information are synchronously decoded.
The apparatus according to claim 103, wherein the processor is further configured to: obtain the first encoding information from a first position of the encoded code stream, and obtain the first encoding information from the encoded code stream obtain the second encoding information or the third code stream from the second position;

Perform synchronous decoding on the first encoded information and the second encoded information; or perform synchronous decoding on the first encoded information and the third encoded information.
The device according to claim 105 or 106, wherein one of the first position and the second position is a header position of the encoded code stream, and the other is the encoded code stream. The tail position of the encoded bitstream.
An image processing system, characterized in that it comprises an image processing module, the image encoding device as claimed in any one of claims 56 to 83, the image decoding device as claimed in any one of claims 84 to 107, and a memory;

The image processing module is used to process the image, and transmit the processed image to the image encoding device;

The image encoding device is configured to, after dividing the processed image into a plurality of image blocks to be encoded, perform compression encoding processing on the image blocks to be encoded to generate an encoded code stream;

the memory is used for storing the encoded code stream;

The image decoding device is configured to decode the encoded code stream, and transmit the decoded result to the image processing module.
A movable platform, characterized in that it includes the image encoding device as claimed in any one of claims 56 to 83 ; or, includes the image processing system as claimed in claim 108 .
The movable platform of claim 109, wherein the movable platform comprises an unmanned aerial vehicle, an unmanned vehicle, or a mobile robot.
An image transmission system, characterized in that it includes an image transmitter and an image receiver; wherein the image transmitter includes the image encoding device according to any one of claims 56 to 83, and the image receiver includes The image decoding apparatus according to any one of claims 84 to 107;

The image transmitter is configured to send the encoded code stream generated after encoding by the image encoding device to the image receiver;

The image receiver is configured to use the image decoding device to decode the encoded code stream after receiving the encoded code stream.
A computer-readable storage medium, characterized in that computer instructions are stored thereon, and when the instructions are executed by a processor, implement the method described in any one of claims 1 to 29 or any one of claims 30 to 55.