[go: up one dir, main page]

WO2011057557A1 - Method, apparatus and system for video image encoding and decoding - Google Patents

Method, apparatus and system for video image encoding and decoding Download PDF

Info

Publication number
WO2011057557A1
WO2011057557A1 PCT/CN2010/078564 CN2010078564W WO2011057557A1 WO 2011057557 A1 WO2011057557 A1 WO 2011057557A1 CN 2010078564 W CN2010078564 W CN 2010078564W WO 2011057557 A1 WO2011057557 A1 WO 2011057557A1
Authority
WO
WIPO (PCT)
Prior art keywords
block
sub
blocks
value
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2010/078564
Other languages
French (fr)
Chinese (zh)
Inventor
林永兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of WO2011057557A1 publication Critical patent/WO2011057557A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques

Definitions

  • the present invention relates to the field of video image processing technologies, and in particular, to a video image encoding process, a decoding processing method and apparatus, and a codec system. Background technique
  • Block-based hybrid video coding framework is one of the general video compression coding processing schemes, mainly including intra coding technology and inter coding technology.
  • the intra-frame coding technique utilizes spatial correlation, such as the use of intra prediction (Intra prediction) technology to remove spatial redundancy information.
  • the intra-frame coding can be completed by the current frame information itself, without reference to other frame information, ie No reference frames are required.
  • the interframe coding technique utilizes time domain correlation, such as the use of motion compensated prediction techniques to remove time domain redundancy information, which requires predictive coding based on information of the reference frame.
  • the above coding technique can achieve higher compression efficiency of video image coding.
  • one prior art performs predictive coding by using correlation between coded neighboring blocks, and writes a prediction mode of the image component and its corresponding prediction residual information into the code. Streaming, so that the decoding end decodes according to the preset mode and its corresponding prediction residual information.
  • Another prior art is: intra prediction coding using correlation between adjacent pixels, such as: downsampling a block of size 8 X 8 (in pixels) to obtain 4 sizes of 4 X The block of 4, intra-coded a block of size 4 X 4 to obtain a reconstructed image of the block, and predictively encodes other blocks of size 4 ⁇ 4 with reference to the reconstructed image of the block.
  • the invention provides a video image encoding process, a decoding processing method and device, and a codec system to improve the compression efficiency of video image coding.
  • the present invention provides a video image encoding processing method, including:
  • the third block is subjected to predictive coding processing with reference to the reconstructed image value of the second block.
  • the present invention also provides a video image decoding processing method, including: Obtaining coded bit information of a macroblock of a video image frame, the macroblock including at least: a first block, a second block, and a third block;
  • the decoded image values of the blocks in the macroblock are subjected to upsampling synthesis processing to obtain decoded image values of the macroblocks.
  • the present invention also provides a video image encoding processing apparatus, including:
  • a downsampling and decoding module configured to perform downsampling on a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block;
  • An intra prediction coding module configured to perform intra prediction coding processing on the first block to obtain a reconstructed image value of the first block
  • a first inter-pixel predictive coding module configured to perform predictive coding processing on the second block with reference to a reconstructed image value of the first block, to obtain a reconstructed image value of the second block;
  • the second inter-pixel predictive coding module is configured to perform predictive coding processing on the third block with reference to the reconstructed image value of the second block.
  • the present invention also provides a video image decoding processing apparatus, including:
  • a bit information acquiring module configured to acquire coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block;
  • An intra prediction decoding module configured to perform intra prediction decoding on the coded bit information of the first block Processing, obtaining a decoded image value of the first block;
  • the first inter-pixel predictive decoding module is configured to perform predictive decoding processing on the encoded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block;
  • a second inter-pixel prediction decoding module configured to perform predictive decoding processing on the encoded bit information of the third block with reference to the decoded image value of the second block, to obtain a decoded image value of the third block;
  • a decoded image generating module configured to perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.
  • the invention also provides a video image coding and decoding system, comprising:
  • a video image encoding processing apparatus configured to downsample a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; Performing intra prediction coding processing to obtain a reconstructed image value of the first block; performing prediction encoding processing on the second block with reference to the reconstructed image value of the first block to obtain the second block a reconstructed image value; performing predictive encoding processing on the third block with reference to the reconstructed image value of the second block;
  • a video image decoding processing apparatus configured to acquire coded bit information of the macroblock, where the macroblock includes at least: the first block, the second block, and the third block; and the first block
  • the coded bit information is subjected to intra prediction decoding processing to obtain a decoded image value of the first block; and the coded bit information of the second block is subjected to predictive decoding processing with reference to the decoded image value of the first block.
  • decoded image value of the second block performing predictive decoding processing on the encoded bit information of the third block with reference to the decoded image value of the second block to obtain a decoded image value of the third block;
  • the decoded image values of the respective blocks are subjected to upsampling synthesis processing to obtain decoded image values of the macroblocks.
  • the macroblock is downsampled and decomposed to obtain a plurality of blocks, and the intra prediction technique and the interpixel prediction technique are combined in the process of predictive coding or decoding of the plurality of blocks, and at least two levels of cascade prediction are formed.
  • Reference relationship so you can make full use of the correlation between pixels, improve prediction accuracy and video
  • the coding compression efficiency of the image DRAWINGS
  • FIG. 1 is a flowchart of a video image encoding processing method according to a first embodiment of the present invention
  • FIG. 2 is a flowchart of a video image decoding processing method according to a second embodiment of the present invention.
  • FIG. 3 is a flowchart of a video image encoding processing method according to a third embodiment of the present invention
  • FIG. 3b is a block diagram of a logical implementation of a video image encoding processing method according to a third embodiment of the present invention
  • Schematic diagram of macroblock downsampling
  • FIG. 3 is a schematic diagram of an inter-pixel prediction coding interpolation process according to a third embodiment of the present invention
  • FIG. 4 is a flowchart of a video image decoding processing method according to a fourth embodiment of the present invention.
  • FIG. 5a is a flowchart of a video image encoding processing method according to a fifth embodiment of the present invention
  • FIG. 5b is a schematic diagram of a macroblock decomposition according to a fifth embodiment of the present invention
  • FIG. 6 is a flowchart of a video image decoding processing method according to a sixth embodiment of the present invention.
  • FIG. 7 is a flowchart of a video image encoding processing method according to a seventh embodiment of the present invention
  • FIG. 7b is a schematic diagram of a macroblock down sampling decomposition according to a seventh embodiment of the present invention.
  • FIG. 8 is a flowchart of a video image decoding processing method according to an eighth embodiment of the present invention.
  • FIG. 9 is a schematic structural diagram of a video image encoding processing apparatus according to a ninth embodiment of the present invention.
  • FIG. 10 is a schematic structural diagram of a video image decoding processing apparatus according to a tenth embodiment of the present invention; Schematic diagram of video image codec system structure. detailed description
  • FIG. 1 is a flowchart of a video image encoding processing method according to a first embodiment of the present invention. As shown in FIG. 1, the video image encoding processing method in this embodiment includes:
  • Step 11 Down-sampling the macroblock of the video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block.
  • the pixels in the first block include at least pixels in the lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • the downsampling coefficient can be determined according to the actual coding requirement of the video image frame, and the macroblock is downsampled and decomposed according to the downsampling coefficient to obtain a plurality of blocks. If the downsampling coefficient is ⁇ , the macroblock is downsampled and decomposed.
  • Grouping in A 2 blocks for example, dividing the blocks into N groups according to the positional relationship between the pixels constituting each block extracted in the downsampling process, each group consisting of one or more blocks, and N is greater than or equal to An integer of 3.
  • the first group includes at least one first block
  • the second group includes at least one second block
  • the third group includes at least one third block, and so on, until packet processing of the blocks is completed.
  • grouping may be performed according to whether the positional relationship of the extracted pixels is adjacent, for example, dividing different blocks composed of pixels adjacent to the same pixel into the same group.
  • Step 12 Perform intra prediction coding processing on the first block to obtain a reconstructed image value of the first block.
  • the first block may be subjected to intra prediction encoding processing by using pixel values of other macroblocks encoded in the frame to obtain a reconstructed image value of the first block.
  • the reconstructed image value consists of the predicted value of the first block and the predicted residual after the encoding process.
  • Step 13 Perform a predictive coding process on the second block with reference to the reconstructed image value of the first block, and obtain a reconstructed image value of the second block.
  • the reconstructed image value of the first block may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the second block, according to the second block.
  • the predicted value and its prediction residual construct the reconstructed image value of the second block.
  • Step 14 Perform prediction encoding processing on the third block with reference to the reconstructed image value of the second block.
  • the reconstructed image value of the second block may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the third block, according to the third block.
  • the predicted value and its prediction residual construct the reconstructed image value of the third block.
  • the first block, the second block, and the third block are respectively composed of pixels extracted by using a preset extraction rule for all pixels of the macro block, such as every other or more in the horizontal and vertical directions or other directions.
  • the pixels extracted by the pixels are respectively composed, and thus the pixels of the respective blocks are interlaced with each other in the spatial position.
  • the coded block (the 2/3th block) and the pixel of the reference block (the first block) are spatially interlaced with each other, the prediction between them can be fully utilized. Correlation between pixels, thereby improving prediction accuracy. It can be seen that this is an inter-pixel prediction method, which is different from the intra prediction method of step 12 (which is actually an inter-block prediction method).
  • the inter-pixel predictive coding method is used to predictively encode other blocks than the first block, and the second block is the reconstructed image value of the first block.
  • the third block is predictive coding based on the reconstructed image value of the second block.
  • a similar prediction reference relationship may be established between the plurality of blocks obtained by the macroblock downsampling decomposition.
  • at least two levels of cascaded prediction reference relationships are formed.
  • coded bit information of each block obtained by encoding the prediction residual of each block in the macroblock may be written into the code stream.
  • the coded stream is written to the prediction residual information of each block will be described as an example.
  • This embodiment is exemplified by encoding in the order of the first block, the second block, and the third block. Further, based on the same principle, encoding may be performed in the order of the third block, the second block, the first block, or the first block, the third block, and the second block. The encoding process will not be described again.
  • the macroblock is downsampled and decomposed to obtain a plurality of blocks, and a cascaded reference relationship is formed between the plurality of blocks, so that correlation between pixels can be fully utilized to improve prediction accuracy, and at the same time, the implementation
  • macroblock layer MB level
  • the texture characteristics of different regions in the image can be better adapted on the macroblock layer, thereby improving coding compression efficiency.
  • the size of the macro block in this embodiment may be 16 X 16, 32 X 32, 64 X 64 or 128 X 128, or the like. Macroblocks mentioned in the subsequent embodiments refer to this meaning unless otherwise stated.
  • FIG. 2 is a flowchart of a video image decoding processing method according to a second embodiment of the present invention.
  • This embodiment is a decoding end processing flow corresponding to the encoding processing method shown in Fig. 1.
  • the video image decoding processing method in this embodiment includes:
  • Step 21 Acquire coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block.
  • the pixels in the first block include at least pixels in the lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • the macroblock downsampling decomposition mode is shown in step 11 of the corresponding embodiment, and is not described here.
  • the code side carries the coded bit information of each block of the macro block in the code stream sent to the decoding end, and the decoding end decodes the bit information of each block to obtain the residual decoding coefficient of each block.
  • Step 22 Perform intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block.
  • the intra-predictive decoding process of the coded bit information constituting the first block can be performed by using the pixel values of other macroblocks decoded in the frame, and the predicted value of the first block and its residual are obtained, and the predicted value of the first block is compared with The residuals are added to obtain the decoded image value of the first block.
  • This is a traditional intra prediction decoding method.
  • Step 23 Perform prediction decoding processing on the coded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block.
  • the method may include performing interpolation processing on the reconstructed image value of the first block, and the pixel value of the corresponding position after the interpolation is used as the predicted value of the second block.
  • the decoded image value of the second block is constructed based on the predicted value of the second block and its residual decoding coefficient.
  • Step 24 Perform prediction decoding processing on the coded bit information of the third block with reference to the decoded image value of the second block to obtain a decoded image value of the third block.
  • the method may include performing interpolation processing on the reconstructed image value of the second block, and the pixel value of the corresponding position after the interpolation is used as the predicted value of the third block.
  • the decoded image value of the third block is constructed based on the predicted value of the third block and its residual decoding coefficient.
  • the decoded block (the 2/3th block) and the pixel of the reference block (the first block) are spatially interlaced with each other, the prediction between them can be fully utilized. Correlation between each pixel, thereby improving prediction accuracy. It can be seen that this is an inter-pixel prediction method, which is different from the intra prediction method of step 22 (which is actually an inter-block prediction method).
  • the inter-pixel prediction decoding method is used for predictive decoding on other blocks except the first block, the second block is predictive decoding based on the decoded image value of the first block, and the third block is based on the decoded image value of the second block. Perform predictive decoding.
  • a similar decoding prediction relationship may be established between multiple blocks included in the macroblock, so that at least two stages are formed in the prediction decoding process of the plurality of blocks obtained by downsampling decomposition. Cascaded predictive reference relationships.
  • Step 25 Perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.
  • the upsampling synthesis process performed by the decoding end is an inverse process of the downsampling decomposition performed by the encoding end, that is, this step is the inverse process of the step 11 in the encoding process of the first embodiment of the present invention.
  • each block is described by taking the case where the prediction residual coefficient exists in each block as an example. Actually, it is also possible to allow the case where the prediction residual coefficient is not present for each block, or the case where the prediction coefficient is present in the first block and the prediction residual coefficient is not present in the second block and the third block, or other combinations.
  • This embodiment is exemplified by decoding in the order of the first block, the second block, and the third block. Further, based on the same principle, decoding may be performed in the order of the third block, the second block, the first block, or the first block, the third block, and the second block.
  • the video image decoding processing method performs downsampling decomposition on the macroblock to obtain a plurality of blocks for pre-preprocessing.
  • the intra-prediction decoding technology is combined with the inter-pixel prediction decoding technology, and at least two levels of cascaded prediction reference relationships are formed in the inter-pixel prediction decoding process, so that the pixel correlation can be fully utilized for encoding compression. Decoding of the video image afterwards.
  • FIG. 3 is a flowchart of a video image encoding processing method according to a third embodiment of the present invention
  • FIG. 3b is a logic block diagram of a video image encoding processing method according to a third embodiment of the present invention.
  • a block-based hybrid video coding framework is used to perform a downsampling decomposition of a 16 ⁇ 16 macroblock as an example.
  • the video image encoding processing method of this embodiment includes: Step 31: Input a video image frame to be encoded. Step 32: Taking ⁇ as a downsampling coefficient, performing a 16 ⁇ 16 macroblock in the video image frame
  • FIG. 3c is a schematic diagram of a macroblock downsampling decomposition according to a third embodiment of the present invention. As shown in Fig. 3c, using ⁇ as the downsampling coefficient, downsampling the 16 ⁇ 16 macroblocks in the video image frame is: 2
  • the blocks composed of the positional relationships of the pixels extracted from left to right and from top to bottom may be labeled as: blocks 0, 1, 2, and 3.
  • the blocks are divided into multiple groups, for example: block 3 is a group, blocks 1 and 2 are a group, and block 0 is a group.
  • the predictive coding type of block 3 is intra prediction coding
  • the prediction coding type of other blocks (such as blocks 0, 1, and 2) is inter-pixel predictive coding.
  • the "I” and “P” in the figure respectively indicate that the prediction coding type of the corresponding block is: intra prediction coding or inter-pixel prediction type. Therefore, in this embodiment, the block 3 is the "first block” according to the embodiment of the present invention, and the blocks 1 and 2 are the “second block” according to the embodiment of the present invention, and the block 0 is the embodiment of the present invention. Said "third block".
  • the pixel in the first block includes at least a pixel in a lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • an aliasing effect may result during downsampling decomposition.
  • the embodiment may perform anti-aliasing filtering on some or all of the pixels of the macroblock in the downsampling decomposition process:
  • anti-aliasing filtering can be performed on all pixels of a macroblock, such as anti-aliasing filtering using a low-pass filter to filter out high-frequency components, thereby reducing aliasing effects on encoding during downsampling decomposition The impact.
  • Anti-aliasing filtering can be used in the downsampling decomposition of blocks 0, 1 and 2, and anti-aliasing filtering is performed in the downsampling process of block 3, such as using a low-pass filter.
  • the anti-aliasing filtering process is performed as a downsampling filter to filter out high frequency components in the block 3. Since block 3 has a large influence on the coding quality of other blocks (blocks 0, 1, and 2), only block 3 is subjected to anti-aliasing filtering processing, and the aliasing effect in the process of reducing the downsampling decomposition is caused by coding. Based on the technical effects of the impact, it also helps to reduce the complexity of the implementation.
  • Step 33 Perform intra prediction coding processing on the block 3 to obtain the reconstructed image value of the block 3 and its prediction residual.
  • This step can perform intra prediction encoding on block 3 by using the pixel values of other macroblocks encoded in the frame.
  • a predicted value is obtained, and the predicted value is compared with the original pixel of the macroblock to obtain a prediction residual of the block 3, and the reconstructed image value of the block 3 is constructed based on the predicted value of the block 3 and the encoded residual after the encoding.
  • Step 34 Perform prediction encoding processing on Block 1 and Block 2 with reference to the reconstructed image values of the block 3, and obtain reconstructed image values of the blocks 1 and 2 and their prediction residuals, respectively.
  • Step 35 Perform prediction encoding processing on block 0 with reference to the reconstructed image values of block 1 and/or block 2, to obtain reconstructed image values of block 0 and prediction residuals thereof.
  • FIG. 3 is a schematic exploded view of inter-pixel predictive coding interpolation processing according to a third embodiment of the present invention. As shown in Fig.
  • a 4-tap (4-tap) interpolation filter can be used to interpolate the referenced reconstructed image values, where: Interpolation processing for block 1 and block 2 is as follows: 4-tap interpolation The filter performs interpolation of the reconstructed image values of the block 3 in the horizontal direction or the vertical direction, and interpolates the vertical direction of the reconstructed image values of the block 3.
  • the pixel value corresponding to the position is a predicted value of the block 1 pixel
  • the pixel value corresponding to the interpolation position in the horizontal direction of the reconstructed image value of the block 3 is the predicted value of the block 2 pixel.
  • the interpolation processing for block 0 is, for example, finding an average value of pixel values of a plurality of pixels around the pixel constituting the block 0, which is a predicted value of the pixel of the block 0.
  • the average of the pixel values of the plurality of pixels around the pixel of the block 0 can be averaged, and FIG. 3d only illustrates the case where the four pixels are from the block 1 and the block 2, respectively.
  • the four pixels may come from block 1 or from blocks 1 and 3, if block 0 is referenced to the reconstructed image value of block 2;
  • the number of pixels may be from block 2 or from blocks 2 and 3; the number of pixels for averaging is not limited to four, for example, the average of the pixel values of 8 pixels around the pixel of block 0 may be averaged. , these 8 pixels can come from blocks 1, 2 And 3, etc.
  • the reconstructed image values of the macroblocks may be constructed according to the reconstructed image values of the blocks obtained in the above steps 33-35, and the reconstructed image values of the macroblocks are stored.
  • Step 36 Write coded bit information of each block obtained by encoding the prediction residual of each block in the macroblock into the code stream.
  • the present embodiment will be described by taking an example in which the prediction residual information of each block is written into the code stream. Actually, it is also possible to allow the prediction residual information of each block not to be written into the code stream, or the prediction residual information of the first block is written into the code stream and the prediction residual information of the second block and the third block is not The case of writing a stream, or other combination.
  • FIG. 3b is a logic block diagram of a video image encoding processing method according to a third embodiment of the present invention. The following describes the processing procedure of the blocks included in the macroblock in the application scenario of the block-based hybrid video coding framework in this embodiment, which may specifically include:
  • the reconstructed image value of block 0 is constructed from the predicted value of block 0 and its prediction residual.
  • the other blocks are separately predicted, and the prediction residuals of the other blocks are sequentially subjected to transform processing, quantization processing, and entropy encoding processing.
  • the prediction residuals of the blocks included in the macroblock are written into the code stream by the entropy coding process, and the reconstructed image values of the respective blocks are combined and processed to obtain reconstructed image values of the macroblocks.
  • the above-mentioned block can have two blocks of different predictive coding types, such as intra prediction and inter-pixel prediction, so that the block can be optimized in the process of quantization processing, as described below. :
  • a block-level quantization mapping mechanism can be established within the macroblock, that is, according to the importance of different blocks within the macroblock, divided into multiple groups, and different quantization parameters (Quinization Parameter, QP for short) are assigned to each block.
  • Block 0 refers to block 1 and block 2
  • block 1 and block 2 refer to block 3.
  • the QP value of the block being referenced is given a smaller QP value.
  • the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:
  • QP (block 3) QP (Slice) -6
  • QP (Slice) represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.
  • the prediction residual of each block is subjected to corresponding quantization processing according to the QP value of each block.
  • different quantization and optimization processes can be performed on different blocks according to the different QP values of different blocks in the quantization process as follows:
  • Example 1 In the implementation of Rate-Distortion Optimization (RDO), it is necessary to consider the different QP values of different blocks within a macroblock. When calculating the rate distortion overhead (rdcost) for different blocks, determine according to the QP value. The lambda coefficient corresponding to different QP values is different from the corresponding lambda coefficient.
  • RDO Rate-Distortion Optimization
  • Example 2 In the process of quantization rounding, consider different prediction coding types of each block in the macroblock, and use different dead zones for different blocks to match intra prediction and inter-pixel prediction.
  • the coefficients of the block prediction residual of the prediction coding type are transformed to have different distribution laws.
  • the selection principle of the dead zone used by each block is: a smaller dead zone is used for the intra prediction coding type block, and a larger dead zone is used for the inter-pixel prediction coding type block. That is, a smaller dead zone is used for block 3, and a larger dead zone is used for blocks 2, 1 and 0, for example, the dead zone used for block 3 is A, and the dead zone used for block 2, 1 or 0 is B, Then A is less than B.
  • coded bit information may be carried in the code stream outputted by the coding end, such as entropy coding information of a syntax element "Coded Block Pattern” (CBP) in the video coding standard.
  • CBP Coded Block Pattern
  • the entropy coding process is performed on each block according to the distribution rule of the quantized coefficients of the prediction residuals of each block to obtain better performance, and the results of these entropy coding processes can be written. Code stream.
  • CAVLC Context-based Adaptive Variable Length Coding
  • CAB AC Context-based Adaptive Binary Arithmetic Coding
  • block 0 is predicted by using block 1 and/or block 2 as a reference. Coding, and block 1 and block 2 are both predictively coded with reference to block 3, which is equivalent to forming a cascaded prediction reference relationship between blocks; and since this embodiment encodes at the macroblock layer (MB level) Processing can thus be referred to as "MB-level cascade predictive coding.”
  • this embodiment is used as a macroblock layer coding processing technology, and can be combined with other existing macroblock types. Participate in RDO decision coding together, so as to better adapt to the texture characteristics of different regions in the image on the macroblock layer.
  • the predictive coding of the macroblock of the size of X 16 in this embodiment is described, it can be understood that other sizes of blocks, such as 32 X 32, 64 X 64 or The macroblocks of size 128 X 128 are used for predictive coding.
  • the downsampling coefficients in the predictive coding process can be divided into four groups or more than four groups according to actual needs, thereby forming three or more cascades.
  • the implementation mechanism of the prediction reference relationship is similar to that of the embodiment, and details are not described herein again.
  • FIG. 4 is a flowchart of a video image decoding processing method according to a fourth embodiment of the present invention.
  • This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the third embodiment of the present invention.
  • the video image decoding processing method in this embodiment includes:
  • Step 41 Acquire a code stream of coded bit information of blocks 0, 1, 2, and 3 included in the macro block of the video image frame.
  • Step 42 Decode the coded bit information of blocks 0, 1, 2, and 3, respectively, to obtain a residual decoding coefficient of each block.
  • the decoding process of the encoded bit information of any block includes processes such as entropy decoding, inverse quantization, inverse transform, and prediction to implement decoding of the encoded compressed video image.
  • step 42 may include step 421 - step 423.
  • Step 421 Perform entropy decoding processing on the coded bit information of each block.
  • the encoding bit information of each block is subjected to corresponding entropy decoding processing, and the block using the intra prediction coding mode for predictive coding has non-zero coefficients.
  • the probability of occurrence is greater than the probability that a block that is predictively coded using inter-pixel predictive coding mode exhibits non-zero coefficients, such as: the probability that block 3 exhibits a non-zero coefficient, which is greater than the probability that blocks 0, 1, and 2 have non-zero coefficients.
  • Step 422 Perform inverse quantization processing on the coefficients after entropy decoding processing of each block.
  • the QP value assignment principle between blocks is: Determine the QP value based on the reference relationship.
  • the reference relationship existing in the subsequent decoding process can be determined in advance, for example, there is a reference relationship as follows: block 0 reference block 1/2, block 1/2 reference sub-block 3.
  • the QP value of the block to be referenced is given a smaller QP value.
  • the QP value of the available block satisfies the following formula:
  • the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:
  • QP represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.
  • the entropy-decoded coefficients of the coded bit information of each block are subjected to corresponding inverse quantization processing according to the QP value of each block.
  • Step 423 Perform inverse transform processing on the coefficients of each block subjected to the inverse quantization process to obtain a residual.
  • Step 43 Perform intra prediction decoding processing on the block 3, and add the predicted value to the residual of the block 3 to obtain the decoded image value of the block 3.
  • Step 44 Perform prediction decoding processing on Block 1 and Block 2 with reference to the decoded image value of the block 3, and add the predicted values of the block 1 and the block 2 to their respective residuals to obtain the blocks 1 and 2 respectively. Decode image values.
  • Step 45 Perform prediction decoding processing on block 0 with reference to the decoded image values of block 1 and/or block 2, and add the predicted value of block 0 to its residual to obtain a decoded image value of block 0.
  • step 43 to step 45 are equivalent to establishing a cascading prediction reference relationship in each block prediction decoding process, and the implementation manner thereof is similar to the method in which the coding end establishes a cascading prediction reference relationship in each block predictive coding process.
  • step 32 the description of step 34
  • the method for performing interpolation processing on blocks 0, 1 and 2 in steps 33 and 34 is also applicable to the decoding steps 44 and 45 to improve the accuracy of the prediction, and the specific implementation is here. No longer.
  • Step 46 Perform up-sampling synthesis processing on the decoded image values of the blocks to obtain decoded image values of the macroblocks.
  • each block is described by taking the case where the prediction residual coefficient exists in each block as an example. Actually, it is also possible to allow the case where the prediction residual coefficient is not present for each block, or the case where the prediction coefficient is present in the first block and the prediction residual coefficient is not present in the second block and the third block, or other combinations.
  • both block 1 and block 2 perform prediction decoding with reference to block 3
  • block 0 performs prediction decoding with reference to block 1 and/or block 2, which is equivalent to forming between blocks.
  • the cascaded prediction reference relationship enables decoding of a video image that is encoded and compressed to fully utilize pixel correlation.
  • FIG. 5a is a flowchart of a video image encoding processing method according to a fifth embodiment of the present invention.
  • the difference between this embodiment and the third embodiment of the present invention is that the block 3 is further divided into a plurality of sub-blocks, and the embodiment can be applied to the encoding process of a video image with rich texture details.
  • this is The embodiment video image encoding processing method includes:
  • Step 51 Input a video image frame to be encoded.
  • Step 52 Perform downsampling on the 16 ⁇ 16 macroblock in the video image frame to obtain four 8 ⁇ 8 size blocks, which are respectively labeled as: blocks 0, 1, 2, and 3, and divide the block 3 into 4 blocks.
  • 4 X 4 size sub-blocks labeled as: Sub-blocks 30, 31, 32, and 33, respectively.
  • the method of decomposing the 16 ⁇ 16 macroblocks into four 8 ⁇ 8-sized blocks is similar to the step 32, and details are not described herein again.
  • FIG. 5b is a schematic diagram of macroblock decomposition according to a fifth embodiment of the present invention. As shown in Fig. 5b, this embodiment divides the block 3 into four sub-blocks of size 4 x 4, i.e., sub-blocks 30, 31, 32, and 33. The decomposition process of block 3 in this embodiment does not involve downsampling.
  • the prediction coding types indicated by "I" in each sub-block are: intra prediction coding.
  • Step 53 Perform intra prediction encoding processing on the sub-blocks 30, 31, 32, and 33, respectively, obtain reconstructed image values of the respective sub-blocks, perform synthesis processing on the reconstructed image values of the respective sub-blocks, and obtain reconstruction of the block 3. Image value.
  • the prediction value may be obtained by performing intra prediction on the constituent sub-block 30 by using the pixel values of other macroblocks encoded in the frame, and comparing the predicted value with the original pixel of the macroblock to obtain a prediction residual of the sub-block 30, according to The predicted value of the sub-block 30 and the reconstructed image value of the prediction residual constructed sub-block 30 after the encoding process.
  • the predicted values of the sub-blocks 31, 32, and 33 and their corresponding prediction residuals are obtained, respectively, and reconstructed image values of the sub-blocks 31, 32, and 33 are constructed, respectively.
  • the reconstructed image values of the sub-blocks 30, 31, 32, and 33 are subjected to synthesis processing to obtain reconstructed image values of the block 3.
  • Steps 54-55 are the same as steps 34-35, and are not described here.
  • Step 56 Write coded bit information of each sub-block and each block obtained by encoding the prediction residual of each sub-block and each block in the macro block into the code stream.
  • the encoding processing of the prediction residuals of the sub-blocks 30, 31, 32, and 33 in this step is the same as the encoding processing of the prediction residual of the block (such as block 3) in which the prediction encoding type is intra-predictive encoding in step 36.
  • the prediction residuals for blocks 0, 1, and 2 are the same as the encoding process for the prediction residuals of the blocks whose prediction coding type is inter-pixel prediction coding (such as blocks 0, 1, and 2), and are not described herein again.
  • the video image encoding processing method of this embodiment implements the technical effects of the third embodiment of the present invention, and divides the block 3 into a plurality of sub-blocks (ie, sub-blocks 30, 31, 32, and 33), and performs sub-blocks on each sub-block.
  • intra prediction coding the reconstructed image values of the block 3 obtained by combining the reconstructed image values of the respective sub-blocks are used as references of the blocks 1 and 2.
  • the difference between this embodiment and the third embodiment of the present invention is that one 8 ⁇ 8 intra prediction code in the third embodiment of the present invention is replaced by four 4 ⁇ 4 intra prediction codes. 4 X 4 intra prediction coding is more suitable for encoding video images with rich texture details.
  • FIG. 6 is a flowchart of a video image decoding processing method according to a sixth embodiment of the present invention.
  • This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the fifth embodiment of the present invention.
  • the video image decoding processing method in this embodiment includes:
  • Step 61 The code stream of the encoded bit information of the sub-blocks 30, 31, 32 and 33 and the blocks 0, 1 and 2 of the acquired macroblock of the video image frame.
  • Step 62 Decode the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2, respectively, to obtain the residual of each block.
  • the decoding process of the coded bit information of the sub-blocks 30, 31, 32 and 33 in this step is the same as the decoding process of the coded bit information of the block (such as block 3) of the predictive coding type being intra-predictive coded in step 42.
  • the coded bit information for the blocks 0, 1 and 2 is the same as the method for decoding the bit information of the block (for example, blocks 0, 1 and 2) whose prediction coding type is inter-pixel predictive coding, and will not be described herein.
  • Step 63 Perform intra prediction decoding processing on the sub-blocks 30, 31, 32, and 33, and add the predicted value to the residual corresponding to each sub-block to obtain a decoded image value of each sub-block, and decode each sub-block.
  • Image value The line synthesis processing obtains the decoded image value of the block 3.
  • the prediction value may be obtained by performing intra prediction on the constituent sub-block 30 by using the pixel values of other macroblocks decoded in the frame, and the decoded image values of the sub-block 30 are constructed according to the prediction value of the sub-block 30 and its residual.
  • the decoded image values of the sub-blocks 31, 32, and 33 are obtained, respectively.
  • FIG. 7a is a flowchart of a video image encoding processing method according to a seventh embodiment of the present invention.
  • the difference between this embodiment and the third embodiment of the present invention is that the present embodiment performs the downsampling decomposition again in block 3 to obtain a plurality of sub-blocks.
  • the difference between this embodiment and the fifth embodiment of the present invention is that the block is in this embodiment. 3
  • the downsampling decomposition is performed again to obtain a plurality of sub-blocks, and the decomposition process in the fifth embodiment does not involve downsampling.
  • This embodiment can be applied to the encoding process of a video image with less texture details.
  • the video image encoding processing method of this embodiment includes: Step 71: Input a video image frame to be encoded. Step 72: Taking ⁇ as a downsampling coefficient, performing a 16 ⁇ 16 macroblock in the video image frame
  • blocks 0, 1, 2, and 3 Decomposed into four blocks of 8 x 8 size, labeled as: blocks 0, 1, 2, and 3;
  • the block of the intra prediction coding type of the prediction coding type may be downsampled again according to the actual coding requirement of the video image frame to obtain multiple sub-blocks, and the prediction coding type of the plurality of sub-blocks is intra prediction.
  • the sub-block of the code is subjected to downsampling decomposition again... and so on, multi-level downsampling decomposition is performed, and the decomposed block or sub-block is encoded. This idea can be called Hierarchical coding method. .
  • the type of predictive coding that can be obtained by downsampling the macroblock is intraframe.
  • the block of the predictive coding is subjected to downsampling decomposition again, and two sub-blocks are obtained.
  • Grouping in 2 blocks for example, according to the positional relationship between the sub-block pixels extracted in the downsampling process, the m 2 sub-blocks are divided into groups, each group consisting of one or more sub-blocks, each group consisting of One or more blocks consisting of "integers greater than or equal to 3.
  • the first group includes at least one first sub-block
  • the second group includes at least one second sub-block
  • the third group includes at least one third sub-block, and so on, until packet processing of 2 sub-blocks is completed .
  • grouping may be performed according to whether the positional relationship of the extracted pixels is adjacent, for example, dividing different sub-blocks composed of pixels adjacent to the same pixel into the same group.
  • FIG. 7b is a schematic diagram of a macroblock downsampling decomposition according to a seventh embodiment of the present invention. As shown in Fig. 7b, this embodiment is equivalent to dividing the block 3 into four sub-blocks of 4 x 4 size, i.e., sub-blocks 30, 31, 32 and 33, on the basis of that shown in Fig. 3c.
  • the 8 ⁇ 8 size block 3 is downsampled to obtain 4 4 ⁇ 4 size sub-block implementation methods, and the 16 ⁇ 16 size macroblock is downsampled into 4 8 ⁇ 8 size blocks.
  • the method is the same. For details, refer to the description in step 32, and details are not described herein again.
  • the sub-blocks may be divided into groups according to the positional relationship of the extracted pixels in the macroblock downsampling process, for example, the sub-blocks 33 are a group, and the sub-blocks 31 and 32 are one. Group, sub-block 30 is a group.
  • sub-block 33 encodes the predictive coding type as intra-prediction, and the predictive coding types for sub-blocks 30, 31 and 32 and blocks 0, 1 and 2 are between pixels. Predictive coding.
  • the "I” and "P” in the figure respectively indicate that the prediction coding type of the corresponding block is: intra prediction coding or inter-pixel prediction coding type. Therefore, in this embodiment, the sub-block 33 is the first embodiment of the present invention. 1 sub-blocks, sub-blocks 31 and 32 are "second sub-blocks" according to an embodiment of the present invention, and sub-block 30 is a "third sub-block” according to an embodiment of the present invention.
  • the pixels in the first block include at least pixels in the lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • Step 73 Perform intra prediction encoding processing on the sub-block 33 to obtain a reconstructed image value of the sub-block 33.
  • the intra prediction encoding processing method of the sub-block 33 in this step is similar to the method for performing intra prediction encoding processing on the block 3 in step 33, and details are not described herein again.
  • Step 74 Perform prediction coding processing on the sub-blocks 31 and 32 with reference to the reconstructed image values of the sub-blocks 33, and obtain reconstructed image values of the sub-blocks 31 and 32, respectively.
  • the determination of the inter-pixel predictive coding and the prediction reference relationship of the sub-blocks 31 and 32 in this step is similar to the method for predictive coding processing of the blocks 1 and 2 in step 34, and details are not described herein again.
  • the reconstructed image value of the sub-block 33 may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the sub-block 31 or 32.
  • the reconstructed image value of the sub-block 31 or 32 is constructed according to the predicted value of the sub-block 31 or 32 and its prediction residual, and the specific implementation thereof is similar to the method for the interpolation processing of the blocks 1 and 2 in step 34, and is no longer Narration.
  • Step 75 Perform prediction encoding processing on the sub-block 30 with reference to the reconstructed image value of the sub-block 31 and/or the sub-block 32 to obtain a reconstructed image value of the sub-block 30.
  • the determination of the inter-pixel predictive coding and the prediction reference relationship of the sub-block 30 in this step is similar to the method of predictive coding processing for the block 0 in the step 35, and details are not described herein again.
  • the reconstructed image values of the sub-blocks 31 and/or 32 may be subjected to interpolation processing, and the pixel values of the corresponding positions after interpolation are used as the predicted values of the sub-block 30,
  • the method for constructing the reconstructed image of the sub-block 30 according to the predicted value of the sub-block 30 and its prediction residual is similar to the method for the interpolation processing of the block 0 in step 35, and details are not described herein again.
  • Step 76 Synthesize the reconstructed image values of the sub-blocks 30, 31, 32, and 33 to obtain the reconstructed image values of the block 3.
  • Step 77-Step 78 is the same as Step 34-Step 35, and details are not described herein again.
  • Step 79 Write coded bit information of each block obtained by encoding the prediction residuals of the sub-blocks 30, 31, 32, and 33, and blocks 0, 1, and 2 into the code stream.
  • the predictive coding process for each sub-block or each block includes performing prediction, transform, quantization, and entropy coding, etc., according to steps 71-79 above. It can be seen that each sub-block and each block form a cascade prediction reference relationship. Due to the introduction of cascading predictive coding, there may be two different predictive coding types in the macroblock, such as intra prediction and inter-pixel prediction. There may be two different pre-J codes in the intra-pre and inter-pixel prediction. The sub-blocks of the type can be optimized during the quantization process of the block.
  • the basic principle of assigning QP values to each sub-block or block is: Determine the QP value according to the reference relationship.
  • block 0 refers to block 1/2
  • block 1/2 refers to block 3
  • sub-block 30 refers to sub-block 31/32
  • sub-block 31/32 refers to sub-block 33.
  • step 362 that is: the QP value of the block being referenced is given a smaller QP value.
  • the QP values of the sub-blocks and the blocks satisfy the following formula: QP (sub-block 33) ⁇ QP (sub-block 31/32) ⁇ QP (sub-block 30) ⁇ QP (block 1/2) ⁇ QP (block 0)
  • the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the sub-block and the QP value of the block:
  • QP (subblock 33) QP (Slice) -10
  • QP represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.
  • the respective prediction residuals are subjected to corresponding quantization processing based on the QP values of the sub-blocks or blocks.
  • a method for performing a corresponding quantization optimization process on different sub-blocks or blocks is similar to the description of the third embodiment of the present invention, and details are not described herein again.
  • the entropy coding of the macroblocks may be optimized.
  • the entropy coding mode of the syntax element CBP is optimized.
  • the syntax element "cbp_block_flag" in the video coding standard indicates whether four 4x4 sub-blocks in one 8x8 block contain non-zero coefficients. Due to the introduction of cascading predictive coding, a block of 8 ⁇ 8 size may have two sub-blocks of different predictive coding types within and between pixels, and the prediction residuals of sub-blocks of different predictive coding types are quantized. The coefficients have different distribution rules. In view of this, the entropy coding mode of the syntax element "cbp_block_flag" needs to be designed accordingly. The design method is similar to the optimization design method of the entropy coding mode of the syntax element CBP in step 363. This will not be repeated here.
  • a plurality of sub-blocks are obtained by sub-sampling the block 3 again, and the sub-blocks are obtained.
  • Different layers such as layers and block layers respectively establish respective corresponding cascading prediction reference relationships, which is equivalent to forming two levels of cascading coding processing.
  • the embodiment can more fully utilize the correlation between pixels, thereby improving prediction accuracy and encoding compression efficiency of the video image.
  • the macroblock of the size of 16 16 in this embodiment performs two levels of downsampling decomposition as an example, it can be understood that it can be based on the actual needs of video image encoding, such as less rich in texture details.
  • three or more levels of downsampling decomposition may be performed on the basis of the following, for example: sub-blocking decomposition in the sub-block 33 of 4 ⁇ 4 size, thereby forming three levels of The cascading coding process is similar to the embodiment, and will not be described again.
  • FIG. 8 is a flowchart of a video image decoding processing method according to an eighth embodiment of the present invention.
  • This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the seventh embodiment of the present invention.
  • the video image decoding processing method in this embodiment includes:
  • Step 81 Acquire a code stream of the sub-blocks 30, 31, 32, and 33 included in the macro block of the video image frame and the coded bit information of the blocks 0, 1, and 2.
  • Step 82 Decode the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2, respectively, to obtain residual decoding coefficients of each sub-block or block.
  • the decoding process of the encoded bit information of any sub-block or block may include: entropy decoding, inverse quantization, and inverse transform, etc., which is equivalent to the inverse processing of step 79.
  • step 82 may include step 821 - step 823.
  • Step 821 performing entropy decoding processing on the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2.
  • the encoding bit information of each block is subjected to corresponding entropy decoding processing, and the block using the intra prediction coding mode for predictive coding has non-zero coefficients.
  • the probability of the block larger than the predictive coding using the inter-plot predictive coding mode such as: the probability that the sub-block 33 residual decoding coefficient has a non-zero coefficient is greater than other sub-blocks 30, 31, and 32 or block 0.
  • the residual decoding coefficients of 1 and 2 have a probability of non-zero coefficients.
  • the entropy decoding implementation mechanism is similar to the steps 421 to 423, and details are not described herein.
  • Step 822 Perform inverse quantization processing on the coefficients of the sub-blocks 30, 31, 32, and 33 and the coded bit information of the blocks 0, 1, and 2 subjected to entropy decoding processing.
  • the QP value assignment principle for each sub-block and each block is: Determine the QP value based on the reference relationship.
  • the reference relationship existing in the subsequent decoding process may be determined in advance, for example, there is a following reference relationship: block 0 refers to block 1/2, block 1/2 refers to block 3, and in block 3, subblock 0 refers to sub-block 31/32, sub-block 31/32 refers to sub-block 33.
  • the QP value of the block to be referenced is given a smaller QP value.
  • the QP value of the available block satisfies the following formula:
  • the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:
  • QP (subblock 33) QP (Slice) -10
  • QP represents the QP value corresponding to the slice of the video image
  • the QP value corresponding to the slice can be obtained in advance.
  • Step 823 Perform inverse transform processing on the coefficients of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2 after the inverse quantization to obtain a residual.
  • Step 83 Perform intra prediction decoding processing on the sub-block 33, and add the predicted value to the residual of the sub-block 33 to obtain a decoded image value of the sub-block 33.
  • Step 84 Perform prediction decoding processing on the sub-blocks 31 and 32 respectively with reference to the decoded image values of the sub-blocks 33, and obtain predicted values of the sub-blocks 31 and 32, and predict the values of the sub-blocks 31 and 32 and their respective residuals. The difference is added to obtain decoded image values of the sub-blocks 31 and 32.
  • Step 85 Perform prediction decoding processing on the sub-block 30 with reference to the decoded image values of the sub-blocks 31 and/or 32, obtain a prediction value of the sub-block 30, and add the predicted value of the sub-block 30 and its residual to obtain a sub- The decoded image value of block 30.
  • Step 86 Perform upsampling synthesis processing on the decoded image values of the respective sub-blocks 30, 31, 32, and 33 to obtain decoded image values of the block 3.
  • Steps 87 to 89 are the same as steps 44 to 46, and are not described here.
  • the video image encoding processing method of this embodiment implements the technical effects of the fourth embodiment of the present invention, and the encoding end desamples the block 3 again to obtain a plurality of sub-blocks (ie, sub-blocks 30, 31, 32, and 33), and the decoding end.
  • the respective cascading prediction reference relationships are established for different layers such as the sub-block layer and the block layer, which is equivalent to forming two levels of cascading decoding processing.
  • this embodiment can implement decoding of a video image after encoding and compressing fully utilizing pixel correlation.
  • FIG. 9 is a schematic structural diagram of a video image encoding processing apparatus according to a ninth embodiment of the present invention.
  • the video image encoding processing apparatus of this embodiment includes: a downsampling decomposition module 91, an intra prediction encoding module 92, a first inter-pixel predictive encoding module 93, and a second inter-pixel predictive encoding module 94.
  • the downsampling decomposition module 91 is configured to downsample the macroblock of the video image frame to obtain a plurality of blocks, and the plurality of blocks include at least: a first block, a second block, and a third block.
  • the pixel in the first block contains at least the pixel in the lowermost corner of the macroblock.
  • the pixel in the lowermost corner of the second block is in the horizontal or vertical direction from the pixel in the lowermost corner of the first block.
  • the pixel in the lowermost corner of the third block is diagonal to the pixel in the lowermost corner of the first block.
  • the intra prediction encoding module 92 is configured to perform intra prediction encoding processing on the first block to obtain a reconstructed image value of the first block.
  • the first inter-pixel predictive coding module 93 is configured to perform predictive coding on the second block with reference to the reconstructed image value of the first block to obtain a reconstructed image value of the second block.
  • the second inter-pixel predictive coding module 94 is configured to perform predictive coding processing on the third block with reference to the reconstructed image value of the second block.
  • the first inter-pixel predictive coding module 93 may be further configured to perform interpolation processing on the reconstructed image values of the first block, such as: performing interpolation processing on the reconstructed image values of the first block, and interpolating the pixel values of the corresponding positions as the second block.
  • the predicted value constructs the reconstructed image value of the second block from the predicted value of the second block and its prediction residual.
  • the second inter-pixel predictive coding module 94 may be further configured to perform interpolation processing on the reconstructed image values of the second block, such as performing interpolation processing on the reconstructed image values of the second block, and interpolating the pixel values of the corresponding positions as the third block.
  • the predicted value constructs the reconstructed image value of the third block from the predicted value of the third block and its prediction residual.
  • the video image coding processing apparatus performs down-sampling decomposition on the macroblock to obtain a plurality of blocks, and combines the intra-prediction coding technology and the inter-pixel prediction coding technology in performing predictive coding on the plurality of blocks. At least two levels of cascading prediction reference relationships are formed in the inter-pixel predictive coding process, so that correlation between pixels can be fully utilized, and prediction accuracy and coding compression efficiency of video images are improved.
  • the video image encoding processing apparatus of this embodiment refer to the description of the foregoing video image encoding processing method embodiment, and details are not described herein again.
  • FIG. 10 is a schematic structural diagram of a video image decoding processing apparatus according to a tenth embodiment of the present invention. As shown
  • the video image decoding processing apparatus of this embodiment includes: a bit information acquiring module 101, an intra prediction decoding module 102, a first inter-pixel predictive decoding module 103, a second inter-pixel predictive decoding module 104, and a decoded image generating module 105. .
  • the bit information obtaining module 101 is configured to obtain coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block.
  • the pixels in the first block include at least pixels in the lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • the intra prediction decoding module 102 is configured to perform intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block.
  • the first inter-pixel predictive decoding module 103 is configured to perform predictive decoding processing on the coded bit information of the second block with reference to the decoded picture value of the first block to obtain a decoded picture value of the second block.
  • the second inter-pixel prediction decoding module 104 is configured to use the decoded image value of the second block as a reference, for the third
  • the coded bit information of the block is subjected to predictive decoding processing to obtain a decoded image value of the third block.
  • the decoded image generation module 105 is configured to perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.
  • the first inter-pixel prediction decoding module 103 is further configured to perform interpolation processing on the decoded image values of the first block, for example, performing interpolation processing on the reconstructed image values of the first block, and the pixel values of the corresponding positions after interpolation are used as the second block.
  • the predicted value constructs the decoded image value of the second block from the predicted value of the second block and its residual decoding coefficient.
  • the second inter-pixel predictive decoding module 104 can also be used to interpolate the decoded image values of the second block. For example, the reconstructed image value of the second block is subjected to interpolation processing, and the pixel value of the corresponding position after interpolation is used as the predicted value of the third block, and the decoded image of the third block is constructed based on the predicted value of the third block and its residual decoding coefficient. value.
  • the video image decoding processing apparatus combines the intra prediction decoding technology with the inter-pixel prediction decoding technology by performing downsampling decomposition on the macroblock to obtain a plurality of blocks, and forms at least two in the inter-pixel prediction decoding process.
  • the hierarchical cascading predicts the reference relationship, thus facilitating the full utilization of pixel correlation between blocks to achieve reconstruction of the encoded compressed video image.
  • FIG. 11 is a schematic structural diagram of a video image codec system according to an eleventh embodiment of the present invention.
  • the video image encoding and decoding system of this embodiment includes: a video image encoding processing device 111 and a video image decoding processing device 112.
  • the video image encoding processing device 111 is configured to downsample the macroblock of the video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; performing intraframe on the first block Predicting the encoding process to obtain the reconstructed image value of the first block; performing the predictive encoding process on the second block with reference to the reconstructed image value of the first block to obtain the reconstructed image value of the second block;
  • the reconstructed image value is a reference, and the third block is subjected to predictive coding processing.
  • the video image decoding processing device 112 is configured to acquire coded bit information of the macro block, wherein the macro block includes at least: the first block, the second block, and the third block; and performing intra prediction decoding on the coded bit information of the first block. Processing, obtaining a decoded image value of the first block; performing predictive decoding processing on the encoded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block; decoding by the second block The image value is a reference, and the coded bit information of the third block is subjected to predictive decoding processing to obtain a decoded image value of the third block; and the decoded image value of each block in the macro block is subjected to upsampling synthesis processing to obtain a decoded image of the macroblock. value.
  • the pixels in the first block include at least pixels in the lower right corner of the macroblock.
  • the pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block.
  • the pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
  • the pixels in the first block may include at least pixels in the upper left corner of the macroblock.
  • the pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block.
  • the pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.
  • the macroblock is downsampled and decomposed to obtain a plurality of blocks, and the intra prediction technique and the interpixel prediction technology are combined in the process of predictive coding or decoding of the plurality of blocks to form at least
  • the second-level cascading prediction reference relationship can make full use of the correlation between pixels, improve the prediction accuracy and the coding compression efficiency of video images.
  • the video image decoding processing apparatus refines the structure.
  • the mechanism for implementing video image coding and decoding can be referred to the description of the corresponding embodiment in FIG. 1 to FIG. 8 , and details are not described herein again.
  • modules in the apparatus in the embodiments may be distributed in the apparatus of the embodiment according to the embodiment, or may be correspondingly changed in one or more apparatuses different from the embodiment.
  • the modules of the above embodiments may be combined into one module, or may be further split into a plurality of sub-modules.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method, apparatus and encoding-decoding system for video image encoding processing and decoding processing are provided by the present invention. The encoding processing method includes: performing down-sampling disassembly on the macro block of a video image frame to obtain a plurality of blocks which at least include a first block, a second block and a third block; performing intra-frame predictive encoding processing on the first block to obtain the reconstructed image value of the first block; by using the reconstructed image value of the first block as a reference, performing predictive encoding processing on the second block to obtain the reconstructed image value of the second block; by using the reconstructed image value of the second block as a reference, performing predictive encoding processing on the third block. The embodiments of the present invention can sufficiently utilize the correlation among pixels, and improve prediction precision and the encoding compression efficiency of the video image.

Description

视频图像编码和解码的方法、 装置及系统 本申请要求于 2009 年 11 月 10 日提交中国专利局、 申请号为 200910211704. 2 , 发明名称为 "视频图像编码处理、 解码处理方法和装置及 编解码系统" 的中国专利申请的优先权,本申请还要求于 2010年 4月 9 日提交 中国专利局、 申请号为 201010147573. 9、 发明名称为 "视频图像编码处理、 解码处理方法和装置及编解码系统" 的中国专利申请的优先权, 以上两个在 先专利申请的内容通过引用结合在本申请中。 技术领域  The present invention claims to be submitted to the Chinese Patent Office on November 10, 2009, and the application number is 200910211704. 2 , and the invention name is "video image encoding processing, decoding processing method and device, and codec. The priority of the Chinese patent application of the system is also required to be submitted to the Chinese Patent Office on April 9, 2010, and the application number is 201010147573. 9. The invention name is "video image encoding processing, decoding processing method and device, and codec. The priority of the Chinese Patent Application, the entire disclosure of which is hereby incorporated by reference. Technical field

本发明涉及视频图像处理技术领域, 特别是涉及一种视频图像编码处理、 解码处理方法和装置及编解码系统。 背景技术  The present invention relates to the field of video image processing technologies, and in particular, to a video image encoding process, a decoding processing method and apparatus, and a codec system. Background technique

未来人们对高清视频图像的传送和存储的需求呈现爆炸式增长, 然而因特 网、 通信网、 广播网等信息网络基础设施的发展却相对滞后, 网络带宽仍然成 为一个瓶颈。 基于这一事实, 下一代视频编码标准将重点面向高清、 超高清视 频, 大幅度提高视频图像编码压缩效率, 以緩解高清视频图像传输的带宽需求 增长与信息网络基础设施的发展之间的矛盾。 如何提高视频图像编码压缩效率, 是新一代视频编码标准急需解决的核心问题。 基于块( Block )的混合视频编码框架是通用的视频压缩编码处理方案之一, 主要包括帧内编码( Intra coding )技术和帧间编码 ( Inter coding )技术。 帧内 编码技术利用空域相关性, 如使用帧内预测 (Intra prediction )技术去除空域冗 余信息, 帧内编码由当前帧自身信息即可完成, 不需要参考其他帧的信息, 即 不需要参考帧。 帧间编码技术利用时域相关性, 如利用运动补偿预测技术去除 时域冗余信息, 该技术需要根据参考帧的信息进行预测编码。 采用上述编码技 术可达到较高的视频图像编码的压缩效率。 In the future, the demand for the transmission and storage of high-definition video images will explode. However, the development of information network infrastructure such as the Internet, communication networks, and broadcast networks has lagged behind, and network bandwidth has become a bottleneck. Based on this fact, the next-generation video coding standard will focus on high-definition, ultra-high-definition video, and greatly improve the compression efficiency of video image coding to alleviate the contradiction between the growth of bandwidth demand for high-definition video image transmission and the development of information network infrastructure. How to improve the compression efficiency of video image coding is a core problem that needs to be solved in the new generation of video coding standards. Block-based hybrid video coding framework is one of the general video compression coding processing schemes, mainly including intra coding technology and inter coding technology. The intra-frame coding technique utilizes spatial correlation, such as the use of intra prediction (Intra prediction) technology to remove spatial redundancy information. The intra-frame coding can be completed by the current frame information itself, without reference to other frame information, ie No reference frames are required. The interframe coding technique utilizes time domain correlation, such as the use of motion compensated prediction techniques to remove time domain redundancy information, which requires predictive coding based on information of the reference frame. The above coding technique can achieve higher compression efficiency of video image coding.

为了获得较高的视频编码压缩效率, 一种现有技术是利用已编码的相邻块 之间的相关性进行预测编码, 将对图像分量的预测模式及其相应的预测残差信 息写入码流, 以便解码端根据该预设模式及其相应的预测残差信息进行解码。 另一种现有技术是: 利用相邻像素之间的相关性进行帧内预测编码, 如: 对尺 寸为 8 X 8 (单位为像素)的块进行下采样分解, 得到 4个尺寸为 4 X 4的块, 对 其中一个尺寸为 4 X 4的块进行帧内编码得到该块的重构图像, 以该块的重构图 像为参考, 对其他尺寸为 4 X 4的块进行预测编码。  In order to obtain higher video coding compression efficiency, one prior art performs predictive coding by using correlation between coded neighboring blocks, and writes a prediction mode of the image component and its corresponding prediction residual information into the code. Streaming, so that the decoding end decodes according to the preset mode and its corresponding prediction residual information. Another prior art is: intra prediction coding using correlation between adjacent pixels, such as: downsampling a block of size 8 X 8 (in pixels) to obtain 4 sizes of 4 X The block of 4, intra-coded a block of size 4 X 4 to obtain a reconstructed image of the block, and predictively encodes other blocks of size 4×4 with reference to the reconstructed image of the block.

发明人在实现本发明实施例过程中发现, 现有技术并没有充分利用像素间 的相关性, 视频编码压缩效率比较低。 发明内容  The inventors found in the process of implementing the embodiments of the present invention that the prior art does not fully utilize the correlation between pixels, and the video encoding compression efficiency is relatively low. Summary of the invention

本发明提供一种视频图像编码处理、 解码处理方法和装置及编解码系统, 以提高视频图像编码的压缩效率。  The invention provides a video image encoding process, a decoding processing method and device, and a codec system to improve the compression efficiency of video image coding.

本发明提供了一种视频图像编码处理方法, 包括:  The present invention provides a video image encoding processing method, including:

对视频图像帧的宏块进行下采样分解得到多个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块;  Downsampling the macroblock of the video image frame to obtain a plurality of blocks, the plurality of blocks including at least: a first block, a second block, and a third block;

对所述第 1块进行帧内预测编码处理, 得到所述第 1块的重构图像值; 以所述第 1块的重构图像值为参考, 对所述第 2块进行预测编码处理, 得 到所述第 2块的重构图像值;  Performing intra prediction encoding processing on the first block to obtain a reconstructed image value of the first block; performing prediction encoding processing on the second block with reference to the reconstructed image value of the first block; Obtaining a reconstructed image value of the second block;

以所述第 2块的重构图像值为参考, 对所述第 3块进行预测编码处理。 本发明还提供了一种视频图像解码处理方法, 包括: 获取视频图像帧的宏块的编码比特信息, 所述宏块至少包括: 第 1块, 第 2 块和第 3块; The third block is subjected to predictive coding processing with reference to the reconstructed image value of the second block. The present invention also provides a video image decoding processing method, including: Obtaining coded bit information of a macroblock of a video image frame, the macroblock including at least: a first block, a second block, and a third block;

对所述第 1块的编码比特信息进行帧内预测解码处理, 得到所述第 1块的 解码图像值;  Performing intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block;

以所述第 1块的解码图像值为参考, 对所述第 2块的编码比特信息进行预 测解码处理, 得到第 2块的解码图像值;  Referring to the decoded image value of the first block, performing prediction decoding processing on the encoded bit information of the second block to obtain a decoded image value of the second block;

以所述第 2块的解码图像值为参考, 对所述第 3块的编码比特信息进行预 测解码处理, 得到第 3块的解码图像值;  Deriving and decoding the coded bit information of the third block with reference to the decoded picture value of the second block to obtain a decoded picture value of the third block;

将所述宏块中各块的解码图像值进行上采样合成处理, 得到所述宏块的解 码图像值。  The decoded image values of the blocks in the macroblock are subjected to upsampling synthesis processing to obtain decoded image values of the macroblocks.

本发明还提供了一种视频图像编码处理装置, 包括:  The present invention also provides a video image encoding processing apparatus, including:

下采样分解模块, 用于对视频图像帧的宏块进行下采样分解得到多个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块;  a downsampling and decoding module, configured to perform downsampling on a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block;

帧内预测编码模块, 用于对所述第 1 块进行帧内预测编码处理, 得到所述 第 1块的重构图像值;  An intra prediction coding module, configured to perform intra prediction coding processing on the first block to obtain a reconstructed image value of the first block;

第一像素间预测编码模块, 用于以所述第 1 块的重构图像值为参考, 对所 述第 2块进行预测编码处理, 得到所述第 2块的重构图像值;  a first inter-pixel predictive coding module, configured to perform predictive coding processing on the second block with reference to a reconstructed image value of the first block, to obtain a reconstructed image value of the second block;

第二像素间预测编码模块, 用于以所述第 2块的重构图像值为参考, 对所 述第 3块进行预测编码处理。  The second inter-pixel predictive coding module is configured to perform predictive coding processing on the third block with reference to the reconstructed image value of the second block.

本发明还提供了一种视频图像解码处理装置, 包括:  The present invention also provides a video image decoding processing apparatus, including:

比特信息获取模块, 用于获取视频图像帧的宏块的编码比特信息, 所述宏 块至少包括: 第 1块, 第 2块和第 3块;  a bit information acquiring module, configured to acquire coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block;

帧内预测解码模块, 用于对所述第 1 块的编码比特信息进行帧内预测解码 处理, 得到所述第 1块的解码图像值; An intra prediction decoding module, configured to perform intra prediction decoding on the coded bit information of the first block Processing, obtaining a decoded image value of the first block;

第一像素间预测解码模块, 用于以所述第 1 块的解码图像值为参考, 对所 述第 2块的编码比特信息进行预测解码处理, 得到第 2块的解码图像值;  The first inter-pixel predictive decoding module is configured to perform predictive decoding processing on the encoded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block;

第二像素间预测解码模块, 用于以所述第 2块的解码图像值为参考, 对所 述第 3块的编码比特信息进行预测解码处理, 得到第 3块的解码图像值;  a second inter-pixel prediction decoding module, configured to perform predictive decoding processing on the encoded bit information of the third block with reference to the decoded image value of the second block, to obtain a decoded image value of the third block;

解码图像生成模块, 用于将所述宏块中各块的解码图像值进行上采样合成 处理, 得到所述宏块的解码图像值。  And a decoded image generating module, configured to perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.

本发明还提供了一种视频图像编解码系统, 包括:  The invention also provides a video image coding and decoding system, comprising:

视频图像编码处理装置, 用于对视频图像帧的宏块进行下采样分解得到多 个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块; 对所述第 1块进行帧 内预测编码处理, 得到所述第 1块的重构图像值; 以所述第 1块的重构图像值 为参考, 对所述第 2块进行预测编码处理, 得到所述第 2块的重构图像值; 以 所述第 2块的重构图像值为参考, 对所述第 3块进行预测编码处理;  a video image encoding processing apparatus, configured to downsample a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; Performing intra prediction coding processing to obtain a reconstructed image value of the first block; performing prediction encoding processing on the second block with reference to the reconstructed image value of the first block to obtain the second block a reconstructed image value; performing predictive encoding processing on the third block with reference to the reconstructed image value of the second block;

视频图像解码处理装置, 用于获取所述宏块的编码比特信息, 所述宏块至 少包括: 所述第 1块, 所述第 2块和所述第 3块; 对所述第 1块的编码比特信 息进行帧内预测解码处理, 得到所述第 1块的解码图像值; 以所述第 1块的解 码图像值为参考, 对所述第 2 块的编码比特信息进行预测解码处理, 得到第 2 块的解码图像值; 以所述第 2块的解码图像值为参考, 对所述第 3块的编码比 特信息进行预测解码处理, 得到第 3 块的解码图像值; 将所述宏块中各块的解 码图像值进行上采样合成处理, 得到所述宏块的解码图像值。  a video image decoding processing apparatus, configured to acquire coded bit information of the macroblock, where the macroblock includes at least: the first block, the second block, and the third block; and the first block The coded bit information is subjected to intra prediction decoding processing to obtain a decoded image value of the first block; and the coded bit information of the second block is subjected to predictive decoding processing with reference to the decoded image value of the first block. a decoded image value of the second block; performing predictive decoding processing on the encoded bit information of the third block with reference to the decoded image value of the second block to obtain a decoded image value of the third block; The decoded image values of the respective blocks are subjected to upsampling synthesis processing to obtain decoded image values of the macroblocks.

本发明实施例对宏块进行下采样分解得到多个块, 对多个块进行预测编码 或解码过程中将帧内预测技术与像素间预测技术相结合, 形成了至少二级的级 联式预测参考关系, 因此可充分利用像素间的相关性, 提高预测精度以及视频 图像的编码压缩效率。 附图说明 In the embodiment of the present invention, the macroblock is downsampled and decomposed to obtain a plurality of blocks, and the intra prediction technique and the interpixel prediction technique are combined in the process of predictive coding or decoding of the plurality of blocks, and at least two levels of cascade prediction are formed. Reference relationship, so you can make full use of the correlation between pixels, improve prediction accuracy and video The coding compression efficiency of the image. DRAWINGS

为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对实施 例或现有技术描述中所需要使用的附图作简单地介绍, 显而易见地, 下面描述 中的附图仅仅是本发明的一些实施例, 对于本领域普通技术人员来讲, 在不付 出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.

图 1为本发明第一实施例提供的视频图像编码处理方法流程图;  1 is a flowchart of a video image encoding processing method according to a first embodiment of the present invention;

图 2为本发明第二实施例提供的视频图像解码处理方法流程图;  2 is a flowchart of a video image decoding processing method according to a second embodiment of the present invention;

图 3a为本发明第三实施例提供的视频图像编码处理方法流程图; 图 3b为本发明第三实施例提供的视频图像编码处理方法逻辑实现框图; 图 3c为本发明第三实施例提供的宏块下采样分解示意图;  FIG. 3 is a flowchart of a video image encoding processing method according to a third embodiment of the present invention; FIG. 3b is a block diagram of a logical implementation of a video image encoding processing method according to a third embodiment of the present invention; Schematic diagram of macroblock downsampling;

图 3d为本发明第三实施例提供的像素间预测编码插值处理分解示意图; 图 4为本发明第四实施例提供的视频图像解码处理方法流程图;  FIG. 3 is a schematic diagram of an inter-pixel prediction coding interpolation process according to a third embodiment of the present invention; FIG. 4 is a flowchart of a video image decoding processing method according to a fourth embodiment of the present invention;

图 5a为本发明第五实施例提供的视频图像编码处理方法流程图; 图 5b为本发明第五实施例提供的宏块分解示意图;  5a is a flowchart of a video image encoding processing method according to a fifth embodiment of the present invention; FIG. 5b is a schematic diagram of a macroblock decomposition according to a fifth embodiment of the present invention;

图 6为本发明第六实施例提供的视频图像解码处理方法流程图;  FIG. 6 is a flowchart of a video image decoding processing method according to a sixth embodiment of the present invention;

图 7a为本发明第七实施例提供的视频图像编码处理方法流程图; 图 7b为本发明第七实施例提供的宏块下采样分解示意图;  FIG. 7 is a flowchart of a video image encoding processing method according to a seventh embodiment of the present invention; FIG. 7b is a schematic diagram of a macroblock down sampling decomposition according to a seventh embodiment of the present invention;

图 8为本发明第八实施例提供的视频图像解码处理方法流程图;  8 is a flowchart of a video image decoding processing method according to an eighth embodiment of the present invention;

图 9为本发明第九实施例提供的视频图像编码处理装置结构示意图; 图 10为本发明第十实施例提供的视频图像解码处理装置结构示意图; 图 11为本发明第十一实施例提供的视频图像编解码系统结构示意图。 具体实施方式 9 is a schematic structural diagram of a video image encoding processing apparatus according to a ninth embodiment of the present invention; FIG. 10 is a schematic structural diagram of a video image decoding processing apparatus according to a tenth embodiment of the present invention; Schematic diagram of video image codec system structure. detailed description

下面将结合本发明实施例中的附图, 对本发明实施例中的技术方案进行清 楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而不是 全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没有付出创造 性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。  BRIEF DESCRIPTION OF THE DRAWINGS The technical solutions in the embodiments of the present invention will be described in detail with reference to the accompanying drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without any creative work are within the scope of the present invention.

图 1为本发明第一实施例提供的视频图像编码处理方法流程图。 如图 1所 示, 本实施例视频图像编码处理方法包括:  FIG. 1 is a flowchart of a video image encoding processing method according to a first embodiment of the present invention. As shown in FIG. 1, the video image encoding processing method in this embodiment includes:

步骤 11、 对视频图像帧的宏块进行下采样分解得到多个块, 上述多个块至 少包括: 第 1块、 第 2块和第 3块。  Step 11. Down-sampling the macroblock of the video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block.

在本发明实施例中,所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与所述第 1块中的右下角的像素在水平或垂直方 向上。 所述第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所 述宏块的对角线方向上。  In an embodiment of the invention, the pixels in the first block include at least pixels in the lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

可根据视频图像帧的实际编码需求确定下采样系数, 根据下采样系数对宏 块进行下采样分解得到多个块, 假设下采样系数为丄, 则对宏块下采样分解后  The downsampling coefficient can be determined according to the actual coding requirement of the video image frame, and the macroblock is downsampled and decomposed according to the downsampling coefficient to obtain a plurality of blocks. If the downsampling coefficient is 丄, the macroblock is downsampled and decomposed.

M  M

可得到 个块。 在 A 2个块进行分组, 例如可根据下采样过程中抽取的、 组成各 块像素之间的位置关系将 个块分为 N组, 每组由一个或多个块组成, N为大 于或等于 3的整数。 第一组包括至少一个第 1块, 第二组包括至少一个第 2块, 第三组包括至少一个第 3块 ......以此类推, 直至完成 个块的分组处理。 可选 的, 在对 A 2个块进行分组过程中, 可根据抽取像素的位置关系是否相邻的原则 进行分组, 如将与同一像素相邻的像素组成的不同块划分为同一组等。 You can get a block. Grouping in A 2 blocks, for example, dividing the blocks into N groups according to the positional relationship between the pixels constituting each block extracted in the downsampling process, each group consisting of one or more blocks, and N is greater than or equal to An integer of 3. The first group includes at least one first block, the second group includes at least one second block, the third group includes at least one third block, and so on, until packet processing of the blocks is completed. Optional In the process of grouping A 2 blocks, grouping may be performed according to whether the positional relationship of the extracted pixels is adjacent, for example, dividing different blocks composed of pixels adjacent to the same pixel into the same group.

步骤 12、 对第 1块进行帧内预测编码处理, 得到第 1块的重构图像值。 本步骤可利用帧内已编码的其他宏块的像素值对第 1块进行帧内预测编码 处理, 得到第 1块的重构图像值。 重构图像值由第 1块的预测值及其编码处理 后的预测残差组成。  Step 12: Perform intra prediction coding processing on the first block to obtain a reconstructed image value of the first block. In this step, the first block may be subjected to intra prediction encoding processing by using pixel values of other macroblocks encoded in the frame to obtain a reconstructed image value of the first block. The reconstructed image value consists of the predicted value of the first block and the predicted residual after the encoding process.

步骤 13、 以第 1块的重构图像值为参考, 对第 2块进行预测编码处理, 得 到第 2块的重构图像值。  Step 13. Perform a predictive coding process on the second block with reference to the reconstructed image value of the first block, and obtain a reconstructed image value of the second block.

可选的, 在对第 2块进行预测编码处理过程中, 可包括对第 1块的重构图 像值进行插值处理, 插值后对应位置的像素值作为第 2 块的预测值, 根据第 2 块的预测值及其预测残差构造第 2块的重构图像值。  Optionally, during the predictive coding process on the second block, the reconstructed image value of the first block may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the second block, according to the second block. The predicted value and its prediction residual construct the reconstructed image value of the second block.

步骤 14、 以第 2块的重构图像值为参考, 对第 3块进行预测编码处理。 可选的, 在对第 2块进行预测编码处理过程中, 可包括对第 2块的重构图 像值进行插值处理, 插值后对应位置的像素值作为第 3 块的预测值, 根据第 3 块的预测值及其预测残差构造第 3块的重构图像值。  Step 14. Perform prediction encoding processing on the third block with reference to the reconstructed image value of the second block. Optionally, during the predictive coding process on the second block, the reconstructed image value of the second block may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the third block, according to the third block. The predicted value and its prediction residual construct the reconstructed image value of the third block.

本实施例中第 1块、 第 2块和第 3块由对宏块的所有像素以预设的抽取规 则抽取的像素分别组成, 如在水平和垂直方向或其他方向上每隔 1 个或多个像 素抽取的像素分别组成, 因此各块的像素在空间位置上相互交织。 上述步骤 13 和步骤 14中, 由于编码的块(第 2/3块 )与其参考的块(第 1块) 的像素在空 间位置上是互相交织在一起的, 因此他们之间的预测可以充分利用像素之间的 相关性, 从而提高预测精度。 可见, 这是一种像素间预测方法, 不同于步骤 12 的帧内预测方法(其实际上是一种块间的预测方法)。 采用像素间预测编码方式 对除了第 1块之外的其他块进行预测编码, 第 2块是以第 1块的重构图像值为 参考进行预测编码, 第 3块是以第 2块的重构图像值为参考进行预测编码, 可 选的, 宏块下采样分解得到的多组块之间, 也可建立相似的预测参考关系, 这 样, 在对下采样分解得到的多个块进行预测编码过程中, 形成了至少二级的级 联式预测参考关系。 In this embodiment, the first block, the second block, and the third block are respectively composed of pixels extracted by using a preset extraction rule for all pixels of the macro block, such as every other or more in the horizontal and vertical directions or other directions. The pixels extracted by the pixels are respectively composed, and thus the pixels of the respective blocks are interlaced with each other in the spatial position. In the above steps 13 and 14, since the coded block (the 2/3th block) and the pixel of the reference block (the first block) are spatially interlaced with each other, the prediction between them can be fully utilized. Correlation between pixels, thereby improving prediction accuracy. It can be seen that this is an inter-pixel prediction method, which is different from the intra prediction method of step 12 (which is actually an inter-block prediction method). The inter-pixel predictive coding method is used to predictively encode other blocks than the first block, and the second block is the reconstructed image value of the first block. Referring to the prediction coding, the third block is predictive coding based on the reconstructed image value of the second block. Alternatively, a similar prediction reference relationship may be established between the plurality of blocks obtained by the macroblock downsampling decomposition. Thus, in the process of predictive coding of a plurality of blocks obtained by downsampling decomposition, at least two levels of cascaded prediction reference relationships are formed.

在上述技术方案的基础上, 进一步的, 可将对宏块中各块的预测残差进行 编码处理得到的各块的编码比特信息写入码流。 需要说明的是, 这里以对各个 块的预测残差信息写入码流的情况为例进行说明。 实际上, 也可以允许对各个 块的预测残差信息不写入码流的情况, 或者, 第 1 块的预测残差信息写入码流 而第 2块和第 3块的预测残差信息不写入码流的情况, 或者其他组合。  Further, in addition to the above technical solution, further, coded bit information of each block obtained by encoding the prediction residual of each block in the macroblock may be written into the code stream. It should be noted that the case where the coded stream is written to the prediction residual information of each block will be described as an example. Actually, it is also possible to allow the prediction residual information of each block not to be written into the code stream, or the prediction residual information of the first block is written into the code stream and the prediction residual information of the second block and the third block is not. The case of writing a stream, or other combination.

本实施例以按照第 1块、 第 2块、 第 3块的顺序进行编码为例。 进一步的, 基于相同的原理, 还可以按照第 3块、 第 2块、 第 1块的顺序或者第 1块、 第 3 块、 第 2块的顺序进行编码。 编码过程不再赘述。  This embodiment is exemplified by encoding in the order of the first block, the second block, and the third block. Further, based on the same principle, encoding may be performed in the order of the third block, the second block, the first block, or the first block, the third block, and the second block. The encoding process will not be described again.

本实施例视频图像编码处理方法对宏块进行下采样分解得到多个块, 多个 块之间形成级联的参考关系, 因此可充分利用像素间的相关性, 提高预测精度, 同时, 本实施例作为一种宏块层(MB level )的编码技术, 可以在宏块层上更好 的适应图像内不同区域的紋理特性, 从而提高编码压缩效率。  In the video image coding processing method of the embodiment, the macroblock is downsampled and decomposed to obtain a plurality of blocks, and a cascaded reference relationship is formed between the plurality of blocks, so that correlation between pixels can be fully utilized to improve prediction accuracy, and at the same time, the implementation As an encoding technique of macroblock layer (MB level), the texture characteristics of different regions in the image can be better adapted on the macroblock layer, thereby improving coding compression efficiency.

本实施例中宏块的尺寸可以为 16 X 16、 32 X 32、 64 X 64或 128 X 128, 或其 他尺寸等。 除非特别说明, 后续实施例提到的宏块均指此含义。  The size of the macro block in this embodiment may be 16 X 16, 32 X 32, 64 X 64 or 128 X 128, or the like. Macroblocks mentioned in the subsequent embodiments refer to this meaning unless otherwise stated.

图 2为本发明第二实施例提供的视频图像解码处理方法流程图。 本实施例 为与图 1所示的编码处理方法对应的解码端处理流程。 如图 2所示, 本实施例 视频图像解码处理方法包括:  FIG. 2 is a flowchart of a video image decoding processing method according to a second embodiment of the present invention. This embodiment is a decoding end processing flow corresponding to the encoding processing method shown in Fig. 1. As shown in FIG. 2, the video image decoding processing method in this embodiment includes:

步骤 21、 获取视频图像帧的宏块的编码比特信息, 该宏块至少包括: 第 1 块, 第 2块和第 3块。 在本发明实施例中,所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与所述第 1块中的右下角的像素在水平或垂直方 向上。 所述第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所 述宏块的对角线方向上。 Step 21: Acquire coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block. In an embodiment of the invention, the pixels in the first block include at least pixels in the lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

宏块下采样分解方式如图 1对应实施例步骤 11所示, 在此不再赘述。 编码 端在向解码端发送的码流中携带有宏块的各块的编码比特信息, 解码端对各块 的比特信息进行解码, 可得各块的残差解码系数。  The macroblock downsampling decomposition mode is shown in step 11 of the corresponding embodiment, and is not described here. The code side carries the coded bit information of each block of the macro block in the code stream sent to the decoding end, and the decoding end decodes the bit information of each block to obtain the residual decoding coefficient of each block.

步骤 22、 对第 1块的编码比特信息进行帧内预测解码处理, 得到第 1块的 解码图像值。  Step 22: Perform intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block.

可利用帧内已解码的其他宏块的像素值对组成第 1块的编码比特信息进行 帧内预测解码处理, 得到第 1块的预测值及其残差, 将第 1块的预测值与及其 残差相加, 得到第 1块的解码图像值。 这是传统的帧内预测解码方法。  The intra-predictive decoding process of the coded bit information constituting the first block can be performed by using the pixel values of other macroblocks decoded in the frame, and the predicted value of the first block and its residual are obtained, and the predicted value of the first block is compared with The residuals are added to obtain the decoded image value of the first block. This is a traditional intra prediction decoding method.

步骤 23、 以第 1块的解码图像值为参考, 对第 2块的编码比特信息进行预 测解码处理, 得到第 2块的解码图像值。  Step 23: Perform prediction decoding processing on the coded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block.

可选的, 在对第 2块的编码比特信息进行预测解码处理过程中, 可包括对 第 1块的重构图像值进行插值处理, 插值后对应位置的像素值作为第 2块的预 测值, 根据第 2块的预测值及其残差解码系数构造第 2块的解码图像值。  Optionally, in performing the predictive decoding process on the coded bit information of the second block, the method may include performing interpolation processing on the reconstructed image value of the first block, and the pixel value of the corresponding position after the interpolation is used as the predicted value of the second block. The decoded image value of the second block is constructed based on the predicted value of the second block and its residual decoding coefficient.

步骤 24、 以第 2块的解码图像值为参考, 对第 3块的编码比特信息进行预 测解码处理, 得到第 3块的解码图像值。 可选的, 在对第 3 块的编码比特信息进行预测解码处理过程中, 可包括对 第 2块的重构图像值进行插值处理, 插值后对应位置的像素值作为第 3块的预 测值, 根据第 3块的预测值及其残差解码系数构造第 3块的解码图像值。 Step 24: Perform prediction decoding processing on the coded bit information of the third block with reference to the decoded image value of the second block to obtain a decoded image value of the third block. Optionally, in performing the predictive decoding process on the encoded bit information of the third block, the method may include performing interpolation processing on the reconstructed image value of the second block, and the pixel value of the corresponding position after the interpolation is used as the predicted value of the third block. The decoded image value of the third block is constructed based on the predicted value of the third block and its residual decoding coefficient.

上述步骤 23和步骤 24中, 由于解码的块 (第 2/3块 )与其参考的块(第 1块) 的像素在空间位置上是互相交织在一起的, 他们之间的预测, 可以充分利用每一像素 之间的相关性, 从而提高预测精度。 可见, 这是一种像素间预测方法, 不同于步骤 22 的帧内预测方法(其实际上是一种块间的预测方法)。采用像素间预测解码方式对除了 第 1块 的其他块进行预测解码, 第 2块是以第 1块的解码图像值为参考进行预测 解码, 第 3块是以第 2块的解码图像值为参考进行预测解码, 可选的, 宏块包括的多 组块之间, 也可建立相似的解码预测关系, 这样, 在对下采样分解得到的多个块进行 预测解码过程中, 形成了至少二级的级联式预测参考关系。  In the above steps 23 and 24, since the decoded block (the 2/3th block) and the pixel of the reference block (the first block) are spatially interlaced with each other, the prediction between them can be fully utilized. Correlation between each pixel, thereby improving prediction accuracy. It can be seen that this is an inter-pixel prediction method, which is different from the intra prediction method of step 22 (which is actually an inter-block prediction method). The inter-pixel prediction decoding method is used for predictive decoding on other blocks except the first block, the second block is predictive decoding based on the decoded image value of the first block, and the third block is based on the decoded image value of the second block. Perform predictive decoding. Optionally, a similar decoding prediction relationship may be established between multiple blocks included in the macroblock, so that at least two stages are formed in the prediction decoding process of the plurality of blocks obtained by downsampling decomposition. Cascaded predictive reference relationships.

步骤 25、 将所述宏块中各块的解码图像值进行上采样合成处理, 得到所述 宏块的解码图像值。  Step 25: Perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.

解码端进行的上采样合成处理过程是编码端进行下采样分解的逆过程, 即 本步骤是本发明第一实施例编码过程中的步骤 11的逆过程。  The upsampling synthesis process performed by the decoding end is an inverse process of the downsampling decomposition performed by the encoding end, that is, this step is the inverse process of the step 11 in the encoding process of the first embodiment of the present invention.

需要说明的是, 本实施例涉及的各个块的解码过程, 均以各个块存在预测 残差系数的情况为例进行说明。 实际上, 还可以允许各个块不存在预测残差系 数的情况, 或者, 第 1块存在预测残差系数而第 2块和第 3块不存在预测残差 系数的情况, 或者其他组合。  It should be noted that the decoding process of each block according to the present embodiment is described by taking the case where the prediction residual coefficient exists in each block as an example. Actually, it is also possible to allow the case where the prediction residual coefficient is not present for each block, or the case where the prediction coefficient is present in the first block and the prediction residual coefficient is not present in the second block and the third block, or other combinations.

本实施例以按照第 1块、 第 2块、 第 3块的顺序进行解码为例。 进一步的, 基于相同的原理, 还可以按照第 3块、 第 2块、 第 1块的顺序或者第 1块、 第 3 块、 第 2块的顺序进行解码。  This embodiment is exemplified by decoding in the order of the first block, the second block, and the third block. Further, based on the same principle, decoding may be performed in the order of the third block, the second block, the first block, or the first block, the third block, and the second block.

本实施例视频图像解码处理方法对宏块进行下采样分解得到多个块进行预 测解码过程中, 将帧内预测解码技术与像素间预测解码技术相结合, 像素间预 测解码过程中形成了至少二级的级联式预测参考关系, 因此可以实现充分利用 像素相关性进行编码压缩后的视频图像的解码。 图 3a为本发明第三实施例提供的视频图像编码处理方法流程图; 图 3b为 本发明第三实施例提供的视频图像编码处理方法逻辑实现框图。 本实施例基于 块的混合视频编码框架中, 对 16 X 16大小的宏块进行丄下采样分解为例进行说 In this embodiment, the video image decoding processing method performs downsampling decomposition on the macroblock to obtain a plurality of blocks for pre-preprocessing. In the process of decoding and decoding, the intra-prediction decoding technology is combined with the inter-pixel prediction decoding technology, and at least two levels of cascaded prediction reference relationships are formed in the inter-pixel prediction decoding process, so that the pixel correlation can be fully utilized for encoding compression. Decoding of the video image afterwards. FIG. 3 is a flowchart of a video image encoding processing method according to a third embodiment of the present invention; FIG. 3b is a logic block diagram of a video image encoding processing method according to a third embodiment of the present invention. In this embodiment, a block-based hybrid video coding framework is used to perform a downsampling decomposition of a 16×16 macroblock as an example.

2  2

明。 如图 3a所示, 本实施例视频图像编码处理方法包括: 步骤 31、 输入待编码的视频图像帧。 步骤 32、 以丄为下采样系数,对视频图像帧中 16 X 16大小的宏块进行下采 Bright. As shown in FIG. 3a, the video image encoding processing method of this embodiment includes: Step 31: Input a video image frame to be encoded. Step 32: Taking 丄 as a downsampling coefficient, performing a 16×16 macroblock in the video image frame

2  2

样分解, 得到 4个 8 X 8大小的块。 图 3c为本发明第三实施例提供的宏块下采样分解示意图。 如图 3c所示, 以丄为下采样系数, 对视频图像帧中 16 X 16大小的宏块进行下采样分解即为: 2 Decomposed into four blocks of 8 x 8 size. FIG. 3c is a schematic diagram of a macroblock downsampling decomposition according to a third embodiment of the present invention. As shown in Fig. 3c, using 丄 as the downsampling coefficient, downsampling the 16×16 macroblocks in the video image frame is: 2

在水平和垂直方向上每隔 1 个像素抽取, 抽取的像素组成新的块, 共可形成 4 个 8 X 8大小的块。 图 3c中以不同的填充方式区分组成不同块的像素。 为便于 描述本实施例的技术方案, 不妨将从左到右、 从上到下抽取像素的位置关系组 成的块, 分别标记为: 块 0、 1、 2和 3。 根据宏块下采样分解过程中抽取像素的位置关系, 将块分为多组, 例如: 块 3为一组, 块 1和 2为一组, 块 0为一组。 在对各块的下述编码处理过程中, 块 3 的预测编码类型为帧内预测编码, 其他块(如块 0、 1和 2 ) 的预测编码类型为像素间预测编码。 图中采用 "I" 和 "P" 分别表示相应块的预测编码类型为: 帧内预测编码或像素间预测类型。 因 此本实施例中, 块 3即为本发明实施例所述的 "第 1块", 块 1和 2即为本发明 实施例所述的 "第 2块", 块 0即为本发明实施例所述的 "第 3块"。 在本发明 实施例中, 所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块 中的右下角的像素与所述第 1 块中的右下角的像素在水平或垂直方向上。 所述 第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所述宏块的对 角线方向上。 Every 1 pixel is extracted in the horizontal and vertical directions, and the extracted pixels form a new block, which can form 4 blocks of 8×8 size. The pixels that make up the different blocks are distinguished in Figure 3c by different filling methods. To facilitate the description of the technical solution of the present embodiment, the blocks composed of the positional relationships of the pixels extracted from left to right and from top to bottom may be labeled as: blocks 0, 1, 2, and 3. According to the positional relationship of the extracted pixels in the macroblock downsampling process, the blocks are divided into multiple groups, for example: block 3 is a group, blocks 1 and 2 are a group, and block 0 is a group. In the following encoding process for each block, the predictive coding type of block 3 is intra prediction coding, and the prediction coding type of other blocks (such as blocks 0, 1, and 2) is inter-pixel predictive coding. The "I" and "P" in the figure respectively indicate that the prediction coding type of the corresponding block is: intra prediction coding or inter-pixel prediction type. Therefore, in this embodiment, the block 3 is the "first block" according to the embodiment of the present invention, and the blocks 1 and 2 are the "second block" according to the embodiment of the present invention, and the block 0 is the embodiment of the present invention. Said "third block". In the present invention In an embodiment, the pixel in the first block includes at least a pixel in a lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

在下采样分解过程中可能会导致混叠 (aliasing )效应。 为了进一步提升编 码性能, 可选的, 本实施例可在下采样分解过程中进行对宏块的部分或全部像 素进行抗混叠滤波处理:  An aliasing effect may result during downsampling decomposition. In order to further improve the coding performance, optionally, the embodiment may perform anti-aliasing filtering on some or all of the pixels of the macroblock in the downsampling decomposition process:

举例说明一: 可对宏块的全部像素进行抗混叠滤波处理, 如采用低通滤波 器进行抗混叠滤波处理以滤除高频成分, 从而降低下采样分解过程中的混叠效 应对编码造成的影响。  For example one: anti-aliasing filtering can be performed on all pixels of a macroblock, such as anti-aliasing filtering using a low-pass filter to filter out high-frequency components, thereby reducing aliasing effects on encoding during downsampling decomposition The impact.

举例说明二: 可对块 0、 1和 2进行下采样分解过程中不采用抗混叠滤波处 理, 而在对块 3 进行下采样分解过程中进行抗混叠滤波处理, 如采用低通滤波 器作为下采样滤波器进行抗混叠滤波处理以滤除块 3 中的高频成分。 由于块 3 对其他块(块 0、 1和 2 ) 的编码质量的影响较大, 因此仅对块 3进行抗混叠滤 波处理, 在实现降低下采样分解过程中的混叠效应对编码造成的影响的技术效 果的基础上, 还有利于降低实现的复杂度。  For example two: Anti-aliasing filtering can be used in the downsampling decomposition of blocks 0, 1 and 2, and anti-aliasing filtering is performed in the downsampling process of block 3, such as using a low-pass filter. The anti-aliasing filtering process is performed as a downsampling filter to filter out high frequency components in the block 3. Since block 3 has a large influence on the coding quality of other blocks (blocks 0, 1, and 2), only block 3 is subjected to anti-aliasing filtering processing, and the aliasing effect in the process of reducing the downsampling decomposition is caused by coding. Based on the technical effects of the impact, it also helps to reduce the complexity of the implementation.

步骤 33、 对块 3进行帧内预测编码处理, 得到块 3的重构图像值及其预测 残差。  Step 33: Perform intra prediction coding processing on the block 3 to obtain the reconstructed image value of the block 3 and its prediction residual.

本步骤可利用帧内已编码的其他宏块的像素值, 对块 3进行帧内预测编码 得到预测值, 将该预测值与宏块原始像素进行比较得到块 3 的预测残差, 根据 块 3的预测值及其编码处理后的预测残差构造块 3的重构图像值。 步骤 34、 以块 3的重构图像值为参考, 分别对块 1和块 2进行预测编码处 理, 分别得到块 1和块 2的重构图像值及其预测残差。 步骤 35、以块 1和 /或块 2的重构图像值为参考,对块 0进行预测编码处理, 得到块 0的重构图像值及其预测残差。 可选的, 上述步骤 34和步骤 35对块 0、 1和 2进行预测编码处理过程中, 为了提高预测编码的准确性, 可对被参考的块的重构图像值进行插值处理, 插 值后对应位置的像素值为预测值。 可采用多抽头插值滤波器对被参考的重构图像值进行插值处理。 图 3d为本 发明第三实施例提供的像素间预测编码插值处理分解示意图。 如图 3d所示, 可 采用 4-tap ( 4抽头)插值滤波器, 对被参考的重构图像值进行插值处理, 其中: 对块 1和块 2的插值处理过程例如: 采用 4-tap插值滤波器对块 3的重构图 像值进行水平方向或垂直方向丄精度插值,以块 3的重构图像值垂直方向的插值 This step can perform intra prediction encoding on block 3 by using the pixel values of other macroblocks encoded in the frame. A predicted value is obtained, and the predicted value is compared with the original pixel of the macroblock to obtain a prediction residual of the block 3, and the reconstructed image value of the block 3 is constructed based on the predicted value of the block 3 and the encoded residual after the encoding. Step 34: Perform prediction encoding processing on Block 1 and Block 2 with reference to the reconstructed image values of the block 3, and obtain reconstructed image values of the blocks 1 and 2 and their prediction residuals, respectively. Step 35: Perform prediction encoding processing on block 0 with reference to the reconstructed image values of block 1 and/or block 2, to obtain reconstructed image values of block 0 and prediction residuals thereof. Optionally, in the foregoing step 34 and step 35, during the predictive coding process on the blocks 0, 1 and 2, in order to improve the accuracy of the predictive coding, the reconstructed image values of the referenced block may be interpolated, and the interpolation is performed. The pixel value of the position is the predicted value. The referenced reconstructed image values may be interpolated using a multi-tap interpolation filter. FIG. 3 is a schematic exploded view of inter-pixel predictive coding interpolation processing according to a third embodiment of the present invention. As shown in Fig. 3d, a 4-tap (4-tap) interpolation filter can be used to interpolate the referenced reconstructed image values, where: Interpolation processing for block 1 and block 2 is as follows: 4-tap interpolation The filter performs interpolation of the reconstructed image values of the block 3 in the horizontal direction or the vertical direction, and interpolates the vertical direction of the reconstructed image values of the block 3.

2  2

位置对应的像素值为块 1像素的预测值, 以块 3的重构图像值水平方向的插值 位置对应的像素值为块 2像素的预测值。 对块 0的插值处理过程例如: 求取组成块 0的像素周围的多个像素的像素 值的平均值, 为块 0的像素的预测值。 具体的, 可对块 0的像素周围的多个像 素的像素值求取平均值, 图 3d仅示意出了这 4个像素分别来自块 1和块 2的情 形。 可以理解, 如果块 0是以块 1的重构图像值为参考, 这 4个像素可来自块 1 或者来自块 1和 3, 如果块 0是以块 2的重构图像值为参考; 这 4个像素可来自 块 2或者来自块 2和 3; 对求取平均值的像素的个数也不限于 4个, 例如, 还可 对块 0的像素周围的 8个像素的像素值求取平均值, 这 8个像素可来自块 1、 2 和 3等。 The pixel value corresponding to the position is a predicted value of the block 1 pixel, and the pixel value corresponding to the interpolation position in the horizontal direction of the reconstructed image value of the block 3 is the predicted value of the block 2 pixel. The interpolation processing for block 0 is, for example, finding an average value of pixel values of a plurality of pixels around the pixel constituting the block 0, which is a predicted value of the pixel of the block 0. Specifically, the average of the pixel values of the plurality of pixels around the pixel of the block 0 can be averaged, and FIG. 3d only illustrates the case where the four pixels are from the block 1 and the block 2, respectively. It can be understood that if block 0 is referenced to the reconstructed image value of block 1, the four pixels may come from block 1 or from blocks 1 and 3, if block 0 is referenced to the reconstructed image value of block 2; The number of pixels may be from block 2 or from blocks 2 and 3; the number of pixels for averaging is not limited to four, for example, the average of the pixel values of 8 pixels around the pixel of block 0 may be averaged. , these 8 pixels can come from blocks 1, 2 And 3, etc.

可选的, 可根据上述步骤 33-步骤 35得到的各块的重构图像值, 构造宏块 的重构图像值, 并存储该宏块的重构图像值。  Optionally, the reconstructed image values of the macroblocks may be constructed according to the reconstructed image values of the blocks obtained in the above steps 33-35, and the reconstructed image values of the macroblocks are stored.

步骤 36、 将对宏块中各块的预测残差进行编码处理得到的各块的编码比特 信息写入码流。  Step 36: Write coded bit information of each block obtained by encoding the prediction residual of each block in the macroblock into the code stream.

需要说明的是, 本实施例以对各个块的预测残差信息写入码流的情况为例 进行说明。 实际上, 也可以允许对各个块的预测残差信息不写入码流的情况, 或者, 第 1块的预测残差信息写入码流而第 2块和第 3块的预测残差信息不写 入码流的情况, 或者其他组合。  It should be noted that the present embodiment will be described by taking an example in which the prediction residual information of each block is written into the code stream. Actually, it is also possible to allow the prediction residual information of each block not to be written into the code stream, or the prediction residual information of the first block is written into the code stream and the prediction residual information of the second block and the third block is not The case of writing a stream, or other combination.

在基于块的混合视频编码框架中, 对任一块的预测编码处理包括: 预测、 变换、 量化和熵编码等处理过程, 以实现视频图像的编码压缩。 图 3b为本发明 第三实施例提供的视频图像编码处理方法逻辑实现框图。 下面结合图 3b, 说明 当本实施例在基于块的混合视频编码框架的应用场景中, 对宏块包括的各块的 处理过程, 具体可包括:  In the block-based hybrid video coding framework, the predictive coding process for any block includes processes such as prediction, transform, quantization, and entropy coding to implement coding compression of the video image. FIG. 3b is a logic block diagram of a video image encoding processing method according to a third embodiment of the present invention. The following describes the processing procedure of the blocks included in the macroblock in the application scenario of the block-based hybrid video coding framework in this embodiment, which may specifically include:

( 1 )对块 3进行帧内预测得到预测值及其预测残差, 对块 3的预测残差依 次进行变换处理、 量化处理和熵编码处理, 根据块 3 的预测残差及其预测值构 造块 3的重构图像值。  (1) Performing intra prediction on block 3 to obtain a predicted value and a prediction residual thereof, and performing transform processing, quantization processing, and entropy encoding processing on the prediction residual of block 3, and constructing according to the prediction residual of block 3 and its predicted value. The reconstructed image value of block 3.

( 2 )以块 3的重构图像值为参考, 分别对块 1和 2进行预测得到预测值及 其预测残差, 对块 1和 2的预测残差依次进行变换处理、 量化处理和熵编码处 理, 分别根据块 1和 2各自的预测残差及其预测值, 构造块 1和 2的重构图像 值。  (2) Taking the reconstructed image value of block 3 as a reference, predicting blocks 1 and 2 respectively to obtain predicted values and their prediction residuals, and performing transform processing, quantization processing, and entropy coding on the prediction residuals of blocks 1 and 2, respectively. Processing, reconstructed image values of blocks 1 and 2 are constructed based on respective prediction residuals of blocks 1 and 2 and their predicted values, respectively.

( 3 ) 以块 1和 /或 2块的重构图像值为参考, 对块 0进行预测得到预测值 及其预测残差, 对块 0的预测残差依次进行变换处理、 量化处理和熵编码处理, 根据块 0的预测值及其预测残差构造块 0的重构图像值。 (3) Taking the reconstructed image values of block 1 and/or 2 blocks as reference, predicting block 0 to obtain the predicted value and its prediction residual, and performing transform processing, quantization processing and entropy coding on the prediction residual of block 0 in sequence. deal with, The reconstructed image value of block 0 is constructed from the predicted value of block 0 and its prediction residual.

可选的, 如果宏块还包括有其他块, 分别对其他各块进行预测, 并对其他 各块的预测残差依次进行变换处理、 量化处理和熵编码处理。 宏块包括的各块 的预测残差经熵编码处理后的结果写入码流, 对各块的重构图像值进合成处理, 可得到宏块的重构图像值。  Optionally, if the macroblock further includes other blocks, the other blocks are separately predicted, and the prediction residuals of the other blocks are sequentially subjected to transform processing, quantization processing, and entropy encoding processing. The prediction residuals of the blocks included in the macroblock are written into the code stream by the entropy coding process, and the reconstructed image values of the respective blocks are combined and processed to obtain reconstructed image values of the macroblocks.

上述对块由于级联预测编码的引入, 使得宏块内部可以有帧内预测和像素 间预测两种不同预测编码类型的块, 因此可在对块进行量化处理过程中进行优 化设计, 具体说明如下:  Due to the introduction of cascading predictive coding, the above-mentioned block can have two blocks of different predictive coding types, such as intra prediction and inter-pixel prediction, so that the block can be optimized in the process of quantization processing, as described below. :

可在宏块内部建立块级别的量化映射机制, 即: 根据宏块内部不同块的重 要性不同划分为多组, 给各组块分配不同的量化参数 ( Quantization Parameter, 简称 QP )值。  A block-level quantization mapping mechanism can be established within the macroblock, that is, according to the importance of different blocks within the macroblock, divided into multiple groups, and different quantization parameters (Quinization Parameter, QP for short) are assigned to each block.

由于被参考的块的质量会影响以其为参考的块的质量, 因此, 为了获得较 佳的率失真 (Rate-Distortion, 简称 RD )性能, 各块之间的 QP值分配原则是: 根据参考关系确定 QP值。 在上述编码过程中, 有如下参考关系: 块 0参 考块 1和块 2, 块 1和块 2参考块 3。被作为参考的块的 QP值被赋予更小的 QP 值。  Since the quality of the referenced block affects the quality of the block with which it is referenced, in order to obtain better Rate-Distortion (RD) performance, the principle of QP value allocation between blocks is: The relationship determines the QP value. In the above encoding process, there are the following reference relationships: Block 0 refers to block 1 and block 2, and block 1 and block 2 refer to block 3. The QP value of the block being referenced is given a smaller QP value.

根据上述原则, 如图 3c所示的块之间的参考关系, 可得块的 QP值满足下 式:  According to the above principle, the reference relationship between the blocks shown in Figure 3c, the QP value of the block can be satisfied as follows:

QP (块 3 ) ^QP (块 1/2 ) ^QP (块 0 )  QP (block 3) ^QP (block 1/2) ^QP (block 0)

在实际应用中, 各块的 QP值可根据实验确定, 例如可采样以下关系确定 各块的 QP值:  In practical applications, the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:

QP (块 1/2 ) = QP (块 0 ) = QP ( Slice )  QP (block 1/2) = QP (block 0) = QP (Slice)

QP (块 3 ) = QP ( Slice ) -6 其中, QP ( Slice )表示视频图像条带 ( Slice )对应的 QP值, Slice对应的 QP值可预先获取。 QP (block 3) = QP (Slice) -6 Where QP (Slice) represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.

在确定各块的 QP值之后,根据各块的 QP值对各块的预测残差进行相应的 量化处理。 为了获得较佳的量化编码性能, 可在量化处理过程中根据不同块的 QP值不同的特点, 对不同的块进行相应的量化优化处理如下:  After determining the QP value of each block, the prediction residual of each block is subjected to corresponding quantization processing according to the QP value of each block. In order to obtain better quantization and coding performance, different quantization and optimization processes can be performed on different blocks according to the different QP values of different blocks in the quantization process as follows:

举例说明 1 : 在率失真优化 ( Rate-Distortion Optimization, 简称 RDO ) 实 现过程中, 需要考虑宏块内部不同块 QP值不同的特点,对不同块计算率失真开 销( rdcost )时,根据 QP值确定与其对应的 lambda系数,不同 QP值对应的 lambda 系数不同。  Example 1: In the implementation of Rate-Distortion Optimization (RDO), it is necessary to consider the different QP values of different blocks within a macroblock. When calculating the rate distortion overhead (rdcost) for different blocks, determine according to the QP value. The lambda coefficient corresponding to different QP values is different from the corresponding lambda coefficient.

举例说明 2: 在量化取整(rounding )过程中, 考虑宏块内部各块的不同预 测编码类型, 对不同块使用不同的死区 (deadzone ), 以匹配帧内预测和像素间 预测两种不同预测编码类型的块预测残差经变换处理后的系数具有不同的分布 规律。 在进行量化取整处理过程中, 各块使用的死区的选择原则是: 对帧内预 测编码类型的块使用较小的死区, 对像素间预测编码类型的块使用较大的死区, 即对块 3使用较小的死区, 对块 2、 1和 0使用较大的死区, 例如对块 3使用的 死区为 A, 对块 2、 1或 0使用的死区为 B, 则 A小于 B。  Example 2: In the process of quantization rounding, consider different prediction coding types of each block in the macroblock, and use different dead zones for different blocks to match intra prediction and inter-pixel prediction. The coefficients of the block prediction residual of the prediction coding type are transformed to have different distribution laws. In the process of performing quantization and rounding, the selection principle of the dead zone used by each block is: a smaller dead zone is used for the intra prediction coding type block, and a larger dead zone is used for the inter-pixel prediction coding type block. That is, a smaller dead zone is used for block 3, and a larger dead zone is used for blocks 2, 1 and 0, for example, the dead zone used for block 3 is A, and the dead zone used for block 2, 1 or 0 is B, Then A is less than B.

在编码端输出的码流中可携带其他编码比特信息, 如对视频编码标准中语 法元素 "编码块模版" ( Coded Block Pattern, 简称 CBP ) 的熵编码信息。 由于 级联预测编码的引入, 使得宏块内部可以有帧内预测和像素间预测两种不同预 测编码类型的块, 不同预测编码类型的块的预测残差经量化处理后的系数具有 不同的分布规律, 考虑到这一点, 可对语法元素 CBP的熵编码方式进行相应的 优化设计, 具体的, CBP用于表示每个比特(Bit )对应的块是否存在非零系数。 如对于亮度(Luma )分量, CBP中有 4个比特分别指示宏块中的 4个 8 X 8大 小的块是否存在非零系数。 在进行熵编码处理过程中, 根据各块的预测残差经 量化处理后的系数的分布规律对各块进行相应的熵编码处理, 以获得较佳的性 能, 这些熵编码处理的结果可写入码流。 Other coded bit information may be carried in the code stream outputted by the coding end, such as entropy coding information of a syntax element "Coded Block Pattern" (CBP) in the video coding standard. Due to the introduction of cascading predictive coding, there may be two blocks of different predictive coding types, intra prediction and inter-pixel prediction, and the prediction residuals of different prediction coding types have different distributions after quantization. Regularity, taking this into consideration, the entropy coding mode of the syntax element CBP can be optimized accordingly. Specifically, CBP is used to indicate whether there is a non-zero coefficient for the block corresponding to each bit (Bit). For the Luma component, there are 4 bits in the CBP indicating 4 8 X 8 large in the macroblock. Whether small blocks have non-zero coefficients. In the entropy coding process, the entropy coding process is performed on each block according to the distribution rule of the quantized coefficients of the prediction residuals of each block to obtain better performance, and the results of these entropy coding processes can be written. Code stream.

举例说明 1 : 在自适应变长编码 (Context-based Adaptive Variable Length Coding, 简称 CAVLC )方式中, 需要才艮据宏块内部不同块具有不同的预测编码 类型, 重新设计结合帧内预测和像素间预测两种不同预测编码类型的块特性的 CBP的码表。 帧内预测编码类型的块的 CBP比特位出现 " 1" 的概率较大, 像 素间预测编码类型的块的 CBP比特位出现 "0" 的 ¾¾率较大, 支设: CBP 中 用于指示块中是否存在非零系数的 4个比特位, 依次用于指示块 3、 块 2、 块 1 和块 0中是否存在非零系数, 则宏块的 CBP中 4个比特位的值依次为 " 1000" 时可看作大概率事件, 对各块分配较短的码字; 而宏块的 CBP中 4个比特位的 值依次为 "0111 " 时可看作小概率事件, 对各块分配较长的码字。 根据各块的 CBP的码表对应的码字, 对相应的块的预测残差进行熵编码处理。  For example: In the Context-based Adaptive Variable Length Coding (CAVLC) method, it is necessary to re-design the combination of intra prediction and inter-pixel according to different prediction coding types in different macroblocks. A code table of CBPs that predict block characteristics for two different predictive coding types. The probability that the CBP bit of the intra prediction coding type block has a "1" is large, and the CBP bit of the inter-preferential coding type block has a large "0" rate, and the support: CBP is used to indicate the block. Whether there are 4 bits of non-zero coefficient in sequence, which are used to indicate whether there are non-zero coefficients in block 3, block 2, block 1 and block 0, then the value of 4 bits in the CBP of the macro block is "1000". " can be regarded as a large probability event, assigning shorter codewords to each block; and the value of 4 bits in the CBP of the macroblock can be regarded as a small probability event in the order of "0111", and each block is allocated longer. Codeword. The prediction residual of the corresponding block is subjected to entropy coding processing according to the codeword corresponding to the code table of the CBP of each block.

举例说明 2 : 在自适应二进制算术编码 (Context-based Adaptive Binary Arithmetic Coding, 简称 CAB AC ) 方式中, 需要根据宏块内部不同块具有不同 的预测编码类型, 重新设计概率模型初始化以及模型更新方法。 具体的, 宏块 内部的帧内和像素间两种不同预测编码类型的块, 分别适用不同的初始化概率 模型, 且各块分别适用不同的更新方法。 例如: 帧内预测编码类型的块的 CBP 中上述比特位值为 " 1" 表示大概率符号 (More Probable Symbol, 简称 MPS ), 像素间预测编码类型的块的 CBP中上述比特位值为 " 1 "表示小概率符号(Less Probable Symbol, 简称 LPS ), 在进行始化概率模型的选择和模型更新时, 可以 不使用上下文信息, 仅根据 CBP的各个比特位的位置进行模型选择和更新。  For example 2: In the Context-based Adaptive Binary Arithmetic Coding (CAB AC) method, it is necessary to redesign the probabilistic model initialization and the model update method according to different prediction coding types of different blocks within the macroblock. Specifically, different intra-prediction coding types of intra- and inter-pixel blocks within a macroblock are respectively applied with different initialization probability models, and different update methods are applied to each block. For example: in the CBP of the block of the intra prediction coding type, the above bit value is "1", which means a More Probable Symbol (MPS), and the bit value of the block of the inter-pixel predictive coding type is "1". "Less Probable Symbol (LPS)", when performing the selection of the probabilistic model and the model update, the model selection and update can be performed based only on the position of each bit of the CBP without using the context information.

本实施例视频图像编码处理方法中, 块 0以块 1和 /或块 2为参考进行预测 编码, 而块 1和块 2均以块 3为参考进行预测编码, 相当于在各块之间形成了 级联的预测参考关系; 又由于本实施例是在宏块层(MB level )进行编码处理, 因而可称为 "MB-level级联预测编码"。 与帧层 (frame level ) 下采样分解编码 方法(如 Intra帧分割为两场进行场编码的方法) 不同, 本实施例作为一种宏块 层的编码处理技术, 可与其他现有宏块类型一起参与 RDO决策编码, 从而实现 在宏块层上更好的适应图像内不同区域的紋理特性。 In the video image encoding processing method of this embodiment, block 0 is predicted by using block 1 and/or block 2 as a reference. Coding, and block 1 and block 2 are both predictively coded with reference to block 3, which is equivalent to forming a cascaded prediction reference relationship between blocks; and since this embodiment encodes at the macroblock layer (MB level) Processing can thus be referred to as "MB-level cascade predictive coding." Different from the frame level downsampling coding method (such as the method of dividing the Intra frame into two fields for field coding), this embodiment is used as a macroblock layer coding processing technology, and can be combined with other existing macroblock types. Participate in RDO decision coding together, so as to better adapt to the texture characteristics of different regions in the image on the macroblock layer.

需要说明的是, 虽然本实施例 16 X 16大小的宏块的预测编码进行说明, 但 可以理解, 可根据视频图像编码的实际需要, 对其他尺寸的块, 如 32 X 32、 64 X 64或 128 X 128等大小的宏块进行预测编码,预测编码过程中下采样系数可根 据实际需要, 得到的多个块的分组可为四组或四组以上, 从而形成三级联或三 级联以上的预测参考关系, 其实现机理与本实施例相似, 在此不再赘述。  It should be noted that although the predictive coding of the macroblock of the size of X 16 in this embodiment is described, it can be understood that other sizes of blocks, such as 32 X 32, 64 X 64 or The macroblocks of size 128 X 128 are used for predictive coding. The downsampling coefficients in the predictive coding process can be divided into four groups or more than four groups according to actual needs, thereby forming three or more cascades. The implementation mechanism of the prediction reference relationship is similar to that of the embodiment, and details are not described herein again.

图 4为本发明第四实施例提供的视频图像解码处理方法流程图。 本实施例 为与本发明第三实施例提供的编码处理方法对应的解码端处理流程。 如图 4所 示, 本实施例视频图像解码处理方法包括:  FIG. 4 is a flowchart of a video image decoding processing method according to a fourth embodiment of the present invention. This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the third embodiment of the present invention. As shown in FIG. 4, the video image decoding processing method in this embodiment includes:

步骤 41、 获取视频图像帧的宏块包含的块 0、 1、 2和 3的编码比特信息的 码流。  Step 41: Acquire a code stream of coded bit information of blocks 0, 1, 2, and 3 included in the macro block of the video image frame.

步骤 42、 分别对块 0、 1、 2和 3的编码比特信息进行解码处理, 得到各块 的残差解码系数。  Step 42: Decode the coded bit information of blocks 0, 1, 2, and 3, respectively, to obtain a residual decoding coefficient of each block.

在基于块的混合视频编码框架中, 对任一块的编码比特信息的解码处理包 括: 熵解码、 反量化、 反变换和预测等处理过程, 以实现编码压缩后的视频图 像的解码。 可选的, 当本实施例在基于块的混合视频编码框架的应用场景中, 步骤 42可包括步骤 421-步骤 423。  In the block-based hybrid video coding framework, the decoding process of the encoded bit information of any block includes processes such as entropy decoding, inverse quantization, inverse transform, and prediction to implement decoding of the encoded compressed video image. Optionally, in the application scenario of the block-based hybrid video coding framework, step 42 may include step 421 - step 423.

步骤 421、 对各块的编码比特信息进行熵解码处理。 在熵解码处理过程中, 可根据各块的残差解码系数不同的分布规律, 对各 块的编码比特信息进行相应的熵解码处理, 采用帧内预测编码模式进行预测编 码的块出现非零系数的概率, 大于采用像素间预测编码模式进行预测编码的块 出现非零系数的概率, 如: 块 3出现非零系数的概率, 大于块 0、 1和 2出现非 零系数的概率。 Step 421: Perform entropy decoding processing on the coded bit information of each block. In the entropy decoding process, according to the distribution rule of the residual decoding coefficients of each block, the encoding bit information of each block is subjected to corresponding entropy decoding processing, and the block using the intra prediction coding mode for predictive coding has non-zero coefficients. The probability of occurrence is greater than the probability that a block that is predictively coded using inter-pixel predictive coding mode exhibits non-zero coefficients, such as: the probability that block 3 exhibits a non-zero coefficient, which is greater than the probability that blocks 0, 1, and 2 have non-zero coefficients.

步骤 422、 对各块经熵解码处理后的系数进行反量化处理。  Step 422: Perform inverse quantization processing on the coefficients after entropy decoding processing of each block.

由于被参考的块的质量会影响以其为参考的块的质量, 因此, 为了获得较 佳的 RD性能, 各块之间的 QP值分配原则是: 根据参考关系确定 QP值。 可预 先确定后续解码过程存在的参考关系,例如存在如下参考关系:块 0参考块 1/2, 块 1/2参考子块 3。 则被作为参考的块的 QP值被赋予更小的 QP值。  Since the quality of the referenced block affects the quality of the block with which it is referenced, in order to obtain better RD performance, the QP value assignment principle between blocks is: Determine the QP value based on the reference relationship. The reference relationship existing in the subsequent decoding process can be determined in advance, for example, there is a reference relationship as follows: block 0 reference block 1/2, block 1/2 reference sub-block 3. The QP value of the block to be referenced is given a smaller QP value.

根据上述原则, 可得块的 QP值满足下式:  According to the above principle, the QP value of the available block satisfies the following formula:

QP (块 3 ) ^QP (块 1/2 ) ^QP (块 0 )  QP (block 3) ^QP (block 1/2) ^QP (block 0)

在实际应用中, 各块的 QP值可根据实验确定, 例如可采样以下关系确定 各块的 QP值:  In practical applications, the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:

QP (块 1/2 ) = QP (块 0 ) = QP ( Slice )  QP (block 1/2) = QP (block 0) = QP (Slice)

QP (块 3 ) = QP ( Slice ) -6  QP (block 3) = QP (Slice) -6

其中, QP ( Slice )表示视频图像条带 ( Slice )对应的 QP值, Slice对应的 QP值可预先获取。  Among them, QP (Slice) represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.

在确定各块的 QP值之后, 根据各块的 QP值, 对各块的编码比特信息经熵 解码处理后的系数进行相应的反量化处理。  After determining the QP value of each block, the entropy-decoded coefficients of the coded bit information of each block are subjected to corresponding inverse quantization processing according to the QP value of each block.

步骤 423、 对各块经反量化处理后的系数进行反变换处理, 得到残差。  Step 423: Perform inverse transform processing on the coefficients of each block subjected to the inverse quantization process to obtain a residual.

步骤 43、 对块 3进行帧内预测解码处理, 将预测值与块 3的残差相加, 得 到块 3的解码图像值。 步骤 44、 以块 3的解码图像值为参考, 分别对块 1和块 2进行预测解码处 理, 将块 1和块 2的预测值与其各自的残差相加, 分别得到块 1和块 2的解码 图像值。 Step 43: Perform intra prediction decoding processing on the block 3, and add the predicted value to the residual of the block 3 to obtain the decoded image value of the block 3. Step 44: Perform prediction decoding processing on Block 1 and Block 2 with reference to the decoded image value of the block 3, and add the predicted values of the block 1 and the block 2 to their respective residuals to obtain the blocks 1 and 2 respectively. Decode image values.

步骤 45、以块 1和 /或块 2的解码图像值为参考,对块 0进行预测解码处理, 将块 0的预测值与其残差相加, 得到块 0的解码图像值。  Step 45: Perform prediction decoding processing on block 0 with reference to the decoded image values of block 1 and/or block 2, and add the predicted value of block 0 to its residual to obtain a decoded image value of block 0.

上述步骤 43-步骤 45相当于在各块预测解码过程中建立级联预测参考关系, 其实现方式与编码端在各块预测编码过程中建立级联预测参考关系的方法相 似, 具体详见步骤 32-步骤 34的记载, 步骤 33和步骤 34中关于对块 0、 1和 2 进行插值处理的方法, 也适用于解码端步骤 44和步骤 45 中, 以提高预测的准 确性, 具体实现方式在此不再赘述。  The foregoing step 43 to step 45 are equivalent to establishing a cascading prediction reference relationship in each block prediction decoding process, and the implementation manner thereof is similar to the method in which the coding end establishes a cascading prediction reference relationship in each block predictive coding process. For details, see step 32. - the description of step 34, the method for performing interpolation processing on blocks 0, 1 and 2 in steps 33 and 34 is also applicable to the decoding steps 44 and 45 to improve the accuracy of the prediction, and the specific implementation is here. No longer.

步骤 46、 将各块的解码图像值进行上采样合成处理, 得到宏块的解码图像 值。  Step 46: Perform up-sampling synthesis processing on the decoded image values of the blocks to obtain decoded image values of the macroblocks.

需要说明的是, 本实施例涉及的各个块的解码过程, 均以各个块存在预测 残差系数的情况为例进行说明。 实际上, 还可以允许各个块不存在预测残差系 数的情况, 或者, 第 1块存在预测残差系数而第 2块和第 3块不存在预测残差 系数的情况, 或者其他组合。  It should be noted that the decoding process of each block according to the present embodiment is described by taking the case where the prediction residual coefficient exists in each block as an example. Actually, it is also possible to allow the case where the prediction residual coefficient is not present for each block, or the case where the prediction coefficient is present in the first block and the prediction residual coefficient is not present in the second block and the third block, or other combinations.

本实施例视频图像解码处理方法中, 块 1和块 2均以块 3为参考进行预测 解码, 而块 0以块 1和 /或块 2为参考进行预测解码, 相当于在各块之间形成了 级联的预测参考关系, 因此可以实现充分利用像素相关性进行编码压缩后的视 频图像的解码。  In the video image decoding processing method of this embodiment, both block 1 and block 2 perform prediction decoding with reference to block 3, and block 0 performs prediction decoding with reference to block 1 and/or block 2, which is equivalent to forming between blocks. The cascaded prediction reference relationship enables decoding of a video image that is encoded and compressed to fully utilize pixel correlation.

图 5a为本发明第五实施例提供的视频图像编码处理方法流程图。 本实施例 与本发明第三实施例的区别在于, 本实施例将块 3 进一步划分为多个子块, 本 实施例可应用于紋理细节较为丰富的视频图像的编码处理。 如图 5a所示, 本实 施例视频图像编码处理方法包括: FIG. 5a is a flowchart of a video image encoding processing method according to a fifth embodiment of the present invention. The difference between this embodiment and the third embodiment of the present invention is that the block 3 is further divided into a plurality of sub-blocks, and the embodiment can be applied to the encoding process of a video image with rich texture details. As shown in Figure 5a, this is The embodiment video image encoding processing method includes:

步骤 51、 输入待编码的视频图像帧。  Step 51: Input a video image frame to be encoded.

步骤 52、 对视频图像帧中 16 X 16大小的宏块进行下采样分解, 得到 4个 8 X 8大小的块, 分别标记为: 块 0、 1、 2和 3, 将块 3划分为 4个 4 X 4大小的 子块, 分别标记为: 子块 30、 31、 32和 33。  Step 52: Perform downsampling on the 16×16 macroblock in the video image frame to obtain four 8×8 size blocks, which are respectively labeled as: blocks 0, 1, 2, and 3, and divide the block 3 into 4 blocks. 4 X 4 size sub-blocks, labeled as: Sub-blocks 30, 31, 32, and 33, respectively.

本步骤将 16 X 16大小的宏块进行下采样分解为 4个 8 X 8大小的块的实现 方法, 与步骤 32相似, 在此不再赘述。  In this step, the method of decomposing the 16×16 macroblocks into four 8×8-sized blocks is similar to the step 32, and details are not described herein again.

图 5b为本发明第五实施例提供的宏块分解示意图。 如图 5b所示, 本实施 例将块 3划分为 4个 4 X 4大小的子块, 即子块 30、 31、 32和 33。 本实施例对 块 3的分解过程不涉及下采样。 图中在各子块中用 "I" 表示其的预测编码类型 均为: 帧内预测编码。  FIG. 5b is a schematic diagram of macroblock decomposition according to a fifth embodiment of the present invention. As shown in Fig. 5b, this embodiment divides the block 3 into four sub-blocks of size 4 x 4, i.e., sub-blocks 30, 31, 32, and 33. The decomposition process of block 3 in this embodiment does not involve downsampling. In the figure, the prediction coding types indicated by "I" in each sub-block are: intra prediction coding.

步骤 53、 对子块 30、 31、 32和 33分别进行帧内预测编码处理, 得到各子 块的重构图像值, 对各子块的重构图像值进行合成处理, 得到块 3 的重构图像 值。  Step 53: Perform intra prediction encoding processing on the sub-blocks 30, 31, 32, and 33, respectively, obtain reconstructed image values of the respective sub-blocks, perform synthesis processing on the reconstructed image values of the respective sub-blocks, and obtain reconstruction of the block 3. Image value.

本步骤可利用帧内已编码的其他宏块的像素值, 对组成子块 30进行帧内预 测得到预测值, 将该预测值与宏块原始像素进行比较得到子块 30的预测残差, 根据子块 30的预测值及其编码处理后的预测残差构造子块 30的重构图像值。 采用类似的方法, 分别得到子块 31、 32和 33的预测值及其相应的预测残差, 并分别构造得到子块 31、 32和 33的重构图像值。 对子块 30、 31、 32和 33的 重构图像值进行合成处理, 得到块 3的重构图像值。  In this step, the prediction value may be obtained by performing intra prediction on the constituent sub-block 30 by using the pixel values of other macroblocks encoded in the frame, and comparing the predicted value with the original pixel of the macroblock to obtain a prediction residual of the sub-block 30, according to The predicted value of the sub-block 30 and the reconstructed image value of the prediction residual constructed sub-block 30 after the encoding process. In a similar manner, the predicted values of the sub-blocks 31, 32, and 33 and their corresponding prediction residuals are obtained, respectively, and reconstructed image values of the sub-blocks 31, 32, and 33 are constructed, respectively. The reconstructed image values of the sub-blocks 30, 31, 32, and 33 are subjected to synthesis processing to obtain reconstructed image values of the block 3.

步骤 54-步骤 55、 与步骤 34-步骤 35相同, 在此不再赘述。  Steps 54-55 are the same as steps 34-35, and are not described here.

步骤 56、 将对宏块中各子块和各块的预测残差进行编码处理得到的各子块 及各块的编码比特信息写入码流。 本步骤对子块 30、 31、 32和 33的预测残差的编码处理, 与步骤 36中对预 测编码类型为帧内预测编码的块(如块 3 ) 的预测残差的编码处理的方法相同, 对块 0、 1和 2的预测残差与对预测编码类型为像素间预测编码的块(如块 0、 1 和 2 ) 的预测残差的编码处理的方法相同, 在此不再赘述。 Step 56: Write coded bit information of each sub-block and each block obtained by encoding the prediction residual of each sub-block and each block in the macro block into the code stream. The encoding processing of the prediction residuals of the sub-blocks 30, 31, 32, and 33 in this step is the same as the encoding processing of the prediction residual of the block (such as block 3) in which the prediction encoding type is intra-predictive encoding in step 36. The prediction residuals for blocks 0, 1, and 2 are the same as the encoding process for the prediction residuals of the blocks whose prediction coding type is inter-pixel prediction coding (such as blocks 0, 1, and 2), and are not described herein again.

本实施例视频图像编码处理方法在实现本发明第三实施例技术效果的基础 上, 通过对块 3细划分为多个子块(即子块 30、 31、 32和 33 ), 对各子块进行 帧内预测编码,将各子块重构图像值合成得到的块 3的重构图像值作为块 1和 2 的参考。 本实施例与本发明第三实施例的区别在于, 利用 4个 4 X 4的帧内预测 编码替换了本发明第三实施例中的 1个 8 X 8帧内预测编码。 4 X 4的帧内预测编 码, 更适合紋理细节较为丰富的视频图像的编码处理。  The video image encoding processing method of this embodiment implements the technical effects of the third embodiment of the present invention, and divides the block 3 into a plurality of sub-blocks (ie, sub-blocks 30, 31, 32, and 33), and performs sub-blocks on each sub-block. In intra prediction coding, the reconstructed image values of the block 3 obtained by combining the reconstructed image values of the respective sub-blocks are used as references of the blocks 1 and 2. The difference between this embodiment and the third embodiment of the present invention is that one 8×8 intra prediction code in the third embodiment of the present invention is replaced by four 4×4 intra prediction codes. 4 X 4 intra prediction coding is more suitable for encoding video images with rich texture details.

图 6为本发明第六实施例提供的视频图像解码处理方法流程图。 本实施例 为与本发明第五实施例提供的编码处理方法对应的解码端处理流程。 如图 6所 示, 本实施例视频图像解码处理方法包括:  FIG. 6 is a flowchart of a video image decoding processing method according to a sixth embodiment of the present invention. This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the fifth embodiment of the present invention. As shown in FIG. 6, the video image decoding processing method in this embodiment includes:

步骤 61、 获取的视频图像帧的宏块包含的子块 30、 31、 32和 33以及块 0、 1和 2的编码比特信息的码流。  Step 61: The code stream of the encoded bit information of the sub-blocks 30, 31, 32 and 33 and the blocks 0, 1 and 2 of the acquired macroblock of the video image frame.

步骤 62、 分别对子块 30、 31、 32和 33以及块 0、 1和 2的编码比特信息进 行解码处理, 得到各块的残差。  Step 62: Decode the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2, respectively, to obtain the residual of each block.

本步骤对子块 30、 31、 32和 33的编码比特信息的解码处理, 与步骤 42中 对预测编码类型为帧内预测编码的块(如块 3 )的编码比特信息的解码处理的方 法相同, 对块 0、 1和 2的编码比特信息与对预测编码类型为像素间预测编码的 块(如块 0、 1和 2 ) 的比特信息的解码处理的方法相同, 在此不再赘述。  The decoding process of the coded bit information of the sub-blocks 30, 31, 32 and 33 in this step is the same as the decoding process of the coded bit information of the block (such as block 3) of the predictive coding type being intra-predictive coded in step 42. The coded bit information for the blocks 0, 1 and 2 is the same as the method for decoding the bit information of the block (for example, blocks 0, 1 and 2) whose prediction coding type is inter-pixel predictive coding, and will not be described herein.

步骤 63、 对子块 30、 31、 32和 33进行帧内预测解码处理, 并将预测值与 各子块相应的残差相加, 得到各子块的解码图像值, 对各子块的解码图像值进 行合成处理, 得到块 3的解码图像值。 本步骤可利用帧内已解码的其他宏块的像素值,对组成子块 30进行帧内预 测得到预测值, 根据子块 30的预测值及其残差构造子块 30的解码图像值。 采 用相似的方法, 分别得到子块 31、 32和 33的的解码图像值。 对子块 30、 31、 32和 33的解码图像值进行合成处理, 得到块 3的解码图像值。 步骤 64-步骤 66、 与步骤 44-步骤 46相同, 在此不再赘述。 本实施例视频图像解码处理方法, 在实现本发明第四实施例提供的视频图 像解码方法达到技术效果的基础上, 通过将采用帧内预测编码的块划分的多个 子块, 在紋理细节较为丰富的视频图像的编码处理的应用场景中, 可以实现充 分利用像素相关性进行编码压缩后的视频图像的解码。 图 7a为本发明第七实施例提供的视频图像编码处理方法流程图。 本实施例 与本发明第三实施例的区别在于, 本实施例在块 3 再次进行了下采样分解得到 多个子块, 本实施例与本发明第五实施例的区别在于, 本实施例对块 3 再次进 行了下采样分解得到多个子块, 而第五实施例中的分解过程不涉及下采样。 本 实施例可应用于紋理细节较不丰富的视频图像的编码处理。 如图 7a所示, 本实 施例视频图像编码处理方法包括: 步骤 71、 输入待编码的视频图像帧。 步骤 72、 以丄为下采样系数,对视频图像帧中 16 X 16大小的宏块进行下采 Step 63: Perform intra prediction decoding processing on the sub-blocks 30, 31, 32, and 33, and add the predicted value to the residual corresponding to each sub-block to obtain a decoded image value of each sub-block, and decode each sub-block. Image value The line synthesis processing obtains the decoded image value of the block 3. In this step, the prediction value may be obtained by performing intra prediction on the constituent sub-block 30 by using the pixel values of other macroblocks decoded in the frame, and the decoded image values of the sub-block 30 are constructed according to the prediction value of the sub-block 30 and its residual. Using a similar method, the decoded image values of the sub-blocks 31, 32, and 33 are obtained, respectively. The decoded image values of the sub-blocks 30, 31, 32, and 33 are subjected to synthesis processing to obtain decoded image values of the block 3. Step 64-step 66 is the same as step 44-step 46, and details are not described herein again. In the video image decoding processing method of the embodiment, the video image decoding method provided by the fourth embodiment of the present invention achieves the technical effect, and the plurality of sub-blocks divided by the intra-predictive coding block are rich in texture details. In the application scenario of the encoding process of the video image, decoding of the video image after the encoding and compression using the pixel correlation can be realized. FIG. 7a is a flowchart of a video image encoding processing method according to a seventh embodiment of the present invention. The difference between this embodiment and the third embodiment of the present invention is that the present embodiment performs the downsampling decomposition again in block 3 to obtain a plurality of sub-blocks. The difference between this embodiment and the fifth embodiment of the present invention is that the block is in this embodiment. 3 The downsampling decomposition is performed again to obtain a plurality of sub-blocks, and the decomposition process in the fifth embodiment does not involve downsampling. This embodiment can be applied to the encoding process of a video image with less texture details. As shown in FIG. 7a, the video image encoding processing method of this embodiment includes: Step 71: Input a video image frame to be encoded. Step 72: Taking 丄 as a downsampling coefficient, performing a 16×16 macroblock in the video image frame

2  2

样分解, 得到 4个 8 X 8大小的块, 分别标记为: 块 0、 1、 2和 3 ; 以丄为下采 Decomposed into four blocks of 8 x 8 size, labeled as: blocks 0, 1, 2, and 3;

2 样系数, 对 8 X 8大小的块 3进行下采样分解, 得到 4个 4 X 4大小的子块, 分 别标记为: 子块 30、 31、 32和 33。 可根据视频图像帧的实际编码需求对预测编码类型为帧内预测编码的块再 次进行下采样分解, 得到多个子块, 对多个子块中预测编码类型为帧内预测编 码的子块再次进行下采样分解 ......以此类推, 进行多级下采样分解并对分解得 到的块或子块进行编码处理, 该思想可称为分等级(Hierarchical )编码方法。 2 sample coefficients, downsampling decomposition of block 8 of 8×8 size, and obtaining 4 4×4 size sub-blocks, respectively labeled as: sub-blocks 30, 31, 32 and 33. The block of the intra prediction coding type of the prediction coding type may be downsampled again according to the actual coding requirement of the video image frame to obtain multiple sub-blocks, and the prediction coding type of the plurality of sub-blocks is intra prediction. The sub-block of the code is subjected to downsampling decomposition again... and so on, multi-level downsampling decomposition is performed, and the decomposed block or sub-block is encoded. This idea can be called Hierarchical coding method. .

假设下采样系数为丄, 可对宏块下采样分解后得到的预测编码类型为帧内 m  Assuming that the downsampling coefficient is 丄, the type of predictive coding that can be obtained by downsampling the macroblock is intraframe.

预测编码的块再次进行下采样分解, 可得到 2个子块。 在 2个块进行分组, 例 如可根据下采样过程中抽取的、 组成各子块像素之间的位置关系将 m2个子块分 为《组, 每组由一个或多个子块组成, 每组由一个或多个块组成, 《为大于或等 于 3的整数。 第一组包括至少一个第 1子块, 第二组包括至少一个第 2子块, 第三组包括至少一个第 3子块 ......以此类推, 直至完成 2个子块的分组处理。 可选的, 在对 m2个子块进行分组过程中, 可根据抽取像素的位置关系是否相邻 的原则进行分组, 如将与同一像素相邻的像素组成的不同子块划分为同一组等。 The block of the predictive coding is subjected to downsampling decomposition again, and two sub-blocks are obtained. Grouping in 2 blocks, for example, according to the positional relationship between the sub-block pixels extracted in the downsampling process, the m 2 sub-blocks are divided into groups, each group consisting of one or more sub-blocks, each group consisting of One or more blocks consisting of "integers greater than or equal to 3. The first group includes at least one first sub-block, the second group includes at least one second sub-block, the third group includes at least one third sub-block, and so on, until packet processing of 2 sub-blocks is completed . Optionally, in the process of grouping m 2 sub-blocks, grouping may be performed according to whether the positional relationship of the extracted pixels is adjacent, for example, dividing different sub-blocks composed of pixels adjacent to the same pixel into the same group.

下面结合实例进行说明。 图 7b为本发明第七实施例提供的宏块下采样分解 示意图。 如图 7b所示, 本实施例相当于在图 3c所示的基础上, 将块 3划分为 4 个 4 X 4大小的子块, 即子块 30、 31、 32和 33。 将 8 X 8大小的块 3进行下采样 分解得到 4个 4 X 4大小的子块的实现方法, 与将 16 X 16大小的宏块进行下采 样分解为 4个 8 X 8大小的块的实现方法相同, 具体详见步骤 32的记载, 在此 不再赘述。  The following description will be made with reference to examples. FIG. 7b is a schematic diagram of a macroblock downsampling decomposition according to a seventh embodiment of the present invention. As shown in Fig. 7b, this embodiment is equivalent to dividing the block 3 into four sub-blocks of 4 x 4 size, i.e., sub-blocks 30, 31, 32 and 33, on the basis of that shown in Fig. 3c. The 8×8 size block 3 is downsampled to obtain 4 4×4 size sub-block implementation methods, and the 16×16 size macroblock is downsampled into 4 8×8 size blocks. The method is the same. For details, refer to the description in step 32, and details are not described herein again.

在子块 30、 31、 32和 33中, 可根据宏块下采样分解过程中抽取像素的位 置关系将子块分为多组, 例如: 子块 33为一组, 子块 31和 32为一组, 子块 30 为一组。  In the sub-blocks 30, 31, 32, and 33, the sub-blocks may be divided into groups according to the positional relationship of the extracted pixels in the macroblock downsampling process, for example, the sub-blocks 33 are a group, and the sub-blocks 31 and 32 are one. Group, sub-block 30 is a group.

在对各子块或块的下述编码处理过程中,子块 33对预测编码类型为帧内预 测编码, 对子块 30、 31和 32以及块 0、 1和 2的预测编码类型为像素间预测编 码。 图中采用 "I" 和 "P"分别表示相应块的预测编码类型为: 帧内预测编码或 像素间预测编码类型。 因此本实施例中, 子块 33即为本发明实施例所述的 "第 1子块", 子块 31和 32即为本发明实施例所述的 "第 2子块", 子块 30即为本 发明实施例所述的 "第 3子块"。 In the following encoding process for each sub-block or block, sub-block 33 encodes the predictive coding type as intra-prediction, and the predictive coding types for sub-blocks 30, 31 and 32 and blocks 0, 1 and 2 are between pixels. Predictive coding. The "I" and "P" in the figure respectively indicate that the prediction coding type of the corresponding block is: intra prediction coding or inter-pixel prediction coding type. Therefore, in this embodiment, the sub-block 33 is the first embodiment of the present invention. 1 sub-blocks, sub-blocks 31 and 32 are "second sub-blocks" according to an embodiment of the present invention, and sub-block 30 is a "third sub-block" according to an embodiment of the present invention.

在本发明实施例中,所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与所述第 1块中的右下角的像素在水平或垂直方 向上。 所述第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所 述宏块的对角线方向上。  In an embodiment of the invention, the pixels in the first block include at least pixels in the lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

步骤 73、 对子块 33进行帧内预测编码处理, 得到子块 33的重构图像值。 本步骤对子块 33的帧内预测编码处理方法, 与步骤 33中对块 3进行帧内 预测编码处理的方法相似, 在此不再赘述。  Step 73: Perform intra prediction encoding processing on the sub-block 33 to obtain a reconstructed image value of the sub-block 33. The intra prediction encoding processing method of the sub-block 33 in this step is similar to the method for performing intra prediction encoding processing on the block 3 in step 33, and details are not described herein again.

步骤 74、以子块 33的重构图像值为参考,分别对子块 31和 32进行预测编 码处理, 分别得到子块 31和 32的重构图像值。  Step 74: Perform prediction coding processing on the sub-blocks 31 and 32 with reference to the reconstructed image values of the sub-blocks 33, and obtain reconstructed image values of the sub-blocks 31 and 32, respectively.

本步骤对子块 31和 32的像素间预测编码及预测参考关系的确定, 与步骤 34中对块 1和 2的预测编码处理的方法相似, 在此不再赘述。 可选的, 在对子 块 31或 32进行预测编码处理过程中, 可包括对子块 33的重构图像值进行插值 处理, 插值后对应位置的像素值作为子块 31或 32的预测值,根据子块 31或 32 的预测值及其预测残差构造子块 31或 32的重构图像值, 其具体实现方式, 与 步骤 34对块 1和 2的插值处理的方法相似, 在此不再赘述。  The determination of the inter-pixel predictive coding and the prediction reference relationship of the sub-blocks 31 and 32 in this step is similar to the method for predictive coding processing of the blocks 1 and 2 in step 34, and details are not described herein again. Optionally, in performing the predictive coding process on the sub-block 31 or 32, the reconstructed image value of the sub-block 33 may be interpolated, and the pixel value of the corresponding position after interpolation is used as the predicted value of the sub-block 31 or 32. The reconstructed image value of the sub-block 31 or 32 is constructed according to the predicted value of the sub-block 31 or 32 and its prediction residual, and the specific implementation thereof is similar to the method for the interpolation processing of the blocks 1 and 2 in step 34, and is no longer Narration.

步骤 75、 以子块 31和 /或子块 32的重构图像值为参考, 对子块 30进行预 测编码处理, 得到子块 30的重构图像值。 本步骤对子块 30的像素间预测编码及预测参考关系的确定, 与步骤 35中 对块 0的预测编码处理的方法相似, 在此不再赘述。 可选的, 在对子块 30进行 预测编码处理过程中, 可包括对子块 31和 /或 32的重构图像值进行插值处理, 插值后对应位置的像素值作为子块 30的预测值, 根据子块 30的预测值及其预 测残差构造子块 30的重构图像值, 其具体实现方式, 与步骤 35对块 0的插值 处理的方法相似, 在此不再赘述。 Step 75: Perform prediction encoding processing on the sub-block 30 with reference to the reconstructed image value of the sub-block 31 and/or the sub-block 32 to obtain a reconstructed image value of the sub-block 30. The determination of the inter-pixel predictive coding and the prediction reference relationship of the sub-block 30 in this step is similar to the method of predictive coding processing for the block 0 in the step 35, and details are not described herein again. Optionally, in performing the predictive coding process on the sub-block 30, the reconstructed image values of the sub-blocks 31 and/or 32 may be subjected to interpolation processing, and the pixel values of the corresponding positions after interpolation are used as the predicted values of the sub-block 30, The method for constructing the reconstructed image of the sub-block 30 according to the predicted value of the sub-block 30 and its prediction residual is similar to the method for the interpolation processing of the block 0 in step 35, and details are not described herein again.

步骤 76、 对子块 30、 31、 32和 33的重构图像值进行合成处理, 得到块 3 的重构图像值。  Step 76: Synthesize the reconstructed image values of the sub-blocks 30, 31, 32, and 33 to obtain the reconstructed image values of the block 3.

步骤 77-步骤 78、 与步骤 34-步骤 35相同, 在此不再赘述。  Step 77-Step 78 is the same as Step 34-Step 35, and details are not described herein again.

步骤 79、 将对子块 30、 31、 32和 33、 以及块 0、 1和 2的预测残差进行编 码处理得到的各块的编码比特信息写入码流。  Step 79: Write coded bit information of each block obtained by encoding the prediction residuals of the sub-blocks 30, 31, 32, and 33, and blocks 0, 1, and 2 into the code stream.

当本实施例应用在基于块的混合视频编码框架的场景中, 对每个子块或每 个块的预测编码处理都包括依次进行预测、 变换、 量化和熵编码等处理, 根据 上述步骤 71-79可知,各子块及各块之间的形成级联式预测参考关系。 由于级联 预测编码的引入, 使得宏块内部可以有帧内预测和像素间预测两种不同预测编 码类型的块、 块内部可以有帧内预 'J和像素间预测两种不同预 'J编码类型的子 块, 因此可在对块进行量化处理过程中进行优化设计, 如为各子块或块分配 QP 值的基本原则为: 根据参考关系确定 QP值。 在上述编码过程中, 有如下参考关 系: 块 0参考块 1/2, 块 1/2参考块 3, 在块 3中, 子块 30参考子块 31/32, 子 块 31/32参考子块 33。 与步骤 362相似, 即: 被作为参考的块的 QP值被赋予更 小的 QP值。  When the present embodiment is applied to a scenario of a block-based hybrid video coding framework, the predictive coding process for each sub-block or each block includes performing prediction, transform, quantization, and entropy coding, etc., according to steps 71-79 above. It can be seen that each sub-block and each block form a cascade prediction reference relationship. Due to the introduction of cascading predictive coding, there may be two different predictive coding types in the macroblock, such as intra prediction and inter-pixel prediction. There may be two different pre-J codes in the intra-pre and inter-pixel prediction. The sub-blocks of the type can be optimized during the quantization process of the block. For example, the basic principle of assigning QP values to each sub-block or block is: Determine the QP value according to the reference relationship. In the above encoding process, there are the following reference relationships: block 0 refers to block 1/2, block 1/2 refers to block 3, in block 3, sub-block 30 refers to sub-block 31/32, sub-block 31/32 refers to sub-block 33. Similar to step 362, that is: the QP value of the block being referenced is given a smaller QP value.

根据上述原则, 如图 7b所示的子块以及块之间的参考关系, 可得子块以及 块的 QP值满足下式: QP (子块 33 ) ^QP (子块 31/32 ) ^QP (子块 30 ) ^QP (块 1/2 ) ^QP (块 0 ) According to the above principle, as shown in the sub-block and the reference relationship between the blocks as shown in FIG. 7b, the QP values of the sub-blocks and the blocks satisfy the following formula: QP (sub-block 33) ^QP (sub-block 31/32) ^QP (sub-block 30) ^QP (block 1/2) ^QP (block 0)

在实际应用中, 各块的 QP值可根据实验确定, 例如可采样以下关系确定 子块以及块的 QP值:  In practical applications, the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the sub-block and the QP value of the block:

QP (块 1/2 ) = QP (块 0 ) = QP ( Slice )  QP (block 1/2) = QP (block 0) = QP (Slice)

QP (子块 31/32 ) = QP (子块 30 ) = QP ( Slice ) -6  QP (subblock 31/32) = QP (subblock 30) = QP (Slice) -6

QP (子块 33 ) = QP ( Slice ) -10  QP (subblock 33) = QP (Slice) -10

其中, QP ( Slice )表示视频图像条带 ( Slice )对应的 QP值, Slice对应的 QP值可预先获取。  Among them, QP (Slice) represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance.

在确定子块 30、 31、 32和 33以及块 0、 1和 2的 QP值之后, 根据子块或 块的 QP值对各自预测残差进行相应的量化处理。对不同子块或块进行相应的量 化优化处理的方法, 与本发明第三实施例的记载相似, 在此不再赘述。  After determining the QP values of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2, the respective prediction residuals are subjected to corresponding quantization processing based on the QP values of the sub-blocks or blocks. A method for performing a corresponding quantization optimization process on different sub-blocks or blocks is similar to the description of the third embodiment of the present invention, and details are not described herein again.

由于级联预测编码的引入, 使得宏块内部可以有帧内预测和像素间预测两 种不同预测编码类型的块、 块内部可以有帧内预测和像素间预测两种不同预测 编码类型的子块, 因此, 可对宏块各块进行的熵编码进行优化设计, 其中, 对 语法元素 CBP的熵编码方式进行相应的优化设计, 可参见本发明第三实施例的 记载, 在此不再赘述。  Due to the introduction of cascading predictive coding, there may be two blocks of different predictive coding types, such as intra prediction and inter-pixel prediction, within the macroblock, and there may be two sub-blocks of different prediction coding types, such as intra prediction and inter-pixel prediction. Therefore, the entropy coding of the macroblocks may be optimized. The entropy coding mode of the syntax element CBP is optimized. For details, refer to the description of the third embodiment of the present invention, and details are not described herein.

视频编码标准中语法元素 "cbp_block_flag" 表示 1个 8x8块中的 4个 4x4 子块是否含有非零系数。 由于级联预测编码的引入, 使得 1个 8 X 8大小的块内 部可以有帧内和像素间两种不同预测编码类型的子块, 不同预测编码类型的子 块的预测残差经量化处理后的系数具有不同的分布规律, 考虑到这一点, 需要 对语法元素 "cbp_block_flag" 的熵编码方式进行相应的设计, 其设计方法与步 骤 363中对语法元素 CBP的熵编码方式优化设计方法相似, 在此不再赘述。 本实施例视频图像编码处理方法在实现本发明第三实施例技术效果的基础 上, 通过对块 3再次下采样分解得到多个子块(即子块 30、 31、 32和 33 ), 对 子块层以及块层等不同层分别建立各自相应的级联预测参考关系, 相当于形成 二个等级的级联式编码处理。 在紋理细节较不丰富的视频图像的编码处理的应 用场景中, 本实施例能更充分的利用像素之间的相关性, 从而提高预测精度及 视频图像的编码压缩效率。 The syntax element "cbp_block_flag" in the video coding standard indicates whether four 4x4 sub-blocks in one 8x8 block contain non-zero coefficients. Due to the introduction of cascading predictive coding, a block of 8×8 size may have two sub-blocks of different predictive coding types within and between pixels, and the prediction residuals of sub-blocks of different predictive coding types are quantized. The coefficients have different distribution rules. In view of this, the entropy coding mode of the syntax element "cbp_block_flag" needs to be designed accordingly. The design method is similar to the optimization design method of the entropy coding mode of the syntax element CBP in step 363. This will not be repeated here. In the video image encoding processing method of this embodiment, on the basis of implementing the technical effects of the third embodiment of the present invention, a plurality of sub-blocks (ie, sub-blocks 30, 31, 32, and 33) are obtained by sub-sampling the block 3 again, and the sub-blocks are obtained. Different layers such as layers and block layers respectively establish respective corresponding cascading prediction reference relationships, which is equivalent to forming two levels of cascading coding processing. In an application scenario of encoding processing of a video image with less texture details, the embodiment can more fully utilize the correlation between pixels, thereby improving prediction accuracy and encoding compression efficiency of the video image.

需要说明的是, 虽然本实施例 16 X 16大小的宏块进行二个等级的下采样分 解为例进行说明, 但可以理解, 可根据视频图像编码的实际需要, 如在紋理细 节较不丰富的视频图像的编码处理的应用场景中, 可在此基础上进行三个或三 个以上等级的下采样分解, 如: 在 4 X 4大小的子块 33进行下采样分解, 从而 形成三个等级的级联式编码处理, 其实现机理与本实施例相似, 在此不再赘述。  It should be noted that, although the macroblock of the size of 16 16 in this embodiment performs two levels of downsampling decomposition as an example, it can be understood that it can be based on the actual needs of video image encoding, such as less rich in texture details. In the application scenario of the encoding process of the video image, three or more levels of downsampling decomposition may be performed on the basis of the following, for example: sub-blocking decomposition in the sub-block 33 of 4×4 size, thereby forming three levels of The cascading coding process is similar to the embodiment, and will not be described again.

图 8 为本发明第八实施例提供的视频图像解码处理方法流程图。 本实施例 为与本发明第七实施例提供的编码处理方法对应的解码端处理流程。 如图 8所 示, 本实施例视频图像解码处理方法包括:  FIG. 8 is a flowchart of a video image decoding processing method according to an eighth embodiment of the present invention. This embodiment is a decoding end processing flow corresponding to the encoding processing method provided by the seventh embodiment of the present invention. As shown in FIG. 8, the video image decoding processing method in this embodiment includes:

步骤 81、 获取视频图像帧的宏块包含的子块 30、 31、 32和 33以及块 0、 1 和 2的编码比特信息的码流。  Step 81: Acquire a code stream of the sub-blocks 30, 31, 32, and 33 included in the macro block of the video image frame and the coded bit information of the blocks 0, 1, and 2.

步骤 82、 分别对子块 30、 31、 32和 33以及块 0、 1和 2的编码比特信息进 行解码处理, 得到各子块或块的残差解码系数。  Step 82: Decode the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2, respectively, to obtain residual decoding coefficients of each sub-block or block.

对任一子块或块的编码比特信息的解码处理可包括: 熵解码、 反量化和反 变换等处理, 相当于步骤 79的反处理过程。 可选的, 步骤 82可包括步骤 821- 步骤 823。  The decoding process of the encoded bit information of any sub-block or block may include: entropy decoding, inverse quantization, and inverse transform, etc., which is equivalent to the inverse processing of step 79. Optionally, step 82 may include step 821 - step 823.

步骤 821、 对子块 30、 31、 32和 33以及块 0、 1和 2的编码比特信息进行 熵解码处理。 在熵解码处理过程中, 可根据各块的残差解码系数不同的分布规律, 对各 块的编码比特信息进行相应的熵解码处理, 采用帧内预测编码模式进行预测编 码的块出现非零系数的概率, 大于采用像素间预测编码模式进行预测编码的块 出现非零系数的概率, 如: 子块 33残差解码系数出现非零系数的概率, 大于其 他子块 30、 31和 32或块 0、 1和 2的残差解码系数出现非零系数的概率。 其熵 解码实现机理与步骤 421-步骤 423相似, 在此不再赘述。 Step 821, performing entropy decoding processing on the coded bit information of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2. In the entropy decoding process, according to the distribution rule of the residual decoding coefficients of each block, the encoding bit information of each block is subjected to corresponding entropy decoding processing, and the block using the intra prediction coding mode for predictive coding has non-zero coefficients. The probability of the block larger than the predictive coding using the inter-plot predictive coding mode, such as: the probability that the sub-block 33 residual decoding coefficient has a non-zero coefficient is greater than other sub-blocks 30, 31, and 32 or block 0. The residual decoding coefficients of 1 and 2 have a probability of non-zero coefficients. The entropy decoding implementation mechanism is similar to the steps 421 to 423, and details are not described herein.

步骤 822、 对子块 30、 31、 32和 33以及块 0、 1和 2的编码比特信息经熵 解码处理后的系数, 进行反量化处理。  Step 822: Perform inverse quantization processing on the coefficients of the sub-blocks 30, 31, 32, and 33 and the coded bit information of the blocks 0, 1, and 2 subjected to entropy decoding processing.

由于被参考的块的质量会影响以其为参考的块的质量, 因此, 为了获得较 佳的 RD性能, 各子块和各块的 QP值分配原则是: 根据参考关系确定 QP值。 可预先确定后续解码过程存在的参考关系, 例如存在如下参考关系: 块 0参考 块 1/2, 块 1/2参考块 3, 在块 3中, 子块 0参考子块 31/32, 子块 31/32参考子 块 33。 则被作为参考的块的 QP值被赋予更小的 QP值。  Since the quality of the referenced block affects the quality of the block with which it is referenced, in order to obtain better RD performance, the QP value assignment principle for each sub-block and each block is: Determine the QP value based on the reference relationship. The reference relationship existing in the subsequent decoding process may be determined in advance, for example, there is a following reference relationship: block 0 refers to block 1/2, block 1/2 refers to block 3, and in block 3, subblock 0 refers to sub-block 31/32, sub-block 31/32 refers to sub-block 33. The QP value of the block to be referenced is given a smaller QP value.

根据上述原则, 可得块的 QP值满足下式:  According to the above principle, the QP value of the available block satisfies the following formula:

QP (子块 33 ) ^QP (子块 31/32 ) ^QP (子块 30 ) ^QP (块 1/2 ) ^QP QP (sub-block 33) ^QP (sub-block 31/32) ^QP (sub-block 30) ^QP (block 1/2) ^QP

(块 0 ) (block 0)

在实际应用中, 各块的 QP值可根据实验确定, 例如可采样以下关系确定 各块的 QP值:  In practical applications, the QP value of each block can be determined experimentally. For example, the following relationship can be sampled to determine the QP value of each block:

QP (块 1/2 ) = QP (块 0 ) = QP ( Slice )  QP (block 1/2) = QP (block 0) = QP (Slice)

QP (子块 31/32 ) = QP (子块 30 ) = QP ( Slice ) -6  QP (subblock 31/32) = QP (subblock 30) = QP (Slice) -6

QP (子块 33 ) = QP ( Slice ) -10  QP (subblock 33) = QP (Slice) -10

其中, QP ( Slice )表示视频图像条带 ( Slice )对应的 QP值, Slice对应的 QP值可预先获取。 在确定各块的 QP值之后, 根据各块的 QP值, 对各块的编码比特信息经熵 解码处理后的系数进行相应的反量化处理。 Where QP (Slice) represents the QP value corresponding to the slice of the video image, and the QP value corresponding to the slice can be obtained in advance. After determining the QP value of each block, according to the QP value of each block, the coded bit information of each block is subjected to corresponding inverse quantization processing by the entropy-decoded coefficients.

步骤 823、 对子块 30、 31、 32和 33以及块 0、 1和 2经反量化处理后的系 数, 进行反变换处理, 得到残差。  Step 823: Perform inverse transform processing on the coefficients of the sub-blocks 30, 31, 32, and 33 and the blocks 0, 1, and 2 after the inverse quantization to obtain a residual.

步骤 83、 对子块 33进行帧内预测解码处理, 并将预测值与子块 33残差相 加, 得到子块 33的解码图像值。  Step 83: Perform intra prediction decoding processing on the sub-block 33, and add the predicted value to the residual of the sub-block 33 to obtain a decoded image value of the sub-block 33.

步骤 84、以子块 33的解码图像值为参考,对子块 31和 32分别进行预测解 码处理, 得到子块 31和 32的预测值, 将子块 31和 32的预测值与其各自对应 的残差相加, 得到子块 31和 32的解码图像值。  Step 84: Perform prediction decoding processing on the sub-blocks 31 and 32 respectively with reference to the decoded image values of the sub-blocks 33, and obtain predicted values of the sub-blocks 31 and 32, and predict the values of the sub-blocks 31 and 32 and their respective residuals. The difference is added to obtain decoded image values of the sub-blocks 31 and 32.

步骤 85、 以子块 31和 /或 32的解码图像值为参考, 对子块 30进行预测解 码处理, 得到子块 30的预测值, 将子块 30的预测值与其残差相加, 得到子块 30的解码图像值。  Step 85: Perform prediction decoding processing on the sub-block 30 with reference to the decoded image values of the sub-blocks 31 and/or 32, obtain a prediction value of the sub-block 30, and add the predicted value of the sub-block 30 and its residual to obtain a sub- The decoded image value of block 30.

步骤 86、 将各子块 30、 31、 32和 33的解码图像值进行上采样合成处理, 得到块 3的解码图像值。  Step 86: Perform upsampling synthesis processing on the decoded image values of the respective sub-blocks 30, 31, 32, and 33 to obtain decoded image values of the block 3.

步骤 87-步骤 89、 与步骤 44-步骤 46相同, 在此不再赘述。  Steps 87 to 89 are the same as steps 44 to 46, and are not described here.

本实施例视频图像编码处理方法在实现本发明第四实施例技术效果的基础 上, 编码端对块 3再次下采样分解得到多个子块(即子块 30、 31、 32和 33 ), 解码端对子块层以及块层等不同层分别建立各自相应的级联预测参考关系, 相 当于形成二个等级的级联式解码处理。 在紋理细节较不丰富的视频图像处理的 应用场景中, 本实施例可以实现充分利用像素相关性进行编码压缩后的视频图 像的解码。 如果编码端对宏块进行三个或三个以上等级的下采样分解, 则解码 端进行等三个或三个以上级的级联式解码处理的实现方式, 与本实施例相似, 在此不再赘述。 图 9为本发明第九实施例提供的视频图像编码处理装置结构示意图。如图 9 所示, 本实施例视频图像编码处理装置包括: 下采样分解模块 91、 帧内预测编 码模块 92、 第一像素间预测编码模块 93和第二像素间预测编码模块 94。 The video image encoding processing method of this embodiment implements the technical effects of the fourth embodiment of the present invention, and the encoding end desamples the block 3 again to obtain a plurality of sub-blocks (ie, sub-blocks 30, 31, 32, and 33), and the decoding end. The respective cascading prediction reference relationships are established for different layers such as the sub-block layer and the block layer, which is equivalent to forming two levels of cascading decoding processing. In an application scenario of video image processing with less texture details, this embodiment can implement decoding of a video image after encoding and compressing fully utilizing pixel correlation. If the encoding end performs three or more levels of downsampling decomposition on the macroblock, the decoding end performs the implementation of the cascading decoding processing of three or more levels, which is similar to the embodiment, and is not Let me repeat. FIG. 9 is a schematic structural diagram of a video image encoding processing apparatus according to a ninth embodiment of the present invention. As shown in FIG. 9, the video image encoding processing apparatus of this embodiment includes: a downsampling decomposition module 91, an intra prediction encoding module 92, a first inter-pixel predictive encoding module 93, and a second inter-pixel predictive encoding module 94.

下采样分解模块 91用于对视频图像帧的宏块进行下采样分解得到多个块, 上述多个块至少包括: 第 1块、 第 2块和第 3块。 所述第 1块中的像素至少包 含该宏块中最右下角的像素。 所述第 2块中的最右下角的像素与所述第 1块中 的最右下角的像素在水平或垂直方向上。 所述第 3 块中的最右下角的像素与所 述第 1块中的最右下角的像素在对角线方向上。  The downsampling decomposition module 91 is configured to downsample the macroblock of the video image frame to obtain a plurality of blocks, and the plurality of blocks include at least: a first block, a second block, and a third block. The pixel in the first block contains at least the pixel in the lowermost corner of the macroblock. The pixel in the lowermost corner of the second block is in the horizontal or vertical direction from the pixel in the lowermost corner of the first block. The pixel in the lowermost corner of the third block is diagonal to the pixel in the lowermost corner of the first block.

帧内预测编码模块 92用于对第 1块进行帧内预测编码处理, 得到第 1块的 重构图像值。  The intra prediction encoding module 92 is configured to perform intra prediction encoding processing on the first block to obtain a reconstructed image value of the first block.

第一像素间预测编码模块 93用于以第 1块的重构图像值为参考, 对第 2块 进行预测编码处理, 得到第 2块的重构图像值。  The first inter-pixel predictive coding module 93 is configured to perform predictive coding on the second block with reference to the reconstructed image value of the first block to obtain a reconstructed image value of the second block.

第二像素间预测编码模块 94用于以第 2块的重构图像值为参考, 对第 3块 进行预测编码处理。  The second inter-pixel predictive coding module 94 is configured to perform predictive coding processing on the third block with reference to the reconstructed image value of the second block.

在上述技术方案的基础上, 为了提高预测准确性, 可选的:  Based on the above technical solutions, in order to improve the prediction accuracy, an optional:

第一像素间预测编码模块 93 还可用于对第 1 块的重构图像值进行插值处 理, 如: 对第 1 块的重构图像值进行插值处理, 插值后对应位置的像素值作为 第 2块的预测值, 根据第 2块的预测值及其预测残差构造第 2块的重构图像值。  The first inter-pixel predictive coding module 93 may be further configured to perform interpolation processing on the reconstructed image values of the first block, such as: performing interpolation processing on the reconstructed image values of the first block, and interpolating the pixel values of the corresponding positions as the second block. The predicted value constructs the reconstructed image value of the second block from the predicted value of the second block and its prediction residual.

第二像素间预测编码模块 94还可用于对第 2 块的重构图像值进行插值处 理, 如对第 2块的重构图像值进行插值处理, 插值后对应位置的像素值作为第 3 块的预测值, 根据第 3块的预测值及其预测残差构造第 3块的重构图像值。  The second inter-pixel predictive coding module 94 may be further configured to perform interpolation processing on the reconstructed image values of the second block, such as performing interpolation processing on the reconstructed image values of the second block, and interpolating the pixel values of the corresponding positions as the third block. The predicted value constructs the reconstructed image value of the third block from the predicted value of the third block and its prediction residual.

本实施例视频图像编码处理装置对宏块进行下采样分解得到多个块, 对多 个块进行预测编码过程中将帧内预测编码技术与像素间预测编码技术相结合, 像素间预测编码过程中形成了至少二级的级联式预测参考关系, 因此可充分利 用像素间的相关性, 提高预测精度以及视频图像的编码压缩效率。 关于本实施 例视频图像编码处理装置的实现机理, 可参见上述有关视频图像编码处理方法 实施例的记载, 在此不再赘述。 In this embodiment, the video image coding processing apparatus performs down-sampling decomposition on the macroblock to obtain a plurality of blocks, and combines the intra-prediction coding technology and the inter-pixel prediction coding technology in performing predictive coding on the plurality of blocks. At least two levels of cascading prediction reference relationships are formed in the inter-pixel predictive coding process, so that correlation between pixels can be fully utilized, and prediction accuracy and coding compression efficiency of video images are improved. For the implementation mechanism of the video image encoding processing apparatus of this embodiment, refer to the description of the foregoing video image encoding processing method embodiment, and details are not described herein again.

图 10为本发明第十实施例提供的视频图像解码处理装置结构示意图。如图 FIG. 10 is a schematic structural diagram of a video image decoding processing apparatus according to a tenth embodiment of the present invention. As shown

10所示, 本实施例视频图像解码处理装置包括: 比特信息获取模块 101、 帧内 预测解码模块 102、第一像素间预测解码模块 103、第二像素间预测解码模块 104 和解码图像生成模块 105。 As shown in FIG. 10, the video image decoding processing apparatus of this embodiment includes: a bit information acquiring module 101, an intra prediction decoding module 102, a first inter-pixel predictive decoding module 103, a second inter-pixel predictive decoding module 104, and a decoded image generating module 105. .

比特信息获取模块 101 用于获取视频图像帧的宏块的编码比特信息, 上述 宏块至少包括: 第 1块, 第 2块和第 3块。  The bit information obtaining module 101 is configured to obtain coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block.

在本发明实施例中,所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与所述第 1块中的右下角的像素在水平或垂直方 向上。 所述第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所 述宏块的对角线方向上。  In an embodiment of the invention, the pixels in the first block include at least pixels in the lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

帧内预测解码模块 102用于对第 1块的编码比特信息进行帧内预测解码处 理, 得到第 1块的解码图像值。  The intra prediction decoding module 102 is configured to perform intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block.

第一像素间预测解码模块 103用于以第 1块的解码图像值为参考, 对第 2 块的编码比特信息进行预测解码处理, 得到第 2块的解码图像值。  The first inter-pixel predictive decoding module 103 is configured to perform predictive decoding processing on the coded bit information of the second block with reference to the decoded picture value of the first block to obtain a decoded picture value of the second block.

第二像素间预测解码模块 104用于以第 2块的解码图像值为参考, 对第 3 块的编码比特信息进行预测解码处理, 得到第 3块的解码图像值。 The second inter-pixel prediction decoding module 104 is configured to use the decoded image value of the second block as a reference, for the third The coded bit information of the block is subjected to predictive decoding processing to obtain a decoded image value of the third block.

解码图像生成模块 105 用于将宏块中各块的解码图像值进行上采样合成处 理, 得到宏块的解码图像值。  The decoded image generation module 105 is configured to perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock.

在上述技术方案的基础上, 为了提高预测准确性, 可选的:  Based on the above technical solutions, in order to improve the prediction accuracy, an optional:

第一像素间预测解码模块 103还可用于对第 1块的解码图像值进行插值处 理, 如: 对第 1 块的重构图像值进行插值处理, 插值后对应位置的像素值作为 第 2块的预测值, 根据第 2块的预测值及其残差解码系数构造第 2块的解码图 像值。  The first inter-pixel prediction decoding module 103 is further configured to perform interpolation processing on the decoded image values of the first block, for example, performing interpolation processing on the reconstructed image values of the first block, and the pixel values of the corresponding positions after interpolation are used as the second block. The predicted value constructs the decoded image value of the second block from the predicted value of the second block and its residual decoding coefficient.

第二像素间预测解码模块 104还可用于对第 2块的解码图像值进行插值处 理。 如: 对第 2块的重构图像值进行插值处理, 插值后对应位置的像素值作为 第 3块的预测值, 根据第 3块的预测值及其残差解码系数构造第 3块的解码图 像值。  The second inter-pixel predictive decoding module 104 can also be used to interpolate the decoded image values of the second block. For example, the reconstructed image value of the second block is subjected to interpolation processing, and the pixel value of the corresponding position after interpolation is used as the predicted value of the third block, and the decoded image of the third block is constructed based on the predicted value of the third block and its residual decoding coefficient. value.

本实施例视频图像解码处理装置对宏块进行下采样分解得到多个块进行预 测解码过程中, 将帧内预测解码技术与像素间预测解码技术相结合, 像素间预 测解码过程中形成了至少二级的级联式预测参考关系, 因此有利于充分利用块 之间的像素相关性以实现编码压缩后的视频图像的重构。 关于本实施例视频图 像解码处理装置的实现机理, 可参见上述有关视频图像解码处理方法实施例的 记载, 在此不再赘述。  In this embodiment, the video image decoding processing apparatus combines the intra prediction decoding technology with the inter-pixel prediction decoding technology by performing downsampling decomposition on the macroblock to obtain a plurality of blocks, and forms at least two in the inter-pixel prediction decoding process. The hierarchical cascading predicts the reference relationship, thus facilitating the full utilization of pixel correlation between blocks to achieve reconstruction of the encoded compressed video image. For the implementation mechanism of the video image decoding processing apparatus of this embodiment, reference may be made to the above description of the video image decoding processing method embodiment, and details are not described herein again.

图 11为本发明第十一实施例提供的视频图像编解码系统结构示意图。 如图 11所示, 本实施例视频图像编解码系统包括: 视频图像编码处理装置 111和视 频图像解码处理装置 112。  FIG. 11 is a schematic structural diagram of a video image codec system according to an eleventh embodiment of the present invention. As shown in FIG. 11, the video image encoding and decoding system of this embodiment includes: a video image encoding processing device 111 and a video image decoding processing device 112.

视频图像编码处理装置 111 用于对视频图像帧的宏块进行下采样分解得到 多个块, 上述多个块至少包括: 第 1块、 第 2块和第 3块; 对第 1块进行帧内 预测编码处理, 得到第 1块的重构图像值; 以第 1块的重构图像值为参考, 对 第 2块进行预测编码处理, 得到第 2块的重构图像值; 以第 2块的重构图像值 为参考, 对第 3块进行预测编码处理。 The video image encoding processing device 111 is configured to downsample the macroblock of the video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; performing intraframe on the first block Predicting the encoding process to obtain the reconstructed image value of the first block; performing the predictive encoding process on the second block with reference to the reconstructed image value of the first block to obtain the reconstructed image value of the second block; The reconstructed image value is a reference, and the third block is subjected to predictive coding processing.

视频图像解码处理装置 112 用于获取宏块的编码比特信息, 上述宏块至少 包括: 上述第 1块, 上述第 2块和上述第 3块; 对第 1块的编码比特信息进行 帧内预测解码处理, 得到第 1块的解码图像值; 以第 1块的解码图像值为参考, 对第 2块的编码比特信息进行预测解码处理, 得到第 2块的解码图像值; 以第 2 块的解码图像值为参考,对第 3块的编码比特信息进行预测解码处理,得到第 3 块的解码图像值; 将宏块中各块的解码图像值进行上采样合成处理, 得到该宏 块的解码图像值。  The video image decoding processing device 112 is configured to acquire coded bit information of the macro block, wherein the macro block includes at least: the first block, the second block, and the third block; and performing intra prediction decoding on the coded bit information of the first block. Processing, obtaining a decoded image value of the first block; performing predictive decoding processing on the encoded bit information of the second block with reference to the decoded image value of the first block to obtain a decoded image value of the second block; decoding by the second block The image value is a reference, and the coded bit information of the third block is subjected to predictive decoding processing to obtain a decoded image value of the third block; and the decoded image value of each block in the macro block is subjected to upsampling synthesis processing to obtain a decoded image of the macroblock. value.

在本发明实施例中,所述第 1块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与所述第 1块中的右下角的像素在水平或垂直方 向上。 所述第 3块中的右下角的像素与所述第 1块中的右下角的像素在处于所 述宏块的对角线方向上。  In an embodiment of the invention, the pixels in the first block include at least pixels in the lower right corner of the macroblock. The pixel in the lower right corner of the second block is in the horizontal or vertical direction from the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.

当然, 在其他可选择的实施例中, 也可以为, 所述第 1块中的像素至少包 含该宏块中左上角的像素。 所述第 2块中的左上角的像素与所述第 1块中的左 上角的像素在水平或垂直方向上。 所述第 3块中的左上角的像素与所述第 1块 中的左上角的像素处于所述宏块的对角线方向上。  Of course, in other optional embodiments, the pixels in the first block may include at least pixels in the upper left corner of the macroblock. The pixel in the upper left corner of the second block is in the horizontal or vertical direction from the pixel in the upper left corner of the first block. The pixel in the upper left corner of the third block and the pixel in the upper left corner of the first block are in the diagonal direction of the macro block.

本发明实施例视频图像编解码系统中,对宏块进行下采样分解得到多个块, 对多个块进行预测编码或解码过程中将帧内预测技术与像素间预测技术相结 合, 形成了至少二级的级联式预测参考关系, 因此可充分利用像素间的相关性, 提高预测精度以及视频图像的编码压缩效率。 关于本实施例视频图像编码处理 装置细化结构可参见图 9对应实施例的记载, 视频图像解码处理装置细化结构 可参见图 10对应实施例的记载, 其实现视频图像编解码的机理可参见图 1-图 8 对应实施例的记载, 在此不再赘述。 In the video image coding and decoding system of the embodiment of the present invention, the macroblock is downsampled and decomposed to obtain a plurality of blocks, and the intra prediction technique and the interpixel prediction technology are combined in the process of predictive coding or decoding of the plurality of blocks to form at least The second-level cascading prediction reference relationship can make full use of the correlation between pixels, improve the prediction accuracy and the coding compression efficiency of video images. For the detailed structure of the video image encoding processing apparatus of this embodiment, refer to the description of the corresponding embodiment of FIG. 9, the video image decoding processing apparatus refines the structure. For the description of the corresponding embodiment of FIG. 10, the mechanism for implementing video image coding and decoding can be referred to the description of the corresponding embodiment in FIG. 1 to FIG. 8 , and details are not described herein again.

本领域普通技术人员可以理解: 附图只是一个实施例的示意图, 附图中的 模块或流程并不一定是实施本发明所必须的。  It will be understood by those of ordinary skill in the art that the drawings are only a schematic representation of one embodiment, and the modules or processes in the drawings are not necessarily required to practice the invention.

本领域普通技术人员可以理解: 实施例中的装置中的模块可以按照实施例 描述分布于实施例的装置中, 也可以进行相应变化位于不同于本实施例的一个 或多个装置中。 上述实施例的模块可以合并为一个模块, 也可以进一步拆分成 多个子模块。  It will be understood by those skilled in the art that the modules in the apparatus in the embodiments may be distributed in the apparatus of the embodiment according to the embodiment, or may be correspondingly changed in one or more apparatuses different from the embodiment. The modules of the above embodiments may be combined into one module, or may be further split into a plurality of sub-modules.

上述本发明实施例序号仅仅为了描述, 不代表实施例的优劣。  The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.

本领域普通技术人员可以理解: 实现上述方法实施例的全部或部分步骤可 以通过程序指令相关的硬件来完成, 前述的程序可以存储于一计算机可读取存 储介质中, 该程序在执行时, 执行包括上述方法实施例的步骤; 而前述的存储 介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存储程序代码的介质。  A person skilled in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by using hardware related to program instructions, and the foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. The foregoing steps include the steps of the foregoing method embodiments; and the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

最后应说明的是: 以上实施例仅用以说明本发明的技术方案, 而非对其限 制; 尽管参照前述实施例对本发明进行了详细的说明, 本领域的普通技术人员 应当理解: 其依然可以对前述实施例所记载的技术方案进行修改, 或者对其中 部分技术特征进行等同替换; 而这些修改或者替换, 并不使相应技术方案的本 质脱离本发明实施例技术方案的精神和范围。  It should be noted that the above embodiments are only for explaining the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those skilled in the art that: The technical solutions described in the foregoing embodiments are modified, or the equivalents of the technical features are replaced by the equivalents of the technical solutions of the embodiments of the present invention.

Claims

权 利 要 求 Rights request 1、 一种视频图像编码处理方法, 其特征在于, 包括:  A video image encoding processing method, comprising: 对视频图像帧的宏块进行下采样分解得到多个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块;  Downsampling the macroblock of the video image frame to obtain a plurality of blocks, the plurality of blocks including at least: a first block, a second block, and a third block; 对所述第 1块进行帧内预测编码处理, 得到所述第 1块的重构图像值; 以所述第 1块的重构图像值为参考, 对所述第 2块进行预测编码处理, 得 到所述第 2块的重构图像值;  Performing intra prediction encoding processing on the first block to obtain a reconstructed image value of the first block; performing prediction encoding processing on the second block with reference to the reconstructed image value of the first block; Obtaining a reconstructed image value of the second block; 以所述第 2块的重构图像值为参考, 对所述第 3块进行预测编码处理。 The third block is subjected to predictive coding processing with reference to the reconstructed image value of the second block. 2、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 所述以第 1块的重构图像值为参考, 对所述第 2块进行预测编码处理, 包 括: 对所述第 1块的重构图像值进行插值处理; 2. The video image encoding processing method according to claim 1, wherein the predictive encoding processing on the second block is performed by using the reconstructed image value of the first block, and the method includes: 1 block of reconstructed image values are interpolated; 所述以第 2块的重构图像值为参考, 对所述第 3块进行预测编码处理, 包 括: 对所述第 2块的重构图像值进行插值处理。  The predictive coding process is performed on the third block with reference to the reconstructed image value of the second block, and includes: performing interpolation processing on the reconstructed image value of the second block. 3、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 对所述第 1块、 第 2块和第 3块中的任一块进行预测编码处理, 包括:  The video image encoding processing method according to claim 1, wherein the predictive encoding process is performed on any one of the first block, the second block, and the third block, including: 对各块的预测残差进行变换处理;  Transforming the prediction residuals of each block; 根据各块的量化参数值, 对各块的预测残差经变换处理后的系数进行量化 处理, 所述第 1块的量化参数值小于或等于所述第 2块的量化参数值、 且所述 第 2块的量化参数值小于或等于所述第 3块的量化参数值;  Encoding the transformed residuals of the prediction residuals of the respective blocks according to the quantization parameter values of the blocks, wherein the quantization parameter values of the first block are less than or equal to the quantization parameter values of the second block, and the The quantization parameter value of the second block is less than or equal to the quantization parameter value of the third block; 对各块经量化处理后的系数进行熵编码处理。  Entropy coding processing is performed on the quantized coefficients of each block. 4、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 还包括: 根据所述多个块中各块的预测残差系数的不同分布规律, 对所述多个块中 各块是否包含非零系数的指示信息进行熵编码处理, 所述第 1 块包含非零系数 的概率, 大于所述第 2块或第 3块包含非零系数的概率。 The video image encoding processing method according to claim 1, further comprising: performing, according to different distribution rules of prediction residual coefficients of each of the plurality of blocks, each of the plurality of blocks Whether or not the indication information including the non-zero coefficient is subjected to entropy coding processing, and the first block includes non-zero coefficients The probability of being greater than the probability that the second or third block contains non-zero coefficients. 5、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 对所述第 1块进行帧内预测编码处理, 得到所述第 1块的重构图像值, 包括:  The video image encoding processing method according to claim 1, wherein the performing the intra prediction encoding process on the first block to obtain the reconstructed image value of the first block comprises: 将所述第 1 块的像素划分成多个子块, 对各子块进行帧内预测编码处理, 得到各子块的重构图像值, 对各子块的重构图像值进行合成处理, 得到所述第 1 块的重构图像值。  Dividing the pixels of the first block into a plurality of sub-blocks, performing intra prediction encoding processing on each sub-block, obtaining reconstructed image values of the respective sub-blocks, and synthesizing the reconstructed image values of the respective sub-blocks to obtain a The reconstructed image value of the first block. 6、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 对所述第 1块进行帧内预测编码处理, 得到所述第 1块的重构图像值, 包括:  The video image encoding processing method according to claim 1, wherein the performing the intra prediction encoding process on the first block to obtain the reconstructed image value of the first block comprises: 对所述第 1块进行下采样分解得到多个子块, 所述多个子块至少包括: 第 1 子块、 第 2子块和第 3子块;  Downsampling the first block to obtain a plurality of sub-blocks, where the plurality of sub-blocks at least include: a first sub-block, a second sub-block, and a third sub-block; 对所述第 1子块进行帧内预测编码处理, 得到所述第 1子块的重构图像值; 以所述第 1子块的重构图像值为参考, 对所述第 2子块进行预测编码处理, 得到所述第 2子块的重构图像值;  Performing intra prediction encoding processing on the first sub-block to obtain a reconstructed image value of the first sub-block; and performing the second sub-block on the reference to the reconstructed image value of the first sub-block Predicting an encoding process to obtain a reconstructed image value of the second sub-block; 以所述第 2子块的重构图像值为参考, 对所述第 3子块进行预测编码处理, 得到所述第 3子块的重构图像值;  And performing predictive coding processing on the third sub-block with reference to the reconstructed image value of the second sub-block to obtain a reconstructed image value of the third sub-block; 对所述多个子块中各子块的重构图像值进行上采样合成处理,得到所述第 1 块的重构图像值。  Performing upsampling synthesis processing on the reconstructed image values of the sub-blocks of the plurality of sub-blocks to obtain reconstructed image values of the first block. 7、 根据权利要求 6所述的视频图像编码处理方法, 其特征在于, 对所述第 1子块、 第 2子块和第 3子块中的任一子块进行预测编码处理, 包括:  The video image encoding processing method according to claim 6, wherein performing predictive encoding processing on any one of the first sub-block, the second sub-block, and the third sub-block includes: 对各子块的预测残差进行变换处理;  Performing transformation processing on prediction residuals of each sub-block; 根据各子块的量化参数值, 对各子块的预测残差经变换处理后的系数进行 量化处理, 所述第 1子块的量化参数值小于或等于所述第 2子块的量化参数值、 且所述第 2子块的量化参数值小于或等于所述第 3子块的量化参数值; 对各子块经量化处理后的系数进行熵编码处理。 Encoding the transformed residuals of the prediction sub-blocks according to the quantization parameter values of the sub-blocks, and the quantization parameter values of the first sub-block are less than or equal to the quantization parameter values of the second sub-block And the quantization parameter value of the second sub-block is less than or equal to the quantization parameter value of the third sub-block; Entropy coding processing is performed on the quantized coefficients of each sub-block. 8、 根据权利要求 6所述的视频图像编码处理方法, 其特征在于, 还包括: 根据各子块的预测残差经量化处理后的系数不同的分布规律, 对各子块的 预测残差经量化处理后的进行相应的熵编码处理, 所述第 1 子块的出现非零系 数的概率, 大于所述第 2子块或第 3子块出现非零系数的概率。  The video image encoding processing method according to claim 6, further comprising: predicting a residual residual for each sub-block according to a distribution rule in which the prediction residuals of the respective sub-blocks are quantized by different coefficients After the quantization process, the corresponding entropy coding process is performed, and the probability of occurrence of the non-zero coefficient of the first sub-block is greater than the probability of occurrence of the non-zero coefficient of the second sub-block or the third sub-block. 9、 根据权利要求 1所述的视频图像编码处理方法, 其特征在于, 在对所述 宏块进行下采样分解的过程中, 还包括: 对所述宏块的部分或全部像素进行抗 混叠滤波处理。  The video image encoding processing method according to claim 1, wherein, in the process of downsampling the macroblock, the method further comprises: anti-aliasing part or all of the pixels of the macroblock Filter processing. 10、 如权利要求 1 中所述的视频图像编码方法, 其特征在于, 所述第 1块 中的像素至少包含该宏块中右下角的像素。 所述第 2 块中的右下角的像素与所 述第 1块中的右下角的像素在水平或垂直方向上。 所述第 3块中的右下角的像 素与所述第 1块中的右下角的像素处于所述宏块的对角线方向上。  10. The video image encoding method according to claim 1, wherein the pixels in the first block include at least pixels in a lower right corner of the macroblock. The pixel in the lower right corner of the second block is in a horizontal or vertical direction with the pixel in the lower right corner of the first block. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block. 11、 一种视频图像解码处理方法, 其特征在于, 包括:  A video image decoding processing method, comprising: 获取视频图像帧的宏块的编码比特信息, 所述宏块至少包括: 第 1块, 第 2 块和第 3块;  Obtaining coded bit information of a macroblock of a video image frame, the macroblock including at least: a first block, a second block, and a third block; 对所述第 1块的编码比特信息进行帧内预测解码处理, 得到所述第 1块的 解码图像值;  Performing intra prediction decoding processing on the coded bit information of the first block to obtain a decoded image value of the first block; 以所述第 1块的解码图像值为参考, 对所述第 2块的编码比特信息进行预 测解码处理, 得到第 2块的解码图像值;  Referring to the decoded image value of the first block, performing prediction decoding processing on the encoded bit information of the second block to obtain a decoded image value of the second block; 以所述第 2块的解码图像值为参考, 对所述第 3块的编码比特信息进行预 测解码处理, 得到第 3块的解码图像值;  Deriving and decoding the coded bit information of the third block with reference to the decoded picture value of the second block to obtain a decoded picture value of the third block; 将所述宏块中各块的解码图像值进行上采样合成处理, 得到所述宏块的解 码图像值。 The decoded image values of the blocks in the macroblock are subjected to upsampling synthesis processing to obtain decoded image values of the macroblocks. 12、 根据权利要求 11所述的视频图像解码处理方法, 其特征在于, 所述以第 1块的解码图像值为参考, 对所述第 2块进行预测解码处理, 包 括: 对所述第 1块的解码图像值进行插值处理; The video image decoding processing method according to claim 11, wherein the predicting and decoding the second block by referring to the decoded image value of the first block includes: The decoded image values of the block are subjected to interpolation processing; 所述以第 2块的解码图像值为参考, 对所述第 3块进行预测解码处理, 包 括: 对所述第 2块的解码图像值进行插值处理。  The predictive decoding process is performed on the third block with reference to the decoded image value of the second block, and includes: performing interpolation processing on the decoded image value of the second block. 13、 根据权利要求 11所述的视频图像解码处理方法, 其特征在于, 对所述 第 1块、 第 2块和第 3块中的任一块进行预测解码处理, 包括:  The video image decoding processing method according to claim 11, wherein the predictive decoding process is performed on any one of the first block, the second block, and the third block, including: 对各块的编码比特信息进行熵解码处理;  Entropy decoding processing on coded bit information of each block; 根据各块的量化参数值,对熵解码处理后的系数进行反量化处理, 所述第 1 块的量化参数值小于或等于所述第 2块的量化参数值、 且所述第 2块的量化参 数值小于或等于所述第 3块的量化参数值;  Performing inverse quantization processing on the coefficients after the entropy decoding process according to the quantization parameter values of the blocks, wherein the quantization parameter value of the first block is less than or equal to the quantization parameter value of the second block, and the quantization of the second block The parameter value is less than or equal to the quantization parameter value of the third block; 对反量化处理后系数进行反变换处理。  The inverse transform processing is performed on the inverse quantization process. 14、根据权利要求 11所述的视频图像解码处理方法, 其特征在于,还包括: 根据各块的残差系数的不同分布规律, 对各块是否包含非零系数的指示信 息进行相应的熵解码处理, 所述第 1块出现非零系数的概率, 大于所述第 2块 或第 3块出现非零系数的概率。  The video image decoding processing method according to claim 11, further comprising: performing corresponding entropy decoding on whether each block includes non-zero coefficient indication information according to different distribution rules of residual coefficients of each block Processing, the probability that the first block has a non-zero coefficient is greater than the probability that the second block or the third block has a non-zero coefficient. 15、 根据权利要求 11所述的视频图像解码处理方法, 其特征在于, 所述第 1块包括多个子块;对所述第 1块的编码比特信息进行帧内预测解码处理,包括: 对各子块的编码比特信息进行解码处理, 得到各子块的解码图像值; 对各子块的解码图像值进行合成处理, 得到所述第 1块的解码图像值。 The video image decoding processing method according to claim 11, wherein the first block includes a plurality of sub-blocks, and the intra-prediction decoding process is performed on the coded bit information of the first block, including: The coded bit information of the sub-block is decoded to obtain a decoded image value of each sub-block; and the decoded image values of the respective sub-blocks are combined to obtain a decoded image value of the first block. 16、 根据权利要求 11所述的视频图像解码处理方法, 其特征在于, 所述第 1块至少包括: 第 1子块、 第 2子块和第 3子块; 所述对第 1块的编码比特信息 进行帧内预测解码处理, 得到所述第 1块的解码图像值, 包括: 对所述第 1子块进行帧内预测解码处理, 得到所述第 1子块的解码图像值; 以所述第 1子块的解码图像值为参考进行帧内预测解码处理, 得到第 2子 块的解码图像值; The video image decoding processing method according to claim 11, wherein the first block includes at least: a first sub-block, a second sub-block, and a third sub-block; and the encoding of the first block The bit information is subjected to an intra prediction decoding process, and the decoded image value of the first block is obtained, including: Performing intra prediction decoding processing on the first sub-block to obtain a decoded image value of the first sub-block; performing intra-prediction decoding processing on the decoded image value of the first sub-block to obtain a second sub-block The decoded image value of the block; 以所述第 2子块的解码图像值为参考进行帧内预测解码处理, 得到第 3子 块的解码图像值;  Performing intra prediction decoding processing with reference to the decoded image value of the second sub-block to obtain a decoded image value of the third sub-block; 将各子块的解码图像值进行上采样合成处理, 得到所述第 1 块的解码图像 值。  The decoded image values of the respective sub-blocks are subjected to up-sampling synthesis processing to obtain decoded image values of the first block. 17、 根据权利要求 16所述的视频图像解码处理方法, 其特征在于, 对所述 第 1子块、 第 2子块和第 3子块中的任一子块进行预测解码处理, 包括:  The video image decoding processing method according to claim 16, wherein performing predictive decoding processing on any one of the first sub-block, the second sub-block, and the third sub-block includes: 对各子块的编码比特信息进行熵解码处理;  Entropy decoding processing on coded bit information of each sub-block; 根据各子块的量化参数值, 对熵解码处理后的系数进行反量化处理, 所述 第 1子块的量化参数值小于或等于所述第 2子块的量化参数值、 且所述第 2子 块的量化参数值小于或等于所述第 3子块的量化参数值;  Performing inverse quantization processing on the coefficients after the entropy decoding process according to the quantization parameter values of the respective sub-blocks, wherein the quantization parameter value of the first sub-block is less than or equal to the quantization parameter value of the second sub-block, and the second parameter The quantization parameter value of the sub-block is less than or equal to the quantization parameter value of the third sub-block; 对反量化处理后系数进行反变换处理。  The inverse transform processing is performed on the inverse quantization process. 18、根据权利要求 16所述的视频图像解码处理方法, 其特征在于, 还包括: 根据各子块的残差系数的不同分布规律, 对各子块是否包含非零系数的指 示信息进行相应的熵解码处理, 所述第 1 子块出现非零系数的概率, 大于所述 第 2子块或第 3子块出现非零系数的概率。  The video image decoding processing method according to claim 16, further comprising: performing corresponding information on whether each sub-block includes non-zero coefficients according to different distribution rules of residual coefficients of each sub-block In the entropy decoding process, a probability that the first sub-block exhibits a non-zero coefficient is greater than a probability that the second sub-block or the third sub-block exhibits a non-zero coefficient. 19、 根据权利要求 11所述的视频图像解码处理方法, 其特征在于, 所述第 1块中的像素至少包含该宏块中右下角的像素。所述第 2块中的右下角的像素与 所述第 1块中的右下角的像素在水平或垂直方向上。 所述第 3块中的右下角的 像素与所述第 1块中的右下角的像素处于所述宏块的对角线方向上。  The video image decoding processing method according to claim 11, wherein the pixels in the first block include at least pixels in a lower right corner of the macroblock. The pixel in the lower right corner of the second block and the pixel in the lower right corner of the first block are in a horizontal or vertical direction. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block. 20、 一种视频图像编码处理装置, 其特征在于, 包括: 下采样分解模块, 用于对视频图像帧的宏块进行下采样分解得到多个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块; 20. A video image encoding processing apparatus, comprising: a downsampling decomposition module, configured to perform downsampling on a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; 帧内预测编码模块, 用于对所述第 1 块进行帧内预测编码处理, 得到所述 第 1块的重构图像值;  An intra prediction coding module, configured to perform intra prediction coding processing on the first block to obtain a reconstructed image value of the first block; 第一像素间预测编码模块, 用于以所述第 1 块的重构图像值为参考, 对所 述第 2块进行预测编码处理, 得到所述第 2块的重构图像值;  a first inter-pixel predictive coding module, configured to perform predictive coding processing on the second block with reference to a reconstructed image value of the first block, to obtain a reconstructed image value of the second block; 第二像素间预测编码模块, 用于以所述第 2块的重构图像值为参考, 对所 述第 3块进行预测编码处理。  The second inter-pixel predictive coding module is configured to perform predictive coding processing on the third block with reference to the reconstructed image value of the second block. 21、 根据权利要求 20所述的视频图像编码处理装置, 其特征在于, 所述第一像素间预测编码模块, 还用于对所述第 1 块的重构图像值进行插 值处理;  The video image encoding processing device according to claim 20, wherein the first inter-pixel predictive encoding module is further configured to perform interpolation processing on the reconstructed image values of the first block; 所述第二像素间预测编码模块, 还用于对所述第 2块的重构图像值进行插 值处理。  The second inter-pixel predictive coding module is further configured to perform interpolation processing on the reconstructed image values of the second block. 22、 根据权利要求 20所述的视频图像编码处理装置, 其特征在于, 所述第 1块中的像素至少包含该宏块中右下角的像素。所述第 2块中的右下角的像素与 所述第 1块中的右下角的像素在水平或垂直方向上。 所述第 3块中的右下角的 像素与所述第 1块中的右下角的像素处于所述宏块的对角线方向上。  The video image encoding processing device according to claim 20, wherein the pixels in the first block include at least pixels in a lower right corner of the macroblock. The pixel in the lower right corner of the second block and the pixel in the lower right corner of the first block are in a horizontal or vertical direction. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block. 23、 一种视频图像解码处理装置, 其特征在于, 包括:  A video image decoding processing apparatus, comprising: 比特信息获取模块, 用于获取视频图像帧的宏块的编码比特信息, 所述宏 块至少包括: 第 1块, 第 2块和第 3块;  a bit information acquiring module, configured to acquire coded bit information of a macroblock of a video image frame, where the macroblock includes at least: a first block, a second block, and a third block; 帧内预测解码模块, 用于对所述第 1 块的编码比特信息进行帧内预测解码 处理, 得到所述第 1块的解码图像值;  An intra prediction decoding module, configured to perform intra prediction decoding processing on the encoded bit information of the first block to obtain a decoded image value of the first block; 第一像素间预测解码模块, 用于以所述第 1 块的解码图像值为参考, 对所 述第 2块的编码比特信息进行预测解码处理, 得到第 2块的解码图像值; 第二像素间预测解码模块, 用于以所述第 2块的解码图像值为参考, 对所 述第 3块的编码比特信息进行预测解码处理, 得到第 3块的解码图像值; a first inter-pixel prediction decoding module, configured to use the decoded image value of the first block as a reference, The coded bit information of the second block is subjected to predictive decoding processing to obtain a decoded image value of the second block; and the second inter-pixel predictive decoding module is configured to reference the decoded image value of the second block to the third The coded bit information of the block is subjected to predictive decoding processing to obtain a decoded image value of the third block; 解码图像生成模块, 用于将所述宏块中各块的解码图像值进行上采样合成 处理, 得到所述宏块的解码图像值。  And a decoded image generating module, configured to perform upsampling synthesis processing on the decoded image values of the blocks in the macroblock to obtain decoded image values of the macroblock. 24、 根据权利要求 23所述的视频图像解码处理装置, 其特征在于, 所述第一像素间预测解码模块, 还用于对所述第 1 块的解码图像值进行插 值处理;  The video image decoding processing device according to claim 23, wherein the first inter-pixel predictive decoding module is further configured to perform interpolation processing on the decoded image values of the first block; 所述第二像素间预测解码模块, 还用于对所述第 2块的解码图像值进行插 值处理。  The second inter-pixel predictive decoding module is further configured to perform interpolation processing on the decoded image values of the second block. 25、 根据权利要求 23所述的视频图像解码处理装置, 其特征在于, 所述第 1块中的像素至少包含该宏块中右下角的像素。所述第 2块中的右下角的像素与 所述第 1块中的右下角的像素在水平或垂直方向上。 所述第 3块中的右下角的 像素与所述第 1块中的右下角的像素处于所述宏块的对角线方向上。  The video image decoding processing device according to claim 23, wherein the pixels in the first block include at least pixels in a lower right corner of the macroblock. The pixel in the lower right corner of the second block and the pixel in the lower right corner of the first block are in a horizontal or vertical direction. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block. 26、 一种视频图像编解码系统, 其特征在于, 包括:  26. A video image encoding and decoding system, comprising: 视频图像编码处理装置, 用于对视频图像帧的宏块进行下采样分解得到多 个块, 所述多个块至少包括: 第 1块、 第 2块和第 3块; 对所述第 1块进行帧 内预测编码处理, 得到所述第 1块的重构图像值; 以所述第 1块的重构图像值 为参考, 对所述第 2块进行预测编码处理, 得到所述第 2块的重构图像值; 以 所述第 2块的重构图像值为参考, 对所述第 3块进行预测编码处理;  a video image encoding processing apparatus, configured to downsample a macroblock of a video image frame to obtain a plurality of blocks, where the plurality of blocks include at least: a first block, a second block, and a third block; Performing intra prediction coding processing to obtain a reconstructed image value of the first block; performing prediction encoding processing on the second block with reference to the reconstructed image value of the first block to obtain the second block a reconstructed image value; performing predictive encoding processing on the third block with reference to the reconstructed image value of the second block; 视频图像解码处理装置, 用于获取所述宏块的编码比特信息, 所述宏块至 少包括: 所述第 1块, 所述第 2块和所述第 3块; 对所述第 1块的编码比特信 息进行帧内预测解码处理, 得到所述第 1块的解码图像值; 以所述第 1块的解 码图像值为参考, 对所述第 2 块的编码比特信息进行预测解码处理, 得到第 2 块的解码图像值; 以所述第 2块的解码图像值为参考, 对所述第 3块的编码比 特信息进行预测解码处理, 得到第 3 块的解码图像值; 将所述宏块中各块的解 码图像值进行上采样合成处理, 得到所述宏块的解码图像值。 a video image decoding processing apparatus, configured to acquire coded bit information of the macroblock, where the macroblock includes at least: the first block, the second block, and the third block; and the first block Encoding bit information is subjected to intra prediction decoding processing to obtain a decoded image value of the first block; and a solution to the first block The code image value is a reference, and the coded bit information of the second block is subjected to predictive decoding processing to obtain a decoded image value of the second block; and the decoded image value of the second block is referred to, and the third block is referred to The coded bit information is subjected to predictive decoding processing to obtain a decoded image value of the third block; and the decoded image value of each block in the macro block is subjected to upsampling synthesis processing to obtain a decoded image value of the macroblock. 27、 如权利要求 26中所述的视频图像编解码系统, 其特征在于, 所述第 1 块中的像素至少包含该宏块中右下角的像素。 所述第 2块中的右下角的像素与 所述第 1块中的右下角的像素在水平或垂直方向上。 所述第 3块中的右下角的 像素与所述第 1块中的最右下角的像素处于所述宏块的对角线方向上。  27. The video image encoding and decoding system according to claim 26, wherein the pixels in the first block include at least pixels in a lower right corner of the macroblock. The pixel in the lower right corner of the second block and the pixel in the lower right corner of the first block are in a horizontal or vertical direction. The pixel in the lower right corner of the third block and the pixel in the lower right corner of the first block are in the diagonal direction of the macro block.
PCT/CN2010/078564 2009-11-10 2010-11-09 Method, apparatus and system for video image encoding and decoding Ceased WO2011057557A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN200910211704.2 2009-11-10
CN 200910211704 CN101710990A (en) 2009-11-10 2009-11-10 Video image encoding and decoding method, device and encoding and decoding system
CN201010147573.9 2010-04-09
CN 201010147573 CN102055972B (en) 2009-11-10 2010-04-09 Video image coding/decoding methods and devices as well as video image coding and decoding system

Publications (1)

Publication Number Publication Date
WO2011057557A1 true WO2011057557A1 (en) 2011-05-19

Family

ID=42403747

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/078564 Ceased WO2011057557A1 (en) 2009-11-10 2010-11-09 Method, apparatus and system for video image encoding and decoding

Country Status (2)

Country Link
CN (2) CN101710990A (en)
WO (1) WO2011057557A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10205964B2 (en) 2011-06-20 2019-02-12 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
EP4633164A3 (en) * 2018-03-29 2026-01-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Intra-prediction mode concept for block-wise picture coding

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101710990A (en) * 2009-11-10 2010-05-19 华为技术有限公司 Video image encoding and decoding method, device and encoding and decoding system
CN102595117B (en) * 2011-01-14 2014-03-12 清华大学 Method and device for coding and decoding
CN103533331B (en) * 2013-10-18 2017-01-11 华为技术有限公司 Encoding and decoding method and device for image
US10104397B2 (en) * 2014-05-28 2018-10-16 Mediatek Inc. Video processing apparatus for storing partial reconstructed pixel data in storage device for use in intra prediction and related video processing method
EP2980793A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder, system and methods for encoding and decoding
CN106254719B (en) * 2016-07-25 2018-11-30 清华大学深圳研究生院 A kind of light field image compression method based on linear transformation and image interpolation
CN106851288B (en) * 2017-02-27 2020-09-15 北京奇艺世纪科技有限公司 Intra-frame predictive coding method and device
CN111836043B (en) * 2019-04-15 2025-10-17 中兴通讯股份有限公司 Method and device for predicting and decoding code block
CN112235568B (en) * 2019-07-15 2024-03-26 华为技术有限公司 Image encoding method, decoding method, device and storage medium
CN117768655A (en) * 2021-02-25 2024-03-26 华为技术有限公司 Audio, video or image layered compression method and device
CN114629962B (en) * 2021-08-05 2024-11-15 中国石油天然气集团有限公司 Method and device for compressing, decompressing and transmitting downhole drilling data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006004331A1 (en) * 2004-07-07 2006-01-12 Samsung Electronics Co., Ltd. Video encoding and decoding methods and video encoder and decoder
US20070071087A1 (en) * 2005-08-30 2007-03-29 Samsung Electronics Co., Ltd. Apparatus and method for video encoding and decoding and recording medium having recorded theron program for the method
CN101389028A (en) * 2008-10-15 2009-03-18 北京航空航天大学 A Video Intra-Frame Coding Method Based on Spatial Domain Decomposition
CN101459840A (en) * 2007-12-13 2009-06-17 华为技术有限公司 Encoding and decoding method, apparatus and system for video image
CN101710990A (en) * 2009-11-10 2010-05-19 华为技术有限公司 Video image encoding and decoding method, device and encoding and decoding system

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20050078709A (en) * 2004-01-31 2005-08-08 삼성전자주식회사 Method for scalable video coding and decoding, and apparatus for the same
WO2005093661A2 (en) * 2004-03-09 2005-10-06 Thomson Research Funding Corporation Reduced resolution update mode for advanced video coding
US8432975B2 (en) * 2008-01-18 2013-04-30 Mediatek Inc. Apparatus and method for processing a picture frame
CN101252686B (en) * 2008-03-20 2010-04-14 上海交通大学 Method and system for lossless encoding and decoding in video frames based on interleaved prediction

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006004331A1 (en) * 2004-07-07 2006-01-12 Samsung Electronics Co., Ltd. Video encoding and decoding methods and video encoder and decoder
US20070071087A1 (en) * 2005-08-30 2007-03-29 Samsung Electronics Co., Ltd. Apparatus and method for video encoding and decoding and recording medium having recorded theron program for the method
CN101459840A (en) * 2007-12-13 2009-06-17 华为技术有限公司 Encoding and decoding method, apparatus and system for video image
CN101389028A (en) * 2008-10-15 2009-03-18 北京航空航天大学 A Video Intra-Frame Coding Method Based on Spatial Domain Decomposition
CN101710990A (en) * 2009-11-10 2010-05-19 华为技术有限公司 Video image encoding and decoding method, device and encoding and decoding system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10205964B2 (en) 2011-06-20 2019-02-12 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10516897B2 (en) 2011-06-20 2019-12-24 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10536717B2 (en) 2011-06-20 2020-01-14 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10904569B2 (en) 2011-06-20 2021-01-26 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10979734B2 (en) 2011-06-20 2021-04-13 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10979735B2 (en) 2011-06-20 2021-04-13 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US10986368B2 (en) 2011-06-20 2021-04-20 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US11689742B2 (en) 2011-06-20 2023-06-27 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US11711541B2 (en) 2011-06-20 2023-07-25 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US12149737B2 (en) 2011-06-20 2024-11-19 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
US12244855B2 (en) 2011-06-20 2025-03-04 Electronics And Telecommunications Research Institute Image encoding/decoding method using prediction block and apparatus for same
EP4633164A3 (en) * 2018-03-29 2026-01-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Intra-prediction mode concept for block-wise picture coding

Also Published As

Publication number Publication date
CN102055972A (en) 2011-05-11
CN102055972B (en) 2013-08-07
CN101710990A (en) 2010-05-19

Similar Documents

Publication Publication Date Title
WO2011057557A1 (en) Method, apparatus and system for video image encoding and decoding
CN102771125B (en) Encoding/decoding method and device using tree structure
JP5727114B1 (en) Method and apparatus for encoding / decoding picture boundary coding unit
CN104094597B (en) Adaptive transformation method based on intra-frame prediction and device using the method
JP5650742B2 (en) Video encoding method and apparatus, and video decoding method and apparatus
CN116915984A (en) Image encoding/decoding method and transmitting method
KR101691978B1 (en) Method and apparatus for image encoding/decoding
CN116366843A (en) Image encoding/decoding method and apparatus using sample filtering
CN117354501A (en) Image encoding/decoding method, transmitting method, and digital storage medium
CN115550651A (en) Method for encoding and decoding images and computer-readable medium storing bitstream
CN116896633A (en) Video encoding/decoding method and device and recording medium for storing bit stream
CN104811718A (en) Adaptive intra-prediction encoding and decoding method
CN103250412A (en) Image encoding/decoding method for rate-distortion optimization and device for performing the method
CN102740079B (en) Quality scalable coding based on HEVC standard rate-distortion optimization
JP2020145626A (en) Image encoder, image encoding method, image decoder, and image decoding method
WO2012008180A1 (en) Image encoding device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10829521

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10829521

Country of ref document: EP

Kind code of ref document: A1