WO2018169267A1 - Dispositif et procédé de codage ou de décodage d'image - Google Patents
Dispositif et procédé de codage ou de décodage d'image Download PDFInfo
- Publication number
- WO2018169267A1 WO2018169267A1 PCT/KR2018/002891 KR2018002891W WO2018169267A1 WO 2018169267 A1 WO2018169267 A1 WO 2018169267A1 KR 2018002891 W KR2018002891 W KR 2018002891W WO 2018169267 A1 WO2018169267 A1 WO 2018169267A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- block
- qtbt
- depth
- node
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/119—Adaptive subdivision aspects, e.g. subdivision of a picture into rectangular or non-rectangular coding blocks
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/423—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to image encoding or decoding for efficiently encoding an image. More specifically, the present invention relates to a technique for signaling partition information about Quadtree plus Binary tree (QTBT) based block partitioning.
- QTBT Quadtree plus Binary tree
- a coding tree unit is a coding tree for reflecting various local characteristics in an image, and is divided into a coding unit (CU) using a quadtree structure.
- CU coding unit
- intra and inter coding prediction is also performed in the CU unit.
- Each CU is again divided into a PU (Prediction Unit).
- PU Prediction Unit
- TU Transformation Unit
- the QTBT structure removes existing CU, PU, and TU concepts, and proposes various CU partition shapes to match various local features of video data. That is, in QTBT, a CU may be defined to have a square or rectangular shape.
- the CTU size was 64 in HEVC
- the necessity of larger block sizes such as 128 and 256 has been discussed as the resolution of the image increases, and it is necessary to signal such segmentation information when the block is divided into at least 4x4 in the CTU.
- the amount of data that is added increases.
- An object of the present invention is to efficiently signal partitioning information about a quadtree plus binary tree (QTBT) based block partitioning.
- QTBT quadtree plus binary tree
- the QTBT block partitioning structure is a structure in which a binary tree is rooted from a leaf node of a quadtree.
- the encoding of the split information may include determining, from the determined block partitioning structure, the highest depth among depths in which at least one leaf node of the quadtree exists (hereinafter, referred to as “highest QT leaf node depth”). ; And encoding depth information indicating the highest QT leaf node depth as quadtree split information from a root node to parent nodes of nodes belonging to the highest QT leaf node depth in the determined block partitioning structure.
- the block of the encoded image data is divided into a plurality of divided blocks according to a quadtree plus binarytree (QTBT) block partitioning structure, and the QTBT block partitioning structure is formed from a binary tree (leaf node) of a quadtree (quadtree).
- QTBT quadtree plus binarytree
- the splitting information includes depth information indicating the highest depth among the depths in which at least one leaf node of the quadtree exists in the QTBT block partitioning structure (hereinafter, referred to as “highest QT leaf node depth”).
- the determining of the QTBT includes quadtree splitting for all nodes from a root node of the QTBT to a parent node of nodes belonging to the highest QT leaf node depth using depth information indicating the highest QT leaf node depth. It includes the step of performing.
- an apparatus for decoding image data includes a memory and one or more processors, wherein the one or more processors are configured to parse the block of encoded image data and the partition information associated with the block of the image data and determine the QTBT using the partition information. And decoding the block of the encoded image data for each leaf node of the QTBT.
- FIG. 1 is a diagram illustrating an example of a quad tree splitting structure for a CTU and a tree structure thereof.
- FIG. 2 is a block diagram of an image encoding apparatus according to an embodiment of the present invention.
- FIG. 3 shows an example of a plurality of intra prediction modes.
- FIG. 4 is an exemplary diagram of neighboring blocks of a current block.
- FIG 5 illustrates an image decoding apparatus according to an embodiment of the present invention.
- FIG. 6 is a flowchart illustrating an exemplary operation of encoding an image by the image encoding apparatus.
- FIG. 7 is a flowchart illustrating an exemplary operation of decoding an image by the image decoding device.
- FIG. 8 is a diagram illustrating an example of a QTBT splitting structure of a CTU and a tree structure thereof.
- FIG. 9 is a diagram illustrating the number of bits that can be represented as BT in one CU.
- FIG. 10 is a diagram illustrating an exemplary QTBT splitting structure of a CTU in a tree structure.
- FIG. 2 is a block diagram of an image encoding apparatus according to an embodiment of the present invention.
- the image encoding apparatus includes a block divider 210, a predictor 220, a subtractor 230, a transformer 240, a quantizer 245, an encoder 250, an inverse quantizer 260, and an inverse transform unit ( 265, an adder 270, a filter unit 280, and a memory 290.
- each component may be implemented as a hardware chip, or may be implemented in software and implemented so that the microprocessor executes a function of software corresponding to each component.
- the block dividing unit 210 After dividing each picture constituting the image into a plurality of coding tree units (CTUs), the block dividing unit 210 recursively divides the CTUs using a tree structure.
- a leaf node in the tree structure becomes a CU (coding unit) which is a basic unit of coding.
- the tree structure is a quadtree (QT) in which a parent node (or parent node) is divided into four child nodes (or child nodes) of the same size, or such a QT structure and a parent node are divided into two child nodes.
- QTBT QuadTree plus BinaryTree
- the CTU may be first divided into a QT structure.
- Quadtree splitting may be repeated until the size of the splitting block reaches the minimum block size (MinQTSize) of the leaf nodes allowed in QT. If the leaf node of the quadtree is not larger than the maximum block size (MaxBTSize) of the root node allowed in BT, it may be further partitioned into the BT structure.
- MaxBTSize maximum block size of the root node allowed in BT
- BT there may be a plurality of partition types. For example, in some examples, there may be two types of horizontally dividing a block of a node into two blocks of the same size (ie, symmetric horizontal splitting) and vertically dividing (ie, symmetric vertical splitting).
- the asymmetrical form may include dividing a block of the node into two rectangular blocks having a size ratio of 1: 3, or dividing the block of the node in a diagonal direction.
- the partition information generated by the block divider 210 by dividing the CTU by the QTBT structure is encoded by the encoder 250 and transmitted to the image decoding apparatus.
- Examples of the present disclosure generally relate to a technique for signaling QTBT block partition information and a technique for determining block partitioning from QTBT block partition information.
- certain techniques of the present disclosure may be performed by the encoder 250. That is, for example, the encoder 250 may perform the techniques of the present disclosure described with reference to FIGS. 8 to 10 below.
- one or more other units of the encoding apparatus may be responsible for additionally or alternatively performing the techniques of this disclosure.
- a block corresponding to a CU that is, a leaf node of QTBT
- a 'current block' a block corresponding to a CU (that is, a leaf node of QTBT) to be encoded or decoded
- the prediction unit 220 generates a prediction block by predicting the current block.
- the predictor 220 includes an intra predictor 222 and an inter predictor 224.
- the intra predictor 222 predicts pixels in the current block by using pixels (reference pixels) positioned around the current block in the current picture including the current block. There are a plurality of intra prediction modes according to the prediction direction, and the peripheral pixels to be used and the equations are defined differently according to each prediction mode. In particular, the intra predictor 222 may determine an intra prediction mode to be used to encode the current block. In some examples, intra prediction unit 222 may encode the current block using several intra prediction modes and select an appropriate intra prediction mode to use from the tested modes. For example, intra predictor 222 calculates rate distortion values using rate-distortion analysis for several tested intra prediction modes, and has the best rate distortion characteristics among the tested modes. Intra prediction mode may be selected.
- FIG. 3 shows an example of a plurality of intra prediction modes.
- the plurality of intra prediction modes may include two non-directional modes (planar mode and DC mode) and 65 directional modes.
- the intra predictor 222 selects one intra prediction mode from among the plurality of intra prediction modes, and predicts the current block by using a peripheral pixel (reference pixel) and an operation formula determined according to the selected intra prediction mode.
- Information on the selected intra prediction mode is encoded by the encoder 250 and transmitted to the image decoding apparatus.
- the intra prediction unit 222 may efficiently encode intra prediction mode information indicating which mode of the plurality of intra prediction modes is used as the intra prediction mode of the current block. Some of the most probable modes as the intra prediction mode of may be determined as the most probable mode (MPM). In addition, mode information indicating whether the intra prediction mode of the current block is selected from the MPM is generated and transmitted to the encoder 250. In general, when the intra prediction mode of the current block is selected from the MPMs, the first intra identification information for indicating which mode of the MPMs is selected as the intra prediction mode of the current block is transmitted to the encoder.
- MPM most probable mode
- the intra prediction unit 222 instead of explicitly signaling which mode among the MPMs and / or non-MPMs is selected as the intra prediction mode for predicting the current block. Can group the MPMs and / or non-MPMs and signal the index of the group to which the intra mode for predicting the current block belongs.
- the present invention is not limited thereto, and the number of MPMs included in the MPM list may be selected within a range of 3 to 10.
- the MPM list is constructed using the intra prediction mode of neighboring blocks of the current block.
- the neighboring block may be, for example, all or some of the left block L, the upper block A, the lower left block BL, the upper right block AR, and the upper left block AL of the current block. It may include.
- the intra prediction mode of these neighboring blocks is included in the MPM list.
- the intra prediction mode of the valid blocks in the order of the left block (L), the top block (A), the bottom left block (BL), the top right block (AR), and the top left block (AL) is included in the MPM list.
- the candidate is configured by adding a planar mode and a DC mode to the intra prediction modes of the blocks.
- valid modes in the order of the left block (L), the top block (A), the planar mode, the DC mode, the bottom left block (BL), the top right block (AR), and the top left block (AL) may be added to the MPM list. have.
- the MPM list includes only different intra prediction modes. That is, when a duplicated mode is present, only one of them is included in the MPM list.
- the MPM may be derived by adding -1 or +1 to the directional modes in the list.
- the number of insufficient modes is added to the MPM list in the order of vertical mode, horizontal mode, diagonal mode, and the like. You may.
- the inter prediction unit 224 searches for the block most similar to the current block in the coded and decoded reference picture before the current picture, and generates a prediction block for the current block using the searched block. A motion vector corresponding to a displacement between the current block in the current picture and the prediction block in the reference picture is generated.
- the motion information including the information about the reference picture and the motion vector used to predict the current block is encoded by the encoder 250 and transmitted to the image decoding apparatus.
- the subtractor 230 subtracts the prediction block generated by the intra predictor 222 or the inter predictor 224 from the current block to generate a residual block.
- the converter 240 converts the residual signal in the residual block having pixel values of the spatial domain into a transform coefficient of the frequency domain.
- the transform unit 240 may convert the residual signals in the residual block using the size of the current block as a conversion unit, or divide the residual block into a plurality of smaller subblocks and convert the residual signals in a subblock-sized transform unit. You can also convert. There may be various ways of dividing the residual block into smaller subblocks. For example, it may be divided into sub-blocks of a predetermined same size, or a quadtree (QT) scheme may be used in which the residual block is a root node.
- QT quadtree
- the quantization unit 245 quantizes the transform coefficients output from the transform unit 240, and outputs the quantized transform coefficients to the encoder 250.
- the encoder 250 generates a bitstream by encoding the quantized transform coefficients by using an encoding method such as CABAC.
- the encoder 250 encodes information such as CTU size, MinQTSize, MaxBTSize, MaxBTDepth, MinBTSize, QT split flag, BT split flag, split type, etc. related to block division, so that the image decoding apparatus is the same as the image encoding apparatus. Allows you to split blocks.
- the encoder 250 encodes information about a prediction type indicating whether a current block is encoded by intra prediction or inter prediction, and encodes intra prediction information or inter prediction information according to the prediction type.
- the inverse quantizer 260 inversely quantizes the quantized transform coefficients output from the quantizer 245 to generate transform coefficients.
- the inverse transformer 265 restores the residual block by converting the transform coefficients output from the inverse quantizer 260 from the frequency domain to the spatial domain.
- the adder 270 reconstructs the current block by adding the reconstructed residual block and the predicted block generated by the predictor 220.
- the pixels in the reconstructed current block are used as reference pixels when intra prediction of the next order of blocks.
- the filter unit 280 deblocks and filters the boundary between the reconstructed blocks in order to remove blocking artifacts that occur due to encoding / decoding of blocks. When all the blocks in a picture are reconstructed, the reconstructed picture is used as a reference picture for inter prediction of a block in a picture to be encoded later.
- FIG 5 illustrates an image decoding apparatus according to an embodiment of the present invention.
- the image decoding apparatus includes a decoder 510, an inverse quantizer 520, an inverse transformer 530, a predictor 540, an adder 550, a filter 560, and a memory 570.
- the image decoding apparatus may be implemented by each component as a hardware chip, or may be implemented by software and a microprocessor to execute a function of software corresponding to each component.
- the decoder 510 decodes the bitstream received from the image encoding apparatus, extracts information related to block division, determines a current block to be decoded, and includes prediction information and residual signal information necessary for reconstructing the current block. Extract
- the decoder 510 extracts information about the CTU size from a high level syntax such as a Sequence Parameter Set (SPS) or a Picture Parameter Set (PPS) to determine the size of the CTU and determine a picture of the determined size.
- a high level syntax such as a Sequence Parameter Set (SPS) or a Picture Parameter Set (PPS) to determine the size of the CTU and determine a picture of the determined size.
- SPS Sequence Parameter Set
- PPS Picture Parameter Set
- Split into CTUs The CTU is determined as the highest layer of the tree structure, that is, the root node, and the CTU is partitioned using a tree structure (eg, a QTBT structure) by extracting partition information about the CTU.
- a tree structure eg, a QTBT structure
- Examples of the present disclosure generally relate to a technique for signaling QTBT block partition information and a technique for determining block partitioning from QTBT block partition information.
- certain techniques of the present disclosure may be performed by the decoder 510. That is, for example, the decoder 510 may perform the techniques of this disclosure described with respect to FIGS. 8 through 10 below.
- one or more other units of the decryption apparatus may be responsible for additionally or alternatively performing the techniques of this disclosure.
- the decoder 510 determines the current block (current block) to be decoded by splitting the tree structure, the decoder 510 extracts information about a prediction type indicating whether the current block is intra predicted or inter predicted.
- the decoder 510 extracts a syntax element for intra prediction information (intra prediction mode) of the current block. First, the decoder 510 extracts mode information (ie, an MPM flag) indicating whether an intra prediction mode of a current block is selected from the MPMs. Also, in general, when the intra mode encoding information indicates that the intra prediction mode of the current block is selected from the MPMs, the first intra identification information for indicating which mode of the MPMs is selected as the intra prediction mode of the current block is extracted. And if the intra mode encoding information indicates that the intra prediction mode of the current block is not selected among the MPMs, a second intra for indicating which mode other than the MPM is selected as the intra prediction mode of the current block.
- mode information ie, an MPM flag
- the intra prediction unit 222 may indicate an intra identification information indicating which mode among MPMs and / or non-MPMs is selected as an intra prediction mode for predicting a current block.
- group MPMs and / or non-MPMs and extract intra identification information eg, group index, etc. indicating whether an intra mode for predicting the current block belongs.
- the decoder 510 extracts information on the quantized transform coefficients of the current block as information on the residual signal.
- the inverse quantizer 520 inversely quantizes the quantized transform coefficients, and the inverse transformer 530 inversely transforms the inverse quantized transform coefficients from the frequency domain to the spatial domain to generate a residual block for the current block.
- the predictor 540 includes an intra predictor 542 and an inter predictor 544.
- the intra predictor 542 is activated when the intra prediction is the prediction type of the current block
- the inter predictor 544 is activated when the intra prediction is the prediction type of the current block.
- the intra predictor 542 determines the intra prediction mode of the current block among the plurality of intra prediction modes from the syntax elements for the intra prediction mode extracted from the decoder 510, and references pixels around the current block according to the intra prediction mode. Predict the current block using
- the intra predictor 542 constructs an MPM list including a predetermined number of MPMs from neighboring blocks of the current block.
- the method of constructing the MPM list is the same as that of the intra predictor 222 of FIG. 2.
- the intra prediction unit 542 may assign the first intra identification information among the MPMs in the MPM list. Select the MPM indicated by the intra prediction mode of the current block. On the other hand, if the mode information indicates that the intra prediction mode of the current block is not selected from the MPM, the intra prediction mode of the current block is determined among the remaining intra prediction modes except the MPMs in the MPM list using the second intra identification information. do.
- the intra prediction unit 222 of the image encoding apparatus may determine the final intra mode (that is, the intra mode for predicting the current block) by evaluating the intra modes belonging to the corresponding group. For example, in some examples, the intra predictor 542 may generate a reconstructed block for a plurality of intra modes belonging to a group, and evaluate the reconstructed blocks to determine a final intra mode.
- the inter predictor 544 determines motion information of the current block by using a syntax element for the intra prediction mode extracted from the decoder 510, and predicts the current block by using the determined motion information.
- the adder 550 reconstructs the current block by adding the residual block output from the inverse transformer and the prediction block output from the inter predictor or the intra predictor.
- the pixels in the reconstructed current block are utilized as reference pixels in intra prediction of the block to be subsequently decoded.
- the filter unit 560 deblocks and filters the boundary between the reconstructed blocks in order to remove blocking artifacts that occur due to block-by-block decoding and stores them in the memory 570.
- the reconstructed picture is used as a reference picture for inter prediction of a block in a picture to be decoded later.
- the image encoding apparatus determines a QTBT block partitioning structure for encoding a block of image data (S610).
- the QTBT block partitioning structure is a structure in which a binary tree is rooted from a leaf node of a quadtree.
- the image encoding apparatus generates an encoded bitstream including the block of the image data and the segmentation information representing the determined block partitioning structure based on the determined block partitioning structure (S620).
- the image decoding apparatus parses a block of image data encoded from a bitstream and segmentation information associated with the block of the image data (S710).
- the block of the encoded image data is divided into a plurality of partitioned blocks according to a QTBT block partitioning structure, and the QTBT block partitioning structure is a structure in which a binary tree is routed from a leaf node of a quadtree.
- the image decoding apparatus determines the leaf nodes of the QTBT using the split information, and decodes the block of the encoded image data for each leaf node of the QTBT (S720).
- the techniques disclosed below relate to a technique for signaling QTBT block partition information applied to step S620 of FIG. 6 and a technique for determining block partitioning from QTBT block partition information applied to step S720 of FIG. 7.
- the techniques of this disclosure may be performed by, for example, the image encoding apparatus and the image decoding apparatus shown and described with respect to FIGS. 2 and 5. That is, in one example, the encoder described with reference to FIG. 2 may perform certain techniques described below when determining block partitioning to encode a block of image data. In another example, the decoder described with respect to FIG. 5 may perform certain techniques described below when determining block partitioning to decode a block of image data.
- the QTBT block partitioning can be used to partition a CTU into CUs. That is, the root node of the QTBT may be a CTU, and the leaf node of the QTBT may be a CU.
- the encoding apparatus signals the split information obtained by dividing the CTU into a plurality of CUs, and correspondingly, the decoding apparatus uses the signaled split information to convert the CTU into the plurality of CUs.
- Embodiments for determining partitioning to be divided are described.
- the CTU that is the root node may be first divided into a QT structure.
- the encoding apparatus uses a first flag (QT split flag, QT_split_flag) to indicate whether the root node of the QTBT is split into a QT structure.
- QT split flag QT_split_flag
- the QT division of each node may be repeated recursively.
- BT partitioning may be performed on blocks that are not QT partitioned.
- the encoding apparatus further uses a second flag (BT split flag, BT_split_flag) to indicate whether a block is split into a BT structure.
- the BT split may have a plurality of split types. In this case, when the block is divided into BT structures, the encoding apparatus further uses split type information indicating the BT split type
- FIG. 8 is a diagram illustrating an example of a QTBT splitting structure of a CTU and a tree structure thereof.
- the solid line indicates the division by the QT structure
- the dotted line indicates the division by the BT structure.
- BT splitting has two splitting types, that is, a splitting of a block of a node into two blocks of the same size and a splitting type vertically.
- QT_split_flag 1
- the block corresponding to the first node of the depth 1 is no longer divided by the QT, but is BT divided, and the BT division type is a vertical type.
- the block corresponding to the first node of depth 2 is a child node of the BT partitioned parent node (that is, the first node of depth 1), and is again BT split.
- the BT split may be recursively repeated, but child nodes of the BT split node are not QT split again. Therefore, in generating split information for child nodes of the BT split node, there is no need to explicitly signal that the block is not QT split. For example, in FIG.
- the BT partition type information 1 indicating the type is encoded.
- a decoding apparatus determines a partitioning for dividing a CTU into a plurality of CUs using split information signaled from the encoding apparatus in the present embodiment.
- the decoding apparatus first extracts a first flag (QT_split_flag) related to the division of the QT.
- QT_split_flag a first flag related to the division of the QT.
- the decoding apparatus divides the corresponding node into four nodes of lower depths.
- the second flag BT_split_flag and the split type information related to the splitting of the BT are extracted to split the corresponding leaf node into the BT structure.
- the demodulation device extracts a QT splitting flag QT_split_flag corresponding to the node of the highest depth of the QTBT structure. Since the value of the extracted QT split flag QT_split_flag is 1, the node of the highest depth is divided into four nodes of the lower depth (depth 1 of QT). Then, the QT splitting flag QT_split_flag of the first node of depth 1 is extracted. Since the extracted QT split flag QT_split_flag value is 0, the first node of depth 1 is no longer split into a QT structure.
- the process proceeds to the BT having the first node of the depth 1 of the QT as the root node of the BT.
- the BT split flag BT_split_flag corresponding to the root node of the BT ie, depth 0
- the root node of BT is split into two nodes of depth 1.
- split type information indicating whether a block corresponding to the root node of the BT is split vertically or horizontally is extracted. Since the split type information is 1, the block corresponding to the root node of BT is split vertically.
- the BT split flag (BT_split_flag) for the first node of the depth 1 split from the root node of BT is extracted. Since the BT split flag BT_split_flag is 1, split type information of a block of the first node of depth 1 is extracted. Since the partition type information of the block of the first node of depth 1 is 1, the block of the first node of depth 1 is vertically divided. Then, the BT split flag BT_split_flag of the second node of the depth 1 split from the root node of BT is extracted. Since the BT split flag BT_split_flag is 0, it is no longer split by BT.
- the decoder 510 first extracts the QT split flag QT_split_flag repeatedly to split the CTU into a QT structure.
- the BT split flag (BT_split_flag) is extracted for the leaf node of the QT, and when the BT split flag (BT_split_flag) indicates the split, split type information is extracted.
- the decoding apparatus may confirm that the CTU is divided by the encoding apparatus into a structure as shown in FIG.
- the QTBT splitting structure As the CTU size grows larger, in the QTBT splitting structure, it is highly likely that all nodes will be QT split again until the size of the split blocks decreases below a certain level.
- the number of times the QT division is repeated increases, the number of bits required to express these QT division information using the QT division flag is increased by n-th power of four.
- This embodiment aims to reduce the QT partitioning information of these nodes when all nodes from the highest depth (that is, depth 0) to a certain lower depth are commonly QT partitioned.
- the encoding apparatus uses the QT splitting flag to express these depths instead of expressing these QT split information. Depth information that specifies can be signaled.
- the encoding apparatus may signal depth information indicating the highest depth in which the leaf nodes of the QT exist among the depths of the QTBT partition structure.
- the highest depth in which the leaf nodes of the QT exist may be referred to as the 'highest QT leaf node depth'.
- 'top QT leaf node depth 2' means that the root node of the QTBT belonging to the highest depth (0) of the QTBT is QT splitted, and each of the four child nodes (which belong to depth 1) of the root node are also QTBT.
- QT split, and further, at least some of the nodes belonging to depth 2 are leafs of QT that are not QT split.
- the depth information may function as QT partitioning information from a root node to parent nodes of nodes belonging to the highest QT leaf node depth in the QTBT partitioning structure.
- the depth information may be expressed, for example, in a truncated unary manner.
- Depth information may be expressed in a CTU header.
- An example codeword for each depth is as follows.
- the QT structure repeated from the root node can be represented by a breadth first scan method, and the remaining lower QTs or BTs are similar to those of the first embodiment.
- the QT split flag, the BT split flag, and the BT split type information may be used to express the depth first scan method.
- the example QTBT splitting structure of FIG. 8 is the leaf nodes of QT where the root node is QT split and the first and second and fourth nodes of depth 1 are no longer QT split.
- the highest QT leaf node depth is 1 in the example QTBT splitting structure of FIG. 8.
- the QTBT splitting structure of FIG. 8 if the highest QT leaf node depth is separately indicated using the codeword of Table 1, a total of 35 bits are required as follows.
- the underlined bits relate to the BT division and the remaining bits relate to the QT division.
- a signaling method with depth information may be extended to a slice, a picture, or a sequence level.
- the depth can be expressed in the Slice Header.
- the depth expressed in the slice header may function as QT split information for all nodes existing between the highest depth and the depth expressed in the slice header, for all CTUs included in the slice.
- the remaining lower QT or BT may be expressed using the QT split flag, the BT split flag, and the BT split type information similarly to the first embodiment.
- a slice consisting of four CTUs is exemplarily described.
- the highest QT leaf node depth of CTU No. 1 the highest QT leaf node depth of CTU No. 2 is 2, and the highest CTU No. 3 CTU.
- the QT leaf node depth is 4 and the highest QT leaf node depth of CTU # 4 is 3.
- all four CTUs are divided into the same shape up to depth 2. That is, all the CTUs are recursively QT-divided until all the CTU blocks reach depth 2.
- the encoding apparatus may express information indicating that all CTUs of the corresponding slice are QT-split up to depth 2 in the slice header.
- the depth is added to the picture-level header information (e.g., picture parameter sets). (in a picture parameter set (PPS)).
- the depth is determined in sequence-level header information (e.g., a sequence parameter set parameter set (SPS)).
- QT common depth The depth expressed in the picture-level header information, the sequence-level header information, and the sequence-level header information in this manner is referred to as "QT common depth" below.
- the QT common depth is used as part of the QTBT segmentation information for all CTUs included in that level.
- Table 2 shows an example syntax element for indicating the QT common depth in a Sequence Parameter Set (SPS). As shown in Table 2, in order to signal the QT common depth, a flag indicating that there is a QT common depth (QT_depth_signal_flag) and a parameter (QT_comm_depth_minus3) that specifies the QT common depth may be used. Similar syntax elements may be included in a Picture Parameter Set (PPS) or Slice Header.
- PPS Picture Parameter Set
- Slice Header Similar syntax elements may be included in a Picture Parameter Set (PPS) or Slice Header.
- the QT common depth represented in picture-level header information, sequence-level header information, or sequence-level header information is determined whether all CTUs of that level are actually divided into the same form up to that QT common depth. Regardless, it may be a predetermined default value. In this case, the encoding apparatus may additionally signal a difference value between the QT common depth defaulted to the SPS, PPS, or Slice and the highest QT leaf node depth of each CTU. The default value may be predetermined based on statistics related to the likelihood of matching the highest QT leafnode depth of the CTUs included in that level.
- the decoding apparatus may determine the CTU. A method of determining partitioning to be divided into a plurality of CUs will be described.
- the decoding apparatus first extracts depth information indicating the highest depth (that is, the highest QT leaf node depth) among depths in which at least one leaf node of the quadtree exists in the QTBT structure from the encoded bitstream. Using depth information indicating the highest QT leaf node depth, the decoding apparatus performs QT splitting for all nodes from a root node of the QTBT to a parent node of nodes belonging to the highest QT leaf node depth. For example, when the highest QT leaf node depth is 2, the root node is QT splitted, and the four child nodes (which belong to depth 1) of the root node are each QT splitted.
- the highest QT leaf node depth at least some of the nodes belonging to the highest QT leaf node depth are not QT split. In addition, some of the nodes belonging to the highest QT leaf node depth are neither BT split nor QT split or BT split. In other words, some of the nodes belonging to the highest QT leaf node depth may be further QT split, BT split, or leaf nodes of QTBT (either QT split or BT split). Therefore, nodes belonging to the highest QT leaf node depth are determined in a partition structure in substantially the same manner as in the first embodiment.
- the decoding apparatus parses the bitstream to process a sequence parameter set (SPS), a picture parameter set (PPS), or a slice. Extract depth information expressed in Header. Using the extracted depth information, the decoding apparatus may recursively split all nodes until the depth indicated by the depth information is reached.
- SPS sequence parameter set
- PPS picture parameter set
- Slice Slice
- the leaf nodes of the quadtree in the QTBT partition structure may be further partitioned into the BT structure.
- BT there may be a plurality of split types, for example, a type of dividing a block of a node horizontally into two blocks of the same size (ie, symmetric horizontal splitting) and a type of splitting vertically (ie, symmetrical). There may be two kinds of symmetric vertical splitting.
- a technique for reducing the amount of bits representing such BT is proposed.
- the BT partition information representation scheme proposed in this embodiment may be used together with the QT partition information representation scheme illustrated in the first or second embodiment.
- FIG. 9 is a diagram illustrating the number of bits that can be represented as BT in one CU.
- the parent node is divided horizontally (up and down) and the child node is divided vertically (left and right), or the parent node is divided vertically (left and right) and the child node is horizontal.
- the division into (up and down) is not considered.
- 16 expressions are possible. In FIG. 9, these combinations are represented by 4 bits to 6 bits using the BT partition flag and the BT partition type flag.
- the present invention represents a method of reducing the amount of bits representing the combination of these parent and child nodes by representing the parent node in 2 bits and the child node in a fixed length codeword of 3 bits.
- Adopt Example codewords are listed in Table 3. In the case of “10 10 10” and “11 11 11”, where the parent node and the child node are divided in the same form, the leaf node is different, but when the child node is marked with a specific bit, the first two bits (2 bits of the parent node) are divided. Prediction of the method is possible.
- Child Node (First embodiment, 2 to 4 bits) Child Node (3rd Embodiment, 3 bits) 10/11 0 0 000 0 10 001 to 110 0 11 10 0 10 11 11 0 11 10 10 10 10 111 11 11 11 11
- 1 0 11 11 0 0 0 (No. 1 BT) and 0 10 0 0 (No. 2 BT) are used as the depth priority method.
- 1 0 11 10 11 11 0 0 10 0 0 0 0 (BT 3, 4) 0 0 0 0 0 0 10 10 0 0 10 0 0 (BT 5) 0 0 (52 bits total) Can be.
- the following information may be additionally used to efficiently signal information about block division by the QTBT structure.
- the information is encoded as header information of an image, for example, may be encoded by a sequence parameter set (SPS) or a picture parameter set (PPS).
- SPS sequence parameter set
- PPS picture parameter set
- CTU size the top layer of QTBT, that is, the block size of the root node
- MinQTSize the minimum block size of leaf nodes allowed in QT
- MaxBTSize the maximum block size of the root node allowed by BT
- MaxBTDepth the maximum depth allowed by BT
- MinBTSize the minimum block size of leaf nodes allowed in BT
- a block having the same size as MinBTSize in BT is no longer split and no split information (second flag, split type information) regarding BT is also encoded.
- second flag, split type information the maximum or minimum block size that a loop or leaf node of the QT and BT can have at a high level such as a sequence parameter set (SPS) or a picture parameter set (PPS) can be defined to determine whether the CTU is divided or not.
- SPS sequence parameter set
- PPS picture parameter set
- the amount of coding for the information indicating the partition type can be reduced.
- the decoding apparatus extracts the information and reflects this information when extracting the split information for QT and BT. Can be.
- the decoding apparatus does not extract the split information (QT split flag) about the QT of the block from the bitstream (that is, the QT split flag of the block does not exist in the bitstream), and sets the value to 0 automatically. do.
- the decoding apparatus does not extract the BT partition flag for the leaf node having a block larger than MaxBTSize in QT, and automatically sets the BT partition flag to 0.
- the depth of the node of the BT reaches MaxBTDepth, the block of the node is no longer split.
- the BT partition flag of the corresponding node is not extracted from the bitstream, and its value is automatically set to zero. Also, in BT, blocks having the same size as MinBTSize are no longer split. Therefore, the decoding apparatus does not extract the BT partition flag of the block having the same size as MinBTSize from the bitstream, and automatically sets the value to zero.
- the luma component and the chroma component of the CTU may be divided into the same QTBT structure.
- the luminance component and the chrominance component may each be divided into separate QTBT structures.
- the luminance component and the chrominance component may be divided into different QTBT structures. That is, CTU may be divided into luminance coding blocks (CBs) and color difference CBs to perform block partitioning, and block block information may be generated for each CB and transmitted to a decoding apparatus.
- CBs luminance coding blocks
- CBs luminance difference CBs
- the color difference information may not require a detailed block partition.
- only QT partition information among all block partition information of luminance may be used as block partition information of the color difference CB.
- a separate syntax split_cb_flag_for_QT_in_CTU
- the decoding apparatus converts the QT partition information from the entire block partition information of luminance to the QT partition information of the luminance CB.
- the CB is used as QT block partition information, and this value is 1, different block partition information may be sent to the decoding apparatus as before.
- the QT common depth of the QT when expressed in any one of a Sequence Parameter Set (SPS), a Picture Parameter Set (PPS), and a Slice Header, in some examples, the luminance in the CTU unit
- the QT common depth for the color difference may refer to the QT common depth set higher than the CTU (that is, set in the SPS, PPS, or Slice Header).
- the QT common depth for luminance and color difference in the CTU unit may be expressed as a difference value from the QT common depth set above the CTU.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
La présente invention concerne la signalisation efficace d'informations de division relatives à un partitionnement de blocs basé QTBT (arbre quaternaire plus arbre binaire). Un dispositif de codage d'image selon un aspect de la présente invention détermine la plus grande profondeur de nœud de feuille QT pour indiquer la profondeur la plus élevée parmi des profondeurs dans lesquelles au moins un nœud feuille d'un arbre quaternaire est présent dans une structure de partitionnement de blocs déterminée, et des informations de profondeur de signal pour indiquer la plus grande profondeur de nœud de feuille QT en tant que des informations de division d'arbre quaternaire sur tous les nœuds, depuis un nœud racine jusqu'à un nœud parent, de nœuds appartenant à la plus grande profondeur de nœud feuille QT dans la structure de partitionnement de blocs déterminée.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR10-2017-0031234 | 2017-03-13 | ||
| KR1020170031234A KR102390384B1 (ko) | 2017-03-13 | 2017-03-13 | 영상 부호화 또는 복호화하기 위한 장치 및 방법 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2018169267A1 true WO2018169267A1 (fr) | 2018-09-20 |
Family
ID=63522489
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/KR2018/002891 Ceased WO2018169267A1 (fr) | 2017-03-13 | 2018-03-12 | Dispositif et procédé de codage ou de décodage d'image |
Country Status (2)
| Country | Link |
|---|---|
| KR (1) | KR102390384B1 (fr) |
| WO (1) | WO2018169267A1 (fr) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020060259A1 (fr) * | 2018-09-21 | 2020-03-26 | 엘지전자 주식회사 | Procédé de décodage d'image à l'aide d'un partitionnement de blocs dans un système de codage d'images, et dispositif associé |
| CN111107378A (zh) * | 2018-10-26 | 2020-05-05 | 北京字节跳动网络技术有限公司 | 分割信息的信令通知 |
| CN113396586A (zh) * | 2019-02-11 | 2021-09-14 | 北京字节跳动网络技术有限公司 | 条件依赖的视频块分割 |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MX2021016157A (es) | 2019-06-19 | 2022-03-11 | Lg Electronics Inc | Metodo y dispositivo de codificacion/decodificacion de imagen usando tama?o adaptivo limitante del bloque de croma y metodo de transformacion del flujo de bits. |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20130030240A (ko) * | 2011-09-16 | 2013-03-26 | 한국항공대학교산학협력단 | 영상 부호화/복호화 방법 및 그 장치 |
| WO2016091161A1 (fr) * | 2014-12-10 | 2016-06-16 | Mediatek Singapore Pte. Ltd. | Procédé de codage vidéo utilisant un partitionnement de blocs en arbre binaire |
| KR20160077166A (ko) * | 2013-12-27 | 2016-07-01 | 인텔 코포레이션 | 차세대 비디오 코딩을 위한 콘텐츠 적응적 이득 보상된 예측 |
| KR20170019344A (ko) * | 2014-05-01 | 2017-02-21 | 퀄컴 인코포레이티드 | 하나 이상의 계층들을 포함하는 파티션과 연관된 프로파일, 티어 및 레벨에 기초하여 디코더의 능력을 결정하는 것에 의해 멀티-계층 비디오 데이터를 디코딩하기 위한 방법 및 디바이스 |
| KR20170023893A (ko) * | 2017-02-07 | 2017-03-06 | 인텔렉추얼디스커버리 주식회사 | 쿼드트리 구조 기반의 적응적 양자화 파라미터 부호화 및 복호화 방법 및 장치 |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9462275B2 (en) * | 2012-01-30 | 2016-10-04 | Qualcomm Incorporated | Residual quad tree (RQT) coding for video coding |
-
2017
- 2017-03-13 KR KR1020170031234A patent/KR102390384B1/ko active Active
-
2018
- 2018-03-12 WO PCT/KR2018/002891 patent/WO2018169267A1/fr not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20130030240A (ko) * | 2011-09-16 | 2013-03-26 | 한국항공대학교산학협력단 | 영상 부호화/복호화 방법 및 그 장치 |
| KR20160077166A (ko) * | 2013-12-27 | 2016-07-01 | 인텔 코포레이션 | 차세대 비디오 코딩을 위한 콘텐츠 적응적 이득 보상된 예측 |
| KR20170019344A (ko) * | 2014-05-01 | 2017-02-21 | 퀄컴 인코포레이티드 | 하나 이상의 계층들을 포함하는 파티션과 연관된 프로파일, 티어 및 레벨에 기초하여 디코더의 능력을 결정하는 것에 의해 멀티-계층 비디오 데이터를 디코딩하기 위한 방법 및 디바이스 |
| WO2016091161A1 (fr) * | 2014-12-10 | 2016-06-16 | Mediatek Singapore Pte. Ltd. | Procédé de codage vidéo utilisant un partitionnement de blocs en arbre binaire |
| KR20170023893A (ko) * | 2017-02-07 | 2017-03-06 | 인텔렉추얼디스커버리 주식회사 | 쿼드트리 구조 기반의 적응적 양자화 파라미터 부호화 및 복호화 방법 및 장치 |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020060259A1 (fr) * | 2018-09-21 | 2020-03-26 | 엘지전자 주식회사 | Procédé de décodage d'image à l'aide d'un partitionnement de blocs dans un système de codage d'images, et dispositif associé |
| US11368687B2 (en) | 2018-09-21 | 2022-06-21 | Lg Electronics Inc. | Method for decoding image by using block partitioning in image coding system, and device therefor |
| US11882280B2 (en) | 2018-09-21 | 2024-01-23 | Lg Electronics Inc. | Method for decoding image by using block partitioning in image coding system, and device therefor |
| CN111107378A (zh) * | 2018-10-26 | 2020-05-05 | 北京字节跳动网络技术有限公司 | 分割信息的信令通知 |
| CN111107378B (zh) * | 2018-10-26 | 2024-05-14 | 北京字节跳动网络技术有限公司 | 分割信息的信令通知 |
| CN113396586A (zh) * | 2019-02-11 | 2021-09-14 | 北京字节跳动网络技术有限公司 | 条件依赖的视频块分割 |
| US12244826B2 (en) | 2019-02-11 | 2025-03-04 | Beijing Bytedance Network Technology Co., Ltd. | Video block partition based on quinary-tree |
| US12395647B2 (en) | 2019-02-11 | 2025-08-19 | Beijing Bytedance Network Technology Co. Ltd. | Condition dependent video block partition |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20180104451A (ko) | 2018-09-21 |
| KR102390384B1 (ko) | 2022-04-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2018128466A1 (fr) | Dispositif et procédé de codage ou de décodage d'image | |
| EP3183881B1 (fr) | Procédés comportant des extensions à un mode de copie au-dessus pour le codage en mode palette | |
| WO2019135628A1 (fr) | Procédé et dispositif de codage ou de décodage d'image | |
| WO2012057528A2 (fr) | Procédé de codage et de décodage à prédiction intra adaptative | |
| WO2013069932A1 (fr) | Procédé et appareil de codage d'image, et procédé et appareil de décodage d'image | |
| KR20150139902A (ko) | 팔레트-기반 비디오 코딩에서 팔레트들의 결정 | |
| WO2020185009A1 (fr) | Procédé et appareil de codage efficace de blocs résiduels | |
| WO2011133002A2 (fr) | Dispositif et procédé de codage d'image | |
| WO2012058909A1 (fr) | Appareil et procédé pour codage vidéo à haute efficacité utilisant une structure de tranche flexible | |
| WO2011087271A2 (fr) | Procédé et dispositif de traitement de signaux vidéo | |
| WO2020251260A1 (fr) | Procédé et dispositif de traitement de signal vidéo utilisant un procédé de prédiction dpcm de blocs | |
| JP2022521911A (ja) | イントラ予測ベースのビデオ信号処理方法及び装置 | |
| US20210360244A1 (en) | Signaling for video block partition mode | |
| JP2023181460A (ja) | ビデオ信号処理方法及び装置 | |
| KR20170028905A (ko) | 팔레트 모드 코딩을 위한 방법 | |
| WO2019194500A1 (fr) | Procédé de codage d'images basé sur une prédication intra et dispositif associé | |
| WO2018169267A1 (fr) | Dispositif et procédé de codage ou de décodage d'image | |
| WO2019059721A1 (fr) | Codage et décodage d'image à l'aide d'une technique d'amélioration de résolution | |
| WO2018079873A1 (fr) | Procédé et appareil de codage vidéo utilisant des types quelconques de partitionnement de blocs | |
| WO2018074812A1 (fr) | Dispositif et procédé de codage ou de décodage d'image | |
| WO2018070555A1 (fr) | Procédé et appareil de codage vidéo utilisant une combinaison de diverses structures de partitionnement de blocs | |
| WO2019132567A1 (fr) | Procédé et dispositif de codage vidéo utilisant une prédiction intra d'unité de sous-bloc | |
| WO2018070556A1 (fr) | Procédé et appareil pour extraire des données de mode de prédiction intra d'un bloc carré ou rectangulaire | |
| KR20200136300A (ko) | 인트라 모드 후보 구성 방법 및 영상 복호화 장치 | |
| WO2021137445A1 (fr) | Procédé de détermination de noyaux de transformée de traitement de signal vidéo et appareil associé |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18768374 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 18768374 Country of ref document: EP Kind code of ref document: A1 |