[go: up one dir, main page]

WO2014088316A2 - Procédé de codage et de décodage vidéo, et appareil utilisant celui-ci - Google Patents

Procédé de codage et de décodage vidéo, et appareil utilisant celui-ci Download PDF

Info

Publication number
WO2014088316A2
WO2014088316A2 PCT/KR2013/011165 KR2013011165W WO2014088316A2 WO 2014088316 A2 WO2014088316 A2 WO 2014088316A2 KR 2013011165 W KR2013011165 W KR 2013011165W WO 2014088316 A2 WO2014088316 A2 WO 2014088316A2
Authority
WO
WIPO (PCT)
Prior art keywords
picture
layer
enhancement layer
interpolation
block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/KR2013/011165
Other languages
English (en)
Korean (ko)
Other versions
WO2014088316A3 (fr
Inventor
심동규
조현호
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intellectual Discovery Co Ltd
Original Assignee
Intellectual Discovery Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intellectual Discovery Co Ltd filed Critical Intellectual Discovery Co Ltd
Publication of WO2014088316A2 publication Critical patent/WO2014088316A2/fr
Publication of WO2014088316A3 publication Critical patent/WO2014088316A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/463Embedding additional information in the video signal during the compression process by compressing encoding parameters before transmission

Definitions

  • the present invention relates to an image processing technique, and more particularly, to a method and apparatus for encoding / decoding an enhancement layer by interpolating a picture of a reference layer in a scalable video codec and using the same as a reference picture of the enhancement layer.
  • JCT-VC Joint Collaborative Team on Video Coding
  • HEVC High Efficiency Video Coding
  • This HEVC has a compression efficiency of about 50% or more when compared in terms of subjective picture quality compared to the H.264 / AVC High profile, which is a video standard known to have the highest compression efficiency.
  • HEVC can effectively support 4K-UHD and 8K-UHD resolution video, and since the basic block of encoding is variable, it can support more various resolution video than conventional video compression standard technology.
  • the HEVC standardization for the base layer was established in January 2013 under the name HEVC version 1, and by 2014, the HEVC-based scalable video compression standard technology and the HEVC-based multiview video compression standard technology are planned to be developed. .
  • HEVC is similar in structure to conventional video codecs such as H.264 / AVC, but since new coding techniques are additionally used, the interlayer prediction technique considering the new additions to HEVC in the HEVC-based scalable video compression technique need.
  • the present invention provides a method and apparatus for improving encoding performance of an enhancement layer by interpolating a reconstructed picture of a reference layer according to the resolution of an enhancement layer in an HEVC-based scalable video compression codec, and adding it to a reference picture list of the enhancement layer. It aims to provide.
  • An inter-layer reconstructed picture reference method for solving the above problems includes selecting a picture to be interpolated in a reference layer; Performing interpolation through a plurality of interpolators on the selected picture; Adding the interpolated picture to the reference picture list of the enhancement layer.
  • An inter-layer reconstructed picture reference apparatus includes an apparatus for selecting a picture to be interpolated in a reference layer; An apparatus for performing interpolation through a plurality of interpolation apparatuses on a selected picture; And an apparatus for adding the interpolated picture to the reference picture list of the enhancement layer.
  • the inter-layer reconstructed picture reference method according to the second embodiment of the present invention for solving the above problems comprises the steps of selecting a plurality of interpolation points in the picture of the reference layer; Performing interpolation through an interpolation unit on the input picture of the selected point; Adding the interpolated picture to the reference picture list of the enhancement layer.
  • An inter-layer reconstructed picture reference apparatus includes an apparatus for selecting a plurality of interpolation points in a picture of a reference layer; An apparatus for performing interpolation through an interpolation unit on an input picture of a selected point; And an apparatus for adding the interpolated picture to the reference picture list of the enhancement layer.
  • the inter-layer reconstructed picture reference method according to the third embodiment of the present invention for solving the above problems comprises the steps of selecting a plurality of interpolation points in a picture of a reference layer; Performing interpolation through different interpolators according to the input picture of the selected point; Adding the interpolated picture to the reference picture list of the enhancement layer.
  • An inter-layer reconstructed picture reference apparatus includes an apparatus for selecting a plurality of interpolation points in a picture of a reference layer; An apparatus for performing interpolation through different interpolators in accordance with an input picture of a selected point; And an apparatus for adding the interpolated picture to the reference picture list of the enhancement layer.
  • the inter-layer reconstruction picture reference encoding / decoding apparatus interpolates a plurality of reference pictures having different characteristics by performing interpolation through a plurality of interpolators on an image to which sample adaptive offset is applied in a reference layer.
  • Create A plurality of reference pictures generated through the reference layer may be inserted into a reference picture list of the enhancement layer and used for inter prediction to improve encoding performance of the enhancement layer.
  • the inter-layer reconstructed picture reference encoding / decoding apparatus generates interpolation images having different characteristics by interpolating a reference picture in a reference layer at a plurality of interpolation points, and in the enhancement layer
  • the reference picture lister By using the reference picture lister, encoding performance of an enhancement layer can be improved through various combinations.
  • the inter-layer reconstructed picture reference encoding / decoding apparatus selects a plurality of interpolation points with respect to a reference picture in a reference layer and has a plurality of characteristics having different characteristics in consideration of characteristics of each interpolation point.
  • the interpolation is performed to generate a plurality of reference pictures having different characteristics.
  • the generated plurality of reference pictures may be inserted into the reference picture list of the enhancement layer and used for inter prediction to improve encoding performance of the enhancement layer.
  • FIG. 1 is a block diagram illustrating a configuration of a scalable video encoder.
  • FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 3 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • FIG. 4 is a block diagram illustrating a configuration of a video decoding apparatus according to an embodiment of the present invention.
  • FIG. 5 is a conceptual diagram for describing adding pictures interpolated in a reference layer to a reference picture list of an enhancement layer in an image encoding / decoding apparatus according to the present invention.
  • FIG. 6 is a conceptual diagram for describing adding pictures interpolated in a reference layer to a reference picture list of an enhancement layer when decoding an arbitrary access picture in the image encoding / decoding apparatus according to the present invention.
  • FIG. 7 is a conceptual diagram illustrating an example of performing inter-layer intra prediction in an enhancement layer, deriving a difference coefficient from an enhancement layer, and encoding the same by using pictures interpolated in a reference layer in an image encoding apparatus to which the present invention is applied.
  • FIG. 8 is a conceptual diagram illustrating an example of performing inter-layer intra prediction in an enhancement layer using pictures interpolated in a reference layer and decoding a block using reconstructed difference coefficients of an enhancement layer in an image decoding apparatus to which the present invention is applied. to be.
  • FIG. 9 is a flowchart schematically illustrating a method of constructing a reference picture list of an enhancement layer and performing inter-layer prediction in the enhancement layer according to an embodiment of the present invention.
  • first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
  • the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.
  • each component shown in the embodiments of the present invention are shown independently to represent different characteristic functions, and do not mean that each component is made of separate hardware or one software component unit.
  • each component is included in each component for convenience of description, and at least two of the components may be combined into one component, or one component may be divided into a plurality of components to perform a function.
  • Integrated and separate embodiments of the components are also included within the scope of the present invention without departing from the spirit of the invention.
  • the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance.
  • the present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included in the scope of the present invention.
  • FIG. 1 is a block diagram illustrating a configuration of a scalable video encoder.
  • a scalable video encoder provides spatial scalability, temporal scalability, and SNR scalability.
  • spatial scalability multi-layers using upsampling are used, and temporal scalability uses Hierarchical B picture structure.
  • quality scalability only the quantization coefficient is changed or a gradual encoding method for quantization error is used in the same manner as the technique for spatial scalability.
  • Input video 110 is down sampled through spatial decimation 115.
  • the down-sampled image 120 is used as an input of the reference layer, and the intra-prediction technique through the intra predictor 135 or the inter-screen through the motion compensator 130 for effectively coding the coding blocks in the picture of the reference layer.
  • the difference coefficient which is a difference value between the original block to be encoded and the prediction block generated by the motion compensation unit 130 or the intra prediction unit 135, is discrete cosine transformed or integer transformed through the transform unit 140.
  • the transform difference coefficient is quantized while passing through the quantization unit 145, and the transform difference coefficient is entropy coded by the entropy encoder 150.
  • the quantized transform difference coefficients are reconstructed back into differential coefficients through the inverse quantizer 152 and the inverse transform unit 154 to generate predicted values for use in adjacent blocks or adjacent pictures.
  • the difference coefficient value restored due to an error occurring in the quantization unit 145 may not be the same as the difference coefficient value used as an input of the converter 140.
  • the reconstructed difference coefficient value is added to a prediction block previously generated by the motion compensator 130 or the intra predictor 135 to reconstruct the pixel value of the block currently encoded.
  • the reconstructed block passes through the in-loop filter 156. When all blocks in the picture are reconstructed, the reconstructed picture is input to the reconstructed picture buffer 158 and used for inter prediction in the reference layer.
  • the input video 110 is used as an input value and encoded.
  • the interlayer prediction is performed by the motion compensator 172 or the intra predictor 170 in order to effectively encode the coding block in the picture as in the reference layer.
  • an intra prediction is performed and an optimal prediction block is generated.
  • the block to be encoded in the enhancement layer is predicted in the prediction block generated by the motion compensator 172 or the intra predictor 170, and as a result, a difference coefficient is generated in the enhancement layer.
  • the difference coefficients of the enhancement layer are encoded through the transform unit, the quantization unit, and the entropy encoding unit similarly to the reference layer.
  • encoded bits are generated in each layer.
  • the multiplexer 192 serves to configure one single bitstream 194.
  • each of the multiple layers may be independently encoded in FIG. 1, since the input video of the lower layer is down-sampled from the video of the upper layer, it has very similar characteristics. Therefore, when the reconstructed pixel values, motion vectors, and residual signals of the lower layer video are used in the enhancement layer, encoding efficiency may be increased.
  • the inter-layer intra prediction 162 reconstructs an image of a reference layer and interpolates the reconstructed image 180 according to an image size of an enhancement layer and uses the image as a reference image.
  • a method of decoding the reference image in units of frames and a method of decoding in units of blocks may be used in consideration of complexity reduction.
  • inter-layer prediction is allowed only when the reference layer is encoded in the intra prediction mode.
  • the image 180 reconstructed in the reference layer is input to the intra prediction unit 170 of the enhancement layer, thereby improving coding efficiency than using neighboring pixel values in the picture in the enhancement layer.
  • inter-layer motion prediction 160 refers to motion information 185 such as a motion vector or a reference frame index in the reference layer in the enhancement layer.
  • motion information 185 such as a motion vector or a reference frame index in the reference layer in the enhancement layer.
  • the inter-layer difference coefficient prediction 164 predicts the difference coefficient of the enhancement layer as the value of the difference coefficient 190 decoded in the reference layer.
  • the difference coefficient value of the enhancement layer can be encoded more effectively.
  • the difference coefficient 190 decoded in the reference layer is input to the motion compensation unit 172 of the enhancement layer to predict the motion of the enhancement layer. From the process, an optimal motion vector may be derived by considering the decoded difference coefficient value 190 of the reference layer.
  • FIG. 2 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • the apparatus for decoding an image includes a portion for decoding a reference layer and a portion for decoding an enhancement layer.
  • the portion 205 for decoding the reference layer includes an entropy decoder 210, an inverse quantizer 211, an inverse transformer 212, a motion compensator 213, an intra predictor 214, and a deblocking filter 215. ), A sample adaptive offset unit 216, a reconstructed picture buffer 217, and the like.
  • a bitstream of a reference layer and an enhancement layer is composed of a single bitstream.
  • the demultiplexer 200 extracts only the bitstream of the reference layer from the received bitstream and inputs it to the video decoder of the reference layer.
  • the input bitstream is decoded by the syntax elements encoded by CABAC or VLC through the entropy decoding unit 210.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 211, and the inverse quantized coefficients are inversely transformed through the inverse transformer 212 to restore a differential coefficient value.
  • the prediction block is generated from the reference picture stored in the reconstructed picture buffer 217 through the motion compensation unit 213.
  • the intra prediction unit 214 When the block to be decoded is encoded in the intra prediction mode, the intra prediction unit 214 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 215.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 216, and a process of compensating an offset value in units of pixels according to each class is performed.
  • the reconstructed picture that has passed through the deblocking filter unit 215 and the sample adaptive offset unit 216 is stored in the reconstructed picture buffer 217 for the inter prediction mode of the next picture.
  • the picture 220 that has passed through the sample adaptive offset unit 216 is copied to the video output buffer and then output as video.
  • the decoding apparatus interpolates the reconstructed picture 220 of the reference layer and then uses it as a reference picture of the enhancement layer in order to effectively encode the picture of the enhancement layer.
  • An image decoding apparatus for an enhancement layer is similar to an image decoding apparatus of a reference layer, such as an entropy decoding unit 250, an inverse quantization unit 251, an inverse transform unit 252, a motion compensator 253, and an intra predictor 254. , A deblocking filter unit 255, a sample adaptive offset unit 256, and a reconstructed picture buffer 257.
  • the image decoding apparatus further includes interpolation unit A 240 and interpolation unit B 230 having different characteristics in order to interpolate the reconstructed picture of the reference layer.
  • the enhancement layer refers to the reference layer
  • decoding on the reference layer must be preceded.
  • the syntax elements of the enhancement layer encoded by CABAC or VLC are decoded through the entropy decoder 250 through the bitstream of the enhancement layer extracted through the demultiplexer 200.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 251, and the inverse quantized coefficients are inversely changed through the inverse transformation unit 252 to restore a differential coefficient value.
  • the prediction block is generated from the picture stored in the reconstructed picture buffer 257 through the motion compensation unit 253.
  • the intra prediction unit 254 When the block of the enhancement layer to be decoded is encoded in the intra prediction mode, the intra prediction unit 254 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 255.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 256, and a process of compensating an offset value in units of pixels according to each class is performed.
  • the image decoding apparatus decodes a picture of a reference layer and interpolates it according to the resolution of the enhancement layer, and uses the interpolated picture as a prediction value in inter-screen or intra-picture prediction of the enhancement layer.
  • a plurality of interpolation filters are used by the image decoding apparatus to interpolate the decoded picture 220 in the reference layer.
  • the interpolator A 240 performs interpolation on the picture to which the in-loop filter such as the deblocking filter 215 and the sample adaptive offset 216 is applied after being decoded in the reference layer to be equal to the resolution of the enhancement layer.
  • the filter coefficient of the interpolator A 240 may be a DCT-IF based interpolation filter, an adaptive filter coefficient based interpolation filter, or a fixed coefficient based interpolation filter.
  • the interpolator B 230 also receives the same picture as the input of the interpolator A 240 and performs interpolation in consideration of the resolution of the enhancement layer.
  • the interpolator B 230 is a DCT-IF based interpolation filter using fewer filter tap coefficients than the interpolator A 240, an adaptive filter coefficient based interpolation filter, and a fixed coefficient based interpolation filter. Can be used.
  • the image decoding apparatus uses the interpolation unit A 240 having high efficiency and the interpolation unit B 230 having low complexity to perform the reference layer with respect to the same picture of the reference layer. Characterized by generating different prediction images for. In particular, in this figure, only two interpolation units represented by high efficiency and low complexity are described, but three or more interpolation units having different characteristics can be applied, and in this case, information about which interpolation unit to select in the enhancement layer is explicitly specified. Signaling or may be derived from context information of an enhancement layer or a reference layer.
  • the pictures 260 and 270 interpolated by the interpolator A 240 and the interpolator B 230 are input to the reconstructed picture buffer of the enhancement layer 257 to compensate for motion in the inter prediction mode of the enhancement layer. Is used).
  • the picture 260 interpolated by the interpolator A 240 may further be used as an inter-layer intra prediction value in the intra predictor of the enhancement layer.
  • FIG. 3 is a block diagram illustrating a configuration of an image decoding apparatus according to an embodiment of the present invention.
  • the apparatus for decoding an image includes a portion for decoding a reference layer and a portion for decoding an enhancement layer.
  • the portion 305 for decoding the reference layer may include an entropy decoder 310, an inverse quantizer 311, an inverse transformer 312, a motion compensator 313, an intra predictor 314, and a deblocking filter 315. ), A sample adaptive offset unit 316, a reconstructed picture buffer 317, and the like.
  • a bitstream of a reference layer and an enhancement layer is composed of a single bitstream.
  • the demultiplexer 300 extracts only the bitstream of the reference layer from the received bitstream and inputs it to the video decoder of the reference layer.
  • the input bitstream is decoded by the syntax elements encoded by CABAC or VLC through the entropy decoding unit 310.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 311, and the inverse quantized coefficients are inversely transformed through the inverse transformer 312 to restore a differential coefficient value.
  • the prediction block is generated from the reconstructed picture stored in the reconstructed picture buffer 317 through the motion compensation unit 313.
  • the intra predictor 314 When the block to be decoded is encoded in the intra prediction mode, the intra predictor 314 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 315.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 316, and a process of compensating an offset value in units of pixels according to each class is performed.
  • the reconstructed picture that has passed through the deblocking filter unit 315 and the sample adaptive offset unit 316 is stored in the reconstructed picture buffer 317 for the inter prediction mode of the next picture.
  • the picture 320 that has passed through the sample adaptive offset unit 316 is copied to the video output buffer and then output as video.
  • the decoding apparatus interpolates a reconstructed picture of the reference layer and then uses it as a reference picture of the enhancement layer.
  • An image decoding apparatus for an enhancement layer is similar to an image decoding apparatus of a reference layer, such as an entropy decoding unit 350, an inverse quantizer 351, an inverse transform unit 352, a motion compensator 353, and an intra predictor 354. , A deblocking filter unit 355, a sample adaptive offset unit 356, and a reconstructed picture buffer 357.
  • the apparatus for decoding an image additionally includes an interpolation unit 330 for interpolating a reconstructed picture of a reference layer.
  • the enhancement layer refers to the reference layer
  • decoding on the reference layer must be preceded.
  • the syntax elements of the enhancement layer encoded by CABAC or VLC are decoded through the entropy decoder 350 through the bitstream of the enhancement layer extracted through the demultiplexer 300.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 351, and the inverse quantized coefficients are inversely changed by the inverse transformation unit 352 to restore the difference coefficient values.
  • a motion block 353 When the block of the enhancement layer to be decoded is in the inter prediction mode, a motion block 353 generates a prediction block from a picture stored in the reconstructed picture buffer 357.
  • the intra prediction unit 354 When the block of the enhancement layer to be decoded is encoded in the intra prediction mode, the intra prediction unit 354 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 355.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 356, and a process of compensating an offset value
  • the image decoding apparatus decodes a picture of a reference layer and interpolates it according to the resolution of the enhancement layer, and then uses the interpolated picture as a prediction value in inter-screen or intra-picture prediction of the enhancement layer.
  • the image decoding apparatus uses a plurality of interpolation points in interpolating a picture decoded in a reference layer.
  • the image decoding apparatus may include points before applying the deblocking filter 340, points after applying the deblocking filter 342, and points after applying the sample adaptive offset 345. It is used as an input point of the executive 330.
  • the point 340 before the deblocking filter is applied has a feature that edges are relatively well preserved since the deblocking filter is not applied.
  • the point 342 may have a small deblocking phenomenon, but edges may not be well preserved due to low-pass filtering.
  • the point 345 may be similar in terms of the PSNR and the original image of the reconstructed point because the band offset and the edge offset may be applied in units of coding tree units (CTUs).
  • the interpolator 330 selects two points from the picture 340 before applying the deblocking filter, the picture 342 after applying the picture, and the picture 345 applied up to the sample adaptive offset in the reference layer and interpolates the image according to the resolution of the enhancement layer. Do this.
  • Information about selecting two interpolation points in the interpolator 330 may be explicitly informed by the enhancement layer.
  • Two pictures interpolated at two points selected by the interpolator 330 are added to the reconstructed picture buffer 357 of the enhancement layer.
  • the interpolation pictures added to the reconstruction picture buffer 357 of the enhancement layer are used for constructing a reference picture list such as L0, L1, or BI direction prediction in the inter prediction process in the enhancement layer.
  • One of the two interpolation points selected by the interpolator 330 may be additionally used in the intra prediction unit of the enhancement layer, and the selection information about the interpolation points may be explicitly informed by the enhancement layer.
  • FIG. 4 is a block diagram illustrating a configuration of a video decoding apparatus according to an embodiment of the present invention.
  • the apparatus for decoding an image includes a portion for decoding a reference layer and a portion for decoding an enhancement layer.
  • a bitstream of a reference layer and an enhancement layer is composed of a single bitstream.
  • the demultiplexer 400 extracts only the bitstream of the reference layer from the received bitstream and inputs it to the video decoder of the reference layer.
  • the input bitstream is decoded by the syntax elements encoded by CABAC or VLC through the entropy decoding unit 410.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 411, and the inverse quantized coefficients are inversely transformed through the inverse transformer 412 to be restored to differential coefficient values.
  • a prediction block is generated from a picture stored in the reconstructed picture buffer 417 through the motion compensation unit 413.
  • the intra predictor 414 When the block to be decoded is encoded in the intra prediction mode, the intra predictor 414 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 415.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 416, and a process of compensating an offset value in units of pixels according to each class is performed.
  • the reconstructed picture that has passed through the deblocking filter unit 415 and the sample adaptive offset unit 416 is stored in the reconstructed picture buffer 417 for the inter prediction mode of the next picture.
  • the picture 420 passed through the sample adaptive offset unit 416 is copied to the video output buffer and then output as video.
  • the decoding apparatus interpolates a reconstructed picture of the reference layer and then uses it as a reference picture of the enhancement layer.
  • An image decoding apparatus for an enhancement layer is similar to an image decoding apparatus of a reference layer, such as an entropy decoding unit 450, an inverse quantization unit 451, an inverse transform unit 452, a motion compensator 453, and an intra predictor 454. , A deblocking filter 455, a sample adaptive offset 456, and a reconstructed picture buffer 457.
  • the image decoding apparatus further includes interpolation unit A 440 and interpolation unit B 430 having different characteristics in order to interpolate the reconstructed picture of the reference layer.
  • the enhancement layer when decoding the enhancement layer in the scalable video codec, since the enhancement layer refers to the reference layer, decoding on the reference layer must be preceded.
  • the syntax elements of the enhancement layer encoded by CABAC or VLC are decoded through the entropy decoding unit 450 in the bitstream of the enhancement layer extracted through the demultiplexer 400.
  • the entropy decoded coefficients are inversely quantized while passing through the inverse quantization unit 451, and the inverse quantized coefficients are inversely changed by the inverse transformation unit 452, thereby restoring the differential coefficient values.
  • the motion compensation unit 453 When the block of the enhancement layer to be decoded is the inter prediction mode, the motion compensation unit 453 generates a prediction block from a picture stored in the reconstructed picture buffer 457.
  • the intra prediction unit 454 When the block of the enhancement layer to be decoded is encoded in the intra prediction mode, the intra prediction unit 454 generates a prediction block according to the intra prediction mode using pixel values decoded around the block to be currently decoded. The prediction block generated through inter-screen or intra-screen prediction is added to the difference coefficient value of the pixel domain to decode the block.
  • Deblocking filtering is performed on the decoded block, slice, or pictures through the deblocking filter 455.
  • the deblocking filter of HEVC is basically performed on a prediction block (PB) and a transform block (TB) boundary.
  • the decoded block to which deblocking filtering is applied is selected as one of a band offset or an edge offset through the sample adaptive offset unit 456, and a process of compensating an offset value in
  • the image decoding apparatus decodes a picture of a reference layer and interpolates it according to the resolution of the enhancement layer, and then uses the interpolated picture as a prediction value in inter-screen or intra-picture prediction of the enhancement layer.
  • the image decoding apparatus uses a plurality of interpolation units considering interpolation points and characteristics of each interpolation point in interpolating a picture decoded in a reference layer.
  • the image decoding apparatus uses points 425 after the deblocking filter and points 420 after the sample adaptive offset are used as interpolation points. Since the two interpolation points have different characteristics depending on whether the sample adaptive offset unit is applied or not, interpolation unit A 440 and interpolation unit B 430 having different interpolation characteristics are used.
  • the pictures interpolated through the interpolator A 440 and the interpolator B 430 are used for constructing a reference picture list such as L0, L1, or BI direction prediction in the inter prediction process in the enhancement layer.
  • the picture 460 interpolated through the interpolator A 440 may be further used in the intra prediction unit of the enhancement layer.
  • FIG. 5 is a conceptual diagram for explaining adding pictures interpolated in a reference layer to a reference picture list of an enhancement layer in an image encoding / decoding apparatus according to the present invention.
  • an interpolation picture A having different characteristics by interpolating a decoded picture 510 at the same time in a reference layer when encoding / decoding a picture of an enhancement layer is performed.
  • 530 and interpolation picture B 520 are generated.
  • the generation of interpolation pictures having different characteristics may vary depending on the embodiment as in the first, second, and third embodiments of the present invention.
  • the generated interpolation picture A 530 and the interpolation picture B 520 are added to the L0 list 540 and the L1 list 550 in the enhancement layer to enable prediction through various picture combinations in the prediction mode of the enhancement layer.
  • FIG. 6 is a conceptual diagram for describing adding pictures interpolated in a reference layer to a reference picture list of an enhancement layer when encoding / decoding a random access picture is performed in an image encoding / decoding apparatus according to the present invention.
  • an interpolation picture A having different characteristics by interpolating decoded pictures 610 at the same time in a reference layer when encoding / decoding a picture of an enhancement layer is performed.
  • 630 and interpolation picture B 620 are generated.
  • the generated interpolation picture A 630 and interpolation picture B 620 may be randomly accessible pictures such as a clean random access (CRA) or instantaneous decoding refresh (IDR) picture of HEVC.
  • CRA clean random access
  • IDR instantaneous decoding refresh
  • FIG. 7 is a conceptual view illustrating encoding a block by applying an inter-layer intra prediction technique to a coding block of an enhancement layer in an image encoding apparatus to which the present invention is applied.
  • the image encoding apparatus to which the present invention is applied may apply an inter-layer intra prediction technique using a picture interpolated in a reference layer when encoding a coding block of an enhancement layer.
  • interpolation frame A 710 and interpolation frame B 720 having different characteristics may be generated using a plurality of interpolation points or a plurality of interpolation filters in the picture 700 of the reference layer.
  • two prediction blocks 730 and 732 are generated at positions corresponding to coding blocks encoded in an enhancement layer in a plurality of interpolation pictures.
  • the two prediction blocks are finally generated as the prediction block 735 through operations such as an average value and a weight-based average value.
  • the coding block 740 of the enhancement layer and the generated prediction block 735 are input to the difference 750 module and then calculate the difference coefficients.
  • the calculated difference coefficients are encoded 760 through a transform, quantization, and entropy coding process in the enhancement layer.
  • FIG. 8 is a conceptual diagram illustrating decoding of a block by applying an inter-layer intra prediction technique to a coding block of an enhancement layer in an image decoding apparatus according to the present invention.
  • an image decoding apparatus to which the present invention is applied may apply an inter-layer intra prediction technique using a picture interpolated in a reference layer when decoding a coding block of an enhancement layer.
  • the interpolation frame A 810 and the interpolation frame B 820 having different characteristics may be generated using a plurality of interpolation points or a plurality of interpolation filters of the picture 800 of the reference layer.
  • two prediction blocks 830 and 832 are generated at positions corresponding to coding blocks encoded in an enhancement layer in a plurality of interpolation pictures.
  • the two prediction blocks are finally generated as the prediction block 835 through calculation of an average value and a weight-based average value.
  • the difference coefficient reconstructed through the entry-decode decoding, inverse quantization, and the soft transform unit in the enhancement layer and the prediction block generated in the reference layer are input to the reconstruction unit 850.
  • the reconstruction unit 850 finally decodes the block 860 by adding the reconstruction difference coefficient of the enhancement layer and the prediction block generated in the reference layer in units of pixels.
  • FIG. 9 is a flowchart schematically illustrating a method of constructing a reference picture list of an enhancement layer and performing inter-layer prediction in the enhancement layer according to an embodiment of the present invention.
  • the method of FIG. 9 may be performed by the encoding apparatus and the decoding apparatus of FIGS. 1 to 4 described above. In FIG. 9, it is described as being performed by the decoding apparatus for convenience of description, but may also be performed by the encoding apparatus.
  • the decoding apparatus may generate a reference picture used for interlayer prediction (S900).
  • the reference picture may be a picture decoded in the reference layer at the same time as the current picture of the enhancement layer.
  • the picture decoded in the reference layer may be used as a reference picture after resampling according to the resolution of the enhancement layer through interpolation or the like.
  • the decoding apparatus may generate the reference picture A and the reference picture B by interpolating the decoded picture of the reference layer according to the resolution of the enhancement layer.
  • the reference picture A and the reference picture B may be interpolation pictures generated according to interpolation characteristics of the interpolation unit as in the first, second, and third embodiments of the present invention.
  • the decoding apparatus may construct a reference picture list of the enhancement layer by using the reference picture (S910).
  • reference pictures in the reference picture list may be specified by a reference picture index (refIdx) value.
  • the first reference picture in the reference picture list may have a reference picture index (refIdx) having a value of 0, and the second reference picture in the reference picture list may have a reference picture index (refIdx) having a value of 1.
  • the reference picture lists L0 and L1 may be configured as shown in FIGS. 5 and 6 described above.
  • the reference picture list L0 is a reference picture having a POC smaller than the picture order count (POC) of the current picture at the reference picture index (refIdx) 0 and a reference picture at the reference picture index (refIdx) 1.
  • A a reference picture having a larger PCO than the POC of the current picture may be added to the reference picture index refIdx 2.
  • the reference picture list L1 is a reference picture having a larger PCO than the POC of the current picture at reference picture index (refIdx) 0, a reference picture B at reference picture index (refIdx) 1, and a POC at the reference picture index (refIdx) 2 than the POC of the current picture.
  • a small reference picture may be added.
  • the reference picture list L0 may add a long-term reference picture having a large POC difference from the current picture to the reference picture index (refIdx) 3, and add the reference picture B to the reference picture index (refIdx) 4.
  • it can be configured.
  • Reference picture list L1 may add a long-term reference picture having a large POC difference from the current picture to reference picture index (refIdx) 3, and add reference picture A to reference picture index (refIdx) 4 Can be configured.
  • the decoding apparatus may perform inter-layer prediction on the current picture of the enhancement layer using the reference picture lists L0 and L1 (S920).
  • the decoding apparatus may receive information (reference picture index information) about a reference picture used for inter-layer prediction from the encoding apparatus.
  • a reference picture of the current picture can be obtained from the reference picture lists L0 and L1 based on the information about the reference picture (reference picture index information), and a prediction value of the prediction block in the current picture is generated using the reference picture. can do.
  • the method according to the present invention described above may be stored in a computer-readable recording medium that is produced as a program for execution on a computer, and examples of the computer-readable recording medium include ROM, RAM, CD-ROM, magnetic tape , Floppy disks, optical data storage devices, and the like, and also include those implemented in the form of carrier waves (eg, transmission over the Internet).
  • the computer readable recording medium can be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
  • functional programs, codes, and code segments for implementing the method can be easily inferred by programmers in the art to which the present invention belongs.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

La présente invention concerne un procédé et un appareil pour une référence d'images reconstruites dans des codecs vidéo échelonnables. Le procédé de décodage comprend les étapes suivantes : sélection de point de référence sur une couche de référence ; sélection de filtre d'interpolation ; mise en œuvre d'interpolation ; et reconstitution de liste de référence de couche d'amélioration par l'intermédiaire de l'image interpolée.
PCT/KR2013/011165 2012-12-04 2013-12-04 Procédé de codage et de décodage vidéo, et appareil utilisant celui-ci Ceased WO2014088316A2 (fr)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
KR10-2012-0139394 2012-12-04
KR20120139394 2012-12-04
KR20130045277 2013-04-24
KR10-2013-0045277 2013-04-24
KR20130045290 2013-04-24
KR10-2013-0045284 2013-04-24
KR20130045284 2013-04-24
KR10-2013-0045290 2013-04-24

Publications (2)

Publication Number Publication Date
WO2014088316A2 true WO2014088316A2 (fr) 2014-06-12
WO2014088316A3 WO2014088316A3 (fr) 2014-10-23

Family

ID=50884107

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2013/011165 Ceased WO2014088316A2 (fr) 2012-12-04 2013-12-04 Procédé de codage et de décodage vidéo, et appareil utilisant celui-ci

Country Status (1)

Country Link
WO (1) WO2014088316A2 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10567761B2 (en) 2016-03-14 2020-02-18 Interdigital Vc Holdings, Inc. Method and device for encoding at least one image unit, and method and device for decoding a stream representative of at least one image unit
CN115134591A (zh) * 2015-06-05 2022-09-30 杜比实验室特许公司 图像编解码方法、比特流存储方法
US12537941B2 (en) 2015-06-05 2026-01-27 Dolby Laboratories Licensing Corporation Method and device for encoding and decoding intra-frame prediction

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9102220D0 (en) * 1991-02-01 1991-03-20 British Telecomm Method and apparatus for decoding video signals
JP4295236B2 (ja) * 2005-03-29 2009-07-15 日本電信電話株式会社 階層間予測符号化方法,装置,階層間予測復号方法,装置,階層間予測符号化プログラム,階層間予測復号プログラムおよびそれらのプログラム記録媒体
JP5170786B2 (ja) * 2006-10-25 2013-03-27 韓國電子通信研究院 マルチビュービデオのスケーラブルコーディングおよびデコーディング方法、ならびにコーディングおよびデコーディング装置
KR101377530B1 (ko) * 2009-08-21 2014-03-27 에스케이텔레콤 주식회사 적응적 움직임 벡터 해상도를 이용한 영상 부호화/복호화 방법 및 장치
KR20120045369A (ko) * 2010-10-29 2012-05-09 에스케이 텔레콤주식회사 쿼드 트리 기반으로 블록 필터 정보를 부호화/복호화하는 영상 부호화/복호화 장치 및 방법

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115134591A (zh) * 2015-06-05 2022-09-30 杜比实验室特许公司 图像编解码方法、比特流存储方法
US12088788B2 (en) 2015-06-05 2024-09-10 Dolby Laboratories Licensing Corporation Method and device for encoding and decoding intra-frame prediction
US12537941B2 (en) 2015-06-05 2026-01-27 Dolby Laboratories Licensing Corporation Method and device for encoding and decoding intra-frame prediction
US10567761B2 (en) 2016-03-14 2020-02-18 Interdigital Vc Holdings, Inc. Method and device for encoding at least one image unit, and method and device for decoding a stream representative of at least one image unit

Also Published As

Publication number Publication date
WO2014088316A3 (fr) 2014-10-23

Similar Documents

Publication Publication Date Title
JP7171654B2 (ja) 時間的サブレイヤー情報に基づいたインターレイヤー予測方法及び装置
KR101345090B1 (ko) 적응성 인핸스먼트층 예측을 사용하여 비트 심도 스케일러블 비디오 데이터를 인코딩 및/또는 디코딩하는 방법 및 장치
CN104396249B (zh) 可伸缩视频编码的层间预测的方法和设备
TW202037163A (zh) 改善的子劃分內預測
WO2014171768A1 (fr) Procédé et appareil de traitement de signal vidéo
TW202042552A (zh) 用於照明補償之區塊大小限制
WO2015005622A9 (fr) Procédé et appareil pour le traitement d'un signal vidéo
JP2024539787A (ja) 動きベクトル差分のジョイント・シグナリング方法
TW202236852A (zh) 高效視訊編碼器架構
KR20140122189A (ko) 계층 간 결합된 화면 내 예측을 이용한 영상 부호화/복호화 방법 및 그 장치
KR20230124072A (ko) 단일 참조 시그널링에 의한 도출된 모션 벡터
JP2022526437A (ja) 参照ピクチャー再サンプリングがある場合のラップアラウンド動き補償に関する方法、装置、コンピュータ・プログラム
CN117356098A (zh) 在跨分量变换系数级别重构中导出偏移
WO2011052990A2 (fr) Procédé et appareil de codage/décodage d'images sur la base d'une résolution adaptative
JP2026004573A (ja) ピクチャヘッダを含む映像情報をコーディングする映像デコーディング方法及びその装置
KR20230135670A (ko) 조인트 모션 벡터 차이 코딩
US20210329271A1 (en) Method and apparatus for video coding
TW202236848A (zh) 使用增強內插濾波器的訊框內預測
WO2014088316A2 (fr) Procédé de codage et de décodage vidéo, et appareil utilisant celui-ci
JP2024174009A (ja) 参照フレームの適応的な並べ替えのための方法および装置
WO2015072626A1 (fr) Procédé de génération d'images de référence inter-couche et appareil pour codage vidéo multi-couche
CN116368801B (zh) 用于视频解码的方法和相关装置
CN112118452B (zh) 视频解码的方法和装置、计算机设备
WO2014171771A1 (fr) Procédé et appareil de traitement de signal vidéo
JP7758420B2 (ja) 単一参照動きベクトル差分のための適応解像度

Legal Events

Date Code Title Description
122 Ep: pct application non-entry in european phase

Ref document number: 13860615

Country of ref document: EP

Kind code of ref document: A2