HK1218196B

HK1218196B - Coding and decoding methods of a picture block, corresponding devices and data stream

Info

Publication number: HK1218196B
Application number: HK16106092.6A
Authority: HK
Inventors: Philippe Bordes; Franck Hiron; Pierre Andrivon; Patrick Lopez; Philippe Salmon
Original assignee: 交互数字Vc控股公司
Priority date: 2013-02-22
Filing date: 2014-02-17
Publication date: 2020-06-26

Description

Picture block encoding and decoding method, corresponding equipment and data flow

技术领域Technical Field

公开用于从特殊重构的参考画面解码画面块的方法。还公开对应的编码方法和对应的编码和解码设备。A method for decoding a picture block from a specially reconstructed reference picture is disclosed. Also disclosed are a corresponding encoding method and corresponding encoding and decoding devices.

背景技术Background Art

在视频流传输期间，可用带宽可能随时间改变。因此，流传输应用的输出比特率需要调整，以实时地适合可用带宽，以便避免拥挤。使得能够进行实时比流率调整的一种方式是使用实时编码器，但是需要针对每个客户端分配一个编码系统，例如在关于VOD服务有很多客户端的情况下可能是不可接受的。使得能够进行实时比特率调整的另外的方式是使用可伸缩视频编码。在可伸缩编码中，视频源被编码成若干层。在传送期间，为了调整输出比特率，服务器选择要发送的层(模式“推送”)或者解码器请求要发送的层(模式“回拉”)。该方法适于通过异质信道进行流传输，但是与单一层视频编码相比，可伸缩视频编码降低总的压缩效率并且增加编码器和解码器两者的计算复杂性。实现比特率调整的一种简单方法是对相同的视频序列的多个版本进行编码。这些版本具有不同的分辨率和/或质量级别，并且因此具有不同的比特率。在流传输期间，当需要调整输出比特率时，要传送的流可以动态地从一个版本切换到其他版本，以便适合带宽需求或用户的能力，如图1所示。该方案被称为“流切换”。然而，在帧间编码的画面(P或B画面)处的流之间的直接切换可能造成重构的参考画面的失配，并且导致不正确的画面重构。重构的视频的质量可能显著降低。解决该问题的一种方法是使用使用比特流(典型地，I画面或IDR画面或CRA画面)中的随机访问点(RAP)。IDR是“瞬间解码器刷新(Instantaneous Decoder Refresh)”的英文首字母缩写，CRA是“清除随机访问(Clean Random Access)”的英文首字母缩写。因为切换只能在这些RAP处进行，所以需要频繁地在比特流中分配RAP，以便实现提示流切换。然而，对这样的I/IDR画面进行编码引入大量的比特率开销。另外，使用位于RAP之前的重构的参考画面的RAP之后的画面或者被跳过，或者未被正确地解码，因为它们使用的重构的参考画面与如图2所示的编码中所使用的不同。在图2上，Ic从重构的参考画面I1和I2重构，同时根据重构的参考图参考画面i1和i2对其进行编码。During video streaming, available bandwidth may change over time. Therefore, the output bit rate of the streaming application needs to be adjusted to fit the available bandwidth in real time in order to avoid congestion. One way to enable real-time bit rate adjustment is to use a real-time encoder, but this requires allocating a coding system to each client, which may be unacceptable when there are many clients for a VOD service. Another way to enable real-time bit rate adjustment is to use scalable video coding. In scalable coding, the video source is encoded into several layers. During transmission, in order to adjust the output bit rate, the server selects the layer to be sent (mode "push") or the decoder requests the layer to be sent (mode "pull back"). This method is suitable for streaming over heterogeneous channels, but compared to single-layer video coding, scalable video coding reduces the overall compression efficiency and increases the computational complexity of both the encoder and decoder. A simple way to achieve bit rate adjustment is to encode multiple versions of the same video sequence. These versions have different resolutions and/or quality levels and therefore have different bit rates. During streaming, when the output bit rate needs to be adjusted, the stream to be transmitted can be dynamically switched from one version to the other to suit bandwidth requirements or user capabilities, as shown in Figure 1. This scheme is called "stream switching". However, direct switching between streams at inter-frame coded pictures (P or B pictures) may cause mismatches in the reconstructed reference pictures and lead to incorrect picture reconstruction. The quality of the reconstructed video may be significantly reduced. One way to solve this problem is to use random access points (RAPs) in the bitstream (typically, I pictures, IDR pictures, or CRA pictures). IDR is the acronym for "Instantaneous Decoder Refresh" and CRA is the acronym for "Clean Random Access". Because switching can only be performed at these RAPs, RAPs need to be frequently allocated in the bitstream to achieve prompt stream switching. However, encoding such I/IDR pictures introduces a large bitrate overhead. In addition, pictures after the RAP that use the reconstructed reference picture located before the RAP are either skipped or not decoded correctly because they use a different reconstructed reference picture than that used in the encoding as shown in Figure 2. In Figure 2, Ic is reconstructed from the reconstructed reference pictures I1 and I2, and is encoded based on the reconstructed reference pictures i1 and i2.

在AVC中，设计特殊画面类型(SI/SP)以允许对来自另外的流的画面的相同的重构，并且因此便于流切换。于是，视频画面被编码成切换点处的SP画面而不是如图3所示的帧内编码的画面。SP画面的编码效率高于帧内编码的画面，但是它们与正常的P画面相比仍然效率更低。因此，在分配很多切换点的情况下，总的编码效率仍然降低。In AVC, special picture types (SI/SP) are designed to allow identical reconstruction of pictures from other streams, thus facilitating stream switching. Consequently, video pictures are encoded as SP pictures at switching points rather than intra-coded pictures, as shown in Figure 3. SP pictures are more efficiently coded than intra-coded pictures, but they are still less efficient than normal P pictures. Therefore, even with many switching points, overall coding efficiency is still reduced.

在Zhou等人的标题为“Efficient bit stream switching of H.264 codedvideo”并于发表于SPIE vol.5909(2005)会议的文献中，公开了一种方案，该方案使得能够在任何时间进行切换而不需要大量的比特率开销。该方案仅用于IPPP GOP结构。除了不同比特率的相同视频序列的多个版本之外，还对DIFF画面进行编码，用于切换发生的当前画面的重构的参考画面，如图4所示。DIFF画面是当前画面的重构的参考画面和其他流中的时间对应的画面之间的差异。差异画面传送给解码器以补偿失配。因为如在该文献的第5页所提及的那样，仅在切换发生时才传送DIFF画面，所以以上方案所引入的比特率开销很小。另一方面，该方案仅对从单个重构的参考画面的预测的P画面起作用。另外，该方案需要编码次序和显示次序是相同的。In a paper titled "Efficient bit stream switching of H.264 coded video" by Zhou et al. and presented at the SPIE conference vol. 5909 (2005), a scheme is disclosed that enables switching at any time without requiring a large bitrate overhead. This scheme is only used for the IPPP GOP structure. In addition to multiple versions of the same video sequence at different bitrates, a DIFF picture is encoded, which is used as a reconstructed reference picture for the current picture when switching occurs, as shown in Figure 4. The DIFF picture is the difference between the reconstructed reference picture of the current picture and the temporally corresponding picture in the other stream. The difference picture is transmitted to the decoder to compensate for the mismatch. Because, as mentioned on page 5 of the paper, the DIFF picture is only transmitted when switching occurs, the bitrate overhead introduced by the above scheme is very small. On the other hand, this scheme only works for P pictures predicted from a single reconstructed reference picture. In addition, this scheme requires that the encoding order and the display order are the same.

发明内容Summary of the Invention

公开一种用于对画面块进行解码的方法。该方法包含：A method for decoding a picture block is disclosed. The method comprises:

-将至少一个流S_diff解码成解码数据以及用于标识解码器中的重构的参考画面的一个信息；- decoding at least one stream S_diff into decoded data and an information item for identifying a reconstructed reference picture in the decoder;

-从至少所标识的重构的参考画面并且从解码数据重构特殊基准画面；- reconstructing a special reference picture from at least the identified reconstructed reference picture and from the decoded data;

-从至少特殊参考画面重构画面块，其中不显示在被重构时的特殊参考画面。- reconstructing a picture block from at least a special reference picture, wherein the special reference picture is not displayed when being reconstructed.

有利地，从第一层解码出所标识的重构的参考画面，并且其中，从依赖于第一层的第二层解码出解码数据和标识解友器画面缓冲器中的重构的参考画面的信息。Advantageously, the identified reconstructed reference picture is decoded from a first layer, and wherein decoded data and information identifying the reconstructed reference picture in a decoder picture buffer are decoded from a second layer dependent on the first layer.

根据特定特征，第一层是基础层。According to certain characteristics, the first layer is the base layer.

根据特定实施例，该解码方法还包含对指示第二层的随后解码的画面未使用任何层间预测的标志进行解码。According to a particular embodiment, the decoding method further comprises decoding a flag indicating that a subsequently decoded picture of the second layer does not use any inter-layer prediction.

还公开一种用于对画面块进行编码的方法。该编码方法还包含：A method for encoding a picture block is also disclosed. The encoding method further comprises:

-根据至少一个重构的参考画面对画面块进行编码；以及- encoding a picture block based on at least one reconstructed reference picture; and

-根据另外的重构的参考画面和用于指示解码器画面缓冲器中的另外重构的参考画面将至少一个重构的参考画面编码为特殊参考画面，其中，不显示在被重构时的特殊参考画面。- encoding at least one reconstructed reference picture as a special reference picture based on the further reconstructed reference picture and for indicating the further reconstructed reference picture in a decoder picture buffer, wherein the special reference picture is not displayed when being reconstructed.

有利地，在第一层中对所标识的重构的参考画面进行编码，并且在依赖于第一层的第二层中对至少一个重构的参考画面和用于标识解码器缓冲器中的另外的重构的参考画面进行编码。Advantageously, the identified reconstructed reference picture is encoded in a first layer, and at least one reconstructed reference picture and identification of a further reconstructed reference picture in a decoder buffer are encoded in a second layer dependent on the first layer.

根据特定实施例，还包含对指示第二层的随后解码的画面未使用任何层间预测的标志进行编码。According to a specific embodiment, encoding a flag indicating that a subsequently decoded picture of the second layer does not use any inter-layer prediction is further included.

公开一种用于对画面块进行解码的解码设备。该解码设备包含：A decoding device for decoding a picture block is disclosed. The decoding device comprises:

-用于将至少一个流S_diff解码成解码数据以及用于标识解码器中的重构的参考画面的一个信息的部件；- means for decoding at least one stream S_diff into decoded data and one piece of information for identifying a reconstructed reference picture in the decoder;

-用于从至少所标识的重构的参考画面并且从解码数据重构特殊基准画面的部件；- means for reconstructing a special reference picture from at least the identified reconstructed reference picture and from the decoded data;

-用于从至少特殊参考画面重构画面块的部件，其中不显示在被重构时的特殊参考画面。- means for reconstructing a picture block from at least a special reference picture, wherein the special reference picture is not displayed when being reconstructed.

该解码设备适于执行所述解码方法的步骤。The decoding device is adapted to perform the steps of the decoding method.

公开一种用于对画面块进行编码的编码方法。该编码设备包含：A coding method for coding a picture block is disclosed. The coding device comprises:

该编码设备适于所述编码方法的步骤。The encoding device is adapted to the steps of the encoding method.

公开一种数据流。该数据流包含在其中编码的用于标识解码器画面缓冲器中的重构的参考画面的一个信息以及允许从所标识的重构的参考画面重构特殊参考画面的数据，所述特殊参考画面是不被显示的参考画面。A data stream is disclosed, which contains information encoded therein for identifying a reconstructed reference picture in a decoder picture buffer and data allowing reconstruction of a special reference picture from the identified reconstructed reference picture, the special reference picture being a reference picture that is not displayed.

公开一种解码方法。该解码方法包含：A decoding method is disclosed. The decoding method includes:

根据多层流的第一层的解码器画面缓冲器的另外的参考画面和从所述多层流的第二层解码的数据来重构参考画面，并且将所重构的参考画面存储在所述第二层的解码器画面缓冲器中，其中，所述参考画面被指示为不被显示；reconstructing a reference picture from a further reference picture of a decoder picture buffer of a first layer of a multi-layer stream and data decoded from a second layer of the multi-layer stream, and storing the reconstructed reference picture in a decoder picture buffer of the second layer, wherein the reference picture is indicated not to be displayed;

对指示第二层的随后解码的画面未使用任何层间预测的标志进行解码；以及decoding a flag indicating that a subsequently decoded picture of the second layer does not use any inter-layer prediction; and

至少根据所重构的参考图像来重构所述随后解码的画面的画面块。Picture blocks of the subsequently decoded picture are reconstructed at least from the reconstructed reference image.

有利地，该解码方法还包含从所述第二层解码用于标识所述另外的参考画面的信息。Advantageously, the decoding method further comprises decoding information identifying said further reference picture from said second layer.

有利地，在该解码方法中，从所述第二层解码的数据表示所述参考画面和所述另外的参考画面之间的逐个像素差异。Advantageously, in the decoding method, data decoded from the second layer represent pixel-by-pixel differences between the reference picture and the further reference picture.

有利地，在该解码方法中，第一层是基础层。Advantageously, in the decoding method, the first layer is a base layer.

有利地，在该解码方法中，所述参考画面和所述另外的参考画面是时间对齐的。Advantageously, in the decoding method, said reference picture and said further reference picture are time aligned.

公开一种编码方法。该编码方法包含：A coding method is disclosed. The coding method includes:

根据多层流的第一层的解码器画面缓冲器的另外的参考画面，在所述多层流的第二层中对所述第二层的解码器画面缓冲器的参考画面进行编码，其中，所述参考画面被指示为将不被显示；encoding, in a second layer of a multi-layer stream, a reference picture of a decoder picture buffer of a first layer of the multi-layer stream based on a further reference picture of a decoder picture buffer of the second layer, wherein the reference picture is indicated as not to be displayed;

对指示所述第二层的随后编码的画面未使用任何层间预测的标志进行编码；以及encoding a flag indicating that a subsequently coded picture of the second layer does not use any inter-layer prediction; and

至少根据所述参考画面对所述随后编码的画面的画面块进行编码。Picture blocks of the subsequently coded picture are encoded at least based on the reference picture.

有利地，该编码方法还包含在所述第二层中对用于标识所述另外的参考画面的信息进行编码。Advantageously, the encoding method further comprises encoding in said second layer information for identifying said further reference picture.

有利地，在该编码方法中，在所述第二层中对所述参考画面进行编码包含确定所述参考画面和所述另外的参考画面之间的逐个像素差异，以及对所述逐个像素差异进行编码。Advantageously, in the encoding method, encoding the reference picture in the second layer comprises determining pixel-by-pixel differences between the reference picture and the further reference picture, and encoding the pixel-by-pixel differences.

有利地，在该编码方法中，第一层是基础层。Advantageously, in the encoding method, the first layer is a base layer.

有利地，在该编码方法中，所述参考画面和所述另外的参考画面是时间对齐的。Advantageously, in the encoding method, said reference picture and said further reference picture are time aligned.

公开一种解码器。该解码器包含：A decoder is disclosed. The decoder includes:

用于根据多层流的第一层的解码器画面缓冲器的另外的参考画面和从所述多层流的第二层解码的数据来重构参考画面以及将所重构的参考画面存储在所述第二层的解码器画面缓冲器中的部件，其中，所述参考画面被指示为不被显示；means for reconstructing a reference picture from a further reference picture of a decoder picture buffer of a first layer of a multi-layer stream and data decoded from a second layer of the multi-layer stream, and storing the reconstructed reference picture in a decoder picture buffer of the second layer, wherein the reference picture is indicated not to be displayed;

用于对指示第二层的随后解码的画面未使用任何层间预测的标志进行解码的部件；以及means for decoding a flag indicating that a subsequently decoded picture of the second layer does not use any inter-layer prediction; and

用于至少根据所重构的参考图像来重构所述随后解码的画面的画面块的部件。Means for reconstructing picture blocks of said subsequently decoded picture at least from the reconstructed reference picture.

有利地，该解码器还包含从所述第二层解码用于标识所述另外的参考画面的信息的部件。Advantageously, the decoder further comprises means for decoding from said second layer information identifying said further reference picture.

有利地，在该解码器中，从所述第二层解码的数据表示所述参考画面和所述另外的参考画面之间的逐个像素差异。Advantageously, in the decoder, data decoded from the second layer represents pixel-by-pixel differences between the reference picture and the further reference picture.

有利地，在该解码器中，第一层是基础层。Advantageously, in the decoder, the first layer is a base layer.

有利地，在该解码器中，所述参考画面和所述另外的参考画面是时间对齐的。Advantageously, in the decoder, said reference picture and said further reference picture are time aligned.

公开一种编码器。该编码器包含：An encoder is disclosed. The encoder includes:

用于根据多层流的第一层的解码器画面缓冲器的另外的参考画面，在所述多层流的第二层中对所述第二层的解码器画面缓冲器的参考画面进行编码的部件，其中，所述参考画面被指示为将不被显示；means for encoding, in a second layer of a multi-layer stream, a reference picture of a decoder picture buffer of a first layer of the multi-layer stream based on a further reference picture of a decoder picture buffer of the second layer, wherein the reference picture is indicated as not to be displayed;

用于对指示所述第二层的随后编码的画面未使用任何层间预测的标志进行编码的部件；以及means for encoding a flag indicating that a subsequently coded picture of the second layer does not use any inter-layer prediction; and

用于至少根据所述参考画面对所述随后编码的画面的画面块进行编码的部件。Means for encoding picture blocks of said subsequently coded picture at least based on said reference picture.

有利地，该编码器还包含在所述第二层中对用于标识所述另外的参考画面的信息进行编码的部件。Advantageously, the encoder further comprises means for encoding, in said second layer, information identifying said further reference picture.

有利地，在该编码器中，在所述第二层中对所述参考画面进行编码包含确定所述参考画面和所述另外的参考画面之间的逐个像素差异，以及对所述逐个像素差异进行编码。Advantageously, in the encoder, encoding the reference picture in the second layer comprises determining pixel-by-pixel differences between the reference picture and the further reference picture, and encoding the pixel-by-pixel differences.

有利地，在该编码器中，第一层是基础层。Advantageously, in the encoder, the first layer is a base layer.

有利地，在该编码器中，所述参考画面和所述另外的参考画面是时间对齐的。Advantageously, in the encoder, said reference picture and said further reference picture are time aligned.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

本发明的其他特征和优点将对其一些实施例的描述而呈现，该描述结合以下附图进行：Other features and advantages of the present invention will emerge from the description of some embodiments thereof, which is made in conjunction with the following drawings:

图1和图2例示流切换的一般原理；Figures 1 and 2 illustrate the general principles of stream switching;

图3例示根据技术现状的使用SI/SP画面的流切换的原理；FIG3 illustrates the principle of stream switching using SI/SP pictures according to the state of the art;

图4例示根据技术现状的使用DIFF画面的流切换的原理；FIG4 illustrates the principle of stream switching using DIFF pictures according to the state of the art;

图5例示根据本发明的解码方法的流程图；FIG5 illustrates a flowchart of a decoding method according to the present invention;

图6图示根据本发明的编码方法的流程图；FIG6 illustrates a flow chart of an encoding method according to the present invention;

图7例示根据本发明的使用SRP画面的流切换的原理；FIG7 illustrates the principle of stream switching using SRP pictures according to the present invention;

图8例示根据本发明的解码方法的另一个实施例；FIG8 illustrates another embodiment of a decoding method according to the present invention;

图9图示根据本发明的多层视频解码器；FIG9 illustrates a multi-layer video decoder according to the present invention;

图10图示根据本发明的多层视频编码器；以及FIG10 illustrates a multi-layer video encoder according to the present invention; and

图11表示根据本发明的多层流。FIG. 11 shows a multi-layer flow according to the present invention.

具体实施方式DETAILED DESCRIPTION

本发明涉及用于对像素的画面块进行解码的方法以及对这样的画面块进行编码的方法。画面块属于画面序列的画面。每个画面包含每个均与至少一项画面数据相关联的像素或画面点。一项画面数据例如是一项亮度数据或者一项色度数据。以下参照画面块来描述编码和解码方法。显然，这些方法可以应用于画面的若干画面块以及序列的若干画面，着眼于对一个或多个画面分别进行编码和解码。画面块是任何形式的像素集合。其可以是方形、矩形。但是本发明不限于这样的形式。在下面的部分中，字块(word block)被用于画面块。在HEVC中，块指编码单元(CU)。The present invention relates to a method for decoding a picture block of pixels and a method for encoding such a picture block. A picture block belongs to a picture of a picture sequence. Each picture contains pixels or picture points, each of which is associated with at least one item of picture data. An item of picture data is, for example, a luminance data item or a chrominance data item. The encoding and decoding methods are described below with reference to picture blocks. Obviously, these methods can be applied to several picture blocks of a picture and several pictures of a sequence, with a view to encoding and decoding one or more pictures respectively. A picture block is a collection of pixels in any form. It can be square or rectangular. However, the present invention is not limited to such a form. In the following, the word block is used for a picture block. In HEVC, a block refers to a coding unit (CU).

“预测器(predictor)”项指定用于预测其他数据的数据。预测器用于预测画面块。预测器或预测块从属于其预测(空间预测或画面内预测)的块的与画面相同的画面的一个或若干重构的参考样本或者重构的参考画面(时间预测或画面间预测)的一个(单向预测)或若干(双向预测或双预测)参考块来获得。通过运动向量在重构的参考画面中标识参考块。预测还可以被加权，以考虑照明变化模型(也被称为加权预测)。The "predictor" item specifies data used to predict other data. A predictor is used to predict a picture block. The predictor, or prediction block, is obtained from one or more reconstructed reference samples of the same picture as the block being predicted (spatial prediction or intra-picture prediction) or one (unidirectional prediction) or more (bidirectional prediction or bi-prediction) reference blocks of a reconstructed reference picture (temporal prediction or inter-picture prediction). The reference block is identified in the reconstructed reference picture by a motion vector. The prediction can also be weighted to account for illumination variation models (also known as weighted prediction).

术语“残差”表示从源数据中减去预测器之后所获得的数据。The term "residual" refers to the data obtained after subtracting the predictor from the source data.

术语“重构”表示在将残差与预测器融合之后所获得的数据(例如像素、块)。融合通常是预测器与残差的求和。然而，融合更一般地并且更特别地包含重构的样本的另外的后滤波阶段和/或将偏移添加到重构的样本的另外的步骤。在重构参考画面时，将其存储在DPB(“编码器画面缓冲器”的英文首字母缩写)中作为新重构的参考画面。The term "reconstruction" refers to the data (e.g., pixels, blocks) obtained after fusing the residual with the predictor. Fusion is typically the summation of the predictor and the residual. However, fusion more generally and more particularly includes an additional post-filtering stage of the reconstructed samples and/or an additional step of adding offsets to the reconstructed samples. When a reference picture is reconstructed, it is stored in the DPB (an acronym for "encoder picture buffer") as a newly reconstructed reference picture.

在提及对画面解码时，术语“重构”和“解码”经常被用作同义语。因此，“重构的块”也被叫作术语“解码的块”。When referring to decoding a picture, the terms "reconstruction" and "decoding" are often used as synonyms. Therefore, a "reconstructed block" is also referred to as a "decoded block."

术语编码要采取最宽泛的含义。编码可能包含应用变换和/或量化数据。其还可能仅仅指熵编码。DCT(“离散余弦变换(Discrete Cosine Transform)”)是这样的变换的示例。以相同的方式，术语解码可能包含熵解码以及应用变换和/或逆量化。在解码器侧应用的变换是在编码器侧应用的变换的逆变换。The term encoding is used in the broadest sense. Encoding may include applying a transform and/or quantizing the data. It may also simply refer to entropy coding. The DCT ("Discrete Cosine Transform") is an example of such a transform. In the same way, the term decoding may include entropy decoding as well as applying a transform and/or inverse quantization. The transform applied at the decoder is the inverse of the transform applied at the encoder.

流是形成编码画面的表示的比特的序列以及形成一个或多个编码视频序列的相关联的数据。流是用于指代NAL单元流或字节流的总称。A stream is a sequence of bits forming a representation of a coded picture and associated data forming one or more coded video sequences. Stream is a general term used to refer to a stream of NAL units or a stream of bytes.

NAL(“网络抽象层(Network Abstraction Layer)”的英文首字母缩写)单元是包含要遵照的数据的类型的指示和包含该数据的字节的语法结构。NAL被指定为以适合在各种通信信道或存储介质上运送的方式对该数据进行格式化并且提供头部信息。在NAL单元中包含所有数据，其每一个都包含整数数量的字节。NAL单元指定用于在面向包的系统和流系统两者中使用的通用格式。除了在字节流格式中每个NAL单元之前可以有开始码前缀和额外填充的字节之外，用于面向包的传输和字节流的NAL单元的格式是相同的。A NAL (an acronym for "Network Abstraction Layer") unit is a syntactic structure containing an indication of the type of data to be followed and the bytes containing the data. The NAL is specified to format the data and provide header information in a manner suitable for transport over various communication channels or storage media. All data is contained in NAL units, each of which contains an integer number of bytes. The NAL unit specifies a common format for use in both packet-oriented systems and streaming systems. The format of the NAL unit for packet-oriented transport and byte streams is the same, except that in the byte stream format each NAL unit may be preceded by a start code prefix and additional padding bytes.

AU(“访问单元(Access Unit)”的英文首字母缩写)是根据指定的分类规则彼此相关联、以解码次序连续并且包含恰好一个编码的画面的NAL单元的集合。访问单元的解码总是得到解码画面。An AU (the acronym for “Access Unit”) is a set of NAL units that are associated with each other according to a specified classification rule, are consecutive in decoding order, and contain exactly one coded picture. Decoding an access unit always results in a decoded picture.

在图5和图6中，所表示的框仅是功能实体，其未必对应于物理分开的实体。本领域的技术人员将意识到的是，所述原理的方面可以实施为系统、方法或计算机可读取介质。因此，所述原理的方面可以采取完全硬件实施方式的形式、完全软件实施方式(包括固件、常驻软件、微代码等)的形式或者在本文中一般全部称为“电路”、“模块”或“系统”的组合软件和硬件方面的实施方式。另外，所述原理的方面可以采取计算机可读取存储介质的形式。可以利用一个或多个计算机可读取存储介质的任何组合。In Figures 5 and 6, the represented boxes are only functional entities, which do not necessarily correspond to physically separated entities. It will be appreciated by those skilled in the art that the aspects of the principles can be implemented as systems, methods or computer-readable media. Therefore, the aspects of the principles can take the form of complete hardware implementation, the form of complete software implementation (including firmware, resident software, microcode, etc.) or the implementation of the combined software and hardware aspects generally all referred to as "circuit", "module" or "system" in this article. In addition, the aspects of the principles can take the form of computer-readable storage media. Any combination of one or more computer-readable storage media can be utilized.

图中的流程图和/或框图例示根据本发明的各个实施例的系统、方法和计算机程序产品的可能的实现方式的配置、操作和功能。在这点上，流程图或框图中的每个框可以表示模块、片段或代码的一部分，其包含用于实现所指定的逻辑功能的一个或多个可执行的指令。还应当注意到，在一些可替换的实现方式中，在框中所记录的功能可以不以图中所记录的次序来进行。例如，以相继的方式示出的两个框实际上可以基本并行地执行，或者框有时可以以相反的次序来执行，或者框可以以交替的次序来执行，取决于所涉及的功能。还应当注意的是，框图和/或流程图示例的每个框以及框图和/或流程图示例中的框的组合可以通过执行所指定的或能或去拿的基于专用硬件的系统或者专肜硬件和计算机指令的组合来实现。虽然而未明确描述，但是所述实施例可以以任何组合或子组合的方式利用。The flow chart and/or block diagram in the figure illustrate the configuration, operation and function of the possible implementation of the system, method and computer program product according to various embodiments of the present invention. In this regard, each box in the flow chart or block diagram can represent a part of a module, fragment or code, which contains one or more executable instructions for realizing the specified logical function. It should also be noted that in some alternative implementations, the functions recorded in the box may not be performed in the order recorded in the figure. For example, the two boxes shown in a sequential manner can actually be performed substantially in parallel, or the boxes can sometimes be performed in the opposite order, or the boxes can be performed in an alternating order, depending on the functions involved. It should also be noted that each box of the block diagram and/or flow chart example and the combination of the boxes in the block diagram and/or flow chart example can be implemented by executing a specified or capable or available system based on dedicated hardware or a combination of dedicated hardware and computer instructions. Although not explicitly described, the embodiments can be utilized in any combination or sub-combination.

图5图示根据特定且非限制性的实施例的解码方法的流程图。该方法用于对在流S中编码的当前画面块Bc进行解码。画面块Bc属于当前画面Ic的切片Sc。切片是诸如画面块的集合这样的画面的一部分。5 shows a flow chart of a decoding method according to a specific and non-limiting embodiment. The method is used to decode a current picture block Bc encoded in a stream S. The picture block Bc belongs to a slice Sc of the current picture Ic. A slice is a part of a picture, such as a collection of picture blocks.

在步骤10中，将至少一个流S_diff解码成解码数据(例如，残差和编码模式)以及用于标识存储在DPB中的重构的参考画面R2的信息INFO。In step 10 , at least one stream S_diff is decoded into decoded data (eg, residual and coding mode) and information INFO for identifying the reconstructed reference picture R2 stored in the DPB.

在步骤S12中，从所标识的重构的参考图R2和解码画面重构特殊参考画面(其英文首字母缩写是SRP)R1’。然后将特殊参考画面放置在DPB中。该参考画面R1’是特殊的，因为其从不被显示，而是仅用于重构其他画面中的块。重构SRP R1’包含针对R1’的每个画面块确定预测器以及添加残差。预测器可以根据所标识的重构的参考画面R2(作为与Bc同位置的R2中的块或者运动向量所标识的R2中的经运动补偿的块)或者如传统帧内预测那样地根据R1’的相邻的重构的样本来确定。如果R2中的块在R2中的空间位于与Ic中的Bc的空间位置相同的，则R2中的块与Bc是同位置的。根据变形，如果重构的参考画面R2的大小不同于当前画面Ic的大小，则重新调节R2以便重构特殊参考画面，使得重新调节的R2画面(可能具有适当的填充)与Ic具有相同的大小。在该情况下，从F(R2)重构R1’，其中F是重新调节滤波器。流S_diff可以是流S的一部分，或者可以独立于流S。In step S12, a special reference picture (SRP) R1′ is reconstructed from the identified reconstructed reference picture R2 and the decoded picture. The special reference picture is then placed in the DPB. This reference picture R1′ is special in that it is never displayed but is only used to reconstruct blocks in other pictures. Reconstructing the SRP R1′ involves determining a predictor for each picture block of R1′ and adding a residual. The predictor can be determined based on the identified reconstructed reference picture R2 (as a block in R2 co-located with Bc or a motion-compensated block in R2 identified by a motion vector) or based on neighboring reconstructed samples of R1′ as in conventional intra prediction. A block in R2 is co-located with Bc if its spatial location in R2 is the same as that of Bc in Ic. According to a variant, if the size of the reconstructed reference picture R2 is different from that of the current picture Ic, R2 is rescaled to reconstruct the special reference picture so that the rescaled R2 picture (possibly with appropriate padding) has the same size as Ic. In this case, R1' is reconstructed from F(R2), where F is the rescaling filter. The stream S_diff may be part of the stream S or may be independent of it.

作为示例，流S_diff对不同于R2另外的重构的参考画面R1和重构的参考画面R2之间的逐个像素差异进行编码。R1例如是重构的参考画面，当前画面块Bc根据它进行编码。在该情况下，对流S_diff进行解码包含对差异画面进行解码，其通常通过熵解码、逆量化和变换。变换例如是逆DCT。差异画面通常是重构的参考画面R1和重构的参考画面R2之间的差异的近似。近似是由于在编码期间的损失(例如，由于量化)。如果差异画面DIFF被无损编码，则解码的差异画面DIFF等于重构的参考画面R1和重构的参考画面R2之间的差异。根据变型，如果R1和R2大小不同，则差异画面是重构的参考画面R1和重新调节的重构的参考画面R2之间的差异。作为示例，如果R2大于R1，则下调(downscale)R2，而如果R2小于R1，则上调(upscale)R2。在该情况下，特殊参考画面R1’等于F(R2)+DIFF，如果R2和Ic大小相同的，那么F是单位矩阵，或者在其他情况下，F是重新调节函数。As an example, the stream S_diff encodes the pixel-by-pixel difference between a reconstructed reference picture R1 that is different from R2 and the reconstructed reference picture R2. R1 is, for example, the reconstructed reference picture based on which the current picture block Bc is encoded. In this case, decoding the stream S_diff involves decoding the difference picture, which is typically performed by entropy decoding, inverse quantization, and a transform. The transform is, for example, an inverse DCT. The difference picture is typically an approximation of the difference between the reconstructed reference picture R1 and the reconstructed reference picture R2. The approximation is due to losses during encoding (e.g., due to quantization). If the difference picture DIFF is losslessly coded, the decoded difference picture DIFF is equal to the difference between the reconstructed reference picture R1 and the reconstructed reference picture R2. According to a variant, if R1 and R2 are of different sizes, the difference picture is the difference between the reconstructed reference picture R1 and the rescaled reconstructed reference picture R2. As an example, if R2 is larger than R1, R2 is downscaled, and if R2 is smaller than R1, R2 is upscaled. In this case, the special reference picture R1' is equal to F(R2)+DIFF, where F is the identity matrix if R2 and Ic are of the same size, or a rescaling function in other cases.

根据变型，解码方法还包含可选的对与差异画面DIFF相关联的符号的解码。如果对这样的符号进行解码，则特殊参考画面R1’在符号为正时等于F(R2)+DIFF，并且在符号为负时等于F(R2)-DIFF。According to a variant, the decoding method also comprises the optional decoding of a sign associated with the difference picture DIFF. If such a sign is decoded, the special reference picture R1' is equal to F(R2)+DIFF when the sign is positive and equal to F(R2)-DIFF when the sign is negative.

根据另外的变型，流S_diff针对R1的一些块对这些块和R2中的同位置的块之间的差异进行编码。R1的其他块使用传统的帧内预测(即根据相邻的重构的样本)在S_diff中编码。According to another variant, the stream S_diff encodes, for some blocks of R1, the differences between these blocks and the co-located blocks in R2. Other blocks of R1 are encoded in S_diff using conventional intra prediction (ie from neighbouring reconstructed samples).

根据另外的变型，流S_diff针对R1的一些块对这些块和R2中的对应块之间的差异进行编码。R2中的对应块是同位置的块或者经运动补偿的块。R1中的其他块使用传统的帧内预测(即根据相邻的重构的样本)在S_diff中编码。According to another variant, the stream S_diff encodes the differences between some blocks of R1 and the corresponding blocks in R2. The corresponding blocks in R2 are either co-located blocks or motion-compensated blocks. The other blocks in R1 are encoded in S_diff using conventional intra prediction (i.e., based on neighboring reconstructed samples).

对信息INFO进行解码使得能够处理不同的使用情况。作为示例，如果当前画面块Bc根据两个重构的参考画面R1和r1进行编码，则两个特殊参考画面R1’和r1’以及两个信息INFO和info在步骤10中被解码。特殊参考画面R1’和r1’分别对应于R2和r2，其中R2和r2是存储在DPB中的两个重构的参考画面，Bc根据它们进行重构。因此，INFO向解码器指示R1’将从R2进行重构，而info指示r1’从r2进行重构。Decoding the information INFO enables handling of different use cases. As an example, if the current picture block Bc is encoded based on two reconstructed reference pictures R1 and r1, two special reference pictures R1' and r1' and two pieces of information INFO and info are decoded in step 10. The special reference pictures R1' and r1' correspond to R2 and r2, respectively, where R2 and r2 are the two reconstructed reference pictures stored in the DPB from which Bc is reconstructed. Therefore, INFO indicates to the decoder that R1' will be reconstructed from R2, while info indicates that r1' will be reconstructed from r2.

每个特殊画面例如在流S_diff用指示不同于传统的I、P、B画面/切片类型的画面/切片类型的专用标志来标识。该画面/切片类型指示当前AU包含不被显示的特殊参考画面。根据变型，每个特殊画面用切片头部中的专用标志来标识。Each special picture is identified, for example, in the stream S_diff, by a dedicated flag indicating a picture/slice type that is different from the conventional I, P, or B picture/slice types. This picture/slice type indicates that the current AU contains a special reference picture that is not displayed. According to a variant, each special picture is identified by a dedicated flag in the slice header.

根据变型，画面切片类型是I、P或B，但是切片头部的特殊标志指示重构的画面不被显示但是作为参考存储在DPB中。According to a variant, the picture slice type is I, P or B, but a special flag in the slice header indicates that the reconstructed picture is not displayed but stored as a reference in the DPB.

用于在DPB中标识重构的参考画面R2的信息INFO例如是在文献ISO/IEC 14496-10(第3.104节)中定义的POC(“画面次序计数”的英文首字母缩写)。根据变型，用于标识重构的参考画面的信息是重构的参考画面索引。The information INFO for identifying the reconstructed reference picture R2 in the DPB is, for example, POC (Picture Order Count) defined in ISO/IEC 14496-10 (Section 3.104). According to a variant, the information for identifying the reconstructed reference picture is a reconstructed reference picture index.

在步骤16中，从特殊参考画面R1’重构当前画面块Bc。通常，因为就内容而言，特殊参考画面更接近于R1而不是R2，漂移因此而减小。通常，重构画面块包含从流S中解码残差以及将残差添加到预测器。在跳过模式下，残差可以是0。对残差进行解码包含熵解码、逆量化以及应用在编码器侧应用的变换的逆变换。这些步骤对于视频压缩/编码领域的技术人员是公知的，并且不做进一步的公开。用从流S中解码出的运动向量来标识特殊参考画面R1’中的参考块。参考块被用作预测器。在双向预测的情况下，以两个重构的参考画面(可能是同样的重构的参考画面)来标识两个参考块。预测器是这两个参考块的加权和。如果Bc是从属于可能不同于在编码中所使用的重构的参考画面R1和r1的两个重构的参考画面R2和r2的两个参考块双向预测的，则两个SRP R1’和r1’可能被重构。特殊参考画面R1’和r1’从而被用作Bc的参考画面。如果在重构Bc时在DPB中r1可用，则Bc还可以从一个特殊参考画面R1’和r1来重构。INFO和符号可以针对每个特殊参考画面(在切片头部中或者在切片片段头部中)进行解码或者可以针对若干特殊参考画面来分组在一个单一头部中。INFO和符号例如可以从SEI消息、VPS(视频参数集HEVC)中或者从Sc的切片头部中被解码。In step 16, the current picture block Bc is reconstructed from the special reference picture R1′. Typically, drift is reduced because the special reference picture is closer in content to R1 than to R2. Typically, reconstructing the picture block involves decoding the residual from stream S and adding the residual to the predictor. In skip mode, the residual can be zero. Decoding the residual involves entropy decoding, inverse quantization, and applying the inverse of the transform applied at the encoder. These steps are well known to those skilled in the art of video compression/coding and are not further disclosed. A reference block in the special reference picture R1′ is identified by a motion vector decoded from stream S. The reference block is used as the predictor. In the case of bidirectional prediction, the two reference blocks are identified by two reconstructed reference pictures (possibly the same reconstructed reference picture). The predictor is the weighted sum of these two reference blocks. If Bc is bidirectionally predicted from two reference blocks belonging to two reconstructed reference pictures R2 and r2, which may be different from the reconstructed reference pictures R1 and r1 used in encoding, then two SRPs R1′ and r1′ may be reconstructed. Special reference pictures R1' and r1' are thus used as reference pictures for Bc. If r1 is available in the DPB when reconstructing Bc, Bc can also be reconstructed from a special reference picture R1' and r1. INFO and symbols can be decoded for each special reference picture (in the slice header or in the slice segment header) or can be grouped in a single header for several special reference pictures. INFO and symbols can be decoded, for example, from SEI messages, VPS (Video Parameter Set HEVC), or from the slice header of Sc.

图6图示根据特定且非限制性实施例的编码方法的流程图。该方法用于对流S中的当前画面块Bc进行编码。Figure 6 shows a flow chart of a coding method according to a specific and non-limiting embodiment.The method is used to code a current picture block Bc in a stream S.

在步骤S20中，从流S1中的至少一个第一重构的参考画面R1对当前画面块Bc进行编码。通常，对当前画面块进行编码包含：确定残差；变换残差；以及将所变换的残差量化成量化数据。将量化数据进一步熵编码在流S中。残差通过从当前画面块Bc中减去预测器来获得。预测器从第一重构的参考画面R1确定。更精确地，通过运动向量在重构的参考画面R1中确定预测器。如果当前块是从两个参考块双向预测出的，则预测器通过平均这两个参考块来获得。两个参考块属于两个不同的重构的参考画面R1和r1，或者属于同一重构的参考画面。运动向量也被编码在流S中。这些步骤对于视频压缩领域的技术人员是公知的，不做进一步的公开。In step S20, the current picture block Bc is encoded from at least one first reconstructed reference picture R1 in the stream S1. Typically, encoding the current picture block comprises determining a residual; transforming the residual; and quantizing the transformed residual into quantized data. The quantized data is further entropy encoded in the stream S. The residual is obtained by subtracting a predictor from the current picture block Bc. The predictor is determined from the first reconstructed reference picture R1. More precisely, the predictor is determined in the reconstructed reference picture R1 via a motion vector. If the current block is bidirectionally predicted from two reference blocks, the predictor is obtained by averaging the two reference blocks. The two reference blocks may belong to two different reconstructed reference pictures R1 and r1, or to the same reconstructed reference picture. The motion vector is also encoded in the stream S. These steps are well known to those skilled in the art of video compression and are not disclosed further.

在步骤S24中，将重构的参考画面R1和信息INFO编码到流S_diff中。对S_diff的解码是SRP。流S_diff可以是流S的一部分，或者可以独立于流S。根据不同于用INFO标识的R1的第二重构的参考画面R2将重构的参考画面R1编码在S_diff中。根据变型，如果重构的参考画面R2的大小不同于当前画面Ic的大小，并且因此不同于R1的大小，则对R2重新调节以便对重构的参考画面R1进行编码，使得重新调节的R2画面(可能通过适当的填充)与Ic具有相同的大小。在该情况下，根据F(R2)对R1进行编码，其中F是重新调节滤波器。In step S24, the reconstructed reference picture R1 and the information INFO are encoded into the stream S_diff. The decoding of S_diff is SRP. The stream S_diff can be part of the stream S or can be independent of the stream S. The reconstructed reference picture R1 is encoded in S_diff based on a second reconstructed reference picture R2, which is different from R1, identified by INFO. According to a variant, if the size of the reconstructed reference picture R2 is different from the size of the current picture Ic, and therefore different from the size of R1, R2 is rescaled so that the reconstructed reference picture R1 is encoded so that the rescaled R2 picture (possibly with appropriate padding) has the same size as Ic. In this case, R1 is encoded according to F(R2), where F is the rescaling filter.

作为示例，流S_diff对R1和R2之间的逐个像素差异DIFF进行编码。DIFF画面通过变换(例如，使用DCT)、量化和熵编码进行编码。根据变型，如果R1和R2的大小不同，则差异画面是重构的参考画面R1和重新调节的第二重构的参考画面R2之间的差异。作为示例，如果R2大于R1，则下调R2，而如果R2小于R1，则上调R2。在该情况下，DIFF＝R1-F(R2)，当R2和Ic大小相同时，F是单位矩阵函数，而在其他情况下是重新调节函数。As an example, the stream S_diff encodes the pixel-by-pixel difference DIFF between R1 and R2. The DIFF picture is encoded by transformation (e.g., using DCT), quantization, and entropy coding. According to a variant, if R1 and R2 are of different sizes, the difference picture is the difference between the reconstructed reference picture R1 and the rescaled second reconstructed reference picture R2. As an example, if R2 is greater than R1, R2 is scaled down, while if R2 is less than R1, R2 is scaled up. In this case, DIFF = R1 - F(R2), where F is the identity matrix function when R2 and Ic are of the same size and a rescaling function otherwise.

根据变型，解码方法还包含可选的对与差异画面相关联的符号进行的解码。如果这样的符号被解码，则特殊参考画面R1’在符号为正时等于F(R2)+DIFF，而在符号为负时等于F(R2)-DIFF。According to a variant, the decoding method also comprises the optional decoding of a sign associated with the difference picture. If such a sign is decoded, the special reference picture R1' is equal to F(R2)+DIFF when the sign is positive and to F(R2)-DIFF when the sign is negative.

根据另外的变型，流S_diff针对R1的一些块对这些块和R2中的块(即，与Bc同位置的块或者经运动补偿的块)之间的差异进行编码。使用传统的帧内预测(即，根据相邻的重构的样本)将R1的其他块编码到S_diff中。According to another variant, the stream S_diff encodes the differences between some blocks of R1 and blocks in R2 (i.e. blocks co-located with Bc or motion compensated blocks). Other blocks of R1 are encoded in S_diff using traditional intra prediction (i.e. from neighboring reconstructed samples).

对信息INFO进行编码使得能够处理不同的使用情况。例如，如果从两个重构的参考画面R1和r1对当前画面块Bc进行编码，则两个重构的参考画面根据两个其他重构的参考画面R2和r2进行编码。INFO向解码器指示将从R2重构特殊参考画面R1’，而info指示将从r2重构另外的特殊参考画面r1’。每个特殊参考画面例如在流S_diff中用指示不同于传统的I、P、B画面/切片类型的画面/切片类型的专用标志来标识。该画面/切片类型指示当前AU是将被用于替换DPB中的画面的特殊参考画面。根据变型，每个特殊画面用切片头部中的专用标志来标识。Encoding the information INFO enables handling of different use cases. For example, if the current picture block Bc is encoded from two reconstructed reference pictures R1 and r1, the two reconstructed reference pictures are encoded based on two other reconstructed reference pictures R2 and r2. INFO indicates to the decoder that a special reference picture R1' will be reconstructed from R2, while info indicates that another special reference picture r1' will be reconstructed from r2. Each special reference picture is identified, for example, in the stream S_diff with a dedicated flag indicating a picture/slice type that is different from the traditional I, P, and B picture/slice types. This picture/slice type indicates that the current AU is a special reference picture that will be used to replace a picture in the DPB. According to a variant, each special picture is identified with a dedicated flag in the slice header.

在特定实施例中，针对DPB的重构的参考画面中的若干或每个可能的对来编码一个特殊参考画面和信息INFO。因此，在任何时候都可以从DPB的任何画面重构块Bc，即使其并非来自在限制漂移的同时根据其进行编码的那个，在重构Bc时，如果在DPB中R1不可用，则可以从特殊参考画面R1’而不是R2来重构Bc。漂移因此而被限制，因为就内容而言，R1’更接近于R1而不是R2。In a particular embodiment, a special reference picture and information INFO are encoded for several or each possible pair of reconstructed reference pictures of the DPB. Thus, at any time, block Bc can be reconstructed from any picture of the DPB, even if it is not from the one from which it was coded while limiting drift. When reconstructing Bc, if R1 is not available in the DPB, Bc can be reconstructed from special reference picture R1' instead of R2. Drift is thus limited because, in terms of content, R1' is closer to R1 than to R2.

标识第二重构的参考画面的信息例如是POC。根据变型，标识第二重构的画面的信息是重构的参考画面索引。The information identifying the second reconstructed reference picture is, for example, a POC.According to a variant, the information identifying the second reconstructed picture is a reconstructed reference picture index.

公开的该解码方法的所有变型和选择都可应用于编码方法。具体地，编码方法包含可选的对与差异画面相关联的符号的编码。All variants and options of the disclosed decoding method are applicable to the encoding method. In particular, the encoding method comprises the optional encoding of symbols associated with the difference picture.

INFO和符号例如从SEI消息、VPS(视频参数集HEVC)解码或者来自Sc的切片头部。INFO and SYMBOL are decoded from, for example, SEI messages, VPS (Video Parameter Set HEVC) or from the slice header of Sc.

根据变型，在流切换的背景下使用编码和解码方法，如图7示。在该情况下，画面的第一序列被编码在流S0中。画面的第二序列被编码在流S1中。通常，画面的第二序列与第一序列相同，但是以不同的比特率，即通过使用不同的量化步长进行编码。根据变型，画面的第二序列是第一序列的重新调节的版本，即上调或下调的版本。根据特定实施例，S0和S1具有相同的GOP结构(即，相同的解码次序和相同的参考画面列表，如在HEVC标准的8.3.1节和8.3.2节中所定义的那样)。According to a variant, the encoding and decoding method is used in the context of stream switching, as shown in FIG7 . In this case, a first sequence of pictures is encoded in stream S0 . A second sequence of pictures is encoded in stream S1 . Typically, the second sequence of pictures is identical to the first sequence, but is encoded at a different bit rate, i.e. by using a different quantization step size. According to a variant, the second sequence of pictures is a rescaled version of the first sequence, i.e. an upscaled or downscaled version. According to a particular embodiment, S0 and S1 have the same GOP structure (i.e. the same decoding order and the same reference picture list, as defined in sections 8.3.1 and 8.3.2 of the HEVC standard).

除了流S0和S1之外，在每个时刻tn，S1的重构的参考画面被进一步地编码在流S_diff中作为来自如图7所示的S0的在时间上相对应的，即时间对齐的(例如，相同的画面次序计数)重构的参考画面的SRP。重构的参考画面与用于标识对应的重构的参考画面的信息info_tn一起编码在S_diff中。注意，对应于的源画面被编码在S1中，并且对应于的源画面被编码在S0中。In addition to streams S0 and S1, at each time instant tn, the reconstructed reference picture of S1 is further encoded in the stream S_diff as the SRP of the temporally corresponding, i.e., time-aligned (e.g., same picture order count) reconstructed reference picture from S0 as shown in FIG7 . The reconstructed reference picture is encoded in S_diff together with information info_tn for identifying the corresponding reconstructed reference picture. Note that the source picture corresponding to is encoded in S1, and the source picture corresponding to is encoded in S0.

参考图5公开的解码方法用于对从第一流S0切换到第二流S1之后的画面块Bc进行解码。参照图7，从流S0中解码画面并且显示直至时间t2为止。在t2和t3之间发生切换。在切换之后，从流S1中解码并显示画面。在切换时，DBP0包含从S0解码的若干重构的参考画面。DPB0与S0有关。参照图7，PB0包含在切换时间处的三个重构的参考画面和The decoding method disclosed with reference to FIG5 is used to decode a picture block Bc after switching from a first stream S0 to a second stream S1. Referring to FIG7, a picture is decoded from stream S0 and displayed until time t2. A switch occurs between t2 and t3. After the switch, a picture is decoded and displayed from stream S1. At the time of the switch, DBP0 contains several reconstructed reference pictures decoded from S0. DPB0 is associated with S0. Referring to FIG7, PB0 contains three reconstructed reference pictures at the switching time and

在步骤10中，S_diff1、S_diff2和S_diff3被解码成解码数据(例如，残差或编码模式)以及标识存储在DPB0中的重构的参考画面和的信息info_t0、info_t1和info_t2。In step 10 , S_diff1 , S_diff2 , and S_diff3 are decoded into decoded data (eg, residual or coding mode) and information info_t0 , info_t1 , and info_t2 identifying the reconstructed reference pictures stored in DPB0 .

在步骤12中，从对应的解码数据和对应的重构的参考画面和重构三个特殊参考画面SRP_t0、SRP_t1和SRP_t2。然后，将重构的SRP存储在(可能)不同于DPB0的 DPB1中。DPB1与S1有关。根据第一特定实施例，S_diff对和可能被重新调节的在时间上相对应的画面之间的逐个像素差异进行编码。在该情况下，重构的SRP是其中diff_t0、diff_t1、 diff_t2是从S_diff解码的。如有必要，用F重新调节使得其大小与当前画面Ic的大小相同。如果没有进行重新调节，则F是单位矩阵函数。根据第二特定实施例，S_diff使用可能由F重新调节的对进行编码。在该情况下，中的块的预测器是画面中的空间同位置的块或者是中的经运动补偿的块或者从中的空间相邻块(空间帧内预测)得出。在第一特定实施例的情况下，当不需要重新调节时，即当第一和第二流的画面的大小相同时，可以使用相同的差异画面diff_t0、diff_t1和diff_t2从S0切换到S1或者从S1切换到 S0。在先前的示例中，如果diff_t0对和流S1中的在时间上相对应的画面之间差异，而不是相反的情况进行编码，则从中减去diff_t0而不是添加，以便重构SRP_t0。从而对符号解码以指示是通过添加差异画面还是通过减去差异画面来修改重构的参考画面。 In step 12, three special reference pictures SRP_t0, SRP_t1, and SRP_t2 are reconstructed from the corresponding decoded data and the corresponding reconstructed reference pictures. The reconstructed SRPs are then stored in DPB1, which may be different from DPB0. DPB1 is related to S1. According to a first specific embodiment, S_diff encodes the pixel-by-pixel differences between temporally corresponding pictures, which may be rescaled. In this case, the reconstructed SRPs are where diff_t0, diff_t1, and diff_t2 are decoded from S_diff. If necessary, they are rescaled using F to have the same size as the current picture Ic. If no rescaling is performed, F is the identity matrix function. According to a second specific embodiment, S_diff is encoded using the pair, which may be rescaled by F. In this case, the predictor for the block in is either a spatially co-located block in the picture, or a motion-compensated block in or derived from a spatially neighboring block in (spatial intra prediction). In the case of the first specific embodiment, when rescaling is not required, that is, when the picture sizes of the first and second streams are the same, the same difference pictures diff_t0, diff_t1, and diff_t2 can be used to switch from S0 to S1 or vice versa. In the previous example, if diff_t0 encodes the difference between temporally corresponding pictures in stream S1, rather than the reverse, diff_t0 is subtracted from it to reconstruct SRP_t0. Thus, a symbol is decoded to indicate whether the reconstructed reference picture is modified by adding or subtracting the difference picture.

在步骤S16中，从DPB1中的重构的参考画面来重构Bc。就在切换之后，DPB1包含三个SRP。In step S16, Bc is reconstructed from the reconstructed reference pictures in DPB 1. Just after switching, DPB 1 contains three SRPs.

本发明显然不限制于3个重构的参考画面的情况。根据本发明的特定实施例，对于DPB0中的所有重构的参考画面，在步骤12重构特殊参考画面并且存储在DPB1中。根据变型，仅对将被用作切换之后的参考画面的DPB0中的每个重构的参考画面重构SRP。The present invention is obviously not limited to the case of three reconstructed reference pictures. According to a specific embodiment of the present invention, for all reconstructed reference pictures in DPB0, a special reference picture is reconstructed in step 12 and stored in DPB1. According to a variant, an SRP is reconstructed only for each reconstructed reference picture in DPB0 that will be used as a reference picture after switching.

根据变型，将标志f13编码(相应地，解码)例如在VPS或SEI中，指示具有给定的layer_id的随后编码(相应地，解码)的画面未使用任何层间预测。更精确地，在标志之后编码(相应地，解码)的画面未使用任何层间预测。According to a variant, the flag f13 is encoded (respectively, decoded), for example in a VPS or SEI, indicating that subsequently encoded (respectively, decoded) pictures with a given layer_id do not use any inter-layer prediction. More precisely, pictures encoded (respectively, decoded) after the flag do not use any inter-layer prediction.

图8例示根据特定且非限制性实施例的解码方法的另一实施例。解码器接收不同的访问单元。首先接收并解码访问单元AU1。从解码的AU1重构第一画面I1。然后，接收并解码第二访问单元AU2。从解码的AU2重构第二画面I2。画面I1和I2在它们被信号通知为用作参考画面的情况下属于相同的流S0并且存储在DPB0中。然后，切换发生。可以由向编码器发送接收S_diff流的请求的解码器请求该切换。根据变型，切换由编码器启动。在该切换之后，解码器接收两个AU单元S_diff1和S_diff2。S_diff1和S_diff2(步骤10)被编码以便分别使用画面I1和I2来重构(步骤12)SRP1和SRP2。SRP1和SRP2是两个特殊的参考画面。然后将SRP1和SRP2放在与S1相关的DPB1中。然后，解码器接收AU3，并对其进行解码。从解码的AU3并且可能从DPB1的至少一个画面(时间预测)，即SRP1或SRP2重构画面I3。I3属于第二流S1并且可能被存储在DPB1中以便将来用作重构的参考画面。然后，解码器接收AU4并且对其进行解码。从解码的AU4并且可能从DPB1的至少一个画面(时间预测)重构画面I4。显示画面I1、I2、I3和I4，但是不显示SRP1、SRP2。实际上，仅显示两个时间对齐的画面中的一个。SRP1与I1是时间对齐的，而SRP2与I2是时间对齐的。Figure 8 illustrates another embodiment of a decoding method according to a specific and non-limiting embodiment. The decoder receives different access units. First, access unit AU1 is received and decoded. A first picture I1 is reconstructed from the decoded AU1. Then, a second access unit AU2 is received and decoded. A second picture I2 is reconstructed from the decoded AU2. Pictures I1 and I2 belong to the same stream S0 and are stored in DPB0 when they are signaled to be used as reference pictures. Then, a switch occurs. This switch can be requested by the decoder sending a request to the encoder to receive the S_diff stream. According to a variant, the switch is initiated by the encoder. After this switch, the decoder receives two AU units S_diff1 and S_diff2. S_diff1 and S_diff2 (step 10) are encoded so that SRP1 and SRP2 are reconstructed (step 12) using pictures I1 and I2, respectively. SRP1 and SRP2 are two special reference pictures. SRP1 and SRP2 are then placed in DPB1 associated with S1. The decoder then receives AU3 and decodes it. Picture I3 is reconstructed from the decoded AU3 and possibly from at least one picture of DPB1 (temporal prediction), i.e., SRP1 or SRP2. I3 belongs to the second stream S1 and may be stored in DPB1 for future use as a reference picture for reconstruction. The decoder then receives AU4 and decodes it. Picture I4 is reconstructed from the decoded AU4 and possibly from at least one picture of DPB1 (temporal prediction). Pictures I1, I2, I3, and I4 are displayed, but SRP1 and SRP2 are not. In fact, only one of the two time-aligned pictures is displayed. SRP1 is time-aligned with I1, while SRP2 is time-aligned with I2.

根据本发明的特定实施例，第一和第二序列的画面以及特殊的参考画面被编码成多层流。作为特定示例，被标识为特殊的参考画面的画面被编码为依赖于例如其中第一序列的画面被编码的基础层这样的另外的层(流S0)的可伸缩流的增强层。如果第一层需要来自第二层的信息用于解码，则第一层依赖于该第二层。增强层允许从S0的重构的参考画面重构将被用作用于重构从S0切换到S1之后的S1的画面的参考画面的特殊参考画面。该增强层例如与SVC或SHVC编码标准兼容。根据本发明的特定实施例，特殊参考画面通过用于对增强层进行编码的SVC或SHVC所提供的编码工具/模式的子集来编码。根据本发明的另外的实施例，在SVC或SHVC编码标准中禁用层内运动向量预测(时间预测)。相反地，激活来自S0层的帧内预测。也可以激活画面内预测。根据另外的实施例，例如通过将HEVC标志slice_temporal_mvp_enable_flag设置为假来禁用时间mv预测以对S0和S1编码。这意味着运动向量预测(MV预测)使用来自重构的相邻的编码单元的MV而不是使用先前重构的参考画面的MV来建立。According to a specific embodiment of the present invention, the pictures of the first and second sequences and the special reference pictures are encoded into a multi-layer stream. As a specific example, the pictures identified as special reference pictures are encoded as enhancement layers of a scalable stream that depends on another layer (stream S0), such as the base layer in which the pictures of the first sequence are encoded. If the first layer needs information from the second layer for decoding, the first layer depends on the second layer. The enhancement layer allows the reconstruction of special reference pictures from the reconstructed reference pictures of S0 to be used as reference pictures for reconstructing pictures of S1 after switching from S0 to S1. The enhancement layer is compatible with the SVC or SHVC coding standards, for example. According to a specific embodiment of the present invention, the special reference pictures are encoded using a subset of the coding tools/modes provided by SVC or SHVC for encoding the enhancement layer. According to another embodiment of the present invention, intra-layer motion vector prediction (temporal prediction) is disabled in the SVC or SHVC coding standards. Instead, intra-frame prediction from the S0 layer is activated. Intra-picture prediction can also be activated. According to another embodiment, temporal MV prediction is disabled for encoding S0 and S1, for example by setting the HEVC flag slice_temporal_mvp_enable_flag to false. This means that motion vector prediction (MV prediction) is built using MVs from reconstructed neighboring coding units instead of using MVs of previously reconstructed reference pictures.

在下面的图9和10中，编码和解码模块被称为编码器和解码器。In the following Figures 9 and 10, the encoding and decoding modules are referred to as encoder and decoder.

图9图示根据特定且非限制性实施例的多层编码器。使用作为单层编码器的第一编码器ENC0，例如MPEG2、H.264或HEVC兼容的编码器将第一序列的画面编码在S0中。本发明不限于所使用的单层编码器。用ENC0编码的参考画面被重构为R2，并且被提供为给第三编码器ENC2的输入。第二编码器ENC1被用于对S1中的第二序列的画面编码。本发明不限于所使用的编码器。用ENC1编码的在时间上对应于重构的参考画面R2的参考画面被重构为R1并且提供为给第三编码器ENC2的输入。因此，对于ENC0的DPB中的每个重构的参考画面R2，重构在时间上对应的参考画面R1。从而，编码器ENC2根据可能重新调节为流S_diff的在时间上对应的重构的参考画面R2对重构的参考画面R1编码。根据特定实施例，编码器ENC2包含用于从R1中减去R2(可能经过重新调节)的减法器，并且还包含用于对这样获得的可能经过变换和量化的差异画面进行编码的熵编码器。根据变型，从R1的每个块中减去预测器，其中预测器是画面R2中的空间同位置的块(可能经过重新调节)或者R2中的运动补偿的块(可能经过重新调节)或者从R1中的空间相邻的块得出(空间帧内预测)。从而获得残差，并且在可能的变换和量化之后对残差进行进一步的熵编码。在该情况下，编码在S_diff中的不是R1和R2之间的简单的逐个像素差异。标识被用于对重构的参考画面R1进行编码的重构的参考画面R2的信息INFO也被编码在S_diff中。编码器ENC2例如与诸如SVC或SHVC这样的可伸缩视频编码器兼容。本发明不限于所使用的可伸缩编码器。可伸缩视频编解码标准定义将属于一个层(BL)的AU与属于另外的增强层AU分离/区分的layer_id指示器。根据特定实施例，来自ENC0的AU使用不同于被用于对来自ENC2的AU进行编码的layer_id的给定的layer_id进行编码。来自ENC1的AU和来自ENC2的AU具有相同的layer_id。根据有利的实施例，ENC1和ENC2可以是同一编码模块。FIG9 illustrates a multi-layer encoder according to a specific and non-limiting embodiment. A first sequence of pictures is encoded in S0 using a first encoder ENC0, such as an MPEG2, H.264, or HEVC-compatible encoder, as a single-layer encoder. The present invention is not limited to the single-layer encoder used. Reference pictures encoded with ENC0 are reconstructed as R2 and provided as input to a third encoder ENC2. A second encoder ENC1 is used to encode a second sequence of pictures in S1. The present invention is not limited to the encoder used. Reference pictures encoded with ENC1 that temporally correspond to the reconstructed reference pictures R2 are reconstructed as R1 and provided as input to the third encoder ENC2. Thus, for each reconstructed reference picture R2 in the DPB of ENC0, a temporally corresponding reference picture R1 is reconstructed. Consequently, encoder ENC2 encodes the reconstructed reference picture R1 based on the temporally corresponding reconstructed reference picture R2, which may be rescaled to the stream S_diff. According to a particular embodiment, the encoder ENC2 comprises a subtractor for subtracting R2 (possibly rescaled) from R1, and further comprises an entropy encoder for encoding the difference picture thus obtained, possibly transformed and quantized. According to a variant, a predictor is subtracted from each block of R1, where the predictor is a spatially co-located block in picture R2 (possibly rescaled) or a motion-compensated block in R2 (possibly rescaled) or is derived from a spatially adjacent block in R1 (spatial intra prediction). A residual is thus obtained, which is further entropy-coded after possible transformation and quantization. In this case, what is encoded in S_diff is not a simple pixel-by-pixel difference between R1 and R2. Information INFO identifying the reconstructed reference picture R2 used to encode the reconstructed reference picture R1 is also encoded in S_diff. The encoder ENC2 is compatible with scalable video coders such as SVC or SHVC, for example. The invention is not limited to the scalable coder used. The Scalable Video Codec standard defines a layer_id indicator that separates/distinguishes AUs belonging to one layer (BL) from AUs belonging to another enhancement layer. According to a specific embodiment, AUs from ENC0 are encoded using a given layer_id that is different from the layer_id used to encode AUs from ENC2. AUs from ENC1 and AUs from ENC2 have the same layer_id. According to an advantageous embodiment, ENC1 and ENC2 can be the same coding module.

图10图示根据特定且非限制性实施例的多层解码器。第一流S0使用作为例如是MPEG2、H.264或HEVC兼容的解码器的单层解码器的第一解码器DEC0来解码。本发明不限制于所使用的单层解码器。解码器DEC0从第一流S0，具体地从存储在DPB0中的参考画面R2重构画面。第二解码器DEC1用于从第二流S1重构画面。本发明不限于所使用的解码器。解码器DEC2从流S_diff解码(步骤10)用于标识在DPB0中的重构的参考画面R2的信息INFO。解码器DEC2例如与诸如SVC或SHVC这样的可伸缩视频解码器兼容。本发明不限于所使用的可伸缩解码器。解码器DEC2进一步从可能经过重新调节的时间对齐的重构的参考画面R2以及从S_diff解码的数据(例如，残差、编码模式)重构(步骤S12)特殊参考画面R1’。根据特定实施例，解码器DEC2包含用于从S_diff解码残差的熵解码器以及用于将残差添加到预测器的加法器，其中，预测器从可能经过重新调节的R2中的同位置或经运动补偿的块或者从R1’中的重构的样本(画面内预测)得出。然后将特殊参考画面R1’放在DPB1中。Figure 10 illustrates a multi-layer decoder according to a specific and non-limiting embodiment. The first stream S0 is decoded using a first decoder DEC0 that is a single-layer decoder, for example, a decoder compatible with MPEG2, H.264 or HEVC. The present invention is not limited to the single-layer decoder used. Decoder DEC0 reconstructs the picture from the first stream S0, specifically from the reference picture R2 stored in DPB0. The second decoder DEC1 is used to reconstruct the picture from the second stream S1. The present invention is not limited to the decoder used. Decoder DEC2 decodes (step 10) information INFO for identifying the reconstructed reference picture R2 in DPB0 from the stream S_diff. Decoder DEC2 is compatible with scalable video decoders such as SVC or SHVC, for example. The present invention is not limited to the scalable decoder used. Decoder DEC2 further reconstructs (step S12) the special reference picture R1' from the reconstructed reference picture R2, which may have been rescaled in time alignment, and from the data decoded from S_diff (e.g., residual, coding mode). According to a particular embodiment, the decoder DEC2 comprises an entropy decoder for decoding the residual from S_diff and an adder for adding the residual to a predictor derived from a possibly rescaled co-located or motion compensated block in R2 or from reconstructed samples in R1 ' (intra-picture prediction). The special reference picture R1 ' is then placed in DPB1.

根据有利的实施例，DEC1和DEC2可以是同一解码模块。According to an advantageous embodiment, DEC1 and DEC2 may be the same decoding module.

图11例示根据特定且非限制性实施例的多层流。在该图中，虚线表示画面依赖性。具有layer_id＝Layer_A的AU1和AU2被接收和解码。参考画面b1和b2根据解码的AU进行重构并且存储在Layer_A的DPB_A中。在切换时，具有layer_id＝Layer_B的AU S_diff1和S_diff2被接收和解码。然后，解码器DEC2根据从S_diff1和S_diff2解码的数据并且还根据分别从S_diff1和S_diff2解码的信息info_1和info_2所标识的b1和b2来重构特殊参考画面e’1和e’2。分别与b1和b2时间对齐的特殊参考画面e’1和e’2存储在Layer_B的DPB_B中。然后，接收并解码AU3。从该解码的AU3并且还从特殊参考画面e’1和e’2重构画面e3。重构的画面e3存储在DPB_B中，因为e3用作e4的重构的参考画面。接收并解码AU4。从解码的AU4并且还从特殊参考画面e’2以及重构的参考画面e3来重构画面e4。接收并解码随后的AU5和AU6。从解码出的AU5和AU6重构对应的画面e5和e6。如果重构的画面被用作参考画面，则DPB_B可能通过添加e5和e6来更新。e’1优选为在对e3进行编码时所使用的重构的参考画面之一的e1的近似。e’2优选为在对e3和e4进行编码时所使用的重构的参考画面之一的e2的近似。有利地，标志f13被编码(相应地，被解码)在例如VPS或SEI中，指示具有给定layer_id的随后编码(相应地，解码)的解码画面未使用任何层间预测。更准确地，在标志之后编码(相应地，解码)的画面未使用层间预测。Figure 11 illustrates a multi-layer stream according to a specific and non-limiting embodiment. In the figure, dotted lines represent picture dependencies. AU1 and AU2 with layer_id = Layer_A are received and decoded. Reference pictures b1 and b2 are reconstructed based on the decoded AUs and stored in DPB_A of Layer_A. Upon switching, AUs S_diff1 and S_diff2 with layer_id = Layer_B are received and decoded. Decoder DEC2 then reconstructs special reference pictures e'1 and e'2 based on the data decoded from S_diff1 and S_diff2 and also based on b1 and b2 identified by information info_1 and info_2 decoded from S_diff1 and S_diff2, respectively. Special reference pictures e'1 and e'2, which are time-aligned with b1 and b2, respectively, are stored in DPB_B of Layer_B. AU3 is then received and decoded. Picture e3 is reconstructed from this decoded AU3 and also from the special reference pictures e'1 and e'2. The reconstructed picture e3 is stored in DPB_B because e3 is used as a reference picture for the reconstruction of e4. AU4 is received and decoded. Picture e4 is reconstructed from the decoded AU4 and also from the special reference picture e'2 and the reconstructed reference picture e3. The subsequent AU5 and AU6 are received and decoded. Corresponding pictures e5 and e6 are reconstructed from the decoded AU5 and AU6. If the reconstructed pictures are used as reference pictures, DPB_B may be updated by adding e5 and e6. e'1 is preferably an approximation of e1, one of the reconstructed reference pictures used when encoding e3. e'2 is preferably an approximation of e2, one of the reconstructed reference pictures used when encoding e3 and e4. Advantageously, a flag f13 is encoded (respectively, decoded) in, for example, a VPS or SEI, indicating that subsequently encoded (respectively, decoded) decoded pictures with a given layer_id do not use any inter-layer prediction. More precisely, pictures encoded (respectively, decoded) after the flag do not use inter-layer prediction.

将第一和第二序列的画面以及特殊参考画面编码成多层流使得能够重构例如具有相同POC这样的时间对齐的两个参考画面(b1和e’1，或者b2和e’2)。实际上，在多层方法中使用不同的DPB。特别地，对每个层使用一个DPB。因此，时间对齐的重构的参考画面存储在不同的DPB中。由于层的依赖性，对多层流进行解码通常需要在对级别N+1的层进行解码之前对级别N的层进行解码，其中N为整数。层之间的这种依赖性不与流切换应用兼容。有利地，对标志f13进行编码引入层之间的独立性，并且因此呈现适合于流切换应用的可伸缩编码/解码。Encoding the pictures of the first and second sequences and the special reference pictures into a multi-layer stream makes it possible to reconstruct two reference pictures (b1 and e'1, or b2 and e'2) that are time-aligned, for example, with the same POC. In practice, different DPBs are used in the multi-layer approach. In particular, one DPB is used for each layer. Therefore, the time-aligned reconstructed reference pictures are stored in different DPBs. Due to the dependency of the layers, decoding a multi-layer stream usually requires decoding the layers of level N before decoding the layers of level N+1, where N is an integer. This dependency between layers is not compatible with stream switching applications. Advantageously, encoding the flag f13 introduces independence between the layers and thus presents scalable coding/decoding suitable for stream switching applications.

根据本发明的编码和解码方法使得能够实现灵活的流切换，同时具有仅在切换发生时的较小的比特率开销。这些方法适合于任何GOP结构、任何数量的重构的参考画面，甚至在解码次序不同于显示次序时也适合。The encoding and decoding methods according to the present invention enable flexible stream switching with a small bitrate overhead only when switching occurs. These methods are suitable for any GOP structure, any number of reconstructed reference pictures, and even when the decoding order is different from the display order.

下面提供在S_diff流的SHVC编码标准框架内的语法的示例。An example of the syntax within the framework of the SHVC coding standard for an S_diff stream is provided below.

slice_typeslice_type slice_type的名称The name of the slice_type 00 B(B切片)B (B slice) 11 P(P切片)P (P slice) 22 I(I切片)I(I slice) 33 SRP(SRP切片)SRP (SRP slice)

添加slice_type以标识特殊参考画面的切片。Add slice_type to identify the slice of a special reference picture.

sign_diff_pic等于1指示应当对预测添加残差，否则指示应当对预测减去残差。sign_diff_pic equal to 1 indicates that the residual should be added to the prediction, otherwise it indicates that the residual should be subtracted from the prediction.

pic_order_cnt_diffpic_lsb指定该特殊参考画面的画面次序计数模数MaxPicOrderCntLsb。然后，帧内BL预测将使用具有相同pic_order_cnt的DPB中的参考画面。pic_order_cnt_lsb语法元素的长度是log2_max_pic_order_cnt_lsb_minus4+4比特。pic_order_cnt_diffpic_lsb的值应当是在0到MaxPicOrderCntLsb–1的范围内，并且包含0和MaxPicOrderCntLsb–1。当不存在pic_order_cnt_diffpic_lsb时，pic_order_cnt_diffpic_lsb被推断为等于0。pic_order_cnt_diffpic_lsb specifies the picture order count modulo MaxPicOrderCntLsb for this special reference picture. Intra BL prediction will then use reference pictures in the DPB with the same pic_order_cnt. The length of the pic_order_cnt_lsb syntax element is log2_max_pic_order_cnt_lsb_minus4 + 4 bits. The value of pic_order_cnt_diffpic_lsb shall be in the range of 0 to MaxPicOrderCntLsb–1, inclusive. When pic_order_cnt_diffpic_lsb is not present, pic_order_cnt_diffpic_lsb is inferred to be equal to 0.

delta_poc_msb_diffpic_cycle_lt用于确定被用于重构该特殊参考画面的DPB中的长期重构的参考画面的画面序列计数值的最高有效位的值。当不存在delta_poc_msb_cycle_lt时，其被推断为等于0。delta_poc_msb_diffpic_cycle_lt is used to determine the value of the most significant bit of the picture sequence count value of the long-term reconstructed reference picture in the DPB used to reconstruct this special reference picture. When delta_poc_msb_cycle_lt is not present, it is inferred to be equal to 0.

num_layer_id_diffpic_apply指示被用于对该特殊参考画面进行解码的重构的参考画面的num_layer_id。num_layer_id_diffpic_apply indicates the num_layer_id of the reconstructed reference pictures used for decoding this special reference picture.

语法的示例(vps_extension)：Example of syntax (vps_extension):

diff_pic_flag_enabled等于1指示对inter_layer_pred_for_non_diff_picture_flag编码。diff_pic_flag_enabled equal to 1 indicates that inter_layer_pred_for_non_diff_picture_flag is coded.

inter_layer_pred_for_non_diff_picture_flag等于1指示类型I、P或B的任何随后画面不使用层间预测，但是类型SRP的画面可能使用层间预测，但不是时间层内预测。inter_layer_pred_for_non_diff_picture_flag equal to 1 indicates that any subsequent pictures of type I, P, or B do not use inter-layer prediction, but pictures of type SRP may use inter-layer prediction, but not temporal intra-layer prediction.

根据本发明并且在图9和10上所示的视频编码器和解码器例如实现为各种形式的硬件、软件、固件、专用处理器或者其组合。优选地，所述原理可以实现为硬件和软件的组合。而且，软件优选地实现为有形地实施在程序存储设备上的应用程序。应用程序可以上载到包含任何适合架构的机器，并且由该机器执行。优选地，该机器实现在具有诸如一个或多个中央处理单元(CPU)、随机存取存储器(RAM)和输入/输出(I/O)接口这样的硬件的计算机平台上。计算机平台还包括操作系统和微指令码。在本文中描述的各种处理和功能可以是微指令码的一部分或经由操作系统妊的应用程序的一部分(或其组合)。另外，各种其他外围设备可以连接到该计算机平台，诸如另外的数据存储设备和打印设备。According to the present invention and the video encoder and decoder shown in Figures 9 and 10, for example, are implemented as various forms of hardware, software, firmware, special processors or combinations thereof. Preferably, the principle can be implemented as a combination of hardware and software. Moreover, the software is preferably implemented as an application program tangibly implemented on a program storage device. The application program can be uploaded to a machine comprising any suitable architecture and executed by the machine. Preferably, the machine is implemented on a computer platform with hardware such as one or more central processing units (CPUs), random access memories (RAMs) and input/output (I/O) interfaces. The computer platform also includes an operating system and microinstruction codes. The various processes and functions described herein can be a part of the microinstruction code or a part (or a combination thereof) of the application program via the operating system. In addition, various other peripheral devices can be connected to the computer platform, such as other data storage devices and printing equipment.

根据变型，根据本发明的编码和解码设备根据纯粹的硬件实现方式来实现，例如实现为专用组件(例如ASIC(专用集成电路)或FPGA(现场可编程门阵列)或VLSI(超大规模集成电路))的形式，或者实现为集成在设备中的若干电子组件的形式，或者甚至是以硬件元件和软件元件混合的形式。According to a variant, the encoding and decoding device according to the invention is implemented according to a purely hardware implementation, for example in the form of a dedicated component such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array) or a VLSI (Very Large Scale Integrated Circuit), or in the form of several electronic components integrated in the device, or even in the form of a mixture of hardware and software elements.

Claims

1. A decoding method, comprising:

A reference frame is reconstructed based on an additional reference frame from the decoder frame buffer of the first layer of the scalable multi-layer stream and data decoded from the second layer of the scalable multi-layer stream, and the reconstructed reference frame is stored in the decoder frame buffer of the second layer, wherein the reconstructed reference frame is indicated not to be displayed.

The subsequent decoding of the image indicating the second layer was performed without using any inter-layer prediction flags; and

At least the frame blocks of the subsequently decoded frame are reconstructed based on the reconstructed reference frame.

2. The decoding method according to claim 1 further comprises:

Information used to identify the additional reference image is decoded from the second layer.

3. The decoding method according to any one of claims 1 to 2, wherein the reconstructed reference frame and the other reference frame are time-aligned.

4. An encoding method, comprising:

Based on an additional reference frame of the decoder frame buffer of the first layer of the scalable multi-layer stream, the reference frame of the decoder frame buffer of the second layer of the scalable multi-layer stream is encoded in the second layer, wherein the reference frame is indicated to not be displayed.

The subsequent encoding of the second layer was not coded using any inter-layer prediction flags; and

At least the frame blocks of the subsequently encoded frame are encoded based on the reference frame.

5. The encoding method according to claim 4, further comprising:

In the second layer, information used to identify the additional reference screen is encoded.

6. The encoding method according to any one of claims 4 to 5, wherein the reference frame and the other reference frame are time-aligned.

7. A decoder, comprising:

Components for reconstructing a reference frame based on an additional reference frame from a decoder frame buffer of the first layer of a scalable multi-layer stream and data decoded from the second layer of the scalable multi-layer stream, and storing the reconstructed reference frame in a decoder frame buffer of the second layer, wherein the reconstructed reference frame is indicated not to be displayed.

A component for decoding images indicating subsequent decoding of the second layer that do not use any inter-layer prediction flags; and

A component for reconstructing, at least based on, the frame blocks of the subsequently decoded frame, based on at least the reconstructed reference frame.

8. The decoder according to claim 7, further comprising:

A component used to decode information from the second layer for identifying the additional reference image.

9. The decoder according to any one of claims 7 to 8, wherein the reconstructed reference frame and the other reference frame are time-aligned.

10. An encoder comprising:

A component for encoding a reference screen of the decoder screen buffer of the second layer in the scalable multi-layer stream, based on an additional reference screen of the decoder screen buffer of the first layer, wherein the reference screen is indicated to not be displayed.

A component for encoding a frame indicating subsequent encoding of the second layer that does not use any inter-layer prediction flags; and

A component for encoding at least a block of the subsequently encoded frame based on the reference frame.

11. The encoder of claim 10, further comprising:

A component for encoding information used to identify the additional reference screen in the second layer.

12. The encoder according to any one of claims 10 to 11, wherein the reference frame and the additional reference frame are time-aligned.

13. A method for transmitting a stream, comprising:

Encoded data of a reference frame of the decoder frame buffer of the second layer of the scalable multi-layer stream is transmitted in the second layer. The reference frame is encoded based on an additional reference frame of the decoder frame buffer of the first layer of the scalable multi-layer stream, wherein the reference frame is indicated to not be displayed.

The transmission indicates that the subsequent encoded image of the second layer does not use any inter-layer prediction flags; and

Encoded data of frame blocks of the subsequently encoded frame are transmitted, the frame blocks being encoded at least according to the reference frame.

14. The method of transporting a stream according to claim 13, further comprising transmitting information for identifying the additional reference frame in the second layer.

15. The method of transporting a stream according to any one of claims 13 to 14, wherein the reference frame and the additional reference frame are time-aligned.