HK1229105B - Methods and apparatus for incorporating video usability information within a multi-view video coding system - Google Patents
Methods and apparatus for incorporating video usability information within a multi-view video coding system Download PDFInfo
- Publication number
- HK1229105B HK1229105B HK17102533.1A HK17102533A HK1229105B HK 1229105 B HK1229105 B HK 1229105B HK 17102533 A HK17102533 A HK 17102533A HK 1229105 B HK1229105 B HK 1229105B
- Authority
- HK
- Hong Kong
- Prior art keywords
- view
- bitstream
- equal
- information
- max
- Prior art date
Links
Description
本申请是2008年9月16日提交的申请号为200880119540.4的发明专利申请“将视频可用性信息(VUI)并入多视图视频(MVC)编码系统的方法和装置”的分案申请。This application is a divisional application of the invention patent application No. 200880119540.4 filed on September 16, 2008, entitled "Method and Apparatus for Incorporating Video Usability Information (VUI) into a Multi-View Video (MVC) Coding System."
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
该申请要求2007年10月5日提交的美国临时申请序列号60/977,709的权益,通过引用将其全部内容并入于此。此外,该申请与共同受让的、通过引用并入于此、并且与该申请同时提交的、题为“METHODS AND APPARATUS FOR INCORPORATIONG VIDEO USABILITY(VUI)WITHIN A MULTI-VIEW VIDEO(MVC)CODING SYSTEM”代理人案号PU080155的非临时申请有关,其也要求2007年10月5日提交的美国临时申请序列号60/977,709的权益。This application claims the benefit of U.S. Provisional Application Serial No. 60/977,709, filed October 5, 2007, which is hereby incorporated by reference in its entirety. In addition, this application is related to a commonly assigned non-provisional application, attorney docket No. PU080155, entitled “METHODS AND APPARATUS FOR INCORPORATIONG VIDEO USABILITY (VUI) WITHIN A MULTI-VIEW VIDEO (MVC) CODING SYSTEM,” filed concurrently with this application, which also claims the benefit of U.S. Provisional Application Serial No. 60/977,709, filed October 5, 2007.
技术领域Technical Field
本原理一般涉及视频编码和解码,并且更具体地涉及用于将视频可用性信息(VUI)并入多视图视频编码(MVC)的方法和装置。The present principles relate generally to video encoding and decoding, and more particularly to methods and apparatus for incorporating video usability information (VUI) into multi-view video coding (MVC).
背景技术Background Art
国际标准化组织/国际电工委员会(ISO/IEC)运动画面专家组-4(MPEG-4)第10部分高级视频编码(AVC)标准/国际电信联盟电信分部(ITU-T)H.264建议(下文中称为“MPEG-4AVC标准”)规定了序列参数集的视频可用性信息(VUI)参数的语法和语义。视频可用性信息包括以下信息:高宽比、过扫描(over-scanning)、视频信号类型、色度位置、定时、网络抽象层(NAL)假定参考解码器(HRD)参数、视频编码层(VCL)假定参考解码器参数、比特流限制等等。视频可用性信息提供对应的比特流的额外信息,以允许用户的更宽范围的应用。例如,在比特流限制信息中,视频可用性信息规定:(1)运动是否超过画面边界;(2)每个画面的最大字节; (3)每个宏块的最大比特;(4)最大运动向量长度(水平的和垂直的);(5)重新排序帧的数目;以及(6)最大的解码帧缓冲器大小。当解码器看到该信息时,替代使用“级”信息来设置解码要求(其通常比比特流实际要求的解码要求要高),解码器可以基于更紧的界限(tighter limits)来定制其解码操作。The International Organization for Standardization/International Electrotechnical Commission (ISO/IEC) Moving Picture Experts Group-4 (MPEG-4) Part 10 Advanced Video Coding (AVC) standard/International Telecommunication Union-Telecommunication Sector (ITU-T) H.264 Recommendation (hereinafter referred to as the "MPEG-4 AVC standard") specifies the syntax and semantics of the Video Usability Information (VUI) parameter of the sequence parameter set. The VUI includes the following information: aspect ratio, over-scanning, video signal type, chroma position, timing, Network Abstraction Layer (NAL) Hypothetical Reference Decoder (HRD) parameters, Video Coding Layer (VCL) Hypothetical Reference Decoder parameters, bitstream constraints, etc. The VUI provides additional information of the corresponding bitstream to allow a wider range of user applications. For example, in the bitstream constraint information, the VUI specifies: (1) whether motion exceeds the picture boundary; (2) the maximum bytes per picture; (3) the maximum bits per macroblock; (4) the maximum motion vector length (horizontal and vertical); (5) the number of reordered frames; and (6) the maximum decoded frame buffer size. When a decoder sees this information, instead of using the "level" information to set decoding requirements (which are typically higher than what the bitstream actually requires), the decoder can tailor its decoding operations based on tighter limits.
多视图视频编码(MVC)是对MPEG-4AVC标准的扩展。在多视图视频编码中,可以通过使用各视图之间的相关性来编码多视图的视频图像。在所有视图中,一个视图是基本视图,其是兼容MPEG-4AVC标准的,并且不能从其它视图预测。其它视图被称为非基本视图。可以从基本视图和其它非基本视图中预测性地编码非标准视图。可以在时间上对每个视图进行子采样。可以通过temporal_id语法元素来标识视图的时间子集。视图的时间级是视频信号的一个表示。在多视图视频的编码的比特流中,存在视图和时间级的不同组合。每个组合被称为操作点。可以从比特流中提取与各操作点相对应的子比特流。Multi-view video coding (MVC) is an extension of the MPEG-4 AVC standard. In multi-view video coding, multi-view video images can be encoded by using the correlation between the views. Among all views, one view is a base view, which is compatible with the MPEG-4 AVC standard and cannot be predicted from other views. Other views are called non-base views. Non-standard views can be predictively encoded from the base view and other non-base views. Each view can be subsampled in time. The temporal subset of the view can be identified by the temporal_id syntax element. The temporal level of the view is a representation of the video signal. In the encoded bitstream of the multi-view video, there are different combinations of views and temporal levels. Each combination is called an operation point. The sub-bitstream corresponding to each operation point can be extracted from the bitstream.
发明内容Summary of the Invention
通过本原理解决现有技术的这些和其它缺陷和缺点,本原理针对用于将视频可用性信息(VUI)并入多视图视频编码(MVC)的方法和装置。These and other deficiencies and shortcomings of the prior art are addressed by the present principles, which are directed to methods and apparatus for incorporating video usability information (VUI) into multi-view video coding (MVC).
根据本原理的一方面,提供了一种装置。该装置包括编码器,其用于通过对于各个视图、视图中的各个时间级、以及各个操作点中的至少一项指定视频可用性信息来编码多视图视频内容。According to an aspect of the present principles, there is provided an apparatus comprising an encoder for encoding multi-view video content by specifying video usability information for at least one of respective views, respective temporal levels in a view, and respective operating points.
根据本原理的另一方面,提供了一种方法。该方法包括通过对于各个视图、视图中的各个时间级、以及各个操作点中的至少一项指定视频可用性信息来编码多视图视频内容。According to another aspect of the present principles, there is provided a method comprising encoding multi-view video content by specifying video usability information for at least one of respective views, respective temporal levels within a view, and respective operating points.
根据本原理的另一方面,提供了一种装置。该装置包括解码器,其用于通过对于各个视图、视图中的各个时间级、以及各个操作点中的至少一项指定视频可用性信息来解码多视图视频内容。According to another aspect of the present principles, there is provided an apparatus comprising a decoder for decoding multi-view video content by specifying video usability information for at least one of respective views, respective temporal levels in a view, and respective operating points.
根据本原理的另一方面,提供了一种方法。该方法包括通过对于各个视图、视图中的各个时间级、以及各个操作点中的至少一项指定视频可用性信息来解码多视图视频内容。According to another aspect of the present principles, there is provided a method comprising decoding multi-view video content by specifying video usability information for at least one of respective views, respective temporal levels within a view, and respective operating points.
根据本原理的另一方面,提供一种编码方法,包括:通过为多视图视频编码中的各个操作点指定比特流限制信息,来编码多视图视频内容,其中操 作点对应于视图和时间级的不同组合中的一个组合。According to another aspect of the present principles, there is provided a coding method comprising: encoding multi-view video content by specifying bitstream limitation information for respective operation points in multi-view video coding, wherein the operation point corresponds to one of different combinations of view and temporal level.
根据本原理的另一方面,提供一种解码方法,包括:通过解码为多视图视频编码中的各个操作点指定的比特流限制信息,来解码多视图视频内容,其中操作点对应于视图和时间级的不同组合中的一个组合。According to another aspect of the present principles, there is provided a decoding method, comprising decoding multi-view video content by decoding bitstream restriction information specified for respective operation points in multi-view video encoding, wherein the operation point corresponds to one of different combinations of view and temporal level.
根据本原理的另一方面,提供一种解码装置,包括:解码器,用于通过解码为多视图视频编码中的各个操作点指定的比特流限制信息,来解码多视图视频内容,其中操作点对应于视图和时间级的不同组合中的一个组合。According to another aspect of the present principles, there is provided a decoding apparatus comprising: a decoder for decoding multi-view video content by decoding bitstream restriction information specified for respective operation points in multi-view video coding, wherein the operation point corresponds to one of different combinations of view and temporal level.
根据本原理的另一方面,提供一种传输视频信号的装置,包括:用于通过为多视图视频编码中的各个操作点指定比特流限制信息,来传输多视图视频内容的部件,其中操作点对应于视图和时间级的不同组合中的一个组合。According to another aspect of the present principles, there is provided an apparatus for transmitting a video signal, comprising: means for transmitting multi-view video content by specifying bitstream limitation information for respective operation points in multi-view video coding, wherein the operation point corresponds to one of different combinations of view and temporal level.
根据本原理的另一方面,提供一种解码方法,包括:通过解码为多视图视频编码的视图中的每个时间级别指定的比特流限制信息,来解码多视图视频内容,其中所述比特流限制信息指示以下中的一个或多个:According to another aspect of the present principles, there is provided a decoding method, comprising: decoding multi-view video content by decoding bitstream restriction information specified for each temporal level in a view of multi-view video encoding, wherein the bitstream restriction information indicates one or more of the following:
(1)运动向量是否超过画面边界;(1) Whether the motion vector exceeds the frame boundary;
(2)每个画面的最大字节;(2) Maximum bytes per frame;
(3)每个宏块的最大比特;(3) Maximum bits per macroblock;
(4)最大水平和垂直运动向量长度;(4) Maximum horizontal and vertical motion vector lengths;
(5)重新排序帧的数目;以及(5) the number of reordered frames; and
(6)最大的解码帧缓冲器大小。(6) Maximum decoded frame buffer size.
本原理的这些和其它方面、特征和优点将从要结合附图阅读的示例实施例的以下详细描述中变得明显。These and other aspects, features and advantages of the present principles will become apparent from the following detailed description of example embodiments, which is to be read in connection with the accompanying drawings.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
依据以下示例图,可以更好地理解本原理,其中:This principle can be better understood with reference to the following example diagram, where:
图1是依据本原理的实施例的、可以应用本原理的示例的多视图视频编码(MVC)编码器的框图;1 is a block diagram of an example multi-view video coding (MVC) encoder to which the present principles may be applied, in accordance with an embodiment of the present principles;
图2是依据本原理的实施例的、可以应用本原理的示例的多视图视频编码(MVC)解码器的框图;FIG2 is a block diagram of an example Multi-View Video Coding (MVC) decoder to which the present principles may be applied, in accordance with an embodiment of the present principles;
图3是依据本原理的实施例的、使用mvc_vui_parameters_extension()语法元素来编码每个视图的比特流限制参数的示例方法的流程图;3 is a flowchart of an example method for encoding per-view bitstream restriction parameters using the mvc_vui_parameters_extension() syntax element, in accordance with an embodiment of the present principles;
图4是依据本原理的实施例的、使用mvc_vui_parameters_extension()语法 元素来解码每个视图的比特流限制参数的示例方法的流程图;FIG4 is a flowchart of an example method for decoding per-view bitstream restriction parameters using the mvc_vui_parameters_extension() syntax element, in accordance with an embodiment of the present principles;
图5是依据本原理的实施例的、使用mvc_vui_parameters_extension()语法元素来编码每个视图中的每个时间级的比特流限制参数的示例方法的流程图;5 is a flowchart of an example method for encoding bitstream restriction parameters for each temporal level in each view using the mvc_vui_parameters_extension() syntax element, in accordance with an embodiment of the present principles;
图6是依据本原理的实施例的、使用mvc_vui_parameters_extension()语法元素来解码每个视图中的每个时间级的比特流限制参数的示例方法的流程图;6 is a flowchart of an example method for decoding bitstream restriction parameters for each temporal level in each view using the mvc_vui_parameters_extension() syntax element, in accordance with an embodiment of the present principles;
图7是依据本原理的实施例的、使用view_scalability_parameters_extension()语法元素来编码每个操作点的比特流限制参数的示例方法的流程图;以及FIG7 is a flowchart of an example method for encoding bitstream restriction parameters for each operation point using the view_scalability_parameters_extension() syntax element, in accordance with an embodiment of the present principles; and
图8是依据本原理的实施例的、使用view_scalability_parameters_extension()语法元素来解码每个操作点的比特流限制参数的示例方法的流程图。8 is a flowchart of an example method for decoding bitstream restriction parameters for each operation point using the view_scalability_parameters_extension() syntax element, in accordance with an embodiment of the present principles.
具体实施方式DETAILED DESCRIPTION
本原理针对用于将视频可用性信息(VUI)并入多视图视频编码(MVC)的方法和装置。The present principles are directed to methods and apparatus for incorporating video usability information (VUI) into multi-view video coding (MVC).
本说明书例示了本原理。因此,将理解:本领域技术人员将能够设计尽管在此没有清楚地描述或示出、但是实现本原理并包括在其精神和范围之内的各种布置。This description illustrates the present principles. Thus, it will be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the present principles and are included within its spirit and scope.
在此叙述的所有示例和条件性语言意欲用于教导的目的以便帮助读者理解由本发明人贡献以促进本领域技术的本原理以及构思,并且应当被解释为不限制这种具体叙述的示例和条件。All examples and conditional language recited herein are intended for teaching purposes to aid the reader in understanding the present principles and concepts contributed by the inventors to advance the art, and should not be construed as limiting such specifically recited examples and conditions.
另外,在这里叙述本原理的原理、方面和实施例及其特定示例的所有陈述意欲包括其结构上的和功能上的等效物。另外,意图是:这样的等效物包括当前已知的等效物以及将来开发的等效物二者,即所开发的执行相同功能的任何元件,而不论其结构如何。In addition, all statements herein reciting principles, aspects, and embodiments of the present principles, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents and equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
因此,例如,本领域技术人员将认识到:在此呈现的框图表示体现本原理的说明性电路的概念性视图。类似地,将认识到:任何流程图示(flow chart)、流程图(flowdiagram)、状态转换图、伪代码等表示实质上可以表示在计算机可读介质中并因此由计算机或处理器执行的各种处理,而不管是否明确地示 出这样的计算机或处理器。Thus, for example, those skilled in the art will recognize that the block diagrams presented herein represent conceptual views of illustrative circuitry embodying the present principles. Similarly, it will be recognized that any flow charts, flow diagrams, state transition diagrams, pseudo-code, and the like may substantially represent various processes that are embodied in a computer-readable medium and thereby executed by a computer or processor, whether or not such computer or processor is explicitly shown.
可以通过使用专用硬件以及能够与适当的软件相关联地执行软件的硬件来提供图中示出的各种元件的功能。当利用处理器来提供所述功能时,可以利用单个专用处理器、利用单个共享处理器、或者利用其中一些可被共享的多个独立处理器来提供所述功能。另外,术语“处理器”或“控制器”的明确使用不应当被解释为排他性地指代能够执行软件的硬件,而是可以隐含地无限制地包括数字信号处理器(“DSP”)硬件、用于存储软件的只读存储器(“ROM”)、随机存取存储器(“RAM”)、和非易失性存储器。The functions of the various elements shown in the figures can be provided by using dedicated hardware and hardware that can execute software in association with appropriate software. When a processor is utilized to provide the functions, the functions can be provided by a single dedicated processor, by a single shared processor, or by a plurality of independent processors some of which can be shared. In addition, the explicit use of the terms "processor" or "controller" should not be interpreted as referring exclusively to hardware that can execute software, but rather can implicitly include, without limitation, digital signal processor ("DSP") hardware, read-only memory ("ROM") for storing software, random access memory ("RAM"), and non-volatile memory.
还可以包括其它传统的和/或定制的硬件。类似地,图中示出的任何开关只是概念性的。它们的功能可以通过程序逻辑的操作、通过专用逻辑、通过程序控制和专用逻辑的交互、或者甚至手动地来执行,如从上下文更具体地理解的,可以由实施者选择具体技术。Other conventional and/or custom hardware may also be included. Similarly, any switches shown in the figures are conceptual only. Their functions may be performed through the operation of program logic, through dedicated logic, through the interaction of program control and dedicated logic, or even manually, as will be more specifically understood from the context, and the specific technique may be selected by the implementer.
在其权利要求中,被表示为用于执行指定功能的部件的任何元件意欲包含执行那个功能的任何方式,例如包括:a)执行那个功能的电路元件的组合或者b)与适当电路相组合的任何形式的软件,所述软件因此包括固件或微代码等,所述适当电路用于执行该软件以执行所述功能。由这种权利要求限定的本原理在于如下事实,即,以权利要求所要求的方式将由各种所叙述的部件提供的功能组合和集合到一起。因此,认为可以提供那些功能的任何部件与在此示出的那些部件等效。In its claims, any element represented as a means for performing a specified function is intended to encompass any means of performing that function, including, for example, a) a combination of circuit elements that perform that function or b) any form of software, such as firmware or microcode, combined with appropriate circuitry for executing the software to perform the function. The present principle defined by such claims resides in the fact that the functionality provided by the various recited components is combined and brought together in the manner required by the claims. Therefore, any component that can provide those functions is considered equivalent to those shown herein.
在本说明书中提到的本原理的“一个实施例”或“实施例”是指结合所述实施例描述的特定特征、结构、特性等被包括在本原理的至少一个实施例中。因此,在说明书各处出现的短语“在一个实施例中”和“在实施例中”的出现不必都指代相同的实施例。Reference throughout this specification to "one embodiment" or "an embodiment" of the present principles means that a particular feature, structure, characteristic, etc. described in connection with the embodiment is included in at least one embodiment of the present principles. Thus, the appearances of the phrases "in one embodiment" and "in an embodiment" in various places in the specification are not necessarily all referring to the same embodiment.
应当认识到,术语“和/或”和“至少一个”的使用(例如在“A和/或B”和“A和B中的至少一个”的情况下)意欲包括仅选择第一个列出的选项(A)、仅选择第二个列出的选项(B)、或者选择两个选项(A和B)。作为另一示例,在“A、B和/或C”和“A、B和C中的至少一个”的情况下,这种措辞意欲包括仅选择第一个列出的选项(A)、或者仅选择第二个列出的选项(B)、或者仅选择第三个列出的选项(C)、或者仅选择第一个和第二个列出的选项(A和B)、或者仅选择第一个和第三个列出的选项(A和C)、或者仅选择第二个和第三个列出的选项(B和C)、或者选择全部三个选项(A和B和C)。如本领域和相关领域普 通技术人员容易认识到的,这可以被扩展用于很多列出的项目。It should be appreciated that the use of the terms "and/or" and "at least one of" (e.g., in the context of "A and/or B" and "at least one of A and B") is intended to include selecting only the first listed option (A), only the second listed option (B), or both options (A and B). As another example, in the context of "A, B, and/or C" and "at least one of A, B, and C," such wording is intended to include selecting only the first listed option (A), or only the second listed option (B), or only the third listed option (C), or only the first and second listed options (A and B), or only the first and third listed options (A and C), or only the second and third listed options (B and C), or all three options (A, B, and C). As will be readily appreciated by those of ordinary skill in this and related arts, this can be extended to many listed items.
多视图视频编码(MVC)是用于编码多视图序列的压缩架构。多视图视频编码(MVC)序列是从不同视点捕获同一场景的一组两个或更多个视频序列。Multi-view video coding (MVC) is a compression framework for encoding multi-view sequences. A multi-view video coding (MVC) sequence is a set of two or more video sequences that capture the same scene from different viewpoints.
如在这里可互换地使用的,“交叉视图(cross-view)”和“视图间(inter-view)”两者都指代属于除了当前视图之外的视图的画面。As used interchangeably herein, "cross-view" and "inter-view" both refer to pictures belonging to a view other than the current view.
此外,如在此使用的,“高级语法”指代在分层级地驻留在宏块层上的比特流中出现的语法。例如,高级语法(如在此使用的)可以指代但不限于码片报头级、补充增强信息(SEI)级、画面参数集(PPS)级、序列参数集(SPS)级和网络抽象层(NAL)单元报头级处的语法。Furthermore, as used herein, "high-level syntax" refers to syntax that appears in a bitstream that resides hierarchically at the macroblock layer. For example, high-level syntax (as used herein) may refer to, but is not limited to, syntax at the slice header level, the supplemental enhancement information (SEI) level, the picture parameter set (PPS) level, the sequence parameter set (SPS) level, and the network abstraction layer (NAL) unit header level.
而且,应当认识到,尽管在此关于MPEG-4AVC标准的多视图视频编码扩展、为了例示性目的描述了本原理的一个或多个实施例,但是本原理不仅限于该扩展和/或该标准,并且因此可以关于其它视频编码标准、建议和其扩展而利用本原理,同时维持本原理的精神。Furthermore, it should be appreciated that although one or more embodiments of the present principles are described herein with respect to the multi-view video coding extension of the MPEG-4 AVC standard for illustrative purposes, the present principles are not limited to that extension and/or that standard and, thus, the present principles may be utilized with respect to other video coding standards, proposals, and extensions thereof while maintaining the spirit of the present principles.
另外,应当认识到,尽管在此关于比特流限制信息、为了例示性目的描述了本原理的一个或多个实施例,但是本原理不仅限于使用作为一种类型的视频可用性信息的比特流限制信息,并且因此根据本原理还可以使用关于多视图视频编码可以扩展使用的其它类型的视频可用性信息,同时维持本原理的精神。Additionally, it should be appreciated that although one or more embodiments of the present principles are described herein with respect to bitstream restriction information for illustrative purposes, the present principles are not limited to the use of bitstream restriction information as a type of video availability information, and thus other types of video availability information that may be extended for use with respect to multi-view video coding may also be used in accordance with the present principles while maintaining the spirit of the present principles.
转到图1,通过参考标号100总地指示示例的多视图视频编码(MVC)编码器。编码器100包括具有与变换器110的输入以信号通信地连接的输出的组合器105。变换器110的输出与量化器115的输入以信号通信地连接。量化器115的输出与熵编码器120的输入以及逆量化器125的输入以信号通信地连接。逆量化器125的输出与逆变换器130的输入以信号通信地连接。逆变换器130的输出与组合器135的第一非反相输入以信号通信地连接。组合器135的输出与帧内预测器145的输入和去块(deblocking)滤波器150的输入以信号通信地连接。去块滤波器150的输出与(视图i的)参考画面存储部件155的输入以信号通信地连接。参考画面存储部件155的输出与运动补偿器175的第一输入以及运动估计器180的第一输入以信号通信地连接。运动估计器180的输出与运动补偿器175的第二输入以信号通信地连接。1 , an exemplary multi-view video coding (MVC) encoder is generally indicated by the reference numeral 100. Encoder 100 includes a combiner 105 having an output connected in signal communication with an input of a transformer 110. The output of transformer 110 is connected in signal communication with an input of a quantizer 115. The output of quantizer 115 is connected in signal communication with an input of an entropy encoder 120 and an input of an inverse quantizer 125. The output of inverse quantizer 125 is connected in signal communication with an input of an inverse transformer 130. The output of inverse transformer 130 is connected in signal communication with a first non-inverting input of combiner 135. The output of combiner 135 is connected in signal communication with an input of an intra predictor 145 and an input of a deblocking filter 150. The output of deblocking filter 150 is connected in signal communication with an input of a reference picture storage (of view i) 155. An output of the reference picture storage component 155 is connected in signal communication with a first input of a motion compensator 175 and a first input of a motion estimator 180. An output of the motion estimator 180 is connected in signal communication with a second input of the motion compensator 175.
(其它视图的)参考画面存储部件160的输出与视差/照度估计器170的 第一输入以及视差/照度补偿器165的第一输入以信号通信地连接。视差/照度估计器170的输出与视差/照度补偿器165的第二输入以信号通信地连接。An output of the reference picture storage (of other views) 160 is connected in signal communication with a first input of a disparity/illumination estimator 170 and a first input of a disparity/illumination compensator 165. An output of the disparity/illumination estimator 170 is connected in signal communication with a second input of the disparity/illumination compensator 165.
熵解码器120的输出可用作编码器100的输出。组合器105的非反相输入可用作编码器100的输入,并且与视差/照度估计器170的第二输入以及运动估计器180的第二输入以信号通信地连接。开关185的输出与组合器135的第二非反相输入以及组合器105的反相输入以信号通信地连接。开关185包括与运动补偿器175的输出以信号通信地连接的第一输入、与视差/照度补偿器165的输出以信号通信地连接的第二输入、以及与帧内预测器145的输出以信号通信地连接的第三输入。The output of the entropy decoder 120 is available as an output of the encoder 100. A non-inverting input of the combiner 105 is available as an input of the encoder 100 and is connected in signal communication with a second input of the disparity/illumination estimator 170 and a second input of the motion estimator 180. An output of the switch 185 is connected in signal communication with a second non-inverting input of the combiner 135 and an inverting input of the combiner 105. The switch 185 includes a first input connected in signal communication with an output of the motion compensator 175, a second input connected in signal communication with an output of the disparity/illumination compensator 165, and a third input connected in signal communication with an output of the intra predictor 145.
模式判定模块140具有连接到开关185的输出,用于控制通过开关185选择哪个输入。The mode decision module 140 has an output connected to the switch 185 for controlling which input is selected by the switch 185 .
转到图2,通过附图标记200总地指示示例的多视图视频编码(MVC)解码器。解码器200包括熵解码器205,其具有与逆量化器210的输入以信号通信地连接的输出。逆量化器的输出与逆变换器215的输入以信号通信地连接。逆变换器215的输出与组合器220的第一非反相输入以信号通信地连接。组合器220的输出与去块滤波器225的输入和帧内预测器230的输入以信号通信地连接。去块滤波器225的输出与(视图i的)参考画面存储部件240的输入以信号通信地连接。参考画面存储部件240的输出与运动补偿器235的第一输入以信号通信地连接。2 , an exemplary multi-view video coding (MVC) decoder is generally indicated by the reference numeral 200. The decoder 200 includes an entropy decoder 205 having an output connected in signal communication with an input of an inverse quantizer 210. The output of the inverse quantizer is connected in signal communication with an input of an inverse transformer 215. The output of the inverse transformer 215 is connected in signal communication with a first non-inverting input of a combiner 220. The output of the combiner 220 is connected in signal communication with an input of a deblocking filter 225 and an input of an intra-frame predictor 230. The output of the deblocking filter 225 is connected in signal communication with an input of a reference picture storage component 240 (for view i). The output of the reference picture storage component 240 is connected in signal communication with a first input of a motion compensator 235.
(其它视图的)参考画面存储部件245的输出与视差/照度补偿器250的第一输入以信号通信地连接。An output of the reference picture storage (of other views) 245 is connected in signal communication with a first input of the disparity/illumination compensator 250 .
熵解码器205的输入可用作对解码器200的输入,用于接收残余比特流。此外,模式模块260的输入也可用作对解码器200的输入,用于接收控制语法以控制通过开关255选择哪个输入。此外,运动补偿器235的第二输入可用作解码器200的输入,用于接收运动向量。此外,视差/照度补偿器250的第二输入可用作对解码器200的输入,用于接收视差向量和照度补偿语法。An input to the entropy decoder 205 is available as an input to the decoder 200 for receiving the residual bitstream. Furthermore, an input to the mode module 260 is also available as an input to the decoder 200 for receiving control syntax to control which input is selected via the switch 255. Furthermore, a second input to the motion compensator 235 is available as an input to the decoder 200 for receiving motion vectors. Furthermore, a second input to the disparity/illumination compensator 250 is available as an input to the decoder 200 for receiving disparity vectors and illumination compensation syntax.
开关255的输出与组合器220的第二非反相输入以信号通信地连接。开关255的第一输入与视差/照度补偿器250的输出以信号通信地连接。开关255的第二输入与运动补偿器235的输出以信号通信地连接。开关255的第三输入与帧内预测器230的输出以信号通信地连接。模式模块260的输出与开关255以信号通信地连接,以控制通过开关255选择哪个输入。去块滤波器225 的输出可用作解码器的输出。An output of the switch 255 is connected in signal communication with a second non-inverting input of the combiner 220. A first input of the switch 255 is connected in signal communication with an output of the disparity/illumination compensator 250. A second input of the switch 255 is connected in signal communication with an output of the motion compensator 235. A third input of the switch 255 is connected in signal communication with an output of the intra predictor 230. An output of the mode module 260 is connected in signal communication with the switch 255 to control which input is selected by the switch 255. The output of the deblocking filter 225 is available as an output of the decoder.
在MPEG-4AVC标准中,序列参数集的语法和语义参数被指定用于视频可用性信息(VUI)。这表示可以被插入到比特流中以增强视频对于多种目的的可用性的附加信息。视频可用性信息包括以下信息:高宽比、过扫描、视频信号类型、色度位置、定时、网络抽象层(NAL)假定参考解码器(HRD)参数、视频编码层(VCL)假定参考解码器参数、比特流限制等等。In the MPEG-4 AVC standard, the syntax and semantic parameters of the sequence parameter set are specified for video usability information (VUI). This represents additional information that can be inserted into the bitstream to enhance the usability of the video for various purposes. VUI includes information such as aspect ratio, overscan, video signal type, chroma position, timing, network abstraction layer (NAL) hypothetical reference decoder (HRD) parameters, video coding layer (VCL) hypothetical reference decoder parameters, bitstream constraints, and more.
依据本原理的一个或多个实施例,我们将现有的视频可用性信息字段用于与现有技术相比新的和不同的目的,并且进一步将其使用扩展到多视图视频编码(MVC)。在我们的多视图视频编码机制中,扩展视频可用性信息,使得其可以在例如不同视图、视图中的不同时间级、或不同操作点之间是不同的。因此,依据实施例,我们根据以下项中的一项或多项(但不限于)来指定视频可用性信息:分别指定各个视图的视频可用性信息;分别指定视图中的各个时间级的视频可用性信息;以及分别指定各个操作点的视频可用性信息。According to one or more embodiments of the present principles, we use the existing video availability information field for a new and different purpose compared to the prior art, and further extend its use to multi-view video coding (MVC). In our multi-view video coding mechanism, the video availability information is extended so that it can be different, for example, between different views, different temporal levels within a view, or different operating points. Thus, according to an embodiment, we specify the video availability information according to one or more of the following (but not limited to): specifying the video availability information for each view separately; specifying the video availability information for each temporal level within a view separately; and specifying the video availability information for each operating point separately.
在MPEG-4AVC标准中,可以在序列参数集(SPS)中传送包括视频可用性信息(VUI)的集。依据实施例,我们将视频可用性信息的概念扩展到用于多视图视频编码(MVC)背景中。有利地,这允许为多视图视频编码中的不同的视图、视图中的不同时间级、或者不同操作点指定不同的视频可用性信息。在实施例中,我们提供新颖的方式来考虑、修改和使用多视图视频编码的视频可用性信息中的比特流限制信息。In the MPEG-4 AVC standard, a set including video usability information (VUI) can be transmitted in a sequence parameter set (SPS). According to an embodiment, we extend the concept of video usability information to be used in the context of multi-view video coding (MVC). Advantageously, this allows different video usability information to be specified for different views, different temporal levels within a view, or different operating points in multi-view video coding. In an embodiment, we provide novel ways to consider, modify, and use bitstream constraint information in the video usability information for multi-view video coding.
在作为sequence_parameter_set()的一部分的vui_parameters()语法元素中指定了MPEG-4AVC标准中的比特流限制信息。表1例示了vui_parameters()的MPEG-4AVC标准语法。The bitstream restriction information in the MPEG-4 AVC standard is specified in the vui_parameters() syntax element which is part of the sequence_parameter_set(). Table 1 illustrates the MPEG-4 AVC standard syntax of vui_parameters().
表1Table 1
比特流限制信息的语法元素的语义如下:The semantics of the syntax elements of bitstream constraint information are as follows:
bitstream_restriction_flag等于1指定:以下的编码后的视频序列比特流限制参数存在。bitstream_restriction_flag equal to 1 specifies that the following coded video sequence bitstream restriction parameters are present.
bitstream_restriction_flag等于0指定:以下的编码后的视频序列比特流限制参数不存在。bitstream_restriction_flag equal to 0 specifies that the following coded video sequence bitstream restriction parameters are not present.
motion_vectors_over_pic_boundaries_flag等于0指示:没有使用画面边界之外的样本以及部分样本位置处的样本(使用画面边界之外的一个或多个样本导出该样本的值)来对任何样本进行帧间预测。motion_vectors_over_pic_boundaries_flag equal to 0 indicates that samples outside the picture boundaries and samples at partial sample positions whose values are derived using one or more samples outside the picture boundaries are not used for inter prediction of any sample.
motion_vectors_over_pic_boundaries_flag等于1指示:可以在帧间预测中使用画面边界之外的一个或多个样本。motion_vectors_over_pic_boundaries_flag equal to 1 indicates that one or more samples outside picture boundaries may be used in inter prediction.
当motion_vectors_over_pic_boundaries_flag语法元素不存在时,motion_vectors_over_pic_boundaries_flag的值应当被推断为等于1。When the motion_vectors_over_pic_boundaries_flag syntax element is not present, the value of motion_vectors_over_pic_boundaries_flag shall be inferred to be equal to 1.
max_bytes_per_pic_denom指示与编码后的视频序列中的任何编码画面相关联的虚拟编码层(VCL)网络抽象层(NAL)单元的尺寸之和不超过的字节数目。max_bytes_per_pic_denom indicates the number of bytes that the sum of the sizes of Virtual Coding Layer (VCL) Network Abstraction Layer (NAL) units associated with any coded picture in the coded video sequence does not exceed.
为此目的,将表示网络抽象层单元流中的一画面的字节数目指定为该画面的虚拟编码层网络抽象层单元数据的总的字节数目,(即,虚拟编码层网络抽象层单元的NumBytesInNALunit变量的总数)。max_bytes_per_pic_denom的值应当在包括0和16的、0到16的范围内。For this purpose, the number of bytes representing a picture in the NAL unit stream is specified as the total number of bytes of the virtual coding layer NAL unit data for that picture (i.e., the sum of the NumBytesInNALunit variables of the virtual coding layer NAL unit). The value of max_bytes_per_pic_denom should be in the range of 0 to 16, inclusive.
取决于max_bytes_per_pic_denom,以下适用:Depending on max_bytes_per_pic_denom, the following applies:
-如果max_bytes_per_pic_denom等于0,则不指示界限。- If max_bytes_per_pic_denom is equal to 0, no limit is indicated.
-否则(max_bytes_per_pic_denom不等于0),则通过多于以下比特数目来在编码后的视频序列中表示未编码画面:Otherwise (max_bytes_per_pic_denom is not equal to 0), the uncoded picture is represented in the coded video sequence by more than the following number of bits:
(PicSizelnMbs*RawMbBits)÷(8*max_bytes_per_pic_denom)(PicSizelnMbs*RawMbBits)÷(8*max_bytes_per_pic_denom)
当max_bytes_per_pic_denom语法元素不存在时,max_bytes_per_pic_denom的值应当被推断为等于2。变量PicSizeInMbs是画面中宏块的数目。如在MPEG-4AVC标准的子款7.4.2.1中,导出了变量RawMbBits。When the max_bytes_per_pic_denom syntax element is not present, the value of max_bytes_per_pic_denom shall be inferred to be equal to 2. The variable PicSizeInMbs is the number of macroblocks in the picture. The variable RawMbBits is derived as in subclause 7.4.2.1 of the MPEG-4 AVC standard.
max_bits_per_mb_denom指示编码后的视频序列的任何画面中任何宏块的macroblock_layer()数据的编码比特的最大数目。max_bits_per_mb_denom的值应当在包括0和16的、0到16的范围中。max_bits_per_mb_denom indicates the maximum number of coded bits of macroblock_layer() data for any macroblock in any picture of the coded video sequence. The value of max_bits_per_mb_denom shall be in the range of 0 to 16, inclusive.
取决于max_bits_per_mb_denom,以下适用:Depending on max_bits_per_mb_denom, the following applies:
-如果max_bits_per_mb_denom等于0,则不规定界限。- If max_bits_per_mb_denom is equal to 0, no limit is specified.
-否则(max_bits_per_mb_denom不等于0),应当通过多于以下的比特数目来在比特流中表示未编码的macroblock_layer()。Otherwise (max_bits_per_mb_denom is not equal to 0), uncoded macroblock_layer() shall be represented in the bitstream by more than the following number of bits.
(128+RawMbBits)÷max_bits_per_mb_denom(128+RawMbBits)÷max_bits_per_mb_denom
取决于entropy_coding_mode_flag,如下地计数macroblock_layer()数据的比特:Depending on entropy_coding_mode_flag, the bits of macroblock_layer() data are counted as follows:
-如果entropy_coding_mode_flag等于0,则通过宏块的macroblock_layer()语法结构中的比特数目来给出macroblock_layer()数据的比特数目。- If entropy_coding_mode_flag is equal to 0, the number of bits of the macroblock_layer() data is given by the number of bits in the macroblock_layer() syntax structure of the macroblock.
-否则(entropy_coding_mode_flag等于1),则当解析与宏块相关联的macroblock_layer()时,通过在MPEG-4AVC标准的子款9.3.3.2.2和9.3.3.2.3中调用read_bits(1)的次数来给出该宏块的macroblock_layer()数据的比特数目。- Otherwise (entropy_coding_mode_flag is equal to 1), when parsing the macroblock_layer() associated with the macroblock, the number of bits of the macroblock_layer() data for the macroblock is given by the number of calls to read_bits(1) in subclauses 9.3.3.2.2 and 9.3.3.2.3 of the MPEG-4 AVC standard.
当max_bits_per_mb_denom不存在时,max_bits_per_mb_denom的值应当被推断为等于1。When max_bits_per_mb_denom is not present, the value of max_bits_per_mb_denom should be inferred to be equal to 1.
log2_max_mv_length_horizontal和log2_max_mv_length_vertical分别指示编码后的视频序列中所有画面的以1/4亮度样本为单位(1/4luma sample units)的解码后的水平和垂直运动向量分量的最大绝对值。值n宣告没有运动向量分量的值将超过以1/4亮度样本的位移为单位的从包括-2n和2n-1的、-2n到2n-1的范围。log2_max_mv_length_horizontal的值应当在包括0和16 的、0到16的范围中。log2_max_mv_length_vertical的值应当在包括0和16的、0到16的范围中。当log2_max_mv_length_horizontal不存在时,log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值应当被推断为等于16。应注意,解码后的垂直或水平运动向量分量的最大绝对值也被如在MPEG-4AVC标准的附录A中规定的简档和级界限所限制。log2_max_mv_length_horizontal and log2_max_mv_length_vertical indicate the maximum absolute values of the decoded horizontal and vertical motion vector components, respectively, in 1/4 luma sample units, for all pictures in the coded video sequence. The value n declares that no motion vector component will exceed the range -2n to 2n - 1 , inclusive, in units of 1/4 luma sample displacement. The value of log2_max_mv_length_horizontal shall be in the range 0 to 16, inclusive. The value of log2_max_mv_length_vertical shall be in the range 0 to 16, inclusive. When log2_max_mv_length_horizontal is not present, the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical shall be inferred to be equal to 16. It should be noted that the maximum absolute value of a decoded vertical or horizontal motion vector component is also limited by the profile and level limits as specified in Annex A of the MPEG-4 AVC standard.
num_reorder_frames指示分别在解码顺序上领先于编码后的视频序列中的任何帧、补充字段对、或非成对的字段并且在输出顺序上跟在其后的帧、补充字段对、或非成对的字段的最大数目。num_reorder_frames的值应当在包括0和max_dec_fram_buffering的、0到max_dec_fram_buffering的范围中。当num_reorder_frames语法元素不存在时,num_reorder_frames的值应当被如下地推断:num_reorder_frames indicates the maximum number of frames, supplementary field pairs, or unpaired fields that precede in decoding order and follow in output order any frame, supplementary field pair, or unpaired field in the coded video sequence, respectively. The value of num_reorder_frames shall be in the range of 0 to max_dec_fram_buffering, inclusive. When the num_reorder_frames syntax element is not present, the value of num_reorder_frames shall be inferred as follows:
-如果profile_idc等于44、100、110、122或244,并且constraint_set3_flag等于1,则num_reorder_frames的值应当被推断为等于0。If profile_idc is equal to 44, 100, 110, 122, or 244, and constraint_set3_flag is equal to 1, then the value of num_reorder_frames shall be inferred to be equal to 0.
-否则(profile_idc不等于44、100、110、122或244,或者constraint_set3_flag等于0),num_reorder_frames的值应当被推断为等于max_dec_fram_bufferingMaxDpbSize。Otherwise (profile_idc is not equal to 44, 100, 110, 122 or 244, or constraint_set3_flag is equal to 0), the value of num_reorder_frames shall be inferred to be equal to max_dec_fram_bufferingMaxDpbSize.
max_dec_fram_buffering指定以帧缓冲器为单位的、假定参考解码器解码后的画面缓冲器(DPB)的所需尺寸。编码后的视频序列不应要求具有大于Max(1,max_dec_fram_buffering)个帧缓冲器的尺寸的解码后的画面缓冲器,以便使得解码画面的输出处于通过画面定时补充增强信息(SEI)消息的dpb_output_delay指定的输出时间。max_dec_fram_buffering的值应当在包括num_ref_frames和MaxDpbSize(如在MPEG-4AVC标准的子款A.3.1或A.3.2中所规定的)的、num_ref_frames到MaxDpbSize的范围中。当max_dec_fram_buffering语法元素不存在时,max_dec_fram_buffering的值应当被如下地推断:max_dec_fram_buffering specifies the required size of the decoded picture buffer (DPB) of the assumed reference decoder in units of frame buffers. The encoded video sequence should not require a decoded picture buffer with a size larger than Max(1, max_dec_fram_buffering) frame buffers so that the output of the decoded pictures is at the output time specified by the dpb_output_delay of the picture timing supplementary enhancement information (SEI) message. The value of max_dec_fram_buffering should be in the range of num_ref_frames to MaxDpbSize, including num_ref_frames and MaxDpbSize (as specified in subclause A.3.1 or A.3.2 of the MPEG-4 AVC standard). When the max_dec_fram_buffering syntax element is not present, the value of max_dec_fram_buffering should be inferred as follows:
-如果profile_idc等于44或244,并且constraint_set3_flag等于1,则max_dec_fram_buffering的值应当被推断为等于0。If profile_idc is equal to 44 or 244, and constraint_set3_flag is equal to 1, then the value of max_dec_fram_buffering shall be inferred to be equal to 0.
-否则(profile_idc不等于44或244,或者constraint_set3_flag等于0),max_dec_frame_buffering的值应当被推断为等于MaxDpbSize。Otherwise (profile_idc is not equal to 44 or 244, or constraint_set3_flag is equal to 0), the value of max_dec_frame_buffering shall be inferred to be equal to MaxDpbSize.
在多视图视频编码中,比特流限制参数基于更紧的界限定制子流的解码操作。因此,应允许对于多视图视频编码比特流的每个可提取的子流指定比 特流限制参数。依据实施例,我们提出对于每个视图、视图中每个时间级、以及/或者每个操作点指定比特流限制信息。In multi-view video coding, bitstream constraint parameters customize the decoding operation of substreams based on tighter bounds. Therefore, it should be possible to specify bitstream constraint parameters for each extractable substream of a multi-view video coding bitstream. According to an embodiment, we propose to specify bitstream constraint information per view, per temporal level within a view, and/or per operation point.
对于每个视图指定比特流限制参数Specify bitstream restriction parameters for each view
可以对于每个视图指定比特流限制参数。我们提出mvc_vui_parameters_extension语法,其是subset_sequence_parameter_set的一部分。表2例示了mvc_vui_parameters_extension语法。Bitstream restriction parameters can be specified for each view. We propose the mvc_vui_parameters_extension syntax, which is part of subset_sequence_parameter_set. Table 2 illustrates the mvc_vui_parameters_extension syntax.
mvc_vui_parameters_extension()在与该subset_sequence_parameter_set相关联的所有视图上循环。在该循环内指定每个视图的view_id和每个视图的比特流限制参数。mvc_vui_parameters_extension() loops over all views associated with this subset_sequence_parameter_set, specifying the view_id of each view and the bitstream restriction parameters of each view within the loop.
表2Table 2
比特流限制语法元素的语义如下:The semantics of the bitstream constraint syntax element are as follows:
bitstream_restriction_flag[i]指定具有等于view_id的view_id[i]的视图的bitstream_restriction_flag的值。bitstream_restriction_flag[i] specifies the value of bitstream_restriction_flag for the view with view_id[i] equal to view_id.
motion_vectors_over_pic_boundaries_flag[i]指定具有等于view_id的view_id[i]的视图的motion_vectors_over_pic_boundaries_flag的值。当motion_vectors_over_pic_boundaries_flag[i]语法元素不存在时,具有等于view_id的view_id[i]的视图的motion_vectors_over_pic_boundaries_flag的值应当被推断为等于1。motion_vectors_over_pic_boundaries_flag[i] specifies the value of motion_vectors_over_pic_boundaries_flag for the view with view_id[i] equal to view_id. When the motion_vectors_over_pic_boundaries_flag[i] syntax element is not present, the value of motion_vectors_over_pic_boundaries_flag for the view with view_id[i] equal to view_id shall be inferred to be equal to 1.
max_bytes_per_pic_denom[i]指定具有等于view_id的view_id[i]的视图的max_bytes_per_pic_denom值。当max_bytes_per_pic_denom[i]语法元素不存在时,具有等于view_id的view_id[i]的视图的max_bytes_per_pic_denom的值应当被推断为等于2。max_bytes_per_pic_denom[i] specifies the value of max_bytes_per_pic_denom for the view with view_id[i] equal to view_id. When the max_bytes_per_pic_denom[i] syntax element is not present, the value of max_bytes_per_pic_denom for the view with view_id[i] equal to view_id shall be inferred to be equal to 2.
max_bits_per_mb_denom[i]指定具有等于view_id的view_id[i]的视图的max_bits_per_mb_denom值。当max_bits_per_mb_denom[i]不存在时,具有等于view_id的view_id[i]的视图的max_bits_per_mb_denom的值应当被推断为等于1。max_bits_per_mb_denom[i] specifies the max_bits_per_mb_denom value for the view with view_id[i] equal to view_id. When max_bits_per_mb_denom[i] is not present, the max_bits_per_mb_denom value for the view with view_id[i] equal to view_id shall be inferred to be equal to 1.
log2_max_mv_length_horizontal[i]和log2_max_mv_length_vertical[i]分别指定具有等于view_id的view_id[i]的视图的log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值。当log2_max_mv_length_horizontal[i]不存在时,具有等于view_id的view_id[i]的视图的log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值应当被推断为等于16。log2_max_mv_length_horizontal[i] and log2_max_mv_length_vertical[i] specify the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical, respectively, for the view with view_id[i] equal to view_id. When log2_max_mv_length_horizontal[i] is not present, the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical for the view with view_id[i] equal to view_id shall be inferred to be equal to 16.
num_reorder_frames[i]指定具有等于view_id的view_id[i]的视图的num_reorder_frames的值。num_reorder_frames[i]的值应当在包括0和max_dec_frame_buffering的、0到max_dec_frame_buffering的范围中。当num_reorder_frames[i]语法元素不存在时,具有等于view_id的view_id[i]的视图的num_reorder_frames的值应当被推断为等于max_dec_frame_buffering。num_reorder_frames[i] specifies the value of num_reorder_frames for the view with view_id[i] equal to view_id. The value of num_reorder_frames[i] should be in the range of 0 to max_dec_frame_buffering, inclusive. When the num_reorder_frames[i] syntax element is not present, the value of num_reorder_frames for the view with view_id[i] equal to view_id should be inferred to be equal to max_dec_frame_buffering.
max_dec_frame_buffering[i]指定具有等于view_id的view_id[i]的视图的max_dec_frame_buffering的值。max_dec_frame_buffering[i]的值应当在包括num_ref_frames[i]和MaxDpbSize(如在MPEG-4AVC标准中的子款A.3.1或A.3.2中所指定的)的、num_ref_frames[i]到MaxDpbSize的范围中。当max_dec_frame_buffering[i]语法元素不存在时,具有等于view_id的view_id[i]的视图的max_dec_frame_buffering的值应当被推断为等于MaxDpbSize。max_dec_frame_buffering[i] specifies the value of max_dec_frame_buffering for the view with view_id[i] equal to view_id. The value of max_dec_frame_buffering[i] shall be in the range of num_ref_frames[i] to MaxDpbSize, inclusive, of num_ref_frames[i] and MaxDpbSize (as specified in subclauses A.3.1 or A.3.2 in the MPEG-4 AVC Standard). When the max_dec_frame_buffering[i] syntax element is not present, the value of max_dec_frame_buffering for the view with view_id[i] equal to view_id shall be inferred to be equal to MaxDpbSize.
转到图3,通过附图标记300总地指示用于使用mvc_vui_parameters_extension()语法元素来编码每个视图的比特流限制参数的示例方法。3 , an example method for encoding per-view bitstream restriction parameters using the mvc_vui_parameters_extension() syntax element is indicated generally by the reference numeral 300 .
方法300包括开始块305,其将控制传递到功能块310。功能块310将变量M设置为等于视图的数目减一,并将控制传递到功能块315。功能块315将变量M写入到比特流,并将控制传递到功能块320。功能块320将变量i 设置为等于0,并将控制传递到功能块325。功能块325写入view_id[i]语法元素,并将控制传递到功能块330。功能块330写入bitstream_restriction_flag[i]语法元素,并将控制传递到判定块335。判定块335确定bitstream_restriction_flag[i]语法元素是否等于0。如果等于0,则将控制传递到判定块345。否则,将控制传递到功能块340。Method 300 includes a start block 305, which passes control to a function block 310. Function block 310 sets a variable M equal to the number of views minus one and passes control to a function block 315. Function block 315 writes the variable M to the bitstream and passes control to a function block 320. Function block 320 sets a variable i equal to 0 and passes control to a function block 325. Function block 325 writes the view_id[i] syntax element and passes control to a function block 330. Function block 330 writes the bitstream_restriction_flag[i] syntax element and passes control to a decision block 335. Decision block 335 determines whether the bitstream_restriction_flag[i] syntax element is equal to 0. If so, control is passed to a decision block 345. Otherwise, control is passed to a function block 340.
功能块340写入视图i的比特流限制参数,并将控制传递到判定块345。判定块345确定变量i是否等于变量M。如果等于,则将控制传递到结束块399。否则,将控制传递到功能块350。Function block 340 writes the bitstream restriction parameters for view i and passes control to decision block 345. Decision block 345 determines whether variable i is equal to variable M. If so, control is passed to end block 399. Otherwise, control is passed to function block 350.
功能块350将变量i设置为等于i加一,并且将控制返回到功能块325。Function block 350 sets the variable i equal to i plus one and returns control to function block 325 .
转到图4,通过参考标号400总地指示用于使用mvc_vui_parameters_extension()语法元素来解码每个视图的比特流限制参数的示例方法。4 , an example method for decoding per-view bitstream restriction parameters using the mvc_vui_parameters_extension() syntax element is indicated generally by the reference numeral 400 .
方法400包括开始块405,其将控制传递到功能块407。功能块407从比特流中读取变量M,并将控制传递到功能块410。功能块410将视图的数目设置为等于变量M加一,并将控制传递到功能块420。功能块420将变量i设置为等于0,并将控制传递到功能块425。功能块425读取view_id[i]语法元素,并将控制传递到功能块430。功能块430读取bitstream_restriction_flag[i]语法元素,并将控制传递到判定块435。判定块435确定bitstream_restriction_flag[i]语法元素是否等于0。如果等于0,则将控制传递到判定块445。否则,将控制传递到功能块440。Method 400 includes a start block 405, which passes control to a function block 407. Function block 407 reads a variable M from the bitstream and passes control to a function block 410. Function block 410 sets the number of views equal to the variable M plus one and passes control to a function block 420. Function block 420 sets a variable i equal to 0 and passes control to a function block 425. Function block 425 reads the view_id[i] syntax element and passes control to a function block 430. Function block 430 reads the bitstream_restriction_flag[i] syntax element and passes control to a decision block 435. Decision block 435 determines whether the bitstream_restriction_flag[i] syntax element is equal to 0. If so, control is passed to a decision block 445. Otherwise, control is passed to a function block 440.
功能块440读取视图i的比特流限制参数,并将控制传递到判定块445。判定块445确定变量i是否等于变量M。如果等于,则将控制传递到结束块499。否则,将控制传递到功能块450。Function block 440 reads the bitstream restriction parameters for view i and passes control to decision block 445. Decision block 445 determines whether variable i is equal to variable M. If so, control is passed to end block 499. Otherwise, control is passed to function block 450.
功能块450将变量i设置为等于i加一,并且将控制返回到功能块425。Function block 450 sets the variable i equal to i plus one and returns control to function block 425 .
对于每个视图的每个时间级指定比特流限制参数Specify bitstream constraint parameters for each temporal level of each view
可以对于每个视图的每个时间级指定比特流限制参数。我们提出作为subset_sequence_parameter_set的一部分的mvc_vui_parameters_extention语法。表3例示了mvc_vui_parameters_extention语法。The bitstream restriction parameters may be specified for each temporal level of each view. We propose the mvc_vui_parameters_extention syntax as part of subset_sequence_parameter_set. Table 3 illustrates the mvc_vui_parameters_extention syntax.
表3Table 3
比特流限制语法元素的语义如下:The semantics of the bitstream constraint syntax element are as follows:
bitstream_restriction_flag[i][j]指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的bitstream_restriction_flag的值。bitstream_restriction_flag[i][j] specifies the value of bitstream_restriction_flag for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id.
motion_vectors_over_pic_boundaries_flag[i][j]指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的motion_vectors_over_pic_boundaries_flag的值。当motion_vectors_over_pic_boundaries_flag[i]语法元素不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的motion_vectors_over_pic_boundaries_flag的值应当被推断为等于1。motion_vectors_over_pic_boundaries_flag[i][j] specifies the value of motion_vectors_over_pic_boundaries_flag for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id. When the motion_vectors_over_pic_boundaries_flag[i] syntax element is not present, the value of motion_vectors_over_pic_boundaries_flag for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id shall be inferred to be equal to 1.
max_bytes_per_pic_denom[i][j]指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_bytes_per_pic_denom的值。当max_bytes_per_pic_denom[i]语法元素不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_bytes_per_pic_denom的值应当被推断为等于2。max_bytes_per_pic_denom[i][j] specifies the value of max_bytes_per_pic_denom for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id. When the max_bytes_per_pic_denom[i] syntax element is not present, the value of max_bytes_per_pic_denom for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id shall be inferred to be equal to 2.
max_bits_per_mb_denom[i][j]指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_bits_per_mb_denom的值。当max_bits_per_mb_denom[i]不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_bits_per_mb_denom的值应当被推断为等于1。max_bits_per_mb_denom[i][j] specifies the value of max_bits_per_mb_denom for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id. When max_bits_per_mb_denom[i] is not present, the value of max_bits_per_mb_denom for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id shall be inferred to be equal to 1.
log2_max_mv_length_horizontal[i][j]和log2_max_mv_length_vertical[i][j]分别指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值。当log2_max_mv_length_horizontal[i]不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值应当被推断为等于16。log2_max_mv_length_horizontal[i][j] and log2_max_mv_length_vertical[i][j] specify the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical, respectively, for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id. When log2_max_mv_length_horizontal[i][j] is not present, the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id shall be inferred to be equal to 16.
num_reorder_frames[i][j]指定具有等于view_id的view_id[i]的视图中的、 具有等于temporal_id的temporal_id[i][j]的时间级的num_reorder_frames的值。num_reorder_frames[i]的值应当在包括0和max_dec_frame_buffering的、0到max_dec_frame_buffering的范围中。当num_reorder_frames[i]语法元素不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的num_reorder_frames的值应当被推断为等于max_dec_frame_buffering。num_reorder_frames[i][j] specifies the value of num_reorder_frames for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id. The value of num_reorder_frames[i] shall be in the range of 0 to max_dec_frame_buffering, inclusive. When the num_reorder_frames[i] syntax element is not present, the value of num_reorder_frames for the temporal level with temporal_id[i][j] equal to temporal_id in the view with view_id[i] equal to view_id shall be inferred to be equal to max_dec_frame_buffering.
max_dec_frame_buffering[i][j]指定具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_dec_frame_buffering的值。max_dec_frame_buffering[i]的值应当在包括num_ref_frames[i]和MaxDpbSize(如在MPEG-4AVC标准中的子款A.3.1或A.3.2中所指定的)的、num_ref_frames[i]到MaxDpbSize的范围中。当max_dec_frame_buffering[i]语法元素不存在时,具有等于view_id的view_id[i]的视图中的、具有等于temporal_id的temporal_id[i][j]的时间级的max_dec_frame_buffering的值应当被推断为等于MaxDpbSize。max_dec_frame_buffering[i][j] specifies the value of max_dec_frame_buffering for the temporal level with temporal_id[i][j] equal to temporal_id, in the view with view_id[i] equal to view_id. The value of max_dec_frame_buffering[i] shall be in the range of num_ref_frames[i] to MaxDpbSize, inclusive, of num_ref_frames[i] and MaxDpbSize (as specified in subclauses A.3.1 or A.3.2 in the MPEG-4 AVC Standard). When the max_dec_frame_buffering[i] syntax element is not present, the value of max_dec_frame_buffering for the temporal level with temporal_id[i][j] equal to temporal_id, in the view with view_id[i] equal to view_id, shall be inferred to be equal to MaxDpbSize.
在mvc_vui_parameters_extension()中,执行两个循环。外循环在与subset_sequence_parameter_set相关联的所有视图上循环。在外循环中指定用于每个视图的时间级数目的view_id。内循环在视图的所有时间级上循环。在内循环中指定比特流限制信息。In mvc_vui_parameters_extension(), two loops are executed. The outer loop loops over all views associated with subset_sequence_parameter_set. The view_id, which specifies the number of temporal levels for each view, is specified in the outer loop. The inner loop loops over all temporal levels of the view. The bitstream restriction information is specified in the inner loop.
转到图5,通过附图标记500总地指示用于使用mvc_vui_parameters_extension()语法元素来编码每个视图中的每个时间级的比特流限制参数的示例方法。5 , an example method for encoding bitstream restriction parameters for each temporal level in each view using the mvc_vui_parameters_extension() syntax element is indicated generally by the reference numeral 500 .
方法500包括开始块505,其将控制传递到功能块510。功能块510将变量M设置为等于视图的数目减一,并将控制传递到功能块515。功能块515将变量M写入到比特流,并将控制传递到功能块520。功能块520将变量i设置为等于0,并将控制传递到功能块525。功能块525写入view_id[i]语法元素,并将控制传递到功能块530。功能块530将变量N设置为等于视图i中时间级的数目减一,并将控制传递到功能块535。功能块535将变量N写入到比特流,并将控制传递到功能块540。功能块540将变量j设置为等于0,并将控制传递到功能块545。功能块545写入temporal_id[i][j]语法元素,并将控制传递到功能块550。功能块550写入bitstream_restriction_flag[i][j]语法元素,并将控制传递到判定块555。判定块555确定 bitstream_restriction_flag[i][j]语法元素是否等于0。如果等于0,则将控制传递到判定块565。否则,将控制传递到功能块560。Method 500 includes a start block 505, which passes control to a function block 510. Function block 510 sets a variable M equal to the number of views minus one and passes control to a function block 515. Function block 515 writes the variable M to the bitstream and passes control to a function block 520. Function block 520 sets a variable i equal to 0 and passes control to a function block 525. Function block 525 writes the view_id[i] syntax element and passes control to a function block 530. Function block 530 sets a variable N equal to the number of temporal levels in view i minus one and passes control to a function block 535. Function block 535 writes the variable N to the bitstream and passes control to a function block 540. Function block 540 sets a variable j equal to 0 and passes control to a function block 545. Function block 545 writes the temporal_id[i][j] syntax element and passes control to a function block 550. Function block 550 writes the bitstream_restriction_flag[i][j] syntax element and passes control to decision block 555. Decision block 555 determines whether the bitstream_restriction_flag[i][j] syntax element is equal to 0. If so, control is passed to decision block 565. Otherwise, control is passed to function block 560.
功能块560写入视图i中的时间级j的比特流限制参数,并将控制传递到判定块565。判定块565确定变量j是否等于变量N。如果等于,则将控制传递到判定块570。否则,将控制传递到功能块575。Function block 560 writes the bitstream restriction parameters for temporal level j in view i and passes control to decision block 565. Decision block 565 determines whether variable j is equal to variable N. If so, control is passed to decision block 570. Otherwise, control is passed to function block 575.
判定块570确定变量i是否等于变量M。如果等于,则将控制传递到结束块599。否则,将控制传递到功能块580。Decision block 570 determines whether the variable i is equal to the variable M. If so, control is passed to end block 599. Otherwise, control is passed to function block 580.
功能块580将变量i设置为等于i加一,并且将控制返回到功能块525。Function block 580 sets the variable i equal to i plus one and returns control to function block 525 .
功能块575将变量j设置为等于j加一,并且将控制返回到功能块545。Function block 575 sets the variable j equal to j plus one and returns control to function block 545 .
转到图6,通过参考标号600总地指示用于使用mvc_vui_parameters_extension()语法元素来解码每个视图中的每个时间级的比特流限制参数的示例方法。6 , an example method for decoding bitstream restriction parameters for each temporal level in each view using the mvc_vui_parameters_extension() syntax element is indicated generally by the reference numeral 600 .
方法600包括开始块605,其将控制传递到功能块607。功能块607从比特流中读取变量M,并将控制传递到功能块610。功能块610将视图的数目设置为等于M加一,并将控制传递到功能块620。功能块620将变量i设置为等于0,并将控制传递到功能块625。功能块625读取view_id[i]语法元素,并将控制传递到功能块627。功能块627从比特流中读取变量N,并将控制传递到功能块630。功能块630将视图i中时间级的数目设置为等于N加一,并将控制传递到功能块640。功能块640将变量j设置为等于0,并将控制传递到功能块645。功能块645读取temporal_id[i][j]语法元素,并将控制传递到功能块650。功能块650读取bitstream_restriction_flag[i][j]语法元素,并将控制传递到判定块655。判定块655确定bitstream_restriction_flag[i][j]语法元素是否等于0。如果等于0,则将控制传递到判定块665。否则,将控制传递到功能块660。Method 600 includes a start block 605, which passes control to a function block 607. Function block 607 reads a variable M from the bitstream and passes control to a function block 610. Function block 610 sets the number of views to M plus one and passes control to a function block 620. Function block 620 sets a variable i to 0 and passes control to a function block 625. Function block 625 reads a view_id[i] syntax element and passes control to a function block 627. Function block 627 reads a variable N from the bitstream and passes control to a function block 630. Function block 630 sets the number of temporal levels in view i to N plus one and passes control to a function block 640. Function block 640 sets a variable j to 0 and passes control to a function block 645. Function block 645 reads a temporal_id[i][j] syntax element and passes control to a function block 650. Function block 650 reads the bitstream_restriction_flag[i][j] syntax element and passes control to decision block 655. Decision block 655 determines whether the bitstream_restriction_flag[i][j] syntax element is equal to 0. If so, control is passed to decision block 665. Otherwise, control is passed to function block 660.
功能块660读取视图i中的时间级j的比特流限制参数,并将控制传递到判定块665。判定块665确定变量j是否等于变量N。如果等于,则将控制传递到判定块670。否则,将控制传递到功能块675。Function block 660 reads the bitstream restriction parameters for temporal level j in view i and passes control to decision block 665. Decision block 665 determines whether variable j is equal to variable N. If so, control is passed to decision block 670. Otherwise, control is passed to function block 675.
判定块670确定变量i是否等于变量M。如果等于,则将控制传递到结束块699。否则,将控制传递到功能块680。Decision block 670 determines whether the variable i is equal to the variable M. If so, control is passed to end block 699. Otherwise, control is passed to function block 680.
功能块680将变量i设置为等于i加一,并且将控制返回到功能块625。Function block 680 sets the variable i equal to i plus one and returns control to function block 625 .
功能块675将变量j设置为等于j加一,并且将控制返回到功能块645。Function block 675 sets the variable j equal to j plus one and returns control to function block 645 .
对于每个操作点指定比特流限制信息Specify bitstream constraint information for each operation point
可以对于每个操作点指定比特流限制参数。我们提出在视图可缩放性信息SEI消息中传递每个操作点的比特流限制参数。可以如表4中那样修改视图可缩放性信息SEI消息的语法。将比特流限制信息的语法插入在所有操作点上循环的循环中。Bitstream restriction parameters can be specified for each operation point. We propose to convey the bitstream restriction parameters for each operation point in the view scalability information SEI message. The syntax of the view scalability information SEI message can be modified as shown in Table 4. The syntax of the bitstream restriction information is inserted in a loop that loops over all operation points.
表4Table 4
比特流限制语法元素的语义如下:The semantics of the bitstream constraint syntax element are as follows:
bitstream_restriction_flag[i]指定具有等于operation_point_id的operation_point_id[i]的操作点的bitstream_restriction_flag的值。bitstream_restriction_flag[i] specifies the value of bitstream_restriction_flag for the operation point having operation_point_id[i] equal to operation_point_id.
motion_vectors_over_pic_boundaries_flag[i]指定具有等于operation_point_id的operation_point_id[i]的操作点的motion_vectors_over_pic_boundaries_flag的值。当motion_vectors_over_pic_boundaries_flag[i]语法元素不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的motion_vectors_over_pic_boundaries_flag值应当被推断为等于1。motion_vectors_over_pic_boundaries_flag[i] specifies the value of motion_vectors_over_pic_boundaries_flag for the operation point with operation_point_id[i] equal to operation_point_id. When the motion_vectors_over_pic_boundaries_flag[i] syntax element is not present, the value of motion_vectors_over_pic_boundaries_flag for the operation point with operation_point_id[i] equal to operation_point_id shall be inferred to be equal to 1.
max_bytes_per_pic_denom[i]指定具有等于等于operation_point_id的operation_point_id[i]的操作点的max_bytes_per_pic_denom值。当max_bytes_per_pic_denom[i]语法元素不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的max_bytes_per_pic_denom的值应当被推断为等于2。max_bytes_per_pic_denom[i] specifies the max_bytes_per_pic_denom value for the operation point with operation_point_id[i] equal to operation_point_id. When the max_bytes_per_pic_denom[i] syntax element is not present, the max_bytes_per_pic_denom value for the operation point with operation_point_id[i] equal to operation_point_id shall be inferred to be equal to 2.
max_bits_per_mb_denom[i]指定具有等于operation_point_id的operation_point_id[i]的操作点的max_bits_per_mb_denom值。当max_bits_per_mb_ denom[i]不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的max_bits_per_mb_denom的值应当被推断为等于1。max_bits_per_mb_denom[i] specifies the max_bits_per_mb_denom value for the operation point with operation_point_id[i] equal to operation_point_id. When max_bits_per_mb_denom[i] is not present, the max_bits_per_mb_denom value for the operation point with operation_point_id[i] equal to operation_point_id shall be inferred to be equal to 1.
log2_max_mv_length_horizontal[i]和log2_max_mv_length_vertical[i]分别指定具有等于operation_point_id的operation_point_id[i]的操作点的log2_max_mv_length_horizontal的值和log2_max_mv_length_vertical的值。当log2_max_mv_length_horizontal[i]不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的log2_max_mv_length_horizontal和log2_max_mv_length_vertical的值应当被推断为等于16。log2_max_mv_length_horizontal[i] and log2_max_mv_length_vertical[i] specify the value of log2_max_mv_length_horizontal and the value of log2_max_mv_length_vertical, respectively, for the operation point with operation_point_id[i] equal to operation_point_id. When log2_max_mv_length_horizontal[i] is not present, the values of log2_max_mv_length_horizontal and log2_max_mv_length_vertical for the operation point with operation_point_id[i] equal to operation_point_id shall be inferred to be equal to 16.
num_reorder_frames[i]指定具有等于operation_point_id的operation_point_id[i]的操作点的num_reorder_frames的值。num_reorder_frames[i]的值应当在包括0和max_dec_frame_buffering的、0到max_dec_frame_buffering的范围中。当num_reorder_frames[i]语法元素不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的num_reorder_frames的值应当被推断为等于max_dec_frame_buffering。num_reorder_frames[i] specifies the value of num_reorder_frames for the operation point with operation_point_id[i] equal to operation_point_id. The value of num_reorder_frames[i] should be in the range of 0 to max_dec_frame_buffering, inclusive. When the num_reorder_frames[i] syntax element is not present, the value of num_reorder_frames for the operation point with operation_point_id[i] equal to operation_point_id should be inferred to be equal to max_dec_frame_buffering.
max_dec_frame_buffering[i]指定具有等于operation_point_id的operation_point_id[i]的操作点的max_dec_frame_buffering的值。max_dec_frame_buffering[i]的值应当在包括num_ref_frames[i]和MaxDpbSize(如在MPEG-4AVC标准中的子款A.3.1或A.3.2中所指定的)的、num_ref_frames[i]到MaxDpbSize的范围中。当max_dec_frame_buffering[i]语法元素不存在时,具有等于operation_point_id的operation_point_id[i]的操作点的max_dec_frame_buffering的值应当被推断为等于MaxDpbSize。max_dec_frame_buffering[i] specifies the value of max_dec_frame_buffering for the operation point with operation_point_id[i] equal to operation_point_id. The value of max_dec_frame_buffering[i] shall be in the range of num_ref_frames[i] to MaxDpbSize, inclusive, of num_ref_frames[i] and MaxDpbSize (as specified in subclauses A.3.1 or A.3.2 in the MPEG-4 AVC Standard). When the max_dec_frame_buffering[i] syntax element is not present, the value of max_dec_frame_buffering for the operation point with operation_point_id[i] equal to operation_point_id shall be inferred to be equal to MaxDpbSize.
转到图7,通过附图标记700总地指示用于使用view_scalability_parameters_extension()语法元素来编码每个操作点的比特流限制参数的示例方法。Turning to FIG. 7 , an example method for encoding bitstream restriction parameters per operation point using the view_scalability_parameters_extension() syntax element is indicated generally by the reference numeral 700 .
方法700包括开始块705,其将控制传递到功能块710。功能块710将变量M设置为等于操作点的数目减一,并将控制传递到功能块715。功能块715将变量M写入到比特流,并将控制传递到功能块720。功能块720将变量i设置为等于0,并将控制传递到功能块725。功能块725写入operation_point_id[i]语法元素,并将控制传递到功能块730。功能块730写入bitstream_restriction_flag[i]语法元素,并将控制传递到判定块735。判定块735确定bitstream_restriction_flag[i]语法元素是否等于0。如果等于0,则将 控制传递到判定块745。否则,将控制传递到功能块740。Method 700 includes a start block 705, which passes control to a function block 710. Function block 710 sets a variable M equal to the number of operation points minus one and passes control to a function block 715. Function block 715 writes the variable M to the bitstream and passes control to a function block 720. Function block 720 sets a variable i equal to 0 and passes control to a function block 725. Function block 725 writes the operation_point_id[i] syntax element and passes control to a function block 730. Function block 730 writes the bitstream_restriction_flag[i] syntax element and passes control to a decision block 735. Decision block 735 determines whether the bitstream_restriction_flag[i] syntax element is equal to 0. If so, control is passed to a decision block 745. Otherwise, control is passed to a function block 740.
功能块740写入操作点i的比特流限制参数,并将控制传递到判定块745。判定块745确定变量i是否等于变量M。如果等于,则将控制传递到结束块799。否则,将控制传递到功能块750。Function block 740 writes the bitstream constraint parameters for operation point i and passes control to decision block 745. Decision block 745 determines whether variable i is equal to variable M. If so, control is passed to end block 799. Otherwise, control is passed to function block 750.
功能块750将变量i设置为等于i加一,并且将控制返回到功能块725。Function block 750 sets the variable i equal to i plus one and returns control to function block 725 .
转到图8,通过参考标号800总地指示用于使用view_scalability_parameters_extension()语法元素来解码每个操作点的比特流限制参数的示例方法。Turning to FIG. 8 , an example method for decoding bitstream restriction parameters per operation point using the view_scalability_parameters_extension() syntax element is indicated generally by the reference numeral 800 .
方法800包括开始块805,其将控制传递到功能块807。功能块807从比特流中读取变量M,并将控制传递到功能块810。功能块810将操作点的数目设置为等于变量M加一,并将控制传递到功能块820。功能块820将变量i设置为等于0,并将控制传递到功能块825。功能块825读取operation_point_id[i]语法元素,并将控制传递到功能块830。功能块830读取bitstream_restriction_flag[i]语法元素,并将控制传递到判定块835。判定块835确定bitstream_restriction_flag[i]语法元素是否等于0。如果等于0,则将控制传递到判定块845。否则,将控制传递到功能块840。Method 800 includes a start block 805, which passes control to a function block 807. Function block 807 reads a variable M from the bitstream and passes control to a function block 810. Function block 810 sets the number of operation points equal to the variable M plus one and passes control to a function block 820. Function block 820 sets a variable i equal to 0 and passes control to a function block 825. Function block 825 reads the operation_point_id[i] syntax element and passes control to a function block 830. Function block 830 reads the bitstream_restriction_flag[i] syntax element and passes control to a decision block 835. Decision block 835 determines whether the bitstream_restriction_flag[i] syntax element is equal to 0. If so, control is passed to a decision block 845. Otherwise, control is passed to a function block 840.
功能块840读取操作点i的比特流限制参数,并将控制传递到判定块845。判定块445确定变量i是否等于变量M。如果等于,则将控制传递到结束块899。否则,将控制传递到功能块850。Function block 840 reads the bitstream constraint parameters for operation point i and passes control to decision block 845. Decision block 845 determines whether variable i is equal to variable M. If so, control is passed to end block 899. Otherwise, control is passed to function block 850.
功能块850将变量i设置为等于i加一,并且将控制返回到功能块825。Function block 850 sets the variable i equal to i plus one and returns control to function block 825 .
现在将给出本发明的许多伴随优点/特征中的一些的描述,已经在上面提及了所述许多伴随优点/特征中的一些。例如,一个优点/特征是包括用于通过对于各个视图、视图中的各个时间级、以及各个操作点中的至少一项指定视频可用性信息来编码多视图视频内容的编码器的装置。A description will now be given of some of the many accompanying advantages/features of the present invention, some of which have been mentioned above. For example, one advantage/feature is an apparatus comprising an encoder for encoding multi-view video content by specifying video usability information for at least one of individual views, individual temporal levels within a view, and individual operating points.
另一优点/特征是具有如上所述的编码器的装置,其中,在至少一个高级语法元素中指定参数。Another advantage/feature is the apparatus having the encoder as described above, wherein the parameters are specified in at least one high-level syntax element.
此外,另一优点/特征是具有如上所述的编码器的装置,其中,所述至少一个高级语法元素包括以下中的至少一项:mvc_vui_parameters_extension()语法元素、mvc_scalability_info补充增强信息语法消息、序列参数集的至少一部分、画面参数集、以及补充增强信息。Furthermore, another advantage/feature is the apparatus having the encoder as described above, wherein the at least one high-level syntax element includes at least one of: an mvc_vui_parameters_extension() syntax element, an mvc_scalability_info supplemental enhancement information syntax message, at least a portion of a sequence parameter set, a picture parameter set, and supplemental enhancement information.
此外,另一优点/特征是具有如上所述的编码器的装置,其中,视频可用 性信息的至少一部分包括比特流限制参数。Additionally, another advantage/feature is the apparatus having the encoder as described above, wherein at least a portion of the video availability information includes a bitstream restriction parameter.
基于这里的教导,相关领域普通技术人员可以容易确定本原理的这些和其它特征和优点。应理解本原理的教导可以以硬件、软件、固件、特殊用途处理器、或其组合的各种形式来实现。Based on the teachings herein, those skilled in the relevant art can easily determine these and other features and advantages of the present principles.It should be understood that the teachings of the present principles can be implemented in various forms of hardware, software, firmware, special purpose processors, or a combination thereof.
最优选地,本原理的教导被实现为硬件与软件的组合。此外,软件可以实现为有形地体现在程序存储单元上的应用程序。应用程序可以被上载到包括任何适当结构的机器并由该机器执行。优选地,在具有诸如一个或多个中央处理单元(“CPU”)、随机存取存储器(“RAM”)、以及输入/输出(“I/O”)接口等的硬件的计算机平台上实现该机器。计算机平台还可以包括操作系统和微指令代码。这里描述的各种处理与功能可以是可以由CPU执行的微指令代码的一部分或是应用程序的一部分、或者是其任何组合。另外,各种其它外设单元可以连接到计算机平台,如附加数据存储单元以及打印单元。Most preferably, the teachings of the present principles are implemented as a combination of hardware and software. In addition, the software can be implemented as an application program tangibly embodied on a program storage unit. The application program can be uploaded to a machine comprising any suitable structure and executed by the machine. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units ("CPU"), random access memory ("RAM"), and input/output ("I/O") interfaces. The computer platform can also include an operating system and microinstruction code. The various processes and functions described herein can be part of the microinstruction code that can be executed by the CPU or part of the application program, or any combination thereof. In addition, various other peripheral units can be connected to the computer platform, such as additional data storage units and printing units.
还应理解,由于在附图中示出的一些系统组成组件和方法优选地以软件实现,因此这些系统组件或处理功能块之间的实际连接可能取决于本原理被编程的方式而有所不同。给出这里的教导,相关领域普通技术人员将能够预期本原理的这些和类似的实现方式或配置。It should also be understood that since some of the system components and methods shown in the drawings are preferably implemented in software, the actual connections between these system components or processing function blocks may vary depending on how the present principles are programmed. Given the teachings herein, one of ordinary skill in the relevant art will be able to contemplate these and similar implementations or configurations of the present principles.
尽管这里已经参考附图描述了示例实施例,但是应理解本原理不限于那些确切的实施例,并且相关领域普通技术人员可以在其中进行各种改变和修改,而不偏离本原理的范围和精神。所有这些改变和修改意在被包括在所附权利要求中所提出的本原理的范围之内。Although example embodiments have been described herein with reference to the accompanying drawings, it should be understood that the present principles are not limited to those precise embodiments, and that various changes and modifications may be made therein by one of ordinary skill in the relevant art without departing from the scope and spirit of the present principles. All such changes and modifications are intended to be included within the scope of the present principles as set forth in the appended claims.
Claims (11)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US60/977,709 | 2007-10-05 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1229105A1 HK1229105A1 (en) | 2017-11-10 |
| HK1229105B true HK1229105B (en) | 2020-07-17 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101889448B (en) | Method and apparatus for incorporating video usability information (VUI) into a multiview video (MVC) coding system | |
| EP2137973B1 (en) | Methods and apparatus for video usability information (vui) for scalable video coding (svc) | |
| EP3179725B1 (en) | Method and apparatus for signaling view scalability in multi-view video coding | |
| CN118646871A (en) | Method and apparatus for high-level syntax in video coding and decoding | |
| CN115552911A (en) | Generic constraint information and signaling for syntax elements in video codecs | |
| HK1229105B (en) | Methods and apparatus for incorporating video usability information within a multi-view video coding system | |
| HK1229105A1 (en) | Methods and apparatus for incorporating video usability information within a multi-view video coding system | |
| HK1226885A1 (en) | Methods and apparatus for incorporating video usability information within a multi-view video coding system | |
| HK40065952A (en) | Methods and apparatus for video usability information (vui) for scalable video coding (svc) | |
| HK40012187A (en) | Methods and apparatus for video usability information (vui) for scalable video coding (svc) | |
| HK40012187B (en) | Methods and apparatus for video usability information (vui) for scalable video coding (svc) |