[go: up one dir, main page]

HK1250109B - Image processing device and image processing method - Google Patents

Image processing device and image processing method Download PDF

Info

Publication number
HK1250109B
HK1250109B HK18109496.0A HK18109496A HK1250109B HK 1250109 B HK1250109 B HK 1250109B HK 18109496 A HK18109496 A HK 18109496A HK 1250109 B HK1250109 B HK 1250109B
Authority
HK
Hong Kong
Prior art keywords
quantization matrix
identification information
transform coefficient
quantization
coefficient data
Prior art date
Application number
HK18109496.0A
Other languages
Chinese (zh)
Other versions
HK1250109A1 (en
Inventor
田中润一
Original Assignee
Sony Group Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Group Corporation filed Critical Sony Group Corporation
Publication of HK1250109A1 publication Critical patent/HK1250109A1/en
Publication of HK1250109B publication Critical patent/HK1250109B/en

Links

Description

图像处理设备和图像处理方法Image processing device and image processing method

本申请是申请号为201280007291.6、PCT国际申请日为2012年1月18日、发明名称为“图像处理装置和图像处理方法”的发明专利申请的分案申请。This application is a divisional application of the invention patent application with application number 201280007291.6, PCT international application date January 18, 2012, and invention name “Image processing device and image processing method”.

技术领域Technical Field

本发明涉及图像处理设备和图像处理方法。The present invention relates to an image processing device and an image processing method.

背景技术Background Art

在H.264/AVC(一种用于图像编码方案的标准规格)中,当以高级规范(highprofile)或更高级规范量化图像数据时,可以对正交变换系数的每个分量使用不同的量化步长。可基于以与正交变换的单位相同大小定义的量化矩阵(也称为缩放列表)和标准步长值设置正交变换系数的每个分量的量化步长(或量化尺度)。In H.264/AVC (a standard specification for image coding schemes), when quantizing image data in the High Profile or higher profile, a different quantization step size can be used for each component of the orthogonal transform coefficients. The quantization step size (or quantization scale) for each component of the orthogonal transform coefficients can be set based on a quantization matrix (also called a scaling list) defined with the same size as the unit of the orthogonal transform and a standard step size value.

图38示出在H.264/AVC中预定义的四种类别的默认量化矩阵。矩阵SL1是用于帧内预测模式的默认4×4量化矩阵。矩阵SL2是用于帧间预测模式的默认4×4量化矩阵。矩阵SL3是用于帧内预测模式的默认8×8量化矩阵。矩阵SL4是用于帧间预测模式的默认8×8量化矩阵。用户还可以在序列参数集或图片参数集中定义与图38所示的默认矩阵不同的其自己的量化矩阵。注意,在未指定量化矩阵的情况中,可使用全部分量具有相等量化步长的扁平量化矩阵。Figure 38 shows the four categories of default quantization matrices predefined in H.264/AVC. Matrix SL1 is the default 4×4 quantization matrix used for intra prediction mode. Matrix SL2 is the default 4×4 quantization matrix used for inter prediction mode. Matrix SL3 is the default 8×8 quantization matrix used for intra prediction mode. Matrix SL4 is the default 8×8 quantization matrix used for inter prediction mode. Users can also define their own quantization matrices that are different from the default matrices shown in Figure 38 in the sequence parameter set or picture parameter set. Note that if a quantization matrix is not specified, a flat quantization matrix with equal quantization step sizes for all components can be used.

在高效率视频编码(HEVC)中(其标准化被提升为下一代图像编码方案以作为H.264/AVC的继任),引入编码单位(CU)的概念,其对应于过去的宏块(参考下面的非专利文献1)。编码单位尺寸的范围在序列参数集中被指定为称为最大编码单位(LCU)和最小编码单位(SCU)的一对2的幂值。另外,使用SPLIT_FLAG在由LCU和SCU规定的范围中指定具体编码单位尺寸。High Efficiency Video Coding (HEVC), whose standardization is being promoted as the next-generation image coding scheme to succeed H.264/AVC, introduces the concept of coding units (CUs), which correspond to conventional macroblocks (see Non-Patent Document 1 below). The range of coding unit sizes is specified in the sequence parameter set as a pair of power-of-two values called the largest coding unit (LCU) and the smallest coding unit (SCU). Furthermore, the SPLIT_FLAG flag is used to specify a specific coding unit size within the range specified by the LCU and SCU.

在HEVC中,可以将一个编码单位分为一个或更多个正交变换的单位,或者换句话说,一个或多个变换单位(TU)。可以使用4×4、8×8、16×16和32×32中的任一个作为变换单元尺寸。从而,还可以对这些候选变换单位尺寸的每个指定量化矩阵。下面的非专利文献2提出对一个图片中的一个变换单元尺寸规定多个量化矩阵的候选,并从率失真(RD)优化的角度对每个块适应地选择量化矩阵。In HEVC, a coding unit can be divided into one or more orthogonal transform units, or in other words, one or more transform units (TUs). Any of 4×4, 8×8, 16×16, and 32×32 transform unit sizes can be used. Therefore, a quantization matrix can also be specified for each of these candidate transform unit sizes. The following non-patent document 2 proposes specifying multiple quantization matrix candidates for one transform unit size in a picture, and adaptively selecting a quantization matrix for each block from the perspective of rate-distortion (RD) optimization.

引用列表Reference List

非专利文献Non-patent literature

非专利文献1: JCTVC-B205, "Test Model under Consideration", ITU-T SG16WP3和ISO/IEC JTC1/SC29/WG11的视频编码联合组 (JCT-VC)第二次会议:瑞士,日内瓦,2010年7月21-28日Non-Patent Document 1: JCTVC-B205, "Test Model under Consideration", Second Meeting of the Joint Video Coding Group of ITU-T SG16WP3 and ISO/IEC JTC1/SC29/WG11 (JCT-VC): Geneva, Switzerland, July 21-28, 2010

非专利文献2:VCEG-AD06, "Adaptive Quantization Matrix Selection on KTASoftware", ITU - 电信标准化部门研究组16问题6视频编码专家组(VCEG)第30次会议:中国,杭州,2006年10月23-24日。Non-Patent Document 2: VCEG-AD06, "Adaptive Quantization Matrix Selection on KTA Software", ITU - Telecommunication Standardization Sector Study Group 16 Question 6 Video Coding Experts Group (VCEG) 30th Meeting: Hangzhou, China, October 23-24, 2006.

发明内容Summary of the Invention

技术问题Technical issues

然而,根据视频中包括的每个图像的特征,适应于量化和逆量化的量化矩阵不同。为此,如果试图以优化量化矩阵编码其图像特征时时变化的视频,量化矩阵更新的频率将上升。在H.264/AVC中,以图片参数集(PPS)或序列参数集(SPS)定义量化矩阵。从而,如果量化矩阵更新的频率上升,那么由SPS或PPS占用的编码流的比例将增大。这意味着,由于增大的开销(overhead),编码效率将下降。对于HEVC,其中量化矩阵尺寸进一步增大,并且其中可以对每个图片定义若干不同的量化矩阵,存在这样的风险,即,该随着量化矩阵的更新的编码效率的下降可能变得更加显著。However, depending on the characteristics of each image included in the video, the quantization matrix adapted for quantization and inverse quantization is different. For this reason, if an attempt is made to encode a video whose image characteristics change from time to time with an optimized quantization matrix, the frequency of quantization matrix updates will increase. In H.264/AVC, quantization matrices are defined in picture parameter sets (PPS) or sequence parameter sets (SPS). Therefore, if the frequency of quantization matrix updates increases, the proportion of the coded stream occupied by the SPS or PPS will increase. This means that due to the increased overhead, coding efficiency will decrease. For HEVC, where the quantization matrix size is further increased and where several different quantization matrices can be defined for each picture, there is a risk that this decrease in coding efficiency with the update of the quantization matrix may become more significant.

因此,需要提供一种机制,使得可以减轻随着量化矩阵的更新的编码效率的下降。Therefore, it is necessary to provide a mechanism that can alleviate the decrease in encoding efficiency accompanying the update of the quantization matrix.

解决问题的技术方案Technical solutions to the problem

根据本发明的实施例,提供一种图像处理装置,包括:获取部,被配置为从编码流获取量化矩阵参数,该编码流是在不同于序列参数集和图片参数集的参数集中设置定义量化矩阵的所述量化矩阵参数的编码流;设置部,被配置为基于通过所述获取部获取的量化矩阵参数设置当逆量化从编码流解码的数据时使用的量化矩阵;以及逆量化部,被配置为使用通过所述设置部设置的量化矩阵逆量化从编码流解码的数据。According to an embodiment of the present invention, there is provided an image processing device, comprising: an acquisition unit configured to acquire quantization matrix parameters from a coding stream, the coding stream being a coding stream in which the quantization matrix parameters defining the quantization matrix are set in a parameter set different from a sequence parameter set and a picture parameter set; a setting unit configured to set a quantization matrix used when inverse quantizing data decoded from the coding stream based on the quantization matrix parameters acquired by the acquisition unit; and an inverse quantization unit configured to inverse quantize the data decoded from the coding stream using the quantization matrix set by the setting unit.

图像处理装置可以典型地实现为用于解码图像的图像解码装置。The image processing device can typically be implemented as an image decoding device for decoding an image.

根据本发明的实施例,提供一种图像处理方法,包括:从编码流获取量化矩阵参数,所述编码流是在不同于序列参数集和图片参数集的参数集中设置定义量化矩阵的所述量化矩阵参数的编码流;基于获取的量化矩阵参数设置当逆量化从编码流解码的数据时使用的量化矩阵;以及使用所设置的量化矩阵逆量化从所述编码流解码的数据。According to an embodiment of the present invention, there is provided an image processing method, comprising: acquiring quantization matrix parameters from a coding stream, wherein the coding stream is a coding stream in which the quantization matrix parameters defining the quantization matrix are set in a parameter set different from a sequence parameter set and a picture parameter set; setting a quantization matrix used when inverse quantizing data decoded from the coding stream based on the acquired quantization matrix parameters; and inverse quantizing the data decoded from the coding stream using the set quantization matrix.

根据本发明的实施例,提供一种图像处理装置,包括:量化部,被配置为使用量化矩阵量化数据;设置部,被配置为设置量化矩阵参数,该量化矩阵参数定义在所述量化部量化所述数据时使用的量化矩阵;以及编码部,被配置为在不同于序列参数集和图片参数集的参数集中编码通过设置部设置的量化矩阵参数。According to an embodiment of the present invention, an image processing device is provided, comprising: a quantization unit configured to quantize data using a quantization matrix; a setting unit configured to set quantization matrix parameters, which define the quantization matrix used when the quantization unit quantizes the data; and an encoding unit configured to encode the quantization matrix parameters set by the setting unit in a parameter set different from a sequence parameter set and a picture parameter set.

图像处理装置可以典型地实现为用于编码图像的图像编码装置。The image processing apparatus may typically be implemented as an image encoding apparatus for encoding an image.

根据本发明的实施例,提供一种图像处理方法,包括:使用量化矩阵量化数据;设置量化矩阵参数,该量化矩阵参数定义在量化所述数据时使用的量化矩阵;以及在不同于序列参数集和图片参数集的参数集中编码所设置的量化矩阵参数。According to an embodiment of the present invention, there is provided an image processing method, comprising: quantizing data using a quantization matrix; setting quantization matrix parameters, which define the quantization matrix used when quantizing the data; and encoding the set quantization matrix parameters in a parameter set different from a sequence parameter set and a picture parameter set.

发明的有益效果Advantageous Effects of the Invention

根据本发明的图像处理装置和图像处理方法,可以缓解随着量化矩阵的更新的编码效率的下降。According to the image processing apparatus and the image processing method of the present invention, it is possible to alleviate the decrease in encoding efficiency accompanying the updating of the quantization matrix.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为示出根据实施例的图像编码装置的示例结构的框图;1 is a block diagram showing an example structure of an image encoding device according to an embodiment;

图2为示出图1所示的语法处理部的具体配置的实例的框图;FIG2 is a block diagram showing an example of a specific configuration of a grammar processing section shown in FIG1 ;

图3为示出在实施例中的量化矩阵参数集中包括的示例参数的示例图;FIG3 is an exemplary diagram showing example parameters included in a quantization matrix parameter set in an embodiment;

图4为示出在实施例中的片段头中包括的示例参数的说明图;FIG4 is an explanatory diagram showing example parameters included in a slice header in the embodiment;

图5为示出根据实施例的参数集插入处理的示例流程的流程图;5 is a flow chart illustrating an example process of parameter set insertion processing according to an embodiment;

图6为用于说明在根据实施例的技术与现有技术之间的流结构中的差异的说明图;FIG6 is an explanatory diagram for explaining a difference in a stream structure between the technology according to the embodiment and the related art;

图7为示出根据实施例的图像解码装置的示例结构的框图;7 is a block diagram showing an example structure of an image decoding device according to an embodiment;

图8为示出图7所示的语法处理部的具体配置的实例的框图;FIG8 is a block diagram showing an example of a specific configuration of a grammar processing section shown in FIG7;

图9为示出根据实施例的量化矩阵生成处理的示例流程的流程图;9 is a flowchart illustrating an example flow of a quantization matrix generation process according to an embodiment;

图10为示出根据实施例的复制模式中的处理的具体流程的实例的流程图;10 is a flowchart showing an example of a specific flow of processing in the copy mode according to the embodiment;

图11为示出根据实施例的轴指定模式中的处理的具体流程的实例的流程图;11 is a flowchart showing an example of a specific flow of processing in the axis designation mode according to the embodiment;

图12为示出根据实施例的用于对片段设置量化矩阵的处理的示例流程的流程图;12 is a flowchart illustrating an example flow of a process for setting a quantization matrix for a segment according to an embodiment;

图13为示出表达量化矩阵参数集的语法的说明性伪码的第一实例的第一说明图;13 is a first explanatory diagram showing a first example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图14为示出表达量化矩阵参数集的语法的说明性伪码的第一实例的第二说明图;14 is a second explanatory diagram showing a first example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图15为示出表达量化矩阵参数集的语法的说明性伪码的第一实例的第三说明图;15 is a third explanatory diagram showing a first example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图16为示出表达量化矩阵参数集的语法的说明性伪码的第二实例的第一说明图;16 is a first explanatory diagram showing a second example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图17为示出表达量化矩阵参数集的语法的说明性伪码的第二实例的第二说明图;17 is a second explanatory diagram showing a second example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图18为示出表达量化矩阵参数集的语法的说明性伪码的第二实例的第三说明图;18 is a third explanatory diagram showing a second example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图19为示出表达量化矩阵参数集的语法的说明性伪码的第二实例的第四说明图;19 is a fourth explanatory diagram showing a second example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图20为示出表达量化矩阵参数集的语法的说明性伪码的第二实例的第五说明图;20 is a fifth explanatory diagram showing a second example of an illustrative pseudo code expressing the syntax of a quantization matrix parameter set;

图21为示出为量化量化矩阵而定义的量化尺度设置区域的实例的说明图;FIG21 is an explanatory diagram showing an example of a quantization scale setting area defined for quantizing a quantization matrix;

图22为示出在以图21所示的各个量化尺度设置区域中设置的量化尺度的实例的说明图;FIG22 is an explanatory diagram showing an example of quantization scales set in the respective quantization scale setting areas shown in FIG21;

图23为用于说明在LCEC中准备的11类VLC表的说明图;FIG23 is an explanatory diagram for explaining 11 types of VLC tables prepared in LCEC;

图24为示出根据使用APS的第一技术构造的编码流的实例的说明图;FIG24 is an explanatory diagram showing an example of an encoded stream constructed according to the first technique using APS;

图25为示出根据使用APS的第一技术定义的APS语法的实例的说明图;FIG25 is an explanatory diagram showing an example of an APS syntax defined according to the first technique using APS;

图26为示出根据使用APS的第一技术定义的片段头语法的实例的说明图;FIG26 is an explanatory diagram showing an example of a slice header syntax defined according to the first technique using APS;

图27为示出根据使用APS的第一技术的示例修改定义的APS语法的实例的说明图;27 is an explanatory diagram showing an example of an APS syntax defined according to an example modification of the first technique using APS;

图28为示出根据使用APS的第二技术构造的编码流的实例的说明图;FIG28 is an explanatory diagram showing an example of an encoded stream constructed according to the second technique using APS;

图29为示出根据使用APS的第三技术构造的编码流的实例的说明图;FIG29 is an explanatory diagram showing an example of an encoded stream constructed according to the third technique using APS;

图30为示出根据使用APS的第三技术定义的APS语法的实例的说明图;FIG30 is an explanatory diagram showing an example of an APS syntax defined according to the third technique using APS;

图31为示出根据使用APS的第三技术定义的片段头语法的实例的说明图;FIG31 is an explanatory diagram showing an example of a slice header syntax defined according to the third technique using APS;

图32为列出用于若干典型编码工具的每个的参数特征的表;FIG32 is a table listing parameter characteristics for each of several typical encoding tools;

图33为用于说明根据使用APS的第三技术的示例修改构造的编码流的实例的说明图;FIG33 is an explanatory diagram for explaining an example of an encoded stream constructed according to an example modification of the third technique using APS;

图34是示出电视的示意配置的实例的框图;FIG34 is a block diagram showing an example of a schematic configuration of a television;

图35是示出移动电话的示意配置的实例的框图;FIG35 is a block diagram showing an example of a schematic configuration of a mobile phone;

图36是示出记录和回放装置的示意配置的实例的框图;FIG36 is a block diagram showing an example of a schematic configuration of a recording and playback device;

图37是示出成像装置的示意配置的实例的框图;以及FIG37 is a block diagram showing an example of a schematic configuration of an imaging device; and

图38示出在H.264/AVC中预定义的默认量化矩阵的说明图。FIG. 38 is an explanatory diagram showing a default quantization matrix predefined in H.264/AVC.

具体实施方式DETAILED DESCRIPTION

下文中,将参照附图详细描述本发明的优选实施例。注意,在该说明书和附图中,以相同的标号标注具有基本相同的功能和结构的元件,并省略重复的说明。Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. Note that, in this specification and the drawings, elements that have substantially the same function and structure are denoted by the same reference numerals, and repeated explanation is omitted.

另外,将以下面的顺序进行描述。Additionally, the description will be made in the following order.

1.根据实施例的图像编码装置的示例配置1. Example Configuration of Image Encoding Device According to Embodiment

1-1.示例总体配置1-1. Example overall configuration

1-2.语法处理部的示例配置1-2. Example Configuration of Syntax Processing Unit

1-3.示例参数结构1-3. Example parameter structure

2.在根据实施例的编码期间的处理流程2. Processing Flow During Encoding According to the Embodiment

3.根据实施例的图像解码装置的示例配置3. Example Configuration of Image Decoding Device According to Embodiment

3-1.示例总体配置3-1. Example overall configuration

3-2.语法处理部的示例配置3-2. Example Configuration of Syntax Processing Unit

4.在根据实施例的解码期间的处理流程4. Processing Flow During Decoding According to an Embodiment

4-1.生成量化矩阵4-1. Generating Quantization Matrix

4-2.对片段设置量化矩阵4-2. Setting the quantization matrix for the segment

5.语法实例5. Syntax Examples

5-1.第一实例5-1. First Example

5-2.第二实例5-2. Second Example

6.参数集的各种示例配置6. Various example configurations of parameter sets

6-1.第一技术6-1. First Technology

6-2.第一技术的示例修改6-2. Example Modification of First Technology

6-3.第二技术6-3. Second technology

6-4.第三技术6-4. Third Technology

6-5.第三技术的示例修改6-5. Example Modification of the Third Technology

7.应用7. Application

8.结论8. Conclusion

<1.根据实施例的图像编码装置的示例配置><1. Example Configuration of Image Encoding Device According to Embodiment>

这部分描述根据实施例的图像编码装置的示例配置。This section describes an example configuration of an image encoding device according to an embodiment.

[1-1.示例总体配置][1-1. Example overall configuration]

图1为示出根据实施例的图像编码装置的示例结构的框图。参考图1,图像编码装置10被配置有:模拟至数字(A/D)转换部11、记录缓存器12、语法处理部13、减法部14、正交变换部15、量化部16、无损编码部17、累积缓存器18、速率控制部19、逆量化部21、逆正交变换部22、加法部23、解块滤波器24、帧存储器25、选择器26、帧内预测部30、运动估计部40、以及模式选择部50。FIG1 is a block diagram illustrating an example structure of an image encoding device according to an embodiment. Referring to FIG1 , the image encoding device 10 includes an analog-to-digital (A/D) conversion section 11, a recording buffer 12, a syntax processing section 13, a subtraction section 14, an orthogonal transform section 15, a quantization section 16, a lossless encoding section 17, an accumulation buffer 18, a rate control section 19, an inverse quantization section 21, an inverse orthogonal transform section 22, an addition section 23, a deblocking filter 24, a frame memory 25, a selector 26, an intra-frame prediction section 30, a motion estimation section 40, and a mode selection section 50.

A/D转换部11将以模拟格式输入的图像信号转换为数字格式的图像数据,并将数字图像数据的序列输出到记录缓存器12。The A/D conversion section 11 converts an image signal input in an analog format into image data in a digital format, and outputs a sequence of digital image data to the recording buffer 12 .

记录缓存器12记录在从A/D转换部11输入的图像数据的序列中包括的图像。在记录根据编码处理的图片组(GOP)结构的图像之后,记录缓存器12将记录的图像数据输出到语法处理部13。The recording buffer 12 records images included in the sequence of image data input from the A/D conversion section 11. After recording images in a group of pictures (GOP) structure according to encoding processing, the recording buffer 12 outputs the recorded image data to the syntax processing section 13.

从记录缓存器12输出到语法处理部13的图像数据被映射到以所谓网络提取层(NAL)单元为单位的比特流。图像数据流包括一个或多个序列。序列中的先头图片称为即时解码刷新(IDR)图片。每个序列包括一个或多个图片,每个图片还包括一个或多个片段。在H.264/AVC和HEVC中,这些片段是视频编码和解码的基本单位。用于每个片段的数据被识别为视频编码层(VCL)NAL单元。The image data output from the recording buffer 12 to the syntax processing unit 13 is mapped into a bitstream in units of so-called Network Abstraction Layer (NAL) units. The image data stream consists of one or more sequences. The leading picture in a sequence is called an Instantaneous Decoding Refresh (IDR) picture. Each sequence consists of one or more pictures, and each picture also consists of one or more slices. In H.264/AVC and HEVC, these slices are the basic units of video encoding and decoding. The data for each slice is identified as a Video Coding Layer (VCL) NAL unit.

语法处理部13顺序识别从记录缓存器12输入的图像数据流的NAL单元,并将存储头信息的非VCL NAL单元插入到流中。语法处理部13插入到流中的非VCL NAL单元包括序列参数集(SPS)和图片参数集(PPS)。另外,在该实施例中,语法处理部13在流中插入量化矩阵参数集(QMPS)、不同于SPS和PPS的非VCL NAL单元。语法处理部13还在片段的开始添加片段头(SH)。语法处理部13然后将包括VCL NAL单元和非VCL NAL单元的图像数据流输出到减法部14、帧内预测部30和运动估计部40。下面将进一步描述语法处理部13的具体配置。The syntax processing unit 13 sequentially identifies NAL units in the image data stream input from the recording buffer 12 and inserts non-VCL NAL units storing header information into the stream. The non-VCL NAL units inserted into the stream by the syntax processing unit 13 include a sequence parameter set (SPS) and a picture parameter set (PPS). Furthermore, in this embodiment, the syntax processing unit 13 inserts a quantization matrix parameter set (QMPS), a non-VCL NAL unit other than the SPS and PPS, into the stream. The syntax processing unit 13 also adds a slice header (SH) to the beginning of a slice. The syntax processing unit 13 then outputs the image data stream, including VCL NAL units and non-VCL NAL units, to the subtraction unit 14, the intra prediction unit 30, and the motion estimation unit 40. The specific configuration of the syntax processing unit 13 will be further described below.

对减法部14提供从语法处理部13输出的图像数据和通过下述模式选择部50选择的预测图像数据。减法部14计算作为从语法处理部13输出的图像数据与从模式选择部50输入的预测图像数据之差的预测误差数据,并将计算的预测误差数据输出到正交变换部15。The subtraction unit 14 is supplied with the image data output from the syntax processing unit 13 and the predicted image data selected by the mode selection unit 50 described below. The subtraction unit 14 calculates prediction error data, which is the difference between the image data output from the syntax processing unit 13 and the predicted image data input from the mode selection unit 50, and outputs the calculated prediction error data to the orthogonal transformation unit 15.

正交变换部15对从减法部13输入的预测误差数据执行正交变换。通过正交变换部15执行的正交变换可以为例如离散余弦变换(DCT)或Karhunen-Loeve变换。正交变换部15将通过正交变换处理获得的变换系数数据输出到量化部16。The orthogonal transform section 15 performs an orthogonal transform on the prediction error data input from the subtraction section 13. The orthogonal transform performed by the orthogonal transform section 15 may be, for example, a discrete cosine transform (DCT) or a Karhunen-Loeve transform. The orthogonal transform section 15 outputs the transform coefficient data obtained by the orthogonal transform process to the quantization section 16.

量化部16使用量化矩阵量化从正交变换部15输入的变换系数数据,并将量化的变换系数数据(下文称为量化数据)输出到无损编码部17和逆量化部21。基于来自的速率控制部19的速率控制信号控制量化数据的比特率。由量化部16使用的量化矩阵在量化矩阵参数集中定义,并可以对于每个片段在片段头中指定。在该情况中,在未指定量化矩阵的情况中,使用对于全部分量具有相等量化步长的扁平量化矩阵。The quantization unit 16 quantizes the transform coefficient data input from the orthogonal transform unit 15 using a quantization matrix and outputs the quantized transform coefficient data (hereinafter referred to as quantized data) to the lossless encoding unit 17 and the inverse quantization unit 21. The bit rate of the quantized data is controlled based on a rate control signal from the rate control unit 19. The quantization matrix used by the quantization unit 16 is defined in a quantization matrix parameter set and can be specified in the slice header for each slice. In this case, if no quantization matrix is specified, a flat quantization matrix with an equal quantization step size for all components is used.

无损编码部17通过在从量化部16输入的量化数据上执行无损编码处理生成编码流。通过无损编码部17进行的无损编码可以是例如可变长度编码或算术编码。另外,无损编码部17将从模式选择部50输入的关于帧内预测的信息或关于帧间预测的信息多路复用到编码流的头中。无损编码部17然后将这样生成的编码流输出到累积缓存器18。The lossless encoding section 17 generates an encoded stream by performing a lossless encoding process on the quantized data input from the quantization section 16. The lossless encoding performed by the lossless encoding section 17 may be, for example, variable length encoding or arithmetic coding. Furthermore, the lossless encoding section 17 multiplexes information on intra-frame prediction or information on inter-frame prediction input from the mode selection section 50 into the header of the encoded stream. The lossless encoding section 17 then outputs the encoded stream generated in this manner to the accumulation buffer 18.

累积缓存器18使用诸如半导体存储器的存储介质临时缓存从无损编码部17输入的编码流。累积缓存器18然后以根据传输信道的带宽的速率将这样缓存的编码流输出到未示出的传输部(诸如与外围设备的通信接口或与连接接口)。The accumulation buffer 18 temporarily buffers the encoded stream input from the lossless encoding section 17 using a storage medium such as a semiconductor memory. The accumulation buffer 18 then outputs the encoded stream thus buffered to a transmission section (not shown) (such as a communication interface with a peripheral device or a connection interface) at a rate according to the bandwidth of the transmission channel.

速率控制部19监视累积缓存器18中的空闲空间。然后,速率控制部19根据累积缓存器18中的空闲空间生成速率控制信号,并将生成的速率控制信号输出到量化部16。例如,当在累积缓存器18中没有太多空闲空间时,速率控制部19生成用于降低量化数据的比特率的速率控制信号。另外,当在累积缓存器18中存在足够的空闲空间时,速率控制部19生成用于增大量化数据的比特率的速率控制信号。The rate control section 19 monitors the free space in the accumulation buffer 18. The rate control section 19 then generates a rate control signal based on the free space in the accumulation buffer 18 and outputs the generated rate control signal to the quantization section 16. For example, when there is not much free space in the accumulation buffer 18, the rate control section 19 generates a rate control signal for reducing the bit rate of the quantized data. Alternatively, when there is sufficient free space in the accumulation buffer 18, the rate control section 19 generates a rate control signal for increasing the bit rate of the quantized data.

逆量化部21使用量化矩阵对从量化部16输入的量化数据执行逆量化处理。逆量化部21然后将通过逆量化处理获得的变换系数数据输出到逆正交变换部22。The inverse quantization section 21 performs an inverse quantization process using a quantization matrix on the quantized data input from the quantization section 16. The inverse quantization section 21 then outputs transform coefficient data obtained by the inverse quantization process to the inverse orthogonal transform section 22.

逆正交变换部22对从逆量化部21输入的变换系数数据执行逆正交变换处理,从而恢复预测误差数据。然后,逆正交变换部22将恢复的预测误差数据输出到加法部23。The inverse orthogonal transform section 22 performs an inverse orthogonal transform process on the transform coefficient data input from the inverse quantization section 21 , thereby restoring the prediction error data.

加法部23将从逆正交变换部22输入的恢复的预测误差数据与从模式选择部50输入的预测图像数据相加从而生成解码图像数据。然后,加法部23将这样生成的解码图像数据输出到解块滤波器24和帧存储器25。The addition unit 23 generates decoded image data by adding the restored prediction error data input from the inverse orthogonal transform unit 22 to the predicted image data input from the mode selection unit 50. The addition unit 23 then outputs the decoded image data generated in this way to the deblocking filter 24 and the frame memory 25.

解块滤波器24施加滤波以减少在图像编码时生成的块效应(blockingartifact)。解块滤波器24通过对从加法部23输入的解码图像数据滤波而除去块效应,并将经过滤波的解码图像数据输出到帧存储器25。The deblocking filter 24 applies filtering to reduce blocking artifacts generated during image encoding. The deblocking filter 24 removes blocking artifacts by filtering the decoded image data input from the adding unit 23 and outputs the filtered decoded image data to the frame memory 25 .

帧存储器25使用存储介质存储从加法部23输入的解码图像数据和从解块滤波器24输入的滤波之后的解码图像数据。The frame memory 25 stores the decoded image data input from the addition section 23 and the decoded image data after filtering input from the deblocking filter 24 using a storage medium.

选择器26从帧存储器25读取将用于帧内预测的未过滤的解码图像数据,并将所读取的解码图像数据提供到帧内预测部30作为参考图像数据。另外,选择器26从帧存储器25读取将用于帧间预测的经过滤的解码图像数据,并将所读取的解码图像数据提供到运动估计部40作为参考图像数据。The selector 26 reads unfiltered decoded image data to be used for intra prediction from the frame memory 25, and supplies the read decoded image data as reference image data to the intra prediction section 30. In addition, the selector 26 reads filtered decoded image data to be used for inter prediction from the frame memory 25, and supplies the read decoded image data to the motion estimation section 40 as reference image data.

帧内预测部30基于从语法处理部13输入的将被编码的图像数据和通过选择器26提供的解码图像数据,在每个帧内预测模式中执行帧内预测处理。例如,帧内预测部30使用预定成本函数评估每个帧内预测模式的预测结果。然后,帧内预测部30选择产生最小成本函数值的帧内预测模式,即,产生最高压缩比的帧内预测模式,作为最优帧内预测模式。另外,帧内预测部30将关于帧内预测的信息,诸如指示最优帧内预测模式、预测图像数据和成本函数值的预测模式信息,输出到模式选择部50。The intra-frame prediction section 30 performs intra-frame prediction processing in each intra-frame prediction mode based on the image data to be encoded input from the syntax processing section 13 and the decoded image data provided by the selector 26. For example, the intra-frame prediction section 30 evaluates the prediction results of each intra-frame prediction mode using a predetermined cost function. The intra-frame prediction section 30 then selects the intra-frame prediction mode that produces the smallest cost function value, that is, the intra-frame prediction mode that produces the highest compression ratio, as the optimal intra-frame prediction mode. In addition, the intra-frame prediction section 30 outputs information about intra-frame prediction, such as prediction mode information indicating the optimal intra-frame prediction mode, predicted image data, and cost function value, to the mode selection section 50.

运动估计部40基于从语法处理部13输入的将被编码的图像数据和通过选择器26提供的解码图像数据执行帧间预测处理(在各帧之间的预测处理)。例如,运动估计部40使用预定成本函数评估每个预测模式的预测结果。然后,运动估计部40选择产生最小成本函数值的预测模式,即,产生最高压缩比的预测模式,作为最优预测模式。运动估计部40根据最优预测模式生成预测图像数据。运动估计部40将关于帧间预测的信息,诸如指示所述选择的最优帧间预测模式、预测图像数据和成本函数值的预测模式信息,输出到模式选择部50。The motion estimation unit 40 performs inter-frame prediction processing (prediction processing between frames) based on the image data to be encoded input from the syntax processing unit 13 and the decoded image data provided by the selector 26. For example, the motion estimation unit 40 evaluates the prediction results of each prediction mode using a predetermined cost function. The motion estimation unit 40 then selects the prediction mode that produces the smallest cost function value, that is, the prediction mode that produces the highest compression ratio, as the optimal prediction mode. The motion estimation unit 40 generates predicted image data based on the optimal prediction mode. The motion estimation unit 40 outputs information regarding inter-frame prediction, such as prediction mode information indicating the selected optimal inter-frame prediction mode, predicted image data, and cost function value, to the mode selection unit 50.

模式选择部50把从帧内预测部30输入关于帧内预测的成本函数值与从运动估计部40输入的关于帧间预测的成本函数值进行比较。然后,模式选择部50在帧内预测与帧间预测之间选择具有较小成本函数值的预测方法。在选择帧内预测的情况中,模式选择部50将关于帧内预测的信息输出到无损编码部17,并且还将预测图像数据输出到减法部14和加法部23。另外,在选择帧间预测的情况中,模式选择部50将关于上述帧间预测的信息输出到无损编码部17,并且还将预测图像数据输出到减法部14和加法部23。The mode selection section 50 compares the cost function value for intra-frame prediction input from the intra-frame prediction section 30 with the cost function value for inter-frame prediction input from the motion estimation section 40. The mode selection section 50 then selects the prediction method with the smaller cost function value between intra-frame prediction and inter-frame prediction. If intra-frame prediction is selected, the mode selection section 50 outputs information about the intra-frame prediction to the lossless encoding section 17 and also outputs predicted image data to the subtraction section 14 and the addition section 23. If inter-frame prediction is selected, the mode selection section 50 outputs information about the inter-frame prediction to the lossless encoding section 17 and also outputs predicted image data to the subtraction section 14 and the addition section 23.

[1-2.语法处理部的示例配置][1-2. Example Configuration of Syntax Processing Unit]

图2为示出图1所示的图像编码装置10的语法处理部13的具体配置的实例的框图。参考图2,语法处理部13包括设置部110、参数生成部120和插入部130。Fig. 2 is a block diagram showing an example of a specific configuration of the syntax processing section 13 of the image encoding device 10 shown in Fig. 1. Referring to Fig. 2 , the syntax processing section 13 includes a setting section 110, a parameter generating section 120, and an inserting section 130.

(1)设置部(1) Setting department

设置部110保存用于通过图像编码装置10进行的编码处理的各个设置。例如,设置部110保存用于图像数据中的每个序列的概要、用于每个图片的编码模式、关于GOP结构的数据等。另外,在该实施例中,设置部110保存关于由量化部16(和逆量化部21)使用的量化矩阵的设置。量化部16应使用哪个量化矩阵的问题对于每个片段可以是预定的,通常基于离线图像分析。The settings section 110 stores various settings for the encoding process performed by the image encoding device 10. For example, the settings section 110 stores information about the profile for each sequence in the image data, the encoding mode for each picture, data regarding the GOP structure, and the like. Furthermore, in this embodiment, the settings section 110 stores settings regarding the quantization matrix used by the quantization section 16 (and the inverse quantization section 21). The quantization matrix used by the quantization section 16 can be predetermined for each slice, typically based on offline image analysis.

例如,在诸如数字视频相机的示例应用中,在输入图像中不存在压缩伪像,从而即使在高范围中,仍可使用具有减小量化步长的量化矩阵。量化矩阵以图片为单位或以帧为单位变化。在具有低复杂性的输入图像的情况中,使用具有较小量化步长的扁平量化矩阵使得可以改善用户主观感知的图像质量。另一方面,在具有高复杂性的输入图像的情况中,期望使用更大量化步长以抑制速率的增大。在该情况中,使用扁平量化矩阵具有将低范围信号中的伪像识别为块噪声的可能。为此,有利地是,通过使用其中量化步长从低范围增大到高范围的量化矩阵而减少噪声。For example, in an example application such as a digital video camera, there are no compression artifacts in the input image, so a quantization matrix with a reduced quantization step size can be used even in the high range. The quantization matrix changes on a picture-by-picture or frame-by-frame basis. In the case of an input image with low complexity, the use of a flat quantization matrix with a smaller quantization step size makes it possible to improve the image quality perceived by the user. On the other hand, in the case of an input image with high complexity, it is desirable to use a larger quantization step size to suppress the increase in rate. In this case, the use of a flat quantization matrix has the potential to identify artifacts in the low range signal as block noise. For this reason, it is advantageous to reduce noise by using a quantization matrix in which the quantization step size increases from the low range to the high range.

在诸如再压缩以MPEG-2编码的广播内容的记录器的示例应用中,在输入图像自身中存在诸如蚊式噪声的MPEG-2压缩伪像。蚊式噪声是以较大量化步长量化高范围信号的结果生成的噪声,该噪声的频率分量自身变为极高频率。当再压缩这些输入图像时,期望地是,使用具有在高范围中的大量化步长的量化矩阵。另外,相比于逐行信号,在隔行信号中,由于隔行扫描的影响,在水平方向中的信号的相关性高于在垂直方向中的信号的相关性。为此,还有利地是,根据图像信号是逐行信号还是隔行信号使用不同的量化矩阵。在两种情况中,根据图像内容,最优量化矩阵可以以图片为单位或以帧为单位变化。In an example application such as a recorder for recompressing broadcast content encoded in MPEG-2, MPEG-2 compression artifacts such as mosquito noise are present in the input image itself. Mosquito noise is noise generated as a result of quantizing a high-range signal with a large quantization step size, and the frequency components of this noise themselves become extremely high frequencies. When recompressing these input images, it is desirable to use a quantization matrix with a large quantization step size in the high range. In addition, compared to progressive signals, in interlaced signals, the correlation of the signals in the horizontal direction is higher than the correlation of the signals in the vertical direction due to the influence of interlaced scanning. For this reason, it is also advantageous to use different quantization matrices depending on whether the image signal is a progressive signal or an interlaced signal. In both cases, the optimal quantization matrix can be varied on a picture-by-picture basis or on a frame-by-frame basis, depending on the image content.

(2)参数生成部(2) Parameter generation unit

参数生成部120生成由设置部110保存的定义用于编码处理的设置的参数,并将生成的参数输出到插入部130。The parameter generation section 120 generates parameters that define settings for the encoding process held by the setting section 110 , and outputs the generated parameters to the insertion section 130 .

例如,在该实施例中,参数生成部120生成用于定义将由量化部16使用的量化矩阵的量化矩阵参数。通过参数生成部120生成的一组量化矩阵参数被包括在量化矩阵参数集(QMPS)中。每个QMPS被分配QMPS ID,该QMPS ID为用于将各个QMPS相互区分的标识符。通常,将多种量化矩阵定义到一个QMPS中。通过矩阵尺寸以及对应的预测方法和信号分量一起相互区分量化矩阵的种类。例如,可以在一个QMPS内对于尺寸4×4、8×8、16×16和32×32中的每个定义最多六种量化矩阵(帧内预测/帧间预测中的Y/Cb/Cr分量)。For example, in this embodiment, the parameter generation unit 120 generates quantization matrix parameters for defining the quantization matrix to be used by the quantization unit 16. A set of quantization matrix parameters generated by the parameter generation unit 120 is included in a quantization matrix parameter set (QMPS). Each QMPS is assigned a QMPS ID, which is an identifier for distinguishing each QMPS from another. Typically, multiple quantization matrices are defined in one QMPS. The types of quantization matrices are distinguished from each other by the matrix size, as well as the corresponding prediction method and signal component. For example, up to six quantization matrices (Y/Cb/Cr components in intra-frame prediction/inter-frame prediction) can be defined within one QMPS for each of the sizes 4×4, 8×8, 16×16, and 32×32.

更具体是,参数生成部120可利用Z形扫描将每个量化矩阵转换为线性阵列(linear array),并与H.264/AVC中的量化矩阵编码处理类似地,以差分脉码调制(DPCM)格式对线性阵列中的每个元素的值进行编码。在该情况中,DPCM差分数据的线性阵列变为量化矩阵参数。在该说明中,将该用于生成量化矩阵参数的模式表示为全扫描模式。More specifically, the parameter generation unit 120 may convert each quantization matrix into a linear array using zigzag scanning, and encode the value of each element in the linear array using differential pulse code modulation (DPCM) format, similar to the quantization matrix encoding process in H.264/AVC. In this case, the linear array of DPCM differential data becomes the quantization matrix parameters. In this description, this mode for generating quantization matrix parameters is referred to as the full scan mode.

另外,参数生成部120还可以在不同于全扫描模式的模式生成量化矩阵参数,以减少量化矩阵参数的代码量。例如,代替全扫描模式,参数生成部120还可以在下述的复制模式或轴指定模式中生成量化矩阵参数。In addition, the parameter generation unit 120 may generate quantization matrix parameters in a mode other than the full scan mode to reduce the amount of code for the quantization matrix parameters. For example, instead of the full scan mode, the parameter generation unit 120 may generate quantization matrix parameters in the copy mode or axis designation mode described below.

复制模式是可以在这样的情况中选择的模式,其中用于给定片段的量化矩阵近似于或等于已经定义的量化矩阵。在复制模式的情况中,参数生成部120包括QMPS的QMPS ID,其中在QMPS中定义复制源量化矩阵以及复制源量化矩阵的尺寸和类别作为量化矩阵参数。注意,在该说明书中,对预测方法与对应于给定量化矩阵的信号分量的组合指定该量化矩阵的类别。在其中在将定义的量化矩阵与复制源量化矩阵之间存在差异的情况中,参数生成部120还可以包括残差数据,用于生成表示QMPS中的每个分量的残差的残差矩阵。The copy mode is a mode that can be selected in situations where the quantization matrix for a given segment is similar to or equal to an already defined quantization matrix. In the copy mode, the parameter generation unit 120 includes the QMPS ID of the QMPS, in which the copy source quantization matrix, as well as the size and type of the copy source quantization matrix, are defined as quantization matrix parameters. Note that in this specification, the type of quantization matrix is specified for the combination of a prediction method and a signal component corresponding to a given quantization matrix. In the case where there is a difference between the defined quantization matrix and the copy source quantization matrix, the parameter generation unit 120 may also include residual data for generating a residual matrix representing the residual error for each component in the QMPS.

用于轴指定模式的处理还根据两种指定方法被进一步划分:差分法和内插法。通过差分法,参数生成部120仅指定量化矩阵中的对应于最左列的垂直轴、最上一行的水平轴、以及沿变换单元的对角线的对角轴的元素的值。通过内插法,参数生成部120仅指定量化矩阵中对应于位于变换单元的左上(DC分量)、右上、左下和右下四个角的的元素的值。剩余元素的值可通过诸如线性内插、三次内插或Lagrange内插的任意技术内插。同样,在轴指定模式中,在其中在将定义的量化矩阵与内插的量化矩阵之间存在差异的情况中,参数生成部120还可以包括残差数据,用于生成表示QMPS中的每个分量的残差的残差矩阵。The processing for the axis designation mode is further divided according to two designation methods: the difference method and the interpolation method. In the difference method, the parameter generation unit 120 only specifies the values of the elements in the quantization matrix corresponding to the vertical axis of the leftmost column, the horizontal axis of the top row, and the diagonal axes along the diagonal lines of the transform unit. In the interpolation method, the parameter generation unit 120 only specifies the values of the elements in the quantization matrix corresponding to the four corners located at the top left (DC component), top right, bottom left, and bottom right of the transform unit. The values of the remaining elements can be interpolated using any technique such as linear interpolation, cubic interpolation, or Lagrange interpolation. Similarly, in the axis designation mode, in the case where there is a difference between the defined quantization matrix and the interpolated quantization matrix, the parameter generation unit 120 may also include residual data for generating a residual matrix representing the residual of each component in the QMPS.

(3)插入部(3) Insertion

插入部130将诸如分别包括通过参数生成部120生成的参数组的SPS、PPS、QMPS的头信息和片段头插入到从记录缓存器12输入的图像数据的流中。如前文所述,QMPS是不同于SPS和PPS的非VCL NAL单元。QMPS包括通过参数生成部120生成的量化矩阵参数。插入部130然后将具有插入的头信息的图像数据流输出到减法部14、帧内预测部30和运动估计部40。The insertion unit 130 inserts header information such as the SPS, PPS, and QMPS, each including a parameter group generated by the parameter generation unit 120, and a slice header into the stream of image data input from the recording buffer 12. As previously described, the QMPS is a non-VCL NAL unit different from the SPS and PPS. The QMPS includes the quantization matrix parameters generated by the parameter generation unit 120. The insertion unit 130 then outputs the image data stream with the inserted header information to the subtraction unit 14, the intra-frame prediction unit 30, and the motion estimation unit 40.

[1-3.示例参数结构][1-3. Example parameter structure]

(1)量化矩阵参数集(1) Quantization matrix parameter set

图3为示出在该实施例中的每个QMPS中包括的示例参数的说明图。参考图3,每个QMPS包括“QMPS ID”、“生成模式存在标记”、“生成模式”、以及用于每个模式的不同量化矩阵参数。Fig. 3 is an explanatory diagram showing example parameters included in each QMPS in this embodiment. Referring to Fig. 3 , each QMPS includes "QMPS ID," "generation mode presence flag," "generation mode," and different quantization matrix parameters for each mode.

“QMPS ID”为用于将各个QMPS相互区分的标识符。QMPS ID可以是在从0到31等范围中的整数。指定未使用的QMPS ID表示将定义新的QMPS。重新指定在序列中已经使用的QMPS ID表示要更新已经定义的QMPS。"QMPS ID" is an identifier used to distinguish each QMPS from another. The QMPS ID can be an integer in the range of 0 to 31. Specifying an unused QMPS ID indicates that a new QMPS will be defined. Re-specifying a QMPS ID already in use in a sequence indicates that the already defined QMPS will be updated.

“生成模式存在标记”为指示“生成模式”(其是表示量化矩阵生成处理的模式的分类)是否存在于QMPS中的标记。在生成模式存在标记指示“0:不存在”的情况中,在QMPS中以全扫描模式定义量化矩阵。同时,在生成模式存在标记指示“1:存在”的情况中,在QMPS中存在“生成模式”。The "generation mode presence flag" is a flag indicating whether the "generation mode" (which is a classification indicating the mode of the quantization matrix generation process) is present in the QMPS. If the generation mode presence flag indicates "0: not present," the quantization matrix is defined in the full scan mode in the QMPS. Meanwhile, if the generation mode presence flag indicates "1: present," the "generation mode" is present in the QMPS.

“生成模式”是可以取例如“0:复制”、“1:轴指定”或“2:全扫描”中的任意值的分类。在下文的语法伪码中,通过称为“pred_mode”的变量表示生成模式。The "generation mode" is a classification that can take any value, for example, "0: copy," "1: axis specification," or "2: full scan." In the following pseudo-grammar, the generation mode is represented by a variable called "pred_mode."

在复制模式(即,pred_mode=0)的情况中,QMPS可包括“源ID”、“复制源尺寸”、“复制源类别”、“残差标记”以及“残差数据”作为量化矩阵参数。“源ID”为指定其中定义复制源量化矩阵的QMPS的QMPS ID。“复制源尺寸”为复制源量化矩阵的尺寸。“复制源类别”为复制源量化矩阵的类别(帧内-Y、帧内-Cb、...、帧内-Cr)。“残差标记”为指示是否存在残差的标记。“残差数据”是用于在存在残差的情况中生成表示残差的残差矩阵的数据。在残差标记指示“0:不存在”的情况中可以省略残差数据。In the case of copy mode (i.e., pred_mode=0), the QMPS may include "Source ID," "Copy Source Size," "Copy Source Category," "Residual Flag," and "Residual Data" as quantization matrix parameters. "Source ID" is the QMPS ID that specifies the QMPS in which the copy source quantization matrix is defined. "Copy Source Size" is the size of the copy source quantization matrix. "Copy Source Category" is the category of the copy source quantization matrix (Intra-Y, Intra-Cb, ..., Intra-Cr). "Residual Flag" is a flag indicating whether a residual exists. "Residual Data" is data used to generate a residual matrix representing the residual when a residual exists. Residual data can be omitted when the residual flag indicates "0: not present."

注意,可以将其中给定QMPS的源ID等于QMPS自身的QMPS ID的情况解释为指定由图38中的实例示出的默认量化矩阵等。通过这样,由于在QMPS中不再需要包括用于指定默认量化矩阵的独立标记,可以减少QMPS的代码量。Note that the case where the source ID of a given QMPS is equal to the QMPS ID of the QMPS itself can be interpreted as specifying the default quantization matrix shown by the example in FIG38 , etc. By doing so, since it is no longer necessary to include an independent flag for specifying the default quantization matrix in the QMPS, the code amount of the QMPS can be reduced.

在轴指定模式(即,pred_mode=1)的情况中,QMPS可包括“指定方法标记”、以及“参考轴数据”或“角数据”、“残差标记”以及“残差数据”作为量化矩阵参数。指定方法标记为这样的标记,其指示如何沿参考轴指定用作用于生成量化矩阵的参考的元素的值,并且该标记可以取例如“0:差分”或“1:内插”的值。在指定方法为“0:差分”的情况中,通过参考轴数据指定对应于量化矩阵的参考轴(这些为垂直轴、水平轴和对角线轴)的元素值。在指定方法为“1:内插”的情况中,由角数据指定对应于位于量化矩阵的左上、右上、左下和右下的四个角的元素的值。可通过从这四个角的值内插生成在三个参考轴上的元素的值。残差标记和残差数据与复制模式的情况相似。In the case of axis designation mode (i.e., pred_mode=1), the QMPS may include a "designation method flag", as well as "reference axis data" or "corner data", a "residual flag", and "residual data" as quantization matrix parameters. The designation method flag is a flag that indicates how the value of the element used as a reference for generating the quantization matrix is specified along the reference axis, and the flag can take a value of, for example, "0: differential" or "1: interpolation". In the case where the designation method is "0: differential", the element values corresponding to the reference axes of the quantization matrix (these are the vertical axis, horizontal axis, and diagonal axis) are specified by the reference axis data. In the case where the designation method is "1: interpolation", the values of the elements corresponding to the four corners located at the upper left, upper right, lower left, and lower right of the quantization matrix are specified by the corner data. The values of the elements on the three reference axes can be generated by interpolating from the values of these four corners. The residual flag and residual data are similar to those in the case of the copy mode.

在全扫描模式的情况中(即,pred_mode=2),QMPS可包括DPCM差分数据的线性阵列作为量化矩阵参数。In the case of full scan mode (ie, pred_mode=2), the QMPS may include a linear array of DPCM difference data as quantization matrix parameters.

注意,每个QMPS可包括对于每类量化矩阵不同的生成模式和对应于每种模式的量化矩阵参数。换句话说,作为实例,在单个QMPS中,可以以全扫描模式定义给定类别的量化矩阵,可以以轴指定模式定义另一类量化矩阵,以及可以以复制模式定义剩余的量化矩阵。Note that each QMPS may include different generation modes for each type of quantization matrix and quantization matrix parameters corresponding to each mode. In other words, as an example, in a single QMPS, a quantization matrix for a given class may be defined in full scan mode, a quantization matrix for another class may be defined in axis designation mode, and the remaining quantization matrices may be defined in copy mode.

(2)片段头(2) Fragment header

图4为部分地示出在该实施例中的每个片段头中包括的示例参数的说明图。参考图4,每个片段头可包括“片段类别”、“PPS ID”、“QMPS ID存在标记”、以及“QMPS ID”。“片段类别”是指示片段的编码类别的分类,并且取对应于P片段、B片段或I片段等的值。“PPS ID”是对所述片段引用的图片参数集(PPS)的ID。“QMPS ID存在标记”是指示在片段头中是否存在QMPS ID的标记。“QMPS ID”是对所述片段引用的量化矩阵参数集(QMPS)的QMPS ID。FIG4 is an explanatory diagram partially illustrating example parameters included in each slice header in this embodiment. Referring to FIG4 , each slice header may include a "slice category," a "PPS ID," a "QMPS ID presence flag," and a "QMPS ID." The "slice category" is a classification indicating the encoding category of the slice and takes values corresponding to, for example, a P-slice, a B-slice, or an I-slice. The "PPS ID" is the ID of the picture parameter set (PPS) referenced for the slice. The "QMPS ID presence flag" is a flag indicating whether a QMPS ID is present in the slice header. The "QMPS ID" is the QMPS ID of the quantization matrix parameter set (QMPS) referenced for the slice.

<2.在根据实施例的编码期间的处理流程><2. Processing Flow During Encoding According to the Embodiment>

(1)参数集插入处理(1) Parameter set insertion processing

图5为示出根据本实施例的通过语法处理部13的插入部130的参数集插入处理的示例流程的流程图。FIG5 is a flowchart showing an example flow of a parameter set insertion process by the inserting section 130 of the syntax processing section 13 according to the present embodiment.

参考图5,插入部130首先连续获取在从记录缓存器12输入的图像数据流中的NAL单元,并识别单个图片(步骤S100)。接着,插入部130确定识别的图片是否是序列的先头图片(步骤S102)。在这里,在识别图片是序列的先头图片的情况中,插入部130将SPS插入到流中(步骤S104)。下一步,插入部130还确定是否存在用于识别的图片的PPS中的变化(步骤S106)。在这里,在PPS中存在变化的情况中、或者在识别图片是序列的先头图片的情况中,插入部130将PPS插入到流中(步骤S108)。下一步,插入部130还确定在QMPS中是否存在变化(步骤S110)。在这里,在QMPS中存在变化的情况中、或者在识别图片是序列的先头图片的情况中,插入部130将QMPS插入到流中(步骤S112)。之后,插入部130在检测到流的结束的情况中结束该处理。另一方面,在流未结束的情况中,插入部130对下一个图片重复上述处理(步骤S114)。Referring to Figure 5 , the insertion unit 130 first continuously retrieves NAL units from the image data stream input from the recording buffer 12 and identifies a single picture (step S100 ). Next, the insertion unit 130 determines whether the identified picture is the leading picture of the sequence (step S102 ). If the identified picture is the leading picture of the sequence, the insertion unit 130 inserts the SPS into the stream (step S104 ). Next, the insertion unit 130 also determines whether there is a change in the PPS for the identified picture (step S106 ). If there is a change in the PPS, or if the identified picture is the leading picture of the sequence, the insertion unit 130 inserts the PPS into the stream (step S108 ). Next, the insertion unit 130 also determines whether there is a change in the QMPS (step S110 ). If there is a change in the QMPS, or if the identified picture is the leading picture of the sequence, the insertion unit 130 inserts the QMPS into the stream (step S112 ). After that, if the insertion unit 130 detects the end of the stream, the insertion unit 130 ends the process. On the other hand, if the stream has not ended, the insertion unit 130 repeats the above process for the next picture (step S114).

注意,尽管该流程图为了简化仅示出了对SPS、PPS、QMPS的插入,插入部130还可以在流中插入其它头信息,诸如补充增强信息(SEI)和片段头。Note that although this flowchart only shows the insertion of SPS, PPS, and QMPS for simplicity, the inserting unit 130 may also insert other header information such as supplemental enhancement information (SEI) and slice headers into the stream.

(2)对流结构的描述(2) Description of convection structure

图6为用于说明在根据该实施例的技术与现有技术之间的流结构中的差异的说明图。FIG. 6 is an explanatory diagram for explaining the difference in flow structure between the technology according to this embodiment and the related art.

图6的左侧示出流ST1作为根据现有技术生成的实例。由于流ST1的开始是序列的开始,在流ST1的开始插入第一SPS(1)和第一PPS(1)。可以在SPS(1)和PPS(1)中可以定义一个或多个量化矩阵。然后,假设在若干后续片段头和片段数据之后变得需要更新量化矩阵。从而,将第二PPS(2)插入到流ST1中。PPS(2)还包括除量化矩阵参数之外的参数。然后,假设在若干后续片段头和片段数据之后变得需要更新PPS。从而,将第三PPS(3)插入到流ST1中。PPS(3)也包括量化矩阵参数。利用在由片段头中的PPS ID指定的PPS中定义的量化矩阵进行用于随后片段的量化处理(和逆量化处理)。The left side of FIG6 shows a stream ST1 as an example generated according to the prior art. Since the start of the stream ST1 is the start of a sequence, the first SPS (1) and the first PPS (1) are inserted at the start of the stream ST1. One or more quantization matrices can be defined in the SPS (1) and the PPS (1). Then, it is assumed that it becomes necessary to update the quantization matrix after several subsequent slice headers and slice data. Thus, the second PPS (2) is inserted into the stream ST1. The PPS (2) also includes parameters other than the quantization matrix parameters. Then, it is assumed that it becomes necessary to update the PPS after several subsequent slice headers and slice data. Thus, the third PPS (3) is inserted into the stream ST1. The PPS (3) also includes quantization matrix parameters. The quantization process (and inverse quantization process) for the subsequent slices is performed using the quantization matrix defined in the PPS specified by the PPS ID in the slice header.

图6的右侧示出流ST2作为根据该实施例的上述技术生成的实例。由于流ST2的开始是序列的开始,在流ST2的开始插入第一SPS(1)、第一PPS(1)和第一QMPS(1)。在流ST2中,可以在QMPS(1)中定义一个或多个量化矩阵。流ST2中的PPS(1)和QMPS(1)的长度之和近似等于流ST1中的PPS(1)的长度。接着,如果在若干后续片段头和片段数据之后变得需要更新量化矩阵,将第二QMPS(2)插入到流ST2中。由于QMPS(2)不包含除量化矩阵参数之外的参数,QMPS(2)的长度短于流ST1中的PPS(2)的长度。接着,如果在若干后续片段头和片段数据之后变得需要更新PPS,将第二PPS(2)插入到流ST2中。由于流ST2中的PPS(2)不包含量化矩阵参数,流ST2中的PPS(2)的长度短于流ST1中的PPS(3)的长度。利用在由片段头中的QMPSID指定的QMPS中定义的量化矩阵进行用于随后片段的量化处理(和逆量化处理)。The right side of FIG6 shows a stream ST2 as an example generated according to the above-described technique of this embodiment. Since the beginning of the stream ST2 is the beginning of the sequence, the first SPS (1), the first PPS (1), and the first QMPS (1) are inserted at the beginning of the stream ST2. In the stream ST2, one or more quantization matrices can be defined in the QMPS (1). The sum of the lengths of the PPS (1) and the QMPS (1) in the stream ST2 is approximately equal to the length of the PPS (1) in the stream ST1. Then, if it becomes necessary to update the quantization matrix after several subsequent slice headers and slice data, a second QMPS (2) is inserted into the stream ST2. Since the QMPS (2) does not contain parameters other than the quantization matrix parameters, the length of the QMPS (2) is shorter than the length of the PPS (2) in the stream ST1. Then, if it becomes necessary to update the PPS after several subsequent slice headers and slice data, a second PPS (2) is inserted into the stream ST2. Since PPS(2) in stream ST2 does not include quantization matrix parameters, the length of PPS(2) in stream ST2 is shorter than the length of PPS(3) in stream ST1. Quantization processing (and inverse quantization processing) for subsequent slices is performed using the quantization matrix defined in the QMPS specified by the QMPSID in the slice header.

图6中对流ST1和ST2的比较示出,通过在本发明实施例中所述的技术可以减少流总体的代码量。特别是,在具有较大尺寸的量化矩阵的情况中,或者在其中对每个图片定义更多量化矩阵的情况中,通过上述技术能更有效地减少代码量。FIG6 shows a comparison of streams ST1 and ST2, showing that the technique described in the embodiment of the present invention can reduce the overall code size of the stream. In particular, in the case of a large-scale quantization matrix or in the case of defining more quantization matrices for each picture, the above technique can more effectively reduce the code size.

<3.根据实施例的图像解码装置的示例配置><3. Example Configuration of Image Decoding Device According to Embodiment>

[3-1.示例总体配置][3-1. Example overall configuration]

这部分描述根据实施例的图像解码装置的示例配置。This section describes an example configuration of an image decoding device according to an embodiment.

[3-1.示例总体配置][3-1. Example overall configuration]

图7为示出根据实施例的图像解码装置60的示例结构的框图。参考图7,图像解码装置60被配置有:语法处理部61、无损解码部62、逆量化部63、逆正交变换部64、加法部65、解块滤波器66、记录缓存器67、数字至模拟(D/A)转换部68、帧存储器69、选择器70和71、帧内预测部80、以及运动补偿部90。FIG7 is a block diagram showing an example structure of an image decoding device 60 according to an embodiment. Referring to FIG7 , the image decoding device 60 is configured with a syntax processing section 61, a lossless decoding section 62, an inverse quantization section 63, an inverse orthogonal transform section 64, an addition section 65, a deblocking filter 66, a recording buffer 67, a digital-to-analog (D/A) conversion section 68, a frame memory 69, selectors 70 and 71, an intra-frame prediction section 80, and a motion compensation section 90.

语法处理部61从经传输信道输入的编码流获取诸如SPS、PPS、QMPS和片段头的头信息,并基于获取的头信息识别用于由图像解码装置60进行解码处理的各个设置。例如,在该实施例中,语法处理部61基于QMPS中包括的量化矩阵参数设置将在逆量化部63的逆量化处理期间使用的量化矩阵。后面将进一步描述语法处理部61的具体配置。The syntax processing section 61 acquires header information such as the SPS, PPS, QMPS, and slice header from the encoded stream input via the transmission channel, and identifies, based on the acquired header information, various settings for the decoding process performed by the image decoding device 60. For example, in this embodiment, the syntax processing section 61 sets the quantization matrix to be used during the inverse quantization process of the inverse quantization section 63 based on the quantization matrix parameters included in the QMPS. The specific configuration of the syntax processing section 61 will be further described later.

无损解码部62根据在编码时使用的编码方法对从语法处理部61输入的编码流进行解码。无损解码部62然后将解码的量化数据输出到逆量化部63。另外,无损解码部62将在头信息中包括的关于帧内预测的信息输出到帧内预测部80,并将关于帧间预测的信息输出到运动补偿部90。The lossless decoding section 62 decodes the encoded stream input from the syntax processing section 61 according to the encoding method used at the time of encoding. The lossless decoding section 62 then outputs the decoded quantized data to the inverse quantization section 63. In addition, the lossless decoding section 62 outputs information on intra-frame prediction included in the header information to the intra-frame prediction section 80, and outputs information on inter-frame prediction to the motion compensation section 90.

逆量化部63使用通过语法处理部61设置的量化矩阵以逆量化通过无损解码部62解码的量化数据(即,量化的变换系数数据)。可以根据在片段头中指定的QMPS ID、每个块(变换单元)的尺寸、用于每个块的预测方法和信号分量确定对于给定片段中的每个块应使用哪个量化矩阵的问题。The inverse quantization section 63 inversely quantizes the quantized data (i.e., quantized transform coefficient data) decoded by the lossless decoding section 62 using the quantization matrix set by the syntax processing section 61. The question of which quantization matrix should be used for each block in a given slice can be determined based on the QMPS ID specified in the slice header, the size of each block (transform unit), the prediction method used for each block, and the signal component.

逆正交变换部64通过根据在编码时使用的正交变换方法对从逆量化部63输入的变换系数数据执行逆正交变换而生成预测误差数据。然后,逆正交变换部64将生成的预测误差数据输出到加法部65。The inverse orthogonal transform section 64 generates prediction error data by performing inverse orthogonal transform on the transform coefficient data input from the inverse quantization section 63 according to the orthogonal transform method used at the time of encoding.

加法部65将从逆正交变换部64输入的预测误差数据与从选择器71输入的预测图像数据相加从而生成解码图像数据。然后,加法部65将所生成的解码图像数据输出到解块滤波器66和帧存储器69。The addition section 65 generates decoded image data by adding the prediction error data input from the inverse orthogonal transform section 64 and the predicted image data input from the selector 71 . The addition section 65 then outputs the generated decoded image data to the deblocking filter 66 and the frame memory 69 .

解块滤波器66通过对从加法部65输入的解码图像数据滤波而除去块效应,并将经过滤波的解码图像数据输出到记录缓存器67和帧存储器69。The deblocking filter 66 removes blocking artifacts by filtering the decoded image data input from the adding section 65 , and outputs the filtered decoded image data to the recording buffer 67 and the frame memory 69 .

记录缓存器67通过记录从解块滤波器66输入的图像生成图像数据的时序序列。然后,记录缓存器67将生成的图像数据输出到D/A转换部68。The recording buffer 67 generates a time-series sequence of image data by recording the image input from the deblocking filter 66. Then, the recording buffer 67 outputs the generated image data to the D/A conversion section 68.

D/A转换部68将从记录缓存器67输入的数字格式的图像数据转换为模拟格式的图像信号。然后,D/A转换部68通过将模拟图像信号输出到例如连接到图像解码装置60的显示器(未示出)而使得图像被显示。The D/A converter 68 converts the digital image data input from the recording buffer 67 into an analog image signal. The D/A converter 68 then displays an image by outputting the analog image signal to a display (not shown) connected to the image decoding device 60, for example.

帧存储器69使用存储介质存储从加法部65输入的未滤波的解码图像数据和从解块滤波器66输入的经滤波的解码图像数据。The frame memory 69 stores the unfiltered decoded image data input from the addition section 65 and the filtered decoded image data input from the deblocking filter 66 using a storage medium.

选择器70根据通过无损解码部62获取的模式信息对图像中的每个块在帧内预测部80与运动补偿部90之间切换来自帧存储器69的图像数据的输出目的地。例如,在其中指定帧内预测模式的情况中,选择器70将从帧存储器69提供的未滤波的解码图像数据输出到帧内预测部80作为参考图像数据。另外,在其中指定帧间预测模式的情况中,选择器70将从帧存储器69提供的滤波的解码图像数据输出到运动补偿部90作为参考图像数据。The selector 70 switches the output destination of the image data from the frame memory 69 between the intra prediction section 80 and the motion compensation section 90 for each block in the image according to the mode information acquired by the lossless decoding section 62. For example, in the case where the intra prediction mode is specified, the selector 70 outputs the unfiltered decoded image data supplied from the frame memory 69 to the intra prediction section 80 as reference image data. In addition, in the case where the inter prediction mode is specified, the selector 70 outputs the filtered decoded image data supplied from the frame memory 69 to the motion compensation section 90 as reference image data.

选择器71根据通过无损解码部62获取的模式信息对图像中的每个块在帧内预测部80与运动补偿部90之间切换将提供给加法部65的预测图像数据的输出源。例如,在其中指定帧内预测模式的情况中,选择器71把从帧内预测部80输出的预测图像数据提供给加法部65。在其中指定帧间预测模式的情况中,选择器71把从运动补偿部90输出的预测图像数据提供给加法部65。The selector 71 switches the output source of the predicted image data to be supplied to the addition section 65 between the intra prediction section 80 and the motion compensation section 90 for each block in the image according to the mode information acquired by the lossless decoding section 62. For example, in the case where the intra prediction mode is specified, the selector 71 supplies the predicted image data output from the intra prediction section 80 to the addition section 65. In the case where the inter prediction mode is specified, the selector 71 supplies the predicted image data output from the motion compensation section 90 to the addition section 65.

帧内预测部80基于从无损解码部62输入的关于帧内预测的信息和来自帧存储器69的参考图像数据执行对像素值的图片内预测,并生成预测图像数据。然后,帧内预测部80将所生成的预测图像数据输出给选择器71。The intra prediction section 80 performs intra-picture prediction on pixel values based on the information on intra prediction input from the lossless decoding section 62 and the reference image data from the frame memory 69 and generates predicted image data. The intra prediction section 80 then outputs the generated predicted image data to the selector 71.

运动补偿部90基于从无损解码部62输入的关于帧间预测的信息和来自帧存储器69的参考图像数据执行运动补偿处理,并生成预测图像数据。然后,运动补偿部90将所生成的预测图像数据输出给选择器71。The motion compensation section 90 performs motion compensation processing based on the information on inter prediction input from the lossless decoding section 62 and the reference image data from the frame memory 69 and generates predicted image data. The motion compensation section 90 then outputs the generated predicted image data to the selector 71.

[3-2.语法处理部的示例配置][3-2. Example Configuration of Syntax Processing Unit]

图8为示出图7所示的图像解码装置60的语法处理部61的具体配置的实例的框图。参考图8,语法处理部61包括参数获取部160和设置部170。Fig. 8 is a block diagram showing an example of a specific configuration of the syntax processing section 61 of the image decoding device 60 shown in Fig. 7. Referring to Fig. 8 , the syntax processing section 61 includes a parameter acquisition section 160 and a setting section 170.

(1)参数获取部(1) Parameter acquisition unit

参数获取部160从图像数据流识别诸如SPS、PPS、QMPS和片段头的头信息,并获取头信息中包括的参数。例如,在该实施例中,参数获取部160从QMPS获取定义量化矩阵的量化矩阵参数。如前文所述,QMPS是不同于SPS和PPS的非VCL NAL单元。参数获取部160然后将获取的参数输出到设置部170。参数获取部160还将图像数据流输出到无损解码部62。The parameter acquisition unit 160 identifies header information such as the SPS, PPS, QMPS, and slice header from the image data stream and acquires parameters included in the header information. For example, in this embodiment, the parameter acquisition unit 160 acquires quantization matrix parameters that define the quantization matrix from the QMPS. As previously mentioned, the QMPS is a non-VCL NAL unit, unlike the SPS and PPS. The parameter acquisition unit 160 then outputs the acquired parameters to the setup unit 170. The parameter acquisition unit 160 also outputs the image data stream to the lossless decoding unit 62.

(2)设置部(2) Setting department

设置部170基于通过参数获取部160获取的参数应用用于图7所示的每个部中的处理的设置。例如,设置部170从LCU和SCU值对识别编码单位尺寸的范围,同时还根据split_flag的值设置编码单位尺寸。通过把这里的编码单位作为处理单位进行图像数据的解码。另外,设置部170还设置变换单位尺寸。通过把这里的变换单位设置作为处理单位,进行上述通过逆量化部63的逆量化和通过逆正交变换部64的逆正交变换。The setting unit 170 applies settings for the processing in each unit shown in FIG. 7 based on the parameters acquired by the parameter acquisition unit 160. For example, the setting unit 170 identifies the range of coding unit sizes from the LCU and SCU value pairs and also sets the coding unit size based on the value of split_flag. Image data is decoded using these coding units as processing units. Furthermore, the setting unit 170 also sets the transform unit size. Using these transform units as processing units, the inverse quantization performed by the inverse quantization unit 63 and the inverse orthogonal transform performed by the inverse orthogonal transform unit 64 are performed.

另外,在该实施例中,设置部170基于通过参数获取部160从QMPS获取的量化矩阵参数设置量化矩阵。更具体是,基于QMPS中包括的量化参数,设置部170生成分别在全扫描模式、复制模式和轴指定模式中尺寸和类别彼此不同的多个量化矩阵。可以在每次在图像数据流中检测到QMPS时进行量化矩阵的生成。In this embodiment, the setting unit 170 sets the quantization matrix based on the quantization matrix parameters acquired from the QMPS by the parameter acquisition unit 160. More specifically, based on the quantization parameters included in the QMPS, the setting unit 170 generates a plurality of quantization matrices having different sizes and types in the full scan mode, the copy mode, and the axis designation mode. Quantization matrix generation can be performed each time a QMPS is detected in the image data stream.

例如,在全扫描模式中,设置部170按照DPCM格式解码在量化矩阵参数中包括的差分数据的线性阵列。设置部170然后根据Z形扫描的扫描图案将解码的线性阵列转换为二维量化矩阵。For example, in full scan mode, the setting section 170 decodes the linear array of differential data included in the quantization matrix parameters in DPCM format and then converts the decoded linear array into a two-dimensional quantization matrix according to a zigzag scan pattern.

另外,在复制模式中,设置部170复制通过在量化矩阵参数中包括的源ID、复制源尺寸和复制源类型指定的(在前生成的)量化矩阵。这里,在新的量化矩阵的尺寸小于复制源量化矩阵的尺寸的情况中,设置部170通过减少复制量化矩阵中的元素生成新的量化矩阵。另外,在新的量化矩阵的尺寸大于复制源量化矩阵的尺寸的情况中,设置部170通过内插复制量化矩阵中的元素生成新的量化矩阵。然后,在存在残差分量的情况中,设置部170将残差分量加到新的量化矩阵。In copy mode, the setup unit 170 copies the (previously generated) quantization matrix specified by the source ID, copy source size, and copy source type included in the quantization matrix parameters. If the size of the new quantization matrix is smaller than the size of the copy source quantization matrix, the setup unit 170 generates the new quantization matrix by reducing the elements in the copy quantization matrix. If the size of the new quantization matrix is larger than the size of the copy source quantization matrix, the setup unit 170 generates the new quantization matrix by interpolating the elements in the copy quantization matrix. If a residual component exists, the setup unit 170 adds the residual component to the new quantization matrix.

另外,在给定QMPS中的量化矩阵参数中包括的源ID等于该QMPS的QMPS ID的情况中,设置部170将新的量化矩阵看作默认量化矩阵。In addition, in the case where the source ID included in the quantization matrix parameters in a given QMPS is equal to the QMPS ID of the QMPS, the setting section 170 regards the new quantization matrix as a default quantization matrix.

另外,在轴指定模式中,设置部170识别在量化矩阵参数中包括的指定方法标记。然后,在差分方法的情况中,设置部170基于在量化矩阵参数中包括的参考轴数据生成对应于垂直轴、水平轴和对角线轴的量化矩阵的元素的值,并通过内插生成其它元素的值。另外,在内插方法的情况中,设置部170基于在量化矩阵参数中包括的角数据生成对应于四个角的量化矩阵的元素的值,并在通过内插生成沿参考轴的元素的值之后,还通过内插生成剩余元素的值。然后,在存在残差分量的情况中,设置部170将残差分量加到新的量化矩阵。In addition, in the axis designation mode, the setup unit 170 identifies the designation method flag included in the quantization matrix parameters. Then, in the case of the differential method, the setup unit 170 generates values for the quantization matrix elements corresponding to the vertical, horizontal, and diagonal axes based on the reference axis data included in the quantization matrix parameters, and generates values for the other elements by interpolation. Furthermore, in the case of the interpolation method, the setup unit 170 generates values for the quantization matrix elements corresponding to the four corners based on the angle data included in the quantization matrix parameters. After generating values for the elements along the reference axes by interpolation, it also generates values for the remaining elements by interpolation. Then, if a residual component exists, the setup unit 170 adds the residual component to the new quantization matrix.

之后,当在片段头中指定QMPS ID时,设置部170将对于通过指定的QMPS ID标识的QMPS生成的量化矩阵设置为将由逆量化部63使用的量化矩阵。Thereafter, when the QMPS ID is specified in the slice header, the setting section 170 sets the quantization matrix generated for the QMPS identified by the specified QMPS ID as the quantization matrix to be used by the inverse quantization section 63 .

<4.在根据实施例的解码期间的处理流程><4. Processing Flow During Decoding According to the Embodiment>

[4-1.生成量化矩阵][4-1. Generating Quantization Matrix]

(1)量化矩阵生成处理(1) Quantization matrix generation processing

图9为示出根据本实施例的通过语法处理部61的量化矩阵生成处理的示例流程的流程图。图9中的量化矩阵生成处理是可以每当在图像数据流中检测到QMPS时进行的处理。Fig. 9 is a flowchart showing an example flow of a quantization matrix generation process by the syntax processing section 61 according to the present embodiment. The quantization matrix generation process in Fig. 9 is a process that can be performed every time a QMPS is detected in an image data stream.

参考图9,参数获取部160首先从QMPS获取QMPS ID(步骤S200)。如果此时获取的QMPS ID是流中的未使用的ID,则设置部170根据下述的处理生成将与该QMPS ID相关联的新的量化矩阵。另一方面,如果QMPS ID是已经使用的ID,设置部170将与该QMPS ID关联存储的量化矩阵更新为根据下述的处理生成的矩阵。然后,参数获取部160从QMPS获取生成模式存在标记(步骤S202)。Referring to Figure 9 , the parameter acquisition unit 160 first acquires a QMPS ID from the QMPS (step S200). If the acquired QMPS ID is an unused ID in the stream, the setup unit 170 generates a new quantization matrix to be associated with the QMPS ID according to the process described below. On the other hand, if the QMPS ID is already in use, the setup unit 170 updates the quantization matrix stored in association with the QMPS ID to a matrix generated according to the process described below. The parameter acquisition unit 160 then acquires a generation mode presence flag from the QMPS (step S202).

随后的从步骤S206到步骤S240的处理对于每种量化矩阵重复(步骤S204)。注意,量化矩阵的类别对应于量化矩阵的尺寸和类型(即,预测方法和信号分量)的组合。The subsequent processing from step S206 to step S240 is repeated for each quantization matrix (step S204 ). Note that the category of the quantization matrix corresponds to a combination of the size and type (ie, prediction method and signal component) of the quantization matrix.

在步骤S206中,设置部170根据生成模式存在标记确定在QMPS中是否存在生成模式(的类别)(步骤S206)。在此时不存在生成模式的情况中,设置部170类似于H.264/AVC中的量化矩阵解码处理地以全扫描方法生成量化矩阵(步骤S208)。另一方面,在存在生成模式的情况中,参数获取部160从QMPS获取生成模式(步骤S210)。设置部170然后根据生成模式进行不同的处理(步骤S212、S214)。In step S206, the setup unit 170 determines whether (the type of) a generation mode exists in the QMPS based on the generation mode presence flag (step S206). If a generation mode does not exist at this time, the setup unit 170 generates a quantization matrix using a full scan method, similar to the quantization matrix decoding process in H.264/AVC (step S208). On the other hand, if a generation mode exists, the parameter acquisition unit 160 acquires the generation mode from the QMPS (step S210). The setup unit 170 then performs different processes depending on the generation mode (steps S212 and S214).

例如,在指示复制模式的情况中,设置部170以图10所例示的复制模式进行处理(步骤S220)。另外,在指示轴指定模式的情况中,设置部170以图11所例示的轴指定模式进行处理(步骤S240)。另外,在指示全扫描模式的情况中,设置部170类似于H.264/AVC中的量化矩阵解码处理地以全扫描方法生成量化矩阵(步骤S208)。For example, if the copy mode is indicated, the setup unit 170 performs processing in the copy mode illustrated in FIG. 10 (step S220). Alternatively, if the axis designation mode is indicated, the setup unit 170 performs processing in the axis designation mode illustrated in FIG. 11 (step S240). Furthermore, if the full scan mode is indicated, the setup unit 170 generates a quantization matrix using a full scan method, similar to the quantization matrix decoding process in H.264/AVC (step S208).

然后,当生成用于全部类别的量化矩阵的量化矩阵时,如图9所示的量化矩阵生成处理结束。Then, when the quantization matrices for the quantization matrices of all the classes are generated, the quantization matrix generation process shown in FIG. 9 ends.

(2)复制模式中的处理(2) Processing in copy mode

图10为示出在图9的步骤S220中的复制模式中的处理的具体流程的实例的流程图。FIG. 10 is a flowchart showing an example of a specific flow of processing in the copy mode in step S220 of FIG. 9 .

参考图10,首先,参数获取部160从QMPS获取源ID(步骤S221)。接着,设置部170确定在图9的步骤S200中获取的QMPS ID(当前QMPS的QMPS ID)与源ID是否相等。此时,在当前QMPS的QMPS ID与源ID相等的情况中,设置部170生成新的量化矩阵作为默认量化矩阵(步骤S223)。另一方面,在其中当前QMPS的QMPS ID与源ID不相等的情况中,处理前进到步骤S224。Referring to FIG10 , the parameter acquisition unit 160 first acquires the source ID from the QMPS (step S221). Next, the setup unit 170 determines whether the QMPS ID acquired in step S200 of FIG9 (the QMPS ID of the current QMPS) is equal to the source ID. If the QMPS ID of the current QMPS is equal to the source ID, the setup unit 170 generates a new quantization matrix as a default quantization matrix (step S223). On the other hand, if the QMPS ID of the current QMPS is not equal to the source ID, the process proceeds to step S224.

在步骤S224,参数获取部160从QMPS获取复制源尺寸和复制源类型(步骤S224)。然后,设置部170复制通过源ID、复制源尺寸和复制源类型指定的量化矩阵(步骤S225)。然后,设置部170比较复制源尺寸与将要生成的量化矩阵的尺寸(步骤S226)。这里,在其中复制源尺寸与将生成的量化矩阵的尺寸不相等的情况中,设置部170通过根据尺寸差内插或减少复制量化矩阵中的元素而生成新的量化矩阵(步骤S227)。In step S224, the parameter acquisition unit 160 acquires the copy source size and copy source type from the QMPS (step S224). The setup unit 170 then copies the quantization matrix specified by the source ID, copy source size, and copy source type (step S225). The setup unit 170 then compares the copy source size with the size of the quantization matrix to be generated (step S226). If the copy source size is not equal to the size of the quantization matrix to be generated, the setup unit 170 generates a new quantization matrix by interpolating or reducing the elements of the copied quantization matrix according to the size difference (step S227).

另外,参数获取部160从QMPS获取残差标记(步骤S228)。接着,设置部170根据残差标记的值确定是否存在残差数据(步骤S229)。这里,在存在残差数据的情况中,设置部170将残差增加到在步骤S223或步骤S225到S227中生成的新的量化矩阵(步骤S230)。In addition, the parameter acquisition unit 160 acquires the residual flag from the QMPS (step S228). Next, the setting unit 170 determines whether residual data exists based on the value of the residual flag (step S229). If residual data exists, the setting unit 170 adds the residual to the new quantization matrix generated in step S223 or steps S225 to S227 (step S230).

(3)轴指定模式中的处理(3) Processing in axis designation mode

图11为示出在图9的步骤S240中的轴指定模式中的处理的具体流程的实例的流程图。FIG. 11 is a flowchart showing an example of a specific flow of processing in the axis designation mode in step S240 of FIG. 9 .

参考图11,首先,参数获取部160从QMPS获取指定方法标记(步骤S241)。然后,设置部170根据指定方法标记的值确定指定方法(步骤S242)。这里,在其中指定差分方法的情况中,处理前进到步骤S243。另一方面,在其中指定内插方法的情况中,处理前进到步骤S246。Referring to FIG. 11 , the parameter acquisition unit 160 first acquires a designated method flag from the QMPS (step S241). The setting unit 170 then determines the designated method based on the value of the designated method flag (step S242). If the difference method is designated, the process proceeds to step S243. On the other hand, if the interpolation method is designated, the process proceeds to step S246.

在差分方法的情况中,设置部170基于在量化矩阵参数中包括的参考轴数据生成对应于垂直轴、水平轴和对角线轴的量化矩阵的元素的值(步骤S243、S244和S245)。同时,在内插方法的情况中,设置部170基于在量化矩阵参数中包括的角数据生成对应于四个角的量化矩阵的元素的值(步骤S246)。接着,设置部170通过内插生成沿连接四个角的参考轴(垂直轴、水平轴和对角线轴)的元素的值(步骤S247)。然后,设置部170基于沿参考轴的元素的值内插剩余元素的值(步骤S248)。In the case of the differential method, the setting unit 170 generates values for the elements of the quantization matrix corresponding to the vertical, horizontal, and diagonal axes based on the reference axis data included in the quantization matrix parameters (steps S243, S244, and S245). Meanwhile, in the case of the interpolation method, the setting unit 170 generates values for the elements of the quantization matrix corresponding to the four corners based on the corner data included in the quantization matrix parameters (step S246). Next, the setting unit 170 generates values for the elements along the reference axes (vertical, horizontal, and diagonal) connecting the four corners through interpolation (step S247). The setting unit 170 then interpolates the values of the remaining elements based on the values of the elements along the reference axes (step S248).

另外,参数获取部160从QMPS获取残差标记(步骤S249)。接着,设置部170根据残差标记的值确定是否存在残差数据(步骤S250)。这里,在存在残差数据的情况中,设置部170将残差加到在步骤S248中生成的新的量化矩阵(步骤S251)。Furthermore, the parameter acquisition unit 160 acquires the residual flag from the QMPS (step S249). Next, the setting unit 170 determines whether residual data exists based on the value of the residual flag (step S250). If residual data exists, the setting unit 170 adds the residual to the new quantization matrix generated in step S248 (step S251).

[4-2.对片段设置量化矩阵][4-2. Setting the quantization matrix for the fragment]

图12为示出根据本实施例的通过语法处理部61对片段设置量化矩阵的处理的示例流程的流程图。图12中的处理可以每当在图像数据流中检测到片段头时进行。Fig. 12 is a flowchart showing an example flow of a process of setting a quantization matrix for a slice by the syntax processing section 61 according to the present embodiment. The process in Fig. 12 can be performed every time a slice header is detected in an image data stream.

首先,参数获取部160从片段头获取QMPS ID存在标记(步骤S261)。接着,参数获取部160根据QMPS ID存在标记的值确定在片段头中是否存在QMPS ID(步骤S262)。这里,在存在QMPS ID的情况中,参数获取部160还从QMPS获取QMPS ID(步骤S263)。设置部170然后对于该片段头之后的片段设置对于通过获取的QMPS ID识别的QMPS生成的量化矩阵(步骤S264)。另一方面,在其中在片段头中不存在QMPS ID的情况中,设置部170对该片段头之后的片段设置扁平量化矩阵(步骤S265)。First, the parameter acquisition unit 160 acquires the QMPS ID presence flag from the slice header (step S261). Next, the parameter acquisition unit 160 determines whether a QMPS ID exists in the slice header based on the value of the QMPS ID presence flag (step S262). If a QMPS ID exists, the parameter acquisition unit 160 also acquires the QMPS ID from the QMPS (step S263). The setting unit 170 then sets the quantization matrix generated for the QMPS identified by the acquired QMPS ID for the slices following the slice header (step S264). On the other hand, if a QMPS ID does not exist in the slice header, the setting unit 170 sets the flattened quantization matrix for the slices following the slice header (step S265).

<5.语法实例><5. Syntax Examples>

[5-1.第一实例][5-1. First Example]

图13到15示出根据本实施例的表达QMPS的语法的说明性伪码的第一实例。在伪码的左边缘给出行号。另外,伪码中的下划线变量表示在QMPS内指定对应于该变量的参数。Figures 13 to 15 show a first example of an illustrative pseudocode expressing the syntax of a QMPS according to this embodiment. Line numbers are given at the left edge of the pseudocode. In addition, underlined variables in the pseudocode indicate that the parameter corresponding to the variable is specified within the QMPS.

图13中行1上的函数QuantizationMatrixParameterset()是表达单个QMPS的语法的函数。在行2和行3上,指定QMPS ID(quantization_matrix_paramter_id)和生成模式存在标记(pred_present_flag)。随后从行6到行56的语法对于每个尺寸和类型的量化矩阵循环。在其中存在生成模式的情况中(pred_present_flag =1),在QMPS中插入循环中从行7到行53的语法。The QuantizationMatrixParameterset() function on line 1 in Figure 13 expresses the syntax of a single QMPS. Lines 2 and 3 specify the QMPS ID (quantization_matrix_paramter_id) and the generation mode presence flag (pred_present_flag). The syntax from lines 6 to 56 then loops for each quantization matrix size and type. If the generation mode exists (pred_present_flag = 1), the syntax from lines 7 to 53 in the loop is inserted into the QMPS.

在其中存在生成模式的情况中的从行9到行16的语法是用于复制模式的语法。从行9到行16,指定源ID、复制源尺寸和复制源类型。行12的函数pred_matrix()表示将复制由源ID、复制源尺寸和复制源类型指定的量化矩阵。在行13上,指定残差标记。在行15上的函数residual_matrix()表示在其中存在残差分量的情况中在QMPS中指定残差数据。In the case of the Generate mode, the syntax from lines 9 to 16 is for the Copy mode. Lines 9 to 16 specify the source ID, copy source size, and copy source type. The pred_matrix() function on line 12 indicates that the quantization matrix specified by the source ID, copy source size, and copy source type will be copied. Line 13 specifies the residual flag. The residual_matrix() function on line 15 indicates that, in the case of the residual component, residual data is specified in the QMPS.

从行18到行50的语法是用于轴指定模式的语法,并在图14中描述。在行18上,规定指定方法标记。在其中指定方法是差分(DPCM)方法的情况中,从行21到行25指定沿垂直轴的元素的值,从行26到行34指定沿水平轴的元素的值,以及从行35到行40指定沿对角线轴的元素的值。在该情况中的参考轴数据为DPCM差分数据的线性阵列。注意,在其中可以复制沿水平轴的元素的值的情况中,可以省略用于沿水平轴的元素的语法(行27,行28)。在其中指定方法是内插方法的情况中,从行42到行45分别指定左上(DC分量)、右上、左下和右下元素的值作为角数据。The syntax from lines 18 to 50 is the syntax for the axis designation mode and is described in FIG14. On line 18, a designation method flag is specified. In the case where the designation method is the differential (DPCM) method, the values of the elements along the vertical axis are specified from lines 21 to 25, the values of the elements along the horizontal axis are specified from lines 26 to 34, and the values of the elements along the diagonal axis are specified from lines 35 to 40. The reference axis data in this case is a linear array of DPCM differential data. Note that in the case where the values of the elements along the horizontal axis can be copied, the syntax for the elements along the horizontal axis can be omitted (line 27, line 28). In the case where the designation method is the interpolation method, the values of the upper left (DC component), upper right, lower left, and lower right elements are specified as angle data from lines 42 to 45, respectively.

行52上的处理是用于全扫描模式的语法。行55上的处理是用于其中不存在生成模式的情况中的语法。在两种情况中,通过表示H.264/AVC中的量化矩阵语法的函数qmatrix()以全扫描方法指定量化矩阵。The process on line 52 is the syntax for full scan mode. The process on line 55 is the syntax for the case where there is no generation mode. In both cases, the quantization matrix is specified in the full scan method using the function qmatrix(), which represents the quantization matrix syntax in H.264/AVC.

图15中行1上的函数residual_matrix()是用于指定在图13中的行15和图14中的行49上使用的残差数据的函数。在图15中的实例中,通过DPCM方法或行程长度(run-length)方法指定残差数据。在DPCM方法的情况中,从行4到行8对线性阵列中的每个元素指定与最后元素的差值(delta_coef)。在行程长度方法的情况中,从行11到行18重复指定其中值连续为零(行程)的部分中的元素组的长度和非零元素的值(数据)。The residual_matrix() function on line 1 in FIG. 15 is used to specify the residual data used on line 15 in FIG. 13 and line 49 in FIG. 14 . In the example in FIG. 15 , the residual data is specified using either the DPCM method or the run-length method. In the case of the DPCM method, the difference (delta_coef) from the last element is specified for each element in the linear array from lines 4 to 8. In the case of the run-length method, the length of the element group in the portion where the values are continuously zero (run) and the values (data) of the non-zero elements are repeatedly specified from lines 11 to 18.

[5-2.第二实例][5-2. Second Example]

图16到20示出根据本实施例的表达QMPS的语法的说明性伪码的第二实例。16 to 20 show a second example of illustrative pseudo code expressing the syntax of QMPS according to the present embodiment.

图16中行1上的函数QuantizationMatrix parameterSet()是表达单个QMPS的语法的函数。在行2上,指定QMPS ID(quantization_matrix_paramter_id)。另外,除了在其中仅指定默认量化矩阵的情况中,在行6指定生成模式存在标记(pred_present_flag)。The QuantizationMatrix parameterSet() function on line 1 in Figure 16 expresses the syntax of a single QMPS. Line 2 specifies the QMPS ID (quantization_matrix_paramter_id). In addition, line 6 specifies the generation mode presence flag (pred_present_flag), except when specifying only the default quantization matrix.

另外,在第二实例中,在函数QuantizationMatrixParameterSet()中从行7到行10指定四种量化尺度(Qscale0到Qscale3)。这些量化尺度是可被采用以量化量化矩阵中的每个元素的值并进一步降低速率的参数。更具体是,在例如8×8量化矩阵中定义如同图21中所示的四个量化尺度设置区域A1到A4。量化尺度设置区域A1是用于对应于低范围信号(包括DC分量)的元素组的区域。量化尺度设置区域A2和A3是用于对应于中等范围中的各个信号的元素组的区域。量化尺度设置区域A4是用于对应于高范围信号的元素组的区域。可以对这些区域的每个设置用于量化量化矩阵中的元素的值的量化尺度。例如,参考图22,对于量化尺度设置区域A1,第一量化尺度(Qscale0)是“1”。这表示,在对应于低范围信号的元素组中未量化量化矩阵中的值。同时,对于量化尺度设置区域A2,第二量化尺度(Qscale1)为“2”。对于量化尺度设置区域A3,第三量化尺度(Qscale2)为“3”。对于量化尺度设置区域A4,第四量化尺度(Qscale3)为“4”。随着量化尺度变得更大,通过量化生成的误差增大。然而,通常,一定程度的误差对于高范围信号是容许的。从而,在期望获得高编码效率的情况中,通过设置这样的用于量化量化矩阵的量化尺度可以有效地减少用于定义量化矩阵所需的代码量,而不会大幅降低图像质量。在其中量化矩阵被量化的情况中,通过在每个元素所属的量化尺度设置区域中设置的量化步骤,可以基本量化或逆量化图3中实例所示的残差数据或差分数据中的每个元素的值。In the second example, lines 7 through 10 of the QuantizationMatrixParameterSet() function specify four quantization scales (Qscale0 through Qscale3). These quantization scales are parameters that can be used to quantize the values of each element in the quantization matrix and further reduce the rate. More specifically, four quantization scale setting areas A1 through A4, as shown in Figure 21, are defined in an 8×8 quantization matrix. Quantization scale setting area A1 is used for groups of elements corresponding to low-range signals (including DC components). Quantization scale setting areas A2 and A3 are used for groups of elements corresponding to signals in the mid-range. Quantization scale setting area A4 is used for groups of elements corresponding to high-range signals. For each of these areas, a quantization scale can be set for quantizing the values of elements in the quantization matrix. For example, referring to Figure 22, for quantization scale setting area A1, the first quantization scale (Qscale0) is "1." This indicates that values in the quantization matrix are not quantized in groups of elements corresponding to low-range signals. Meanwhile, for quantization scale setting area A2, the second quantization scale (Qscale1) is "2." For the quantization scale setting area A3, the third quantization scale (Qscale2) is "3." For the quantization scale setting area A4, the fourth quantization scale (Qscale3) is "4." As the quantization scale becomes larger, the error generated by quantization increases. However, generally, a certain degree of error is acceptable for high-range signals. Thus, in cases where high coding efficiency is desired, by setting such a quantization scale for quantizing the quantization matrix, the amount of code required to define the quantization matrix can be effectively reduced without significantly degrading image quality. In the case where the quantization matrix is quantized, the value of each element in the residual data or differential data shown in the example of FIG. 3 can be substantially quantized or inversely quantized by the quantization step set in the quantization scale setting area to which each element belongs.

注意,图21所示的量化尺度设置区域的布局仅仅是一个实例。例如,还可以对量化矩阵的每个尺寸定义不同数目的量化尺度设置区域(例如,可以对较大尺寸定义更多的量化尺度设置区域)。另外,量化尺度设置区域的边界的位置不限于图21所示的实例。通常,在线性化量化矩阵时的扫描图案是Z形扫描。为此,优选使用从右上方到左下方的对角线区域边界,如图21所示。然而,根据诸如在量化矩阵中的元素间相关性和使用的扫描图案的因素,还可以使用沿垂直方向或水平方向的区域边界。另外,还可以从编码效率的角度适应地选择量化尺度设置区域的布局(区域数、边界位置等)。例如,在定义近似扁平的量化矩阵的情况中,还可以选择更小数目的量化尺度设置区域。Note that the layout of the quantization scale setting areas shown in FIG21 is merely an example. For example, a different number of quantization scale setting areas can be defined for each size of the quantization matrix (e.g., more quantization scale setting areas can be defined for larger sizes). Furthermore, the positions of the quantization scale setting area boundaries are not limited to the example shown in FIG21 . Typically, the scanning pattern when linearizing the quantization matrix is a zigzag scan. For this reason, diagonal area boundaries from the upper right to the lower left are preferably used, as shown in FIG21 . However, depending on factors such as the correlation between elements in the quantization matrix and the scanning pattern used, area boundaries along the vertical or horizontal direction can also be used. Furthermore, the layout of the quantization scale setting areas (number of areas, boundary positions, etc.) can be adaptively selected from the perspective of coding efficiency. For example, when defining a nearly flat quantization matrix, a smaller number of quantization scale setting areas can be selected.

在图16中,后续的从行13到行76的语法对于每个尺寸和类型的量化矩阵循环。在其中存在生成模式的情况中(pred_present_flag=1),在QMPS中插入循环中从行14到行66(参考图18)的语法。In Fig. 16 , the syntax from lines 13 to 76 that follow loops for each quantization matrix size and type. In the case where the generation mode exists (pred_present_flag=1), the syntax from lines 14 to 66 (see Fig. 18 ) in the loop is inserted into the QMPS.

在其中存在生成模式的情况中的从行16到行22的语法是用于复制模式的语法。从行16到行18,指定源ID、复制源尺寸和复制源类型。在行19上,指定残差标记。在行21上的函数residual_matrix()表示在其中存在残差分量的情况中在QMPS中指定残差数据。可根据上述四种量化尺度(Qscale0到Qscale3)的值量化这里的残差数据。从行23到行56的语法是用于轴指定模式的语法,并在图17中描述。可同样根据上述四种量化尺度(Qscale0到Qscale3)的值量化轴指定模式中的残差数据(行55)。The syntax from lines 16 to 22 in the case where there is a generation mode is the syntax for the copy mode. From lines 16 to 18, the source ID, copy source size, and copy source type are specified. On line 19, the residual flag is specified. The function residual_matrix() on line 21 indicates that residual data is specified in the QMPS in the case where there is a residual component. The residual data here can be quantized according to the values of the four quantization scales (Qscale0 to Qscale3) described above. The syntax from lines 23 to 56 is the syntax for the axis designation mode and is described in Figure 17. The residual data in the axis designation mode can also be quantized according to the values of the four quantization scales (Qscale0 to Qscale3) described above (line 55).

图18中的行57到行66的语法是用于全扫描模式的语法。另外,从行68到行75的语法是用于其中不存在生成模式的情况中的语法。在两种情况中,通过函数qmatrix()以全扫描方法指定量化矩阵。然而,在第二实例中,为了进一步提高编码效率,适应地切换用于DPCM方法中的熵编码差分数据(delta_coef)或行程长度方法中的行程值(run)和非零元素值(data)的VLC表。行61和行71上的vlc_table_data指定对于DPCM方法中的差分数据(delta_coef)或行程长度方法中的非零元素值(data)选择的VLC表的表数目。行63和行73上的vlc_table_run指定对于行程长度方法中的行程值(run)选择的VLC表的表数目。The syntax from lines 57 to 66 in Figure 18 is for full scan mode. Additionally, the syntax from lines 68 to 75 is for when generate mode is not present. In both cases, the quantization matrix is specified using the full scan method using the qmatrix() function. However, in the second example, to further improve coding efficiency, the VLC tables used for entropy-encoded differential data (delta_coef) in the DPCM method or run values (run) and non-zero element values (data) in the run-length method are adaptively switched. vlc_table_data on lines 61 and 71 specifies the number of VLC tables selected for differential data (delta_coef) in the DPCM method or non-zero element values (data) in the run-length method. vlc_table_run on lines 63 and 73 specifies the number of VLC tables selected for run values (run) in the run-length method.

图19的行1上的函数qmatrix()是用于以全扫描方法指定量化矩阵的语法。图19中的行3到行8指示用于DPCM方法的语法,并且利用通过上述vlc_table_data指定的VLC表编码行5上的差分数据(delta_coef)。另外,行10到行21指示用于行程长度方法的语法,并且利用通过上述vlc_table_run指定的VLC表编码行12上的行程值(run)。利用通过上述vlc_tabla_data指定的VLC表编码行13上的非零元素值(data)。The function qmatrix() on line 1 of Figure 19 is a syntax for specifying a quantization matrix using the full scan method. Lines 3 to 8 of Figure 19 indicate syntax for the DPCM method, and the differential data (delta_coef) on line 5 is encoded using the VLC table specified by vlc_table_data. Additionally, lines 10 to 21 indicate syntax for the run length method, and the run values (run) on line 12 are encoded using the VLC table specified by vlc_table_run. The non-zero element values (data) on line 13 are encoded using the VLC table specified by vlc_tabla_data.

图23示出在低复杂度熵编码(LCEC)方法中可以选择的11种可变长度编码(VLC)表的代码字列表。图23中每个代码字中的“x”为给定后缀。例如,如果值“15”以指数哥伦布代码编码,获得9位代码字“000010000”,而如果以VLC4编码该值,获得5位代码字“11111”。通过这样,在编码许多较大值的情况中,通过选择在短代码字中具有较多位数的后缀的VLC表,可以提高编码效率。在图23的11种VLC表中,例如VLC4在5比特代码字中具有4位后缀。另外,VLC9在6比特代码字中具有4位后缀。从而,这些VLC表适于编码许多较大值的情况。Figure 23 shows a list of codewords from 11 variable-length coding (VLC) tables that can be selected in the Low Complexity Entropy Coding (LCEC) method. The "x" in each codeword in Figure 23 represents a given suffix. For example, if the value "15" is encoded using the Exponential Golomb code, the 9-bit codeword "000010000" is obtained, while if the same value is encoded using VLC4, the 5-bit codeword "11111" is obtained. This improves coding efficiency when encoding many large values by selecting a VLC table with a larger number of suffixes in short codewords. For example, among the 11 VLC tables in Figure 23, VLC4 has a 4-bit suffix in a 5-bit codeword. Furthermore, VLC9 has a 4-bit suffix in a 6-bit codeword. Therefore, these VLC tables are suitable for encoding many large values.

返回到量化矩阵语法,由于量化矩阵的线性阵列的差分数据具有多个连续零,行程长度方法中的行程值产生多个较大的值,而不是诸如0、1或2的小值。另一方面,行程长度方法中的非零元素值和差分数据值仅不频繁地形成大值。从而,通过利用上述语法切换用于每个差分数据指定方法的VLC表(DPCM/行程长度)和类别值(在行程长度方法的情况中为行程/非零元素),大大减少了限定量化矩阵所需的代码量。Returning to the quantization matrix syntax, since the differential data of the linear array of the quantization matrix has multiple consecutive zeros, the run values in the run-length method generate multiple large values rather than small values such as 0, 1, or 2. On the other hand, the non-zero element values and differential data values in the run-length method only rarely form large values. Therefore, by switching the VLC table (DPCM/run-length) and class value (run/non-zero element in the case of the run-length method) for each differential data specification method using the above syntax, the amount of code required to define the quantization matrix is greatly reduced.

图20的行1上的函数residual_matrix()还实现了对VLC表的适应切换。换句话说,行7上的vlc_table_data指定对于DPCM方法中的差分数据(delta_coef)选择的VLC表的表编号。行10上的差分数据(delta_coef)利用在行7指定的VLC表进行编码。行15上的vlc_table_data指定对于行程长度方法中的非零元素值(data)选择的VLC表的表编号。行16上的vlc_table_run指定对于行程长度方法中的行程值(run)选择的VLC表的表编号。利用通过上述vlc_table_run指定的VLC表编码行19上的行程值(run)。利用通过上述vlc_table_data指定的VLC表编码行20上的非零元素值(data)。The residual_matrix() function on line 1 of Figure 20 also implements adaptive switching of VLC tables. Specifically, vlc_table_data on line 7 specifies the table number of the VLC table selected for the differential data (delta_coef) in the DPCM method. The differential data (delta_coef) on line 10 is encoded using the VLC table specified in line 7. vlc_table_data on line 15 specifies the table number of the VLC table selected for the non-zero element values (data) in the run-length method. vlc_table_run on line 16 specifies the table number of the VLC table selected for the run values (run) in the run-length method. The run values (run) on line 19 are encoded using the VLC table specified by vlc_table_run. The non-zero element values (data) on line 20 are encoded using the VLC table specified by vlc_table_data.

根据诸如QMPS语法的各种特征,有效地减少了用于定义量化矩阵所需的代码量,并且可以提高编码效率。然而,在这里描述的语法仅仅为一个实例。换句话说,可以减少或省略这里示例示出的语法的一部分,可以改变参数的顺序,或者可以将其它参数增加到语法中。另外,当在SPS或PPS而不是QMPS中定义量化矩阵时,也可以实现在这部分描述的语法的若干特征。在该情况中,可以减少用于在SPS或PPS中定义量化矩阵所需的代码量。By using various features of the QMPS syntax, the amount of code required to define the quantization matrix is effectively reduced, and coding efficiency can be improved. However, the syntax described here is merely an example. In other words, a portion of the syntax shown here can be reduced or omitted, the order of parameters can be changed, or other parameters can be added to the syntax. In addition, when the quantization matrix is defined in an SPS or PPS instead of a QMPS, several features of the syntax described in this section can also be implemented. In this case, the amount of code required to define the quantization matrix in the SPS or PPS can be reduced.

<6.参数集的各种示例配置><6. Various example configurations of parameter sets>

上文描述了用于存储量化矩阵参数的量化矩阵参数集(QMPS)的语法的几个具体实例。QMPS基本可以为仅包含量化矩阵参数的专用参数集,但是还可以是还包括与除量化矩阵之外的编码工具相关的其它参数的共用参数集。例如, "适应参数集(APS)"(JCTVC-F747r3, Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3and ISO/IEC JTC1/SC29/WG11 6th Meeting: Torino, IT, 2011年7月14-22日)引入了称为适应参数集(APS)的新的参数集,并提出存储与自适应环路滤波器(ALF)和APS中的样本适应偏置(SAO)相关的参数。通过在该APS中另外包括量化矩阵参数,可以基本配置上述QMPS。从而,在该部分中,将描述通过使用“适应参数集(APS)”(JCTVC-F747r3)提出的APS配置QMPS的若干技术。The above describes several specific examples of the syntax of a Quantization Matrix Parameter Set (QMPS) for storing quantization matrix parameters. A QMPS can essentially be a dedicated parameter set containing only quantization matrix parameters, but it can also be a shared parameter set that also includes parameters related to coding tools other than the quantization matrix. For example, the "Adaptation Parameter Set (APS)" (JCTVC-F747r3, Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 6th Meeting: Torino, IT, July 14-22, 2011) introduces a new parameter set called the Adaptation Parameter Set (APS) and proposes storing parameters related to the Adaptive Loop Filter (ALF) and Sample Adaptive Offset (SAO) in the APS. By additionally including quantization matrix parameters in this APS, the aforementioned QMPS can be essentially configured. Thus, in this section, several techniques for configuring a QMPS by using the Adaptation Parameter Set (APS) proposed by the APS (JCTVC-F747r3) will be described.

[6-1.第一技术][6-1. First technique]

第一技术是在一个APS中列出全部目标参数、并利用APS ID(唯一表示APS的标识符)指示每个参数的技术。图24示出根据第一技术配置的编码流的实例。The first technique is a technique of listing all target parameters in one APS and indicating each parameter using an APS ID (an identifier uniquely representing an APS). Fig. 24 shows an example of a coded stream configured according to the first technique.

参考图24,在位于序列的开始的图片P0的开始插入SPS 801、PPS 802以及APS803。PPS 802由PPS ID“P0”标识。APS 803由APS ID“A0”标识。APS 803包括关于ALF的参数、关于SAO的参数、以及量化矩阵参数(下文表示为关于QM的参数)。附到图片P0中的片段数据的片段头804包括参考PPS ID“P0”,这表示PPS 802中的参数被参考以解码该片段数据。类似地,片段头804包括参考APS ID“A0”,这表示APS 803中的参数被参考以解码该片段数据。24 , SPS 801, PPS 802, and APS 803 are inserted at the beginning of picture P0, which is located at the beginning of the sequence. PPS 802 is identified by PPS ID "P0." APS 803 is identified by APS ID "A0." APS 803 includes parameters related to ALF, parameters related to SAO, and quantization matrix parameters (hereinafter referred to as QM parameters). Slice header 804 attached to slice data in picture P0 includes a reference to PPS ID "P0," indicating that the parameters in PPS 802 are referenced for decoding the slice data. Similarly, slice header 804 includes a reference to APS ID "A0," indicating that the parameters in APS 803 are referenced for decoding the slice data.

APS 805被插入到在图片P0之后的图片P1中。APS 805由APS ID“A1”标识。APS 805包括关于ALF的参数、关于SAO的参数、以及关于QM的参数。APS 805中包括的关于ALF的参数和关于SAO的参数已经从APS 803更新,但是关于QM的参数未更新。附到图片P1中的片段数据的片段头806包括参考APS ID“A1”,这表示APS 805中的参数被参考以解码该片段数据。APS 805 is inserted into picture P1 following picture P0. APS 805 is identified by APS ID "A1." APS 805 includes parameters related to ALF, SAO, and QM. The ALF and SAO parameters included in APS 805 have been updated from APS 803, but the QM parameters have not been updated. The slice header 806 attached to the slice data in picture P1 includes a reference to APS ID "A1," indicating that the parameters in APS 805 are referenced to decode the slice data.

APS 807被插入到在图片P1之后的图片P2中。APS 807由APS ID“A2”标识。APS 807包括关于ALF的参数、关于SAO的参数、以及关于QM的参数。APS 807中包括的关于ALF的参数和关于QM的参数已经从APS 805更新,但是关于SAO的参数未更新。附到图片P2中的片段数据的片段头808包括参考APS ID“A2”,这表示APS 807中的参数被参考以解码该片段数据。APS 807 is inserted into picture P2, which follows picture P1. APS 807 is identified by APS ID "A2." APS 807 includes parameters related to ALF, SAO, and QM. The ALF and QM parameters included in APS 807 have been updated from APS 805, but the SAO parameters have not been updated. Slice header 808 attached to the slice data in picture P2 includes a reference to APS ID "A2," indicating that the parameters in APS 807 are referenced to decode the slice data.

图25示出根据第一技术定义的APS语法的实例。在图25中的行2上,指定用于唯一识别该APS的APS ID。该APS ID为用于代替利用图3描述的QMPS ID的参数集标识符。在行13到行17上指定关于ALF的参数。在行18到行23上指定关于SAO的参数。在行24到行28上指定关于QM的参数。行24上的“aps_qmatrix_flag”是表示指示在APS中是否设置关于QM的参数的量化矩阵存在标记。行27上的“qmatrix_param()”是指定通过图13到20中的实例示出的量化矩阵参数的函数。在行24上的量化矩阵存在标记指示在APS中设置关于QM的参数(aps_qmatrix_flag=1),可使用函数qmatrix_param() 以设置APS中的量化矩阵参数。FIG25 illustrates an example of the APS syntax defined according to the first technique. Line 2 of FIG25 specifies the APS ID, which uniquely identifies the APS. This APS ID is a parameter set identifier used in place of the QMPS ID described in FIG3 . Lines 13 to 17 specify parameters related to the ALF. Lines 18 to 23 specify parameters related to the SAO. Lines 24 to 28 specify parameters related to the QM. "aps_qmatrix_flag" on line 24 is a quantization matrix presence flag indicating whether QM-related parameters are set in the APS. "qmatrix_param()" on line 27 is a function that specifies the quantization matrix parameters shown in the examples of FIG13 to 20 . The quantization matrix presence flag on line 24 indicates that QM-related parameters are set in the APS (aps_qmatrix_flag = 1). The qmatrix_param() function can be used to set the quantization matrix parameters in the APS.

在实现第一技术的情况中,图8所示的参数获取部160通过参考APS中包括的量化矩阵存在标记确定在APS中是否设置量化矩阵参数。然后参数获取部160在APS中设置量化矩阵参数的情况中从APS获取量化矩阵参数。In the case of implementing the first technique, the parameter acquisition unit 160 shown in FIG8 determines whether the quantization matrix parameters are set in the APS by referring to the quantization matrix presence flag included in the APS. Then, if the quantization matrix parameters are set in the APS, the parameter acquisition unit 160 acquires the quantization matrix parameters from the APS.

图26为示出根据第一技术定义的片段头语法的实例的说明图。在图26中的行5上,从将对该片段设置的参数中指定用于参考PPS中包括的参数的参考PPS ID。在行8上,从将对该片段设置的参数中指定用于参考APS中包括的参数的参考APS ID。该参考APS ID为用于代替利用图4描述的(参考)QMPS ID的参考参数。FIG26 is an explanatory diagram showing an example of the slice header syntax defined according to the first technique. In line 5 of FIG26 , a reference PPS ID for referencing parameters included in the PPS is specified from among the parameters to be set for the slice. In line 8, a reference APS ID for referencing parameters included in the APS is specified from among the parameters to be set for the slice. This reference APS ID is a reference parameter used in place of the (reference) QMPS ID described in FIG4 .

根据第一技术,通过扩展通过“适应参数集(APS)”(JCTVC-F747r3)提出的APS,可以低成本实现前述量化矩阵参数集。另外,可以使用量化矩阵存在标记以从与APS中可能包括的各种编码工具相关的参数中仅部分更新量化矩阵参数,或者可选地,仅部分不更新量化矩阵参数。换句话说,由于可能仅在需要更新量化矩阵时在APS中包括量化矩阵参数,从而可以在APS中有效地传输量化矩阵参数。According to the first technique, the aforementioned quantization matrix parameter set can be implemented at low cost by extending the Adaptation Parameter Set (APS) proposed by JCTVC-F747r3. Furthermore, a quantization matrix presence flag can be used to update only some of the parameters associated with various coding tools that may be included in the APS, or alternatively, to exclude some of the quantization matrix parameters from being updated. In other words, since the quantization matrix parameters can be included in the APS only when they are needed, the quantization matrix parameters can be efficiently transmitted in the APS.

[6-2.第一技术的示例修改][6-2. Example modification of the first technique]

还可以实现根据下文的示例修改的技术,以进一步减少APS中的量化矩阵参数的代码量。The modified technique according to the example below can also be implemented to further reduce the code amount of the quantization matrix parameters in the APS.

图27示出根据第一技术的示例修改定义的APS语法的实例。在图27所示的语法中,在行24到行33上指定关于QM的参数。行24上的“aps_qmatrix_flag”是表示在APS中是否设置关于QM的参数的量化矩阵存在标记。行25上的“ref_aps_id_present_flag”是指示在APS中是否将使用过去参考ID作为关于QM的参数的过去参考ID存在标记。在过去参考ID存在标记指示将使用过去参考ID的情况中(ref_aps_id_present_flag=1),在行27设置过去参考ID“ref_aps_id”。过去参考ID是用于参考在当前APS之前编码或解码的APS的APS ID的标识符。在使用过去参考ID的情况中,在参考源(之后的)APS中未设置量化矩阵参数。在该情况中,图8所示的设置部170再使用基于在由过去参考ID指示的参考目标APS中的量化矩阵参数设置的量化矩阵作为对应于参考源APS的量化矩阵。注意,可以禁止参考参考源APS(其被称为自参考)的APS ID的过去参考ID。取代地,设置部170可以设置默认量化矩阵作为对应于自参考APS的量化矩阵。在未使用过去参考ID的情况中(ref_aps_id_present_flag=0),可以使用行31上的函数“qmatrix_param()”以设置APS中的量化矩阵参数。FIG27 illustrates an example of an APS syntax defined according to an example modification of the first technique. In the syntax illustrated in FIG27 , parameters related to the QM are specified on lines 24 to 33. "aps_qmatrix_flag" on line 24 is a quantization matrix presence flag indicating whether parameters related to the QM are set in the APS. "ref_aps_id_present_flag" on line 25 is a past reference ID presence flag indicating whether a past reference ID will be used as a parameter related to the QM in the APS. When the past reference ID presence flag indicates that a past reference ID will be used (ref_aps_id_present_flag = 1), the past reference ID "ref_aps_id" is set on line 27. The past reference ID is an identifier for referencing the APS ID of an APS encoded or decoded before the current APS. When a past reference ID is used, the quantization matrix parameters are not set in the reference source (latter) APS. In this case, the setup unit 170 shown in FIG8 reuses the quantization matrix set based on the quantization matrix parameters in the reference target APS indicated by the past reference ID as the quantization matrix corresponding to the reference source APS. Note that it is possible to prohibit the use of a past reference ID that references the APS ID of the reference source APS (referred to as self-reference). Alternatively, the setup unit 170 may set a default quantization matrix as the quantization matrix corresponding to the self-reference APS. If a past reference ID is not used (ref_aps_id_present_flag = 0), the function "qmatrix_param()" on line 31 may be used to set the quantization matrix parameters in the APS.

这样,通过使用过去参考ID以再使用已经编码或解码的量化矩阵,避免了在APS中重复设置相同的量化矩阵参数。从而,可以减少APS中的量化矩阵参数的代码量。注意,尽管图27示出其中使用APS ID以参考过去APS的实例,然而,参考过去APS的方法不限于该实例。例如,还可以使用在参考源APS与参考目标APS之间的诸如APS数目的其它参数以参考过去APS。另外,代替使用过去参考ID存在标记,可以根据过去参考ID是否指示给定值(-1,例如)切换参考过去APS和设置新的量化矩阵参数。In this way, by using the past reference ID to reuse the quantization matrix that has already been encoded or decoded, it is avoided to repeatedly set the same quantization matrix parameters in the APS. As a result, the code amount of the quantization matrix parameters in the APS can be reduced. Note that although Figure 27 shows an example in which the APS ID is used to reference the past APS, the method of referencing the past APS is not limited to this example. For example, other parameters such as the number of APSs between the reference source APS and the reference target APS can also be used to reference the past APS. In addition, instead of using the past reference ID existence flag, it is possible to switch the reference to the past APS and set new quantization matrix parameters depending on whether the past reference ID indicates a given value (-1, for example).

[6-3.第二技术][6-3. Second technique]

第二技术是对于每种参数在不同APS(不同NAL单元)中存储参数,并利用唯一标识每个APS的APS ID参考每个参数的技术。图28示出根据第二技术配置的编码流的实例。The second technique is a technique of storing parameters in different APSs (different NAL units) for each parameter, and referencing each parameter using an APS ID that uniquely identifies each APS. Fig. 28 shows an example of a coded stream configured according to the second technique.

参考图28,在位于序列的开始的图片P0的开始处插入SPS 811、PPS 812、APS813a、APS 813b以及APS 813c。PPS 812由PPS ID“P0”标识。APS 813a是用于关于ALF的参数的APS,并由APS ID“A00”标识。APS 813b是用于关于SAO的参数的APS,并由APS ID“A10”标识。APS 813c是用于关于QM的参数的APS,并由APS ID“A20”标识。附到图片P0中的片段数据的片段头814包括参考PPS ID“P0”,这表示PPS 812中的参数被参考以解码该片段数据。类似地,片段头814包括参考APS_ALF ID "A00"、参考APS_SAO ID "A10"以及参考APS_QM ID"A20",并且这些表示APS 813a、813b和813c中的参数被参考以解码片段数据。28 , SPS 811, PPS 812, APS 813a, APS 813b, and APS 813c are inserted at the beginning of picture P0 at the beginning of the sequence. PPS 812 is identified by PPS ID "P0." APS 813a is an APS for parameters related to ALF and is identified by APS ID "A00." APS 813b is an APS for parameters related to SAO and is identified by APS ID "A10." APS 813c is an APS for parameters related to QM and is identified by APS ID "A20." The slice header 814 attached to the slice data in picture P0 includes a reference to PPS ID "P0," indicating that the parameters in PPS 812 are referenced to decode the slice data. Similarly, the slice header 814 includes reference APS_ALF ID “A00”, reference APS_SAO ID “A10”, and reference APS_QM ID “A20”, and these indicate that parameters in the APSs 813a, 813b, and 813c are referenced to decode the slice data.

APS 815a和APS 815b被插入到在图片P0之后的图片P1中。APS 815a是用于关于ALF的参数的APS,并由APS ID“A01”标识。APS 815b是用于关于SAO的参数的APS,并由APSID“A11”标识。由于关于QM的参数未从图片P0更新,从而未插入用于关于QM的参数的APS。附到图片P1中的片段数据的片段头816包括参考APS_ALF ID "A01"、参考APS_SAO ID "A11",和参考APS_QM ID "A20"。这些表示APS 815a、815b和813c中的参数被参考以解码片段数据。APS 815a and APS 815b are inserted into picture P1, which follows picture P0. APS 815a is an APS for parameters related to ALF and is identified by APS ID "A01." APS 815b is an APS for parameters related to SAO and is identified by APS ID "A11." Since the parameters related to QM are not updated from picture P0, no APS for QM parameters is inserted. The slice header 816 attached to the slice data in picture P1 includes reference APS_ALF ID "A01," reference APS_SAO ID "A11," and reference APS_QM ID "A20." These indicate that the parameters in APSs 815a, 815b, and 815c are referenced to decode the slice data.

APS 817a和APS 817c被插入到在图片P1之后的图片P2中。APS 817a是用于关于ALF的参数的APS,并由APS ID“A02”标识。APS 817c是用于关于QM的参数的APS,并由APS ID“A21”标识。由于关于SAO的参数未从图片P1更新,从而未插入用于关于SAO的参数的APS。附到图片P2中的片段数据的片段头818包括参考APS_ALF ID "A02"、参考APS_SAO ID "A11",和参考APS_QM ID "A21"。这些表示APS 817a、815b和817c中的参数被参考以解码片段数据。APS 817a and APS 817c are inserted into picture P2, which follows picture P1. APS 817a is an APS for parameters related to ALF and is identified by APS ID "A02." APS 817c is an APS for parameters related to QM and is identified by APS ID "A21." Since SAO parameters are not updated from picture P1, no APS for SAO parameters is inserted. The slice header 818 attached to the slice data in picture P2 includes reference APS_ALF ID "A02," reference APS_SAO ID "A11," and reference APS_QM ID "A21." These indicate that the parameters in APSs 817a, 817b, and 817c are referenced to decode the slice data.

第二技术中的用于关于QM的参数的APS(例如APS 813c和817c)大致等于前述QMPS。用于关于QM的参数的APS的APS ID被用于代替利用图3描述的QMPS ID。根据第二技术,由于对每种参数使用不同的APS,对于不需要更新的参数不进行对冗余参数的传输。从而可以优化编码效率。然而,在第二技术中,由于将包含到APS中的参数类别增多,从而NAL单元类型(nal_unit_type)、用于标识APS类别的标识符的分类增大。在HEVC的标准说明中,保留有限定数目的NAL单元类型(nal_unit_type)用于扩展。从而,有利地是,考虑避免扩展用于APS的多个NAL单元类型的结构。The APS for QM-related parameters in the second technique (e.g., APSs 813c and 817c) is approximately equivalent to the aforementioned QMPS. The APS ID of the APS for QM-related parameters is used instead of the QMPS ID described using FIG. 3 . According to the second technique, since a different APS is used for each parameter, redundant parameters are not transmitted for parameters that do not need to be updated. This allows for optimization of coding efficiency. However, in the second technique, as the number of parameter categories to be included in the APS increases, the number of NAL unit types (nal_unit_type), identifiers used to identify APS categories, increases. The HEVC standard specification reserves a limited number of NAL unit types (nal_unit_type) for extension. Therefore, it is advantageous to consider avoiding the expansion of a structure with multiple NAL unit types used for the APS.

[6-4.第三技术][6-4. Third Technology]

第三技术为这样的技术,其中在APS中包含量化矩阵参数和其它参数,并通过从APS ID单独定义的各个标识符分组这些参数。在该说明书中,将该分配到每个组并从APSID单独定义的标识符称为辅助标识符(SUB ID)。使用片段头中的辅助标识符参考每个参数。图29示出根据第三技术配置的编码流的实例。The third technique includes quantization matrix parameters and other parameters in the APS, and groups these parameters using identifiers defined independently from the APS ID. In this specification, the identifier assigned to each group and defined independently from the APS ID is referred to as a sub-identifier (SUB ID). Each parameter is referenced using the sub-identifier in the slice header. Figure 29 shows an example of a coded stream configured according to the third technique.

参考图29,在位于序列的开始的图片P0的开始处插入SPS 821、PPS 822以及APS823。PPS 822由PPS ID“P0”标识。APS 823包括关于ALF的参数、关于SAO的参数、以及关于QM的参数。关于ALF的参数属于一个组并由用于ALF的辅助标识符SUB_ALF ID "AA0"标识。关于SAO的参数属于一个组并由用于SAO的辅助标识符SUB_SAO ID "AS0"标识。关于QM的参数属于一个组并由用于QM的辅助标识符SUB_QM ID "AQ0"标识。附到图片P0中的片段数据的片段头824包括参考SUB_ALF ID "AA0"、参考SUB_SAO ID "AS0"及参考SUB_QM ID "AQ0"。这表示,属于SUB_ALF ID "AA0"的关于ALF的参数、属于SUB_SAO ID "AS0"的关于SAO的参数、以及属于SUB_QM ID "AQ0" 的关于QM的参数被参考以解码该片段数据。29 , an SPS 821, a PPS 822, and an APS 823 are inserted at the beginning of picture P0 at the beginning of the sequence. PPS 822 is identified by PPS ID "P0." APS 823 includes parameters related to ALF, SAO, and QM. The ALF parameters belong to a single group and are identified by a sub-identifier for ALF, SUB_ALF ID "AA0." The SAO parameters belong to a single group and are identified by a sub-identifier for SAO, SUB_SAO ID "AS0." The QM parameters belong to a single group and are identified by a sub-identifier for QM, SUB_QM ID "AQ0." The slice header 824 attached to the slice data in picture P0 includes references to SUB_ALF ID "AA0," SUB_SAO ID "AS0," and SUB_QM ID "AQ0." This means that parameters regarding ALF belonging to SUB_ALF ID “AA0”, parameters regarding SAO belonging to SUB_SAO ID “AS0”, and parameters regarding QM belonging to SUB_QM ID “AQ0” are referenced to decode the segment data.

APS 825被插入到在图片P0之后的图片P1中。APS 825包括关于ALF的参数和关于SAO的参数。关于ALF的参数由SUB_ALF ID "AA1"标识。关于SAO的参数由SUB_SAO ID "AS1"标识。由于关于QM的参数未从图片P0更新,从而在APS 825中不包括关于QM的参数。附到图片P1中的片段数据的片段头826包括参考SUB_ALF ID "AA1"、参考SUB_SAO ID "AS1"及参考SUB_QM ID "AQ0"。这表示,APS 825中的属于SUB_ALF ID "AA1"的关于ALF的参数、属于SUB_SAO ID "AS1"的关于SAO的参数、以及APS 823中的属于SUB_QM ID "AQ0" 的关于QM的参数被参考以解码该片段数据。APS 825 is inserted into picture P1, which follows picture P0. APS 825 includes parameters related to ALF and SAO. ALF parameters are identified by SUB_ALF ID "AA1." SAO parameters are identified by SUB_SAO ID "AS1." Since QM parameters are not updated from picture P0, QM parameters are not included in APS 825. The slice header 826 attached to the slice data in picture P1 includes a reference to SUB_ALF ID "AA1," a reference to SUB_SAO ID "AS1," and a reference to SUB_QM ID "AQ0." This indicates that the ALF parameters associated with SUB_ALF ID "AA1" in APS 825, the SAO parameters associated with SUB_SAO ID "AS1," and the QM parameters associated with SUB_QM ID "AQ0" in APS 823 are referenced for decoding the slice data.

APS 827被插入到在图片P1之后的图片P2中。APS 827包括关于ALF的参数和关于QM的参数。关于ALF的参数由SUB_ALF ID "AA2"标识。关于QM的参数由SUB_QM ID "AQ1"标识。由于关于SAO的参数未从图片P1更新,从而在APS 827中不包括关于SAO的参数。附到图片P2中的片段数据的片段头828包括参考SUB_ALF ID "AA2"、参考SUB_SAO ID "AS1"及参考SUB_QM ID "AQ1"。这表示,APS 827中的属于SUB_ALF ID "AA2"的关于ALF的参数、属于SUB_QM ID "AQ1"的关于QM的参数、以及APS 825中的属于SUB_SAO ID "AS1" 的关于SAO的参数被参考以解码该片段数据。APS 827 is inserted into picture P2, which follows picture P1. APS 827 includes parameters related to the ALF and QM. The ALF parameters are identified by the SUB_ALF ID "AA2." The QM parameters are identified by the SUB_QM ID "AQ1." Since the SAO parameters are not updated from picture P1, the SAO parameters are not included in APS 827. The slice header 828 attached to the slice data in picture P2 includes a reference to the SUB_ALF ID "AA2," a reference to the SUB_SAO ID "AS1," and a reference to the SUB_QM ID "AQ1." This indicates that the ALF parameters associated with the SUB_ALF ID "AA2," the QM parameters associated with the SUB_QM ID "AQ1," and the SAO parameters associated with the SUB_SAO ID "AS1" in APS 825 are referenced for decoding the slice data.

图30示出根据第三技术定义的APS语法的实例。在图30的行2到行4上,指定三个组存在标记"aps_adaptive_loop_filter_flag"、"aps_sample_adaptive_offset_flag"和"aps_qmatrix_flag"。组存在标记指示在APS中是否包括属于各个组的参数。尽管从图30的实例中的语法中省略APS ID,也可以在语法中增加用于标识APS的APS ID。在行12到行17上指定关于ALF的参数。行13上的"sub_alf_id"是用于ALF的辅助标识符。在行18到行24上指定关于SAO的参数。行19上的"sub_sao_id"是用于SAO的辅助标识符。在行25到行30上指定关于QM的参数。行26上的"sub_qmatrix_id"是用于QM的辅助标识符。行29上的“qmatrix_param()”是指定通过图13到20中的实例示出的量化矩阵参数的函数。FIG30 illustrates an example of the APS syntax defined according to the third technique. Lines 2 through 4 of FIG30 specify three group presence flags: "aps_adaptive_loop_filter_flag," "aps_sample_adaptive_offset_flag," and "aps_qmatrix_flag." The group presence flags indicate whether the APS includes parameters belonging to each group. Although the APS ID is omitted from the syntax in the example of FIG30 , it may be added to the syntax to identify the APS. Lines 12 through 17 specify parameters related to the ALF. "sub_alf_id" on line 13 is a supplementary identifier for the ALF. Lines 18 through 24 specify parameters related to the SAO. "sub_sao_id" on line 19 is a supplementary identifier for the SAO. Lines 25 through 30 specify parameters related to the QM. "sub_qmatrix_id" on line 26 is a supplementary identifier for the QM. “qmatrix_param()” on line 29 is a function that specifies the quantization matrix parameters shown by way of example in FIGS. 13 to 20 .

图31为示出根据第三技术定义的片段头语法的实例的说明图。在图31中的行5上,从将对该片段设置的参数中指定用于参考PPS中包括的参数的参考PPS ID。在行8上,从将对该片段设置的参数中指定用于参考关于ALF的参数的参考SUB_ALF ID。在行9上,从将对该片段设置的参数中指定用于参考关于SAO的参数的参考SUB_SAO ID。在行10上,从将对该片段设置的参数中指定用于参考关于QM的参数的参考SUB_QM ID。FIG31 is an explanatory diagram showing an example of the slice header syntax defined according to the third technique. In line 5 of FIG31 , a reference PPS ID for referencing parameters included in the PPS is specified from the parameters to be set for the slice. In line 8, a reference SUB_ALF ID for referencing parameters related to the ALF is specified from the parameters to be set for the slice. In line 9, a reference SUB_SAO ID for referencing parameters related to the SAO is specified from the parameters to be set for the slice. In line 10, a reference SUB_QM ID for referencing parameters related to the QM is specified from the parameters to be set for the slice.

在实施第三技术的情况中,图像编码装置10的语法处理部13的参数生成部130在每次更新量化矩阵参数时向更新的一组量化矩阵参数的附加新的SUB_QM ID作为辅助标识符。插入部130然后将具有附加的SUB_QM ID的量化矩阵参数与其它参数一起插入到APS中。图像解码装置60的语法处理部61的参数获取部160使用在片段头中指定的参考SUB_QM ID以从APS获取将对每个片段设置的量化矩阵参数。When implementing the third technique, the parameter generation unit 130 of the syntax processing unit 13 of the image encoding device 10 appends a new SUB_QM ID as an auxiliary identifier to the updated set of quantization matrix parameters each time the quantization matrix parameters are updated. The insertion unit 130 then inserts the quantization matrix parameters with the appended SUB_QM ID into the APS along with other parameters. The parameter acquisition unit 160 of the syntax processing unit 61 of the image decoding device 60 uses the reference SUB_QM ID specified in the slice header to acquire the quantization matrix parameters to be set for each slice from the APS.

根据第三技术,通过使用辅助标识符在APS中分组参数,对于不需要更新的组中的参数不进行对冗余参数的传输。从而可以优化编码效率。另外,由于APS的类别即使在参数类别增加时也不增加,从而大数目的NAL单元类型不是像上述第二技术那样消耗。从而,第三技术不会危害未来扩展的灵活性。According to the third technique, by grouping parameters in an APS using auxiliary identifiers, redundant parameters are not transmitted for parameters in a group that do not need to be updated. This optimizes coding efficiency. Furthermore, since the number of APS types does not increase even when the number of parameter types increases, the large number of NAL unit types is not consumed as in the second technique described above. Consequently, the third technique does not compromise flexibility for future expansion.

在图29到31的实例中,APS中包括的参数根据关于ALF、SAO和QM的编码工具被分组。然而,这仅仅是分组参数的一个实例。APS可以包括关于其它编码工具的参数。例如,诸如用于自适应内插滤波(AIF)的滤波系数的关于AIF的参数为可包含到APS中的参数的一个实例。下文中,将参考图32描述用于对将被包含到APS中的参数进行分组的各种标准。In the examples of Figures 29 to 31, the parameters included in the APS are grouped according to the coding tools related to ALF, SAO, and QM. However, this is merely one example of grouping parameters. The APS may include parameters related to other coding tools. For example, parameters related to the adaptive interpolation filter (AIF), such as filter coefficients for the AIF, are one example of parameters that may be included in the APS. Hereinafter, various criteria for grouping parameters to be included in the APS will be described with reference to Figure 32.

图32所示的表列出“参数内容”、“更新频率”和“数据尺寸”作为通常的编码工具中的各个参数的特征。The table shown in FIG32 lists “parameter contents”, “update frequency”, and “data size” as characteristics of each parameter in a general encoding tool.

自适应环路滤波(ALF)是这样的滤波(通常为Wiener滤波),其通过自适应地确定的滤波系数对解码图像进行二维滤波,以最小化解码图像与原始图像之间的误差。关于ALF的参数包括将应用于每个块的滤波系数和用于每个编码单元(CU)的打开/关闭标记。ALF滤波系数的数据尺寸相比于其它类别的参数极大。为此,通常对高速率I图片传输关于ALF的参数,而可以省略对低速率B图片传输关于ALF的参数。这是因为,从增益的角度看,对低速率图片传输具有大数据尺寸的关于ALF的参数是效率低下的。在大多数情况中,ALF滤波系数对于每个图片变化。由于滤波系数取决于图像内容,能够再使用在前设置的滤波系数的可能性较低。Adaptive loop filtering (ALF) is a type of filtering (typically a Wiener filter) that applies two-dimensional filtering to the decoded image using adaptively determined filter coefficients to minimize the error between the decoded image and the original image. ALF parameters include the filter coefficients applied to each block and an on/off flag for each coding unit (CU). The data size of ALF filter coefficients is significantly larger than that of other types of parameters. For this reason, ALF parameters are typically transmitted for high-rate I pictures, while transmission of ALF parameters for low-rate B pictures can be omitted. This is because transmitting large ALF parameters for low-rate pictures is inefficient from a performance perspective. In most cases, the ALF filter coefficients change for each picture. Because the filter coefficients depend on the image content, the likelihood of reusing previously set filter coefficients is low.

样本适应性偏置(SAO)是通过对解码图像中的每个像素值增加适应性确定的偏置值而改善解码图像的图像质量的工具。关于SAO的参数包括偏置图案和偏置值。关于SAO的参数的数据尺寸没有关于ALF的参数大。作为通常的规则,关于SAO的参数同样对于每个图片变化。然而,由于关于SAO的参数具有即使在图像内容略微变化时也不会变化太多的性质,存在能够再使用在前设置的参数值的可能。Sample Adaptive Offset (SAO) is a tool for improving the image quality of decoded images by adding an adaptively determined offset value to each pixel value in the decoded image. SAO parameters include an offset pattern and an offset value. The data size of SAO parameters is smaller than that of ALF parameters. Generally speaking, SAO parameters also change for each picture. However, since SAO parameters do not change significantly even with slight changes in image content, it is possible to reuse previously set parameter values.

量化矩阵(QM)是这样的矩阵,其元素为在量化通过正交变换从图像数据变换的变换系数时使用的量化尺度。关于QM的参数,或者换句话说,量化矩阵参数,如该说明书中详细所述。关于QM的参数的数据尺寸大于关于SAO的参数。作为通常的规则,对于全部图片需要量化矩阵,但是如果图像内容改变不大,不需要对每个图片更新所述量化矩阵。为此,可以对相同的图片类型(诸如I/P/B图片),或对于每个GOP再使用量化矩阵。The quantization matrix (QM) is a matrix whose elements are the quantization scale used when quantizing transform coefficients transformed from image data through an orthogonal transform. The parameters regarding the QM, or in other words, the quantization matrix parameters, are described in detail in this specification. The data size of the parameters regarding the QM is larger than that of the parameters regarding the SAO. As a general rule, the quantization matrix is required for all pictures, but if the image content does not change significantly, it is not necessary to update the quantization matrix for each picture. For this reason, the quantization matrix can be reused for the same picture type (such as I/P/B pictures) or for each GOP.

适应性内插滤波(AIF)是对每个子像素位置适应性地改变在运动补偿期间使用的内插滤波的滤波系数。关于AIF的参数包括用于各个子像素位置的滤波系数。关于AIF的参数的数据尺寸小于上述三种参数。作为通常的规则,关于AIF的参数对于每个图片变化。然而,由于相同的图片类型趋向于具有类似的内插性质,可以对相同的图片类型(诸如I/P/B图片)再使用关于AIF的参数。Adaptive interpolation filtering (AIF) adaptively changes the filter coefficients of the interpolation filter used during motion compensation for each sub-pixel position. AIF parameters include the filter coefficients for each sub-pixel position. The data size of AIF parameters is smaller than the three parameters mentioned above. Generally speaking, AIF parameters vary for each picture. However, since pictures of the same type tend to have similar interpolation properties, AIF parameters can be reused for the same picture type (such as I/P/B pictures).

基于上述参数质量,为了分组APS中包括的参数,可以采用例如下面三种标准:Based on the above parameter qualities, the following three criteria may be used to group the parameters included in the APS:

标准A)根据编码工具分组Criterion A) Grouping by Coding Tool

标准B)根据更新频率分组Criterion B) Grouping by update frequency

标准C)根据参数再使用的可能性分组。Criterion C) Grouping by the likelihood of parameter reuse.

标准A是根据其相关的编码工具对参数分组的标准。图29到31中示例示出的参数集结构基于标准A。由于通常根据参数的相关编码工具确定参数的质量,通过编码工具对参数分组使得可以根据参数的各种质量进行及时、有效的参数更新。Criteria A groups parameters according to their associated coding tools. The parameter set structures illustrated in Figures 29 through 31 are based on Criteria A. Because parameter quality is typically determined based on its associated coding tool, grouping parameters by coding tool allows for timely and efficient parameter updates based on their varying qualities.

标准B是根据其更新频率对参数分组的标准。如图32所示,作为通常的规则,关于ALF的参数、关于SAO的参数以及关于AIF的参数都可以在每个图片更新。从而,可以将这些参数分组到单个组中,而将关于QM的参数分组到另一个组中。在该情况中,相比于标准A,存在较少的组。从而,在片段头存在较少的需要指定的辅助标识符,并且可以减少片段头的代码量。同时,由于属于相同组的参数的更新频率彼此相似,冗余地传输不更新的参数以更新其它参数的可能性较低。Standard B groups parameters according to their update frequency. As shown in Figure 32, as a general rule, parameters related to ALF, SAO, and AIF can all be updated for each picture. Therefore, these parameters can be grouped into a single group, while parameters related to QM can be grouped into another group. In this case, there are fewer groups compared to Standard A. Consequently, there are fewer auxiliary identifiers that need to be specified in the slice header, and the amount of code in the slice header can be reduced. Furthermore, since the update frequencies of parameters belonging to the same group are similar, the likelihood of redundantly transmitting parameters that are not updated in order to update other parameters is low.

标准C是根据参数再使用的可能性对参数分组的标准。尽管关于ALF的参数不大可能被再使用,然而关于SAO的参数和关于AIF的参数一定程度上有可能被再使用。利用关于QM的参数,参数在多个图片上非常可能被再使用。从而,通过这样根据参数的再使用的可能性对参数分组,可以避免对APS中的再使用参数的冗余传输。Criterion C groups parameters based on their potential for reuse. While ALF parameters are unlikely to be reused, SAO and AIF parameters are somewhat likely to be reused. QM parameters are highly likely to be reused across multiple pictures. Therefore, by grouping parameters based on their potential for reuse, redundant transmission of reused parameters in APS can be avoided.

[6-5.第三技术的示例修改][6-5. Example modification of the third technique]

通过上述第三技术,如图31示例示出,参数被分组到APS中的组的数量导致在片段头中指定的相同数目的参考SUB ID。参考SUB ID要求的代码量与片段头数目以及组数的乘积近似成比例。还可以实现根据下文的示例修改的技术,以进一步减少该比率。With the third technique described above, as shown in the example of FIG31 , the number of groups into which parameters are grouped in the APS results in the same number of reference SUB IDs being specified in the slice header. The amount of code required for the reference SUB IDs is approximately proportional to the product of the number of slice headers and the number of groups. A modified technique, as shown in the example below, can also be implemented to further reduce this ratio.

在第三技术的示例修改中,在APS或其它参数集中定义与辅助标识符的组合相关的组合ID。然后可以通过组合ID从片段头参考APS中包括的参数。图33示出根据第三技术的示例修改配置的编码流的实例。In an example modification of the third technique, a combination ID associated with a combination of auxiliary identifiers is defined in an APS or other parameter set. The parameters included in the APS can then be referenced from the slice header using the combination ID. FIG33 shows an example of a coded stream configured according to an example modification of the third technique.

参考图33,在位于序列的开始的图片P0的开始处插入SPS 831、PPS 832以及APS833。PPS 832由PPS ID“P0”标识。APS 833包括关于ALF的参数、关于SAO的参数、以及关于QM的参数。关于ALF的参数由SUB_ALF ID "AA0"标识。关于SAO的参数由SUB_SAO ID "AS0"标识。关于QM的参数由SUB_QM ID "AQ0"标识。另外,APS 833包括组合ID"C00"={AA0, AS0,AQ0}作为对组合的定义。附到图片P0中的片段数据的片段头834包括组合ID“C00”。这表示,分别与组合ID“C00”相关的属于SUB_ALF ID "AA0"的关于ALF的参数、属于SUB_SAO ID "AS0"的关于SAO的参数、以及属于SUB_QM ID "AQ0" 的关于QM的参数被参考以解码该片段数据。33 , SPS 831, PPS 832, and APS 833 are inserted at the beginning of picture P0, which is located at the beginning of the sequence. PPS 832 is identified by PPS ID "P0." APS 833 includes parameters related to ALF, SAO, and QM. Parameters related to ALF are identified by SUB_ALF ID "AA0." Parameters related to SAO are identified by SUB_SAO ID "AS0." Parameters related to QM are identified by SUB_QM ID "AQ0." Furthermore, APS 833 includes a combination ID "C00" = {AA0, AS0, AQ0} as a definition of a combination. The slice header 834 attached to the slice data in picture P0 includes the combination ID "C00." This means that parameters regarding ALF belonging to SUB_ALF ID "AA0", parameters regarding SAO belonging to SUB_SAO ID "AS0", and parameters regarding QM belonging to SUB_QM ID "AQ0" respectively associated with combination ID "C00" are referenced to decode the segment data.

APS 835被插入到在图片P0之后的图片P1中。APS 835包括关于ALF的参数和关于SAO的参数。关于ALF的参数由SUB_ALF ID "AA1"标识。关于SAO的参数由SUB_SAO ID "AS1"标识。由于关于QM的参数未从图片P0更新,从而在APS 835中不包括关于QM的参数。另外,APS 835包括组合ID"C01"={AA1, AS0, AQ0}、组合ID"C02"={AA0, AS1, AQ0}以及组合ID"C03"={AA1, AS1, AQ0}作为对组合的定义。附到图片P1中的片段数据的片段头836包括组合ID“C03”。这表示,分别与组合ID“C03”相关的属于SUB_ALF ID "AA1"的关于ALF的参数、属于SUB_SAO ID "AS1"的关于SAO的参数、以及属于SUB_QM ID "AQ0" 的关于QM的参数被参考以解码该片段数据。APS 835 is inserted into picture P1, which follows picture P0. APS 835 includes parameters related to ALF and parameters related to SAO. Parameters related to ALF are identified by SUB_ALF ID "AA1." Parameters related to SAO are identified by SUB_SAO ID "AS1." Since parameters related to QM are not updated from picture P0, they are not included in APS 835. In addition, APS 835 includes combination IDs "C01" = {AA1, AS0, AQ0}, "C02" = {AA0, AS1, AQ0}, and "C03" = {AA1, AS1, AQ0} as definitions of combinations. The slice header 836 attached to the slice data in picture P1 includes the combination ID "C03." This means that parameters regarding ALF belonging to SUB_ALF ID "AA1", parameters regarding SAO belonging to SUB_SAO ID "AS1", and parameters regarding QM belonging to SUB_QM ID "AQ0", which are respectively associated with combination ID "C03", are referenced to decode the segment data.

APS 837被插入到在图片P1之后的图片P2中。APS 837包括关于ALF的参数。关于ALF的参数由SUB_ALF ID "AA2"标识。由于关于SAQ的参数和关于QM的参数未从图片P1更新,从而在APS 837中不包括关于SAO的参数和关于QM的参数。另外,APS 837包括组合ID "C04"={AA2, AS0, AQ0} 和组合ID"C05"={AA2, AS1, AQ0}作为对组合的定义。附到图片P2中的片段数据的片段头838包括组合ID“C05”。这表示,分别与组合ID“C05”相关的属于SUB_ALF ID "AA2"的关于ALF的参数、属于SUB_SAO ID "AS1"的关于SAO的参数、以及属于SUB_QM ID "AQ0" 的关于QM的参数被参考以解码该片段数据。APS 837 is inserted into picture P2, which follows picture P1. APS 837 includes parameters related to the ALF. These parameters are identified by the SUB_ALF ID "AA2." Since the SAO and QM parameters are not updated from picture P1, APS 837 does not include SAO and QM parameters. Furthermore, APS 837 includes combination IDs "C04" = {AA2, AS0, AQ0} and "C05" = {AA2, AS1, AQ0} as definitions of combinations. The slice header 838 attached to the slice data in picture P2 includes the combination ID "C05." This indicates that the ALF parameters associated with SUB_ALF ID "AA2," the SAO parameters associated with SUB_SAO ID "AS1," and the QM parameters associated with SUB_QM ID "AQ0," respectively associated with combination ID "C05," are referenced for decoding the slice data.

注意,在该示例修改中,可以不对辅助标识符的全部组合定义组合ID,从而仅对在片段头中实际参考的辅助标识符的组合定义组合ID。另外,可以在与存储对应的参数的APS不同的APS中定义辅助标识符的组合。Note that in this example modification, combination IDs may not be defined for all combinations of auxiliary identifiers, and thus may be defined only for combinations of auxiliary identifiers actually referenced in the segment header. In addition, combinations of auxiliary identifiers may be defined in an APS different from the APS storing the corresponding parameters.

在实现该示例修改的情况中,图像编码装置10的语法处理部13的参数生成部130生成作为补充参数的组合ID,其将与对包括量化矩阵参数的各种参数分组的辅助标识符的组合相关联。插入部130然后将通过参数生成部130生成的组合ID插入到APS或其它参数集中。图像解码装置60的语法处理部61的参数获取部160获取在每个片段的片段头中指定的组合ID,并使用与该组合ID关联的辅助标识符以另外获取APS中的量化矩阵参数。In implementing this exemplary modification, the parameter generation unit 130 of the syntax processing unit 13 of the image encoding device 10 generates a combination ID as a supplementary parameter, which is associated with a combination of auxiliary identifiers that group various parameters, including the quantization matrix parameter. The insertion unit 130 then inserts the combination ID generated by the parameter generation unit 130 into the APS or other parameter set. The parameter acquisition unit 160 of the syntax processing unit 61 of the image decoding device 60 acquires the combination ID specified in the slice header of each slice and uses the auxiliary identifier associated with the combination ID to separately acquire the quantization matrix parameter in the APS.

这样,通过使用与辅助标识符的组合关联的组合ID以参考APS中的参数,可以减少从片段头参考每个参数所需的代码量。In this way, by using a combination ID associated with a combination of auxiliary identifiers to reference parameters in the APS, the amount of code required to reference each parameter from the slice header can be reduced.

7.实例应用7. Example Application

根据上述实施例的图像编码装置10和图像解码装置60可以应用到:各种电子装置,诸如卫星广播的发射器或接收器、诸如有线电视的有线广播、互联网分发、通过蜂窝通信向客户设备的分发等;用于将图像记录到介质上的记录装置,所述介质诸如为光盘、磁盘、或闪存;以及用于从这些存储介质回放图像的回放装置。下面将描述四个实例应用。The image encoding device 10 and image decoding device 60 according to the above-described embodiments can be applied to various electronic devices, such as transmitters and receivers for satellite broadcasting, cable broadcasting such as cable television, Internet distribution, and distribution to client devices via cellular communications; recording devices for recording images onto media such as optical disks, magnetic disks, or flash memory; and playback devices for playing back images from these storage media. Four example applications will be described below.

[7-1.第一实例应用][7-1. First Example Application]

图34是示出采用上述实施例的电视的示例示意配置的框图。电视900包括天线901、调谐器902、解复用器903、解码器904、视频信号处理部905、显示部906、音频信号处理部907、扬声器908、外部接口909、控制部910、用户接口911、以及总线912。34 is a block diagram showing an exemplary schematic configuration of a television employing the above-described embodiment. A television 900 includes an antenna 901, a tuner 902, a demultiplexer 903, a decoder 904, a video signal processing unit 905, a display unit 906, an audio signal processing unit 907, a speaker 908, an external interface 909, a control unit 910, a user interface 911, and a bus 912.

调谐器902从经天线901接收的广播信号提取希望信道的信号,并对提取的信号解调制。调谐器902然后将通过解调获得的编码比特流输出到解复用器903。即,调谐器902用作电视900的传输装置,用于接收其中图像被编码的编码流。The tuner 902 extracts a signal of a desired channel from the broadcast signal received via the antenna 901 and demodulates the extracted signal. The tuner 902 then outputs the encoded bit stream obtained by demodulation to the demultiplexer 903. That is, the tuner 902 serves as a transmission device of the television 900 for receiving an encoded stream in which an image is encoded.

解复用器903从编码比特流分离将被观看的节目的视频流和音频流,并将分离的流输出到解码器904。另外,解复用器903从编码比特流提取诸如电子节目指南(EPG)的辅助数据等,并将提取的数据提供到控制部910。另外,当编码比特流被加扰时,解复用器903可以进行解扰。The demultiplexer 903 separates the video stream and audio stream of the program to be viewed from the encoded bit stream, and outputs the separated streams to the decoder 904. In addition, the demultiplexer 903 extracts auxiliary data such as an electronic program guide (EPG) from the encoded bit stream, and supplies the extracted data to the control section 910. In addition, when the encoded bit stream is scrambled, the demultiplexer 903 can perform descrambling.

解码器904对从解复用器903输入的视频流和音频流进行解码。然后,解码器904将通过解码处理生成的视频数据输出到视频信号处理部905。另外,解码器904将通过解码处理生成的音频数据输出到音频信号处理部907。The decoder 904 decodes the video stream and audio stream input from the demultiplexer 903. The decoder 904 then outputs the video data generated by the decoding process to the video signal processing section 905. In addition, the decoder 904 outputs the audio data generated by the decoding process to the audio signal processing section 907.

视频信号处理部905回放从解码器904输入的视频数据,并使得显示部906显示视频。视频信号处理部905还可以使得显示部906显示经网络提供的应用画面。另外,视频信号处理部905可以进行其它处理,诸如根据设置在例如视频数据上的噪声移除。另外,视频信号处理部905可以生成图形用户界面(GUI)图像,诸如菜单、按钮或光标,并将生成的图像叠加在输出图像上。The video signal processing unit 905 plays back the video data input from the decoder 904 and causes the display unit 906 to display the video. The video signal processing unit 905 may also cause the display unit 906 to display an application screen provided via the network. Furthermore, the video signal processing unit 905 may perform other processing, such as noise removal according to settings on the video data. Furthermore, the video signal processing unit 905 may generate a graphical user interface (GUI) image, such as a menu, button, or cursor, and superimpose the generated image on the output image.

显示部906通过由视频信号处理部905提供的驱动信号驱动,并在显示装置(诸如液晶显示器、等离子显示器、或OLED显示器)的显示屏上显示视频或图像。The display section 906 is driven by the driving signal supplied from the video signal processing section 905 , and displays a video or an image on a display screen of a display device such as a liquid crystal display, a plasma display, or an OLED display.

音频信号处理部907在从解码器904输入的音频数据上执行诸如D/A转换和放大的回放处理,并从扬声器908输出音频。另外,音频信号处理部907可以进行其它处理,诸如在音频数据上的噪声移除。The audio signal processing section 907 performs playback processing such as D/A conversion and amplification on the audio data input from the decoder 904, and outputs audio from the speaker 908. In addition, the audio signal processing section 907 may perform other processing such as noise removal on the audio data.

外部接口909是用于将电视900连接到外部装置或网络的接口。例如,可通过解码器904对经外部接口909接收的视频流或音频流进行解码。即,外部接口909还用作电视900的传输装置,用于接收其中图像被编码的编码流。The external interface 909 is an interface for connecting the television 900 to an external device or a network. For example, a video stream or an audio stream received via the external interface 909 can be decoded by the decoder 904. That is, the external interface 909 also serves as a transmission device of the television 900 for receiving an encoded stream in which an image is encoded.

控制部910包括诸如中央处理单元(CPU)的处理器、和诸如随机存取存储器(RAM)和只读存储器(ROM)的存储器。存储器存储将通过CPU执行的程序、程序数据、EPG数据、经网络获得的数据等。当例如启动电视900时,通过CPU读取并执行存储器中存储的程序。通过执行程序,CPU根据从例如用户接口911输入的操作信号控制电视900的操作。The control unit 910 includes a processor such as a central processing unit (CPU), and memory such as random access memory (RAM) and read-only memory (ROM). The memory stores programs to be executed by the CPU, program data, EPG data, data obtained via a network, and the like. When the television 900 is turned on, for example, the CPU reads and executes the programs stored in the memory. By executing the programs, the CPU controls the operation of the television 900 based on operation signals input from, for example, the user interface 911.

用户接口911被连接到控制部910。用户接口911例如包括由用户使用以操作电视900的按钮和开关、以及远程控制信号接收器。用户接口911检测经这些结构元件的用户的操作,生成操作信号,并将生成的操作信号输出到控制部910。The user interface 911 is connected to the control unit 910. The user interface 911 includes, for example, buttons and switches used by the user to operate the television 900, and a remote control signal receiver. The user interface 911 detects user operations via these components, generates operation signals, and outputs the generated operation signals to the control unit 910.

总线912将调谐器902、解复用器903、解码器904、视频信号处理部905、音频信号处理部907、外部接口909以及控制部910相互连接。The bus 912 connects the tuner 902 , the demultiplexer 903 , the decoder 904 , the video signal processing section 905 , the audio signal processing section 907 , the external interface 909 , and the control section 910 to one another.

在这样配置的电视900中,解码器904包括根据上述实施例的图像解码装置60的功能。因此,可以缓解对于通过电视900解码的视频的编码效率的下降,或者改善编码效率。In the television 900 thus configured, the decoder 904 includes the function of the image decoding device 60 according to the above-described embodiment. Therefore, it is possible to alleviate a decrease in encoding efficiency for a video decoded by the television 900 or improve encoding efficiency.

[7-2.第二实例应用][7-2. Second Example Application]

图35是示出采用上述实施例的移动电话的示例示意配置的框图。移动电话920包括天线921、通信部922、音频编解码器923、扬声器924、麦克风925、相机部926、图像处理部927、多路复用/解复用(mux/demux)部928、记录和回放部929、显示部930、控制部931、操作部932以及总线933。FIG35 is a block diagram showing an exemplary schematic configuration of a mobile phone employing the above-described embodiment. The mobile phone 920 includes an antenna 921, a communication section 922, an audio codec 923, a speaker 924, a microphone 925, a camera section 926, an image processing section 927, a multiplexing/demultiplexing (mux/demux) section 928, a recording and playback section 929, a display section 930, a control section 931, an operation section 932, and a bus 933.

天线921被连接到通信部922。扬声器924和麦克风925被连接到音频编解码器923。操作部932被连接到控制部931。总线933将通信部922、音频编解码器923、相机部926、图像处理部927、mux/demux部928、记录和回放部929、显示器930以及控制部931相互连接。The antenna 921 is connected to the communication section 922. The speaker 924 and the microphone 925 are connected to the audio codec 923. The operation section 932 is connected to the control section 931. The bus 933 connects the communication section 922, the audio codec 923, the camera section 926, the image processing section 927, the mux/demux section 928, the recording and playback section 929, the display 930, and the control section 931 to each other.

移动电话920在各个操作模式中进行诸如如下的操作:传输和接收音频信号、传输和接收电子邮件或图像数据、拍摄图像、记录数据等,所述操作模式包括音频通信模式、数据通信模式、成像模式以及视频电话模式。The mobile phone 920 performs operations such as transmitting and receiving audio signals, transmitting and receiving emails or image data, capturing images, recording data, etc. in various operation modes including an audio communication mode, a data communication mode, an imaging mode, and a video phone mode.

在音频通信模式中,将通过麦克风925生成的模拟音频信号提供到音频编解码器923。音频编解码器923将模拟音频信号转换为音频数据,对转换的音频数据进行A/D转换,并压缩该音频数据。然后,音频编解码器923将压缩的音频数据输出给通信部922。通信部922对音频数据进行编码和调制,并生成传输信号。通信部922然后经天线921将生成的传输信号传输到基站(未示出)。另外,通信部922放大经天线921接收的无线信号,并转换无线信号的频率,并获取接收的信号。然后,通信部922对接收信号进行解调和解码并生成音频数据,并将生成的音频数据输出到音频编解码器923。音频编解码器923对所述音频数据进行解压缩和D/A转换,并生成模拟音频信号。然后,音频编解码器923将生成的音频信号提供到扬声器924以使得将音频输出。In audio communication mode, an analog audio signal generated by microphone 925 is provided to audio codec 923. Audio codec 923 converts the analog audio signal into audio data, performs A/D conversion on the converted audio data, and compresses the audio data. Audio codec 923 then outputs the compressed audio data to communication unit 922. Communication unit 922 encodes and modulates the audio data to generate a transmission signal. Communication unit 922 then transmits the generated transmission signal to a base station (not shown) via antenna 921. Furthermore, communication unit 922 amplifies a wireless signal received via antenna 921, converts the frequency of the wireless signal, and acquires the received signal. Communication unit 922 then demodulates and decodes the received signal to generate audio data, which it then outputs to audio codec 923. Audio codec 923 decompresses and D/A converts the audio data to generate an analog audio signal. Audio codec 923 then supplies the generated audio signal to speaker 924 for audio output.

另外,在数据通信模式中,例如,控制部931根据用户经操作部932的操作生成构成电子邮件的文本数据。另外,控制部931使得在显示部930上显示文本。另外,控制部931根据经操作部932的来自用户的传输指令生成电子邮件数据,并将生成的电子邮件数据输出到通信部922。通信部922对电子邮件数据进行编码和调制,并生成传输信号。通信部922然后经天线921将生成的传输信号传输到基站(未示出)。另外,通信部922放大经天线921接收的无线信号,并转换无线信号的频率,并获取接收的信号。然后,通信部922对接收信号进行解调和解码并重构电子邮件数据,并将重构的电子邮件数据输出到控制部931。控制部931使得显示部930显示电子邮件的内容,并还使得将电子邮件数据存储到记录和回放部929的存储介质中。In data communication mode, for example, the control unit 931 generates text data constituting an email based on user operations via the operation unit 932. Furthermore, the control unit 931 causes the text to be displayed on the display unit 930. Furthermore, the control unit 931 generates email data based on a transmission instruction from the user via the operation unit 932 and outputs the generated email data to the communication unit 922. The communication unit 922 encodes and modulates the email data to generate a transmission signal. The communication unit 922 then transmits the generated transmission signal to a base station (not shown) via the antenna 921. Furthermore, the communication unit 922 amplifies the wireless signal received via the antenna 921, converts the frequency of the wireless signal, and acquires the received signal. The communication unit 922 then demodulates and decodes the received signal to reconstruct the email data and outputs the reconstructed email data to the control unit 931. The control unit 931 causes the display unit 930 to display the contents of the email and also causes the email data to be stored in the storage medium of the recording and playback unit 929.

记录和回放部929包括可任意读取和写入的存储介质。例如,存储介质可以是诸如RAM的内置存储介质、或闪存、或外部安装的存储介质,诸如硬盘、磁盘、磁光盘、光盘、USB存储器或存储卡。The recording and playback section 929 includes a storage medium that can be read and written arbitrarily. For example, the storage medium may be a built-in storage medium such as RAM, or a flash memory, or an externally mounted storage medium such as a hard disk, a magnetic disk, a magneto-optical disk, an optical disk, a USB memory, or a memory card.

另外,在成像模式中,相机部926拍摄被摄体的图像,生成图像数据,并将生成的图像数据输出到例如图像处理部927。图像处理部927对从相机部926输入的图像数据编码,并使得在记录和回放部929的存储介质上存储编码流。In addition, in the imaging mode, the camera section 926 captures an image of a subject, generates image data, and outputs the generated image data to, for example, the image processing section 927. The image processing section 927 encodes the image data input from the camera section 926 and causes the encoded stream to be stored on the storage medium of the recording and playback section 929.

另外,在视频电话模式中,例如,mux/demux部928对通过图像处理部927编码的视频流和从音频编解码器923输入的音频流进行多路复用,并将多路复用的流输出到通信部922。通信部922对该流进行编码和调制,并生成传输信号。通信部922然后经天线921将生成的传输信号传输到基站(未示出)。另外,通信部922放大经天线921接收的无线信号,并转换无线信号的频率,并获取接收的信号。所述传输信号和接收信号可包括编码比特流。然后,通信部922对接收信号进行解调和解码并重构所述流,并将重构的流输出到mux/demux部928。mux/demux部928从输入流分离视频流和音频流,并将视频流输出到图像处理部927且将音频流输出到音频编解码器923。图像处理部927解码视频流以生成视频数据。视频数据被提供到显示部930,并通过显示部930显示一系列图像。音频编解码器923对所述音频流进行解压缩和D/A转换,并生成模拟音频信号。然后,音频编解码器923将生成的音频信号提供到扬声器924以使得将音频输出。In videophone mode, for example, the mux/demux unit 928 multiplexes the video stream encoded by the image processing unit 927 and the audio stream input from the audio codec 923, and outputs the multiplexed stream to the communication unit 922. The communication unit 922 encodes and modulates the stream to generate a transmission signal. The communication unit 922 then transmits the generated transmission signal to a base station (not shown) via the antenna 921. Furthermore, the communication unit 922 amplifies the wireless signal received via the antenna 921, converts the frequency of the wireless signal, and obtains the received signal. The transmission signal and the received signal may include an encoded bit stream. The communication unit 922 then demodulates and decodes the received signal to reconstruct the stream and output the reconstructed stream to the mux/demux unit 928. The mux/demux unit 928 separates the video stream and the audio stream from the input stream, outputs the video stream to the image processing unit 927, and the audio stream to the audio codec 923. The image processing unit 927 decodes the video stream to generate video data. The video data is supplied to the display unit 930, and a series of images are displayed on the display unit 930. The audio codec 923 decompresses and D/A converts the audio stream and generates an analog audio signal. The audio codec 923 then supplies the generated audio signal to the speaker 924 so that the audio is output.

在这样配置的移动电话920中,图像处理部927具有根据上述实施例的图像编码装置10和图像解码装置60的功能。因此,可以缓解对于通过移动电话920编码和解码的视频的编码效率的下降,或者改善编码效率。In the mobile phone 920 thus configured, the image processing section 927 has the functions of the image encoding device 10 and the image decoding device 60 according to the above-described embodiment. Therefore, it is possible to alleviate a decrease in encoding efficiency for videos encoded and decoded by the mobile phone 920 or improve encoding efficiency.

[7-3.第三实例应用][7-3. Third Example Application]

图36是示出采用上述实施例的记录和回放装置的示例示意配置的框图。记录和回放装置940编码和在记录介质上记录例如接收的广播节目的音频数据和视频数据。记录和回放装置940还可以编码和在记录介质上记录例如从其它装置获取的音频数据和视频数据。另外,记录和回放装置940例如根据来自用户的指令通过监视器和扬声器回放记录在记录介质上的数据。此时,记录和回放装置940对音频数据和视频数据解码。FIG36 is a block diagram illustrating an exemplary schematic configuration of a recording and playback device employing the above-described embodiment. The recording and playback device 940 encodes and records, for example, audio and video data of a received broadcast program onto a recording medium. The recording and playback device 940 can also encode and record, for example, audio and video data acquired from another device onto a recording medium. Furthermore, the recording and playback device 940 plays back data recorded on the recording medium via a monitor and speakers, for example, in response to user instructions. In this case, the recording and playback device 940 decodes the audio and video data.

记录和回放装置940包括调谐器941、外部接口942、编码器943、硬盘驱动器(HDD)944、盘驱动器945、选择器946、解码器947、屏幕上显示(OSD)948、控制部949、以及用户接口950。The recording and playback device 940 includes a tuner 941 , an external interface 942 , an encoder 943 , a hard disk drive (HDD) 944 , a disk drive 945 , a selector 946 , a decoder 947 , an on-screen display (OSD) 948 , a control section 949 , and a user interface 950 .

调谐器941从经天线(未示出)接收的广播信号提取希望信道的信号,并对提取的信号解调。调谐器941然后将通过解调获得的编码比特流输出到选择器946。即,调谐器941用作记录和回放装置940的传输装置。The tuner 941 extracts a signal of a desired channel from a broadcast signal received via an antenna (not shown) and demodulates the extracted signal. The tuner 941 then outputs a coded bit stream obtained by demodulation to the selector 946. That is, the tuner 941 functions as a transmission device of the recording and playback device 940.

外部接口942是用于将记录和回放装置940连接到外部装置或网络的接口。外部接口942例如可以为IEEE 1394接口、网络接口、USB接口、闪存接口等等。例如,经外部接口942接收的音频数据和视频数据被输入编码器943。即,外部接口942用作记录和回放装置940的传输装置。The external interface 942 is an interface for connecting the recording and playback device 940 to an external device or a network. The external interface 942 may be, for example, an IEEE 1394 interface, a network interface, a USB interface, a flash memory interface, or the like. For example, audio data and video data received via the external interface 942 are input to the encoder 943. That is, the external interface 942 serves as a transmission device of the recording and playback device 940.

在从外部接口942输入的视频数据和音频数据未被编码的情况中,编码器943对该视频数据和音频数据进行编码。编码器943然后将编码的比特流输出到选择器946。In the case where the video data and audio data input from the external interface 942 are not encoded, the encoder 943 encodes the video data and audio data, and then outputs the encoded bit stream to the selector 946 .

HDD 944在内部硬盘上记录编码比特流,其为诸如视频或音频、各种程序及其它数据的压缩内容数据。另外,HDD 944在回放视频和音频时从硬盘读取该数据。The HDD 944 records a coded bit stream, which is compressed content data such as video or audio, various programs and other data, on an internal hard disk. In addition, the HDD 944 reads the data from the hard disk when playing back video and audio.

盘驱动器945相对于插入的记录介质记录或读取数据。插入到盘驱动器945中的记录介质可以是例如DVD盘(诸如 DVD-视频、DVD-RAM、DVD-R、DVD-RW、DVD+或DVD+RW盘)、蓝光盘(注册商标)等。The disk drive 945 records or reads data with respect to a recording medium inserted. The recording medium inserted into the disk drive 945 may be, for example, a DVD disk (such as a DVD-Video, DVD-RAM, DVD-R, DVD-RW, DVD+, or DVD+RW disk), a Blu-ray Disc (registered trademark), or the like.

在记录视频和音频时,选择器946选择从调谐器941或编码器943输入的编码比特流,并将选择的编码比特流输出到HDD 944或盘驱动器945。另外,在回放视频和音频时,选择器946将从HDD 944或盘驱动器945输入的编码比特流输出到解码器947。When recording video and audio, the selector 946 selects an encoded bit stream input from the tuner 941 or the encoder 943 and outputs the selected encoded bit stream to the HDD 944 or the disk drive 945. In addition, when playing back video and audio, the selector 946 outputs the encoded bit stream input from the HDD 944 or the disk drive 945 to the decoder 947.

解码器947对编码比特流解码,并生成音频数据和视频数据。解码器947然后将生成的视频数据输出到OSD 948。另外,解码器904将生成的音频数据输出到外部扬声器。The decoder 947 decodes the encoded bit stream and generates audio data and video data. The decoder 947 then outputs the generated video data to the OSD 948. In addition, the decoder 904 outputs the generated audio data to an external speaker.

OSD 948回放从解码器947输入的视频数据,并显示视频。另外,OSD 948可以在显示视频上叠加GUI图像,诸如菜单、按钮或光标。The OSD 948 plays back the video data input from the decoder 947 and displays the video. In addition, the OSD 948 may superimpose a GUI image such as a menu, a button, or a cursor on the displayed video.

控制部949包括诸如CPU的处理器和诸如RAM或ROM的存储器。存储器存储通过CPU执行的程序、程序数据等。当例如启动记录和回放装置940时,通过CPU读取并执行存储器中存储的程序。通过执行程序,CPU根据从例如用户接口950输入的操作信号控制记录和回放装置940的操作。The control unit 949 includes a processor such as a CPU and a memory such as a RAM or ROM. The memory stores programs executed by the CPU, program data, and the like. When the recording and playback device 940 is activated, for example, the CPU reads and executes the program stored in the memory. By executing the program, the CPU controls the operation of the recording and playback device 940 based on an operation signal input from, for example, the user interface 950.

用户接口950被连接到控制部949。用户接口950例如包括由用户使用以操作记录和回放装置940的按钮和开关、以及远程控制信号接收器。用户接口950检测经这些结构元件的用户的操作,生成操作信号,并将生成的操作信号输出到控制部949。The user interface 950 is connected to the control unit 949. The user interface 950 includes, for example, buttons and switches used by the user to operate the recording and playback device 940, and a remote control signal receiver. The user interface 950 detects the user's operation via these structural elements, generates an operation signal, and outputs the generated operation signal to the control unit 949.

在这样配置的记录和回放装置940中,编码器943包括根据上述实施例的图像编码装置10的功能。另外,解码器947具有根据上述实施例的图像解码装置60的功能。因此,可以缓解对于通过记录和回放装置940编码和解码的视频的编码效率的下降,或者改善编码效率。In the recording and playback device 940 configured in this manner, the encoder 943 includes the functions of the image encoding device 10 according to the above-described embodiment. In addition, the decoder 947 has the functions of the image decoding device 60 according to the above-described embodiment. Therefore, it is possible to alleviate a decrease in the encoding efficiency of a video encoded and decoded by the recording and playback device 940, or to improve the encoding efficiency.

[7-4.第四实例应用][7-4. Fourth Example Application]

图37是示出采用上述实施例的成像装置的示例示意配置的框图。成像装置960拍摄被摄体的图像,生成图像,对图像数据编码,并将图像数据记录在记录介质上。37 is a block diagram showing an exemplary schematic configuration of an imaging device employing the above-described embodiment. The imaging device 960 captures an image of a subject, generates an image, encodes the image data, and records the image data on a recording medium.

成像装置960包括光学块961、成像部962、信号处理部963、图像处理部964、显示部965、外部接口966、存储器967、介质驱动器968、OSD 969、控制部970、用户接口971、以及总线972。The imaging device 960 includes an optical block 961 , an imaging section 962 , a signal processing section 963 , an image processing section 964 , a display section 965 , an external interface 966 , a memory 967 , a media drive 968 , an OSD 969 , a control section 970 , a user interface 971 , and a bus 972 .

光学块961被连接到成像部962。成像部962被连接到信号处理部963。显示部965被连接到图像处理部964。用户接口971被连接到控制部970。总线972将图像处理部964、外部接口966、存储器967、介质驱动器968、OSD 969以及控制部970相互连接。The optical block 961 is connected to the imaging section 962. The imaging section 962 is connected to the signal processing section 963. The display section 965 is connected to the image processing section 964. The user interface 971 is connected to the control section 970. The bus 972 connects the image processing section 964, the external interface 966, the memory 967, the media drive 968, the OSD 969, and the control section 970 to each other.

光学块961具有聚焦透镜、孔径光阑机构等。光学块961在成像部962的成像表面上形成被摄体的光学图像。成像部962包括诸如CCD或CMOS传感器的图像传感器,并将在成像表面上形成的光学图像光电地转换为作为电信号的图像信号。然后,成像部962把图像信号输出到信号处理部963。The optical block 961 includes a focusing lens, an aperture stop mechanism, and the like. The optical block 961 forms an optical image of a subject on the imaging surface of the imaging section 962. The imaging section 962 includes an image sensor such as a CCD or CMOS sensor, and photoelectrically converts the optical image formed on the imaging surface into an image signal, which is an electrical signal. The imaging section 962 then outputs the image signal to the signal processing section 963.

信号处理部963对从成像部962输入的图像信号执行各种相机信号处理,诸如拐点校正(knee correction)、伽马校正、颜色校正等。信号处理部963把经过处理的图像数据输出到图像处理部964。The signal processing section 963 performs various camera signal processes such as knee correction, gamma correction, color correction, etc. on the image signal input from the imaging section 962 . The signal processing section 963 outputs the processed image data to the image processing section 964 .

图像处理部964对从信号处理部963输入的图像数据进行编码,并生成编码数据。然后,图像处理部964将生成的编码数据输出到外部接口966或介质驱动器968。图像处理部964还对从外部接口966或介质驱动器968输入的编码数据进行解码,并生成图像数据。图像处理部964然后将生成的图像数据输出显示部965。另外,图像处理部964还可以将从信号处理部963输入的图像数据输出到显示部965,以使得图像被显示。另外,图像处理部964可以将从OSD 969获取的显示数据叠加到将输出到显示部965的图像上。The image processing unit 964 encodes the image data input from the signal processing unit 963 and generates encoded data. The image processing unit 964 then outputs the generated encoded data to the external interface 966 or the media drive 968. The image processing unit 964 also decodes the encoded data input from the external interface 966 or the media drive 968 and generates image data. The image processing unit 964 then outputs the generated image data to the display unit 965. In addition, the image processing unit 964 can also output the image data input from the signal processing unit 963 to the display unit 965 so that the image is displayed. In addition, the image processing unit 964 can superimpose the display data obtained from the OSD 969 on the image to be output to the display unit 965.

OSD 969生成GUI图像,诸如菜单、按钮或光标,并将生成的图像输出到图像处理部964。The OSD 969 generates a GUI image such as a menu, a button, or a cursor, and outputs the generated image to the image processing section 964 .

外部接口966被配置为例如USB输入/输出端子。外部接口966在例如打印图像时将成像装置960连接到打印机。另外,根据需要将驱动器连接到外部接口966。诸如磁盘或光盘的可移动介质被插入到驱动器中,并且可以在成像装置960中安装从该可移动介质读取的程序。另外,外部接口966可被配置网络接口,其将被连接到诸如LAN或互联网的网络。即,外部接口966用作图像拍摄装置960的传输装置。The external interface 966 is configured as, for example, a USB input/output terminal. The external interface 966 connects the imaging device 960 to a printer, for example, when printing an image. Furthermore, a drive is connected to the external interface 966 as needed. Removable media such as a magnetic disk or optical disk is inserted into the drive, and programs read from the removable media can be installed in the imaging device 960. Furthermore, the external interface 966 may be configured as a network interface for connecting to a network such as a LAN or the Internet. In other words, the external interface 966 functions as a transmission device for the image capturing device 960.

将被插入到介质驱动器968中的记录介质可以是任意可读取和可写入的可移动介质,诸如磁盘、磁光盘、光盘、或半导体存储器。另外,可将记录介质永久地安装到介质驱动器968中以构成诸如内部硬盘驱动器或固态驱动器(SSD)的非便携式存储部。The recording medium to be inserted into the media drive 968 may be any removable medium that can read and write, such as a magnetic disk, a magneto-optical disk, an optical disk, or a semiconductor memory. In addition, the recording medium may be permanently installed in the media drive 968 to constitute a non-portable storage unit such as an internal hard disk drive or a solid-state drive (SSD).

控制部970包括诸如CPU的处理器和诸如RAM或ROM的存储器。存储器存储通过CPU执行的程序、程序数据等。当例如启动成像装置960时,通过CPU读取并执行存储器中存储的程序。通过执行程序,CPU根据从例如用户接口971输入的操作信号控制成像装置960的操作。The control unit 970 includes a processor such as a CPU and a memory such as RAM or ROM. The memory stores programs executed by the CPU, program data, and the like. When the imaging device 960 is activated, for example, the CPU reads and executes the program stored in the memory. By executing the program, the CPU controls the operation of the imaging device 960 based on an operation signal input from, for example, the user interface 971.

用户接口971被连接到控制部970。用户接口971包括由用户使用以操作例如成像装置960的按钮、开关等。用户接口971检测经这些结构元件的用户的操作,生成操作信号,并将生成的操作信号输出到控制部970。The user interface 971 is connected to the control section 970. The user interface 971 includes buttons, switches, and the like used by the user to operate, for example, the imaging device 960. The user interface 971 detects the user's operation via these structural elements, generates an operation signal, and outputs the generated operation signal to the control section 970.

在这样配置的成像装置960中,图像处理部964具有根据上述实施例的图像编码装置10和图像解码装置60的功能。因此,可以缓解对于通过成像装置960编码和解码的视频的编码效率的下降,或者改善编码效率。In the imaging device 960 configured in this manner, the image processing section 964 has the functions of the image encoding device 10 and the image decoding device 60 according to the above-described embodiment. Therefore, it is possible to alleviate a decrease in encoding efficiency for video encoded and decoded by the imaging device 960 or improve encoding efficiency.

<8.结论>8. Conclusion

上文利用图1到37描述了根据实施例的图像编码装置10和图像解码装置60。根据实施例,将定义在量化和逆量化图像时使用的量化矩阵的量化矩阵参数插入不同于序列参数集和图片参数集的量化矩阵参数集(QMPS)中。通过这样,不需要既要在更新量化矩阵时编码除量化矩阵参数之外的参数、又要在更新除量化矩阵参数之外的参数时编码量化矩阵参数。从而,减轻了随着更新量化矩阵的编码效率的下降,或者改善了编码效率。具体地说,在具有较大尺寸的量化矩阵的情况中,或者在其中对每个图片定义较多数目的量化矩阵的情况中,通过本说明书公开的技术对代码量的减少变得更有效。The image encoding device 10 and image decoding device 60 according to the embodiment have been described above using Figures 1 to 37. According to the embodiment, quantization matrix parameters defining the quantization matrix used when quantizing and inverse quantizing an image are inserted into a quantization matrix parameter set (QMPS) that is different from the sequence parameter set and the picture parameter set. This eliminates the need to encode parameters other than the quantization matrix parameters when updating the quantization matrix, and vice versa. Consequently, the decrease in coding efficiency associated with updating the quantization matrix is mitigated, or coding efficiency is improved. Specifically, in the case of a quantization matrix with a larger size, or in the case of a large number of quantization matrices defined for each picture, the techniques disclosed in this specification become more effective in reducing the amount of code.

另外,根据本发明实施例,代替直接定义量化矩阵,可以在QMPS中包括指定复制在前生成的量化矩阵的参数。在该情况中,从QMPS省略指定量化矩阵自身的参数(例如DPCM格式的差分数据阵列),从而可以进一步减少用于定义量化矩阵所需的代码量。Furthermore, according to an embodiment of the present invention, instead of directly defining the quantization matrix, the QMPS may include parameters specifying a copy of a previously generated quantization matrix. In this case, the parameters specifying the quantization matrix itself (e.g., a differential data array in DPCM format) are omitted from the QMPS, further reducing the amount of code required to define the quantization matrix.

另外,根据本发明实施例,对每个QMPS分配QMPS ID。然后,在复制模式中,可以将定义复制源量化矩阵的QMPS的QMPS ID指定为源ID。另外,可以将复制源量化矩阵的尺寸和类型指定为复制源尺寸和复制源类型。从而,可以将来自在前生成的多个QMPS中的量化矩阵的任意QMPS中的量化矩阵灵活指定为复制源量化矩阵。还可以复制和再使用不同尺寸或类型的量化矩阵。Furthermore, according to an embodiment of the present invention, a QMPS ID is assigned to each QMPS. In copy mode, the QMPS ID of the QMPS that defines the copy-source quantization matrix can be specified as the source ID. Furthermore, the size and type of the copy-source quantization matrix can be specified as the copy-source size and copy-source type. This allows the quantization matrix in any QMPS from among the quantization matrices in multiple previously generated QMPSs to be flexibly designated as the copy-source quantization matrix. Quantization matrices of different sizes or types can also be copied and reused.

另外,根据本发明实施例,可在QMPS中包括指定将复制的量化矩阵的残差分量的参数。从而,仍可以以低速率新生成与在前生成的量化矩阵不完全相等的量化矩阵。In addition, according to an embodiment of the present invention, a parameter specifying a residual component of the quantization matrix to be copied may be included in the QMPS. Thus, a quantization matrix that is not completely equal to a previously generated quantization matrix may be newly generated at a low rate.

另外,在轴指定模式中,代替扫描量化矩阵的全部元素,在QMPS中可以仅包括在量化矩阵中对应于量化矩阵的三个参考轴或四个角的元素的值。从而,在该情况中,同样可以小代码量定义量化矩阵。In addition, in axis-specific mode, instead of scanning all elements of the quantization matrix, the QMPS can only include the values of the elements corresponding to the three reference axes or four corners of the quantization matrix. Thus, in this case, the quantization matrix can also be defined with a small amount of code.

另外,根据本发明实施例,基于由在片段头中指定的QMPS ID标识的QMPS中的量化矩阵参数设置对于每个片段使用的量化矩阵。从而,由于可以对每个片段灵活地切换量化矩阵,即使在其中图像特征随时间变化的情况中,仍可以使用在每个时刻点的最优量化矩阵以编码或解码视频。In addition, according to an embodiment of the present invention, the quantization matrix used for each slice is set based on the quantization matrix parameters in the QMPS identified by the QMPS ID specified in the slice header. Therefore, since the quantization matrix can be flexibly switched for each slice, even in the case where the image characteristics change over time, the optimal quantization matrix at each time point can be used to encode or decode the video.

注意,本发明描述了这样的实例,其中将量化矩阵参数集多路复用到编码流的头中,并从编码侧传输到解码侧。然而,传输量化矩阵参数集的技术不限于这样的实例。例如,可以将每个参数集中的信息传输或记录为与编码比特流关联的单独的数据,而不被多路复用到编码比特流中。这里,术语“关联”表示可以在解码时将在比特流中包括的图像(还包括诸如片段或块的局部图像)与对应于这些图像的信息联系起来。换句话说,也可以在与图像(或比特流)分开的传输信道上传输信息。另外,可以将信息记录在与图像(或比特流)分开的记录介质(或相同记录介质中的单独的记录区域)中。另外,可以诸如多个帧、单个帧或者帧内的部分的任意单位将信息和图像(或比特流)彼此关联。Note that this disclosure describes an example in which quantization matrix parameter sets are multiplexed into the header of the coded stream and transmitted from the encoding side to the decoding side. However, the technique for transmitting quantization matrix parameter sets is not limited to this example. For example, the information in each parameter set can be transmitted or recorded as separate data associated with the coded bitstream, rather than multiplexed into the coded bitstream. Here, the term "associated" means that the images (including partial images such as slices or blocks) included in the bitstream can be associated with the information corresponding to these images during decoding. In other words, the information can also be transmitted over a transmission channel separate from the images (or bitstream). Alternatively, the information can be recorded in a separate recording medium (or a separate recording area within the same recording medium) from the images (or bitstream). Furthermore, the information and the images (or bitstream) can be associated with each other in arbitrary units, such as multiple frames, a single frame, or a portion within a frame.

上文从而参考附图详细描述了本发明的优选实施例。然而,本发明的技术范围不限于所述实例。对于本发明所述领域的技术人员显而易见地,可以进行各种修改和替换,只要其落入权利要求所述的技术构思的范围中,并且,可以理解,这样的修改或替换显然属于本发明的技术范围。The preferred embodiments of the present invention have been described in detail with reference to the accompanying drawings. However, the technical scope of the present invention is not limited to the examples described. It will be apparent to those skilled in the art that various modifications and substitutions may be made as long as they fall within the scope of the technical concept described in the claims, and it will be understood that such modifications or substitutions clearly fall within the technical scope of the present invention.

另外,还可以如下配置本发明的技术。Additionally, the technology of the present invention can also be configured as follows.

(1)一种图像处理装置,包括:(1) An image processing device comprising:

获取部,被配置为从编码流获取量化矩阵参数,该编码流是在不同于序列参数集和图片参数集的参数集中设置定义量化矩阵的所述量化矩阵参数的编码流;an acquisition unit configured to acquire a quantization matrix parameter from a coded stream, the quantization matrix parameter defining the quantization matrix being set in a parameter set different from a sequence parameter set and a picture parameter set;

设置部,被配置为基于通过所述获取部获取的量化矩阵参数设置当逆量化从编码流解码的数据时使用的量化矩阵;以及a setting section configured to set a quantization matrix used when inverse quantizing data decoded from an encoded stream, based on the quantization matrix parameters acquired by the acquisition section; and

逆量化部,被配置为使用通过所述设置部设置的量化矩阵逆量化从编码流解码的数据。The inverse quantization section is configured to inversely quantize data decoded from the encoded stream using the quantization matrix set by the setting section.

(2)根据(1)所述的图像处理设备,其中(2) The image processing apparatus according to (1), wherein

所述包含量化矩阵参数的参数集是共用参数集,利用该共用参数集还能够设置与除量化矩阵之外的编码工具相关的其它编码参数,以及The parameter set including the quantization matrix parameters is a common parameter set, which can be used to set other coding parameters related to coding tools other than the quantization matrix, and

当在所述共用参数集中设置量化矩阵参数时,所述获取部获取量化矩阵参数。When a quantization matrix parameter is set in the common parameter set, the acquisition section acquires the quantization matrix parameter.

(3)根据(2)所述的图像处理设备,其中(3) The image processing apparatus according to (2), wherein

所述获取部通过参考在所述共用参数集中包括的标记确定在所述共用参数集中是否设置了量化矩阵参数。The acquisition section determines whether a quantization matrix parameter is set in the common parameter set by referring to a flag included in the common parameter set.

(4)根据(2)或(3)所述的图像处理设备,其中(4) The image processing apparatus according to (2) or (3), wherein

所述共用参数集是适应参数集。The common parameter set is an adaptation parameter set.

(5)根据(4)所述的图像处理设备,其中(5) The image processing apparatus according to (4), wherein

在其中在所述第一适应参数集之后解码的第二适应参数集中包括对第一适应参数集的参考的情况中,所述设置部再使用基于从所述第一适应参数集获取的量化矩阵参数而设置的量化矩阵作为对应于所述第二适应参数集的量化矩阵。In a case where a second adaptation parameter set decoded after the first adaptation parameter set includes a reference to the first adaptation parameter set, the setting unit reuses the quantization matrix set based on the quantization matrix parameters obtained from the first adaptation parameter set as the quantization matrix corresponding to the second adaptation parameter set.

(6)根据(5)所述的图像处理设备,其中(6) The image processing apparatus according to (5), wherein

在其中在第三适应参数集中包括对第三适应参数集的参考的情况中,所述设置部设置默认量化矩阵作为与所述第三适应参数集对应的量化矩阵。In a case where a reference to a third adaptation parameter set is included in a third adaptation parameter set, the setting section sets a default quantization matrix as a quantization matrix corresponding to the third adaptation parameter set.

(7)根据(1)所述的图像处理设备,其中(7) The image processing apparatus according to (1), wherein

在其中在第二参数集中包括用于指示对第一参数集的第一量化矩阵进行复制的复制参数的情况中,所述设置部通过复制所述第一量化矩阵设置第二量化矩阵。In a case where a copy parameter for instructing to copy a first quantization matrix of the first parameter set is included in the second parameter set, the setting section sets the second quantization matrix by copying the first quantization matrix.

(8)根据(7)所述的图像处理设备,其中(8) The image processing apparatus according to (7), wherein

每个包括所述量化矩阵参数的参数集具有用于标识每个参数集的标识符,以及Each parameter set including the quantization matrix parameters has an identifier for identifying each parameter set, and

所述复制参数包括复制源的参数集的标识符。The copy parameters include an identifier of a parameter set of a copy source.

(9)根据(8)所述的图像处理设备,其中(9) The image processing apparatus according to (8), wherein

每个参数集包括分别定义多种类别的量化矩阵的量化矩阵参数,以及Each parameter set includes quantization matrix parameters that define quantization matrices of multiple categories, and

所述复制参数包括指定所述第一量化矩阵的类别的类别参数。The copy parameters include a category parameter specifying a category of the first quantization matrix.

(10)根据(8)所述的图像处理设备,其中(10) The image processing apparatus according to (8), wherein

在第三参数集中包括的复制源的参数集的标识符等于第三参数集的标识符的情况中,所述设置部设置默认量化矩阵作为用于第三参数集的第三量化矩阵。The setting section sets a default quantization matrix as a third quantization matrix for the third parameter set in a case where an identifier of a parameter set of a copy source included in the third parameter set is equal to an identifier of the third parameter set.

(11)根据(7)所述的图像处理设备,其中(11) The image processing apparatus according to (7), wherein

在第二量化矩阵的尺寸大于第一量化矩阵的尺寸的情况中,所述设置部通过对复制的第一量化矩阵的元素进行插值来设置第二量化矩阵。In a case where a size of the second quantization matrix is larger than a size of the first quantization matrix, the setting section sets the second quantization matrix by interpolating elements of the copied first quantization matrix.

(12)根据(7)所述的图像处理设备,其中(12) The image processing apparatus according to (7), wherein

在第二量化矩阵的尺寸小于第一量化矩阵的尺寸的情况中,所述设置部通过减少复制的第一量化矩阵的元素来设置第二量化矩阵。In a case where a size of the second quantization matrix is smaller than a size of the first quantization matrix, the setting section sets the second quantization matrix by reducing elements of the copied first quantization matrix.

(13)根据(7)所述的图像处理设备,其中(13) The image processing apparatus according to (7), wherein

在第二参数集中包括指定所复制的量化矩阵的残差分量的残差指定参数的情况中,所述设置部通过将所述残差分量增加到所复制的第一量化矩阵而设置第二量化矩阵。In a case where a residual designation parameter designating a residual component of the copied quantization matrix is included in the second parameter set, the setting section sets the second quantization matrix by adding the residual component to the copied first quantization matrix.

(14)根据(1)所述的图像处理设备,其中(14) The image processing apparatus according to (1), wherein

每个包括所述量化矩阵参数的参数集具有用于标识每个参数集的参数集标识符,以及Each parameter set including the quantization matrix parameters has a parameter set identifier for identifying each parameter set, and

所述逆量化部对于每个片段使用通过设置部基于在由在片段头中指定的参数集标识符标识的参数集中包括的量化矩阵参数设置的量化矩阵。The inverse quantization section uses, for each slice, a quantization matrix set by the setting section based on quantization matrix parameters included in a parameter set identified by a parameter set identifier specified in a slice header.

(15)根据(7)至(14)中任一项所述的图像处理装置,其中(15) The image processing device according to any one of (7) to (14), wherein

包含量化矩阵参数的参数集还包括与除量化矩阵之外的编码工具相关的其它编码参数。The parameter set containing the quantization matrix parameters may also include other coding parameters related to coding tools other than the quantization matrix.

(16)根据(15)所述的图像处理设备,其中(16) The image processing apparatus according to (15), wherein

所述量化矩阵参数和所述其它编码参数通过与标识每个参数集的参数标识符分开定义的辅助标识符被分组,以及The quantization matrix parameters and the other encoding parameters are grouped by an auxiliary identifier defined separately from a parameter identifier identifying each parameter set, and

所述获取部使用辅助标识符获取量化矩阵参数。The acquisition section acquires a quantization matrix parameter using the auxiliary identifier.

(17)根据(16)所述的图像处理设备,其中(17) The image processing apparatus according to (16), wherein

在参数集或另一参数集中定义与多个辅助标识符的组合关联的组合标识符,以及defining a combined identifier associated with a combination of multiple auxiliary identifiers in a parameter set or in another parameter set, and

所述获取部获取在每个片段的片段头中指定的组合标识符,并使用与所获取的组合标识符关联的辅助标识符获取量化矩阵参数。The acquisition section acquires a combination identifier specified in a slice header of each slice, and acquires a quantization matrix parameter using an auxiliary identifier associated with the acquired combination identifier.

(18)一种图像处理方法,包括:(18) An image processing method comprising:

从编码流获取量化矩阵参数,所述编码流是在不同于序列参数集和图片参数集的参数集中设置定义量化矩阵的所述量化矩阵参数的编码流;Acquire a quantization matrix parameter from a coded stream, wherein the quantization matrix parameter defining the quantization matrix is set in a parameter set different from a sequence parameter set and a picture parameter set;

基于获取的量化矩阵参数设置当逆量化从编码流解码的数据时使用的量化矩阵;以及setting a quantization matrix used when inverse quantizing data decoded from an encoded stream based on the acquired quantization matrix parameter; and

使用所设置的量化矩阵逆量化从所述编码流解码的数据。The data decoded from the encoded stream is inversely quantized using the set quantization matrix.

(19)一种图像处理装置,包括:(19) An image processing device comprising:

量化部,被配置为使用量化矩阵量化数据;a quantization unit configured to quantize data using a quantization matrix;

设置部,被配置为设置量化矩阵参数,该量化矩阵参数定义在所述量化部量化所述数据时使用的量化矩阵;以及a setting section configured to set a quantization matrix parameter that defines a quantization matrix used when the quantization section quantizes the data; and

编码部,被配置为在不同于序列参数集和图片参数集的参数集中编码通过设置部设置的量化矩阵参数。The encoding unit is configured to encode the quantization matrix parameters set by the setting unit in a parameter set different from the sequence parameter set and the picture parameter set.

(20)一种图像处理方法,包括:(20) An image processing method comprising:

使用量化矩阵量化数据;quantize the data using a quantization matrix;

设置量化矩阵参数,该量化矩阵参数定义在量化所述数据时使用的量化矩阵;以及Setting a quantization matrix parameter, the quantization matrix parameter defining a quantization matrix to be used when quantizing the data; and

在不同于序列参数集和图片参数集的参数集中编码所设置的量化矩阵参数。The set quantization matrix parameters are encoded in a parameter set different from the sequence parameter set and the picture parameter set.

附图标记列表Reference Signs List

10 图像处理装置(图像编码装置)10 Image Processing Device (Image Coding Device)

16 量化部16 Quantitative Department

120 参数生成部120 Parameter Generation Unit

130 插入部130 Insertion

60 图像处理装置(图像解码装置)60 Image Processing Device (Image Decoding Device)

63 逆量化部63 Inverse Quantization Unit

160 参数获取部160 Parameter Acquisition Unit

170 设置部170 Settings

Claims (36)

1.一种图像处理设备,包括:1. An image processing apparatus, comprising: 至少一个处理器;At least one processor; 包括计算机程序代码的至少一个存储器,存储器和计算机程序代码被配置为在与处理器一起工作的情况下引起所述设备执行至少如下步骤:At least one memory including computer program code, the memory and computer program code being configured to cause the device to perform at least the following steps when working in conjunction with a processor: 设置第一量化矩阵和第二量化矩阵作为用于对第一图像的第一变换系数数据进行量化的量化矩阵;The first quantization matrix and the second quantization matrix are set as quantization matrices for quantizing the first transform coefficient data of the first image; 通过将所述第一量化矩阵复制到所述第二量化矩阵来设置所述第二量化矩阵;The second quantization matrix is set by copying the first quantization matrix to the second quantization matrix; 使用所述第二量化矩阵量化第一变换系数数据以生成第一被量化的变换系数数据;以及The first transform coefficient data is quantized using the second quantization matrix to generate the first quantized transform coefficient data; and 将第一被量化的变换系数数据编码成第一编码流;Encode the first quantized transform coefficient data into a first encoded stream; 设置第三量化矩阵;Set a third quantization matrix; 从第二编码流解码第二被量化的变换系数数据;Decode the second quantized transform coefficient data from the second encoded stream; 通过将所述第三量化矩阵复制到第四量化矩阵来设置所述第四量化矩阵;以及The fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix; and 使用所述第四量化矩阵对从第二编码流解码的第二被量化的变换系数数据进行逆量化以生成被逆量化的变换系数数据。The fourth quantization matrix is used to inverse quantize the second quantized transform coefficient data decoded from the second encoded stream to generate inverse quantized transform coefficient data. 2.根据权利要求1所述的设备,其中所述第一量化矩阵对应于第一标识信息的值,并且所述第二量化矩阵对应于第二标识信息的值。2. The device according to claim 1, wherein the first quantization matrix corresponds to the value of the first identification information, and the second quantization matrix corresponds to the value of the second identification information. 3.根据权利要求2所述的设备,其中在其中将所述第一量化矩阵复制到所述第二量化矩阵的情况下,存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备生成与第二标识信息的值不同的第一标识信息的值。3. The device of claim 2, wherein when the first quantization matrix is copied to the second quantization matrix, the memory includes computer program code configured to cause the device to generate a value of first identification information that is different from the value of the second identification information when working with the processor. 4.根据权利要求3所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备确定第一标识信息的值与第二标识信息的值不同,其中基于第一标识信息的值与第二标识信息的值不同的确定而通过复制所述第一量化矩阵来设置所述第二量化矩阵。4. The device of claim 3, wherein the memory includes computer program code configured to cause the device, when working with the processor, to determine that the value of the first identification information is different from the value of the second identification information, wherein the second quantization matrix is set by copying the first quantization matrix based on the determination that the value of the first identification information is different from the value of the second identification information. 5.根据权利要求1所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备从第二编码流解码第三标识信息,第三标识信息识别是否将所述第三量化矩阵复制到所述第四量化矩阵。5. The device of claim 1, wherein the memory includes computer program code configured to cause the device to decode third identification information from a second encoded stream when working with a processor, the third identification information identifying whether the third quantization matrix is copied to the fourth quantization matrix. 6.根据权利要求5所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备确定第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵,其中基于第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵的确定而通过将所述第三量化矩阵复制到所述第四量化矩阵来设置所述第四量化矩阵。6. The device of claim 5, wherein the memory includes computer program code configured to: cause the device, when working with a processor, to determine that third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix, wherein the fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix based on the determination that the third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix. 7.根据权利要求1所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备将第二默认量化矩阵设置为所述第四量化矩阵。7. The device of claim 1, wherein the memory includes computer program code configured to cause the device to set the second default quantization matrix to the fourth quantization matrix when working with the processor. 8.根据权利要求1所述的设备,其中所述第一量化矩阵、所述第二量化矩阵、所述第三量化矩阵和所述第四量化矩阵是基于预测模式和颜色分量的组合而设置的。8. The device according to claim 1, wherein the first quantization matrix, the second quantization matrix, the third quantization matrix and the fourth quantization matrix are configured based on a combination of prediction modes and color components. 9.根据权利要求8所述的设备,其中预测模式和颜色分量的组合包括如下的组合:9. The device of claim 8, wherein the combination of prediction mode and color components comprises the following combinations: 帧内预测和辉度分量(Y),帧内预测和第一颜色差分量(Cb),帧内预测和第二颜色差分量(Cr),帧间预测和辉度分量(Y),帧间预测和第一颜色差分量(Cb),以及帧间预测和第二颜色差分量(Cr)。Intra-frame prediction and luminance component (Y), intra-frame prediction and first color difference component (Cb), intra-frame prediction and second color difference component (Cr), inter-frame prediction and luminance component (Y), inter-frame prediction and first color difference component (Cb), and inter-frame prediction and second color difference component (Cr). 10.根据权利要求1所述的设备,其中所述第一量化矩阵和所述第二量化矩阵是针对不同大小的第一变换系数数据生成的,并且所述第三量化矩阵和所述第四量化矩阵是针对不同大小的第二被量化的变换系数数据生成的。10. The apparatus of claim 1, wherein the first quantization matrix and the second quantization matrix are generated for first transform coefficient data of different sizes, and the third quantization matrix and the fourth quantization matrix are generated for second quantized transform coefficient data of different sizes. 11.根据权利要求1所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备执行:11. The device of claim 1, wherein the memory includes computer program code configured to cause the device to execute when working in conjunction with a processor: 用以生成第一变换系数数据的对图像数据的正交变换;以及An orthogonal transformation of the image data used to generate the first transform coefficient data; and 对被逆量化的变换系数数据的逆正交变换。Inverse orthogonal transform of the inverse quantized transform coefficient data. 12.根据权利要求1所述的设备,其中存储器包括被配置为如下的计算机程序代码:在与处理器一起工作的情况下引起所述设备:12. The device of claim 1, wherein the memory includes computer program code configured to cause the device to: when working in conjunction with the processor: 以从第一最大编码单位分出的第一编码单位对第一被量化的变换系数数据进行编码,以及以从第二最大编码单位分出的第二编码单位对第二编码流进行解码。The first quantized transform coefficient data is encoded using a first coding unit derived from the first maximum coding unit, and the second coded stream is decoded using a second coding unit derived from the second maximum coding unit. 13.一种图像处理方法,包括:13. An image processing method, comprising: 设置第一量化矩阵和第二量化矩阵作为用于对第一图像的第一变换系数数据进行量化的量化矩阵;The first quantization matrix and the second quantization matrix are set as quantization matrices for quantizing the first transform coefficient data of the first image; 通过将所述第一量化矩阵复制到所述第二量化矩阵来设置所述第二量化矩阵;The second quantization matrix is set by copying the first quantization matrix to the second quantization matrix; 使用所述第二量化矩阵量化第一变换系数数据以生成第一被量化的变换系数数据;以及The first transform coefficient data is quantized using the second quantization matrix to generate the first quantized transform coefficient data; and 将第一被量化的变换系数数据编码成第一编码流;Encode the first quantized transform coefficient data into a first encoded stream; 设置第三量化矩阵;Set a third quantization matrix; 从第二编码流解码第二被量化的变换系数数据;Decode the second quantized transform coefficient data from the second encoded stream; 通过将所述第三量化矩阵复制到第四量化矩阵来设置所述第四量化矩阵;以及The fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix; and 使用所述第四量化矩阵对从第二编码流解码的第二被量化的变换系数数据进行逆量化以生成被逆量化的变换系数数据。The fourth quantization matrix is used to inverse quantize the second quantized transform coefficient data decoded from the second encoded stream to generate inverse quantized transform coefficient data. 14.根据权利要求13所述的方法,其中所述第一量化矩阵对应于第一标识信息的值,并且所述第二量化矩阵对应于第二标识信息的值。14. The method of claim 13, wherein the first quantization matrix corresponds to the value of the first identification information, and the second quantization matrix corresponds to the value of the second identification information. 15.根据权利要求13所述的方法,进一步包括在其中将所述第一量化矩阵复制到所述第二量化矩阵的情况下,生成与第二标识信息的值不同的第一标识信息的值。15. The method of claim 13, further comprising, in the case of copying the first quantization matrix to the second quantization matrix, generating a value of first identification information that is different from the value of the second identification information. 16.根据权利要求15所述的方法,进一步包括确定第一标识信息的值与第二标识信息的值不同,其中基于第一标识信息的值与第二标识信息的值不同的确定而通过复制所述第一量化矩阵来设置所述第二量化矩阵。16. The method of claim 15, further comprising determining that the value of the first identification information is different from the value of the second identification information, wherein the second quantization matrix is set by copying the first quantization matrix based on the determination that the value of the first identification information is different from the value of the second identification information. 17.根据权利要求13所述的方法,进一步包括从第二编码流解码第三标识信息,第三标识信息识别是否将所述第三量化矩阵复制到所述第四量化矩阵。17. The method of claim 13, further comprising decoding third identification information from a second encoded stream, wherein the third identification information identifies whether the third quantization matrix is copied to the fourth quantization matrix. 18.根据权利要求17所述的方法,进一步包括确定第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵,其中基于第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵的确定而通过将所述第三量化矩阵复制到所述第四量化矩阵来设置所述第四量化矩阵。18. The method of claim 17, further comprising determining that third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix, wherein the fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix based on the determination that the third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix. 19.根据权利要求18所述的方法,进一步包括将第二默认量化矩阵设置为所述第四量化矩阵。19. The method of claim 18, further comprising setting the second default quantization matrix as the fourth quantization matrix. 20.根据权利要求19所述的方法,其中所述第一量化矩阵、所述第二量化矩阵、所述第三量化矩阵和所述第四量化矩阵是基于预测模式和颜色分量的组合而设置的。20. The method of claim 19, wherein the first quantization matrix, the second quantization matrix, the third quantization matrix, and the fourth quantization matrix are configured based on a combination of prediction modes and color components. 21.根据权利要求20所述的方法,其中预测模式和颜色分量的组合包括如下的组合:21. The method of claim 20, wherein the combination of prediction mode and color components comprises the following combinations: 帧内预测和辉度分量(Y),帧内预测和第一颜色差分量(Cb),帧内预测和第二颜色差分量(Cr),帧间预测和辉度分量(Y),帧间预测和第一颜色差分量(Cb),以及帧间预测和第二颜色差分量(Cr)。Intra-frame prediction and luminance component (Y), intra-frame prediction and first color difference component (Cb), intra-frame prediction and second color difference component (Cr), inter-frame prediction and luminance component (Y), inter-frame prediction and first color difference component (Cb), and inter-frame prediction and second color difference component (Cr). 22.根据权利要求13所述的方法,其中所述第一量化矩阵和所述第二量化矩阵是针对不同大小的第一变换系数数据生成的,并且所述第三量化矩阵和所述第四量化矩阵是针对不同大小的第二被量化的变换系数数据生成的。22. The method of claim 13, wherein the first quantization matrix and the second quantization matrix are generated for first transform coefficient data of different sizes, and the third quantization matrix and the fourth quantization matrix are generated for second quantized transform coefficient data of different sizes. 23.根据权利要求13所述的方法,进一步包括:23. The method of claim 13, further comprising: 对图像数据执行正交变换以生成第一变换系数数据;以及Perform an orthogonal transformation on the image data to generate first transform coefficient data; and 对被逆量化的变换系数数据执行逆正交变换。Perform an inverse orthogonal transform on the dequantized transform coefficients. 24.根据权利要求13所述的方法,进一步包括:24. The method of claim 13, further comprising: 以从第一最大编码单位分出的第一编码单位对第一被量化的变换系数数据进行编码,The first quantized transform coefficient data is encoded using a first coding unit derived from the first largest coding unit. 以从第二最大编码单位分出的第二编码单位对第二编码流进行解码。The second encoded stream is decoded using the second encoded unit derived from the second largest encoded unit. 25.一种存储有计算机程序的计算机可读介质,该程序在由处理器执行时,执行:25. A computer-readable medium storing a computer program, which, when executed by a processor, performs: 设置第一量化矩阵和第二量化矩阵作为用于对第一图像的第一变换系数数据进行量化的量化矩阵;The first quantization matrix and the second quantization matrix are set as quantization matrices for quantizing the first transform coefficient data of the first image; 通过将所述第一量化矩阵复制到所述第二量化矩阵来设置所述第二量化矩阵;The second quantization matrix is set by copying the first quantization matrix to the second quantization matrix; 使用所述第二量化矩阵量化第一变换系数数据以生成第一被量化的变换系数数据;以及The first transform coefficient data is quantized using the second quantization matrix to generate the first quantized transform coefficient data; and 将第一被量化的变换系数数据编码成第一编码流;Encode the first quantized transform coefficient data into a first encoded stream; 设置第三量化矩阵;Set a third quantization matrix; 从第二编码流解码第二被量化的变换系数数据;Decode the second quantized transform coefficient data from the second encoded stream; 通过将所述第三量化矩阵复制到第四量化矩阵来设置所述第四量化矩阵;以及The fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix; and 使用所述第四量化矩阵对从第二编码流解码的第二被量化的变换系数数据进行逆量化以生成被逆量化的变换系数数据。The fourth quantization matrix is used to inverse quantize the second quantized transform coefficient data decoded from the second encoded stream to generate inverse quantized transform coefficient data. 26.根据权利要求25所述的介质,其中所述第一量化矩阵对应于第一标识信息的值,并且所述第二量化矩阵对应于第二标识信息的值。26. The medium of claim 25, wherein the first quantization matrix corresponds to the value of the first identification information, and the second quantization matrix corresponds to the value of the second identification information. 27.根据权利要求26所述的介质,进一步存储有如下的程序:该程序在其中将所述第一量化矩阵复制到所述第二量化矩阵的情况下由处理器执行时,生成与第二标识信息的值不同的第一标识信息的值。27. The medium according to claim 26, further storing a program in which, when executed by a processor in the case of copying the first quantization matrix to the second quantization matrix, a value of first identification information different from the value of the second identification information is generated. 28.根据权利要求27所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,确定第一标识信息的值与第二标识信息的值不同,其中基于第一标识信息的值与第二标识信息的值不同的确定而通过复制所述第一量化矩阵来设置所述第二量化矩阵。28. The medium according to claim 27, further storing a program that, when executed by a processor, determines that the value of the first identification information is different from the value of the second identification information, wherein the second quantization matrix is set by copying the first quantization matrix based on the determination that the values of the first identification information and the second identification information are different. 29.根据权利要求25所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,从第二编码流解码第三标识信息,第三标识信息识别是否将所述第三量化矩阵复制到所述第四量化矩阵。29. The medium according to claim 25 further stores a program that, when executed by a processor, decodes third identification information from a second encoded stream, the third identification information identifying whether the third quantization matrix is copied to the fourth quantization matrix. 30.根据权利要求29所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,确定第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵,其中基于第三标识信息识别到将所述第三量化矩阵复制到所述第四量化矩阵的确定而通过将所述第三量化矩阵复制到所述第四量化矩阵来设置所述第四量化矩阵。30. The medium according to claim 29, further storing a program that, when executed by a processor, determines that third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix, wherein the fourth quantization matrix is set by copying the third quantization matrix to the fourth quantization matrix based on the determination that the third identification information identifies that the third quantization matrix is copied to the fourth quantization matrix. 31.根据权利要求25所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,将第二默认量化矩阵设置为所述第四量化矩阵。31. The medium according to claim 25 further stores a program that, when executed by a processor, sets the second default quantization matrix to the fourth quantization matrix. 32.根据权利要求25所述的介质,其中所述第一量化矩阵、所述第二量化矩阵、所述第三量化矩阵和所述第四量化矩阵是基于预测模式和颜色分量的组合而设置的。32. The medium of claim 25, wherein the first quantization matrix, the second quantization matrix, the third quantization matrix, and the fourth quantization matrix are configured based on a combination of prediction modes and color components. 33.根据权利要求32所述的介质,其中预测模式和颜色分量的组合包括如下的组合:33. The medium of claim 32, wherein the combination of prediction mode and color components comprises the following combinations: 帧内预测和辉度分量(Y),帧内预测和第一颜色差分量(Cb),帧内预测和第二颜色差分量(Cr),帧间预测和辉度分量(Y),帧间预测和第一颜色差分量(Cb),以及帧间预测和第二颜色差分量(Cr)。Intra-frame prediction and luminance component (Y), intra-frame prediction and first color difference component (Cb), intra-frame prediction and second color difference component (Cr), inter-frame prediction and luminance component (Y), inter-frame prediction and first color difference component (Cb), and inter-frame prediction and second color difference component (Cr). 34.根据权利要求25所述的介质,其中所述第一量化矩阵和所述第二量化矩阵是针对不同大小的第一变换系数数据生成的,并且所述第三量化矩阵和所述第四量化矩阵是针对不同大小的第二被量化的变换系数数据生成的。34. The medium of claim 25, wherein the first quantization matrix and the second quantization matrix are generated for first transform coefficient data of different sizes, and the third quantization matrix and the fourth quantization matrix are generated for second quantized transform coefficient data of different sizes. 35.根据权利要求25所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,执行:35. The medium according to claim 25, further storing a program that, when executed by a processor, performs: 用以生成第一变换系数数据的对图像数据的正交变换;以及An orthogonal transformation of the image data used to generate the first transform coefficient data; and 对被逆量化的变换系数数据的逆正交变换。Inverse orthogonal transform of the inverse quantized transform coefficient data. 36.根据权利要求25所述的介质,进一步存储有如下的程序:该程序在由处理器执行时,执行:36. The medium according to claim 25, further storing a program that, when executed by a processor, performs: 以从第一最大编码单位分出的第一编码单位对第一被量化的变换系数数据进行编码;以及Encode the first quantized transform coefficient data using a first coding unit derived from the first largest coding unit; and 以从第二最大编码单位分出的第二编码单位对第二编码流进行解码。The second encoded stream is decoded using the second encoded unit derived from the second largest encoded unit.
HK18109496.0A 2011-02-10 2018-07-23 Image processing device and image processing method HK1250109B (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
JP2011027896 2011-02-10
JP2011-027896 2011-02-10
JP2011047655 2011-03-04
JP2011-047655 2011-03-04
JP2011-187179 2011-08-30
JP2011187179 2011-08-30

Publications (2)

Publication Number Publication Date
HK1250109A1 HK1250109A1 (en) 2018-11-23
HK1250109B true HK1250109B (en) 2021-02-19

Family

ID=

Similar Documents

Publication Publication Date Title
US11825089B2 (en) Image processing device and image processing method
AU2018206825B2 (en) Image processing device and image processing method
AU2015202011B2 (en) Image Processing Device and Image Processing Method
HK1250109B (en) Image processing device and image processing method
HK1243252B (en) Image processing device and image processing method
HK1243252A1 (en) Image processing device and image processing method
HK1193913B (en) Image processing device and image processing method
HK1193913A (en) Image processing device and image processing method