HK1224114B - Image encoding device and method, image decoding device and method - Google Patents
Image encoding device and method, image decoding device and method Download PDFInfo
- Publication number
- HK1224114B HK1224114B HK16112120.0A HK16112120A HK1224114B HK 1224114 B HK1224114 B HK 1224114B HK 16112120 A HK16112120 A HK 16112120A HK 1224114 B HK1224114 B HK 1224114B
- Authority
- HK
- Hong Kong
- Prior art keywords
- prediction
- unit
- intra
- block
- image
- Prior art date
Links
Description
本申请是申请号为201280005061.6(PCT/JP2012/000061)、申请日为2012年1月6日(递交日为2013年7月11日)、发明名称为“运动图像编码装置、运动图像译码装置、运动图像编码方法以及运动图像译码方法”的发明专利申请的分案申请。This application is a divisional application of the invention patent application with application number 201280005061.6 (PCT/JP2012/000061), application date January 6, 2012 (submission date July 11, 2013), and invention name "Motion image encoding device, motion image decoding device, motion image encoding method and motion image decoding method".
技术领域Technical Field
本发明是关于以高效率来进行图像编码的图像编码装置以及图像编码方法、以及以高效率来对被编码的图像进行译码的图像译码装置以及图像译码方法的技术。The present invention relates to a technology of an image coding device and an image coding method for efficiently coding an image, and an image decoding device and an image decoding method for efficiently decoding a coded image.
背景技术Background Art
例如,在MPEG(Moving Picture Experts Group:运动图像专家组)、“ITU-TH.26x”等的国际标准视频编码方式中,使得将输入视频帧分割成矩形的块(编码对象块),对该编码对象块通过实施使用已编码的图像信号的预测处理来生成预测图像,通过对作为该编码对象块和预测图像的差分的预测误差信号以块为单位进行正交变换、量化处理,来进行信息压缩。For example, in international standard video coding methods such as MPEG (Moving Picture Experts Group) and "ITU-TH.26x", the input video frame is divided into rectangular blocks (coding target blocks), and a prediction image is generated by performing prediction processing on the coding target blocks using the encoded image signal. Information compression is performed by performing orthogonal transformation and quantization processing on the prediction error signal, which is the difference between the coding target block and the predicted image, in units of blocks.
例如,在作为国际标准方式的MPEG-4 AVC/H.264(ISO/IEC 14496-10│ITU-TH.264)中,进行来自已编码的附近像素的帧内预测处理或者在邻近帧间的运动补偿预测处理(例如,参照非专利文献1)。For example, in MPEG-4 AVC/H.264 (ISO/IEC 14496-10│ITU-T H.264), which is an international standard, intra-frame prediction processing from encoded nearby pixels or motion compensation prediction processing between adjacent frames is performed (for example, see Non-Patent Document 1).
在MPEG-4 AVC/H.264中,在亮度的帧内预测模式中,可以以块为单位从多个预测模式中选择一个预测模式。In MPEG-4 AVC/H.264, in the intra prediction mode of luma, one prediction mode can be selected from a plurality of prediction modes in units of blocks.
图10是表示亮度的块尺寸是4×4像素的情况下的帧内预测模式的说明图。FIG. 10 is an explanatory diagram showing intra prediction modes when the luma block size is 4×4 pixels.
在图10中,白色圈是编码对象的块内的像素。黑色圈是用于预测的像素,是已编码的相邻块内的像素。In FIG10 , white circles represent pixels in the block to be coded, and black circles represent pixels used for prediction, which are pixels in already coded adjacent blocks.
在图10中,作为帧内预测模式,准备模式0~模式8共9个模式,而模式2是进行平均值预测的模式,利用上边和左边块的相邻像素的平均值来预测编码对象块内的像素。In FIG10 , nine intra-frame prediction modes, Mode 0 to Mode 8, are prepared, and Mode 2 is a mode for performing average value prediction, which predicts pixels within the encoding target block using the average value of adjacent pixels of the upper and left blocks.
模式2以外的模式是进行方向性预测的模式。模式0是垂直方向预测,通过在垂直方向重复上边块的相邻像素而生成预测图像。例如,在是竖条纹花样时选择模式0。Modes other than Mode 2 perform directional prediction. Mode 0 is vertical prediction, generating a predicted image by vertically repeating adjacent pixels in the upper block. For example, when the image is a vertical stripe pattern, select Mode 0.
模式1是水平方向预测,通过在水平方向重复左边块的相邻像素而生成预测图像。例如,在是横条纹花样时选择模式1。Mode 1 is horizontal prediction, which generates a predicted image by repeating adjacent pixels of the left block in the horizontal direction. For example, when the image is a horizontal stripe pattern, select Mode 1.
模式3~模式8使用上边或者左边块的相邻像素,在规定的方向(箭头表示的方向)生成插值像素由此生成预测图像。Modes 3 to 8 generate a predicted image by generating interpolated pixels in a predetermined direction (direction indicated by an arrow) using adjacent pixels of the upper or left block.
应用帧内预测的亮度的块尺寸可以从4×4像素、8×8像素、16×16像素中选择,当是8×8像素的情况下,和4×4像素的情况一样规定9个帧内预测模式。The block size of luminance to which intra prediction is applied can be selected from 4×4 pixels, 8×8 pixels, and 16×16 pixels. In the case of 8×8 pixels, 9 intra prediction modes are specified as in the case of 4×4 pixels.
当是16×16像素的情况下,规定4个帧内预测模式(平均值预测、垂直方向预测、水平方向预测、平面预测)。In the case of 16×16 pixels, four intra prediction modes (average prediction, vertical prediction, horizontal prediction, and planar prediction) are defined.
平面预测是将在斜方向对上边块的相邻像素和左边块的相邻像素进行内插插值而生成的像素作为预测值的模式。Planar prediction is a mode in which pixels generated by interpolating adjacent pixels of an upper block and adjacent pixels of a left block in a diagonal direction are used as prediction values.
进行方向性预测的帧内预测模式因为在例如45度等由模式预先确定的方向生成预测值,所以在块内的对象的边界(边缘)的方向与预测模式所示的方向一致的情况下,预测效率变高而能够减少编码量。The intra-frame prediction mode that performs directional prediction generates prediction values in a direction predetermined by the mode, such as 45 degrees. Therefore, when the direction of the boundary (edge) of the object in the block is consistent with the direction indicated by the prediction mode, the prediction efficiency becomes high and the coding amount can be reduced.
但是,有时仅仅是边缘的方向和预测模式所示的方向之间产生稍许偏差、或者即使方向一致但编码对象块内的边缘稍许变形(摇晃、弯曲等),就会局部性地发生大的预测误差,预测效率急剧下降。However, sometimes there is only a slight deviation between the direction of the edge and the direction indicated by the prediction mode, or even if the direction is the same but the edge in the encoding object block is slightly deformed (shaking, bending, etc.), a large local prediction error will occur, and the prediction efficiency will drop sharply.
为了防止这种预测效率的下降,在8×8像素的方向性预测中,通过使用对已编码的相邻像素实施平滑处理的方式进行预测处理,生成被平滑了的预测图像,使得减少在产生预测方向的稍许偏差、在边缘产生稍许变形的情况下所发生的预测误差。In order to prevent this decline in prediction efficiency, in the directional prediction of 8×8 pixels, prediction processing is performed by smoothing the encoded adjacent pixels to generate a smoothed prediction image, thereby reducing the prediction error caused by slight deviations in the prediction direction and slight deformations at the edges.
非专利文献1:MPEG-4 AVC(ISO/IEC 14496-10)/ITU-TH.264标准Non-Patent Document 1: MPEG-4 AVC (ISO/IEC 14496-10)/ITU-TH.264 Standard
因为以往的图像编码装置如以上那样被构成,所以如果生成被平滑了的预测图像,则即使发生预测方向的稍许偏差或在边缘产生稍许变形,也能够减少所发生的预测误差。但是,在非专利文献1中,在8×8像素的块之外不实施平滑处理,在8×8像素的块中也只进行一般的平滑处理。Since conventional image encoding devices are configured as described above, generating a smoothed prediction image can reduce the resulting prediction error even if there is a slight deviation in the prediction direction or slight distortion at the edge. However, in Non-Patent Document 1, smoothing is not performed on blocks other than 8×8 pixel blocks, and only general smoothing is performed on 8×8 pixel blocks.
实际上,即使在8×8像素以外的尺寸的块中也同样地存在以下问题:即、即使预测图像和编码对象图像的图样相似,由于边缘的微小的不匹配也局部性地发生大的预测误差,产生预测效率的大幅度下降。另外,还存在如下问题:即使在同一尺寸的块中,如果在对预测误差信号进行量化时使用的量化参数、块内像素的位置、预测模式等不同,则适用于减少局部性的预测误差的处理也不同,但只准备有一般的平滑处理而无法充分减少预测误差。In practice, the same problem exists for blocks of sizes other than 8×8 pixels: even if the patterns of the predicted image and the target image are similar, small mismatches at the edges can cause large localized prediction errors, significantly reducing prediction efficiency. Furthermore, even within blocks of the same size, different factors, such as the quantization parameter used to quantize the prediction error signal, the position of pixels within the block, and the prediction mode, lead to different processing for reducing localized prediction errors. Therefore, simply providing general smoothing processing cannot sufficiently reduce prediction errors.
进而,还存在以下问题:在进行了平均值预测时,由于将块内的预测全部作为与块相邻的像素的平均值,所以位于块边界的像素的预测信号相对于周围的已编码的像素容易变成不连续的信号,另一方面因为图像信号一般是空间方向相关性高的信号,所以因上述不连续性而容易在块的边界部分产生预测误差。Furthermore, there is the following problem: when performing average value prediction, since all predictions within the block are taken as the average value of the pixels adjacent to the block, the prediction signal of the pixels located at the block boundary is likely to become a discontinuous signal relative to the surrounding encoded pixels. On the other hand, because image signals are generally signals with high spatial directional correlation, prediction errors are likely to occur at the boundary part of the block due to the above-mentioned discontinuity.
发明内容Summary of the Invention
本发明是为了解决上述那样的问题而提出的,其目的在于得到一种能够减少局部性地发生的预测误差而提高图像品质的图像编码装置、图像译码装置、图像编码方法以及图像译码方法。The present invention is proposed to solve the above-mentioned problems, and its object is to provide an image encoding device, an image decoding device, an image encoding method, and an image decoding method that can reduce locally occurring prediction errors and improve image quality.
关于本发明涉及的图像编码装置,在帧内预测单元通过使用帧内的已编码的图像信号,实施帧内预测处理而生成预测图像时,从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器,使用该滤波器,实施针对预测图像的滤波处理,将滤波处理后的预测图像输出给差分图像生成单元。Regarding the image encoding device according to the present invention, when the intra-frame prediction unit generates a predicted image by performing intra-frame prediction processing using the encoded image signal within the frame, a filter is selected from one or more pre-prepared filters according to the states of various parameters involved in the encoding of the filtering object block, and filtering processing is performed on the predicted image using the filter, and the predicted image after filtering processing is output to the differential image generation unit.
根据本发明,因为构成为在帧内预测单元通过使用帧内的已编码的图像信号,实施帧内预测处理来生成预测图像时,从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器,使用该滤波器实施针对预测图像的滤波处理,将滤波处理后的预测图像输出给差分图像生成单元,所以具有能够减少局部性地发生的预测误差而提高图像品质的效果。According to the present invention, when the intra-frame prediction unit generates a predicted image by performing intra-frame prediction processing using an encoded image signal within a frame, a filter is selected from one or more pre-prepared filters according to the states of various parameters involved in the encoding of the filtering object block, and filtering processing is performed on the predicted image using the filter, and the filtered predicted image is output to the differential image generation unit. Therefore, it has the effect of reducing locally occurring prediction errors and improving image quality.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是表示本发明的实施方式1的运动图像编码装置的构成图。FIG1 is a diagram showing the structure of a moving picture encoding apparatus according to Embodiment 1 of the present invention.
图2是表示本发明的实施方式1的运动图像译码装置的构成图。FIG2 is a diagram showing the configuration of the moving picture decoding apparatus according to the first embodiment of the present invention.
图3是表示本发明的实施方式1的运动图像编码装置的处理内容的流程图。FIG3 is a flowchart showing the processing contents of the moving picture encoding apparatus according to Embodiment 1 of the present invention.
图4是表示本发明的实施方式1的运动图像译码装置的处理内容的流程图。FIG4 is a flowchart showing the processing contents of the moving picture decoding apparatus according to the first embodiment of the present invention.
图5是表示将最大尺寸的编码块分层地分割成多个编码块的样子的说明图。FIG5 is an explanatory diagram showing how a coding block of the maximum size is divided hierarchically into a plurality of coding blocks.
图6的(a)表示分割后的分区的分布,(b)是用4叉树图表示将编码模式m(Bn)分配给分层分割后的分区的状况的说明图。FIG6(a) shows the distribution of partitions after division, and FIG6(b) is an explanatory diagram showing, using a quadtree diagram, how coding modes m( Bn ) are assigned to partitions after hierarchical division.
图7是表示在编码块Bn内的各分区Pi n中可以选择的帧内预测参数(帧内预测模式)的一个例子的说明图。FIG7 is an explanatory diagram showing an example of intra-frame prediction parameters (intra-frame prediction modes) that can be selected in each partition Pin within the coding block Bn .
图8是表示在li n=mi n=4时,生成分区Pi n内的像素的预测值时使用的像素的一个例子的说明图。FIG8 is an explanatory diagram showing an example of pixels used when generating predicted values of pixels within a partition Pin when lin = min = 4 .
图9是表示N=5时的参照像素配置的一个例子的说明图。FIG. 9 is an explanatory diagram showing an example of reference pixel arrangement when N=5.
图10是表示亮度块尺寸是4×4像素时的非专利文献1的帧内预测模式的说明图。FIG. 10 is an explanatory diagram showing the intra prediction mode of Non-Patent Document 1 when the luminance block size is 4×4 pixels.
图11是表示在生成预测图像时使用的帧内的已编码的图像和滤波处理对象像素的距离的一个例子的说明图。FIG. 11 is an explanatory diagram showing an example of the distance between an encoded image and a filtering target pixel within a frame used when generating a predicted image.
图12是表示具体的滤波器的参照像素配置的说明图。FIG. 12 is an explanatory diagram showing a specific arrangement of reference pixels of a filter.
图13是表示决定对帧内预测模式索引和分区尺寸的组合的每个组合使用哪个滤波器的表的一个例子的说明图。FIG. 13 is an explanatory diagram showing an example of a table for determining which filter to use for each combination of an intra prediction mode index and a partition size.
图14是表示在进行了平均值预测时进行滤波处理的简化的一个例子的说明图。FIG. 14 is an explanatory diagram showing an example of simplification of filtering processing when average value prediction is performed.
图15是表示将滤波器选择表索引加到序列级标题时的比特流的一个例子的说明图。FIG15 is an explanatory diagram showing an example of a bit stream when a filter selection table index is added to a sequence level header.
图16是表示将滤波器选择表索引加到图片级标题时的比特流的一个例子的说明图。FIG16 is an explanatory diagram showing an example of a bit stream when a filter selection table index is added to a picture-level header.
图17是表示将滤波器选择表索引加到切片标题时的比特流的一个例子的说明图。FIG17 is an explanatory diagram showing an example of a bit stream when a filter selection table index is added to a slice header.
图18是表示将滤波器选择表索引加到基准块标题时的比特流的一个例子的说明图。FIG18 is an explanatory diagram showing an example of a bit stream when a filter selection table index is added to a reference block header.
图19是表示与决定对帧内预测模式索引和分区尺寸的组合的每个组合使用哪个滤波器的表的图13不同的例子的说明图。FIG. 19 is an explanatory diagram showing an example different from FIG. 13 of a table for determining which filter to use for each combination of an intra prediction mode index and a partition size.
图20是表示对帧内预测模式索引和分区尺寸的组合的每个组合决定是否对在生成中间预测图像时的参照像素进行平滑处理的表的例子的说明图。20 is an explanatory diagram showing an example of a table for determining whether or not to perform smoothing processing on reference pixels when generating an inter-prediction image for each combination of an intra prediction mode index and a partition size.
符号说明Explanation of symbols
1:编码控制部(编码控制单元)1: Encoding control unit (encoding control unit)
2:块分割部(块分割单元)2: Block division unit (block division unit)
3:切换开关(帧内预测单元,运动补偿预测单元)3: Switch (intra-frame prediction unit, motion compensation prediction unit)
4:帧内预测部(帧内预测单元)4: Intra-frame prediction unit (intra-frame prediction unit)
5:运动补偿预测部(运动补偿预测单元)5: Motion Compensated Prediction Unit (Motion Compensated Prediction Unit)
6:减法部(差分图像生成单元)6: Subtraction unit (difference image generation unit)
7:变换·量化部(图像压缩单元)7: Transformation and quantization unit (image compression unit)
8:反量化·反变换部8: Inverse quantization and inverse transformation unit
9:加法部9: Addition Department
10:帧内预测用存储器10: Intra-frame prediction memory
11:环路滤波器部11: Loop filter section
12:运动补偿预测帧存储器12: Motion compensation prediction frame memory
13:可变长度编码部(可变长度编码单元)13: Variable length coding unit (variable length coding means)
51:可变长度译码部(可变长度译码单元)51: Variable length decoding unit (variable length decoding means)
52:切换开关(帧内预测单元,运动补偿预测单元)52: Switch (intra-frame prediction unit, motion compensation prediction unit)
53:帧内预测部(帧内预测单元)53: Intra-frame prediction unit (intra-frame prediction unit)
54:运动补偿预测部(运动补偿预测单元)54: Motion Compensation Prediction Unit (Motion Compensation Prediction Unit)
55:反量化·反变换部(差分图像生成单元)55: Inverse quantization and inverse transformation unit (difference image generation unit)
56:加法部(译码图像生成单元)56: Adding unit (decoded image generating unit)
57:帧内预测用存储器57: Intra-frame prediction memory
58:环路滤波器部58: Loop filter section
59:运动补偿预测帧存储器59: Motion Compensated Prediction Frame Memory
100:滤波器选择表索引100: filter selection table index
具体实施方式DETAILED DESCRIPTION
以下,为了更详细地说明本发明,对实施方式按照附图进行说明。Hereinafter, in order to explain the present invention in more detail, embodiments will be described with reference to the accompanying drawings.
实施方式1Implementation Method 1
在本实施方式1中对运动图像编码装置和运动图像译码装置进行说明,该运动图像编码装置输入视频的各帧图像,通过实施来自已编码的附近像素的帧内预测处理或者在邻近帧之间实施运动补偿预测处理而生成预测图像,在对作为该预测图像和帧图像的差分图像的预测误差信号实施了基于正交变换·量化的压缩处理后,进行可变长度编码而生成比特流,该运动图像译码装置对从该运动图像编码装置输出的比特流进行译码。In this first embodiment, a motion image encoding device and a motion image decoding device are described. The motion image encoding device inputs each frame image of a video, generates a predicted image by performing intra-frame prediction processing from encoded nearby pixels or performing motion compensation prediction processing between adjacent frames, performs compression processing based on orthogonal transformation and quantization on a prediction error signal which is a differential image between the predicted image and the frame image, and then performs variable-length coding to generate a bit stream. The motion image decoding device decodes the bit stream output from the motion image encoding device.
本实施方式1的运动图像编码装置其特征在于:适应于视频信号的空间·时间方向的局部性的变化,将视频信号分割成多种尺寸的区域进行帧内·帧间适应性编码。The moving picture coding apparatus of the first embodiment is characterized by dividing the video signal into regions of various sizes to perform intra-frame and inter-frame adaptive coding in response to local changes in the spatial and temporal directions of the video signal.
一般地,视频信号具有信号的复杂程度在空间·时间上局部性地变化的特性。在空间上看,在某一特定的视频帧上,既有在天空、墙壁等那样比较宽的图像区域中具有均匀的信号特性的图样,也混合存在有在具有人物、细致的纹理的绘画等小的图像区域内具有复杂纹理图案的图样。Video signals generally exhibit local variations in complexity across space and time. Spatially, within a given video frame, there are patterns with uniform signal characteristics across relatively wide image regions, such as the sky and walls, as well as patterns with complex textures within smaller image regions, such as figures and finely textured paintings.
即使在时间上看,虽然天空、墙壁的时间方向的图样的变化局部性地小,但运动的人物、物体其轮廓因为在时间上进行刚性和非刚性运动,所以时间上的变化大。Even when viewed temporally, although the changes in the time-direction patterns of the sky and the wall are locally small, the contours of moving people and objects undergo rigid and non-rigid motion in time, so the changes in time are large.
编码处理通过时间·空间上的预测而生成信号电力、熵小的预测误差信号,从而消减整体的编码量,而如果能够将用于预测的参数均匀地应用于尽可能大的图像信号区域,则能够减小该参数的编码量。The encoding process generates a prediction error signal with low signal power and entropy through temporal and spatial prediction, thereby reducing the overall encoding amount. If the parameters used for prediction can be evenly applied to the largest possible image signal area, the encoding amount of the parameter can be reduced.
另一方面,对于时间上·空间上变化大的图像信号图案,如果应用同一预测参数,则因为预测的错误增加,所以无法消减预测误差信号的编码量。On the other hand, if the same prediction parameters are applied to an image signal pattern that varies greatly temporally and spatially, the prediction error increases, and thus the coding amount of the prediction error signal cannot be reduced.
因而,希望对于时间上·空间上变化大的图像信号图案,减小预测对象的区域,即使增加用于预测的参数的数据量,也减少预测误差信号的电力、熵。Therefore, for image signal patterns that vary greatly temporally and spatially, it is desirable to reduce the area to be predicted, thereby reducing the power and entropy of the prediction error signal even if the amount of parameter data used for prediction is increased.
为了进行适应于这样的视频信号的一般性质的编码,在本实施方式1的运动图像编码装置中,从规定的最大块尺寸分层分割视频信号的区域,对每个分割区域实施预测处理、预测误差的编码处理。In order to perform coding adapted to the general properties of such video signals, the moving picture coding apparatus according to the first embodiment hierarchically divides the video signal into regions based on a predetermined maximum block size, and performs prediction processing and prediction error coding processing on each divided region.
本实施方式1的运动图像编码装置作为处理对象的视频信号是视频帧具有水平、垂直2维的数字样本(像素)列的任意的视频信号,除了具有亮度信号和2个色差信号的YUV信号、从数字摄像元件输出的RGB信号等的任意的颜色空间的彩色视频信号以外,还有单色图像信号、红外线图像信号等。The video signal processed by the motion image encoding device of this embodiment 1 is an arbitrary video signal whose video frame has horizontal and vertical two-dimensional columns of digital samples (pixels). In addition to color video signals of arbitrary color spaces such as YUV signals having a brightness signal and two color difference signals, RGB signals output from a digital camera element, there are also monochrome image signals, infrared image signals, etc.
各像素的灰度既可以是8位,也可以是10位、12位等的灰度。The grayscale of each pixel may be 8 bits, 10 bits, 12 bits, or the like.
但是,在以下的说明中,除非另有规定,假设输入的视频信号是YUV信号。另外,假设2个色差成分U、V相对于亮度成分Y是子采样的4:2:0格式的信号。However, in the following description, unless otherwise specified, it is assumed that the input video signal is a YUV signal. In addition, it is assumed that the two color difference components U and V are subsampled with respect to the luminance component Y and are in a 4:2:0 format.
而且,把与视频的各帧对应的处理数据单位称为“图片”,在本实施方式1中,“图片”作为进行顺序扫描(progressive scanning:逐行扫描)的视频帧的信号进行说明。但是,当视频信号是交错信号的情况下,“图片”也可以是作为构成视频帧的单位的场图像信号。Furthermore, the unit of processed data corresponding to each video frame is called a "picture." In this first embodiment, a "picture" is described as a signal of a video frame that undergoes progressive scanning. However, if the video signal is an interlaced signal, a "picture" may also be a field image signal that constitutes a video frame.
图1是表示本发明的实施方式1的运动图像编码装置的构成图。FIG1 is a diagram showing the structure of a moving picture encoding apparatus according to Embodiment 1 of the present invention.
在图1中,编码控制部1实施如下处理:决定成为实施帧内预测处理(帧内预测处理)或者运动补偿预测处理(帧间预测处理)时的处理单位的编码块的最大尺寸,并且决定将最大尺寸的编码块分层分割时的上限层数。In Figure 1, the encoding control unit 1 implements the following processing: determines the maximum size of the coding block that serves as the processing unit when implementing intra-frame prediction processing (intra-frame prediction processing) or motion compensation prediction processing (inter-frame prediction processing), and determines the upper limit number of layers when the maximum-sized coding block is divided into layers.
另外,编码控制部1实施如下处理:从可以利用的1个以上的编码模式(1个以上的帧内编码模式、1个以上的帧间编码模式)中,选择适于分层分割的各个编码块的编码模式。Furthermore, the encoding control unit 1 performs processing for selecting an encoding mode suitable for each hierarchically divided encoding block from one or more available encoding modes (one or more intra-frame encoding modes and one or more inter-frame encoding modes).
另外,编码控制部1实施如下处理:对各个编码块的每个,决定在压缩差分图像时使用的量化参数以及变换块大小,并且决定在实施预测处理时使用的帧内预测参数或者帧间预测参数。量化参数以及变换块尺寸包含于预测误差编码参数,被输出给变换·量化部7、反量化·反变换部8以及可变长度编码部13等。Furthermore, the encoding control unit 1 performs processing to determine, for each coding block, the quantization parameter and transform block size used when compressing the differential image, and the intra-frame prediction parameters or inter-frame prediction parameters used when performing prediction processing. The quantization parameter and transform block size are included in the prediction error coding parameters and are output to the transform/quantization unit 7, the inverse quantization/inverse transform unit 8, and the variable length coding unit 13.
而且,编码控制部1构成编码控制单元。Furthermore, the encoding control unit 1 constitutes an encoding control unit.
块分割部2实施如下处理:当输入表示输入图像的视频信号时,将该视频信号表示的输入图像分割成由编码控制部1所决定的最大尺寸的编码块,并且直到达到由编码控制部1决定的上限层数为止,分层分割该编码块。而且,块分割部2构成块分割单元。When a video signal representing an input image is input, the block division unit 2 performs processing such that the input image represented by the video signal is divided into coding blocks of the maximum size determined by the coding control unit 1, and the coding blocks are divided hierarchically until the upper limit number of layers determined by the coding control unit 1 is reached. Furthermore, the block division unit 2 constitutes a block division unit.
切换开关3实施如下处理:如果由编码控制部1选择的编码模式是帧内编码模式,则将用块分割部2分割了的编码块输出到帧内预测部4,如果由编码控制部1选择的编码模式是帧间编码模式,则将由块分割部2分割了的编码块输出到运动补偿预测部5。The switching switch 3 implements the following processing: if the coding mode selected by the coding control unit 1 is the intra-frame coding mode, the coding block divided by the block division unit 2 is output to the intra-frame prediction unit 4; if the coding mode selected by the coding control unit 1 is the inter-frame coding mode, the coding block divided by the block division unit 2 is output to the motion compensation prediction unit 5.
帧内预测部4实施如下处理:当从切换开关3接收到由块分割部2分割了的编码块时,使用帧内的已编码的图像信号,根据从编码控制部1输出的帧内预测参数,通过实施针对该编码块的帧内预测处理来生成预测图像。The intra-frame prediction unit 4 performs the following processing: when receiving the coding block divided by the block division unit 2 from the switching switch 3, it uses the encoded image signal in the frame and generates a predicted image by performing intra-frame prediction processing on the coding block based on the intra-frame prediction parameters output from the encoding control unit 1.
但是,帧内预测部4在生成上述预测图像后,在从预先准备的1个以上的滤波器中,根据在运动图像译码装置中生成和上述预测图像相同的预测图像的时刻点已知的各种参数的状态来选择滤波器,使用该滤波器来实施针对上述预测图像的滤波处理,将滤波处理后的预测图像输出到减法部6以及加法部9。However, after generating the above-mentioned predicted image, the intra-frame prediction unit 4 selects a filter from one or more pre-prepared filters based on the states of various parameters known at the time when a predicted image identical to the above-mentioned predicted image is generated in the motion image decoding device, uses the filter to perform filtering processing on the above-mentioned predicted image, and outputs the filtered predicted image to the subtraction unit 6 and the addition unit 9.
具体地说,作为上述各参数根据以下4个参数中的至少1个以上的参数的状态来唯一地决定滤波器。Specifically, the filter is uniquely determined based on the state of at least one of the following four parameters as the above-mentioned parameters.
·参数(1)Parameters (1)
上述的预测图像的块尺寸The block size of the predicted image above
·参数(2)Parameters (2)
由编码控制部1决定的量化参数The quantization parameter determined by the encoding control unit 1
·参数(3)Parameters (3)
在生成预测图像时使用的帧内的已编码的图像信号和滤波处理对象像素的距离The distance between the coded image signal in the frame used to generate the predicted image and the pixel to be filtered
·参数(4)Parameters (4)
由编码控制部1决定的帧内预测参数The intra-frame prediction parameters determined by the encoding control unit 1
而且,由切换开关3以及帧内预测部4来构成帧内预测单元。Furthermore, the switch 3 and the intra prediction unit 4 constitute an intra prediction unit.
运动补偿预测部5实施如下处理:作为适于由块分割部2分割了的编码块的编码模式,当由编码控制部1选择了帧间编码模式的情况下,使用由运动补偿预测帧存储器12保持的1帧以上的参照图像,根据从编码控制部1输出的帧间预测参数,通过实施针对该编码块的运动补偿预测处理来生成预测图像。而且,由切换开关3以及运动补偿预测部5构成运动补偿预测单元。When the coding control unit 1 has selected the inter-frame coding mode as the coding mode appropriate for the coding block divided by the block division unit 2, the motion-compensated prediction unit 5 generates a predicted image by performing motion-compensated prediction processing on the coding block based on the inter-frame prediction parameters output from the coding control unit 1, using one or more reference images stored in the motion-compensated prediction frame memory 12. The switch 3 and the motion-compensated prediction unit 5 constitute a motion-compensated prediction unit.
减法部6实施如下处理:通过从由块分割部2分割了的编码块中减去由帧内预测部4或者运动补偿预测部5生成的预测图像,生成差分图像(=编码块-预测图像)。而且,减法部6构成差分图像生成单元。The subtraction unit 6 generates a difference image (= coded block - predicted image) by subtracting the predicted image generated by the intra prediction unit 4 or the motion compensation prediction unit 5 from the coded block divided by the block division unit 2. The subtraction unit 6 constitutes a difference image generation unit.
变换·量化部7以从编码控制部1输出的预测误差编码参数所包含的变换块尺寸为单位来实施由减法部6生成的差分图像的变换处理(例如,DCT(离散余弦变换)、预先针对特定的学习系列进行底层设计的KL变换等的正交变换处理),并且实施如下处理:使用该预测误差编码参数所包含的量化参数,对该差分图像的变换系数进行量化,由此将量化后的变换系数作为差分图像的压缩数据来输出。而且,变换·量化部7构成图像压缩单元。The transform/quantization unit 7 performs a transform process (e.g., a DCT (discrete cosine transform) or an orthogonal transform such as a KL transform pre-designed for a specific learning sequence) on the difference image generated by the subtraction unit 6, using the transform block size included in the prediction error coding parameters output from the encoding control unit 1 as a unit. Furthermore, the transform coefficients of the difference image are quantized using the quantization parameters included in the prediction error coding parameters, thereby outputting the quantized transform coefficients as compressed data for the difference image. Furthermore, the transform/quantization unit 7 constitutes image compression means.
反量化·反变换部8实施如下处理:通过使用从编码控制部1输出的预测误差编码参数所包含的量化参数,对从变换·量化部7输出的压缩数据进行反量化,以该预测误差编码参数所包含的变换块尺寸为单位,实施反量化的压缩数据的反变换处理(例如,反DCT(反离散余弦变换)、反KL变换等的反变换处理),从而将反变换处理后的压缩数据作为局部译码预测误差信号进行输出。The inverse quantization and inverse transformation unit 8 performs the following processing: by using the quantization parameters included in the prediction error coding parameters output from the encoding control unit 1, the compressed data output from the transformation and quantization unit 7 is inverse quantized, and the inverse transformation processing (for example, inverse DCT (inverse discrete cosine transform), inverse KL transform, etc.) of the inverse quantized compressed data is performed in units of the transformation block size included in the prediction error coding parameters, thereby outputting the compressed data after the inverse transformation processing as a local decoding prediction error signal.
加法部9实施如下处理:通过将从反量化·反变换部8输出的局部译码预测误差信号和表示由帧内预测部4或者运动补偿预测部5生成的预测图像的预测信号相加,生成表示局部译码图像的局部译码图像信号。The adding unit 9 performs the following processing: by adding the local decoded prediction error signal output from the inverse quantization/inverse transform unit 8 and the prediction signal representing the predicted image generated by the intra-frame prediction unit 4 or the motion compensation prediction unit 5, a local decoded image signal representing the local decoded image is generated.
帧内预测用存储器10是将由加法部9生成的局部译码图像信号表示的局部译码图像作为用帧内预测部4在下次的帧内预测处理中使用的图像保存的RAM等的存储介质。The intra prediction memory 10 is a storage medium such as RAM that stores the local decoded image represented by the local decoded image signal generated by the adder 9 as an image to be used in the next intra prediction process by the intra prediction unit 4 .
环路滤波器部11实施如下处理:对由加法器9生成的局部译码图像信号所包含的编码失真进行补偿,将编码失真补偿后的局部译码图像信号表示的局部译码图像作为参照图像输出到运动补偿预测帧存储器12。The loop filter unit 11 performs processing to compensate for coding distortion included in the local decoded image signal generated by the adder 9 and outputs the local decoded image represented by the local decoded image signal after coding distortion compensation to the motion compensation prediction frame memory 12 as a reference image.
运动补偿预测帧存储器12是将由环路滤波器部11进行的滤波处理后的局部译码图像作为由运动补偿预测部5在下次的运动补偿预测处理中使用的参照图像而保存的RAM等的记录介质。The motion-compensated prediction frame memory 12 is a recording medium such as RAM that stores the local decoded image filtered by the loop filter unit 11 as a reference image to be used by the motion-compensated prediction unit 5 in the next motion-compensated prediction process.
可变长度编码部13实施如下处理:对从变换·量化部7输出的压缩数据、从编码控制部1输出的编码模式以及预测误差编码参数、从帧内预测部4输出的帧内预测参数或者从运动补偿预测部5输出的帧间预测参数进行可变长度编码,生成复用了该压缩数据、编码模式、预测误差编码参数、帧内预测参数/帧间预测参数的编码数据的比特流。而且,可变长度编码部13构成可变长度编码单元。The variable length coding unit 13 performs variable length coding on the compressed data output from the transform/quantization unit 7, the coding mode and prediction error coding parameters output from the coding control unit 1, and the intra-frame prediction parameters output from the intra-frame prediction unit 4 or the inter-frame prediction parameters output from the motion compensation prediction unit 5, thereby generating a bit stream of coded data that multiplexes the compressed data, coding mode, prediction error coding parameters, and intra-frame prediction parameters/inter-frame prediction parameters. Furthermore, the variable length coding unit 13 constitutes a variable length coding unit.
图2是表示本发明的实施方式1的运动图像译码装置的构成图。FIG2 is a diagram showing the configuration of the moving picture decoding apparatus according to the first embodiment of the present invention.
在图2中,可变长度译码部51实施如下处理:从在比特流中复用的编码数据中对分层分割的各个编码块所涉及的压缩数据、编码模式、预测误差编码参数、帧内预测参数/帧间预测参数进行可变长度译码,将其压缩数据以及预测误差编码参数输出到反量化·反变换部55,并且将其编码模式以及帧内预测参数/帧间预测参数输出到切换开关52。而且,可变长度译码部51构成可变长度译码单元。In FIG2 , the variable-length decoding unit 51 performs variable-length decoding of compressed data, coding modes, prediction error coding parameters, and intra-frame prediction parameters/inter-frame prediction parameters for each hierarchically divided coding block from the coded data multiplexed in the bitstream, outputs the compressed data and prediction error coding parameters to the inverse quantization/inverse transform unit 55, and outputs the coding mode and intra-frame prediction parameters/inter-frame prediction parameters to the switch 52. Furthermore, the variable-length decoding unit 51 constitutes a variable-length decoding unit.
切换开关52实施如下处理:在从可变长度译码部51输出的编码块所涉及的编码模式是帧内编码模式的情况下,将从可变长度译码部51输出的帧内预测参数输出到帧内预测部53,当其编码模式是帧间编码模式的情况下,将从可变长度译码部51输出的帧间预测参数输出到运动补偿预测部54。The switching switch 52 performs the following processing: when the coding mode involved in the coding block output from the variable length decoding unit 51 is the intra-frame coding mode, the intra-frame prediction parameters output from the variable length decoding unit 51 are output to the intra-frame prediction unit 53; when its coding mode is the inter-frame coding mode, the inter-frame prediction parameters output from the variable length decoding unit 51 are output to the motion compensation prediction unit 54.
帧内预测部53实施如下处理:使用帧内的已译码的图像信号,根据从切换开关52输出的帧内预测参数,通过针对编码块实施帧内预测处理来生成预测图像。The intra prediction unit 53 performs a process of generating a predicted image by performing an intra prediction process on the coding block based on the intra prediction parameters output from the switch 52 using the decoded image signal in the frame.
其中,帧内预测内部53在生成上述的预测图像后,从预先准备的1个以上的滤波器中,根据在生成上述预测图像的时刻点已知的各种参数的状态来选择滤波器,使用该滤波器对上述预测图像实施滤波处理,将滤波处理后的预测图像输出到加法部56。Among them, after generating the above-mentioned prediction image, the intra-frame prediction internal 53 selects a filter from one or more pre-prepared filters according to the states of various parameters known at the time of generating the above-mentioned prediction image, uses the filter to perform filtering processing on the above-mentioned prediction image, and outputs the filtered prediction image to the addition unit 56.
具体地说,作为上述各种参数,根据以下的4个参数中的至少1个以上的参数的状态唯一地决定滤波器。其中,所使用的参数预先决定为与前面说明的运动图像编码装置相同的参数。即,统一在运动图像编码装置以及运动图像译码装置中使用的参数,以使得当在运动图像编码装置一侧,帧内预测部4使用参数(1)以及(4)进行滤波处理的情况下,运动图像译码装置一侧也同样地,帧内预测部53使用参数(1)以及(4)进行滤波处理。Specifically, as the various parameters described above, the filter is uniquely determined based on the state of at least one of the following four parameters. The parameters used are predetermined to be the same as those used in the previously described moving picture encoding device. That is, the parameters used in the moving picture encoding device and the moving picture decoding device are unified so that when the intra-frame prediction unit 4 on the moving picture encoding device performs filtering using parameters (1) and (4), the intra-frame prediction unit 53 on the moving picture decoding device similarly performs filtering using parameters (1) and (4).
·参数(1)Parameters (1)
上述预测图像的块尺寸The block size of the above predicted image
·参数(2)Parameters (2)
用可变长度译码部51进行可变长度译码的量化参数Quantization parameter of variable length decoding performed by the variable length decoding unit 51
·参数(3)Parameters (3)
在生成预测图像时使用的帧内的已译码的图像信号和滤波处理对象像素的距离The distance between the decoded image signal in the frame used to generate the predicted image and the pixel to be filtered
·参数(4)Parameters (4)
用可变长度译码部51进行可变长度译码的帧内预测参数Intra-frame prediction parameters that are variable-length decoded by the variable-length decoding unit 51
而且,由切换开关52以及帧内预测部53构成帧内预测单元。The switch 52 and the intra prediction unit 53 constitute an intra prediction unit.
运动补偿预测部54实施如下处理:使用由运动补偿预测帧存储器59保存的1帧以上的参照图像,根据从切换开关52输出的帧内预测参数,通过对编码块实施运动补偿预测处理来生成预测图像。而且,由切换开关52以及运动补偿预测部54构成运动补偿预测单元。The motion-compensated prediction unit 54 generates a predicted image by performing motion-compensated prediction on the coding block based on the intra-frame prediction parameters output from the switch 52, using one or more reference images stored in the motion-compensated prediction frame memory 59. The switch 52 and the motion-compensated prediction unit 54 constitute a motion-compensated prediction unit.
反量化·反变换部55实施如下处理:使用从可变长度译码部51输出的预测误差编码参数所包含的量化参数,对从可变长度译码部51输出的编码块所涉及的压缩数据进行反量化,通过以该预测误差编码参数所包含的变换块尺寸为单位,实施反量化的压缩数据的反变换处理(例如,反DCT(反离散余弦变换)、反KL变换等的反变换处理),从而将反变换处理后的压缩数据作为译码预测误差信号(表示压缩前的差分图像的信号)进行输出。而且,反量化·反变换部55构成差分图像生成单元。The inverse quantization/inverse transformation unit 55 performs the following processing: using the quantization parameter included in the prediction error coding parameter output from the variable length decoding unit 51, it inversely quantizes the compressed data associated with the coded block output from the variable length decoding unit 51. By performing inverse transformation processing (e.g., inverse DCT (inverse discrete cosine transform) or inverse KL transform) on the inversely quantized compressed data in units of the transform block size included in the prediction error coding parameter, the inversely transformed compressed data is output as a decoded prediction error signal (a signal representing the difference image before compression). Furthermore, the inverse quantization/inverse transformation unit 55 constitutes a difference image generation unit.
加法部56实施如下处理:通过将从反量化·反变换部55输出的译码预测误差信号和表示由帧内预测部53或者运动补偿预测部54生成的预测图像的预测信号相加,生成表示译码图像的译码图像信号。而且,加法部56构成译码图像生成单元。The addition unit 56 performs processing to generate a decoded image signal representing a decoded image by adding the decoded prediction error signal output from the inverse quantization/inverse transform unit 55 and a prediction signal representing a predicted image generated by the intra prediction unit 53 or the motion compensation prediction unit 54. Furthermore, the addition unit 56 constitutes a decoded image generation unit.
帧内预测用存储器57是将由加法部56生成的译码图像信号表示的译码图像作为由帧内预测部53在下次的帧内预测处理中使用的图像而保存的RAM等的记录介质。The intra prediction memory 57 is a recording medium such as RAM that stores the decoded image represented by the decoded image signal generated by the adder 56 as an image used by the intra prediction unit 53 in the next intra prediction process.
环路滤波器部58实施如下处理:对由加法器56生成的译码图像信号所包含的编码失真进行补偿,将编码失真补偿后的译码图像信号表示的译码图像作为参照图像而输出到运动补偿预测帧存储器59。The loop filter unit 58 performs processing to compensate for coding distortion included in the decoded image signal generated by the adder 56 , and outputs a decoded image represented by the decoded image signal after coding distortion compensation as a reference image to the motion compensation prediction frame memory 59 .
运动补偿预测帧存储器59是将用环路滤波器部58进行环路滤波处理后的译码图像作为用运动补偿预测部54在下次的运动补偿预测处理中使用的参照图像而保存的RAM等的记录介质。The motion compensation prediction frame memory 59 is a recording medium such as RAM that stores the decoded image subjected to the loop filtering process by the loop filter unit 58 as a reference image to be used in the next motion compensation prediction process by the motion compensation prediction unit 54 .
在图1中,假设作为运动图像编码装置的构成要素的编码控制部1、块分割部2、切换开关3、帧内预测部4、运动补偿预测部5、减法部6、变换·量化部7、反量化·反变换部8、加法部9、环路滤波器部11以及可变长度编码部13分别用专用的硬件(例如,安装有CPU的半导体集成电路或者单片机等)来构成,但在运动图像编码装置用计算机构成的情况下,也可以将记录有编码控制部1、块分割部2、切换开关3、帧内预测部4、运动补偿预测部5、减法部6、变换·量化部7、反量化·反变换部8、加法部9、环路滤波器部11以及可变长度编码部13的处理内容的程序保存于该计算机的存储器,该计算机的CPU执行保存于该存储器的程序。In FIG. 1 , it is assumed that the components of the moving picture coding apparatus, namely, the coding control unit 1, the block division unit 2, the switch 3, the intra-frame prediction unit 4, the motion-compensated prediction unit 5, the subtraction unit 6, the transform/quantization unit 7, the inverse quantization/inverse transform unit 8, the addition unit 9, the loop filter unit 11, and the variable-length coding unit 13, are each configured using dedicated hardware (e.g., a semiconductor integrated circuit or a single-chip microcomputer having a CPU mounted thereon). However, if the moving picture coding apparatus is configured using a computer, a program recording the processing details of the coding control unit 1, the block division unit 2, the switch 3, the intra-frame prediction unit 4, the motion-compensated prediction unit 5, the subtraction unit 6, the transform/quantization unit 7, the inverse quantization/inverse transform unit 8, the addition unit 9, the loop filter unit 11, and the variable-length coding unit 13 may be stored in a memory of the computer, and the CPU of the computer may execute the program stored in the memory.
图3是表示本发明的实施方式1的运动图像编码装置的处理内容的流程图。FIG3 is a flowchart showing the processing contents of the moving picture encoding apparatus according to Embodiment 1 of the present invention.
在图2中,假设作为运动图像译码装置的构成要素的可变长度译码部51、切换开关52、帧内预测部53、运动补偿预测部54、反量化·反变换部55、加法部56以及环路滤波器部58分别用专用的硬件(例如,安装有CPU的半导体集成电路或者单片机等)构成,但在运动图像译码装置用计算机构成的情况下,也可以将记录有可变长度译码部51、切换开关52、帧内预测部53、运动补偿预测部54、反量化·反变换部55、加法部56以及环路滤波器部58的处理内容的程序保存于该计算机的存储器,该计算机的CPU执行保存于该存储器的程序。In FIG. 2 , it is assumed that the variable-length decoding unit 51, the switch 52, the intra-frame prediction unit 53, the motion-compensated prediction unit 54, the inverse quantization/inverse transform unit 55, the addition unit 56, and the loop filter unit 58, which are components of the moving picture decoding apparatus, are each configured using dedicated hardware (e.g., a semiconductor integrated circuit or a single-chip microcomputer equipped with a CPU). However, if the moving picture decoding apparatus is configured using a computer, a program recording the processing contents of the variable-length decoding unit 51, the switch 52, the intra-frame prediction unit 53, the motion-compensated prediction unit 54, the inverse quantization/inverse transform unit 55, the addition unit 56, and the loop filter unit 58 may be stored in a memory of the computer, and the CPU of the computer may execute the program stored in the memory.
图4是表示本发明的实施方式1的运动图像译码装置的处理内容的流程图。FIG4 is a flowchart showing the processing contents of the moving picture decoding apparatus according to the first embodiment of the present invention.
以下说明动作。The following describes the operation.
最初,说明图1的运动图像编码装置的处理内容。First, the processing contents of the moving picture encoding apparatus in FIG. 1 will be described.
首先,编码控制部1决定实施帧内预测处理(帧内预测处理)或者运动补偿预测处理(帧间预测处理)时的成为处理单位的编码块的最大尺寸,并且决定分层分割最大尺寸的编码块时的上限层数(图3的步骤ST1)。First, the encoding control unit 1 determines the maximum size of the coding block that serves as the processing unit when performing intra-frame prediction processing (intra-frame prediction processing) or motion compensation prediction processing (inter-frame prediction processing), and determines the upper limit number of layers when hierarchically dividing the coding block of the maximum size (step ST1 of Figure 3).
作为编码块的最大尺寸的决定方法,例如考虑针对全部的图片,决定出与输入图像的分辨率相应的尺寸的方法。As a method for determining the maximum size of a coding block, for example, a method of determining a size corresponding to the resolution of an input image for all pictures may be considered.
另外,考虑将输入图像的局部性的运动的复杂程度的不同作为参数进行定量化,在运动剧烈的图片中将最大尺寸决定为小的值,在运动少的图片中将最大尺寸决定为大的值的方法等。Another possible method is to quantify the difference in complexity of local motion in the input image as a parameter, and determine the maximum size to be a small value in images with intense motion and a large value in images with less motion.
对于上限的层数的设定例如考虑以下的方法,即、设定为输入图像的运动越是剧烈,越加深层数,以能够检测更细微的运动,如果输入图像的运动少,则抑制层数。For example, the following method can be considered to set the upper limit of the number of layers: the more intense the motion of the input image, the deeper the number of layers to detect finer motions; and if the motion of the input image is small, the number of layers is reduced.
另外,编码控制部1从可以利用的1个以上的编码模式(M种帧内编码模式,N种帧间编码模式)中选择适合于分层分割的各个编码块的编码模式(步骤ST2)。Furthermore, the encoding control unit 1 selects an encoding mode suitable for each hierarchically divided encoding block from one or more available encoding modes (M intra-frame encoding modes and N inter-frame encoding modes) (step ST2).
基于编码控制部1的编码模式的选择方法因为是公知的技术,所以省略详细的说明,例如有:使用可以利用的任意的编码模式,对编码块实施编码处理,验证编码效率,在可以利用的多个编码模式中,选择编码效率最好的编码模式的方法等。The method for selecting the coding mode based on the coding control unit 1 is a well-known technology, so detailed description is omitted. For example, there are methods such as: using any available coding mode, performing coding processing on the coding block, verifying the coding efficiency, and selecting the coding mode with the best coding efficiency from multiple available coding modes.
另外,编码控制部1对各个编码块的每个,决定压缩差分图像时使用的量化参数以及变换块尺寸,并且决定实施预测处理时使用的帧内预测参数或者帧间预测参数。Furthermore, the encoding control unit 1 determines, for each encoding block, a quantization parameter and a transform block size used when compressing a differential image, and determines intra-frame prediction parameters or inter-frame prediction parameters used when performing a prediction process.
编码控制部1将包含量化参数以及变换块尺寸的预测误差编码参数输出到变换·量化部7、反量化·反变换部8以及可变长度编码部13。另外,根据需要将预测误差编码参数输出到帧内预测部4。The encoding control unit 1 outputs prediction error coding parameters including quantization parameters and transform block size to the transform/quantization unit 7, inverse quantization/inverse transform unit 8, and variable length encoding unit 13. Furthermore, the prediction error coding parameters are output to the intra prediction unit 4 as needed.
块分割部2如果输入表示输入图像的视频信号,则将该视频信号表示的输入图像分割成由编码控制部1决定的最大尺寸的编码块,并且直到达到用编码控制部1决定的上限层数为止,分层分割该编码块。When a video signal representing an input image is input, the block division unit 2 divides the input image represented by the video signal into coding blocks of the maximum size determined by the coding control unit 1, and divides the coding blocks into layers until the upper limit number of layers determined by the coding control unit 1 is reached.
在此,图5是表示将最大尺寸的编码块分层分割成多个编码块的样子的说明图。Here, FIG5 is an explanatory diagram showing how a coding block of the maximum size is hierarchically divided into a plurality of coding blocks.
在图5的例子中,最大尺寸的编码块是第0层的编码块B0,按照亮度成分具有(L0,M0)的尺寸。In the example of FIG5 , the largest-sized coding block is coding block B 0 of layer 0, which has a size of (L 0 , M 0 ) according to the luminance component.
另外,在图5的例子中,将最大尺寸的编码块B0作为始点,用4叉树结构分层分割到另外确定的规定深度,由此得到编码块Bn。In the example of FIG. 5 , the maximum-sized coding block B 0 is used as a starting point, and the coding block B n is obtained by hierarchically dividing the coding block B 0 to a predetermined depth determined separately using a quad-tree structure.
在深度n中,编码块Bn是尺寸(Ln、Mn)的图像区域。At depth n, coding block Bn is an image region of size ( Ln , Mn ).
其中,Ln和Mn可以相同也可以不同,但在图5的例子中示出Ln=Mn的情况。However, Ln and Mn may be the same or different, but the example in FIG5 shows a case where Ln = Mn .
以后,编码块Bn的尺寸定义为编码块Bn的亮度成分中的(Ln,Mn)。Hereinafter, the size of coding block Bn is defined as ( Ln , Mn ) in the luma component of coding block Bn .
块分割部2因为进行4叉树分割,所以(Ln+1,Mn+1)=(Ln/2,Mn/2)始终成立。Since the block dividing unit 2 performs quad-tree division, (Ln +1 , Mn +1 ) = ( Ln /2, Mn /2) always holds.
其中,如RGB信号等那样,在全部的颜色成分具有相同样本数的彩色视频信号(4:4:4格式)中,全部的颜色成分的尺寸成为(Ln,Mn),但在处理4:2:0格式的情况下,对应的色差成分的编码块的尺寸是(Ln/2,Mn/2)。In a color video signal (4:4:4 format) where all color components have the same number of samples, such as an RGB signal, the size of all color components is ( Ln , Mn ). However, when processing the 4:2:0 format, the size of the coding block of the corresponding color difference component is ( Ln /2, Mn /2).
以后,将在第n层的编码块Bn中可以选择的编码模式标记为m(Bn)。Hereinafter, the coding mode that can be selected in the coding block Bn of the nth layer is denoted as m( Bn ).
当是具有多个颜色成分的彩色视频信号的情况下,编码模式m(Bn)对每种颜色成分可以以分别使用独立的模式的方式构成,但以后只要没有特别说明,就是设为是指对YUV信号4:2:0格式的编码块的亮度成分的编码模式的情况来进行说明。In the case of a color video signal having multiple color components, the coding mode m( Bn ) can be configured to use an independent mode for each color component. However, unless otherwise specified, the following description will be based on the coding mode for the luminance component of a coding block in a YUV signal 4:2:0 format.
在编码模式m(Bn)中有1个或多个帧内编码模式(总称为“INTRA”)、1个或多个帧间编码模式(总称为“INTER”),编码控制部1如上所述,从在该图片中可以利用的全部的编码模式或其子集中选择对编码块Bn编码效率最好的编码模式。Among the coding modes m( Bn ), there are one or more intra-frame coding modes (collectively referred to as "INTRA") and one or more inter-frame coding modes (collectively referred to as "INTER"). As described above, the coding control unit 1 selects the coding mode with the highest coding efficiency for the coding block Bn from all coding modes available for the picture or a subset thereof.
编码块Bn如图5所示,进一步分割成1个或多个预测处理单位(分区)。As shown in FIG5 , the coding block Bn is further divided into one or more prediction processing units (partitions).
以后,将属于编码块Bn的分区标记为Pi n(i:第n层的分区号码)。Hereinafter, the partitions belonging to the coding block Bn are denoted as Pin ( i: partition number of the nth layer).
将属于编码块Bn的分区Pi n的分割怎样进行的作为信息包含在编码模式m(Bn)中。The coding mode m(B n ) contains information on how the partition P i n belonging to the coding block B n is divided.
分区Pi n全部按照编码模式m(Bn)进行预测处理,但对于每个分区Pi n,可以选择独立的预测参数。All partitions P i n are predicted according to the coding mode m(B n ), but for each partition P i n , independent prediction parameters can be selected.
编码控制部1对于最大尺寸的编码块,例如生成图6所示那样的块分割状态,确定编码块Bn。The encoding control unit 1 generates a block division state as shown in FIG. 6 , for example, for the maximum-sized encoding block, and determines the encoding block B n .
图6(a)的阴影部分表示分割后的分区的分布,另外,图6(b)用4叉树图来表示将编码模式m(Bn)分配给分层分割后的分区的状况。The shaded areas in FIG6(a) represent the distribution of partitions after division, and FIG6(b) shows, using a quadtree diagram, how coding modes m( Bn ) are assigned to partitions after hierarchical division.
在图6(b)中,用四方形包围的节点表示被分配了编码模式m(Bn)的节点(编码块Bn)。In FIG6(b), the nodes surrounded by squares represent nodes (coding blocks Bn ) to which coding mode m( Bn ) is assigned.
在编码控制部1选择对各个编码块Bn的分区Pi n最佳的编码模式m(Bn)时,如果该编码模式m(Bn)是帧内编码模式(步骤ST3),则切换开关3将用块分割部2分割了的编码块Bn的分区Pi n输出到帧内预测部4。When the encoding control unit 1 selects the optimal encoding mode m( Bn ) for each partition P in the encoding block Bn , if the encoding mode m( Bn ) is an intra-frame encoding mode (step ST3), the switching switch 3 outputs the partition P in the encoding block Bn divided by the block dividing unit 2 to the intra-frame prediction unit 4.
另一方面,如果该编码模式m(Bn)是帧间编码模式(步骤ST3),则将用块分割部2分割了的编码块Bn的分区Pi n输出到运动补偿预测部5。On the other hand, if the coding mode m(B n ) is the inter-frame coding mode (step ST3 ), the partitions P in of the coding block B n divided by the block dividing unit 2 are output to the motion compensation prediction unit 5 .
在从切换开关3接收到编码块Bn的分区Pi n时,帧内编码预测部4使用帧内的已编码的图像信号,根据从编码控制部1输出的帧内预测参数,通过对该编码块Bn的分区Pi n实施帧内预测处理,生成帧内预测图像Pi n(步骤ST4)。Upon receiving the partition Pin of the coding block Bn from the switching switch 3, the intra-frame coding prediction unit 4 uses the encoded image signal within the frame and, based on the intra-frame prediction parameters output from the coding control unit 1, performs intra-frame prediction processing on the partition Pin of the coding block Bn to generate an intra-frame predicted image Pin ( step ST4).
但是,帧内预测部4在生成上述帧内预测图像Pi n后,从预先准备的1个以上的滤波器中,根据在运动图像译码装置中在生成和上述帧内预测图像Pi n相同的预测图像的时刻点已知的各种参数的状态来选择滤波器,使用该滤波器对该帧内预测图像Pi n实施滤波处理。However, after generating the intra-frame prediction image P in , the intra-frame prediction unit 4 selects a filter from one or more pre-prepared filters based on the states of various parameters known at the time of generating a prediction image identical to the intra-frame prediction image P in the motion picture decoding device, and performs filtering processing on the intra-frame prediction image P using the selected filter.
帧内预测部4当对帧内预测图像Pi n实施滤波处理时,将滤波处理后的帧内预测图像Pi n输出到减法部6以及加法部9,而即使在图2的运动图像译码装置中也因为能够生成相同的帧内预测图像Pi n,所以将该帧内预测参数输出到可变长度编码部13。When the intra-frame prediction unit 4 performs filtering processing on the intra-frame prediction image P in , it outputs the filtered intra-frame prediction image P in to the subtraction unit 6 and the addition unit 9. Since the same intra-frame prediction image P in can be generated even in the moving picture decoding apparatus of FIG. 2 , the intra-frame prediction parameters are output to the variable-length coding unit 13.
帧内预测部4的处理内容的概略如上所述,详细处理内容以后说明。The outline of the processing contents of the intra prediction unit 4 is as described above, and the details of the processing contents will be described later.
运动补偿预测部5在从切换开关3接收到编码块Bn的分区Pi n时,使用用运动补偿预测帧存储器12保存的1帧以上的参照图像,根据从编码控制部1输出的帧间预测参数,通过对该编码块Bn的分区Pi n实施运动补偿预测处理,生成帧间预测图像Pi n(步骤ST5)。Upon receiving the partition Pin of the coding block Bn from the switching switch 3, the motion-compensated prediction unit 5 uses one or more reference images stored in the motion-compensated prediction frame memory 12 and, based on the inter-frame prediction parameters output from the encoding control unit 1, performs motion-compensated prediction processing on the partition Pin of the coding block Bn , thereby generating an inter -frame prediction image Pin (step ST5).
而且,通过实施运动补偿预测处理来生成预测图像的技术因为是公知的技术所以省略详细的说明。Furthermore, the technology for generating a predicted image by performing motion compensation prediction processing is a well-known technology, and therefore detailed description thereof will be omitted.
在帧内预测部4或者运动补偿预测部5生成预测图像(帧内预测图像Pi n、帧间预测图像Pi n)时,减法部6通过从用块分割部2分割了的编码块Bn的分区Pi n中减去用帧内预测部4或者运动补偿预测部5生成的预测图像(帧内预测图像Pi n、帧间预测图像Pi n)来生成差分图像,将表示该差分图像的预测误差信号ei n输出到变换·量化部7(步骤ST6)。When the intra prediction unit 4 or the motion compensation prediction unit 5 generates a predicted image (intra prediction image Pin , inter prediction image Pin ), the subtraction unit 6 generates a difference image by subtracting the predicted image (intra prediction image Pin , inter prediction image Pin ) generated by the intra prediction unit 4 or the motion compensation prediction unit 5 from the partition Pin of the coding block Bn divided by the block division unit 2 , and outputs a prediction error signal ein representing the difference image to the transformation and quantization unit 7 (step ST6).
变换·量化部7在从减法部6接收表示差分图像的预测误差信号ei n时,以从编码控制部1输出的预测误差编码参数所包含的变换块尺寸为单位,实施该差分图像的变换处理(例如,DCT(离散余弦变换)、预先对特定的学习系列进行底层设计的KL变换等的正交变换处理),并且使用包含于该预测误差编码参数的量化参数,对该差分图像的变换系数进行量化,由此将该量化后的变换系数作为差分图像的压缩数据,输出到反量化·反变换部8以及可变长度编码部13(步骤ST7)。When the transform/quantization unit 7 receives the prediction error signal e i n representing the differential image from the subtraction unit 6, it performs transform processing (for example, DCT (discrete cosine transform), orthogonal transform processing such as KL transform that is pre-designed for a specific learning series) on the differential image in units of the transform block size included in the prediction error coding parameter output from the encoding control unit 1, and quantizes the transform coefficients of the differential image using the quantization parameter included in the prediction error coding parameter, thereby outputting the quantized transform coefficients as compressed data of the differential image to the inverse quantization/inverse transform unit 8 and the variable length coding unit 13 (step ST7).
反量化·反变换部8在从变换·量化部7接收到差分图像的压缩数据时,使用从编码控制部1输出的预测误差编码参数所包含的量化参数,对该差分图像的压缩数据进行反量化,以包含于该预测误差编码参数的变换块尺寸为单位,实施反量化的压缩数据的反变换处理(例如,反DCT(反离散余弦变换)、反KL变换等的反变换处理),由此将反变换处理后的压缩数据作为局部译码预测误差信号ei n帽(hat)(因电子申请的关系,附加在字母字符上的“^”记为帽),输出到加法部9(步骤ST8)。When receiving the compressed data of the differential image from the transform/quantization unit 7, the inverse quantization/inverse transformation unit 8 uses the quantization parameter included in the prediction error coding parameter output from the encoding control unit 1 to inverse quantize the compressed data of the differential image, and performs inverse transformation processing (for example, inverse transformation processing such as inverse DCT (inverse discrete cosine transform) and inverse KL transform) on the inverse quantized compressed data in units of the transform block size included in the prediction error coding parameter, thereby outputting the compressed data after the inverse transformation processing as a local decoding prediction error signal e in hat (due to electronic application, the "^" attached to the alphabetical character is represented as hat) to the addition unit 9 (step ST8).
加法部9在从反量化·反变换部8接收局部译码预测误差信号ei n帽时,通过将该局部译码预测误差信号ei n帽和表示用帧内预测部4或者运动补偿预测部5生成的预测图像(帧内预测图像Pi n、帧间预测图像Pi n)的预测信号相加,生成局部译码图像,该局部译码图像是局部译码分区图像Pi n帽或者作为其集合的局部译码编码块图像(步骤ST9)。Upon receiving the local decoding prediction error signal ein - hat from the inverse quantization/inverse transform unit 8, the adder 9 adds the local decoding prediction error signal ein- hat to a prediction signal representing a prediction image (intra-frame prediction image Pin , inter-frame prediction image Pin ) generated by the intra-frame prediction unit 4 or the motion compensation prediction unit 5 , thereby generating a local decoded image, which is the local decoding partition image Pin - hat or a local decoding coding block image as a collection thereof (step ST9).
加法部9在生成局部译码图像时,将表示该局部译码图像的局部译码图像信号保存于帧内预测用存储器10,并且将该局部译码图像信号输出到环路滤波器部11。When generating a local decoded image, the adder 9 stores a local decoded image signal representing the local decoded image in the intra prediction memory 10 , and outputs the local decoded image signal to the loop filter 11 .
步骤ST3~ST9的处理在对分层分割的全部的编码块Bn的处理结束之前重复实施,当对全部编码块Bn的处理结束时转移到步骤ST12的处理(步骤ST10,ST11)。The processing of steps ST3 to ST9 is repeatedly performed until the processing of all the hierarchically divided coding blocks Bn is completed. When the processing of all the coding blocks Bn is completed, the process shifts to step ST12 (steps ST10 and ST11).
可变长度编码部13对从变换·量化部7输出的压缩数据、从编码控制部1输出的编码模式(包含表示编码块的分割状态的信息)以及预测误差编码参数、从帧内预测部4输出的帧内预测参数或者从运动补偿预测部5输出的帧间预测参数进行熵编码。The variable length coding unit 13 performs entropy coding on the compressed data output from the transform/quantization unit 7, the coding mode (including information indicating the segmentation status of the coding block) and the prediction error coding parameters output from the coding control unit 1, the intra-frame prediction parameters output from the intra-frame prediction unit 4, or the inter-frame prediction parameters output from the motion compensation prediction unit 5.
可变长度编码部13复用作为熵编码的编码结果的压缩数据、编码模式、预测误差编码参数、帧内预测参数/帧间预测参数的编码数据,生成比特流(步骤ST12)。The variable length coding unit 13 multiplexes the compressed data as the encoding result of the entropy encoding, the encoding mode, the prediction error encoding parameter, and the encoded data of the intra prediction parameter/inter prediction parameter to generate a bit stream (step ST12).
环路滤波器部11在从加法器9接收局部译码图像信号时,补偿包含于该局部译码图像信号的编码失真,将编码失真补偿后的局部译码图像信号表示的局部译码图像作为参照图像而保存于运动补偿预测帧存储器12(步骤ST13)。When receiving the local decoded image signal from the adder 9, the loop filter unit 11 compensates for the coding distortion contained in the local decoded image signal, and stores the local decoded image represented by the local decoded image signal after coding distortion compensation as a reference image in the motion compensation prediction frame memory 12 (step ST13).
由环路滤波器部11进行的滤波处理既可以以从加法器9输出的局部译码图像信号的最大编码块或者各自的编码块为单位进行,也可以以汇集多个最大编码块了的单位来进行,或者在输出1图片量的局部译码图像信号后汇集1图片量来进行。The filtering processing performed by the loop filter unit 11 can be performed in units of the maximum coding block or individual coding blocks of the local decoded image signal output from the adder 9, or can be performed in units of a plurality of maximum coding blocks, or can be performed by aggregating one picture after outputting the local decoded image signal of one picture.
接着详细说明帧内预测部4的处理内容。Next, the processing contents of the intra prediction unit 4 will be described in detail.
图7是表示在编码块Bn内的各分区Pi n中可以选择的帧内预测参数(帧内预测模式)的一个例子的说明图。FIG7 is an explanatory diagram showing an example of intra-frame prediction parameters (intra-frame prediction modes) that can be selected in each partition Pin within the coding block Bn .
在图7的例子中表示帧内预测模式和该帧内预测模式表示的预测方向向量,设计成随着可以选择的帧内预测模式的个数的增加,预测方向向量彼此之间的相对角度减小。In the example of FIG. 7 , the intra prediction modes and the prediction direction vectors indicated by the intra prediction modes are shown, and the relative angles between the prediction direction vectors decrease as the number of selectable intra prediction modes increases.
帧内预测部4根据针对分区Pi n的帧内预测参数、在帧内预测图像Pi n的生成中使用的滤波器的选择参数,对分区Pi n实施帧内预测处理。The intra prediction unit 4 performs intra prediction processing on the partition Pin based on the intra prediction parameters for the partition Pin and the selection parameters of the filter used to generate the intra prediction image Pin .
以下,说明根据针对分区Pi n的亮度信号的帧内预测参数(帧内预测模式),生成亮度信号的帧内预测信号的帧内处理。The following describes the intra-frame processing of generating an intra-frame prediction signal of the luminance signal based on the intra-frame prediction parameters (intra-frame prediction mode) of the luminance signal for the partition Pin .
在此,假设分区Pi n的尺寸为li n×mi n像素。Here, it is assumed that the size of the partition Pin is lin × min pixels.
图8是表示在li n=mi n=4时,在生成分区Pi n内的像素预测值时使用的像素的一个例子的说明图。FIG8 is an explanatory diagram showing an example of pixels used when generating pixel prediction values within a partition Pin when lin = min = 4 .
在图8中,将与分区Pi n相邻的已编码的上分区的像素(2×li n+1)个和左分区的像素(2×mi n)个设为用于预测的像素,但在预测中使用的像素与图8所示的像素相比可多可少。In FIG8 , (2× lin +1) pixels of the coded upper partition and (2×min ) pixels of the left partition adjacent to partition Pin are used as pixels for prediction. However, the pixels used for prediction may be more or less than those shown in FIG8 .
另外,在图8中,虽然在预测中使用相邻的1行或者1列量的像素,但也可以在预测中使用2行或者2列,或者其以上的像素。In addition, although one adjacent row or column of pixels is used for prediction in FIG8 , two rows or two columns of pixels, or more, may be used for prediction.
当针对分区Pi n的帧内预测模式的索引值是2(平均值预测)的情况下,将上分区的相邻像素和左分区的相邻像素的平均值作为分区Pi n内的全部像素的预测值来生成中间预测图像。When the index value of the intra- frame prediction mode for partition Pin is 2 (average prediction), the average value of the adjacent pixels of the upper partition and the adjacent pixels of the left partition is used as the prediction value of all pixels in partition Pin to generate an intermediate prediction image.
当帧内预测模式索引值是2(平均值预测)以外的情况下,根据分区值表示的预测方向向量vp=(dx,dy)生成分区Pi n内的像素的预测值。When the intra prediction mode index value is other than 2 (average prediction), the predicted value of the pixel in the partition Pin is generated according to the prediction direction vector vp = (dx, dy) represented by the partition value.
其中,假设生成预测值的像素(预测对象像素)的分区Pi n内的相对坐标(以分区的左上像素为原点)为(x,y)。Here, it is assumed that the relative coordinates (with the upper left pixel of the partition as the origin) within the partition Pin of the pixel generating the predicted value (prediction target pixel) are (x, y).
在预测中使用的参照像素的位置变成在下述表示的A和相邻像素的交点。The position of the reference pixel used in the prediction becomes the intersection of A shown below and the adjacent pixel.
其中,k是负的标量值。Here, k is a negative scalar value.
当参照像素处于整数像素位置的情况下,将该整数像素设为预测对象像素的预测值。When the reference pixel is at an integer pixel position, the integer pixel is set as the predicted value of the prediction target pixel.
另一方面,当参照像素不处于整数像素位置的情况下,将根据与参照像素相邻的整数像素生成的插值像素设为预测值。On the other hand, when the reference pixel is not located at an integer pixel position, an interpolated pixel generated from integer pixels adjacent to the reference pixel is used as a predicted value.
在图8的例子中,因为参照像素不处于整数像素位置,所以根据与参照像素相邻的2个像素进行内插,计算预测值。但是,预测值并不限于相邻的2个像素,也可以根据相邻的2个以上的像素来生成插值像素并作为预测值。In the example of Figure 8, since the reference pixel is not at an integer pixel position, the predicted value is calculated by interpolation based on two pixels adjacent to the reference pixel. However, the predicted value is not limited to two adjacent pixels; an interpolated pixel can be generated based on two or more adjacent pixels and used as the predicted value.
而后,通过对用根据上述步骤生成的分区Pi n内的预测值构成的中间预测图像进行以后说明的滤波处理,取得最终的帧内预测图像Pi n,将该帧内预测图像Pi n输出到减法部6以及加法部9中。Then, by performing filtering processing (described later) on the intermediate prediction image formed by the prediction values within the partition Pin generated according to the above steps, a final intra-frame prediction image Pin is obtained, and the intra-frame prediction image Pin is output to the subtraction unit 6 and the addition unit 9.
另外,在帧内预测图像Pi n的生成中使用的帧内预测参数为了复用于比特流而输出到可变长度编码部13。In addition, the intra-frame prediction parameters used to generate the intra-frame prediction image P in are output to the variable-length coding unit 13 for multiplexing in the bit stream.
以下,说明具体的滤波处理。The following describes specific filtering processing.
从预先准备的至少一个以上的滤波器中,用以后说明的方法选择所使用的滤波器,对中间预测图像的各像素按照以下的式子(1)进行滤波处理。A filter to be used is selected from at least one filter prepared in advance by a method described later, and filtering processing is performed on each pixel of the intermediate prediction image according to the following equation (1).
在式(1)中,an(n=0,1,……,N)是包含参照像素涉及的系数(a0、a1、……、aN-1)和偏移系数aN的滤波器系数。In the formula (1), a n (n=0, 1, ..., N) is a filter coefficient including coefficients related to reference pixels (a 0 , a 1 , ..., a N-1 ) and an offset coefficient a N .
Pn(n=0,1,……,N-1)表示包含滤波处理对象p0的滤波器的参照像素。N是任意的参照像素数。P n (n=0, 1, ..., N-1) represents reference pixels of the filter including the filtering target p 0. N is an arbitrary number of reference pixels.
s(pn)表示各参照像素的亮度值,s帽(p0)表示在滤波处理对象像素p0中的滤波处理后的亮度值。s(p n ) represents the luminance value of each reference pixel, and s hat (p 0 ) represents the luminance value after filtering at the filtering target pixel p 0 .
但是,滤波器系数也可以设为没有偏移系数aN的系数来构成。另外,对于处在分区Pi n内的各参照像素的亮度值s(pn),既可以设为中间预测图像的各像素的亮度值,也可以设为只在滤波处理结束的像素位置进行滤波处理后的亮度值。However, the filter coefficients may be configured without the offset coefficient a N. Furthermore, the luminance value s(p n ) of each reference pixel within the partition P i n may be the luminance value of each pixel in the intermediate prediction image, or may be the luminance value after filtering is performed only at the pixel position where filtering is completed.
对于处于分区Pi n之外的各参照像素的亮度值s(pn),如果是已编码区域,则是编码后的亮度值(被译码的亮度值),如果尚且没有进行编码的区域,则从处于上述定义的分区Pi n内的各参照像素的亮度值s(pn)以及已编码区域的编码后的亮度值中按照规定的顺序选择代用的信号值(例如,在作为候补的像素中,选择位于最近的位置的信号值等)。For the luminance value s( pn ) of each reference pixel outside the partition P in a coded area, the coded luminance value (decoded luminance value) is used; if the area is not yet coded, a substitute signal value is selected in a prescribed order from the luminance values s( pn ) of each reference pixel within the partition P defined above and the coded luminance values of the coded area (for example, the signal value at the nearest position among the candidate pixels is selected).
图9是表示N=5时的参照像素配置的一个例子的说明图。FIG. 9 is an explanatory diagram showing an example of reference pixel arrangement when N=5.
在进行上述的滤波处理时,分区Pi n的尺寸(li n×mi n)越大,在输入图像中越容易存在非线性的边缘等,因为容易发生与中间预测图像的预测方向的偏差,所以优选对中间预测图像进行平滑处理。When performing the above - mentioned filtering process, the larger the size ( line × minute ) of the partition Pin , the more likely it is that nonlinear edges will exist in the input image, and since deviations from the prediction direction of the intermediate prediction image are likely to occur, it is preferable to smooth the intermediate prediction image.
进而,预测误差的量化值越大,在译码图像中产生的量化失真越大,因为根据与分区Pi n相邻的已编码的像素而生成的中间预测图像的预测精度降低,所以优先准备粗略地表现分区Pi n的那样的经过平滑处理的预测图像。Furthermore, the larger the quantization value of the prediction error, the greater the quantization distortion generated in the decoded image. Since the prediction accuracy of the intermediate prediction image generated based on the encoded pixels adjacent to the partition Pin is reduced, it is preferred to prepare a smoothed prediction image that roughly represents the partition Pin .
进而,即使是相同的分区Pi n内的像素,因为距离与在中间预测图像的生成中使用的分区Pi n相邻的已编码像素越远的像素,越容易在中间预测图像和输入图像之间发生边缘等的偏移,所以对预测图像进行平滑处理,抑制在发生偏移时的剧烈的预测误差的增加比较好。Furthermore, even for pixels within the same partition Pin , since the farther the pixel is from the encoded pixel adjacent to the partition Pin used in generating the intermediate prediction image, the more likely it is that an offset such as an edge will occur between the intermediate prediction image and the input image, it is better to smooth the prediction image to suppress the drastic increase in prediction error when an offset occurs.
另外,生成中间预测图像时的帧内预测被构成为,根据预测块内的预测值为全部相同的值的平均值预测和使用预测方向向量vp的预测这2种不同的方法之一的方法进行预测,进而即使在使用预测方向向量vp的预测的情况下,也根据将位于整数像素位置的参照像素的值直接作为预测值的像素、和至少2个像素以上的参照像素进行内插,生成不位于整数像素位置的像素,将该像素值作为预测值的像素的预测块内的配置因预测方向向量vp的方向而不同。因而,预测图像的性质因帧内预测模式而不同,最佳的滤波处理也不同,所以根据帧内预测模式的索引值来改变滤波器的强度、滤波器的参照像素数、参照像素配置等比较好。Furthermore, intra-frame prediction for generating an intermediate predicted image is configured to perform prediction using one of two different methods: average prediction, where all predicted values within a prediction block are the same, or prediction using a prediction direction vector vp . Furthermore, even when using prediction using the prediction direction vector vp , pixels not located at integer pixel positions are generated by interpolating between pixels whose predicted values are directly used as reference pixels at integer pixel positions and at least two reference pixels. The placement of pixels within the prediction block whose predicted values are used differs depending on the direction of the prediction direction vector vp . Therefore, the properties of the predicted image differ depending on the intra-frame prediction mode, and the optimal filtering process also differs. Therefore, it is preferable to change the filter strength, the number of reference pixels used by the filter, the reference pixel placement, and other factors according to the intra-frame prediction mode index value.
因而,在滤波器选择处理中被构成为考虑以下的4个参数(1)~(4)来选择滤波器。Therefore, the filter selection process is configured to select a filter in consideration of the following four parameters (1) to (4).
(1)分区Pi n的尺寸(li n×mi n)(1) Size of partition Pin (l in × min in )
(2)包含于预测误差编码参数的量化参数(2) Quantization parameters included in the prediction error coding parameters
(3)在中间预测图像的生成时使用的已编码的像素(图8所示的“在预测中使用的像素”)群和滤波处理对象像素的距离(3) The distance between the group of coded pixels used in generating the intermediate prediction image (“pixels used in prediction” shown in FIG8 ) and the pixel to be filtered
(4)生成中间预测图像时的帧内预测模式的索引值(4) Index value of the intra-frame prediction mode when generating the intermediate prediction image
具体地说被构成为:分区Pi n的尺寸(li n×mi n)越大,由量化参数决定的量化值越大,滤波处理对象像素和处于分区Pi n的左边以及上边的已编码的像素群的距离越远,越使用平滑处理强度强的滤波器或参照像素数多的滤波器。作为滤波处理对象像素和处于分区Pi n的左边以及上边的已编码像素群的距离的一个例子,列举出图11。另外,被构成为还利用帧内预测模式的索引值来切换滤波器的强度、滤波器的参照像素数、参照像素配置等。Specifically, the system is configured such that as the size (l in × m in ) of the partition Pin increases, the quantization value determined by the quantization parameter increases, and the distance between the target pixel to be filtered and the coded pixel groups to the left and above the partition Pin increases, a filter with a stronger smoothing strength or a filter with a larger number of reference pixels is used. Figure 11 is an example of the distance between the target pixel to be filtered and the coded pixel groups to the left and above the partition Pin. Furthermore, the system is configured to switch the filter strength, the number of reference pixels used in the filter, the reference pixel arrangement, and the like using the index value of the intra-frame prediction mode.
即,对于上述的参数组合的各个组合,通过从预先准备的滤波器群中进行适宜的滤波器的对应,实现与上述参数相应的滤波器的适应性选择。另外,例如,在组合参数(3)和(4)时,也可以根据参数(4)的“帧内预测模式”适宜地改变参数(3)的“与已编码像素群的距离”的定义。即,并不是如图11所示那样将与已编码像素群的距离的定义固定化,也可以设为距图8所示的“参照像素”的距离那样的依赖于预测方向的距离。通过这样处理,能够实现还考虑到参数(3)和(4)这样的多个参数关系的适应性的滤波处理。That is, for each combination of the above-mentioned parameter combinations, by matching an appropriate filter from a pre-prepared filter group, adaptive selection of a filter corresponding to the above-mentioned parameters is achieved. In addition, for example, when combining parameters (3) and (4), the definition of the "distance from the encoded pixel group" of parameter (3) can also be appropriately changed according to the "intra-frame prediction mode" of parameter (4). In other words, instead of fixing the definition of the distance from the encoded pixel group as shown in Figure 11, it can be set to a distance that depends on the prediction direction, such as the distance from the "reference pixel" shown in Figure 8. By processing in this way, adaptive filtering processing can be achieved that also takes into account the relationship between multiple parameters such as parameters (3) and (4).
另外,在这些参数的组合中,可以与“没有滤波处理”对应地准备没实施滤波处理的组合。进而,作为滤波器的强度的定义,也可以将最弱的滤波器定义为“没有滤波处理”。Furthermore, among these parameter combinations, a combination in which no filtering is performed may be prepared corresponding to “no filtering.” Furthermore, as a definition of filter strength, the weakest filter may be defined as “no filtering.”
另外,4个参数(1)~(4)因为即使在运动图像译码装置一侧中也是已知的参数,所以完全不产生为了进行上述的滤波处理所需要的应编码的附加信息。Furthermore, since the four parameters (1) to (4) are already known parameters even in the video decoding apparatus, no additional information to be encoded is generated in order to perform the above-mentioned filtering process.
而且,在上述的说明中,通过预先准备必要数量的滤波器来适应性选择,从而进行滤波器的切换,但也可以通过以根据上述滤波器选择参数的值计算滤波器的方式将滤波器定义为上述滤波器选择参数的函数,以实现滤波器的切换。Moreover, in the above description, the filter switching is performed by preparing the necessary number of filters in advance for adaptive selection, but the filter switching can also be achieved by defining the filter as a function of the above filter selection parameter in a manner of calculating the filter based on the value of the above filter selection parameter.
进而,在上述的说明中,表示了被构成为考虑4个参数(1)~(4)来选择滤波器的情况,但也可以构成为在4个参数(1)~(4)中,考虑至少1个以上的参数来选择滤波器。Furthermore, in the above description, a case is shown in which a filter is selected by considering four parameters (1) to (4), but a filter may be selected by considering at least one parameter among the four parameters (1) to (4).
以下,将使用参数(1)以及(4)的情况作为例子,表示通过对参数组合的各个组合从预先准备的滤波器群中使适宜的滤波对应起来从而进行滤波的适应性选择的滤波处理的构成例子。The following describes a configuration example of filtering processing in which appropriate filters are matched to each combination of parameter combinations from a pre-prepared filter group to perform adaptive selection of filters, using the case of using parameters (1) and (4) as an example.
将在上述滤波处理例子中使用的滤波器如以下那样定义。The filter used in the above-mentioned filtering process example is defined as follows.
滤波器索引1的滤波器(参照像素数N=3)Filter with filter index 1 (reference pixel number N=3)
a0=3/4,a1=1/8,a2=1/8a 0 = 3/4, a 1 = 1/8, a 2 = 1/8
滤波器索引2的滤波器(参照像素数N=3)Filter with filter index 2 (reference pixel number N=3)
a0=1/2,a1=1/4,a2=1/4a 0 =1/2, a 1 =1/4, a 2 =1/4
滤波器索引3的滤波器(参照像素数N=3)Filter with filter index 3 (reference pixel number N=3)
a0=1/4,a1=3/8,a2=3/8a 0 =1/4, a 1 =3/8, a 2 =3/8
滤波器索引4的滤波器(参照像素数N=5)Filter index 4 (reference pixel number N = 5)
a0=1/4,a1=3/16,a2=3/16,a3=3/16,a4=3/16a 0 =1/4, a 1 =3/16, a 2 =3/16, a 3 =3/16, a 4 =3/16
但是,假设滤波处理是设为没有偏移系数aN(aN=0)时的基于式(1)的滤波处理,假设此次使用的滤波器的种类是3种,滤波器的参照像素配置如图12所示。However, assuming that the filtering process is based on equation (1) with no offset coefficient a N (a N = 0), and assuming that three types of filters are used this time, the reference pixel arrangement of the filters is as shown in FIG. 12 .
接着,图13是示出表示在分区Pi n的每个尺寸的各帧内预测模式中使用的滤波器的表的一个例子的说明图。其中,假设可以取得的Pi n的尺寸是4×4像素、8×8像素、16×16像素、32×32像素、64×64像素,帧内预测模式的索引值和帧内预测方向的对应如图7所示。Next, FIG13 is an explanatory diagram showing an example of a table indicating filters used in each intra prediction mode for each size of a partition Pin . Assuming that the available Pin sizes are 4×4 pixels, 8×8 pixels, 16×16 pixels, 32×32 pixels, and 64×64 pixels, the correspondence between the intra prediction mode index value and the intra prediction direction is as shown in FIG7 .
另外,滤波器索引0表示不进行滤波处理。一般当使用方向性预测、平均值预测的情况下,因为存在以下的趋势,所以如图13所示的表那样,考虑帧内预测中的图像的特性,通过对参数(1)以及(4)的每个组合,在表中将使用哪个滤波器对应起来,从而能够通过参照表来实现适宜的滤波器切换。In addition, filter index 0 indicates that no filtering is performed. Generally, when directional prediction or average value prediction is used, because the following trend exists, as shown in the table in FIG13 , by considering the characteristics of the image in intra-frame prediction, the filter to be used is mapped in the table for each combination of parameters (1) and (4), thereby enabling appropriate filter switching to be achieved by referring to the table.
·一般在建筑物等人造物体中多数存在的水平·垂直方向的边缘的线性清晰的情况较多,所以多数情况可以通过水平·垂直方向预测进行高精度的预测。因而在水平·垂直方向预测时不进行平滑处理比较好。Horizontal and vertical edges are often found in artificial objects such as buildings, and they often have clear, linear edges. Therefore, horizontal and vertical prediction can often achieve high-precision prediction. Therefore, it is best not to perform smoothing during horizontal and vertical prediction.
·一般因为图像信号是在空间方向连续性高的信号,所以在使用与和分区Pi n相邻的已编码像素的连续性受损的平均值预测时,对分区Pi n的左边以及上边的块边界部周边的像素进行平滑处理来提高连续性比较好。Generally, because image signals have high spatial continuity, when using average value prediction that impairs the continuity of coded pixels adjacent to the partition Pin , it is preferable to improve continuity by smoothing the pixels around the block boundaries to the left and above the partition Pin .
·具有倾斜方向的方向性的区域因为面积越大,边缘等发生失真而具有非线性形状的情况越多,所以在使用倾斜方向预测时分区尺寸越大,越施加平滑强度强、参照像素数多的滤波器比较好。·The larger the area of a region with oblique directionality, the more likely it is that edges will be distorted and have nonlinear shapes. Therefore, when using oblique direction prediction, it is better to apply a larger partition size, a stronger smoothing filter, and a larger number of reference pixels.
·一般如果分区尺寸过大,则分区内的信号值的空间变化变得多样,在方向性预测、平均值预测中只能进行非常粗略的预测,难以高精度的预测的区域增加。在那样的区域中,因为进行平滑处理仅是使图像模糊而无法期待预测效率的改善效果,所以在那样的分区尺寸中,因为在不实施滤波处理的一方不增加不必要的计算量即可,所以是理想的(例如,在图13中,在32×32像素以上的分区尺寸中设定成不进行滤波处理)。Generally, if the partition size is too large, the spatial variation of signal values within the partition becomes diverse, resulting in only very coarse predictions for directional and average predictions, and increasing the number of areas where high-precision prediction is difficult. In such areas, smoothing simply blurs the image and cannot be expected to improve prediction efficiency. Therefore, it is ideal to omit filtering at such a partition size because this avoids unnecessary computational overhead (for example, in Figure 13, filtering is not performed for partition sizes larger than 32×32 pixels).
进而,当在滤波处理时的参照像素是分区Pi n内的像素时将中间预测图像的亮度值设为参照像素的亮度值来使用的情况下,有时可以简化滤波处理。例如,当帧内预测模式是平均值预测的情况下,针对分区Pi n的滤波处理对于图14所示的每个区域能够简化成以下那样的滤波处理。Furthermore, when the reference pixel in the filtering process is a pixel within the partition Pin , the luminance value of the intermediate prediction image is used as the luminance value of the reference pixel, which can sometimes simplify the filtering process. For example, when the intra prediction mode is average value prediction, the filtering process for the partition Pin can be simplified to the following filtering process for each region shown in FIG. 14 .
·区域A(分区Pi n的左上的像素)Region A (the upper left pixel of partition Pin )
滤波器索引1的滤波器(没有改变)Filter with filter index 1 (no change)
a0=3/4,a1=1/8,a2=1/8(参照像素数N=3)a 0 = 3/4, a 1 = 1/8, a 2 = 1/8 (reference pixel number N = 3)
滤波器索引2的滤波器(没有改变)Filter with filter index 2 (no change)
a0=1/2,a1=1/4,a2=1/4(参照像素数N=3)a 0 = 1/2, a 1 = 1/4, a 2 = 1/4 (reference pixel number N = 3)
滤波器索引3的滤波器(没有改变)Filter with filter index 3 (no change)
a0=1/4,a1=3/8,a2=3/8(参照像素数N=3)a 0 = 1/4, a 1 = 3/8, a 2 = 3/8 (refer to the number of pixels N = 3)
滤波器索引4的滤波器Filter with filter index 4
a0=5/8,a1=3/16,a2=3/16(参照像素数N=3)a 0 = 5/8, a 1 = 3/16, a 2 = 3/16 (referring to the number of pixels N = 3)
·区域B(区域A以外的分区Pi n的上端的像素)Region B (the pixels at the top of the partition Pin outside of region A)
滤波器索引1的滤波器Filter with filter index 1
a0=7/8,a2=1/8(参照像素数N=2)a 0 = 7/8, a 2 = 1/8 (reference pixel number N = 2)
滤波器索引2的滤波器Filter with filter index 2
a0=3/4,a2=1/4(参照像素数N=2)a 0 = 3/4, a 2 = 1/4 (reference pixel number N = 2)
滤波器索引3的滤波器(没有改变)Filter with filter index 3 (no change)
a0=5/8,a2=3/8(参照像素数N=2)a 0 = 5/8, a 2 = 3/8 (reference pixel number N = 2)
滤波器索引4的滤波器Filter with filter index 4
a0=13/16,a2=3/16(参照像素数N=2)a 0 = 13/16, a 2 = 3/16 (refer to the number of pixels N = 2)
·区域C(区域A以外的分区Pi n的左端的像素)Region C (the pixels at the left end of the partition Pin outside region A)
滤波器索引1的滤波器Filter with filter index 1
a0=7/8,a1=1/8(参照像素数N=2)a 0 = 7/8, a 1 = 1/8 (reference pixel number N = 2)
滤波器索引2的滤波器Filter with filter index 2
a0=3/4,a1=1/4(参照像素数N=2)a 0 = 3/4, a 1 = 1/4 (reference pixel number N = 2)
滤波器索引3的滤波器Filter with filter index 3
a0=5/8,a1=3/8(参照像素数N=2)a 0 = 5/8, a 1 = 3/8 (reference pixel number N = 2)
滤波器索引4的滤波器Filter with filter index 4
a0=13/16,a1=3/16(参照像素数N=2)a 0 = 13/16, a 1 = 3/16 (refer to the number of pixels N = 2)
·区域D(区域A、B、C以外的分区Pi n的像素)Region D (pixels in the partition Pin outside regions A, B, and C)
全部的滤波器索引的滤波器:All filter indexed filters:
没有滤波处理No filtering
即使如上述那样简化滤波处理,滤波处理结果也和简化前相同。Even if the filtering process is simplified as described above, the filtering process result is the same as before the simplification.
通过这样除去实际处理的冗余部分,能够谋求滤波处理的高速化。By eliminating redundant portions of actual processing in this manner, it is possible to increase the speed of filtering processing.
在上述例子中使用了图13的表,但也可以使用其他的表。例如,也可以考虑当与编码性能的改善效果的大小相比重视抑制处理量的情况下,代替图13的表而使用图19的表。在该表的情况下,因为只对分区Pi n的尺寸是4×4像素、8×8像素、16×16像素的平均值预测进行滤波处理,所以与使用图13的表的情况相比进行滤波处理的预测模式少,能够抑制伴随滤波处理的计算量的增加。此时,通过还利用上述的帧内预测模式是平均值预测时的滤波处理的简化,能够实现处理负荷非常低的滤波处理。While the table in Figure 13 was used in the above example, other tables may be used. For example, when reducing processing load is more important than improving coding performance, the table in Figure 19 may be used instead of the table in Figure 13. With this table, filtering is performed only for average value predictions of partitions Pin sizes of 4×4 pixels, 8×8 pixels, and 16×16 pixels. Therefore, fewer prediction modes are filtered than when the table in Figure 13 is used, thus suppressing the increase in computational complexity associated with filtering. In this case, by also utilizing the simplified filtering process when the intra-frame prediction mode is average value prediction, filtering with a significantly lower processing load can be achieved.
进而,当还重视实施的容易性的情况下,和上述的滤波处理一样假设只在平均值预测的情况下进行滤波处理,不根据分区Pi n的尺寸来切换进一步使用的滤波器,可以始终使用同样的滤波器(例如,滤波器索引2的滤波器)。这种情况下,虽然基于滤波器的编码性能的改善效果降低了与不进行与分区Pi n的尺寸相应的处理相应的量,但能够抑制实施时的电路规模(在软件的情况下是代码的行数)。本滤波处理是只考虑4个参数(1)~(4)中的参数(4)的滤波器。Furthermore, when ease of implementation is also important, as with the above-mentioned filtering process, it is assumed that filtering is performed only in the case of average value prediction, and the filter to be used further is not switched according to the size of the partition Pin , and the same filter (for example, the filter with filter index 2) can be always used. In this case, although the improvement effect of the coding performance based on the filter is reduced by the amount corresponding to not performing the processing corresponding to the size of the partition Pin , the circuit scale (the number of lines of code in the case of software) during implementation can be suppressed. This filtering process is a filter that considers only parameter (4) among the four parameters (1) to (4).
作为滤波处理的实施方式,即使不以通过参照表来选择对应的滤波器索引的滤波器的形式来实施,也可以是直接实施对分区Pi n的每个尺寸执行的滤波处理、或者直接实施对分区Pi n的各尺寸的每个像素位置执行的滤波处理这种直接实施滤波器的形式。这样,即使不是参照表的形式,只要是进行滤波处理的结果所得到的预测图像是等价的,则实施方式是何种形式都没有关系。As an implementation method for the filtering process, even if it is not implemented in a form in which a filter index corresponding to the filter is selected using a reference table, a form in which the filtering process is directly performed on each size of the partition Pin , or a form in which the filtering process is directly performed on each pixel position of each size of the partition Pin can be used. In this way, even if it is not implemented in a reference table, as long as the predicted image obtained as a result of the filtering process is equivalent, the implementation method is not relevant.
另外,在前面说明的例子中,说明了只用一个用于切换滤波器的表的方法,但也可以构成为准备多个上述表,通过用图15~图18中的某一个形式来将滤波器选择表索引100作为标题信息进行编码,从而可以用规定的单位来切换滤波器选择表。In addition, in the example described above, a method of using only one table for switching filters was described, but it is also possible to prepare multiple such tables and encode the filter selection table index 100 as header information in one of the formats shown in Figures 15 to 18, so that the filter selection table can be switched using a specified unit.
例如,如图15所示,通过对序列级标题加上滤波器选择表索引100,与只使用单一的表的情况相比,还可以进行与序列的特性相应的滤波处理。For example, as shown in FIG15 , by adding a filter selection table index 100 to the sequence level header, it is possible to perform filtering processing that is appropriate to the characteristics of the sequence, compared to the case where only a single table is used.
而且,与在前面说明的MPEG-4、AVC/H.264中的8×8像素的块的帧内预测时对参照图像实施的平滑处理一样,即使是构成为将在帧内预测部4中生成分区Pi n的中间预测图像时的参照像素作为对与分区Pi n相邻的已编码像素进行了平滑处理的像素的情况下,也能够进行和上述例子一样的对中间预测图像的滤波处理。Moreover, similar to the smoothing process performed on the reference image during the intra-frame prediction of the 8×8 pixel block in MPEG-4 and AVC/H.264 described above, even in a case where the reference pixels when generating the intermediate prediction image of the partition Pin in the intra-frame prediction unit 4 are used as pixels on which the coded pixels adjacent to the partition Pin are smoothed, the same filtering process as in the above example can be performed on the intermediate prediction image.
另一方面,因为基于在生成中间预测图像时对参照像素的平滑处理和对中间预测图像的滤波处理的效果有重复的部分,所以有时即使同时使用两个处理,与进行其中一方的处理的情况相比也只能得到些许的性能改善效果。因而,可以构成为:当重视抑制计算量的情况下,对于对在生成中间预测图像时的参照像素进行平滑处理的分区Pi n,不进行针对中间预测图像的滤波处理。例如,考虑这样的情况:对中间预测图像的滤波处理如图19的表那样只在平均值预测中进行滤波处理,针对在生成中间预测图像的参照像素的平滑处理参照只对图20那样的特定的方向性预测进行平滑处理的表来进行。其中,在图20中,‘1’表示进行平滑处理,‘0’表示不进行平滑处理。On the other hand, because the effects of smoothing and filtering the reference pixels when generating the intermediate prediction images overlap, sometimes using both processes simultaneously only yields a slight performance improvement compared to performing either process alone. Therefore, when prioritizing computational complexity, a configuration can be employed where filtering is not performed on the intermediate prediction images for the partition Pin where smoothing is performed on the reference pixels when generating the intermediate prediction images. For example, consider a case where filtering is performed only for average value prediction, as shown in the table in FIG19 , while smoothing is performed only for specific directional predictions, as shown in FIG20 . In FIG20 , '1' indicates smoothing is performed, and '0' indicates not smoothing.
在帧内预测图像Pi n的生成中使用的帧内预测参数为了复用于比特流中而输出到可变长度编码部13。The intra-frame prediction parameters used to generate the intra-frame prediction image P in are output to the variable-length coding unit 13 in order to be multiplexed in the bit stream.
对分区Pi n内的色差信号也按照和亮度信号同样的顺序,实施基于帧内预测参数(帧内预测模式)的帧内预测处理,将在帧内预测图像的生成中使用的帧内预测参数输出到可变长度编码部13。The color difference signal within the partition Pin is also subjected to intra-frame prediction processing based on the intra-frame prediction parameters (intra-frame prediction mode) in the same order as the luminance signal, and the intra-frame prediction parameters used in generating the intra-frame prediction image are output to the variable length coding unit 13.
但是,关于色差信号的帧内预测既可以构成为与亮度信号同样地进行在上述说明的滤波处理,也可以不进行在上述说明的滤波处理。However, the intra-frame prediction of the chrominance signal may be configured to perform the filtering process described above in the same manner as the luminance signal, or may not perform the filtering process described above.
接着,说明图2的运动图像译码装置的处理内容。Next, the processing contents of the moving picture decoding apparatus in FIG2 will be described.
可变长度译码部51在输入从图1的图像编码装置输出的比特流时,对该比特流实施可变长度译码处理,以具有1帧以上的图片的序列为单位或者以图片为单位译码帧尺寸的信息(图4的步骤ST21)。When the variable length decoding unit 51 receives the bit stream output from the image encoding apparatus of FIG. 1 , it performs variable length decoding on the bit stream and decodes frame size information in units of a sequence of pictures having one or more frames or in units of pictures (step ST21 of FIG. 4 ).
可变长度译码部51按照和图1的编码控制部1一样的步骤决定在实施帧内预测处理(帧内预测处理)或者运动补偿预测处理(帧间预测处理)时的成为处理单位的编码块的最大尺寸,并且决定分层分割最大尺寸的编码块时的上限层数(步骤ST22)。The variable length decoding unit 51 determines the maximum size of the coding block that serves as the processing unit when performing intra-frame prediction processing (intra-frame prediction processing) or motion compensation prediction processing (inter-frame prediction processing) in the same steps as the coding control unit 1 in Figure 1, and determines the upper limit number of layers when hierarchically dividing the coding block of the maximum size (step ST22).
例如,在图像编码装置中,当编码块的最大尺寸根据输入图像的分辨率被决定的情况下,根据之前译码的帧尺寸信息来决定编码块的最大尺寸。For example, in an image encoding apparatus, when the maximum size of a coding block is determined according to the resolution of an input image, the maximum size of the coding block is determined according to previously decoded frame size information.
而且,当在比特流中复用了表示编码块的最大尺寸以及上限层数的信息的情况下,参照根据该比特流译码的信息。Furthermore, when information indicating the maximum size of a coding block and the upper limit number of layers is multiplexed in a bitstream, the information decoded from the bitstream is referenced.
在被复用于比特流的最大尺寸的编码块B0的编码模式m(B0)中,包含表示最大尺寸的编码块B0的分割状态的信息,所以可变长度译码部51译码被复用于比特流的最大尺寸的编码块B0的编码模式m(B0),确定分层分割的各个编码块Bn(步骤ST23)。The coding mode m(B 0 ) of the maximum-sized coding block B 0 multiplexed in the bitstream includes information indicating the division state of the maximum-sized coding block B 0. Therefore, the variable-length decoding unit 51 decodes the coding mode m(B 0 ) of the maximum-sized coding block B 0 multiplexed in the bitstream and determines each hierarchically divided coding block B n (step ST23).
可变长度译码部51当确定各个编码块Bn时,译码该编码块Bn的编码模式m(Bn),根据属于该编码模式m(Bn)的分区Pi n信息,确定属于编码块Bn的分区Pi n。When determining each coding block Bn , the variable length decoding unit 51 decodes the coding mode m( Bn ) of the coding block Bn , and determines the partition Pin belonging to the coding block Bn based on the partition Pin information belonging to the coding mode m ( Bn ) .
可变长度译码部51当确定属于编码块Bn的分区Pi n时,针对每个分区Pi n,译码压缩数据、编码模式、预测误差编码参数、帧内预测参数/帧间预测参数(步骤ST24)。When the partition P in belonging to the coding block B n is determined, the variable length decoding unit 51 decodes the compressed data, the coding mode, the prediction error coding parameters, and the intra prediction parameters/inter prediction parameters for each partition P in (step ST24).
即,当分配给编码块Bn的编码模式m(Bn)是帧内编码模式的情况下,对属于编码块的每个分区Pi n译码帧内预测参数。That is, when the coding mode m(B n ) assigned to the coding block B n is the intra coding mode, intra prediction parameters are decoded for each partition P i n belonging to the coding block.
当分配给编码块Bn的编码模式m(Bn)是帧间编码模式的情况下,对属于编码块的每个分区Pi n译码帧间预测参数。When the coding mode m(B n ) assigned to the coding block B n is the inter coding mode, inter prediction parameters are decoded for each partition P i n belonging to the coding block.
成为预测处理单位的分区进一步根据预测误差编码参数所包含的变换块尺寸信息,被分割成成为变换处理单位的1个或多个分区,对成为变换处理单位的每个分区译码压缩数据(变换·量化后的变换系数)。The partition serving as the prediction processing unit is further divided into one or more partitions serving as the transform processing unit based on the transform block size information included in the prediction error coding parameter, and compressed data (transformed and quantized transform coefficients) is decoded for each partition serving as the transform processing unit.
在来自可变长度译码部51的属于编码块Bn的分区Pi n的编码模式m(Bn)是帧内编码模式的情况下(步骤ST25),切换开关52将从可变长度译码部51输出的帧内预测参数输出到帧内预测部53。When the coding mode m( Bn ) of the partition P in the coding block Bn from the variable length decoding unit 51 is the intra coding mode (step ST25), the switch 52 outputs the intra prediction parameters output from the variable length decoding unit 51 to the intra prediction unit 53.
另一方面,当分区Pi n的编码模式m(Bn)是帧间编码模式的情况下(步骤ST25),将从可变长度译码部51输出的帧间预测参数输出到运动补偿预测部54。On the other hand, when the coding mode m(B n ) of the partition Pin is the inter coding mode (step ST25 ), the inter prediction parameters output from the variable length decoding unit 51 are output to the motion compensation prediction unit 54 .
帧内预测部53在从切换开关52接收到帧内预测参数时,和图1的帧内预测部4一样,使用帧内的已译码的图像信号,根据该帧内预测参数,对编码块Bn的分区Pi n实施帧内预测处理,生成帧内预测图像Pi n(步骤ST26)。Upon receiving the intra-frame prediction parameters from the switch 52, the intra-frame prediction unit 53, like the intra-frame prediction unit 4 in FIG1 , uses the decoded image signal within the frame and, based on the intra-frame prediction parameters, performs intra-frame prediction processing on the partition P in the coding block Bn to generate an intra- frame predicted image P (step ST26).
但是,帧内预测部53在生成上述帧内预测图像Pi n后,用和图1的帧内预测部4一样的方法,从预先准备的1个以上的滤波器中,根据在生成上述帧内预测图像Pi n的时刻点已知的各种参数的状态来选择滤波器,使用该滤波器对该帧内预测图像Pi n实施滤波处理,将滤波处理后的帧内预测图像Pi n作为最终的帧内预测图像。However, after generating the intra-frame prediction image P in the same manner as the intra-frame prediction unit 4 in FIG. 1 , the intra-frame prediction unit 53 selects a filter from one or more pre-prepared filters based on the states of various parameters known at the time of generating the intra- frame prediction image P in the same manner as the intra- frame prediction unit 4 in FIG. 1 , and performs filtering processing on the intra-frame prediction image P in using the filter, and uses the filtered intra-frame prediction image P in as the final intra-frame prediction image.
即,用和在帧内预测部4中的滤波器选择中使用的参数相同的参数,用和在帧内预测部4中的滤波器选择方法一样的方法选择滤波器来进行滤波处理。That is, the filtering process is performed by selecting a filter using the same parameters as those used for filter selection in the intra prediction unit 4 and by using the same method as the filter selection method in the intra prediction unit 4 .
例如,帧内预测部4被构成为当把不进行滤波处理的情况与滤波器索引0对应起来,将预先准备的4个滤波器分别与滤波器索引1至4对应起来,参照图13的表进行滤波处理的情况下,帧内预测部53也定义和帧内预测部4一样的滤波器和滤波器索引,通过参照图13的表根据分区Pi n的尺寸以及作为帧内预测参数的帧内预测模式的索引进行滤波器选择,进行滤波处理。For example, the intra-frame prediction unit 4 is configured to correspond to the filter index 0 when no filtering processing is performed, and the four pre-prepared filters are respectively corresponded to the filter indexes 1 to 4. When filtering processing is performed with reference to the table in Figure 13, the intra-frame prediction unit 53 also defines the same filters and filter indexes as the intra-frame prediction unit 4, and performs filtering processing by selecting a filter based on the size of the partition Pin and the index of the intra -frame prediction mode as the intra-frame prediction parameter with reference to the table in Figure 13.
另外,如上述的例子那样,还可以构成为:在准备定义以参数的组合为单位使用的滤波器的表,通过参照该表来实现滤波器的切换的情况下,按照图15~图18中的某一个的形式将滤波器选择表索引100作为标题信息进行译码,从和预先准备的运动图像编码装置相同的表群中选择被译码了的滤波器选择表索引100表示的表,参照该表进行滤波器的选择。In addition, as in the example described above, it is also possible to configure such that: when a table defining filters used in units of parameter combinations is prepared and filters are switched by referring to the table, the filter selection table index 100 is decoded as header information in the form of one of Figures 15 to 18, and the table represented by the decoded filter selection table index 100 is selected from the same table group as that of the pre-prepared motion picture encoding device, and the filter is selected by referring to the table.
运动补偿预测部54在从切换开关52接收到帧间预测参数时,使用由运动补偿预测帧存储器59所保存的1帧以上的参照图像,根据该帧间预测参数,通过对编码块Bn的分区Pi n实施运动补偿预测处理,来生成帧间预测图像Pi n(步骤ST27)。Upon receiving the inter-frame prediction parameters from the switching switch 52, the motion-compensated prediction unit 54 uses one or more reference frames stored in the motion-compensated prediction frame memory 59 to perform motion-compensated prediction processing on the partition P in the coding block B according to the inter- frame prediction parameters, thereby generating an inter- frame prediction image P (step ST27).
反量化·反变换部55使用从可变长度译码部51输出的预测误差编码参数所包含的量化参数,对从可变长度译码部51输出的编码块涉及的压缩数据进行反量化,以该预测误差编码参数所包含的变换块尺寸为单位,通过实施反量化的压缩数据的反变换处理(例如,反DCT(反离散余弦变换)、反KL变换等的反变换处理),将反变换处理后的压缩数据作为译码预测误差信号(表示压缩前的差分图像的信号)输出到加法部56(步骤ST28)。The inverse quantization and inverse transformation unit 55 uses the quantization parameters included in the prediction error coding parameters output from the variable length decoding unit 51 to inverse quantize the compressed data related to the coding block output from the variable length decoding unit 51, and uses the transformation block size included in the prediction error coding parameters as a unit to perform inverse transformation processing (for example, inverse transformation processing such as inverse DCT (inverse discrete cosine transform) and inverse KL transform) on the inverse quantized compressed data, and outputs the compressed data after the inverse transformation processing as a decoded prediction error signal (a signal representing the differential image before compression) to the addition unit 56 (step ST28).
加法部56在从反量化·反变换部55接收到译码预测误差信号时,通过将该译码预测误差信号和表示由帧内预测部53或者运动补偿预测部54生成的预测图像的预测信号相加而生成译码图像,将表示该译码图像的译码图像信号保存于帧内预测用存储器57,并且将该译码图像信号输出到环路滤波器部58(步骤ST29)。When the addition unit 56 receives the decoded prediction error signal from the inverse quantization/inverse transformation unit 55, it generates a decoded image by adding the decoded prediction error signal and a prediction signal representing a predicted image generated by the intra-frame prediction unit 53 or the motion compensation prediction unit 54, stores the decoded image signal representing the decoded image in the intra-frame prediction memory 57, and outputs the decoded image signal to the loop filter unit 58 (step ST29).
步骤ST23~ST29的处理直到针对分层分割的全部的编码块Bn的处理结束之前被重复实施(步骤ST30)。The processes of steps ST23 to ST29 are repeatedly performed until the processes on all the hierarchically divided coding blocks Bn are completed (step ST30).
环路滤波器部58在从加法器56接收到译码图像信号时,补偿包含于该译码图像信号的编码失真,将编码失真补偿后的译码图像信号表示的译码图像作为参照图像而保存于运动补偿预测帧存储器59(步骤ST31)。When receiving the decoded image signal from the adder 56, the loop filter unit 58 compensates for the coding distortion contained in the decoded image signal, and stores the decoded image represented by the decoded image signal after coding distortion compensation as a reference image in the motion compensation prediction frame memory 59 (step ST31).
由环路滤波器部58进行的滤波处理既可以以从加法器56输出的译码图像信号的最大编码块或者各自的编码块为单位来进行,也可以在输出了与1画面量的宏块相当的译码图像信号后汇集1画面量来进行。The filtering process performed by the loop filter unit 58 can be performed in units of the largest coding block or each coding block of the decoded image signal output from the adder 56, or can be performed by aggregating the decoded image signals of one picture after outputting the decoded image signals of macroblocks equivalent to one picture.
从以上可知,根据本实施方式1,由于被构成为运动图像编码装置的帧内预测部4使用帧内的已编码的图像信号,在通过实施帧内预测处理来生成帧内预测图像时,从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器,使用该滤波器对预测图像实施滤波处理,所以起到能够降低在局部性地发生的预测误差、提高图像品质的效果。As can be seen from the above, according to the present embodiment 1, since the intra-frame prediction unit 4 constructed as a motion image encoding device uses the encoded image signal within the frame, when generating the intra-frame predicted image by performing intra-frame prediction processing, a filter is selected from one or more pre-prepared filters according to the states of various parameters involved in the encoding of the filtering object block, and the predicted image is filtered using the filter, thereby achieving the effect of reducing the prediction error that occurs locally and improving the image quality.
另外,根据本实施方式1,因为构成为帧内预测部4考虑(1)~(4)参数中的至少1个以上的参数来选择滤波器,所以可以得到抑制在进行方向性预测时编码对象图像的边缘些许地非线性地失真或者因角度偏差而产生的局部性预测误差、和在进行平均值预测时与相邻的已编码的信号的连续性丢失而产生的块的边界部分的预测误差的效果,起到可以改善预测效率的效果。所述(1)是分区Pi n的尺寸(li n×mi n)、(2)是包含于预测误差编码参数的量化参数、(3)是在中间预测图像的生成时使用的已编码像素群和滤波处理对象像素的距离、(4)是生成中间预测图像时的帧内预测模式的索引值。Furthermore, according to the first embodiment, since the intra-frame prediction unit 4 is configured to select a filter by taking into account at least one of the parameters (1) to (4), it is possible to suppress a local prediction error caused by slight nonlinear distortion of the edge of the encoding target image or angular deviation when performing directional prediction, and a prediction error at the boundary of a block caused by loss of continuity with adjacent encoded signals when performing average value prediction, thereby achieving an effect of improving prediction efficiency. The parameters (1) are the size ( lin × min ) of the partition Pin , (2) are the quantization parameters included in the prediction error coding parameters, (3) are the distances between the encoded pixel group used in generating the intermediate prediction image and the filtering target pixel, and (4) are the index values of the intra-frame prediction mode used in generating the intermediate prediction image.
根据本实施方式1,由于构成为运动图像译码装置的帧内预测部53使用帧内的已译码的图像信号,在通过实施帧内预测处理来生成帧内预测图像时,从预先准备的1个以上的滤波器中,根据滤波处理对象块的译码所涉及的各种参数的状态来选择滤波器,使用该滤波器对预测图像实施滤波处理,所以起到能够降低局部性地发生的预测误差,在运动图像译码装置一侧也生成和在运动图像编码装置一侧生成的帧内预测图像一样的帧内预测图像的效果。According to the first embodiment, since the intra-frame prediction unit 53 configured as a motion picture decoding device uses the decoded image signal within the frame, when generating an intra-frame predicted image by performing intra-frame prediction processing, a filter is selected from one or more pre-prepared filters based on the states of various parameters involved in the decoding of the filtering target block, and filtering processing is performed on the predicted image using the filter. This has the effect of reducing the prediction error that occurs locally, and generating an intra-frame predicted image on the motion picture decoding device side that is the same as the intra-frame predicted image generated on the motion picture encoding device side.
另外,根据本实施方式1,因为构成为帧内预测部53考虑(1)~(4)参数中的至少1个以上的参数来选择滤波器,所以可以得到抑制在进行方向性预测时编码对象图像的边缘些许地非线性地失真或者因角度偏差而产生的局部性预测误差、和在进行平均值预测时与相邻的已编码的信号的连续性丢失而产生的块的边界部分的预测误差的效果,起到在运动图像译码装置一侧也生成和在运动图像编码装置一侧生成的帧内预测图像一样的帧内预测图像的效果。所述(1)是分区Pi n的尺寸(li n×mi n)、(2)是包含于预测误差编码参数的量化参数、(3)是在中间预测图像的生成时使用的已编码像素群和滤波处理对象像素的距离、(4)是生成中间预测图像时的帧内预测模式的索引值。Furthermore, according to the first embodiment, since the intra prediction unit 53 is configured to select a filter by taking into account at least one of the parameters (1) to (4), it is possible to suppress a local prediction error caused by slight nonlinear distortion of the edge of the image to be coded or by angular deviation when performing directional prediction, and a prediction error at the boundary of a block caused by loss of continuity with adjacent coded signals when performing average value prediction, thereby achieving an effect of generating an intra prediction image on the side of the moving picture decoding apparatus that is the same as the intra prediction image generated on the side of the moving picture coding apparatus. The parameters (1) are the size ( lin × min ) of the partition Pin , (2) are the quantization parameters included in the prediction error coding parameters, (3) are the distances between the coded pixel group used in generating the intermediate prediction image and the pixel to be filtered, and (4) are the index values of the intra prediction mode used in generating the intermediate prediction image.
实施方式2Implementation Method 2
在上述实施方式1中,表示以下这样的方式,即、帧内预测部4使用帧内的已编码的图像信号,在通过实施帧内预测处理来生成帧内预测图像时,从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器,使用该滤波器对预测图像实施滤波处理,但当设计了编码对象的块与预测图像间的平方误差和为最小的维纳滤波器,使用上述维纳滤波器与使用从预先准备的1个以上的滤波器中选择的滤波器相比,预测误差的减少程度变高的情况下,也可以代替其选择的滤波器,使用上述维纳滤波器,对预测图像实施滤波处理。In the above-mentioned embodiment 1, the following method is represented, that is, the intra-frame prediction unit 4 uses the encoded image signal in the frame, and when generating the intra-frame predicted image by performing intra-frame prediction processing, a filter is selected from one or more pre-prepared filters according to the states of various parameters involved in the encoding of the filtering object block, and the predicted image is filtered using the filter. However, when a Wiener filter is designed in which the sum of square errors between the encoding object block and the predicted image is minimized, and the degree of reduction in prediction error becomes higher when the above-mentioned Wiener filter is used compared with the use of the filter selected from one or more pre-prepared filters, the predicted image can also be filtered using the above-mentioned Wiener filter instead of the selected filter.
以下,具体地说明处理内容。The following describes the processing details in detail.
在上述实施方式1中,帧内预测部4、53从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器。但是,当考虑4个参数(1)~(4)来选择滤波器的情况下,可以从选择候补中选择适宜的滤波器,而当在选择候补以外存在最佳滤波器的情况下,不能进行“最佳的滤波处理”。In the first embodiment described above, the intra-frame prediction units 4 and 53 select a filter from one or more pre-prepared filters based on the states of various parameters related to the encoding of the target block to be filtered. However, when the filter is selected based on the four parameters (1) to (4), an appropriate filter can be selected from the selection candidates. However, if an optimal filter exists outside the selection candidates, "optimal filtering" cannot be performed.
在本实施方式2中,其特征在于:以图片为单位在运动图像编码装置一侧设计最佳滤波器实施滤波处理,并且对该滤波器的滤波器系数等进行编码,在运动图像译码装置一侧通过译码该滤波器系数等,实施使用了该滤波器的滤波处理。In this embodiment 2, it is characterized in that: an optimal filter is designed on the side of the motion image encoding device in units of pictures to implement filtering processing, and the filter coefficients of the filter are encoded, and the filtering processing using the filter is implemented on the side of the motion image decoding device by decoding the filter coefficients.
运动图像编码装置的帧内预测部4和上述实施方式1一样,通过对编码块Bn的分区Pn实施帧内预测处理来生成帧内预测图像Pi n。The intra prediction unit 4 of the moving picture encoding device generates an intra prediction image P in by performing an intra prediction process on the partition P n of the coding block B n , similarly to the first embodiment.
另外,帧内预测部4用和上述实施方式1一样的方法,从预先准备的1个以上的滤波器中,根据滤波处理对象块的编码所涉及的各种参数的状态来选择滤波器,使用该滤波器对帧内预测图像Pi n进行滤波处理。In addition, the intra-frame prediction unit 4 uses the same method as the above-mentioned embodiment 1 to select a filter from one or more pre-prepared filters according to the states of various parameters related to the encoding of the filtering target block, and uses the filter to filter the intra-frame prediction image P i n .
帧内预测部4在图片内的全部的编码块Bn中决定了帧内预测参数后,对在图片内使用同一滤波器的区域(具有同一滤波器索引的区域)的每个设计该区域内的输入图像和帧内预测图像的平方误差和(对象区域内的均方误差)为最小的维纳滤波器。After the intra-frame prediction unit 4 determines the intra-frame prediction parameters in all the coding blocks Bn in the picture, it designs a Wiener filter for each area using the same filter in the picture (area with the same filter index) so that the sum of the square errors between the input image and the intra-frame prediction image in the area (the mean square error in the object area) is minimized.
维纳滤波器可以利用中间预测图像信号s’的自相关矩阵Rs’s’、输入图像信号s和中间预测图像信号s’的互相关矩阵Rss’根据以下的式(4)求出滤波器系数w。矩阵Rs’s’和Rss’的大小与所求的滤波器抽头数对应。The Wiener filter can calculate the filter coefficient w using the autocorrelation matrix R s's' of the intermediate predicted image signal s' and the cross-correlation matrix R ss' between the input image signal s and the intermediate predicted image signal s' according to the following equation (4). The sizes of the matrices R s's' and R ss' correspond to the number of filter taps required.
帧内预测部4在设计维纳滤波器时,设使用该维纳滤波器实施滤波处理时的滤波器设计对象区域内的平方误差和为D1,设编码涉及该维纳滤波器的信息(例如,滤波器系数)时的编码量为R1,设使用以与上述实施方式1同样的方法选择的滤波器来实施滤波处理的情况下的滤波器设计对象区域内的平方误差和为D2,确认下式(5)是否成立。When the intra-frame prediction unit 4 designs the Wiener filter, it assumes that the sum of square errors in the filter design object area when the Wiener filter is used to perform filtering processing is D1, the coding amount when encoding information related to the Wiener filter (for example, filter coefficients) is R1, and the sum of square errors in the filter design object area when the filter is selected in the same way as in the above-mentioned embodiment 1 to perform filtering processing is D2, and confirms whether the following formula (5) holds.
D1+λ·R1<D2 (5)D1+λ·R1<D2 (5)
其中,λ是常数。Here, λ is a constant.
帧内预测部4在式(5)成立的情况下,代替用与上述实施方式1同样的方法进行选择的滤波器,而使用该维纳滤波器来实施滤波处理。When equation (5) holds true, the intra prediction unit 4 performs filtering using the Wiener filter instead of the filter selected in the same manner as in the first embodiment.
另一方面,当式(5)不成立的情况下,使用以与上述实施方式1同样的方法进行选择的滤波器来实施滤波处理。On the other hand, when the equation (5) does not hold, filtering processing is performed using a filter selected in the same manner as in the first embodiment.
在此,用平方误差和D1、D2进行评价,但并不限于此,代替平方误差和D1、D2,也可以使用误差的绝对值和等的其他的表示预测失真的尺度来进行评价。Here, the evaluation is performed using the sum of squared errors D1 and D2, but the present invention is not limited thereto. Instead of the sum of squared errors D1 and D2, another measure representing prediction distortion, such as the sum of absolute values of errors, may be used for evaluation.
帧内预测部4在使用维纳滤波器实施滤波处理的情况下,需要表示该维纳滤波器的滤波器系数、将哪个索引的滤波器置换为维纳滤波器的滤波器更新信息。When the intra prediction unit 4 performs filtering using a Wiener filter, filter update information indicating the filter coefficients of the Wiener filter and the filter index to be replaced with the Wiener filter is required.
具体地说,当将通过使用了滤波器选择参数的滤波处理可以选择的滤波器的数量设为L,对各滤波器分配0~L-1的索引的情况下,对于各个索引,需要在使用设计好的维纳滤波器的情况下将“1”值作为滤波器更新信息进行编码,在使用预先准备的滤波器的情况下将“0”值作为滤波器更新信息进行编码。Specifically, when the number of filters that can be selected by filtering processing using filter selection parameters is set to L, and each filter is assigned an index of 0 to L-1, for each index, when using a designed Wiener filter, it is necessary to encode a "1" value as filter update information, and when using a pre-prepared filter, it is necessary to encode a "0" value as filter update information.
可变长度编码部13对从帧内预测部4输出的滤波器更新信息进行可变长度编码,在比特流中复用该滤波器更新信息的编码数据。The variable-length coding unit 13 performs variable-length coding on the filter update information output from the intra prediction unit 4 and multiplexes the coded data of the filter update information into a bit stream.
在此,虽然示出了对在图片内使用相同的滤波器的每个区域,设计该区域内的输入图像和预测图像的均方误差为最小的维纳滤波器的情况,但也可以构成为对不是图片单位的其他特定的区域单位,针对使用了相同的滤波器的每个区域设计该区域内的输入图像和预测图像的均方误差为最小的维纳滤波器,还可以是只对某一特定的图片进行滤波器设计,或者只在与特定的条件相应的情况下(例如,附加场景变化检测功能,检测到场景变化的图片的情况下),进行上述滤波器的设计。Here, although the case is shown in which a Wiener filter is designed so that the mean square error between the input image and the predicted image in each area using the same filter in the picture is minimized, it can also be constructed so that a Wiener filter is designed so that the mean square error between the input image and the predicted image in each area using the same filter is minimized for other specific area units that are not picture units. It can also be possible to design a filter only for a specific picture, or to design the above-mentioned filter only in accordance with specific conditions (for example, when a scene change detection function is added and a picture with a scene change is detected).
运动图像译码装置的可变长度译码部51从被复用于比特流的编码数据中对滤波器更新信息进行可变长度译码。The variable-length decoding unit 51 of the moving picture decoding device performs variable-length decoding on the filter update information from the encoded data multiplexed into the bit stream.
帧内预测部53与上述实施方式1同样地,通过对编码块Bn的分区Pi n实施帧内预测处理来生成帧内预测图像Pi n。The intra prediction unit 53 generates an intra prediction image Pin by performing an intra prediction process on the partition Pin of the coding block Bn , similarly to the first embodiment.
帧内预测部53在从可变长度译码部51接收到滤波器更新信息时,参照该滤波器更新信息,确认在相应的索引的滤波器中有无更新。Upon receiving the filter updating information from the variable-length decoding unit 51 , the intra prediction unit 53 refers to the filter updating information and checks whether or not the filter of the corresponding index has been updated.
帧内预测部53在确认的结果是当将某一区域的滤波器置换为维纳滤波器的情况下,读出包含于该滤波器更新信息的维纳滤波器的滤波器系数,确定该维纳滤波器,使用该维纳滤波器来实施帧内预测图像Pi n的滤波处理。When the result of confirmation is that the filter of a certain area is replaced by a Wiener filter, the intra-frame prediction unit 53 reads the filter coefficient of the Wiener filter contained in the filter update information, determines the Wiener filter, and uses the Wiener filter to perform filtering processing on the intra-frame prediction image P i n .
另一方面,在未置换为维纳滤波器的区域中,用与上述实施方式1一样的方法来选择滤波器,使用该滤波器来实施帧内预测图像Pi n的滤波处理。On the other hand, in the region not replaced by the Wiener filter, a filter is selected by the same method as in the first embodiment, and filtering processing is performed on the intra- frame prediction image Pin using this filter.
从以上可知,根据本实施方式2,因为构成为设计编码对象的块和预测图像间的平方误差和为最小的维纳滤波器,当使用该维纳滤波器与使用从预先准备的1个以上的滤波器中选择的滤波器相比,预测误差的减少程度变高的情况下,代替其选择的滤波器而使用该维纳滤波器,对预测图像实施滤波处理,所以能够起到与上述实施方式1相比进一步减少局部性地发生的预测误差的效果。As can be seen from the above, according to the present embodiment 2, since the Wiener filter is designed to minimize the sum of square errors between the block of the encoding object and the predicted image, when the degree of reduction in the prediction error becomes higher than that of using a filter selected from one or more pre-prepared filters, the Wiener filter is used instead of the selected filter to perform filtering processing on the predicted image, so that the effect of further reducing the locally occurring prediction error can be achieved compared with the above-mentioned embodiment 1.
而且,在本申请发明在其发明的范围内可以进行各实施方式的自由的组合、或者各实施方式的任意的构成要素的变形、或者在各实施方式中省略任意的构成要素。Furthermore, within the scope of the present invention, the various embodiments can be freely combined, arbitrary components of the various embodiments can be modified, or arbitrary components can be omitted in the various embodiments.
产业上的可利用性Industrial applicability
本发明适于需要高效率地编码图像的图像编码装置,另外,适于需要高效率地对被编码的图像进行译码的图像译码装置。The present invention is suitable for an image encoding device that needs to encode an image efficiently, and is also suitable for an image decoding device that needs to decode an encoded image efficiently.
Claims (4)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2011-004038 | 2011-01-12 | ||
| JP2011004038 | 2011-01-12 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK13113477.0A Addition HK1186323B (en) | 2011-01-12 | 2012-01-06 | Image encoding device, image decoding device, image encoding method, and image decoding method |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK13113477.0A Division HK1186323B (en) | 2011-01-12 | 2012-01-06 | Image encoding device, image decoding device, image encoding method, and image decoding method |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1224114A1 HK1224114A1 (en) | 2017-08-11 |
| HK1224114B true HK1224114B (en) | 2019-11-29 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105959706B (en) | Image coding apparatus and method, and image decoding apparatus and method | |
| KR101580021B1 (en) | Image decoding device, image encoding device, image encoding method, and image decoding method | |
| HK1224114B (en) | Image encoding device and method, image decoding device and method | |
| HK1224113B (en) | Image encoding device and method, image decoding device and method | |
| HK1224112B (en) | Image encoding device and method, image decoding device and method | |
| HK1225541B (en) | Image encoding device and method, image decoding device and method | |
| HK1225203B (en) | Motion-video encoding apparatus and method and motion-video decoding apparatus and method | |
| HK1219593B (en) | Motion-video encoding apparatus, motion-video decoding apparatus, motion-video encoding method, and motion-video decoding method | |
| HK1225204B (en) | Motion-video encoding apparatus and method and motion-video decoding apparatus and method | |
| HK1186323B (en) | Image encoding device, image decoding device, image encoding method, and image decoding method | |
| HK1225204A1 (en) | Motion-video encoding apparatus and method and motion-video decoding apparatus and method | |
| HK1225203A1 (en) | Motion-video encoding apparatus and method and motion-video decoding apparatus and method | |
| HK1219594B (en) | Motion-video encoding apparatus, motion-video decoding apparatus, motion-video encoding method, and motion-video decoding method | |
| HK1182244B (en) | Motion-video encoding apparatus, motion-video decoding apparatus, motion-video encoding method, and motion-video decoding method | |
| HK1222072B (en) | Image encoding device, method thereof, image decoding device and method thereof |