JP4561701B2

JP4561701B2 - Video encoding device

Info

Publication number: JP4561701B2
Application number: JP2006183982A
Authority: JP
Inventors: 英樹竹原; 孝森重; 茂福島; 仁村崎; 一郎安藤
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2006-07-04
Filing date: 2006-07-04
Publication date: 2010-10-13
Anticipated expiration: 2026-07-04
Also published as: JP2008016985A

Description

本発明は動画像符号化装置に係り、特に動画像に関する画像データを所定画素数のブロックに分割し、前記ブロックを単位として動き補償予測を用いて圧縮符号化を行う動画像符号化装置に関する。 The present invention relates to a moving image encoding apparatus, and more particularly to a moving image encoding apparatus that divides image data related to a moving image into blocks having a predetermined number of pixels and performs compression encoding using motion compensated prediction in units of the blocks.

現在、動画像に関する画像データを高能率に圧縮して符号化する圧縮符号化規格が多数規格化され、磁気テープ、ＤＶＤ(Digital Versatile Disc)、半導体メモリなどの蓄積メディアや衛星デジタル放送、インターネットなどの放送通信分野で広く使用されている。 Currently, a number of compression coding standards that efficiently compress and encode image data related to moving images have been standardized, and storage media such as magnetic tape, DVD (Digital Versatile Disc), semiconductor memory, satellite digital broadcasting, the Internet, etc. Widely used in the broadcasting and communication field.

上記の圧縮符号化規格のうち代表的なＭＰＥＧ（Moving Picture Experts Group）などでは、インタレース走査により得られる動画像データを、所定のブロック単位でインタレース構造とフレーム構造を切り替えて符号化する。これは、フィールド構造の動画像データはフレーム構造の動画像データと比較し、垂直方向の画素が間引きとなるため垂直方向の画素空間相関（空間解像度）が低くなるが、１フレーム分の時間を２フィールドで表現するため画素時間相関（時間解像度）が高くなるため、時間的変化が少ない、動きの少ない画像データの場合は、空間解像度が高くなるフレーム構成で符号化を行うと効率が良くなり、逆に、時間的変化が大きい、動きの多い画像データの場合は、時間解像度が高くなるフィールド構成で符号化を行うと効率が良くなるためである。 In a typical MPEG (Moving Picture Experts Group) among the above compression coding standards, moving image data obtained by interlace scanning is encoded by switching between an interlace structure and a frame structure in a predetermined block unit. This is because the field structure moving image data is compared with the frame structure moving image data, and the vertical pixel spatial correlation (spatial resolution) is low because the vertical pixels are thinned out. Since the pixel time correlation (temporal resolution) is high because it is expressed in two fields, in the case of image data with little temporal change and little motion, encoding with a frame configuration with high spatial resolution will improve efficiency. On the contrary, in the case of image data with a large temporal change and a lot of movement, it is because efficiency is improved if encoding is performed with a field configuration with a high temporal resolution.

そこで、フレーム構造とフィールド構造の垂直方向の画素相関値を比較して、相関の高い方の構造を符号化する動画像符号化装置が開示されている（例えば、特許文献１参照）。 In view of this, there has been disclosed a moving image encoding apparatus that compares the pixel correlation values in the vertical direction of the frame structure and the field structure and encodes the structure having the higher correlation (see, for example, Patent Document 1).

特許第３１２５１４５号公報Japanese Patent No. 3125145

しかし、上記の特許文献１記載の動画像符号化装置では次のような課題がある。一般的に、動画像データをブロック単位で圧縮符号化する動画像符号化装置では、動画像データに対して画像内符号化のみを行うＩフレーム、片方向画像間予測符号化を行うＰフレーム、双方向画像間予測符号化を行うＢフレームの計３つのフレームタイプ（予測タイプとしてのピクチャタイプ）がある。Ｉフレームでは全てのブロックが画像内符号化され、Ｐフレームではブロック単位で画像内符号化と片方向画像間予測符号化が切り替えられ、Ｂフレームではブロック単位で画像内符号化、片方向画像間予測符号化、双方向画像間予測符号化が切り替えられる。上述した動画像符号化装置では、フレームタイプに関わらず、１画像内におけるフレーム構造とフィールド構造の垂直方向の画素相関値を比較して、相関の高い方の構造を符号化している。 However, the moving picture encoding apparatus described in Patent Document 1 has the following problems. In general, in a moving image encoding apparatus that compresses and encodes moving image data in units of blocks, an I frame that performs only intra-coding on moving image data, a P frame that performs unidirectional inter-image predictive encoding, There are a total of three frame types (picture type as a prediction type) of B frames that perform bidirectional inter-picture prediction encoding. In the I frame, all blocks are intra-coded, in the P frame, intra-frame coding and unidirectional inter-picture prediction coding are switched on a block basis, and in the B frame, intra-frame coding and unidirectional inter-picture coding on a block basis. Predictive coding and bidirectional inter-picture predictive coding are switched. In the above-described moving picture encoding apparatus, regardless of the frame type, the pixel correlation value in the vertical direction of the frame structure and the field structure in one image is compared, and the structure having the higher correlation is encoded.

画像内符号化を行う際には、アクティビティから求めた相関の高い構造を選択することで符号化効率が向上することが期待できるが、画像間予測符号化を行う際には、画像間予測効率の影響を受けるため、１画像内のアクティビティから求めた相関の高い構造を選択することが必ずしも符号化効率の向上には繋がらない。そこで、１画像内のアクティビティに加えて画像間予測効率を考慮することでより符号化効率の向上を図る。 When performing intra-picture coding, it can be expected that the coding efficiency will be improved by selecting a highly correlated structure obtained from the activity. However, when performing inter-picture prediction coding, the inter-picture prediction efficiency is improved. Therefore, the selection of a highly correlated structure obtained from the activity in one image does not necessarily lead to an improvement in coding efficiency. Therefore, the encoding efficiency is further improved by considering the inter-picture prediction efficiency in addition to the activity in one image.

すなわち、図６（ａ）に示すようにＦ０、Ｆ１、Ｆ２の順で１／３０秒のフレーム毎に表示が切り替わるフレーム構造での画像間予測距離は１／３０秒となり、一方、図６（ｂ）に示すようにあるフレーム０のトップフィールドｆＴ０及びボトムフィールドｆＢ０、次のフレーム１のトップフィールドｆＴ１、ボトムフィールドｆＴＢ１、次のフレーム２のトップフィールドｆＴ２、ボトムフィールドｆＴＢ２の順で１／６０秒のフィールド毎に表示が切り替わるフィールド構造での画像間予測距離は１／６０秒である。動画像には時間方向に相関性があるため、画像間予測距離が近いほど予測効率は高くなる。 That is, as shown in FIG. 6 (a), the inter-picture prediction distance in the frame structure in which the display is switched every frame of 1/30 seconds in the order of F0, F1, and F2 is 1/30 seconds, while FIG. As shown in b), the top field fT0 and the bottom field fB0 of the frame 0, the top field fT1 of the next frame 1, the bottom field fTB1, the top field fT2 of the next frame 2, and the bottom field fTB2 in this order are 1/60 seconds. The predicted distance between images in the field structure in which the display is switched for each field is 1/60 seconds. Since moving images are correlated in the time direction, the prediction efficiency increases as the inter-image prediction distance is shorter.

従って、画像間予測符号化を行うフレームタイプ（Ｐフレーム及びＢフレーム）については、フィールド構造の選択率を増加させることで、画像間予測効率を向上させることが可能となる。特に、フィールド／フレーム判定が曖昧になりやすい特徴を持った複雑で細かな動きが多い動画像について効果を期待できる。 Therefore, for frame types (P frames and B frames) for which inter-picture prediction encoding is performed, it is possible to improve inter-picture prediction efficiency by increasing the selection rate of the field structure. In particular, the effect can be expected for a moving image having a complicated and detailed movement with a feature in which field / frame determination tends to be ambiguous.

本発明は以上の点を鑑みてなされたもので、フレームタイプに応じてフレーム構造とフィールド構造の選択率を可変にすることで、画像間予測効率を向上させ、好適な画質の符号化信号を生成し得る動画像符号化装置を提供することを目的とする。 The present invention has been made in view of the above points. By changing the selection rate of the frame structure and the field structure according to the frame type, the inter-picture prediction efficiency is improved, and an encoded signal having a suitable image quality is obtained. It is an object of the present invention to provide a moving image encoding device that can be generated.

上記目的を達成するため、本発明は、インターレース走査による入来動画像信号を予め定められた第１の画素数の第１のブロックを最小符号化単位として、複数の第１のブロックを備えた所定の画像単位毎に画像内符号化又は画像間予測符号化して符号化画像信号を得る動画像符号化装置において、入来動画像信号を第１の画素数よりも少ない第２の画素数の第２のブロック単位で、フレーム構造の垂直画素間エネルギーを示すブロックフレームアクティビティを算出する第１の算出手段と、入来動画像信号を第２のブロック単位で、フィールド構造の垂直画素間エネルギーを示すブロックフィールドアクティビティを算出する第２の算出手段と、動画像信号の各画像単位を画像内符号化と画像間予測符号化のどちらで符号化するかを示す予測タイプ信号と、ブロックフレームアクティビティ及びブロックフィールドアクティビティとが供給され、ブロックフィールドアクティビティに基づいて現符号化対象の第１のブロックにおけるフィールドアクティビティに関する値であるＦＩを生成すると共に、現符号化対象の第１のブロックを含む画像単位の予測タイプ信号が画像内符号化を示しているときは、ブロックフレームアクティビティに基づいて現符号化対象の第１のブロックにおけるフレームアクティビティに関する値であるＦＲ１を生成し、現符号化対象の第１のブロックを含む画像単位の予測タイプ信号が画像間予測符号化を示しているときは、ブロックフレームアクティビティに基づいて現符号化対象の第１のブロックにおけるフレームアクティビティに関する値であり、ＦＲ１より大なる値のＦＲ２を生成する生成手段と、ＦＩがＦＲ１より大又はＦＩがＦＲ２より大のときは、現符号化対象の第１のブロックをフレーム構造で符号化させるように制御し、ＦＩがＦＲ１より小又はＦＩがＦＲ２より小のときは、現符号化対象の第１のブロックをフィールド構造で符号化させるように制御し、ＦＩがＦＲ１と等しい又はＦＩがＦＲ２と等しいときは、現符号化対象の第１のブロックをフィールド構造もしくはフレーム構造のどちらか一方の構造で符号化させるように制御する符号化構造判定手段とを有することをことを特徴とする。 In order to achieve the above object, the present invention comprises a plurality of first blocks, with an incoming moving image signal obtained by interlace scanning as a minimum encoding unit of a first block having a predetermined first number of pixels. In a moving image encoding apparatus that obtains an encoded image signal by performing intra-image encoding or inter-image predictive encoding for each predetermined image unit, an incoming moving image signal having a second number of pixels smaller than the first number of pixels. A first calculation means for calculating a block frame activity indicating a vertical inter-pixel energy of a frame structure in a second block unit; and an incoming moving image signal for a vertical inter-pixel energy of a field structure in a second block unit. A second calculating means for calculating the indicated block field activity, and a prediction indicating whether each image unit of the moving image signal is encoded by intra-picture encoding or inter-picture predictive encoding. A type signal and a block frame activity and a block field activity are supplied, and based on the block field activity, an FI that is a value related to the field activity in the first block to be encoded is generated, and the current encoding target When the prediction unit signal of the image unit including one block indicates intra-picture encoding, FR1 that is a value related to the frame activity in the first block to be encoded is generated based on the block frame activity; When the prediction type signal for each image including the first block to be encoded indicates inter-picture prediction encoding, a value related to the frame activity in the first block to be encoded based on the block frame activity In , Generating means for generating FR2 having a value greater than FR1, and when FI is greater than FR1 or FI is greater than FR2, control is performed so that the first block to be encoded is encoded in a frame structure. When FI is smaller than FR1 or FI is smaller than FR2, control is performed so that the first block to be encoded is encoded with a field structure. When FI is equal to FR1 or FI is equal to FR2 And coding structure determining means for controlling to encode the first block to be coded with either the field structure or the frame structure.

この発明では、画像内符号化を行うときにはブロックフィールドアクティビティに基づいて生成したＦＩと、ブロックフレームアクティビティに基づいて生成したＦＲ１とを比較し、画像間予測符号化を行うときには上記のＦＩとブロックフレームアクティビティに基づいて生成したＦＲ１より大なる値のＦＲ２とを比較し、ＦＲ１又はＦＲ２がＦＩより大なる値のときはフィールド構造で符号化した符号化画像信号を出力するが、ＦＲ２はＦＲ１よりも大であるので、画像間予測符号化を行うときには画像内符号化を行うときよりもフィールド構造で符号化した符号化画像信号をより高い確率で出力することができる。 In the present invention, when intra-picture encoding is performed, the FI generated based on the block field activity is compared with FR1 generated based on the block frame activity, and when performing inter-picture predictive encoding, the above-described FI and block frame are compared. Compared with FR2 having a value greater than FR1 generated based on the activity, and when FR1 or FR2 is greater than FI, an encoded image signal encoded with a field structure is output, but FR2 is more than FR1. Therefore, when inter-picture predictive coding is performed, a coded image signal coded with a field structure can be output with higher probability than when intra-picture coding is performed.

本発明によれば、フレームタイプに応じてフレーム構造とフィールド構造の選択率を可変にすることで、特に、フィールド／フレーム判定が曖昧になりやすい特徴を持った複雑で細かな動きが多い動画像について、好適な画質の動画像符号化装置を提供することができ、また、画像間予測符号化を行うときには画像内符号化を行うときよりもフィールド構造で符号化した符号化画像信号をより高い確率で出力するようにしたため、画像間予測符号化では予測効率の高いフィールド構造を優先することができ、よりフレームタイプに適合した最適な符号化画像信号の選択出力ができる。 According to the present invention, by making the selection rate of the frame structure and the field structure variable according to the frame type, in particular, a moving image with a lot of complicated and fine motion having a characteristic that field / frame determination is likely to be ambiguous. Therefore, it is possible to provide a moving image coding apparatus having a suitable image quality, and when performing inter-picture predictive coding, a coded image signal coded with a field structure is higher than when performing intra-picture coding. Since output is performed with a probability, a field structure with high prediction efficiency can be prioritized in inter-picture prediction encoding, and an optimal encoded image signal more suitable for the frame type can be selected and output.

次に、本発明の実施の形態について図面と共に説明する。図１は本発明になる動画像符号化装置の一実施の形態の構成図を示す。図２は本発明になる動画像符号化装置の一実施形態の全体的な動作を説明するフローチャートである。図１の実施の形態の構成及び動作について、図２を参照して説明する。まず、符号化対象のインターレース走査動画信号である入力画像信号ａは、画像信号ａの各画像単位を画像内符号化と画像間予測符号化のどちらで符号化するかを示す予測タイプ信号であるフレームタイプ信号ｂと共に符号化構造決定器１１に入力される。符号化構造決定器１１は後述する構成により、フレームタイプ信号ｂに応じて入力画像信号を基本ブロック毎にフレーム構造またはフィールド構造で符号化するのかを決定し（ステップＳ１００）、決定したフレーム／フィールド構造を示す符号化構造信号ｃと符号化対象画像信号ｄを出力する。 Next, embodiments of the present invention will be described with reference to the drawings. FIG. 1 shows a configuration diagram of an embodiment of a moving picture encoding apparatus according to the present invention. FIG. 2 is a flowchart for explaining the overall operation of an embodiment of the moving picture coding apparatus according to the present invention. The configuration and operation of the embodiment of FIG. 1 will be described with reference to FIG. First, an input image signal a which is an interlace scanning moving image signal to be encoded is a prediction type signal indicating whether each image unit of the image signal a is encoded by intra-picture encoding or inter-picture predictive encoding. It is input to the coding structure determiner 11 together with the frame type signal b. The coding structure determiner 11 determines whether to encode the input image signal with a frame structure or a field structure for each basic block in accordance with the frame type signal b with the configuration described later (step S100). An encoded structure signal c indicating the structure and an encoding target image signal d are output.

ここで、符号化対象画像信号ｄは、符号化構造信号ｃがフレーム構造を示していれば入力画像信号ａをフレーム構造で整列した画像信号となり、符号化構造信号ｃがフィールド構造を示していれば入力画像信号ａをフィールド構造で整列した画像信号となる。また、基本ブロックはマクロブロック単位（水平１６画素×垂直１６画素）を用いる。なお、基本ブロックはマクロブロックペア単位（水平１６画素×垂直３２画素）とすることもできる。 Here, the encoding target image signal d is an image signal in which the input image signal a is arranged in the frame structure if the encoding structure signal c indicates the frame structure, and the encoding structure signal c indicates the field structure. For example, the input image signal a is an image signal arranged in a field structure. The basic block uses a macroblock unit (16 horizontal pixels × 16 vertical pixels). Note that the basic block may be a macroblock pair unit (horizontal 16 pixels × vertical 32 pixels).

符号化構造信号ｃと符号化対象画像信号ｄは動き検出器１３に与えられる。動き検出器１３は符号化構造信号ｃに応じて、符号化対象画像信号ｄと画像メモリ２３からの参照画像信号から動きベクトルを求める（ステップＳ１０１）。動き補償予測器１４は動き検出器１３からの動きベクトルと画像メモリ２３からの参照画像信号とから予測信号を生成し（ステップＳ１０２）、減算器１２に送る。減算器１２は符号化対象画像信号ｄから動き補償予測器１４からの予測信号を減算して予測誤差信号を生成する（ステップＳ１０３）。 The encoded structure signal c and the encoding target image signal d are given to the motion detector 13. The motion detector 13 obtains a motion vector from the encoding target image signal d and the reference image signal from the image memory 23 according to the encoding structure signal c (step S101). The motion compensation predictor 14 generates a prediction signal from the motion vector from the motion detector 13 and the reference image signal from the image memory 23 (step S102), and sends it to the subtractor 12. The subtracter 12 subtracts the prediction signal from the motion compensated predictor 14 from the encoding target image signal d to generate a prediction error signal (step S103).

直交変換器１５は減算器１２からの予測誤差信号の直交変換処理を行い（ステップＳ１０４）、得られた係数を量子化器１６に与える。量子化器１６は所定の量子化ステップ幅で係数を量子化し（ステップＳ１０５）、量子化された係数をエントロピー符号化器１７と逆量子化器１９に与える。エントロピー符号化器１７は量子化された係数を予め定められた方法に従ってエントロピー符号化する（ステップＳ１０６）。多重化器１８では、フレームタイプ信号ｂ、符号化構造信号ｃ、動きベクトル及びエントロピー符号化した符号列などと共にヘッダ情報を多重化し（ステップＳ１０７）、動画像符号信号ｅを生成して出力する。 The orthogonal transformer 15 performs orthogonal transformation processing of the prediction error signal from the subtracter 12 (step S104), and gives the obtained coefficient to the quantizer 16. The quantizer 16 quantizes the coefficients with a predetermined quantization step width (step S105), and supplies the quantized coefficients to the entropy encoder 17 and the inverse quantizer 19. The entropy encoder 17 performs entropy encoding on the quantized coefficient according to a predetermined method (step S106). The multiplexer 18 multiplexes the header information together with the frame type signal b, the encoded structure signal c, the motion vector, the entropy-encoded code string, and the like (step S107), and generates and outputs a moving image code signal e.

一方、逆量子化器１９は量子化器１６からの量子化された係数を逆量子化し（ステップＳ１０８）、逆直交変換器２０は、逆量子化器１９からの逆量子化された信号に対して直交変換と逆の処理を行い、予測誤差信号を再生する（ステップＳ１０９）。加算器２１は逆直交変換器２０からの予測誤差信号と動き補償予測器１４からの予測信号とを加算し（ステップＳ１１０）、局部復号された再生画像信号を生成する。この再生画像信号は、符号化時に生じるブロック歪みを減少させるためのデブロック処理がループフィルタ２２によって行われた後（ステップＳ１１１）、画像メモリ２３に供給されて蓄えられる（ステップＳ１１２）。画像メモリ２３に蓄積された再生画像信号は、必要に応じて読み出され参照画像信号として動き検出器１３及び動き補償予測器１４にそれぞれ供給される。 On the other hand, the inverse quantizer 19 inversely quantizes the quantized coefficient from the quantizer 16 (step S108), and the inverse orthogonal transformer 20 applies the inverse quantized signal from the inverse quantizer 19 to the inversely quantized signal. Then, a process reverse to the orthogonal transform is performed to reproduce the prediction error signal (step S109). The adder 21 adds the prediction error signal from the inverse orthogonal transformer 20 and the prediction signal from the motion compensation predictor 14 (step S110), and generates a locally decoded reproduced image signal. The reproduced image signal is supplied to and stored in the image memory 23 (step S112) after deblocking processing for reducing block distortion occurring at the time of encoding is performed by the loop filter 22 (step S111). The reproduced image signal stored in the image memory 23 is read as necessary and supplied to the motion detector 13 and the motion compensation predictor 14 as a reference image signal.

次に、本実施の形態における動画像符号化装置の特徴的な構成要素である符号化構造決定器１１の構成及び動作について更に詳細に説明する。図３は本発明における動画像符号化装置の特徴的な構成要素である符号化構造決定器１１の一実施の形態のブロック図を示す。図３に示すように、符号化構造決定器１１は、フレームアクティビティ演算器１１１、フィールドアクティビティ演算器１１２及び判定器１１３から構成されている。 Next, the configuration and operation of the coding structure determiner 11 that is a characteristic component of the moving picture coding apparatus according to the present embodiment will be described in more detail. FIG. 3 shows a block diagram of an embodiment of a coding structure determiner 11 which is a characteristic component of the moving picture coding apparatus according to the present invention. As shown in FIG. 3, the coding structure determiner 11 includes a frame activity calculator 111, a field activity calculator 112, and a determiner 113.

フレームアクティビティ演算器１１１は図５（ａ）に示すように、白丸で示すトップフィールド（奇数フィールド）の画素と、黒丸で示すボトムフィールド（偶数フィールド）の画素のうち、アクティビティブロック単位（水平８画素×垂直８画素）毎に後述する（１）式によりブロックフレームアクティビティ(Block Frame Activity)を算出する。このブロックフレームアクティビティ(Block Frame Activity)は、垂直方向に隣接する２ラインの画素間のエネルギーの総和である。 As shown in FIG. 5A, the frame activity computing unit 111 is an activity block unit (horizontal 8 pixels) among the top field (odd field) pixels indicated by white circles and the bottom field (even field) pixels indicated by black circles. The block frame activity is calculated for each of (vertical 8 pixels) according to the formula (1) described later. This block frame activity is the sum of energy between two lines of pixels adjacent in the vertical direction.

また、フィールドアクティビティ演算器１１２は図５（ｂ）に示すように、白丸で示すトップフィールド（奇数フィールド）の画素のうち、又は黒丸で示すボトムフィールド（偶数フィールド）の画素のうち、アクティビティブロック単位（水平８画素×垂直８画素）毎に後述する（２）式によりブロックフィールドアクティビティ(Block Field Activity)を算出する。このブロックフィールドアクティビティ(Block Field Activity)は、垂直方向に隣接する２つの画素間のエネルギーの総和である。 In addition, as shown in FIG. 5B, the field activity calculator 112 is an activity block unit among the pixels in the top field (odd field) indicated by white circles or the pixels in the bottom field (even field) indicated by black circles. For each (8 horizontal pixels × 8 vertical pixels), a block field activity is calculated according to equation (2) described later. This block field activity is the sum of energy between two adjacent pixels in the vertical direction.

ここで、（１）式及び（２）式はエネルギーを垂直画素間絶対差分とみなしたアクティビティ算出方法の一例である。また、（１）式、（２）式中、Ｐ（ｘ，ｙ）はフレーム画像における水平座標ｘ、垂直座標ｙにおける画素値であり、Ｐ（ｘ，ｙ＋１）は画素値Ｐ（ｘ，ｙ）の画素より異なるフィールドの１ライン下側に隣接する画素の画素値であり、Ｐ（ｘ，ｙ＋２）は画素値Ｐ（ｘ，ｙ）の画素と同じフィールドの１ライン下側に隣接する画素の画素値である。

Here, equations (1) and (2) are examples of an activity calculation method in which energy is regarded as an absolute difference between vertical pixels. In the expressions (1) and (2), P (x, y) is the pixel value at the horizontal coordinate x and the vertical coordinate y in the frame image, and P (x, y + 1) is the pixel value P (x, y). ) Is a pixel value of a pixel adjacent to one line below the different field, and P (x, y + 2) is a pixel adjacent to one line below the same field as the pixel having the pixel value P (x, y). Pixel value.

また、図３において、判定器１１３は、フレームタイプ信号ｂと、フレームアクティビティ演算器１１１及びフィールドアクティビティ演算器１１２の各出力信号とを入力として受け、基本ブロック毎にフレーム構造またはフィールド構造で符号化するのかを決定して、フレーム／フィールド構造を示す符号化構造信号ｃと符号化対象画像信号ｄを出力する。 In FIG. 3, the determiner 113 receives the frame type signal b and the output signals of the frame activity calculator 111 and the field activity calculator 112 as inputs, and encodes each basic block with a frame structure or a field structure. The coding structure signal c indicating the frame / field structure and the coding target image signal d are output.

次に、図３の実施の形態の動作について、図４のフローチャートを併せ参照して説明する。図３において、入力画像信号ａはフレームアクティビティ演算器１１１とフィールドアクティビティ演算器１１２に入力される。フレームアクティビティ演算器１１１は入力画像信号ａに対して、アクティビティブロック単位（水平８画素×垂直８画素）毎に（１）式に基づいてブロックフレームアクティビティ（Block Frame Activity）を算出して出力する。また、これと並行してフィールドアクティビティ演算器１１２は入力画像信号ａに対して、アクティビティブロック単位（水平８画素×垂直８画素）毎に（２）式に基づいてブロックフィールドアクティビティ（Block Field Activity）を算出して出力する。 Next, the operation of the embodiment of FIG. 3 will be described with reference to the flowchart of FIG. In FIG. 3, an input image signal a is input to a frame activity calculator 111 and a field activity calculator 112. The frame activity calculator 111 calculates and outputs a block frame activity (Block Frame Activity) based on the equation (1) for each activity block (horizontal 8 pixels × vertical 8 pixels) for the input image signal a. In parallel with this, the field activity calculator 112 performs block field activity on the input image signal a based on the equation (2) for each activity block (horizontal 8 pixels × vertical 8 pixels). Is calculated and output.

判定器１１３は、フレームタイプ信号ｂと、フレームアクティビティ演算器１１１及びフィールドアクティビティ演算器１１２の各出力信号とを入力として受け、まず、フレームタイプ信号ｂから符号化対象の入力画像信号ａのフレームタイプがＩフレーム、Ｐフレーム、Ｂフレームのいずれであるかを判定する（ステップＳ２０１）。 The determiner 113 receives the frame type signal b and the output signals of the frame activity calculator 111 and the field activity calculator 112 as inputs. First, the frame type of the input image signal a to be encoded is determined from the frame type signal b. Is an I frame, a P frame, or a B frame (step S201).

また、判定器１１３は、フレームアクティビティ演算器１１１からのブロックフレームアクティビティ（Block Frame Activity）から基本ブロックのフレームアクティビティ（FrameActivity）を算出すると共に、フィールドアクティビティ演算器１１２からのブロックフィールドアクティビティ（Block Field Activity）から基本ブロックのフィールドアクティビティ（FieldActivity）を算出する。ここで、基本ブロックのフレームアクティビティ（FrameActivity）は、基本ブロックに含まれる４つのブロックフレームアクティビティ（Block Frame Activity）の和であり、また、基本ブロックのフィールドアクティビティ（FieldActivity）、基本ブロックに含まれる４つのブロックフィールドアクティビティ（Block Field Activity）の和であり、それぞれ次式で表される。 Further, the determiner 113 calculates the frame activity (FrameActivity) of the basic block from the block frame activity (Block Frame Activity) from the frame activity calculator 111 and at the same time the block field activity (Block Field Activity) from the field activity calculator 112. ) To calculate the basic block field activity (FieldActivity). Here, the frame activity (FrameActivity) of the basic block is the sum of the four block frame activities (Block Frame Activity) included in the basic block, and the field activity (FieldActivity) of the basic block and 4 included in the basic block. It is the sum of two block field activities and is represented by the following equation.

FrameActivity＝ΣBlock Frame Activity （３）
FieldActivity＝ΣBlock Field Activity （４）
なお、基本ブロックのフレーム（フィールド）アクティビティの算出方法は４つのブロックフレーム（フィールド）アクティビティの和に限定しない。４つのブロックフレーム（フィールド）アクティビティの最小値、最大値、平均値などを用いることもできる。 FrameActivity = ΣBlock Frame Activity (3)
FieldActivity = ΣBlock Field Activity (4)
The method for calculating the basic block frame (field) activity is not limited to the sum of the four block frame (field) activities. The minimum value, maximum value, average value, etc. of the four block frame (field) activities can also be used.

判定器１１３はステップＳ２０１でフレームタイプ信号ｂがＩフレームであると判定した場合には、次式
ＦＲ＝FrameActivity （５）
ＦＩ＝FieldActivity （６）
により、それぞれフレームアクティビティＦＲ、フィールドアクティビティＦＩを算出する（ステップＳ２０３）。 If the determination unit 113 determines in step S201 that the frame type signal b is an I frame, the following expression FR = FrameActivity (5)
FI = FieldActivity (6)
To calculate the frame activity FR and the field activity FI, respectively (step S203).

また、判定器１１３はステップＳ２０１でフレームタイプ信号ｂがＰフレームであると判定した場合には、次式
ＦＲ＝FrameActivity＋Op （７）
ＦＩ＝FieldActivity （８）
により、それぞれフレームアクティビティＦＲ、フィールドアクティビティＦＩを算出する（ステップＳ２０２）。また、判定器１１３はステップＳ２０１でフレームタイプ信号ｂがＢフレームであると判定した場合には、次式
ＦＲ＝FrameActivity＋Ob （９）
ＦＩ＝FieldActivity （10）
により、それぞれフレームアクティビティＦＲ、フィールドアクティビティＦＩを算出する（ステップＳ２０４）。すなわち、判定器１１３はフレームタイプがＰフレーム又はＢフレームのときには、ＩフレームのときよりもフレームアクティビティＦＲの値が大なるように、正の値のオフセット値Ｏｐ、Ｏｂ（すなわち、Ｏｐ＞０、Ｏｂ＞０）を加算する。 If the determination unit 113 determines in step S201 that the frame type signal b is a P frame, the following expression FR = FrameActivity + Op (7)
FI = FieldActivity (8)
To calculate the frame activity FR and the field activity FI, respectively (step S202). If the determination unit 113 determines in step S201 that the frame type signal b is a B frame, FR = FrameActivity + Ob (9)
FI = FieldActivity (10)
Thus, the frame activity FR and the field activity FI are calculated respectively (step S204). That is, when the frame type is P frame or B frame, the determiner 113 has positive offset values Op, Ob (that is, Op> 0, so that the value of the frame activity FR is larger than that of the I frame. Ob> 0) is added.

続いて、判定器１１３はステップ２０２、２０３又は２０４で算出した上記のフレームアクティビティＦＲ及びフィールドアクティビティＦＩが、ＦＲ≦ＦＩであるかどうか判定し（ステップＳ２０５）、ＦＲ≦ＦＩであるときにはフレーム構造と決定し（ステップＳ２０６）、ＦＲ＞ＦＩであるときにはフィールド構造と決定する（ステップＳ２０７）。ＦＲ及びＦＩのうち、相対的に値が小さな方が相関が高いからである。なお、ＦＲ＝ＦＩの時にはフィールド構造と決定するようにしてもよい（等号が成立する場合にはどちらの符号化構造でも選択可能としておくようにしてもよい。）。 Subsequently, the determiner 113 determines whether or not the frame activity FR and the field activity FI calculated in step 202, 203, or 204 satisfy FR ≦ FI (step S205). When FR ≦ FI, The field structure is determined (step S206). If FR> FI, the field structure is determined (step S207). This is because, among FR and FI, the smaller the value, the higher the correlation. When FR = FI, the field structure may be determined (if the equal sign holds, either coding structure may be selected).

判定器１１３はこのようにして決定したフレーム構造／フィールド構造を示す符号化構造信号ｃを生成して出力する。判定器１１３は、符号化構造信号ｃがフレーム構造を示していれば入力画像信号ａをフレーム構造として符号化させるように後段の符号化を制御し、符号化構造信号ｃがフィールド構造を示していれば入力画像信号ａをフィールド構造として符号化させるように後段の符号化を制御する。 The determiner 113 generates and outputs an encoded structure signal c indicating the frame structure / field structure determined in this way. The determination unit 113 controls the subsequent encoding so that the input image signal a is encoded as a frame structure if the encoding structure signal c indicates a frame structure, and the encoding structure signal c indicates a field structure. Then, the subsequent encoding is controlled so that the input image signal a is encoded as a field structure.

このように、本実施の形態では、画像間予測符号化を行うフレームタイプであるＰフレームとＢフレームにおいて、ＩフレームのときよりもフレームアクティビティＦＲの値が大なるように、正の値のオフセット値Ｏｐ、Ｏｂを加算したのは、画像間予測を行うＰフレームやＢフレームのフレームアクティビティが画像内符号化を行うＩフレームほど符号化効率との関連性が高くないため、ＰフレームとＢフレームでは予測効率の高いフィールド構造を優先するためである。 As described above, in the present embodiment, in the P frame and the B frame, which are frame types for performing inter-picture predictive coding, the positive value offset is set so that the value of the frame activity FR is larger than that in the I frame. The values Op and Ob are added because the P frame and the B frame are not related to the coding efficiency as much as the I frame in which the frame activity of the P frame or the B frame performing the inter-picture prediction performs the intra-picture coding. This is because priority is given to a field structure with high prediction efficiency.

上記のように、本実施の形態によれば、フレームタイプに応じてフレーム構造とフィールド構造の選択率を可変にすることで、特に、フィールド／フレーム判定が曖昧になりやすい特徴を持った複雑で細かな動きが多い動画像について、好適な画質の動画像符号化装置を提供することができる。 As described above, according to the present embodiment, by making the selection rate of the frame structure and the field structure variable according to the frame type, the field / frame determination is particularly complicated with the characteristic that the field / frame determination tends to be ambiguous. It is possible to provide a moving image encoding device having a suitable image quality for moving images with many fine movements.

なお、本発明は上記の実施の形態に限定されるものではなく、０＜Ｏｐ＜Ｏｂに設定し、Ｐフレームよりも更にＢフレームの方を予測効率の高いフィールド構造を優先するように設定してもよい。また、本実施の形態では予測タイプ信号としてのフレームタイプ信号に応じてオフセット値の制御を行ったが、これを予測タイプ信号としてのスライスタイプ信号に応じて制御することも可能である（フレームやフィールドよりも小さい画像単位であるスライス単位で予測タイプ［画像内予測、片方向予測、双方向予測］を設定している場合もあり、これに対応させるため）。 The present invention is not limited to the above embodiment, and 0 <Op <Ob is set, and the B frame is set to give priority to the field structure with higher prediction efficiency than the P frame. May be. In this embodiment, the offset value is controlled according to the frame type signal as the prediction type signal. However, it is also possible to control this according to the slice type signal as the prediction type signal (for example, a frame or The prediction type [intra-image prediction, unidirectional prediction, bidirectional prediction] may be set in slice units, which are image units smaller than the field, in order to cope with this).

また、本発明は図１及び図３に示した構成をコンピュータにより実行させる動画像符号化プログラムも包含するものである。ここで、この動画像符号化プログラムは、記録されている記録媒体からコンピュータに取り込まれてもよいし、通信ネットワークを介して配信されてコンピュータに取り込まれてもよく、更にはファームウェアとしてコンピュータに組み込まれていてもよい。 The present invention also includes a moving image encoding program that causes a computer to execute the configuration shown in FIGS. Here, the moving picture encoding program may be taken into a computer from a recorded recording medium, distributed via a communication network, taken into the computer, or further incorporated into the computer as firmware. It may be.

本発明の動画像符号化装置の一実施の形態のブロック図である。It is a block diagram of one embodiment of a moving picture coding apparatus of the present invention. 本発明の動画像符号化装置の一実施の形態の動作説明用フローチャートである。It is a flowchart for operation | movement description of one Embodiment of the moving image encoder of this invention. 本発明の動画像符号化装置の要部の符号化構造決定器の一実施の形態のブロック図である。It is a block diagram of one Embodiment of the encoding structure determination device of the principal part of the moving image encoder of this invention. 図３の符号化構造決定器の動作説明用フローチャートである。It is a flowchart for operation | movement description of the encoding structure determiner of FIG. 本発明の動画像符号化装置の実施例におけるフレームアクティビティとフィールドアクティビティを説明する図である。It is a figure explaining the frame activity and the field activity in the Example of the moving image encoder of this invention. フレーム構造とフィールド構造を説明する図である。It is a figure explaining a frame structure and a field structure.

Explanation of symbols

ａ入力画像信号
ｂフレームタイプ信号
ｃ符号化構造信号
ｄ符号化対象画像信号
ｅ動画像符号信号
１１符号化構造決定器
１２減算器
１３動き検出器
１４動き補償予測器
１５直交変換器
１６量子化器
１７エントロピー符号化器
１８多重化器
１９逆量子化器
２０逆直交変換器
２１加算器
２２ループフィルタ
２３画像メモリ
１１１フレームアクティビティ演算器
１１２フィールドアクティビティ演算器
１１３判定器 a input image signal b frame type signal c encoding structure signal d encoding target image signal e moving image code signal 11 encoding structure determiner 12 subtractor 13 motion detector 14 motion compensation predictor 15 orthogonal transformer 16 quantizer 17 Entropy Encoder 18 Multiplexer 19 Inverse Quantizer 20 Inverse Orthogonal Transformer 21 Adder 22 Loop Filter 23 Image Memory 111 Frame Activity Calculator 112 Field Activity Calculator 113 Determinator

Claims

Intra-picture coding is performed for each predetermined image unit including a plurality of the first blocks, with the first block having a predetermined first number of pixels as the incoming video signal by interlace scanning as a minimum coding unit. Alternatively, in a video encoding device that obtains an encoded image signal by performing inter-picture prediction encoding,
First calculation means for calculating a block frame activity indicating vertical inter-pixel energy of a frame structure in the second block unit having a second pixel number smaller than the first pixel number for the incoming moving image signal; ,
Second calculation means for calculating a block field activity indicating vertical inter-pixel energy of a field structure in the second block unit for the incoming moving image signal;
A prediction type signal indicating whether each image unit of the moving image signal is encoded by the intra-picture encoding or the inter-picture prediction encoding, the block frame activity, and the block field activity are supplied; Based on a block field activity, an FI that is a value related to a field activity in the first block to be encoded is generated, and the prediction type signal in units of images including the first block to be encoded is When intra-picture encoding is indicated, FR1 that is a value related to frame activity in the first block to be encoded is generated based on the block frame activity, and the first block to be encoded The prediction type signal in units of images including the image When showing the predictive coding, on the basis of the said block frame activity is a value for a frame activity in the first block of the current coded, a generating means for generating a FR2 large becomes than the FR1,
When the FI is larger than the FR1 or the FI is larger than the FR2, the first block to be encoded is controlled to be encoded in a frame structure, and the FI is smaller than the FR1 or the When FI is smaller than FR2, control is performed to encode the first block to be encoded with a field structure, and when FI is equal to FR1 or when FI is equal to FR2, A moving picture coding apparatus comprising coding structure determination means for controlling the first block to be coded as a field structure or a frame structure.