WO2014112076A1 - Video encoding device - Google Patents
Video encoding device Download PDFInfo
- Publication number
- WO2014112076A1 WO2014112076A1 PCT/JP2013/050814 JP2013050814W WO2014112076A1 WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1 JP 2013050814 W JP2013050814 W JP 2013050814W WO 2014112076 A1 WO2014112076 A1 WO 2014112076A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- previous
- referenceable
- flash
- difference
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/87—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving scene cut or scene change detection in combination with video compression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/142—Detection of scene cut or scene change
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
Definitions
- the present invention relates to a moving picture coding apparatus that performs coding using inter-frame prediction, and more particularly to a moving picture coding apparatus that can prevent image disturbance immediately after flashing.
- Inter-frame prediction is used in video compression.
- a flash image has a low degree of similarity with an image adjacent thereto, it is often inappropriate to use the flash image as a reference frame in inter-frame prediction. Therefore, it has been proposed to use the frame before the flash as the reference frame of the frame after the flash (see, for example, Patent Document 1).
- a flash image is detected by using two of the inter-frame difference value of the accumulated luminance value and the difference for each pixel of the line in the first direction / first direction.
- the difference for each pixel of the line in the first direction / first direction is too sensitive to movement, the difference increases only by moving one pixel from the previous frame, and false detection increases.
- the present invention has been made to solve the above-described problems, and an object of the present invention is to obtain a moving picture coding apparatus capable of preventing image disturbance immediately after flashing.
- the present invention is a moving image encoding apparatus that performs encoding using inter-frame prediction, and includes an encoding unit that encodes a difference value between an encoding target frame and a prediction frame, and a current picture of the encoding target frame , A scene change detection unit for detecting the presence or absence of a scene change for each of the previous and second previous referenceable pictures, and the luminance of a block corresponding between the current picture and the previous referenceable picture A luminance block accumulating unit that calculates X as the difference of each accumulated value accumulated by one picture, and the difference between the accumulated luminance values of the previous referenceable picture and the current picture as Y1 ,
- the difference between the accumulated values of the luminances of the previous referenceable picture and the current picture is Y2, and the previous referenceable picture and the current picture are (1)
- the current picture and the 1 are the difference between the accumulated luminance values of Y3, Y3, the luminance accumulation unit for calculating the difference between them, the first threshold value Xth, and
- the previous referenceable picture is determined to be a flash image, and (2) the one When a scene change is detected between the previous referenceable picture and the previous two referenceable pictures, and
- a flash determination unit that determines that the previous and second previous reference pictures are flash images, and refers to the first previous reference picture when the first previous reference picture is not determined to be a flash image;
- the prediction frame is calculated, and when the previous referenceable picture is determined to be a flash image, the prediction frame is calculated with reference to the two previous referenceable pictures, and the previous and second Prediction frame calculation for calculating the prediction frame with reference to the three previous referenceable pictures when the previous referenceable picture is determined to be a flash image And a section.
- the block size may be considered arbitrary. The larger the block size, the more moderate the increase / decrease of the difference value with respect to the motion.
- FIG. 1 is a block diagram showing a moving picture coding apparatus according to an embodiment of the present invention. This moving image encoding apparatus performs encoding using inter-frame prediction.
- the input moving image signal is a moving image signal (video signal) reproduced by a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
- a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times.
- An I picture is a picture that is encoded only with its own picture information and is the most beautiful but requires the most code amount.
- a P picture is a picture that is encoded only with its own picture information and is encoded with reference to a picture earlier than itself, and is more beautiful than a B picture, but has a larger amount of code than a B picture.
- a B picture is a picture that is encoded with reference to a past / future picture from itself, and the amount of code can be reduced most among three pictures.
- an I picture and a P picture are referenceable pictures that can be referred to when other pictures are encoded, while a B picture is a non-referenceable picture.
- Video interface 1 inputs an input video signal and outputs a frame to be encoded.
- the subtracter 2 calculates the difference between the encoding target frame and the predicted frame.
- the discrete cosine transformer 3 performs a discrete cosine transform (DCT) on the output of the subtracter 2 in units of a fixed block.
- the quantization unit 4 quantizes the DCT coefficient data obtained by the discrete cosine transform.
- the inverse quantization unit 5 performs inverse quantization on the quantized DCT coefficient data.
- the inverse discrete cosine transformer 6 performs inverse discrete cosine transform (inverse DCT) on the inversely quantized DCT coefficient data.
- the adder 7 adds the output of the inverse discrete cosine transformer 6 and the prediction frame, and stores the addition result in the video memory 8 as a referenceable picture.
- the motion compensation prediction unit 9 does not perform motion vector detection when the encoding target frame is an I picture, and outputs 0 as a prediction frame.
- the motion compensation prediction unit 9 starts from the encoding target frame. A motion vector is detected for each macroblock, and a prediction frame is generated.
- Motion vector detection is performed in units of macroblocks between the encoding target frame and the referenceable picture stored in the video memory 8. Specifically, by detecting the macroblock having the highest correlation of the referenceable picture with respect to the macroblock of the encoding target frame, the macroblock of the encoding target frame has moved from which macroblock of the referenceable picture. Is detected as a motion vector.
- the motion compensation prediction unit 9 performs motion compensation on the referenceable picture using this motion vector to calculate a prediction frame.
- the motion compensation prediction unit 9 also outputs a prediction mode of motion compensation prediction and prediction mode / motion vector information indicating a motion vector.
- the entropy encoding unit 10 encodes the quantized DCT coefficient data and the prediction mode / motion vector information. In this way, the entropy encoding unit 10 encodes the difference value between the encoding target frame and the prediction frame.
- the header / syntax adding unit 11 adds syntax information to the encoded result.
- a number ref-idx is assigned as syntax information to an image to be a predicted frame.
- Ref-idx is assigned in order from 0 in order of time closest to the current picture being processed.
- the scene change detection unit 12 detects the presence / absence of a scene change (SC) for each of the current picture of the encoding target frame, the previous one picture, and the second previous reference picture.
- FIG. 2 is a diagram showing the relationship between a flash image and a scene change.
- a scene change is detected between the current picture and the previous referenceable picture (pattern 1).
- the entire front surface of 2 pictures is a flash image, and 1/3 of the screen before 1 picture is a flash image (1/3 of the screen is white)
- the previous referenceable picture and the previous 2 pictures A scene change is detected in the referenceable picture (pattern 2).
- FIG. 3 is a diagram showing a calculation method of the luminance block accumulation unit.
- the luminance block accumulating unit 13 calculates, as X, a value obtained by accumulating the difference for each accumulated value of luminance of the corresponding block between the current picture and the previous referenceable picture for one picture.
- the block is a macroblock of 16 pixels ⁇ 16 pixels here, but is not limited thereto, and may be a block of 4 pixels ⁇ 4 pixels, 8 pixels ⁇ 8 pixels, 16 pixels ⁇ 32 pixels, or the like.
- FIG. 4 is a diagram illustrating a calculation method of the luminance accumulation unit.
- the luminance accumulation unit 14 sets Y1 as the difference between the accumulated luminance values of the previous referenceable picture and the current picture, and determines the difference between the accumulated luminance values of the previous referenceable picture and the current picture.
- Y2 is set as Y2
- the difference between the accumulated values of the luminance of the three previous referenceable pictures and the current picture is set as Y3, and the difference between them is calculated.
- the flash determination unit 15 determines the flash image as follows by setting the first threshold value to Xth and the second threshold value to Yth. (1) When a scene change is detected between the current picture and the previous referenceable picture, and
- the flash determination unit 15 detects the flash image
- the flash determination unit 15 instructs to change the predicted frame to the frame immediately before the flash image.
- the motion compensated prediction unit 9 calculates a prediction frame by referring to the previous referenceable picture when the previous referenceable picture is not determined to be a flash image, and calculates the previous frame.
- a prediction frame is calculated with reference to the previous referenceable picture, and when the previous and second previous referenceable pictures are determined to be flash images, 3 is calculated.
- a prediction frame is calculated with reference to the previous referenceable picture.
- FIG. 5 and 6 are diagrams showing examples of selecting pictures to be referred to when calculating a prediction frame.
- FIG. 5 shows a case where the interval M to the referenceable picture (I or P picture) is 1.
- the previous picture is referred to, and in the case of 2 picture flash, the previous picture is referred to so that the flash image is not referred to.
- FIG. 6 shows a case where M is 3, the interval N to the next I picture is 15, and the fourth P picture is a flash image.
- the 1st I picture is referred to without referring to the 4th P picture.
- the I picture is a flash picture
- the picture immediately after that is changed to a P picture and encoded. After that, encoding is performed with a GOP configuration adapted to the change. Although the immediately following picture may be an I picture, a large amount of codes are generated.
- the B picture is a flash image, the B picture is not referred to in the first place, so that encoding is performed as usual.
- the flash determination unit 15 detects the flash image, the flash determination unit 15 instructs the header / syntax adding unit 11 to change ref-idx of the frame immediately before the flash image to 0.
- the calculation result X of the luminance block accumulating unit 13 is more responsive to the motion of the picture (increase / decrease in the difference value) than the accumulated luminance difference for each pixel.
- the difference between Y1, Y2, and Y3 calculated by the luminance accumulation unit 14 is more responsive to the motion of the picture (increase / decrease in the difference value) than the calculation result X of the luminance block accumulation unit 13. Accordingly, since the flash image is determined for each scene change calculation pattern based on these calculation results, erroneous detection of the flash image is reduced. Further, it is possible to cope with a two-picture flash in which a flash is generated over two frames. For this reason, the disturbance of the image immediately after the flash can be prevented.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
本発明は、フレーム間予測を用いた符号化を行う動画像符号化装置に関し、特にフラッシュ直後の画像の乱れを防ぐことができる動画像符号化装置に関する。 The present invention relates to a moving picture coding apparatus that performs coding using inter-frame prediction, and more particularly to a moving picture coding apparatus that can prevent image disturbance immediately after flashing.
動画像の圧縮においてフレーム間予測が用いられる。しかし、フラッシュ画像はそれに隣接する画像との類似度が低いため、フレーム間予測においてフラッシュ画像を参照フレームとするのは不適切な場合が多い。そこで、フラッシュの前のフレームをフラッシュ後のフレームの参照フレームとすることが提案されている(例えば、特許文献1参照)。 Inter-frame prediction is used in video compression. However, since a flash image has a low degree of similarity with an image adjacent thereto, it is often inappropriate to use the flash image as a reference frame in inter-frame prediction. Therefore, it has been proposed to use the frame before the flash as the reference frame of the frame after the flash (see, for example, Patent Document 1).
従来技術では、輝度値の累積値のフレーム間差分値と第1方向/第1方向のラインの画素ごとの差分の2つを用いて、フラッシュ画像を検出する。しかし、第1方向/第1方向のラインの画素ごとの差分は動きに敏感すぎるため、前のフレームから1画素動くだけで差分が大きくなり、誤検出が多くなる。また、輝度値の累積値のフレーム間差分値のみを用いた場合でも、誤検出が多くなる。これにより、フラッシュ直後の画像が乱れるという問題があった。 In the prior art, a flash image is detected by using two of the inter-frame difference value of the accumulated luminance value and the difference for each pixel of the line in the first direction / first direction. However, since the difference for each pixel of the line in the first direction / first direction is too sensitive to movement, the difference increases only by moving one pixel from the previous frame, and false detection increases. Even when only the inter-frame difference value of the accumulated luminance value is used, erroneous detection increases. As a result, there is a problem that the image immediately after the flash is disturbed.
本発明は、上述のような課題を解決するためになされたもので、その目的はフラッシュ直後の画像の乱れを防ぐことができる動画像符号化装置を得るものである。 The present invention has been made to solve the above-described problems, and an object of the present invention is to obtain a moving picture coding apparatus capable of preventing image disturbance immediately after flashing.
本発明は、フレーム間予測を用いた符号化を行う動画像符号化装置であって、符号化対象フレームと予測フレームとの差分値を符号化する符号化部と、符号化対象フレームのカレントピクチャ、その1つ前及び2つ前の参照可能ピクチャについてそれぞれシーンチェンジの有無を検出するシーンチェンジ検出部と、前記カレントピクチャと前記1つ前の参照可能ピクチャとの間で対応するブロックの輝度の累算値ごとの差を1ピクチャ分累算したものをXとして算出する輝度ブロック累算部と、前記1つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY1とし、前記2つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY2とし、前記3つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY3として、互いの差を算出する輝度累算部と、第1の閾値をXthとし、第2の閾値をYthとして、(1)前記カレントピクチャと前記1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y2|≧Ythの場合、前記1つ前の参照可能ピクチャをフラッシュ画像と判定し、(2)前記1つ前の参照可能ピクチャと前記2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y2-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定し、(3)前記カレントピクチャと前記1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定し、(4)前記カレントピクチャと前記2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定するフラッシュ判定部と、前記1つ前の参照可能ピクチャがフラッシュ画像と判定されなかった場合に前記1つ前の参照可能ピクチャを参照して前記予測フレームを算出し、前記1つ前の参照可能ピクチャがフラッシュ画像と判定された場合に前記2つ前の参照可能ピクチャを参照して前記予測フレームを算出し、前記1つ前及び2つ前の参照可能ピクチャがフラッシュ画像と判定された場合に前記3つ前の参照可能ピクチャを参照して前記予測フレームを算出する予測フレーム算出部とを備えることを特徴とする。なお、ここでブロックのサイズは任意と考えてよい。ブロックのサイズが大きいほど動きに対する差分値の増減が緩やかになる。 The present invention is a moving image encoding apparatus that performs encoding using inter-frame prediction, and includes an encoding unit that encodes a difference value between an encoding target frame and a prediction frame, and a current picture of the encoding target frame , A scene change detection unit for detecting the presence or absence of a scene change for each of the previous and second previous referenceable pictures, and the luminance of a block corresponding between the current picture and the previous referenceable picture A luminance block accumulating unit that calculates X as the difference of each accumulated value accumulated by one picture, and the difference between the accumulated luminance values of the previous referenceable picture and the current picture as Y1 , The difference between the accumulated values of the luminances of the previous referenceable picture and the current picture is Y2, and the previous referenceable picture and the current picture are (1) The current picture and the 1 are the difference between the accumulated luminance values of Y3, Y3, the luminance accumulation unit for calculating the difference between them, the first threshold value Xth, and the second threshold value Yth. If a scene change is detected in the previous referenceable picture and | X | ≧ Xth and | Y1-Y2 | ≧ Yth, the previous referenceable picture is determined to be a flash image, and (2) the one When a scene change is detected between the previous referenceable picture and the previous two referenceable pictures, and | X | ≧ Xth and | Y2-Y3 | ≧ Yth, the previous and second previous referenceable pictures are (3) When a scene change is detected in the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y3 | ≧ Yth, the previous and second Previous It is determined that the illuminable picture is a flash image. (4) If a scene change is detected in the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y3 | ≧ Yth, A flash determination unit that determines that the previous and second previous reference pictures are flash images, and refers to the first previous reference picture when the first previous reference picture is not determined to be a flash image; The prediction frame is calculated, and when the previous referenceable picture is determined to be a flash image, the prediction frame is calculated with reference to the two previous referenceable pictures, and the previous and second Prediction frame calculation for calculating the prediction frame with reference to the three previous referenceable pictures when the previous referenceable picture is determined to be a flash image And a section. Here, the block size may be considered arbitrary. The larger the block size, the more moderate the increase / decrease of the difference value with respect to the motion.
本発明により、フラッシュ直後の画像の乱れを防ぐことができる。 According to the present invention, it is possible to prevent image disturbance immediately after flashing.
本発明の実施の形態に係る動画像符号化装置について図面を参照して説明する。図1は、本発明の実施の形態に係る動画像符号化装置を示すブロック図である。この動画像符号化装置はフレーム間予測を用いた符号化を行う。 A moving picture encoding apparatus according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a moving picture coding apparatus according to an embodiment of the present invention. This moving image encoding apparatus performs encoding using inter-frame prediction.
入力動画像信号は、同じ信号を繰り返し複数回再生可能なディジタルVTRやDVDシステムのようなビデオ記録再生機器によって再生される動画像信号(映像信号)である。入力動画像信号の符号化対象フレームにはIピクチャ、Pピクチャ及びBピクチャの3種類の画像タイプがある。 The input moving image signal is a moving image signal (video signal) reproduced by a video recording / reproducing device such as a digital VTR or a DVD system capable of repeatedly reproducing the same signal a plurality of times. There are three types of image types of I picture, P picture, and B picture in the encoding target frame of the input video signal.
Iピクチャは、自分のピクチャ情報のみで符号化を行うピクチャであり、最も綺麗だが、最も符号量を多く必要とする。Pピクチャは、自分のピクチャ情報のみの符号化と自分より過去のピクチャを参照して符号化を行うピクチャであり、Bピクチャより綺麗だが、Bピクチャより符号量が多い。Bピクチャは、自分より過去・未来のピクチャを参照して符号化を行うピクチャであり、3ピクチャ中最も符号量を少なくできる。また、一般的には、IピクチャとPピクチャは他のピクチャを符号化する際に参照できる参照可能ピクチャであるが、Bピクチャは参照不可ピクチャである。 An I picture is a picture that is encoded only with its own picture information and is the most beautiful but requires the most code amount. A P picture is a picture that is encoded only with its own picture information and is encoded with reference to a picture earlier than itself, and is more beautiful than a B picture, but has a larger amount of code than a B picture. A B picture is a picture that is encoded with reference to a past / future picture from itself, and the amount of code can be reduced most among three pictures. In general, an I picture and a P picture are referenceable pictures that can be referred to when other pictures are encoded, while a B picture is a non-referenceable picture.
ビデオインターフェース1は入力動画像信号を入力し、符号化対象フレームを出力する。減算器2は、符号化対象フレームと予測フレームとの差を算出する。離散コサイン変換器3は、減算器2の出力を一定の大きさのブロック単位で離散コサイン変換(DCT)する。量子化部4は、離散コサイン変換により得られたDCT係数データを量子化する。
逆量子化部5は、量子化されたDCT係数データを逆量子化する。逆離散コサイン変換器6は、逆量子化されたDCT係数データを逆離散コサイン変換(逆DCT)する。加算器7は、逆離散コサイン変換器6の出力と予測フレームを加算し、その加算結果を参照可能ピクチャとしてビデオメモリ8に記憶させる。
The
動き補償予測部9は符号化対象フレームがIピクチャの場合は動きベクトル検出を行わず、予測フレームとして0を出力し、符号化対象フレームがPピクチャ又はBピクチャの場合は、符号化対象フレームからマクロブロック毎に動きベクトルを検出し、予測フレームを生成する。
The motion
動きベクトルの検出は、符号化対象フレームとビデオメモリ8に記憶されている参照可能ピクチャとの間でマクロブロック単位で行われる。具体的には、符号化対象フレームのマクロブロックに対して参照可能ピクチャの最も相関の高いマクロブロックを検出することにより、符号化対象フレームのマクロブロックが参照可能ピクチャのどのマクロブロックから動いたものかを示す情報が動きベクトルとして検出される。動き補償予測部9は、この動きベクトルを用いて参照可能ピクチャに対して動き補償を行って予測フレームを算出する。
Motion vector detection is performed in units of macroblocks between the encoding target frame and the referenceable picture stored in the
動き補償予測部9からは動き補償予測の予測モードと動きベクトルを示す予測モード/動きベクトル情報も出力される。エントロピー符号化部10は、量子化されたDCT係数データと予測モード/動きベクトル情報をそれぞれ符号化する。このようにしてエントロピー符号化部10は、符号化対象フレームと予測フレームとの差分値を符号化する。
The motion
ヘッダ/シンタクス付加部11は、符号化結果にシンタクス情報を付加する。予測フレームとする画像には、シンタクス情報としてref-idxという番号を割り当てる。現在処理しているカレントピクチャに時間的に近い順にref-idxを0から順に振っていく。
The header /
シーンチェンジ検出部12は、符号化対象フレームのカレントピクチャ、1つ前及び2つ前の参照可能ピクチャについてそれぞれシーンチェンジ(SC)の有無を検出する。図2はフラッシュ画像とシーンチェンジの関係を示す図である。1ピクチャ前のみ全面がフラッシュ画像である場合、カレントピクチャと1つ前の参照可能ピクチャでシーンチェンジが検出される(パターン1)。2ピクチャ前の全面がフラッシュ画像であり、1ピクチャ前の画面の1/3がフラッシュ画像(画面の1/3が白くなっている)である場合、1つ前の参照可能ピクチャと2つ前の参照可能ピクチャでシーンチェンジが検出される(パターン2)。2ピクチャ前の画面の1/3がフラッシュ画像であり、1ピクチャ前の全面がフラッシュ画像である場合、カレントピクチャと1つ前の参照可能ピクチャでシーンチェンジが検出される(パターン3)。2ピクチャ前の全面がフラッシュ画像であり、1ピクチャ前の画面の2/3がフラッシュ画像(画面の2/3が白くなっている)である場合、カレントピクチャと2つ前の参照可能ピクチャでシーンチェンジが検出される(パターン4)。2ピクチャ前の画面の2/3がフラッシュ画像であり、1ピクチャ前の全面がフラッシュ画像である場合、カレントピクチャと2つ前の参照可能ピクチャでシーンチェンジが検出される(パターン5)。
The scene
図3は輝度ブロック累算部の算出方法を示す図である。輝度ブロック累算部13は、カレントピクチャと1つ前の参照可能ピクチャとの間で対応するブロックの輝度の累算値ごとの差を1ピクチャ分累算したものをXとして算出する。なお、ブロックは、ここでは16画素x16画素のマクロブロックであるが、これに限らず4画素x4画素、8画素x8画素、16画素x32画素等のブロックでもよい。
FIG. 3 is a diagram showing a calculation method of the luminance block accumulation unit. The luminance
図4は輝度累算部の算出方法を示す図である。輝度累算部14は、1つ前の参照可能ピクチャとカレントピクチャの輝度の累算値同士の差分をY1とし、2つ前の参照可能ピクチャとカレントピクチャの輝度の累算値同士の差分をY2とし、3つ前の参照可能ピクチャとカレントピクチャの輝度の累算値同士の差分をY3として、互いの差を算出する。
FIG. 4 is a diagram illustrating a calculation method of the luminance accumulation unit. The
フラッシュ判定部15は、第1の閾値をXthとし、第2の閾値をYthとして、以下のようにフラッシュ画像を判定する。
(1)カレントピクチャと1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y2|≧Ythの場合、1つ前の参照可能ピクチャをフラッシュ画像と判定する(図2のパターン1に該当)。
(2)1つ前の参照可能ピクチャと2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y2-Y3|≧Ythの場合、1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定する(図2のパターン2に該当)。
(3)カレントピクチャと1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定する(図2のパターン3に該当)。
(4)カレントピクチャと2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定する(図2のパターン4,5に該当)。
The
(1) When a scene change is detected between the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y2 | ≧ Yth, the previous referenceable picture is determined to be a flash image ( Corresponds to
(2) When a scene change is detected in the previous referenceable picture and the previous referenceable picture, and | X | ≧ Xth and | Y2-Y3 | ≧ Yth, the previous and second previous references A possible picture is determined as a flash image (corresponding to
(3) When a scene change is detected between the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y3 | ≧ Yth, the previous and second previous referenceable pictures are flash images. (Corresponding to
(4) When a scene change is detected between the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y3 | ≧ Yth, the previous and second previous referenceable pictures are flash images. (Corresponding to
そして、フラッシュ判定部15は、フラッシュ画像を検出すると、フラッシュ画像の直前のフレームに予測フレームを変更するように指示する。この指示を受けた動き補償予測部9は、1つ前の参照可能ピクチャがフラッシュ画像と判定されなかった場合に1つ前の参照可能ピクチャを参照して予測フレームを算出し、1つ前の参照可能ピクチャがフラッシュ画像と判定された場合に2つ前の参照可能ピクチャを参照して予測フレームを算出し、1つ前及び2つ前の参照可能ピクチャがフラッシュ画像と判定された場合に3つ前の参照可能ピクチャを参照して予測フレームを算出する。
Then, when the
図5及び図6は、予測フレームを算出する際に参照するピクチャの選択例を示す図である。図5は参照可能ピクチャ(I又はPピクチャ)までの間隔Mが1の場合である。1ピクチャフラッシュの場合には1つ前のピクチャを参照し、2ピクチャフラッシュの場合には2つ前のピクチャを参照して、フラッシュ画像を参照しないようにする。 5 and 6 are diagrams showing examples of selecting pictures to be referred to when calculating a prediction frame. FIG. 5 shows a case where the interval M to the referenceable picture (I or P picture) is 1. In the case of 1 picture flash, the previous picture is referred to, and in the case of 2 picture flash, the previous picture is referred to so that the flash image is not referred to.
図6はMが3、次のIピクチャまでの間隔Nが15であり、4番のPピクチャがフラッシュ画像の場合である。7番のPピクチャを符号化する際には4番のPピクチャを参照せず、1番のIピクチャを参照する。 FIG. 6 shows a case where M is 3, the interval N to the next I picture is 15, and the fourth P picture is a flash image. When the 7th P picture is encoded, the 1st I picture is referred to without referring to the 4th P picture.
もしIピクチャがフラッシュ画像だった場合、その直後のピクチャをPピクチャに変更して符号化を行う。それ以降はその変更に合わせたGOP構成で符号化を行う。なお、直後のピクチャをIピクチャにしてもよいが、符号が大量に発生してしまう。また、Bピクチャがフラッシュ画像だった場合、そもそもBピクチャは参照されないので、通常通り符号化を行う。 If the I picture is a flash picture, the picture immediately after that is changed to a P picture and encoded. After that, encoding is performed with a GOP configuration adapted to the change. Although the immediately following picture may be an I picture, a large amount of codes are generated. If the B picture is a flash image, the B picture is not referred to in the first place, so that encoding is performed as usual.
また、参照する画像のref-idxが小さいほど符号量が軽減されるため、参照しないフラッシュ画像に一番符号量が有利になる番号ref-idx=0を割り当てるのはもったいない。そこで、フラッシュ判定部15は、フラッシュ画像を検出すると、フラッシュ画像の直前のフレームのref-idxを0に変更するようにヘッダ/シンタクス付加部11に指示する。
Also, since the code amount is reduced as the ref-idx of the referenced image is smaller, it is a waste to assign the number ref-idx = 0 that gives the most advantageous code amount to the flash image that is not referenced. Therefore, when the
本実施の形態において、輝度ブロック累算部13の算出結果Xは、1画素ごとに輝度の差を累積したものよりもピクチャの動きに対して反応(差分値の増減)が緩やかになる。輝度累算部14が算出するY1,Y2,Y3の互いの差は、輝度ブロック累算部13の算出結果Xよりも更にピクチャの動きに対して反応(差分値の増減)が緩やかになる。従って、これらの算出結果に基づいてシーンチェンジの算出パターンごとにフラッシュ画像を判定するため、フラッシュ画像の誤検出が少なくなる。また、2つのフレームにわたってフラッシュが発生する2ピクチャフラッシュにも対応できる。このため、フラッシュ直後の画像の乱れを防ぐことができる。
In the present embodiment, the calculation result X of the luminance
9 動き補償予測部(予測フレーム算出部)、10 エントロピー符号化部(符号化部)、12 シーンチェンジ検出部、13 輝度ブロック累算部、14 輝度累算部、15 フラッシュ判定部 9 motion compensated prediction unit (predicted frame calculation unit), 10 entropy coding unit (coding unit), 12 scene change detection unit, 13 luminance block accumulation unit, 14 luminance accumulation unit, 15 flash determination unit
Claims (1)
符号化対象フレームと予測フレームとの差分値を符号化する符号化部と、
前記符号化対象フレームのカレントピクチャ、その1つ前及び2つ前の参照可能ピクチャについてそれぞれシーンチェンジの有無を検出するシーンチェンジ検出部と、
前記カレントピクチャと前記1つ前の参照可能ピクチャとの間で対応するブロックの輝度の累算値ごとの差を1ピクチャ分累算したものをXとして算出する輝度ブロック累算部と、
前記1つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY1とし、前記2つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY2とし、前記3つ前の参照可能ピクチャと前記カレントピクチャの輝度の累算値同士の差分をY3として、互いの差を算出する輝度累算部と、
第1の閾値をXthとし、第2の閾値をYthとして、(1)前記カレントピクチャと前記1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y2|≧Ythの場合、前記1つ前の参照可能ピクチャをフラッシュ画像と判定し、(2)前記1つ前の参照可能ピクチャと前記2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y2-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定し、(3)前記カレントピクチャと前記1つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定し、(4)前記カレントピクチャと前記2つ前の参照可能ピクチャでシーンチェンジが検出され、|X|≧Xthかつ|Y1-Y3|≧Ythの場合、前記1つ前及び2つ前の参照可能ピクチャをフラッシュ画像と判定するフラッシュ判定部と、
前記1つ前の参照可能ピクチャがフラッシュ画像と判定されなかった場合に前記1つ前の参照可能ピクチャを参照して前記予測フレームを算出し、前記1つ前の参照可能ピクチャがフラッシュ画像と判定された場合に前記2つ前の参照可能ピクチャを参照して前記予測フレームを算出し、前記1つ前及び2つ前の参照可能ピクチャがフラッシュ画像と判定された場合に前記3つ前の参照可能ピクチャを参照して前記予測フレームを算出する予測フレーム算出部とを備えることを特徴とする動画像符号化装置。 A video encoding device that performs encoding using inter-frame prediction,
An encoding unit that encodes a difference value between the encoding target frame and the prediction frame;
A scene change detection unit for detecting the presence or absence of a scene change for the current picture of the encoding target frame, the previous and second referenceable pictures,
A luminance block accumulating unit that calculates, as X, a difference obtained by accumulating the luminance value of the corresponding block between the current picture and the previous referenceable picture for one picture;
The difference between the accumulated luminance values of the previous referenceable picture and the current picture is Y1, and the difference between the accumulated luminance values of the previous referenceable picture and the current picture is Y2. A luminance accumulation unit for calculating a difference between Y3 as a difference between the luminance accumulation values of the three previous referenceable pictures and the current picture;
When the first threshold value is Xth and the second threshold value is Yth, (1) a scene change is detected in the current picture and the previous referenceable picture, and | X | ≧ Xth and | Y1-Y2 | ≧ In the case of Yth, the previous referenceable picture is determined as a flash image, and (2) a scene change is detected in the previous referenceable picture and the second previous referenceable picture, and | X | ≧ If Xth and | Y2-Y3 | ≧ Yth, the previous and second previous referenceable pictures are determined to be flash images, and (3) a scene change occurs between the current picture and the previous previous referenceable picture. If | X | ≧ Xth and | Y1-Y3 | ≧ Yth, the previous and second previous referenceable pictures are determined to be flash images, and (4) the current picture and the previous When a scene change is detected in the previous referenceable picture, and | X | ≧ Xth and | Y1−Y3 | ≧ Yth, the flash determination for determining the previous and second previous referenceable pictures as flash images And
When the previous referenceable picture is not determined to be a flash picture, the prediction frame is calculated with reference to the previous referenceable picture, and the previous referenceable picture is determined to be a flash picture. The prediction frame is calculated by referring to the previous two referenceable pictures, and the third previous reference is determined when the previous and second previous referenceable pictures are determined to be flash images. A motion picture encoding apparatus comprising: a prediction frame calculation unit that calculates the prediction frame with reference to a possible picture.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2013/050814 WO2014112076A1 (en) | 2013-01-17 | 2013-01-17 | Video encoding device |
| JP2014557244A JP5880737B2 (en) | 2013-01-17 | 2013-01-17 | Video encoding device |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2013/050814 WO2014112076A1 (en) | 2013-01-17 | 2013-01-17 | Video encoding device |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2014112076A1 true WO2014112076A1 (en) | 2014-07-24 |
Family
ID=51209197
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2013/050814 Ceased WO2014112076A1 (en) | 2013-01-17 | 2013-01-17 | Video encoding device |
Country Status (2)
| Country | Link |
|---|---|
| JP (1) | JP5880737B2 (en) |
| WO (1) | WO2014112076A1 (en) |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH07193818A (en) * | 1993-12-27 | 1995-07-28 | Sony Corp | Image processing method and image processing apparatus |
| JP2000115778A (en) * | 1998-09-25 | 2000-04-21 | Sarnoff Corp | Detection and coding of flash frame in video data |
| JP2002010270A (en) * | 2000-06-27 | 2002-01-11 | Mitsubishi Electric Corp | Image encoding apparatus and image encoding method |
| JP2006115336A (en) * | 2004-10-15 | 2006-04-27 | Canon Inc | Video encoding apparatus and control method thereof |
| JP2007166408A (en) * | 2005-12-15 | 2007-06-28 | Toshiba Corp | Image processing apparatus and image processing method |
| JP2011205354A (en) * | 2010-03-25 | 2011-10-13 | Renesas Electronics Corp | Moving picture coder, moving picture coding method and program |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4585981B2 (en) * | 2006-03-10 | 2010-11-24 | キヤノン株式会社 | Image encoding method and image encoding apparatus |
| US20110211633A1 (en) * | 2008-11-12 | 2011-09-01 | Ferran Valldosera | Light change coding |
| EP2421249A4 (en) * | 2009-04-16 | 2013-02-20 | Panasonic Corp | IMAGING DEVICE, EXTERNAL FLASHING DETECTION METHOD, PROGRAM, AND INTEGRATED CIRCUIT |
-
2013
- 2013-01-17 WO PCT/JP2013/050814 patent/WO2014112076A1/en not_active Ceased
- 2013-01-17 JP JP2014557244A patent/JP5880737B2/en active Active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH07193818A (en) * | 1993-12-27 | 1995-07-28 | Sony Corp | Image processing method and image processing apparatus |
| JP2000115778A (en) * | 1998-09-25 | 2000-04-21 | Sarnoff Corp | Detection and coding of flash frame in video data |
| JP2002010270A (en) * | 2000-06-27 | 2002-01-11 | Mitsubishi Electric Corp | Image encoding apparatus and image encoding method |
| JP2006115336A (en) * | 2004-10-15 | 2006-04-27 | Canon Inc | Video encoding apparatus and control method thereof |
| JP2007166408A (en) * | 2005-12-15 | 2007-06-28 | Toshiba Corp | Image processing apparatus and image processing method |
| JP2011205354A (en) * | 2010-03-25 | 2011-10-13 | Renesas Electronics Corp | Moving picture coder, moving picture coding method and program |
Also Published As
| Publication number | Publication date |
|---|---|
| JPWO2014112076A1 (en) | 2017-01-19 |
| JP5880737B2 (en) | 2016-03-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US7171052B2 (en) | Apparatus and method for correcting motion of image | |
| JP5197591B2 (en) | VIDEO ENCODING METHOD AND DECODING METHOD, DEVICE THEREOF, THEIR PROGRAM, AND RECORDING MEDIUM CONTAINING THE PROGRAM | |
| JP5426655B2 (en) | Compressed video encoding device, compressed video decoding device, compressed video encoding method, and compressed video decoding method | |
| JP2007235291A (en) | Adaptive quantization apparatus and adaptive quantization program | |
| KR100561398B1 (en) | Apparatus and method for detecting and compensating for luminance changes for each section in a moving picture | |
| EP3547684A1 (en) | Method, device and system for method of encoding a sequence of frames in a video stream | |
| US9066100B2 (en) | Image coding apparatus and image coding method | |
| US20170070555A1 (en) | Video data flow compression method | |
| JP2006135557A (en) | Moving picture encoding device | |
| JP5178616B2 (en) | Scene change detection device and video recording device | |
| JP2001251627A (en) | Encoding device, encoding method, and recording medium recording program | |
| JP3778208B2 (en) | Image coding apparatus and image coding method | |
| JP5880737B2 (en) | Video encoding device | |
| US20070140336A1 (en) | Video coding device and image recording/reproducing device | |
| JP3732760B2 (en) | Object recognition apparatus and object recognition method | |
| JP4911625B2 (en) | Image processing apparatus and imaging apparatus equipped with the same | |
| JPH07184206A (en) | Image coding device | |
| JP2008199521A (en) | Image processing apparatus and method | |
| KR101319916B1 (en) | Motion estimation method and apparatus, and image encoding method and apparatus using the same | |
| JP4690250B2 (en) | Fade detection device | |
| JP5718438B2 (en) | Compressed video encoding device, compressed video decoding device, compressed video encoding method, and compressed video decoding method | |
| KR20240084469A (en) | A method and device for prunning a video sequence | |
| JP2006086684A (en) | Motion picture encoder and encoding program | |
| KR101021538B1 (en) | H.264 High Speed Intra Mode Decision Method | |
| JP5268666B2 (en) | Image encoding device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13871567 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2014557244 Country of ref document: JP Kind code of ref document: A |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 13871567 Country of ref document: EP Kind code of ref document: A1 |