JP2001359096A

JP2001359096A - Image coding device

Info

Publication number: JP2001359096A
Application number: JP2000145904A
Authority: JP
Inventors: Toshiyuki Kondo; 敏志近藤; Kojiro Kawasaki; 弘二郎川▲崎▼; Hideki Fukuda; 秀樹福田; Kazuhiko Nakamura; 和彦中村; Shiro Iwasaki; 史朗岩▲崎▼
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1999-06-08
Filing date: 2000-05-18
Publication date: 2001-12-26

Abstract

(57)【要約】【課題】入力映像フレームに含まれる高周波成分に応
じた解像度で符号化を行うことにより、より少ない符号
量で高画質の映像を得る。【解決手段】入力映像フレームは、周波数帯域分析器
１０１に入力され、空間領域から周波数領域に変換され
る。解像度決定器１０２は、周波数帯域分析器１０１の
出力を分析し、それにより映像フレームの解像度を決定
する。解像度決定器１０２で決定された解像度に基づい
て、解像度変換器１０３は映像フレームに対して解像度
変換を行う。そしてその後、ブロック化器１０４、ＤＣ
Ｔ変換器１０５、量子化器１０６、可変長符号化器１０
７により符号化処理を施されて符号列として出力され
る。 (57) [Summary] [Problem] To obtain a high quality video with a smaller code amount by performing encoding at a resolution corresponding to a high frequency component included in an input video frame. An input video frame is input to a frequency band analyzer 101, and is converted from a spatial domain to a frequency domain. The resolution determiner 102 analyzes the output of the frequency band analyzer 101 and thereby determines the resolution of the video frame. Based on the resolution determined by the resolution determiner 102, the resolution converter 103 performs resolution conversion on the video frame. And then, the blocking unit 104, DC
T converter 105, quantizer 106, variable length encoder 10
7, and is output as a code string.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、映像信号を圧縮符
号化するための画像符号化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image encoding apparatus for compressing and encoding a video signal.

【０００２】[0002]

【従来の技術】近年、映像信号の高能率圧縮符号化方式
の１つとしてＭＰＥＧ２（Moving Picutre Experts Gro
up 2）方式が広く知られている。ＭＰＥＧ２方式では、
映像フレーム（フィールド）を参照画像を用いずにフレ
ーム内符号化を行うＩ（intra）フレームと、フレーム
間予測符号化を行うＰ（predictive）又はＢ（bidirect
ionally predictive）フレームとに分類して符号化を行
う。また、各フレームは１６×１６画素の大きさを有す
るマクロブロックと呼ばれる単位に分割され、更に８×
８画素の大きさを有するブロックを単位として離散コサ
イン変換（Discrete Cosine Transform：ＤＣＴ）を施
すことにより符号化を行う。2. Description of the Related Art In recent years, MPEG2 (Moving Picture Experts Groove) has been used as one of high-efficiency compression coding methods for video signals.
up 2) The method is widely known. In the MPEG2 system,
An I (intra) frame for performing intra-frame encoding of a video frame (field) without using a reference image, and a P (predictive) or B (bidirect) for performing inter-frame predictive encoding
and encoding is performed. Each frame is divided into units called macroblocks having a size of 16 × 16 pixels, and further divided into 8 × 16 pixels.
Encoding is performed by performing discrete cosine transform (DCT) on a block having a size of 8 pixels.

【０００３】[0003]

【発明が解決しようとする課題】特開平９−２７１０２
６号公報に記載された従来技術によれば、常に発生符号
量を所定の符号量以内に抑えるために、量子化スケール
（量子化ステップの大きさ）と発生符号量との積が大き
くなったときや、バッファ占有量が大きくなったとき、
すなわち符号化が難しい複雑な映像が入力された場合に
は、以後の入力映像信号の解像度を落とす。したがっ
て、例えば発生符号量が所定の符号量を超えそうになっ
た場合には、入力映像信号が高域成分を多く含んでいて
も解像度を落としてしまうため、画質劣化が生じてしま
う。Problems to be Solved by the Invention
According to the related art described in Japanese Patent Application Laid-Open No. 6-64, the product of the quantization scale (quantization step size) and the generated code amount is increased in order to always keep the generated code amount within a predetermined code amount. When the buffer occupancy increases,
That is, when a complex video that is difficult to encode is input, the resolution of the subsequent input video signal is reduced. Therefore, for example, when the generated code amount is about to exceed the predetermined code amount, the resolution is lowered even if the input video signal contains many high-frequency components, so that the image quality is deteriorated.

【０００４】本発明の目的は、画質劣化を最小限に抑え
ながら発生符号量を大きく減らすことができる画像符号
化装置を提供することにある。An object of the present invention is to provide an image coding apparatus capable of greatly reducing the amount of generated codes while minimizing image quality deterioration.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、本発明は、入力映像信号の特性に応じた解像度変更
を行うこととしたものである。例えば、ある実施形態に
よれば、符号化が難しい複雑な映像が入力された場合に
は入力映像信号の解像度を上げ、単調な映像が入力され
た場合には入力映像信号の解像度を下げる。なお、詳細
は後述する。In order to achieve the above object, the present invention is to change the resolution according to the characteristics of an input video signal. For example, according to an embodiment, the resolution of an input video signal is increased when a complex video that is difficult to encode is input, and the resolution of the input video signal is reduced when a monotonous video is input. The details will be described later.

【０００６】[0006]

【発明の実施の形態】以下、図面を参照しながら本発明
の実施形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【０００７】（第１の実施形態）本発明の第１の実施形
態を図１を用いて説明する。図１は、遅延器１００、周
波数帯域分析器１０１、解像度決定器１０２、解像度変
換器１０３、圧縮符号化器１１０から構成された画像符
号化装置のブロック図である。圧縮符号化器１１０は、
ブロック化器１０４、ＤＣＴ変換器１０５、量子化器１
０６、可変長符号化器１０７から構成される。ここでは
説明の簡単化のためにフレーム内符号化の場合のブロッ
ク図を示している。(First Embodiment) A first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a block diagram of an image encoding device including a delay unit 100, a frequency band analyzer 101, a resolution determiner 102, a resolution converter 103, and a compression encoder 110. The compression encoder 110
Blocker 104, DCT transformer 105, quantizer 1
06, the variable length encoder 107. Here, a block diagram in the case of intra-frame encoding is shown for simplification of description.

【０００８】ディジタル信号である入力映像信号は、ま
ず周波数帯域分析器１０１に入力される。周波数帯域分
析器１０１では、映像フレームを空間領域から周波数領
域に変換する。周波数領域への変換は、例えば映像信号
をフーリエ変換することにより行うことができる。この
変換は、フレーム全体に対して行っても良いし、例えば
ブロックやマクロブロック程度の大きさに分割して、各
ブロックやマクロブロック毎に変換しても良い。そして
周波数帯域分析器１０１は、映像信号を周波数領域に変
換した結果を出力する。An input video signal, which is a digital signal, is first input to a frequency band analyzer 101. The frequency band analyzer 101 converts a video frame from a spatial domain to a frequency domain. The conversion to the frequency domain can be performed, for example, by performing a Fourier transform on the video signal. This conversion may be performed for the entire frame, or may be performed, for example, by dividing into blocks or macroblocks and for each block or macroblock. Then, the frequency band analyzer 101 outputs a result obtained by converting the video signal into a frequency domain.

【０００９】解像度決定器１０２は、周波数帯域分析器
１０１の出力である周波数領域に変換された映像信号を
分析し、それにより映像フレームの解像度（画素数×ラ
イン数）をどのように変更するかを決定する。この決定
方法を説明する。A resolution determiner 102 analyzes a video signal converted into a frequency domain, which is an output of the frequency band analyzer 101, and how to change the resolution (number of pixels × number of lines) of a video frame accordingly. To determine. This determination method will be described.

【００１０】図２（ａ）〜図２（ｄ）は解像度の例を示
す模式図である。ここでは、図２（ｂ）〜図２（ｄ）に
示す解像度Ａ、Ｂ、Ｃの３種類から解像度を選択するも
のとする。図２（ａ）は入力映像フレームの解像度であ
り、図２（ｂ）の解像度Ａは入力映像フレームの解像度
を水平方向に１／２にした解像度、図２（ｃ）の解像度
Ｂは入力映像フレームの解像度を水平方向に３／４にし
た解像度、図２（ｄ）の解像度Ｃは入力映像フレームの
解像度と同じ解像度である。FIGS. 2A to 2D are schematic diagrams showing examples of resolution. Here, it is assumed that a resolution is selected from three types of resolutions A, B, and C shown in FIGS. 2B to 2D. 2A shows the resolution of the input video frame, FIG. 2B shows the resolution A obtained by halving the resolution of the input video frame in the horizontal direction, and FIG. 2C shows the resolution B of the input video frame. The resolution obtained by reducing the frame resolution to 水平 in the horizontal direction, and the resolution C in FIG. 2D is the same as the resolution of the input video frame.

【００１１】図３は解像度決定のフローチャート、図４
（ａ）及び図４（ｂ）は周波数成分の例を示すグラフで
ある。解像度決定器１０２では、図３に示すフローチャ
ートにより解像度を決定する。まず、周波数領域に変換
された映像フレームの最高周波数成分ｆｍａｘを得る。
ｆｍａｘは周波数帯域分析器１０１での周波数変換がフ
レーム全体に対して行われている場合には、その周波数
成分の最大値として得ることができる。また、周波数帯
域分析器１０１での周波数変換がブロックやマクロブロ
ック毎に周波数変換している場合には、各ブロックやマ
クロブロックの最高周波数成分の最大値としてｆｍａｘ
を得ることができる。そして、ｆｍａｘが予め定めてお
いたしきい値ＴＨ１よりも小さいかの比較を行う。もし
ｆｍａｘがＴＨ１よりも小さい場合には解像度を解像度
Ａに決定する。ｆｍａｘがしきい値ＴＨ１以上である場
合には、ｆｍａｘが予め定めておいたしきい値ＴＨ２よ
りも小さいかの比較を行う。ここでＴＨ１＜ＴＨ２であ
る。もしｆｍａｘがＴＨ２よりも小さい場合には解像度
を解像度Ｂに決定する。ｆｍａｘがしきい値ＴＨ２以上
である場合には解像度を解像度Ｃに決定する。例えば周
波数帯域分析器１０１の出力である映像信号の周波数成
分及びしきい値ＴＨ１、ＴＨ２が図４（ａ）に示すよう
な値である場合には、図３のフローチャートに従って解
像度Ｂが選択される。また、図４（ｂ）は、周波数成分
が予め定めておいたしきい値ｆｔｈ以上となる最高周波
数をｆｍａｘとした場合であり、この場合には図３のフ
ローチャートに従って解像度Ａが選択される。FIG. 3 is a flowchart for determining the resolution, and FIG.
FIGS. 4A and 4B are graphs showing examples of frequency components. The resolution determiner 102 determines the resolution according to the flowchart shown in FIG. First, the highest frequency component fmax of the video frame converted into the frequency domain is obtained.
fmax can be obtained as the maximum value of the frequency component when the frequency conversion in the frequency band analyzer 101 is performed on the entire frame. If the frequency conversion in the frequency band analyzer 101 is performed for each block or macroblock, fmax is set as the maximum value of the highest frequency component of each block or macroblock.
Can be obtained. Then, a comparison is made as to whether fmax is smaller than a predetermined threshold value TH1. If fmax is smaller than TH1, the resolution is determined to be resolution A. If fmax is equal to or greater than the threshold value TH1, a comparison is made as to whether fmax is smaller than a predetermined threshold value TH2. Here, TH1 <TH2. If fmax is smaller than TH2, the resolution is determined to be resolution B. If fmax is equal to or larger than the threshold value TH2, the resolution is determined to be the resolution C. For example, when the frequency components of the video signal output from the frequency band analyzer 101 and the threshold values TH1 and TH2 are values as shown in FIG. 4A, the resolution B is selected according to the flowchart of FIG. . FIG. 4B shows a case where the maximum frequency at which the frequency component is equal to or higher than a predetermined threshold value fth is fmax. In this case, the resolution A is selected according to the flowchart of FIG.

【００１２】入力映像信号は、遅延器１００を介して解
像度変換器１０３に供給される。遅延器１００は、周波
数帯域分析器１０１及び解像度決定器１０２の動作に要
する時間だけ入力映像信号を遅延させるものである。一
方、解像度決定器１０２で決定された解像度は解像度変
換器１０３に出力される。これにより解像度変換器１０
３は、遅延器１００を介して与えられた映像フレームに
対して折り返し雑音除去フィルタを施した後、補間や間
引きを行って解像度を変換する。例えば図４（ａ）に示
す周波数成分を有する映像フレームの場合には、解像度
Ｂ、すなわち水平方向を元の映像の３／４に縮小する。
また、解像度Ｃが選択された場合には、解像度変換器１
０３は入力映像フレームをそのまま出力する。An input video signal is supplied to a resolution converter 103 via a delay unit 100. The delay unit 100 delays the input video signal by the time required for the operation of the frequency band analyzer 101 and the resolution determiner 102. On the other hand, the resolution determined by the resolution determiner 102 is output to the resolution converter 103. Thereby, the resolution converter 10
Reference numeral 3 applies a folding noise removal filter to the video frame provided via the delay unit 100, and then converts the resolution by performing interpolation or thinning. For example, in the case of a video frame having frequency components shown in FIG. 4A, the resolution B, that is, the horizontal direction is reduced to ３ of the original video.
When the resolution C is selected, the resolution converter 1
03 outputs the input video frame as it is.

【００１３】解像度変換器１０３から出力された映像フ
レームは、ブロック化器１０４に入力されてブロックに
分割される。ブロックは例えば８×８画素程度の大きさ
を有する。そして各ブロックはＤＣＴ変換器１０５に入
力される。ＤＣＴ変換器１０５では各ブロックに２次元
ＤＣＴを施してＤＣＴ係数に変換する。ＤＣＴ変換器１
０５の出力であるＤＣＴ係数は量子化器１０６に入力さ
れ、量子化処理により係数の丸め込みが行われる。量子
化器１０６の出力は可変長符号化器１０７に入力されて
可変長符号化を施された後、符号列として出力される。
ブロック化器１０４、ＤＣＴ変換器１０５、量子化器１
０６、可変長符号化器１０７においても解像度決定器１
０２で決定された解像度を元に符号化を制御する。The video frame output from the resolution converter 103 is input to a blocker 104 and is divided into blocks. The block has a size of, for example, about 8 × 8 pixels. Then, each block is input to the DCT converter 105. The DCT converter 105 applies two-dimensional DCT to each block to convert the blocks into DCT coefficients. DCT converter 1
The DCT coefficient which is the output of 05 is input to the quantizer 106, and the coefficient is rounded by a quantization process. The output of the quantizer 106 is input to the variable-length encoder 107, subjected to variable-length encoding, and then output as a code string.
Blocker 104, DCT transformer 105, quantizer 1
06, the resolution determiner 1 in the variable length encoder 107
The encoding is controlled based on the resolution determined in 02.

【００１４】以上のように第１の実施形態の画像符号化
装置は、入力映像信号に含まれる周波数成分を分析し、
その最高周波数成分がどの周波数帯域にあるかにより解
像度を決定する。この場合には、高い周波数成分を有す
るほど高い解像度となるように解像度を決定する。そし
て、入力映像をその決定した解像度に変換し符号化を行
う。したがって、第１の実施形態の画像符号化装置を用
いることにより、入力映像の周波数成分に応じた解像度
で入力映像を符号化することができる。つまり、入力映
像の周波数成分に応じて符号化するブロック数を減らす
ことができるため、少ない画質劣化で発生符号量を大き
く削減することができる。As described above, the image coding apparatus according to the first embodiment analyzes the frequency components included in the input video signal,
The resolution is determined based on which frequency band the highest frequency component is in. In this case, the resolution is determined such that the higher the frequency component, the higher the resolution. Then, the input video is converted to the determined resolution and encoded. Therefore, by using the image encoding device of the first embodiment, it is possible to encode an input video with a resolution corresponding to the frequency component of the input video. That is, since the number of blocks to be coded can be reduced according to the frequency components of the input video, the amount of generated codes can be greatly reduced with little image quality deterioration.

【００１５】なお、本実施形態では、周波数に対して２
つのしきい値ＴＨ１、ＴＨ２を設定し、３種類の解像度
（Ａ、Ｂ、Ｃ）のうちから１つの解像度を選択する場合
について説明したが、周波数に対するしきい値、解像度
の種類数は、本実施形態とは別の値であっても構わな
い。In this embodiment, the frequency is 2
The case where three threshold values TH1 and TH2 are set and one resolution is selected from three types of resolutions (A, B, and C) has been described. It may be a different value from the embodiment.

【００１６】図１の変形例を図５に示す。図５が図１と
異なる点は、入力映像信号がアナログ信号である点であ
る。図１中の解像度変換器１０３は標本化器１０３ａに
置き換えられる。図５によれば、解像度決定器１０２で
決定された解像度は標本化器１０３ａに出力される。こ
れにより標本化器１０３ａは、決定された解像度に応じ
たサンプリング周波数により、遅延器１００を介して与
えられた入力映像フレームの標本化を行う。また、標本
化に際してはサンプリング周波数に応じた折り返し歪み
除去フィルタを施す。例えばＣＣＩＲＲＥＣ．６０１
規格では、ＮＴＳＣの場合には輝度信号の水平方向の標
本化は１３．５ＭＨｚのサンプリング周波数で行うよう
に定められている。したがって、標本化器１０３ａで
は、解像度Ｃが選択された場合にはサンプリング周波数
は１３．５ＭＨｚ、解像度Ｂが選択された場合にはサン
プリング周波数は１０．１２５ＭＨｚ、解像度Ａが選択
された場合にはサンプリング周波数は６．７５ＭＨｚと
して標本化を行い、ディジタル信号として出力する。ブ
ロック化器１０４以下の動作は図１の場合と同様であ
る。FIG. 5 shows a modification of FIG. FIG. 5 differs from FIG. 1 in that the input video signal is an analog signal. The resolution converter 103 in FIG. 1 is replaced by a sampler 103a. According to FIG. 5, the resolution determined by the resolution determiner 102 is output to the sampler 103a. Thus, the sampler 103a samples the input video frame given via the delay unit 100 at a sampling frequency according to the determined resolution. Further, at the time of sampling, an aliasing distortion removal filter corresponding to the sampling frequency is applied. For example, CCIR REC. 601
According to the standard, in the case of NTSC, sampling of the luminance signal in the horizontal direction is performed at a sampling frequency of 13.5 MHz. Therefore, in the sampler 103a, the sampling frequency is 13.5 MHz when the resolution C is selected, the sampling frequency is 10.125 MHz when the resolution B is selected, and the sampling frequency is set when the resolution A is selected. Sampling is performed at a frequency of 6.75 MHz, and a digital signal is output. The operation after the block generator 104 is the same as that in FIG.

【００１７】（第２の実施形態）本発明の第２の実施形
態を図６を用いて説明する。図６は、遅延器１００、フ
ィルタ２０１，２０２、出力値比較器２０４，２０５、
解像度決定器２０７、解像度変換器１０３、圧縮符号化
器１１０から構成された画像符号化装置のブロック図で
ある。(Second Embodiment) A second embodiment of the present invention will be described with reference to FIG. FIG. 6 shows a delay unit 100, filters 201 and 202, output value comparators 204 and 205,
FIG. 2 is a block diagram of an image encoding device including a resolution determiner 207, a resolution converter 103, and a compression encoder 110.

【００１８】ディジタル信号である入力映像信号は、ま
ずフィルタ２０１，２０２に入力される。フィルタ２０
１，２０２は遮断周波数が異なる低域通過フィルタであ
る。フィルタ２０１，２０２の周波数特性例を図７に示
す。図７に示すように、ここではフィルタ２０１の遮断
周波数ｆｃ１よりもフィルタ２０２の遮断周波数ｆｃ２
の方が高いものとする。フィルタ２０１，２０２の出力
はそれぞれ出力値比較器２０４，２０５に入力される。An input video signal, which is a digital signal, is first input to filters 201 and 202. Filter 20
Reference numeral 1202 denotes low-pass filters having different cutoff frequencies. FIG. 7 shows an example of the frequency characteristics of the filters 201 and 202. As shown in FIG. 7, here, the cut-off frequency fc2 of the filter 202 is higher than the cut-off frequency fc1 of the filter 201.
Shall be higher. Outputs of the filters 201 and 202 are input to output value comparators 204 and 205, respectively.

【００１９】出力値比較器２０４，２０５では、フィル
タ出力を入力とし、フィルタ出力信号のエネルギーを計
算する。そして、そのエネルギー値と予め定めておいた
しきい値とを比較する。そしてその比較結果を解像度決
定器２０７に対して出力する。ここでは出力値比較器２
０４，２０５は、フィルタ出力がしきい値よりも大きい
場合は「大」、小さい場合は「小」を出力するものとす
る。The output value comparators 204 and 205 receive the filter output and calculate the energy of the filter output signal. Then, the energy value is compared with a predetermined threshold value. The comparison result is output to the resolution determiner 207. Here, the output value comparator 2
04 and 205 output “large” when the filter output is larger than the threshold, and output “small” when the filter output is smaller than the threshold.

【００２０】解像度決定器２０７は、出力値比較器２０
４，２０５の比較結果を受けて入力映像フレームの解像
度を決定する。ここでは、上記第１の実施形態と同様に
図２（ｂ）〜図２（ｄ）に示す３種類の解像度Ａ、Ｂ、
Ｃのいずれかに決定するものとする。その決定方法を図
８に示す。The resolution determiner 207 is provided for the output value comparator 20
The resolution of the input video frame is determined based on the comparison result of 4,205. Here, similarly to the first embodiment, three types of resolutions A, B, and B shown in FIGS. 2B to 2D are used.
C shall be determined. FIG. 8 shows the determination method.

【００２１】図８に示すように解像度決定器２０７は、
出力値比較器２０４の比較結果が「小」である場合に
は、最も低い解像度である解像度Ａを選択する。また、
出力値比較器２０４の比較結果が「大」であり、かつ出
力値比較器２０５の比較結果が「小」である場合には、
解像度Ｂを選択する。また、出力値比較器２０５の比較
結果が「大」である場合には、解像度Ｃを選択する。す
なわち、フィルタ２０１，２０２の出力値のうち予め定
められたしきい値よりも大きいものがある場合に、当該
フィルタの遮断周波数が高いほど高い解像度となるよう
に、解像度を決定する。As shown in FIG. 8, the resolution determiner 207
When the comparison result of the output value comparator 204 is “small”, the resolution A that is the lowest resolution is selected. Also,
When the comparison result of the output value comparator 204 is “large” and the comparison result of the output value comparator 205 is “small”,
Select resolution B. When the comparison result of the output value comparator 205 is “large”, the resolution C is selected. That is, when there is one of the output values of the filters 201 and 202 larger than a predetermined threshold value, the resolution is determined so that the higher the cutoff frequency of the filter, the higher the resolution.

【００２２】解像度決定器２０７で決定された解像度は
解像度変換器１０３に出力される。これにより解像度変
換器１０３は、遅延器１００を介して与えられた映像フ
レームに対して折り返し雑音除去フィルタを施した後、
補間や間引きを行って解像度を変換する。解像度変換器
１０３から出力された映像フレームは、圧縮符号化器１
１０により符号列に変換される。The resolution determined by the resolution determiner 207 is output to the resolution converter 103. Thereby, the resolution converter 103 applies the aliasing removal filter to the video frame given through the delay unit 100,
Convert the resolution by performing interpolation or thinning. The video frame output from the resolution converter 103 is output to the compression encoder 1
It is converted into a code string by 10.

【００２３】以上のように第２の実施形態の画像符号化
装置は、入力映像信号に対して異なる遮断周波数を有す
る低域通過フィルタを施し、そのフィルタ出力と予め定
めておいたしきい値とを比較する。そして、しきい値よ
りも大きい出力を出すフィルタの遮断周波数が高いほ
ど、高い解像度となるように解像度を決定する。そし
て、入力映像をその決定した解像度に変換し符号化を行
う。したがって、第２の実施形態の画像符号化装置を用
いることにより、入力映像の周波数成分に応じた解像度
で入力映像を符号化することができる。つまり、入力映
像の周波数成分に応じて符号化するブロック数を減らす
ことができるため、少ない画質劣化で発生符号量を大き
く削減することができる。As described above, the image coding apparatus according to the second embodiment applies a low-pass filter having a different cutoff frequency to an input video signal, and compares the filter output with a predetermined threshold value. Compare. Then, the resolution is determined so that the higher the cutoff frequency of the filter that outputs an output larger than the threshold value, the higher the resolution. Then, the input video is converted to the determined resolution and encoded. Therefore, by using the image encoding device of the second embodiment, it is possible to encode the input video with a resolution corresponding to the frequency component of the input video. That is, since the number of blocks to be coded can be reduced according to the frequency components of the input video, the amount of generated codes can be greatly reduced with little image quality deterioration.

【００２４】なお、本実施形態では、２つの異なる遮断
周波数を有するフィルタ２０１，２０２を用いて３種類
の解像度（Ａ、Ｂ、Ｃ）のうちから１つの解像度を選択
する場合について説明したが、フィルタ数、解像度の種
類数は本実施形態とは別の値であっても構わない。In this embodiment, a case has been described in which one of three resolutions (A, B, C) is selected by using the filters 201 and 202 having two different cutoff frequencies. The number of filters and the number of resolution types may be different values from the present embodiment.

【００２５】図６の変形例を図９に示す。図９が図６と
異なる点は、入力映像信号がアナログ信号である点であ
る。図６中の解像度変換器１０３は標本化器１０３ａに
置き換えられる。図９によれば、解像度決定器２０７で
決定された解像度は標本化器１０３ａに出力される。こ
れにより標本化器１０３ａは、決定された解像度に応じ
たサンプリング周波数により、遅延器１００を介して与
えられた入力映像フレームの標本化を行い、ディジタル
映像信号を圧縮符号化器１１０へ供給する。FIG. 9 shows a modification of FIG. FIG. 9 differs from FIG. 6 in that the input video signal is an analog signal. The resolution converter 103 in FIG. 6 is replaced by a sampler 103a. According to FIG. 9, the resolution determined by the resolution determiner 207 is output to the sampler 103a. Thus, the sampler 103a samples the input video frame given via the delay unit 100 at a sampling frequency corresponding to the determined resolution, and supplies a digital video signal to the compression encoder 110.

【００２６】（第３の実施形態）本発明の第３の実施形
態を図１０を用いて説明する。図１０は、遅延器１０
０、アクティビティ測定器３０１、解像度決定器３０
２、解像度変換器１０３、圧縮符号化器１１０から構成
された画像符号化装置のブロック図である。(Third Embodiment) A third embodiment of the present invention will be described with reference to FIG. FIG.
0, activity measuring device 301, resolution determining device 30
FIG. 2 is a block diagram of an image encoding device including a resolution converter 103 and a compression encoder 110.

【００２７】ディジタル信号である入力映像信号は、ま
ずアクティビティ測定器３０１に入力される。アクティ
ビティ測定器３０１では、映像フレームのアクティビテ
ィを計算する。ここでアクティビティとはフレーム内の
画素値の分散値である。Ｎ画素×Ｎラインからなるフレ
ームにおいて、ｉを水平方向の画素位置、ｊを垂直方向
の画素位置、Ｐijを画素値とするとき、平均値Ｐｍ及び
分散値ｖａｒは、Ｐｍ＝（１／Ｎ²）ΣΣＰij ｖａｒ＝（１／Ｎ²）ΣΣ（Ｐij−Ｐｍ）² により与えられる。ここで、ΣΣはｉについて０からＮ
−１まで総和をとり、かつｊについて０からＮ−１まで
総和をとる操作を意味するものとする。そしてアクティ
ビティ測定器３０１は、計算したアクティビティを出力
する。なお、マクロブロックやブロック毎の画素値の分
散値をアクティビティとしてもよい。An input video signal, which is a digital signal, is first input to the activity measuring device 301. The activity measuring device 301 calculates the activity of a video frame. Here, the activity is a variance of pixel values in the frame. In a frame composed of N pixels × N lines, when i is a pixel position in the horizontal direction, j is a pixel position in the vertical direction, and Pij is a pixel value, the average value Pm and the variance value var are Pm = (1 / N ^2). ) {Pij var = (1 / N ² )} (Pij−Pm) ² Where ΣΣ is from 0 to N for i
It means an operation of summing up to −1 and summing up j from 0 to N−1. Then, the activity measuring device 301 outputs the calculated activity. The activity may be a variance of pixel values for each macroblock or block.

【００２８】解像度決定器３０２は、アクティビティ測
定器３０１の出力であるアクティビティの値を用いて、
映像フレームの解像度をどのように変更するかを決定す
る。この決定方法を説明する。The resolution determining unit 302 uses the activity value output from the activity measuring unit 301 to
Determine how to change the resolution of the video frame. This determination method will be described.

【００２９】図１１は解像度決定のフローチャートであ
る。今、上記第１の実施形態と同様に、図２（ｂ）〜図
２（ｄ）に示す解像度Ａ、Ｂ、Ｃの３種類から解像度を
選択するものとする。FIG. 11 is a flowchart for determining the resolution. Now, as in the first embodiment, it is assumed that a resolution is selected from three types of resolutions A, B, and C shown in FIGS. 2B to 2D.

【００３０】解像度決定器３０２では、図１１に示すフ
ローチャートにより解像度を決定する。まず、アクティ
ビティａｃｔの決定を行う。例えばアクティビティ測定
器３０１においてフレーム内の画素値の分散値としてア
クティビティを求めた場合には、その値がａｃｔとな
る。また、アクティビティ測定器３０１において、マク
ロブロックやブロック毎の分散値としてアクティビティ
を求めた場合には、それらのアクティビティの１フレー
ム内の平均値をａｃｔとしてもよいし、フレーム内の最
大値をａｃｔとしてもよい。図１２は、後者の詳細手順
を示している。そして、図１１に示すように、ａｃｔが
予め定めておいたしきい値ＴＨ３よりも小さいかの比較
を行う。もしａｃｔがＴＨ３よりも小さい場合には解像
度を解像度Ａに決定する。ａｃｔがしきい値ＴＨ３以上
である場合には、ａｃｔが予め定めておいたしきい値Ｔ
Ｈ４よりも小さいかの比較を行う。ここでＴＨ３＜ＴＨ
４である。もしａｃｔがＴＨ４よりも小さい場合には解
像度を解像度Ｂに決定する。ａｃｔがしきい値ＴＨ４以
上である場合には解像度を解像度Ｃに決定する。The resolution determiner 302 determines the resolution according to the flowchart shown in FIG. First, the activity act is determined. For example, when the activity is obtained as the variance of the pixel values in the frame by the activity measuring device 301, the value is act. When the activity measuring device 301 calculates an activity as a variance value for each macroblock or block, the average value of those activities in one frame may be set as act, or the maximum value in the frame may be set as act. Is also good. FIG. 12 shows the latter detailed procedure. Then, as shown in FIG. 11, a comparison is made as to whether act is smaller than a predetermined threshold value TH3. If act is smaller than TH3, the resolution is determined to be resolution A. If act is equal to or greater than threshold TH3, act is a predetermined threshold T
A comparison is made to see if it is smaller than H4. Where TH3 <TH
4. If act is smaller than TH4, the resolution is determined to be resolution B. If act is equal to or greater than threshold value TH4, the resolution is determined to be resolution C.

【００３１】解像度決定器３０２で決定された解像度は
解像度変換器１０３に出力される。これにより解像度変
換器１０３は、遅延器１００を介して与えられた映像フ
レームに対して折り返し雑音除去フィルタを施した後、
補間や間引きを行って解像度を変換する。解像度変換器
１０３から出力された映像フレームは、圧縮符号化器１
１０により符号列に変換される。The resolution determined by the resolution determiner 302 is output to the resolution converter 103. Thereby, the resolution converter 103 applies the aliasing removal filter to the video frame given through the delay unit 100,
Convert the resolution by performing interpolation or thinning. The video frame output from the resolution converter 103 is output to the compression encoder 1
It is converted into a code string by 10.

【００３２】以上のように第３の実施形態の画像符号化
装置は、入力映像のアクティビティを計算し、そのアク
ティビティの値によって解像度を決定する。この場合に
は、アクティビティが大きいほど、高い解像度となるよ
うに解像度を決定する。そして、入力映像をその決定し
た解像度に変換し符号化を行う。したがって、第３の実
施形態の画像符号化装置を用いることにより、入力映像
のアクティビティに応じた解像度で入力映像を符号化す
ることができる。つまり、入力映像のアクティビティに
応じて符号化するブロック数を減らすことができるた
め、少ない画質劣化で発生符号量を大きく削減すること
ができる。As described above, the image coding apparatus according to the third embodiment calculates the activity of an input video, and determines the resolution according to the value of the activity. In this case, the resolution is determined so that the higher the activity, the higher the resolution. Then, the input video is converted to the determined resolution and encoded. Therefore, by using the image encoding device of the third embodiment, it is possible to encode an input video at a resolution corresponding to the activity of the input video. That is, since the number of blocks to be coded can be reduced according to the activity of the input video, the amount of generated codes can be greatly reduced with little image quality deterioration.

【００３３】なお、本実施形態では、アクティビティに
対して２つのしきい値ＴＨ３、ＴＨ４を設定し、３種類
の解像度（Ａ、Ｂ、Ｃ）のうちから１つの解像度を選択
する場合について説明したが、アクティビティに対する
しきい値の数、解像度の種類数は本実施形態とは別の値
であっても構わない。In this embodiment, the case where two threshold values TH3 and TH4 are set for the activity and one resolution is selected from three types of resolutions (A, B and C) has been described. However, the number of thresholds and the number of types of resolutions for activities may be different from those in the present embodiment.

【００３４】図１０の変形例を図１３に示す。図１３が
図１０と異なる点は、入力映像信号がアナログ信号であ
る点である。図１０中の解像度変換器１０３は標本化器
１０３ａに置き換えられる。図１３によれば、解像度決
定器３０２で決定された解像度は標本化器１０３ａに出
力される。これにより標本化器１０３ａは、決定された
解像度に応じたサンプリング周波数により、遅延器１０
０を介して与えられた入力映像フレームの標本化を行
い、ディジタル映像信号を圧縮符号化器１１０へ供給す
る。FIG. 13 shows a modification of FIG. FIG. 13 differs from FIG. 10 in that the input video signal is an analog signal. The resolution converter 103 in FIG. 10 is replaced by a sampler 103a. According to FIG. 13, the resolution determined by the resolution determiner 302 is output to the sampler 103a. Thus, the sampler 103a uses the sampling frequency corresponding to the determined resolution to set the delay
The sampling of the input video frame given via 0 is performed, and the digital video signal is supplied to the compression encoder 110.

【００３５】（第４の実施形態）本発明の第４の実施形
態を図１４を用いて説明する。図１４は、ＤＣＴ係数分
析器４０１、解像度決定器４０２、解像度変換器１０
３、圧縮符号化器１１０から構成された画像符号化装置
のブロック図である。圧縮符号化器１１０は、ブロック
化器１０４、ＤＣＴ変換器１０５、量子化器１０６、可
変長符号化器１０７から構成される。(Fourth Embodiment) A fourth embodiment of the present invention will be described with reference to FIG. FIG. 14 shows a DCT coefficient analyzer 401, a resolution determiner 402, and a resolution converter 10
FIG. 3 is a block diagram of an image encoding device including a compression encoder 110. The compression encoder 110 includes a block converter 104, a DCT transformer 105, a quantizer 106, and a variable length encoder 107.

【００３６】ディジタル信号である入力映像信号は、ま
ず解像度変換器１０３に入力される。そして解像度変換
器１０３は、後述する方法で決定された解像度に基づい
て映像フレームの解像度を変換する。解像度変換器１０
３から出力された映像フレームは、ブロック化器１０４
に入力されてブロックに分割される。ブロックは例えば
８×８画素程度の大きさを有する。そして各ブロックは
ＤＣＴ変換器１０５に入力される。ＤＣＴ変換器１０５
では各ブロックに２次元ＤＣＴを施してＤＣＴ係数に変
換する。ＤＣＴ変換器１０５の出力であるＤＣＴ係数は
量子化器１０６に入力され、量子化処理により係数の丸
め込みが行われる。量子化器１０６の出力は可変長符号
化器１０７に入力されて可変長符号化を施された後、符
号列として出力される。An input video signal, which is a digital signal, is first input to the resolution converter 103. Then, the resolution converter 103 converts the resolution of the video frame based on the resolution determined by a method described later. Resolution converter 10
3 is output to the blocker 104
And is divided into blocks. The block has a size of, for example, about 8 × 8 pixels. Then, each block is input to the DCT converter 105. DCT converter 105
Then, two-dimensional DCT is applied to each block to convert it into DCT coefficients. The DCT coefficients output from the DCT transformer 105 are input to a quantizer 106, where the coefficients are rounded by a quantization process. The output of the quantizer 106 is input to the variable-length encoder 107, subjected to variable-length encoding, and then output as a code string.

【００３７】一方、ＤＣＴ変換器１０５の出力であるＤ
ＣＴ係数は、ＤＣＴ係数分析器４０１に入力される。Ｄ
ＣＴ係数分析器４０１では、入力されたＤＣＴ係数を１
フレーム期間に渡って分析する。ＤＣＴ係数分析器４０
１での分析は、映像フレームがどの程度高域成分を有し
ているかを調べるために行う。これには様々な方法が考
えられるが、その例を図１５を用いて説明する。On the other hand, D which is the output of DCT converter 105
The CT coefficients are input to a DCT coefficient analyzer 401. D
In the CT coefficient analyzer 401, the input DCT coefficient is
Analyze over the frame period. DCT coefficient analyzer 40
The analysis in step 1 is performed to check how high-frequency components the video frame has. Various methods are conceivable for this, examples of which will be described with reference to FIG.

【００３８】図１５は、ＤＣＴ係数ブロックを示した模
式図である。図１５に示すようにＤＣＴ係数ブロックを
水平、垂直周波数成分によってゾーン１〜４に分割す
る。ＤＣＴ係数分析器４０１は、ＤＣＴ変換器１０５か
ら入力されたＤＣＴ係数ブロック毎に最も高い周波数成
分がどのゾーンに属するかを１フレームに渡って計数
し、その分析結果を解像度決定器４０２に対して出力す
る。例えば分析方法が上記の方法である場合には、各ゾ
ーンのカウント数を出力する。解像度決定器４０２は、
ＤＣＴ係数分析器４０１の分析結果を元にして次の映像
フレームの解像度をどのように変更するかを決定する。FIG. 15 is a schematic diagram showing a DCT coefficient block. As shown in FIG. 15, the DCT coefficient block is divided into zones 1 to 4 by horizontal and vertical frequency components. The DCT coefficient analyzer 401 counts which zone the highest frequency component belongs to for each DCT coefficient block input from the DCT converter 105 over one frame, and sends the analysis result to the resolution determiner 402. Output. For example, when the analysis method is the above method, the count number of each zone is output. The resolution determiner 402
Based on the analysis result of the DCT coefficient analyzer 401, how to change the resolution of the next video frame is determined.

【００３９】例えばＤＣＴ係数分析器４０１における分
析方法が上記の方法である場合について、図１６を用い
て解像度の決定方法を説明する。この場合には、ゾーン
４、３、２、１の順でカウント数と所定のしきい値との
比較を行い、カウント数がしきい値以上となるゾーンに
より解像度を決定する。ここで各ゾーンに対応する解像
度は、ゾーン１、２、３、４となるに従って高くなる。For example, in the case where the analysis method in the DCT coefficient analyzer 401 is the above method, a method of determining the resolution will be described with reference to FIG. In this case, the count number is compared with a predetermined threshold value in the order of zones 4, 3, 2, and 1, and the resolution is determined according to the zone where the count number is equal to or greater than the threshold value. Here, the resolution corresponding to each zone increases as the zones become 1, 2, 3, and 4.

【００４０】このようにして解像度決定器４０２で決定
された解像度は解像度変換器１０３に出力される。これ
により解像度変換器１０３は、次の映像フレームの解像
度変換を行う。The resolution determined by the resolution determiner 402 is output to the resolution converter 103. As a result, the resolution converter 103 performs resolution conversion of the next video frame.

【００４１】以上のように第４の実施形態の画像符号化
装置は、入力映像のＤＣＴ係数ブロックを分析し、その
ＤＣＴ係数の高域成分の分布によって解像度を決定す
る。この場合には、高域成分を有するブロックが多いほ
ど、高い解像度となるように入力映像の解像度を決定す
る。そして入力映像の次のフレームをその決定した解像
度に変換し符号化を行う。したがって、第４の実施形態
の画像符号化装置を用いることにより、入力映像の周波
数成分に応じた解像度で入力映像を符号化することがで
きる。つまり、入力映像の周波数成分に応じて符号化す
るブロック数を減らすことができるため、少ない画質劣
化で発生符号量を大きく削減することができる。As described above, the image coding apparatus according to the fourth embodiment analyzes a DCT coefficient block of an input video, and determines a resolution based on a distribution of a high frequency component of the DCT coefficient. In this case, the resolution of the input video is determined so that the higher the number of blocks having high-frequency components, the higher the resolution. Then, the next frame of the input video is converted to the determined resolution and encoded. Therefore, by using the image encoding device of the fourth embodiment, it is possible to encode an input video with a resolution corresponding to the frequency component of the input video. That is, since the number of blocks to be coded can be reduced according to the frequency components of the input video, the amount of generated codes can be greatly reduced with little image quality deterioration.

【００４２】なお、本実施形態では、ＤＣＴ係数ブロッ
クを４つのゾーンに分割して分析を行う場合について説
明したが、このゾーン数は別の値であっても構わない。In the present embodiment, the case where the analysis is performed by dividing the DCT coefficient block into four zones has been described. However, the number of zones may be another value.

【００４３】また、本実施形態では、ＤＣＴ係数ブロッ
クを１フレーム期間に渡って分析し、その結果を次のフ
レームの解像度の決定に用いる場合について説明した
が、このＤＣＴ係数ブロックの分析期間は異なる期間で
あっても構わない。In this embodiment, the case where the DCT coefficient block is analyzed over one frame period and the result is used to determine the resolution of the next frame has been described. However, the analysis period of the DCT coefficient block is different. It may be a period.

【００４４】図１４の変形例を図１７に示す。図１７が
図１４と異なる点は、入力映像信号がアナログ信号であ
る点である。図１４中の解像度変換器１０３は標本化器
１０３ａに置き換えられる。図１７によれば、解像度決
定器４０２で決定された解像度は標本化器１０３ａに出
力される。これにより標本化器１０３ａは、決定された
解像度に応じたサンプリング周波数により入力映像フレ
ームの標本化を行い、ディジタル映像信号を圧縮符号化
器１１０へ供給する。FIG. 17 shows a modification of FIG. FIG. 17 differs from FIG. 14 in that the input video signal is an analog signal. The resolution converter 103 in FIG. 14 is replaced by a sampler 103a. According to FIG. 17, the resolution determined by the resolution determiner 402 is output to the sampler 103a. As a result, the sampler 103a samples the input video frame at a sampling frequency corresponding to the determined resolution, and supplies the digital video signal to the compression encoder 110.

【００４５】（第５の実施形態）本発明の第５の実施形
態を図１８を用いて説明する。図１８は、平均量子化ス
テップ演算器５０１、発生符号量計数器５０２、解像度
決定器５０３、解像度変換器１０３、圧縮符号化器１１
０から構成された画像符号化装置のブロック図である。
圧縮符号化器１１０は、ブロック化器１０４、ＤＣＴ変
換器１０５、量子化器１０６、可変長符号化器１０７か
ら構成される。(Fifth Embodiment) A fifth embodiment of the present invention will be described with reference to FIG. FIG. 18 shows an average quantization step calculator 501, a generated code amount counter 502, a resolution determiner 503, a resolution converter 103, and a compression encoder 11.
FIG. 2 is a block diagram of an image encoding device composed of 0.
The compression encoder 110 includes a block converter 104, a DCT transformer 105, a quantizer 106, and a variable length encoder 107.

【００４６】ディジタル信号である入力映像信号は、ま
ず解像度変換器１０３に入力される。そして解像度変換
器１０３は、後述する方法で決定された解像度に基づい
て映像フレームの解像度を変換する。解像度変換器１０
３から出力された映像フレームは、ブロック化器１０４
に入力されてブロックに分割される。そして各ブロック
はＤＣＴ変換器１０５に入力される。ＤＣＴ変換器１０
５では各ブロックに２次元ＤＣＴを施してＤＣＴ係数に
変換する。ＤＣＴ変換器１０５の出力であるＤＣＴ係数
は量子化器１０６に入力され、量子化処理により係数の
丸め込みが行われる。量子化器１０６の出力は可変長符
号化器１０７に入力されて可変長符号化を施された後、
符号列として出力される。An input video signal, which is a digital signal, is first input to a resolution converter 103. Then, the resolution converter 103 converts the resolution of the video frame based on the resolution determined by a method described later. Resolution converter 10
3 is output to the blocker 104
And is divided into blocks. Then, each block is input to the DCT converter 105. DCT converter 10
In step 5, two-dimensional DCT is performed on each block to convert the blocks into DCT coefficients. The DCT coefficients output from the DCT transformer 105 are input to a quantizer 106, where the coefficients are rounded by a quantization process. The output of the quantizer 106 is input to a variable-length encoder 107 and subjected to variable-length encoding.
Output as a code string.

【００４７】一方、可変長符号化器１０７で生成した符
号列の発生符号量が発生符号量計数器５０２に入力され
る。発生符号量計数器５０２は、所定期間（例えば１フ
レーム）の発生符号量の累積和を求める。所定期間の発
生符号量の累積和は解像度決定器５０３に対して出力さ
れる。On the other hand, the generated code amount of the code string generated by the variable length encoder 107 is input to the generated code amount counter 502. The generated code amount counter 502 calculates the cumulative sum of the generated code amounts for a predetermined period (for example, one frame). The cumulative sum of the generated code amounts for a predetermined period is output to the resolution determiner 503.

【００４８】また、量子化器１０６でＤＣＴ係数の量子
化処理に用いられた量子化ステップが平均量子化ステッ
プ演算器５０１に入力される。平均量子化ステップ演算
器５０１では、所定期間（例えば１フレーム）の量子化
ステップの平均値を求める。求められた量子化ステップ
の平均値は、解像度決定器５０３に対して出力される。The quantization step used by the quantizer 106 for quantizing the DCT coefficient is input to the average quantization step calculator 501. The average quantization step calculator 501 calculates an average value of quantization steps in a predetermined period (for example, one frame). The obtained average value of the quantization step is output to the resolution determiner 503.

【００４９】解像度決定器５０３は、発生符号量計数器
５０２で求められた所定期間の発生符号量の累積和と、
平均量子化ステップ演算器５０１で求められた所定期間
の量子化ステップの平均値との積を計算する。この積の
値をコンプレキシティＸとする。コンプレキシティＸは
映像の符号化難易度を示す値であり、大きいほど、符号
化が難しい（同じ量子化ステップで符号化した場合に、
発生符号量が多くなる）ことを示している。The resolution determiner 503 calculates the cumulative sum of the generated code amount for a predetermined period obtained by the generated code amount counter 502,
The product of the average quantization step calculator 501 and the average value of the quantization steps for a predetermined period is calculated. The value of this product is defined as complexity X. Complexity X is a value indicating the degree of difficulty in encoding a video, and the larger it is, the more difficult it is to encode (when encoded in the same quantization step,
(The generated code amount increases).

【００５０】解像度決定器５０３では、図１９に示すフ
ローチャートにより解像度を決定する。ここでは、図２
（ｂ）〜図２（ｄ）に示す３つの解像度Ａ、Ｂ、Ｃのい
ずれかに解像度を決定するものとする。まず、所定期間
のコンプレキシティＸを得る。そして、Ｘと予め定めて
おいたしきい値ＴＨ５との比較を行う。もしＸがＴＨ５
よりも小さい場合には解像度を解像度Ａに決定する。Ｘ
がしきい値ＴＨ５以上である場合には、Ｘと予め定めて
おいたしきい値ＴＨ６との比較を行う。ここでＴＨ５＜
ＴＨ６である。もしＸがＴＨ６よりも小さい場合には解
像度を解像度Ｂに決定する。Ｘがしきい値ＴＨ６以上で
ある場合には解像度を解像度Ｃに決定する。The resolution determiner 503 determines the resolution according to the flowchart shown in FIG. Here, FIG.
It is assumed that the resolution is determined to one of the three resolutions A, B, and C shown in FIGS. First, a complexity X for a predetermined period is obtained. Then, X is compared with a predetermined threshold value TH5. If X is TH5
If smaller, the resolution is determined to be resolution A. X
Is greater than or equal to the threshold value TH5, X is compared with a predetermined threshold value TH6. Where TH5 <
TH6. If X is smaller than TH6, the resolution is determined to be resolution B. If X is greater than or equal to threshold value TH6, the resolution is determined to be resolution C.

【００５１】このようにして解像度決定器５０３で決定
された解像度は解像度変換器１０３に出力される。これ
により解像度変換器１０３は、次の映像フレームの解像
度変換を行う。The resolution determined by the resolution determiner 503 is output to the resolution converter 103. As a result, the resolution converter 103 performs resolution conversion of the next video frame.

【００５２】以上のように第５の実施形態の画像符号化
装置は、過去の所定期間の発生符号量及び量子化ステッ
プからコンプレキシティを計算し、そのコンプレキシテ
ィの大きさによって解像度を決定する。この場合には、
コンプレキシティが大きい、すなわち符号化難易度が高
い映像であるほど、高い解像度となるように入力映像の
解像度を決定する。そして入力映像の次のフレームをそ
の決定した解像度に変換し符号化を行う。したがって、
第５の実施形態の画像符号化装置を用いることにより、
入力映像の符号化難易度に応じた解像度で入力映像を符
号化することができる。つまり、入力映像の符号化難易
度に応じて符号化するブロック数を減らすことができる
ため、少ない画質劣化で発生符号量を大きく削減するこ
とができる。As described above, the image coding apparatus according to the fifth embodiment calculates the complexity from the generated code amount and the quantization step in the past predetermined period, and determines the resolution according to the magnitude of the complexity. I do. In this case,
The resolution of the input video is determined so that the higher the complexity, that is, the higher the video encoding difficulty, the higher the resolution. Then, the next frame of the input video is converted to the determined resolution and encoded. Therefore,
By using the image encoding device of the fifth embodiment,
The input video can be encoded at a resolution corresponding to the encoding difficulty of the input video. That is, since the number of blocks to be encoded can be reduced in accordance with the degree of difficulty of encoding the input video, the amount of generated codes can be greatly reduced with little deterioration in image quality.

【００５３】なお、本実施形態では、発生符号量の累積
値と量子化ステップの平均値とを求める所定期間が１フ
レームである場合について説明したが、これは異なる期
間であっても構わない。In this embodiment, the case where the predetermined period for obtaining the accumulated value of the generated code amount and the average value of the quantization step is one frame is described, but this may be a different period.

【００５４】また、本実施形態では、コンプレキシティ
に対して２つのしきい値ＴＨ５、ＴＨ６を設定し、３種
類の解像度（Ａ、Ｂ、Ｃ）のうちから１つの解像度を選
択する場合について説明したが、しきい値の数、解像度
の種類数は本実施形態とは別の値であっても構わない。In this embodiment, two threshold values TH5 and TH6 are set for the complexity, and one of the three resolutions (A, B, C) is selected. As described above, the number of thresholds and the number of types of resolutions may be different from those in the present embodiment.

【００５５】図１８の変形例を図２０に示す。図２０が
図１８と異なる点は、入力映像信号がアナログ信号であ
る点である。図１８中の解像度変換器１０３は標本化器
１０３ａに置き換えられる。図２０によれば、解像度決
定器５０３で決定された解像度は標本化器１０３ａに出
力される。これにより標本化器１０３ａは、決定された
解像度に応じたサンプリング周波数により入力映像フレ
ームの標本化を行い、ディジタル映像信号を圧縮符号化
器１１０へ供給する。FIG. 20 shows a modification of FIG. FIG. 20 differs from FIG. 18 in that the input video signal is an analog signal. The resolution converter 103 in FIG. 18 is replaced by a sampler 103a. According to FIG. 20, the resolution determined by the resolution determiner 503 is output to the sampler 103a. As a result, the sampler 103a samples the input video frame at a sampling frequency corresponding to the determined resolution, and supplies the digital video signal to the compression encoder 110.

【００５６】なお、上記第１〜第５の実施形態では、フ
レーム内符号化の場合について説明したが、フレーム間
予測符号化の場合でも同様に解像度変換を施すことがで
きることは言うまでもない。In the first to fifth embodiments, the case of intra-frame coding has been described. However, it goes without saying that resolution conversion can be similarly performed in the case of inter-frame predictive coding.

【００５７】また、上記第１〜第５の実施形態では水平
方向に解像度を落とす場合について説明したが、これは
垂直方向に解像度を落としても、水平方向と垂直方向の
両方の解像度を落としても構わない。In the first to fifth embodiments, the case where the resolution is lowered in the horizontal direction has been described. However, even if the resolution is lowered in the vertical direction, the resolution is lowered in both the horizontal direction and the vertical direction. No problem.

【００５８】また、上記第１〜第５の実施形態では、符
号化方式としてＭＰＥＧ２を想定して説明したが、これ
は他の符号化方式であっても構わない。Further, in the first to fifth embodiments, the description has been made on the assumption that MPEG2 is used as the encoding method. However, this may be another encoding method.

【００５９】以下の第６〜第９の実施形態は、特に映像
信号及び音声信号を圧縮符号化して得られたＡＶストリ
ームを書き換え可能な光ディスクであるＤＶＤ−ＲＡＭ
やハードディスク装置などの蓄積メディアに蓄積するの
に適したものである。The following sixth to ninth embodiments are particularly directed to a DVD-RAM which is an optical disk capable of rewriting an AV stream obtained by compression-encoding a video signal and an audio signal.
It is suitable for storing in a storage medium such as a hard disk drive.

【００６０】（第６の実施形態）本発明の第６の実施形
態を、図２１を用いて説明する。図２１は、圧縮符号化
器６１０、符号量制御器６２０、解像度変更ユニット６
３０から構成された画像符号化装置のブロック図であ
る。(Sixth Embodiment) A sixth embodiment of the present invention will be described with reference to FIG. FIG. 21 shows a compression encoder 610, a code amount controller 620, and a resolution changing unit 6.
FIG. 2 is a block diagram of an image encoding device constituted by 30.

【００６１】圧縮符号化器６１０は、不図示のブロック
化器、ＤＣＴ変換器、量子化器、可変長符号化器から構
成され、可変の割り当てビット量Ｂｔに応じた可変ビッ
トレートを有する符号化ビットストリームを出力するよ
うに映像信号を圧縮符号化する機能を有する。割り当て
ビット量Ｂｔは、圧縮符号化器６１０における量子化ス
テップを規定する。The compression encoder 610 is composed of a block generator, a DCT transformer, a quantizer, and a variable length encoder (not shown), and has a variable bit rate corresponding to a variable allocated bit amount Bt. It has a function of compression-encoding a video signal so as to output a bit stream. The allocated bit amount Bt defines a quantization step in the compression encoder 610.

【００６２】符号量制御器６２０は、発生符号量を制御
するための手段であって、符号化ビットストリームの所
定期間毎の発生ビット量Ｂｇを検出するための発生ビッ
ト量検出器６２１と、検出された発生ビット量Ｂｇと与
えられた平均目標符号量Ｂａとの差を逐次累積し累積誤
差Ｄを計算するための累積誤差計算器６２２と、圧縮符
号化器６１０への割り当てビット量Ｂｔを制御するため
のビット割り当て器６２３と、前記映像信号により表さ
れたシーンの複雑さを示す符号化難易度（コンプレキシ
ティ）Ｘを発生ビット量Ｂｇに基づいて検出するための
符号化難易度検出器６２４と、符号化難易度Ｘの変化に
基づいてシーンの切り替わりを検出するための難易度変
化検出器６２５とで構成されている。符号化難易度Ｘ
は、発生ビット量Ｂｇとそのときの量子化ステップとに
基づいて計算される。ΔＸは、シーンの切り替わり時に
ビット割り当て器６２３へ供給される難易度変化検出信
号である。Ｄｉは累積誤差の初期値を、Ｄｍａｘは累積
誤差の最大値をそれぞれ表している。ここで、累積誤差
Ｄとその最大値Ｄｍａｘとの差を余裕量Ｍとする。累積
誤差Ｄ及び符号化難易度Ｘは、例えば１ＧＯＰ（group
of pictures）期間毎に計算される。The code amount controller 620 is a means for controlling the generated code amount, and includes a generated bit amount detector 621 for detecting the generated bit amount Bg of the coded bit stream every predetermined period, A cumulative error calculator 622 for sequentially accumulating a difference between the generated generated bit amount Bg and a given average target code amount Ba and calculating a cumulative error D, and a bit amount Bt allocated to the compression encoder 610. And a coding difficulty detector for detecting a coding difficulty (complexity) X indicating the complexity of the scene represented by the video signal based on the generated bit amount Bg. 624 and a difficulty change detector 625 for detecting a scene change based on a change in the encoding difficulty X. Encoding difficulty X
Is calculated based on the generated bit amount Bg and the quantization step at that time. ΔX is a difficulty level change detection signal supplied to the bit allocator 623 when a scene is switched. Di represents the initial value of the accumulated error, and Dmax represents the maximum value of the accumulated error. Here, the difference between the accumulated error D and its maximum value Dmax is defined as a margin M. The accumulated error D and the encoding difficulty X are, for example, 1 GOP (group
of pictures) calculated for each period.

【００６３】解像度変更ユニット６３０は、ディジタル
信号である入力映像信号の解像度を変更して圧縮符号化
器６１０へ供給するための解像度変換器６３１と、３０
〜４０ＧＯＰ期間に渡って符号化難易度Ｘを累積して累
積難易度ＡＸを計算するための累積難易度計算器６３２
と、累積誤差Ｄの大きさと累積難易度ＡＸの大きさとの
うちの少なくとも一方に応じて解像度を決定するための
解像度決定器６３３とで構成されている。解像度変換器
６３１は、入力映像信号の元の解像度が例えば水平７０
４画素×垂直４８０画素であるときに、そのまま７０４
×４８０で出力する高解像度モードと、水平解像度を半
分にして３５２×４８０で出力する低解像度モードとを
持つ。解像度変更ユニット６３０は、累積誤差Ｄが予め
定められたしきい値Ｄｔｈ＿Ｕを超えて大きくなると３
５２×４８０の低解像度モードで映像信号を出力し、そ
の後に累積誤差ＤがＤｔｈ＿Ｕより小さいしきい値Ｄｔ
ｈ＿Ｌを下回ると映像信号の解像度を元の高解像度（７
０４×４８０）に戻す機能を有する。また、この解像度
変更ユニット６３０は、累積難易度ＡＸが予め定められ
たしきい値ＡＸｔｈ＿Ｕを超えて大きくなると３５２×
４８０の低解像度モードで映像信号を出力し、その後に
累積難易度ＡＸがＡＸｔｈ＿Ｕより小さいしきい値ＡＸ
ｔｈ＿Ｌを下回ると映像信号の解像度を元の高解像度
（７０４×４８０）に戻す機能をも有する。The resolution changing unit 630 changes the resolution of the input video signal, which is a digital signal, and supplies it to the compression encoder 610.
Cumulative difficulty calculator 632 for accumulating encoding difficulty X and calculating cumulative difficulty AX over ＯＰ40 GOP period
And a resolution determiner 633 for determining a resolution according to at least one of the magnitude of the accumulated error D and the magnitude of the accumulated difficulty AX. The resolution converter 631 determines that the original resolution of the input video signal is horizontal 70, for example.
When 4 pixels × vertical 480 pixels, 704
It has a high-resolution mode for outputting at × 480 and a low-resolution mode for outputting at 352 × 480 by halving the horizontal resolution. The resolution changing unit 630 outputs 3 when the cumulative error D exceeds a predetermined threshold value Dth_U.
A video signal is output in a low resolution mode of 52 × 480, and thereafter, a threshold Dt in which the accumulated error D is smaller than Dth_U
h_L, the resolution of the video signal is reduced to the original high resolution (7
04 × 480). When the cumulative difficulty AX exceeds a predetermined threshold AXth_U and increases, the resolution changing unit 630 determines that the cumulative difficulty AX becomes 352 ×
A video signal is output in the low resolution mode of 480, and then the threshold value AX at which the cumulative difficulty level AX is smaller than AXth_U
It also has a function of returning the resolution of the video signal to the original high resolution (704 × 480) when it falls below th_L.

【００６４】ビット割り当て器６２３は、シーンの切り
替わり直後には符号化難易度Ｘに応じて割り当てビット
量Ｂｔを新たに決定し、同一シーン内では直前の割り当
てビット量Ｂｔを逐次更新するようにフィードバック制
御を行う。すなわち、シーンの切り替わり直後のフレー
ムでは、ビット割り当て器６２３は、平均難易度Ｘａを
上回る符号化難易度Ｘを示す複雑なシーンに対しては平
均目標量Ｂａを上回るビット量を割り当てるように仮割
り当てビット量を決定し、かつこの時点で累積誤差Ｄが
所定量を上回る場合には、仮割り当てビット量を最大限
平均目標量Ｂａまで減ずるように累積誤差Ｄの大きさに
応じて仮割り当てビット量を補正することにより割り当
てビット量Ｂｔを決定する。ただし、平均難易度Ｘａを
下回る符号化難易度Ｘを示す単調なシーンには、平均目
標量Ｂａを下回るビット量Ｂｔが割り当てられる。同一
シーン内のフレームでは、ビット割り当て器６２３は、
累積誤差Ｄが予め定められた最大量Ｄｍａｘを上回らな
いように直前の割り当てビット量Ｂｔを逐次更新する。
この際、ビット割り当て器６２３は、累積誤差Ｄが最大
量Ｄｍａｘに近づくにつれて割り当てビット量Ｂｔが平
均目標量Ｂａに近づくように、更新された割り当てビッ
ト量Ｂｔを決定する。The bit allocator 623 newly determines the allocated bit amount Bt according to the encoding difficulty X immediately after the scene change, and performs feedback so as to sequentially update the immediately preceding allocated bit amount Bt within the same scene. Perform control. That is, in the frame immediately after the scene switching, the bit allocator 623 temporarily allocates a bit amount exceeding the average target amount Ba to a complex scene having the encoding difficulty X exceeding the average difficulty Xa. If the bit amount is determined and the accumulated error D exceeds the predetermined amount at this point, the provisionally assigned bit amount is set according to the magnitude of the accumulated error D so that the temporarily assigned bit amount is reduced to the maximum average target amount Ba. To determine the allocated bit amount Bt. However, a bit amount Bt smaller than the average target amount Ba is assigned to a monotonous scene indicating the encoding difficulty X lower than the average difficulty Xa. For frames in the same scene, the bit allocator 623
The immediately preceding allocated bit amount Bt is sequentially updated so that the accumulated error D does not exceed a predetermined maximum amount Dmax.
At this time, the bit allocator 623 determines the updated allocated bit amount Bt such that the allocated bit amount Bt approaches the average target amount Ba as the cumulative error D approaches the maximum amount Dmax.

【００６５】図２２（ａ）〜図２２（ｃ）は、図２１の
画像符号化装置において解像度変更なしとした場合の動
作例を表しており、符号化難易度の高いシーンが続いた
ものと仮定している。図２２（ａ）は符号化難易度Ｘ
の、図２２（ｂ）は割り当てビット量Ｂｔの、図２２
（ｃ）は累積誤差Ｄの各々の時間推移を示す。ここで
は、累積誤差の初期値Ｄｉ＝０、したがって初期状態で
の余裕量ＭはＤｍａｘに設定されているものとする。FIGS. 22 (a) to 22 (c) show an operation example in the case where the resolution is not changed in the image coding apparatus of FIG. 21, and it is assumed that scenes with high coding difficulty continue. I assume. FIG. 22A shows the encoding difficulty X.
FIG. 22B shows the allocation bit amount Bt of FIG.
(C) shows the time transition of each of the accumulated errors D. Here, it is assumed that the initial value Di = 0 of the accumulated error, and therefore the margin M in the initial state is set to Dmax.

【００６６】図２２（ｂ）及び図２２（ｃ）によれば、
余裕量Ｍが少なくなるにつれて割り当てビット量Ｂｔは
しだいに減少していく。このまま、累積誤差ＤがＤｍａ
ｘに達する、つまり余裕量Ｍがゼロになると、たとえ符
号化難易度Ｘが高いシーンであっても平均目標量Ｂａし
か与えない。したがって、総符号量が所定量を超えるこ
とがないため、所定量の中での記録時間が保証され、か
つ余裕量Ｍがなくなった場合にも固定ビットレートと同
等の画質は保証できる。According to FIGS. 22 (b) and 22 (c),
As the margin M decreases, the allocated bit amount Bt gradually decreases. As it is, the accumulated error D becomes Dma
When x reaches x, that is, when the margin M becomes zero, only the average target amount Ba is given even in a scene where the encoding difficulty X is high. Therefore, since the total code amount does not exceed the predetermined amount, the recording time within the predetermined amount is guaranteed, and the image quality equivalent to the fixed bit rate can be guaranteed even when the margin amount M disappears.

【００６７】しかしながら、余裕量Ｍがなくなってから
は固定ビットレート制御に等しくなってしまうため、入
力映像信号の各シーン毎の符号化難易度に応じた可変ビ
ットレート制御を行うことはできない。そのため、この
ような状態で符号化難易度の高い映像信号が入力される
と十分な符号量を与えることができず、ブロックノイ
ズ、モスキートノイズといったＤＣＴ符号化処理特有の
符号化ノイズが発生し、画質が著しく低下してしまう。However, since the bit rate becomes equal to the fixed bit rate control after the margin M disappears, it is not possible to perform the variable bit rate control according to the encoding difficulty of each scene of the input video signal. Therefore, if a video signal having a high degree of coding difficulty is input in such a state, a sufficient coding amount cannot be provided, and coding noise peculiar to DCT coding processing such as block noise and mosquito noise is generated. The image quality is significantly reduced.

【００６８】そこで、本実施形態では、余裕量Ｍが少な
くなり可変ビットレート制御を続けていくのが困難な状
態になったときなどに、入力映像信号の解像度を低下さ
せることとした。このようにして発生符号量を抑え気味
に推移させることで、継続して可変ビットレート制御が
行えるようにする。Therefore, in the present embodiment, the resolution of the input video signal is reduced, for example, when the margin M decreases and it becomes difficult to continue the variable bit rate control. In this way, the generated code amount is suppressed and slightly changed, so that the variable bit rate control can be continuously performed.

【００６９】例えば、累積誤差Ｄが増加していき最大累
積誤差Ｄｍａｘに近くなり余裕量Ｍ（＝Ｄｍａｘ−Ｄ）
が小さくなってくると、入力映像信号の解像度を低下さ
せ符号化難易度Ｘを下げることで、元々の解像度で符号
化を行った場合に比べて発生ビット量Ｂｇを低く抑える
ことができ、余裕量Ｍがそれ以上減少するのを抑制する
ことが可能となる。また、完全に余裕量Ｍがなくなって
おらず、継続して入力映像信号の持つ符号化難易度に応
じた可変ビットレート制御を行うことが可能なため、ブ
ロックノイズ、モスキートノイズといったＤＣＴ符号化
処理特有の符号化ノイズの抑制に効果がある。For example, the cumulative error D increases and approaches the maximum cumulative error Dmax, and the margin M (= Dmax−D)
Becomes smaller, the resolution of the input video signal is reduced and the encoding difficulty X is reduced, so that the generated bit amount Bg can be suppressed lower than when encoding is performed at the original resolution. It is possible to suppress the amount M from further decreasing. Further, since the margin M has not completely disappeared and the variable bit rate control can be continuously performed according to the encoding difficulty of the input video signal, DCT encoding processing such as block noise and mosquito noise is performed. This is effective in suppressing specific coding noise.

【００７０】図２３（ａ）〜図２３（ｃ）は、図２１の
画像符号化装置における累積誤差Ｄに基づく解像度変更
の効果を表している。ここでは、Ｄｔｈ＿Ｕ＝Ｄｍａｘ
×３／４であり、Ｄｔｈ＿Ｌ＝Ｄｍａｘ／２であるもの
としている。また、入力映像信号のモデルは図２２
（ａ）と同じものとした。ただし、低解像度に変換され
符号化を行っている時刻ｔ１からｔ２の間では、発生ビ
ット量Ｂｇから検出される符号化難易度Ｘは、元の解像
度のときに比べ約７０％程度に抑えられるものとした。FIGS. 23 (a) to 23 (c) show the effect of changing the resolution based on the accumulated error D in the image coding apparatus of FIG. Here, Dth_U = Dmax
× 3/4, and Dth_L = Dmax / 2. The model of the input video signal is shown in FIG.
Same as (a). However, between times t1 and t2 when the conversion is performed to the low resolution and the encoding is performed, the encoding difficulty X detected from the generated bit amount Bg is suppressed to about 70% as compared with the original resolution. It was taken.

【００７１】図２３（ａ）及び図２３（ｃ）によると、
時刻０〜ｔ１の間、入力映像信号は符号化難易度Ｘの高
い映像シーンが続いており、累積誤差Ｄは単調に増加し
ていき、時刻ｔ１付近でしきい値Ｄｔｈ＿Ｕを超える。
この時刻ｔ１において、解像度変更ユニット６３０によ
り入力映像信号の解像度が元の７０４×４８０から低解
像度モードの３５２×４８０に切り替えられる。According to FIGS. 23 (a) and 23 (c),
Between time 0 and time t1, the input video signal continues a video scene with a high encoding difficulty X, and the accumulated error D monotonically increases, and exceeds the threshold value Dth_U near time t1.
At this time t1, the resolution of the input video signal is switched from the original 704 × 480 to the low-resolution mode 352 × 480 by the resolution changing unit 630.

【００７２】低解像度モード時の入力映像信号の持つ符
号化難易度Ｘは、高解像度モードのときに比較して相対
的に低くなり、図２３（ａ）に示すような推移を示すこ
とになる。この期間に、累積誤差Ｄは、だいたい減少す
る方向に変化し、時刻ｔ２付近でしきい値Ｄｔｈ＿Ｌを
下回る。このとき、解像度変更ユニット６３０は低解像
度モードの３５２×４８０から元の７０４×４８０に戻
す。The encoding difficulty X of the input video signal in the low resolution mode is relatively lower than that in the high resolution mode, and shows a transition as shown in FIG. . During this period, the accumulated error D generally changes in a decreasing direction, and falls below the threshold value Dth_L near time t2. At this time, the resolution changing unit 630 returns from the low-resolution mode of 352 × 480 to the original 704 × 480.

【００７３】図２３（ｃ）には、解像度変更なしの場合
の累積誤差Ｄの推移を合わせて示しているが、時刻ｔ２
での本実施形態の累積誤差Ｄは図２２（ｃ）と比較して
十分な余裕量Ｍを持った状態となり、この時刻以降のビ
ット割り当てにおいて、入力映像信号の持つ符号化難易
度Ｘに応じた最適な割り当てビットＢｔが求められてい
ることが、図２３（ｂ）の割り当てビット量Ｂｔの推移
から分かる。FIG. 23 (c) also shows the transition of the accumulated error D when the resolution is not changed.
In this embodiment, the accumulated error D in this embodiment has a sufficient margin M as compared with FIG. 22C. In the bit allocation after this time, the accumulated error D depends on the encoding difficulty X of the input video signal. It can be seen from the transition of the allocated bit amount Bt in FIG. 23B that the optimum allocated bit Bt is obtained.

【００７４】図２４（ａ）〜図２４（ｃ）は、図２１の
画像符号化装置における累積難易度ＡＸに基づく解像度
変更の効果を表している。累積誤差Ｄに基づく解像度変
更の場合と同様の効果が得られることが分かる。FIGS. 24A to 24C show the effect of changing the resolution based on the cumulative difficulty level AX in the image coding apparatus of FIG. It can be seen that the same effect as in the case of changing the resolution based on the accumulated error D can be obtained.

【００７５】以上のように第６の実施形態の画像符号化
装置では、入力映像信号の解像度を累積誤差Ｄ又は累積
難易度ＡＸの大きさによって適応的に変化させ、入力映
像信号の持つ符号化難易度に応じた可変ビットレート制
御を継続して行うことを可能にし、符号化ノイズの抑制
に効果がある。しかも、解像度を低下させるしきい値
と、低下させた解像度を元に戻すしきい値の２種類のし
きい値を持ちヒステリシス特性を持たせることで、頻繁
に解像度が切り替わるのを抑えることできる。また、動
きの激しいシーンでは、解像度の低下が人の目に分かり
にくいため、解像度低下による悪影響は最低限に抑えら
れる。As described above, in the image coding apparatus according to the sixth embodiment, the resolution of the input video signal is adaptively changed according to the magnitude of the cumulative error D or the cumulative difficulty AX, and the coding of the input video signal is performed. This makes it possible to continuously perform variable bit rate control according to the degree of difficulty, and is effective in suppressing coding noise. Moreover, by providing two types of thresholds, that is, a threshold value for lowering the resolution and a threshold value for restoring the lowered resolution, and having a hysteresis characteristic, it is possible to suppress frequent switching of the resolution. In addition, in a scene where movement is intense, a decrease in resolution is difficult to be recognized by human eyes, so that an adverse effect due to the decrease in resolution is minimized.

【００７６】なお、本実施形態の解像度変換器６３１で
は２つの解像度の間を切り替えるものとしたが、３種類
以上の解像度の間を切り替えるようにしても構わない。
累積誤差Ｄの大きさと累積難易度ＡＸの大きさとを適宜
組み合わせることで、解像度を切り替える判断を行って
も良い。例えば、低い解像度への切り替えは累積誤差Ｄ
の大きさによって判断し、高い解像度へ戻す判断は累積
難易度ＡＸの大きさによって行うといった方法である。
また、２つの判断条件の両方を満たす、又は、どちらか
一方の条件を満たすなどとしても何ら問題はない。解像
度変換器６３１をアナログ入力の標本化器に置き換える
ことも可能である。Although the resolution converter 631 of this embodiment switches between two resolutions, it may switch between three or more resolutions.
The determination of switching the resolution may be made by appropriately combining the magnitude of the accumulated error D and the magnitude of the accumulated difficulty AX. For example, switching to a lower resolution requires a cumulative error D
Is determined based on the magnitude of the cumulative difficulty AX.
In addition, there is no problem even if both of the two determination conditions are satisfied or one of the two conditions is satisfied. It is also possible to replace the resolution converter 631 with an analog input sampler.

【００７７】（第７の実施形態）本発明の第７の実施形
態を図２５を用いて説明する。本実施形態は、解像度の
切り替え点が人の目に認識されにくい解像度切り替え手
段を持つ画像符号化装置を提供するものである。図２５
において、６４０は静止シーン検出器を、６４１はシー
ンチェンジ検出器をそれぞれ表している。(Seventh Embodiment) A seventh embodiment of the present invention will be described with reference to FIG. The present embodiment provides an image coding apparatus having a resolution switching unit that makes it difficult for the switching point of the resolution to be recognized by human eyes. FIG.
In, 640 represents a still scene detector, and 641 represents a scene change detector.

【００７８】静止シーン検出器６４０は、入力映像信号
から静止シーンであることを検出する機能を有し、静止
シーンが検出されると解像度変更ユニット６３０に対し
通知を行う。シーンチェンジ検出器６４１は、入力映像
信号の場面の切り替わりであるシーンチェンジを検出す
る機能を有し、シーンチェンジが検出されると解像度変
更ユニット６３０に対し通知を行う。静止シーンの検出
やシーンチェンジの検出は、フレーム間画素差分や輝
度、色差レベルの変化などの特徴量を用いる方法などあ
るが、どんな方法であっても良い。The still scene detector 640 has a function of detecting a still scene from an input video signal, and notifies the resolution changing unit 630 when a still scene is detected. The scene change detector 641 has a function of detecting a scene change, which is a scene change of an input video signal, and notifies the resolution change unit 630 when a scene change is detected. Detection of a still scene and detection of a scene change include a method using a feature amount such as a pixel difference between frames, a change in luminance and a color difference level, and any method may be used.

【００７９】解像度変更ユニット６３０は、符号量制御
器６２０から与えられた情報を判断条件とし、解像度の
切り替えの必要があった場合、すぐに解像度の切り替え
を実施するのではなく、静止シーン検出器６４０から通
知される静止シーン情報又はシーンチェンジ検出器６４
１から通知されるシーンチェンジ情報を待ち、これらに
同期して切り替えを行うようにする。The resolution change unit 630 uses the information provided from the code amount controller 620 as a determination condition, and when it is necessary to switch the resolution, does not immediately switch the resolution, but uses a still scene detector. Still scene information or scene change detector 64 notified from 640
Wait for the scene change information notified from 1 and switch in synchronization with them.

【００８０】以上のように第７の実施形態の画像符号化
装置では、解像度を切り替える必要があるとき、静止シ
ーンやシーンチェンジに同期して切り替えを行うように
することで、解像度の切り替わりが人の目に判別しにく
くすることが可能となる。As described above, in the image encoding apparatus according to the seventh embodiment, when the resolution needs to be switched, the switching is performed in synchronization with a still scene or a scene change. It is possible to make it difficult for the eyes to discriminate.

【００８１】なお、本実施形態では静止シーン及びシー
ンチェンジの両方を用いるようにしたが、片方だけ用い
るようにしても何ら問題はない。In this embodiment, both the still scene and the scene change are used, but there is no problem if only one of them is used.

【００８２】（第８の実施形態）本発明の第８の実施形
態を図２６を用いて説明する。本実施形態は、生成した
ＡＶストリームをその管理情報と併せてＤＶＤ−ＲＡＭ
などの蓄積メディア６５４に書き込む際に、ＡＶストリ
ームと管理情報との整合が取り易く、また管理情報のサ
イズが大きくなり過ぎることのない画像符号化装置を提
供するものである。図２６は、圧縮符号化器６１０、符
号量制御器６２０、解像度変更ユニット６３０、パラメ
ータ管理ユニット６５３から構成された画像符号化装置
のブロック図である。圧縮符号化器６１０は、ＧＯＰ構
造決定器６５０、ＶＯＢ切り替え器６５１、ＭＰＥＧ符
号化器６５２を備えている。(Eighth Embodiment) An eighth embodiment of the present invention will be described with reference to FIG. In the present embodiment, the generated AV stream is stored in a DVD-RAM together with its management information.
It is an object of the present invention to provide an image encoding apparatus that can easily match the AV stream with the management information when writing to the storage medium 654 such as the storage medium 654 and does not make the size of the management information too large. FIG. 26 is a block diagram of an image encoding device including a compression encoder 610, a code amount controller 620, a resolution change unit 630, and a parameter management unit 653. The compression encoder 610 includes a GOP structure determiner 650, a VOB switch 651, and an MPEG encoder 652.

【００８３】ＭＰＥＧ２規格においては、解像度の異な
る映像信号の符号化ビットストリームは区別され、同一
のシーケンスとして扱うことは許されていない。つま
り、解像度変更点の前後で別々のシーケンスとして符号
化ビットストリームを生成しなければならない。In the MPEG2 standard, encoded bit streams of video signals having different resolutions are distinguished from each other, and are not allowed to be treated as the same sequence. That is, the encoded bit stream must be generated as a separate sequence before and after the resolution change point.

【００８４】＜ＧＯＰの分割及びシーケンスの分割＞図
２６において、ＧＯＰ構造決定器６５０は、ＭＰＥＧ符
号化器６５２におけるＭＰＥＧ符号化の際のＧＯＰ構造
を決定する機能を持ち、また、解像度変更ユニット６３
０によって解像度変更が実施されるのに同期して、ＧＯ
Ｐを切断し新たなＧＯＰを開始する機能を持つ。図２７
（ａ）〜図２７（ｃ）にこのときの動作を示す。図２７
（ａ）は、Ｍ＝３、Ｎ＝１５（ＭはＩ又はＰフレームの
現れる周期、ＮはＧＯＰのフレーム数）で通常の動作を
しているときの状態で、図２７（ｂ）及び図２７（ｃ）
は、フレーム１０の前で解像度変更が行われたときのＧ
ＯＰ構造の変更例をそれぞれ示している。<Division of GOP and Division of Sequence> In FIG. 26, a GOP structure determiner 650 has a function of determining the GOP structure in MPEG encoding in the MPEG encoder 652, and the resolution changing unit 63
0 in synchronization with the resolution change being implemented,
It has a function to disconnect P and start a new GOP. FIG.
(A) to FIG. 27 (c) show the operation at this time. FIG.
FIG. 27 (b) and FIG. 27 (b) show a state in which a normal operation is performed at M = 3, N = 15 (M is a period in which an I or P frame appears, and N is the number of frames of a GOP). 27 (c)
Is G when the resolution is changed before frame 10.
Each of the OP structure changes is shown.

【００８５】図２７（ｂ）では、解像度変更後に新たに
始まるＧＯＰを、前のＧＯＰを参照しないクローズドＧ
ＯＰとする。具体的には、解像度変更後に新たに始まる
ＧＯＰの表示順序で最初のＩフレームより前にある２枚
のフレーム（フレーム１０及び１１）を、予測符号化に
際して前ＧＯＰに属するＰフレーム（フレーム９）を参
照しないように符号化することで、解像度変更前後のＧ
ＯＰ間に相関がないようにする。In FIG. 27B, a GOP newly started after the resolution change is set to a closed GOP that does not refer to the previous GOP.
OP. More specifically, two frames (frames 10 and 11) preceding the first I frame in the display order of a GOP newly started after the resolution change are replaced with P frames (frame 9) belonging to the previous GOP at the time of predictive encoding. Is encoded so as not to refer to the G before and after the resolution change.
There is no correlation between OPs.

【００８６】図２７（ｃ）では、解像度変更後に新たに
始まるＧＯＰを表示順序でＩフレームから始めること
で、同様に直前のＧＯＰとの相関がないようにする。In FIG. 27C, the GOP newly started after the resolution change is started from the I frame in the display order, so that there is no correlation with the immediately preceding GOP.

【００８７】上記ＧＯＰ構造決定器６５０を含む画像符
号化装置によって生成されるＭＰＥＧ２ストリームは、
解像度変更点においてＧＯＰを分割することが可能で、
かつ、解像度変更点の前後のＧＯＰが互いに相関を持た
ない。したがって、解像度変更直前ＧＯＰのビットスト
リームの終端にシーケンス終了コード（Ｓｅｑｕｅｎｃ
ｅ＿Ｅｎｄ＿Ｃｏｄｅ）を挿入し、新たに始まるＧＯＰ
の先頭にシーケンス開始コード（Ｓｅｑｕｅｎｃｅ＿Ｓ
ｔａｒｔ＿Ｃｏｄｅ）を挿入することが可能となり、解
像度変更点の前後で独立した２つの符号化ストリームシ
ーケンスに容易に分割することが可能となる。なお、シ
ーケンス開始コードについては、全てのＧＯＰの先頭に
挿入するようにしても特に問題はない。The MPEG2 stream generated by the image encoding device including the GOP structure determiner 650 includes:
GOP can be divided at the resolution change point,
In addition, the GOPs before and after the resolution change point have no correlation with each other. Therefore, a sequence end code (Sequenc) is placed at the end of the bit stream of the GOP immediately before the resolution change.
e_End_Code) and insert a new GOP
At the beginning of the sequence start code (Sequence_S
start_Code) can be inserted, and it can be easily divided into two independent coded stream sequences before and after the resolution change point. It should be noted that there is no particular problem if the sequence start code is inserted at the beginning of all GOPs.

【００８８】＜ＶＯＢの分割＞一連の映像及び音声など
の符号化ビットストリームがマルチプレクスされたＡＶ
ストリームをＤＶＤ−ＲＡＭなどの蓄積メディアに蓄積
する際に、これらＡＶストリームの管理単位としてＶＯ
Ｂ（video object）を考える。ＶＯＢは、その管理情報
として映像データのフレーム周波数・解像度・アスペク
ト比などや、音声データの符号化モード・チャンネル数
・符号化レートなどのパラメータを持ち、これらの管理
情報は、これに属するＡＶストリームを復号化する際に
デコーダにて参照されるものである。<Division of VOB> An AV in which a coded bit stream of a series of video and audio is multiplexed
When a stream is stored in a storage medium such as a DVD-RAM, a VO
Consider B (video object). The VOB has parameters such as the frame frequency, resolution, and aspect ratio of video data as its management information and the coding mode, number of channels, and coding rate of audio data. Are referred to by the decoder when decoding is performed.

【００８９】解像度変更ユニット６３０によって映像信
号の解像度が変更されると、映像データのパラメータが
変化することになり、ＶＯＢの分割が必要になる。When the resolution of the video signal is changed by the resolution changing unit 630, the parameters of the video data change, and VOB division is required.

【００９０】図２６において、ＶＯＢ切り替え器６５１
は、解像度変更ユニット６３０に対して解像度変更の許
可を与える機能と、実際に解像度変更が行われたとき
に、変更後の映像のパラメータとともにＶＯＢ分割実施
の旨の通知をパラメータ管理ユニット６５３に対して行
うという２つの機能を併せ持つ。パラメータ管理ユニッ
ト６５３は、作成されたＡＶストリームに対応する各パ
ラメータを蓄積メディア６５４に書き込む機能を持つ。Referring to FIG. 26, VOB switch 651
Is a function for giving the resolution change permission to the resolution change unit 630 and, when the resolution is actually changed, a notification of the VOB division execution together with the changed video parameters to the parameter management unit 653. It has both functions of performing The parameter management unit 653 has a function of writing each parameter corresponding to the created AV stream to the storage medium 654.

【００９１】上記ＶＯＢ切り替え器６５１を含む画像符
号化装置によって生成されるＭＰＥＧ２ストリームは、
解像度変更点においてＶＯＢを分割することが可能とな
る。したがって、解像度変更点の前後でＧＯＰが独立か
つＶＯＢが異なる符号化ビットストリームの生成が可能
となる。また、シーケンス分割で説明したシーケンス終
了コードをもってＶＯＢの終了を明示することが可能と
なる。The MPEG2 stream generated by the image coding device including the VOB switch 651 is:
The VOB can be divided at the resolution change point. Therefore, it is possible to generate an encoded bit stream having independent GOPs and different VOBs before and after the resolution change point. Further, the end of the VOB can be clearly indicated by the sequence end code described in the sequence division.

【００９２】＜ＶＯＢ分割の制限＞ＶＯＢを分割するこ
とで管理情報はどんどん増えていくため、例えば１枚の
ＤＶＤ−ＲＡＭディスクに記録可能なＶＯＢの最大数
（ＶＯＢ＿Ｍａｘ）を予め定めておく必要がある。<Restriction on VOB division> Since the management information is steadily increased by dividing the VOB, for example, the maximum number of VOBs (VOB_Max) that can be recorded on one DVD-RAM disk must be determined in advance. is there.

【００９３】ＶＯＢ切り替え器６５１は、ＶＯＢ数（Ｖ
ＯＢ＿Ｎｕｍ）をカウントする機能を備え、予め定めら
れたＶＯＢ最大数ＶＯＢ＿Ｍａｘに達したら、それ以降
のＶＯＢの切り替えを禁止する。こうすることで、管理
情報サイズを一定サイズ以下に抑えることが可能とな
る。The VOB switch 651 outputs the number of VOBs (V
OB_Num), and when a predetermined maximum number of VOBs VOB_Max is reached, switching of VOBs thereafter is prohibited. This makes it possible to suppress the management information size to a certain size or less.

【００９４】また、蓄積メディア６５４における記録媒
体の総容量（Ｖｏｌｕｍｅ）と符号化ビットストリーム
の目標平均レート（Ｒｓ）とから、Ｔｓ＝Ｖｏｌｕｍｅ／Ｒｓにより総記録時間（Ｔｓ）を求め、これをＶＯＢ最大数
ＶＯＢ＿Ｍａｘで割り、Ｔｍｉｎ＝Ｔｓ／ＶＯＢ＿Ｍａｘにより最小ＶＯＢ時間（Ｔｍｉｎ）を求める。ＶＯＢ切
り替え器６５１は、新たなＶＯＢが開始されてから、Ｔ
ｍｉｎの期間はＶＯＢを再度切り替えることを禁止する
ようにする。こうすることでも、本画像符号化装置にお
いて生成されるＶＯＢ総数がＶＯＢ最大数ＶＯＢ＿Ｍａ
ｘを超えないように制限することが可能となり、管理情
報サイズを一定サイズ以下に抑えることが可能となる。Further, from the total capacity (Volume) of the recording medium in the storage medium 654 and the target average rate (Rs) of the coded bit stream, a total recording time (Ts) is obtained by Ts = Volume / Rs, and this is calculated as VOB. Divide by the maximum number VOB_Max, and find the minimum VOB time (Tmin) by Tmin = Ts / VOB_Max. After a new VOB is started, the VOB switch 651
During the min period, switching of the VOB again is prohibited. By doing so, the total number of VOBs generated in the present image encoding device is the maximum number of VOBs VOB_Ma.
x can be limited so as not to exceed x, and the management information size can be suppressed to a certain size or less.

【００９５】例えば、Ｖｏｌｕｍｅ＝４．７ギガバイト
の容量を持つＤＶＤ−ＲＡＭに目標平均レートＲｓ＝５
Ｍｂｐｓで映像・音声を記録する場合、Ｔｓ＝７５２０
秒となる。このとき、ＶＯＢ最大数ＶＯＢ＿Ｍａｘを９
９９とすると、Ｔｍｉｎ＝７．５２秒となる。つまり、
ＶＯＢ切り替え後７．５２秒の間は再度ＶＯＢの切り替
えが行われないようにする。For example, a target average rate Rs = 5 in a DVD-RAM having a volume of 4.7 gigabytes.
When recording video / audio at Mbps, Ts = 7520
Seconds. At this time, the maximum number of VOBs VOB_Max is set to 9
If it is 99, Tmin = 7.52 seconds. That is,
VOB switching is not performed again for 7.52 seconds after VOB switching.

【００９６】なお、１つの記録媒体の中に目標平均レー
トの異なる複数のＶＯＢを記録する場合は、許容される
最も低い目標平均レートで総記録時間Ｔｓを求め、これ
を元に最小ＶＯＢ時間Ｔｍｉｎを求めるようにすること
で、記録媒体全体でのＶＯＢ総数の制限は十分保証でき
る。When a plurality of VOBs having different target average rates are recorded in one recording medium, the total recording time Ts is obtained at the lowest allowable target average rate, and the minimum VOB time Tmin is calculated based on the total recording time Ts. Is obtained, it is possible to sufficiently guarantee the limitation of the total number of VOBs in the entire recording medium.

【００９７】また、既にＡＶストリームが記録されてい
る記録媒体に追加してＡＶストリームを記録しようとす
る場合は、記録媒体の空き容量（Ｖｏｌｕｍｅ＿ｒｅｓ
ｔ）と符号化ビットストリームの目標平均レート（Ｒ
ｓ）から、Ｔｓ＿ｒｅｓｔ＝Ｖｏｌｕｍｅ＿ｒｅｓｔ／Ｒｓにより残記録時間（Ｔｓ＿ｒｅｓｔ）を求め、これをＶ
ＯＢ最大数ＶＯＢ＿Ｍａｘから既に記録されているＶＯ
Ｂ数を引いた残り（ＶＯＢ＿ａｖａｉｌ）で割り、Ｔｍｉｎ＝Ｔｓ＿ｒｅｓｔ／ＶＯＢ＿ａｖａｉｌにより最小ＶＯＢ時間Ｔｍｉｎを求める。ＶＯＢ切り替
え器６５１は、新たなＶＯＢが開始されてから、Ｔｍｉ
ｎの期間はＶＯＢを再度切り替えることを禁止するよう
にする。When an AV stream is to be recorded in addition to a recording medium on which an AV stream has already been recorded, the free space of the recording medium (Volume_res
t) and the target average rate (R
s), the remaining recording time (Ts_rest) is obtained from Ts_rest = Volume_rest / Rs, and
VO already recorded from the maximum number of OBs VOB_Max
The minimum VOB time Tmin is obtained from Tmin = Ts_rest / VOB_avail by dividing by the remainder (VOB_avail) after subtracting the B number. After a new VOB is started, the VOB switch 651
During the period of n, switching of the VOB again is prohibited.

【００９８】例えば、Ｖｏｌｕｍｅ＝４．７ギガバイト
の容量を持つＤＶＤ−ＲＡＭに目標平均レートＲｓ＝５
Ｍｂｐｓで映像・音声を記録する場合を想定する。既に
１時間分の映像・音声が１００個のＶＯＢで記録されて
いたとすると、Ｖｏｌｕｍｅ＿ｒｅｓｔ＝２．４５ギガ
バイト、Ｔｓ＿ｒｅｓｔ＝３９２０秒、ＶＯＢ＿ａｖａ
ｉｌ＝８９９個となり、Ｔｍｉｎ＝４．３６秒となる。
つまり、ＶＯＢ切り替え後４．３６秒の間は再度ＶＯＢ
の切り替えが行われないようにする。For example, a target average rate Rs = 5 in a DVD-RAM having a volume of 4.7 gigabytes.
It is assumed that video and audio are recorded at Mbps. Assuming that video and audio for one hour have already been recorded in 100 VOBs, Volume_rest = 2.45 gigabytes, Ts_rest = 3920 seconds, VOB_ava
il = 899, and Tmin = 4.36 seconds.
In other words, for 4.36 seconds after VOB switching,
Switch is not performed.

【００９９】なお、ＶＯＢ＿ａｖａｉｌは、上記のよう
にＶＯＢ＿Ｍａｘから既に使用されているＶＯＢ数を引
いた値でも良いし、この値と総記録時間に対する残記録
時間の比をＶＯＢ＿Ｍａｘに乗じて得られる値（上記の
例では、９９９×３９２０／７５２０＝５２０）とを比
較して小さい方の値を用いる（５２０個の方となる）よ
うにしても良い。VOB_avail may be a value obtained by subtracting the number of VOBs already used from VOB_Max as described above, or a value obtained by multiplying VOB_Max by the ratio of this value and the remaining recording time to the total recording time ( In the above example, 999 × 3920/7520 = 520) may be compared, and the smaller value may be used (520).

【０１００】ＶＯＢ数のカウントは、解像度変更による
ＶＯＢ分割で行うだけではなく、その他の要因、例えば
外部からの命令によってＶＯＢを分割した場合を含める
ようにしても問題はない。The counting of the number of VOBs is performed not only by the VOB division by changing the resolution but also by other factors, such as the case where the VOB is divided by an external command.

【０１０１】（第９の実施形態）本発明の第９の実施形
態を図２８を用いて説明する。図２８において、選択器
６６０は、解像度変更ユニット６３０からの解像度変更
要求に対して使用すべき解像度をリターンする機能を持
つ。この選択器６６０は、符号化ビットストリームの目
標平均レートを入力として、各目標平均レート毎に使用
できる解像度のテーブルを持つ。例えば、図２８に示し
たテーブル中のハッチングなしの解像度が選択可能であ
る。選択される解像度の範囲をこのように制限すること
で、目標平均レートに見合った解像度が選択されるよう
になり、解像度を高くし過ぎて符号量が多く出過ぎた
り、解像度を低くし過ぎて画質を損ねたりといったこと
を防ぐことが可能となる。(Ninth Embodiment) A ninth embodiment of the present invention will be described with reference to FIG. In FIG. 28, a selector 660 has a function of returning a resolution to be used in response to a resolution change request from the resolution change unit 630. The selector 660 has a table of resolutions that can be used for each target average rate with the target average rate of the coded bit stream as an input. For example, a resolution without hatching in the table shown in FIG. 28 can be selected. By limiting the range of resolutions selected in this way, the resolution that matches the target average rate will be selected, and if the resolution is set too high and the code amount will be too large, the resolution will be set too low and the image quality will be set too low. Can be prevented.

【０１０２】図２８の変形例を図２９に示す。ここで使
用する解像度は、操作者からの指定に従う。図２９の入
力器６６１は、操作者からの入力を受けて選択器６６０
に対して使用可能な解像度を通知する。例えば、操作者
により７０４×４８０の解像度だけ使用可能とする入力
があると、選択器６６０は解像度変更ユニット６３０か
ら解像度変更要求があったとしても、７０４×４８０以
外の解像度の選択を行わず、解像度変更は発生しない。
また、操作者により７０４×４８０と４８０×４８０の
２つの解像度が許可されていたとすると、選択器６６０
では、この２つの解像度の間で解像度の切り替えを行う
ように解像度変更ユニット６３０に対して指定を行うよ
うにする。このように操作者からの入力に応じて使用で
きる解像度を指定することで、操作者の好みに応じた解
像度の選択が可能となる。なお、操作者からの入力は、
具体的な解像度の指定ではなく、解像度変更を許可する
か否かの指定のみにすることも可能である。FIG. 29 shows a modification of FIG. The resolution used here depends on the designation from the operator. The input device 661 of FIG.
To the available resolutions. For example, if there is an input by the operator that only the resolution of 704 × 480 can be used, the selector 660 does not select a resolution other than 704 × 480 even if there is a resolution change request from the resolution change unit 630. No resolution change occurs.
If two resolutions of 704 × 480 and 480 × 480 are permitted by the operator, the selector 660
Then, the resolution change unit 630 is instructed to switch the resolution between the two resolutions. By specifying the resolution that can be used according to the input from the operator in this way, it is possible to select the resolution according to the preference of the operator. The input from the operator is
Instead of specifying a specific resolution, it is also possible to specify only whether or not to permit the resolution change.

【０１０３】以上のとおり、本発明の第１の実施形態に
よれば、入力映像信号に含まれる周波数成分を分析し、
その最高周波数成分がどの周波数帯域にあるかにより解
像度を決定する。この場合には、高い周波数成分を有す
るほど、高い解像度となるように入力映像信号の解像度
を決定する。そして入力映像信号をその決定した解像度
に変換、又は解像度に応じたサンプリング周波数で標本
化を行った後、圧縮符号化を行う。As described above, according to the first embodiment of the present invention, the frequency components included in the input video signal are analyzed,
The resolution is determined based on which frequency band the highest frequency component is in. In this case, the resolution of the input video signal is determined so that the higher the frequency component, the higher the resolution. Then, after converting the input video signal to the determined resolution or performing sampling at a sampling frequency corresponding to the resolution, compression encoding is performed.

【０１０４】また、本発明の第２の実施形態によれば、
入力映像信号に対して異なる遮断周波数を有する低域通
過フィルタを施し、そのフィルタ出力と予め定めておい
たしきい値を比較する。そして、しきい値よりも大きい
出力を出すフィルタの遮断周波数が高いほど、高い解像
度となるように入力映像信号の解像度を決定する。そし
て入力映像信号を決定した解像度に変換、又は解像度に
応じたサンプリング周波数で標本化を行った後、圧縮符
号化を行う。Further, according to the second embodiment of the present invention,
A low-pass filter having a different cutoff frequency is applied to the input video signal, and the output of the filter is compared with a predetermined threshold. Then, the resolution of the input video signal is determined so that the higher the cutoff frequency of the filter that outputs an output larger than the threshold, the higher the resolution. Then, after converting the input video signal to the determined resolution or performing sampling at a sampling frequency corresponding to the resolution, compression encoding is performed.

【０１０５】また、本発明の第３の実施形態によれば、
入力映像信号のアクティビティを計算し、そのアクティ
ビティの値によって解像度を決定する。この場合には、
アクティビティが大きいほど、高い解像度となるように
入力映像信号の解像度を決定する。そして入力映像信号
をその決定した解像度に変換、又は解像度に応じたサン
プリング周波数で標本化を行った後、圧縮符号化を行
う。Further, according to the third embodiment of the present invention,
The activity of the input video signal is calculated, and the resolution is determined according to the value of the activity. In this case,
The resolution of the input video signal is determined so that the higher the activity, the higher the resolution. Then, after converting the input video signal to the determined resolution or performing sampling at a sampling frequency corresponding to the resolution, compression encoding is performed.

【０１０６】また、本発明の第４の実施形態によれば、
入力映像信号のＤＣＴ係数ブロックを分析し、そのＤＣ
Ｔ係数の高域成分の分布によって解像度を決定する。こ
の場合には、高域成分を有するブロックが多いほど、高
い解像度となるように入力映像信号の解像度を決定す
る。そして入力映像信号の次のフレームをその決定した
解像度に変換、又は解像度に応じたサンプリング周波数
で標本化を行った後、圧縮符号化を行う。Further, according to the fourth embodiment of the present invention,
The DCT coefficient block of the input video signal is analyzed and its DC
The resolution is determined by the distribution of the high frequency component of the T coefficient. In this case, the resolution of the input video signal is determined so that the higher the number of blocks having high frequency components, the higher the resolution. Then, after the next frame of the input video signal is converted to the determined resolution or sampled at a sampling frequency according to the resolution, compression encoding is performed.

【０１０７】また、本発明の第５の実施形態によれば、
過去の所定期間の発生符号量及び量子化ステップからコ
ンプレキシティを計算し、そのコンプレキシティの大き
さによって解像度を決定する。この場合には、コンプレ
キシティが大きい、すなわち符号化難易度が高い映像で
あるほど、高い解像度となるように入力映像信号の解像
度を決定する。そして入力映像信号の次のフレームをそ
の決定した解像度に変換、又は解像度に応じたサンプリ
ング周波数で標本化を行った後、圧縮符号化を行う。Further, according to the fifth embodiment of the present invention,
The complexity is calculated from the generated code amount and the quantization step in the past predetermined period, and the resolution is determined according to the magnitude of the complexity. In this case, the resolution of the input video signal is determined so that the higher the complexity, that is, the higher the encoding difficulty, the higher the resolution. Then, after the next frame of the input video signal is converted to the determined resolution or sampled at a sampling frequency according to the resolution, compression encoding is performed.

【０１０８】したがって、本発明の第１〜第５の実施形
態に係る画像符号化装置を用いることにより、周波数成
分などの入力映像の特性に応じた解像度で入力映像を符
号化することができる。つまり、入力映像信号の特性に
応じて符号化するブロック数を減らすことができるた
め、少ない画質劣化で発生符号量を大きく削減すること
ができる。また、入力映像の符号化難易度に応じた解像
度で入力映像信号を符号化することができる。また、入
力映像信号の特性に応じた解像度変換が行われているこ
とから、解像度が切り替わった場合に、視覚的な劣化が
生じにくい。Therefore, by using the image encoding apparatuses according to the first to fifth embodiments of the present invention, it is possible to encode an input video with a resolution according to characteristics of the input video such as frequency components. That is, since the number of blocks to be coded can be reduced according to the characteristics of the input video signal, the amount of generated codes can be greatly reduced with little image quality deterioration. Further, the input video signal can be encoded at a resolution corresponding to the encoding difficulty of the input video. Further, since the resolution conversion is performed according to the characteristics of the input video signal, visual degradation hardly occurs when the resolution is switched.

【０１０９】特に、本発明の第１〜第３の実施形態によ
れば、符号化対象フレーム自体の特性に応じて解像度を
決定するため、例えば現在のフレームでシーンチェンジ
が起こったような場合であっても画質劣化が生じない。In particular, according to the first to third embodiments of the present invention, since the resolution is determined in accordance with the characteristics of the encoding target frame itself, for example, when a scene change occurs in the current frame, Even if there is, image quality does not deteriorate.

【０１１０】また、本発明の第６の実施形態によれば、
リアルタイムに記録時間を保証しながら可変レートの符
号化ビットストリームを生成するときに、入力映像信号
の符号化難易度に応じた可変ビット割り当てが困難な状
態になると、入力映像信号の解像度を低下させ引き続き
符号化難易度に応じた可変ビット割り当てを継続してい
くことを可能にし、ブロックノイズやモスキートノイズ
といったＤＣＴ符号化処理特有の符号化ノイズを低減さ
せ、画質を向上させることが可能となる。According to the sixth embodiment of the present invention,
When generating a variable rate coded bit stream while guaranteeing the recording time in real time, if it becomes difficult to assign variable bits according to the encoding difficulty of the input video signal, the resolution of the input video signal is reduced. It is possible to continue variable bit allocation according to the degree of difficulty of encoding, to reduce encoding noise peculiar to DCT encoding processing such as block noise and mosquito noise, and to improve image quality.

【０１１１】また、本発明の第７の実施形態によれば、
解像度の切り替えを入力映像信号のシーンチェンジや静
止に同期させるようにし、解像度の切り替わりが人の目
に認知されにくくすることが可能となる。According to the seventh embodiment of the present invention,
The switching of the resolution is synchronized with the scene change or the stillness of the input video signal, so that the switching of the resolution is hardly recognized by human eyes.

【０１１２】また、本発明の第８の実施形態によれば、
生成したＡＶストリームをその管理情報と併せてＤＶＤ
−ＲＡＭなどの蓄積メディアに書き込む際に、ＡＶスト
リームの管理単位であるＶＯＢを解像度の変更点に同期
して切り替えるようにし、ＡＶストリームと管理情報の
整合を取り易くし、また、管理情報サイズが大きくなり
過ぎることのないよう解像度切り替え回数を規定数以下
に抑えることが可能となる。According to the eighth embodiment of the present invention,
The generated AV stream is stored in a DVD together with its management information.
-When writing to a storage medium such as a RAM, a VOB, which is a management unit of an AV stream, is switched in synchronization with a change in resolution, thereby facilitating consistency between the AV stream and the management information. It is possible to suppress the number of resolution switching to a specified number or less so as not to become too large.

【０１１３】また、本発明の第９の実施形態によれば、
解像度切り替えに使用する解像度を目標平均レートや操
作者からの指定に応じて制限を設けることにより、符号
化条件や操作者の好みに応じた画質で符号化を行えると
いう効果がある。Further, according to the ninth embodiment of the present invention,
By setting a limit on the resolution to be used for switching the resolution in accordance with the target average rate and the designation from the operator, there is an effect that encoding can be performed with an image quality according to the encoding conditions and the preference of the operator.

【０１１４】[0114]

【発明の効果】以上のとおり、本発明によれば、入力映
像信号の特性に応じた解像度変更を行うこととしたの
で、画質劣化を最小限に抑えながら発生符号量を大きく
減らすことができる画像符号化装置を提供することがで
きる。As described above, according to the present invention, since the resolution is changed in accordance with the characteristics of the input video signal, the amount of generated codes can be greatly reduced while minimizing image quality deterioration. An encoding device can be provided.

[Brief description of the drawings]

【図１】本発明の第１の実施形態に係る画像符号化装置
のブロック図である。FIG. 1 is a block diagram of an image encoding device according to a first embodiment of the present invention.

【図２】（ａ）〜（ｄ）は選択可能な解像度の例を説明
するための模式図である。FIGS. 2A to 2D are schematic diagrams illustrating examples of selectable resolutions.

【図３】図１の画像符号化装置における解像度決定のフ
ローチャートである。FIG. 3 is a flowchart of resolution determination in the image encoding device of FIG. 1;

【図４】（ａ）及び（ｂ）は各々図３中の最高周波数成
分の定義を説明するためのグラフである。FIGS. 4A and 4B are graphs for explaining the definition of the highest frequency component in FIG. 3;

【図５】図１の画像符号化装置の変形例を示すブロック
図である。FIG. 5 is a block diagram showing a modified example of the image encoding device of FIG. 1;

【図６】本発明の第２の実施形態に係る画像符号化装置
のブロック図である。FIG. 6 is a block diagram of an image encoding device according to a second embodiment of the present invention.

【図７】図６中の２つのフィルタの各々の周波数特性の
例を示すグラフである。FIG. 7 is a graph showing an example of frequency characteristics of each of two filters in FIG. 6;

【図８】図６の画像符号化装置における解像度決定のフ
ローチャートである。FIG. 8 is a flowchart of resolution determination in the image encoding device of FIG. 6;

【図９】図６の画像符号化装置の変形例を示すブロック
図である。FIG. 9 is a block diagram illustrating a modification of the image encoding device of FIG. 6;

【図１０】本発明の第３の実施形態に係る画像符号化装
置のブロック図である。FIG. 10 is a block diagram of an image encoding device according to a third embodiment of the present invention.

【図１１】図１０の画像符号化装置における解像度決定
のフローチャートである。FIG. 11 is a flowchart of resolution determination in the image encoding device of FIG. 10;

【図１２】図１１中のアクティビティ決定ステップの詳
細フローチャートである。FIG. 12 is a detailed flowchart of an activity determination step in FIG. 11;

【図１３】図１０の画像符号化装置の変形例を示すブロ
ック図である。FIG. 13 is a block diagram illustrating a modification of the image encoding device in FIG. 10;

【図１４】本発明の第４の実施形態に係る画像符号化装
置のブロック図である。FIG. 14 is a block diagram of an image encoding device according to a fourth embodiment of the present invention.

【図１５】図１４中のＤＣＴ係数分析器の動作を説明す
るための、ＤＣＴ係数ブロックの模式図である。FIG. 15 is a schematic diagram of a DCT coefficient block for explaining the operation of the DCT coefficient analyzer in FIG.

【図１６】図１４の画像符号化装置における解像度決定
のフローチャートである。FIG. 16 is a flowchart of resolution determination in the image encoding device of FIG. 14;

【図１７】図１４の画像符号化装置の変形例を示すブロ
ック図である。FIG. 17 is a block diagram illustrating a modification of the image encoding device in FIG. 14;

【図１８】本発明の第５の実施形態に係る画像符号化装
置のブロック図である。FIG. 18 is a block diagram of an image encoding device according to a fifth embodiment of the present invention.

【図１９】図１８の画像符号化装置における解像度決定
のフローチャートである。19 is a flowchart of resolution determination in the image encoding device of FIG.

【図２０】図１８の画像符号化装置の変形例を示すブロ
ック図である。FIG. 20 is a block diagram illustrating a modification of the image encoding device in FIG. 18;

【図２１】本発明の第６の実施形態に係る画像符号化装
置のブロック図である。FIG. 21 is a block diagram of an image encoding device according to a sixth embodiment of the present invention.

【図２２】図２１の画像符号化装置において解像度変更
なしとした場合の動作例を表す図であり、（ａ）は符号
化難易度Ｘの、（ｂ）は割り当てビット量Ｂｔの、
（ｃ）は累積誤差Ｄの各々の時間推移を示す。22A and 22B are diagrams illustrating an operation example when the resolution is not changed in the image encoding device in FIG. 21, where FIG. 22A illustrates encoding difficulty X, FIG.
(C) shows the time transition of each of the accumulated errors D.

【図２３】図２１の画像符号化装置における解像度変更
ありの場合の動作例を表す図であり、（ａ）は符号化難
易度Ｘの、（ｂ）は割り当てビット量Ｂｔの、（ｃ）は
累積誤差Ｄの各々の時間推移を示す。23A and 23B are diagrams illustrating an operation example in the case where the resolution is changed in the image encoding device in FIG. 21, where FIG. 23A illustrates an encoding difficulty X, FIG. 23B illustrates an assigned bit amount Bt, and FIG. Indicates the time transition of each of the accumulated errors D.

【図２４】図２１の画像符号化装置における解像度変更
ありの場合の動作例を表す図であり、（ａ）は符号化難
易度Ｘの、（ｂ）は割り当てビット量Ｂｔの、（ｃ）は
累積難易度ＡＸの各々の時間推移を示す。24A and 24B are diagrams illustrating an operation example of the image encoding device in FIG. 21 when the resolution is changed, where FIG. 24A illustrates the encoding difficulty X, FIG. 24B illustrates the assigned bit amount Bt, and FIG. Indicates the time transition of each of the cumulative difficulty levels AX.

【図２５】本発明の第７の実施形態に係る画像符号化装
置のブロック図である。FIG. 25 is a block diagram of an image encoding device according to a seventh embodiment of the present invention.

【図２６】本発明の第８の実施形態に係る画像符号化装
置のブロック図である。FIG. 26 is a block diagram of an image encoding device according to an eighth embodiment of the present invention.

【図２７】図２６中のＧＯＰ構造決定器により決定され
るＧＯＰ構造例を表す図であり、（ａ）は通常の場合、
（ｂ）は新たに開始するＧＯＰをクローズドＧＯＰとし
た場合、（ｃ）は新たに開始するＧＯＰをＩフレーム開
始ＧＯＰとした場合をそれぞれ示す。27A and 27B are diagrams illustrating an example of a GOP structure determined by a GOP structure determiner in FIG. 26. FIG.
(B) shows a case where the newly started GOP is a closed GOP, and (c) shows a case where the newly started GOP is an I-frame start GOP.

【図２８】本発明の第９の実施形態に係る画像符号化装
置のブロック図である。FIG. 28 is a block diagram of an image encoding device according to a ninth embodiment of the present invention.

【図２９】図２８の画像符号化装置の変形例を示すブロ
ック図である。FIG. 29 is a block diagram illustrating a modification of the image encoding device in FIG. 28.

[Explanation of symbols]

１００遅延器１０１周波数帯域分析器１０２解像度決定器１０３解像度変換器１０３ａ標本化器１０４ブロック化器１０５ＤＣＴ変換器１０６量子化器１０７可変長符号化器１１０圧縮符号化器２０１，２０２フィルタ２０４，２０５出力値比較器２０７解像度決定器３０１アクティビティ測定器３０２解像度決定器４０１ＤＣＴ係数分析器４０２解像度決定器５０１平均量子化ステップ演算器５０２発生符号量計数器５０３解像度決定器６１０圧縮符号化器６２０符号量制御器６２１発生ビット量検出器６２２累積誤差計算器６２３ビット割り当て器６２４符号化難易度検出器６２５難易度変化検出器６３０解像度変更ユニット６３１解像度変換器６３２累積難易度計算器６３３解像度決定器６４０静止シーン検出器６４１シーンチェンジ検出器６５０ＧＯＰ構造決定器６５１ＶＯＢ切り替え器６５２ＭＰＥＧ符号化器６５３パラメータ管理ユニット６５４蓄積メディア６６０選択器６６１入力器ＡＸ累積難易度Ｂａ平均目標符号量Ｂｇ発生ビット量Ｂｔ割り当てビット量Ｄ累積誤差Ｄｉ初期累積誤差Ｄｍａｘ最大累積誤差Ｘ符号化難易度 ΔＸ難易度変化検出信号 REFERENCE SIGNS LIST 100 delay unit 101 frequency band analyzer 102 resolution determiner 103 resolution converter 103 a sampler 104 blocker 105 DCT converter 106 quantizer 107 variable length encoder 110 compression encoder 201, 202 filter 204, 205 Output value comparator 207 Resolution determiner 301 Activity measurer 302 Resolution determiner 401 DCT coefficient analyzer 402 Resolution determiner 501 Average quantization step calculator 502 Generated code amount counter 503 Resolution determiner 610 Compression encoder 620 Code amount Controller 621 Generated bit amount detector 622 Cumulative error calculator 623 Bit allocator 624 Encoding difficulty detector 625 Difficulty change detector 630 Resolution change unit 631 Resolution converter 632 Cumulative difficulty calculator 633 Resolution determiner 40 Still scene detector 641 Scene change detector 650 GOP structure determiner 651 VOB switch 652 MPEG encoder 653 Parameter management unit 654 Storage media 660 Selector 661 Input device AX Cumulative difficulty Ba Average target code amount Bg Generated bit amount Bt Allocated bit amount D Cumulative error Di Initial cumulative error Dmax Maximum cumulative error X Encoding difficulty ΔX Difficulty change detection signal

───────────────────────────────────────────────────── フロントページの続き (72)発明者福田秀樹大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者中村和彦大阪府門真市大字門真1006番地松下電器産業株式会社内 (72)発明者岩▲崎▼ 史朗大阪府門真市大字門真1006番地松下電器産業株式会社内Ｆターム(参考） 5C059 KK01 LB00 MA00 MA23 MC11 ME01 PP04 SS01 SS06 SS11 TA06 TB04 TC00 TC18 TC19 TD01 TD04 TD11 UA02 UA11 UA13 UA34 5J064 AA01 BA09 BA13 BA16 BC16 BC24 BD02 ──────────────────────────────────────────────────続き Continuing from the front page (72) Inventor Hideki Fukuda 1006 Kazuma Kadoma, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. 72) Inventor Iwasaki Shiro 1006 Kazuma Kadoma, Osaka Prefecture F-term (reference) 5C059 KK01 LB00 MA00 MA23 MC11 ME01 PP04 SS01 SS06 SS11 TA06 TB04 TC00 TC18 TC19 TD01 TD04 TD11 UA11 UA11 UA13 UA34 5J064 AA01 BA09 BA13 BA16 BC16 BC24 BD02

Claims

[Claims]

1. A frequency band analyzing means for outputting a frequency component value of an input video signal, and detecting a highest frequency of the frequency component value output from the frequency band analyzing means, wherein the highest frequency is predetermined. Resolution determining means for determining a resolution according to which of the plurality of frequency bands belongs, resolution changing means for changing and outputting the resolution of the input video signal according to the determination of the resolution determining means, An image encoding apparatus, comprising: encoding means for compression-encoding the video signal output from the resolution changing means.

2. The image encoding apparatus according to claim 1, wherein the input video signal is a digital signal, and the resolution changing unit changes the number of pixels of the input video signal according to the determined resolution. An image encoding device comprising a resolution conversion unit for outputting.

3. The image encoding device according to claim 1, wherein the input video signal is an analog signal, and the resolution changing unit samples the input video signal at a sampling frequency according to the determined resolution. An image coding apparatus comprising a sampling means for converting a digital video signal into a digital video signal and outputting the digital video signal.

4. The image encoding device according to claim 1, wherein the resolution determining unit is configured to provide a higher resolution to the predetermined plurality of frequency bands as the highest frequency belongs to a higher frequency band. An image encoding apparatus for determining a resolution.

5. The image encoding apparatus according to claim 1, wherein the resolution determining means determines a threshold value of the frequency component value which is an output of the frequency band analysis means. An image coding apparatus for detecting a highest frequency which is equal to or higher than a value.

6. The image encoding apparatus according to claim 1, wherein the highest resolution determined by said resolution determining means is the same as the resolution of said input video signal.

7. A plurality of filter means for performing frequency filtering having different frequency characteristics on an input video signal, and comparing output values of the plurality of filter means with a predetermined threshold value, Comparing means for outputting a result; resolution determining means for determining a resolution from a comparison result of the comparing means; and resolution for changing and outputting the resolution of the input video signal in accordance with the determination of the resolution determining means. An image encoding apparatus, comprising: a changing unit; and an encoding unit for compressing and encoding a video signal output from the resolution changing unit.

8. The image encoding device according to claim 7, wherein the input video signal is a digital signal, and the resolution changing unit changes the number of pixels of the input video signal according to the determined resolution. An image encoding device comprising a resolution conversion unit for outputting.

9. The image encoding apparatus according to claim 7, wherein the input video signal is an analog signal, and the resolution changing unit samples the input video signal at a sampling frequency according to the determined resolution. An image coding apparatus comprising a sampling means for converting a digital video signal into a digital video signal and outputting the digital video signal.

10. The image encoding apparatus according to claim 7, wherein the plurality of filter units are low-pass filters each having a different cutoff frequency, and the resolution determining unit outputs an output of the plurality of filter units. An image encoding apparatus characterized in that, when there is a value larger than the predetermined threshold value, the resolution is determined such that the higher the cutoff frequency of the filter means, the higher the resolution.

11. The image encoding device according to claim 7, wherein the highest resolution determined by said resolution determining means is the same as the resolution of said input video signal.

12. An activity measuring means for obtaining a variance value of a pixel value with respect to an input video signal, and a resolution is determined by which of a predetermined range the variance value output from the activity measuring means belongs to. Resolution determining means for determining; resolution changing means for changing and outputting the resolution of the input video signal in accordance with the determination of the resolution determining means; and compression encoding of the video signal output from the resolution changing means. An image encoding apparatus comprising: an encoding unit for determining the resolution, wherein the resolution determining unit determines the resolution such that the higher the variance value output from the activity measuring unit, the higher the resolution. Image encoding device.

13. The image encoding device according to claim 12, wherein the input video signal is a digital signal, and wherein the resolution changing unit changes the number of pixels of the input video signal according to the determined resolution. An image encoding device comprising a resolution conversion unit for outputting.

14. The image encoding device according to claim 12, wherein the input video signal is an analog signal, and the resolution changing unit samples the input video signal at a sampling frequency according to the determined resolution. An image coding apparatus comprising a sampling means for converting a digital video signal into a digital video signal and outputting the digital video signal.

15. The image encoding apparatus according to claim 12, wherein said activity measuring means calculates and outputs a variance of pixel values for each predetermined unit, and said resolution determining means includes said activity measuring means. The average value or the maximum value of the variance value of each unit, which is the output of the unit, is determined over a predetermined video unit period, and the resolution is determined based on which of the predetermined ranges the average value or the maximum value belongs to. An image encoding device, characterized in that:

16. The image encoding apparatus according to claim 12, wherein the highest resolution determined by said resolution determining means is the same as the resolution of said input video signal.

17. A resolution changing means for changing and outputting a resolution of an input video signal, a block dividing means for dividing an output of the resolution changing means into blocks having a predetermined size, and Orthogonal transform means for orthogonally transforming the output of the means into a frequency domain; code string generating means for transforming the orthogonal transform coefficient output from the orthogonal transform means into a code string; and output of the orthogonal transform means. Orthogonal transform coefficient analyzing means for analyzing the frequency distribution of the orthogonal transform coefficient, and resolution determining means for determining the resolution used by the resolution changing means based on the analysis result output from the orthogonal transform coefficient analyzing means. An image encoding device, comprising:

18. The image encoding apparatus according to claim 17, wherein the input video signal is a digital signal, and wherein the resolution changing unit changes the number of pixels of the input video signal according to the determined resolution. An image encoding device comprising a resolution conversion unit for outputting.

19. The image encoding apparatus according to claim 17, wherein the input video signal is an analog signal, and the resolution changing unit samples the input video signal at a sampling frequency according to the determined resolution. An image coding apparatus comprising a sampling means for converting a digital video signal into a digital video signal and outputting the digital video signal.

20. The image coding apparatus according to claim 17, wherein the orthogonal transform coefficient analysis unit sets a plurality of zones obtained by dividing the orthogonal transform coefficient into frequency components, and outputs an output of the orthogonal transform unit. Counting which of the plurality of zones the highest frequency component of the orthogonal transform coefficient belongs to for each block over a predetermined video unit period, wherein the resolution determining means determines a resolution based on the counting result. An image encoding device characterized by the above-mentioned.

21. The image coding apparatus according to claim 20, wherein said resolution determining means determines the resolution such that the higher the count number of the high frequency component zone, the higher the resolution. Device.

22. The image encoding apparatus according to claim 17, wherein the highest resolution determined by said resolution determining means is the same as the resolution of said input video signal.

23. A resolution changing unit for changing the resolution of an input video signal and outputting the same, a block dividing unit for dividing an output of the resolution changing unit into blocks having a predetermined size, and the block division. Orthogonal transform means for orthogonally transforming the output of the means into a frequency domain; quantizing means for quantizing an orthogonal transform coefficient output from the orthogonal transform means using a variable quantization step; A code string generating means for converting an output of the means into a code string; a generated code amount counting means for obtaining and outputting a cumulative value of a code amount generated in the code string generating means within a predetermined period; Means for calculating and outputting an average value of the quantization step used in the quantization step within the predetermined period, and a generated code amount output from the generated code amount counting means. A complexity value, which is a product of the accumulated value and the average value of the quantization step output from the average quantization step calculation means, is determined, and the resolution used by the resolution changing means is determined based on the complexity value. And a resolution determining unit for determining the resolution, wherein the resolution determining unit is higher as the complexity value belongs to a larger value range among a plurality of predetermined ranges. An image coding apparatus characterized in that the resolution is determined so as to be the resolution.

24. The image encoding device according to claim 23, wherein the input video signal is a digital signal, and wherein the resolution changing unit changes the number of pixels of the input video signal according to the determined resolution. An image encoding device comprising a resolution conversion unit for outputting.

25. The image encoding device according to claim 23, wherein the input video signal is an analog signal, and the resolution changing unit samples the input video signal at a sampling frequency according to the determined resolution. An image coding apparatus comprising a sampling means for converting a digital video signal into a digital video signal and outputting the digital video signal.

26. The image encoding apparatus according to claim 23, wherein the highest resolution determined by said resolution determining means is the same as the resolution of said input video signal.

27. Compression encoding means for compressing and encoding an input video signal so as to output an encoded bit stream having a variable bit rate according to a variable allocated bit amount; A cumulative error calculating means for sequentially adding a difference between a generated bit amount for each predetermined period of the encoded bit stream and calculating a cumulative error; and Bit allocation means for controlling the amount of update, and resolution changing means for changing the resolution of the input video signal to low when the cumulative error exceeds a predetermined amount. Image encoding device.

28. The image coding apparatus according to claim 27, wherein said resolution changing means restores said resolution when said accumulated error falls below another specified amount smaller than said specified amount. Image encoding device.

29. The image encoding apparatus according to claim 27, wherein said resolution changing means selects one of three or more levels of resolution according to the magnitude of said accumulated error. Encoding device.

30. Compression encoding means for compressing and encoding an input video signal so as to output an encoded bit stream having a variable bit rate according to a variable allocated bit amount, and a given average target amount and A cumulative error calculating means for sequentially adding a difference between a generated bit amount for each predetermined period of the encoded bit stream and calculating a cumulative error; and Bit allocation means for controlling the amount of update; encoding difficulty detecting means for detecting the encoding difficulty indicating the complexity of the input video signal for each of the predetermined periods; and a plurality of encoding difficulty levels. Of the input video signal in accordance with at least one of the magnitude of the cumulative error and the magnitude of the cumulative difficulty. The image coding apparatus is characterized in that a resolution changing means for changing the Zodo.

31. The image encoding apparatus according to claim 30, wherein the resolution changing unit is configured to determine that the cumulative error exceeds a predetermined first specified amount or the cumulative difficulty is a predetermined amount. 2. An image coding apparatus according to claim 1, wherein the resolution of the input video signal is changed to a lower value when the specified amount exceeds 2.

32. The image encoding device according to claim 30, wherein the resolution changing unit is configured to determine that the accumulated error is smaller than a third designated amount smaller than the first designated amount, or the accumulated difficulty is smaller than the third designated amount. An image encoding apparatus, wherein the resolution is restored when a value falls below a fourth designated amount smaller than the second designated amount.

33. The image encoding device according to claim 30, wherein said resolution changing means changes the resolution of said input video signal to be low when said accumulated error exceeds a predetermined upper limit designated amount, and said accumulation difficulty is increased. An image encoding apparatus, wherein the resolution is restored when the degree falls below a predetermined lower limit specified amount.

34. A compression encoding unit for compressing and encoding an input video signal, a scene change detection unit for detecting a scene change of the input video signal, and the input video signal in synchronization with the scene change. And a resolution changing means for changing the resolution of the image.

35. A compression encoding unit for compressing and encoding an input video signal, a still scene detection unit for detecting a still scene of the input video signal, and the input unit when the still scene is detected. An image encoding apparatus comprising: a resolution changing unit configured to change a resolution of a video signal.

36. An input video signal is compression-encoded to an MPE.
An image encoding device comprising: a compression encoding unit for outputting a G stream; and a resolution changing unit for changing a resolution of the input video signal, wherein the compression encoding unit includes GO
GOP for determining P (group of pictures) structure
An image coding apparatus comprising a structure determining unit, wherein a GOP is switched at the same time as the resolution is changed.

37. The image encoding device according to claim 36, wherein the GOP structure determining unit is configured to determine the GOP immediately after the resolution is changed.
An image encoding apparatus, wherein an OP is a closed GOP that does not refer to a previous GOP.

38. The image encoding device according to claim 36, wherein the GOP structure determining unit determines a GOP immediately after the resolution is changed.
An image encoding apparatus characterized in that an OP starts from an I (intra) frame.

39. The image encoding apparatus according to claim 36, wherein the compression encoding unit is a VOB that is a set of the GOPs.
An image encoding device, comprising: a VOB switching unit for switching a video object, wherein the VOB is switched at the same time as the resolution is changed.

40. The image coding apparatus according to claim 39, wherein said VOB switching means does not switch the VOB when the number of VOBs reaches a predetermined maximum number of VOBs. apparatus.

41. The image encoding apparatus according to claim 39, wherein said VOB switching means does not switch the VOB until a specified time has elapsed after switching the VOB once.

42. The image encoding apparatus according to claim 41, wherein the specified time is a total recording time obtained from a total capacity of a recording medium on which the MPEG stream is to be recorded and a target average rate of the MPEG stream. Predetermined V
An image coding apparatus characterized by being obtained by dividing by the maximum number of OBs.

43. The image encoding apparatus according to claim 41, wherein the specified time is a remaining recording time obtained from a free space of a recording medium on which the MPEG stream is to be recorded and a target average rate of the MPEG stream. An image coding apparatus characterized by being obtained by dividing by a value obtained by subtracting the number of already recorded VOBs from a predetermined maximum number of VOBs.

44. The image encoding apparatus according to claim 39, wherein said VOB switching means inserts an end code indicating the end of the VOB into said MPEG stream when said VOB is switched. Device.

45. An MPE which compresses and encodes an input video signal.
Compression encoding means for outputting a G stream; resolution changing means for changing the resolution of the input video signal into a plurality of steps; and resolution used by the resolution changing means according to a given target average rate And a selecting means for selecting the image encoding device.

46. An MPE which compresses and encodes an input video signal.
Compression encoding means for outputting a G stream; resolution changing means for changing the resolution of the input video signal into a plurality of steps; input means for receiving input from an operator; Selecting means for selecting a resolution to be used by the resolution changing means in accordance with the input of the image encoding apparatus.