JP4041245B2

JP4041245B2 - Image encoding device

Info

Publication number: JP4041245B2
Application number: JP13478499A
Authority: JP
Inventors: 隆史遠藤
Original assignee: Kyocera Corp
Current assignee: Kyocera Corp
Priority date: 1999-05-14
Filing date: 1999-05-14
Publication date: 2008-01-30
Anticipated expiration: 2019-05-14
Also published as: JP2000333172A

Description

【０００１】
【発明の属する技術分野】
本発明は画像データを圧縮符号化する画像符号化装置に関し、特に出力符号量に制限がある場合に画質をなるべく良好に保ちながら符号量を制限内に収める画像符号化装置に関する。
【０００２】
【従来の技術】
従来離散コサイン変換を利用した画像の圧縮符号化方法の規格としてＪＰＥＧが知られている。これは画像データをブロックごとに離散コサイン変換し、変換値を量子化する方式である。変換符号化に分類される離散コサイン変換は画像データを空間周波数領域へ変換する公知の変換方式であり、画像データに対しては情報エントロピーを減少させる性能が高いことから広く使用されている。
【０００３】
離散コサイン変換と量子化によって画像は非可逆な歪を受け、情報エントロピーが減少する。離散コサイン変換とＪＰＥＧの量子化は動作上は一体であるので、まとめて変換符号化部とみなせる。これより後さらにエントロピー符号化が施される。エントロピー符号化は出力符号が情報エントロピーに近づくように符号化するものであり、符号化による歪は導入されない。これを実施する部分は係数符号化部である。
【０００４】
係数符号化部では、量子化して得られた係数のうち直流係数は前のブロックの直流係数値との差分を求め差分値をハフマン符号化する。一方交流係数はジグザグ順にスキャンして零の個数と零でない係数の組み合わせとして二次元ハフマン符号化する。離散コサイン変換と量子化はともに交流係数において零の出現頻度を多くするので、零の連続する個数と零でない係数の組み合わせに対してハフマン符号を割り当てている。このため係数符号化部は零でない係数を発見するか、ブロックの終わりに達すると符号を出力する。ＪＰＥＧに従って圧縮された画像は劣化を伴うが、量子化に使用する量子化テーブルを適切に設定することにより主観評価値を良好に保ったまま大幅なデータ圧縮が可能である。
【０００５】
また時々刻々変化する画像を伝送する画像通信を目的とした、ＴＶ会議用の動画像コーデックが従来知られている。動画像コーデックは圧縮率が高く優れているが、専用のＬＳＩが必要な上に高速なクロックで動作させる必要があり、消費電力が大きくなってしまう。携帯通信機器のように消費電力を極力抑える必要がある機器においては、動画像コーデックではなくＪＰＥＧコーデックを用いて画像伝送を行うことができる。ＪＰＥＧの処理方法は動画像コーデックに比べて簡易であり、専用のＬＳＩを使わずにＣＰＵとソフトウエアで実現することも可能である。動画像コーデックの演算量が大きくなる理由の一つは、フレーム間差分を符号化して伝送するためであり、このとき符号化器側ではローカルデコーダを用いて、相手が復号したものと同じ画像を生成しなけばならないことにある。これに対しＪＰＥＧコーデックを用いてフレーム間差分を利用しなければ要求される演算性能を低くし、このため消費電力も抑えられるというメリットがある。
【０００６】
しかしＪＰＥＧ符号化器では動画像コーデックほどの圧縮率が達成できないため、画像サイズ、画質、フレームレートなどを犠牲にして発生符号量を低減させる必要にせまられる。
【０００７】
従来符号量を制御する画像符号化装置として、特許公報第２６４７２７２号の画像符号化における伝送レート制御方式が知られており、これを図７に示してある。これによると単位時間当たりの平均符号量と瞬時符号量がレート計算回路２０５で計算され、ＶＬＣ制御回路２０４に通知され、ＶＬＣ制御回路２０４はそれらのレートが設定値を超えると、ＶＬＣ部２０３を制御して、途中までの係数の符号を送り、強制的に残りの係数の符号を送らないようにする。これによって符号量の制御が実現される。またこの際に量子化器２０２の特性を変更しないため、復号側では符号量制御がされているかどうかに配慮する必要がなく、汎用的で簡易な復号器を用いることができるという長所がある。また係数の零置換は高周波数側の係数から行われるため、符号量が制限された場合でも画質が大きくは損なわれないという長所がある。
【０００８】
また従来、ブロックの画像内での位置に応じて量子化器を制御して、重要な部分を精度よく、重要でない部分の情報を削減して符号化する画像符号化装置が特許公開平４−２４７７８９の符号化装置および復号化装置として知られている。
【０００９】
【発明が解決しようとする課題】
しかしながら従来の特許公報第２６４７２７２号の符号量を制御する画像符号化装置においては、画像における位置を考慮して画質に差を設けた符号量制御ができないという問題がある。画像の中で重要な部分はたいてい中央付近であり、周辺ほど重要性は少ない。画質を犠牲にして符号量を削減する必要が生じた場合には、周辺部の画質劣化は許容できるが、中央部の画質劣化は避けるべきである。このため画像の中央部に位置するブロックには多くの符号量を割り当て、画像の周辺に位置するブロックには少ない符号量を割り当てることが望ましいが、従来の画像符号化装置ではできなかった。この理由は、従来の装置においてブロックの係数を切り捨てる判定基準はブロックの画像における位置を考慮していないからである。
【００１０】
これに対してブロックの位置に応じて量子化器を制御する従来の特許公開平４−２４７７８９の画像符号化装置はこの問題を克服しているが、復号器が符号化器に対応した専用のものでなければならず、広く普及しているＪＰＥＧ復号器を用いることができず、画像の伝送は可能であるが、伝送した画像を他の用途に利用したり、電子メールに添付する際にはＪＰＥＧ符号化器で圧縮しなおさなければならないという問題がある。さらにこの画像符号化装置はブロックの位置に応じて重み付けを変えることはできるが、発生符号量を一定の制限内に収めるという符号量制御の機能がない。つまり個々のブロックの符号量のバランスは制御できるけれども、それらを合計した総符号量を制御できないという問題があった。
【００１１】
【課題を解決するための手段】
上記の問題を解決するために請求項１記載の画像符号化装置は、画像データをブロックごとに離散コサイン変換し量子化して係数を出力する変換符号化手段と、該変換符号化手段が出力する係数を符号化する係数符号化手段とからなる画像符号化装置において、
前記係数符号化手段の符号出力の許容回数を記憶する許容回数記憶手段と、
画像データの画像における位置に応じて前記許容回数記憶手段の記憶する許容回数を増加させる空間的許容回数増加手段と、
画像データの画像における位置に応じて係数の閾値を指定する空間的閾値指定手段と、を備え、
前記係数符号化手段は、前記変換符号化手段が出力する係数の空間周波数を前記空間的閾値指定手段が指定する閾値と比較し、前記係数の空間周波数が前記閾値より低いときは係数を符号化して符号を出力し、前記許容回数記憶手段の記憶する許容回数が正ならばその係数を符号化して符号を出力し、許容回数が正でなければその係数を零として扱うことにより符号を出力せず、
更に、前記係数符号化手段が符号を出力した場合、出力した前記符号の個数だけ前記許容回数記憶手段が記憶している許容回数を減少させるよう制御することを特徴とする。
【００１２】
【発明の実施の形態】
以下、本発明の実施の形態を図面に基づいて説明する。図１は本発明の好適な実施の形態を示すブロック図である。さらに図２はその動作の手順を示すフローチャートである。
【００１３】
本発明の画像符号化装置ではＭＣＵ（Minimum Coded Unit）の画像における位置をもとにして動作をする。ＭＣＵとは複数のブロックを含んだ符号化単位であり、その中には輝度成分のブロックと色差成分のブロックが含まれ、カラー画像としての最小の符号化単位である。画像符号化装置がＭＣＵを処理する順番は一定しているため、ＭＣＵに通し番号を定めて、これをＭＣＵカウンタで数えることによりＭＣＵの位置を特定することができる。各処理部にはＭＣＵカウンタの値を通知されることによりＭＣＵの画像における位置に応じた処理を行うことができる。
【００１４】
図２で処理を開始するとまず処理Ｓ０１においてＭＣＵを指定するＭＣＵカウンタを0 に初期化し、許容回数記憶手段１０５を0 に初期化する。
【００１５】
続く処理Ｓ０２は画像を構成する全てのＭＣＵを処理し終えたかを調べる判定処理である。判定結果がＹｅｓのとき１画像分の処理は終わりである。判定結果がＮｏのとき処理Ｓ０３へ進む。
【００１６】
処理Ｓ０３ではＭＣＵカウンタの値を空間的許容回数増加手段１０４に通知する。空間的許容回数増加手段１０４は許容回数の増分値をＭＣＵの番号に対応させて記憶している。空間的許容回数増加手段１０４は処理Ｓ０３において通知を受けると、この記憶値を用いて許容回数記憶手段１０５の記憶値を増加させ更新する。さらにＭＣＵカウンタの値は空間的閾値指定手段１０３へも通知される。空間的閾値指定手段１０３はそのＭＣＵの画像における位置に応じて閾値を設定する。
【００１７】
続いて処理Ｓ０４において、ＭＣＵカウンタで指定されたＭＣＵの画像データは８×８画素のブロック単位で、変換符号化部１０１に入力される。ひとつのＭＣＵには複数のブロックが存在しており、ブロックごとに順に処理される。変換符号化部１０１は入力されたブロックに対して処理Ｓ０４において二次元離散コサイン変換を実行し、さらに量子化を実行する。ＪＰＥＧに適合する量子化は離散コサイン変換された係数値に量子化ステップの逆数を乗算し、四捨五入によって整数化する処理である。量子化の動作は乗算であるから、乗算を大量に実行する離散コサイン変換と一体化し、両者を変換符号化部１０１としている。
【００１８】
量子化テーブルには、入力される画像の性質や画像符号化装置が出力する画像の画質や符号量に応じて、量子化ステップサイズをあらかじめ設定しておく。公知のように離散コサイン変換によって出力される８×８の係数ブロックは空間周波数領域上の係数であり、各係数のブロックにおける位置と空間周波数とが１対１に対応している。量子化ステップサイズは低周波数側の係数では小さく、高周波数側の係数では大きくすることにより、低周波数側の係数には多くの情報量を割り当て、高周波数側の係数には少ない情報量を割り当てることができる。また量子化テーブルは、データブロックが画像の輝度成分であるか色差成分であるかに応じて切り替えることができ、統計的な性質の異なる色成分ごとに適した量子化ステップを指定することができる。
【００１９】
処理Ｓ０５において、変換符号化部１０１が出力する係数は係数符号化部１０２に入力される。係数符号化部１０２は量子化された８×８の係数ブロックを入力とし、これをエントロピー符号化する。
【００２０】
ＭＣＵに含まれる複数のブロックの処理を終えたら処理Ｓ０６に移り、ＭＣＵカウンタはインクリメントされ、判定処理Ｓ０２へ戻る。
【００２１】
次に、本発明の特徴である空間的な位置を考慮した符号量制御が実行されるのは係数符号化部１０２である。図３はその動作を示すフローチャートであり、これをもとにして説明する。
【００２２】
係数ブロックの入力を受けた係数符号化部１０２は処理Ｓ１１において直流係数を符号化し符号を出力する。直流係数は同じ色成分の前の直流係数との差分を求め、差分値をそれが属するレベル値とそのレベル内での残りビットとに分解し、レベル値をハフマン符号化し、続いて残りビットを添付する。
【００２３】
係数符号化部１０２が交流係数を符号化するときは、低周波数から高周波数への順にブロックをジグザグにスキャンし、値が零の係数の数を数えながら零でない係数を探す。
【００２４】
ここでジグザグスキャンの順序を図４に示す。図４は変換符号化部１０１が離散コサイン変換と量子化を実行した結果である８×８の係数ブロックと、これをジグザグスキャンする順番を示しており、ブロックの左ほど、また上ほど空間周波数が低くなっている。係数符号化部１０２は図４において番号１から番号６３の順に係数をスキャンし零であるか零でないかを調べる。ただし番号０の係数は直流係数なのでジグザグスキャンは行わない。
【００２５】
図３の処理Ｓ１２はジグザグスキャンをはじめる準備処理であり、係数を指定するカウンタを1 に初期化し、零係数の数を数えるカウンタを0 に初期化する。
【００２６】
処理Ｓ１３はブロックのジグザグスキャンが終了したかどうかを調べる判定処理である。これは係数を指定するカウンタが６４未満であるかどうかを調べ、結果がＹｅｓなら処理Ｓ１４へ進み、結果がＮｏなら処理Ｓ１９へ進む。
【００２７】
処理Ｓ１４はカウンタが指定する係数が０であるかを調べる判定処理である。結果がＹｅｓであればその係数は０であるので処理Ｓ２１へ進み、Ｎｏであれば処理Ｓ１５へ進む。
【００２８】
処理Ｓ１５はその係数の空間周波数が空間的閾値指定手段１０３が指定する閾値と比較して低周波数側であるかどうかを調べる判定処理である。処理Ｓ１５ではその係数の位置を指定するカウンタ値を空間的閾値指定手段１０３の指定する閾値と比較し、カウンタ値が閾値未満であるかどうかを調べる。結果がＹｅｓの場合には処理Ｓ１６へ進み、Ｎｏの場合は処理Ｓ１８へ進む。空間的閾値指定手段１０３はＭＣＵカウンタ値に基づいて閾値を変化させる。ＭＣＵカウンタが画像の中央付近を指定している場合は画質を重視するために閾値は６４に設定される。閾値が６４に指定されると零でない係数は全て閾値未満と判定され処理Ｓ１６において符号化され符号が出力される。これに対してＭＣＵカウンタが画像の周辺を指定している場合には閾値は１５に設定される。これによって、係数カウンタが１から１４の間に零でない係数が存在した場合に、零に置換されず処理Ｓ１６で符号化されることが保証される。
【００２９】
このように画質を重視しない周辺のＭＣＵであっても低周波数側の係数を符号量制御の対象から外すのは、低周波数側の係数を零置換すると復号画像における影響が大きいためである。これに対して高周波数側の係数は零置換しても影響が比較的小さいため符号量制御を実行する際には係数を削除する。また空間的閾値指定手段１０３が通知されたＭＣＵの画像における位置に応じて閾値を変化させるのは、画像中央付近に位置する画像データブロックに対しては符号量制御を無効にして画質を保証し、画像周辺に位置するブロックに対しては画質の低下を許して符号量制御を有効にするためである。
【００３０】
処理Ｓ１６において係数符号化部１０２は符号化を実行して符号を出力し、許容回数記憶手段１０５を出力した符号の個数だけデクリメントする。符号化は次のように行う。零係数の数が１６以上の間はＺＲＬ符号を出力し零係数の数から１６を引く動作を繰り返す。ＺＲＬとはZero Run Length のことで、符号シンボルの一つとして二次元ハフマン符号のテーブルの中に存在するものである。零の係数の数が１６未満になったならば、符号化すべき零でない係数をその係数が属するレベル値とそのレベル内での残りビットとに分解し、零の係数の数と零でない係数のレベル値との組を二次元ハフマン符号化し、続いて残りビットを添付する。ＺＲＬ符号と二次元ハフマン符号との双方とも符号出力の度に許容回数記憶手段１０５を1 個デクリメントする。符号を出力した後、処理Ｓ１６においてさらに零係数のカウンタを0 に初期化する。この後処理Ｓ１７へ進む。
【００３１】
処理Ｓ１７では係数の位置を指定するカウンタをインクリメントして判定処理Ｓ１３へ戻る。
【００３２】
全ての交流係数のジグザグスキャンを終えた場合、判定処理Ｓ１３から処理Ｓ１９に至る。処理Ｓ１９は零係数の数が0 個であるかを調べる判定処理である。判定結果がＹｅｓであればそのブロックに対する符号化処理は終了する。結果がＮｏであればＥＯＢ符号を出力し、許容回数記憶手段１０５を1 個デクリメントする。ＥＯＢとは、End Of Blockのことで、符号シンボルの一つとして二次元ハフマン符号のテーブルの中に存在するものである。
【００３３】
係数符号化部１０２が二次元ハフマン符号を出力するのは、零でない交流係数を符号化する場合、ＺＲＬを出力する場合、ＥＯＢを出力する場合、のいずれかである。ブロックの最後に交流係数が出現することはまれであるので、多くの場合ブロックの最後にはＥＯＢ符号が出力される。本発明では係数符号化部の符号出力の回数と符号量との相関が高いことを利用しており、符号出力の回数を測定し、これが許容回数内に収まるように符号出力の回数を制御するものである。本実施の形態では符号出力の回数を数える際にＺＲＬやＥＯＢを含めることにするが、これらを含めずに数えるような画像符号化装置を作成してもよい。
【００３４】
本発明において符号量制御が働いて実際に削減動作が実行されるのは処理Ｓ１８である。処理Ｓ１８は許容回数記憶手段１０５の記憶する値が正であるか調べる判定処理である。判定結果がＹｅｓであれば処理Ｓ１６に進んで符号を出力する。Ｎｏの場合は処理Ｓ２１へ進む。処理Ｓ２１は値が零の係数の数を数えるカウンタを1 個インクリメントする処理であり、その後処理Ｓ１７へ進む。処理Ｓ１８から処理Ｓ２１へ進むときに、量子化時点では零でなかった係数が零として処理されることになるのである。
【００３５】
次に画像における位置に応じて画質を変えながら符号量制御を実現する方法について説明する。図５には画像１１０と画像１１０を構成するＭＣＵが示されている。ＭＣＵのサイズは横１６画素、縦１６画素であり、輝度成分のブロックが4 個と色差成分のブロックが２個含まれている。画像１１０の横方向には６個のＭＣＵ、縦方向には７個のＭＣＵが並び、計４２個のＭＣＵで構成されている。横のインデクスと縦のインデクスの組でＭＣＵを指定することにすると、（0,0 ）のＭＣＵはＭＣＵ１１１であり、（5,0 ）のＭＣＵはＭＣＵ１１６である。ＭＣＵカウンタとしては、縦のインデクス×6 ＋横のインデクスを用いることにより、一つのカウンタ値でＭＣＵの画像における位置を指定することができる。図５において画像１１０の中央付近に位置するＭＣＵ１１８などは実線で描かれ、画像１１０の周辺に位置するＭＣＵ１１１からＭＣＵ１１７などは点線で示されているが、これは点線で示されたＭＣＵは周辺に位置するため重要度が低く、実線で示されたＭＣＵは中央に位置するため重要度が高く、画質の優先度に差を設けていることを表している。
【００３６】
本発明の符号化装置は空間的許容回数増加手段１０４の動作によって、空間的な符号量配分を制御しており、図２の処理Ｓ０３において許容回数記憶手段１０５の記憶値をいくつ増加させるかを対象とする画像サイズと符号量に合わせて設計しなければならない。
【００３７】
まず二次元ハフマン符号を画像全体で何個出力してよいか、全体の許容回数を予め決める。符号化結果において画像の符号量と符号化された二次元ハフマン符号の数とは強い相関があるため、対象とする標準的な画像について目標符号量内で符号化できる量子化テーブルを試行錯誤によって求め、このとき画像全体で出力される二次元ハフマン符号の数を測定し、これをＮi とする。
【００３８】
次に求めたＮi を総ＭＣＵ数で割り、ＭＣＵの1 個あたりの二次元ハフマン符号の数を求めこれをＮm とする。ここではＭＣＵの1 個あたりの二次元ハフマン符号出力Ｎm を４０個に設定する。これはＭＣＵあたりの目標符号量を約１７ bitに設定し、量子化テーブルを選択した結果の数字である。
【００３９】
ブロックの画像における空間的位置に応じて符号量の配分を行うために、空間的許容回数増加手段１０４はＭＣＵの画像における位置に応じて許容回数増加値を記憶しており、ＭＣＵの画像における位置が入力されると許容回数増加値を読み出して許容回数記憶手段１０５の記憶値に加算し更新する。
【００４０】
各ＭＣＵに対する許容回数の増加値を図６に示す。図６は画像１１０を構成する各ＭＣＵの位置に許容回数の増加値を示したもので、空間的許容回数増加手段１０４はこの値をテーブルに記憶しておく。
【００４１】
まずＭＣＵ１１１に対して符号化を行うが、ＭＣＵの位置は（0,0 ）であり、この位置を表すＭＣＵカウンタの値が図２の処理Ｓ０３において空間的許容回数増加手段１０４へ通知され、許容回数増加手段１０４はテーブルの（0,0 ）の位置を参照し、許容回数記憶手段１０５の記憶値を４０増加させる。これはＮm が４０であることからＭＣＵ１個分の割り当てを行うことを意味し、４０個の符号を出力するまでは符号量制御が働かない。係数符号化部１０２はＭＣＵ１１１の係数を符号化し、二次元ハフマン符号を出力するたびに、許容回数記憶手段１０５の記憶値を1 だけ減少させる。
【００４２】
ＭＣＵ１１１は周辺に位置しており、空間的閾値指定手段１０３は閾値を小さく設定しているので、出力される符号が多い場合には許容回数記憶手段１０５の記憶値が正でなくなり、係数符号化部１０２は閾値以上の高周波数側の係数を零とみなして符号出力を制限するようになる。
【００４３】
ＭＣＵ１１１の符号化が終わると次に（0,1 ）の位置にあるＭＣＵ１１２を符号化する前に、空間的許容回数増加手段１０４は許容回数記憶手段１０５の記憶値を２４０増加させる。これはＭＣＵの６個分すなわち、６Ｎm 個の許容回数であり、（0,1 ）の位置にあるＭＣＵ１１２から、（1,0 ）の位置にあるＭＣＵ１１７までを符号化するための割り当てである。以後ＭＣＵ１１３からＭＣＵ１１７までの符号化に際しての許容回数増加値は０に設定する。これらの増加値は空間的許容回数増加手段１０４の中のテーブルに記憶される。
【００４４】
ＭＣＵ１１２から順に符号化していくと、先に符号化されるＭＣＵには許容回数が多くあるため、符号出力制限を受けることなく全ての係数が符号化される。このため先に符号化されるＭＣＵが多くの係数を出力すると、後で符号化されるＭＣＵでは許容回数記憶手段１０５の値が正でなくなり高周波数側係数の出力が制限されやすくなる。とくに位置（1,0 ）のＭＣＵ１１７と、位置（0,6 ）のＭＣＵ１１６において出力制限が発生しやすいため、画像の周辺の画質が低下する。一方位置（0,1 ）のＭＣＵ１１２やその後続のＭＣＵ１１３などにおいては、許容回数記憶手段１０５の値が豊富にあるため、符号の出力制限は発生せず、その画質は量子化テーブルで量子化したとおりの画質になる。
【００４５】
同様に図６に示すように、j を１から５までの整数として位置（1,j ）におけるＭＣＵに対する許容回数増加値を以降のＭＣＵの分を含めて２４０に設定し、他のＭＣＵに対する許容回数増加値を０に設定する。これによって位置（1,j ）のＭＣＵは常に良好な画質が保証され、位置（0,j+1 ）や位置（5,j ）にあるＭＣＵの画質は低下しやすくなる。j が６のときも同様に図６に示すように位置（1,6 ）にあるＭＣＵを符号化する前に、空間的許容回数増加手段１０４は許容回数記憶手段１０５の記憶値を２００増加させる。これは５個分すなわち５Ｎm 個の許容数である。
【００４６】
画像中央付近に位置するＭＣＵ、例えば図５のＭＣＵ１１８については、画質劣化を避けるために係数符号化部１０２による符号の出力制限を行わない。これは空間的閾値指定手段１０３が指定する閾値を画像データの画像における位置に応じて変化させることによって実現する。８×８の係数ブロックは図４に示す順番でジグザグスキャンされるので、係数の空間周波数は横と縦のインデクスの和が一定になるようなバンドに区分することができる。よって空間的閾値指定手段１０３は図４のジグザグスキャンの順番における閾値を通知し、これによって係数符号化部１０２は係数の空間周波数を判定する。
【００４７】
そこで図５で点線で示される周辺のＭＣＵにおいては、空間的閾値指定手段１０３は閾値として１５を通知する。係数の空間周波数が１５未満の係数とは、図４において番号が１から１４までの１４個の交流係数のことであり、係数の横と縦のインデクスの和が５未満であれば低周波数側と判定していることになる。係数符号化部１０２は図３の判定処理Ｓ１４で零でない係数を見つけると、判定処理Ｓ１５でその係数の位置を表すカウンタが閾値未満かどうか調べ、結果がＹｅｓであれば符号を出力し、Ｎｏであれば判定処理Ｓ１８において許容回数記憶手段１０５の値が正であるかを調べ、結果がＹｅｓであれば処理Ｓ１７で符号を出力し、Ｎｏであればその係数の二次元ハフマン符号の出力を制限する。
【００４８】
一方図５で実線で示されるＭＣＵが処理される際、図２の処理Ｓ０３でＭＣＵカウンタの値が空間的閾値指定手段１０３に通知されると閾値は６４に設定される。これにより全ての交流係数は判定処理Ｓ１５において閾値未満と判定されることになり、係数符号化部１０２は符号の出力制限を実行しない。つまり判定処理Ｓ１４において見つけた零でない係数全てが処理Ｓ１６において符号化され、許容回数記憶手段１０５の記憶値は符号の出力のたびにデクリメントされる。これによって画像中央付近の画質を良好に保ちながら、画像周辺部の画質と符号量の制御を実現する画像符号化装置が得られる。
【００４９】
なお本実施の形態では空間的許容回数増加手段１０４はテーブルに許容回数の増加値を記憶しておき、テーブルを参照して許容回数記憶手段１０５の値を更新しているが、他の方法を用いることもできる。例えばＭＣＵのアドレスをもとに演算によって増加値を求めて許容回数記憶手段１０５の値を更新してもよい。またこのプログラムには、過去の符号の発生状況を反映させて、次のＭＣＵにおいて必要となる許容回数増加値を予測し、許容回数の不足による画質劣化を近傍のＭＣＵに分散させるような適応的な制御を実行することもできる。
【００５０】
さらにまた本実施の形態ではＭＣＵを単位として、ＭＣＵの符号化の前に許容回数記憶手段１０５の値を更新するように、空間的許容回数増加手段１０４を動作させているが、ブロックの位置を入力し、より細かくブロック単位で空間的許容回数増加手段１０４を動作させることもできる。
【００５１】
【発明の効果】
本発明の画像符号化装置によれば次のような効果が得られる。一定の符号量内に収めるために符号量制御を実行しながら、画像における位置に応じて画質を制御することができる。とくに画質劣化が画像周辺部において発生し、画像中央部においては一定の画質が保証され、画像において重要な中央部の情報を劣化させることがない。また出力される符号は国際標準のＪＰＥＧ復号器でデコードすることができ、専用の復号器を必要とせず、普及している様々なＪＰＥＧ復号器を利用することができ、システムの自由度を大きくしかつ復号器の製作コストを削減できる。さらに出力された符号はフォーマット変換をすることなく様々な用途に再利用が可能である。
【図面の簡単な説明】
【図１】本発明による画像符号化装置のブロック図。
【図２】本発明の画像符号化装置の動作を表すフローチャート。
【図３】本発明における係数符号化部がブロックを符号化する動作を表すフローチャート。
【図４】８ｘ８の係数ブロックのジグザグスキャンの順番を示す図。
【図５】画像を構成するＭＣＵのうち周辺部に位置するＭＣＵと中央部に位置するＭＣＵを示す図。
【図６】各ＭＣＵを符号化する前に空間的許容回数増加手段が増加させる許容回数の増加値をＭＣＵの位置に対応させて示した図。
【図７】従来の画像符号化装置のブロック図。
【符号の説明】
１０１：変換符号化部
１０２：係数符号化部
１０３：空間的閾値指定手段
１０４：空間的許容回数増加手段
１０５：許容回数記憶手段
１１０：画像
１１１〜１１８：ＭＣＵ
２０１：ＤＣＴ演算部
２０２：量子化部
２０３：ＶＬＣ部
２０４：ＶＬＣ制御回路
２０５：レート計算回路[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an image encoding apparatus that compresses and encodes image data, and more particularly to an image encoding apparatus that keeps the code amount within the limit while keeping the image quality as good as possible when the output code amount is limited.
[0002]
[Prior art]
Conventionally, JPEG is known as a standard of an image compression encoding method using discrete cosine transform. In this method, image data is subjected to discrete cosine transform for each block, and the transform value is quantized. Discrete cosine transform classified as transform coding is a well-known transform method for transforming image data into the spatial frequency domain, and is widely used for image data because of its high ability to reduce information entropy.
[0003]
Discrete cosine transform and quantization causes the image to undergo irreversible distortion and reduce information entropy. Since the discrete cosine transform and JPEG quantization are integrated in operation, they can be collectively regarded as a transform coding unit. After this, entropy coding is further performed. Entropy encoding is performed so that the output code approaches information entropy, and distortion due to encoding is not introduced. The part that implements this is a coefficient encoding unit.
[0004]
In the coefficient encoding unit, among the coefficients obtained by quantization, the DC coefficient is obtained as a difference from the DC coefficient value of the previous block, and the difference value is Huffman encoded. On the other hand, AC coefficients are scanned in a zigzag order and are two-dimensionally Huffman encoded as a combination of the number of zeros and non-zero coefficients. Since both the discrete cosine transform and the quantization increase the appearance frequency of zero in the AC coefficient, a Huffman code is assigned to a combination of a continuous number of zeros and a non-zero coefficient. Therefore, the coefficient encoding unit finds a non-zero coefficient or outputs a code when the end of the block is reached. Although an image compressed according to JPEG is accompanied by deterioration, by appropriately setting a quantization table used for quantization, significant data compression is possible while maintaining a good subjective evaluation value.
[0005]
Conventionally, a video conference codec for video conferencing is known for the purpose of image communication for transmitting images that change from moment to moment. A moving image codec has a high compression rate and is excellent, but requires a dedicated LSI and must be operated with a high-speed clock, resulting in an increase in power consumption. In a device that needs to reduce power consumption as much as possible, such as a portable communication device, image transmission can be performed using a JPEG codec instead of a moving image codec. The JPEG processing method is simpler than a moving image codec, and can be realized by a CPU and software without using a dedicated LSI. One of the reasons why the amount of calculation of the moving image codec is large is that the difference between frames is encoded and transmitted. At this time, the encoder uses a local decoder to display the same image as that decoded by the other party. It must be generated. On the other hand, if the JPEG codec is not used and the inter-frame difference is not used, the required calculation performance is lowered, and there is an advantage that power consumption can be suppressed.
[0006]
However, since the JPEG encoder cannot achieve a compression rate as high as that of a moving image codec, it is necessary to reduce the amount of generated code at the expense of image size, image quality, frame rate, and the like.
[0007]
As a conventional image coding apparatus for controlling the code amount, a transmission rate control method in image coding of Japanese Patent No. 2647272 is known, which is shown in FIG. According to this, the average code amount and the instantaneous code amount per unit time are calculated by the rate calculation circuit 205 and notified to the VLC control circuit 204. When these rates exceed the set values, the VLC control circuit 204 The code of the coefficients up to the middle is sent, and the sign of the remaining coefficients is not forcibly sent. Thereby, control of the code amount is realized. At this time, since the characteristics of the quantizer 202 are not changed, there is no need to consider whether or not the code amount is controlled on the decoding side, and there is an advantage that a general-purpose and simple decoder can be used. In addition, since the coefficient zero replacement is performed from a coefficient on the high frequency side, there is an advantage that the image quality is not greatly impaired even when the code amount is limited.
[0008]
Conventionally, an image coding apparatus that performs coding by controlling a quantizer in accordance with the position of a block in an image to reduce important portions with high accuracy and reduce information on unimportant portions has been disclosed in Japanese Patent Application Laid-Open No. Hei 4- It is known as a 247789 encoder and decoder.
[0009]
[Problems to be solved by the invention]
However, in the conventional image coding apparatus that controls the code amount of Japanese Patent Publication No. 2647272, there is a problem that the code amount control with a difference in image quality cannot be performed in consideration of the position in the image. The important part of the image is usually near the center and less important as the periphery. When it is necessary to reduce the amount of code at the expense of image quality, the image quality deterioration in the peripheral portion can be tolerated, but the image quality deterioration in the central portion should be avoided. For this reason, it is desirable to allocate a large amount of code to the block located at the center of the image and assign a small amount of code to the block located around the image, but this has not been possible with the conventional image coding apparatus. The reason for this is that in the conventional apparatus, the criterion for truncating the block coefficient does not consider the position of the block in the image.
[0010]
On the other hand, the conventional image coding apparatus disclosed in Japanese Patent Laid-Open No. 4-247789 which controls the quantizer according to the position of the block overcomes this problem, but the decoder has a dedicated decoder corresponding to the encoder. The JPEG decoder that is widely used cannot be used and the image can be transmitted. However, when the transmitted image is used for other purposes or attached to an e-mail. Has the problem that it must be recompressed with a JPEG encoder. Further, although this image coding apparatus can change the weighting according to the block position, it does not have a code amount control function for keeping the generated code amount within a certain limit. That is, although the balance of the code amount of each block can be controlled, there is a problem that the total code amount obtained by adding them can not be controlled.
[0011]
[Means for Solving the Problems]
In order to solve the above problem, an image coding apparatus according to claim 1 is a transform coding means for discrete cosine transforming and quantizing image data for each block and outputting coefficients, and the transform coding means outputs In an image encoding device comprising coefficient encoding means for encoding a coefficient,
An allowable number storage means for storing the allowable number of code outputs of the coefficient encoding means;
Spatial allowable number increase means for increasing the allowable number of times stored in the allowable number storage means according to the position of the image data in the image;
Spatial threshold value specifying means for specifying a threshold value of the coefficient according to the position of the image data in the image,
The coefficient encoding means compares the spatial frequency of the coefficient output from the transform encoding means with the threshold specified by the spatial threshold specifying means, and encodes the coefficient when the spatial frequency of the coefficient is lower than the threshold. If the allowable number stored in the allowable number storage means is positive, the coefficient is encoded and the code is output.If the allowable number is not positive, the coefficient is output as zero. Without
Further, when the coefficient encoding means outputs a code, the allowable number of times stored in the allowable number storage means is controlled to be decreased by the number of the outputted codes.
[0012]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing a preferred embodiment of the present invention. FIG. 2 is a flowchart showing the procedure of the operation.
[0013]
The image coding apparatus of the present invention operates based on the position of an MCU (Minimum Coded Unit) image. The MCU is an encoding unit including a plurality of blocks, and includes a luminance component block and a color difference component block, and is the minimum encoding unit as a color image. Since the order in which the image coding apparatus processes the MCU is fixed, the serial number is determined for the MCU, and the MCU position can be specified by counting this with the MCU counter. Each processing unit is notified of the value of the MCU counter, so that processing according to the position of the MCU in the image can be performed.
[0014]
When the processing is started in FIG. 2, first, an MCU counter for designating an MCU is initialized to 0 and an allowable number storage means 105 is initialized to 0 in processing S01.
[0015]
A subsequent process S02 is a determination process for checking whether all the MCUs constituting the image have been processed. When the determination result is Yes, the processing for one image is completed. When the determination result is No, the process proceeds to step S03.
[0016]
In step S03, the value of the MCU counter is notified to the spatial allowable number increase means 104. Spatial allowable number increase means 104 stores the increment value of the allowable number corresponding to the MCU number. When receiving the notification in step S03, the spatial allowable number increase means 104 increases and updates the stored value of the allowable number storage means 105 using this stored value. Further, the value of the MCU counter is also notified to the spatial threshold value specifying means 103. The spatial threshold designating unit 103 sets a threshold according to the position of the MCU in the image.
[0017]
Subsequently, in process S04, the image data of the MCU specified by the MCU counter is input to the transform coding unit 101 in units of 8 × 8 pixel blocks. A single MCU has a plurality of blocks, and the blocks are processed in order. In step S04, the transform coding unit 101 performs two-dimensional discrete cosine transform on the input block, and further performs quantization. Quantization conforming to JPEG is a process of multiplying a coefficient value obtained by discrete cosine transform by an inverse number of a quantization step and rounding to an integer. Since the quantization operation is multiplication, it is integrated with discrete cosine transform that executes a large amount of multiplication, and both are used as transform coding section 101.
[0018]
In the quantization table, a quantization step size is set in advance according to the properties of the input image and the image quality and code amount of the image output from the image encoding apparatus. As is well known, an 8 × 8 coefficient block output by discrete cosine transform is a coefficient in the spatial frequency domain, and the position of each coefficient in the block and the spatial frequency have a one-to-one correspondence. The quantization step size is small for the low frequency side coefficient and large for the high frequency side coefficient, so that a large amount of information is allocated to the low frequency side coefficient and a small amount of information is allocated to the high frequency side coefficient. be able to. The quantization table can be switched depending on whether the data block is a luminance component or a color difference component of the image, and a quantization step suitable for each color component having a different statistical property can be designated. .
[0019]
In the process S05, the coefficient output from the transform encoding unit 101 is input to the coefficient encoding unit 102. The coefficient encoding unit 102 receives the quantized 8 × 8 coefficient block and performs entropy encoding on this.
[0020]
When processing of a plurality of blocks included in the MCU is completed, the process proceeds to process S06, the MCU counter is incremented, and the process returns to the determination process S02.
[0021]
Next, it is the coefficient encoding unit 102 that performs code amount control in consideration of the spatial position, which is a feature of the present invention. FIG. 3 is a flowchart showing the operation, which will be described based on this.
[0022]
In response to the input of the coefficient block, the coefficient encoding unit 102 encodes the DC coefficient and outputs a code in step S11. For the DC coefficient, the difference from the previous DC coefficient of the same color component is obtained, the difference value is decomposed into the level value to which it belongs and the remaining bits within that level, the level value is Huffman encoded, and then the remaining bits are Attach.
[0023]
When the coefficient encoding unit 102 encodes the AC coefficient, the blocks are scanned zigzag in order from the low frequency to the high frequency, and the coefficient that is not zero is searched for while counting the number of coefficients having the value zero.
[0024]
Here, the order of zigzag scanning is shown in FIG. FIG. 4 shows an 8 × 8 coefficient block that is a result of the transform coding unit 101 performing discrete cosine transform and quantization, and the order in which the block is zigzag scanned. Is low. The coefficient encoding unit 102 scans the coefficients in the order of number 1 to number 63 in FIG. 4 to check whether they are zero or not. However, since the coefficient of number 0 is a DC coefficient, zigzag scanning is not performed.
[0025]
Process S12 in FIG. 3 is a preparatory process for starting a zigzag scan. A counter for specifying a coefficient is initialized to 1, and a counter for counting the number of zero coefficients is initialized to 0.
[0026]
The process S13 is a determination process for checking whether the zigzag scan of the block has been completed. This checks whether the counter specifying the coefficient is less than 64. If the result is Yes, the process proceeds to step S14. If the result is No, the process proceeds to step S19.
[0027]
The process S14 is a determination process for checking whether the coefficient designated by the counter is 0. If the result is Yes, the coefficient is 0 and the process proceeds to step S21. If the result is No, the process proceeds to step S15.
[0028]
Process S15 is a determination process for checking whether the spatial frequency of the coefficient is on the low frequency side as compared with the threshold value specified by the spatial threshold value specifying means 103. In process S15, the counter value specifying the position of the coefficient is compared with the threshold value specified by the spatial threshold value specifying means 103, and it is checked whether the counter value is less than the threshold value. If the result is Yes, the process proceeds to step S16, and if the result is No, the process proceeds to step S18. The spatial threshold designating unit 103 changes the threshold based on the MCU counter value. When the MCU counter designates the vicinity of the center of the image, the threshold value is set to 64 in order to emphasize the image quality. When the threshold value is designated as 64, all non-zero coefficients are determined to be less than the threshold value, and are encoded and output in step S16. On the other hand, when the MCU counter designates the periphery of the image, the threshold is set to 15. As a result, when there is a non-zero coefficient between 1 and 14 in the coefficient counter, it is guaranteed that the coefficient counter is not replaced with zero and is encoded in step S16.
[0029]
The reason why the low frequency side coefficient is excluded from the code amount control target even in the peripheral MCU that does not place importance on the image quality is that if the low frequency side coefficient is zero-substituted, the influence on the decoded image is large. On the other hand, the coefficient on the high frequency side has a relatively small effect even if zero substitution is performed, and therefore, the coefficient is deleted when executing the code amount control. In addition, the threshold value is changed according to the position of the MCU image notified by the spatial threshold value designating means 103 for the image data block located near the center of the image to invalidate the code amount control and guarantee the image quality. This is because the code amount control is enabled for the blocks located in the periphery of the image while allowing the image quality to deteriorate.
[0030]
In step S16, the coefficient encoding unit 102 executes encoding and outputs a code, and decrements the number of codes output from the allowable number storage means 105. Encoding is performed as follows. While the number of zero coefficients is 16 or more, the ZRL code is output and the operation of subtracting 16 from the number of zero coefficients is repeated. ZRL is Zero Run Length, which exists as one of the code symbols in the two-dimensional Huffman code table. If the number of zero coefficients is less than 16, the non-zero coefficient to be encoded is decomposed into the level value to which the coefficient belongs and the remaining bits within that level, and the number of zero coefficients and the non-zero coefficient The pair with the level value is two-dimensionally Huffman encoded, and then the remaining bits are attached. Both the ZRL code and the two-dimensional Huffman code decrement the allowable number storage means 105 for each code output. After outputting the code, a zero coefficient counter is further initialized to 0 in step S16. Thereafter, the process proceeds to S17.
[0031]
In the process S17, the counter for designating the coefficient position is incremented and the process returns to the determination process S13.
[0032]
When the zigzag scan of all the AC coefficients is completed, the process proceeds from the determination process S13 to the process S19. Process S19 is a determination process for checking whether the number of zero coefficients is zero. If the determination result is Yes, the encoding process for the block ends. If the result is No, an EOB code is output and the allowable number storage means 105 is decremented by one. The EOB is an end of block, and is present in the two-dimensional Huffman code table as one of the code symbols.
[0033]
The coefficient encoding unit 102 outputs the two-dimensional Huffman code when encoding a non-zero AC coefficient, when outputting ZRL, or when outputting EOB. Since an AC coefficient rarely appears at the end of a block, an EOB code is output at the end of the block in many cases. In the present invention, the fact that the correlation between the number of code outputs of the coefficient coding unit and the code amount is high is used, the number of code outputs is measured, and the number of code outputs is controlled so that this is within the allowable number. Is. In the present embodiment, ZRL and EOB are included when counting the number of code outputs, but an image encoding device that counts without including these may be created.
[0034]
In the present invention, the code amount control works and the actual reduction operation is executed in step S18. Process S18 is a determination process for checking whether the value stored in the allowable number storage means 105 is positive. If a determination result is Yes, it will progress to process S16 and will output a code | symbol. In No, it progresses to process S21. Process S21 is a process of incrementing the counter for counting the number of zero-valued coefficients by one, and then proceeds to process S17. When the process proceeds from the process S18 to the process S21, the coefficient that was not zero at the time of quantization is processed as zero.
[0035]
Next, a method for realizing the code amount control while changing the image quality according to the position in the image will be described. FIG. 5 shows an image 110 and MCUs constituting the image 110. The size of the MCU is 16 pixels horizontally and 16 pixels vertically, and includes four luminance component blocks and two color difference component blocks. Six MCUs are arranged in the horizontal direction of the image 110, and seven MCUs are arranged in the vertical direction. If the MCU is specified by a set of a horizontal index and a vertical index, the MCU of (0, 0) is the MCU 111, and the MCU of (5, 0) is the MCU 116. By using a vertical index × 6 + horizontal index as the MCU counter, the position of the MCU in the image can be specified by one counter value. In FIG. 5, the MCU 118 and the like located near the center of the image 110 are drawn by solid lines, and the MCU 111 to MCU 117 and the like located around the image 110 are indicated by dotted lines. The importance is low because it is located, and the MCU indicated by the solid line is high in importance because it is located in the center, indicating that there is a difference in the priority of image quality.
[0036]
The encoding apparatus of the present invention controls the spatial code amount distribution by the operation of the spatial allowable number increase means 104, and how many values to be stored in the allowable number storage means 105 are increased in step S03 in FIG. It must be designed according to the target image size and code amount.
[0037]
First, the total allowable number is determined in advance how many two-dimensional Huffman codes can be output for the entire image. Since there is a strong correlation between the code amount of the image and the number of encoded 2D Huffman codes in the encoding result, a quantization table that can be encoded within the target code amount for the target standard image is obtained by trial and error. At this time, the number of two-dimensional Huffman codes output in the entire image is measured, and this is defined as Ni.
[0038]
Next, the obtained Ni is divided by the total number of MCUs, and the number of two-dimensional Huffman codes per MCU is obtained, and this is defined as Nm. Here, 40 two-dimensional Huffman code outputs Nm per MCU are set. This is a number obtained as a result of selecting a quantization table by setting the target code amount per MCU to about 17 bits.
[0039]
In order to distribute the code amount according to the spatial position in the image of the block, the spatial allowable number increase means 104 stores the allowable number increase value according to the position in the MCU image, and the position in the MCU image. Is input, the allowable number increase value is read and added to the stored value of the allowable number storage means 105 to be updated.
[0040]
FIG. 6 shows an increase value of the allowable number for each MCU. FIG. 6 shows an increase value of the allowable number at the position of each MCU constituting the image 110, and the spatial allowable number increase means 104 stores this value in a table.
[0041]
First, encoding is performed on the MCU 111, and the position of the MCU is (0, 0), and the value of the MCU counter representing this position is notified to the spatial allowable number increasing means 104 in step S03 in FIG. The number increase means 104 refers to the position of (0,0) in the table and increases the stored value of the allowable number storage means 105 by 40. This means that since Nm is 40, one MCU is allocated, and code amount control does not work until 40 codes are output. The coefficient encoding unit 102 encodes the coefficient of the MCU 111 and decreases the value stored in the allowable number storage unit 105 by 1 each time a two-dimensional Huffman code is output.
[0042]
Since the MCU 111 is located in the vicinity, and the spatial threshold value specifying unit 103 sets a small threshold value, when the number of codes to be output is large, the stored value of the allowable number storage unit 105 is not positive, and the coefficient encoding is performed. The unit 102 regards the high frequency side coefficient equal to or higher than the threshold as zero and limits the code output.
[0043]
When encoding of the MCU 111 is completed, the spatial allowable number increasing means 104 increases the stored value of the allowable number storage means 105 by 240 before encoding the MCU 112 at the position (0, 1) next. This is an allowable number of 6 MCUs, that is, 6Nm, and is an allocation for encoding from the MCU 112 at the (0,1) position to the MCU 117 at the (1,0) position. Thereafter, the allowable number increase value at the time of encoding from MCU 113 to MCU 117 is set to zero. These increased values are stored in a table in the spatial allowable number increasing means 104.
[0044]
If encoding is performed in order from the MCU 112, since the number of allowable times is large in the previously encoded MCU, all coefficients are encoded without being limited in code output. For this reason, if the MCU that is encoded earlier outputs many coefficients, the value of the allowable number storage means 105 is not positive in the MCU that is encoded later, and the output of the high frequency side coefficient is likely to be limited. In particular, output restriction is likely to occur in the MCU 117 at the position (1,0) and the MCU 116 at the position (0,6), so that the image quality around the image is degraded. On the other hand, in the MCU 112 at the position (0, 1), the subsequent MCU 113, and the like, there are abundant values in the allowable number storage means 105, so there is no code output limitation, and the image quality is quantized by the quantization table. The image quality will be as shown.
[0045]
Similarly, as shown in FIG. 6, j is an integer from 1 to 5, and the allowable number increase value for the MCU at the position (1, j) is set to 240 including the subsequent MCUs, and the allowable values for other MCUs are set. Set the increment value to 0. As a result, the MCU at the position (1, j) always guarantees good image quality, and the image quality of the MCU at the position (0, j + 1) and the position (5, j) tends to deteriorate. Similarly, when j is 6, before encoding the MCU located at the position (1,6) as shown in FIG. 6, the spatial allowable number increase means 104 increases the stored value of the allowable number storage means 105 by 200. . This is an allowable number of 5 or 5Nm.
[0046]
For the MCU located near the center of the image, for example, the MCU 118 in FIG. 5, the code output by the coefficient encoding unit 102 is not limited in order to avoid image quality degradation. This is realized by changing the threshold specified by the spatial threshold specifying means 103 in accordance with the position of the image data in the image. Since the 8 × 8 coefficient block is zigzag scanned in the order shown in FIG. 4, the spatial frequency of the coefficient can be divided into bands in which the sum of the horizontal and vertical indexes is constant. Therefore, the spatial threshold designation unit 103 notifies the threshold in the zigzag scan order of FIG. 4, and the coefficient encoding unit 102 thereby determines the spatial frequency of the coefficient.
[0047]
Therefore, in the peripheral MCU indicated by the dotted line in FIG. 5, the spatial threshold value specifying means 103 notifies 15 as the threshold value. The coefficient whose spatial frequency is less than 15 is the 14 AC coefficients numbered 1 to 14 in FIG. 4. If the sum of the horizontal and vertical indices of the coefficient is less than 5, the low frequency side It will be judged. When the coefficient encoding unit 102 finds a non-zero coefficient in the determination process S14 of FIG. 3, the coefficient encoding unit 102 checks whether the counter representing the position of the coefficient is less than the threshold value in the determination process S15, and outputs a code if the result is Yes. If so, it is checked in the determination process S18 whether the value of the allowable number storage means 105 is positive. If the result is Yes, the code is output in the process S17, and if it is No, the output of the two-dimensional Huffman code of the coefficient is output. Restrict.
[0048]
On the other hand, when the MCU indicated by the solid line in FIG. 5 is processed, the threshold value is set to 64 when the value of the MCU counter is notified to the spatial threshold value specifying means 103 in step S03 of FIG. As a result, all AC coefficients are determined to be less than the threshold value in the determination process S15, and the coefficient encoding unit 102 does not perform code output restriction. That is, all the non-zero coefficients found in the determination process S14 are encoded in the process S16, and the stored value in the allowable number storage unit 105 is decremented each time a code is output. As a result, it is possible to obtain an image encoding device that can control the image quality and code amount of the peripheral portion of the image while maintaining the image quality near the center of the image.
[0049]
In this embodiment, the spatial allowable number increase means 104 stores the increase value of the allowable number in the table and updates the value of the allowable number storage means 105 with reference to the table. It can also be used. For example, an increase value may be obtained by calculation based on the MCU address, and the value in the allowable number storage means 105 may be updated. In addition, this program reflects the past code generation situation, predicts an allowable increase in the number of times required in the next MCU, and adaptively disperses image quality degradation due to insufficient allowable numbers to neighboring MCUs. It is also possible to execute various controls.
[0050]
Furthermore, in the present embodiment, the spatial allowable number increasing means 104 is operated so that the value of the allowable number storage means 105 is updated before encoding the MCU in units of MCUs. It is also possible to input and operate the spatial allowable number increasing means 104 more finely in units of blocks.
[0051]
【The invention's effect】
According to the image coding apparatus of the present invention, the following effects can be obtained. The image quality can be controlled according to the position in the image while executing the code amount control so as to be within a certain code amount. In particular, image quality degradation occurs in the peripheral portion of the image, and a constant image quality is guaranteed in the central portion of the image, and information in the central portion that is important in the image is not degraded. In addition, the output code can be decoded by an international standard JPEG decoder, no dedicated decoder is required, and various popular JPEG decoders can be used, increasing the degree of freedom of the system. In addition, the production cost of the decoder can be reduced. Furthermore, the output code can be reused for various purposes without format conversion.
[Brief description of the drawings]
FIG. 1 is a block diagram of an image encoding apparatus according to the present invention.
FIG. 2 is a flowchart showing the operation of the image coding apparatus according to the present invention.
FIG. 3 is a flowchart showing an operation of coding a block by a coefficient coding unit according to the present invention.
FIG. 4 is a diagram showing the order of zigzag scanning of an 8 × 8 coefficient block.
FIG. 5 is a diagram showing MCUs located in a peripheral part and MCUs located in a central part among MCUs constituting an image;
FIG. 6 is a diagram showing an increase value of the allowable number that is increased by the spatial allowable number increase means before encoding each MCU in correspondence with the position of the MCU.
FIG. 7 is a block diagram of a conventional image encoding device.
[Explanation of symbols]
101: Transform coding unit
102: Coefficient encoding unit
103: Spatial threshold designation means
104: Means for increasing the allowable number of times of space
105: Permissible number storage means
110: Image
111-118: MCU
201: DCT calculation unit
202: Quantization unit
203: VLC section
204: VLC control circuit
205: Rate calculation circuit

Claims

In an image coding apparatus comprising transform coding means for discrete cosine transform and quantizing image data for each block and outputting coefficients, and coefficient coding means for coding coefficients output by the transform coding means,
An allowable number storage means for storing the allowable number of code outputs of the coefficient encoding means;
Spatial allowable number increase means for increasing the allowable number of times stored in the allowable number storage means according to the position of the image data in the image;
Spatial threshold value specifying means for specifying a threshold value of a coefficient according to the position of the image data in the image;
With
The coefficient encoding unit compares a counter value indicating the position of the coefficient output by the transform encoding unit in the zigzag scan order with a threshold value specified by the spatial threshold value specifying unit, and when the counter value is lower than the threshold value Encodes the coefficient and outputs the code. If the allowable number stored in the allowable number storage means is positive, the coefficient is encoded and the code is output. If the allowable number is not positive, the coefficient is treated as zero. Does not output the sign,
Further, when the coefficient encoding means outputs a code, the image encoding apparatus controls to reduce the allowable number of times stored in the allowable number storage means by the number of the outputted codes.