JP2003047005A

JP2003047005A - Image encoding apparatus and method, program, and storage medium

Info

Publication number: JP2003047005A
Application number: JP2001232812A
Authority: JP
Inventors: Hiroshi Kajiwara; 浩梶原
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2001-07-31
Filing date: 2001-07-31
Publication date: 2003-02-14

Abstract

(57)【要約】【課題】異なる解像度レベルを必要とする領域を含む
画像データを符号化する場合に、効率の良い画像符号化
を行うこと。又、符号化を行った画像データを復号する
際に、注目解像度領域を早期に特定可能な符号列を生成
する符号化を行うこと。【解決手段】高解像度領域情報入力部１０１から入力
した高解像度領域情報に基づき領域決定部１０２はフラ
グを生成し、高解像度画像入力部１０３及び低解像度画
像入力部１０４はフラグに基づき高、低解像度画像を読
込み、離散ウェーブレット変換部１０５は高解像度画像
に２次元離散ウェーブレット変換を施す。係数合成部１
０６は高解像度画像の低周波サブバンドと低解像度画像
との合成を行う。離散ウェーブレット変換部１０７は合
成したサブバンドに２次元離散ウェーブレット変換を施
し、ビットプレーン符号化部１０８は符号列を生成す
る。 (57) [Problem] To efficiently perform image encoding when encoding image data including areas requiring different resolution levels. In addition, when decoding encoded image data, encoding is performed to generate a code string capable of specifying a target resolution area at an early stage. SOLUTION: An area determination unit 102 generates a flag based on high resolution area information input from a high resolution area information input unit 101, and a high resolution image input unit 103 and a low resolution image input unit 104 generate high, low based on the flag. The resolution image is read, and the discrete wavelet transform unit 105 performs a two-dimensional discrete wavelet transform on the high-resolution image. Coefficient synthesis unit 1
06 synthesizes the low-frequency subband of the high-resolution image with the low-resolution image. The discrete wavelet transform unit 107 performs a two-dimensional discrete wavelet transform on the synthesized subband, and the bit plane encoding unit 108 generates a code sequence.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、画像データを符号
化する画像符号化装置及びその方法並びにプログラム、
記憶媒体に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image coding apparatus for coding image data, a method therefor, and a program,
It relates to a storage medium.

【０００２】[0002]

【従来の技術】近年、デジタルカメラ、スキャナといっ
た画像入力装置の技術の向上にともない、これら入力装
置により取り込む画像データの解像度は増加の一途を辿
っている。低解像度の画像であれば画像データの量も少
なく、伝送、蓄積といった処理に支障をきたすことはな
かった。しかし高解像度になるにつれ、画像データ量も
膨大なものになり、伝送する際に多くの時間がかかった
り、蓄積に多くの記憶容量を必要とするという問題が生
じる。このため画像の伝送、蓄積に際しては高能率符号
化を用いることにより、画像の冗長性を除く、あるいは
視覚的に許容できる範囲で画像を加工し、データ量の削
減を行うことが一般的である。復号により元の画像を完
全に再現できる符号化方式を可逆符号化、視覚的に近い
画像を得るものの完全には元の画像を再現できない符号
化方式を非可逆符号化と呼んでいる。非可逆符号化の場
合、視覚的に劣化が目立たない部分を変化させて符号量
の削減を図ることが肝要であるが、これは画像の特性に
大きく依存している。画像データと一口にいってもその
タイプは様々であり、人物・風景等を銀塩写真で撮影
し、スキャナで読み取る、あるいは直接デジタルカメラ
で撮影するなどして生成される自然画像、文字、線情報
をラスタライズした文字・線画像、コンピュータで生成
した２次元画像データや３次元形状をレンダリングした
ＣＧ画像などがあり、良好な再生画質を得るためにはそ
れぞれに必要解像度、必要階調数が異なると言われる。
一般には文字画像、線画像は自然画像に比べて高い解像
度が必要であるとされている。2. Description of the Related Art In recent years, with the improvement of the technology of image input devices such as digital cameras and scanners, the resolution of image data captured by these input devices has been increasing. If the image has a low resolution, the amount of image data is small, and it does not hinder processing such as transmission and storage. However, as the resolution becomes higher, the amount of image data also becomes enormous, and it takes a lot of time for transmission and a large storage capacity is required for storage. For this reason, it is common to reduce the amount of data by using high-efficiency coding when transmitting and storing images to eliminate image redundancy or to process the image in a visually permissible range. . The coding method that can completely reproduce the original image by decoding is called lossless coding, and the coding method that obtains a visually close image but cannot completely reproduce the original image is called lossy coding. In the case of lossy encoding, it is important to reduce the code amount by changing the portion where deterioration is not noticeable visually, but this largely depends on the characteristics of the image. There are various types of image data, even if it's just a bit.Natural images, characters, and lines that are created by shooting people, landscapes, etc. with silver salt photos, reading them with a scanner, or shooting them directly with a digital camera. There are character / line images in which information is rasterized, computer-generated 2D image data and CG images in which 3D shapes are rendered, etc., and each has a different required resolution and required number of gradations to obtain good playback image quality. Is said.
Generally, character images and line images are required to have higher resolution than natural images.

【０００３】従来、高能率符号化の一手法としてウェー
ブレット変換を利用する方法が用いられている。従来方
式では、まず、離散ウェーブレット変換を用いて符号化
対象画像を複数の周波数帯域（サブバンド）に分割し、
次に，各サブバンドの変換係数をさまざまな方法で量子
化，エントロピー符号化して符号列を生成する。画像の
ウェーブレット変換の方法としては、図４（ａ），
（ｂ），（ｃ）にその過程を示すように１次元の変換処
理を水平、垂直方向にそれぞれに適用して４つのサブバ
ンドに分割する方法が用いられる。さらに、低周波サブ
バンド（ＬＬサブバンド）のみを繰り返して分割する方
法が一般的である。図５に１次元の変換を２回繰り返し
て行った場合の例を示す。Conventionally, a method utilizing wavelet transform has been used as one method of high efficiency coding. In the conventional method, first, an image to be coded is divided into a plurality of frequency bands (subbands) by using the discrete wavelet transform,
Next, the transform coefficients of each subband are quantized and entropy coded by various methods to generate a code string. As a method of wavelet transform of an image, as shown in FIG.
As shown in (b) and (c), a method of applying one-dimensional conversion processing in the horizontal and vertical directions to divide into four subbands is used. Furthermore, a method of repeatedly dividing only the low frequency subband (LL subband) is common. FIG. 5 shows an example in which the one-dimensional conversion is repeated twice.

【０００４】ウェーブレット変換を用いた画像符号化の
利点の一つとして、空間解像度の段階的復号の実現が容
易であるということが挙げられる。図５のようにウェー
ブレット変換を施し、低周波サブバンドＬＬから高周波
サブバンドＨＨ１へと順々に各サブバンドの係数を符号
化・伝送した場合、復号側ではＬＬサブバンドの係数を
受信した段階で１／４の解像度の復元画像を、また、Ｌ
Ｌ，ＬＨ１，ＨＬ１，ＨＨ１を受信した段階で１／２の
解像度の復元画像を、さらにＬＨ２，ＨＬ２，ＨＨ２ま
でを受信した場合には元の解像度の復元画像をといった
具合に、徐々に解像度を上げて画像を復号することがで
きる。One of the advantages of the image coding using the wavelet transform is that it is easy to realize the stepwise decoding of the spatial resolution. When the wavelet transform is performed as shown in FIG. 5 and the coefficients of the respective subbands are sequentially encoded and transmitted from the low frequency subband LL to the high frequency subband HH1, the decoding side receives the coefficients of the LL subband. To restore a 1/4 resolution image,
When the L, LH1, HL1, and HH1 are received, the restored image with a resolution of 1/2 is received, and when LH2, HL2, and HH2 are received, the restored image with the original resolution is gradually displayed. You can raise and decode the image.

【０００５】文字・写真の混在画像に見られるように、
画像中に必要解像度の異なる部分が混在する場合に、ウ
ェーブレット変換の空間解像度の段階性を利用して必要
な部分だけ高解像度を復号するために必要なデータを符
号化し、高解像度の必要ない領域については高解像度を
復号するために必要なデータを破棄するといった方法が
用いられている。As seen in a mixed image of characters and photographs,
When parts with different required resolutions are mixed in the image, the data required for decoding the high resolution only for the required part is encoded by using the spatial resolution of the wavelet transform. For, the method of discarding the data necessary for decoding the high resolution is used.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上述し
た様な従来型の高能率符号化方法では、自然画像、文字
・線画像の混在画像など、必要解像度の異なる領域を含
む画像データを符号化する際には、一旦、高い解像度で
画像データを読み込まなければならず、効率の良い符号
化方式とは言えなかった。However, in the conventional high-efficiency encoding method as described above, image data including areas having different required resolutions such as natural images and mixed images of character / line images are encoded. In that case, the image data had to be read once with a high resolution, which was not an efficient encoding method.

【０００７】本発明は以上の問題に鑑みて成されたもの
であり、異なる解像度レベルを必要とする領域を含む画
像データを符号化する場合に、効率の良い画像符号化を
行うことを目的とする。The present invention has been made in view of the above problems, and an object of the present invention is to perform efficient image encoding when encoding image data including areas requiring different resolution levels. To do.

【０００８】又、符号化を行った画像データを復号する
際に、注目解像度領域を早期に特定可能な符号列を生成
する符号化を行うことを目的とする。It is another object of the present invention to perform coding for generating a code string that can specify a resolution area of interest at an early stage when decoding coded image data.

【０００９】[0009]

【課題を解決するための手段】本発明の目的を達成する
ために、例えば本発明の画像符号化装置は以下の構成を
備える。In order to achieve the object of the present invention, for example, an image coding apparatus of the present invention has the following configuration.

【００１０】すなわち、画像を符号化する画像符号化装
置であって、原画像の第１の領域を第１の解像度で読み
込み、読み込んだ領域を含む第１の画像を生成する第１
の読み込み手段と、前記原画像において、前記第１の解
像度よりも高い第２の解像度で、前記第１の領域とは異
なる第２の領域を読み込み、読み込んだ領域を含む第２
の画像を生成する第２の読み込み手段と、前記第２の画
像に対して周波数変換を行い、サブバンド毎の係数を得
る第１の周波数変換手段と、前記第１の周波数変換手段
による複数のサブバンドのうち所定のサブバンドに前記
第１の画像を合成し、合成サブバンドを生成する合成手
段と、前記合成サブバンドに対して更に周波数変換を行
い、サブバンド毎の係数を得る第２の周波数変換手段
と、前記第１の周波数変換手段、及び第２の周波数変換
手段により得られたサブバンドの係数から、符号列を生
成する符号列生成手段とを備える。That is, an image coding apparatus for coding an image, wherein a first area of an original image is read at a first resolution and a first image including the read area is generated.
Reading means and a second area including a read area in the original image at a second resolution higher than the first resolution and different from the first area.
Second reading means for generating an image of the first image, a first frequency converting means for performing frequency conversion on the second image to obtain a coefficient for each sub-band, and a plurality of the first frequency converting means. A synthesizing unit for synthesizing the first image in a predetermined subband of the subbands to generate a synthetic subband, and second frequency conversion for the synthetic subband to obtain a coefficient for each subband. And a code string generating means for generating a code string from the coefficients of the sub-bands obtained by the first frequency converting means and the second frequency converting means.

【００１１】更に、前記第２の読み込み手段が読み込む
領域を指定するフラグを生成するフラグ生成手段を備え
る。Further, it is provided with a flag generating means for generating a flag designating an area to be read by the second reading means.

【００１２】一方、更に、前記第１の画像において、所
定の領域を示すフラグを生成するフラグ生成手段を備え
る。On the other hand, the first image further comprises a flag generating means for generating a flag indicating a predetermined area.

【００１３】この場合、前記符号列生成手段が前記合成
サブバンドに対して周波数変換を行った際に生成される
サブバンドを構成する係数のうち、前記第２の画像に関
連する係数を示すマスクを生成するマスク生成手段を更
に備える。In this case, a mask indicating a coefficient associated with the second image among the coefficients forming the subband generated when the code string generating means frequency-converts the combined subband. It further comprises a mask generating means for generating.

【００１４】[0014]

【発明の実施の形態】以下添付図面を参照して、本発明
を好適な実施形態に従って詳細に説明する。BEST MODE FOR CARRYING OUT THE INVENTION The present invention will now be described in detail according to preferred embodiments with reference to the accompanying drawings.

【００１５】［第１の実施形態］図１に本実施形態にお
ける画像符号化装置の機能構成を示す。同図に於いて１
０１は高解像度領域情報入力部、１０２は高解像度スキ
ャン領域決定部、１０３は高解像度画像入力部、１０４
は低解像度画像入力部、１０５は離散ウェーブレット変
換部、１０６は係数合成部、１０７は離散ウェーブレッ
ト変換部、１０８はビットプレーン符号化部、１０９は
符号列形成部、１０９は符号出力部である。[First Embodiment] FIG. 1 shows the functional arrangement of an image coding apparatus according to this embodiment. 1 in the figure
Reference numeral 01 is a high resolution area information input unit, 102 is a high resolution scan area determination unit, 103 is a high resolution image input unit, and 104.
Is a low-resolution image input unit, 105 is a discrete wavelet transform unit, 106 is a coefficient synthesis unit, 107 is a discrete wavelet transform unit, 108 is a bit plane coding unit, 109 is a code string formation unit, and 109 is a code output unit.

【００１６】又、図１０に本実施形態における画像符号
化装置の基本構成を示す。Further, FIG. 10 shows the basic configuration of the image coding apparatus in this embodiment.

【００１７】１００１はＣＰＵで、ＲＡＭ１００２やＲ
ＯＭ１００３に格納されたプログラムやデータを用いて
本装置全体の制御を行うと共に、後述の符号化処理を行
う。Reference numeral 1001 is a CPU, which is a RAM 1002 or R
The program and data stored in the OM 1003 are used to control the entire apparatus and also to perform encoding processing described later.

【００１８】１００２はＲＡＭで、外部記憶装置１００
４や記憶媒体ドライブ１０１０からロードされたプログ
ラムやデータ、処理対象のデータなどを一時的に記憶し
ておくエリアを備えると共に、ＣＰＵ１００１が処理を
行う際に用いるワークエリアも備える。A RAM 1002 is an external storage device 100.
4 and a storage medium drive 1010, a program area, a data area, a processing target data area, and the like are temporarily stored, and a work area used by the CPU 1001 for processing is also provided.

【００１９】１００３はＲＯＭで、装置全体の制御を行
うプログラムやデータを格納する。A ROM 1003 stores programs and data for controlling the entire apparatus.

【００２０】１００４はハードディスクなどの外部記憶
装置で、記憶媒体ドライブ１０１０から読み込まれたプ
ログラムやデータなどをファイルの形式で保存する。ま
た、ＣＰＵ１００１が処理を実行する際に用いるワーク
エリアのサイズがＲＡＭ１００２に設けられなくなった
場合に、ファイルとして提供することもできる。An external storage device 1004, such as a hard disk, stores programs and data read from the storage medium drive 1010 in a file format. Further, when the size of the work area used when the CPU 1001 executes the processing is no longer provided in the RAM 1002, it can be provided as a file.

【００２１】１００５、１００６は夫々キーボード、マ
ウスで、各種の指示を本装置に入力することができる。Reference numerals 1005 and 1006 denote a keyboard and a mouse, respectively, which can input various instructions to the apparatus.

【００２２】１００７は表示装置で、ＣＲＴや液晶画面
などにより構成されており、画像や文字などを表示する
ことができる。A display device 1007 is composed of a CRT, a liquid crystal screen, etc., and can display images and characters.

【００２３】１００８は画像入力装置で、ディジタルカ
メラやスキャナ等により構成されており、画像をデータ
として入力することができる。また、入力する画像に対
する処理（γ補正や色補正など）を行う回路などもこれ
に含まれる。An image input device 1008 is composed of a digital camera, a scanner or the like, and can input an image as data. Further, a circuit for performing processing (γ correction, color correction, etc.) on an input image is also included in this.

【００２４】１００９は記憶媒体ドライブで、ＣＤ−Ｒ
ＯＭやＤＶＤなどの記憶媒体からプログラムやデータな
どを読み込むドライブで、読み込んだプログラムやデー
タは外部記憶装置１００４やＲＡＭ１００２に出力され
る。A storage medium drive 1009 is a CD-R.
A drive that reads programs and data from a storage medium such as an OM and a DVD outputs the read programs and data to the external storage device 1004 and the RAM 1002.

【００２５】１０１０は上述の各部を繋ぐバスである。A bus 1010 connects the above-mentioned units.

【００２６】又上述の通り、本実施形態の画像符号化装
置は図１０に示す基本構成を備えるが、図１に示した機
能構成を有するプログラムを記憶媒体ドライブ１００
９，もしくは外部記憶装置１００４等から読み込み、Ｃ
ＰＵ１００１により実行することで、本画像符号化装置
を図１に示す構成を備える装置としても良い。As described above, the image coding apparatus of this embodiment has the basic structure shown in FIG. 10, but the program having the functional structure shown in FIG.
9, or read from the external storage device 1004, C
By being executed by the PU 1001, the present image coding device may be a device having the configuration shown in FIG.

【００２７】本実施形態では白黒原稿を、各画素の輝度
値を８ビットで表現した画像データとして読み取り、符
号化するものとして説明する。しかしながらこれに限ら
ず、４ビット、１０ビット、１２ビットなど８ビット以
外のビット数で輝度値を表現する画像データとして読み
取って符号化する場合にも適用できる。また各画素をＲ
ＧＢ、ＣＭＹＫなどの複数の色成分或いはＹＣｒＣｂ等
の輝度と色度／色差成分で表現するカラー画像データと
して読み取る場合にも適用できる。この場合にはカラー
画像データ中の各成分がモノクロ画像データであると見
なせば良い。In the present embodiment, a black-and-white original will be described as being read and encoded as image data in which the brightness value of each pixel is represented by 8 bits. However, the present invention is not limited to this, and can also be applied to the case of reading and encoding as image data expressing a luminance value with a bit number other than 8 bits such as 4 bits, 10 bits, 12 bits. In addition, each pixel is R
It can also be applied to the case of reading as color image data represented by a plurality of color components such as GB and CMYK or luminance and chromaticity / color difference components such as YCrCb. In this case, each component in the color image data may be regarded as monochrome image data.

【００２８】以下、図１、図１０を参照して、本実施形
態における各部の機能とその動作について説明する。本
実施形態では符号化対象となる白黒原稿を所定の解像度
で読み取り、符号化するものであるが、その一部分だけ
を水平・垂直方向ともに倍の解像度で読み取るものであ
る。以降、所定の解像度を低解像度、水平・垂直方向と
もにこれを倍にした解像度を高解像度と呼ぶ。白黒原稿
の大きさは固定であるものとし、原稿全体を高解像度で
読み取った場合の画像サイズをＸ，Ｙとして表す。ま
た、Ｘ，Ｙともに４の倍数とする。The function and operation of each unit in this embodiment will be described below with reference to FIGS. 1 and 10. In the present embodiment, a black-and-white original to be encoded is read at a predetermined resolution and encoded, but only a part thereof is read at double the resolution in both the horizontal and vertical directions. Hereinafter, the predetermined resolution will be referred to as low resolution, and the resolution obtained by doubling the predetermined resolution will be referred to as high resolution. It is assumed that the size of the black and white original is fixed, and the image size when the entire original is read at high resolution is represented as X and Y. Both X and Y are multiples of 4.

【００２９】まず、高解像度領域情報入力部１０１から
原稿のどの部分を高解像度で読み取るかを指定する高解
像度領域情報が入力される。本実施形態では高解像度で
読み取る領域は矩形とする。また、高解像度領域情報は
高解像度で読み取った場合の画像サイズを基準とした左
上隅画素位置（Ｕｌｘ，Ｕｌｙ）と右下隅画素位置（Ｌ
Ｒｘ，ＬＲｙ）により特定される。図２に原稿と高解像
度領域情報の例を示す。First, the high resolution area information input section 101 inputs high resolution area information for specifying which part of the document is to be read at high resolution. In this embodiment, the area to be read at high resolution is rectangular. In addition, the high resolution area information includes the pixel position at the upper left corner (Ulx, Uly) and the pixel position at the lower right corner (L) based on the image size when reading at high resolution.
Rx, LRy). FIG. 2 shows an example of a document and high resolution area information.

【００３０】高解像度スキャン領域決定部１０２は、高
解像度領域情報入力部１０１から入力される高解像度領
域の左上隅画素位置と右下隅画素位置から実際に高解像
度で読み取る画素位置を指定する解像度フラグＦ（ｘ，
ｙ）を生成する。生成した解像度フラグＦ（ｘ、ｙ）は
ＲＡＭ１００２内に一時的に記憶される。実際に高解像
度で読み取る画素位置については解像度フラグＦ（ｘ，
ｙ）を１に設定し、それ以外は０とする。具体的には、とする。図７に図２のように高解像度領域情報を指定し
た場合の解像度フラグＦ（ｘ，ｙ）の例を示す。The high resolution scan area determination unit 102 specifies a pixel position to be actually read in high resolution from the upper left corner pixel position and the lower right corner pixel position of the high resolution area input from the high resolution area information input unit 101. F (x,
y) is generated. The generated resolution flag F (x, y) is temporarily stored in the RAM 1002. Regarding the pixel position that is actually read in high resolution, the resolution flag F (x,
y) is set to 1 and 0 otherwise. In particular, And FIG. 7 shows an example of the resolution flag F (x, y) when the high resolution area information is designated as shown in FIG.

【００３１】高解像度スキャン領域決定部１０２により
高解像度で読み取る領域が決定されると、解像度フラグ
Ｆ（ｘ，ｙ）に基づき、高解像度画像入力部１０３およ
び低解像度画像入力部１０４により高解像度部分、低解
像度部分の画像読み込みが行われる。実際には高解像度
画像入力部１０３、低解像度画像入力部１０４に同じ原
稿の画像が入力され、夫々の入力部１０３，１０４で高
解像度部分、低解像度部分が読み込まれる。When the high resolution scan area determining unit 102 determines the area to be read at high resolution, the high resolution image input unit 103 and the low resolution image input unit 104 determine the high resolution area based on the resolution flag F (x, y). , The low-resolution image is read. Actually, the image of the same document is input to the high resolution image input unit 103 and the low resolution image input unit 104, and the high resolution portion and the low resolution portion are read by the respective input units 103 and 104.

【００３２】高解像度画像入力部１０３は解像度フラグ
Ｆ（ｘ，ｙ）＝１の部分についてのみ、白黒原稿から画
素データを読み込み、高解像度画像データＰｈ（ｘ，
ｙ）をＲＡＭ１００２内に形成する。解像度フラグＦ
（ｘ，ｙ）＝０である位置についてはＰｈ（ｘ，ｙ）＝
０として実際の読み込み動作は行わない。The high-resolution image input unit 103 reads pixel data from a black-and-white document only for the portion of the resolution flag F (x, y) = 1, and the high-resolution image data Ph (x,
y) is formed in the RAM 1002. Resolution flag F
For a position where (x, y) = 0, Ph (x, y) =
When 0 is set, the actual read operation is not performed.

【００３３】離散ウェーブレット変換部１０５は高解像
度画像入力部１０３から入力される高解像度画像データ
の各画素データＰｈ（ｘ，ｙ）をＲＡＭ１００２に適宜
格納しながら２次元の離散ウェーブレット変換を施し、
ＬＬ，ＬＨ，ＨＬ，ＨＨの４つのサブバンドに分解す
る。そして各サブバンドの係数を再度ＲＡＭ１００２内
であって、画像データＰｈ（ｘ、ｙ）が格納されている
エリアとは別のエリアに出力する。以降、各サブバンド
の係数をＣ（Ｓ，ｘ，ｙ）と表す。Ｓはサブバンドを表
し、ＬＬ，ＬＨ，ＨＬ，ＨＨのいずれかである。また、
ｘ，ｙは各サブバンド内の左上隅の係数位置を（０，
０）とした場合の、水平方向および垂直方向の係数位置
を表す。The discrete wavelet transform unit 105 performs two-dimensional discrete wavelet transform while appropriately storing each pixel data Ph (x, y) of the high resolution image data input from the high resolution image input unit 103 in the RAM 1002.
It is decomposed into four subbands LL, LH, HL, and HH. Then, the coefficient of each subband is output again to the RAM 1002 in an area different from the area in which the image data Ph (x, y) is stored. Hereinafter, the coefficient of each subband is represented as C (S, x, y). S represents a subband and is any one of LL, LH, HL, and HH. Also,
x, y is the coefficient position of the upper left corner in each subband (0,
0) represents the coefficient positions in the horizontal direction and the vertical direction.

【００３４】２次元離散ウェーブレット変換は、１次元
の変換（フィルタ処理）を水平・垂直方向それぞれに適
用することにより実現する。図４に２次元離散ウェーブ
レット変換処理を示す。同図より２次元離散ウェーブレ
ット変換は、まず符号化対象画像（図４（ａ））に対し
て、垂直方向に１次元の離散ウェーブレット変換を適用
し、低周波サブバンドＬと高周波サブバンドＨに分解す
る（図４（ｂ））。さらに、それぞれに水平方向の１次
元離散ウェーブレット変換を適用することにより、Ｌ
Ｌ，ＨＬ，ＬＨ，ＨＨの４つのサブバンドに分解する
（図４（ｃ））。本画像符号化装置では、Ｎ個の１次元
信号ｘ（ｎ）（ｎは０からＮ−１とする）に対する１次
元離散ウェーブレット変換は以下の式により行われるも
のとする。The two-dimensional discrete wavelet transform is realized by applying a one-dimensional transform (filter process) to each of the horizontal and vertical directions. FIG. 4 shows the two-dimensional discrete wavelet transform process. From the figure, in the two-dimensional discrete wavelet transform, first, the one-dimensional discrete wavelet transform in the vertical direction is applied to the image to be encoded (FIG. 4A), and the low frequency subband L and the high frequency subband H are applied. Disassemble (FIG. 4 (b)). Further, by applying a horizontal one-dimensional discrete wavelet transform to each of them, L
It is decomposed into four subbands of L, HL, LH, and HH (FIG. 4 (c)). In this image coding apparatus, the one-dimensional discrete wavelet transform for N one-dimensional signals x (n) (n is 0 to N-1) is performed by the following equation.

【００３５】ｈ（ｎ）＝ｘ（２ｎ）−ｘ（２ｎ＋１）（１）ｌ（ｎ）＝ｆｌｏｏｒ｛（ｘ（２ｎ）＋ｘ（２ｎ＋１））／２｝（２）ここで、ｈ（ｎ）は高周波サブバンドの係数、ｌ（ｎ）
は低周波サブバンドの係数を表す。また、ｆｌｏｏｒ
｛Ｒ｝は実数Ｒを超えない最大の整数値を得る関数とす
る。H (n) = x (2n) −x (2n + 1) (1) l (n) = floor {(x (2n) + x (2n + 1)) / 2} (2) where h (n) Is the coefficient of the high frequency subband, l (n)
Represents the coefficient of the low frequency subband. Also, floor
Let {R} be a function that obtains the maximum integer value that does not exceed the real number R.

【００３６】一方、低解像度画像入力部１０４は高解像
度スキャン領域決定部１０２で生成する解像度フラグを
参照し、Ｆ（ｘ，ｙ）＝０である画素について低解像度
で画像データを読み込み、低解像度画像Ｐｌ（ｌｘ，ｌ
ｙ）をＲＡＭ１００２内（高解像度画像データＰｈ
（ｘ、ｙ）が記憶されているエリアとは別のエリア）に
形成する。ここでｌｘは０〜（Ｘ／２−１）、ｌｙは０
〜（Ｙ／２−１）までである。低解像度画像の一つの画
素Ｐｌ（ｌｘ，ｌｙ）に、対応する４つのフラグＦ（ｌ
ｘ×２，ｌｙ×２），Ｆ（ｌｘ×２＋１，ｌｙ×２），
Ｆ（ｌｘ×２，ｌｙ×２＋１），Ｆ（ｌｘ×２＋１，ｌ
ｙ×２＋１）を設ける。高解像度領域の４隅の座標値が
偶数であることから、これら４つのフラグは全て同じ値
を取るので、Ｆ（ｌｘ×２，ｌｙ×２）を読み込みの要
否を判断する指標として用いる。解像度フラグＦ（ｌｘ
×２，ｌｙ×２）＝１であるｌｘ，ｌｙについてはＰｌ
（ｌｘ，ｌｙ）＝０として実際の読み込み動作は行わな
い。On the other hand, the low resolution image input unit 104 refers to the resolution flag generated by the high resolution scan area determination unit 102, reads the image data at a low resolution for the pixel of F (x, y) = 0, and the low resolution Image Pl (lx, l
y) in the RAM 1002 (high resolution image data Ph
It is formed in an area different from the area in which (x, y) is stored. Here, lx is 0 to (X / 2-1) and ly is 0.
To (Y / 2-1). Four flags F (l corresponding to one pixel Pl (lx, ly) of the low resolution image
x × 2, ly × 2), F (lx × 2 + 1, ly × 2),
F (lx × 2, ly × 2 + 1), F (lx × 2 + 1, l
y × 2 + 1) is provided. Since the coordinate values of the four corners of the high-resolution area are even, these four flags all have the same value, and therefore F (lx × 2, ly × 2) is used as an index for determining the necessity of reading. Resolution flag F (lx
Pl for lx and ly with x2, lyx2) = 1
When (lx, ly) = 0, the actual read operation is not performed.

【００３７】低解像度画像入力部１０４における低解像
度画像入力と、離散ウェーブレット変換部１０５におけ
る高解像度画像のサブバンド分解が終了すると、係数合
成部１０６は高解像度スキャン領域決定部１０２の生成
する解像度フラグＦ（ｘ，ｙ）を参照して、離散ウェー
ブレット変換部１０５の生成する低周波サブバンドＣ
（ＬＬ，ｘ，ｙ）と低解像度画像Ｐｌ（ｌｘ，ｌｙ）と
の合成を行う。低周波サブバンドＣ（ＬＬ，ｘ，ｙ）は
低解像度画像Ｐｌ（ｌｘ，ｌｙ）と同じ大きさ（水平、
垂直方向のサイズがＸ／２，Ｙ／２）であるので、サブ
バンド内の係数位置を表す変数をｘ，ｙからｌｘ，ｌｙ
に置き変えて、係数合成部１０６での合成処理について
説明する。When the low-resolution image input in the low-resolution image input unit 104 and the sub-band decomposition of the high-resolution image in the discrete wavelet transform unit 105 are completed, the coefficient synthesizing unit 106 produces a resolution flag generated by the high-resolution scan area determining unit 102. The low-frequency subband C generated by the discrete wavelet transform unit 105 with reference to F (x, y)
(LL, x, y) and the low resolution image Pl (lx, ly) are combined. The low-frequency subband C (LL, x, y) has the same size (horizontal, horizontal) as the low-resolution image Pl (lx, ly).
Since the size in the vertical direction is X / 2, Y / 2), variables representing the coefficient position in the subband are changed from x, y to lx, ly.
Instead, the combining process in the coefficient combining unit 106 will be described.

【００３８】係数合成部１０６はＣ（ＬＬ，ｌｘ，ｌ
ｙ）の全ての係数についてラスタースキャン順に解像度
フラグＦ（ｌｘ×２，ｌｙ×２）を調べ、Ｆ（ｌｘ×
２，ｌｙ×２）＝０であればＣ（ＬＬ，ｌｘ，ｌｙ）を
Ｐｌ（ｌｘ，ｌｙ）に置き換える。このようにしてＣ
（ＬＬ，ｌｘ，ｌｙ）の全ての係数についてこの置き換
え処理を行うと、Ｃ（ＬＬ，ｌｘ，ｌｙ）内で、低解像
度画像の部分に相当する係数を低解像度画像Ｐｌ（ｘ、
ｙ）に置き換えることができる。The coefficient synthesizing unit 106 uses C (LL, lx, l
The resolution flag F (lx × 2, ly × 2) is checked for all the coefficients in y) in raster scan order, and F (lx ×
If 2, ly × 2) = 0, C (LL, lx, ly) is replaced with Pl (lx, ly). In this way C
When this replacement processing is performed for all the coefficients of (LL, lx, ly), the coefficient corresponding to the low-resolution image portion in C (LL, lx, ly) is converted into the low-resolution image Pl (x, ly).
y) can be replaced.

【００３９】離散ウェーブレット変換部１０７は係数合
成部１０６によりＣ（ＬＬ，ｌｘ，ｌｙ）とＰｌ（ｌ
ｘ，ｌｙ）を合成して得られた新しいＣ（ＬＬ，ｌｘ，
ｌｙ）を離散ウェーブレット変換部１０５と同様の処理
により４つのサブバンドに分解する。図５に離散ウェー
ブレット変換部１０７により合成後のＬＬサブバンドを
さらに４つのサブバンドに分解した様子を示す。同図に
おいて、離散ウェーブレット変換部１０５により生成さ
れたＬＨ，ＨＬ，ＨＨサブバンドについてはそれぞれＬ
Ｈ２，ＨＬ２，ＨＨ２として示し、離散ウェーブレット
変換部１０７で生成したＬＨ，ＨＬ，ＨＨサブバンドに
ついてはそれぞれＬＨ１，ＨＬ１，ＨＨ１として区別し
て示した。なお、図５のＬＬは図４（ｃ）のＬＬサブバ
ンドに低解像度画像を合成後、再分解したものであり、
同一のものではない。In the discrete wavelet transform unit 107, the coefficient synthesizing unit 106 causes C (LL, lx, ly) and Pl (l
x, ly) to obtain a new C (LL, lx,
ly) is decomposed into four subbands by the same processing as that of the discrete wavelet transform unit 105. FIG. 5 shows a state in which the LL subband after synthesis is further decomposed into four subbands by the discrete wavelet transform unit 107. In the figure, the LH, HL, and HH subbands generated by the discrete wavelet transform unit 105 are respectively L
It is shown as H2, HL2, and HH2, and the LH, HL, and HH subbands generated by the discrete wavelet transform unit 107 are separately shown as LH1, HL1, and HH1. Note that the LL in FIG. 5 is obtained by combining the LL subband in FIG. 4C with the low-resolution image and then re-decomposing it.
Not the same.

【００４０】ビットプレーン符号化部１０８は、離散ウ
ェーブレット変換部１０５、係数合成部１０６、離散ウ
ェーブレット変換部１０７を経て生成されたＬＬ，ＬＨ
１，ＨＬ１，ＨＨ１，ＬＨ２，ＨＬ２，ＨＨ２の７つの
サブバンドの係数値Ｃ（Ｓ，ｘ，ｙ）を符号化し、符号
列を生成する。各サブバンドの係数をブロック分割し、
別々に符号化することによりランダムアクセスを容易に
する方法などが知られているが、ここでは説明を簡単に
するためにサブバンド単位に符号化する。各サブバンド
の係数値Ｃ（Ｓ，ｘ，ｙ）の符号化は、サブバンド内の
係数値Ｃ（Ｓ，ｘ，ｙ）の絶対値を自然２進数で表現
し、上位の桁から下位の桁へとビットプレーン方向を優
先して２値算術符号化することにより行われる。各サブ
バンドの係数値Ｃ（Ｓ，ｘ，ｙ）を自然２進表記した場
合の下からｎ桁目のビットをＣｎ（Ｓ，ｘ，ｙ）と表記
して説明する。なお、２進数の桁を表す変数ｎをビット
プレーン番号と呼ぶこととし、ビットプレーン番号ｎは
ＬＳＢを０桁目とする。The bit plane coding unit 108 has the LL and LH generated through the discrete wavelet transform unit 105, the coefficient synthesizing unit 106, and the discrete wavelet transform unit 107.
1, HL1, HH1, LH2, HL2, HH2 are coded for the coefficient values C (S, x, y) of the seven subbands to generate a code string. Divide the coefficient of each subband into blocks,
Although a method of facilitating random access by separately encoding is known, here, encoding is performed in subband units to simplify the description. Coding of the coefficient value C (S, x, y) of each subband expresses the absolute value of the coefficient value C (S, x, y) in the subband by a natural binary number, and the upper digit to the lower digit. It is performed by performing binary arithmetic coding on the digits in the bit plane direction. When the coefficient value C (S, x, y) of each subband is expressed in natural binary, the nth digit bit from the bottom is described as Cn (S, x, y). The variable n representing the binary digit is called a bit plane number, and the bit plane number n has the LSB as the 0th digit.

【００４１】図６はビットプレーン符号化部１０８でサ
ブバンドＳを符号化する処理の流れを示したフローチャ
ートである。FIG. 6 is a flowchart showing the flow of processing for encoding the subband S in the bit plane encoding unit 108.

【００４２】ステップＳ６０１はサブバンドＳ内の係数
の絶対値の最大値Ｍａｂｓ（Ｓ）を求めるステップ、ス
テップＳ６０２はサブバンド内の係数の絶対値を表すの
に必要な有効桁数Ｎ_ＢＰ（Ｓ）を求めるステップ、ステ
ップＳ６０３は変数ｎに有効桁数を代入するステップ、
ステップＳ６０４は（ｎ−１）を求めてｎに代入するス
テップ、ステップＳ６０５はｎ桁目のビットプレーンを
符号化するステップ、ステップＳ６０６はｎが０である
か否かを判定するステップである。Step S601 is a step for obtaining the maximum value Mabs (S) of the absolute values of the coefficients in the subband S, and step S602 is the number of significant digits N _BP (S) required to represent the absolute value of the coefficients in the subband. ), Step S603 is a step of substituting the number of significant digits for the variable n,
Step S604 is a step of obtaining (n-1) and substituting it into n, step S605 is a step of encoding the bit plane of the n-th digit, and step S606 is a step of determining whether or not n is 0.

【００４３】図６を用いてビットプレーン符号化部１０
８におけるサブバンドＳの符号化処理の流れについて説
明する。The bit plane coding unit 10 will be described with reference to FIG.
The flow of the subband S encoding process in No. 8 will be described.

【００４４】まず、ステップＳ６０１で符号化対象とな
るサブバンドＳ内の係数の絶対値を調べ、その最大値Ｍ
ａｂｓ（Ｓ）を求める。次にステップＳ６０２ではＭａ
ｂｓ（Ｓ）を２進数で表現するのに必要となる桁数Ｎ
_ＢＰ（Ｓ）を以下の式により求める。First, in step S601, the absolute value of the coefficient in the subband S to be coded is checked, and its maximum value M
Calculate abs (S). Next, in step S602, Ma
Number of digits N required to represent bs (S) in binary number
_BP (S) is _calculated by the following formula.

【００４５】Ｎ_ＢＰ（Ｓ）＝ｃｅｉｌ｛ｌｏｇ２（Ｍａ
ｂｓ（Ｓ））｝ここで、ｃｅｉｌ｛Ｒ｝は実数Ｒに等しいか、あるいは
それ以上の最小の整数値を表す。ステップＳ６０３では
ビットプレーン番号ｎに有効桁数Ｎ_ＢＰ（Ｓ）を代入す
る。ステップＳ６０４ではビットプレーン番号ｎから１
を引く。ステップＳ６０５ではビットプレーンｎを２値
算術符号を用いて符号化する。本実施形態においては算
術符号としてＱＭ−Ｃｏｄｅｒを用いることとする。こ
のＱＭ−Ｃｏｄｅｒを用いて、ある状態（コンテクス
ト）で発生した２値シンボルを符号化する手順、或い
は、算術符号化処理のための初期化手順、終端手順につ
いては、静止画像の国際標準ITU-T Recommendation T.
81 | ISO/IEC10918-1勧告等に詳細に説明されているの
でここでは説明を省略する。また、説明を簡単にするた
め、本実施形態では単一のコンテクストで各ビットを算
術符号化するものとする。各ビットプレーンの符号化の
開始時にはビットプレーン符号化部１０８内の算術符号
化器を初期化し、終了時に算術符号化器の終端処理を行
う。また、個々の係数について最初に符号化される’
１’の直後に、その係数の正負符号を０、１で表し、算
術符号化する。ここでは正ならば０、負ならば１とす
る。N _BP (S) = ceil {log2 (Ma
bs (S))} where ceil {R} represents the smallest integer value equal to or greater than the real number R. In step S603, the number of significant digits N _BP (S) is substituted for the bit plane number n. In step S604, the bit plane number n is 1
pull. In step S605, the bit plane n is encoded using the binary arithmetic code. In this embodiment, QM-Coder is used as the arithmetic code. Regarding the procedure of coding a binary symbol generated in a certain state (context) using this QM-Coder, or the initialization procedure and termination procedure for arithmetic coding processing, the international standard ITU- T Recommendation T.
81 | ISO / IEC 10918-1 Recommendation etc. have been explained in detail, so the explanation is omitted here. Further, in order to simplify the explanation, in the present embodiment, each bit is arithmetically encoded with a single context. At the start of encoding each bit plane, the arithmetic encoder in the bit plane encoding unit 108 is initialized, and at the end, the termination processing of the arithmetic encoder is performed. Also, each coefficient is first coded '
Immediately after 1 ', the positive / negative sign of the coefficient is represented by 0, 1 and arithmetically coded. Here, 0 is set for positive and 1 for negative.

【００４６】例えば、係数が−５で、この係数の属する
サブバンドＳの有効桁数Ｎ_ＢＰ（Ｓ）が６であった場
合、係数の絶対値は２進数０００１０１で表され、各ビ
ットプレーンの符号化により上位桁から下位桁へと符号
化される。２番目のビットプレーンの符号化時（この場
合、上から４桁目）に最初の’１’が符号化され、この
直後に正負符号’１’を算術符号化する。For example, when the coefficient is -5 and the number of significant digits N _BP (S) of the sub-band S to which this coefficient belongs is 6, the absolute value of the coefficient is represented by the binary number 00001, and the absolute value of each bit plane is Encoding is performed from the upper digit to the lower digit. At the time of encoding the second bit plane (in this case, the fourth digit from the top), the first "1" is encoded, and immediately thereafter, the positive / negative code "1" is arithmetically encoded.

【００４７】ステップＳ６０６では、ビットプレーン番
号ｎを０と比較し、ｎ＝０即ち、ステップＳ６０５でＬ
ＳＢプレーンの符号化を行なった場合には、サブバンド
の符号化処理を終了し、それ以外の場合にはステップＳ
６０４に処理を移す。In step S606, the bit plane number n is compared with 0, and n = 0, that is, in step S605 L
If the SB plane has been encoded, the subband encoding process is terminated, and otherwise, step S
The processing is moved to 604.

【００４８】上述の処理により、サブバンドｓの全係数
を符号化し、各ビットプレーンｎに対応する符号列ＣＳ
（Ｓ，ｎ）を生成する。生成した符号列は符号列形成部
１０９に送られ、ＲＡＭ１００２内に一時的に格納され
る。By the above-mentioned processing, all the coefficients of the subband s are coded, and the code string CS corresponding to each bit plane n is coded.
Generate (S, n). The generated code string is sent to the code string forming unit 109 and temporarily stored in the RAM 1002.

【００４９】ビットプレーン符号化部１０８により全サ
ブバンドの係数の符号化が終了し、全符号列がＲＡＭ１
００２に格納されると、符号列形成部１０９は所定の順
序でＲＡＭ１００２に格納される符号列を読み出し、必
要な付加情報を挿入して、本符号化装置の出力となる最
終的な符号列を形成し、符号出力部１１０に出力する。The bit plane coding unit 108 finishes coding the coefficients of all subbands, and all code strings are stored in the RAM1.
After being stored in 002, the code string forming unit 109 reads out the code string stored in the RAM 1002 in a predetermined order, inserts necessary additional information, and outputs a final code string to be the output of the present encoding apparatus. It is formed and output to the code output unit 110.

【００５０】符号列形成部１０９で生成される最終的な
符号列はヘッダと、レベル０、レベル１、およびレベル
２の３つに階層化された符号化データにより構成され
る。レベル０の符号化データはＬＬサブバンドの係数を
符号化して得られるＣＳ（ＬＬ，Ｎ_ＢＰ（ＬＬ）−１）
からＣＳ（ＬＬ，０）の符号列から構成される。レベル
１はＬＨ１，ＨＬ１，ＨＨ１の各サブバンドの係数を符
号化して得られる符号列ＣＳ（ＬＨ１，Ｎ_ＢＰ（ＬＨ
１）−１）〜ＣＳ（ＬＨ１，０）、ＣＳ（ＨＬ１，Ｎ
_ＢＰ（ＨＬ１）−１）〜ＣＳ（ＨＬ１，０）、および、
ＣＳ（ＨＨ１，Ｎ_ＢＰ（ＨＨ１）−１）〜ＣＳ（ＨＨ
１，０）から構成される。また、レベル２はＬＨ２，Ｈ
Ｌ２，ＨＨ２の各サブバンドの係数を符号化して得られ
る符号列ＣＳ（ＬＨ２，Ｎ_ＢＰ（ＬＨ２）−１）〜ＣＳ
（ＬＨ２，０）、ＣＳ（ＨＬ２，Ｎ_ＢＰ（ＨＬ２）−
１）〜ＣＳ（ＨＬ２，０）、および、ＣＳ（ＨＨ２，Ｎ
_ＢＰ（ＨＨ２）−１）〜ＣＳ（ＨＨ２，０）から構成さ
れる。図３に符号列形成部１０９により生成される符号
列の構造を示す。The final code string generated by the code string forming unit 109 is composed of a header and coded data hierarchically divided into three levels 0, 1, and 2. The coded data of level 0 is CS (LL, _NBP (LL) -1) obtained by coding the coefficients of the LL subband.
To CS (LL, 0). Level 1 is a code sequence CS (LH1, N _BP (LH) obtained by encoding the coefficients of each subband of LH1, HL1, and HH1.
1) -1) to CS (LH1,0), CS (HL1, N)
_BP (HL1) -1) to CS (HL1,0), and
CS (HH1, N _BP (HH1) -1) to CS (HH
1, 0). Level 2 is LH2, H
Code strings CS (LH2, N _BP (LH2) -1) to CS obtained by coding the coefficients of the subbands of L2 and HH2
_{(LH2,0), CS (HL2,} N BP (HL2) -
1) to CS (HL2,0) and CS (HH2, N
_{It is} composed of _BP (HH2) -1) to CS (HH2,0). FIG. 3 shows the structure of the code string generated by the code string forming unit 109.

【００５１】符号出力部１１０は符号列形成部１０９で
生成された符号列を装置外部へと出力する。この符号出
力部１１０は、例えば、外部記憶装置１００４やＲＡＭ
１００２といった記憶装置や、新たにネットワーク回線
のインターフェースを画像符号化装置に設けた場合、こ
のインターフェース等である。The code output unit 110 outputs the code string generated by the code string forming unit 109 to the outside of the device. The code output unit 110 is, for example, an external storage device 1004 or a RAM.
When a storage device such as 1002 or an interface for a network line is newly provided in the image encoding device, this is the interface.

【００５２】以上に述べたように、画像データを高解像
度と低解像度に分けて取り込み、高解像度画像をサブバ
ンド分解して得られた低周波サブバンドに、低解像度画
像を合成して符号化することにより、必要な部分のみを
高解像度で読み込んで符号化することが可能となる。As described above, the image data is divided into the high resolution and the low resolution, and the low resolution sub-band obtained by sub-band decomposition of the high resolution image is combined with the low resolution image and encoded. By doing so, it becomes possible to read and encode only a necessary portion with high resolution.

【００５３】［第２の実施形態］図８に本実施形態にお
ける画像符号化装置の機能構成を示す。尚、第１の実施
形態で用いた図１のと共通する部分については同じ符号
で示し、それらの説明を省略する。また、本実施形態に
おける画像符号化装置の基本構成は図１０に示した構成
とする。[Second Embodiment] FIG. 8 shows the functional arrangement of an image coding apparatus according to this embodiment. It should be noted that parts common to those of FIG. 1 used in the first embodiment are denoted by the same reference numerals, and description thereof will be omitted. The basic configuration of the image coding apparatus according to this embodiment is the configuration shown in FIG.

【００５４】図８図に於いて、８０１は低解像度画像入
力部、８０２は像域判定部、８０３は高解像度画像入力
部、８０４は係数合成部である。In FIG. 8, reference numeral 801 is a low resolution image input section, 802 is an image area determination section, 803 is a high resolution image input section, and 804 is a coefficient synthesis section.

【００５５】本実施形態では第１の実施形態と同様に、
白黒原稿を各画素の輝度値を８ビットで表現した画像デ
ータとして読み取り、符号化するものとして説明する。
しかしながらこれに限らず、４ビット、１０ビット、１
２ビットなど８ビット以外のビット数で輝度値を表現す
る画像データとして読み取って符号化する場合にも適用
できる。また各画素をＲＧＢ、ＣＭＹＫなどの複数の色
成分或いはＹＣｒＣｂ等の輝度と色度／色差成分で表現
するカラー画像データとして読み取る場合にも適用でき
る。この場合にはカラー画像データ中の各成分がモノク
ロ画像データであると見なせば良い。本実施形態では第
１の実施形態と同様に、符号化対象となる白黒原稿を所
定の解像度で読み取り、符号化するものであるが、その
一部分だけを水平・垂直方向ともに倍の解像度で読み取
るものである。以降、所定の解像度を低解像度、水平・
垂直方向ともにこれを倍にした解像度を高解像度と呼
ぶ。白黒原稿の大きさは固定であるものとし、原稿全体
を高解像度で読み取った場合の画像サイズをＸ，Ｙとし
て表す。Ｘ，Ｙともに４の倍数とする。In this embodiment, as in the first embodiment,
The description will be made assuming that a black-and-white original is read and encoded as image data in which the brightness value of each pixel is represented by 8 bits.
However, not limited to this, 4 bits, 10 bits, 1
It can also be applied to the case of reading and encoding as image data expressing a luminance value with a bit number other than 8 bits such as 2 bits. It can also be applied to the case where each pixel is read as color image data represented by a plurality of color components such as RGB and CMYK or luminance and chromaticity / color difference components such as YCrCb. In this case, each component in the color image data may be regarded as monochrome image data. In the present embodiment, as in the first embodiment, a black-and-white original to be encoded is read and encoded at a predetermined resolution, but only a part of it is read at double the resolution in both the horizontal and vertical directions. Is. After that, the specified resolution is changed to low resolution, horizontal
The resolution obtained by doubling this in both the vertical direction is called high resolution. It is assumed that the size of the black and white original is fixed, and the image size when the entire original is read at high resolution is represented as X and Y. Both X and Y are multiples of 4.

【００５６】以下、図８を用いて、本実施形態に係る画
像符号化装置の各部の機能とその動作について説明す
る。The function and operation of each section of the image coding apparatus according to this embodiment will be described below with reference to FIG.

【００５７】まず、低解像度画像入力部８０１により、
符号化対象の白黒原稿全体を半分の解像度（例えば水
平、垂直方向共に１画素おきに読み込む）で読み込み、
Ｐｌ（ｌｘ，ｌｙ）を生成する。ここでｌｘは０〜（Ｘ
／２−１）、ｌｙは０〜（Ｙ／２−１）である。First, the low resolution image input unit 801
Read the entire black-and-white document to be encoded at half the resolution (for example, read every other pixel in both horizontal and vertical directions),
Pl (lx, ly) is generated. Where lx is 0- (X
/ 2-1) and ly are 0 to (Y / 2-1).

【００５８】像域判定部８０２は低解像度画像入力部８
０１より入力される低解像度画像Ｐｌ（ｌｘ，ｌｙ）か
ら文字・線画領域を検出し、解像度フラグＦｌ（ｌｘ，
ｌｙ）を生成する。解像度フラグＦｌ（ｌｘ，ｌｙ）は
低解像度画像Ｐｌ（ｌｘ，ｌｙ）の各画素が、文字・線
画領域と判断された部分に含まれるか否かを示し、Ｐｌ
（ｌｘ，ｌｙ）が文字・線画領域に含まれる場合には１
を、Ｐｌ（ｌｘ，ｌｙ）が文字・線画領域に含まれない
場合には０を設定する。本実施形態では像域判定部８０
２における文字、線画領域判別の具体的な方法について
は問わない。The image area determination unit 802 is a low resolution image input unit 8
The character / line drawing area is detected from the low resolution image Pl (lx, ly) input from 01, and the resolution flag Fl (lx, ly) is detected.
ly) is generated. The resolution flag Fl (lx, ly) indicates whether or not each pixel of the low resolution image Pl (lx, ly) is included in the portion determined to be the character / line drawing area, and Pl
1 if (lx, ly) is included in the character / line drawing area
Is set to 0 when Pl (lx, ly) is not included in the character / line drawing area. In the present embodiment, the image area determination unit 80
The specific method of character / line drawing area discrimination in 2 is not limited.

【００５９】高解像度画像入力部８０３は、像域判定部
８０２により生成される解像度フラグＦｌ（ｌｘ，ｌ
ｙ）を参照して元の画像の解像度で白黒原稿を読み込
み、高解像度の画像データＰｈ（ｘ，ｙ）を形成する。
解像度フラグＦｌ（ｆｌｏｏｒ｛ｘ／２｝，ｆｌｏｏｒ
｛ｙ／２｝）＝０である位置ｘ，ｙについては実際の読
み込み動作は行わず、Ｐｈ（ｘ，ｙ）＝０と設定する。
ここでｘは０〜（Ｘ−１），ｙは０〜（Ｙ−１）であ
る。The high resolution image input section 803 has a resolution flag Fl (lx, l) generated by the image area determination section 802.
y), a black-and-white original is read at the resolution of the original image to form high-resolution image data Ph (x, y).
Resolution flag Fl (floor {x / 2}, floor
For the positions x and y where {y / 2}) = 0, the actual reading operation is not performed, and Ph (x, y) = 0 is set.
Here, x is 0 to (X-1) and y is 0 to (Y-1).

【００６０】離散ウェーブレット変換部１０５は高解像
度画像入力部８０３により入力される高解像度画像デー
タの各画素データＰｈ（ｘ，ｙ）を、第１の実施形態と
同様にしてサブバンド分解し、各サブバンドの係数Ｃ
（Ｓ，ｘ，ｙ）（ＳはＬＬ，ＬＨ，ＨＬ，ＨＨの何れ
か）を生成する。The discrete wavelet transform unit 105 subband decomposes each pixel data Ph (x, y) of the high resolution image data input by the high resolution image input unit 803 into subbands in the same manner as in the first embodiment. Subband coefficient C
(S, x, y) (S is any of LL, LH, HL, HH) is generated.

【００６１】離散ウェーブレット変換部１０５における
高解像度画像のサブバンド分解が終了すると、係数合成
部８０４は像域判定部８０２の生成する解像度フラグＦ
ｌ（ｌｘ，ｌｙ）を参照して、離散ウェーブレット変換
部１０５の生成する低周波サブバンドＣ（ＬＬ，ｘ，
ｙ）と低解像度画像Ｐｌ（ｌｘ，ｌｙ）の合成を行う。
低周波サブバンドＣ（ＬＬ，ｘ，ｙ）は低解像度画像Ｐ
ｌ（ｌｘ，ｌｙ）と同じ大きさであるので、サブバンド
内の係数位置を表す変数をｘ，ｙからｌｘ，ｌｙに置き
変えて、係数合成部８０４の合成処理を説明する。係数
合成部８０４はＣ（ＬＬ，ｌｘ，ｌｙ）の全ての係数に
ついてラスタースキャン順に解像度フラグＦｌ（ｌｘ，
ｌｙ）を調べ、Ｆ（ｌｘ，ｌｙ）＝０であればＣ（Ｌ
Ｌ，ｌｘ，ｌｙ）をＰｌ（ｌｘ，ｌｙ）に置き換える。When the subband decomposition of the high resolution image in the discrete wavelet transform unit 105 is completed, the coefficient synthesizing unit 804 generates the resolution flag F generated by the image area determining unit 802.
By referring to l (lx, ly), the low-frequency subband C (LL, x, generated by the discrete wavelet transform unit 105.
y) and the low resolution image Pl (lx, ly) are combined.
The low-frequency subband C (LL, x, y) is the low-resolution image P
Since it has the same size as l (lx, ly), the variable representing the coefficient position in the subband is changed from x, y to lx, ly, and the synthesizing process of the coefficient synthesizing unit 804 will be described. The coefficient synthesizing unit 804 sets the resolution flags Fl (lx,
ly) is checked, and if F (lx, ly) = 0, then C (L
Replace L, lx, ly) with Pl (lx, ly).

【００６２】以降、離散ウェーブレット変換部１０７か
ら符号出力部１１０による符号化の過程は第１の実施形
態で述べた通りである。After that, the process of encoding from the discrete wavelet transform unit 107 to the code output unit 110 is as described in the first embodiment.

【００６３】以上に述べたように、画像データを高解像
度と低解像度に分けて取り込み、高解像度画像をサブバ
ンド分解して得られた低周波サブバンドに低解像度画像
を合成して符号化することにより、必要な部分のみを高
解像度で読み込んで符号化することが可能となる。本実
施形態においては一旦、原稿を低解像度で読み込むこと
により、高解像度を必要とする領域を選定して読み込む
ことができる。As described above, image data is divided into a high resolution image and a low resolution image, and the low resolution image is combined with the low frequency image obtained by subband decomposition of the high resolution image and encoded. As a result, it is possible to read and encode only the necessary portion with high resolution. In the present embodiment, by once reading the original document at a low resolution, it is possible to select and read an area requiring a high resolution.

【００６４】［第３の実施形態］図９に本実施形態にお
ける画像符号化装置の機能構成を示す。第１の実施形態
で用いた図１、および第２の実施形態で用いた図８と共
通する部分については同じ符号で示し、それらの説明を
省略する。また、本実施形態における画像符号化装置の
基本構成は図１０に示した構成とする。[Third Embodiment] FIG. 9 shows the functional arrangement of an image coding apparatus according to this embodiment. Parts common to FIG. 1 used in the first embodiment and FIG. 8 used in the second embodiment are denoted by the same reference numerals, and description thereof will be omitted. The basic configuration of the image coding apparatus according to this embodiment is the configuration shown in FIG.

【００６５】図９に於いて９０１はマスク生成部、９０
２は係数シフト部である。In FIG. 9, reference numeral 901 denotes a mask generation unit, and 90
2 is a coefficient shift unit.

【００６６】本実施形態では第１、第２の実施形態と同
様に、白黒原稿を各画素の輝度値を８ビットで表現した
画像データとして読み取り、符号化するものとして説明
する。しかしながらこれに限らず、４ビット、１０ビッ
ト、１２ビットなど８ビット以外のビット数で輝度値を
表現する画像データとして読み取って符号化する場合に
も適用できる。また各画素をＲＧＢ、ＣＭＹＫなどの複
数の色成分或いはＹＣｒＣｂ等の輝度と色度／色差成分
で表現するカラー画像データとして読み取る場合にも適
用できる。この場合にはカラー画像データ中の各成分が
モノクロ画像データであると見なせば良い。本実施形態
では第１の実施形態と同様に、符号化対象となる白黒原
稿を所定の解像度で読み取り、符号化するものである
が、その一部分だけを水平・垂直方向ともに倍の解像度
で読み取るものである。以降、所定の解像度を低解像
度、水平・垂直方向ともにこれを倍にした解像度を高解
像度と呼ぶ。白黒原稿の大きさは固定であるものとし、
原稿全体を高解像度で読み取った場合の画像サイズを
Ｘ，Ｙとして表す。Ｘ，Ｙともに４の倍数とする。In the present embodiment, as in the first and second embodiments, a black-and-white document is read and encoded as image data in which the luminance value of each pixel is represented by 8 bits. However, the present invention is not limited to this, and can also be applied to the case of reading and encoding as image data expressing a luminance value with a bit number other than 8 bits such as 4 bits, 10 bits, and 12 bits. It can also be applied to the case where each pixel is read as color image data represented by a plurality of color components such as RGB and CMYK or luminance and chromaticity / color difference components such as YCrCb. In this case, each component in the color image data may be regarded as monochrome image data. In the present embodiment, as in the first embodiment, a black-and-white original to be encoded is read and encoded at a predetermined resolution, but only a part of it is read at double the resolution in both the horizontal and vertical directions. Is. Hereinafter, the predetermined resolution will be referred to as low resolution, and the resolution obtained by doubling the predetermined resolution will be referred to as high resolution. The size of the black and white manuscript shall be fixed,
The image size when the entire original is read in high resolution is represented as X and Y. Both X and Y are multiples of 4.

【００６７】以下、図９を用いて、本実施形態に係る画
像符号化装置の各部の機能、及びその動作について説明
する。The function and operation of each section of the image coding apparatus according to this embodiment will be described below with reference to FIG.

【００６８】本実施形態の画像符号化装置は第２の実施
形態の画像符号化装置にマスク生成部９０１、係数シフ
ト部９０２を付加したものであり、他の部分の機能、及
び動作は第２の実施形態と同じであるので、追加された
マスク生成部９０１、と係数シフト部９０２の動作につ
いてのみ説明する。The image coding apparatus of this embodiment is the image coding apparatus of the second embodiment to which a mask generation unit 901 and a coefficient shift unit 902 are added, and the functions and operations of other parts are the same as those of the second embodiment. Since it is the same as the embodiment described above, only the operations of the added mask generation unit 901 and coefficient shift unit 902 will be described.

【００６９】マスク生成部９０１は像域判定部８０２の
生成する解像度フラグＦｌ（ｌｘ，ｌｙ）から、係数合
成部８０４で得られたＣ（ＬＬ，ｘ，ｙ）を離散ウェー
ブレット変換部１０７によりサブバンド分解して得られ
る各サブバンドの各係数Ｃ（ＬＬ，ｘ，ｙ），Ｃ（ＬＨ
１，ｘ，ｙ），Ｃ（ＨＬ１，ｘ，ｙ），Ｃ（ＨＨ，ｘ，
ｙ）が高解像度画像Ｐｈ（ｘ，ｙ）に関連する部分であ
るか否かをしめしたマスク情報Ｍ（ｘ，ｙ）を生成して
出力する。マスク情報は解像度フラグの情報の半分の大
きさであり、Ｆｌ（２×ｘ，２×ｙ），Ｆｌ（２×ｘ＋
１，２×ｙ），Ｆｌ（２×ｘ，２×ｙ＋１），Ｆｌ（２
×ｘ＋１，２×ｙ＋１）のいずれかが１ならばＭ（ｘ，
ｙ）＝１、全て０ならばＭ（ｘ，ｙ）＝０である。The mask generation unit 901 uses the resolution flag Fl (lx, ly) generated by the image area determination unit 802 to calculate the C (LL, x, y) obtained by the coefficient synthesis unit 804 by the discrete wavelet transform unit 107. Each coefficient C (LL, x, y), C (LH) of each sub-band obtained by band decomposition
1, x, y), C (HL1, x, y), C (HH, x,
Mask information M (x, y) indicating whether or not y) is a portion related to the high resolution image Ph (x, y) is generated and output. The mask information is half the size of the information of the resolution flag, and is Fl (2 × x, 2 × y), Fl (2 × x +).
1, 2 × y), Fl (2 × x, 2 × y + 1), Fl (2
If either xx + 1, 2xy + 1) is 1, then M (x,
If y) = 1 and all 0, then M (x, y) = 0.

【００７０】係数シフト部９０２は各サブバンドの有効
ビット数の最大値Ｎｍａｘを求め、離散ウェーブレット
変換部１０７から出力されるＬＬ，ＬＨ１，ＨＬ１，Ｈ
Ｈ１サブバンドの係数値Ｃ（Ｓ，ｘ，ｙ）についてマス
ク生成部９０１で生成するマスク情報Ｍ（ｘ，ｙ）を参
照し、Ｍ（ｘ，ｙ）が１ならばサブバンドの係数Ｃ
（Ｓ，ｘ，ｙ）をＮｍａｘビットシフトアップする。よ
って、全てのサブバンドの係数を参照した場合に、有効
ビット数がＮｍａｘ以上である係数はＭ（ｘ、ｙ）＝１
に対応する係数と見なすことができ、高解像度領域を早
期に特定することができる。The coefficient shift unit 902 obtains the maximum value Nmax of the number of effective bits of each subband, and outputs LL, LH1, HL1, H output from the discrete wavelet transform unit 107.
The mask information M (x, y) generated by the mask generation unit 901 is referred to for the coefficient value C (S, x, y) of the H1 subband, and if M (x, y) is 1, the coefficient C of the subband is calculated.
(S, x, y) is shifted up by Nmax bits. Therefore, when the coefficients of all subbands are referred to, the coefficient whose effective bit number is Nmax or more is M (x, y) = 1.
Can be regarded as a coefficient corresponding to, and a high-resolution area can be specified at an early stage.

【００７１】以降、符号出力部１１０までの処理は第
１、第２の実施形態で説明した通りである。但し、本実
施形態においては符号列形成部１０９で符号化データに
付加情報を挿入する際に係数シフト部９０２でシフトア
ップしたビット数Ｎｍａｘをヘッダに挿入するものとす
る。After that, the processing up to the code output section 110 is as described in the first and second embodiments. However, in the present embodiment, when the code string forming unit 109 inserts the additional information into the encoded data, the number of bits Nmax shifted up by the coefficient shifting unit 902 is inserted into the header.

【００７２】本実施形態の場合、第２の実施形態で述べ
た効果に加えて、符号列の受信側で、文字・線画情報を
早期に復号することが可能となる。In the case of this embodiment, in addition to the effect described in the second embodiment, it becomes possible to early decode the character / line drawing information on the receiving side of the code string.

【００７３】＜変形例＞本発明は上述した実施形態に限
定されるものではない。例えば上述した第１〜３の実施
形態においては、式（１），（２）による離散ウェーブ
レット変換を用いた符号化の例を示したが、離散ウェー
ブレット変換については本実施形態で使用したものに限
定されるものではなく、フィルタの種類や適応方法を変
えても構わない。例えば９／７フィルタなどよりタップ
数の長いフィルタに変えても構わないし、低周波サブバ
ンド以外にも２次元離散ウェーブレット変換を繰り返し
適用しても構わない。但し、この場合係数合成の処理に
於いてフィルタの影響範囲を考慮する必要がある。<Modification> The present invention is not limited to the above embodiment. For example, in the above-described first to third embodiments, an example of encoding using the discrete wavelet transform according to Expressions (1) and (2) has been shown, but the discrete wavelet transform is not limited to that used in this embodiment. There is no limitation, and the type of filter and the adaptation method may be changed. For example, the filter may be changed to a filter having a larger number of taps than the 9/7 filter, or the two-dimensional discrete wavelet transform may be repeatedly applied in addition to the low frequency subband. However, in this case, it is necessary to consider the influence range of the filter in the process of coefficient synthesis.

【００７４】また、係数の符号化方式としてＱＭ−Ｃｏ
ｄｅｒを用いたビットプレーン符号化方式を示したが、
上述の実施形態に限定されるものではなく、例えば、Ｍ
Ｑ−Ｃｏｄｅｒ等、ＱＭ−Ｃｏｄｅｒ以外の算術符号化
方法を適用しても構わないし、ＭＥＬＣＯＤＥなどその
他の２値符号化方式を適用しても構わない。また、ビッ
トプレーンを着目係数の近傍係数の状態に応じて複数の
サブビットプレーンにカテゴリ分けし、複数回のパスで
符号化しても良い。さらにはＧｏｌｏｍｂ符号などを適
用して、係数を２値に分解することなく、多値のままエ
ントロピ符号化しても構わない。QM-Co is used as a coefficient coding method.
Although the bit plane coding method using der is shown,
The present invention is not limited to the above-mentioned embodiment, and for example, M
An arithmetic coding method other than QM-Coder such as Q-Coder may be applied, or another binary coding method such as MELCODE may be applied. In addition, the bit plane may be categorized into a plurality of sub-bit planes according to the state of the coefficient near the coefficient of interest, and may be encoded by a plurality of passes. Furthermore, the Golomb code or the like may be applied to entropy-encode the multi-value as it is without dividing the coefficient into two values.

【００７５】また、説明を簡単にするために、上記各実
施形態では、サブバンド単位のビットプレーン符号化に
ついて説明したが、ランダムアクセス性を高めるために
各サブバンドを更に小ブロックに分割してこの小ブロッ
ク単位にビットプレーン符号化を適用しても構わない。Further, in order to simplify the explanation, in each of the above-mentioned embodiments, bit-plane coding in sub-band units has been explained, but in order to improve random accessibility, each sub-band is further divided into small blocks. Bit plane coding may be applied in units of this small block.

【００７６】また、符号列の形成にあたっては受信側で
徐々に解像度を上げて画像を復元できるように並べた
が、これに限らず、徐々に画質が向上するように値の大
きな係数から順に並べて符号列を形成しても構わない。Further, in forming the code string, the images are arranged so that the image can be restored by gradually increasing the resolution on the receiving side. However, the present invention is not limited to this, and the coefficients are arranged in order from the largest value so that the image quality is gradually improved. A code string may be formed.

【００７７】なお、本発明は複数の機器（例えばホスト
コンピュータ、インターフェース機器、リーダ、プリン
タ等）から構成されるシステムの一部として適用して
も、単一の機器（例えば複写機、ファクシミリ装置、デ
ジタルカメラ等）からなる装置の一部に適用しても良
い。Even when the present invention is applied as a part of a system composed of a plurality of devices (for example, host computer, interface device, reader, printer, etc.), a single device (for example, copying machine, facsimile machine, It may be applied to a part of a device including a digital camera).

【００７８】また、本発明は上記実施の形態を実現する
ための装置および方法のみに限定されるものではなく、
上記システムまたは装置内のコンピュータ(CPUあるいは
MPU)に、上記実施の形態を実現するためのソフトウェア
のプログラムを供給し、このプログラムに従って上記シ
ステムあるいは装置のコンピュータが上記各種デバイス
を動作させることにより上記実施の形態を実現する場合
も本発明の範疇に含まれる。The present invention is not limited to the apparatus and method for realizing the above-mentioned embodiment,
Computer (CPU or
MPU), a software program for realizing the above-described embodiment is supplied, and the computer of the system or the apparatus operates the above-mentioned various devices according to the program to realize the above-described embodiment. Included in the category.

【００７９】またこの場合、前記ソフトウェアのプログ
ラム自体が上記実施の形態の機能を実現することにな
り、そのプログラム自体、及びそのプログラムをコンピ
ュータに供給するための手段、具体的には上記プログラ
ムを格納した記憶媒体は本発明の範疇に含まれる。Further, in this case, the software program itself realizes the functions of the above embodiments, and the program itself and a means for supplying the program to the computer, specifically, the program is stored. Such storage media are included in the scope of the present invention.

【００８０】このようなプログラムを格納する記憶媒体
としては、例えばフロッピィーディスク、ハードディス
ク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、磁気
テープ、不揮発性のメモリカード、ＲＯＭ等を用いるこ
とができる。As a storage medium for storing such a program, for example, a floppy disk, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a magnetic tape, a non-volatile memory card, a ROM or the like can be used.

【００８１】また、上記コンピュータが、供給されたプ
ログラムのみにしたがって各種デバイスを制御すること
により、上記実施の形態の機能が実現される場合だけで
はなく、上記プログラムがコンピュータ上で稼動してい
るＯＳ（オペレーティングシステム）、あるいは他のア
プリケーションソフト等と共同して上記実施の形態が実
現される場合にもかかるプログラムは本発明の範疇に含
まれる。Further, not only when the computer controls the various devices only according to the supplied program to realize the functions of the above-described embodiments, but also when the program runs on the computer. Such a program is also included in the scope of the present invention when the above embodiment is implemented in cooperation with (operating system) or other application software.

【００８２】更に、この供給されたプログラムが、コン
ピュータの機能拡張ボードやコンピュータに接続された
機能拡張ユニットに備わるメモリに格納された後、その
プログラムの指示に基づいてその機能拡張ボードや機能
拡張ユニットに備わるＣＰＵ等が実際の処理の一部また
は全部を行い、その処理によって上記実施の形態が実現
される場合も本発明の範疇に含まれる。Further, after the supplied program is stored in the memory provided in the function expansion board of the computer or the function expansion unit connected to the computer, the function expansion board or function expansion unit is instructed based on the instruction of the program. It is also within the scope of the present invention that the CPU and the like included in the above perform a part or all of the actual processing and the above embodiment is realized by the processing.

【００８３】[0083]

【発明の効果】以上の説明により、本発明によって、異
なる解像度レベルを必要とする領域を含む画像データを
符号化する場合に、効率の良い画像符号化を行うことが
できる。又、符号化を行った画像データを復号する際
に、注目解像度領域を早期に特定可能な符号列を生成す
る符号化を行うことができる。As described above, according to the present invention, efficient image encoding can be performed when encoding image data including areas requiring different resolution levels. Further, when decoding the coded image data, it is possible to perform coding for generating a code string that can identify the resolution region of interest at an early stage.

[Brief description of drawings]

【図１】本発明の第１の実施形態における画像符号化装
置の機能構成を示す図である。FIG. 1 is a diagram showing a functional configuration of an image encoding device according to a first embodiment of the present invention.

【図２】原稿と高解像度領域情報の例を示す図である。FIG. 2 is a diagram showing an example of a document and high resolution area information.

【図３】符号列形成部１０９により生成される符号列の
構造を示す。FIG. 3 shows a structure of a code string generated by a code string forming unit 109.

【図４】２次元離散ウェーブレット変換処理を示す図で
ある。FIG. 4 is a diagram showing a two-dimensional discrete wavelet transform process.

【図５】離散ウェーブレット変換部１０７により合成後
のＬＬサブバンドを更に４つのサブバンドに分解した様
子を示す図である。FIG. 5 is a diagram showing how the LL subband after synthesis is further decomposed into four subbands by the discrete wavelet transform unit 107.

【図６】ビットプレーン符号化部１０８でサブバンドＳ
を符号化する処理の流れを示したフローチャートであ
る。FIG. 6 shows a subband S in the bit plane encoding unit 108.
3 is a flowchart showing a flow of a process of encoding the.

【図７】図２に示すように高解像度領域情報を指定した
場合の解像度フラグＦ（ｘ、ｙ）の例を示す。FIG. 7 shows an example of a resolution flag F (x, y) when high resolution area information is designated as shown in FIG.

【図８】本発明の第２の実施形態における画像符号化装
置の機能構成を示す図である。FIG. 8 is a diagram showing a functional configuration of an image encoding device according to a second embodiment of the present invention.

【図９】本発明の第３の実施形態における画像符号化装
置の機能構成を示す図である。FIG. 9 is a diagram showing a functional configuration of an image encoding device according to a third embodiment of the present invention.

【図１０】本発明の第１乃至３の実施形態における画像
符号化装置の基本構成を示す図である。FIG. 10 is a diagram showing a basic configuration of an image encoding device according to the first to third embodiments of the present invention.

Claims

[Claims]

1. An image coding apparatus for coding an image, comprising: first reading means for reading a first area of an original image at a first resolution and generating a first image including the read area. And a second higher than the first resolution in the original image.
Second reading means for reading a second area different from the first area and generating a second image including the read area at a resolution of, and performing frequency conversion on the second image. , A first frequency conversion unit for obtaining a coefficient for each subband, and combining the first image with a predetermined subband of the plurality of subbands by the first frequency conversion unit to generate a combined subband. A combination means, a second frequency conversion means for further performing frequency conversion on the combined subband to obtain a coefficient for each subband, the first frequency conversion means, and the second frequency conversion means. An image coding apparatus, comprising: a code string generation unit that generates a code string from the coefficients of the subband.

2. The image coding apparatus according to claim 1, further comprising flag generation means for generating a flag designating an area to be read by the second reading means.

3. The first reading means refers to the flag, reads an area other than the area read by the second reading means at a first resolution, and generates the first image. The image coding apparatus according to claim 2, wherein the image coding apparatus is a video coding apparatus.

4. The image coding apparatus according to claim 1, wherein the size of the first image is equal to the size of the predetermined subband.

5. The image coding apparatus according to claim 1, wherein the first resolution is half of the second resolution.

6. The synthesizing means, using the flag,
The image coding according to claim 2, wherein a coefficient included in a region corresponding to the first region in the predetermined subband is specified, and the specified coefficient is replaced with the first image. apparatus.

7. The image coding apparatus according to claim 1, further comprising flag generation means for generating a flag indicating a predetermined area in the first image.

8. The image coding apparatus according to claim 7, wherein the predetermined area is a character or line drawing area.

9. The second reading unit refers to the flag to set a second region of the predetermined area in the original image.
9. The image coding apparatus according to claim 7, wherein the second image is generated by reading the image with the resolution of.

10. The size of the first image is equal to the size of the predetermined subband.
The image coding device according to any one of claims 1 to 9.

11. The image coding apparatus according to claim 7, wherein the first resolution is half of the second resolution.

12. The synthesizing unit uses the flag to identify a coefficient included in a region corresponding to the first region in the predetermined subband, and the identified coefficient is used as the first image. 12. The image coding apparatus according to claim 7, wherein the image coding apparatus is replaced with.

13. A mask showing a coefficient relating to the second image among coefficients constituting a subband generated when the code string generation means frequency-converts the combined subband. The image coding apparatus according to claim 7, further comprising a mask generation unit that generates the mask.

14. The image coding apparatus according to claim 13, wherein the mask generation means specifies a coefficient associated with the second image using the flag.

15. The code string generation means obtains a maximum value of the number of effective bits included in each subband generated by the code string generation means, and a coefficient related to the second image is divided by the maximum value. The image coding apparatus according to claim 1, wherein the bit shift is performed only by the bit shift.

16. The image coding apparatus according to claim 15, wherein the code string includes the maximum value.

17. The image coding apparatus according to claim 1, wherein the first frequency transforming means and the second frequency transforming means use discrete wavelet transforms.

18. The image coding apparatus according to claim 1, wherein the predetermined subband is an LL subband.

19. An image encoding method for encoding an image, comprising a first reading step of reading a first region of an original image at a first resolution and generating a first image including the read region. And a second higher than the first resolution in the original image.
A second reading step of reading a second area different from the first area and generating a second image including the read area at a resolution of, and performing frequency conversion on the second image. , A first frequency conversion step of obtaining a coefficient for each subband, and combining the first image with a predetermined subband of the plurality of subbands by the first frequency conversion step to generate a combined subband And a second frequency conversion step of further performing frequency conversion on the composite subband to obtain a coefficient for each subband, and a first frequency conversion step and a second frequency conversion step. And a code string generating step of generating a code string from the coefficients of the obtained subbands.

20. The image coding method according to claim 19, further comprising a flag generating step of generating a flag designating an area to be read in the second reading step.

21. The image encoding method according to claim 19, further comprising a flag generating step of generating a flag indicating a predetermined area in the first image.

22. A mask showing a coefficient relating to the second image among coefficients forming a subband generated when frequency conversion is performed on the combined subband in the code string generating step. The image coding method according to claim 21, further comprising a mask generation step of generating.

23. A program for executing the image coding method according to any one of claims 19 to 22 on a computer.

24. A computer-readable storage medium that stores the program according to claim 23.