JP2003204439A

JP2003204439A - Image encoding apparatus and image encoding method

Info

Publication number: JP2003204439A
Application number: JP2002001602A
Authority: JP
Inventors: Hiroshi Kajiwara; 浩梶原
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2002-01-08
Filing date: 2002-01-08
Publication date: 2003-07-18

Abstract

(57)【要約】【課題】文字領域や写真領域等の必要な解像度レベル
が異なる領域を含む混在画像を、高画質の画像が復元可
能な状態で効率よく符号化することができる画像符号化
装置及びその制御方法を提供する。【解決手段】画像入力部１０１から入力された画像
は、タイル分割部１０３で複数のタイルに分割される。
画像縮小部１０４は、そのタイルから縮小画像を生成す
る。そして、各タイルについて、領域情報が領域情報入
力部１０２から入力され、領域判定部１０５で領域情報
が判定され、タイルまたは縮小タイルのどちらが、セレ
クタ１０６で選択される。さらに、離散ウェーブレット
変換部１０７で、選択されたタイルの離散ウェーブレッ
ト変換が行われ、所定数のサブバンドに分解される。サ
ブバンドの係数は、変換係数量子化部１０８で量子化さ
れ、ビットプレーン符号化部１０９で符号化され、符号
列形成部１１０でビット列となる。 (57) [Summary] [Image] An image encoding method capable of efficiently encoding a mixed image including areas having different required resolution levels such as a character area and a photograph area in a state where a high-quality image can be restored. An apparatus and a control method thereof are provided. An image input from an image input unit is divided into a plurality of tiles by a tile division unit.
The image reducing unit 104 generates a reduced image from the tile. Then, for each tile, the area information is input from the area information input unit 102, the area information is determined by the area determination unit 105, and the tile or the reduced tile is selected by the selector 106. Further, the discrete wavelet transform unit 107 performs a discrete wavelet transform on the selected tile and decomposes the tile into a predetermined number of subbands. The coefficients of the subbands are quantized by the transform coefficient quantization unit 108, coded by the bit plane coding unit 109, and converted into a bit sequence by the code sequence forming unit 110.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、文字と写真が混在
する画像データを符号化する画像符号化装置及びその制
御方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image coding apparatus for coding image data in which characters and photographs are mixed and a control method thereof.

【０００２】[0002]

【従来の技術】近年、ディジタルカメラやスキャナとい
った画像入力装置に関する技術の向上に伴って、画像入
力装置によって取り込まれる画像の解像度は、増加の一
途を辿っている。従来の画像入力装置を用いて取り込ま
れる画像は、低解像度であって画像のデータ量が少なか
ったので、画像データの伝送や蓄積等の各種画像処理に
おいて、大きな問題を起こすことはなかった。2. Description of the Related Art In recent years, the resolution of an image captured by an image input device has been increasing with the improvement of the technology relating to the image input device such as a digital camera and a scanner. An image captured using a conventional image input device has a low resolution and a small amount of image data, so that it does not cause a big problem in various image processing such as transmission and storage of image data.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、画像入
力装置で取り込まれる画像の解像度が高解像度になるに
つれ、画像のデータ量も膨大なものになってきている。
そのため、画像を伝送する際に多くの時間がかかった
り、画像の蓄積に多くの記憶容量が必要とされるという
問題が生じてきている。However, as the resolution of the image captured by the image input device becomes higher, the data amount of the image also becomes enormous.
For this reason, there are problems that it takes a lot of time to transmit an image and a large storage capacity is required to store the image.

【０００４】この問題を解決するために、通常、画像の
伝送や蓄積に際しては、高能率の符号化を用いることに
よって、画像の冗長性を除いたり、視覚的に許容できる
範囲で画像を加工してデータ量を削減することが行われ
ている。尚、符号化された画像データの復号によって元
の画像を完全に再現することができる符号化方式を可逆
符号化と呼び、視覚的に近い画像を得るものの完全には
元の画像を再現することができない符号化方式を非可逆
符号化と呼ぶ。In order to solve this problem, usually, at the time of image transmission and storage, high efficiency coding is used to remove image redundancy or to process the image within a visually permissible range. The amount of data is being reduced. An encoding method that can completely reproduce the original image by decoding the encoded image data is called lossless encoding. Although a visually similar image is obtained, the original image is completely reproduced. A coding method that cannot do this is called lossy coding.

【０００５】非可逆符号化においては、視覚的に劣化が
目立たない部分を変化させて符号量の削減を図ることが
重要であるが、これは画像の特性に大きく依存してい
る。画像にも様々な種類があって、人物・風景等を銀塩
写真で撮影した画像をスキャナで読み取って生成した画
像、あるいはディジタルカメラで人物・風景等を直接撮
影して生成される自然画像、紙等の媒体上の文字・線情
報をラスタ走査して入力することにより生成する文字・
線画像、コンピュータグラフィック（ＣＧ）によって描
画された画像や構築した３次元モデルをレンダリングす
ることによって生成されるＣＧ画像等がその一例として
挙げられる。In lossy encoding, it is important to reduce the code amount by changing a portion where deterioration is not visually noticeable, but this largely depends on the characteristics of an image. There are various types of images, and images generated by scanning a silver salt photograph of a person, landscape, etc. with a scanner, or a natural image generated by directly shooting a person, landscape, etc. with a digital camera, Characters generated by raster scanning and inputting character / line information on media such as paper
Examples thereof include line images, images drawn by computer graphics (CG), and CG images generated by rendering a constructed three-dimensional model.

【０００６】上述した様々な種類の画像データにおい
て、良好な画質を得るために必要な解像度や階調数は通
常異なっている。一般に、文字・線画像については、自
然画像に比べて高い解像度が必要である。In the various types of image data described above, the resolution and the number of gradations required to obtain good image quality are usually different. Generally, a character / line image requires higher resolution than a natural image.

【０００７】そこで、高能率な符号化の一手法として、
従来からウェーブレット変換を利用する方法が用いられ
ている。この方式では、まず、離散ウェーブレット変換
を用いて符号化対象画像を複数の周波数帯域（サブバン
ド）に分割する。次に、各サブバンドの変換係数をさま
ざまな方法で量子化、エントロピー符号化して、符号列
を生成する。図７を用いて、ウェーブレット変換の処理
過程の一例について示す。ウェーブレット変換の方法と
しては、図７（ａ）に示される原画像に対して、図７
（ｂ）に示されるように、１次元の変換処理が垂直方向
に適用され、次いで、図７（ｃ）に示されるように水平
方向に適用されて、４つのサブバンドに分割する方法が
用いられる。さらに、図７（ｃ）に示される低周波サブ
バンドＬＬのみに対して、４つのサブバンドへの分割を
繰り返す方法が一般的である。Therefore, as a highly efficient encoding method,
Conventionally, the method of utilizing the wavelet transform has been used. In this method, first, an image to be encoded is divided into a plurality of frequency bands (subbands) by using the discrete wavelet transform. Next, the transform coefficient of each subband is quantized and entropy-coded by various methods to generate a code string. An example of the process of wavelet transform will be described with reference to FIG. 7. As a method of the wavelet transform, as shown in FIG. 7 for the original image shown in FIG.
As shown in (b), a one-dimensional conversion process is applied in the vertical direction, and then in the horizontal direction as shown in FIG. 7 (c) to divide into four subbands. To be Further, it is general to repeat division into four subbands only for the low frequency subband LL shown in FIG. 7C.

【０００８】図８は、２次元のウェーブレット変換を２
回繰り返して行った場合の、サブバンドの分割例を示す
図である。ウェーブレット変換を用いた画像符号化の利
点の一つとして、空間解像度の段階的復号の実現が容易
であるということが挙げられる。図８に示されるように
２次元のウェーブレット変換を２回施して７つのサブバ
ンドに分解し、低周波サブバンドＬＬから高周波サブバ
ンドＨＨ２へと、順に各サブバンドの係数を符号化して
伝送した場合、復号側では低周波サブバンドＬＬの変換
係数を受信した段階で水平・垂直１／４の解像度の復元
画像を復号することができる。また、サブバンドＬＬ、
ＬＨ１、ＨＬ１、ＨＨ１を受信した段階で、水平・垂直
１／２の解像度の復元画像を復号することができる。さ
らに、サブバンドＬＨ２、ＨＬ２、ＨＨ２までを受信し
た場合には、元の解像度の復元画像を復号することがで
きる。このように、復号するための解像度を選択して復
号画像を生成することができる。FIG. 8 illustrates a two-dimensional wavelet transform.
It is a figure which shows the example of division | segmentation of a subband at the time of repeating and repeating. One of the advantages of image coding using wavelet transform is that it is easy to realize stepwise decoding of spatial resolution. As shown in FIG. 8, the two-dimensional wavelet transform is performed twice to decompose into seven subbands, and the coefficients of each subband are sequentially encoded and transmitted from the low frequency subband LL to the high frequency subband HH2. In this case, the decoding side can decode the restored image with the horizontal / vertical 1/4 resolution at the stage of receiving the transform coefficient of the low-frequency subband LL. Also, sub-band LL,
When LH1, HL1, and HH1 are received, the restored image with the horizontal / vertical 1/2 resolution can be decoded. Furthermore, when subbands LH2, HL2, and HH2 are received, the restored image of the original resolution can be decoded. In this way, the decoded image can be generated by selecting the resolution for decoding.

【０００９】また、文字・写真の混在画像に見られるよ
うに、画像中に必要とする解像度の異なる部分が混在す
る場合の一符号化方式として、画像をタイルと呼ばれる
独立に符号化される矩形領域に分割し、各タイル毎に必
要解像度を考慮して、高解像度を必要としないタイルに
ついては高周波サブバンドに相当する符号化データを破
棄する、あるいは高周波サブバンドの係数を符号化しな
いことにより、タイル毎に解像度を変えるという方法が
ある。例えば、各タイルを図８に示されるように７つの
サブバンドに分割する場合、文字・線画像領域に属する
タイルについては、高解像度が必要とされるため、Ｌ
Ｌ、ＬＨ１、ＨＬ１、ＨＨ１、ＬＨ２、ＨＬ２、ＨＨ２
の７つのサブバンドを符号化し、自然画像領域に属する
タイルについては、低解像度で十分なため、ＬＬ、ＬＨ
１、ＨＬ１、ＨＨ１の４つのサブバンドのみを符号化す
る。Further, as seen in a mixed image of characters and photographs, as a coding method in the case where parts having different required resolutions are mixed in the image, the image is called a tile and is a rectangular coded independently. By dividing into regions and considering the required resolution for each tile, for tiles that do not require high resolution, the coded data corresponding to the high frequency subband is discarded, or the coefficient of the high frequency subband is not encoded. There is a method to change the resolution for each tile. For example, when each tile is divided into 7 subbands as shown in FIG. 8, high resolution is required for tiles belonging to the character / line image area, and therefore L
L, LH1, HL1, HH1, LH2, HL2, HH2
For the tiles belonging to the natural image area, low resolution is sufficient, so LL and LH
Only four subbands of 1, HL1 and HH1 are encoded.

【００１０】しかしながら、上述した従来の高能率な符
号化方法では、自然画像、文字・線画像の混在画像等に
おいて、必要とする解像度の異なる領域が含まれる画像
データを符号化する場合、高解像度を必要としない領
域、例えば自然画像領域などについては復号側で得られ
る復元画像は、ウェーブレット変換により得られた低周
波成分であるため、エッジの鈍りや細線の消滅などが発
生し、必ずしも良い再生画像にはならなかった。However, in the above-mentioned conventional high-efficiency encoding method, in the case of encoding image data including regions having different required resolutions in a natural image, a mixed image of characters and line images, etc., a high resolution is required. For a region that does not require, for example, a natural image region, the restored image obtained on the decoding side is a low-frequency component obtained by the wavelet transform, so blunt edges and disappearance of thin lines occur, and it is not always good to reproduce. It didn't look like an image.

【００１１】本発明は、このような事情を考慮してなさ
れたものであり、文字領域や写真領域等の必要な解像度
レベルが異なる領域を含む混在画像を、高画質の画像が
復元可能な状態で効率よく符号化することができる画像
符号化装置及びその制御方法を提供することを目的とす
る。The present invention has been made in consideration of such circumstances, and a high-quality image can be restored from a mixed image including regions having different required resolution levels such as a character region and a photo region. It is an object of the present invention to provide an image coding apparatus and a control method thereof that can be coded efficiently in.

【００１２】[0012]

【課題を解決するための手段】上記課題を解決するた
め、本発明は、画像を所定の領域に分割して小画像を生
成する分割手段と、分割された各領域についての領域情
報を入力する入力手段と、分割された各領域について、
所定縮小率によって縮小画像を生成する生成手段と、分
割された各領域の領域情報に基づいて、各領域ごとに小
画像または縮小画像を周波数変換し、所定数のサブバン
ドに分解する分解手段と、分解された各サブバンドの変
換係数を符号化する符号化手段とを備えることを特徴と
する。In order to solve the above-mentioned problems, the present invention inputs a dividing means for dividing an image into a predetermined area to generate a small image, and area information for each divided area. For the input means and each divided area,
Generating means for generating a reduced image at a predetermined reduction ratio, and decomposing means for frequency-converting the small image or reduced image for each region based on the region information of each divided region, and decomposing into a predetermined number of subbands. , Encoding means for encoding the decomposed transform coefficients of each sub-band.

【００１３】[0013]

【発明の実施の形態】以下、図面を参照して、本発明の
実施形態について説明する。本実施形態では、符号化さ
れる画像として、文字領域及び写真領域が混在する白黒
原稿をスキャナ等で読み取って８ビットの輝度値で表現
した画像データを用いる。尚、写真領域とは、濃淡階調
で表現された文字領域以外の全領域を含んでもよい。ま
た、本発明において符号化可能な画像データは８ビット
の画像だけに限らず、４ビット、１０ビット、１２ビッ
ト等のビット数で輝度値を表現する画像データであって
も適用可能である。また、各画素をＲＧＢ、ＣＭＹＫ等
の複数の色成分、あるいはＹＣｂＣｒ等の輝度と色度／
色差成分で表現されるカラー画像データに対しても適用
可能である。このようなカラー画像データの場合、カラ
ー画像データ中の各成分データの一つ一つを白黒の画像
データであるとみなせばよい。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings. In the present embodiment, as an image to be encoded, image data obtained by reading a black-and-white document in which a character region and a photo region are mixed with a scanner or the like and expressing it with an 8-bit luminance value is used. The photographic area may include the entire area other than the character area expressed in gray scale. In addition, the image data that can be encoded in the present invention is not limited to an 8-bit image, and image data that expresses a luminance value with a bit number of 4 bits, 10 bits, 12 bits, or the like can be applied. In addition, each pixel has a plurality of color components such as RGB and CMYK, or luminance and chromaticity such as YCbCr.
It is also applicable to color image data represented by color difference components. In the case of such color image data, each of the component data in the color image data may be regarded as black and white image data.

【００１４】＜第１の実施形態＞図１は、本発明の第１
の実施形態による画像を符号化するための画像符号化装
置の構成を示すブロック図である。図１において、画像
入力部１０１は、本実施形態による画像符号化装置によ
って符号化される画像データを入力するための装置であ
る。<First Embodiment> FIG. 1 shows a first embodiment of the present invention.
FIG. 3 is a block diagram showing a configuration of an image encoding device for encoding an image according to the embodiment of FIG. In FIG. 1, an image input unit 101 is a device for inputting image data encoded by the image encoding device according to the present embodiment.

【００１５】ここで、本実施形態において入力される画
像データは高解像度画像であって、水平方向の画像サイ
ズがＸ画素であり、垂直方向の画像サイズがＹ画素であ
るとする。そして、入力された画像データの水平方向の
ｘ番目、垂直方向のｙ番目の画素データをＰ（ｘ，ｙ）
と表す。ここで、０≦ｘ＜Ｘ、０≦ｙ＜Ｙである。例え
ば、この画像入力部１０１を実現する装置として、スキ
ャナ、デジタルカメラ等の撮像部分がある。そして、こ
の撮像部分は、ＣＣＤ等の撮像デバイス及びガンマ補
正、シェーディング補正等の各種画像調整回路を含むも
のとなる。Here, it is assumed that the image data input in the present embodiment is a high-resolution image, the image size in the horizontal direction is X pixels, and the image size in the vertical direction is Y pixels. Then, the x-th pixel data in the horizontal direction and the y-th pixel data in the vertical direction of the input image data are set to P (x, y).
Express. Here, 0 ≦ x <X and 0 ≦ y <Y. For example, as a device that realizes the image input unit 101, there is an imaging part such as a scanner or a digital camera. The image pickup portion includes an image pickup device such as a CCD and various image adjustment circuits such as gamma correction and shading correction.

【００１６】次に、タイル分割部１０３は、画像入力部
１０１に接続され、画像入力部１０１から入力される画
像データを、適宜、不図示のバッファに格納し、入力さ
れた画像データを所定の幅ＴＷと高さＴＨのタイルに分
割する。ここで、画像データの水平方向及び垂直方向の
画像サイズのＸ、Ｙは、タイルの水平方向及び垂直方向
の画像サイズＴＷ、ＴＨの所定倍数であるとする。Next, the tile division unit 103 is connected to the image input unit 101, stores the image data input from the image input unit 101 in a buffer (not shown) as appropriate, and stores the input image data in a predetermined manner. Divide into tiles of width TW and height TH. Here, it is assumed that the horizontal and vertical image sizes X and Y of the image data are predetermined multiples of the horizontal and vertical image sizes TW and TH of the tile.

【００１７】図２は、複数のタイルに分割される画像デ
ータを説明するための図である。図２に示すように、画
像データの原点となる左上隅の画素を含むタイルをタイ
ルＴ０とし、タイルＴ０に隣接するタイルに対して、ラ
スタ走査順にＴ１、Ｔ２、・・・、Ｔｎという番号を与え
る。ここで、各タイル内の水平方向のｘ番目、垂直方向
のｙ番目の画素値を、Ｔｉ（ｘ，ｙ）と表す。但し、ｉ
＝０，・・・，ｎ、０≦ｘ＜ＴＷ、０≦ｙ＜ＴＨである。
また、Ｔｉ（０，０）は、タイルＴｉの左上隅の画素値
を表す。FIG. 2 is a diagram for explaining image data divided into a plurality of tiles. As shown in FIG. 2, the tile including the pixel at the upper left corner that is the origin of the image data is tile T0, and tiles adjacent to tile T0 are numbered T1, T2, ..., Tn in raster scan order. give. Here, the x-th pixel value in the horizontal direction and the y-th pixel value in the vertical direction in each tile are represented as Ti (x, y). However, i
= 0, ..., N, 0 ≦ x <TW, and 0 ≦ y <TH.
Ti (0,0) represents the pixel value at the upper left corner of the tile Ti.

【００１８】タイル分割部１０３は、さらに画像縮小部
１０４とセレクタ１０６に接続しており、分割された各
タイルの画像データをそれぞれの装置に対して、タイル
Ｔ０からＴｎのラスタ走査順に出力する。The tile division unit 103 is further connected to the image reduction unit 104 and the selector 106, and outputs the divided image data of each tile to each device in the raster scanning order of tiles T0 to Tn.

【００１９】一方、領域情報入力部１０２は、画像入力
部１０１から入力される画像データについて、高解像度
が必要とされる領域の範囲を示す高解像度符号化領域情
報が入力される。図３は、本実施形態による符号化対象
の画像データと高解像度符号化領域を説明するための概
要図である。図３に示すように、本実施形態において高
解像度を必要とする領域は、文字領域であるとする。そ
して、画像データ中の文字領域を包含する矩形領域の、
高解像度での画像サイズを基準としたときの左上隅の画
素位置（ＵＬ_x，ＵＬ_y）と、右下隅の画素位置（Ｌ
Ｒ_x，ＬＲ_y）の２つの座標情報が高解像度符号化領域情
報として領域情報入力部１０２から入力される。On the other hand, the area information input section 102 receives high resolution coded area information indicating a range of an area for which high resolution is required for the image data input from the image input section 101. FIG. 3 is a schematic diagram for explaining the image data to be encoded and the high resolution encoded area according to the present embodiment. As shown in FIG. 3, it is assumed that the region requiring high resolution in this embodiment is a character region. Then, in a rectangular area including the character area in the image data,
Pixel position (UL _x , UL _y ) at the upper left corner and pixel position (L at the lower right corner) based on the image size at high resolution
Two pieces of coordinate information (R _x , LR _y ) are input from the area information input unit 102 as high resolution encoded area information.

【００２０】領域判定部１０５は、領域情報入力部１０
２に接続され、領域情報入力部１０１から入力される画
像データにおける高解像度符号化領域の左上隅の画素位
置（ＵＬ_x，ＵＬ_y）と右下隅の画素位置（ＬＲ_x，Ｌ
Ｒ_y）とを用いて、タイル分割部１０３で生成される各
タイルＴｉが高解像度符号化領域を含むタイルである
か、高解像度符号化領域を含まないタイルであるかを判
定し、タイル毎に判定結果を出力する。この出力とし
て、タイルＴｉに対する判定結果をＦ（Ｔｉ）とし、タ
イルＴｉが高解像度符号化領域を含む場合にはＦ（Ｔ
ｉ）＝１、含まない場合にはＦ（Ｔｉ）＝０とする。図
４は、上述した方法によって画像データのタイルに対し
て２つの判定結果を出力する場合の一例を示す図であ
る。The area determination unit 105 includes an area information input unit 10
2 and the pixel position (UL _x , UL _y ) in the upper left corner and the pixel position (LR _x , L) in the lower right corner of the high resolution encoded area in the image data input from the area information input unit 101.
R _y ), it is determined whether each tile Ti generated by the tile dividing unit 103 is a tile including a high-resolution encoded area or a tile not including a high-resolution encoded area, and each tile Ti The judgment result is output to. As this output, the determination result for the tile Ti is F (Ti), and if the tile Ti includes a high-resolution coded area, then F (T
i) = 1, and if not included, F (Ti) = 0. FIG. 4 is a diagram showing an example of a case where two determination results are output for a tile of image data by the method described above.

【００２１】画像縮小部１０４は、タイル分割部１０３
及びセレクタ１０６とに接続されており、タイル分割部
１０３から出力される各タイルについて、水平方向・垂
直方向共１／２の大きさに縮小された縮小タイルＳＴｉ
を生成する。The image reduction unit 104 is a tile division unit 103.
The reduced tile STi connected to the selector 106 and each tile output from the tile division unit 103 is reduced in size to 1/2 in both the horizontal and vertical directions.
To generate.

【００２２】この画像縮小部１０４における画像の縮小
処理は、縮小画像の画質を重視した縮小処理である。後
述する離散ウェーブレット変換部１０７におけるＬＬサ
ブバンドよりも高画質な縮小画像が得られる方法であれ
ばどのような方法を用いても良く、この具体的な方法と
しては、公知であり、これまでにも多くの方法が提案さ
れているのでここでは説明を省略する。尚、縮小タイル
ＳＴｉ内の水平方向のｘ番目、垂直方向のｙ番目の画素
値を、ＳＴｉ（ｘ，ｙ）と表す。但し、ｉ＝０，・・・，
ｎ、０≦ｘ＜ＴＷ／２、０≦ｙ＜ＴＨ／２であって、Ｓ
Ｔｉ（０，０）は、縮小タイルＳＴｉの左上隅の画素値
を表す。The image reduction process in the image reduction unit 104 is a reduction process that emphasizes the image quality of the reduced image. Any method may be used as long as it can obtain a reduced image having a higher image quality than the LL subband in the discrete wavelet transform unit 107, which will be described later. However, since many methods have been proposed, description thereof will be omitted here. The x-th pixel value in the horizontal direction and the y-th pixel value in the vertical direction in the reduced tile STi are represented as STi (x, y). However, i = 0, ...,
n, 0 ≦ x <TW / 2, 0 ≦ y <TH / 2, and S
Ti (0,0) represents the pixel value at the upper left corner of the reduced tile STi.

【００２３】セレクタ１０６は、さらに領域判定部１０
５に接続している。セレクタ１０６では、着目するタイ
ルＴｉについて、領域判定部１０５からの領域判定情報
Ｆ（Ｔｉ）に基づいて、画像縮小部１０４から出力され
る縮小タイルＳＴｉの画像データ、またはタイル分割部
１０３から出力されるタイルＴｉの画像データのいずれ
かが選択されて出力される。例えば、タイルＴｉに関す
る領域判定情報Ｆ（Ｔｉ）が０である場合、すなわちタ
イルＴｉが文字領域である高解像度符号化領域を含まな
い写真領域である場合、縮小タイルＳＴｉの画像データ
が選択されて出力される。一方、タイルＴｉに関する領
域判定情報Ｆ（Ｔｉ）が１である場合、すなわちタイル
Ｔｉが文字領域である高解像度符号化領域を含む場合、
高解像度のタイルＴｉの画像データが選択されて出力さ
れる。The selector 106 further includes the area determination unit 10
Connected to 5. In the selector 106, for the tile Ti of interest, based on the area determination information F (Ti) from the area determination unit 105, the image data of the reduced tile STi output from the image reduction unit 104 or the tile division unit 103 outputs. One of the image data of the tiles Ti to be selected is output. For example, when the area determination information F (Ti) regarding the tile Ti is 0, that is, when the tile Ti is a photo area that does not include the high-resolution encoded area that is a character area, the image data of the reduced tile STi is selected. Is output. On the other hand, when the area determination information F (Ti) regarding the tile Ti is 1, that is, when the tile Ti includes a high-resolution encoded area that is a character area,
The image data of the high resolution tile Ti is selected and output.

【００２４】また、図１において離散ウェーブレット変
換部１０７は、セレクタ１０６と領域判定部１０５に接
続されている。そして、離散ウェーブレット変換部１０
７は、セレクタ１０６から入力されるタイルＴｉまたは
縮小タイルＳＴｉの画像データを、不図示の内部バッフ
ァに適宜格納しながら２次元の離散ウェーブレット変換
を施し、当該画像を複数のサブバンドに分解して各サブ
バンドの係数を出力する。Further, in FIG. 1, the discrete wavelet transform section 107 is connected to the selector 106 and the area determination section 105. Then, the discrete wavelet transform unit 10
Reference numeral 7 performs two-dimensional discrete wavelet transform while appropriately storing the image data of the tile Ti or the reduced tile STi input from the selector 106 in an internal buffer (not shown), and decomposes the image into a plurality of subbands. Output the coefficient of each subband.

【００２５】また、離散ウェーブレット変換部１０７で
行われる２次元離散ウェーブレット変換の適用回数は、
領域判定部１０５から入力される領域判定情報Ｆ（Ｔ
ｉ）によって異なるようにする。すなわち、Ｆ（Ｔｉ）
が０である場合には、１回の２次元離散ウェーブレット
変換をタイルＳＴｉに行う。図５は、タイルＳＴｉを４
つのサブバンドに分解する場合を説明するための概要図
である。図５に示すように、タイルＳＴｉは、ＬＬ、Ｈ
Ｌ１、ＬＨ１、ＨＨ１の４つのサブバンドに分解され
る。また、図６は、タイルＴｉを７つのサブバンドに分
解する場合を説明するための概要図である。図６に示す
ように、Ｆ（Ｔｉ）が１である場合には、２回の２次元
離散ウェーブレット変換により、ＬＬ、ＨＬ１、ＬＨ
１、ＨＨ１、ＨＬ２、ＬＨ２、ＨＨ２の７つのサブバン
ドに分解される。The number of applications of the two-dimensional discrete wavelet transform performed by the discrete wavelet transform unit 107 is
Area determination information F (T
It depends on i). That is, F (Ti)
Is 0, one-dimensional two-dimensional discrete wavelet transform is performed on the tile STi. Figure 5 shows tile STi 4
It is a schematic diagram for explaining the case where it decomposes | disassembles into one subband. As shown in FIG. 5, the tile STi is LL, H
It is decomposed into four subbands L1, LH1, and HH1. Further, FIG. 6 is a schematic diagram for explaining a case where the tile Ti is decomposed into seven subbands. As shown in FIG. 6, when F (Ti) is 1, LL, HL1, and LH are obtained by performing the two-dimensional discrete wavelet transform twice.
It is decomposed into seven subbands of 1, HH1, HL2, LH2, and HH2.

【００２６】以降、各サブバンドの係数をＣ（Ｓ，ｘ，
ｙ）と表す。但し、Ｓはサブバンドを表し、ＬＬ、ＨＬ
１、ＬＨ１、ＨＨ１、ＨＬ２、ＬＨ２、ＨＨ２のいずれ
かである。また、（ｘ，ｙ）は、各サブバンド内の左上
隅の係数位置を（０，０）としたときの、水平方向およ
び垂直方向の係数位置を表す。Thereafter, the coefficient of each subband is C (S, x,
y). However, S represents a subband, and LL and HL
1, LH1, HH1, HL2, LH2, or HH2. Further, (x, y) represents the coefficient position in the horizontal direction and the vertical direction when the coefficient position at the upper left corner in each subband is (0, 0).

【００２７】ここで、本実施形態で使用する２次元離散
ウェーブレット変換について説明する。２次元離散ウェ
ーブレット変換は、１次元の変換（フィルタ処理）を水
平・垂直方向それぞれに適用することによって実現する
ものである。図７は、符号化対象画像に対して２次元の
離散ウェーブレット変換の適用例を説明するための概要
図である。図７（ａ）は符号化対象画像を示しており、
この画像に対して垂直方向に１次元の離散ウェーブレッ
ト変換を適用し、図７（ｂ）に示すように低周波サブバ
ンドＬと高周波サブバンドＨに分解する。次に、それぞ
れのサブバンドに対して水平方向に１次元離散ウェーブ
レット変換を適用することにより、図７（ｃ）に示すよ
うにＬＬ、ＨＬ，ＬＨ，ＨＨの４つのサブバンドに分解
する。The two-dimensional discrete wavelet transform used in this embodiment will be described here. The two-dimensional discrete wavelet transform is realized by applying a one-dimensional transform (filter process) to each of the horizontal and vertical directions. FIG. 7 is a schematic diagram for explaining an application example of the two-dimensional discrete wavelet transform for the image to be encoded. FIG. 7A shows an image to be encoded,
A one-dimensional discrete wavelet transform is applied to this image in the vertical direction and decomposed into a low frequency subband L and a high frequency subband H as shown in FIG. 7B. Next, a one-dimensional discrete wavelet transform is applied to each subband in the horizontal direction to decompose into four subbands LL, HL, LH, and HH as shown in FIG. 7C.

【００２８】本実施形態による画像符号化装置では、Ｎ
個の１次元信号ｘ（ｎ），(ｎ＝０，・・・，Ｎ−１）に対
する１次元離散ウェーブレット変換は、次式を用いて行
われるものとする。In the image coding apparatus according to this embodiment, N
The one-dimensional discrete wavelet transform for each of the one-dimensional signals x (n), (n = 0, ..., N-1) is performed using the following equation.

【００２９】 h(n)=x(2n+1)-(x(2n)+x(2n+2))/2 （１） l(n)=x(2n)+(h(n-1)+h(n))/4 （２）ここで、ｈ（ｎ）は高周波サブバンドの変換係数、ｌ
（ｎ）は低周波サブバンドの変換係数を表す。尚、上式
の計算において必要となる１次元信号ｘ（ｎ）の両端ｘ
（ｎ），（ｎ＜０及びｎ≧Ｎ）は、公知の手法を用いて
１次元信号ｘ（ｎ），（０≦ｎ＜Ｎ）の値から求める。H (n) = x (2n + 1)-(x (2n) + x (2n + 2)) / 2 (1) l (n) = x (2n) + (h (n-1) + h (n)) / 4 (2) where h (n) is the conversion coefficient of the high frequency subband, l
(N) represents the conversion coefficient of the low frequency subband. It should be noted that both ends x of the one-dimensional signal x (n) required in the calculation of the above equation
(N) and (n <0 and n ≧ N) are obtained from the values of the one-dimensional signal x (n) and (0 ≦ n <N) using a known method.

【００３０】一方、サブバンドＴｉについては、上述し
た２次元離散ウェーブレット変換によって得られたサブ
バンドＬＬに対して、さらに繰り返して２次元離散ウェ
ーブレット変換を適用することにより、図８に示すよう
にＬＬ、ＨＬ１、ＬＨ１、ＨＨ１、ＨＬ２、ＬＨ２、Ｈ
Ｈ２の７つのサブバンドに分解する。図８は、サブバン
ドＴｉに対する２次元離散ウェーブレット変換を説明す
るための図である。尚、図８におけるサブバンドＬＬ
は、図７（ｃ）のサブバンドＬＬを再分解したものであ
り、両者は同一のものではない。On the other hand, for the sub-band Ti, the two-dimensional discrete wavelet transform is repeatedly applied to the sub-band LL obtained by the above-mentioned two-dimensional discrete wavelet transform, and as shown in FIG. , HL1, LH1, HH1, HL2, LH2, H
It decomposes into 7 subbands of H2. FIG. 8 is a diagram for explaining the two-dimensional discrete wavelet transform for the subband Ti. The sub-band LL in FIG.
7 is a re-decomposition of the subband LL of FIG. 7C, and the two are not the same.

【００３１】変換係数量子化部１０８は、離散ウェーブ
レット変換部１０７に接続され、生成された各サブバン
ドの係数Ｃ（Ｓ，ｘ，ｙ）を、各サブバンド毎に定めた
量子化ステップｄｅｌｔａ（Ｓ）を用いて量子化する。
量子化された係数値をＱ（Ｓ，ｘ，ｙ）と表すと、変換
係数量子化部１０８で行われる量子化処理は、次式によ
って表すことができる。The transform coefficient quantization unit 108 is connected to the discrete wavelet transform unit 107, and the generated quantization coefficient C (S, x, y) of each subband is determined for each subband by a quantization step delta ( Quantize using S).
When the quantized coefficient value is represented by Q (S, x, y), the quantization process performed by the transform coefficient quantization unit 108 can be represented by the following equation.

【００３２】 Q(S,x,y)=sign{C(S,x,y)}×floor{|C(S,x,y)|/delta(S)} （３）ここで、ｓｉｇｎ｛Ｉ｝は、整数Ｉの正負符号を表す関
数であって、Ｉが正の場合は１、負の場合は−１を返
す。また、ｆｌｏｏｒ｛Ｒ｝は、実数Ｒを超えない最大
の整数値を表すものとする。Q (S, x, y) = sign {C (S, x, y)} × floor {| C (S, x, y) | / delta (S)} (3) where sign { I} is a function that represents the sign of the integer I, and returns 1 when I is positive and -1 when I is negative. Also, floor {R} represents the maximum integer value that does not exceed the real number R.

【００３３】一方、ビットプレーン符号化部１０９は、
変換係数量子化部１０８に接続されており、離散ウェー
ブレット変換部１０７と変換係数量子化部１０８を経て
生成されたタイルＳＴｉについてのＬＬ、ＨＬ１、ＬＨ
１、ＨＨ１の４つのサブバンド、またはタイルＴｉにつ
いてのＬＬ、ＨＬ１、ＬＨ１、ＨＨ１、ＨＬ２、ＬＨ
２、ＨＨ２の７つのサブバンドの量子化された係数値Ｑ
（Ｓ，ｘ，ｙ）を符号化して符号列を生成する装置であ
る。On the other hand, the bit plane coding unit 109
The tiles LL, HL1, and LH are connected to the transform coefficient quantization unit 108, and the tiles STi generated through the discrete wavelet transform unit 107 and the transform coefficient quantization unit 108.
1, HL, HL1, LH1, HH1, HL2, LH for 4 subbands of HH1 or tile Ti
2, quantized coefficient values Q of 7 subbands of HH2
It is a device that encodes (S, x, y) to generate a code string.

【００３４】ビットプレーン符号化部１０９における符
号化方法としては、各サブバンドの係数をブロック分割
し、別々に符号化することによりランダムアクセス性を
向上させる方法等が知られているが、本実施形態では説
明を簡単にするためにサブバンド単位に符号化する。す
なわち、各サブバンドの量子化された係数値Ｑ（Ｓ，
ｘ，ｙ）の符号化は、サブバンド内の量子化された係数
値Ｑ（Ｓ，ｘ，ｙ）の絶対値を自然２進数で表現し、上
位の桁から下位の桁へとビットプレーン方向を優先して
２値算術符号化することによって行われる。本実施形態
では、各サブバンドの量子化された係数値Ｑ（Ｓ，ｘ，
ｙ）を自然２進表記した場合の下からｎ桁目のビット
を、Ｃｎ（Ｓ，ｘ，ｙ）と表記して説明する。また、２
進数の桁を表す変数ｎをビットプレーン番号と呼び、ビ
ットプレーン番号ｎはＬＳＢ（最下位ビット）を０桁目
とする。As a coding method in the bit plane coding unit 109, there is known a method of improving the random accessibility by dividing the coefficients of each subband into blocks and coding them separately. In the form, coding is performed in subband units for the sake of simplicity. That is, the quantized coefficient values Q (S,
(x, y) is encoded by expressing the absolute value of the quantized coefficient value Q (S, x, y) in the subband in a natural binary number, and from the upper digit to the lower digit in the bit plane direction. Is prioritized and binary arithmetic coding is performed. In the present embodiment, the quantized coefficient values Q (S, x,
The description will be made by describing the bit of the nth digit from the bottom when y) is expressed in natural binary as Cn (S, x, y). Also, 2
A variable n representing a digit of a decimal number is called a bit plane number, and the bit plane number n has the LSB (least significant bit) as the 0th digit.

【００３５】図９は、サブバンドＳを符号化するビット
プレーン符号化部１０９の動作手順を説明するためのフ
ローチャートである。まず、符号化対象となるサブバン
ドＳ内の係数の絶対値を調べ、その最大値Ｍａｂｓ
（Ｓ）が求められる（ステップＳ９０１）。次に、最大
値Ｍａｂｓ（Ｓ）を２進数で表現するために必要となる
桁数Ｎ_BP（Ｓ）を次式に基づいて求める（ステップＳ９
０２）。FIG. 9 is a flow chart for explaining the operation procedure of the bit plane coding unit 109 for coding the subband S. First, the absolute value of the coefficient in the sub-band S to be encoded is checked, and its maximum value Mabs
(S) is obtained (step S901). Next, the number of digits N _BP (S) required to represent the maximum value Mabs (S) in binary number is calculated based on the following equation (step S9).
02).

【００３６】N_BP(S)=ceil{log₂(Mabs(S))} （４）ここで、ｃｅｉｌ｛Ｒ｝は、実数Ｒに等しいか、あるい
はそれ以上の最小の整数値を表す。N _BP (S) = ceil {log ₂ (Mabs (S))} (4) where ceil {R} represents the smallest integer value equal to or greater than the real number R.

【００３７】そして、ビットプレーン番号ｎに有効桁数
Ｎ_BP（Ｓ）を代入する（ステップＳ９０３）。さらに、
ビットプレーン番号ｎから１を引く（ステップＳ９０
４）。そして、ビットプレーンｎを２値算術符号を用い
て符号化する（ステップＳ９０５）。本実施形態におい
ては、算術符号としてＱＭ−Ｃｏｄｅｒを用いる。尚、
ＱＭ−Ｃｏｄｅｒを用いて、ある状態（コンテクスト）
で発生した２値シンボルを符号化する手順、あるいは、
算術符号化処理のための初期化手順や終端手順について
は、静止画像の国際標準ITU-T Recommendation T.81 |
ISO/IEC10918-1勧告等に詳細に説明されているのでこ
こでは説明を省略する。Then, the number of significant digits N _BP (S) is substituted for the bit plane number n (step S903). further,
Subtract 1 from the bit plane number n (step S90
4). Then, the bit plane n is encoded using the binary arithmetic code (step S905). In this embodiment, QM-Coder is used as the arithmetic code. still,
A certain state (context) using QM-Coder
Procedure to encode the binary symbol generated in
Regarding the initialization procedure and termination procedure for arithmetic coding processing, the international standard for still images, ITU-T Recommendation T.81 |
Since it is explained in detail in ISO / IEC10918-1 recommendation etc., the explanation is omitted here.

【００３８】また、本実施形態では、説明を簡単にする
ため、単一のコンテクストで各ビットを算術符号化する
ものとする。そして、各ビットプレーンの符号化の開始
時に算術符号化器を初期化し、終了時に算術符号化器の
終端処理を行う。In the present embodiment, each bit is arithmetically encoded with a single context for the sake of simplicity. Then, the arithmetic encoder is initialized at the start of encoding each bit plane, and the termination processing of the arithmetic encoder is performed at the end.

【００３９】さらに、個々の係数について、最初に符号
化される”１”の直後に、その係数の正負符号を０また
は１で表して算術符号化する。本実施形態では、係数の
符号が正の場合は０、負の場合は１とする。例えば、係
数が−５であって、この係数の属するサブバンドＳの有
効桁数Ｎ_BP（Ｓ）が６の場合、係数の絶対値は２進数０
００１０１で表され、各ビットプレーンの符号化により
上位桁から下位桁へと符号化される。そして、２番目の
ビットプレーンの符号化時（上から４桁目）に最初の”
１”が符号化され、この直後に負の場合の正負符号”
１”を算術符号化するようにする。Further, for each coefficient, immediately after the first coded "1", the sign of the coefficient is represented by 0 or 1 and arithmetically coded. In this embodiment, 0 is set when the sign of the coefficient is positive, and 1 is set when the sign is negative. For example, when the coefficient is -5 and the number of significant digits N _BP (S) of the subband S to which this coefficient belongs is 6, the absolute value of the coefficient is binary 0.
It is represented by 00101 and is encoded from the upper digit to the lower digit by encoding each bit plane. Then, when encoding the second bit plane (fourth digit from the top), the first "
1 "is encoded, and immediately after this, a positive / negative sign when negative"
1 "is arithmetically encoded.

【００４０】また、ビットプレーン番号ｎを０と比較し
て、ｎが０であるか否かが判断される（ステップＳ９０
６）。その結果、ｎが０の場合（Ｙｅｓ）、すなわち、
ステップＳ９０５でＬＳＢプレーンの符号化が行われた
場合、サブバンドの符号化処理を終了する。一方、ｎが
０でない場合（Ｎｏ），ステップＳ９０４の処理が行わ
れる。Further, the bit plane number n is compared with 0 to determine whether n is 0 (step S90).
6). As a result, when n is 0 (Yes), that is,
If the LSB plane has been encoded in step S905, the subband encoding process ends. On the other hand, when n is not 0 (No), the process of step S904 is performed.

【００４１】上述した手順の処理によって、サブバンド
Ｓの全係数を符号化して、各ビットプレーンｎに対応す
る符号列ＣＳ（Ｓ，ｎ）が生成される。符号列形成部１
１０は、ビットプレーン符号化部１０９に接続してお
り、生成された符号列は符号列形成部１１０に送られ、
符号列形成部１１０内の不図示のバッファに一時的に格
納される。符号列形成部１１０は、さらに、符号出力部
１１１に接続している。そして、符号列形成部１１０
は、ビットプレーン符号化部１０９から出力されたすべ
てのタイル（Ｔ０〜Ｔｎ）についてのタイルＴｉまたは
縮小タイルＳＴｉの全サブバンドの係数の全符号列が内
部バッファに格納された後、所定の順序で内部バッファ
に格納される符号列を読み出す。そして、読み出した符
号列に必要な付加情報を挿入して、本符号化装置の出力
となる最終的な符号列を形成して、符号出力部１１１に
出力する。By the processing of the above-mentioned procedure, all the coefficients of the subband S are encoded to generate the code string CS (S, n) corresponding to each bit plane n. Code string forming unit 1
10 is connected to the bit plane encoding unit 109, the generated code string is sent to the code string forming unit 110,
It is temporarily stored in a buffer (not shown) in the code string forming unit 110. The code string forming unit 110 is further connected to the code output unit 111. Then, the code string forming unit 110
Is a predetermined order after all the code strings of the coefficients of all subbands of the tile Ti or the reduced tile STi for all tiles (T0 to Tn) output from the bit plane encoding unit 109 are stored in the internal buffer. Reads the code string stored in the internal buffer. Then, necessary additional information is inserted into the read code string to form a final code string that is the output of the present coding apparatus, and is output to the code output unit 111.

【００４２】図１０は、符号列形成部１１０において生
成される符号列の構造を示す図である。符号列形成部１
１０で生成される最終的な符号列は、タイルのヘッダと
タイルＴ０〜Ｔｎの各タイルの符号化データとから構成
される。また、各タイルの符号化データには、画像デー
タにおけるどのタイルかを識別するための識別情報やそ
れぞれのタイルに適用された離散ウェーブレット変換の
回数などの付加情報を格納したタイルヘッダと、レベル
０とレベル１の２つ、あるいは、レベル０とレベル１と
レベル２の３つに階層化された符号化データとから構成
される。FIG. 10 is a diagram showing the structure of a code string generated by the code string forming unit 110. Code string forming unit 1
The final code string generated in 10 includes a tile header and coded data of tiles T0 to Tn. In addition, in the encoded data of each tile, a tile header storing identification information for identifying which tile in the image data, additional information such as the number of discrete wavelet transforms applied to each tile, and a level 0 And level 1 or level 0, level 1 and level 2 in three layers.

【００４３】図１１は、Ｆ（Ｔｉ）＝０の場合のタイル
ＳＴｉの符号化データの構成を示す概要図である。ま
た、図１２は、Ｆ（Ｔｉ）＝１の場合のタイルＴｉの符
号化データの構成を示す概要図である。図１１及び図１
２に示すように、レベル０の符号化データは、ＬＬサブ
バンドの係数を符号化して得られるＣＳ（ＬＬ，Ｎ
_BP（ＬＬ）−１）からＣＳ（ＬＬ，０）までの符号列か
ら構成される。FIG. 11 is a schematic diagram showing the structure of the encoded data of the tile STi when F (Ti) = 0. Further, FIG. 12 is a schematic diagram showing the configuration of encoded data of tile Ti when F (Ti) = 1. 11 and 1
2, the encoded data of level 0 is CS (LL, N) obtained by encoding the coefficients of the LL subband.
_{It is} composed of code strings from _BP (LL) -1) to CS (LL, 0).

【００４４】また、レベル１は、ＬＨ１、ＨＬ１、ＨＨ
１の各サブバンドの係数を符号化して得られる符号列Ｃ
Ｓ（ＬＨ１，Ｎ_BP（ＬＨ１）−１）からＣＳ（ＬＨ１，
０）と、ＣＳ（ＨＬ１，Ｎ_BP（ＨＬ１）−１）からＣＳ
（ＨＬ１，０）と、ＣＳ（ＨＨ１，Ｎ_BP（ＨＨ１）−
１）からＣＳ（ＨＨ１，０）とから構成される。さら
に、レベル２は、ＬＨ２、ＨＬ２、ＨＨ２の各サブバン
ドの係数を符号化して得られる符号列ＣＳ（ＬＨ２，Ｎ
_BP（ＬＨ２）−１）からＣＳ（ＬＨ２，０）と、ＣＳ
（ＨＬ２，Ｎ_BP（ＨＬ２）−１）からＣＳ（ＨＬ２，
０）と、ＣＳ（ＨＨ２，Ｎ_BP（ＨＨ２）−１）からＣＳ
（ＨＨ２，０）とから構成される。Level 1 is LH1, HL1, HH
Code sequence C obtained by encoding the coefficients of each subband of 1
From S (LH1, N _BP (LH1) -1) to CS (LH1,
0) and CS (HL1, N _BP (HL1) -1) to CS
(HL1,0) and CS (HH1, N _BP (HH1)-
1) to CS (HH1,0). Further, level 2 has a code string CS (LH2, NH) obtained by coding the coefficients of the subbands LH2, HL2, and HH2.
_{From BP} (LH2) -1) to CS (LH2,0), CS
From (HL2, N _BP (HL2) -1) to CS (HL2,
0) and CS (HH2, N _BP (HH2) -1) to CS
(HH2,0).

【００４５】また、符号出力部１１１は、符号列形成部
１１０に接続され、符号列形成部１１０で生成された符
号列を装置外部へと出力するための装置である。符号出
力部１１１の一例として、ハードディスクやメモリとい
った記憶装置、ネットワーク回線のインタフェース等が
ある。The code output unit 111 is a device that is connected to the code string forming unit 110 and outputs the code string generated by the code string forming unit 110 to the outside of the device. Examples of the code output unit 111 include a storage device such as a hard disk and a memory, a network line interface, and the like.

【００４６】図１３は、上述した実施形態による画像符
号化装置の動作手順を説明するためのフローチャートで
ある。まず、画像入力部１０１から高解像度のＸ画素×
Ｙ画素の画像データが入力される（ステップＳ１３
１）。入力された画像データは、タイル分割部１０３に
おいて、ＴＷ×ＴＨの大きさのタイルに分割される（ス
テップＳ１３２）。そして、分割された各タイルについ
て、縦横とも１／２の縮小率によって縮小画像が生成さ
れる（ステップＳ１３３）。FIG. 13 is a flow chart for explaining the operation procedure of the image coding apparatus according to the above-mentioned embodiment. First, from the image input unit 101, high resolution X pixels x
Image data of Y pixels is input (step S13).
1). The input image data is divided by the tile division unit 103 into tiles having a size of TW × TH (step S132). Then, for each of the divided tiles, a reduced image is generated with a reduction ratio of 1/2 in both the vertical and horizontal directions (step S133).

【００４７】一方、分割された各領域についての領域情
報が領域情報入力部１０２から入力され、分割された各
領域の領域情報に基づいて、各領域ごとにタイルまたは
縮小タイルのどちらを選択するかが、領域判定部１０５
において判定される（ステップＳ１３４）。そして、セ
レクタ１０６は判定された方のタイルを選択する（ステ
ップＳ１３５）。さらに、離散ウェーブレット変換部１
０７において、選択されたタイルの画像データの離散ウ
ェーブレット変換が行われ、所定数のサブバンドに分解
される（ステップＳ１３６）。On the other hand, area information about each divided area is input from the area information input unit 102, and which of the tiles or reduced tiles is selected for each area based on the area information of each divided area. However, the area determination unit 105
Is determined (step S134). Then, the selector 106 selects the determined tile (step S135). Furthermore, the discrete wavelet transform unit 1
At 07, the discrete wavelet transform of the image data of the selected tile is performed and decomposed into a predetermined number of subbands (step S136).

【００４８】また、分解された各サブバンドの変換係数
が、変換係数量子化部１０８において量子化される（ス
テップＳ１３７）。量子化された変換係数は、ビットプ
レーン符号化部１０９において符号化され（ステップＳ
１３８）、符号列形成部１１０においてビット列が形成
される（ステップＳ１３９）。そして、生成された符号
化データは、符号出力部１１１から装置外部に出力され
る（ステップＳ１４０）。Further, the decomposed transform coefficient of each sub-band is quantized by the transform coefficient quantizing unit 108 (step S137). The quantized transform coefficient is coded by the bit plane coding unit 109 (step S
138), the bit string is formed in the code string forming unit 110 (step S139). Then, the generated encoded data is output from the code output unit 111 to the outside of the device (step S140).

【００４９】すなわち、本発明による画像符号化装置
は、画像を所定の領域に分割して小画像を生成する分割
手段（タイル分割部１０３）と、分割された各領域につ
いての領域情報を入力する入力手段（領域情報入力部１
０２、領域判定部１０５）と、分割された各領域につい
て、所定縮小率によって縮小画像を生成する生成手段
（画像縮小部１０４）と、分割された各領域の領域情報
に基づいて、各領域ごとに小画像または縮小画像を周波
数変換し、所定数のサブバンドに分解する分解手段（セ
レクタ１０６、離散ウェーブレット変換部１０７）と、
分解された各サブバンドの変換係数を符号化する符号化
手段（ビットプレーン符号化部１０９）とを備えること
を特徴とする。That is, the image coding apparatus according to the present invention inputs the dividing means (tile dividing unit 103) for dividing an image into predetermined regions to generate a small image, and the region information for each divided region. Input means (area information input section 1
02, a region determination unit 105), a generation unit (image reduction unit 104) that generates a reduced image at a predetermined reduction ratio for each divided region, and region information for each divided region. Decomposing means (selector 106, discrete wavelet transforming unit 107) for frequency-converting the small image or reduced image into a predetermined number of subbands,
Coding means (bit plane coding section 109) for coding the decomposed transform coefficient of each sub-band is provided.

【００５０】また、本発明による画像符号化装置は、画
像を周波数変換して所定数のサブバンドへ分解する手段
（離散ウェーブレット変換部１０７）が、離散ウェーブ
レット変換を用いることを特徴とする。さらに、本発明
による画像符号化装置は、分割された所定領域について
の分解されたサブバンドの変換係数のうち、所定の係数
を量子化する量子化手段（変換係数量子化部１０８）を
さらに備えることを特徴とする。さらにまた、本発明に
よる画像符号化装置は、符号化されたビット列に対し
て、所定のヘッダ情報を付与する付与手段（符号列形成
部１１０）をさらに備えることを特徴とする。Further, the image coding apparatus according to the present invention is characterized in that the means (discrete wavelet transform section 107) for frequency-transforming an image into a predetermined number of sub-bands uses discrete wavelet transform. Furthermore, the image coding apparatus according to the present invention further includes a quantizing unit (transform coefficient quantizing unit 108) that quantizes a predetermined coefficient of the transform coefficients of the decomposed subbands of the divided predetermined area. It is characterized by Furthermore, the image coding apparatus according to the present invention is further characterized by further including adding means (code string forming unit 110) for adding predetermined header information to the encoded bit string.

【００５１】以上に述べたように、本実施形態では、入
力された画像データを複数のタイルＴｉに分割し、タイ
ル単位で高解像度符号化領域か否かを判定して、高解像
度符号化領域に属するタイルについてはそのタイルＴｉ
を、高解像度符号化領域に属さないタイルについては適
切な縮小タイルＳＴｉを離散ウェーブレット変換して符
号化した。これによって、高解像度の画像が必要な部分
については高解像度再現のための情報を含んだ符号化デ
ータとして、また、高解像度を必要とない部分について
は好適な低解像度復元画像を得ることが可能な符号化デ
ータとして符号化データを生成することが可能になる。As described above, in the present embodiment, the input image data is divided into a plurality of tiles Ti, and it is judged in tile units whether or not the tiles are high-resolution encoded areas. Tiles that belong to
For the tiles that do not belong to the high-resolution encoded area, the appropriate reduced tile STi is discrete-wavelet-transformed and encoded. This makes it possible to obtain coded data that includes information for high resolution reproduction for areas that require high resolution images and suitable low resolution restored images for areas that do not require high resolution. It is possible to generate encoded data as various encoded data.

【００５２】＜第２の実施形態＞図１４は、本発明の第
２の実施形態による画像符号化装置の構成を示すブロッ
ク図である。図１４において、図１に示される第１の実
施形態による画像符号化装置で用いられた構成要素と同
じ動作・機能の装置については、同じ符号で示してお
り、それらの説明を省略する。<Second Embodiment> FIG. 14 is a block diagram showing the arrangement of an image coding apparatus according to the second embodiment of the present invention. 14, devices having the same operations and functions as those of the components used in the image encoding device according to the first embodiment shown in FIG. 1 are denoted by the same reference numerals, and the description thereof will be omitted.

【００５３】本実施形態における画像符号化装置も、第
１の実施形態と同様に各画素の輝度値を８ビットで表現
した白黒画像データを符号化する。また、本実施形態で
は、画像データは８ビットだけに限らず、４ビット、１
０ビット、１２ビット等のビット数で輝度値を表現する
画像データを符号化する場合にも適用可能である。さら
に、各画素をＲＧＢ、ＣＭＹＫ等の複数の色成分、ある
いはＹＣｂＣｒ等の輝度と色度／色差成分で表現するカ
ラー画像データを符号化する場合にも適用可能である。
この場合には、カラー画像データ中の各成分が白黒画像
データであるとみなせばよい。また、入力される画像デ
ータの大きさを第１の実施形態と同様にＸ画素×Ｙ画素
とし、所定の大きさ（ＴＷ×ＴＨ）で画像データを分割
して生成したタイルの縦横の大きさをさらに半分にした
ものを縮小画像とする。尚、本実施形態において、Ｘ、
Ｙは、共に４の倍数とする。The image encoding apparatus in this embodiment also encodes monochrome image data in which the brightness value of each pixel is represented by 8 bits, as in the first embodiment. Further, in the present embodiment, the image data is not limited to 8 bits, but 4 bits, 1
It can also be applied to the case of encoding image data expressing a brightness value with a bit number such as 0 bit or 12 bits. Furthermore, it is also applicable to the case of encoding color image data in which each pixel is expressed by a plurality of color components such as RGB and CMYK, or luminance and chromaticity / color difference components such as YCbCr.
In this case, each component in the color image data may be regarded as monochrome image data. Further, the size of the input image data is set to X pixels × Y pixels as in the first embodiment, and the vertical and horizontal sizes of tiles generated by dividing the image data into a predetermined size (TW × TH). The reduced image is a half of the above. In the present embodiment, X,
Both Y are multiples of 4.

【００５４】以下、図１４に示されるブロック図を用い
て、本実施形態に係る画像符号化装置の各部について説
明する。画像入力部１０１は、符号化される画像データ
を入力するための装置であり、入力された画像データを
所定の大きさのタイルに分割するタイル分割部１０３に
接続されている。タイル分割部１０３は、分割されたタ
イルから所定の縮小画像を生成する画像縮小部１０４
と、タイルが文字領域または写真領域であるか等の領域
情報を判定する領域判定部と、各タイルについて必要な
解像度を判定するセレクタ１０６に接続している。Each section of the image coding apparatus according to this embodiment will be described below with reference to the block diagram shown in FIG. The image input unit 101 is a device for inputting encoded image data, and is connected to a tile division unit 103 that divides the input image data into tiles of a predetermined size. The tile dividing unit 103 is an image reducing unit 104 that generates a predetermined reduced image from the divided tiles.
An area determination unit that determines area information such as whether the tile is a character area or a photograph area, and a selector 106 that determines a required resolution for each tile.

【００５５】また、セレクタ１０６は、さらに画像縮小
部１０４と判定されたタイルの画像データを離散ウェー
ブレット変換する離散ウェーブレット変換部１４０４に
接続している。離散ウェーブレット変換部１４０４は、
さらにビットプレーン符号化部１０９に接続されてお
り、変換された画像データを符号化するために出力す
る。ビットプレーン符号化部１０９は、符号出力部１１
１に接続した符号列形成部１４０３に接続されて、最下
位ＢＰ決定部１４０２で決定された最下位ビットに基づ
いて符号列を形成する。Further, the selector 106 is further connected to the discrete wavelet transform unit 1404 which performs discrete wavelet transform on the image data of the tile determined to be the image reduction unit 104. The discrete wavelet transform unit 1404
Further, it is connected to the bit plane coding unit 109 and outputs the converted image data for coding. The bit plane encoding unit 109 includes a code output unit 11
The code string forming unit 1403 connected to 1 forms a code string based on the least significant bit determined by the least significant BP determining unit 1402.

【００５６】図１５は、図１４に示される第２の実施形
態による画像符号化装置の動作手順を説明するためのフ
ローチャートである。まず、第１の実施形態と同様に、
本実施形態による画像符号化装置の符号化対象となる画
像データは、画像入力部１０１からラスタ走査順に入力
される（ステップＳ１５１）。入力された画像データ
は、タイル分割部１０３において、所定の幅ＴＷと高さ
ＴＨを有する複数のタイルに分割される（ステップＳ１
５２）。生成されたタイルＴ０〜Ｔｎは、ラスタ走査順
に、タイル分割部１０３に接続された領域判定部１４０
１、セレクタ１０６、画像縮小部１０４に出力される。FIG. 15 is a flow chart for explaining the operation procedure of the image coding apparatus according to the second embodiment shown in FIG. First, as in the first embodiment,
The image data to be encoded by the image encoding apparatus according to the present embodiment is input from the image input unit 101 in raster scan order (step S151). The input image data is divided by the tile division unit 103 into a plurality of tiles having a predetermined width TW and height TH (step S1).
52). The generated tiles T0 to Tn have area determination units 140 connected to the tile division unit 103 in raster scan order.
1, the selector 106, and the image reduction unit 104.

【００５７】領域判定部１４０１では、タイル分割部１
０３で生成されたタイルＴｉを不図示の内部バッファに
格納し、タイル内の画素値を調べる。そして、当該タイ
ルが高解像度符号化領域を含むタイルであるか、高解像
度符号化領域を含まないタイルかであるかを判定し、そ
の判定結果Ｆ（Ｔｉ）をタイルごとに出力する。In the area determination unit 1401, the tile division unit 1
The tile Ti generated in 03 is stored in an internal buffer (not shown), and the pixel value in the tile is checked. Then, it is determined whether the tile is a tile including the high-resolution encoded area or a tile not including the high-resolution encoded area, and the determination result F (Ti) is output for each tile.

【００５８】タイルＴｉの判定は、タイルＴｉの各画素
値Ｔｉ（ｘ，ｙ）についてエッジ判定処理が行われ、各
画素がエッジ部であるか否かを示したエッジ判定情報Ｅ
（Ｔｉ（ｘ，ｙ））が生成される。そして、生成された
エッジ判定情報Ｅ（Ｔｉ（ｘｘ，ｙ））の値は、画素Ｔ
ｉ（ｘ，ｙ）がエッジ部と判定された場合に１が付与さ
れ、エッジ部ではないと判定された場合に０が付与され
る。そして、タイルＴｉ内のすべての画素について前述
したエッジ判定が行われる（ステップＳ１５３ａ）。そ
の後、タイルＴｉ内のエッジの数、すなわちＥ（Ｔｉ
（ｘ，ｙ））の総和が求められ（ステップＳ１５３
ｂ）、その値を所定のしきい値Ｔｈと比較して、エッジ
数の総和がしきい値Ｔｈ以上の値の場合、タイルＴｉを
高解像度符号化領域を含む領域であると判断する。そし
て、判定結果Ｆ（Ｔｉ）として１を出力する。一方、総
和の値がしきい値Ｔｈに等しい、またはそれ以下である
場合、タイルＴｉを高解像度符号化領域を含まないと判
断して、判定結果Ｆ（Ｔｉ）として０を出力する（ステ
ップＳ１５３ｃ）。In determining the tile Ti, edge determination processing is performed for each pixel value Ti (x, y) of the tile Ti, and edge determination information E indicating whether each pixel is an edge portion or not.
(Ti (x, y)) is generated. The value of the generated edge determination information E (Ti (xx, y)) is the pixel T
When i (x, y) is determined to be an edge portion, 1 is given, and when it is determined not to be an edge portion, 0 is given. Then, the above-described edge determination is performed for all the pixels in the tile Ti (step S153a). Then the number of edges in the tile Ti, ie E (Ti
The sum of (x, y)) is calculated (step S153).
b) The value is compared with a predetermined threshold value Th, and when the sum of the number of edges is a value equal to or larger than the threshold value Th, it is determined that the tile Ti is an area including the high resolution coding area. Then, 1 is output as the determination result F (Ti). On the other hand, when the value of the total sum is equal to or less than the threshold value Th, it is determined that the tile Ti does not include the high resolution coding area, and 0 is output as the determination result F (Ti) (step S153c). ).

【００５９】一方、画像縮小部１０４では、上述した領
域判定と同時、あるいはその前後において、分割された
タイルから幅及び高さが半分の縮小画像が生成される
（ステップＳ１５４）。また、セレクタ１０６では、第
１の実施形態と同様にして、領域判定部１４０１の判定
結果Ｆ（Ｔｉ）に基づいて、着目するタイルＴｉ、また
は着目するタイルＴｉについて画像縮小部１０４で縮小
して生成されたＳＴｉを選択して、離散ウェーブレット
変換部１４０４に出力する（ステップＳ１５５）。On the other hand, the image reducing unit 104 generates a reduced image having a half width and a half height from the divided tiles at the same time as, or before and after the above-described area determination (step S154). Further, in the selector 106, similarly to the first embodiment, the tile Ti of interest or the tile Ti of interest is reduced by the image reduction unit 104 based on the determination result F (Ti) of the area determination unit 1401. The generated STi is selected and output to the discrete wavelet transform unit 1404 (step S155).

【００６０】離散ウェーブレット変換部１４０４は、セ
レクタ１０６において選択されたタイルＴｉまたは縮小
タイルＳＴｉを離散ウェーブレット変換した変換係数を
ビットプレーン符号化部１０９に出力する（ステップＳ
１５６）。離散ウェーブレット変換部１４０４における
処理は、変換に使用するフィルタを除いて、第１の実施
形態における離散ウェーブレット変換部１０７と同じで
ある。離散ウェーブレット変換部１４０４では、Ｎ個の
１次元信号ｘ（ｎ），（ｎ＝０，・・・，Ｎ−１に対する
１次元離散ウェーブレット変換は、次式によって行われ
るものとする。The discrete wavelet transform unit 1404 outputs the transform coefficient obtained by performing the discrete wavelet transform of the tile Ti or the reduced tile STi selected by the selector 106 to the bit plane coding unit 109 (step S).
156). The process in the discrete wavelet transform unit 1404 is the same as that in the discrete wavelet transform unit 107 in the first embodiment except for the filter used for the transform. In the discrete wavelet transform unit 1404, the one-dimensional discrete wavelet transform for N one-dimensional signals x (n), (n = 0, ..., N−1) is performed by the following equation.

【００６１】 h(n) = x(2n+1) - floor{( x(2n)+x(2n+2) ) / 2 } （５） l(n) = x(2n) + floor{( h(n-1) + h(n) + 2 ) / 4} （６）ここで、ｆｌｏｏｒ｛Ｒ｝は、Ｒを超えない最大の整数
値を得る関数であり、ｈ（ｎ）は高周波サブバンドの係
数、ｌ（ｎ）は低周波サブバンドの係数を表す。尚、上
式の計算において必要となる１次元信号ｘ（ｎ）の両端
ｘ（ｎ），（ｎ＜０及びｎ≧Ｎ）は公知の手法により１
次元信号ｘ（ｎ），（０≦ｎ＜Ｎ）の値から求める。H (n) = x (2n + 1)-floor {(x (2n) + x (2n + 2)) / 2} (5) l (n) = x (2n) + floor {(h (n-1) + h (n) + 2) / 4} (6) where floor {R} is a function that obtains a maximum integer value that does not exceed R, and h (n) is a high-frequency subband. , L (n) represents the coefficient of the low frequency subband. It should be noted that both ends x (n) and (n <0 and n ≧ N) of the one-dimensional signal x (n) required in the calculation of the above equation are set to 1 by a known method.
It is obtained from the values of the dimensional signals x (n) and (0 ≦ n <N).

【００６２】ビットプレーン符号化部１０９は、さらに
符号列形成部１４０３に接続されており、離散ウェーブ
レット変換された変換係数がビットプレーン符号化部１
０９において生成された各タイルごとの符号化データを
符号列形成部１４０３に出力する（ステップＳ１５
７）。The bit plane coding unit 109 is further connected to the code string forming unit 1403, and the transform coefficients obtained by the discrete wavelet transform are transmitted to the bit plane coding unit 1.
The encoded data for each tile generated in 09 is output to the code string forming unit 1403 (step S15).
7).

【００６３】また、最下位ＢＰ決定部１４０２は、領域
判定部１４０１の判定結果Ｆ（Ｔｉ）に基づいて、各サ
ブバンドＳのビットプレーン符号化データのうち最終的
に生成する符号列に含める最下位ビットプレーンＬＢＰ
（Ｓ）を決定する（ステップＳ１５８）。すなわち、最
下位ＢＰ決定部１４０３で決定される最下位ビットプレ
ーン以下の符号化データは破棄され、装置外部に出力さ
れる符号化データには含まれない。尚、この決定は、タ
イルごとの離散ウェーブレット変換の変換係数の符号化
と同時、あるいはその前後でもよい。Further, the lowest BP determining unit 1402 includes the lowest BP to be included in the finally generated code string of the bit plane encoded data of each subband S based on the determination result F (Ti) of the area determining unit 1401. Lower bit plane LBP
(S) is determined (step S158). That is, the coded data below the least significant bit plane determined by the least significant BP determination unit 1403 is discarded and is not included in the coded data output to the outside of the device. Note that this determination may be performed at the same time as or before or after the coding of the transform coefficient of the discrete wavelet transform for each tile.

【００６４】例えば、あるサブバンドＳについて最下位
ビットプレーンＬＢＰ（Ｓ）を２とした場合、そのサブ
バンドＳについてはビットプレーン番号１とビットプレ
ーン番号０の２つのビットプレーンの符号化データは伝
送されない。このように、Ｎ枚のビットプレーン符号化
データを破棄することは、そのサブバンドの係数値を２
のＮ乗で量子化することと同様の効果がある。図１６
は、Ｆ（Ｔｉ）＝０の場合に選択される最下位ビットプ
レーンＬＢＰ（Ｓ）の一例を示す図である。また、図１
７は、Ｆ（Ｔｉ）＝１の場合に選択される最下位ビット
プレーンＬＢＰ（Ｓ）の一例を示す図である。以降、図
１６及び図１７の最下位ビットプレーンＬＢＰ（Ｓ）を
使用する場合について説明する。For example, when the least significant bit plane LBP (S) of a certain subband S is 2, the coded data of the two bitplanes of the bitplane number 1 and the bitplane number 0 is transmitted for the subband S. Not done. Thus, discarding N bit-plane coded data sets the coefficient value of the subband to 2
There is the same effect as quantization by the Nth power of. FIG.
FIG. 7 is a diagram showing an example of the least significant bit plane LBP (S) selected when F (Ti) = 0. Also, FIG.
FIG. 7 is a diagram showing an example of the least significant bit plane LBP (S) selected when F (Ti) = 1. Hereinafter, a case of using the least significant bit plane LBP (S) of FIGS. 16 and 17 will be described.

【００６５】そして、符号列形成部１４０３は、ビット
プレーン符号化部１０９で生成されたタイルＴｉについ
ての符号化データを不図示の内部バッファに格納する。
但し、各タイル毎に最下位ＢＰ決定部１４０２において
決定されるサブバンドＳの最下位ビットプレーンＬＢＰ
（Ｓ）以下の符号化データについては、バッファに格納
しないものとする。Then, the code string forming unit 1403 stores the coded data for the tile Ti generated by the bit plane coding unit 109 in an internal buffer (not shown).
However, the least significant bit plane LBP of the subband S determined by the least significant BP determination unit 1402 for each tile.
(S) The encoded data below is not stored in the buffer.

【００６６】符号列形成部１４０３では、入力された画
像データのすべてのタイルＴ０〜Ｔｎについての符号列
が内部バッファに格納された後、所定の順序で内部バッ
ファに格納される符号列が読み出され、必要な付加情報
が挿入されて、本画像符号化装置の出力となる最終的な
符号列が形成されて、符号出力部１１１に出力される
（ステップＳ１５９）。In the code string forming unit 1403, after the code strings for all tiles T0 to Tn of the input image data are stored in the internal buffer, the code strings stored in the internal buffer are read in a predetermined order. Then, necessary additional information is inserted, a final code string which is an output of the image coding apparatus is formed, and is output to the code output unit 111 (step S159).

【００６７】符号列形成部１４０３で生成される最終的
な符号列は、ヘッダと、タイルＴ０〜Ｔｎの各タイルの
符号化データにより構成される。また、各タイルの符号
化データは、画像データにおけるタイルの位置を識別す
るための識別情報やそれぞれのタイルに適用された離散
ウェーブレット変換の回数などの付加情報を格納したタ
イルヘッダと、レベル０とレベル１の２つ、あるいは、
レベル０とレベル１とレベル２の３つに階層化された符
号化データとから構成される。The final code string generated by the code string forming unit 1403 is composed of a header and coded data of each tile of tiles T0 to Tn. The encoded data of each tile includes a tile header storing identification information for identifying the position of the tile in the image data, additional information such as the number of discrete wavelet transforms applied to each tile, and level 0. Two of level 1, or
It is composed of encoded data hierarchically divided into three levels of level 0, level 1 and level 2.

【００６８】図１８は、符号列形成部１４０３において
生成されるＦ（Ｔｉ）＝０の場合のタイルＳＴｉの符号
化データの構成を示す概要図である。また、図１９は、
符号列形成部１４０３において生成されるＦ（Ｔｉ）＝
１の場合のタイルＴｉの符号化データの構成を示す概要
図である。図１８及び図１９に示すように、レベル０の
符号化データは、ＬＬサブバンドの係数を符号化して得
られるＣＳ（ＬＬ，Ｎ _BP（ＬＬ）−１）からＣＳ（Ｌ
Ｌ，０）までの符号列から構成される。FIG. 18 shows the code string forming section 1403.
Sign of tile STi generated when F (Ti) = 0
It is a schematic diagram which shows the structure of the conversion data. In addition, FIG.
F (Ti) = generated in the code string forming unit 1403
Outline showing the structure of encoded data of tile Ti in the case of 1
It is a figure. As shown in FIG. 18 and FIG.
The encoded data is obtained by encoding the coefficients of the LL subband.
CS (LL, N _BP(LL) -1) to CS (L
L, 0).

【００６９】また、レベル１は、ＬＨ１、ＨＬ１、ＨＨ
１の各サブバンドの係数を符号化して得られる符号列Ｃ
Ｓ（ＬＨ１，Ｎ_BP（ＬＨ１）−１）からＣＳ（ＬＨ１，
０）と、ＣＳ（ＨＬ１，Ｎ_BP（ＨＬ１）−１）からＣＳ
（ＨＬ１，０）と、ＣＳ（ＨＨ１，Ｎ_BP（ＨＨ１）−
１）からＣＳ（ＨＨ１，０）とから構成される。さら
に、レベル２は、ＬＨ２、ＨＬ２、ＨＨ２の各サブバン
ドの係数を符号化して得られる符号列ＣＳ（ＬＨ２，Ｎ
_BP（ＬＨ２）−１）からＣＳ（ＬＨ２，０）と、ＣＳ
（ＨＬ２，Ｎ_BP（ＨＬ２）−１）からＣＳ（ＨＬ２，
０）と、ＣＳ（ＨＨ２，Ｎ_BP（ＨＨ２）−１）からＣＳ
（ＨＨ２，０）とから構成される。Level 1 is LH1, HL1, HH
Code sequence C obtained by encoding the coefficients of each subband of 1
From S (LH1, N _BP (LH1) -1) to CS (LH1,
0) and CS (HL1, N _BP (HL1) -1) to CS
(HL1,0) and CS (HH1, N _BP (HH1)-
1) to CS (HH1,0). Further, level 2 has a code string CS (LH2, NH) obtained by coding the coefficients of the subbands LH2, HL2, and HH2.
_{From BP} (LH2) -1) to CS (LH2,0), CS
From (HL2, N _BP (HL2) -1) to CS (HL2,
0) and CS (HH2, N _BP (HH2) -1) to CS
(HH2,0).

【００７０】そして、符号出力部１１１は、符号列形成
部１４０３で生成された符号列を装置外部へと出力する
（ステップＳ１６０）。この符号出力部１１１は、第１
の実施形態と同様に、ハードディスクやメモリといった
記憶装置、ネットワーク回線のインタフェース等で実現
される。Then, the code output unit 111 outputs the code string generated by the code string forming unit 1403 to the outside of the device (step S160). The code output unit 111 has a first
Similar to the above embodiment, it is realized by a storage device such as a hard disk or a memory, an interface of a network line, or the like.

【００７１】すなわち、本発明による画像符号化装置
は、入力手段（領域判定部１４０１）が、分割された領
域内の各画素について、エッジ画素か否かを判定する判
定部と、当該領域内のエッジ画素と判定された画素数の
総和を算出する算出部と、算出されたエッジ画素数と所
定のしきい値とを比較して、当該領域の領域情報を生成
する生成部とを備え、生成された領域情報を入力するこ
とを特徴とする。That is, in the image coding apparatus according to the present invention, the input unit (area determining section 1401) determines, for each pixel in the divided area, whether or not the pixel is an edge pixel, and the inside of the area. A generation unit that calculates the total sum of the number of pixels determined to be edge pixels and a generation unit that compares the calculated number of edge pixels with a predetermined threshold value and generates region information of the region, It is characterized by inputting the region information that has been created.

【００７２】また、本発明による画像符号化装置は、入
力された領域情報に基づいて、各領域における符号化さ
れたサブバンドの変換係数の最下位ビットを決定する決
定手段（最下位ＢＰ決定部１４０２）をさらに備え、付
与手段（符号列形成部１４０３）が、各領域のサブバン
ドの係数のビット列から、決定された最下位ビット以上
のビット列を用いて符号化データとすることを特徴とす
る。Further, the image coding apparatus according to the present invention determines the least significant bit of the transform coefficient of the coded subband in each region based on the inputted region information (the least significant BP determining unit). 1402) is further provided, and the assigning means (code string forming unit 1403) uses the bit string of the determined least significant bit or more from the bit string of the coefficient of the sub-band of each region as encoded data. .

【００７３】以上述べたように、第２の実施形態では、
画像データを所定の大きさの複数のタイルに分割し、タ
イル単位で高解像度符号化領域か否かを判定した。そし
て、高解像度符号化領域に属するタイルについてはその
タイルを、高解像度符号化領域に属さないタイルについ
ては所定の縮小率で縮小した縮小タイルを離散ウェーブ
レット変換して符号化した。これによって、高解像度の
必要な部分のみを高解像度再現のための情報を含み、か
つ、高解像度を必要とない部分については好適な低解像
度復元画像を得ることができる符号化データの生成が可
能となる。As described above, in the second embodiment,
The image data was divided into a plurality of tiles of a predetermined size, and it was determined in tile units whether or not it was a high-resolution encoded area. Then, the tiles that belong to the high-resolution coded area are coded by discrete wavelet transforming the tiles that do not belong to the high-resolution coded area and the reduced tiles that have been reduced at a predetermined reduction rate. As a result, it is possible to generate encoded data that contains information for high-resolution reproduction only in the part that requires high resolution, and that can obtain a suitable low-resolution restored image for parts that do not require high resolution. Becomes

【００７４】＜第３の実施形態＞図２０は、本発明の第
３の実施形態による画像符号化装置の構成を示すブロッ
ク図である。図２０においても、第１及び第２の実施形
態で用いた各部と共通する部分については同じ符号で示
し、それらの説明は省略する。図２０において、タイル
分割部１０３は、離散ウェーブレット変換部２００１
と、画像縮小部１０４とに接続されている。また、画像
縮小部１０４はさらに離散ウェーブレット変換部２００
２に接続され、離散ウェーブレット変換部２００２は変
換係数量子化部１０８に接続している。<Third Embodiment> FIG. 20 is a block diagram showing the arrangement of an image encoding apparatus according to the third embodiment of the present invention. Also in FIG. 20, parts common to the parts used in the first and second embodiments are denoted by the same reference numerals, and description thereof will be omitted. 20, the tile division unit 103 includes a discrete wavelet transformation unit 2001.
And the image reduction unit 104. The image reduction unit 104 further includes a discrete wavelet transform unit 200.
2 and the discrete wavelet transform unit 2002 is connected to the transform coefficient quantization unit 108.

【００７５】さらに、セレクタ２００３は、領域判定部
１０５と離散ウェーブレット変換部２００１と変換係数
量子化部１０８に接続されており、領域情報に基づい
て、タイルＴｉに関する変換係数、またはタイルＳＴｉ
に関する量子化された変換係数をビットプレーン符号化
部１０９に出力する。Further, the selector 2003 is connected to the area determining section 105, the discrete wavelet transform section 2001 and the transform coefficient quantizing section 108, and based on the area information, the transform coefficient for the tile Ti or the tile STi.
It outputs the quantized transform coefficient for the bit plane coding unit 109.

【００７６】本実施形態においても第１及び第２の実施
形態と同様に、各画素の輝度値を８ビットで表現した白
黒画像データを符号化するものとして説明する。しかし
ながら、本発明はこれに限らず、４ビット、１０ビッ
ト、１２ビット等の８ビット以外のビット数で輝度値を
表現する画像データを符号化する場合にも適用可能であ
る。また、各画素をＲＧＢ、ＣＭＹＫ等の複数の色成
分、あるいはＹＣｂＣｒ等の輝度と色度／色差成分で表
現するカラー画像データを符号化する場合にも適用可能
である。この場合、カラー画像データ中の各成分が白黒
画像データであるとみなせばよい。Also in this embodiment, as in the first and second embodiments, description will be made assuming that the monochrome image data in which the luminance value of each pixel is represented by 8 bits is encoded. However, the present invention is not limited to this, and is also applicable to the case of encoding image data expressing a luminance value with a bit number other than 8 bits such as 4 bits, 10 bits, and 12 bits. It is also applicable when encoding color image data in which each pixel is represented by a plurality of color components such as RGB and CMYK, or luminance and chromaticity / color difference components such as YCbCr. In this case, each component in the color image data may be regarded as monochrome image data.

【００７７】また、本実施形態では、第１及び第２の実
施形態と同様に、符号化される画像データを、分割した
タイルごとに２つの解像度のいずれかで符号化する。以
降、符号化対象画像データの解像度を高解像度、水平・
垂直方向ともにこれを半分にした解像度を低解像度と呼
ぶ。尚、符号化される画像データの大きさは固定である
ものとし、入力される画像の大きさをＸ画素×Ｙ画素と
する。また、Ｘ、Ｙともに４の倍数とする。Further, in the present embodiment, as in the first and second embodiments, the image data to be encoded is encoded in one of two resolutions for each divided tile. After that, the resolution of the image data to be encoded is high
The resolution obtained by halving both in the vertical direction is called low resolution. The size of the encoded image data is fixed, and the size of the input image is X pixels × Y pixels. Further, both X and Y are multiples of 4.

【００７８】図２１は、図２０に示される構成の画像符
号化装置の動作手順を説明するためのフローチャートで
ある。まず、第１の実施形態と同様に、本実施形態によ
る画像符号化装置において符号化される画像データが、
画像入力部１０１からラスタ走査順に入力される（ステ
ップＳ２１１）。入力された画像データは、タイル分割
部１０３において、上述した実施形態と同様にして、所
定の幅ＴＷと高さＴＨのタイルに分割される（ステップ
Ｓ２１２）。そして、分割されたＴ０からＴｎまでのタ
イルは、ラスタ走査順に離散ウェーブレット変換部２０
０１と画像縮小部１０４に出力される。FIG. 21 is a flow chart for explaining the operation procedure of the image coding apparatus having the configuration shown in FIG. First, as in the first embodiment, the image data encoded in the image encoding device according to the present embodiment is
Inputs are made in raster scan order from the image input unit 101 (step S211). The input image data is divided into tiles having a predetermined width TW and height TH in the tile division unit 103, similarly to the above-described embodiment (step S212). Then, the divided tiles from T0 to Tn are processed by the discrete wavelet transform unit 20 in raster scan order.
01 and the image reduction unit 104.

【００７９】離散ウェーブレット変換部２００１は、タ
イル分割部１０３から入力されたタイルＴｉの画像デー
タを、不図示の内部バッファに適宜格納しながら、２次
元の離散ウェーブレット変換を２回施し、図６で示した
ように、ＬＬ、ＨＬ１、ＬＨ１、ＨＨ１、ＨＬ２、ＬＨ
２、ＨＨ２の７つのサブバンドに分解する（ステップＳ
２１３）。以降、各サブバンドの係数をＣ（Ｓ，ｘ，
ｙ）と表す。ここで、Ｓはサブバンドを表し、ＬＬ、Ｈ
Ｌ１、ＬＨ１、ＨＨ１、ＨＬ２、ＬＨ２、ＨＨ２のいず
れか１つである。また、（ｘ，ｙ）は、各サブバンドＳ
内の左上隅の係数位置を（０，０）としたときの水平方
向および垂直方向の係数位置を表す。The discrete wavelet transform unit 2001 performs two-dimensional discrete wavelet transform twice while appropriately storing the image data of the tile Ti input from the tile division unit 103 in an internal buffer (not shown). As shown, LL, HL1, LH1, HH1, HL2, LH
2, decomposed into 7 subbands of HH2 (step S
213). After that, the coefficient of each subband is C (S, x,
y). Here, S represents a subband, and LL, H
Any one of L1, LH1, HH1, HL2, LH2, and HH2. Further, (x, y) is each subband S
The coefficient position in the horizontal direction and the vertical direction when the coefficient position in the upper left corner of the inside is (0, 0).

【００８０】本実施形態における画像符号化装置では、
Ｎ個の１次元信号ｘ（ｎ），（ｎ＝０，・・・，Ｎ−１）
に対する１次元離散ウェーブレット変換は、第１の実施
形態における離散ウェーブレット変換部１０７と同様に
（１）、（２）式によって行われるものとする。In the image coding apparatus according to this embodiment,
N one-dimensional signals x (n), (n = 0, ..., N-1)
It is assumed that the one-dimensional discrete wavelet transform for is performed by the equations (1) and (2) similarly to the discrete wavelet transform unit 107 in the first embodiment.

【００８１】一方、画像縮小部１０４では、タイル分割
部１０３から出力されるタイルＴｉについて、水平方向
・垂直方向ともに１／２の大きさの縮小タイルＳＴｉを
生成する（ステップＳ２１４）。画像縮小部１０４にお
ける画像縮小処理は、第１の実施形態と同様である。ま
た、縮小タイルＳＴｉ内の水平方向のｘ番目、垂直方向
のｙ番目の画素値を、ＳＴｉ（ｘ，ｙ），（ｉ＝０，・・
・，Ｎ，０≦ｘ＜ＴＷ／２，０≦ｙ＜ＴＨ／２）と表
す。尚、ＳＴｉ（０，０）は、縮小タイルＳＴｉの左上
隅の画素値である。On the other hand, the image reduction unit 104 generates a reduced tile STi that is 1/2 the size of the tile Ti output from the tile division unit 103 in both the horizontal and vertical directions (step S214). The image reduction processing in the image reduction unit 104 is the same as that in the first embodiment. In addition, the horizontal x-th pixel value and the vertical y-th pixel value in the reduced tile STi are represented by STi (x, y), (i = 0, ...
, N, 0 ≦ x <TW / 2, 0 ≦ y <TH / 2). Note that STi (0,0) is the pixel value at the upper left corner of the reduced tile STi.

【００８２】そして、離散ウェーブレット変換部２００
２は画像縮小部１０４から入力される縮小タイルＳＴｉ
を不図示の内部バッファに適宜格納しながら２次元の離
散ウェーブレット変換を１回施し、図５で示したよう
に、ＬＬ、ＨＬ１、ＬＨ１、ＨＨ１の４つのサブバンド
に分解する（ステップＳ２１５）。以降、各サブバンド
の係数をＣ（Ｓ，ｘ，ｙ）と表す。ここで、Ｓはサブバ
ンドを表し、ＬＬ、ＨＬ１、ＬＨ１、ＨＨ１のいずれか
１つである。また、（ｘ，ｙ）は、各サブバンドＳ内の
左上隅の係数位置を（０，０）としたときの、水平方向
及び垂直方向の係数位置を表す。Then, the discrete wavelet transform unit 200
2 is a reduced tile STi input from the image reduction unit 104.
Is appropriately stored in an internal buffer (not shown), the two-dimensional discrete wavelet transform is performed once, and as shown in FIG. 5, it is decomposed into four subbands LL, HL1, LH1, and HH1 (step S215). Hereinafter, the coefficient of each subband is represented as C (S, x, y). Here, S represents a subband, and is any one of LL, HL1, LH1, and HH1. Further, (x, y) represents the coefficient position in the horizontal direction and the vertical direction when the coefficient position at the upper left corner in each subband S is (0, 0).

【００８３】本画像符号化装置では、Ｎ個の１次元信号
ｘ（ｎ），（ｎ＝０，・・・，Ｎ−１）に対する１次元離
散ウェーブレット変換は、第２の実施形態における離散
ウェーブレット変換部１４０４と同様に、（５）、
（６）式によって行われるものとする。そして、離散ウ
ェーブレット変換部２００２で生成されたＬＬ、ＨＬ
１、ＬＨ１、ＨＨ１の各サブバンドの係数は、第１の実
施形態と同様に、変換係数量子化部１０８によって量子
化される（ステップＳ２１６）。尚、ステップＳ２１４
の縮小画像の生成からステップＳ２１６の量子化までの
処理は、ステップＳ２１３の離散ウェーブレット変換と
同時、あるいはその前後であってもよい。In this image coding apparatus, the one-dimensional discrete wavelet transform for N one-dimensional signals x (n), (n = 0, ..., N-1) is performed by the discrete wavelet in the second embodiment. Similar to the conversion unit 1404, (5),
It is assumed that this is performed by the equation (6). Then, the LL and HL generated by the discrete wavelet transform unit 2002
The coefficients of the subbands of 1, LH1, and HH1 are quantized by the transform coefficient quantization unit 108 as in the first embodiment (step S216). Incidentally, step S214
The processing from the generation of the reduced image to the quantization in step S216 may be performed simultaneously with or before or after the discrete wavelet transform in step S213.

【００８４】次に、第１の実施形態と同様に、領域情報
入力部１０２から入力された高解像度符号化領域を指定
する高解像度符号化領域情報から、領域判定部１０５に
おいて着目タイルＴｉについての判定結果Ｆ（Ｔｉ）が
生成される（ステップＳ２１７）。尚、領域情報入力部
１０２からの高解像度符号化領域情報の入力は、画像デ
ータの入力と同時、あるいはその前後であってもよい。
また、領域の判定結果Ｆ（Ｔｉ）の生成は、画像データ
からタイルの分割後であれば各領域の離散ウェーブレッ
ト変換と同時、あるいはその前後であってもよい。Next, as in the first embodiment, from the high resolution coded area information that specifies the high resolution coded area input from the area information input unit 102, the area determination unit 105 determines the tile Ti of interest. The determination result F (Ti) is generated (step S217). The high-resolution encoded area information may be input from the area information input unit 102 at the same time as the input of the image data, or before or after the input.
Further, the region determination result F (Ti) may be generated at the same time as or before or after the discrete wavelet transform of each region as long as the tile is divided from the image data.

【００８５】次に、セレクタ２００３は、着目するタイ
ルＴｉについて、領域判定情報Ｆ（Ｔｉ）に基づいて、
離散ウェーブレット変換部２００１で生成される変換係
数値、または変換係数量子化部１０８で生成される量子
化された係数値を選択して出力する（ステップＳ２１
８）。すなわち、タイルＴｉの領域判定情報Ｆ（Ｔｉ）
が０の場合は、タイルＴｉが文字領域等の高解像度符号
化領域を含まないとして量子化された係数値を選択し、
Ｆ（Ｔｉ）が１の場合は、タイルＴｉが高解像度符号化
領域を含むとして変換係数値を出力する。Next, the selector 2003 determines the tile Ti of interest based on the area determination information F (Ti).
The transform coefficient value generated by the discrete wavelet transform unit 2001 or the quantized coefficient value generated by the transform coefficient quantization unit 108 is selected and output (step S21).
8). That is, the area determination information F (Ti) of the tile Ti
Is 0, the tile Ti selects a quantized coefficient value that does not include a high-resolution encoded area such as a character area,
When F (Ti) is 1, it is determined that the tile Ti includes the high resolution coding area, and the transform coefficient value is output.

【００８６】セレクタ２００３で選択して出力された変
換係数値または量子化された係数値は、第１の実施形態
と同様に、ビットプレーン符号化部１０９、符号列形成
部１１０、符号出力部１１１において符号化されて外部
に出力される（ステップＳ２１９）。The transform coefficient value or the quantized coefficient value selected and output by the selector 2003 is the same as in the first embodiment, that is, the bit plane coding unit 109, the code string forming unit 110, and the code output unit 111. Is encoded and output to the outside (step S219).

【００８７】すなわち、本発明による画像符号化装置
は、画像を所定の領域に分割して小画像を生成する分割
手段（タイル分割部１０３）と、分割された小画像を周
波数変換し、所定数のサブバンドに分解する第１の分解
手段（離散ウェーブレット変換部２００１）と、分割さ
れた各領域についての領域情報を入力する入力手段（領
域情報入力部１０２、領域判定部１０５）と、分割され
た各領域について、所定縮小率によって縮小画像を生成
する生成手段（画像縮小部１０４）と、分割された各領
域の領域情報に基づいて、各領域の縮小画像を周波数変
換し、所定数のサブバンドに分解する第２の分解手段
（離散ウェーブレット変換部２００２）と、分解された
各サブバンドの変換係数を符号化する符号化手段（ビッ
トプレーン符号化部１０９）とを備えることを特徴とす
る。That is, the image coding apparatus according to the present invention divides an image into predetermined regions to generate a small image (tile dividing unit 103), and frequency-converts the divided small images into a predetermined number. First decomposing means (discrete wavelet transform section 2001) for decomposing into sub-bands and input means (area information input section 102, area determining section 105) for inputting area information about each divided area, For each region, a generation unit (image reduction unit 104) that generates a reduced image at a predetermined reduction ratio, and frequency conversion of the reduced image of each region based on the region information of each divided region, a predetermined number of sub-images. Second decomposing means (discrete wavelet transform section 2002) for decomposing into bands, and encoding means (bit plane encoding section 1 for encoding decomposing transform coefficients of each subband) Characterized in that it comprises 9) and.

【００８８】また、本発明による画像符号化装置は、分
割された所定領域についての分解されたサブバンドの変
換係数のうち、所定の係数を量子化する量子化手段（変
換係数量子化部１０８）をさらに備えることを特徴とす
る。Further, the image coding apparatus according to the present invention is a quantizing means (transform coefficient quantizing unit 108) for quantizing a predetermined coefficient of the transform coefficients of the decomposed subbands of the divided predetermined area. Is further provided.

【００８９】本実施形態の場合、領域判定結果に応じて
離散ウェーブレット変換や量子化によって得られるデー
タの選択方法を変えており、文字領域については可逆の
符号化を可能とするものである。In the case of the present embodiment, the method of selecting the data obtained by the discrete wavelet transform or the quantization is changed according to the area determination result, and the character area can be reversibly encoded.

【００９０】＜その他の実施形態＞尚、本発明は、上述
した実施形態に限定されるものではない。例えば、上述
した第１〜第３の実施形態においては、（１）、（２）
式、または（３）、（４）式による離散ウェーブレット
変換を用いた符号化の例について示したが、離散ウェー
ブレット変換については、本実施形態で使用したものに
限定されるものではない。例えば、フィルタの種類や適
応方法を変えてもよい。例えば、９／７フィルタ等よ
り、タップ数の長いフィルタに変えてもよく、低周波サ
ブバンド以外にも２次元離散ウェーブレット変換を繰り
返し適用してもよい。<Other Embodiments> The present invention is not limited to the above-described embodiments. For example, in the above-described first to third embodiments, (1), (2)
Although the example of the encoding using the discrete wavelet transform by the equation or the equations (3) and (4) has been shown, the discrete wavelet transform is not limited to the one used in the present embodiment. For example, the type of filter and the adaptation method may be changed. For example, instead of the 9/7 filter or the like, a filter having a long tap number may be used, and the two-dimensional discrete wavelet transform may be repeatedly applied to other than the low frequency subband.

【００９１】また、係数の符号化方式としてＱＭ−Ｃｏ
ｄｅｒを用いたビットプレーン符号化方式を示したが、
上述の実施の形態に限定されるものではない。例えば、
ＭＱ−Ｃｏｄｅｒ等のＱＭ−Ｃｏｄｅｒ以外の算術符号
化方法を適用してもよく、ＭＥＬＣＯＤＥ等のその他の
２値符号化方式を適用してもよい。さらに、ビットプレ
ーンを着目係数の近傍係数の状態に応じて、複数のサブ
ビットプレーンに分類し、複数回のパスで符号化しても
よい。さらにまた、Ｇｏｌｏｍｂ符号等を適用して、係
数を２値に分解することなく、多値のままエントロピー
符号化してもよい。QM-Co is used as a coefficient coding method.
Although the bit plane coding method using der is shown,
The present invention is not limited to the above embodiment. For example,
An arithmetic coding method other than QM-Coder such as MQ-Coder may be applied, or another binary coding method such as MELCODE may be applied. Furthermore, the bit plane may be classified into a plurality of sub-bit planes according to the state of the coefficient near the coefficient of interest, and may be encoded by a plurality of passes. Furthermore, the Golomb code or the like may be applied to entropy-encode the multi-value as it is without dividing the coefficient into binary.

【００９２】さらにまた、説明を簡単にするために、上
述した実施形態では、サブバンド単位のビットプレーン
符号化について説明したが、ランダムアクセス性を高め
るために、各サブバンドを更に小ブロックに分割して、
この小ブロック単位にビットプレーン符号化を適用して
もよい。さらにまた、符号列の形成にあたっては、受信
側で徐々に解像度を上げて画像を復元できるように並べ
たが、これに限らず、徐々に画質が向上するように値の
大きな係数から順に並べて符号列を形成しても構わな
い。Furthermore, in order to simplify the explanation, in the above-mentioned embodiment, bit-plane coding in sub-band units was explained, but in order to improve random accessibility, each sub-band is further divided into small blocks. do it,
Bit plane coding may be applied in units of this small block. Furthermore, when forming the code string, the codes are arranged so that the image can be restored by gradually increasing the resolution on the receiving side, but the present invention is not limited to this, and codes are arranged in order from the coefficient with the largest value so that the image quality gradually improves. Rows may be formed.

【００９３】尚、上述した実施形態では、文字・写真領
域が画像中に混在する画像データを入力することによっ
て、文字領域を高解像度が必要な領域とし、写真領域を
その他の領域としたが、この領域区分については文字や
写真の他の区分を用いて任意に設定してもよい。In the above-described embodiment, by inputting the image data in which the character / photo area is mixed in the image, the character area is set as an area requiring high resolution, and the photo area is set as the other area. This area division may be arbitrarily set using other divisions such as characters and photographs.

【００９４】尚、本発明は、複数の機器（例えば、ホス
トコンピュータ、インタフェース機器、リーダ、プリン
タ等）から構成されるシステムに適用しても、一つの機
器からなる装置（例えば、複写機、ファクシミリ装置
等）に適用してもよい。Even if the present invention is applied to a system composed of a plurality of devices (eg, host computer, interface device, reader, printer, etc.), a device composed of one device (eg, copying machine, facsimile). Device).

【００９５】また、本発明の目的は、前述した実施形態
の機能を実現するソフトウェアのプログラムコードを記
録した記録媒体（または記憶媒体）を、システムあるい
は装置に供給し、そのシステムあるいは装置のコンピュ
ータ（またはＣＰＵやＭＰＵ）が記録媒体に格納された
プログラムコードを読み出し実行することによっても、
達成されることは言うまでもない。この場合、記録媒体
から読み出されたプログラムコード自体が前述した実施
形態の機能を実現することになり、そのプログラムコー
ドを記録した記録媒体は本発明を構成することになる。
また、コンピュータが読み出したプログラムコードを実
行することにより、前述した実施形態の機能が実現され
るだけでなく、そのプログラムコードの指示に基づき、
コンピュータ上で稼働しているオペレーティングシステ
ム（ＯＳ）などが実際の処理の一部または全部を行い、
その処理によって前述した実施形態の機能が実現される
場合も含まれることは言うまでもない。Further, an object of the present invention is to supply a recording medium (or storage medium) recording a program code of software for realizing the functions of the above-described embodiment to a system or apparatus, and to supply a computer of the system or apparatus ( Alternatively, the CPU or MPU) reads and executes the program code stored in the recording medium,
It goes without saying that it will be achieved. In this case, the program code itself read from the recording medium realizes the functions of the above-described embodiments, and the recording medium recording the program code constitutes the present invention.
Further, by executing the program code read by the computer, not only the functions of the above-described embodiments are realized, but also based on the instruction of the program code,
An operating system (OS) running on the computer does some or all of the actual processing,
It goes without saying that the processing includes the case where the functions of the above-described embodiments are realized.

【００９６】さらに、記録媒体から読み出されたプログ
ラムコードが、コンピュータに挿入された機能拡張カー
ドやコンピュータに接続された機能拡張ユニットに備わ
るメモリに書込まれた後、そのプログラムコードの指示
に基づき、その機能拡張カードや機能拡張ユニットに備
わるＣＰＵなどが実際の処理の一部または全部を行い、
その処理によって前述した実施形態の機能が実現される
場合も含まれることは言うまでもない。Further, after the program code read from the recording medium is written in the memory provided in the function expansion card inserted into the computer or the function expansion unit connected to the computer, based on the instruction of the program code, , The CPU provided in the function expansion card or the function expansion unit performs some or all of the actual processing,
It goes without saying that the processing includes the case where the functions of the above-described embodiments are realized.

【００９７】本発明を上記記録媒体に適用する場合、そ
の記録媒体には、先に説明したフローチャートに対応す
るプログラムコードが格納されることになる。When the present invention is applied to the above recording medium, the recording medium stores the program code corresponding to the above-mentioned flowchart.

【００９８】[0098]

【発明の効果】以上説明したように、本発明によれば、
文字領域や写真領域等の必要な解像度レベルが異なる領
域を含む混在画像を、高画質の画像が復元可能な状態で
効率よく符号化することができる。As described above, according to the present invention,
It is possible to efficiently encode a mixed image including regions having different required resolution levels such as a character region and a photo region in a state in which a high-quality image can be restored.

[Brief description of drawings]

【図１】本発明の第１の実施形態による画像を符号化す
るための画像符号化装置の構成を示すブロック図であ
る。FIG. 1 is a block diagram showing a configuration of an image encoding device for encoding an image according to a first embodiment of the present invention.

【図２】複数のタイルに分割される画像データを説明を
するための図である。FIG. 2 is a diagram for explaining image data divided into a plurality of tiles.

【図３】本実施形態による符号化対象の画像データと高
解像度符号化領域を説明するための概要図である。FIG. 3 is a schematic diagram for explaining image data to be encoded and a high-resolution encoded area according to the present embodiment.

【図４】上述した方法によって画像データのタイルに対
して２つの判定結果を出力する場合の一例を示す図であ
る。FIG. 4 is a diagram showing an example of a case where two determination results are output for a tile of image data by the method described above.

【図５】タイルＳＴｉを４つのサブバンドに分解する場
合を説明するための概要図である。FIG. 5 is a schematic diagram for explaining a case where a tile STi is decomposed into four subbands.

【図６】タイルＴｉを７つのサブバンドに分解する場合
を説明するための概要図である。FIG. 6 is a schematic diagram for explaining a case where tile Ti is decomposed into seven subbands.

【図７】符号化対象画像に対して２次元の離散ウェーブ
レット変換の適用例を説明するための概要図である。FIG. 7 is a schematic diagram for explaining an application example of a two-dimensional discrete wavelet transform to an encoding target image.

【図８】２次元のウェーブレット変換を２回繰り返して
行った場合の、サブバンドの分割例を示す図である。FIG. 8 is a diagram showing an example of sub-band division when two-dimensional wavelet transform is repeated twice.

【図９】サブバンドＳを符号化するビットプレーン符号
化部１０９の動作手順を説明するためのフローチャート
である。FIG. 9 is a flowchart for explaining an operation procedure of a bit plane encoding unit 109 that encodes a subband S.

【図１０】符号列形成部１１０において生成される符号
列の構造を示す図である。10 is a diagram showing the structure of a code string generated by the code string forming unit 110. FIG.

【図１１】Ｆ（Ｔｉ）＝０の場合のタイルＳＴｉの符号
化データの構成を示す概要図である。FIG. 11 is a schematic diagram showing a configuration of encoded data of a tile STi when F (Ti) = 0.

【図１２】Ｆ（Ｔｉ）＝１の場合のタイルＴｉの符号化
データの構成を示す概要図である。FIG. 12 is a schematic diagram showing a configuration of encoded data of tile Ti when F (Ti) = 1.

【図１３】第１の実施形態による画像符号化装置の動作
手順を説明するためのフローチャートである。FIG. 13 is a flowchart for explaining an operation procedure of the image encoding device according to the first embodiment.

【図１４】本発明の第２の実施形態による画像符号化装
置の構成を示すブロック図である。FIG. 14 is a block diagram showing a configuration of an image encoding device according to a second embodiment of the present invention.

【図１５】図１４に示される第２の実施形態による画像
符号化装置の動作手順を説明するためのフローチャート
である。FIG. 15 is a flowchart for explaining an operation procedure of the image encoding device according to the second embodiment shown in FIG.

【図１６】Ｆ（Ｔｉ）＝０の場合に選択される最下位ビ
ットプレーンＬＢＰ（Ｓ）の一例を示す図である。FIG. 16 is a diagram showing an example of the least significant bit plane LBP (S) selected when F (Ti) = 0.

【図１７】Ｆ（Ｔｉ）＝１の場合に選択される最下位ビ
ットプレーンＬＢＰ（Ｓ）の一例を示す図である。FIG. 17 is a diagram showing an example of the least significant bit plane LBP (S) selected when F (Ti) = 1.

【図１８】符号列形成部１４０３において生成されるＦ
（Ｔｉ）＝０の場合のタイルＳＴｉの符号化データの構
成を示す概要図である。FIG. 18 is an F generated in a code string forming unit 1403.
It is a schematic diagram which shows the structure of the encoded data of the tile STi when (Ti) = 0.

【図１９】符号列形成部１４０３において生成されるＦ
（Ｔｉ）＝１の場合のタイルＴｉの符号化データの構成
を示す概要図である。FIG. 19 is an F generated in a code string forming unit 1403.
It is a schematic diagram which shows the structure of the encoded data of tile Ti in case of (Ti) = 1.

【図２０】本発明の第３の実施形態による画像符号化装
置の構成を示すブロック図である。FIG. 20 is a block diagram showing a configuration of an image encoding device according to a third embodiment of the present invention.

【図２１】図２０に示される構成の画像符号化装置の動
作手順を説明するためのフローチャートである。FIG. 21 is a flowchart for explaining an operation procedure of the image encoding device having the configuration shown in FIG.

[Explanation of symbols]

１０１画像入力部１０２領域情報入力部１０３タイル分割部１０４画像縮小部１０５、１４０１領域判定部１０６、２００３セレクタ１０７、１４０４、２００１、２００２離散ウェーブ
レット変換部１０８変換係数量子化部１０９ビットプレーン符号化部１１０、１４０３符号列形成部１４０２最下位ＢＰ決定部Reference Signs List 101 image input unit 102 region information input unit 103 tile division unit 104 image reduction unit 105, 1401 region determination unit 106, 2003 selectors 107, 1404, 2001, 2002 discrete wavelet transform unit 108 transform coefficient quantization unit 109 bit plane coding unit 110, 1403 code string forming unit 1402 lowest BP determining unit

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5C059 MA24 MC11 MC38 PP01 PP20 PP24 PP28 RB02 RB09 SS06 SS11 TA80 TC34 TD06 TD11 UA02 UA15 5C076 AA22 CA02 5C078 AA04 BA53 CA01 DA01 5J064 AA02 BA16 BB13 BC14 BC16 BC22 BC25 5L096 FA06 FA54 GA51 ─────────────────────────────────────────────────── ─── Continued front page F term (reference) 5C059 MA24 MC11 MC38 PP01 PP20 PP24 PP28 RB02 RB09 SS06 SS11 TA80 TC34 TD06 TD11 UA02 UA15 5C076 AA22 CA02 5C078 AA04 BA53 CA01 DA01 5J064 AA02 BA16 BB13 BC14 BC16 BC22 BC25 5L096 FA06 FA54 GA51

Claims

[Claims]

1. A dividing unit that divides an image into predetermined regions to generate a small image, an input unit that inputs region information about each divided region, and a predetermined reduction ratio for each divided region. A generation unit that generates a reduced image, a decomposition unit that frequency-converts a small image or a reduced image for each region based on the region information of each divided region, and a decomposition unit that decomposes the image into a predetermined number of subbands. An image coding apparatus, comprising: a coding unit that codes the transform coefficient of each subband.

2. Dividing means for dividing an image into a predetermined area to generate a small image; first dividing means for frequency-converting the divided small image to decompose into a predetermined number of subbands; Input means for inputting area information for each area, generation means for generating a reduced image at a predetermined reduction ratio for each divided area, and reduction for each area based on the area information for each divided area. An image coding apparatus comprising: a second decomposing means for frequency-converting an image to decompose it into a predetermined number of subbands; and an encoding means for encoding the decomposed transform coefficient of each subband.

3. The input unit calculates, for each pixel in the divided area, a determination unit for determining whether or not the pixel is an edge pixel, and a calculation for calculating a sum of the number of pixels determined to be an edge pixel in the area. And a generation unit that compares the calculated number of edge pixels with a predetermined threshold value to generate region information of the region, and inputs the generated region information. The image encoding device according to 1 or 2.

4. The image coding apparatus according to claim 1, wherein the means for frequency-converting the image to decompose it into a predetermined number of subbands uses a discrete wavelet transform. .

5. A quantizing means for quantizing a predetermined coefficient among the transform coefficients of the decomposed subbands for the divided predetermined area is further provided.
The image coding apparatus according to any one of items 1 to 4.

6. The image encoding apparatus according to claim 1, further comprising an attaching unit that attaches predetermined header information to the encoded bit string.

7. A determination means for determining the least significant bit of a transform coefficient of an encoded subband in each area based on the input area information is provided, wherein the assigning means determines the least significant bit of the subband of each area. 7. The image encoding apparatus according to claim 6, wherein encoded data is obtained by using a bit string having a least significant bit determined from the bit string of coefficients.

8. A dividing step of dividing an image into predetermined areas to generate a small image, a generating step of generating a reduced image at a predetermined reduction ratio for each divided area, and a dividing step for each divided area. An acquisition step of acquiring area information, a decomposition step of frequency-converting a small image or reduced image for each area based on the area information of each divided area, and decomposition into a predetermined number of subbands, and decomposition And an encoding step of encoding the transform coefficient of each subband.

9. A dividing step of dividing an image into predetermined areas to generate a small image, an acquisition step of acquiring area information about each of the divided areas, and an area information of each of the divided areas. , Frequency-converting a small image in each region and dividing it into a predetermined number of subbands
And the generation step of generating a reduced image with a predetermined reduction ratio for each divided area, and the reduced image of each area is frequency-converted based on the area information of each divided area, and a predetermined number of An image coding method comprising: a second decomposition step of decomposing into sub-bands; and an encoding step of encoding the decomposed transform coefficients of each sub-band.

10. The determining step of determining whether each pixel in the divided area is an edge pixel, and the calculation step of calculating the sum of the number of pixels determined to be an edge pixel in the area. The image code according to claim 8 or 9, further comprising: a step, and an area generation step of comparing the calculated number of edge pixels with a predetermined threshold value to generate area information of the area. Method.

11. The image according to claim 8, wherein the step of frequency-converting the image to decompose it into a predetermined number of subbands is performed by using a discrete wavelet transform. Encoding method.

12. The method according to claim 8, further comprising a quantization step of quantizing a predetermined coefficient among the transform coefficients of the decomposed subbands of the divided predetermined area. The image coding method according to item 1.

13. The image encoding method according to claim 8, further comprising an attaching step of attaching predetermined header information to the encoded bit string.

14. The method further comprises a determining step of determining the least significant bit of the transform coefficient of the coded subband in each area based on the acquired area information, wherein the assigning step includes the subband of each area. 14. The image coding method according to claim 13, wherein the coded data is formed by using a bit string having a least significant bit determined from the bit string of the coefficient of.

15. A computer program for controlling an image encoding apparatus for encoding an image, comprising a program code of a dividing step for dividing an image into predetermined areas to generate a small image, and each divided For a region, based on the program code of the generation process that generates a reduced image at a predetermined reduction ratio, the program code of the acquisition process that acquires the region information about each divided region, and the region information of each divided region, Program the code of the decomposing process that frequency-converts the small image or reduced image for each region and decomposes it into a predetermined number of subbands, and the program code of the encoding process that encodes the decomposed transform coefficients of each subband. A computer program having.

16. A recording medium on which the computer program according to claim 15 is stored.