JP2000092489A

JP2000092489A - Image encoding device, image encoding method, and medium recording program

Info

Publication number: JP2000092489A
Application number: JP25541098A
Authority: JP
Inventors: Kensuke Uehara; 堅助上原
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1998-09-09
Filing date: 1998-09-09
Publication date: 2000-03-31

Abstract

(57)【要約】【課題】複数のオブジェクト動画像を符号化し多重化し
て通信回線に伝送する方式では、符号化過程で高アクテ
ィビティのオブジェクトが発生すると予めそのオブジェ
クトに配分したビット量より符号化量が多くなり、通信
回線の伝送容量も増えるため、コマ落しにて伝送容量を
減らすが、コマ落としは抑制したい。【解決手段】各オブジェクト(obj) に対して優先度をつ
け、高アクティビティなobj が発生したとき各obj の間
でビットを移動させ、優先度の高いobj は画質を低下さ
せずまたは向上させるべくレート制御部117 で制御す
る。これにて通信回線の伝送容量増加させないようにし
てコマ落し回数の増加を抑制する。また各obj の符号化
の際に符号化率に応じたコマ落しによる時間的歪みと量
子化精度による空間的歪みとのバランスがとれ、視覚特
性上最適な動作点で符号化処理が行える。 (57) [Summary] In a method of encoding, multiplexing and transmitting a plurality of object moving images to a communication line, when an object having high activity occurs in an encoding process, encoding is performed based on a bit amount previously allocated to the object. Since the volume increases and the transmission capacity of the communication line increases, the transmission capacity is reduced by dropping frames, but it is desired to suppress dropping frames. A priority is assigned to each object (obj), and when a high activity obj occurs, bits are moved between the objs so that a high priority obj does not reduce or improve the image quality. It is controlled by the rate control unit 117. This suppresses an increase in the number of dropped frames without increasing the transmission capacity of the communication line. Further, at the time of encoding each obj, a balance between temporal distortion due to frame drop according to the encoding rate and spatial distortion due to quantization accuracy can be achieved, and encoding processing can be performed at an operating point that is optimal in visual characteristics.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は動画像を圧縮して得
た動画像データを通信回線に送出する際に、通信回線の
伝送容量対応に伝送量を制御すべく符号発生量を制御す
る技術に係わり、特に、複数のオブジェクトの間で画品
質に優先度を持たせて所望の画質を維持できるようにし
た画像符号化装置および画像符号化方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for controlling a code generation amount to control a transmission amount corresponding to a transmission capacity of a communication line when transmitting moving image data obtained by compressing a moving image to a communication line. More particularly, the present invention relates to an image encoding apparatus and an image encoding method that can maintain a desired image quality by giving priority to image quality among a plurality of objects.

【０００２】[0002]

【従来の技術】従来、テレビ電話システムの一つとして
Ｈ．２６１方式では、動画像についてそのフレーム毎に
データを８×８画素のブロックとし、このブロックに対
して２次元ＤＣＴ変換（離散コサイン変換）するフレー
ム内符号化、前フレームと現フレームの同位置の画像ブ
ロックにおいて両者の相関が強いときにフレーム間の差
分をとり、その差分に対して８×８画素のブロックを２
次元ＤＣＴ変換するフレーム間符号化、前フレームから
現フレームヘ類似した動画像ブロックが相対的に隣接移
動した場合に、これを検知してその動画像ブロックの移
動量と移動方向の情報を送るのみで動画像データそのも
のを送らずに済ませることで発生データ量を減らす動き
補償、ＤＣＴ変換後の各周波数毎の係数値が低周波数領
域では値が発生するが、高周波領域では値が発生しにく
く、ゼロ値が続くことを利用したゼロ・ランレングス符
号化、データの発生量に応じてデータの量子化ステップ
幅を変更することでデータの発生量を調整する量子化、
発生頻度の高いデータ・パターンに対しては短い符号値
を、発生頻度の低いデータ・パターンに対しては長い符
号量を割り当てることで、トータル的に発生したデータ
量よりも少ないデータ量に変換する可変長符号化、等が
使用されている。2. Description of the Related Art Conventionally, H.264 has been used as one of videophone systems. In the H.261 system, for a moving image, data is divided into 8 × 8 pixel blocks for each frame, and this block is subjected to two-dimensional DCT (discrete cosine transform) intra-frame encoding. When the correlation between the two is strong in the image block, the difference between the frames is calculated.
Inter-frame coding for dimensional DCT transformation. When a moving image block similar to the current frame moves relatively adjacently from the previous frame, this is detected and only information of the moving amount and moving direction of the moving image block is sent. Motion compensation that reduces the amount of generated data by not sending the moving image data itself. The coefficient value for each frequency after DCT conversion has a value in the low frequency region, but does not easily occur in the high frequency region. Zero run-length encoding using the value that follows, quantization that adjusts the amount of data generation by changing the quantization step width of data according to the amount of data generation,
By assigning a short code value to a frequently occurring data pattern and a long code amount to a less frequently occurring data pattern, the data pattern is converted into a smaller data amount than the total generated data amount. Variable length coding, etc. are used.

【０００３】そして、可変長符号化された圧縮データは
一旦、バッファメモリに保持された後、順次読み出され
て通信回線に送出させることになるが、バッファメモリ
がオーバーフローしないようにバッファメモリのデータ
蓄積量を基に、入力動画像の変化、シーン・チェンジに
応じて適応的に、量子化ステップ幅を変え、コマ落とし
を行い、バッファメモリから出力されるビットレートを
一定にしてから、通信回線に送出される。[0003] The compressed data that has been subjected to the variable-length coding is temporarily stored in a buffer memory and then sequentially read out and sent out to a communication line. Based on the accumulated amount, the quantization step width is adaptively changed according to changes in the input moving image and scene changes, frame skipping is performed, and the bit rate output from the buffer memory is kept constant before the communication line Sent to

【０００４】このような、Ｈ．２６１方式の他にも、国
際標準の動画像符号化方式であるＭＰＥＧ１（Moving P
icture imagecoding Experts Group 1）動画像符号化方
式あるいはＭＰＥＧ２（Moving Picture imagecoding E
xperts Group 2）動画像符号化方式などがあるが、これ
らはいずれも矩形画面を対象とする符号化方式である。[0004] Such H. In addition to the H.261 system, MPEG1 (Moving P
icture imagecoding Experts Group 1) Moving picture coding scheme or MPEG2 (Moving Picture imagecoding E)
xperts Group 2) There are moving picture coding methods and the like, and these are all coding methods for rectangular screens.

【０００５】しかし近来、ＭＰＥＧ４（Moving Picture
imagecoding Experts Group 4）動画像符号化方式のよ
うに画面に含まれている像をオブジェクト単位で符号化
し、これら複数のオブジェクトの圧縮データを受信側に
伝送して表示する方式が普及しつつある。However, recently, MPEG4 (Moving Picture)
imagecoding Experts Group 4) A method of encoding an image contained in a screen in units of objects, such as a moving image encoding method, and transmitting and displaying compressed data of the plurality of objects to a receiving side is becoming widespread.

【０００６】例えば、図１９のようにテレビカメラなど
で撮影した動画像を入力して（１００）、原画像１０１
についてオブジェクト生成部１０２で領域抽出して、原
画像１０１からシーン毎に人物および背景などのオブジ
ェクトを切り抜く。そして、各オブジェクトは符号化さ
れてから（１０３ａ、１０３ｂ）、多重化部１０４で多
重化されて、通信回線１０６に送出される。すなわち、
人物像は“オブジェクト１”（１０３ａ）、背景動画像
は“オブジェクト２”（１０３ｂ）としてそれぞれ別に
符号化して多重化部１０４に送る。そして、“オブジェ
クト１”と“オブジェクト２”の符号化データはこの多
重化部１０４で多重化され、ビットストリューム１０５
のデータ形式で通信回線１０６を介して受信側の多重分
離部１０７に伝送される。受信側では多重化されたオブ
ジェクトは多重分離部１０７に送られ、この多重分離部
１０７で“オブジェクト１”と“オブジェクト２”の符
号化データは分離され、更に復号化部において“オブジ
ェクト１”（１０８ａ）と“オブジェクト２”（１０８
ｂ）として復号化される。For example, as shown in FIG. 19, a moving image captured by a television camera or the like is input (100), and an original image 101 is input.
Are extracted by the object generation unit 102, and objects such as a person and a background are cut out from the original image 101 for each scene. Then, each object is encoded (103a, 103b), multiplexed by the multiplexing unit 104, and transmitted to the communication line 106. That is,
The human image is separately encoded as “object 1” (103a) and the background moving image is separately encoded as “object 2” (103b) and sent to the multiplexing unit 104. Then, the coded data of “Object 1” and “Object 2” are multiplexed by the multiplexing unit 104, and the bit stream 105
Is transmitted to the demultiplexing unit 107 on the receiving side via the communication line 106 in the data format described above. On the receiving side, the multiplexed object is sent to the demultiplexing unit 107, where the coded data of "Object 1" and "Object 2" are separated, and further the "Object 1" ( 108a) and "Object 2" (108
b) is decoded.

【０００７】そして、各オブジェクトは複合化されて
（１０８ａ、１０８ｂ）、コンボジター部１０９で画面
上の各オブジェクトの配置位置などを考慮して、表示部
で再現画像１１０として表示される。すなわち、複合化
された“オブジェクト１”（１０８ａ）と“オブジェク
ト２”（１０８ｂ）は、コンポジタ部１０９によって元
の指定された位置に配置されて再現画像１１０として表
示される。[0007] Each object is compounded (108a, 108b), and displayed on the display unit as a reproduced image 110 in consideration of the arrangement position of each object on the screen by the convoy unit 109. That is, the composited “object 1” (108a) and “object 2” (108b) are arranged at the original designated position by the compositor unit 109 and displayed as the reproduced image 110.

【０００８】この方式の場合、原画像１０１からオブジ
ェクト単位に動画像を切り出すと、必要とするオブジェ
クトのみ受信側に送ることができる。例えば、“オブジ
ェクト１”のように重要な人物像のみ送り、“オブジェ
クト２”のように必ずしも重要でない背景動画像の伝送
を停止し、受信側で別な背景動画像に“オブジェクト
１”を重ね合わせて表示することができる。背景動画像
を送らないことにより、トータル的な伝送容量が少なく
て済む利点がある。In the case of this method, when a moving image is cut out from the original image 101 in units of objects, only necessary objects can be sent to the receiving side. For example, only an important person image such as “Object 1” is transmitted, transmission of an unimportant background moving image such as “Object 2” is stopped, and “Object 1” is superimposed on another background moving image on the receiving side. They can be displayed together. By not sending the background moving image, there is an advantage that the total transmission capacity can be reduced.

【０００９】また、背景動画像を人物動画像より解像度
を下げて送ることにより、伝送容量を低減させることも
できる。Further, the transmission capacity can be reduced by transmitting the background moving image at a lower resolution than the person moving image.

【００１０】図２０は、図１９で説明した方式をハード
ウエアとして実現するための機器構成を示した構成図で
ある。図２０において、１１２は形状抽出部、１１４
ａ，１１４ｂ，〜１１４ｎは符号化部、１１７ａ，〜１
１７ｎはレート制御部、１１９ａ，〜１１９ｎはバッフ
ァメモリ、１２１は多重化部である。FIG. 20 is a configuration diagram showing a device configuration for realizing the method described in FIG. 19 as hardware. In FIG. 20, reference numeral 112 denotes a shape extraction unit;
a, 114b, to 114n are encoding units, 117a, to 1
17n is a rate control unit, 119a to 119n are buffer memories, and 121 is a multiplexing unit.

【００１１】この構成において、テレビカメラなどの撮
像装置で撮影した入力動画像１１１は形状抽出部１１２
と複数の符号化部１１４ａ，１１４ｂ，〜１１４ｎとに
それぞれ入力される。そして、入力動画像１１１が形状
抽出部１１２に入力されることにより、必要となる複数
のオブジェクトの各形状情報１１３ａ，１１３ｂ，〜１
１３ｎがこの形状抽出部１１２によって生成される。そ
して、これらの形状情報１１３ａ，１１３ｂ，〜１１３
ｎは符号化部１１４ａ，１１４ｂ，〜１１４ｎのうち、
それぞれ所定の符号化部１１４ａ，１１４ｂ，〜１１４
ｎに送られて符号化される。符号化部は１１４ａ，１１
４ｂ，〜１１４ｎと複数あるが、これは同時並行して符
号化処理できるようにするためであり、各オブジェクト
毎に専用で符号化するためである。In this configuration, an input moving image 111 captured by an imaging device such as a television camera is
And a plurality of encoding units 114a, 114b, to 114n. Then, when the input moving image 111 is input to the shape extracting unit 112, the necessary shape information 113a, 113b,.
13n is generated by the shape extraction unit 112. Then, these pieces of shape information 113a, 113b, to 113
n is one of the encoding units 114a, 114b, to 114n.
Each of the predetermined coding units 114a, 114b, to 114
n and encoded. The encoding units are 114a and 11
Although there are a plurality of 4b and -114n, this is to enable simultaneous and parallel encoding processing, and to encode exclusively for each object.

【００１２】そして、各オブジェクトは符号化部１１４
ａ〜１１４ｎで並列に、形状情報１１３を使用して、入
力動画像１１１からそれぞれの形状情報に対応する動画
像部分を領域抽出してから符号化（符号化情報１２３）
する。Each object is encoded by the encoding unit 114.
a to 114n in parallel, using the shape information 113, extracting a region of a moving image portion corresponding to each shape information from the input moving image 111, and then encoding (encoding information 123)
I do.

【００１３】このようにして符号化部１１４ａ，１１４
ｂ，〜１１４ｎにて得られた符号化情報１２３はそれぞ
れ符号化部１１４ａ，１１４ｂ，〜１１４ｎに対応する
バッファメモリ１１９ａ，１１９ｂ，〜１１９ｎに与え
られる。In this manner, the encoding units 114a and 114
The encoded information 123 obtained by b, .about.114n is provided to buffer memories 119a, 119b,... 119n corresponding to the encoding units 114a, 114b,.

【００１４】バッファメモリ１１９ａ，１１９ｂ，〜１
１９ｎは、上述したように、出力１２０を一定の伝送レ
ートに平滑化する目的の一時保持用のメモリである。The buffer memories 119a, 119b,.
19n is a temporary holding memory for smoothing the output 120 to a constant transmission rate, as described above.

【００１５】この構成においては、バッファメモリ１１
９ａ，１１９ｂ，〜１１９ｎのデータ蓄積量の情報をそ
れぞれのバッファメモリ１１９ａ，１１９ｂ，〜１１９
ｎに対応するレート制御部１１７ａ，１１７ｂ，〜１１
７ｎに与えている。In this configuration, the buffer memory 11
9a, 119b,... 119n are stored in the respective buffer memories 119a, 119b,.
n corresponding to the rate control units 117a, 117b,.
7n.

【００１６】そして、各レート制御部１１７ａ，１１７
ｂ，〜１１７ｎは自己の受け取った前記データ蓄積量の
情報から自己の対応するバッファメモリ１１９ａ，１１
９ｂ，〜１１９ｎの出力１２０ａ，〜１２０ｎを一定レ
ートにするために、符号化部１１４ａ〜１１４ｎにフィ
ードバックして（１１５ａ，〜１１５ｎ）、符号化部１
１４ａ〜１１４ｎの量子化ステップを変化させることで
バッファメモリ１１９ａ，１１９ｂ，〜１１９ｎに入力
されるフレームデータのデータ量を制御している。Then, each of the rate control units 117a, 117
b, .about.117n are based on the information of the data storage amount received by the corresponding buffer memories 119a, 119
In order to make the outputs 120a and 120n of 9b and 119n constant, they are fed back to the encoding units 114a to 114n (115a and 115n), and the encoding unit 1
The amount of frame data input to the buffer memories 119a, 119b, and 119n is controlled by changing the quantization steps of 14a to 114n.

【００１７】そして、場合によってはバッファメモリ１
１９ａ，１１９ｂ，〜１１９ｎに送られて一時的に保持
されるフレームデータを、フレームスキップ（コマ落と
し）することにより、全体としてバッファメモリ１１９
ａ，１１９ｂ，〜１１９ｎの読み出し出力１２０ａ〜１
２０ｎの伝送レートを一定にしている。In some cases, the buffer memory 1
19a, 119b,..., And 119n, the frame data temporarily stored is subjected to frame skipping (dropping of frames), so that the buffer memory 119 as a whole is skipped.
a, 119b, to 119n read output 120a to 1
The transmission rate of 20n is fixed.

【００１８】各バッファメモリ１１９ａ，１１９ｂ，〜
１１９ｎの読み出し出力１２０ａ〜１２０ｎは多重化部
１２１で多重化されて多重化出力１２３となり、通信回
線に送出される。各オブジェクトに対応した符号化デー
タの伝送レートが一定ならば、多重化出力１２３も一定
の伝送レートに制限される。そして、多重化出力１２３
は通信回線の伝送レートに相当している。Each of the buffer memories 119a, 119b,.
The read outputs 120a to 120n of 119n are multiplexed by the multiplexing unit 121 to become multiplexed outputs 123, which are sent out to the communication line. If the transmission rate of the encoded data corresponding to each object is constant, the multiplex output 123 is also limited to a constant transmission rate. And the multiplexed output 123
Corresponds to the transmission rate of the communication line.

【００１９】また、各バッファメモリ１１９ａ，１１９
ｂ，〜１１９ｎの読み出し出力１２０ａ〜１２０ｎの伝
送レートの割り当ては、対応するオブジェクトをどの程
度圧縮するか、あるいはどの程度コマ落としを許容する
かにかかっている。Each of the buffer memories 119a, 119
The assignment of the transmission rates of the read outputs 120a to 120n of b, to 119n depends on how much the corresponding object is compressed or how much frame dropping is allowed.

【００２０】すなわち、ユーザが各オブジェクトの重要
性、または、優先度をどのように配分するかにかかって
いる。That is, it depends on how the user allocates importance or priority of each object.

【００２１】例えば、あるオブジェクトは動きを忠実に
再現したければ、そのオブジェクト全体について多少圧
縮率を高くし、できるだけコマ落としを避けるようにレ
ート制御部１１７ａ，１１７ｂ，〜１１７ｎでレート制
御を行う必要がある。For example, if it is desired to faithfully reproduce the motion of a certain object, it is necessary to slightly increase the compression ratio for the entire object and perform rate control by the rate control units 117a, 117b, to 117n so as to avoid dropping frames as much as possible. There is.

【００２２】反対に、オブジェクトの精細度を重視する
ならば、そのオブジェクトの全体の圧縮率を下げ、ある
程度のコマ落としを許容するようにレート制御部１１９
でレート制御を行う必要がある。Conversely, if importance is placed on the definition of the object, the rate control section 119 reduces the overall compression ratio of the object and allows a certain amount of frame dropping.
Need to perform rate control.

【００２３】[0023]

【発明が解決しようとする課題】一般に、高品質の動画
像データを伝送するためには、画像データを圧縮する際
に、符号化データが増えたならば、量子化する際にもデ
ータ量を増加させるようにした方が、より自然な動画像
が再現できるようになる。Generally, in order to transmit high-quality moving image data, if the amount of coded data increases when compressing image data, the amount of data is also reduced when quantizing. When the number is increased, a more natural moving image can be reproduced.

【００２４】しかし、符号化データが増加すれば、遅延
なくデータを受信側に伝送するために伝送レートを増加
させなければならず、ＡＴＭ通信のように通信容量に余
裕のある高速な可変レートの通信が必要になってくる。However, if the encoded data increases, the transmission rate must be increased in order to transmit the data to the receiving side without delay, and a high-speed variable rate having a sufficient communication capacity such as ATM communication is required. Communication becomes necessary.

【００２５】しかし、低速回線を対象としたＨ．２６１
などの動画像符号化方式は、符号化データを固定レート
の通信回線を介して伝送することが一般的である。そし
て、低伝送レートの通信回線に動画像データを伝送する
ため、動画像が細かいパターンを含む高アクティビティ
状態のときは符号化データが多くなることから、空間的
な解像度を犠牲にして、量子化の際に符号化データの増
加を抑え、あるいはコマ落しにより時間的な歪みを許容
することで、符号化データの増加も抑えるようにし、こ
れによって一定レートの通信回線で伝送している。However, H.264 for low-speed lines 261
In such a moving image encoding method, it is general that encoded data is transmitted via a fixed-rate communication line. Since video data is transmitted over a communication line with a low transmission rate, the amount of encoded data increases when the video is in a high activity state including fine patterns, so quantization is sacrificed at the expense of spatial resolution. In this case, the increase of the encoded data is suppressed, or the temporal distortion is allowed by dropping frames, so that the increase of the encoded data is also suppressed, whereby the data is transmitted through a communication line having a constant rate.

【００２６】ここで、図１９のようなＭＰＥＧ４による
オブジェクト単位の動画像符号化方式でも、通信回線が
固定レートの場合、各々のオブジェクトに割り当てられ
る伝送レート、すなわち、ビット速度も固定にして配分
し、各オブジェクトの伝送レートを加算し、全体として
の伝送レートを一定に抑えることが一般的である。ま
た、各オブジェクトに対して面積に対する配分レートの
比率に優先度を持たせることで、優先度の高いオブジェ
クトは符号量をより多く割り当てることができるように
し、優先度の低いオブジェクトより精細に表示すること
が可能となる。Here, even in the moving picture coding method for each object by MPEG4 as shown in FIG. 19, when the communication line has a fixed rate, the transmission rate assigned to each object, that is, the bit rate is also fixed and distributed. In general, the transmission rate of each object is added to keep the overall transmission rate constant. In addition, by giving a priority to the ratio of the distribution rate to the area for each object, a high-priority object can be allocated a larger amount of code, and is displayed more precisely than a low-priority object. It becomes possible.

【００２７】例えば、図１９で人物像としての“オブジ
ェクト１”に対しては背景の“オブジェクト２”より面
積に対するビット配分を多くして、人物像を背景より、
精細に表示することが可能となる。For example, in FIG. 19, for "object 1" as a person image, the bit allocation to the area is larger than that for "object 2" in the background, so that the person image is
It is possible to display finely.

【００２８】ここで、図１９による各オブジェクトのレ
ート制御は図２０のブロック構成図で示した方法で行う
ことができる。テレビカメラ等から入力された入力動画
像１１１から形状抽出部１１２で人物像および背景など
必要とするオブジェクトについて領域抽出して形状情報
１１３ａ〜１１３ｎを算出する。各オブジェクトに対応
した符号化部１１４ａ〜１１４ｎで、前記形状情報１１
３ａ〜１１３ｎを使用して、原画像１１１から対応した
オブジェクトの動画像領域を切り出し、前記領域におけ
るテキスチャ、動き補償フレーム予測および形状情報に
ついて符号化する。The rate control of each object shown in FIG. 19 can be performed by the method shown in the block diagram of FIG. A shape extraction unit 112 extracts a region of a necessary object such as a human image and a background from an input moving image 111 input from a television camera or the like, and calculates shape information 113a to 113n. The encoding units 114a to 114n corresponding to each object generate the shape information 11
Using 3a to 113n, a moving image region of the corresponding object is cut out from the original image 111, and the texture, motion compensation frame prediction, and shape information in the region are encoded.

【００２９】各符号化データ１１６ａ〜１１６ｎはバッ
ファメモリ１１９ａ〜１１９ｎに印加される。レート制
御部１１７ａ〜１１７ｎは符号化部１１４ａ〜１１４ｎ
の量子化精度を制御することで、バッファメモリ１１９
ａに入力された符号化データ１１６ａ〜１１６ｎビット
量を変えて出力データ１２０ａ〜１２０ｎが予め各オブ
ジェクトに割り当てられたレートになるように制御す
る。Each of the encoded data 116a to 116n is applied to buffer memories 119a to 119n. The rate control units 117a to 117n are encoding units 114a to 114n.
Of the buffer memory 119 by controlling the quantization precision of
By changing the bit amount of the encoded data 116a to 116n input to “a”, the output data 120a to 120n is controlled so as to have a rate previously assigned to each object.

【００３０】また、レート制御部１１７ａ〜１１７ｎは
必要ならばフレーム単位でコマ落しを行うことで、バッ
ファメモリ１１９ａ〜１１９ｎの出力１２０ａ〜１２０
ｎのレートを一定になるように制御する。そして、バッ
ファメモリ１１９ａ〜１１９ｎの出力１２０ａ〜１２０
ｎは多重化部１２１で多重化され、ビットストリューム
としての出力１２３に変換されて通信回線に送出され
る。The rate control units 117a to 117n perform frame dropping on a frame basis if necessary, so that the outputs 120a to 120n of the buffer memories 119a to 119n are output.
The rate of n is controlled to be constant. The outputs 120a-120 of the buffer memories 119a-119n
n is multiplexed by the multiplexing unit 121, converted into an output 123 as a bit stream, and transmitted to the communication line.

【００３１】図１９でレート制御部１１７ａ〜１１７ｎ
は各オブジェクトに割り当てられたレートについて、バ
ッファメモリ１１９ａ〜１１９ｎの出力１２０ａ〜１２
０ｎのレートを、予め決められたレートに抑えるように
することで、全体としてのレート、すなわち、多重化部
１２１の出力１２３を通信回線の伝送レートに適合させ
ている。In FIG. 19, the rate control units 117a to 117n
Are the outputs 120a-12 of the buffer memories 119a-119n for the rate assigned to each object.
By suppressing the rate of 0n to a predetermined rate, the overall rate, that is, the output 123 of the multiplexing unit 121 is adapted to the transmission rate of the communication line.

【００３２】ここで、従来からのＨ．２６１方式のよう
な矩形画面での表示の場合、画面全体で低アクティビテ
ィの部分から高アクティビティの部分にビット配分を移
動させることで、全体として一定レートに抑えていると
云える。Here, the conventional H.264 standard is used. In the case of a display on a rectangular screen as in the H.261 system, it can be said that the bit rate is moved from a low activity portion to a high activity portion on the entire screen, so that the overall rate is suppressed to a constant rate.

【００３３】すなわち、動きの大きい部分など高アクテ
ィビティの部分と、動きの少ない低アクティビティ部分
を自動的に抽出してビットをバランス良く配分すること
で、全体として低レートで高画質な表示を実現している
とも云える。That is, by automatically extracting a high activity portion such as a portion having a large motion and a low activity portion having a small motion and allocating bits in a well-balanced manner, a high-quality display at a low rate as a whole is realized. It can be said that.

【００３４】ところが、図１９のようにオブジェクトと
して動画像を分割した場合に、そのオブジェクトの形状
内で前記ビット配分を行わなければならない。優先度の
高いオブジェクトにはビット配分を多くしても、そのオ
ブジェクトが高アクティビティ状態になった場合、自身
のオブジェクト内で処理できなくなり、量子化精度を落
としたり、コマ落しをしたりして対処しなければならな
くなり、画質の低下につながる。However, when a moving image is divided as an object as shown in FIG. 19, the bit distribution must be performed within the shape of the object. Even if the bit allocation is increased for an object with a high priority, if the object enters a high activity state, it cannot be processed in its own object, and the quantization accuracy is reduced or frames are dropped. Must be performed, leading to a decrease in image quality.

【００３５】そして、特にこの場合、面積の小さなオブ
ジェクトにあっては、ビット配分のための余裕度が小さ
くなり、高アクティビティ状態になると直ぐに画品質が
低下することになる。In this case, in particular, in the case of an object having a small area, the margin for bit allocation is reduced, and the image quality is immediately reduced when the high activity state is reached.

【００３６】そこで、優先度の高いオブジェクトは最初
から十分に余裕を持ってビットを多く配分しておくこと
になる。しかし、このようにすると、今度は低アクティ
ビティ状態では配分ビット数の余裕があり過ぎ、必要以
上に高画質な動画像を生成することになる。Therefore, an object with a high priority has to allocate a large number of bits with a sufficient margin from the beginning. However, in this case, in the low activity state, there is too much room for the number of allocated bits, and a moving image with a higher image quality than necessary is generated.

【００３７】一方、背景などのオブジェクトは面積が広
いため、量子化精度を落したとしても全体としての配分
ビット数は多くなる。しかし、終始低アクティビティの
ために全体としてもオブジェクト内での配分ビット数の
移動は少なく、配分ビット数の変動に対する余裕があり
過ぎるとも云える。On the other hand, since objects such as backgrounds have a large area, the number of allocated bits as a whole increases even if quantization accuracy is reduced. However, the movement of the number of allocated bits in the object as a whole is small due to the low activity throughout, and it can be said that there is too much room for the fluctuation of the number of allocated bits.

【００３８】それ故、各オブジェクトに如何にして最適
配分できるようにするか、その合理的な技術手法の開発
が急がれる。Therefore, it is urgently necessary to develop a rational technical method for how to optimally allocate the objects.

【００３９】従って、本発明の目的とするところは、複
数オブジェクトの動画像を符号化した画像データを多重
化して通信回線に送出するシステムにおいて、符号化に
あたって、優先度の高いオブジェクトについては画質を
保証できるようにすると共に、視覚特性上、最適な動作
点で符号化処理が行うことができるようにして、各オブ
ジェクトについてバランスのとれた表示を可能とする画
像符号化装置および画像符号化方法を提供することにあ
る。Accordingly, an object of the present invention is to provide a system for multiplexing image data obtained by encoding moving images of a plurality of objects and transmitting the multiplexed image data to a communication line. An image encoding apparatus and an image encoding method capable of performing an assurance and performing an encoding process at an optimal operating point in terms of visual characteristics, thereby enabling a balanced display for each object. To provide.

【００４０】[0040]

【課題を解決するための手段】上記目的を達成するた
め、本発明は次のようにする。すなわち、ＭＰＥＧ４の
如き画面が複数のオブジェクトで構成される動画像を、
オブジェクト単位で符号化し、これら複数のオブジェク
トの各符号化データを多重化して通信回線に送出する符
号化装置において、オブジェクトを符号化する手段であ
って、対象とするオブジェクトについて与えられた画品
質の優先度対応に前記多重化を行うためのビットレート
の配分値とが設定され、配分されたビットレートに見合
う発生符号量で前記対象のオブジェクトを符号化する符
号化手段と、前記オブジェクトを符号化する過程で高ア
クティビティなオブジェクトが発生した場合、前記優先
度の高いオブジェクトの画品質を落さず、または向上さ
せるように前記オブジェクトの間で前記ビットレートを
再配分させるべく、符号化手段のビットレート配分を設
定制御する制御手段とを具備する。To achieve the above object, the present invention is as follows. That is, a moving image in which a screen such as MPEG4 is composed of a plurality of objects,
In an encoding device that encodes each object and multiplexes each of the encoded data of the plurality of objects and sends the multiplexed data to a communication line, the encoding device encodes the objects. A bit rate distribution value for performing the multiplexing corresponding to the priority is set, and coding means for coding the target object with a generated code amount corresponding to the allocated bit rate; and coding the object. When an object having high activity occurs in the process of performing, the bit rate of the encoding means is set so as to redistribute the bit rate among the objects so that the image quality of the high-priority object is not reduced or improved. Control means for setting and controlling the rate distribution.

【００４１】また、前記制御手段には、前記各オブジェ
クトをフレーム毎に符号化する際に、符号化率に応じた
コマ落しによる時間歪みと量子化精度による空間的な歪
みとのバランスがとれた視覚特性上最適な動作点を確保
すべく、ビットレート配分を設定制御する機能を更に具
備する。Further, when encoding the objects for each frame, the control means balances the time distortion due to frame drop according to the encoding rate and the spatial distortion due to quantization accuracy. It further has a function of setting and controlling the bit rate distribution so as to secure an optimum operating point in terms of visual characteristics.

【００４２】また、前記制御手段には、前記オブジェク
トの間で前記ビットレートを移動させる過程で、前記通
信回線に送出する符号化データ全体で時間的歪みが少な
くなるよう、前記ビットレートの配分を設定制御する機
能を更に具備する。Further, in the control means, in the process of moving the bit rate between the objects, the distribution of the bit rate is reduced so that temporal distortion is reduced in the entire coded data transmitted to the communication line. A function for setting and controlling is further provided.

【００４３】このような構成の本システムは、画面が複
数のオブジェクトで構成される動画像を、オブジェクト
単位で符号化し、これら複数のオブジェクトの各符号化
データを多重化して通信回線に送出するが、符号化にあ
たっては、オブジェクトを符号化すると共に、対象とす
るオブジェクトについて、そのオブジェクトの画品質の
優先度対応に前記多重化を行うための与えられたビット
レートの配分値相当となる発生符号量で前記対象のオブ
ジェクトを符号化する。但し、前記オブジェクトを符号
化する過程で高アクティビティなオブジェクトが発生し
た場合、前記優先度の高いオブジェクトの画品質を落さ
ず、または向上させるように前記オブジェクトの間で前
記ビットレートを再配分させるべく、符号化ステップの
ビットレート配分を設定制御する。In this system having such a configuration, a moving image whose screen is composed of a plurality of objects is encoded in units of objects, and each encoded data of the plurality of objects is multiplexed and transmitted to a communication line. In encoding, the amount of generated code is equivalent to the given bit rate distribution value for performing the multiplexing for encoding the object and performing the multiplexing with respect to the priority of the image quality of the object. Encodes the target object. However, if an object having high activity occurs in the process of encoding the object, the bit rate is redistributed among the objects so that the image quality of the high priority object is not reduced or improved. For this purpose, the bit rate distribution of the encoding step is set and controlled.

【００４４】すなわち、伝送にあたってはオブジェクト
を符号化して得たデータをバッファメモリに送り、この
バッファメモリから伝送路の許容伝送レートに従ってデ
ータを順次読み出して伝送路に送るが、画面を構成する
複数のオブジェクトについて全体として１つのバッファ
メモリでレート制御を行うことで、伝送路である通信回
線に対して一定の伝送レートに適合するように制御す
る。その際、初期状態として各オブジェクトに対して優
先度と配分ビットを決め、各オブジェクトは量子化精度
による空間的歪みとコマ落しによる時間的歪みとのバラ
ンスがとれた視覚特性上最適な動作点になるように符号
化処理を行う。そして、動画像符号化の過程で優先度が
高いオブジェクトが高アクティビティ状態になったと
き、全体として通信回線の伝送レートを遵守しながら優
先度の低いオブジェクトから配分ビットを移動させるよ
うにレート制御を行う。また、優先度の低いオブジェク
トが高アクティビティ状態になったとき該当オブジェク
トのレートを下げることで、多重化した場合に、全体と
してコマ落しを少なくする。That is, upon transmission, data obtained by encoding an object is sent to a buffer memory, and data is sequentially read out from the buffer memory according to an allowable transmission rate of the transmission line and sent to the transmission line. By performing rate control on the object as a whole with a single buffer memory, control is performed so as to conform to a constant transmission rate for a communication line as a transmission path. At that time, the priority and allocation bits are determined for each object as an initial state, and each object is set to the optimal operating point in terms of visual characteristics that balances spatial distortion due to quantization accuracy and temporal distortion due to frame dropping. The encoding process is performed so that Then, when an object with a high priority enters a high activity state in the process of moving image encoding, rate control is performed so as to move the allocation bits from the object with a low priority while observing the transmission rate of the communication line as a whole. Do. Also, when an object with a low priority enters the high activity state, the rate of the object is reduced, so that when frames are multiplexed, dropped frames are reduced as a whole.

【００４５】また、複数のオブジェクトについて符号化
の際には、オブジェクト毎に画品質を考慮しながらビッ
卜配分をして、多重化処理を行う。これにより、オブジ
ェクト全体の表示で優先度の高いオブジェクトほど高品
質の動画像表示が実現できるようになり、多重化処理の
際にオブジェクト全体としてのコマ落しの回数も減らす
こともできて、全体としてバランスのとれた符号化処理
を行うことが可能となる。When encoding a plurality of objects, multiplexing processing is performed by allocating bits while taking into account image quality for each object. As a result, the higher the priority of the object in the display of the entire object, the higher the quality of the moving image can be displayed, and the number of dropped frames of the entire object during the multiplexing process can be reduced. A balanced encoding process can be performed.

【００４６】[0046]

【発明の実施の形態】以下、本発明の実施形態につい
て、図面を参照して説明する。本発明は、複数オブジェ
クトの動画像を符号化した画像データを多重化して通信
回線に送出するシステムにおいて、オブジェクトの符号
化の際に優先度をつけ、符号化の際に全体としてコマ落
しが多くなると推定されたとき、オブジェクト間で配分
ビットを移動させ、優先度の高いオブジェクトの画品質
を維持、且つ、向上させるようにし、そして、各オブジ
ェクトの符号化処理の過程で、符号化率に応じたコマ落
しによる時間的歪みと量子化精度による空間的歪みとの
バランスがとれて、しかも、視覚特性上、最適な動作点
で符号化処理が行われるように制御することで、各オブ
ジェクトについてバランスのとれた再生画像が再生側で
表示が可能となるようにするものであり、以下、詳細を
説明する。Embodiments of the present invention will be described below with reference to the drawings. The present invention provides a system for multiplexing image data obtained by encoding moving images of a plurality of objects and transmitting the multiplexed image data to a communication line, wherein priorities are assigned when encoding objects, and a large number of dropped frames occur when encoding. When it is estimated that the object will be allocated, the allocation bits are moved between the objects to maintain and improve the image quality of the high-priority object, and in the course of the encoding process of each object, according to the encoding rate. The balance between temporal distortion due to dropped frames and spatial distortion due to quantization accuracy is achieved, and the control is performed so that the encoding process is performed at the optimal operating point in terms of visual characteristics. This makes it possible to display the reproduced image on the reproducing side, and the details will be described below.

【００４７】＜第１の実施形態＞ここでは、基本である
矩形画面での符号化制御のための符号化装置の例を説明
する。図１は本発明にかかる符号化装置の構成例を示す
ブロック図であり、また、図２は当該符号化装置におけ
るバッファメモリでの滞留量の時間的変化を示した図で
ある。<First Embodiment> Here, an example of an encoding apparatus for encoding control on a basic rectangular screen will be described. FIG. 1 is a block diagram illustrating a configuration example of an encoding device according to the present invention, and FIG. 2 is a diagram illustrating a temporal change of a staying amount in a buffer memory in the encoding device.

【００４８】図１の符号化装置は、Ｈ．２６１などを実
現する符号化器であって、符号化部１２８と、この符号
化部１２８を制御する符号化制御部１２４とから構成さ
れている。これらのうち、符号化部１２８は、動き補償
フレーム間予測部１２９と、ＤＣＴ変換部１３０と、量
子化部１３３と、第１の可変長符号化部１３５と、第２
の可変長符号化部１３１と、多重化部１３７とを含む。The encoding apparatus shown in FIG. 261 and the like, and includes an encoding unit 128 and an encoding control unit 124 that controls the encoding unit 128. Among these, the encoding unit 128 includes a motion compensation inter-frame prediction unit 129, a DCT transform unit 130, a quantization unit 133, a first variable length encoding unit 135, and a second
And a multiplexing unit 137.

【００４９】符号化部１２８の構成要素である動き補償
フレーム間予測部１２９は、１６×１６画素（マクロブ
ロック）の範囲で動き補償が可能であって、フレーム間
の誤差を算出するものであり、符号化部１２８の構成要
素であるＤＣＴ変換部１３０は、その予測誤差信号を８
×８のブロック単位でＤＣＴ変換（離散コサイン変換；
（直交変換））して空間座標データを周波数座標データ
に変換するものである。The motion-compensated inter-frame prediction unit 129, which is a component of the encoding unit 128, can perform motion compensation within a range of 16 × 16 pixels (macroblock) and calculates an error between frames. , A DCT transform unit 130, which is a component of the encoding unit 128,
DCT transform (discrete cosine transform;
(Orthogonal transformation)) to convert the spatial coordinate data into frequency coordinate data.

【００５０】また、符号化部１２８の構成要素である量
子化部１３３は、ＤＣＴ変換部１３０によってＤＣＴ変
換された変換係数を直線量子化（線形量子化）するもの
であり、符号化部１２８の構成要素である第１の可変長
符号化部１３５は、量子化部１３３によって量子化され
た変換係数をハフマン符号化するものである。The quantization unit 133, which is a component of the encoding unit 128, performs linear quantization (linear quantization) on the transform coefficients DCT-transformed by the DCT transform unit 130. The first variable length coding unit 135, which is a component, performs Huffman coding on the transform coefficient quantized by the quantization unit 133.

【００５１】また、符号化部１２８の構成要素である第
２の可変長符号化部１３１は、動き補償フレーム間予測
部１２９が動き補償に用いた動きベクトルをハフマン符
号化するものであり、符号化部１２８の構成要素である
多重化部１３７は、第１の可変長符号化部１３５で符号
化された主情報と第２の可変長符号化部１３１で符号化
されたサイド情報とを多重化して伝送フレームを構成す
るものである。また、符号化部１２８の構成要素である
バッファメモリ１３９は、多重化部１３７により多重化
された圧縮データを一旦、保持し、その後、順次読み出
して通信回線に送出させるためのものであって、バッフ
ァメモリ１３９のデータ蓄積量の情報を符号化制御部１
２４に与える構成となっている。The second variable length coding unit 131, which is a component of the coding unit 128, performs Huffman coding on the motion vector used for motion compensation by the motion compensation inter-frame prediction unit 129. A multiplexing unit 137 that is a component of the encoding unit 128 multiplexes the main information encoded by the first variable length encoding unit 135 and the side information encoded by the second variable length encoding unit 131. To form a transmission frame. A buffer memory 139, which is a component of the encoding unit 128, temporarily stores the compressed data multiplexed by the multiplexing unit 137, and then sequentially reads out the compressed data and sends it out to a communication line. The information on the amount of data stored in the buffer memory 139 is stored in the encoding control unit 1
24.

【００５２】また、符号化部１２８の構成要素であるロ
ーカル復号化部１３４は、ＤＣＴ変換部１３０の出力を
逆ＤＣＴ変換することにより復号して符号化制御部１２
４に与えるものである。The local decoding unit 134, which is a component of the encoding unit 128, decodes the output of the DCT transform unit 130 by performing an inverse DCT transform to decode the output.
4

【００５３】符号化制御部１２４は、このような構成の
符号化部１２８を制御するためのものであって、バッフ
ァメモリ１３９のデータ蓄積量の情報と、ローカル復号
化部１３４の出力する前記ＤＣＴ変換部１３０の出力を
復号した出力とを入力とし、バッファメモリ１３９がオ
ーバーフローしないようにバッファメモリ１３９のデー
タ蓄積量の情報を基に、入力動画像の変化、シーン・チ
ェンジに応じて適応的に、量子化部１３３での量子化ス
テップ幅を変えたり、コマ落としをしたり、バッファメ
モリ１３９から出力されるビットレートが一定になるよ
うに制御しながら、バッファメモリ１３９の蓄積データ
を通信回線に送出するように制御するものである。The encoding control unit 124 is for controlling the encoding unit 128 having such a configuration, and includes information on the amount of data stored in the buffer memory 139 and the DCT output from the local decoding unit 134. An output obtained by decoding the output of the conversion unit 130 is used as an input, and based on information on the amount of data stored in the buffer memory 139 so that the buffer memory 139 does not overflow, adaptively in response to a change in an input moving image or a scene change. The data stored in the buffer memory 139 is transmitted to the communication line while changing the quantization step width in the quantization unit 133, dropping frames, and controlling the bit rate output from the buffer memory 139 to be constant. It is controlled so as to be transmitted.

【００５４】このような構成において、テレビカメラな
どの撮像装置で撮影した入力動画像データ１２７を本シ
ステムに入力すると、この入力動画像データ１２７は、
符号化部１２８における動き補償フレーム間予測部１２
９に与えられる。In such a configuration, when input moving image data 127 captured by an imaging device such as a television camera is input to the present system, the input moving image data 127
Motion compensation inter-frame prediction unit 12 in encoding unit 128
9 given.

【００５５】動き補償フレーム間予測部１２９ではこの
入力動画像データ１２７と前入力動画像データとの間
で、フレーム間の誤差を算出し、予測誤差信号を求め
る。ＤＣＴ変換部１３０は、この動き補償フレーム間予
測部１２９からの予測誤差信号を８×８のブロック単位
でＤＣＴ変換して空間座標データを周波数座標データに
変換し、量子化部１３３は、このＤＣＴ変換した変換係
数を直線量子化して第１の可変長符号化部１３５に与え
る。The motion compensation inter-frame prediction section 129 calculates an error between frames between the input moving image data 127 and the previous input moving image data to obtain a prediction error signal. The DCT transform unit 130 performs DCT transform on the prediction error signal from the motion compensation inter-frame predictive unit 129 in units of 8 × 8 blocks to transform spatial coordinate data into frequency coordinate data. The converted transform coefficients are linearly quantized and provided to the first variable length coding unit 135.

【００５６】すると、第１の可変長符号化部１３５はこ
の量子化した変換係数をハフマン符号化して多重化部１
３７に与える。Then, the first variable-length coding unit 135 performs Huffman coding on the quantized transform coefficients, and
Give 37.

【００５７】一方、動き補償フレーム間予測部１２９か
らの予測誤差信号は第２の可変長符号化部１３１にも与
えられ、第２の可変長符号化部１３１はこの予測誤差信
号から動き補償に用いる動きベクトルを得てこれをハフ
マン符号化し、多重化部１３７に与える。On the other hand, the prediction error signal from the motion compensation inter-frame prediction unit 129 is also supplied to the second variable length coding unit 131, and the second variable length coding unit 131 converts the prediction error signal into motion compensation. A motion vector to be used is obtained, Huffman-encoded, and provided to the multiplexing unit 137.

【００５８】多重化部１３７は、第１の可変長符号化部
１３５で符号化された主情報と第２の可変長符号化部１
３１で符号化されたサイド情報とを多重化してバッファ
メモリ１３９に与える。このようにして、多重化部１３
７の出力は、バッファメモリ１３９を介して伝送路１４
０に送出される。The multiplexing section 137 includes the main information coded by the first variable length coding section 135 and the second variable length coding section 1.
31 is multiplexed with the side information coded at 31 and supplied to the buffer memory 139. Thus, the multiplexing unit 13
7 is transmitted to the transmission line 14 via the buffer memory 139.
Sent to 0.

【００５９】一方、ＤＣＴ変換部１３０の出力はローカ
ル復号化部１３４にも与えられ、ここで逆ＤＣＴ変換す
ることにより復号される。そして、これは符号化制御部
１２４に与えられる。そして、符号化制御部１２４はロ
ーカル復号化部１３４からの復号された情報とバッファ
メモリ１３９からのデータ蓄積量をもとに、量子化での
量子化幅を調整したり、画像のコマ落としをする等の制
御を符号化部１２８に対して実施することで、バッファ
メモリ１３９が溢れないように符号量を制御しつつ、所
定のレートで符号化データが伝送路１４０に送出される
ようにする。On the other hand, the output of DCT transform section 130 is also supplied to local decoding section 134, where it is decoded by inverse DCT transform. Then, this is given to the encoding control unit 124. Then, the encoding control unit 124 adjusts the quantization width in the quantization based on the decoded information from the local decoding unit 134 and the amount of data stored in the buffer memory 139, and performs image frame skipping. By performing such control as to the encoding unit 128, the encoded data is transmitted to the transmission path 140 at a predetermined rate while controlling the code amount so that the buffer memory 139 does not overflow. .

【００６０】ここで、図１の構成における符号化装置で
は、１単位時間当たり、１枚分のフレームが入力動画像
データ１２７として入力され、これが動き補償フレーム
予測され、更にＤＣＴ変換される等の符号化がなされ
る。画像データを符号化すると、これを復号化した場合
での復号画像には空間的歪み（ＳＮ比）が残る。Here, in the coding apparatus having the configuration shown in FIG. 1, one frame is input as input moving image data 127 per unit time, and the input moving image data 127 is subjected to motion compensation frame prediction and further subjected to DCT transformation. Encoding is performed. When image data is encoded, a spatial distortion (S / N ratio) remains in a decoded image obtained by decoding the image data.

【００６１】復号画像１２６の持つ空間的歪み（ＳＮ
比）をＤ、多重化部１３７で多重化された符号量をＲビ
ットと表すとすると、この場合、例えば、時刻ｔにおけ
る入力画像データを符号化した結果、これを復号化した
場合での復号画像１２６中の空間的歪み（ＳＮ比）はＤ
（ｔ）であり、多重化部１３７の出力１３８すなわち、
多重化部１３７で多重化されて発生するデータの符号量
はＲ（ｔ）ビットとなる。The spatial distortion (SN) of the decoded image 126
Assuming that the ratio is D and the code amount multiplexed by the multiplexing unit 137 is R bits, in this case, for example, as a result of encoding the input image data at the time t, decoding in a case where this is decoded The spatial distortion (SN ratio) in the image 126 is D
(T), and the output 138 of the multiplexing unit 137, that is,
The code amount of the data generated by multiplexing in the multiplexing unit 137 is R (t) bits.

【００６２】多重化部１３７で発生したデータはバッフ
ァメモリ１３９に蓄積された後、伝送路１４０へ送出さ
れる。但し、当該バッファメモリ１３９から伝送路１４
０へ送出される符号量は、単位時間当たりＬビット分で
ある。従って、バッファメモリ１３９での滞留量１２５
をＢ（ｔ）とすると、時刻ｔ時点の入力動画像が符号化
された場合においてのバッファメモリ１３９での蓄積情
報量Ｂ（ｔ＋１）は、Ｂ（ｔ＋１）＝Ｂ（ｔ）＋Ｒ（ｔ）−Ｌであり、また、当該時刻ｔ時点の入力動画像がコマ落し
された場合においてのバッファメモリ１３９での蓄積情
報量Ｂ（ｔ＋１）は、Ｂ（ｔ＋１）＝Ｂ（ｔ）−Ｌとなる。The data generated by the multiplexing unit 137 is stored in the buffer memory 139 and then transmitted to the transmission line 140. However, the transmission path 14
The code amount transmitted to 0 is L bits per unit time. Therefore, the amount of stay 125 in the buffer memory 139
Is B (t), the amount of information B (t + 1) stored in the buffer memory 139 when the input moving image at the time t is encoded is: B (t + 1) = B (t) + R (t) −L, and the amount of information B (t + 1) stored in the buffer memory 139 when the input moving image at the time t is dropped, is B (t + 1) = B (t) −L. .

【００６３】ここで、本システムでは、以下に示す方法
でコマ落しを行うものとする。すなわち、Ｂ（ｔ）＞Ｌのとき、“コマ落しモード” Ｌ ≧ Ｂ（ｔ）＞０のとき、“符号化モード” とし、“コマ落しモード”のときはコマ落としを実施
し、“符号化モード”のときはコマ落しは行わず、符号
化を実施する。Here, in the present system, it is assumed that frame drop is performed by the following method. That is, when B (t)> L, “frame drop mode” When L ≧ B (t)> 0, “encoding mode” is set. When “frame drop mode”, frame drop is performed, and In the "encoding mode", encoding is performed without performing frame dropping.

【００６４】図２に、本発明システムにおけるバッファ
メモリ１３９での滞留量Ｂの時間的変化の例を示す。こ
の例では、時刻ｔの時点（符号１４２で指し示す位置）
で多重化部１３７からＲ（ｔ）の符号量が発生し、これ
がバッファメモリ１３９に蓄積される。従って、バッフ
ァメモリ１３９はこの新たなＲ（ｔ）分のデータが蓄積
される結果、蓄積可能な符号量の余裕が無くなる。その
ため、時刻ｔ＋１の時点で符号化制御部１２４は“コマ
落しモード”とし、以降、符号化部１２８を当該“コマ
落しモード”で制御することとなって、時刻ｔ＋１の時
点（符号１４３で指し示す位置）、ｔ＋２の時点（符号
１４４で指し示す位置）、ｔ＋３の時点（符号１４５で
指し示す位置）ではコマ落しが行われる。そのため、バ
ッファメモリ１３９には新たなデータは蓄積されない。FIG. 2 shows an example of a temporal change of the staying amount B in the buffer memory 139 in the system of the present invention. In this example, the time point t (the position indicated by reference numeral 142)
Then, the multiplexing unit 137 generates a code amount of R (t), which is stored in the buffer memory 139. Therefore, the buffer memory 139 stores the new R (t) data, so that there is no room for the code amount that can be stored. Therefore, at time t + 1, the encoding control unit 124 sets the “frame-drop mode”, and thereafter controls the encoding unit 128 in the “frame-drop mode”, and at the time t + 1 (indicated by reference numeral 143). At the time point t + 2 (position indicated by reference numeral 144) and at the time point t + 3 (position indicated by reference numeral 145), a frame drop is performed. Therefore, no new data is stored in the buffer memory 139.

【００６５】しかし、バッファメモリ１３９からはフレ
ームのタイミング毎にＬなる符号量相当分のデータが送
り出されるので時間経過に伴って次第に滞留量Ｂが減少
する。そして時刻ｔ＋４の時点では３×Ｌ分の符号量が
無くなり、バッファメモリ１３９に十分な余裕ができ
る。そのため、当該時刻ｔ＋４の時点で符号化制御部１
２４は“符号化モード”とし、符号化部１２８を当該
“符号化モード”で制御することとなって、次の新たな
データが符号化され、バッファメモリ１３９に蓄積され
る。However, since the data corresponding to the code amount of L is transmitted from the buffer memory 139 at each frame timing, the staying amount B gradually decreases with time. At time t + 4, the code amount corresponding to 3 × L is lost, so that the buffer memory 139 has a sufficient margin. Therefore, at the time t + 4, the encoding control unit 1
Numeral 24 denotes an “encoding mode”, and the encoding unit 128 is controlled in the “encoding mode”, so that the next new data is encoded and stored in the buffer memory 139.

【００６６】このように、バッファメモリ１３９の蓄積
容量に余裕があるか否かに応じて符号化モードにした
り、コマ落としモードにしたりしてバッファメモリ１３
９がオーバフローしないように、一定のレートで符号化
データを伝送するように制御を続ける。As described above, depending on whether the storage capacity of the buffer memory 139 has a margin or not, the encoding mode or the frame drop mode is set to change the buffer memory 13.
Control is continued so that encoded data is transmitted at a constant rate so that 9 does not overflow.

【００６７】ここで、本システムでは、復号した画像の
画質に過剰品質や大きな劣化がないようバランスのとれ
た画質を維持できるように符号化制御部１２４は符号化
部１２８を制御している。これは復号動画像の時間的歪
みを示す指標を用いることによって次のようにして実現
している。Here, in the present system, the encoding control unit 124 controls the encoding unit 128 so that the image quality of the decoded image can be maintained in a balanced manner so that there is no excessive quality or large deterioration. This is realized as follows by using an index indicating the temporal distortion of the decoded moving image.

【００６８】すなわち、復号動画像の時間的歪みを示す
指標として、ある時刻ｔの入力フレームが符号化される
確率を用いる。伝送路１４０での伝送可能なビットレー
トは、単位時間当たりＬビットであり、従って、Ｒ
（ｔ）の符号量を持つ情報がある場合に、この情報を伝
送するためには、Ｒ（ｔ）／Ｌ単位時間を要する。That is, the probability that an input frame at a certain time t is encoded is used as an index indicating the temporal distortion of a decoded moving image. The bit rate that can be transmitted on the transmission line 140 is L bits per unit time.
When there is information having the code amount of (t), it takes R (t) / L unit time to transmit this information.

【００６９】従って、時刻ｔ前後での平均的な入力フレ
ームの符号化確率（以下、符号化率と呼ぶ）をＳ（ｔ）
とおくと、当該Ｓ（ｔ）は、Ｓ（ｔ）＝Ｌ／Ｒ（ｔ）（但し、時刻ｔは符号化モード）と表わすことができ
る。Therefore, the average coding probability of the input frame before and after time t (hereinafter referred to as the coding rate) is represented by S (t)
In other words, S (t) can be expressed as S (t) = L / R (t) (where time t is an encoding mode).

【００７０】ここで、入力画像を符号化する際、その入
力画像の画像データは量子化をすることになるが、その
際の量子化精度により復号動画像は変化する。すなわ
ち、量子化精度を高めれば高いＳＮ比が確保できて復号
動画像の画質が保持できるが、反面、符号化率が減少す
るので、符号量が増大する。逆に、符号量を少なくする
ために符号化率を高めれば量子化精度を下げてＳＮ比を
落とすことになり、画質が低下する。この関係を、画品
質トレードオフ関数Ｓｓ＝Ｇ［Ｄｓ］と呼ぶ。Here, when encoding the input image, the image data of the input image is quantized, but the decoded moving image changes depending on the quantization precision at that time. That is, if the quantization precision is increased, a high SN ratio can be secured and the image quality of the decoded moving image can be maintained. However, the coding rate decreases, and the code amount increases. Conversely, if the coding rate is increased in order to reduce the code amount, the quantization accuracy will be reduced and the SN ratio will be reduced, and the image quality will be reduced. This relationship is called an image quality trade-off function Ss = G [Ds].

【００７１】画品質トレードオフ関数Ｓｓ＝Ｇ［Ｄｓ］
の一例を図３に示す。なお、図３において、横軸はＳＮ
比を表し、縦軸は符号化率を表している。画品質トレー
ドオフ関数Ｓｓ＝Ｇ［Ｄｓ］は図３に符号１４８を付し
て示してあるように、符号化率が大きいとＳＮが悪く、
符号化率が小さくなるとＳＮが向上する右肩下がりの特
性曲線を描く。従って、入力フレームが大きな動きやパ
ターン等を含むとき、右下に移動し、動きが少ないとき
は左上に移動する。Image quality trade-off function Ss = G [Ds]
FIG. 3 shows an example. In FIG. 3, the horizontal axis is SN
The vertical axis indicates the coding rate. As shown by reference numeral 148 in FIG. 3, the image quality trade-off function Ss = G [Ds] has a poor SN when the coding rate is large,
A characteristic curve is drawn in which the SN increases as the coding rate decreases. Therefore, when the input frame includes a large motion, a pattern, or the like, it moves to the lower right, and when there is little motion, it moves to the upper left.

【００７２】一方、前述した符号化率と量子化精度の組
が視覚的に最適となる点（符号１４９を付して指し示す
点）が、それぞれの入力画像に対する画品質トレードオ
フ関数Ｓｓ＝Ｇ［Ｄｓ］上に１点ずつ存在する。この点
を結んだ線が目的関数Ｓｏ＝Ｏ［Ｄｏ］であり、図３に
符号１４７を付して指し示す曲線がこれである。On the other hand, the point at which the combination of the above-mentioned coding rate and quantization accuracy is visually optimal (point indicated by reference numeral 149) is the image quality trade-off function Ss = G [ Ds]. The line connecting these points is the objective function So = O [Do], and this is the curve indicated by reference numeral 147 in FIG.

【００７３】画品質トレードオフ関数を正確に求めるた
めには、量子化精度を変化させつつ、そのフレームを何
度も実際に符号化し、その都度符号化率を測定する必要
がある。しかし、処理時間の観点からこのような処理は
非現実的である。In order to accurately determine the picture quality trade-off function, it is necessary to actually encode the frame many times while changing the quantization precision, and to measure the coding rate each time. However, such processing is impractical in terms of processing time.

【００７４】そこで、現実的な手法として文献『加藤ら
の“動画像符号化方式における時空間歪み最適配分型符
号化制御アルゴリズム”電子情報通信学会論文誌ＢＶ
ｏｌ．Ｊ７１−ＢＮｏ．８Ｐ．９４５−９５４１
９８８年８月』に開示された手法があり、ここには、現
符号化フレームから画品質トレードオフ関数を決定する
２つの方法が提案されている。Therefore, as a practical method, the document “Kato et al.,“ Spatio-temporal distortion optimal allocation type coding control algorithm in video coding ”, IEICE Transactions on Communications, BV
ol. J71-B No. 8P. 945-954 1
August 988 ", which proposes two methods for determining a picture quality trade-off function from the current coded frame.

【００７５】まず、上記文献に記載された第１の方法に
ついて説明する。動き補償フレーム予測及びＤＣＴを終
えた段階で、現フレームの予測誤差信号ＤＣＴ係数、動
きベクトルが得られる。このとき、１フレーム分のＤＣ
Ｔ係数ヒストグラムおよび動きベクトル符号量から、発
生符号量Ｒ（ｑ）と、ＳＮ比Ｄ（ｑ）とが、量子化精度
ｑの関数として求められる。First, the first method described in the above document will be described. After the motion compensation frame prediction and the DCT are completed, a prediction error signal DCT coefficient and a motion vector of the current frame are obtained. At this time, DC for one frame
From the T coefficient histogram and the motion vector code amount, the generated code amount R (q) and the SN ratio D (q) are obtained as functions of the quantization accuracy q.

【００７６】従って、符号化率Ｓ（ｑ）＝Ｌ／Ｒ（ｑ）
と、ＳＮ比Ｄ（ｑ）との関係である画品質トレードオフ
関数Ｓｓ＝Ｇ［Ｄｓ］が精度よく定まる。Therefore, the coding rate S (q) = L / R (q)
And the image quality trade-off function Ss = G [Ds], which is the relationship between the image quality and the SN ratio D (q), is determined with high accuracy.

【００７７】図４は、上記文献に記載されている画品質
トレードオフ関数Ｓｓ＝Ｇ［Ｄｓ］の例を示しており、
いずれも右肩下がりではあるが、図からわかるように異
なる入力フレームに対して異なる曲線になっている。FIG. 4 shows an example of the image quality trade-off function Ss = G [Ds] described in the above document.
Each of them has a downward slope but has different curves for different input frames as can be seen from the figure.

【００７８】次に、上記文献に記載された第２の方法に
ついて説明する。当該第２の方法は、幾つかの符号化率
Ｓ（ｑ）およびＳＮ比Ｄ（ｑ）の候補を予め定めてお
き、その中から直前の符号化フレームの符号化結果と適
合する特性を選定し、その選定結果に基づいて現符号化
フレームの画品質トレードオフ関数を算出する方法であ
る。Next, the second method described in the above document will be described. In the second method, several coding rate S (q) and SN ratio D (q) candidates are determined in advance, and a characteristic matching the coding result of the immediately preceding coding frame is selected from the candidates. Then, based on the selection result, a picture quality trade-off function of the current coded frame is calculated.

【００７９】図５は、符号化率Ｓ（ｑ）を本方法により
推定する方法を示している。図５中、特性曲線に付した
数字［Ｎｏ．１］〜［Ｎｏ．６］は符号化率対量子化ス
テップサイズ特性候補の番号を表し、点Ｃは前符号化フ
レームの符号化率と量子化ステップサイズを表してい
る。図５の例では、番号［Ｎｏ．３］の特性が最も点Ｃ
に近く、番号［Ｎｏ．３］の特性を時刻ｔの符号化率Ｓ
（ｑ）と推定している。ＳＮ比Ｄｓ（ｑ）も同様の方法
で推定される。FIG. 5 shows a method for estimating the coding rate S (q) by the present method. In FIG. 5, the numbers [No. 1] to [No. 6] represents the number of the coding rate vs. quantization step size characteristic candidate, and point C represents the coding rate and the quantization step size of the previous encoded frame. In the example of FIG. 3] is the most point C
, And the number [No. 3] is changed to the coding rate S at time t.
(Q). The SN ratio Ds (q) is estimated in a similar manner.

【００８０】以上、第１の実施形態の符号化装置は、複
数オブジェクトの動画像を符号化した画像データを多重
化して通信回線に送出するシステムにおいて、オブジェ
クトの符号化の際に優先度をつけ、符号化の際に全体と
してコマ落しが多くなると推定されたとき、オブジェク
ト間で配分ビットを移動させ、優先度の高いオブジェク
トの画品質を維持、且つ、向上させるようにしたもので
あり、各オブジェクトの符号化処理の過程で、符号化率
に応じたコマ落しによる時間的歪みと量子化精度による
空間的歪みとのバランスがとれ、視覚特性上最適な動作
点で符号化処理を行うようにしたことで、各オブジェク
トについてバランスのとれた表示が可能となるものであ
る。As described above, the encoding apparatus according to the first embodiment assigns priorities when encoding objects in a system for multiplexing image data obtained by encoding moving images of a plurality of objects and transmitting the multiplexed image data to a communication line. In coding, when it is estimated that the number of dropped frames is increased as a whole, the allocation bits are moved between the objects to maintain and improve the image quality of the high-priority object. During the object encoding process, the temporal distortion due to frame dropping according to the encoding rate and the spatial distortion due to quantization accuracy are balanced, and the encoding process is performed at the optimal operating point for visual characteristics. By doing so, a balanced display can be achieved for each object.

【００８１】以上、矩形画面での符号化制御のための符
号化装置の例を説明した。次に、本発明の主要な目的で
ある複数オブジェクトにおけるレート制御について第２
の実施形態として説明する。The example of the coding apparatus for controlling the coding on the rectangular screen has been described above. Next, the second object of the rate control in a plurality of objects, which is the main object of the present invention, is described.
An embodiment will be described.

【００８２】＜第２の実施形態＞〔複数オブジェクトにおける本発明のレート制御〕図６
は複数オブジェクトにおけるレート制御を、単一のバッ
ファメモリで実施する方式の装置の構成図である。図６
において、１１２は形状抽出部、１１４ａ，１１４ｂ，
〜１１４ｎは符号化部、１１７はレート制御部、１５３
はマルチプレクサ、１１９はバッファメモリである。<Second Embodiment> [Rate Control of the Present Invention for a Plurality of Objects] FIG. 6
FIG. 1 is a configuration diagram of an apparatus of a system that performs rate control on a plurality of objects with a single buffer memory. FIG.
, 112 is a shape extraction unit, 114a, 114b,
To 114n are an encoding unit, 117 is a rate control unit, 153
Is a multiplexer, and 119 is a buffer memory.

【００８３】形状抽出部１１２は、テレビカメラなどの
撮像装置で撮影して得た入力動画像１１１から画面を構
成している各オブジェクトについてそれぞれ輪郭を元に
した領域抽出をして、各オブジェクトの形状情報１１３
ａ，１１３ｂ，〜１１３ｎをそれぞれ生成するものであ
って、１フレームの画面から複数のオブジェクト形状を
抽出するものである。また、この形状抽出部１１２には
抽出したオブジェクトの形状情報から相互の輪郭が接触
しているオブジェクト同士を組として分類し、その情報
（分類表）をレート制御部１１７に出力すると云った機
能を備えている。The shape extraction unit 112 extracts a region based on the outline of each object constituting the screen from the input moving image 111 obtained by photographing with an imaging device such as a television camera, and extracts the region of each object. Shape information 113
a, 113b,..., and 113n, respectively, for extracting a plurality of object shapes from a screen of one frame. The shape extraction unit 112 also has a function of classifying objects whose outlines are in contact with each other as a set based on the shape information of the extracted objects and outputting the information (classification table) to the rate control unit 117. Have.

【００８４】符号化部１１４ａ〜１１４ｎは前記各オブ
ジェクト毎に対応してそれぞれの受け持ちのオブジェク
トを符号化する符号化部であり、各オブジェクトに対応
した形状情報１１３ａ，１１３ｂ，〜１１３ｎを使用
し、当該形状情報の範囲の画像について、レート制御部
１１７の制御のもとに与えられる割り当て符号量（配分
されるビット量対応の符号量）となるよう、符号化する
ものである。The coding units 114a to 114n are coding units for coding the respective assigned objects corresponding to the respective objects, and use the shape information 113a, 113b, to 113n corresponding to each object. The image in the range of the shape information is encoded so as to have the assigned code amount (the code amount corresponding to the allocated bit amount) given under the control of the rate control unit 117.

【００８５】また、符号化部１１４ａ〜１１４ｎには、
前記符号化処理により得られた符号化データを復号化し
て元のデータに戻す機能を有しており、この元に戻した
データはローカル復号化データ１５２ａ〜１５２ｎとし
てレート制御部１１７に出力する機能を持つ。The encoding units 114a to 114n include:
It has a function of decoding the coded data obtained by the above-mentioned coding process and returning it to the original data, and a function of outputting the restored data to the rate control unit 117 as local decoded data 152a to 152n. have.

【００８６】マルチプレクサ１５３はデータを多重化す
るためのものであって、符号化部１１４ａ〜１１４ｎに
より出力されるオブジェクトを符号化した出力データ１
１６ａ，〜１１６ｎを受けて、これらを多重化し、バッ
ファメモリ１１９に送るものである。バッファメモリ１
１９の出力１２０が伝送路への伝送出力となる。The multiplexer 153 is for multiplexing data. The output data 1 is obtained by encoding the objects output from the encoding units 114a to 114n.
16a, .about.116n, and multiplexes them, and sends them to the buffer memory 119. Buffer memory 1
The output 120 of 19 is the transmission output to the transmission line.

【００８７】レート制御部１１７はローカル復号化デー
タ１５２ａ〜１５２ｎを受けて各オブジェクトに対する
割り当て符号量をオブジェクト毎の優先度に応じた個別
の画質と、フレーム全体の画質バランスを考慮して適宜
に配分することにより、定めると共に、バッファメモリ
１１９よりそのデータ蓄積量の情報１５４を受けて当該
蓄積量の情報１５４からバッファメモリ１１９の空き状
態を調べ、バッファメモリ１１９の出力１２０が割り当
てレートになるように、符号化部１１４ａ，〜１１４ｎ
の出力データ１１６ａ，〜１１６ｎについて、コマ落と
ししたりすると云った調整制御をするためのものであ
る。また、このレート制御部１１７には前記形状抽出部
１１２からの分類表から各組に属しているオブジェクト
をまとめてレート制御及びコマ落としを行うと云った調
整制御機能をも備えている。The rate control unit 117 receives the local decoded data 152a to 152n and appropriately allocates the code amount to be assigned to each object in consideration of the balance between the individual image quality according to the priority of each object and the image quality of the entire frame. By doing so, the data storage amount information 154 is received from the buffer memory 119, the empty state of the buffer memory 119 is checked from the storage amount information 154, and the output 120 of the buffer memory 119 is set to the assigned rate. , Encoding units 114a, to 114n
The output data 116a,..., 116n are subjected to adjustment control such as frame skipping. The rate control unit 117 also has an adjustment control function of performing rate control and frame dropping for objects belonging to each group from the classification table from the shape extraction unit 112.

【００８８】このような構成において、テレビカメラな
どの撮像装置で撮影した入力動画像１１１を本システム
に入力すると、この入力動画像１１１は、形状抽出部１
１２と符号化部１１４ａ，１１４ｂ，〜１１４ｎとに、
それぞれ与えられる。In such a configuration, when an input moving image 111 captured by an imaging device such as a television camera is input to the present system, the input moving image 111
12 and the encoding units 114a, 114b, to 114n,
Each given.

【００８９】入力動画像１１１を受けた形状抽出部１１
２は、この入力動画像１１１から画面を構成している各
オブジェクトについてそれぞれ輪郭を元にした領域抽出
をし、各オブジェクト別の形状情報１１３ａ，〜１１３
ｎを生成する。そして、符号化部１１４ａ，１１４ｂ，
〜１１４ｎに対して、生成した各オブジェクト別の形状
情報１１３ａ，〜１１３ｎのうちの、対応するオブジェ
クトの形状情報をそれぞれ与える。The shape extracting unit 11 which has received the input moving image 111
2 extracts a region from each of the objects constituting the screen from the input moving image 111 based on the outline, and obtains shape information 113a, 113 to 113 for each object.
Generate n. Then, the encoding units 114a, 114b,
To 114n, the shape information of the corresponding object among the generated shape information 113a, 113n for each object is given.

【００９０】また、形状抽出部１１２は抽出したオブジ
ェクトの形状情報から相互の輪郭が接触しているオブジ
ェクト同士を組として分類する。そして、前記各組に対
してどのオブジェクトが属しているかの分類表情報を、
レート制御部１１７に送る。The shape extracting unit 112 classifies the objects whose outlines are in contact with each other as a set based on the extracted object shape information. Then, classification table information indicating which object belongs to each of the sets,
Send to rate control section 117.

【００９１】また、入力動画像１１１を受けた符号化部
１１４ａ〜１１４ｎでは、それぞれ自己に与えられた形
状情報１１３ａ，〜１１３ｎを使用して、入力動画像１
１１から形状情報に対応する動画像部分を領域抽出し、
その抽出した動画像部分を用いて動き補償情報を求め、
この動き補償情報と前記抽出した動画像部分のテクスチ
ャ情報とを符号化処理して出力する（出力データ１１６
ａ，〜１１６ｎ）。このとき、符号化処理での発生符号
量は、レート制御部１１７により与えられた割り当て符
号量相当のビット量になるように、量子化幅を設定変更
するなどして、調整制御される（レート制御）。The encoding units 114a to 114n receiving the input moving image 111 use the input moving image 1 by using the shape information 113a to 113n respectively given to themselves.
Region extraction of a moving image portion corresponding to shape information from 11 is performed,
The motion compensation information is obtained using the extracted moving image portion,
The motion compensation information and the extracted texture information of the moving image portion are encoded and output (output data 116
a, ~ 116n). At this time, the generated code amount in the encoding process is adjusted and controlled by changing the quantization width so as to be a bit amount corresponding to the allocated code amount given by the rate control unit 117 (rate). control).

【００９２】また、符号化部１１４ａ〜１１４ｎは、前
記符号化処理により得られた符号化データを復号化して
元のデータに戻す機能を有しており、この元に戻したデ
ータはローカル復号化データ１５２ａ〜１５２ｎとして
レート制御部１１７に出力する。The encoding units 114a to 114n have a function of decoding the coded data obtained by the above-mentioned encoding process and returning the data to the original data. The data is output to the rate control unit 117 as data 152a to 152n.

【００９３】符号化部１１４ａ〜１１４ｎからの出力デ
ータ１１６ａ，〜１１６ｎはマルチプレクサ１５３に送
られてここで多重化される。そして、このマルチプレク
サ１５３にて多重化された出力１５５はバッファメモリ
１１９に与えられる。バッファメモリ１１９はこれを取
り込み、一定レートで読み出す。The output data 116a and 116n from the encoders 114a to 114n are sent to the multiplexer 153 where they are multiplexed. The output 155 multiplexed by the multiplexer 153 is supplied to the buffer memory 119. The buffer memory 119 takes in this and reads it out at a constant rate.

【００９４】この結果、バッファメモリ１１９は全ての
オブジェクトの符号化データについて平滑化することに
なり、当該バッファメモリ１１９から読み出される出力
１２０は一定レートで回線に出力されることになる。As a result, the buffer memory 119 smoothes the encoded data of all objects, and the output 120 read from the buffer memory 119 is output to the line at a constant rate.

【００９５】また、バッファメモリ１１９のデータ滞留
量の情報はレート制御部１１７に送られる。そして、レ
ート制御部１１７は前記滞留量の情報などから必要に応
じて各符号化部１１４ａ〜１１４ｎのコマ落としの制御
を行い、発生符号量を調整してバッファメモリ１１９に
入力される符号化データがオーバーフローしないように
制御する。さらに、レート制御部１１７は各符号化部１
１４ａ〜１１４ｎからのローカル復号化データ１５２ａ
〜１５２ｎを受けて各オブジェクトに対する割り当て符
号量を、オブジェクト毎の優先度に応じた個別の画質
と、フレーム全体の画質バランスを考慮して適宜に配分
し、その個別の情報を各符号化部１１４ａ〜１１４ｎの
うち、対応の符号化部に与える。Information on the amount of data stored in the buffer memory 119 is sent to the rate control unit 117. Then, the rate control unit 117 controls the frame dropping of each of the encoding units 114a to 114n as necessary based on the information of the staying amount and the like, adjusts the generated code amount, and controls the encoded data input to the buffer memory 119. Is controlled not to overflow. Further, the rate control unit 117 controls each encoding unit 1
14a-114n local decrypted data 152a
To 152n, the code amount allocated to each object is appropriately distributed in consideration of the balance between the individual image quality according to the priority of each object and the image quality of the entire frame, and the individual information is encoded by each encoding unit 114a. １１４114n to the corresponding encoding unit.

【００９６】また一方、形状抽出部１１２は抽出したオ
ブジェクトの形状情報から相互の輪郭が接触しているオ
ブジェクト同士を組として分類しており、そして、前記
各組に対してどのオブジェクトが属しているかの分類表
を作成してレート制御部１１７に与えている。そして、
レート制御部１１７はレート制御及びコマ落としの必要
が生じたときは、この分類表の情報からそれぞれ組み単
位で、その組に属しているオブジェクトをまとめてコマ
落としを行う。On the other hand, the shape extracting unit 112 classifies the objects whose outlines are in contact with each other as a set based on the extracted shape information of the object, and determines which object belongs to each set. Is created and given to the rate control unit 117. And
When the need for rate control and frame dropping arises, the rate control unit 117 collectively drops objects belonging to the group for each group from the information in this classification table.

【００９７】つまり、コマ落としにあたっては、レート
制御部１１７は分類表の情報をもとに、コマ落としを組
み単位で行うようにする。In other words, when dropping frames, the rate control unit 117 performs dropping of frames on a group basis based on the information in the classification table.

【００９８】図６の構成の符号化装置の動作の概要はこ
のようなものであるが、この符号化装置は複数オブジェ
クトにおけるレート制御を、１つのバッファメモリで実
施すると共に、複数のオブジェクトについて符号化の際
に、オブジェクト毎に画品質を考慮しながらビッ卜配分
をして、多重化処理を行うようにし、オブジェクト全体
の表示で優先度の高いオブジェクトほど高品質の動画像
表示ができ、多重化処理の際にオブジェクト全体として
のコマ落しの回数も減らすこともできるようにして、全
体としてバランスのとれた符号化処理を行うことを可能
にする点に特徴がある。The outline of the operation of the encoding apparatus having the configuration shown in FIG. 6 is as described above. This encoding apparatus implements rate control for a plurality of objects with one buffer memory, and performs encoding for a plurality of objects. At the time of multiplexing, multiplexing processing is performed by allocating bits while considering image quality for each object, and the higher the priority of the entire object is displayed, the higher the quality of the moving image can be displayed. It is characterized in that the number of dropped frames in the entire object can be reduced during the encoding process, thereby enabling a balanced encoding process as a whole.

【００９９】すなわち、複数のオブジェクトについて全
体として１つのバッファメモリでレート制御を行うこと
で、通信回線に対して一定の伝送レートに適合するよう
に制御する。初期状態として各オブジェクトに対して優
先度と配分ビットを決め、各オブジェクトは量子化精度
による空間的歪みとコマ落しによる時間的歪みとのバラ
ンスがとれた視覚特性上最適な動作点になるように符号
化処理を行う。そして、動画像符号化の過程で優先度が
高いオブジェクトが高アクティビティ状態になったと
き、全体として通信回線の伝送レートを遵守しながら優
先度の低いオブジェクトから配分ビットを移動させるよ
うにレート制御を行う。また、優先度の低いオブジェク
トが高アクティビティ状態になったとき該当オブジェク
トのレートを下げることで、多重化した場合に、全体と
してコマ落しを少なくする。That is, by controlling the rate of a plurality of objects with one buffer memory as a whole, the communication line is controlled to conform to a certain transmission rate. The priority and allocation bits are determined for each object as an initial state, and each object is set to the optimal operating point in terms of visual characteristics that balances spatial distortion due to quantization accuracy and temporal distortion due to frame dropping. Perform encoding processing. Then, when an object with a high priority enters a high activity state in the process of moving image encoding, rate control is performed so as to move the allocation bits from the object with a low priority while observing the transmission rate of the communication line as a whole. Do. Also, when an object with a low priority enters the high activity state, the rate of the object is reduced, so that when frames are multiplexed, dropped frames are reduced as a whole.

【０１００】複数のオブジェクトについて符号化の際
に、オブジェクト毎に画品質を考慮しながらビッ卜配分
をして、多重化処理を行うようにすると、オブジェクト
全体の表示で優先度の高いオブジェクトほど高品質の動
画像表示が実現でき、多重化処理の際にオブジェクト全
体としてのコマ落しの回数も減らすこともでき、全体と
してバランスのとれた符号化処理を行うことが可能とな
る。When encoding a plurality of objects, multiplexing is performed by allocating bits while taking into account image quality for each object. A high-quality moving image display can be realized, the number of dropped frames of the entire object can be reduced during the multiplexing process, and a balanced encoding process can be performed as a whole.

【０１０１】その制御の詳細を次に説明する。The details of the control will be described below.

【０１０２】第２の実施形態における符号化装置におい
ては、入力された原画像が、形状抽出部１３１において
形状抽出処理されることにより、必要なオブジェクトに
ついての領域抽出が行われ、これにより、それらオブジ
ェクトの形状が抽出される。In the encoding apparatus according to the second embodiment, the input original image is subjected to shape extraction processing by the shape extraction unit 131, thereby extracting a region of a necessary object. The shape of the object is extracted.

【０１０３】ここで一般的な動画像からオブジェクトの
輪郭を元に、該当の領域を抽出するには、例えば、特開
平６−２５１１４８号公報に記載されている如きの手法
を用いることができる。Here, in order to extract a corresponding area from a general moving image based on the contour of an object, for example, a method described in JP-A-6-251148 can be used.

【０１０４】そして、符号化部１１４ａ〜１１４ｎで、
前記領域におけるテキスチャ、動き補償フレーム予測お
よび形状情報について符号化する。そして、符号化部１
１４ａ〜１１４ｎで符号化されたテキスチャ、動き補償
フレーム予測および形状情報はマルチプレクサ１５３に
送られてここで多重化され、バッファメモリ１１９に送
られる。ここで、各符号化部１１４ａ〜１１４ｎ毎に別
々のオブジェクトを扱っており、従って、各符号化部１
１４ａ〜１１４ｎ毎にそれぞれ独立のチャネルと考える
と、マルチプレクサ１５３で多重化された出力１５５は
各チャネルのデータが多重化されたデータである。Then, in the encoding units 114a to 114n,
The texture, motion-compensated frame prediction, and shape information in the area are encoded. And the encoding unit 1
The texture, motion-compensated frame prediction, and shape information encoded by 14a to 114n are sent to the multiplexer 153, where they are multiplexed and sent to the buffer memory 119. Here, different objects are handled for each of the encoding units 114a to 114n, and accordingly,
Assuming that channels 14a to 114n are independent channels, the output 155 multiplexed by the multiplexer 153 is data obtained by multiplexing the data of each channel.

【０１０５】第１の実施形態と同様に、レート制御のた
めに、バッファメモリでの滞留量の情報がレート制御部
１１７に送られるが、第２の実施形態での本システムで
は、バッファメモリ全体での滞留量ではなく、各チャネ
ル別の滞留量とする。すなわち、バッファメモリ１１９
での各チャネル毎の現在のデータ量情報がレート制御部
１１７に送られるようになっている。As in the first embodiment, information on the amount of data stored in the buffer memory is sent to the rate control unit 117 for rate control. However, in the present system according to the second embodiment, the entire buffer memory is used. Instead of the amount of stagnation in, the amount of stagnation for each channel. That is, the buffer memory 119
The current data amount information for each channel in the above is sent to the rate control unit 117.

【０１０６】一方、レート制御部１１７は符号化部１１
４ａ〜１１４ｎから与えられたローカル復号化データ１
５２ａ〜１５２ｎと前記各チャネル別のデータ量情報か
ら各チャネルのバッファメモリ１１９の割り当てメモリ
量を算出して、算出結果から量子化精度制御情報１５１
ａ〜１５１ｎを作成し、符号化部１１４ａ〜１１４ｎに
対して当該量子化精度制御情報１５１ａ〜１５１ｎを与
える。符号化部１１４ａ〜１１４ｎは量子化精度制御情
報１５１ａ〜１５１ｎにより指定された量子化精度に設
定して符号化を行う。On the other hand, the rate control section 117
4a to 114n, local decoded data 1 given from
The amount of memory allocated to the buffer memory 119 for each channel is calculated from the data amount information for each channel and the quantization accuracy control information 151 from the calculation result.
a to 151n, and gives the quantization accuracy control information 151a to 151n to the encoding units 114a to 114n. The encoding units 114a to 114n perform encoding by setting the quantization accuracy specified by the quantization accuracy control information 151a to 151n.

【０１０７】ここで符号化部１１４の構成例について触
れておく。＜符号化部１１４の構成例＞図７は符号化部１１４ａ〜
１１４ｎの内部構成を示した構成図である。符号化部１
１４は符号化制御部１５６、符号化被制御部１５８及び
形状符号化部１５７および多重化部１６５とから構成さ
れている。Here, an example of the configuration of the encoding unit 114 will be described. <Example of Configuration of Encoding Unit 114> FIG.
It is a block diagram showing the internal configuration of 114n. Encoding unit 1
14 comprises an encoding control unit 156, an encoded controlled unit 158, a shape encoding unit 157, and a multiplexing unit 165.

【０１０８】これらのうち、符号化制御部１５６は、レ
ート制御部１１７からの量子化精度制御情報１５１をも
とに、符号化被制御部１５８での量子化処理段での量子
化幅を調整したり、コマ落とし制御したり、形状符号化
部１５７の制御をしたりするものであり、形状符号化部
１５７は形状抽出部１１２から与えられる形状情報１１
３について符号化するものであって、形状そのものの符
号化を行うものであり、符号化被制御部１５８は、符号
化制御部１５６の制御のもとに入力動画像データ１１１
中から、形状抽出部１１２の与える形状情報１１３対応
の形状領域について動き補償フレーム予測を行い、これ
を可変符号化し、また、動き補償フレーム予測情報をＤ
ＣＴ処理してＤＣＴ変換係数を得、これを量子化した
後、可変符号化し、また、ＤＣＴ変換係数を復号化して
ローカル復号化データ１５２として出力する機能を有す
る。Among these, the encoding control unit 156 adjusts the quantization width in the quantization processing stage in the encoded controlled unit 158 based on the quantization accuracy control information 151 from the rate control unit 117. The frame encoding control section 157 controls the shape encoding section 157. The shape encoding section 157 controls the shape encoding section 157.
The encoding control unit 158 encodes the input moving image data 111 under the control of the encoding control unit 156.
From among them, motion-compensated frame prediction is performed for a shape area corresponding to the shape information 113 provided by the shape extracting unit 112, and this is variably encoded.
It has a function of obtaining a DCT transform coefficient by performing a CT process, quantizing it, variably encoding it, decoding the DCT transform coefficient, and outputting it as local decoded data 152.

【０１０９】ここで、符号化被制御部１５８は、動き補
償フレーム間予測部１５９、ＤＣＴ変換部１６０、第２
の可変長符号化部１６１、量子化部１６２、ローカル復
号化部１６３、第１の可変長符号化部１６４とから構成
される。Here, the coded controlled unit 158 includes a motion compensation inter-frame predicting unit 159, a DCT transforming unit 160, a second
, A quantization unit 162, a local decoding unit 163, and a first variable length coding unit 164.

【０１１０】動き補償フレーム間予測部１５９は、形状
抽出部１１２の与える形状情報１１３対応の形状領域に
ついて１６×１６画素の範囲で動き補償が可能でフレー
ム間の誤差を算出し、動き補償フレーム予測情報を得る
ものであり、ＤＣＴ変換部１６０は、この動き補償フレ
ーム予測情報をＤＣＴ処理するものであって、具体的に
は予測誤差情報を８×８のブロック単位でＤＣＴ変換し
て空間座標データを周波数座標データに変換するもので
あり、第２の可変長符号化部１６１は、動き補償フレー
ム予測情報を可変符号化して出力するものである。The motion-compensated inter-frame predicting section 159 can perform motion compensation within a range of 16 × 16 pixels for a shape area corresponding to the shape information 113 provided by the shape extracting section 112, calculate an inter-frame error, and perform motion-compensated frame prediction. The DCT transform unit 160 performs DCT processing on the motion-compensated frame prediction information. Specifically, the DCT transform unit 160 performs DCT transform on the prediction error information in units of 8 × 8 blocks to obtain spatial coordinate data. Is converted to frequency coordinate data, and the second variable-length encoding unit 161 variably encodes the motion-compensated frame prediction information and outputs the information.

【０１１１】また、量子化部１６２は、与えられた量子
化幅でＤＣＴ変換部１６０の変換出力を量子化して出力
するものであり、第１の可変長符号化部１６４は、この
量子化出力を可変符号化して出力するものであって、具
体的には量子化した変換係数をハフマン符号化するもの
であり、ローカル復号化部１６３は、量子化部１６２に
て得たＤＣＴ変換係数を復号化してローカル復号化デー
タ１５２として出力するものである。The quantizing section 162 quantizes the transform output of the DCT transform section 160 with a given quantization width and outputs the result. The first variable length coding section 164 outputs the quantized output. Is variably encoded and output. Specifically, the quantized transform coefficient is subjected to Huffman encoding, and the local decoding unit 163 decodes the DCT transform coefficient obtained by the quantization unit 162. And outputs it as local decrypted data 152.

【０１１２】なお、上記多重化部１６５は形状符号化部
１５７と第１の可変長符号化部１６４及び第２の可変長
符号化部１６１の出力を多重化して出力データ１１６と
して出力するものである。The multiplexing section 165 multiplexes the outputs of the shape coding section 157, the first variable length coding section 164, and the second variable length coding section 161 and outputs the multiplexed data as output data 116. is there.

【０１１３】このような構成の符号化部１１４ｉ（ｉは
ａ，〜ｎのうちの一つ）は、形状抽出部１１２から与え
られる形状情報１１３をもとにオブジェクトの輪郭自体
を符号化する。すなわち、複数ある符号化部１１４ａ，
〜１１４ｎは、複数あるオブジェクトのうちのそれぞれ
特定オブジェクトを符号化処理の対象として処理する。
従って、符号化部１１４ａ，〜１１４ｎは、複数ある形
状情報１１３ａ，〜１１３ｎのうち、それぞれ異なる特
定のオブジェクトの形状情報１１３ｉ（ｉはａ，〜ｎの
うちの一つ）をもとにオブジェクトの輪郭自体を符号化
する。The encoding unit 114i (i is one of a,..., N) having such a configuration encodes the outline of the object itself based on the shape information 113 given from the shape extraction unit 112. That is, a plurality of encoding units 114a,
To 114n process a specific object among a plurality of objects as a target of the encoding process.
Therefore, the encoding units 114a to 114n determine the object based on the shape information 113i (i is one of a and n) of the different specific object from among the plurality of shape information 113a and 113n. Encode the contour itself.

【０１１４】また、符号化部１１４ｉには入力動画像１
１１が入力されており、この入力動画像１１１が符号化
被制御部１５８に入力される。Further, the input moving image 1 is
11 is input, and the input moving image 111 is input to the coded controlled unit 158.

【０１１５】符号化被制御部１５８では、入力動画像１
１１が入力されると、動き補償フレーム間予測部１５９
はこれと形状抽出部１１２からの形状情報１１３とを用
いて１６×１６画素の範囲で動き補償が可能なフレーム
間の誤差を算出し、予測誤差信号を得る。そして、ＤＣ
Ｔ変換部１６０は形状抽出部１１２からの形状情報１１
３を参照して前記予測誤差信号を８×８のブロック単位
でＤＣＴ変換することにより、空間座標データを周波数
座標データに変換し、量子化部１６２は、このＤＣＴ変
換部１６０がＤＣＴ変換した変換係数を直線量子化す
る。In the coded controlled unit 158, the input moving image 1
When 11 is input, the motion compensation inter-frame prediction unit 159
Uses this and the shape information 113 from the shape extraction unit 112 to calculate an error between frames that can be motion compensated in a range of 16 × 16 pixels, and obtain a prediction error signal. And DC
The T conversion unit 160 receives the shape information 11 from the shape extraction unit 112.
3, the spatial error data is converted into frequency coordinate data by subjecting the prediction error signal to DCT transform in units of 8 × 8 blocks, and the quantization unit 162 converts the DCT transform by the DCT transform unit 160. The coefficients are linearly quantized.

【０１１６】第１の可変長符号化部１６４は、この量子
化された変換係数を形状抽出部１１２からの形状情報１
１３を参照してハフマン符号化（テクスチャ符号化）
し、多重化部１６５に与える。The first variable length coding section 164 uses the quantized transform coefficient as the shape information 1 from the shape extraction section 112.
Huffman coding (texture coding) with reference to FIG.
Then, it is given to the multiplexing unit 165.

【０１１７】また一方、動き補償フレーム間予測部１５
９において動き補償に用いた動きベクトルを第２の可変
長符号化部１６１は形状抽出部１１２からの形状情報１
１３を参照してハフマン符号化（動き補償符号化）し、
多重化部１６５に与える。また、形状符号化部１５７は
形状抽出部１１２から与えられる形状情報１１３につい
て符号化することにより、形状そのものの符号化を行
い、多重化部１６５に与える。On the other hand, the motion compensation inter-frame prediction unit 15
9, the second variable length coding unit 161 uses the motion vector used for motion compensation in the shape information 1 from the shape extraction unit 112.
Huffman coding (motion compensation coding) with reference to FIG.
This is provided to the multiplexing unit 165. Also, the shape encoding unit 157 encodes the shape itself by encoding the shape information 113 provided from the shape extraction unit 112, and supplies the encoded information to the multiplexing unit 165.

【０１１８】すると、多重化部１６５では、第１の可変
長符号化部１６４で符号化された主情報、第２の可変長
符号化部１６１で符号化されたサイド情報および形状符
号化部１５７で符号化された形状符号化情報を多重化し
て伝送フレームを形成し、マルチプレクサ１５３へと出
力する。Then, in multiplexing section 165, main information coded in first variable length coding section 164, side information coded in second variable length coding section 161 and shape coding section 157 Are multiplexed to form a transmission frame, and output to the multiplexer 153.

【０１１９】符号化制御部１５６は、レート制御部１１
７からの量子化精度制御情報１５１をもとに、符号化被
制御部１５８での量子化処理段での量子化幅を調整した
り、コマ落とし制御したり、形状符号化部１５７の制御
をする。The encoding control section 156 is provided for the rate control section 11
7 based on the quantization accuracy control information 151 from the encoding control unit 158, adjusts the quantization width at the quantization processing stage in the encoding controlled unit 158, controls frame dropping, and controls the shape encoding unit 157. I do.

【０１２０】このように、入力動画像１１１が入力され
た符号化被制御部１５８では、符号化制御部１５６の制
御のもとに入力動画像１１１中から、形状抽出部１１２
の与える形状情報１１３対応の形状領域について動き補
償フレーム予測を行い、これを可変符号化して多重化部
１６５に与え、また、動き補償フレーム予測情報をＤＣ
Ｔ処理し、さらにこれを量子化した後、可変符号化して
多重化部１６５に与え、多重化部１６５は形状符号化部
１５７および第１の可変長符号化部１６４および第２の
可変長符号化部１６１からの出力を多重化しその多重化
出力を、マルチプレクサ１５３に送出する。As described above, in the coded control unit 158 to which the input moving image 111 is input, the shape extraction unit 112 is controlled from the input moving image 111 under the control of the coding control unit 156.
Of the shape area corresponding to the shape information 113 given by the motion compensation frame prediction, variably encodes the result, and supplies the result to the multiplexing unit 165.
After performing T processing and further quantizing this, it is variably coded and provided to a multiplexing unit 165. The multiplexing unit 165 comprises a shape coding unit 157, a first variable length coding unit 164, and a second The output from the multiplexing unit 161 is multiplexed, and the multiplexed output is sent to the multiplexer 153.

【０１２１】また、ローカル復号化部１６３では、形状
抽出部１１２から与えられる形状情報１１３を参照しな
がらＤＣＴ変換係数を逆ＤＣＴ変換することにより、復
号化してローカル復号化データ１５２として出力し、レ
ート制御部１１７に与える。Further, the local decoding section 163 performs inverse DCT transform of the DCT transform coefficient with reference to the shape information 113 given from the shape extracting section 112, thereby decoding and outputting as local decoded data 152, This is given to the control unit 117.

【０１２２】動作概要はこのようなものであり、動き補
償フレーム間予測部１５９、第２の可変長符号化部１６
１、ＤＣＴ変換部１６０、ローカル復号化部１６３およ
び第１の可変長符号化部１６４では原画像（入力動画像
１１１）について形状情報１１３により指定された領域
についてのみ処理を行う。The operation summary is as described above. The motion compensation inter-frame prediction unit 159 and the second variable length coding unit 16
1. The DCT transform unit 160, the local decoding unit 163, and the first variable length coding unit 164 perform processing only on the original image (input moving image 111) with respect to the area specified by the shape information 113.

【０１２３】形状情報の符号化、動き補償およびテキス
チャ符号化の方法は、例えば、ISO/IEC JTC1/SC29/WG11
N1796,"MPEG-4 Video Verfication Model Version 8.
0."July 1997 に詳しく記載されている。Methods for encoding shape information, motion compensation and texture encoding are described in, for example, ISO / IEC JTC1 / SC29 / WG11.
N1796, "MPEG-4 Video Verfication Model Version 8.
0. "July 1997.

【０１２４】図８（ａ）の特性グラフ１６６と図８
（ｂ）の特性グラフ１６７は、第１の実施形態での構成
における図１の単一のバッファメモリ１３９における滞
留量の例を示している。特性グラフ１６６はＢ（ｔ＋３）＝Ｂ（ｔ）＋Ｒ１（ｔ）−３・Ｌ１となり、時刻ｔ＋１とｔ＋２でコマ落しが起こってい
る。ここで、Ｌ１は単位時間当たりの伝送路へ送出され
るデータの伝送量である。The characteristic graph 166 of FIG.
The characteristic graph 167 of (b) shows an example of the staying amount in the single buffer memory 139 of FIG. 1 in the configuration according to the first embodiment. In the characteristic graph 166, B (t + 3) = B (t) + R1 (t) −3 · L1, and frames are dropped at times t + 1 and t + 2. Here, L1 is the amount of data transmitted to the transmission path per unit time.

【０１２５】同様に特性グラフ１６７はＢ（ｔ＋３）＝Ｂ（ｔ）＋Ｒ１（ｔ）−３・Ｌ２となる。Similarly, the characteristic graph 167 becomes B (t + 3) = B (t) + R1 (t) −3 · L2.

【０１２６】上記特性グラフ１６６と１６７の制御形態
を、第２の実施形態での構成における図６の並列符号化
方式のシステムに仮に適用してみる。オブジェクトは２
個と仮定すると、この場合、符号化部は１１４ａと１１
４ｂの２個の構成になり、従来のバッファメモリの制御
形態（図２０）を適用すると符号化部１１４ａと符号化
部１１４ｂについて２個のバッファメモリによる平滑化
を行う必要が生じることになる。従って、図６のシステ
ム構成の場合、バッファメモリ１１９で前記２個のバッ
ファメモリの平滑化を行う必要が生じる。この場合、前
記特性グラフ１６６と特性グラフ１６７をそのまま、線
形加算してバッファメモリ１１９で平滑化を行うと、図
８（ｃ）に示した特性グラフ１６８の如きとなり、この
特性グラフ１６８は、式で表すとＢ（ｔ＋３）＝Ｂ（ｔ）＋Ｒ１（ｔ）＋Ｒ２（ｔ）−３
・Ｌ３となる。ここで、Ｌ３はＬ３＝Ｌ１＋１２としている。
また、このＬ３は図６におけるバッファメモリ１１９の
出力１２０で単位時間当たりの伝送路へ送出されるデー
タの伝送量になる。そして、コマ落しは時刻ｔ＋１とｔ
＋２で発生している。The control modes of the characteristic graphs 166 and 167 will be temporarily applied to the parallel encoding system shown in FIG. 6 in the configuration of the second embodiment. Object is 2
In this case, the encoding units 114a and 11
4b, and applying the conventional buffer memory control mode (FIG. 20), the encoding unit 114a and the encoding unit 114b need to be smoothed by the two buffer memories. Therefore, in the case of the system configuration in FIG. 6, it is necessary to smooth the two buffer memories in the buffer memory 119. In this case, if the characteristic graphs 166 and 167 are linearly added as they are and smoothed by the buffer memory 119, a characteristic graph 168 shown in FIG. 8C is obtained. B (t + 3) = B (t) + R1 (t) + R2 (t) -3
・ It becomes L3. Here, L3 is L3 = L1 + 12.
This L3 is the amount of data transmitted to the transmission line per unit time at the output 120 of the buffer memory 119 in FIG. Then, the frames are dropped at times t + 1 and t
+2.

【０１２７】図９（ａ）は特性グラフ１６６について目
的関数（“Ｓｏ１＝Ｏ１［Ｄｏ］”）１６９と画品質ト
レードオフ関数（“Ｓｓ１＝Ｇ１［Ｄｓ］”）１７０を
示している。目的関数１６９とトレードオフ関数１７０
の交点１７１は時刻ｔで、視覚的にバランスの良い復号
画像が得られる動作点である。前記動作点については上
記の方法で求めることができる。FIG. 9A shows an objective function (“So1 = O1 [Do]”) 169 and an image quality trade-off function (“Ss1 = G1 [Ds]”) 170 for the characteristic graph 166. Objective function 169 and trade-off function 170
Is an operating point at which a visually balanced decoded image is obtained at time t. The operating point can be obtained by the above method.

【０１２８】同様に図９（ｂ）は特性グラフ１６７につ
いて目的関数（“Ｓｏ２＝Ｏ２［Ｄｏ］”）１７２と画
品質トレードオフ関数（“Ｓｓ２＝Ｇ２［Ｄｓ］”）１
７３を示している。目的関数１７２とトレードオフ関数
１７３の交点１７４は時刻ｔで、視覚的にバランスの良
い復号画像が得られる動作点である。そして、バッファ
メモリ１１９で特性グラフ１６６と１６７に従った制御
をそのまま一緒に行うようにすべく、符号化制御部１５
６によりこのような制御を実行するようレート制御部１
１７で制御すれば、図９における動作点１７１と１７４
が保証され、バッファメモリ１１９の出力１２０として
は、前記２つのオブジェクトにおける最適な動作点での
動画像データ出力となり、当該動画像データが通信回線
に伝送されることになる。Similarly, FIG. 9B shows an objective function (“So2 = O2 [Do]”) 172 and an image quality trade-off function (“Ss2 = G2 [Ds])) 1 for the characteristic graph 167.
73 is shown. An intersection 174 of the objective function 172 and the trade-off function 173 is an operating point at time t at which a visually balanced decoded image is obtained. Then, the encoding control unit 15 controls the buffer memory 119 so that the control according to the characteristic graphs 166 and 167 is performed together.
6 to execute such control.
17, the operating points 171 and 174 in FIG.
Is output, and the output 120 of the buffer memory 119 is video data output at the optimal operating point for the two objects, and the video data is transmitted to the communication line.

【０１２９】ところで、図１０（ｂ）の特性グラフ１７
６は（“オブジェクト２”）時刻ｔでの符号化データＲ
２（ｔ）を時刻ｔ＋Ｌおよびｔ＋２の２回のコマ落しを
行うことにより、通信回線に送出している。By the way, the characteristic graph 17 in FIG.
6 is (“object 2”) encoded data R at time t
2 (t) is transmitted to the communication line by performing frame dropping twice at times t + L and t + 2.

【０１３０】しかし、図１０（ａ）の特性グラフ１７５
（“オブジェクト２”）のように、Ｒ１（ｔ）の値が通
常より（例えば時刻ｔ−３など）大きく、時刻ｔ＋１、
ｔ＋２およびｔ＋３の３回のコマ落しを行わないと時刻
ｔでの符号化データを通信回線に送出できず、受信側で
の動画像表示での遅延が通常より大きくなる。そして、
図６の１個のバッファメモリ１１９でレート制御部１１
７で制御すると、図１０（ｃ）の特性グラフ１７７のよ
うに時刻ｔ＋１、ｔ＋２およびｔ＋４でコマ落ちが起こ
ってしまう。However, the characteristic graph 175 shown in FIG.
(“Object 2”), the value of R1 (t) is larger than usual (for example, time t−3), and the time t + 1,
Unless the frame is dropped three times at t + 2 and t + 3, the encoded data at the time t cannot be transmitted to the communication line, and the delay in displaying a moving image on the receiving side becomes longer than usual. And
In one buffer memory 119 of FIG.
When the control is performed by 7, the frame drop occurs at times t + 1, t + 2, and t + 4 as shown in a characteristic graph 177 of FIG.

【０１３１】すなわち、特性グラフ１７６の特性に従っ
た符号化データではコマ落しが２回で済むのに対して、
特性グラフ１７５の特性に従った符号化データを一緒に
してレート制御を行うようにすると、コマ落しが３回に
なってしまう。That is, in the case of encoded data according to the characteristics of the characteristic graph 176, the number of dropped frames is only two, whereas
If the encoded data according to the characteristics of the characteristic graph 175 are subjected to the rate control together, the number of dropped frames becomes three.

【０１３２】そこで、図６のレート制御部１１７で特性
グラフ１７５および特性グラフ１７６の特性に従った符
号化データについて、まとめて制御する場合には、以下
のように行う。Therefore, when the encoded data according to the characteristics of the characteristic graph 175 and the characteristic graph 176 are collectively controlled by the rate control unit 117 in FIG. 6, the following is performed.

【０１３３】まず、優先度の高い“オブジェクト１”
（特性グラフ１７５）の符号化データについては一時的
に時刻ｔのとき量子化精度を上げ画品質を良くし、符号
化データＲ１（ｔ）を大きくしてＲ１′（ｔ）とし、バ
ッファメモリ１１９から回線に対するデータ伝送量をＬ
１からＬ１′に上げて（特性グラフ１７８）、コマ落し
の回数を時刻ｔ＋１とｔ＋２の２回に減少させる。First, “object 1” having a high priority
For the encoded data of the (characteristic graph 175), the quantization precision is temporarily increased at time t to improve the image quality, the encoded data R1 (t) is increased to R1 '(t), and the buffer memory 119 is used. From the data transmission amount to the line
From 1 to L1 '(characteristic graph 178), the number of dropped frames is reduced to two times t + 1 and t + 2.

【０１３４】一方、優先度の低い“オブジェクト２”
（特性グラフ１７６）の符号化データについては一時的
に時刻ｔのときに符号化データＲ２（ｔ）を小さくして
Ｒ２′（ｔ）とし、量子化精度を下げて画品質を劣化さ
せるのと同時に、バッファメモリ１１９から回線に対す
るデータ伝送量をＬ２からＬ２′に下げる（特性グラフ
１７９）。On the other hand, “object 2” having a low priority
Regarding the encoded data of the (characteristic graph 176), at time t, the encoded data R2 (t) is temporarily reduced to R2 ′ (t), and the quantization precision is reduced to degrade the image quality. At the same time, the amount of data transmitted from the buffer memory 119 to the line is reduced from L2 to L2 '(characteristic graph 179).

【０１３５】上記の“オブジェクト１”と“オブジェク
ト２”による符号化データは図６のバッファメモリ１１
９に格納されて個別に行った上記のレート制御と同様に
レート制御部１１７でまとめて制御する。図１１（ｃ）
は特性グラフ１７８と特性グラフ１７９の符号化データ
を加算した場合で、バッファメモリ１１９の伝送量の滞
留状態の時間的変化を示している。The coded data of the above “Object 1” and “Object 2” is stored in the buffer memory 11 of FIG.
9 and are collectively controlled by the rate control unit 117 in the same manner as the above-described rate control individually performed. FIG. 11 (c)
Represents the temporal change in the stagnant state of the transmission amount of the buffer memory 119 when the encoded data of the characteristic graph 178 and the encoded data of the characteristic graph 179 are added.

【０１３６】すなわち、フレーム時刻ｔの時点での符号
化データの量を、Ｒ１（ｔ）＋Ｒ２（ｔ）からＲ１′
（ｔ）＋Ｒ２′（ｔ）に下げ、通信回線の単位時間当た
りデータ伝送量はＬ３として、変化しないようにしてい
る。That is, the amount of encoded data at the frame time t is calculated from R1 (t) + R2 (t) to R1 '.
(T) + R2 '(t), and the data transmission amount per unit time of the communication line is set to L3 so as not to change.

【０１３７】以上のように、優先度の高いオブジェクト
でコマ落しが多く発生する可能性がある場合、そのオブ
ジェクトについては量子化精度を変えることで符号化デ
ータ量を変え、同時に通信回線の単位時間当たりのデー
タ伝送量を大きくする。As described above, when there is a possibility that many frames are dropped in an object having a high priority, the amount of coded data is changed for the object by changing the quantization accuracy, and at the same time, the unit time of the communication line is changed. Increase the amount of data transmission per unit.

【０１３８】一方、優先度の低いオブジェクトについて
は同時刻で量子化精度を下げ、符号化データ量を少なく
し、同時に通信回線の単位時間当たりのデータ伝送量を
下げる。On the other hand, for objects with low priorities, the quantization accuracy is reduced at the same time, the amount of coded data is reduced, and at the same time, the data transmission amount per unit time of the communication line is reduced.

【０１３９】このように制御することで、複数のオブジ
ェクトを１つのバッファメモリでまとめて管理する場合
に、全体として、コマ落しを少なくすることができる。By performing such control, when a plurality of objects are collectively managed in one buffer memory, the number of dropped frames can be reduced as a whole.

【０１４０】また、符号化処理を“オブジェクト１”に
ついては特性グラフ１７５から特性グラフ１７８に、
“オブジェクト２”については特性グラフ１７６から特
性１７９の特性に変更する場合に、以下の視覚的に最適
な動作点を満足するように変える。For the encoding process of “object 1”, the characteristic graph 175 changes to the characteristic graph 178.
When changing the characteristic of the “object 2” from the characteristic graph 176 to the characteristic of the characteristic 179, the characteristic is changed so as to satisfy the following visually optimum operating point.

【０１４１】図１２（ａ）における“オブジェクト１”
においては、目的関数が１８１で、特性グラフ１７５の
場合ではトレードオフ関数が１８２、そして、動作点は
１８４の場合に、視覚的に最適な動作点が得られた。し
かし、特性グラフ１７８に移動するとトレードオフ関数
は１８３になり、視覚的に最適な動作点は１８５に移動
する。"Object 1" in FIG.
In, when the objective function is 181, the trade-off function is 182 in the case of the characteristic graph 175, and the operating point is 184, a visually optimum operating point is obtained. However, when moving to the characteristic graph 178, the trade-off function becomes 183, and the visually optimum operating point moves to 185.

【０１４２】すなわち、動作点１８５は動作点１８４に
比べて、よりＳＮ比が良くなり、符号化率も大きくなっ
ているから、画質が一時的に向上したことを示してい
る。That is, the operating point 185 has a higher SN ratio and a higher coding rate than the operating point 184, indicating that the image quality has been temporarily improved.

【０１４３】一方、図１２（ｂ）における“オブジェク
ト２”についてみてみると目的関数は１８８で、特性グ
ラフ１７６ではトレードオフ関数が１８７、そして、動
作点が１９０の場合に、視覚的に最適な動作点であっ
た。しかし、特性グラフ１７９に移動するとトレードオ
フ関数は１８６になり、視覚的に最適な動作点は１８９
に移動する。すなわち、動作点１８９は動作点１９０に
比べてよりＳＮ比が劣化し、符号化率も小さくなってい
るから、画質が一時的に劣化したことを示している。On the other hand, looking at “object 2” in FIG. 12B, the objective function is 188, the trade-off function is 187 in the characteristic graph 176, and the operating point is 190. It was an operating point. However, when moving to the characteristic graph 179, the trade-off function becomes 186, and the visually optimal operating point is 189.
Go to In other words, the operating point 189 indicates that the image quality has temporarily deteriorated because the SN ratio has deteriorated and the coding rate has decreased as compared with the operating point 190.

【０１４４】以上のように、“オブジェクト１”と“オ
ブジェクト２”は予め決められたレートに収まるように
配分して符号化処理を行うべく、レート制御部１１７で
符号化制御部１５６により制御するようすれば、図６の
構成におけるバッファメモリ１１９でも、各オブジェク
ト個別に行っていたレート制御を、まとめて行うことが
可能である。そして、各オブジェクトのレート制御は、
各オブジェクトに最適な目的関数とトレードオフ関数と
の交点による動作点で行うようにすることにより、視覚
的に最適な動作点になる。As described above, the "object 1" and the "object 2" are controlled by the encoding control unit 156 in the rate control unit 117 so as to distribute the encoding so as to be within a predetermined rate and perform the encoding process. In this way, the rate control that has been performed individually for each object can also be collectively performed in the buffer memory 119 in the configuration of FIG. And the rate control of each object is
By performing the operation at the intersection of the optimal objective function and the trade-off function for each object, the operation point becomes visually optimal.

【０１４５】しかし、優先度の高い“オブジェクト１”
が一時的に符号化量が多くなり、コマ落しが多くなると
推定される場合には、その時刻におけるそのオブジェク
トの符号化データ量Ｒ′及びそのオブジェクトについて
の単位時間当たりのデータ伝送量Ｌ′を変えることによ
り、画品質を向上させ、コマ落しの回数を減らすことが
できる。However, “object 1” having a high priority
If it is estimated that the amount of coding temporarily increases and the number of dropped frames increases, the coded data amount R ′ of the object at that time and the data transmission amount L ′ per unit time of the object are calculated. By changing, the image quality can be improved and the number of dropped frames can be reduced.

【０１４６】一方、優先度の低いオブジェクトは前記時
刻にそのオブジェクトの符号化データ量Ｒ′及びそのオ
ブジェクトの単位時間当たりのデータ伝送量Ｌ′を変え
て画品質を下げるようにする。このケースでは、“オブ
ジェクト１”と“オブジェクト２”を１つのバッファメ
モリで一緒に管理する場合に、コマ落しの回数が増えな
いと云う特徴がある。On the other hand, for an object with a low priority, the picture quality is lowered by changing the coded data amount R 'of the object and the data transmission amount L' of the object per unit time at the time. In this case, when "object 1" and "object 2" are managed together in one buffer memory, the feature is that the number of dropped frames does not increase.

【０１４７】このことについて、説明する。This will be described.

【０１４８】図１３（ａ）において、“オブジェクト
１”の符号化量Ｒ１は特性グラフ１９１で示している。
コマ落しは時刻ｔ＋１とｔ＋２の時点の計２回、起きて
いる。そして、“オブジェクト２”についてはその符号
化量Ｒ２は図１４（ｂ）に特性グラフ１９２で示す如き
であって、この図の特性グラフ１９２で示しているよう
に、コマ落しは時刻ｔ＋１の時点、時刻ｔ＋２の時点お
よび時刻ｔ＋３の時点の計３回、起きている。In FIG. 13A, the encoding amount R 1 of “object 1” is shown by a characteristic graph 191.
The frame drop has occurred twice in total at the times t + 1 and t + 2. The encoding amount R2 of “object 2” is as shown by a characteristic graph 192 in FIG. 14B, and as shown in the characteristic graph 192 of FIG. , Three times at time t + 2 and at time t + 3.

【０１４９】従って、“オブジェクト１”と“オブジェ
クト２”の符号化処理を１つのバッファメモリで管理す
ると、特性グラフ１９３のようにコマ落しが時刻ｔ＋１
の時点、時刻ｔ＋２の時点および時刻ｔ＋３の時点の計
３回起きてしまう。Therefore, if the encoding processing of “Object 1” and “Object 2” is managed by one buffer memory, the frame drop occurs at time t + 1 as shown in the characteristic graph 193.
, The time t + 2, and the time t + 3, a total of three times.

【０１５０】そこで、本発明では図１４（ａ），（ｂ）
のように“オブジェクト１”はそのままで、優先度の低
い“オブジェクト２”について、時刻ｔでの符号化量を
Ｒ２（ｔ）からＲ２′（ｔ）に減少させる（特性グラフ
１９４）。“オブジェクト１”と“オブジェクト２”に
ついて、１つのバッファメモリでまとめて管理する場合
に、符号化制御部１５６によりこのような制御を実行す
るようレート制御部１１７で制御すれば、バッファメモ
リの滞留量を示す特性グラフは図１４（ｃ）に示す１９
５の如きになり、コマ落しは時刻ｔ＋１の時点と時刻ｔ
＋２の時点の計２回に減少する。Therefore, in the present invention, FIGS. 14 (a) and 14 (b)
, The encoding amount at time t is reduced from R2 (t) to R2 ′ (t) for “object 2” having a low priority (characteristic graph 194). When the “object 1” and the “object 2” are collectively managed in one buffer memory, if the encoding control unit 156 controls the rate control unit 117 to execute such control, the buffer memory can be stored. A characteristic graph showing the amount is shown in FIG.
5 and the frame drop occurs at the time t + 1 and the time t
It is reduced to a total of two times at the time of +2.

【０１５１】そして、この場合での目的関数とトレード
オフ関数との関係は、図１５のようになる。すなわち、
図１５（ａ）に示すように、“オブジェクト１”につい
ては目的関数１９６とトレードオフ関数１９７の関係は
変わらず、視覚的に最適な動作点は１９８になる。しか
し、“オブジェクト２”に関しては時刻ｔの時点で符号
化量を減らしたため、図１５（ｂ）に示すように、トレ
ードオフ関数が２００から２０１に移動し、従って、視
覚的に最適な動作点は２０３に移動し、ＳＮと符号化率
が劣化したことになる。FIG. 15 shows the relationship between the objective function and the trade-off function in this case. That is,
As shown in FIG. 15A, the relation between the objective function 196 and the trade-off function 197 for “Object 1” does not change, and the visually optimum operating point is 198. However, as for the “object 2”, the coding amount was reduced at the time t, so that the trade-off function moved from 200 to 201 as shown in FIG. Has moved to 203, which means that the SN and the coding rate have deteriorated.

【０１５２】図１６は本発明システムにおけるバッファ
メモリ１１９でのデータ滞留量を示している。図１６
（ａ）における“オブジェクト１”のバッファメモリ滞
留量の特性グラフ２０４と、図１６（ｂ）における“オ
ブジェクト２”のバッファメモリ滞留量の特性グラフ２
０５は、共に時刻ｔ＋１の時点、時刻ｔ＋２の時点およ
び時刻ｔ＋３の時点の計３回、コマ落しが起きている。
従って、“オブジェクト１”と“オブジェクト２”につ
いて一つのバッファメモリで管理した場合、バッファメ
モリ滞留量の変化は図１６（ｃ）に示す特性グラフ２０
６の如きとなり、時刻ｔ＋１、ｔ＋２およびｔ＋３の各
時点の計３回、コマ落しが起きる。FIG. 16 shows the amount of data staying in the buffer memory 119 in the system of the present invention. FIG.
A characteristic graph 204 of the buffer memory retention amount of “Object 1” in (a) and a characteristic graph 2 of the buffer memory retention amount of “Object 2” in FIG.
In the case of 05, the frame drop occurs three times in total at the time t + 1, the time t + 2, and the time t + 3.
Accordingly, when “object 1” and “object 2” are managed by one buffer memory, the change in the buffer memory staying amount is changed by the characteristic graph 20 shown in FIG.
6, and a frame drop occurs three times at each of the times t + 1, t + 2, and t + 3.

【０１５３】そこで、図１７（ａ）のように特性グラフ
２０４に関しては符号化データの量Ｒ１′（ｔ）と、単
位時間当たりのデータ伝送量Ｌ１′を大きくして画品質
を向上させながら特性グラフ２０７の如きにしてコマ落
しを時刻ｔ＋１とｔ＋２の時点の計２回に減らす。Therefore, as shown in FIG. 17A, regarding the characteristic graph 204, the encoded data amount R1 '(t) and the data transmission amount L1' per unit time are increased to improve the image quality. As shown in a graph 207, the number of dropped frames is reduced to a total of two times at times t + 1 and t + 2.

【０１５４】図１７（ｂ）の特性グラフ２０５に関して
は、符号化データの量Ｒ２′（ｔ）と、単位時間当たり
のデータ伝送量Ｌ２′を小さくして画品質を劣化させ、
特性グラフ２０８の如きにしてコマ落しを時刻ｔ＋１と
ｔ＋２の時点の計２回に減らす。Regarding the characteristic graph 205 of FIG. 17B, the encoded data amount R2 '(t) and the data transmission amount L2' per unit time are reduced to degrade image quality.
As shown in the characteristic graph 208, the number of dropped frames is reduced to a total of two times at times t + 1 and t + 2.

【０１５５】従って、図１７（ａ）の特性グラフ２０７
と図１７（ｂ）の特性グラフ２０８の動作について１個
のバッファメモリ１１９で管理すべく、符号化制御部１
５６によりこのような制御を実行するようレート制御部
１１７で制御すれば、バッファメモリ１１９のデータ滞
留量の特性グラフは図１７（ｃ）における２０９の如き
となり、コマ落しが時刻ｔ＋１とｔ＋２の時点の計２回
に減少する。Therefore, the characteristic graph 207 of FIG.
In order to manage the operation of the characteristic graph 208 of FIG.
If the control is performed by the rate control unit 117 to execute such control by 56, the characteristic graph of the data retention amount in the buffer memory 119 is as shown in 209 in FIG. To a total of two times.

【０１５６】この場合に、目的関数とトレードオフ関数
の関係を考えてみると、図１８に示す如きとなる。すな
わち、この場合、“オブジェクト１”は図１８（ａ）に
示すように、トレードオフ関数が２１０から２１１に移
動したことになり、動作点が２１３から２１４に移動
し、ＳＮ比と符号化率が上昇したことになる。FIG. 18 shows the relationship between the objective function and the trade-off function in this case. That is, in this case, as shown in FIG. 18A, the trade-off function of “Object 1” has shifted from 210 to 211, the operating point has shifted from 213 to 214, and the S / N ratio and coding rate have been changed. Has risen.

【０１５７】一方、“オブジェクト２”に関しては図１
８（ｂ）に示すように、トレードオフ関数が２１６から
２１７に移動したことになり、動作点が２１８から２１
９に移動し、ＳＮ比と符号化率が低下したことになる。On the other hand, regarding “Object 2”, FIG.
As shown in FIG. 8B, the trade-off function has moved from 216 to 217, and the operating point has changed from 218 to 21.
9, the SN ratio and the coding rate are reduced.

【０１５８】このように、第２の実施形態においては、
複数のオブジェクトがある場合に、各オブジェクトは予
め決められたレートに収まるように符号化処理を行うべ
く、レート制御部１１７で符号化制御部１５６により制
御するようしたことで、一つのバッファメモリしかない
場合でも各オブジェクト個別に行っていたレート制御
を、まとめて行うことができるようにしたものであり、
各オブジェクトのレート制御は、各オブジェクトに最適
な目的関数とトレードオフ関数との交点による動作点で
行うようにすることで、視覚的に最適な動作点になるよ
うに制御できるようになるものである。As described above, in the second embodiment,
When there are a plurality of objects, each object is controlled by the encoding control unit 156 by the rate control unit 117 in order to perform the encoding process so as to be within a predetermined rate, so that only one buffer memory is used. Even if there is no object, the rate control that had been performed individually for each object can now be performed collectively,
By controlling the rate of each object at the operating point at the intersection of the optimal objective function and trade-off function for each object, it becomes possible to control the visually optimal operating point. is there.

【０１５９】また、優先度の高いオブジェクトが一時的
に符号化量が多くなって、コマ落しが多くなると推定さ
れるシーンの場合には、その時刻におけるそのオブジェ
クトの符号化データ量Ｒ′及びそのオブジェクトについ
ての単位時間当たりのデータ伝送量Ｌ′を変えるように
し、これにより、画品質を向上させ、コマ落しの回数を
減らすことができるようにし、他方、優先度の低いオブ
ジェクトは前記時刻にそのオブジェクトの符号化データ
量Ｒ′及びそのオブジェクトの単位時間当たりのデータ
伝送量Ｌ′を小さくして画品質を下げるように制御する
ことで、コマ落しの回数が増えないように制御できるも
のである。In the case of a scene in which an object with a high priority is temporarily estimated to have a large amount of coding and a large number of dropped frames, the coded data amount R 'of the object at that time and its coded data amount R' By changing the data transmission amount L 'per unit time for the object, the image quality can be improved and the number of dropped frames can be reduced, while the low priority object is set at the time. By controlling the encoded data amount R 'of the object and the data transmission amount L' of the object per unit time to reduce the image quality, it is possible to control the number of dropped frames so as not to increase. .

【０１６０】なお、以上の説明は通信回線が固定レート
で、各オブジェクトに対して予め初期にビットを配分を
していて、基本的には各オブジェクトも定レートになっ
ている。そして、符号化の前段階でオブジェクトの内、
あるオブジェクトがアクティブになり、そのオブジェク
トの影響で通信回線に送出されるビットストリュームに
対してコマ落しが多く発生すると推定された場合に、優
先度の高いオブジェクトの画品質を落さず、または向上
させるように各オブジェクトの間でビットの移動をさせ
て、前記通信回線に送出されるビットストリュームに対
するコマ落しも減少させる。そして、各オブジェクトの
符号化処理の過程で、符号化率に応じたコマ落しによる
時間的歪みと量子化精度による空間的歪みとのバランス
がとれ、視覚特性上最適な動作点で符号化処理を行うこ
とを目的としている。In the above description, the communication line has a fixed rate, bits are initially allocated to each object in advance, and basically each object has a constant rate. And, before the encoding,
If an object is activated and it is estimated that many drop frames occur for the bitstream transmitted to the communication line due to the effect of the object, the image quality of the high-priority object is not reduced, or By moving bits between each object so as to improve, the drop of frames for the bit stream transmitted to the communication line is also reduced. In the process of encoding each object, the temporal distortion due to frame drop according to the encoding rate and the spatial distortion due to quantization accuracy are balanced, and the encoding process is performed at the optimal operating point in terms of visual characteristics. It is intended to do.

【０１６１】しかし、本発明の適用対象は通信回線が定
レートに限定されるものではない。例えば、無線回線の
ように伝送エラーが頻繁に発生し、実質的に伝送レート
が変化するような通信回線、あるいはＡＴＭ通信のよう
に可変レート通信回線にも適用できる。However, the application of the present invention is not limited to a communication line having a constant rate. For example, the present invention can be applied to a communication line in which a transmission error frequently occurs and a transmission rate substantially changes like a wireless line, or a variable rate communication line such as an ATM communication.

【０１６２】可変レート通信回線の場合、回線のレート
変動の都度、各オブジェクトのビット配分を変える。配
分方法として、優先度の高いオブジェクトと低いオブジ
ェクトについて、配分比率で決める方法がある。また、
優先度の高いオブジェクトの配分ビットを一定とするこ
とで、常時一定の画品質を確保できるようにして、残り
のビットを優先度の低いオブジェクトに配分する方法が
ある。In the case of a variable rate communication line, the bit allocation of each object is changed each time the line rate changes. As an allocation method, there is a method of determining an object having a high priority and an object having a low priority by an allocation ratio. Also,
There is a method of distributing the remaining bits to low-priority objects by keeping the distribution bits of the high-priority objects constant to ensure a constant image quality at all times.

【０１６３】上記の方法により各オブジェクトに配分し
たビットから、本発明の方法により、更に通信回線に送
出されるビットストリュームでのコマ落しを減少させ、
且つ優先度の高いオブジェクトの画品質の維持あるいは
向上させるように、各オブジェクトのビット配分を行
い、符号化を実施する。そして、各オブジェクトの符号
化処理の過程で、符号化率に応じたコマ落しによる時間
的歪みと量子化精度による空間的歪みとのバランスがと
れ、視覚特性上最適な動作点で符号化処理を行うこと
で、可変レート通信回線でも、各オブジェクトの動画像
についてバランスのとれた表示が可能となる。From the bits allocated to each object by the above method, the method of the present invention further reduces the drop of frames in the bit stream transmitted to the communication line,
In addition, bits are allocated to each object and coding is performed so as to maintain or improve the image quality of objects with high priority. In the process of encoding each object, the temporal distortion due to frame drop according to the encoding rate and the spatial distortion due to quantization accuracy are balanced, and the encoding process is performed at the optimal operating point in terms of visual characteristics. By doing so, a well-balanced display of the moving image of each object is possible even with a variable rate communication line.

【０１６４】尚、本発明は上述した実施形態に限定され
るものではなく、種々変形して実施可能である。また、
本発明の実施形態に記載した手法は、コンピュータに実
行させることのできるプログラムとして、磁気ディスク
（フロッピーディスク、ハードディスクなど）、光ディ
スク（ＣＤ−ＲＯＭ、ＤＶＤなど）、半導体メモリなど
の記録媒体に格納して頒布することもできる。Note that the present invention is not limited to the above-described embodiments, and can be implemented with various modifications. Also,
The method described in the embodiment of the present invention is stored in a recording medium such as a magnetic disk (floppy disk, hard disk, etc.), an optical disk (CD-ROM, DVD, etc.), a semiconductor memory, or the like as a program that can be executed by a computer. Can also be distributed.

【０１６５】[0165]

【発明の効果】以上、詳述したように本発明は、複数オ
ブジェクトの動画像を符号化した画像データを多重化し
て通信回線に送出するシステムにおいて、オブジェクト
の符号化の際に優先度をつけ、符号化の際に全体として
コマ落しが多くなると推定されたとき、オブジェクト間
で配分ビットを移動させ、優先度の高いオブジェクトの
画品質を維持、且つ、向上させるようにし、そして、各
オブジェクトの符号化処理の過程で、符号化率に応じた
コマ落しによる時間的歪みと量子化精度のよる空間的歪
みとのバランスがとれ、視覚特性上最適な動作点で符号
化処理が行えるようにしたものであり、これにより、各
オブジェクトについて再生側ではバランスのとれた再生
画像の表示が可能となる。As described above in detail, according to the present invention, in a system for multiplexing image data obtained by encoding moving images of a plurality of objects and transmitting the multiplexed image data to a communication line, priorities are set at the time of object encoding. When it is estimated that the number of dropped frames is increased as a whole during encoding, the allocation bits are moved between the objects to maintain and improve the image quality of the high-priority object, and In the course of the encoding process, the temporal distortion due to frame drop according to the encoding rate and the spatial distortion due to the quantization accuracy are balanced, and the encoding process can be performed at the optimal operating point in terms of visual characteristics. Thus, a balanced reproduced image can be displayed on the reproducing side for each object.

[Brief description of the drawings]

【図１】本発明を説明するための図であって、本発明に
おける第１の実施形態である矩形画面に表示させるため
の符号化方式を説明するための構成図。FIG. 1 is a diagram for explaining the present invention, and is a configuration diagram for explaining an encoding method for displaying on a rectangular screen according to a first embodiment of the present invention.

【図２】本発明を説明するための図であって、本発明に
おける第１の実施形態である矩形画面に表示させるため
の符号化方式でのバッファメモリのデータ滞留量を示し
たグラフの一例。FIG. 2 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount of a buffer memory in an encoding method for displaying on a rectangular screen according to the first embodiment of the present invention; .

【図３】本発明を説明するための図であって、本発明に
おける第１の実施形態である矩形画面に表示させるため
の符号化方式での画品質を定義する目的関数とトレード
オフ関数を示したグラフの一例。FIG. 3 is a diagram for explaining the present invention, and illustrates an objective function and a trade-off function that define image quality in an encoding method for displaying on a rectangular screen according to the first embodiment of the present invention. An example of the graph shown.

【図４】本発明を説明するための図であって、本発明に
おける第１の実施形態である矩形画面に表示させるため
の符号化方式での画品質を定義するトレードオフ関数を
示したグラフの一例。FIG. 4 is a diagram for explaining the present invention, and is a graph showing a trade-off function that defines image quality in an encoding method for displaying on a rectangular screen according to the first embodiment of the present invention. An example.

【図５】本発明を説明するための図であって、本発明に
おける第１の実施形態である矩形画面に表示させるため
の符号化方式での画品質を定義する符号化率と量子化精
度の関係を示したグラフの一例。FIG. 5 is a diagram for explaining the present invention, and shows a coding rate and a quantization accuracy that define image quality in a coding method for displaying on a rectangular screen according to the first embodiment of the present invention. 6 is an example of a graph showing the relationship.

【図６】本発明を説明するための図であって、本発明に
おける第２の実施形態である複数オブジェクトの符号化
方式の構成図。FIG. 6 is a diagram for explaining the present invention, and is a configuration diagram of an encoding method for a plurality of objects according to a second embodiment of the present invention.

【図７】本発明を説明するための図であって、本発明に
おける第２の実施形態である複数オブジェクトの符号化
方式での符号化部の構成図。FIG. 7 is a diagram for explaining the present invention, and is a configuration diagram of an encoding unit in a multi-object encoding method according to a second embodiment of the present invention.

【図８】本発明を説明するための図であって、本発明に
おける第２の実施形態である複数オブジェクトの符号化
方式でのバッファメモリのデータ滞留量を示したグラフ
の一例。FIG. 8 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図９】本発明を説明するための図であって、本発明に
おける第２の実施形態である複数オブジェクトの符号化
方式での各オブジェクトの目的関数とトレードオフ関数
を示したグラフの一例。FIG. 9 is a diagram for explaining the present invention, and is an example of a graph showing an objective function and a trade-off function of each object in a multi-object encoding method according to the second embodiment of the present invention.

【図１０】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 10 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１１】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 11 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１２】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式での各オブジェクトの目的関数とトレードオフ関
数を示したグラフの一例。FIG. 12 is a diagram for explaining the present invention, and is an example of a graph showing an objective function and a trade-off function of each object in a multi-object encoding method according to the second embodiment of the present invention.

【図１３】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 13 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１４】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 14 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１５】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式での各オブジェクトの目的関数とトレードオフ関
数を示したグラフの一例。FIG. 15 is a diagram for explaining the present invention, and is an example of a graph showing an objective function and a trade-off function of each object in the encoding method of a plurality of objects according to the second embodiment of the present invention.

【図１６】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 16 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１７】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式でのバッファメモリのデータ滞留量を示したグラ
フの一例。FIG. 17 is a diagram for explaining the present invention, and is an example of a graph showing a data retention amount in a buffer memory in a multi-object encoding method according to the second embodiment of the present invention.

【図１８】本発明を説明するための図であって、本発明
における第２の実施形態である複数オブジェクトの符号
化方式での各オブジェクトの目的関数とトレードオフ関
数を示したグラフの一例。FIG. 18 is a diagram for explaining the present invention, and is an example of a graph showing an objective function and a trade-off function of each object in a multi-object encoding method according to the second embodiment of the present invention.

【図１９】従来のオブジェクトに分割された動画像を表
示させる方式の構成図。FIG. 19 is a configuration diagram of a conventional method of displaying a moving image divided into objects.

【図２０】従来のオブジェクトに分割された動画像を表
示させる方式での詳細な構成図。FIG. 20 is a detailed configuration diagram of a conventional method of displaying a moving image divided into objects.

[Explanation of symbols]

１１１…入力動画像１１２…形状抽出部１１４，１１４ａ，１４４ｂ，〜１４４ｎ…符号化部１１７…レート制御部１１９…バッファメモリ１２１…多重化部１２４、１５６…符号化制御部１２８…符号化部１２９，１５９…動き補償フレーム間予測部１３０，１６０…ＤＣＴ変換部１３１，１６１…第２の可変長符号化部１３３，１６２…量子化部１３４，１６３…ローカル復号化部１３５，１６４…第１の可変長符号化部１３７，１６５…多重化部１３９…バッファメモリ１５３…マルチプレクサ１５７…形状符号化部１５８…符号化被制御部 Reference numeral 111: input moving image 112: shape extraction unit 114, 114a, 144b, to 144n: encoding unit 117: rate control unit 119: buffer memory 121: multiplexing unit 124, 156: encoding control unit 128 , 159... Motion-compensated inter-frame predictors 130 and 160... DCT converters 131 and 161... Second variable-length encoders 133 and 162. Variable length coding units 137, 165: Multiplexing unit 139: Buffer memory 153: Multiplexer 157: Shape coding unit 158: Coding controlled unit

───────────────────────────────────────────────────── フロントページの続きＦターム(参考） 5C052 AA17 AB04 DD04 GA01 GA07 GA08 GB06 GC05 GC07 GF06 5C059 KK01 LB07 MA00 MA05 MA23 MB01 MB14 MB16 MC11 ME02 NN01 NN21 NN28 NN43 PP04 RB01 RB18 RC16 RC19 SS07 SS20 TA07 TA46 TB18 TC03 TC10 TC14 TC15 TC37 TC47 TD11 TD16 UA02 UA32 UA38 UA39 ──────────────────────────────────────────────────続き Continued on the front page F term (reference) 5C052 AA17 AB04 DD04 GA01 GA07 GA08 GB06 GC05 GC07 GF06 5C059 KK01 LB07 MA00 MA05 MA23 MB01 MB14 MB16 MC11 ME02 NN01 NN21 NN28 NN43 PP04 RB01 RB18 RC16 RC19 SS07 SS20 TC07 TB46 TC14 TC15 TC37 TC47 TD11 TD16 UA02 UA32 UA38 UA39

Claims

[Claims]

An image encoding apparatus for encoding a moving image whose screen is composed of a plurality of objects in units of objects, multiplexing each encoded data of the plurality of objects, and transmitting the multiplexed data to a communication line. Means for encoding, wherein a distribution value of a bit rate for performing the multiplexing is set in accordance with a priority of a given image quality for a target object, and a generated code amount corresponding to the allocated bit rate. Encoding means for encoding the target object, and when an object with high activity occurs in the process of encoding the object, the image quality of the high-priority object is not reduced or improved. Set the bit rate distribution of the encoding means to redistribute the bit rate among the objects Image encoding device characterized by comprising a Gosuru control means.

2. The encoding device according to claim 1, wherein, when encoding each of the objects on a frame-by-frame basis, a balance between temporal distortion due to frame drop according to a coding rate and spatial distortion due to quantization accuracy is achieved. 2. The image encoding apparatus according to claim 1, further comprising a function of setting and controlling a bit rate distribution so as to secure an optimum operating point in terms of visual characteristics.

3. The control means according to claim 1, wherein, in the step of moving the bit rate between the objects, the bit rate distribution is controlled so that temporal distortion is reduced in the entire coded data transmitted to the communication line. 2. The image encoding apparatus according to claim 1, further comprising a function of performing setting control.

4. An image encoding method for encoding a moving image whose screen is composed of a plurality of objects on an object basis, multiplexing each encoded data of the plurality of objects, and transmitting the multiplexed data to a communication line. In addition to encoding, for the target object, the target object is encoded with a generated code amount corresponding to a given bit rate distribution value for performing the multiplexing in accordance with the priority of the image quality of the object. Encoding step, when a high activity object occurs in the process of encoding the object, the bit rate between the objects so as not to reduce or improve the image quality of the high priority object To set and control the bit rate distribution of the encoding step to redistribute Picture coding method characterized by comprising the steps.

5. In the control step, when each of the objects is encoded for each frame, a balance between time distortion due to frame drop according to a coding rate and spatial distortion due to quantization accuracy is achieved. 5. The image encoding method according to claim 4, further comprising a function of setting and controlling bit rate distribution so as to secure an optimal operating point in terms of visual characteristics.

6. The method according to claim 1, wherein, in the step of moving the bit rate between the objects, the bit rate is distributed so that temporal distortion is reduced in the entire coded data transmitted to the communication line. The image encoding method according to claim 4, further comprising a function of performing setting control.

7. An image encoding program for encoding a moving image whose screen is composed of a plurality of objects on an object basis, multiplexing each encoded data of the plurality of objects, and transmitting the multiplexed data to a communication line, Along with encoding the object, for the target object, the target object is generated with a generated code amount equivalent to a given bit rate distribution value for performing the multiplexing in accordance with the priority of the image quality of the object. An encoding step of encoding, and when a high activity object occurs in the process of encoding the object, the object quality is reduced between the objects so as not to reduce or improve the image quality of the high priority object. Set and control the bit rate distribution of the encoding step to redistribute the bit rate. Control and step, readable by a computer, characterized by comprising, and, a medium recording an executable program.