[go: up one dir, main page]

WO1996034495A2 - Dispositif et procede de codage d'images video - Google Patents

Dispositif et procede de codage d'images video Download PDF

Info

Publication number
WO1996034495A2
WO1996034495A2 PCT/IB1996/000348 IB9600348W WO9634495A2 WO 1996034495 A2 WO1996034495 A2 WO 1996034495A2 IB 9600348 W IB9600348 W IB 9600348W WO 9634495 A2 WO9634495 A2 WO 9634495A2
Authority
WO
WIPO (PCT)
Prior art keywords
die
coefficients
amplitude
bit rate
amplitude distribution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB1996/000348
Other languages
English (en)
Other versions
WO1996034495A3 (fr
Inventor
Gerrit Johan Keesman
Willem Marie Julia Coene
Eduard Willem Salomons
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Philips Norden AB
Original Assignee
Philips Electronics NV
Philips Norden AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Electronics NV, Philips Norden AB filed Critical Philips Electronics NV
Priority to AU51608/96A priority Critical patent/AU705914B2/en
Priority to EP96908305A priority patent/EP0768007B1/fr
Priority to JP8532314A priority patent/JPH10502791A/ja
Priority to DE69605523T priority patent/DE69605523T2/de
Publication of WO1996034495A2 publication Critical patent/WO1996034495A2/fr
Publication of WO1996034495A3 publication Critical patent/WO1996034495A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/189Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
    • H04N19/19Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding using optimisation based on Lagrange multipliers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/124Quantisation
    • H04N19/126Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/152Data rate or code amount at the encoder output by measuring the fullness of the transmission buffer

Definitions

  • the invention relates to a device for coding video pictures, comprising: a picture transformer for transforming blocks of pixels into coefficients; coding means for encoding said coefficients, the bit rate R and distortion D of the encoded picture being determined by a coding parameter; means (3) for selecting one from a series of coding parameter values for which the sum D+ ⁇ R is minimal for an applied value of ⁇ ; and calculation means for calculating the value of ⁇ .
  • the invention also relates to a method of coding video pictures.
  • the article discloses a method of improving the picture quality of an MPEG encoder.
  • MPEG defines an encoder comprising a picture transformer to obtain coefficients, and a quantizer for imaging the coefficients to a proximate quantization level.
  • the quantization level constitutes a coding parameter.
  • the picture quality is improved by tresholding the quantized coefficients, i.e. by selecting the quantization level zero rather than the conventional quantization level if that it is better in a rate vs. distortion sense.
  • the prior art method will briefly be summarized with reference to Fig. 1.
  • the reference numeral 100 denotes a rate-distortion curve (hereinafter abbreviated to RD curve) for a range of quantization step sizes ⁇ .
  • RD curve rate-distortion curve
  • the bitrate will be lower because fewer non-zero coefficients are transmitted, and the distortion will be larger.
  • curve 101 represents an RD curve if the thresholding operation is performed on coefficients which have been quantized with a step size ⁇ l.
  • Curve 102 represents an RD curve if the thresholding operation is performed on coefficients which have been quantized with a larger step size ⁇ 2.
  • the curves 101 and 102 will hereinafter be referred to as thresholding curves.
  • the Lagrangian cost is defined as:
  • b_ represents a set of coefficient values to be encoded.
  • D(b and RQ>) are the distortion and the bitrate, respectively, upon encoding of these coefficients.
  • the Lagrang multiplier ⁇ is a natural number which is larger than or equal to 0.
  • a tresholdin operation can be found which yields a set fe of coefficients for which L(fe, ⁇ ) is minimal.
  • Th distortion DQj) and bitrate R ⁇ ) applying to this set constitutes a point on the thresholding curve.
  • increases, the distortion will be larger and the bitrate lower.
  • a given bitrate Rl can be achieved both by (i) quantizing the coefficients with step size ⁇ 2 and transmitting all quantized coefficients, and by (ii) quantizing the coefficients with a smaller step size ⁇ l and thresholding selected coefficients.
  • option (ii) is more favourable because it yield less distortion.
  • the curves 101 and 102 shown are only two curves of many possible thresholding curves.
  • the envelope thereof, denoted by 103 in the Figure, constitutes the optimal RD curve for a given picture. It is completely below curve 100.
  • thresholding may result in a step size ⁇ and a Lagrange multiplier ⁇ which yields a smaller distortion than when thresholding is not carried out.
  • the way in which the optimal operating point of an encoder is determined is, however, very intensive from a computation point of view. It is an iterative procedure in which the optimal set of coefficients is searched for a large number of values of ⁇ .
  • the invention is based on the recognition that the searched Lagrange multiplier ⁇ for a desired bit rate is equal to the negative value of the derivative of the RD curve, and utilizes the fact that at least a good approximation of the RD curve can be derive from the distribution of coefficient amplitudes for each spatial frequency.
  • Said amplitude distribution can easily be measured by statistic pre-analysis of the picture to be coded. For example, the amplitude distribution can be obtained by counting, for each spatial frequency, the number of times when a coefficient indicative of said spatial frequency assumes the same amplitude.
  • the means for estimating the bit rate R is adapted to cumulatively add, for all spatial frequencies, the product of the number of bits to encode a given amplitude with the number of occurrences when a coefficient assumes said amplitude as defined by the amplitude distribution.
  • the means for estimating the bit rate R is adapted to calculate the entropy from said amplitude distribution, said entropy being indicative for the bit rate R.
  • Fig. 1 already described, shows some rate-distortion curves to explain a prior-art device.
  • Fig. 2 shows a device for coding a video signal according to the invention.
  • Fig. 3 shows the block diagram of a ⁇ calculator shown in Fig. 2.
  • Fig. 4 shows a flow chart to explain the operation of a computing circuit shown in Fig. 3.
  • Fig. 5 shows some entropy-distortion diagrams to explain the operation of the ⁇ calculator shown in Fig. 3.
  • Figs. 6 and 7 show block diagrams of further embodiments of the ⁇ calculator.
  • Fig. 8 shows a quantization diagram to explain the operation of a quantizer shown in Fig. 2.
  • Figs. 9 and 10 show flow charts to explain the operation of the quantizer shown in Fig. 2. DESCRIPTION OF EMBODIMENTS
  • Fig. 2 shows a device for coding a video signal according to die invention.
  • the device comprises a delay 1, a picture transformer 2, a quantizer 3, a variabl length coding circuit 4, a bitrate control circuit 5 and a ⁇ calculator 6.
  • An important aspect of the invention is that the video signal can be encoded in accordance with the MPEG standard.
  • the picture transformer 2, the variable-length coding circuit 4 and th bitrate control circuit 5 in this embodiment are identical to the corresponding elements of th generally known MPEG encoder. Therefore, they are not described in detail.
  • Fig. 2 only shows the elements which are necessary for intraframe coding.
  • the invention is also applicable to interframe coding.
  • the applied picture is then first subtracted from a motion-compensated prediction picture, whereafter the residue thus obtained is code
  • the applied picture is block-sequentially subjected to a picture transform.
  • Discrete Cosine Transform DCT
  • Any suitable frequency transform may, however, be used, including sub-band filterin
  • the DCT transforms each picture block of, for example 8*8 pixels into a block of 8*8 coefficients c in which i 0 ... 63 indicates the order of increasing extent of picture detail.
  • the DCT coefficients c are applied to the quantizer 3 which images eac coefficient c ( at a discrete quantization level.
  • the quantization levels are spaced apart by a step size ⁇ j .
  • the step size ⁇ ( is coefficient-dependent in conformity with weighting factors W t which are stored in a quantization matrix.
  • the quantization step size is controlled per block by the bitrate control circuit 5 which applies a step size ⁇ to the quantizer.
  • the step size ⁇ is calculated in accordance with the expression in which i denotes a spatial frequency, W s is the weighting factor for said spatial frequency, and c is a constant.
  • the quantization matrix is generally different for intra coded (I) picture and inter coded (P,B) pictures.
  • d e quantizer images each coefficient at one of d e two nearest quantization levels.
  • quantizer 3 also receives a Lagrange multiplier ⁇ .
  • the quantizer now images a coefficient C j at a different than the nearest quantization level. More particularly, the quantizer forces a coefficient to assume a different level if ⁇ at is more efficient in a rate- distortion sense.
  • Embodiments of quantizer 3 and ⁇ -calculator 6 will be described in greate detail.
  • the quantized coefficients q are subsequently applied to a variable-length coding circuit 4.
  • This circuit forms a variable-length codeword for each non-zero coefficient and a possibly preceding series of non-zero coefficients in conformity with the MPEG standard.
  • the codewords formed are transmitted after buffering (not shown) via a transmission channel.
  • the codewords are also applied to the bitrate control circuit 5.
  • This circuit controls the step size ⁇ in further known manner in such a way that the number of bits per block is always as much as possible in conformity with a predetermined target R,.
  • the Lagrange multiplier ⁇ is applied to the quantizer by ⁇ -calculator 6.
  • This calculator calculates ⁇ by analysis of the picture to be coded. To this end, d e video signal is directly applied to the ⁇ -calculator, while the actual coding of the signal (transform, quantization, variable-length coding) takes place after it has been delayed by one picture period via delay 1.
  • Fig. 3 shows a block diagram of ⁇ -calculator 6.
  • the calculator comprises a picture transformer 600, a histogram forming circuit 601, an RD estimating circuit 602 and a computing circuit 603.
  • the index i indicates the spatial frequency.
  • the coefficients have an amplitude n in die range [-2047,2048].
  • the histogram forming circuit 601 forms, for each spatial frequency i, a histogram of the coefficient amplitude distribution. For example, the circuit 601 counts the number of times when coefficient C
  • the RD estimating circuit 602 uses the histograms h i n to generate an estimation for the RD curve.
  • computing circuit 603 calculates the Lagrange multiplier ⁇ from the estimated RD curve.
  • the searched Lagrange multiplier ⁇ is constituted by d e derivative -dD/dR of the RD curve at a given target bit rate or a given distortion.
  • RD estimating circuit 602 The operation of RD estimating circuit 602 will be explained with reference to a flow chart of operations shown in Fig. 4.
  • a step 40 a value is assigned to a quantization step size ⁇ .
  • the bit rate R and distortion D for this step size are computed in a step 41.
  • Embodiments of this step 41 will be described below in greater detail. If the step 41 were performed for a large number of step sizes ⁇ , then it would yield an RD curve denoted by 500 in Fig. 5. However, it is not necessary to have the RD curve completely available. Because die Lagrange multiplier ⁇ is die derivative -dD/dR at point P (see Fig.
  • step 4 the part of die RD curve around die point P is to be found.
  • Known numerical algorithms can be used to determine point P, for example a "bi-section algorithm". This is denoted in Fig. 4 by means of a step 42 in whic is checked whedier die bit rate R at the current step size ⁇ is sufficiently equal to die targe number of bits R,. As long as this is not the case, a different step size ⁇ is selected (step 4 and the computing step 41 is performed with the different step size. It is recalled diat (the relevant part of) the RD curve can be computed from the available histogram n . It is not necessary to transform the picture again for each computation of R and D.
  • me step size ⁇ is indicative of the spacing ⁇ , between discrete quantization levels.
  • the RD estimating circuit 602 determines from h ⁇ how often die amplitude n of coefficient c, corresponds to a quantization level r , i.e. how often the amplitude n is withi an interval
  • the number of occurrences of c, wim an amplitude t ⁇ n ⁇ t k+ is:
  • diat ⁇ t may be different for I-pictures and P- or B-pictures.
  • the distortion for the current picture dien is:
  • bit rate R is assum to be equivalent widi d e entropy H of the picture.
  • the entropy H is calculated in die following manner.
  • the probability p k of coefficient c, assuming the quantization level r k is calculated from the histogram h i n as:
  • the entropy H and die distortion D are bodi a function of the current step size ⁇ . It is recalled diat the RD curve 500 in Fig. 5 is obtained by calculating H and D for various stepsizes. The RD curve 500 appears to correspond to a very satisfactory extent to die actual RD curve after encoding die relevant picture.
  • variable-length tables used by variable-length coding circuit denoted 4 in Fig. 2.
  • DC coefficients i.e. coefficients c, for which i— 0
  • This code tables specifies for each DC amplitude n a codeword-length 1 D . Recalling that the number of occurrences of DC coefficients CQ with amplitude n is stored in histogram hu n , the number of bits to encode all DC coefficients is:
  • AC coefficients coefficients Cj for which i ⁇ O
  • the MPEG standard comprises an AC code table specifying die codeword-lengd s of "events", each event having a value (the quantization level r k of a non-zero coefficient) and a run length of zero coefficients preceding said non-zero coefficient.
  • the number of events and d eir values can be derived from d e histograms h . What cannot be derived from d e histograms is the run lengdi for each event. Therefore, an average run length for each value r k is used. The average run length is derived from the AC code table, using die assumption that said table applies to typical pictures.
  • An average codeword lengdi I* corresponding with said average run lengdi is also derived from the code table.
  • the number of bits to encode the AC coefficients can dius be approximated by multiplying the number of occurrences of events with the value r k , i.e. d e number of occurrences of coefficient c, with an amplitude t k ⁇ n ⁇ t k+1 , which is:
  • EOB end-of-block code
  • computing circuit 603 calculates the Lagrange multiplier ⁇ as the derivative -dD/d in d e point P of operation. If (R J .D J ) is die point of operation P and two points (R,,D,) an (R 3 ,D 3 ) are in its proximity, ⁇ follows from:
  • has the same value for all picture blocks of the picture.
  • Fig. 5 illustrates how die encoder reacts to die applied ⁇ .
  • die reference numeral 50 denotes d e RD curve as computed by RD estimation circuit 602.
  • the encoder would operate at the point P because the bitra control circuit 5 (see Fig. 2) generates a step size ( ⁇ 2) which actually leads to die target number of bits.
  • d e coefficients are imaged at different quantization levels under die influence of ⁇ . This causes die number of bits to be reduced.
  • the bitrate control circuit reacts thereto by reducing die step size. More particularly, the bitrate control circuit automatically controls the step size to that value ( ⁇ l) at which the derivative of rate distortion curve 501 at point Q remains equal to the derivative of die RD curve at point P.
  • the RD curve for die image has been estimated using die same step size ⁇ for all blocks constituting said image. It has been found diat die estimated RD curve corresponds to a satisfactory extent to the actual RD curve for the current image. Nevertheless, a yet more accurate RD curve can be estimated if it is realised diat die actual MPEG encoder adaptively varies die quantization step size from (macro-)block to (macro-)block. Generally, the step size depends on die activity of said block. Blocks having a low activity are quantized widi a smaller step size than blocks having a high activity, because high activity (e.g. much image detail) masks the effect of courser quantization. In accordance with ISO-IEC/JTC1/SC29 WG11/N0400 (better known as MPEG2 Test Model 5), April 1993, page 57, an adequate step size ⁇ m for die m-th (macro-)block is:
  • ⁇ _calculator 6 which takes block activity into account for die purpose of estimating the RD curve.
  • the ⁇ -calculator comprises an activity determining circuit 604 which calculates die activity A,,, of each block m from its pixel values, for example in accordance with page 57 of MPEG2 Test Model 5.
  • a correction circuit 605 modifies each DCT coefficient c- into an activity-corrected coefficient c' ⁇ in accordance widi die expression:
  • the circuit elements 600-603 are die same as in Fig. 3. As before, the histogram forming circuit 601 counts the number of occurrences of coefficients having the same amplitude. However, the coefficients originally having an amplitude n now have an amplitude n/f(A .
  • a more accurate RD curve can also be estimated if it is realised diat d actual MPEG encoder may encode selected (macro-)blocks of inter coded (P,B) pictures i the intra coding mode. This implies that an inter quantization matrix with weighting factor W e i is used to quantize d e majority of blocks of the inter coded picture, whereas a differ intra quantization matrix widi weighting factors W is used to quantize intra blocks.
  • Fig. 7 shows a yet further embodiment of ⁇ -calculator 6 which takes t intra coding mode of blocks of P- or B-pictures into account for the purpose of estimating RD curve.
  • the ⁇ -calculator comprises a switch 606 which is activated by a switching sign S when a block to be intra coded is received.
  • the signal S is generated by a pre-analysis circuit 607 which determines whe ier inter of intra coding of a block is more efficient.
  • a multiplier 608 modifies each DCT coefficient c t of intra blocks into a coding-mode-corrected coefficient c', in accordance widi die expression:
  • the circuit elements 600-603 are die same as in Fig. 3. As before, die histogram forming circuit 601 counts d e coefficients having the same amplitude. However, d e coefficients o intra coded blocks originally having an amplitude n now have an amplitude n*(W e i /W a i ). RD estimating circuit 602 calculates a plurality of (R,D) points for different step sizes. Applying step size ⁇ to amplitude n*(W e ,/W lti ) has die same effect as applying d e step si to amplitude n. The effect of applying the overall weighting factor W e , t all blocks of the P- or B-picture is thus cancelled for intra blocks and replaced by applyin d e intra weighting factor W a i .
  • Fig. 8 shows a scale of possible values of DCT coefficients Cj.
  • Said quantization levels are spaced apart by a step size ⁇ ⁇ as described before in response to d e step size ⁇ which is applied by bitrate control circuit 5 (see Fig. 2).
  • Fig. shows a flow chart of operations performed by die quantizer on coefficient c 4 .
  • d e value of c is imaged at me most proximate quantization level r k .
  • the "Lagrangian cost" L is subsequently computed for this quantization level r k in accorda wid the formula:
  • (C j -r 2 is a measure of me distortion which is a result of die approximation of c, by r , and R k is the number of bits required for transmitting .
  • the computed Lagrangian cost L is saved as L ⁇ .
  • Subsequendy, d e coefficient c 4 is imaged at a lower quantization level r k .,.
  • Lagrangian cost L is computed for this new quantization level.
  • diis Lagrangian cost L is smaller than L ⁇ . If diis is die case, then the lower quantization level r k . j is apparently more favourable in terms of rate-distortion d an r k . In other words, die distortion increases but die gain in the number of bits is more important.
  • the quantizer dien performs the steps 72-75 again so as to check whether an even lower quantization level is still more favourable.
  • the search for the minimum Lagrangian cost in this way is discontinued as soon as it has been found in step 75 that L increases again.
  • the quantization level corresponding to L ⁇ is subsequendy selected in a step 76 for quantization of die coefficient c,.
  • the algorithm shown in Fig. 9 is performed for all coefficients c 4 of a picture block.
  • diat die level 0 may be reached when a lower quantization level is chosen (step 73).
  • zero coefficients are not coded diemselves but are included in die code for the next non-zero coefficient.
  • the consequences thereof for the number of bits are included in die term R k of the formula for L (step 74).
  • the quantization process described above is referred to as bitrate- constrained quantization of DCT coefficients. It provides a significant improvement of the picture quality in comparison widi conventional MPEG coding in which die most proximate quantization level is coded and transmitted for each coefficient. A further improvement of die picture quality is obtained by subsequently subjecting die coefficients q s dius obtained to diresholding. As already previously noted, thresholding is understood to mean mat selected coefficients are rendered zero.
  • Fig. 10 shows a flow chart of the operations which are performed for this purpose on each non-zero coefficient O j . In a step 80, die Lagrangian cost LI is computed for this coefficient q ⁇ 0 given die current value of ⁇ . It holds for LI diat:
  • ⁇ Rj is the number of bits which is saved by rendering qj zero (the coding of q—0 i included in die coding of d e next non-zero coefficient).
  • LI > L2 the number of bits which is saved by rendering qj zero (the coding of q—0 i included in die coding of d e next non-zero coefficient).
  • the thresholding algorithm shown in Fig. 10 determines once per coefficient whether it is more favourable to maintain mis coefficient or render it zero. This is considerably simpler and less intensive in computation than the algorithm which is described in the afore-mentioned article "Rate-Distortion Optimal Fast Thresholding widi complete JPEG/MPEG Decoder Compatibility" which computes all possible thresholding options and selects the most favourable of em. Nevertheless, die form of thresholding described yields a considerable improvement of the picture quality. It has particularly been found diat the combination of bitrate- constrained quantization and thresholding significantly improves the picture quality.
  • the calculated value of ⁇ can also be used, inter alia, to select optimal motion vectors, to select between field or frame coding mode, to select one from a plurality of prediction modes (forward, backward), etc.
  • the picture quality of an MPEG-coded video signal can be improved considerably by modifying selected coefficients after conventional quantization, for example by rendering d em zero.
  • the modification is such d at the Lagrangian cost D+ ⁇ R (D is distortion, R is bitrate) is minimal for a given value of a Lagrange multiplier ⁇ .
  • D distortion
  • R bitrate
  • a relatively simple process is disclosed in which the value of ⁇ is calculated by means of statistical analysis (6) of the picture to be coded.
  • the statistical analysis comprises the estimation of the RD curve on die basis of the amplitude distribution of die coefficients.
  • the searched ⁇ is the derivative of diis curve at the desired bitrate.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

On peut améliorer considérablement la qualité d'image d'un signal vidéo codé par MPEG par modification de coefficients sélectionnés après avoir effectué une quantification classique, par exemple, en les mettant à zéro. La modification est telle que l'évaluation de Lagrange D+μR (D représente la distorsion, R représente le débit binaire) est minimum pour une valeur donnée d'un multiplicateur de Lagrange μ. L'invention décrit un procédé relativement simple d'après lequel on calcule la valeur de μ au moyen d'une analyse statistique (6) de l'image à coder. L'analyse statistique consiste à évaluer la courbe RD sur la base de la répartition d'amplitude des coefficients. Le μ recherché est la dérivée de cette courbe au débit binaire souhaité.
PCT/IB1996/000348 1995-04-25 1996-04-18 Dispositif et procede de codage d'images video Ceased WO1996034495A2 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU51608/96A AU705914B2 (en) 1995-04-25 1996-04-18 Device and method for coding video pictures
EP96908305A EP0768007B1 (fr) 1995-04-25 1996-04-18 Dispositif et procede de codage d'images video
JP8532314A JPH10502791A (ja) 1995-04-25 1996-04-18 映画像符号化装置および方法
DE69605523T DE69605523T2 (de) 1995-04-25 1996-04-18 Vorrichtung und verfahren zur kodierung von videobildern.

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP95201055 1995-04-25
EP95201055.1 1995-04-25

Publications (2)

Publication Number Publication Date
WO1996034495A2 true WO1996034495A2 (fr) 1996-10-31
WO1996034495A3 WO1996034495A3 (fr) 1997-01-16

Family

ID=8220221

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1996/000348 Ceased WO1996034495A2 (fr) 1995-04-25 1996-04-18 Dispositif et procede de codage d'images video

Country Status (7)

Country Link
US (1) US5691770A (fr)
EP (1) EP0768007B1 (fr)
JP (1) JPH10502791A (fr)
CN (1) CN1157080A (fr)
AU (1) AU705914B2 (fr)
DE (1) DE69605523T2 (fr)
WO (1) WO1996034495A2 (fr)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0859520A1 (fr) * 1997-02-13 1998-08-19 Rockwell Semiconductor Systems Codage des signaux vidéo à quantification adaptive
EP1940176A3 (fr) * 2006-12-28 2008-07-09 Siemens Aktiengesellschaft Procédé et dispositif de codage d'une séquence d'images numérisées
WO2008080881A1 (fr) * 2006-12-28 2008-07-10 Siemens Aktiengesellschaft Procédé et dispositif de codage d'une séquence d'images numérisées
WO2008081461A3 (fr) * 2007-01-03 2009-05-14 Human Monitoring Ltd Compression d'images basée sur une insuffisance entropique
WO2009089370A1 (fr) * 2008-01-08 2009-07-16 Qualcomm Incorporated Quantification basée sur une modélisation du compromis débit/distorsion pour codeurs cabac
EP1708510A3 (fr) * 2005-03-29 2009-09-02 Fujitsu Limited Appareil de codage d'image
WO2010033565A1 (fr) * 2008-09-16 2010-03-25 Dolby Laboratories Licensing Corporation Commande adaptative de codeur vidéo
US7822118B2 (en) 2002-11-08 2010-10-26 Apple Inc. Method and apparatus for control of rate-distortion tradeoff by mode selection in video encoders
US8208536B2 (en) 2005-04-28 2012-06-26 Apple Inc. Method and apparatus for encoding using single pass rate controller
RU2483468C2 (ru) * 2008-02-21 2013-05-27 Квэлкомм Инкорпорейтед Двухпроходное квантование для кодеров, реализующих контекстно-зависимое адаптивное двоичное арифметическое кодирование
US8634462B2 (en) 2007-03-13 2014-01-21 Matthias Narroschke Quantization for hybrid video coding
US8781002B2 (en) 2003-01-08 2014-07-15 Apple Inc. Method and apparatus for improved coding mode selection
USRE46777E1 (en) 2007-03-13 2018-04-03 Realtime Adaptive Streaming Llc Quantization for hybrid video coding
US11310505B2 (en) 2018-04-27 2022-04-19 Interdigital Vc Holdings, Inc. Method and apparatus for adaptive context modeling in video encoding and decoding

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1997017797A2 (fr) * 1995-10-25 1997-05-15 Sarnoff Corporation Appareil et procede d'evaluation du mouvement de blocs de longueur variable fondee sur une quadripartition
US5778192A (en) * 1995-10-26 1998-07-07 Motorola, Inc. Method and device for optimal bit allocation between different sources of information in digital video compression
US5909513A (en) * 1995-11-09 1999-06-01 Utah State University Bit allocation for sequence image compression
JPH10174103A (ja) * 1996-12-13 1998-06-26 Matsushita Electric Ind Co Ltd 画像符号化装置、符号化画像記録媒体、画像復号化装置、画像符号化方法、および符号化画像伝送方法
US6633609B1 (en) * 1996-12-24 2003-10-14 Intel Corporation Method and apparatus for bit rate control in a digital video environment for arbitrary bandwidth
US6263020B1 (en) * 1996-12-24 2001-07-17 Intel Corporation Method and apparatus for bit rate control in a digital video system
DE69832469T2 (de) * 1997-02-03 2006-07-13 Sharp K.K. Mit embedded coding arbeitender bildcodierer mit ratenstörungsoptimierung
KR100249223B1 (ko) * 1997-09-12 2000-03-15 구자홍 엠팩(mpeg)-4의움직임벡터코딩방법
US6118822A (en) * 1997-12-01 2000-09-12 Conexant Systems, Inc. Adaptive entropy coding in adaptive quantization framework for video signal coding systems and processes
US6037987A (en) * 1997-12-31 2000-03-14 Sarnoff Corporation Apparatus and method for selecting a rate and distortion based coding mode for a coding system
US6122321A (en) * 1998-05-12 2000-09-19 Hitachi America, Ltd. Methods and apparatus for reducing the complexity of inverse quantization operations
US6263021B1 (en) * 1998-09-18 2001-07-17 Sarnoff Corporation Treating non-zero quantized transform coefficients as zeros during video compression processing
EP1005233A1 (fr) * 1998-10-12 2000-05-31 STMicroelectronics S.r.l. Contrôle de codage à débit constant dans un codeur vidéo par une préanalyse des tranches d'images
US6167162A (en) * 1998-10-23 2000-12-26 Lucent Technologies Inc. Rate-distortion optimized coding mode selection for video coders
CN1159916C (zh) * 1999-06-22 2004-07-28 日本胜利株式会社 活动图象编码装置及其方法
US7096481B1 (en) 2000-01-04 2006-08-22 Emc Corporation Preparation of metadata for splicing of encoded MPEG video and audio
US6600836B1 (en) * 2000-01-28 2003-07-29 Qualcomm, Incorporated Quality based image compression
US6871006B1 (en) 2000-06-30 2005-03-22 Emc Corporation Processing of MPEG encoded video for trick mode operation
US6771703B1 (en) * 2000-06-30 2004-08-03 Emc Corporation Efficient scaling of nonscalable MPEG-2 Video
US6937770B1 (en) 2000-12-28 2005-08-30 Emc Corporation Adaptive bit rate control for rate reduction of MPEG coded video
US7023924B1 (en) 2000-12-28 2006-04-04 Emc Corporation Method of pausing an MPEG coded video stream
US6980594B2 (en) 2001-09-11 2005-12-27 Emc Corporation Generation of MPEG slow motion playout
US6959116B2 (en) * 2001-09-18 2005-10-25 Emc Corporation Largest magnitude indices selection for (run, level) encoding of a block coded picture
US6968091B2 (en) * 2001-09-18 2005-11-22 Emc Corporation Insertion of noise for reduction in the number of bits for variable-length coding of (run, level) pairs
US8406301B2 (en) * 2002-07-15 2013-03-26 Thomson Licensing Adaptive weighting of reference pictures in video encoding
WO2004006188A2 (fr) * 2002-07-09 2004-01-15 Aware, Inc. Technique iterative de commande de parametres de compression destinee a des images
US7106907B2 (en) * 2002-10-18 2006-09-12 Mitsubishi Electric Research Laboratories, Inc Adaptive error-resilient video encoding using multiple description motion compensation
EP1933569A3 (fr) 2002-11-08 2010-07-07 Apple Inc. Encodeur et procédé de contrôle de la distorsion du taux avec multiplicateur de Lagrange et/ou value de quantificateur
CN1708991B (zh) * 2002-11-08 2010-08-18 苹果公司 利用拉格朗日乘子和视觉掩蔽来控制速率-失真折中的方法和装置
CN101651831B (zh) * 2003-01-08 2013-07-17 苹果公司 用于改进的编码模式选择的方法和装置
US7672523B2 (en) * 2003-03-03 2010-03-02 The Hong Kong University Of Science And Technology Efficient rate allocation for multi-resolution coding of data
US7327786B2 (en) * 2003-06-02 2008-02-05 Lsi Logic Corporation Method for improving rate-distortion performance of a video compression system through parallel coefficient cancellation in the transform
WO2004109586A1 (fr) 2003-06-05 2004-12-16 Aware, Inc. Techniques de controle de qualite d'images
JP2005004866A (ja) * 2003-06-11 2005-01-06 Sony Corp 情報処理装置および方法、記録媒体、並びにプログラム
US7280597B2 (en) * 2003-06-24 2007-10-09 Mitsubishi Electric Research Laboratories, Inc. System and method for determining coding modes, DCT types and quantizers for video coding
KR20050053297A (ko) * 2003-12-02 2005-06-08 학교법인 성균관대학 동영상 부호화 방법 및 장치
US7746927B1 (en) * 2004-03-26 2010-06-29 Apple Inc. Robust single-pass variable bit rate encoding
US8005139B2 (en) 2004-06-27 2011-08-23 Apple Inc. Encoding with visual masking
KR100631714B1 (ko) 2004-06-30 2006-10-09 엘지전자 주식회사 휴대단말기의 개선된 영상신호 레이트 콘트롤 장치 및 방법
US8296649B2 (en) * 2005-05-31 2012-10-23 Sorenson Media, Inc. Method, graphical interface and computer-readable medium for generating a preview of a reformatted preview segment
US7885979B2 (en) * 2005-05-31 2011-02-08 Sorenson Media, Inc. Method, graphical interface and computer-readable medium for forming a batch job
US7975219B2 (en) * 2005-05-31 2011-07-05 Sorenson Media, Inc. Method, graphical interface and computer-readable medium for reformatting data
US7423642B2 (en) * 2005-12-14 2008-09-09 Winbond Electronics Corporation Efficient video frame capturing
US7944965B2 (en) * 2005-12-19 2011-05-17 Seiko Epson Corporation Transform domain based distortion cost estimation
KR101377833B1 (ko) 2006-08-30 2014-03-26 톰슨 라이센싱 분석적이고 경험적인 하이브리드 인코딩 왜곡 모델링을 위한 방법 및 장치
CN101595736B (zh) * 2006-12-15 2013-04-24 汤姆森特许公司 失真度估算
US20080273113A1 (en) * 2007-05-02 2008-11-06 Windbond Electronics Corporation Integrated graphics and KVM system
US9338463B2 (en) 2011-10-06 2016-05-10 Synopsys, Inc. Visual quality measure for real-time video processing
US9781449B2 (en) * 2011-10-06 2017-10-03 Synopsys, Inc. Rate distortion optimization in image and video encoding

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR0176474B1 (ko) * 1992-12-01 1999-04-15 김광호 고속화상 재생에 적합한 디지탈 비디오 테이프레코더용 데이타 코딩기술
US5550590A (en) * 1994-03-04 1996-08-27 Kokusai Denshin Denwa Kabushiki Kaisha Bit rate controller for multiplexer of encoded video

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
IEEE TRANSACTIONS ON IMAGE PROCESSING, Volume 3, No. 5, Sept. 1994, KANNAN RAMCHANDRAN et al., "Rate-Distortion Optimal Fast Thresholding with Complete JPEG/MPEG Decoder Compatibility", pages 700-704. *
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, Volume 1-7, No. 1, 1985, G. STEPHEN ZABELE et al., "Fourier Encoding of Closed Planar Boundaries", pages 98-102. *
INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, Volume III, October 1995, (Washington DC), JUNGWOO LEE, "Optimal Quadtree for Variable Block Size Motion Estimation", pages 480-483. *
PROCEEDINGS OF THE SPIE - THE INTERNATIONAL SOCIETY FOR OPTICAL ENGINEERING CONFERENCE, TITLE: PROC. SPIE INT. SOC. OPT. ENG. (USA), Vol. 2727, Pt. 2. Conference Date: 17-20 March 1996, SCHUSTER G.M., KATSAGGELOS A.K., "Fast and Efficient Mode and Quantizer Selection in Rate Distortion Sense for H. 263", pages 784-95. *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0859520A1 (fr) * 1997-02-13 1998-08-19 Rockwell Semiconductor Systems Codage des signaux vidéo à quantification adaptive
US8355436B2 (en) 2002-11-08 2013-01-15 Apple Inc. Method and apparatus for control of rate-distortion tradeoff by mode selection in video encoders
US7822118B2 (en) 2002-11-08 2010-10-26 Apple Inc. Method and apparatus for control of rate-distortion tradeoff by mode selection in video encoders
US8781002B2 (en) 2003-01-08 2014-07-15 Apple Inc. Method and apparatus for improved coding mode selection
US7646926B2 (en) 2005-03-29 2010-01-12 Fujitsu Limited Image encoding apparatus
EP1708510A3 (fr) * 2005-03-29 2009-09-02 Fujitsu Limited Appareil de codage d'image
US8208536B2 (en) 2005-04-28 2012-06-26 Apple Inc. Method and apparatus for encoding using single pass rate controller
WO2008080881A1 (fr) * 2006-12-28 2008-07-10 Siemens Aktiengesellschaft Procédé et dispositif de codage d'une séquence d'images numérisées
EP1940176A3 (fr) * 2006-12-28 2008-07-09 Siemens Aktiengesellschaft Procédé et dispositif de codage d'une séquence d'images numérisées
US8467617B2 (en) 2007-01-03 2013-06-18 Human Monitoring Ltd. Compressing high resolution images in a low resolution video
US8019167B2 (en) 2007-01-03 2011-09-13 Human Monitoring Ltd. Compressing high resolution images in a low resolution video
WO2008081461A3 (fr) * 2007-01-03 2009-05-14 Human Monitoring Ltd Compression d'images basée sur une insuffisance entropique
US8634462B2 (en) 2007-03-13 2014-01-21 Matthias Narroschke Quantization for hybrid video coding
USRE46777E1 (en) 2007-03-13 2018-04-03 Realtime Adaptive Streaming Llc Quantization for hybrid video coding
US8891615B2 (en) 2008-01-08 2014-11-18 Qualcomm Incorporated Quantization based on rate-distortion modeling for CABAC coders
WO2009089370A1 (fr) * 2008-01-08 2009-07-16 Qualcomm Incorporated Quantification basée sur une modélisation du compromis débit/distorsion pour codeurs cabac
US9008171B2 (en) 2008-01-08 2015-04-14 Qualcomm Incorporated Two pass quantization for CABAC coders
RU2475999C2 (ru) * 2008-01-08 2013-02-20 Квэлкомм Инкорпорейтед Квантование, основанное на моделировании искажения скорости передачи, для кодеров cabac
RU2483468C2 (ru) * 2008-02-21 2013-05-27 Квэлкомм Инкорпорейтед Двухпроходное квантование для кодеров, реализующих контекстно-зависимое адаптивное двоичное арифметическое кодирование
WO2010033565A1 (fr) * 2008-09-16 2010-03-25 Dolby Laboratories Licensing Corporation Commande adaptative de codeur vidéo
KR101271512B1 (ko) * 2008-09-16 2013-06-05 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 비디오 인코더 제어 방법
US8654835B2 (en) 2008-09-16 2014-02-18 Dolby Laboratories Licensing Corporation Adaptive video encoder control
US11310505B2 (en) 2018-04-27 2022-04-19 Interdigital Vc Holdings, Inc. Method and apparatus for adaptive context modeling in video encoding and decoding

Also Published As

Publication number Publication date
AU5160896A (en) 1996-11-18
US5691770A (en) 1997-11-25
AU705914B2 (en) 1999-06-03
EP0768007B1 (fr) 1999-12-08
EP0768007A2 (fr) 1997-04-16
DE69605523T2 (de) 2000-07-06
CN1157080A (zh) 1997-08-13
DE69605523D1 (de) 2000-01-13
WO1996034495A3 (fr) 1997-01-16
JPH10502791A (ja) 1998-03-10

Similar Documents

Publication Publication Date Title
EP0768007B1 (fr) Dispositif et procede de codage d'images video
US5768436A (en) Device and method for coding video pictures
US5638126A (en) Method and apparatus for deciding quantization parameter
US5933194A (en) Method and circuit for determining quantization interval in image encoder
EP0857394B1 (fr) Dispositif et procede d'optimisation de commande de debit dans un systeme de codage
EP2130380B1 (fr) Contrôle de la quantité de données compressées
US6546049B1 (en) Parameterized quantization matrix adaptation for video encoding
US6690833B1 (en) Apparatus and method for macroblock based rate control in a coding system
US8259793B2 (en) System and method of fast MPEG-4/AVC quantization
US7218674B2 (en) Method and device for controlling quantization scales of a video encoding bit stream
US20010033139A1 (en) Digital image coding system having self-adjusting selection criteria for selecting a transform function
WO1999003279A1 (fr) Appareil et procede de regulation de debit en fonction de l'objet dans un systeme de codage
US6252905B1 (en) Real-time evaluation of compressed picture quality within a digital video encoder
WO1998037701A1 (fr) Appareil et procede destines a optimiser la regulation de vitesse dans un systeme de codage
JPH09307904A (ja) 映像信号符号化システム用量子化器
EP1516493B1 (fr) Procede et systeme d'optimisation de la nettete d'images pendant le codage et l'amelioration d'images
US20020009146A1 (en) Adaptively encoding a picture of contrasted complexity having normal video and noisy video portions
WO1997015146A1 (fr) Procede pour coder des images video
JP4295828B2 (ja) 領域情報で符号化する方法
US20100158108A1 (en) System and method for video encoding
KR101956443B1 (ko) 매크로블록 레벨 비트 할당 방법 및 시스템
Dai et al. Intraframe bitrate prediction method for H. 264/AVC
Hu Bit rate control for real-time multipoint video conferencing

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 96190638.3

Country of ref document: CN

AK Designated states

Kind code of ref document: A2

Designated state(s): AU CN JP

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1996908305

Country of ref document: EP

AK Designated states

Kind code of ref document: A3

Designated state(s): AU CN JP

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1996908305

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1996908305

Country of ref document: EP