[go: up one dir, main page]

US20110310968A1 - Method and apparatus for determining a second picture for temporal direct-mode block prediction - Google Patents

Method and apparatus for determining a second picture for temporal direct-mode block prediction Download PDF

Info

Publication number
US20110310968A1
US20110310968A1 US13/220,933 US201113220933A US2011310968A1 US 20110310968 A1 US20110310968 A1 US 20110310968A1 US 201113220933 A US201113220933 A US 201113220933A US 2011310968 A1 US2011310968 A1 US 2011310968A1
Authority
US
United States
Prior art keywords
reference picture
picture
current block
index
inter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/220,933
Inventor
Lowell L. Winger
Simon Booth
Elliot N. Linzer
Ho-Ming Leung
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US13/220,933 priority Critical patent/US20110310968A1/en
Publication of US20110310968A1 publication Critical patent/US20110310968A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/423Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present invention relates to video generally and, more particularly, to a method and apparatus for determining a second picture for temporal direct-mode block prediction.
  • the H.264/MPEG4-AVC video standard allows multiple different reference pictures for inter-prediction.
  • the different reference pictures are potentially signaled down to 8 ⁇ 8 partitions which reference a picture to use for inter-prediction.
  • the standard also allows the choice, in a flexible manner, of which reference pictures to use, and the order in which the reference pictures, are available for any given slice (i.e., a group of macroblocks) of video.
  • a direct-mode block is a bi-predictive predicted block in a B-frame that does not signal either references or motion vectors. Rather, references and motion vectors are derived from a co-located block in a previously decoded picture.
  • the overhead of the derived block mode is very low and provides a very important prediction mode that is often used to significantly reduce the rate of B-frames.
  • the reference pictures for each slice of video are arranged into two ordered lists (i.e., List 0 and List 1 ).
  • List 0 and List 1 For bi-predictive and direct-mode predicted blocks, one picture from each list should be indicated for use for inter-prediction by two reference-indices (one into each list) indicating an ordered number of one of the reference pictures from each list.
  • Previous H.264 implementations of direct-modes use the following sequence to determine which two current reference pictures should be used for inter-prediction of each block of direct-mode block.
  • Previous H.264 implementations find the co-located picture (i.e., reference 0 , the first reference picture from List 1 ) and block for the current block. This co-located picture will be the first reference picture used for direct-mode prediction.
  • the co-located block will be used to derive the reference indices and motion vectors for the current block.
  • previous H.264 implementations determine the List 0 reference picture that is used by the co-located block to refer to a ‘direct-mode reference’.
  • the reference index in the co-located picture of this direct mode reference is called the direct-mode reference index.
  • the direct mode reference index is used by the current block to determine the second reference picture to use for inter prediction. Specifically, the direct-mode reference index is directly used in the reference picture list of the current slice. Finally, the motion vectors for the current block are interpolated from the motion vectors used in the co-located block according to the temporal distances between the current picture and the two reference pictures.
  • the second reference picture does not necessarily refer to the same physical reference picture for direct-prediction that was used for inter-prediction by the co-located block.
  • the reference picture used in the co-located block and the second picture used in the direct-mode prediction of the current block are the same physical picture only if the direct mode reference picture was present in the same position (i) in List 0 of the current slice of the current picture being decoded and (ii) in List 0 of the co-located slice of the co-located picture.
  • the intent of direct-mode prediction is that it uses the physical reference picture used by the co-located block as a reference picture for the current block.
  • Reference picture re-ordering is the ability to flexibly order reference lists for each slice to use different pictures that are best inter-predicted from various other previously encoded/decoded pictures. If the encoder has the ability to specify which pictures are best for the current picture, then prediction residuals may be reduced.
  • a particular example of where the ability to re-order reference pictures is useful is to adaptively choose whether to code an I or P-picture as two fields (the second of which is inter-predicted from the first) or as a single picture without inter-prediction between fields.
  • the reference pictures may be re-ordered between the current picture and the co-located picture such that the same reference picture does not occur in the same position in the respective List 0 .
  • the direct-mode prediction could be seriously compromised with the existing solution since the intended use of the direct-mode is that the same reference picture would be used.
  • the present invention concerns a method for determining a first and a second reference picture used for inter-prediction of a macroblock, comprising the steps of (A) finding a co-located picture and block, (B) determining a reference index, (C) mapping the reference index to a lowest valued reference index in a current reference list and (D) using the reference index to determine the second reference picture.
  • the objects, features and advantages of the present invention include providing a method and/or apparatus that may (i) determine a second picture for temporal direct-mode block prediction and/or (ii) map a reference index to a lowest valued reference index in a current reference list.
  • FIG. 1 is a flow diagram of an implementation of the present invention
  • FIG. 2 is a diagram illustrating an implementation of the present invention
  • FIG. 3 is a partial block diagram of an example implementation of an encoder apparatus.
  • FIG. 4 is a partial block diagram of an example implementation of a decoder apparatus.
  • the present invention may determine which two current reference pictures should be used for inter-prediction of each block of an H.264 direct-mode macroblock.
  • the flow diagram 100 generally comprises a state 102 , a state 104 , a state 106 and a state 108 .
  • the state 102 finds the co-located (e.g., reference 0 , a first reference picture from List 1 ) picture and block for the current block being processed.
  • the co-located picture will be the first reference picture used for direct-mode prediction.
  • the state 104 determines the reference picture that was used by the co-located block to refer to a ‘direct-mode reference’ (i) in reference List 0 of the co-located slice (if a reference picture from List 0 was used for inter-prediction of the co-located macroblock), or (ii) if no List 0 reference picture was used, reference List 1 of the co-located slice (if a reference picture from List 1 was used for inter prediction of the co-located macroblock).
  • the state 106 maps the reference picture from the state 104 to the lowest valued reference index in the current reference List 0 .
  • the state 106 references the same reference picture that was referenced by the co-located picture in the state 104 .
  • the state 108 provides the List 0 reference index found in the state 106 (e.g., the reference index is normally generated in response to remapping).
  • the reference index is generally used with the reference list 0 of the current slice to determine the second reference picture to be used for inter-prediction.
  • the state 106 searches the current List 0 to determine the lowest valued reference index referring to the same reference picture that was referred by the co-located picture.
  • the step 106 implements a number of operations and data not outlined in the current H.264 specification.
  • a unique identifier for each reference picture is stored.
  • the unique identifier is normally correctly associated from the unique identifier that was used as an inter-reference in the co-located picture.
  • the unique identifier is made available as a potential List 0 inter-reference for the current picture.
  • a unique identifier to the actual ‘direct-mode reference picture’ is stored.
  • a module searches the current reference List 0 for the lowest valued reference index identified by the unique identifier and return the value of that reference index.
  • a search in the current reference List 0 provides the potential for increasing coding efficiency of B-frames and provides flexibility to the encoder to be able to use a truly interpolative direct-mode prediction along with an arbitrary choice for the picture referred by the first reference index (index 0 ) of List 0 . These two options were mutually exclusive in the previous H.264 implementations discussed in the background section.
  • the flow diagram 100 ′ generally comprises a block (or circuit) 112 , a block (or circuit) 114 , a block (or circuit) 116 , a block (or circuit) 118 and a block (or circuit) 119 .
  • the block 112 is shown implementing an encoder/decoder signal to construct direct-mode prediction.
  • the block 114 generally sets a co-located picture (e.g., COLPIC) as equal to a value in the List 1 [ 0 ] (e.g., List 1 at index 0 ).
  • the block 116 generally determines whether the second picture is a picture from either List 0 or a List 1 .
  • the block 118 finds the index in the current List 0 that refers to “other picture”.
  • the state 119 creates an interprediction by the weighted average of pixels of COLPIC and OTHERPIC.
  • the present invention may be particularly useful under circumstances when accurate direct-mode prediction is useful. Having a low-overhead/efficient reference to a reference frame other than the two pictures that yield the interpolative direct-mode prediction is desirable.
  • the first entry in List 1 of the current B-picture is generally chosen to give a good direct-mode prediction.
  • Higher compression may be achieved by using a picture other than the index 0 entry in List 0 of the co-located picture as the index 0 entry in List 0 of the current picture.
  • Another feature of the present invention is the choice of the order of pictures in the reference lists of the current picture being decoupled and independent from the choice of the order of reference pictures in the co-located picture. Decoupling the reference pictures potentially significantly simplifies the design of an encoder incorporating the present invention. For example, the lists of the co-located picture need not be taken into account when designing the lists for the current picture.
  • a unique reference index is normally found for the second reference picture for direct-mode predicted blocks.
  • the lowest valued index in List 0 is specifically chosen to use the same physical reference frame.
  • the encoder is not unnecessarily constrained to refer to the same physical frame with the same index in List 0 of the current picture as was used for the ‘direct-mode reference’ of the co-located picture.
  • a meaningful and useful direct-mode prediction that effectively finds the current frame as a temporally interpolated intermediate estimate between the co-located picture and a corresponding ‘direct-mode L 0 reference’.
  • the index 0 of List 0 may be chosen to maximize the coding efficiency of the reference indices rather than to make sure that a good direct-mode prediction is available.
  • the reference index prediction and context-based coding may be improved by using the LOWEST possible reference index.
  • Reference indices will often be ordered from most-frequent to least-frequent in expected occurrence frequency.
  • the predicted index entropy should be expected to be reduced with the present invention.
  • the present invention may be implemented in all H.264 compliant decoders. While encoding may be implemented using the techniques described in the background section, the efficiency of such a system is generally reduced when compared with the present invention. Advanced encoders may realize a benefit by exploiting the improved flexibility possible with the use of the present invention.
  • a content-addressable-memory may provide an efficient hardware structure for implementing the present invention.
  • the present invention may also be implemented in software with a ‘for’ loop search beginning at index 0 of List 0 and proceeding towards the end of List 0 . Such a software implementation may exit early from the loop when the desired reference frame is found.
  • the present invention may be implemented with a variable that holds the lowest reference index in the current List 0 that refers to the ‘direct-mode L 0 reference picture’. Such a variable would be set once before the decoding of the temporal direct-mode macroblocks of each new slice (e.g., if temporal direct rather than spatial direct is used for the slice) using the software table search ‘for’ loop method mentioned above.
  • the encoder apparatus 120 may be implemented as a video bitstream encoder apparatus or system.
  • the encoder apparatus 120 generally comprises a circuit 122 , a circuit 124 , a circuit 126 and a memory 128 .
  • the circuit 122 may receive a bitstream or signal (e.g., TIN).
  • a bitstream or signal (e.g., TOUT) may be generated by the circuit 126 .
  • the memory 128 may hold the List 0 and the List 1 for each of the reference index values.
  • the circuit 122 may be implemented as a compression circuit or module.
  • the compression circuit 122 may be operational to compress the blocks within the signal TIN thereby generating motion vectors. Compression may be determined by a signal (e.g., PRED) received from the circuit 124 .
  • a signal e.g., MV
  • a signal may exchange motion vectors between the compression circuit 122 and the memory 128 .
  • the motion vectors may be written to the memory 128 .
  • the motion vectors may be read from the memory 128 .
  • the circuit 124 may be implemented as a code control circuit.
  • the circuit 124 may generate the signal PRED conveying the prediction type used by the macroblocks.
  • the code control circuit 124 may also generate a signal (e.g., CNT).
  • the signal CNT may provide coding controls to the circuit 126 .
  • the circuit 126 may be implemented as a coding circuit.
  • the coding circuit 126 may be an entropy coding circuit.
  • the entropy coding circuit 126 may receive the blocks and the associated groups of motion vectors from the compression circuit 122 via a bitstream or signal (e.g., TBS).
  • the entropy coding circuit 126 may be configured to encode the signal TBS to generate the signal TOUT for transmission and/or storage.
  • the signal TOUT may be implemented as a Network Abstraction. Layer defined by the H.264 standard.
  • the memory 128 may be implemented as an external memory.
  • the memory 128 is generally operational to store the motion vectors for the blocks while the blocks are being encoded.
  • the memory 128 may be configured to store other data used for encoding the bitstream data.
  • Other types of memories may be implemented to meet the criteria of a particular application.
  • the decoder apparatus 130 may be implemented as a video bitstream decoder or system.
  • the decoder apparatus 130 generally comprises a circuit 132 , a circuit 134 , a circuit 136 and a memory 138 .
  • the circuit 132 may receive an input bitstream or signal (e.g., RIN).
  • the circuit 136 may generate an output bitstream or signal (e.g., ROUT).
  • the circuit 132 may be implemented as a decoder circuit.
  • the decoder circuit 132 may be implemented as an entropy decoder circuit 132 .
  • a decoded bitstream or signal (e.g., RBS) may be presented by the entropy decoder circuit 132 to the circuits 134 and 136 .
  • the circuit 134 may be implemented as a prediction circuit.
  • the prediction circuit 134 may be operational to determine if inter or intra prediction has been implemented for the various macroblocks of the pictures in the signal RBS.
  • the prediction circuit 134 may generate a command signal (e.g., CMD) to the circuit 136 indicating the prediction type.
  • CMD command signal
  • the circuit 136 may be implemented as a decompression circuit.
  • the decompression circuit 136 may examine the compressed groups to determine how the motion vectors should be used.
  • the decompression circuit 136 may store the motion vectors from decoded blocks that may be used for inferring motion vectors of co-located blocks the memory 128 via a signal (e.g., MV).
  • the stored motion vectors may be read from the memory 138 to calculate the motion vectors for B-slice blocks coded under the direct mode (e.g., no associated motion vectors were transmitted in the signal TOUT).
  • the direct mode generally refers to a macroblock or macroblock partition.
  • the inferred motion vectors may then be used in generating the signal ROUT.
  • the memory 138 may be implemented as an external memory.
  • the memory 138 is generally operational to store the motion vectors for the blocks for later use in calculating inferred motion vectors for the co-located blocks.
  • the memory 138 may be configured to store other data used for decoding the bitstream data. Other types of memories may be implemented to meet the criteria of a particular application.
  • the memory 138 may hold the List 0 and the List 1 for each of the reference index values.
  • the present invention may be implemented in decoders of the professional version (PExt) of the H.264 standard that use B-frames, and also for other future extensions of the H.264 standard.
  • PExt professional version of the H.264 standard that use B-frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

A method for determining a first and a second reference picture used for inter-prediction of a macroblock, comprising the steps of (A) finding a co-located picture and block, (B) determining a reference index, (C) mapping the reference index to a lowest valued reference index in a current reference list and (D) using the reference index to determine the second reference picture.

Description

  • This is a continuation of U.S. Ser. No. 10/785,273, filed Feb. 24, 2004, which is incorporated by reference.
  • FIELD OF THE INVENTION
  • The present invention relates to video generally and, more particularly, to a method and apparatus for determining a second picture for temporal direct-mode block prediction.
  • BACKGROUND OF THE INVENTION
  • The H.264/MPEG4-AVC video standard allows multiple different reference pictures for inter-prediction. The different reference pictures are potentially signaled down to 8×8 partitions which reference a picture to use for inter-prediction. The standard also allows the choice, in a flexible manner, of which reference pictures to use, and the order in which the reference pictures, are available for any given slice (i.e., a group of macroblocks) of video.
  • Such flexibility leaves the direct (i.e., spatial and temporal) block prediction modes open to a wide variety of different implementations. A direct-mode block is a bi-predictive predicted block in a B-frame that does not signal either references or motion vectors. Rather, references and motion vectors are derived from a co-located block in a previously decoded picture. The overhead of the derived block mode is very low and provides a very important prediction mode that is often used to significantly reduce the rate of B-frames.
  • The reference pictures for each slice of video are arranged into two ordered lists (i.e., List0 and List1). For bi-predictive and direct-mode predicted blocks, one picture from each list should be indicated for use for inter-prediction by two reference-indices (one into each list) indicating an ordered number of one of the reference pictures from each list.
  • Previous H.264 implementations of direct-modes use the following sequence to determine which two current reference pictures should be used for inter-prediction of each block of direct-mode block. First, previous H.264 implementations find the co-located picture (i.e., reference 0, the first reference picture from List1) and block for the current block. This co-located picture will be the first reference picture used for direct-mode prediction. Next, the co-located block will be used to derive the reference indices and motion vectors for the current block. Specifically, previous H.264 implementations determine the List0 reference picture that is used by the co-located block to refer to a ‘direct-mode reference’. The reference index in the co-located picture of this direct mode reference is called the direct-mode reference index. The direct mode reference index is used by the current block to determine the second reference picture to use for inter prediction. Specifically, the direct-mode reference index is directly used in the reference picture list of the current slice. Finally, the motion vectors for the current block are interpolated from the motion vectors used in the co-located block according to the temporal distances between the current picture and the two reference pictures.
  • Such an implementation has the disadvantage that the second reference picture does not necessarily refer to the same physical reference picture for direct-prediction that was used for inter-prediction by the co-located block. The reference picture used in the co-located block and the second picture used in the direct-mode prediction of the current block are the same physical picture only if the direct mode reference picture was present in the same position (i) in List0 of the current slice of the current picture being decoded and (ii) in List0 of the co-located slice of the co-located picture.
  • The intent of direct-mode prediction is that it uses the physical reference picture used by the co-located block as a reference picture for the current block. However, since H.264 supports reference picture re-ordering, this condition is not necessarily met. Reference picture re-ordering is the ability to flexibly order reference lists for each slice to use different pictures that are best inter-predicted from various other previously encoded/decoded pictures. If the encoder has the ability to specify which pictures are best for the current picture, then prediction residuals may be reduced.
  • A particular example of where the ability to re-order reference pictures is useful is to adaptively choose whether to code an I or P-picture as two fields (the second of which is inter-predicted from the first) or as a single picture without inter-prediction between fields. The reference pictures may be re-ordered between the current picture and the co-located picture such that the same reference picture does not occur in the same position in the respective List0. The direct-mode prediction could be seriously compromised with the existing solution since the intended use of the direct-mode is that the same reference picture would be used.
  • It would be desirable to identify reference index that spatial and temporal direct-mode prediction modes should use to reference the picture that was the primary reference of the co-located macroblock.
  • SUMMARY OF THE INVENTION
  • The present invention concerns a method for determining a first and a second reference picture used for inter-prediction of a macroblock, comprising the steps of (A) finding a co-located picture and block, (B) determining a reference index, (C) mapping the reference index to a lowest valued reference index in a current reference list and (D) using the reference index to determine the second reference picture.
  • The objects, features and advantages of the present invention include providing a method and/or apparatus that may (i) determine a second picture for temporal direct-mode block prediction and/or (ii) map a reference index to a lowest valued reference index in a current reference list.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and other objects, features and advantages of the present invention will be apparent from the following detailed description and the appended claims and drawings in which:
  • FIG. 1 is a flow diagram of an implementation of the present invention;
  • FIG. 2 is a diagram illustrating an implementation of the present invention;
  • FIG. 3 is a partial block diagram of an example implementation of an encoder apparatus; and
  • FIG. 4 is a partial block diagram of an example implementation of a decoder apparatus.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • Referring to FIG. 1, a flow diagram 100 of the present invention is shown. The present invention may determine which two current reference pictures should be used for inter-prediction of each block of an H.264 direct-mode macroblock. The flow diagram 100 generally comprises a state 102, a state 104, a state 106 and a state 108.
  • The state 102 finds the co-located (e.g., reference 0, a first reference picture from List1) picture and block for the current block being processed. The co-located picture will be the first reference picture used for direct-mode prediction. The state 104 determines the reference picture that was used by the co-located block to refer to a ‘direct-mode reference’ (i) in reference List0 of the co-located slice (if a reference picture from List0 was used for inter-prediction of the co-located macroblock), or (ii) if no List0 reference picture was used, reference List1 of the co-located slice (if a reference picture from List1 was used for inter prediction of the co-located macroblock). The state 106 maps the reference picture from the state 104 to the lowest valued reference index in the current reference List0. The state 106 references the same reference picture that was referenced by the co-located picture in the state 104. The state 108 provides the List0 reference index found in the state 106 (e.g., the reference index is normally generated in response to remapping). The reference index is generally used with the reference list0 of the current slice to determine the second reference picture to be used for inter-prediction.
  • The state 106 searches the current List0 to determine the lowest valued reference index referring to the same reference picture that was referred by the co-located picture. The step 106 implements a number of operations and data not outlined in the current H.264 specification.
  • First, a unique identifier for each reference picture is stored. The unique identifier is normally correctly associated from the unique identifier that was used as an inter-reference in the co-located picture. The unique identifier is made available as a potential List0 inter-reference for the current picture. Next, a unique identifier to the actual ‘direct-mode reference picture’ is stored. Next, a module (or method) searches the current reference List0 for the lowest valued reference index identified by the unique identifier and return the value of that reference index.
  • A search in the current reference List0 provides the potential for increasing coding efficiency of B-frames and provides flexibility to the encoder to be able to use a truly interpolative direct-mode prediction along with an arbitrary choice for the picture referred by the first reference index (index0) of List0. These two options were mutually exclusive in the previous H.264 implementations discussed in the background section.
  • Referring to FIG. 2, a diagram 100′ is shown in accordance with the present invention. The flow diagram 100′ generally comprises a block (or circuit) 112, a block (or circuit) 114, a block (or circuit) 116, a block (or circuit) 118 and a block (or circuit) 119. The block 112 is shown implementing an encoder/decoder signal to construct direct-mode prediction. The block 114 generally sets a co-located picture (e.g., COLPIC) as equal to a value in the List1 [0] (e.g., List1 at index 0). The block 116 generally determines whether the second picture is a picture from either List0 or a List1. The block 118 finds the index in the current List0 that refers to “other picture”. The state 119 creates an interprediction by the weighted average of pixels of COLPIC and OTHERPIC.
  • The present invention may be particularly useful under circumstances when accurate direct-mode prediction is useful. Having a low-overhead/efficient reference to a reference frame other than the two pictures that yield the interpolative direct-mode prediction is desirable. For example, the first entry in List1 of the current B-picture is generally chosen to give a good direct-mode prediction. Higher compression may be achieved by using a picture other than the index 0 entry in List0 of the co-located picture as the index 0 entry in List0 of the current picture.
  • Another feature of the present invention is the choice of the order of pictures in the reference lists of the current picture being decoupled and independent from the choice of the order of reference pictures in the co-located picture. Decoupling the reference pictures potentially significantly simplifies the design of an encoder incorporating the present invention. For example, the lists of the co-located picture need not be taken into account when designing the lists for the current picture.
  • A unique reference index is normally found for the second reference picture for direct-mode predicted blocks. The lowest valued index in List0 is specifically chosen to use the same physical reference frame. The encoder is not unnecessarily constrained to refer to the same physical frame with the same index in List0 of the current picture as was used for the ‘direct-mode reference’ of the co-located picture. A meaningful and useful direct-mode prediction that effectively finds the current frame as a temporally interpolated intermediate estimate between the co-located picture and a corresponding ‘direct-mode L0 reference’. For example, the index 0 of List0 may be chosen to maximize the coding efficiency of the reference indices rather than to make sure that a good direct-mode prediction is available.
  • The reference index prediction and context-based coding (e.g., with CABAC entropy-coding) may be improved by using the LOWEST possible reference index. Reference indices will often be ordered from most-frequent to least-frequent in expected occurrence frequency. The predicted index entropy should be expected to be reduced with the present invention.
  • The present invention may be implemented in all H.264 compliant decoders. While encoding may be implemented using the techniques described in the background section, the efficiency of such a system is generally reduced when compared with the present invention. Advanced encoders may realize a benefit by exploiting the improved flexibility possible with the use of the present invention.
  • A content-addressable-memory (CAM) may provide an efficient hardware structure for implementing the present invention. The present invention may also be implemented in software with a ‘for’ loop search beginning at index 0 of List0 and proceeding towards the end of List0. Such a software implementation may exit early from the loop when the desired reference frame is found. In other implementations, the present invention may be implemented with a variable that holds the lowest reference index in the current List0 that refers to the ‘direct-mode L0 reference picture’. Such a variable would be set once before the decoding of the temporal direct-mode macroblocks of each new slice (e.g., if temporal direct rather than spatial direct is used for the slice) using the software table search ‘for’ loop method mentioned above.
  • Referring to FIG. 3, a partial block diagram of an example implementation of an encoder apparatus 120 is shown. The encoder apparatus 120 may be implemented as a video bitstream encoder apparatus or system. The encoder apparatus 120 generally comprises a circuit 122, a circuit 124, a circuit 126 and a memory 128. The circuit 122 may receive a bitstream or signal (e.g., TIN). A bitstream or signal (e.g., TOUT) may be generated by the circuit 126. The memory 128 may hold the List0 and the List1 for each of the reference index values.
  • The circuit 122 may be implemented as a compression circuit or module. The compression circuit 122 may be operational to compress the blocks within the signal TIN thereby generating motion vectors. Compression may be determined by a signal (e.g., PRED) received from the circuit 124. A signal (e.g., MV) may exchange motion vectors between the compression circuit 122 and the memory 128. During compression, the motion vectors may be written to the memory 128. During reconstruction of a reference block the motion vectors may be read from the memory 128.
  • The circuit 124 may be implemented as a code control circuit. The circuit 124 may generate the signal PRED conveying the prediction type used by the macroblocks. The code control circuit 124 may also generate a signal (e.g., CNT). The signal CNT may provide coding controls to the circuit 126.
  • The circuit 126 may be implemented as a coding circuit. In one embodiment, the coding circuit 126 may be an entropy coding circuit. The entropy coding circuit 126 may receive the blocks and the associated groups of motion vectors from the compression circuit 122 via a bitstream or signal (e.g., TBS). The entropy coding circuit 126 may be configured to encode the signal TBS to generate the signal TOUT for transmission and/or storage. In one embodiment, the signal TOUT may be implemented as a Network Abstraction. Layer defined by the H.264 standard.
  • The memory 128 may be implemented as an external memory. The memory 128 is generally operational to store the motion vectors for the blocks while the blocks are being encoded. The memory 128 may be configured to store other data used for encoding the bitstream data. Other types of memories may be implemented to meet the criteria of a particular application.
  • Referring to FIG. 4, a partial block diagram of an example implementation of a decoder apparatus 130 is shown. The decoder apparatus 130 may be implemented as a video bitstream decoder or system. The decoder apparatus 130 generally comprises a circuit 132, a circuit 134, a circuit 136 and a memory 138. The circuit 132 may receive an input bitstream or signal (e.g., RIN). The circuit 136 may generate an output bitstream or signal (e.g., ROUT).
  • The circuit 132 may be implemented as a decoder circuit. In one embodiment, the decoder circuit 132 may be implemented as an entropy decoder circuit 132. The entropy decoder circuit 132 may be operational to decode the bitstream signal TOUT generated by the entropy coding circuit 126 (e.g., TOUT=RIN). A decoded bitstream or signal (e.g., RBS) may be presented by the entropy decoder circuit 132 to the circuits 134 and 136.
  • The circuit 134 may be implemented as a prediction circuit. The prediction circuit 134 may be operational to determine if inter or intra prediction has been implemented for the various macroblocks of the pictures in the signal RBS. The prediction circuit 134 may generate a command signal (e.g., CMD) to the circuit 136 indicating the prediction type.
  • The circuit 136 may be implemented as a decompression circuit. The decompression circuit 136 may examine the compressed groups to determine how the motion vectors should be used. The decompression circuit 136 may store the motion vectors from decoded blocks that may be used for inferring motion vectors of co-located blocks the memory 128 via a signal (e.g., MV). The stored motion vectors may be read from the memory 138 to calculate the motion vectors for B-slice blocks coded under the direct mode (e.g., no associated motion vectors were transmitted in the signal TOUT). The direct mode generally refers to a macroblock or macroblock partition. The inferred motion vectors may then be used in generating the signal ROUT.
  • The memory 138 may be implemented as an external memory. The memory 138 is generally operational to store the motion vectors for the blocks for later use in calculating inferred motion vectors for the co-located blocks. The memory 138 may be configured to store other data used for decoding the bitstream data. Other types of memories may be implemented to meet the criteria of a particular application. The memory 138 may hold the List0 and the List1 for each of the reference index values.
  • The present invention may be implemented in decoders of the professional version (PExt) of the H.264 standard that use B-frames, and also for other future extensions of the H.264 standard.
  • While the invention has been particularly shown and described with reference to the preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made without departing from the spirit and scope of the invention.

Claims (20)

1. A method for determining a plurality of reference pictures of a current block, comprising the steps of:
(A) determining a first reference picture of a co-located block of said current block, wherein an identifier is associated with said first reference picture;
(B) mapping a reference index to a lowest valued reference index associated with said identifier, wherein said identifier is associated with said first reference picture from (i) when said first reference picture was used to inter-predict said co-located block to (ii) when said first reference picture is made available as a potential reference picture to inter-predict said current block; and
(C) inter-predicting said current block using (i) said first reference picture associated with said lowest valued reference index and (ii) a second reference picture comprising said co-located block.
2. The method according to claim 1, wherein said current block comprises an H.264 direct-mode macroblock or an H.264 direct-mode macroblock partition.
3. The method according to claim 1, wherein step (B) further comprises:
storing a respective identifier associated with each of said reference pictures.
4. The method according to claim 1, wherein said first reference picture resides in a first list of said current block and said second reference picture resides in a second list of said current block.
5. The method according to claim 1, wherein said inter-predicting operates on (i) a macroblock when in a first configuration and (ii) a macroblock partition when in a second configuration.
6. The method according to claim 1, wherein step (B) comprises:
searching for said lowest valued reference index identified by said identifier; and
returning a value of said lowest valued reference index.
7. The method according to claim 1, wherein said method further comprising the step of:
implementing an interpolative direct mode prediction and a flexible choice for a picture referenced by a finite index reference.
8. The method according to claim 1, wherein said method is implemented in a video encoder.
9. The method according to claim 1, wherein said method is implemented in a video decoder.
10. An apparatus comprising:
a circuit configured to (i) determine a first reference picture of a co-located block of a current block, wherein an identifier is associated with said first reference picture, (ii) map a reference index to a lowest valued reference index associated with said identifier, wherein said identifier is associated with said first reference picture from (a) when said first reference picture was used to inter-predict said co-located block to (b) when said first reference picture is made available as a potential reference picture to inter-predict said current block and (iii) inter-predict said current block using (a) said first reference picture associated with said lowest valued reference index and (b) a second reference picture comprising said co-located block.
11. The apparatus according to claim 10, wherein said current block comprises an H.264 direct-mode macroblock or an H.264 direct-mode macroblock partition.
12. The apparatus according to claim 10, further comprising a memory configured to store a respective identifier associated with each of said reference pictures.
13. The apparatus according to claim 10, wherein said first reference picture resides in a first list of said current block and said second reference picture resides in a second list of said current block.
14. The apparatus according to claim 10, wherein said inter-prediction operates on (i) a macroblock when in a first configuration and (ii) a macroblock partition when in a second configuration.
15. The apparatus according to claim 10, wherein said map (i) searches for said lowest valued reference index identified by said identifier and (ii) returns a value of said lowest valued reference index.
16. The apparatus according to claim 10, wherein said circuit is further configured to implement an interpolative direct mode prediction and a flexible choice for a picture referenced by a finite index reference.
17. The apparatus according to claim 10, wherein said apparatus is implemented in a video encoder.
18. The apparatus according to claim 10, wherein said apparatus is implemented in a video decoder.
19. The apparatus according to claim 10, further comprising a content addressable memory configured to store said reference pictures.
20. An apparatus comprising:
means for determining a first reference picture of a co-located block of a current block, wherein an identifier is associated with said first reference picture;
means for mapping a reference index to a lowest valued reference index associated with said identifier, wherein said identifier is associated with said first reference picture from (i) when said first reference picture was used to inter-predict said co-located block to (ii) when said first reference picture is made available as a potential reference picture to inter-predict said current block; and
means for inter-predicting said current block using (i) said first reference picture associated with said lowest valued reference index and (ii) a second reference picture comprising said co-located block.
US13/220,933 2004-02-24 2011-08-30 Method and apparatus for determining a second picture for temporal direct-mode block prediction Abandoned US20110310968A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/220,933 US20110310968A1 (en) 2004-02-24 2011-08-30 Method and apparatus for determining a second picture for temporal direct-mode block prediction

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/785,273 US8036271B2 (en) 2004-02-24 2004-02-24 Method and apparatus for determining a second picture for temporal direct-mode block prediction
US13/220,933 US20110310968A1 (en) 2004-02-24 2011-08-30 Method and apparatus for determining a second picture for temporal direct-mode block prediction

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/785,273 Continuation US8036271B2 (en) 2004-02-24 2004-02-24 Method and apparatus for determining a second picture for temporal direct-mode block prediction

Publications (1)

Publication Number Publication Date
US20110310968A1 true US20110310968A1 (en) 2011-12-22

Family

ID=34861593

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/785,273 Expired - Fee Related US8036271B2 (en) 2004-02-24 2004-02-24 Method and apparatus for determining a second picture for temporal direct-mode block prediction
US13/220,933 Abandoned US20110310968A1 (en) 2004-02-24 2011-08-30 Method and apparatus for determining a second picture for temporal direct-mode block prediction

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/785,273 Expired - Fee Related US8036271B2 (en) 2004-02-24 2004-02-24 Method and apparatus for determining a second picture for temporal direct-mode block prediction

Country Status (1)

Country Link
US (2) US8036271B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018090600A1 (en) * 2016-11-18 2018-05-24 华为技术有限公司 Method for establishing virtual reference frame, and device

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7003035B2 (en) 2002-01-25 2006-02-21 Microsoft Corporation Video coding methods and apparatuses
US20040001546A1 (en) 2002-06-03 2004-01-01 Alexandros Tourapis Spatiotemporal prediction for bidirectionally predictive (B) pictures and motion vector prediction for multi-picture reference motion compensation
US7154952B2 (en) 2002-07-19 2006-12-26 Microsoft Corporation Timestamp-independent motion vector prediction for predictive (P) and bidirectionally predictive (B) pictures
US7519116B2 (en) * 2004-07-27 2009-04-14 Broadcom Corporation Teritary content addressable memory based motion estimator
US8074248B2 (en) 2005-07-26 2011-12-06 Activevideo Networks, Inc. System and method for providing video content associated with a source image to a television in a communication network
JP4527677B2 (en) * 2006-03-15 2010-08-18 富士通株式会社 Moving picture coding method, moving picture coding apparatus, moving picture coding program
US20080101474A1 (en) * 2006-11-01 2008-05-01 Yi-Jen Chiu Optimizing the storage and reducing the computation of reference picture list processing in video decoding
KR100922275B1 (en) * 2006-12-15 2009-10-15 경희대학교 산학협력단 Derivation process of a boundary filtering strength and deblocking filtering method and apparatus using the derivation process
US9355681B2 (en) 2007-01-12 2016-05-31 Activevideo Networks, Inc. MPEG objects and systems and methods for using MPEG objects
US9826197B2 (en) 2007-01-12 2017-11-21 Activevideo Networks, Inc. Providing television broadcasts over a managed network and interactive content over an unmanaged network to a client device
KR20080107965A (en) * 2007-06-08 2008-12-11 삼성전자주식회사 Method and apparatus for encoding and decoding video using object boundary based partition
US8526499B2 (en) 2007-06-15 2013-09-03 Sungkyunkwan University Foundation For Corporate Collaboration Bi-prediction coding method and apparatus, bi-prediction decoding method and apparatus, and recording medium
WO2008153262A1 (en) 2007-06-15 2008-12-18 Sungkyunkwan University Foundation For Corporate Collaboration Bi-prediction coding method and apparatus, bi-prediction decoding method and apparatus, and recording midium
US8254455B2 (en) 2007-06-30 2012-08-28 Microsoft Corporation Computing collocated macroblock information for direct mode macroblocks
US8411754B2 (en) * 2008-11-10 2013-04-02 Activevideo Networks, Inc. System, method, and computer program product for translating an element of a static encoded image in the encoded domain
US8189666B2 (en) * 2009-02-02 2012-05-29 Microsoft Corporation Local picture identifier and computation of co-located information
EP2432226B1 (en) * 2009-05-11 2019-06-26 NTT DoCoMo, Inc. Moving image encoding device, method, and program, and moving image decoding device, method, and program
CN102668561A (en) * 2009-12-17 2012-09-12 瑞典爱立信有限公司 Method and arrangement for video coding
KR20130138263A (en) 2010-10-14 2013-12-18 액티브비디오 네트웍스, 인코포레이티드 Streaming digital video between video devices using a cable television system
KR102221331B1 (en) 2010-11-24 2021-03-03 벨로스 미디어 인터내셔널 리미티드 Motion vector calculation method, image coding method, image decoding method, motion vector calculation device and image coding/decoding device
CN106851306B (en) 2011-01-12 2020-08-04 太阳专利托管公司 Moving picture decoding method and moving picture decoding device
US9008176B2 (en) 2011-01-22 2015-04-14 Qualcomm Incorporated Combined reference picture list construction for video coding
WO2012114694A1 (en) 2011-02-22 2012-08-30 パナソニック株式会社 Moving image coding method, moving image coding device, moving image decoding method, and moving image decoding device
US10171813B2 (en) 2011-02-24 2019-01-01 Qualcomm Incorporated Hierarchy of motion prediction video blocks
JP5358746B2 (en) 2011-03-03 2013-12-04 パナソニック株式会社 Moving picture coding method, moving picture coding apparatus, and program
US8934552B2 (en) 2011-03-31 2015-01-13 Qualcomm Incorporated Combined reference picture list construction and mapping
WO2012138660A2 (en) 2011-04-07 2012-10-11 Activevideo Networks, Inc. Reduction of latency in video distribution networks using adaptive bit rates
MX2014000159A (en) * 2011-07-02 2014-02-19 Samsung Electronics Co Ltd Sas-based semiconductor storage device memory disk unit.
US10034018B2 (en) * 2011-09-23 2018-07-24 Velos Media, Llc Decoded picture buffer management
WO2013106390A1 (en) 2012-01-09 2013-07-18 Activevideo Networks, Inc. Rendering of an interactive lean-backward user interface on a television
US9800945B2 (en) 2012-04-03 2017-10-24 Activevideo Networks, Inc. Class-based intelligent multiplexing over unmanaged networks
US9123084B2 (en) 2012-04-12 2015-09-01 Activevideo Networks, Inc. Graphical application integration with MPEG objects
US9672584B2 (en) * 2012-09-06 2017-06-06 Imagination Technologies Limited Systems and methods of partial frame buffer updating
WO2014145921A1 (en) 2013-03-15 2014-09-18 Activevideo Networks, Inc. A multiple-mode system and method for providing user selectable video content
WO2014197879A1 (en) 2013-06-06 2014-12-11 Activevideo Networks, Inc. Overlay rendering of user interface onto source video
US9219922B2 (en) 2013-06-06 2015-12-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
US9294785B2 (en) 2013-06-06 2016-03-22 Activevideo Networks, Inc. System and method for exploiting scene graph information in construction of an encoded video sequence
US10313682B2 (en) * 2013-08-26 2019-06-04 Qualcomm Incorporated Determining regions when performing intra block copying
WO2015054812A1 (en) * 2013-10-14 2015-04-23 Microsoft Technology Licensing, Llc Features of base color index map mode for video and image coding and decoding
EP3058736B1 (en) 2013-10-14 2019-02-27 Microsoft Technology Licensing, LLC Encoder-side options for intra block copy prediction mode for video and image coding
RU2654129C2 (en) 2013-10-14 2018-05-16 МАЙКРОСОФТ ТЕКНОЛОДЖИ ЛАЙСЕНСИНГ, ЭлЭлСи Features of intra block copy prediction mode for video and image coding and decoding
MX360926B (en) 2014-01-03 2018-11-22 Microsoft Technology Licensing Llc Block vector prediction in video and image coding/decoding.
US10390034B2 (en) 2014-01-03 2019-08-20 Microsoft Technology Licensing, Llc Innovations in block vector prediction and estimation of reconstructed sample values within an overlap area
US11284103B2 (en) 2014-01-17 2022-03-22 Microsoft Technology Licensing, Llc Intra block copy prediction with asymmetric partitions and encoder-side search patterns, search ranges and approaches to partitioning
US10542274B2 (en) 2014-02-21 2020-01-21 Microsoft Technology Licensing, Llc Dictionary encoding and decoding of screen content
US9788029B2 (en) 2014-04-25 2017-10-10 Activevideo Networks, Inc. Intelligent multiplexing using class-based, multi-dimensioned decision logic for managed networks
CN105493505B (en) 2014-06-19 2019-08-06 微软技术许可有限责任公司 Unified Intra Block Copy and Inter Prediction Modes
CA2959682C (en) 2014-09-30 2022-12-06 Microsoft Technology Licensing, Llc Rules for intra-picture prediction modes when wavefront parallel processing is enabled
WO2016197314A1 (en) 2015-06-09 2016-12-15 Microsoft Technology Licensing, Llc Robust encoding/decoding of escape-coded pixels in palette mode
US10986349B2 (en) 2017-12-29 2021-04-20 Microsoft Technology Licensing, Llc Constraints on locations of reference blocks for intra block copy prediction

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040066848A1 (en) * 2002-10-04 2004-04-08 Lg Electronics Inc. Direct mode motion vector calculation method for B picture
US20040136461A1 (en) * 2002-04-19 2004-07-15 Satoshi Kondo Motion vector calculating method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1976455B (en) * 2002-04-19 2011-12-07 松下电器产业株式会社 Image coding method and device
KR100491530B1 (en) * 2002-05-03 2005-05-27 엘지전자 주식회사 Method of determining motion vector
EP1427216A1 (en) * 2002-07-02 2004-06-09 Matsushita Electric Industrial Co., Ltd. Image encoding method and image decoding method
AU2003244072B2 (en) * 2002-07-11 2007-08-16 Godo Kaisha Ip Bridge 1 Filtering Strength Determination Method, Moving Picture Coding Method and Moving Picture Decoding Method
KR100941123B1 (en) * 2003-10-09 2010-02-10 톰슨 라이센싱 Direct Mode Derivation Process for Error Concealment

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040136461A1 (en) * 2002-04-19 2004-07-15 Satoshi Kondo Motion vector calculating method
US20040066848A1 (en) * 2002-10-04 2004-04-08 Lg Electronics Inc. Direct mode motion vector calculation method for B picture

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018090600A1 (en) * 2016-11-18 2018-05-24 华为技术有限公司 Method for establishing virtual reference frame, and device

Also Published As

Publication number Publication date
US20050185713A1 (en) 2005-08-25
US8036271B2 (en) 2011-10-11

Similar Documents

Publication Publication Date Title
US8036271B2 (en) Method and apparatus for determining a second picture for temporal direct-mode block prediction
US12301836B2 (en) Image decoding method and apparatus using the same
US10779002B2 (en) Limitation of the MVP derivation based on decoder-side motion vector derivation
US11265551B2 (en) Decoder-side motion vector derivation
US11647207B2 (en) Inter prediction method and apparatus
US10911769B2 (en) Motion-based priority for the construction of candidate lists in video coding
US11463707B2 (en) Techniques of multi-hypothesis motion compensation
EP2767090B1 (en) Tracking a reference picture based on a designated picture on an electronic device
US7664180B2 (en) Moving picture coding method and moving picture decoding method for performing inter picture prediction coding and inter picture predection decoding using previously processed pictures as reference pictures
US7298913B2 (en) Video encoding method and apparatus employing motion compensated prediction interframe encoding, and corresponding video decoding method and apparatus
CN103460698A (en) Combined reference picture list construction and mapping
JP2013514718A (en) Video coding method and apparatus
JP2023164845A (en) Video encoder, video decoder, and corresponding method
US11924440B2 (en) Techniques of multi-hypothesis motion compensation
US20250193437A1 (en) Encoder and decoder for coded pictures having regions with common motion models
CN119586128A (en) Method, device and medium for video processing
JP2003189313A (en) Inter prediction coding method and inter prediction decoding method
HK1238041B (en) Tracking a reference picture based on a designated picture on an electronic device

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION