WO2013069269A1 - Dispositif de codage vidéo, procédé de codage vidéo, programme de codage vidéo, dispositif de transmission, procédé de transmission, programme de transmission, dispositif de décodage vidéo, procédé de décodage vidéo, programme de décodage vidéo, dispositif de réception, procédé de réception et programme de réception - Google Patents
Dispositif de codage vidéo, procédé de codage vidéo, programme de codage vidéo, dispositif de transmission, procédé de transmission, programme de transmission, dispositif de décodage vidéo, procédé de décodage vidéo, programme de décodage vidéo, dispositif de réception, procédé de réception et programme de réception Download PDFInfo
- Publication number
- WO2013069269A1 WO2013069269A1 PCT/JP2012/007130 JP2012007130W WO2013069269A1 WO 2013069269 A1 WO2013069269 A1 WO 2013069269A1 JP 2012007130 W JP2012007130 W JP 2012007130W WO 2013069269 A1 WO2013069269 A1 WO 2013069269A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- prediction
- motion information
- candidate
- block
- candidate list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/593—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to a moving image encoding and decoding technique using motion compensated prediction, and more particularly to a moving image encoding and decoding technique for encoding and decoding motion information used in motion compensated prediction.
- motion compensation prediction is used.
- the motion compensated prediction is performed by dividing the target image into fine blocks, using the decoded image as a reference image, and moving from the processing target block of the target image to the reference block of the reference image based on the amount of motion indicated by the motion vector. This is a technique for generating a signal as a prediction signal.
- a motion vector of an encoded block adjacent to the processing target block is set as a prediction motion vector (also simply referred to as “prediction vector”), and a difference between the motion vector of the processing target block and the prediction vector is obtained.
- the compression efficiency is improved by transmitting the vector as an encoded vector.
- MPEG-4 AVC MPEG-4 AVC / H.
- MPEG-4 AVC moving picture compression coding such as H.264 (hereinafter referred to as MPEG-4 AVC)
- motion compensation prediction with high accuracy is made possible by making the block size for motion compensation prediction fine and diverse.
- the code amount of the encoded vector is increased by reducing the block size.
- Temporal direct motion compensated prediction is used to achieve motion compensated prediction without transmission.
- Patent Document 1 paying attention to the continuity of motion in the spatial direction, the encoded vector is transmitted using the motion vector of the processed block adjacent to the processing target block as the motion vector of the processing target block.
- a method for realizing motion compensated prediction without the need is disclosed.
- the present invention has been made in view of such circumstances, and an object of the present invention is to provide a moving image encoding and decoding technique capable of realizing motion compensation prediction with high efficiency without transmitting an encoding vector. is there.
- a moving picture coding apparatus divides a coded block into one or a plurality of prediction blocks based on a division type and performs motion compensation prediction to code a moving picture.
- a candidate derivation unit (140) that uses the motion information of the selected encoded prediction block as a selection candidate, the division type of the prediction block to be encoded, and the code of the prediction block to be encoded
- the motion information of a specific prediction block in a plurality of prediction blocks adjacent to the prediction block to be encoded is not set as the selection candidate based on the position in the encoding block.
- a candidate list generation unit (140) for generating a candidate list including the selection candidates, motion information in one prediction direction of selection candidates included in the candidate list, and included in the candidate list
- An additional unit (140) that generates a new selection candidate for bi-prediction by combining with motion information in the other prediction direction of the other selection candidates and adds the selection candidate to the candidate list, and the prediction block to be encoded from the candidate list
- a determination unit (141) for determining motion information used for the motion compensation prediction of the first and second encodings
- an encoding unit (104) for encoding information for specifying the determined motion information in the candidate list.
- This apparatus is a moving picture encoding apparatus that performs motion compensation prediction by dividing an encoded block into one or a plurality of prediction blocks based on a division type, and includes a division type of a prediction block to be encoded and the code Based on the position of the prediction block to be encoded in the encoded block, the first motion information group consisting of motion information of a plurality of encoded adjacent blocks adjacent to the prediction block to be encoded
- a recessive joint motion information candidate determination unit (166) that selects a recessive joint motion information candidate capable of motion compensation prediction with a prediction block having a size larger than the size of the prediction block to be encoded, and the first Combining combined motion information candidates for use in the prediction block to be encoded using the second motion information group obtained by removing the recessive combined motion information candidates from the motion information group
- Still another aspect of the present invention is a video encoding method.
- This method is a moving picture coding method for coding a moving picture by performing motion compensation prediction by dividing a coding block into one or a plurality of prediction blocks based on a division type, and a prediction block to be coded Is selected from the plurality of prediction blocks adjacent to the encoded prediction block for which the motion information including the motion vector information and the reference image information is valid, and the motion information of the selected encoded prediction block is selected.
- Still another aspect of the present invention is a transmission device.
- This apparatus packetizes an encoded stream encoded by a moving image encoding method that encodes a moving image by performing motion compensation prediction by dividing an encoded block into one or a plurality of prediction blocks based on a division type.
- a packet processing unit that obtains encoded data and a transmission unit that transmits the packetized encoded data.
- the moving image encoding method selects an encoded prediction block in which motion information including motion vector information and reference image information is valid from a plurality of prediction blocks adjacent to a prediction block to be encoded.
- Still another aspect of the present invention is a transmission method.
- an encoded stream encoded by a moving image encoding method that performs motion compensated prediction and encodes a moving image by dividing an encoded block into one or a plurality of prediction blocks based on a division type is packetized.
- the moving image encoding method selects an encoded prediction block in which motion information including motion vector information and reference image information is valid from a plurality of prediction blocks adjacent to a prediction block to be encoded.
- a moving picture decoding apparatus is a moving picture decoding apparatus that performs motion compensation prediction by dividing a decoded block into one or a plurality of prediction blocks based on a division type, and decodes a coded stream of the moving picture.
- a decoded prediction block in which motion information including motion vector information and reference image information is valid is selected from a plurality of prediction blocks adjacent to the prediction block to be decoded, and the selected decoded block Based on the candidate derivation unit (230) that selects the motion information of the prediction block as a selection candidate, the division type of the prediction block to be decoded, and the position of the prediction block to be decoded in the decoding block
- the invalidation unit (230) that does not select motion information of a specific prediction block in a plurality of prediction blocks adjacent to the prediction block as the selection candidate, and the selection
- An addition unit (230) that generates a new bi-predictive selection candidate in combination with the motion information and adds it to the candidate list; a code string analysis unit (201) that decodes information indicating a position in the
- This apparatus is a moving picture decoding apparatus that performs motion compensation prediction by dividing a decoded block into one or a plurality of prediction blocks based on a division type, and for specifying a combined motion information candidate in a combined motion information candidate list.
- a decoding unit (201) that decodes the candidate identification index from a code string that is encoded as a candidate identification index, a division type of a prediction block to be decoded, and a decoding block of the prediction block to be decoded in the decoding block Based on the position, a size larger than the size of the prediction block to be decoded is selected from the first motion information group including motion information of a plurality of decoded adjacent blocks adjacent to the prediction block to be decoded.
- Recessive combined motion information candidate determination that selects a recessive combined motion information candidate capable of motion compensation prediction with a prediction block (166) and the second motion information group obtained by removing the recessive combined motion information candidate from the first motion information group, and the combined motion information candidate combined motion to be used for the prediction block to be decoded
- a combined motion information selection unit (231) that selects one combined motion information candidate from the combined motion information candidate list based on a specific index and uses the selected motion information as motion information of the prediction block to be decoded;
- Still another aspect of the present invention is a moving picture decoding method.
- This method is a moving picture decoding method in which a decoded block is divided into one or a plurality of prediction blocks based on a division type, motion compensated prediction is performed, and a coded stream of a moving picture is decoded.
- Still another aspect of the present invention is a receiving device.
- This apparatus is a receiving apparatus that performs decoding processing by performing motion compensation prediction by dividing a decoded block into one or a plurality of prediction blocks in an encoded stream of a received moving image, and the encoded block based on a division type
- a receiving unit that receives encoded data in which an encoded stream of a moving image that is encoded by performing motion compensation prediction by dividing the image into one or a plurality of prediction blocks, and the received encoded data
- Motion information including motion vector information and reference image information is valid from a restoration unit that performs packet processing to restore the original encoded stream and a plurality of prediction blocks adjacent to the prediction block to be decoded.
- a candidate derivation unit (230) that selects motion information of the selected decoded prediction block as a selection candidate, and a prediction target to be decoded. Based on the block division type and the position of the prediction block to be decoded in the decoding block, the motion information of a specific prediction block in a plurality of prediction blocks adjacent to the prediction block to be decoded is selected as the selection candidate.
- Still another aspect of the present invention is a receiving method.
- This method is a receiving method of performing decoding processing by performing motion compensation prediction by dividing a decoded block into one or a plurality of prediction blocks in an encoded stream of a received moving image, and the encoded block based on the division type
- Motion information including motion vector information and reference image information from a plurality of prediction blocks adjacent to the prediction block to be decoded and a decoding step that recovers the original encoded stream by packet processing has been decoded.
- a candidate derivation step in which motion information of the selected decoded prediction block is selected as a selection candidate, and the decoding The motion information of a specific prediction block in a plurality of prediction blocks adjacent to the prediction block to be decoded is based on the division type of the prediction block of the elephant and the position of the prediction block to be decoded in the decoding block.
- An invalidation step that is not a selection candidate a candidate list generation step that generates a candidate list that includes the selection candidate, motion information in one prediction direction of the selection candidate that is included in the candidate list, and another that is included in the candidate list
- a decoding step for decoding the decoding target from the candidate list based on the information indicating the decoded position and the code string analyzing step for decoding from the encoded stream And a selection step of selecting the motion information used to lock the motion compensated prediction.
- motion compensation prediction can be realized with high efficiency without transmitting an encoded vector.
- FIGS. 1A and 1B are diagrams for explaining an encoded block.
- 2A to 2D are diagrams for explaining the prediction block size type. It is a figure explaining a prediction block size type. It is a figure explaining prediction coding mode. It is a figure explaining the relationship between a merge index and a code sequence. It is a figure explaining an example of the syntax of a prediction block.
- 1 is a diagram illustrating a configuration of a moving image encoding device according to Embodiment 1.
- FIG. It is a figure which shows the structure of the motion information generation part of FIG. It is a figure explaining the structure of the merge mode determination part of FIG.
- FIG. 10 is a flowchart illustrating an operation of a combined motion information candidate list generation unit in FIG. 9.
- FIGS. 14A to 14D are diagrams for explaining recessive combined motion information candidates. It is a flowchart explaining the determination process of a recessive joint motion information candidate.
- FIG. 10 is a diagram illustrating an example of an effect according to the first embodiment. It is a figure explaining another example of the effect by Embodiment 1.
- FIG. 10 is a flowchart explaining the determination process of a recessive joint motion information candidate.
- FIG. 10 is a diagram illustrating a configuration of a combined motion information candidate list generation unit according to the second embodiment.
- FIG. 10 is a diagram for explaining an operation of a combined motion information candidate list generation unit according to the second embodiment. It is a figure explaining operation
- FIG. It is a figure explaining a duplication joint motion information candidate.
- MPEG-4AVC International Telecommunication Union Telecommunication Standardization Sector
- a plurality of adjacent blocks and another decoded image block are used as a candidate block group, and one candidate block is selected from these candidate block groups, and information on the selected candidate block is encoded and decoded.
- a merge mode is being considered.
- the input image signal is divided into units of maximum encoding blocks, and the divided maximum encoding blocks are processed in a raster scan order.
- the encoded block has a hierarchical structure, and can be made smaller encoded blocks by sequentially equally dividing into 4 in consideration of the encoding efficiency. Note that the encoded blocks divided into four are encoded in the zigzag scan order. An encoded block that cannot be further reduced is called a minimum encoded block.
- An encoded block is a unit of encoding, and the maximum encoded block is also an encoded block when the number of divisions is zero. In this embodiment, the maximum coding block is 64 pixels ⁇ 64 pixels, and the minimum coding block is 8 pixels ⁇ 8 pixels.
- 1 (a) and 1 (b) are diagrams for explaining an encoded block.
- the encoded block is divided into ten.
- CU0, CU1 and CU9 are 32 ⁇ 32 pixel coding blocks
- CU2, CU3 and CU8 are 16 ⁇ 16 pixel coding blocks
- CU4, CU5, CU6 and CU7 are 8 ⁇ 8 pixel coding blocks. It has become.
- the encoded block is divided into one.
- the encoded block is further divided into prediction blocks.
- the encoded block is divided into one or more prediction blocks according to a prediction block size type (also referred to as “division type”).
- FIGS. 2A to 2D are diagrams for explaining the prediction block size type.
- 2 (a) is 2N ⁇ 2N that does not divide the encoded block
- FIG. 2 (b) is 2N ⁇ N that is horizontally divided
- FIG. 2 (c) is N ⁇ 2N that is vertically divided
- FIG. d) shows N ⁇ N which is divided into 4 parts horizontally and vertically.
- 2N ⁇ 2N is one prediction block
- 2N ⁇ N and N ⁇ 2N are two prediction blocks 0 and 1
- N ⁇ N is four prediction blocks 0, prediction block 1, prediction block 2, and prediction It consists of block 3.
- FIG. 3 is a diagram for explaining the prediction block size according to the number of divisions of the encoded block and the prediction block size type.
- the prediction block size in the present embodiment is from 64 pixels ⁇ 64 pixels in which the number of CU divisions is 0 and the prediction block size type is 2N ⁇ 2N to 3 in the number of CU divisions, and the prediction block size type is N ⁇ N. There are 13 predicted block sizes up to 4 pixels x 4 pixels.
- the maximum encoding block is 64 pixels ⁇ 64 pixels and the minimum encoding block is 8 pixels ⁇ 8 pixels, but the present invention is not limited to this combination.
- the prediction block division patterns are shown in FIGS. 2A to 2D, the combination is not limited to this as long as it is a combination that is divided into one or more.
- FIG. 4 is a diagram for explaining the predictive coding mode.
- the prediction direction of motion compensation prediction is single prediction (L0 prediction) and the number of coding vectors is PredL0
- the prediction direction of motion compensation prediction is single prediction (L1 prediction).
- PredL1 in which the number of encoding vectors is 1
- PredBI in which the prediction direction of motion compensation prediction is bi-prediction (BI prediction) and the number of encoding vectors is 2
- the prediction direction in motion compensation prediction is single prediction
- merge mode MERGE
- BI prediction bi-prediction
- BI prediction bi-prediction
- BI prediction bi-prediction
- the prediction direction in motion compensation prediction is single prediction
- MERGE merge mode
- BI prediction bi-prediction
- Intra intra mode
- PredL0, PredL1, and PredBI are prediction vector modes.
- the prediction direction is any of L0 prediction / L1 prediction / BI prediction. This is because the prediction direction of the merge mode takes over the prediction direction of the candidate block selected from the candidate block group as it is, or is decoded information. This is because it is derived from In the merge mode, the encoded vector is not encoded. This is because the merge mode encoding vector inherits the motion vector of the candidate block selected from the candidate block group as it is or is derived from the decoded information.
- the reference image used in the motion compensation prediction is encoded as a reference image index together with the encoded vector.
- the reference image index used in motion compensation prediction is a numerical value of 0 or more. If the motion compensation prediction is uni-prediction, one reference index is used, and if the motion compensation prediction is bi-prediction, two reference indexes are used (FIG. 4).
- Reference index is not encoded in merge mode. This is because the reference index of the merge mode inherits the reference index of the candidate block selected from the candidate block group as it is or is derived from the decoded information.
- the reference index list includes a reference index list L0 and a reference index list L1.
- L0 prediction using a reference image in the reference index list L0 or L1 prediction using a reference image in the reference index list L1 is used.
- BI prediction using the reference index list L0 and the reference index list L1 is used.
- the maximum number of reference images that can be registered in each reference index list is 16.
- Merge index In the present embodiment, in the merge mode, a plurality of adjacent blocks in the processing target image and blocks around the same position as the processing target block in another encoded image are used as candidate block groups. A candidate block having the optimal predictive coding mode, motion vector, and reference index is selected from the block group, and a merge index for indicating the selected candidate block is encoded and decoded. Only one merge index is used in the merge mode (FIG. 4).
- the maximum number of merge indexes (also referred to as the maximum number of merge candidates) is 5, and the merge index is an integer from 0 to 4.
- the maximum number of merge indexes is set to 5, but it may be 2 or more, and is not limited to this.
- the motion information of the candidate block that is the target of the merge index is referred to as a combined motion information candidate, and the aggregate of combined motion information candidates is referred to as a combined motion information candidate list.
- the motion information includes a prediction direction, a motion vector, and a reference index.
- FIG. 5 is a diagram for explaining the relationship between the merge index and the code string.
- the code string when the merge index is 0 is '0'
- the code string when the merge index is 1 is '10'
- the code string when the merge index is 2 is '110'
- the code string when the merge index is 3 The code string when the column is “1110” and the merge index is 4 is “11110”, and the code string is set longer as the merge index becomes larger. Therefore, it is possible to improve the encoding efficiency by assigning a small merge index to a candidate block with a high selection rate.
- Predicted vector index In the present embodiment, in order to improve the accuracy of the prediction vector, a plurality of adjacent blocks and blocks around the same position as the processing target block of another encoded image are used as candidate block groups, and the candidate block group is used. A candidate block having an optimal motion vector is selected as a prediction vector, and a prediction vector index for indicating the selected candidate block is encoded and decoded. If the motion compensation prediction is uni-prediction, one prediction vector index is used, and if the motion compensation prediction is bi-prediction, two prediction vector indexes are used (FIG. 4). The maximum number of prediction vector indexes (also referred to as the maximum number of prediction vector candidates) is 2, and the prediction vector index is an integer of 0 or 1. Although the maximum number of prediction vector candidates is 2 here, it may be 2 or more, and is not limited to this.
- a motion vector of a candidate block that is a target of a prediction vector index is referred to as a prediction vector candidate
- a set of prediction vector candidates is referred to as a prediction vector candidate list.
- the prediction block (PU in FIG. 6) includes a merge flag (merge_flag), a merge index (merge_idx), an inter prediction type (inter_pred_type), a reference index for L0 prediction (ref_idx_10), a difference vector (mvd_10 [0]) for L0 prediction, mvd_l0 [1]), prediction vector index for L0 prediction (mvp_idx_l0), reference index for L1 prediction (ref_idx_l1), difference vector for L1 prediction (mvd_l1 [0], mvd_l1 [1]), and prediction vector index for L1 prediction ( mvp_idx_l1) is installed. [0] of the difference vector indicates a horizontal component and [1] indicates a vertical component.
- inter_pred_type indicates a prediction direction (also referred to as inter prediction type) of motion compensation prediction, and three types of Pred_L0 (uni prediction of L0 prediction), Pred_L1 (uni prediction of L1 prediction) and Pred_BI (bi prediction of BI prediction). There is.
- inter_pred_type is Pred_L0 or Pred_BI, information about L0 prediction is installed, and when inter_pred_type is Pred_L1 or Pred_BI, information about L1 prediction is installed.
- FIG. 7 shows the configuration of moving picture coding apparatus 100 according to the first embodiment.
- the moving image encoding apparatus 100 is an apparatus that encodes a moving image signal in units of prediction blocks for performing motion compensated prediction.
- the coding block division, the prediction block size type determination, the prediction block size and the position of the prediction block in the coding block (position information of the prediction block), and the determination whether the prediction coding mode is intra are not shown. It is assumed that it is determined by the higher-order coding control unit, and in the first embodiment, a case where the predictive coding mode is not intra will be described. In the first embodiment, a B picture corresponding to bi-prediction will be described, but L1 prediction may be omitted for a P picture not corresponding to bi-prediction.
- the moving picture encoding apparatus 100 is realized by hardware such as an information processing apparatus including a CPU (Central Processing Unit), a frame memory, a hard disk, and the like.
- the moving image encoding apparatus 100 realizes functional components described below by operating the above components. Note that the position information of the prediction block to be processed, the prediction block size, and the prediction direction of motion compensated prediction are assumed to be shared in the video encoding device 100 and are not shown.
- the moving image encoding apparatus 100 includes a prediction block image acquisition unit 101, a subtraction unit 102, a prediction error encoding unit 103, a code string generation unit 104, a prediction error decoding unit 105, a motion compensation unit 106, and an addition unit. 107, a motion vector detection unit 108, a motion information generation unit 109, a frame memory 110, and a motion information memory 111.
- the prediction block image acquisition unit 101 acquires the image signal of the prediction block to be processed from the image signal supplied from the terminal 10 based on the position information and the prediction block size of the prediction block, and subtracts the image signal of the prediction block 102, and supplied to the motion vector detection unit 108 and the motion information generation unit 109.
- the motion vector detection unit 108 obtains motion vectors and reference images of the L0 prediction and the L1 prediction from the image signal supplied from the prediction block image acquisition unit 101 and image signals corresponding to a plurality of reference images stored therein. Detect the reference index indicated.
- the motion vector of the L0 prediction and the L1 prediction and the reference index of the L0 prediction and the L1 prediction are supplied to the motion information generation unit 109.
- the motion vector detection unit 108 uses image signals corresponding to a plurality of reference images stored therein as reference images, but a reference image stored in the frame memory 110 can also be used.
- a general motion vector detection method calculates an error evaluation value for an image signal of a target image and a prediction signal of a reference image moved by a predetermined movement amount from the same position, and a movement amount that minimizes the error evaluation value.
- a motion vector is detected for each reference image, and a reference image having a minimum error evaluation value is selected.
- SAD Sud of Absolute Difference
- MSE Mel Square Error
- the motion information generation unit 109 indicates L0 prediction and L1 prediction motion vectors supplied from the motion vector detection unit 108, reference indexes for L0 prediction and L1 prediction, candidate block groups supplied from the motion information memory 111, and reference indexes.
- the prediction encoding mode is determined from the reference image in the frame memory 110 and the image signal supplied from the prediction block image acquisition unit 101.
- the merge flag, the merge index, the prediction direction of motion compensation prediction, the reference index of L0 prediction and L1 prediction, the difference vector of L0 prediction and L1 prediction, and the prediction vector of L0 prediction and L1 prediction is supplied to the code string generation unit 104 as necessary.
- the motion compensation prediction direction, the reference index of L0 prediction and L1 prediction, and the motion vector of L0 prediction and L1 prediction are supplied to the motion compensation unit 106 and the motion information memory 111. Details of the motion information generation unit 109 will be described later.
- the motion compensation unit 106 stores the frame compensation information in the frame memory 110 indicated by the LN prediction reference index supplied from the motion information generation unit 109.
- the reference image is motion-compensated based on the LN prediction motion vector supplied from the motion information generation unit 109 to generate a prediction signal for LN prediction.
- N is 0 or 1.
- the prediction direction of motion compensation prediction is bi-prediction, the average value of the prediction signals of L0 prediction and L1 prediction is the prediction signal. Note that the prediction signals of the L0 prediction and the L1 prediction may be weighted.
- the motion compensation unit 106 supplies the prediction signal to the subtraction unit 102.
- the subtraction unit 102 subtracts the image signal supplied from the prediction block image acquisition unit 101 and the prediction signal supplied from the motion compensation unit 106 to calculate a prediction error signal, and calculates the prediction error signal to the prediction error encoding unit 103. To supply.
- the prediction error encoding unit 103 performs processing such as orthogonal transformation and quantization on the prediction error signal supplied from the subtraction unit 102 to generate prediction error encoded data, and encodes the prediction error encoded data.
- the data is supplied to the column generation unit 104 and the prediction error decoding unit 105.
- the code string generation unit 104 includes prediction error encoded data supplied from the prediction error encoding unit 103, a merge flag, a merge index, and a motion compensation prediction prediction direction (inter prediction type) supplied from the motion information generation unit 109.
- the reference index of L0 prediction and L1 prediction, the difference vector of L0 prediction and L1 prediction, and the prediction vector index of L0 prediction and L1 prediction are entropy-coded according to the syntax sequence shown in FIG. Supply the column to terminal 11.
- Entropy coding is performed by a method including variable length coding such as arithmetic coding or Huffman coding.
- the prediction error decoding unit 105 performs a process such as inverse quantization or inverse orthogonal transform on the prediction error encoded data supplied from the prediction error encoding unit 103 to generate a prediction error signal, and the prediction error signal Is supplied to the adder 107.
- the addition unit 107 adds the prediction error signal supplied from the prediction error decoding unit 105 and the prediction signal supplied from the motion compensation unit 106 to generate a decoded image signal, and supplies the decoded image signal to the frame memory 110. To do.
- the frame memory 110 stores the decoded image signal supplied from the adding unit 107. In addition, for a decoded image in which decoding of the entire image is completed, a predetermined number of images of 1 or more is stored as a reference image.
- the frame memory 110 supplies the stored reference image signal to the motion compensation unit 106 and the motion information generation unit 109.
- the storage area for storing the reference image is a FIFO (First In First Out) method.
- the motion information memory 111 stores the motion information supplied from the motion information generation unit 109 for a predetermined number of images in units of the minimum predicted block size.
- the motion information of the adjacent block of the prediction block to be processed is set as a space candidate block group.
- the motion information memory 111 uses the motion information of the block on the ColPic and the surrounding blocks at the same position as the prediction block to be processed as a time candidate block group.
- the motion information memory 111 supplies the spatial candidate block group and the temporal candidate block group to the motion information generation unit 109 as candidate block groups.
- the motion information memory 111 is synchronized with the frame memory 110 and is controlled by a FIFO (First In First Out) method.
- ColPic is a decoded image different from the prediction block to be processed, and is stored in the frame memory 110 as a reference image.
- ColPic is a reference image decoded immediately before the processing target image.
- ColPic is a reference image decoded immediately before the processing target image.
- it may be a decoded image, for example, the reference image immediately before in the display order or the reference image immediately after in the display order. Alternatively, it can be specified in the encoded stream.
- the motion information is stored in each memory area in units of the smallest prediction block.
- Each memory area stores at least a prediction direction, a motion vector for L0 prediction, a reference index for L0 prediction, a motion vector for L1 prediction, and a reference index for L1 prediction.
- the predictive coding mode is the intra mode
- (0, 0) is stored as a motion vector for L0 prediction and L1 prediction
- “ ⁇ 1” is stored as a reference index for L0 prediction and L prediction.
- H represents a horizontal component
- V represents a vertical component.
- the reference index “ ⁇ 1” may be any value as long as it can be determined that the mode does not perform motion compensation prediction. From this point onward, unless expressed otherwise, the term “block” refers to the smallest predicted block unit when expressed simply as a block.
- LX direction (X is 0 or 1) is valid when the reference index in the LX direction is 0 or more.
- the LX direction is invalid (not valid).
- the reference index in the LX direction is “ ⁇ 1”. It is to be.
- “outside area” generally indicates an area outside a picture to which a prediction block to be processed belongs or an area outside a slice.
- FIG. 8 shows a configuration of the motion information generation unit 109.
- the motion information generation unit 109 includes a prediction vector mode determination unit 120, a merge mode determination unit 121, and a prediction encoding mode determination unit 122.
- the terminal 12 is in the motion information memory 111
- the terminal 13 is in the motion vector detection unit 108
- the terminal 14 is in the frame memory 110
- the terminal 15 is in the prediction block image acquisition unit 101
- the terminal 16 is in the code string generation unit 104
- the terminal 50 Are connected to the motion compensation unit 106
- the terminal 51 is connected to the motion information memory 111, respectively.
- the prediction vector mode determination unit 120 includes a candidate block group supplied from the terminal 12, a motion vector for L0 prediction and L1 prediction supplied from the terminal 13, a reference index for L0 prediction and L1 prediction, and a reference index supplied from the terminal 14.
- the inter-prediction type is determined from the reference image shown in FIG. 5 and the image signal supplied from the terminal 15, and the prediction vector index of the L0 prediction and the L1 prediction is selected according to the inter prediction type, and the difference vector between the L0 prediction and the L1 prediction , A prediction error is calculated, and a rate distortion evaluation value is calculated. Then, motion information, a difference vector, a prediction vector index, and a rate distortion evaluation value based on the inter prediction type are supplied to the prediction coding mode determination unit 122.
- the merge mode determination unit 121 generates a combined motion information candidate list from the candidate block group supplied from the terminal 12, the reference image supplied from the terminal 14, and the image signal supplied from the terminal 15, and the combined motion information
- One merged motion information candidate is selected from the candidate list, a merge index is determined, and a rate distortion evaluation value is calculated. Then, the motion information of the combined motion information candidate, the merge index, and the rate distortion evaluation value are supplied to the predictive coding mode determination unit 122. Details of the merge mode determination unit 121 will be described later.
- the prediction coding mode determination unit 122 compares the rate distortion evaluation value supplied from the prediction vector mode determination unit 120 and the rate distortion evaluation value supplied from the merge mode determination unit 121 to determine a merge flag.
- the merge flag is set to “0”.
- the prediction encoding mode determination unit 122 supplies the merge flag, the inter prediction type, the reference index, the difference vector, and the prediction vector index supplied from the prediction vector mode determination unit 120 to the terminal 16, and the prediction vector mode determination unit 120 The supplied motion information is supplied to the terminal 50 and the terminal 51.
- the merge flag is set to “1”.
- the predictive coding mode determination unit 122 supplies the merge flag and the merge index supplied from the merge mode determination unit 121 to the terminal 16, and supplies the motion information supplied from the merge mode determination unit 121 to the terminal 50 and the terminal 51. To do.
- a specific calculation method of the rate distortion evaluation value is not the main point of the present invention, detailed description is omitted, but the prediction error amount per code amount is calculated from the prediction error and the code amount, and the rate distortion evaluation value is calculated.
- the evaluation value has a characteristic that the encoding efficiency increases as the value decreases. Therefore, encoding efficiency can be improved by selecting a predictive encoding mode with a small rate distortion evaluation value.
- FIG. 9 is a diagram for explaining the configuration of the merge mode determination unit 121.
- the merge mode determination unit 121 includes a combined motion information candidate list generation unit 140 and a combined motion information selection unit 141.
- the combined motion information candidate list generation unit 140 is also installed in the video decoding device 200 that decodes the code sequence generated by the video encoding device 100 according to Embodiment 1 in the same manner.
- the moving image decoding apparatus 200 generates the same combined motion information list.
- the combined motion information candidate list generation unit 140 generates a combined motion information candidate list including the maximum number of merge candidate motion candidates from the candidate block group supplied from the terminal 12, and uses the combined motion information candidate list as the combined motion information. This is supplied to the selection unit 141. A detailed configuration of the combined motion information candidate list generation unit 140 will be described later.
- the combined motion information selection unit 141 selects the optimum combined motion information candidate from the combined motion information candidate list supplied from the combined motion information candidate list generation unit 140, and is information indicating the selected combined motion information candidate. A certain merge index is determined and the merge index is supplied to the terminal 17.
- a prediction error amount is calculated from the reference image supplied from the terminal 14 obtained by motion compensation prediction based on the prediction direction, motion vector, and reference index of the combined motion information candidate, and the image signal supplied from the terminal 15. .
- a rate distortion evaluation value is calculated from the code amount of the merge index and the prediction error amount, and a combined motion information candidate that minimizes the rate distortion evaluation value is selected as an optimal combined motion information candidate.
- the candidate block group includes a spatial candidate block group and a temporal candidate block group.
- FIG. 10 shows adjacent blocks of the prediction block to be processed when the prediction block size to be processed is 16 pixels ⁇ 16 pixels.
- the space candidate block group is assumed to be five blocks of block A1, block C, block D, block B1, and block E shown in FIG.
- the spatial candidate block group is five blocks of block A1, block C, block D, block B1, and block E, but the spatial candidate block group is at least one or more processed adjacent to the prediction block to be processed. Any block may be used, and the present invention is not limited to these. For example, all of block A1, block A2, block A3, block A4, block B1, block B2, block B3, block B4, block C, block D, and block E may be spatial candidate blocks.
- the time candidate block group includes two blocks, block H and block I6 shown in FIG.
- the time candidate block group is two blocks of block H and block I6 on ColPic, but the time candidate block group is at least one block on a decoded image different from the prediction block to be processed.
- the block H may be used.
- the block A4 is expressed as a block A
- the block B4 is expressed as a block B
- the block I6 is expressed as a block I
- the blocks H and I6 are expressed as a time block.
- FIG. 12 is a diagram for explaining the configuration of the combined motion information candidate list generation unit 140.
- the terminal 19 is connected to the combined motion information selection unit 141.
- the combined motion information candidate list generating unit 140 includes a spatially combined motion information candidate generating unit 160, a temporally combined motion information candidate generating unit 161, a redundant combined motion information candidate deleting unit 162, a recessive combined motion information candidate adding unit 163, and a first combined motion.
- An information candidate supplementing unit 164 and a second combined motion information candidate supplementing unit 165 are included.
- the spatially coupled motion information candidate generation unit 160 includes a recessive coupled motion information candidate determination unit 166.
- FIG. 13 is a flowchart for explaining the operation of the combined motion information candidate list generation unit 140.
- the combined motion information candidate list generation unit 140 initializes a combined motion information candidate list (S100). There are no combined motion information candidates in the initialized combined motion information candidate list.
- the recessive combined motion information candidate determination unit 166 selects a recessive combined motion information candidate and stores the recessive combined motion information candidate (S101).
- the stored recessive combined motion information candidates are shared in the combined motion information candidate list generation unit 140.
- the detailed operation of the recessive combined motion information candidate determination unit 166 will be described later.
- the spatially coupled motion information candidate generation unit 160 generates the maximum number of spatially coupled motion information candidates from 0 candidate block groups supplied from the terminal 12 and adds them to the combined motion information candidate list.
- the combined motion information candidate list and the candidate block group are supplied to the temporal combined motion information candidate generation unit 161.
- the detailed operation of the spatially coupled motion information candidate generation unit 160 will be described later.
- the maximum number of spatially coupled motion information candidates will be described later.
- the time combination motion information candidate generation unit 161 generates zero or time combination motion information candidate maximum number of time combination motion information candidates from the candidate block group supplied from the space combination motion information candidate generation unit 160 to generate a space. This is added to the combined motion information candidate list supplied from the combined motion information candidate generating unit 160 (S103), and the combined motion information candidate list is supplied to the recessive combined motion information candidate adding unit 163.
- the detailed operation of the time combination motion information candidate generation unit 161 will be described later. Further, the maximum number of temporally combined motion information candidates will be described later.
- the redundant combined motion information candidate deletion unit 162 examines the combined motion information candidates registered in the combined motion information candidate list supplied from the temporal combined motion information candidate generation unit 161, and combines the same motion information.
- one combined motion information candidate is left and other combined motion information candidates are deleted (S104), and the combined motion information candidate list is supplied to the recessive combined motion information candidate adding unit 163. . Accordingly, all of the combined motion information candidates registered in the combined motion information candidate list become different combined motion information candidates.
- the recessive combined motion information candidate adding unit 163 adds the recessive combined motion information candidate to the tail of the combined motion information candidate list supplied from the redundant combined motion information candidate deleting unit 162. (S105), and the combined motion information candidate list is supplied to the first combined motion information candidate supplementing unit 164. If the recessive combined motion information candidate is not valid, the combined motion information candidate list supplied from the redundant combined motion information candidate deleting unit 162 is supplied to the first combined motion information candidate supplementing unit 164. The detailed operation of the recessive combined motion information candidate adding unit 163 will be described later.
- the first combined motion information candidate supplementing unit 164 includes 0 to 2 first supplemental combining from the combined motion information candidates registered in the combined motion information candidate list supplied from the recessive combined motion information candidate adding unit 163.
- a motion information candidate is generated and added to the combined motion information candidate list (S106), and the combined motion information candidate list is supplied to the second combined motion information candidate supplementing unit 165.
- the detailed operation of the first combined motion information candidate supplement unit 164 will be described later.
- the second combined motion information candidate supplementing unit 165 generates 0 to 2 second supplemental combined motion information candidates that do not depend on the combined motion information candidate list supplied from the first combined motion information candidate supplementing unit 164. Then, it is added to the combined motion information candidate list supplied from the first combined motion information candidate supplementing unit 164 (S107), and the combined motion information candidate list is supplied to the terminal 19.
- S107 the combined motion information candidate supplementing unit 165
- FIGS. 14A to 14D are diagrams for explaining a recessive combined motion information candidate.
- an encoded block can be divided into 1 or 2 or 4 prediction blocks.
- FIG. 14 shows an example in which the encoded block is 16 ⁇ 16.
- the prediction error related to the coding block can be minimized, while the syntax overhead related to the prediction block shown in FIG. 6 increases by the number of prediction blocks. Therefore, a plurality of prediction blocks having the same motion information and the same prediction error can be combined into one prediction block, and the coding efficiency can be improved by suppressing the number of divisions of the prediction block.
- FIG. 14A is a diagram for explaining Example 1 in which candidate block A is a recessive combined motion information candidate.
- the encoded block is the merge mode, and the position of the spatially coupled motion information candidate of the prediction block 1 when the prediction block size type is encoded as N ⁇ 2N is shown.
- the prediction block 0 and the prediction block 1 are collectively encoded as 2N ⁇ 2N, thereby generating syntax overhead related to the prediction block. And the coding efficiency can be improved.
- the candidate block A corresponding to the candidate block when encoded as 2N ⁇ 2N is selected as a combined motion information candidate.
- the candidate block A is set as a recessive combined motion information candidate.
- FIG. 14B is a diagram for explaining Example 2 in which candidate block A is a recessive combined motion information candidate.
- the encoded block is the merge mode, and the position of the spatially coupled motion information candidate of the prediction block 3 when the prediction block size type is encoded as N ⁇ N is shown. If the motion information of the prediction block 0 and the motion information of the prediction block 1 are the same, and the motion information of the prediction block 2 and the motion information of the prediction block 3 are the same, the prediction block 0 and the prediction block 1 are combined.
- the prediction block 2 and the prediction block 3 together as 2N ⁇ N, it is possible to reduce the syntax overhead related to the prediction block and improve the encoding efficiency. Therefore, when the prediction block size type is N ⁇ N and the motion information of the prediction block 0 and the motion information of the prediction block 1 are the same and the prediction block 3, the candidate block A is set as a recessive combined motion information candidate.
- FIG. 14C is a diagram for explaining Example 1 in which candidate block B is a recessive combined motion information candidate.
- the coding block is in the merge mode, and the prediction block size type is 2N ⁇ N.
- the prediction block 0 and the prediction block 1 are collectively encoded as 2N ⁇ 2N, thereby generating syntax overhead related to the prediction block.
- the coding efficiency can be improved. Therefore, when the prediction block size type is 2N ⁇ N and the processing target block is the prediction block 1, the candidate block B is set as a recessive combined motion information candidate.
- FIG. 14D is a diagram for explaining an example 2 in which the candidate block B is a recessive combined motion information candidate.
- the encoded block is the merge mode, and the position of the spatially coupled motion information candidate of the prediction block 3 when the prediction block size type is encoded as N ⁇ N is shown.
- the prediction block 0 and the prediction block 2 are the same, and the motion information of the prediction block 1 and the motion information of the prediction block 3 are the same, the prediction block 0 and the prediction block 2 are combined.
- N ⁇ 2N By encoding the prediction block 1 and the prediction block 3 together as N ⁇ 2N, it is possible to reduce the syntax overhead related to the prediction block and improve the encoding efficiency. Therefore, when the prediction block size type is N ⁇ N and the motion information of the prediction block 0 and the motion information of the prediction block 2 are the same and the prediction block 3, the candidate block B is set as a recessive combined motion information candidate.
- a combined motion information candidate that is more appropriate to be predicted in terms of coding efficiency is selected as a recessive combined motion information candidate.
- the recessive combined motion information candidate is a combined motion information candidate that is improved in coding efficiency by being combined with a larger predicted block size.
- the recessive combined motion information is motion information of a recessive combined motion information candidate.
- FIG. 15 is a flowchart for explaining an example of the determination process of the recessive combined motion information candidate determination unit 166.
- the predicted block size type is 2N ⁇ 2N (S150). If the predicted block size type is 2N ⁇ 2N (Y in S150), the process ends. If the predicted block size type is not 2N ⁇ 2N (N in S150), the predicted block size type is checked (S151). If the predicted block size type is N ⁇ 2N (N ⁇ 2N in S151), it is checked whether it is a predicted block 1 (S152). If the predicted block size type is 2N ⁇ N (2N ⁇ N in S151), it is checked whether it is a predicted block 1 (S154). If the predicted block size type is N ⁇ N (N ⁇ N in S151), it is checked whether it is a predicted block 3 (S156).
- block A is set as a recessive combined motion information candidate (S153). If the predicted block size type is N ⁇ 2N and the predicted block size is not 1 (N in S152), the process ends. If the predicted block size type is 2N ⁇ N and the predicted block is 1 (Y in S154), block B is set as a recessive combined motion information candidate (S155). If the predicted block size type is 2N ⁇ N and the predicted block size is not 1 (N in S154), the process ends.
- the prediction block size type is N ⁇ N and the prediction block 3 (Y in S156)
- the block B is set as a recessive combined motion information candidate (S160). If the motion information of the prediction block 0 and the prediction block 1 is not the same (N in S159), the process is terminated.
- the recessive combined motion information candidate when the recessive combined motion information candidate is determined, the recessive combined motion information candidate is valid, and when the recessive combined motion information candidate is not determined, the recessive combined motion information candidate is not effective. To do.
- FIG. 15 is an example of the determination process of the recessive combined motion information candidate determination unit 166, which is inferior using at least the number of divisions of the encoded block into the prediction block and / or the position of the prediction block in the encoded block. It is only necessary to be able to determine the combined motion information candidate, and the present invention is not limited to this.
- the motion block identity determination between the prediction block 0 and the prediction block 1 step S159
- the prediction block 0 and the prediction block 2 step S157.
- block A can always be an inferior combined motion information candidate
- block B can always be an inferior combined motion information candidate.
- the recessive combined motion information candidate may not be determined. it can. This means that additional control of the recessive joint motion information is performed based on the number of divisions of the encoded block into the prediction blocks.
- FIG. 16 is a flowchart for explaining the operation of the spatially coupled motion information candidate generation unit 160.
- the spatially coupled motion information candidate generating unit 160 repeatedly performs the following processing in the order of block A, block B, block C, block E, and block D, which are candidate blocks included in the spatial candidate block group of the candidate block group (from S110) S115).
- the candidate block is valid (S111). That the candidate block is valid means that at least one of the reference indexes of the L0 prediction and the L1 prediction of the candidate block is 0 or more. If the candidate block is valid (Y in S111), it is checked whether the recessive combined motion information candidate is valid (S112). If the candidate block is not valid (N in S111), step S112 to step S115 are skipped and the next candidate block is inspected (S116). If the recessive combined motion information candidate is valid (Y in S112), it is checked whether the motion information of the candidate block is the same as the motion information of the recessive combined motion information candidate (S113).
- the motion information of the candidate block is added to the combined motion information candidate list as a spatial combined motion information candidate (S114). If the motion information of the candidate block is the same as the motion information of the recessive combined motion information candidate (Y in S113), step S114 and step S115 are skipped and the next candidate block is examined (S116). If the motion information of the candidate block is not the same as the motion information of the recessive combined motion information candidate (N in S113), the motion information of the candidate block is added to the combined motion information candidate list as a spatial combined motion information candidate (S114).
- step S114 it is checked whether the number of spatially combined motion information candidates added to the combined motion information candidate list is the maximum number of spatially combined motion information candidates (S115).
- the maximum number of spatially coupled motion information candidates is 4. If the number of spatially combined motion information candidates added to the combined motion information candidate list is not the maximum number of spatially combined motion information candidates (N in S115), the next candidate block is examined (S115). If the number of spatially combined motion information candidates added to the combined motion information candidate list is the maximum number of spatially combined motion information candidates (Y in S115), the process ends.
- the processing information is processed so that the motion information of block A and block B, which are generally considered to have a long tangent to the processing target block and generally have a high correlation with the processing target block, can be preferentially registered in the combined motion information candidate list.
- the order is block A, block B, block C, block E, and block D
- the combined motion information candidates may be registered in the combined motion information candidate list in a highly correlated order, and the present invention is not limited to this.
- the maximum number of spatially coupled motion information candidates is four, the maximum number of spatially coupled motion information candidates may be 1 or more and less than or equal to the number of candidate blocks included in the spatial candidate block group, and is not limited to this.
- Step S112 and Step S113 can be omitted. If the candidate block is valid (Y in S111), the motion information of the candidate block is converted into the spatially coupled motion information. The candidate may be added to the combined motion information candidate list as a candidate (S114).
- FIG. 17 is a flowchart for explaining the operation of the time combination motion information candidate generation unit 161.
- the following processing is repeated for each prediction direction LX of L0 prediction and L1 prediction (S120 to S127).
- X is 0 or 1.
- the following processing is repeated in the order of block H and block I, which are candidate blocks included in the time candidate block group of the candidate block group (S121 to S126).
- the temporal combination motion information candidate generation unit 161 checks whether the LN prediction of the candidate block is valid (S122).
- N is 0 or 1.
- N is the same as X. That the LN prediction of the candidate block is valid means that the reference index of the LN prediction of the candidate block is 0 or more. If the LN prediction of the candidate block is valid (Y in S122), the LN prediction motion vector of the candidate block is set as the reference motion vector (S123). If the LN prediction of the candidate block is not valid (N in S122), step 123 to step 126 are skipped and the next candidate block is examined (S126).
- a reference image for LX prediction of a temporally coupled motion information candidate is determined (S124).
- the LX prediction reference image of the temporal combination motion information candidate is a reference image that is most frequently used as a reference image for LX prediction of candidate blocks included in the spatial candidate block group.
- the base motion vector is scaled to match the distance between the processing target image and the LX prediction reference image of the temporally combined motion information candidate to calculate a scaling vector, and the scaling vector is used for the temporally combined motion information candidate LX prediction.
- a motion vector is used (S125), and the next prediction direction is processed (S127).
- the scaling vector calculation formula is the same as the scaling vector calculation formula for temporal direct motion compensated prediction in MPEG-4 AVC.
- step S127 it is checked whether at least one of the L0 prediction and the L1 prediction of the temporally coupled motion information candidate is valid (S128).
- the fact that the LN prediction of the temporally combined motion information candidate is effective means that the reference image for the LN prediction of the temporally combined motion information candidate is determined. If at least one of the L0 prediction and the L1 prediction of the temporally combined motion information candidate is valid (Y in S128), the inter prediction type of the temporally combined motion information candidate is determined, and the temporally combined motion information candidate is combined with the motion. The information is added to the information candidate list (S129).
- the inter prediction type is determined as Pred_L0 if only the L0 prediction is valid, and if only the L1 prediction is valid, the inter prediction type of the temporally combined motion information candidate is determined. Is Pred_L1, and if both the L0 prediction and the L1 prediction are valid, the inter prediction type of the temporally combined motion information candidate is Pred_BI.
- FIG. 17 omits the processing corresponding to step S115 shown in FIG. 16 which is a flowchart for explaining the operation of the spatially coupled motion information candidate generation unit 160, but the maximum number of temporally coupled motion information candidate candidates is 2 or more. In this case, a process corresponding to step S115 can be added after step S129.
- the reference image for LX prediction of temporal combination motion information candidates is a reference image that is most frequently used as a reference image for LX prediction of candidate blocks included in the spatial candidate block group, but is not limited thereto, and is not limited to this.
- the reference image may be determined to be 0.
- the inter prediction type is any one of Pred_L0, Pred_L1, and Pred_BI, it may be only Pred_BI and is not limited to this.
- the spatial candidate block group is 5
- the spatial combined motion information candidate maximum number is 4
- the temporal combined motion information candidate maximum number is 1, which is relative to the recessive combined motion information candidate instead of the recessive combined motion information candidate.
- the block D having a high selection probability can be registered in the combined motion information candidate list.
- the spatially combined motion information candidate or temporal combination having a relatively higher selection probability than the recessive combined motion information candidate
- the motion information candidates may be registered in the combined motion information candidate list, and the present invention is not limited to this combination.
- the maximum number of spatially combined motion information candidates is 5
- the maximum number of temporally combined motion information candidates is 1, and when the recessive combined motion information candidate is not valid, the combined motion information candidates are given priority to the five spatially combined motion information candidates.
- the temporal combined motion information candidate having a higher selection probability than the recessive combined motion information candidate may be added to the combined motion information candidate list. it can.
- FIG. 18 is a flowchart for explaining the operation of the recessive combined motion information candidate adding unit 163.
- S140 it is checked whether the recessive combined motion information candidate is valid (S140). If the recessive combined motion information candidate is valid (Y in S140), the motion information of the recessive combined motion information candidate is added to the tail of the combined motion information candidate list (S141). If the recessive combined motion information candidate is not valid (N in S140), the process ends.
- the motion information of the recessive combined motion information candidate is added to the tail of the combined motion information candidate list.
- the recessive combined motion information candidate is valid and the motion information candidate list does not have the same motion information as the motion information of the recessive combined motion information candidate, it is added to the tail of the combined motion information candidate list It can also be done. In that case, the presence of redundant combined motion information candidates is excluded from the combined motion information candidate list, and instead, the first supplemental combined motion information candidate and the second supplemental combined motion information candidate are added to the combined motion information candidate list.
- candidate block A and candidate block B are combined motion information candidates with a relatively high selection rate compared to other blocks, a merge index with a short code length is assigned.
- a merge index with a short code length is assigned.
- the motion information candidate generation unit 160 temporarily removes the same motion information as the recessive combined motion information candidate from the combined motion information candidate list, and the recessive combined motion information candidate adding unit 163 adds the recessive combined motion information candidate to the combined motion information.
- the recessive combined motion information candidate determination unit 166 determines the motion information identity between the prediction block 0 and the prediction block 1 (step S159) or the prediction block 0.
- the motion information identity determination of the prediction block 2 step S157
- the blocks A and B are always inferior combined motion information candidates in the prediction block 3
- the operation of FIG. It is also possible to repeatedly process the two inferior combined motion information candidates of B.
- FIG. 19 is a flowchart for explaining the operation of the first combined motion information candidate supplementing unit 164.
- the first supplemental combined motion information candidate MaxNumGenCand which is the maximum number for generating, is calculated from Equation 1 (S170).
- MaxNumGenCand MaxNumMergeCand-NumCandList; (NumCandList> 0)
- MaxNumGenCand is larger than 0 (S171). If MaxNumGenCand is not greater than 0 (N in S171), the process ends. If MaxNumGenCand is greater than 0 (Y in S171), the following processing is performed. First, loopTimes that is the number of combination inspections is determined. LoopTimes is set to NumCandList ⁇ NumCandList. However, if loopTimes exceeds 8, loopTimes is limited to 8 (S172). Here, loopTimes is an integer from 0 to 7. The following processing is repeated for loopTimes (S172 to S180). A combination of the combined motion information candidate M and the combined motion information candidate N is determined (S173).
- FIG. 20 is a diagram for explaining the relationship between the number of combination inspections, the combined motion information candidate M, and the combined motion information candidate N.
- M and N are different values, and are set in order of decreasing total value of M and N. It is checked whether the L0 prediction of the combined motion information candidate M is valid and the L1 prediction of the combined motion information candidate N is valid (S174). If the L0 prediction of the combined motion information candidate M is valid and the L1 prediction of the combined motion information candidate N is valid (N in S174), the L0 prediction reference image of the combined motion information candidate M and the motion vector are combined motion information candidates.
- bi-coupled motion information is generated by combining the L0 prediction of a certain combined motion information candidate and the L1 prediction motion information of a different combined motion information candidate. If the reference image for L0 prediction of the combined motion information candidate M and the reference image for L1 prediction of the combined motion information candidate N are the same (N in S175), the next combination is processed. Subsequent to step S176, it is checked whether or not there is a combined motion information candidate in the combined motion information candidate list (S177). If the combined motion information candidate does not exist in the combined motion information candidate list (Y of S177), the combined motion information candidate is added to the combined motion information candidate list (S178).
- step S178 is skipped. Subsequent to step S177 or step S178, it is checked whether the number of generated double coupled motion information is MaxNumGenCand (S179). If the number of generated double coupled motion information is MaxNumGenCand (Y in S179), the process ends. If the number of generated double coupled motion information is not MaxNumGenCand (N in S179), the next combination is processed.
- the first supplemental combined motion information candidate is set as an L0 prediction motion vector of a certain combined motion information candidate registered in the combined motion information candidate list and the reference image, and the L1 prediction motion vector of another combined motion information candidate.
- a bi-coupled motion information candidate in which the direction of motion compensation prediction is bidirectional is used, but the present invention is not limited to this.
- combined motion information in which the direction of motion compensated prediction in which an offset value such as +1 is added to the motion vector of L0 prediction and the motion vector of L1 prediction of a certain combined motion information candidate registered in the combined motion information candidate list is bidirectional.
- Combined motion information in which the direction of motion compensated prediction obtained by adding an offset value such as +1 to the motion vector of L0 prediction or the motion vector of L1 prediction of a certain combined motion information candidate registered in the candidate, combined motion information candidate list is unidirectional Can also be a candidate.
- a motion vector of L1 prediction is obtained by scaling based on a motion vector of L0 prediction of a combined motion information candidate registered in the combined motion information candidate list, and these are combined.
- a new combined motion information candidate in which the direction of motion compensation prediction is bidirectional may be generated. Moreover, you may combine them arbitrarily.
- the first supplemental combined motion information candidate is a combined motion information candidate when there is a slight difference between the motion information of the combined motion information candidate registered in the combined motion information candidate list and the motion information candidate to be processed. Coding efficiency can be improved by correcting the motion information of the combined motion information candidates registered in the list to generate effective combined motion information candidates.
- either the L0 prediction or the L1 prediction is the same as the motion information of the processing target block even if the motion information is a recessive combined motion information candidate that is unlikely to be selected as a combined motion information candidate.
- the motion information of the processing target block may be slightly deviated from the motion information of the recessive combined motion information candidate. Therefore, without deleting the motion information of the recessive combined motion information candidate, the motion information of the recessive combined motion information candidate is combined with the motion information of another combined motion information candidate, or the motion information of the recessive combined motion information candidate is corrected.
- the first supplemental combined motion information candidate having a higher selectivity than the recessive combined motion information candidate, it is possible to increase the encoding efficiency.
- at least two combined motion information candidates are required when using the dual combined motion information candidate, only one combined motion information candidate other than the recessive combined motion information candidate is registered in the combined motion information candidate list. If not, the motion information of the recessive combined motion information candidate is combined with the motion information of another combined motion information candidate without deleting the motion information of the recessive combined motion information candidate, so that the coding efficiency can be improved.
- FIG. 21 is a flowchart for explaining the operation of the second combined motion information candidate supplementing unit 165.
- NumCandList the number of combined motion information candidates registered in the combined motion information candidate list supplied from the first combined motion information candidate supplementing unit 164 and the maximum number of merge candidates (MaxNumMergeCand)
- MaxNumGenCand MaxNumMergeCand-NumCandList; (NumCandList> 0)
- the second supplemental combined motion in which the motion vector for L0 prediction is (0,0), the reference index is i, the motion vector for L1 prediction is (0,0), and the inter prediction type for which the reference index is i is Pred_BI.
- Information candidates are generated (S192). It is checked whether the second supplementary combined motion information candidate exists in the combined motion information candidate list (S193). If the second supplementary combined motion information candidate does not exist in the combined motion information candidate list (Y in S193), the second supplemental combined motion information candidate is added to the combined motion information candidate list (S194). If the second supplementary combined motion information candidate exists in the combined motion information candidate list (N in S193), the next i is processed (S195).
- the second supplementary combined motion information candidate has a motion vector for L0 prediction of (0, 0), a reference index of i, a motion vector of L1 prediction of (0, 0), and a reference index of i.
- the combined motion information candidate whose inter prediction type is Pred_BI is used. This is because, in a general moving image, the frequency of occurrence of combined motion information candidates in which the motion vector for L0 prediction and the motion vector for L1 prediction are (0, 0) is statistically high.
- the present invention is not limited to this as long as it is a combined motion information candidate that is statistically frequently used without depending on the motion information of the combined motion information candidate registered in the combined motion information candidate list.
- the motion vectors of L0 prediction and L1 prediction may be vector values other than (0, 0), respectively, and may be set so that the reference indexes of L0 prediction and L1 prediction are different.
- the second supplementary combined motion information candidate can be set as an encoded image or motion information with a high occurrence frequency of a part of the encoded image, encoded in an encoded stream, and transmitted.
- the combined motion information candidate registered in the combined motion information candidate list When the number is zero, it is possible to use the merge mode and improve the encoding efficiency.
- the motion information of the combined motion information candidate registered in the combined motion information candidate list is different from the motion information candidate motion to be processed, new combined motion information that is statistically frequently used in the combined motion information candidate list Encoding efficiency can be improved by adding candidates and expanding the range of options.
- FIG. 22 is a diagram showing a configuration of the video decoding device 200 according to Embodiment 1.
- the video decoding device 200 is a device that generates a playback image by decoding the code string encoded by the video encoding device 100.
- the video decoding device 200 is realized by hardware such as an information processing device including a CPU (Central Processing Unit), a frame memory, and a hard disk.
- the moving picture decoding apparatus 200 realizes functional components described below by operating the above components.
- the coding block division, the prediction block size type determination, the prediction block size and the position of the prediction block in the coding block (position information of the prediction block), and the determination whether the prediction coding mode is intra are not shown. It is assumed that it is determined by the upper control unit, and here, a case where the predictive coding mode is not intra will be described. Note that the position information and the prediction block size of the prediction block to be decoded are shared in the video decoding device 200 and are not shown.
- the moving picture decoding apparatus 200 includes a code string analysis unit 201, a prediction error decoding unit 202, an addition unit 203, a motion information reproduction unit 204, a motion compensation unit 205, a frame memory 206, and a motion information memory 207.
- the code string analysis unit 201 analyzes the code string supplied from the terminal 30 to generate prediction error encoded data, a merge flag, a merge index, a motion compensation prediction direction (inter prediction type), a reference index, a difference vector, and Entropy decodes the prediction vector index according to the syntax. Entropy decoding is performed by a method including variable length coding such as arithmetic coding or Huffman coding. Then, the prediction error encoded data is supplied to the prediction error decoding unit 202, and the merge flag, the merge index, the inter prediction type, the reference index, the difference vector, and the prediction vector index are supplied to the motion information reproduction unit 204. To do.
- the motion information reproduction unit 204 includes a merge flag, a merge index, an inter prediction type, a reference index, a difference vector, and a prediction vector index supplied from the code string analysis unit 201 and a candidate block group supplied from the motion information memory 207.
- the motion information is reproduced, and the motion information is supplied to the motion compensation unit 205 and the motion information memory 207.
- a detailed configuration of the motion information reproducing unit 204 will be described later.
- the motion compensation unit 205 performs motion compensation on the reference image indicated by the reference index in the frame memory 206 based on the motion information supplied from the motion information reproduction unit 204, and generates a prediction signal. If the prediction direction is bi-prediction, an average of the prediction signals of the L0 prediction and the L1 prediction is generated as a prediction signal, and the prediction signal is supplied to the adding unit 203.
- the prediction error decoding unit 202 performs a process such as inverse quantization or inverse orthogonal transform on the prediction error encoded data supplied from the code string analysis unit 201 to generate a prediction error signal, and the prediction error signal is It supplies to the addition part 203.
- the adding unit 203 adds the prediction error signal supplied from the prediction error decoding unit 202 and the prediction signal supplied from the motion compensation unit 205 to generate a decoded image signal, and the decoded image signal is stored in the frame memory 206 and Supply to terminal 31.
- the frame memory 206 and the motion information memory 207 have the same functions as the frame memory 110 and the motion information memory 111 of the moving picture coding apparatus 100.
- the frame memory 206 stores the decoded image signal supplied from the adding unit 203.
- the motion information memory 207 stores the motion information supplied from the motion information reproducing unit 204 in units of the minimum predicted block size.
- FIG. 23 shows the configuration of the motion information playback unit 204.
- the motion information playback unit 204 includes an encoding mode determination unit 210, a motion vector playback unit 211, and a combined motion information playback unit 212.
- the terminal 32 is connected to the code string analysis unit 201
- the terminal 33 is connected to the motion information memory 207
- the terminal 34 is connected to the motion compensation unit 205
- the terminal 36 is connected to the motion information memory 207.
- the encoding mode determination unit 210 determines whether the merge flag supplied from the code string analysis unit 201 is “0” or “1”. If the merge flag is “0”, the inter prediction type, reference index, difference vector, and prediction vector index supplied from the code string analysis unit 201 are supplied to the motion vector reproduction unit 211. If the merge flag is “1”, the merge index supplied from the code string analysis unit 201 is supplied to the combined motion information reproduction unit 212.
- the motion vector reproduction unit 211 reproduces a motion vector from the inter prediction type, the reference index, the difference vector, and the prediction vector index supplied from the encoding mode determination unit 210 and the candidate block group supplied from the terminal 33. Motion information is generated and supplied to the terminals 34 and 36.
- the combined motion information reproduction unit 212 reproduces motion information from the merge index supplied from the encoding mode determination unit 210 and the candidate block group supplied from the terminal 33 and supplies the motion information to the terminal 34 and the terminal 36.
- FIG. 24 shows the configuration of the combined motion information playback unit 212.
- the combined motion information reproduction unit 212 includes a combined motion information candidate list generation unit 230 and a combined motion information selection unit 231.
- the terminal 35 is connected to the encoding mode determination unit 210.
- the combined motion information candidate list generation unit 230 has the same function as the combined motion information candidate list generation unit 140 of the video encoding device 100, and is the same as the combined motion information candidate list generation unit 140 of the video encoding device 100.
- the combined motion information candidate list is generated by the operation, and the combined motion information candidate list is supplied to the combined motion information selection unit 231.
- the combined motion information selection unit 231 selects a combined motion information candidate indicated by the merge index supplied from the terminal 35 from the combined motion information candidate list supplied from the combined motion information candidate list generation unit 230 and combines motion The information is determined, and the motion information of the combined motion information is supplied to the terminal 34 and the terminal 36.
- FIG. 25 is a diagram for explaining an example of the effect according to the first embodiment of the present invention.
- FIG. 25 shows the effect of backward addition of a recessive combined motion information candidate to the combined motion information candidate list when candidate block A is a recessive combined motion information candidate.
- candidate block A, candidate block B, candidate block C, candidate block E, and candidate block T are registered in the combined motion information candidate list.
- merge index 0 (candidate block A), merge index 1 (candidate block B), merge index 2 (candidate block C), merge index 3 (candidate block E), and merge index 4 (candidate block T) are respectively Suppose that they are 2%, 60%, 14%, 12%, and 12%. Strictly speaking, the merge index selection probability varies between when the recessive combined motion information candidate is added in front of the combined motion information candidate list and when the recessive combined motion information candidate is added behind the combined motion information candidate list. Here, for simplification of description, it is assumed that the merge index selection probability does not change. The same applies to examples for explaining the subsequent effects. In this case, the expected code length of the forward added merge index is 2.6 bits (Formula 3).
- the expected value of the code length of the merge index added later is 1.8 bits (Formula 4). Therefore, it can be seen that the expected value of the code length of the merge index added backward is 0.8 bits shorter than the forward addition, and the coding efficiency is improved.
- 0.02x1 + 0.6x2 + 0.14x3 + 0.12x4 + 0.12x4 2.6 Equation 3
- 0.6x1 + 0.14x2 + 0.12x3 + 0.12x4 + 0.02x4 1.8 Equation 4
- the inferior combined motion information candidate is added to the tail of the combined motion information candidate list in which the code length of the merge index is longer than that of the spatially combined motion information candidate or the temporally combined motion information candidate, and the selectivity is relatively Coding efficiency can be improved by assigning a merge index having a short code length to candidate blocks other than the inferior joint motion information candidate that increases.
- FIG. 26 is a diagram for explaining another example of the effect according to the first embodiment of the present invention. Here, it is assumed that candidate block A, candidate block B, and candidate block T are valid in the candidate block group, and the motion compensation prediction directions of candidate block A and candidate block B are both Pred_BI.
- the determination process of the recessive joint motion information candidate determination unit 166 according to Embodiment 1 includes at least either the number of divisions of the encoded block into the prediction block and / or the position of the prediction block in the encoded block as shown in FIG. It was assumed that a recessive combined motion information candidate could be determined. For example, as a condition determination simplification method that does not determine a recessive combined motion information candidate when the prediction block type is N ⁇ N, as shown in FIG. The recessive combined motion information candidate can also be determined using the block height.
- FIG. 27 is a flowchart for explaining an example of the determination process of the recessive combined motion information candidate determination unit 166.
- the operation of the recessive combined motion information candidate determination unit 166 according to the second modification of the first embodiment will be described with reference to FIG.
- the prediction block 1 (S200). If it is the prediction block 1 (Y in S200), it is checked whether the height of the prediction block is larger than the width of the prediction block (S201). If it is not the prediction block 1 (N of S200), a process will be complete
- block B is set as a recessive combined motion information candidate (S204), and the process ends. If the height of the prediction block is not smaller than the width of the prediction block (N in S203), the process is terminated.
- the recessive combined motion information candidate adding unit 163 is installed after the redundant combined motion information candidate deleting unit 162 as shown in FIG.
- the unit 163 may be installed after the spatially coupled motion information candidate generation unit 160.
- the recessive combined motion information candidate cannot be registered after the temporal combined motion information candidate in the combined motion information candidate list, but the process of adding the recessive combined motion information candidate is performed in the spatial combined motion information candidate generation unit 160. Therefore, the independence of the spatially coupled motion information candidate generating unit 160 and the temporally coupled motion information candidate generating unit 161 can be increased, and circuit design and software design can be facilitated.
- the recessive combined motion information candidate adding unit 163 is installed after the redundant combined motion information candidate deleting unit 162 as shown in FIG.
- the unit 163 may be installed after the first combined motion information candidate supplementing unit 164 and the second combined motion information candidate supplementing unit 165.
- the first combined motion information candidate supplementing unit 164 when the first supplemental combined motion information candidate is generated from the combined motion information candidates other than the recessive combined motion information candidate, the first combined motion information candidate supplementing unit 164 is relatively more than the recessive combined motion information candidate. Encoding efficiency can be improved by allocating a merge index having a code length shorter than that of the recessive combined motion information candidate to the first supplemental combined motion information candidate having a high selection probability. When the second supplemental combined motion information candidate has a relatively higher selection probability than the recessive combined motion information candidate, the second supplemental combined motion information candidate has a shorter code length than the recessive combined motion information candidate. By assigning the merge index, the encoding efficiency can be improved.
- the same motion information as the recessive combined motion information candidate is combined with the motion information of another combined motion information candidate, or the recessive combined motion information candidate.
- the encoding efficiency can be improved.
- since at least two combined motion information candidates are required when using the dual combined motion information candidate only one combined motion information candidate other than the recessive combined motion information candidate is registered in the combined motion information candidate list. If not, the same motion information as the recessive combined motion information candidate is combined with the motion information of another combined motion information candidate without deleting the same motion information as the recessive combined motion information candidate, thereby increasing the coding efficiency. be able to.
- the spatial combined motion information candidate generation unit 160 includes the recessive combined motion information candidate determination unit 166 as shown in FIG.
- the recessive combined motion information candidate is not registered in the combined motion information candidate list before the unit 163, and the present invention is not limited to this.
- the redundant combined motion information candidate deletion unit 162 may include a recessive combined motion information candidate determination unit 166.
- the redundant combined motion information candidate deletion unit 162 can exclude the spatially combined motion information candidate having the same motion information as the motion information of the recessive combined motion information candidate from the combined motion information candidate list.
- the redundant combined motion information candidate deletion unit 162 can exclude the spatially combined motion information candidate having the same motion information as the motion information of the recessive combined motion information candidate from the combined motion information candidate list.
- FIG. 28 is a diagram for explaining the configuration of the combined motion information candidate list generation unit 140 according to the second embodiment. 12 differs from the combined motion information candidate list generation unit 140 of Embodiment 1 in that a recessive combined motion information candidate deletion unit 167 is added after the first combined motion information candidate supplement unit 164.
- FIG. 29 is a diagram for explaining the operation of the combined motion information candidate list generation unit 140 according to the second embodiment.
- the operation of the combined motion information candidate list generation unit 140 of Embodiment 1 in FIG. 13 is that the recessive combined motion information candidate deletion unit 167 deletes the recessive combined motion information candidate from the combined motion information candidate list following step S106.
- step (S108) is installed.
- FIG. 30 is a diagram for explaining the operation of the recessive combined motion information candidate deletion unit 167.
- the following processing is repeated for the combined motion information candidates M corresponding to the number (NumCandList) of combined motion information candidates registered in the combined motion information candidate list supplied from the first combined motion information candidate supplementing unit 164 (S210 to S213).
- M is an integer from 0 to NumCandList-1. Whether the candidate block M is recessive combined motion information is checked (S211).
- the recessive combined motion information candidate is deleted from the combined motion information candidate list (S212), and the process is terminated. If the candidate block M is not the recessive combined motion information (N in S211), the next combined motion information candidate is processed (S213).
- Embodiment 2 (Modification of Embodiment 2)
- the operation of the recessive combined motion information candidate deletion unit 167 according to Embodiment 2 is simply to delete the recessive combined motion information candidate registered in the combined motion information candidate list as shown in FIG. 32, but it is a recessive combined motion information candidate. It can also be determined whether or not to delete the recessive combined motion information candidate registered in the combined motion information candidate list by determining whether or not the combined motion information of the prediction block is a duplicate combined motion information candidate.
- FIG. 31 is a diagram for explaining the operation of the recessive combined motion information candidate deletion unit 167 in the modification of the second embodiment. It differs from FIG. 30 in that step S213 is added. It is checked whether the combined motion information of the prediction block including the recessive combined motion information candidate is a redundant combined motion information candidate (step S213). If the combined motion information candidate of the prediction block including the recessive combined motion information candidate is an overlapping combined motion information candidate (Y in step S213), the recessive combined motion information candidate is deleted from the combined motion information candidate list (S212), and processing is performed. Exit. If the combined motion information of the prediction block including the recessive combined motion information candidate is not a redundant combined motion information candidate (N in step S213), the process ends.
- FIG. 32 is a diagram for explaining overlapping combined motion information candidates.
- FIG. 32 shows an example in which the encoded block is 16 ⁇ 16.
- FIG. 32A is a diagram for explaining a candidate for overlapping joint motion information when the prediction block size type is N ⁇ 2N, and the recessive joint motion of the prediction block 1 when the prediction block size type is N ⁇ 2N.
- a candidate block A, a candidate block B, a candidate block C, a candidate block D, and a candidate block E that are spatial combination motion information candidates of the prediction block 0 that is an information candidate are shown.
- 32B shows candidate block A, candidate block B, candidate block C, candidate block D, and candidate block E, which are spatially coupled motion information candidates in the case of the prediction block 0 having a prediction block size type of 2N ⁇ 2N. Show. In FIG. 32A and FIG. 32B, it can be seen that the positions of candidate block A, candidate block D, and candidate block E overlap. A candidate block that overlaps with a position of a candidate block of a larger predicted block size type in a certain predicted block size type is set as a redundant combined motion information candidate.
- the prediction block 0 that is the recessive combined motion information candidate of the prediction block 1 when the prediction block size type is N ⁇ 2N
- the prediction block Since selecting the overlapped joint motion information candidate in 1 is equivalent to the case of selecting the overlapped joint motion information candidate as 2N ⁇ 2N, it always involves a decrease in coding efficiency.
- FIG. 32 (c) is a diagram showing that the overlapped joint motion information candidates for the prediction block 0 of 2N ⁇ N prediction block size type are candidate block B, candidate block C, and candidate block D.
- FIG. 32 (d) is a diagram showing that the overlapped joint motion information candidates in the case of the prediction block 2 with the prediction block size type of N ⁇ N are candidate block A, candidate block D, and candidate block E.
- FIG. 32 (e) is a diagram showing that the overlapped joint motion information candidates for the prediction block 1 of which the prediction block size type is N ⁇ N are candidate block B, candidate block C, and candidate block D.
- the recessive combined motion information candidate registered in the combined motion information candidate list is obtained. By determining whether or not to delete, encoding efficiency can be improved.
- the moving image encoded stream output from the moving image encoding apparatus of the embodiment described above has a specific data format so that it can be decoded according to the encoding method used in the embodiment. Therefore, the moving picture decoding apparatus corresponding to the moving picture encoding apparatus can decode the encoded stream of this specific data format.
- the encoded stream When a wired or wireless network is used to exchange an encoded stream between a moving image encoding device and a moving image decoding device, the encoded stream is converted into a data format suitable for the transmission form of the communication path. It may be transmitted.
- a video transmission apparatus that converts the encoded stream output from the video encoding apparatus into encoded data in a data format suitable for the transmission form of the communication channel and transmits the encoded data to the network, and receives the encoded data from the network Then, a moving image receiving apparatus that restores the encoded stream and supplies the encoded stream to the moving image decoding apparatus is provided.
- the moving image transmitting apparatus is a memory that buffers the encoded stream output from the moving image encoding apparatus, a packet processing unit that packetizes the encoded stream, and transmission that transmits the packetized encoded data via the network.
- the moving image receiving apparatus generates a coded stream by packetizing the received data, a receiving unit that receives the packetized coded data via a network, a memory that buffers the received coded data, and packet processing. And a packet processing unit provided to the video decoding device.
- the above processing relating to encoding and decoding can be realized as a transmission, storage, and reception device using hardware, and is stored in a ROM (Read Only Memory), a flash memory, or the like. It can also be realized by firmware or software such as a computer.
- the firmware program and software program can be recorded on a computer-readable recording medium, provided from a server through a wired or wireless network, or provided as a data broadcast of terrestrial or satellite digital broadcasting Is also possible.
- the present invention can be used for a moving picture coding technique for coding motion information used in motion compensation prediction.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Selon l'invention, une unité de génération de liste d'informations de mouvement couplé candidates (140), lors de la génération d'une liste de candidates, sélectionne un bloc de prédiction codé précédemment dans lequel des informations de mouvement comprenant des informations sur un vecteur de mouvement et des informations sur une image de référence sont valides, parmi une pluralité de blocs de prédiction adjacents à un bloc de prédiction destiné à être codé, et prend, en tant que candidates à une sélection, des informations de mouvement du bloc de prédiction codé précédemment sélectionné, mais ne prend pas, en tant que candidates à une sélection, des informations de mouvement d'un bloc de prédiction spécifique dans la pluralité de blocs de prédiction qui est adjacent au bloc de prédiction destiné à être codé, sur la base d'un type de segmentation du bloc de prédiction destiné à être codé et de la position du bloc de prédiction destiné à être codé dans un bloc de codage. Une unité de sélection d'informations de mouvement couplé (141) détermine les informations de mouvement qui doivent être utilisées pour prédire une compensation de mouvement pour le bloc de prédiction destiné à être codé, à partir de la liste de candidates. Une unité de génération de chaîne de codes (104) code des informations pour spécifier les informations de mouvement ainsi déterminées dans la liste de candidates.
Applications Claiming Priority (8)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2011244801 | 2011-11-08 | ||
| JP2011244802 | 2011-11-08 | ||
| JP2011-244802 | 2011-11-08 | ||
| JP2011-244801 | 2011-11-08 | ||
| JP2012-244596 | 2012-11-06 | ||
| JP2012244595A JP2013123217A (ja) | 2011-11-08 | 2012-11-06 | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、送信装置、送信方法及び送信プログラム |
| JP2012-244595 | 2012-11-06 | ||
| JP2012244596A JP2013123218A (ja) | 2011-11-08 | 2012-11-06 | 動画像復号装置、動画像復号方法、動画像復号プログラム、受信装置、受信方法及び受信プログラム |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2013069269A1 true WO2013069269A1 (fr) | 2013-05-16 |
Family
ID=48289476
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2012/007130 Ceased WO2013069269A1 (fr) | 2011-11-08 | 2012-11-07 | Dispositif de codage vidéo, procédé de codage vidéo, programme de codage vidéo, dispositif de transmission, procédé de transmission, programme de transmission, dispositif de décodage vidéo, procédé de décodage vidéo, programme de décodage vidéo, dispositif de réception, procédé de réception et programme de réception |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2013069269A1 (fr) |
-
2012
- 2012-11-07 WO PCT/JP2012/007130 patent/WO2013069269A1/fr not_active Ceased
Non-Patent Citations (6)
| Title |
|---|
| HIDEKI TAKEHARA ET AL.: "Bi-derivative merge candidate", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 6TH MEETING, July 2011 (2011-07-01), TORINO, IT * |
| HIROYA NAKAMURA ET AL.: "Unification of derivation process for merge mode and MVP", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT- VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/ WG11 6TH MEETING, July 2011 (2011-07-01), TORINO, IT * |
| JIANLE CHEN ET AL.: "MVP index parsing with fixed number of candidates", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 6TH MEETING, July 2011 (2011-07-01), TORINO, IT * |
| TOSHIYASU SUGIO ET AL.: "arsing Robustness for Merge/AMVP", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 6TH MEETING, July 2011 (2011-07-01), TORINO, IT * |
| YNUFEI ZHENG ET AL.: "Unified Motion Vector Predictor Selection for Merge and AMVP", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 5TH MEETING, March 2011 (2011-03-01), GENEVA * |
| YUNFEI ZHENG ET AL.: "Merge Candidate Selection in 2NxN, Nx2N, and NxN Mode", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11 6TH MEETING, July 2011 (2011-07-01), TORINO, IT * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2013099285A1 (fr) | Dispositif de codage vidéo, procédé de codage vidéo et programme de codage vidéo, et dispositif de décodage vidéo, procédé de décodage vidéo et programme de décodage vidéo | |
| WO2013099283A1 (fr) | Dispositif de codage vidéo, procédé de codage vidéo et programme de codage vidéo, et dispositif de décodage vidéo, procédé de décodage vidéo et programme de décodage vidéo | |
| JP6065088B2 (ja) | 動画像符号化装置、動画像符号化方法、及び動画像符号化プログラム、並びに、送信装置、送信方法、及び送信プログラム | |
| JP2013121167A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
| JP6256539B2 (ja) | 動画像復号装置、動画像復号方法、及び動画像復号プログラム | |
| JP5725118B2 (ja) | 動画像復号装置、動画像復号方法、及び動画像復号プログラム、並びに、受信装置、受信方法、及び受信プログラム | |
| JP5725006B2 (ja) | 動画像復号装置、動画像復号方法、及び動画像復号プログラム、並びに、受信装置、受信方法、及び受信プログラム | |
| JP2013118626A (ja) | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、送信装置、送信方法及び送信プログラム | |
| JP2013123218A (ja) | 動画像復号装置、動画像復号方法、動画像復号プログラム、受信装置、受信方法及び受信プログラム | |
| JP5843041B1 (ja) | 動画像復号装置、動画像復号方法、及び動画像復号プログラム、並びに、受信装置、受信方法、及び受信プログラム | |
| WO2013065300A1 (fr) | Dispositif de codage d'image en mouvement, procédé de codage d'image en mouvement, programme de codage d'image en mouvement, transmetteur, procédé de transmission et programme de transmission; et dispositif de décodage d'image en mouvement, procédé de décodage d'image en mouvement, programme de décodage d'image en mouvement, récepteur, procédé de réception et programme de réception | |
| WO2013069269A1 (fr) | Dispositif de codage vidéo, procédé de codage vidéo, programme de codage vidéo, dispositif de transmission, procédé de transmission, programme de transmission, dispositif de décodage vidéo, procédé de décodage vidéo, programme de décodage vidéo, dispositif de réception, procédé de réception et programme de réception | |
| JP2013121163A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム | |
| JP2013121164A (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
| JP2013123217A (ja) | 動画像符号化装置、動画像符号化方法、動画像符号化プログラム、送信装置、送信方法及び送信プログラム | |
| JP2013121168A (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
| JP2013121166A (ja) | 画像復号装置、画像復号方法及び画像復号プログラム | |
| JP2013121165A (ja) | 画像符号化装置、画像符号化方法及び画像符号化プログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12848045 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 12848045 Country of ref document: EP Kind code of ref document: A1 |