RU2340115C1

RU2340115C1 - Method of coding video signals, supporting fast algorithm of precise scalability on quality

Info

Publication number: RU2340115C1
Application number: RU2007139817/09A
Authority: RU
Inventors: Воо-Дзин ХАН (KR); Воо-Дзин ХАН; Кио-хиук ЛИ (KR); Кио-Хиук ЛИ; Санг-чанг ЧА (KR); Санг-чанг ЧА
Original assignee: Самсунг Электроникс Ко., Лтд.
Priority date: 2005-04-29
Filing date: 2006-04-20
Publication date: 2008-11-27
Also published as: JP2008539646A; EP1878261A1; CA2609648A1; AU2006241637A1; WO2006118383A1

Abstract

FIELD: information technology.

SUBSTANCE: video decoding method supports an algorithm of precise scalability on quality (FGS). The method involves: obtaining a predicted image for the current frame, using a motion vector, evaluated with a predetermined accuracy, quantisation of the difference between the current frame and the predicted image, inverse quantisation of the difference and formation of a reconstructed image of the current frame, compensation of motion on the reference frame of the FGS level and on the reference frame of the main level, using an estimated motion vector, calculation of the difference between the FGS level reference frame with compensated motion and the reference frame of the main level with compensated motion, subtraction of the reconstructed image from the current frame and the calculated remainder from the current frame and coding of the subtraction result.

EFFECT: reduced volume of calculations, required for a multi-level algorithm of progressive precise scalability on quality (PFGS).

49 cl, 12 dwg

Description

Область техники, к которой относится изобретениеFIELD OF THE INVENTION

Способы и устройства, соответствующие настоящему изобретению, относятся к кодированию видеосигналов и, более конкретно, к кодированию видеоинформации, которое уменьшает объем вычислений, требуемых для основанного на многоуровневости алгоритма прогрессивной точной масштабируемости по качеству (Progressive Fine Granular Scalability, PFGS).The methods and devices of the present invention relate to encoding video signals and, more specifically, to encoding video information, which reduces the amount of computation required for the Progressive Fine Granular Scalability (PFGS) algorithm based on the multilevel algorithm.

Уровень техникиState of the art

С развитием технологии передачи информации, в том числе, Интернета, увеличился объем мультимедийных услуг, содержащих различные виды информации, такой как текст, видео-, аудиоинформация и так далее. Мультимедийные данные требуют большой емкости носителей данных и широкой полосы пропускания для передачи, поскольку объем мультимедийных данных обычно большой. Соответственно, для передачи мультимедийных данных, в том числе текста, видео- и аудиоинформации, способ кодирования со сжатием является необходимостью.With the development of information transmission technology, including the Internet, the volume of multimedia services containing various types of information, such as text, video, audio information and so on, has increased. Multimedia data requires a large capacity of storage media and a wide bandwidth for transmission, since the volume of multimedia data is usually large. Accordingly, for transmitting multimedia data, including text, video and audio information, a compression encoding method is a necessity.

Основной принцип сжатия данных заключается в исключении избыточности данных. Данные могут быть сжаты посредством удаления пространственной избыточности, при которой в изображении повторяется один и тот же цвет или объект, временной избыточности, при которой между соседними кадрами движущегося изображения имеются лишь небольшие изменения или в аудиоинформации повторяется один и тот же звук, или психической визуальной избыточности, учитывающей особенности человеческого зрения и ограниченное восприятие высокой частоты. При обычном видеокодировании временная избыточность удаляется с помощью временной фильтрации, основанной на компенсации движения, а пространственная избыточность удаляется пространственным преобразованием.The basic principle of data compression is to eliminate data redundancy. Data can be compressed by removing spatial redundancy, in which the same color or object is repeated in the image, temporary redundancy, in which there are only small changes between adjacent frames of the moving image, or the same sound is repeated in the audio information, or mental visual redundancy taking into account the peculiarities of human vision and the limited perception of high frequency. In conventional video coding, temporal redundancy is removed by temporal filtering based on motion compensation, and spatial redundancy is removed by spatial transformation.

Чтобы передать мультимедийные данные, созданные после удаления избыточности данных, требуются средства связи. Различные типы средств связи для мультимедийных данных имеют различные характеристики. Используемые в настоящее время средства связи имеют различные скорости передачи. Например, сеть ультраскоростной связи может передавать данные со скоростью нескольких десятков мегабит в секунду, в то время как сеть мобильной связи имеет скорость передачи 384 килобит в секунду. Для поддержки средств связи, обладающих различными скоростями передачи, или для передачи мультимедийных данных в мультимедийной среде могут быть пригодны способы кодирования данных, обладающие масштабируемостью.Communication media is required to transmit multimedia data created after data redundancy removal. Different types of media for multimedia have different characteristics. Currently used communication tools have different transmission speeds. For example, an ultra-fast communication network can transmit data at a speed of several tens of megabits per second, while a mobile communication network has a transmission speed of 384 kilobits per second. Scalable data encoding methods may be suitable for supporting communication media having different transmission rates or for transmitting multimedia data in a multimedia environment.

Масштабируемость указывает на способность частично декодировать единый сжатый поток битов. Масштабируемость содержит пространственную масштабируемость, указывающую видеоразрешение, масштабируемость по отношению "сигнал/шум" (SNR), указывающую уровень качества видеоинформации, и временную масштабируемость, указывающую скорость передачи кадров.Scalability indicates the ability to partially decode a single compressed bit stream. Scalability includes spatial scalability indicating video resolution, signal-to-noise scalability (SNR), indicating the level of video quality, and temporal scalability, indicating the frame rate.

Работа по стандартизации для реализации многоуровневой масштабируемости, основанной на технологии Scalable Extension H.264 (в дальнейшем, будет упоминаться как "H.264 SE"), ведется в настоящее время совместной видеогруппой (JVT) MPEG (группа экспертов по кинематографии) и ITU (Международное Телекоммуникационное Общество). Для поддержки масштабируемости по отношению "сигнал/шум" (SNR) группой JVT внедряются существующие технологии точной масштабируемости по качеству (Fine Granular Scalability, FGS).Standardization work to implement multi-level scalability based on Scalable Extension H.264 technology (hereinafter referred to as “H.264 SE”) is currently being carried out by the joint video group (JVT) MPEG (cinematography expert group) and ITU ( International Telecommunication Society). To support signal-to-noise ratio (SNR) scalability, JVT implements existing Fine Granular Scalability (FGS) technologies.

На фиг.1 показана схема для объяснения традиционного способа FGS. Кодек на основе FGS выполняет кодирование путем деления потока битов видеоинформации на основной уровень и уровень FGS. В настоящем описании знак верхнего штриха (') используется для обозначения восстановленного изображения, полученного после квантования/инверсного квантования. Более конкретно, блок PB, спрогнозированный заранее из блока MB' в восстановленном левом кадре 11 основного уровня, и блока NB' в восстановленном правом кадре 12 основного уровня, используя вектор движения, вычитается из блока О в исходном текущем кадре 12, для получения разностного блока RB. Таким образом, разностный блок RB может быть определен уравнением (1):1 is a diagram for explaining a conventional FGS method. The FGS-based codec performs encoding by dividing the video bitstream into the main layer and the FGS layer. In the present description, a dash ()) is used to indicate a reconstructed image obtained after quantization / inverse quantization. More specifically, a block PB predicted in advance from block MB ′ in the reconstructed left frame of the main layer 11 and block NB ′ in the reconstructed right frame 12 of the main level using the motion vector is subtracted from block O in the original current frame 12 to obtain a difference block RB. Thus, the difference block RB can be defined by equation (1):

R_B = О - P_B = O - (M_B' + N_B')/2R _B = O - P _B = O - (M _B '+ N _B ') / 2 (1)(one)

Разностный блок R_В квантуется с помощью шага квантования QP_В (RB^Q) основного уровня и затем инверсно квантуется, чтобы получить восстановленный разностный блок R_В'. Разность между неквантованным разностным блоком R_В и восстановленным разностным блоком R_В', блок Δ, соответствующий разности, квантуется с размером шага квантования QP_F, по размеру меньшим, чем шаг квантования основного уровня QP_В (коэффициент сжатия уменьшается по мере уменьшения размера шага квантования). Квантованный блок Δ обозначается как Δ^Q. Квантованный разностный блок R_B ^Q разности на основном уровне и квантованный блок Δ^Q на уровне FGS, в конечном счете, передаются в декодер.The difference block R _B is quantized using the quantization step QP _B (RB ^Q ) of the main layer and then is inverted quantized to obtain the reconstructed difference block R _B ′. The difference between the non-quantized difference block R _B and the restored difference block R _B ', the block Δ corresponding to the difference is quantized with a quantization step size QP _F smaller than the quantization step of the main level QP _B (the compression ratio decreases as the size of the quantization step decreases ) The quantized block is denoted as Δ Δ ^Q. The quantized difference block R _B ^{Q of the} difference at the fundamental level and the quantized block Δ ^Q at the FGS level are ultimately transmitted to the decoder.

На фиг.2 показана схема для объяснения работы традиционного способа прогрессивной точной масштабируемости по качеству (PFGS). Обычный способ FGS использует восстановленную разность R_В' квантованного основного уровня для уменьшения объема данных на уровне FGS. Со ссылкой на фиг.2, способ PFGS использует тот факт, что качество левого и правого опорных кадров на уровне FGS также улучшается с помощью способа FGS. То есть, способ PFGS содержит вычисление нового разностного R_F, использующего заново обновленные левый опорный кадр 21 и правый опорный кадр 23 и квантование разности между новым разностным блоком R_F и квантованным блоком R_F' основного уровня, улучшая, таким образом, характеристики кодирования. Новый разностный блок RF определяется уравнением (2):Figure 2 shows a diagram for explaining the operation of the traditional method of progressive precision scalability in quality (PFGS). The conventional FGS method uses the reconstructed difference R _B ′ of the quantized base layer to reduce the amount of data at the FGS level. With reference to FIG. 2, the PFGS method exploits the fact that the quality of the left and right reference frames at the FGS level is also improved by the FGS method. That is, the PFGS method comprises computing a new difference R _F using the newly updated left reference frame 21 and the right reference frame 23 and quantizing the difference between the new difference block R _F and the quantized base layer block R _F ', thereby improving coding characteristics. The new difference block RF is defined by equation (2):

RF = O - P_F = O - (M_F' + N_F')/2RF = O - P _F = O - (M _F '+ N _F ') / 2 (2)(2)

где M_F' и N_F' соответственно обозначают области в восстановленных левом опорном кадре 21 и правом опорном кадре 23 на уровне FGS согласно соответствующим векторам движения.where M _F 'and N _F ' respectively denote the areas in the restored left reference frame 21 and the right reference frame 23 at the FGS level according to the corresponding motion vectors.

Способ PFGS обладает преимуществом перед способом FGS, заключающимся в том, что объем данных на уровне FGS может быть уменьшен благодаря высокому качеству левого и правого опорных кадров. Поскольку уровень FGS также требует отдельной компенсации движения, объем вычислений увеличивается. То есть, хотя способ PFGS улучшил характеристики по сравнению с обычным способом FGS, он требует большого объема вычислений, поскольку компенсация движения выполняется для каждого уровня FGS, чтобы создавать прогнозированный сигнал и разностный сигнал между прогнозированным сигналом и исходным сигналом. Недавно разработанные видеокодеки интерполируют сигнал изображения для компенсации движения с точностью 1/2 или 1/4 пиксела. Когда компенсация движения выполняется с точностью 1/4 пиксела, должно создаваться изображение с размером, соответствующим четырехкратной разрешающей способности первоначального изображения.The PFGS method has the advantage over the FGS method in that the data volume at the FGS level can be reduced due to the high quality of the left and right reference frames. Since the FGS level also requires separate motion compensation, the amount of computation increases. That is, although the PFGS method has improved performance compared to the conventional FGS method, it requires a lot of computation because motion compensation is performed for each FGS level to create a predicted signal and a difference signal between the predicted signal and the original signal. Recently developed video codecs interpolate an image signal to compensate for motion with an accuracy of 1/2 or 1/4 pixel. When motion compensation is performed with an accuracy of 1/4 pixel, an image should be created with a size corresponding to four times the resolution of the original image.

Сущность изобретенияSUMMARY OF THE INVENTION

Техническая проблемаTechnical problem

Способ SE по стандарту H.264 использует шестиполюсный фильтр в качестве фильтра интерполяции с 1/2 пиксела, который обладает значительной сложностью вычислений, требуя большого объема вычислений для компенсации движения. Это усложняет процессы кодирования и декодирования, требуя, таким образом, повышенных ресурсов системы. В частности, этот недостаток может быть наиболее проблематичен в полевых условиях, требующих кодирования и декодирования в реальном времени, таких как прямое радиовещание или видеоконференция.The H.264 SE method uses a six-pole filter as a 1/2 pixel interpolation filter, which has significant computational complexity, requiring a large amount of computation to compensate for motion. This complicates the encoding and decoding processes, thus requiring increased system resources. In particular, this drawback can be most problematic in the field, requiring real-time encoding and decoding, such as live broadcasting or video conferencing.

Техническое решениеTechnical solution

Настоящее изобретение обеспечивает способ и устройство для сокращения объема вычислений, требующихся для компенсации движения при сохранении характеристик алгоритма прогрессивной точной масштабируемости по качеству (PFGS).The present invention provides a method and apparatus for reducing the amount of computation required to compensate for motion while maintaining the characteristics of the Progressive Fine Quality Scalability Algorithm (PFGS).

Согласно аспекту настоящего изобретения, обеспечивается способ видеокодирования, поддерживающий FGS, способ видеокодирования, содержащий этапы, на которых получают прогнозированное изображение для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, проводят квантование разности между текущим кадром и прогнозированным изображением, проводят инверсное квантование квантованной разности и создают восстановленное изображение для текущего кадра, выполняя компенсацию движения на эталонном кадре уровня FGS и эталонном кадре основного уровня, используя оцененный вектор движения, вычисляют разность между эталонным кадром уровня FGS с компенсированным движением и эталонным кадром основного уровня с компенсированным движением, вычитают восстановленное изображение для текущего кадра и вычисленные разности от текущего кадра и кодируют результат вычитания.According to an aspect of the present invention, there is provided a video encoding method supporting FGS, a video encoding method comprising the steps of obtaining a predicted image for the current frame using a motion vector estimated with predetermined accuracy, quantizing the difference between the current frame and the predicted image, inverting quantization quantized difference and create a reconstructed image for the current frame, performing motion compensation on the reference frame level The FGS and the reference frame of the main level, using the estimated motion vector, calculate the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement, subtract the reconstructed image for the current frame and the calculated differences from the current frame and encode the result of subtraction.

Согласно другому аспекту настоящего изобретения, обеспечивается способ видеокодирования, поддерживающий FGS, способ видеокодирования, содержащий этапы, на которых получают прогнозированное изображение текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, выполняют квантование разности между текущим кадром и прогнозированным изображением, выполняют инверсное квантование квантованной разности и создают восстановленное изображение для текущего кадра, выполняют компенсацию движения на опорном кадре уровня FGS и на опорном кадре основного уровня, используя оцененный вектор движения, и создают прогнозированный кадр для уровня FGS и прогнозированный кадр для основного уровня, соответственно, вычисляют разность между прогнозированным кадром для уровня FGS и прогнозированным кадром для основного уровня, вычитают восстановленное изображение и разность из текущего кадра и кодируют результат вычитания.According to another aspect of the present invention, there is provided a video coding method supporting FGS, a video coding method comprising the steps of obtaining a predicted image of a current frame using a motion vector estimated with predetermined accuracy, quantizing the difference between the current frame and the predicted image, performing inverse quantization quantized difference and create a reconstructed image for the current frame, perform motion compensation on the reference frame the FGS and the reference frame of the main level, using the estimated motion vector, and create a predicted frame for the FGS level and the predicted frame for the main level, respectively, calculate the difference between the predicted frame for the FGS level and the predicted frame for the main level, subtract the reconstructed image and the difference from the current frame and encode the result of subtraction.

Согласно еще одному аспекту настоящего изобретения, обеспечивается способ видеокодирования, поддерживающий FGS, способ видеокодирования, содержащий получение прогнозированного изображения для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, выполняют квантование разности между текущим кадром и прогнозированным изображением, выполняют инверсное квантование квантованной разности и создают восстановленное изображение для текущего кадра, вычисляют разность между опорным кадром для уровня FGS и опорным кадром для основного уровня, выполняют компенсацию движения на разности, используя оценочный вектор движения, вычитают восстановленное изображение и результат компенсированного движения из текущего кадра и кодируют результат вычитания.According to another aspect of the present invention, there is provided a video coding method supporting FGS, a video coding method comprising obtaining a predicted image for a current frame using a motion vector estimated with predetermined accuracy, quantizing the difference between the current frame and the predicted image, performing inverse quantization of the quantized difference and create a reconstructed image for the current frame, calculate the difference between the reference frame for the FGS level and nym frame for the base layer, performing motion compensation on the difference from the estimated motion vector, subtracting the reconstructed image and the motion-compensated result from the current frame, and encoding the result of subtraction.

Согласно еще одному другому аспекту настоящего изобретения, обеспечивают способ видеокодирования, поддерживающий алгоритм точной масштабируемости по качеству (FGS), способ видеокодирования, содержащий этапы получения прогнозированного изображения для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, выполняют компенсацию движения на опорном кадре уровня FGS и на опорном кадре основного уровня, используя вектор движения с более низкой точностью, чем точность оцененного вектора движения, вычисляют разность между опорным кадром уровня FGS с компенсированным движением и опорным кадром основного уровня, вычитают прогнозированное изображение из текущего кадра и кодируют результат вычитания.According to yet another aspect of the present invention, there is provided a video coding method supporting an accurate quality scalability algorithm (FGS), a video coding method comprising the steps of obtaining a predicted image for a current frame using a motion vector estimated with predetermined accuracy, performing motion compensation on a reference frame the FGS level and on the reference frame of the main level, using the motion vector with lower accuracy than the accuracy of the estimated motion vector, calculate the value between the reference frame of the FGS level with compensated movement and the reference frame of the main level, subtract the predicted image from the current frame and encode the result of the subtraction.

Согласно еще одному другому аспекту настоящего изобретения, обеспечивают способ видеокодирования, поддерживающий FGS, способ видеокодирования, содержащий этапы, на которых получают прогнозированное изображение для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, выполняют компенсацию движения на опорном кадре уровня FGS и на опорном кадре основного уровня, используя вектор движения с более низкой точностью, чем точность оцененного вектора движения, и создают прогнозированный кадр для уровня FGS и прогнозированный кадр для основного уровня, соответственно, вычисляют разность между прогнозированным кадром для уровня FGS и прогнозированным кадром для основного уровня, вычитают прогнозированное изображение и вычисленную разность из текущего кадра и кодируют результат вычитания.According to yet another aspect of the present invention, there is provided a video encoding method supporting FGS, a video encoding method comprising the steps of obtaining a predicted image for the current frame using a motion vector estimated with predetermined accuracy, performing motion compensation on the reference frame of the FGS level and on the reference frame of the main level using a motion vector with lower accuracy than the accuracy of the estimated motion vector, and create a predicted frame for the FGS level and the predicted frame for the main level, respectively, the difference between the predicted frame for the FGS level and the predicted frame for the main level is calculated, the predicted image and the calculated difference are subtracted from the current frame, and the subtraction result is encoded.

Согласно еще одному другому аспекту настоящего изобретения, обеспечивается способ видеокодирования, поддерживающий FGS, способ видеокодирования, содержащий этапы, на которых получают прогнозированное изображение для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, вычисляют разность между опорным кадром уровня FGS и опорным кадром основного уровня, выполняют компенсацию движения на разности, используя вектор движения с более низкой точностью, чем точность оцененного вектора движения, вычитают восстановленное изображение и результат компенсированного движения текущего кадра и кодируют результат вычитания.According to yet another aspect of the present invention, there is provided a video encoding method supporting FGS, a video encoding method comprising the steps of obtaining a predicted image for the current frame using a motion vector estimated with predetermined accuracy, calculating the difference between the reference frame of the FGS level and the reference frame ground level, perform motion compensation on the difference using a motion vector with lower accuracy than the accuracy of the estimated motion vector, subtract The tanned image and the result of the compensated movement of the current frame also encode the result of subtraction.

Согласно другому аспекту настоящего изобретения, обеспечивается способ видеодекодирования, поддерживающий FGS, способ видеодекодирования, содержащий этапы, на которых извлекают данные текстуры основного уровня и данные текстуры уровня FGS и векторы движения из входного потока битов, восстановление кадра основного уровня из данных текстуры основного уровня, выполнение компенсации движения на опорном кадре уровня FGS и на опорном кадре основного уровня, используя векторы движения, вычисляют разность между опорным кадром уровня FGS с компенсированным движением и опорным кадром основного уровня с компенсированным движением и складывают вместе кадр основного уровня, данные текстуры уровня FGS и разность.According to another aspect of the present invention, there is provided a video decoding method supporting FGS, a video decoding method comprising the steps of extracting texture layer data of the main layer and texture data of the FGS layer and motion vectors from the input bit stream, restoring the frame of the basic layer from the texture layer data of the main layer, motion compensation on the reference frame of the FGS level and on the reference frame of the main level, using the motion vectors, calculate the difference between the reference frame of the FGS level with compensation motion and the reference frame of the main level with compensated movement and add together the frame of the main level, texture data of the FGS level and the difference.

Согласно другому аспекту настоящего изобретения, обеспечивается видеокодер на основе FGS, содержащий элемент, получающий прогнозированное изображение для текущего кадра, используя вектор движения, оцененный с заранее определенной точностью, элемент, выполняющий квантование разности между текущим кадром и прогнозированным изображением, инверсное квантование квантованной разности и создание восстановленного изображения для текущего кадра, элемент, выполняющий компенсацию движения на опорном кадре уровня FGS и опорном кадре основного уровня, используя оцененный вектор движения, элемент, вычисляющий разность между опорным кадром уровня FGS с компенсированным движением и опорным кадром основного уровня с компенсированным движением, элемент, вычитающий восстановленное изображение и разность из текущего кадра, и элемент, кодирующий результат вычитания.According to another aspect of the present invention, there is provided an FGS-based video encoder comprising an element obtaining a predicted image for the current frame using a motion vector estimated with predetermined accuracy, an element that quantizes the difference between the current frame and the predicted image, inverts quantization of the quantized difference and creates of the reconstructed image for the current frame, an element that performs motion compensation on the reference frame of the FGS level and the reference frame of the main level using the estimated motion vector, an element calculating the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement, an element subtracting the reconstructed image and the difference from the current frame, and an element encoding the result of subtraction.

Согласно еще одному дополнительному аспекту настоящего изобретения, обеспечивается видеодекодер на основе FGS, видеодекодер, содержащий элемент, извлекающий данные текстуры основного уровня, данные текстуры уровня FGS и векторы движения из входного потока битов, элемент, восстанавливающий кадр основного уровня из данных текстуры основного уровня, элемент, выполняющий компенсацию движения на опорном кадре уровня FGS и на опорном кадре основного уровня, используя вектор движения, и создающий прогнозированный кадр уровня FGS и прогнозированный кадр основного уровня, элемент, вычисляющий разность между прогнозированным кадром уровня FGS и прогнозированным кадром основного уровня, и элемент, складывающий вместе данные текстуры, восстановленный кадр основного уровня и разность.According to another further aspect of the present invention, there is provided an FGS-based video decoder, a video decoder comprising an element extracting ground layer texture data, FGS layer texture data and motion vectors from an input bit stream, an element restoring a main layer frame from the main layer texture data, element performing motion compensation on the reference frame of the FGS level and on the reference frame of the main level using the motion vector, and creating a predicted frame of the FGS level and predicted th frame of the core layer, element, calculating the difference between the predicted FGS layer frame and the predicted frame of the core layer, and an element adding together the texture data, the reconstructed base layer frame and the difference.

Описание чертежейDescription of drawings

Вышеупомянутые и другие аспекты настоящего изобретения станут более ясными при подробном описании примеров вариантов его осуществления со ссылкой на прилагаемые чертежи, на которых:The above and other aspects of the present invention will become clearer with a detailed description of examples of variants of its implementation with reference to the accompanying drawings, in which:

Фиг.1 - схема для объяснения обычного способа FGS.1 is a diagram for explaining a conventional FGS method.

Фиг.2 - схема для объяснения обычного способа прогрессивной PFGS.2 is a diagram for explaining a conventional progressive PFGS method.

Фиг.3 - схема для объяснения быстрой прогрессивной точной масштабируемости по качеству (PFGS) в соответствии с примером варианта осуществления настоящего изобретения.FIG. 3 is a diagram for explaining fast progressive accurate quality scalability (PFGS) in accordance with an example embodiment of the present invention.

Фиг.4 - блок-схема видеокодера в соответствии с примером варианта осуществления настоящего изобретения.4 is a block diagram of a video encoder in accordance with an example embodiment of the present invention.

Фиг.5 - блок-схема видеокодера согласно другому примеру варианта осуществления настоящего изобретения.5 is a block diagram of a video encoder according to another example of an embodiment of the present invention.

Фиг.6 и 7 - блок-схемы видеокодеров в соответствии с дополнительным примером варианта осуществления настоящего изобретения.6 and 7 are block diagrams of video encoders in accordance with a further example embodiment of the present invention.

Фиг.8 - блок-схема видеокодера в соответствии с примером варианта осуществления настоящего изобретения.8 is a block diagram of a video encoder in accordance with an example embodiment of the present invention.

Фиг.9 - блок-схема видеодекодера согласно другому примеру варианта осуществления настоящего изобретения.9 is a block diagram of a video decoder according to another example of an embodiment of the present invention.

Фиг.10 и 11 - блок-схемы видеодекодеров в соответствии с дополнительным примером варианта осуществления настоящего изобретения.10 and 11 are block diagrams of video decoders in accordance with a further example embodiment of the present invention.

Фиг.12 - блок-схема системы для выполнения процесса кодирования или декодирования в соответствии с примером варианта осуществления настоящего изобретения.12 is a block diagram of a system for performing an encoding or decoding process in accordance with an example embodiment of the present invention.

Описание предпочтительных вариантов осуществления изобретенияDescription of preferred embodiments of the invention

Настоящее изобретение теперь будет описано более подробно со ссылкой на сопроводительные чертежи, на которых показаны примеры вариантов осуществления изобретения.The present invention will now be described in more detail with reference to the accompanying drawings, in which examples of embodiments of the invention are shown.

Преимущества и признаки настоящего изобретения и способы его осуществления могут стать более понятны при рассмотрении последующего подробного описания примеров вариантов осуществления и сопроводительных чертежей. Настоящее изобретение может, однако, быть реализовано во многих различных формах и не должно истолковываться как ограничивающееся изложенными здесь примерами вариантов осуществления. Скорее, эти примеры вариантов осуществления представлены в этом описании для того, чтобы это описание было всесторонним и законченным и полностью раскрывало концепцию изобретения для тех, кто являются специалистами в данной области техники, и настоящее изобретение будет определяться только прилагаемой формулой изобретения. Одни и те же номера относятся к одним и тем же элементам по всему описанию.The advantages and features of the present invention and methods for its implementation may become more apparent when considering the following detailed description of examples of embodiments and accompanying drawings. The present invention may, however, be embodied in many different forms and should not be construed as being limited to the exemplary embodiments set forth herein. Rather, these examples of embodiments are presented in this description so that this description is comprehensive and complete and fully discloses the concept of the invention to those who are specialists in this field of technology, and the present invention will be defined only by the attached claims. The same numbers refer to the same elements throughout the description.

На фиг.3 показана схема PFGS, соответствующая первому примеру варианта осуществления настоящего изобретения.FIG. 3 shows a PFGS diagram corresponding to a first example of an embodiment of the present invention.

Со ссылкой на фиг.3, как и на фиг.2, Δ на уровне FGS будет квантоваться в соответствии с алгоритмом PFGS и определяется простым уравнением (3):With reference to figure 3, as in figure 2, Δ at the FGS level will be quantized in accordance with the PFGS algorithm and is determined by simple equation (3):

Δ = R_F- R_B'Δ = R _F - R _B ' (3)(3)

R_Fопределяется приведенным выше уравнением (2), и R_B' определяется уравнением (4):R _{F is} determined by the above equation (2), and R _B 'is determined by the equation (4):

R_B' = O' - P_В = O' - (М_В' +N_B')/2R _B '= O' - P _B = O '- (M _B ' + N _B ') / 2 (4)(four)

где O' является изображением, восстановленным путем квантования первоначального изображения O с размером шага квантования основного уровня QP_В и последующего инверсного квантования квантованного изображения.where O 'is an image reconstructed by quantizing an original image O with the core layer quantization step size QP _B and subsequent inverse quantization of the quantized image.

Подстановка уравнений (2) и (4) в уравнение (3) дает уравнение (5):Substitution of equations (2) and (4) into equation (3) gives equation (5):

Δ = O - (М_F' + N_F')/2 - [O' - (М_В' + N_B')/2]Δ = O - (M _F '+ N _F ') / 2 - [O '- (M _B ' + N _B ') / 2] (5)(5)

Со ссылкой на фиг.3, Δ_М и Δ_N обозначают разность между левыми опорными кадрами М_F' и М_В' на основном уровне и на уровне FGS и разность между правыми опорными кадрами N_F' и N_B' на основном уровне и на уровне FGS, соответственно, и определяются уравнениями (6):With reference to FIG. 3, Δ _M and Δ _N denote the difference between the left reference frames M _F 'and M _B ' at the main level and at the FGS level and the difference between the right reference frames N _F 'and N _B ' at the main level and at level FGS, respectively, and are determined by equations (6):

Δ_М = М_F' - М_В'Δ _M = M _F '- M _B ' Δ_N = N_F' - N_B'Δ _N = N _F '- N _B ' (6)(6)

Путем подстановки уравнения (6) в уравнение (5), Δ может быть определена уравнением (7):By substituting equation (6) into equation (5), Δ can be determined by equation (7):

Δ = O - O' - (Δ_М + Δ_N)/2Δ = O - O '- (Δ _M + Δ _N ) / 2 (7)(7)

Как видно из уравнения (7), кодер может получить Δ путем вычитания восстановленного изображения O' основного уровня, полученного квантованием исходного изображения O с шагом квантования основного уровня размером QP_В и последующего инверсного квантования квантованного изображения и среднего значения разностей между каждым из опорных кадров основного уровня и опорных кадров уровня FGS и первоначальным изображением O, то есть (Δ_М+ Δ_N)/2. Декодер восстанавливает исходное изображение O путем сложения изображения О' основного уровня, Δ и среднего значения разностей между опорным кадром основного уровня и опорным кадром уровня FGS.As can be seen from equation (7), the encoder can obtain Δ by subtracting the reconstructed image of the main level O 'obtained by quantizing the original image O with a quantization step of the main level of size QP _B and then inverting the quantized image and the average value of the differences between each of the reference frames of the main level and reference frames of the FGS level and the original image O, that is (Δ _M + Δ _N ) / 2. The decoder restores the original image O by adding the image O 'of the main level, Δ and the average value of the differences between the reference frame of the main level and the reference frame of the FGS level.

В обычном алгоритме PFGS компенсация движения выполняется, используя вектор движения с точностью одного пиксела или части пиксела (1/2 пиксела или 1/4 пиксела), полученный при оценке движения. В последнее время, чтобы увеличивать эффективность сжатия, оценка движения и компенсация обычно выполняются с различными точностями, выраженными в пикселах, такими как точность полпиксела или точность четверть пиксела. В обычном алгоритме PFGS прогнозированное изображение, созданное путем компенсации движения с точностью, например, 1/4 пиксела, упаковывается в целочисленные пикселы. Затем квантование выполняется на разности между первоначальным изображением и прогнозированным изображением. В этом случае, упаковка является процессом восстановления 4-x кратного интерполированного опорного изображения в изображение первоначального размера, выполняя оценку движения с точностью 1/4 пиксела. Например, в течение упаковочного процесса может быть выбран один из каждых четырех пикселов.In the conventional PFGS algorithm, motion compensation is performed using a motion vector with an accuracy of one pixel or part of a pixel (1/2 pixel or 1/4 pixel) obtained from motion estimation. Recently, in order to increase compression efficiency, motion estimation and compensation are usually performed with various precision expressed in pixels, such as half pixel accuracy or quarter pixel accuracy. In the conventional PFGS algorithm, a predicted image created by motion compensation with an accuracy of, for example, 1/4 pixel is packed into integer pixels. Then, quantization is performed on the difference between the original image and the predicted image. In this case, packaging is the process of reconstructing a 4-x interpolated reference image into an image of its original size, performing motion estimation with an accuracy of 1/4 pixel. For example, during the packaging process, one out of every four pixels can be selected.

Однако, данные Δ на FGS уровне, которые должны квантоваться для быстрого PFGS согласно настоящему изобретению, как определено уравнением (7), которые приводят к малой эффективности сжатия, не должны подвергаться оценке движения с высокой пиксельной точностью. Оценка движения и компенсация применяются только к третьему члену (Δ_М + Δ_N)/2 в правой стороне уравнения (7). Однако, поскольку третий член представляется как разности между опорными кадрами на промежуточном уровне, выполнение оценки движения и компенсации с высокой пиксельной точностью не дает высокой эффективности. То есть, поскольку результирующее разностное изображение между изображением на основном уровне с компенсированным движением с заранее определенной пиксельной точностью и изображением на уровне расширения с компенсированным движением с пиксельной точностью нечувствительно к пиксельной точности, быстрый алгоритм PFGS позволяет производить оценку движения и компенсацию с более низкой пиксельной точностью, чем обычный PFGS.However, the Δ data at the FGS level, which must be quantized for fast PFGS according to the present invention, as defined by equation (7), which lead to low compression efficiency, should not be subject to motion estimation with high pixel accuracy. Motion estimation and compensation apply only to the third term (Δ _M + Δ _N ) / 2 on the right side of equation (7). However, since the third term is represented as the difference between the reference frames at the intermediate level, performing motion estimation and compensation with high pixel accuracy does not give high efficiency. That is, since the resulting difference image between the image at the basic level with compensated motion with a predetermined pixel accuracy and the image at the expansion level with compensated motion with pixel accuracy is insensitive to pixel accuracy, the fast PFGS algorithm allows motion estimation and compensation with lower pixel accuracy than regular PFGS.

Согласно второму примеру варианта осуществления, Δ в уравнении (5) в первом примере варианта осуществления может быть также представлена как разность между прогнозированными сигналами P_F и P_В, как показано в уравнении (8). P_F и P_В равны (М_F' + N_F')/2 и (М_В' + NAccording to the second example of the embodiment, Δ in equation (5) in the first example of the embodiment can also be represented as the difference between the predicted signals P _F and P _B , as shown in equation (8). P _F and P _B are equal (M _F '+ N _F ') / 2 and (M _B '+ N

Δ = О - О' - (P_F - P_В)Δ = O - O '- (P _F - P _B ) (8)(8)

Первый и второй примеры варианта осуществления отличаются друг от друга следующим образом. В первом примере варианта осуществления, разности Δ_Ми Δ_N между опорными изображениями уровня FGS и опорными изображениями основного уровня сначала вычисляются и затем делятся на 2. Во втором примере варианта осуществления разность P_F-P_В между прогнозированным изображением P_F уровня FGS и прогнозированным изображением P_В основного уровня вычисляется после вычисления прогнозированных изображений P_F и P_В на этих двух уровнях. То есть, хотя быстрые алгоритмы PFGS, соответствующие первому и второму примерам вариантов осуществления, реализуются разными способами, может быть получен один и тот же результат вычисления (Δ).The first and second examples of the embodiment differ from each other as follows. In the first example of the embodiment, the differences Δ _M and Δ _N between the reference images of the FGS level and the reference images of the main level are first calculated and then divided by 2. In the second example of the embodiment, the difference P _F -P _B between the predicted image P _{F of} the FGS level and the predicted image P _{In the} main level is calculated after calculating the predicted images P _F and P _In these two levels. That is, although fast PFGS algorithms corresponding to the first and second examples of embodiments are implemented in different ways, the same calculation result (Δ) can be obtained.

Как в первом, так и во втором примерах вариантов осуществления, сначала выполняется компенсация движения, а затем вычисляется разность между изображениями. В третьем примере варианта осуществления настоящего изобретения, сначала может вычисляться разность между опорными изображениями на различных уровнях, а затем выполняться компенсация движения. Таким образом, в соответствии с третьим примером варианта осуществления настоящего изобретения, поскольку компенсация движения выполняется для разности, заполнение на границах мало влияет на результирующее изображение. Таким образом, процесс заполнения на границах может быть пропущен. Заполнение на границах является процессом дублирования пиксел, расположенных в непосредственной близости от границ, учитывая, что во время оценки движения совпадение блоков на границе кадра ограничивается.In both the first and second examples of embodiments, motion compensation is first performed, and then the difference between the images is calculated. In a third example of an embodiment of the present invention, the difference between the reference images at various levels can be calculated first, and then motion compensation can be performed. Thus, in accordance with a third example of an embodiment of the present invention, since motion compensation is performed for the difference, filling at the borders has little effect on the resulting image. Thus, the filling process at the borders can be skipped. Filling at borders is a process of duplicating pixels located in close proximity to borders, given that during motion estimation, coincidence of blocks at the frame boundary is limited.

В третьем примере варианта осуществления настоящего изобретения, разность Δ может быть определена уравнением (9):In a third example of an embodiment of the present invention, the difference Δ can be determined by equation (9):

Δ = О - О' -[(mc(М_F' - М_В') + mc(N_F' - N_B'))/2Δ = O - O '- [(mc (M _F ' - M _B ') + mc (N _F ' - N _B ')) / 2 (9)(9)

где mc(.) означает функцию для выполнения компенсации движения.where mc (.) means a function to perform motion compensation.

В то время, как обычный PFGS используется для выполнения прямого прогнозирования (оценка движения и компенсация) для вычисления R_F или R_B, определяемых уравнением (3), быстрые алгоритмы PFGS, соответствующие примерам с первого по третий вариантов осуществления настоящего изобретения, используются для вычисления разности между прогнозированными изображениями или прогнозирования разности между опорными изображениями. Таким образом, характеристики быстрых PFGS настоящего изобретения лишь немного затрагиваются или являются нечувствительными к интерполяции, используемой для повышения пиксельной точности вектора движения.While conventional PFGS is used to perform direct prediction (motion estimation and compensation) to calculate R _F or R _B defined by equation (3), fast PFGS algorithms corresponding to Examples 1 through 3 of the present invention are used to calculate the difference between the predicted images or predicting the difference between the reference images. Thus, the fast PFGS characteristics of the present invention are only slightly affected or insensitive to the interpolation used to improve the pixel accuracy of the motion vector.

Таким образом, интерполяция с четвертью или половиной пиксела может быть пропущена. Кроме того, вместо полупиксельного интерполяционного фильтра, используемого в стандарте H.264 и требующего большого объема вычислений, может использоваться билинейный фильтр, требующий меньшего объема вычислений. Например, билинейный фильтр может применяться к третьим членам в правых частях уравнений (7)-(9). Это может уменьшить ухудшение характеристик по сравнению с тем, когда билинейный фильтр непосредственно применяется к прогнозированному сигналу для получения R_F и R_B как в обычном алгоритме PFGS.Thus, a quarter or half pixel interpolation can be skipped. In addition, instead of the half-pixel interpolation filter used in the H.264 standard and requiring a large amount of computation, a bilinear filter can be used that requires less computation. For example, a bilinear filter can be applied to the third terms in the right-hand sides of equations (7) - (9). This can reduce performance degradation compared to when a bilinear filter is directly applied to the predicted signal to obtain R _F and R _B as in the normal PFGS algorithm.

Принцип действия с первого по третий примеров вариантов осуществления настоящего изобретения основан на уравнении (3). Другими словами, реализация этих примеров вариантов осуществления начинается с допущения, что разность между разностью R_F уровня FGS и разностью R_Bосновного уровня должна быть кодированной. Однако, когда разность, полученная от уровня FGS, очень мала, то есть, когда временная корреляция очень близкая, вышеупомянутые быстрые алгоритмы PFGS, соответствующие первым трем примерам вариантов осуществления, могут значительно ухудшать характеристики кодирования. В этом случае, кодирование только разности, полученной от уровня FGS, то есть, R_F в уравнении (3), может предложить лучшие характеристики кодирования. То есть, согласно четвертому примеру варианта осуществления настоящего изобретения, уравнения (7)-(9) могут быть преобразованы в уравнения (10)-(12), соответственно:The principle of operation from first to third examples of embodiments of the present invention is based on equation (3). In other words, the implementation of these examples of embodiments begins with the assumption that the difference between the difference R _{F of} the FGS level and the difference R _{B of the} main level must be encoded. However, when the difference obtained from the FGS level is very small, that is, when the time correlation is very close, the aforementioned fast PFGS algorithms corresponding to the first three examples of embodiments can significantly degrade the encoding performance. In this case, encoding only the difference obtained from the FGS level, that is, R _F in equation (3), can offer better encoding characteristics. That is, according to a fourth example of an embodiment of the present invention, equations (7) to (9) can be converted to equations (10) to (12), respectively:

Δ = O - P_В - (Δ_М + Δ_N)/2Δ = O - P _B - (Δ _M + Δ _N ) / 2 (10)(10) Δ = О - P_B - (P_F-P_B)Δ = O - P _B - (P _F -P _B ) (11)(eleven) Δ = О - P_B - [(mc(М_F' - М_В') + mc(N_F' - N_B'))/2Δ = O - P _B - [(mc (M _F '- M _B ') + mc (N _F '- N _B ')) / 2 (12)(12)

В уравнениях (10)-(12) восстановленное изображение O' основного уровня заменяется прогнозированным изображением P_В для изображения основного уровня. Конечно, интерполяция может не применяться к третьим членам в правой части уравнений (10)-(12), или для интерполяции может использоваться билинейный фильтр, требующий меньшего объема вычислений.In equations (10) - (12), the reconstructed main-level image O 'is replaced by the predicted image P _B for the main-level image. Of course, interpolation may not apply to the third terms on the right-hand side of equations (10) - (12), or a bilinear filter that requires less computation may be used for interpolation.

Прогнозированное изображение P_В, встречающееся дважды в уравнении (11), не обязательно является одним и тем же. Оцененный вектор движения может использоваться во время компенсации движения для создания прогнозированного изображения P_В во втором члене. С другой стороны, для создания P_В и P_F в третьем члене во время компенсации движения могут использоваться вектор движения с точностью, более низкой, чем у оцененного вектора движения, или фильтр, требующий меньшего объема вычислений (например, билинейный фильтр).The predicted image P _B occurring twice in equation (11) is not necessarily the same. An estimated motion vector can be used during motion compensation to create a predicted image P _B in the second term. On the other hand, to create P _B and P _F in the third term during motion compensation, a motion vector can be used with an accuracy lower than that of the estimated motion vector, or a filter that requires less computation (for example, a bilinear filter).

Алгоритм PFGS, в котором текущий кадр восстанавливается, используя оба восстановленные левый и правый опорные кадры, страдает от ошибки, вызванной дрейфом, когда ухудшение качества изображения как в левом, так и в правом опорных кадрах кумулятивно отражается в текущем кадре. Ошибка дрейфа может быть уменьшена с помощью технологии прогнозирования с пропусканием, использующей прогнозированное изображение, созданное взвешенной суммой прогнозированного изображения, полученного от обоих опорных кадров, и прогнозированного изображения, полученного от основного уровня.The PFGS algorithm, in which the current frame is restored using both the restored left and right reference frames, suffers from an error caused by drift when image quality deterioration in both the left and right reference frames is cumulatively reflected in the current frame. The drift error can be reduced using transmission prediction technology using a predicted image created by a weighted sum of the predicted image obtained from both reference frames and the predicted image obtained from the main level.

Согласно технологии прогнозирования с пропусканием, используемой в обычном PFGS, кодируемое на уровне FGS, выражается уравнением (13):According to the transmission prediction technology used in conventional PFGS, encoded at the FGS level, is expressed by equation (13):

Δ = О - [αP_F + (l-α)P_B]Δ = O - [αP _F + (l-α) P _B ] (13)(13)

Уравнение (13) может быть преобразовано в уравнение (14) в соответствии с пятым примером варианта осуществления настоящего изобретения:Equation (13) can be converted to equation (14) in accordance with a fifth example of an embodiment of the present invention:

Δ = О - P_B- α(P_F - P_B)Δ = O - P _B - α (P _F - P _B ) (14)(fourteen)

Чтобы получить уравнение (14), весовой коэффициент α может применяться только к разности (P_F - P_B) между прогнозированными изображениями в уравнении (11). Таким образом, настоящее изобретение может также применяться к технологии прогнозирования с пропусканием. То есть, интерполяция может быть пропущена или интерполяция может быть применена к разности (P_F - P_B), используя билинейный фильтр, требующий меньший объем вычислений. В последнем случае, результат интерполяции умножается на весовой коэффициент α.To obtain equation (14), the weight coefficient α can only be applied to the difference (P _F - P _B ) between the predicted images in equation (11). Thus, the present invention can also be applied to transmission prediction technology. That is, interpolation can be skipped or interpolation can be applied to the difference (P _F - P _B ) using a bilinear filter that requires less computation. In the latter case, the result of the interpolation is multiplied by the weight coefficient α.

На фиг.4 показана блок-схема видеокодера 100, соответствующая первому примеру варианта осуществления настоящего изобретения.4 is a block diagram of a video encoder 100 corresponding to a first example of an embodiment of the present invention.

Хотя изобретение описывается в отношении каждого блока как основного модуля оценки движения со ссылкой на фиг.1-3, быстрый алгоритм PFGS далее будет описан в отношении каждого кадра, содержащего блок. Для последовательности изложения, идентификатор блока обозначается для "F" нижним индексом, указывающим кадр. Например, кадр, содержащий блок с маркировкой R_B, обозначается как F_RB. И конечно, верхний штрих (') используется, чтобы обозначить восстановленные данные, полученные после квантования/инверсного квантования.Although the invention is described with respect to each block as a basic motion estimation module with reference to FIGS. 1-3, a quick PFGS algorithm will now be described with respect to each frame containing the block. For the sequence of presentation, the block identifier is indicated for “F” by a subscript indicating the frame. For example, a frame containing a block labeled R _B is denoted as F _RB . And of course, the top stroke (') is used to indicate the recovered data obtained after quantization / inverse quantization.

Текущий кадр F_о подается в устройство 105 оценки движения, вычитающее устройство 115 и устройство 170 вычисления разности.The current frame F _o is supplied to the motion estimation device 105, the subtracting device 115 and the difference calculating device 170.

Устройство 105 оценки движения выполняет оценку движения для текущего кадра F_о, используя соседние кадры, чтобы получить векторы движения MV. Соседние кадры, которые упоминаются во время оценки движения, в дальнейшем здесь называются "опорные кадры". Алгоритм согласования блоков (BMA) обычно используется для оценки движения заданного блока. В BMA заданный блок перемещается в пределах зоны поиска в опорном кадре с точностью до пиксела или доли пиксела и смещение с минимальной ошибкой определяется как вектор движения. Хотя для оценки движения используется движущийся блок фиксированного размера, оценка движения может делаться, используя технологию иерархического согласования блоков переменного размера (HVSBM).The motion estimation device 105 performs a motion estimation for the current frame F _o using neighboring frames to obtain motion vectors MV. The adjacent frames that are mentioned during motion estimation are hereinafter referred to as “reference frames”. A block matching algorithm (BMA) is typically used to evaluate the motion of a given block. In BMA, a given block moves within the search zone in a reference frame accurate to a pixel or a fraction of a pixel, and the offset with a minimum error is defined as a motion vector. Although a motion block of a fixed size is used to estimate the motion, motion estimation can be done using the hierarchical matching of variable size blocks (HVSBM).

Когда оценка движения выполняется с точностью до долей пиксела, опорные кадры должны быть укрупнены или интерполированы до заранее определенной разрешающей способности. Например, когда оценка движения выполняется с точностями 1/2 и 1/4 пиксела, опорные кадры должны быть видоизменены или интерполированы с коэффициентом два и четыре, соответственно.When motion estimation is performed to within a fraction of a pixel, reference frames should be enlarged or interpolated to a predetermined resolution. For example, when motion estimation is performed with 1/2 and 1/4 pixel precision, the reference frames should be modified or interpolated with a factor of two and four, respectively.

Когда кодер 100 имеет структуру кодека с разомкнутым контуром, кадры F_M и F_N, граничащие с оригиналом, используются как опорные кадры. Когда кодер 100 имеет структуру кодека с замкнутым контуром, восстановленные соседние кадры F_MB' и F_NB' на основном уровне используются как опорные кадры. Хотя здесь принимается, что кодер 100 имеет структуру кодека с замкнутым контуром, кодер 100 может иметь структуру кодека с разомкнутым контуром.When the encoder 100 has an open-loop codec structure, frames F _M and F _N adjacent to the original are used as reference frames. When the encoder 100 has a closed-loop codec structure, the reconstructed adjacent frames F _MB 'and F _NB ' at the main level are used as reference frames. Although it is assumed here that the encoder 100 has a closed-loop codec structure, the encoder 100 may have an open-loop codec structure.

Векторы движения MV, вычисленные устройством 105 оценки движения, передаются в компенсатор 110 движения. Компенсатор 110 движения выполняет компенсацию движения на опорных кадрах F_MB' и F_NB', используя векторы движения MV, и создает прогнозированный кадр F_PB для текущего кадра. Когда используется двунаправленное прогнозирование, прогнозированное изображение может быть рассчитано как среднее значение опорных кадров с компенсированным движением. Когда используется однонаправленное прогнозирование, прогнозированное изображение может быть тем же самым, что и опорный кадр с компенсированным движением. Хотя здесь далее предполагается, что оценка движения и компенсация используют двунаправленные опорные кадры, специалистам в данной области техники должно быть очевидно, что настоящее изобретение может использовать однонаправленный опорный кадр.The motion vectors MV calculated by the motion estimation device 105 are transmitted to the motion compensator 110. The motion compensator 110 performs motion compensation on the reference frames F _MB 'and F _NB ' using the motion vectors MV, and creates a predicted frame F _PB for the current frame. When bidirectional prediction is used, the predicted image can be calculated as the average value of the reference frames with compensated movement. When using unidirectional prediction, the predicted image may be the same as the reference frame with compensated movement. Although it is further assumed here that motion estimation and compensation use bidirectional reference frames, it should be apparent to those skilled in the art that the present invention can use a unidirectional reference frame.

Вычитающее устройство 115 вычисляет разность F_RB между прогнозированным изображением и текущим изображением для передачи на преобразователь 120.The subtractor 115 calculates the difference F _RB between the predicted image and the current image for transmission to the converter 120.

Преобразователь 120 выполняет пространственное преобразование разности F_RB, чтобы создать коэффициент преобразования F_RB ^T. Способ пространственного преобразования может содержать дискретное косинусное преобразование (DCT), или импульсное преобразование. Конкретно, коэффициенты DCT могут быть созданы в случае, когда используется DCT, и импульсные коэффициенты могут быть созданы в случае, когда разрешено импульсное преобразование.Transducer 120 performs a spatial transform of the difference F _RB to create a transform coefficient F _RB ^T. The spatial transform method may comprise a discrete cosine transform (DCT), or a pulse transform. Specifically, DCT coefficients can be created when DCT is used, and pulse coefficients can be created when pulsed conversion is enabled.

Устройство 125 квантования применяет квантование к коэффициенту преобразования F_RB ^T. Квантование означает процесс выражения коэффициентов преобразования, сформированных среди произвольных реальных значений с помощью дискретных значений, и согласование дискретных значений с индексами, соответствующими заранее определенной таблице квантования. Например, устройство 125 квантования может разделить реально оцененный коэффициент преобразования на шаг квантования с заранее определенным размером и округлить результирующее значение до ближайшего целого числа. В целом, размер шага квантования основного уровня больше, чем размер шага квантования уровня FGS.The quantizer 125 applies quantization to the transform coefficient F _RB ^T. Quantization means the process of expressing transform coefficients generated among arbitrary real values using discrete values, and matching discrete values with indices corresponding to a predetermined quantization table. For example, quantization device 125 may divide the actually estimated transform coefficient into a quantization step with a predetermined size and round the resulting value to the nearest integer. In general, the quantization step size of the base layer is larger than the quantization step size of the FGS level.

Результат квантования, то есть, коэффициент квантования F_RB ^Q, полученный устройством 125 квантования, подается на блок 150 статистического кодирования и устройство 130 инверсного квантования.The quantization result, that is, the quantization coefficient F _RB ^Q obtained by the quantization device 125, is supplied to the statistical encoding unit 150 and the inverse quantization device 130.

Устройство 130 инверсного квантования осуществляет квантование коэффициента квантования F_RB ^Q. Инверсное квантование означает процесс инверсного квантования для восстановления значений, совпадающих с индексами, созданными во время квантования, используя тот же самый шаг квантования, который использовался при квантовании.The inverse quantization device 130 quantizes the quantization coefficient F _RB ^Q. Inverse quantization means the inverse quantization process to restore values that match the indices created during quantization using the same quantization step that was used in the quantization.

Инверсный преобразователь 135 принимает результат инверсного квантования и выполняет инверсное преобразование принятого результата. Инверсное пространственное преобразование может быть, например, инверсным DCT или инверсным импульсным преобразованием, выполненным в порядке, обратном тому, в котором преобразование было выполнено преобразователем 120. Сумматор 140 складывает инверсно преобразованный результат с прогнозированным изображением F_PB, полученным от компенсатора 110 движения, чтобы создать восстановленное изображение.The inverse converter 135 receives the inverse quantization result and performs an inverse transformation of the received result. The inverse spatial transform may be, for example, an inverse DCT or an inverse pulse transform performed in the reverse order to that in which the transform was performed by the transducer 120. The adder 140 adds the inverted transformed result to the predicted image F _PB obtained from the motion compensator 110 to create restored image.

В буфере 145 хранится результат сложения, принятый от сумматора 140. В буфере 145 хранится восстановленное изображение F_о' для текущего кадра, а также предварительно восстановленные опорные кадры F_MB' и F_NB' основного уровня.Buffer 145 stores the addition result received from adder 140. Buffer 145 stores the reconstructed image F _o ′ for the current frame, as well as the previously reconstructed reference frames F _MB ′ and F _NB ′ of the main layer.

Преобразователь 155 вектора движения изменяет точность принятого вектора движения MV. Например, вектор движения MV с точностью 1/4 пиксела может иметь значение 0, 0,25, 0,5 или 0,75. Как описано выше, в соответствии с примерами вариантов осуществления настоящего изобретения, существует небольшое различие в характеристиках кодирования, когда компенсация движения на уровне FGS выполняется для вектора движения MV с более низкой пиксельной точностью, чем на основном уровне. Таким образом, преобразователь 155 вектора движения преобразует вектор движения MV с точностью 1/4 пиксела в вектор движения MV₁ с пиксельной точностью, более низкой, чем точность в 1/4 пиксела, такой как 1/2 пиксела или 1 пиксел. Такая процедура преобразования может быть выполнена простым отбрасыванием или округлением десятичной части пиксельной точности для первоначального вектора движения.The motion vector transducer 155 changes the accuracy of the received motion vector MV. For example, the motion vector MV with an accuracy of 1/4 pixel may have a value of 0, 0.25, 0.5 or 0.75. As described above, in accordance with examples of embodiments of the present invention, there is a slight difference in coding characteristics when motion compensation at the FGS level is performed for the motion vector MV with lower pixel accuracy than at the main level. Thus, the motion vector converter 155 converts the motion vector MV with an accuracy of 1/4 pixel to the motion vector MV ₁ with a pixel accuracy lower than the accuracy of 1/4 pixel, such as 1/2 pixel or 1 pixel. Such a conversion procedure can be performed by simply dropping or rounding the decimal part of the pixel accuracy for the initial motion vector.

Буфер 165 временно хранит эталонные кадры F_MF'_,F_NF' уровня FGS. Хотя на чертеже это не показано, восстановленные кадры F_MF' и F_NF' уровня FGS или первоначальный кадр, соседний с текущим кадром, могут использоваться в качестве опорных кадров уровня FGS.Buffer 165 temporarily stores reference frames F _MF ' _, F _NF ' of the FGS level. Although not shown in the drawing, the reconstructed FGS frames F _MF 'and F _NF ' or the original frame adjacent to the current frame can be used as reference frames of the FGS level.

Компенсатор 160 движения использует преобразованный вектор движения MV₁ для выполнения компенсации движения на восстановленных опорных кадрах F_MB' и F_NB' основного уровня, принятых от буфера 145, и восстановленных опорных кадрах F_MF' и F_NF' уровня FGS, принятых от буфера 165, и обеспечивает передачу кадров с компенсированным движением mc(F_MB'), mc(F_NB'), mc(F_MF') и mc(F_NF') в устройство 170 вычисления разности. F_MF' и F_NF' означают опорные предыдущий и последующий кадры на уровне FGS, соответственно. F_MB' и F_NB' означают опорные предыдущий и последующий кадры на основном уровне, соответственно.Motion compensator 160 uses the converted motion vector MV ₁ to perform motion compensation on the reconstructed reference layer frames F _MB ′ and F _NB ′ received from the buffer 145 and the restored reference frames F _MF ′ and F _NF ′ of the FGS level received from the buffer 165 , and provides the motion-compensated frame transmission mc (F _MB '), mc (F _NB '), mc (F _MF ') and mc (F _NF ') to the difference calculator 170. F _MF 'and F _NF ' mean reference previous and subsequent frames at the FGS level, respectively. F _MB 'and F _NB ' mean reference previous and subsequent frames at the main level, respectively.

Когда для компенсации движения требуется интерполяция, компенсатор 160 движения может использовать фильтр с другим типом интерполяции, чем фильтр, используемый для устройства 105 оценки движения или компенсатора 110 движения. Когда, например, используется вектор движения MV₁ с точностью 1/2 пиксела, билинейный фильтр, требующий малого объема вычислений, может использоваться для интерполяции вместо шестиполюсного фильтра, используемого в стандарте H.264. Поскольку разность между кадром основного уровня с компенсированным движением и кадром уровня FGS с компенсированным движением вычисляется после интерполяции, процесс интерполяции мало влияет на эффективность сжатия.When interpolation is required for motion compensation, motion compensator 160 may use a filter with a different type of interpolation than the filter used for motion estimation device 105 or motion compensator 110. When, for example, a motion vector MV ₁ with an accuracy of 1/2 pixel is used, a bilinear filter that requires little computation can be used for interpolation instead of the six-pole filter used in the H.264 standard. Since the difference between the frame of the main level with compensated motion and the frame of the FGS level with compensated motion is calculated after interpolation, the interpolation process has little effect on the compression efficiency.

Устройство 170 вычисления разности вычисляет разность между опорным кадром mc(F_MF'), mc(F_NF') уровня FGS с компенсированным движением и опорным кадром mc(F_MB'), mc(F_NB') основного уровня с компенсированным движением. То есть, устройство 170 вычисления разности вычисляет Δ_М= mc(F_MF') - mc(F_MB') и Δ_N= mc(F_NF') - mc(F_NB'). Конечно, когда используется однонаправленный опорный кадр, может быть вычислена только одна разность.The difference calculator 170 calculates the difference between the reference frame mc (F _MF '), mc (F _NF ') of the compensated motion level FGS and the reference frame mc (F _MB '), mc (F _NB ') of the main level with compensated motion. That is, the difference calculator 170 calculates Δ _M = mc (F _MF ') - mc (F _MB ') and Δ _N = mc (F _NF ') - mc (F _NB '). Of course, when a unidirectional reference frame is used, only one difference can be calculated.

Затем, устройство 170 вычисления разности вычисляет среднее значение разностей Δ_Ми Δ_Nи вычитает восстановленное изображение F_о' и среднее значение разностей Δ_Ми Δ для текущего кадра F_о. Когда используется однонаправленный опорный кадр, процесс вычисления среднего значения не требуется.Then, the difference calculator 170 calculates the average of the differences Δ _M and Δ _N and subtracts the reconstructed image F _o 'and the average of the differences Δ _M and Δ for the current frame F _about . When a unidirectional reference frame is used, an average value calculation process is not required.

Результат вычитания F_Δ, полученный устройством 170 вычисления разности, подвергается пространственному преобразованию преобразователем 175 и затем квантуется устройством 180 квантования. Квантованный результат F_Δ ^Q передается на блок 150 статистического кодирования. Размер шага квантования, используемый устройством 180 квантования, обычно меньше того, который используется в устройстве 125 квантования.The subtraction result F _Δ obtained by the difference calculating device 170 is subjected to spatial transformation by the transducer 175 and then quantized by the quantization device 180. The quantized result F _Δ ^Q is transmitted to the statistical coding unit 150. The quantization step size used by the quantizer 180 is typically smaller than that used by the quantizer 125.

Модуль 150 статистического кодирования без потерь кодирует вектор движения MV, оцененный устройством 105 оценки движения, коэффициент квантования F_RB ^Q, принятый от устройства 125 квантования, и квантованный результат F_Δ ^Q, принятый от устройства 180 квантования, в поток битов. Существует множество способов кодирования без потерь, в том числе, арифметическое кодирование, кодирование с переменной длиной и т.п.The lossless statistical encoding unit 150 encodes the motion vector MV estimated by the motion estimation device 105, the quantization coefficient F _RB ^Q received from the quantizer 125, and the quantized result F _Δ ^Q received from the quantizer 180 into a bit stream. There are many lossless coding methods, including arithmetic coding, variable length coding, etc.

Альтернативно, хотя это не показано на чертеже, видеокодер, соответствующий второму примеру варианта осуществления настоящего изобретения, может иметь ту же самую конфигурацию и работать как видеокодер 100, показанный на фиг. 4, за исключением устройства вычисления разности.Alternatively, although not shown, the video encoder corresponding to the second example of the embodiment of the present invention may have the same configuration and operate as the video encoder 100 shown in FIG. 4, with the exception of the difference calculator.

То есть, устройство вычисления разности в соответствии со вторым примером варианта осуществления настоящего изобретения создает прогнозированный кадр для каждого уровня перед вычислением разности между кадрами на различных уровнях. Другими словами, устройство вычисления разности создает прогнозированный кадр уровня FGS и прогнозированный кадр основного уровня, используя опорный кадр уровня FGS с компенсированным движением и опорный кадр основного уровня с компенсированным движением. Прогнозированный кадр может быть рассчитан простым усреднением двух опорных кадров с компенсированным движением. Конечно, когда используется однонаправленное прогнозирование, кадр с компенсированным движением может быть самим прогнозированным кадром.That is, a difference calculating device according to a second example of an embodiment of the present invention creates a predicted frame for each level before calculating the difference between frames at different levels. In other words, the difference calculating device creates a predicted frame of the FGS layer and a predicted frame of the main level using the reference frame of the FGS level with compensated motion and the reference frame of the main level with compensated motion. The predicted frame can be calculated by simply averaging two reference frames with compensated movement. Of course, when unidirectional prediction is used, the frame with compensated motion may be the predicted frame itself.

Устройство вычисления разности затем вычисляет разность между прогнозированными кадрами и вычитает восстановленное изображение и вычисленную разность из текущего кадра.The difference calculator then calculates the difference between the predicted frames and subtracts the reconstructed image and the calculated difference from the current frame.

На фиг.5 показана блок-схема видеокодера 300, соответствующего третьему примеру варианта осуществления настоящего изобретения. Со ссылкой на фиг.5, в то время как в первом и втором примерах вариантов осуществления разность между опорным изображением основного уровня и опорным изображением уровня FGS вычисляется после выполнения компенсации движения, показанный видеокодер 300 выполняет компенсацию движения после вычисления разности между опорными кадрами на этих двух уровнях. Во избежание повторного объяснения, приведенное далее описание сосредоточит внимание на отличительных признаках между первым и вторым примерами вариантов осуществления.5 is a block diagram of a video encoder 300 according to a third example of an embodiment of the present invention. Referring to FIG. 5, while in the first and second examples of embodiments, the difference between the reference image of the main layer and the reference image of the FGS level is calculated after performing motion compensation, the video encoder 300 shown performs motion compensation after calculating the difference between the reference frames on these two levels. In order to avoid re-explanation, the following description will focus on the distinguishing features between the first and second examples of embodiments.

Вычитающее устройство 390 вычитает восстановленные опорные кадры F_MB' и F_NB' основного уровня, которые приняты от буфера 345, из опорных кадров F_MF' и F_NF' уровня FGS, которые приняты от буфера 365, и обеспечивает передачу результатов вычитания F_MF' - F_MB' и F_NF' - F_NB' на компенсатор движения 360. Когда используется однонаправленный опорный кадр, существует только одна разность.Subtractor 390 subtracts the reconstructed basic layer reference frames F _MB 'and F _NB ', which are received from buffer 345, from reference frames F _MF 'and F _NF ' of FGS level, which are received from buffer 365, and provides transmission of subtraction results F _MF ' - F _MB 'and F _NF ' - F _NB 'to the motion compensator 360. When a unidirectional reference frame is used, there is only one difference.

Компенсатор 360 движения использует измененный вектор движения MV₁, принятый от преобразователя 355 вектора движения, для выполнения компенсации движения на разностях F_MF' - F_MB' и F_NF' - F_NB' между опорным кадром уровня FGS и опорным кадром основного уровня, принятыми от вычитающего устройства 390. Когда во время компенсации движения используется вектор движения MV₁ с точностью 1/2 пиксела, для интерполяции может использоваться билинейный фильтр, требующий небольшого объема вычислений, вместо шестиполюсного фильтра, используемого в стандарте H.264. Как описано выше, интерполяция мало влияет на эффективность сжатия.The motion compensator 360 uses the modified motion vector MV ₁ received from the motion vector converter 355 to perform motion compensation on the differences F _MF '- F _MB ' and F _NF '- F _NB ' between the reference frame of the FGS level and the reference frame of the main level received from subtractor 390. When a motion vector MV ₁ with 1/2 pixel accuracy is used during motion compensation, a bilinear filter that requires a small amount of computation can be used for interpolation instead of the six-pole filter used in the H.264 standard. As described above, interpolation has little effect on compression efficiency.

Устройство 370 вычисления разности вычисляет среднее значение между разностями (F_MF'-F_MB') и (F_NF'-F_NB') компенсированного движения и вычитает восстановленное изображение F_о' и среднее значение из текущего кадра F_о. Когда используется однонаправленный опорный кадр, процесс усреднения не требуется.The difference calculator 370 calculates an average value between the differences (F _MF '-F _MB ') and (F _NF '-F _NB ') of the compensated movement and subtracts the reconstructed image F _o 'and the average value from the current frame F _o . When a unidirectional reference frame is used, an averaging process is not required.

На фиг.6 и 7 показаны блок-схемы примеров видеокодеров 400 и 600, соответствующих четвертому примеру варианта осуществления настоящего изобретения. Обращаясь сначала к фиг.6, в отличие от первого примера варианта осуществления, показанного на фиг.4, устройство 470 вычисления разности в видеокодере 400 примера варианта осуществления вычитает прогнозированный кадр F_PB основного уровня вместо восстановленного кадра F_о' основного уровня из текущего кадра F_о.6 and 7 are block diagrams of examples of video encoders 400 and 600 corresponding to a fourth example of an embodiment of the present invention. Referring first to FIG. 6, in contrast to the first example of the embodiment shown in FIG. 4, the difference calculator 470 in the video encoder 400 of the example embodiment subtracts the predicted main layer frame F _PB instead of the reconstructed main level frame F _o ′ from the current frame F _about .

Видеокодеры 400 и 600 четвертого примера варианта осуществления, показанные на фиг.6 и 7, соответствуют фиг.4 и 5, показывающим видеокодеры 100 и 300 первого и третьего примеров вариантов осуществления. Обращаясь сначала к фиг.6, устройство 470 вычисления разности вычитает прогнозированное изображение F_PB основного уровня, принятое от компенсатора 410 движения, вместо восстановленного изображения F_о' основного уровня, из текущего кадра. Таким образом, устройство 470 вычисления разности вычитает прогнозированное изображение F_PB и среднее значение разностей Δ_Ми Δ_Nиз текущего кадра F_о, чтобы получить результат вычитания F_Δ.The video encoders 400 and 600 of the fourth example embodiment shown in FIGS. 6 and 7 correspond to FIGS. 4 and 5 showing the video encoders 100 and 300 of the first and third example embodiments. Turning first to FIG. 6, the difference calculator 470 subtracts the prediction image F _{PB of the} main layer received from the motion compensator 410 instead of the reconstructed image F _o ′ of the main level from the current frame. Thus, the difference calculator 470 subtracts the predicted image F _PB and the average value of the differences Δ _M and Δ _N from the current frame F _o to obtain a subtraction result F _Δ .

Точно так же, со ссылкой на фиг.7, устройство 670 вычисления разности вычитает прогнозированное изображение F_PB и среднее значение разностей компенсированного движения mc(F_MF' - F_MB') и mc(F_NF' - F_NB') из текущего кадра F_о, чтобы получить результат вычитания F_Δ.Similarly, with reference to FIG. 7, the difference calculator 670 subtracts the predicted image F _PB and the average value of the differences of the compensated movement mc (F _MF '- F _MB ') and mc (F _NF '- F _NB ') from the current frame F _about to get the result of subtraction F _Δ .

Пример видеокодера, соответствующего четвертому примеру варианта осуществления, совпадающего со вторым примером варианта осуществления (не показан), может иметь ту же самую конфигурацию и выполнять ту же самую операцию, как показано на фиг.6, за исключением операции, выполняемой устройством 470 вычисления разности. В видеокодере четвертого примера варианта осуществления, соответствующего второму примеру варианта осуществления, устройство 470 вычисления разности создает прогнозированный кадр F_PB уровня FGS и прогнозированный кадр F_BF основного уровня, используя опорный кадр mc(F_MF'), mc(F_NF') с компенсированным движением уровня FGS и опорный кадр mc(F_MB'), mc(F_NB') с компенсированным движением основного уровня, соответственно. Устройство 470 вычисления разности также вычисляет разность F_PF - F_PB между прогнозированными кадрами F_PF и F_PB и вычитает восстановленное изображение F_о' и разность F_PF - F_PB из текущего кадра F, чтобы получить результат вычитания F_Δ.An example of a video encoder corresponding to a fourth example of an embodiment matching the second example of an embodiment (not shown) may have the same configuration and perform the same operation as shown in FIG. 6, except for the operation performed by the difference calculator 470. In the video encoder of the fourth exemplary embodiment corresponding to the second exemplary embodiment, the difference calculator 470 creates the predicted FGS level frame F _PB and the main level predicted frame F _BF using the reference frame mc (F _MF ′), mc (F _NF ′) with compensated the movement of the FGS level and the reference frame mc (F _MB '), mc (F _NB ') with compensated movement of the main level, respectively. The difference calculator 470 also calculates the difference F _PF - F _PB between the predicted frames F _PF and F _PB and subtracts the reconstructed image F _o 'and the difference F _PF - F _PB from the current frame F to obtain a subtraction result F _Δ .

Если применяется прогнозирование с пропусканием, устройство вычисления разности 470 умножает весовой коэффициент α на разность F_PF - F_PB и вычитает восстановленное изображение F_о' и произведение (α × (F_PF - F_PB) из текущего кадра F_о, чтобы получить результат вычитания F_Δ.If transmission prediction is used, the difference calculator 470 multiplies the weight coefficient α by the difference F _PF - F _PB and subtracts the reconstructed image F _о 'and the product (α × (F _PF - F _PB ) from the current frame F _o to obtain the subtraction result F _Δ .

На фиг.8 показана блок-схема видеодекодера 700, соответствующая первому примеру варианта осуществления настоящего изобретения. Со ссылкой на фиг.8, блок 701 статистического декодирования без потерь декодирует входящий поток битов, чтобы извлечь данные текстуры F_PB ^Q основного уровня, данные текстуры F_Δ ^Q уровня FGS и векторы движения MV. Декодирование без потерь является инверсным процессом кодирования без потерь.FIG. 8 shows a block diagram of a video decoder 700 according to a first example of an embodiment of the present invention. With reference to FIG. 8, a lossless statistical decoding unit 701 decodes an input bitstream to extract main layer texture data F _PB ^Q , FGS layer texture data F _Δ ^Q , and motion vectors MV. Lossless decoding is an inverse lossless coding process.

Данные текстуры F_PB ^Q основного уровня и данные текстуры F_Δ ^Q уровня FGS подаются на устройства 705 и 745 инверсного квантования, соответственно, и векторы движения MV подаются на компенсатор 720 движения и преобразователь 730 вектора движения.The ground level texture data F _PB ^Q and the FGS level texture data F _Δ ^Q are supplied to inverse quantization devices 705 and 745, respectively, and the motion vectors MV are supplied to the motion compensator 720 and the motion vector converter 730.

Устройство 705 инверсного квантования применяет инверсное квантование к данным текстуры F_PB ^Qосновного уровня, принятым от блока 701 статистического декодирования. Инверсное квантование выполняется в порядке, обратном порядку квантования, выполненному преобразователем, чтобы восстановить значения, совпадающие с индексами, созданными во время квантования в соответствии с заранее определенным шагом квантования, используемым при квантовании.The inverse quantization device 705 applies inverse quantization to the main layer texture data F _PB ^Q received from the statistical decoding unit 701. Inverse quantization is performed in the reverse order of quantization performed by the transducer to restore values that match the indices created during quantization in accordance with the predetermined quantization step used in the quantization.

Инверсный преобразователь 710 выполняет инверсное преобразование инверсного квантованного результата. Инверсное преобразование выполняется в порядке, обратном порядку преобразования, выполненному преобразователем. Конкретно, может использоваться инверсное DCT-преобразование или инверсное импульсное преобразование.Inverse transformer 710 performs an inverse transform of the inverse quantized result. The inverse transform is performed in the reverse order of the transform performed by the converter. Specifically, an inverse DCT transform or an inverse pulse transform can be used.

Восстановленная разность F_RB' подается на сумматор 715.The reconstructed difference F _RB 'is supplied to the adder 715.

Компенсатор 720 движения выполняет компенсацию движения на предварительно восстановленных опорных кадрах F_MB' и F_NB' основного уровня, хранящихся в буфере 725, используя извлеченные векторы движения MV для создания прогнозированного изображения F_PB, которое затем посылается на сумматор 715.The motion compensator 720 performs motion compensation on the previously reconstructed reference frames F _MB ′ and F _NB ′ stored in the buffer 725 using the extracted motion vectors MV to create a predicted image F _PB , which is then sent to the adder 715.

Когда используется двунаправленное прогнозирование, прогнозированное изображение F_PB вычисляется путем усреднения опорных кадров с компенсированным движением. При использовании однонаправленного прогнозирования, прогнозированное изображение F_PB получается как опорный кадр с компенсированным движением.When bidirectional prediction is used, the predicted image F _{PB is} calculated by averaging reference frames with compensated motion. When using unidirectional prediction, the predicted image F _{PB is} obtained as a reference frame with compensated movement.

Сумматор 715 складывает входные кадры F_RB и F_PB, чтобы получить на выходе восстановленное изображение F_о', которое затем сохраняется в буфере 725.An adder 715 adds the input frames F _RB and F _PB to output the reconstructed image F _o ′, which is then stored in the buffer 725.

Инверсное устройство 745 квантования применяет инверсное квантование к данным текстуры F_Δ ^Q уровня FGS, а инверсный преобразователь 750 выполняет инверсное преобразование инверсно квантованного результата F_Δ ^Т, чтобы получить восстановленный кадр F_Δ(F_Δ'), который затем подается на устройство 755 восстановления кадров.The inverse quantizer 745 applies inverse quantization to FGS level texture data F _Δ ^Q , and the inverse transformer 750 inverts the inverse quantized result F _Δ ^T to obtain a reconstructed frame F _Δ (F _Δ '), which is then supplied to the frame recovery device 755 .

Преобразователь 730 вектора движения понижает точность извлеченного вектора движения MV. Например, вектор движения MV с точностью 1/4 пиксела может иметь точность 0, 0,25, 0,5 или 0,75 пиксела. Преобразователь 730 вектора движения изменяет вектор движения MV с точностью 1/4 пиксела в вектор движения MV₁ с точностью, меньшей, чем 1/4 пиксела, такой как 1/2 пиксела или 1 пиксел.Motion vector converter 730 lowers the accuracy of the extracted MV motion vector. For example, a motion vector MV with an accuracy of 1/4 pixel may have an accuracy of 0, 0.25, 0.5, or 0.75 pixels. The motion vector converter 730 changes the motion vector MV with an accuracy of 1/4 pixel to the motion vector MV ₁ with an accuracy of less than 1/4 pixel, such as 1/2 pixel or 1 pixel.

Компенсатор 735 движения использует измененный вектор движения MV₁, чтобы выполнить компенсацию движения на восстановленных опорных кадрах F_MB' и F_NBосновного уровня, принятых от буфера 725, и на восстановленных опорных кадрах F_MF' и F_NF' уровня FGS, принятых от буфера 740, и подает кадр mc(F_MB'), mc(F_NB') основного уровня с компенсированным движением и кадр mc(F_MF'), mc(F_NF') уровня FGS с компенсированным движением на устройство 755 восстановления кадров.The motion compensator 735 uses the modified motion vector MV ₁ to perform motion compensation on the reconstructed basic layer frames F _MB ′ and F _NB received from the buffer 725 and on the reconstructed reference frames F _MF ′ and F _NF ′ of the FGS level received from the buffer 740, and feeds the compensated motion frame mc (F _MB '), mc (F _NB ') and the motion compensated frame mc (F _MF '), mc (F _NF ') to the frame recovery device 755.

Когда вектор движения MV с точностью, например, 1/2 пиксела используется во время компенсации движения, билинейный фильтр, требующий небольшого объема вычислений, может использоваться для интерполяции вместо шестиполюсного фильтра, используемого в стандарте H.264. Процесс интерполяции мало влияет на эффективность сжатия.When the MV motion vector with an accuracy of, for example, 1/2 pixel is used during motion compensation, a bilinear filter that requires little computation can be used for interpolation instead of the six-pole filter used in the H.264 standard. The interpolation process has little effect on compression efficiency.

Устройство 755 восстановления кадров вычисляет разность Δ_Ммежду опорными кадрами с компенсированным движением уровня FGS и основного уровня mc(F_MF') и mc(F_MB'), то есть Δ_М= mc(F_MF') - mc(F_MB') и разность между опорными кадрами с компенсированным движением уровня FGS и основного уровня mc(F_NF') и mc(F_NB'), то есть, Δ_N= mc(F_NF') - mc(F_NB'). Конечно, когда используется однонаправленный опорный кадр, может быть вычислена только одна разность.The frame recovery device 755 calculates the difference Δ _M between the reference frames with the compensated movement of the FGS level and the main level mc (F _MF ') and mc (F _MB '), that is, Δ _M = mc (F _MF ') - mc (F _MB ' ) and the difference between the reference frames with the compensated movement of the FGS level and the main level mc (F _NF ') and mc (F _NB '), that is, Δ _N = mc (F _NF ') - mc (F _NB '). Of course, when a unidirectional reference frame is used, only one difference can be calculated.

Устройство 755 восстановления кадров также вычисляет среднее значение разностей Δ_Ми Δ_Nи складывает среднее значение, F_Δ', и восстановленное изображение F_о' основного уровня, чтобы создать восстановленное изображение F_OF' уровня FGS. Когда используется однонаправленный опорный кадр, процесс усреднения не требуется.The frame recovery device 755 also calculates the average value of the differences Δ _M and Δ _N and adds the average value, F _Δ ', and the reconstructed main level image F _o ' to create a reconstructed image F _OF 'of the FGS level. When a unidirectional reference frame is used, an averaging process is not required.

Буфер 740 затем хранит восстановленное изображение F_OF'. Конечно, предварительно восстановленные изображения F_MF' и F_BF' могут быть сохранены в буфере 740.Buffer 740 then stores the reconstructed image F _OF '. Of course, pre-reconstructed images F _MF 'and F _BF ' can be stored in buffer 740.

Альтернативно, видеодекодер, соответствующий второму примеру варианта осуществления настоящего изобретения, может иметь ту же самую конфигурацию и выполнять ту же самую операцию, как показано на фиг. 8, за исключением операции, выполняемой устройством восстановления кадров. То есть, устройство восстановления кадров, соответствующее второму примеру варианта осуществления, создает прогнозированный кадр для каждого уровня перед вычислением разности между кадрами на этих двух уровнях. То есть, можно сказать, что устройство восстановления кадров создает прогнозированный кадр на уровне FGS и прогнозированный кадр на основном уровне, используя опорные кадры уровня FGS с компенсированным движением и кадры основного уровня с компенсированным движением. Прогнозированные кадры могут быть созданы простым усреднением двух опорных кадров с компенсированным движением. Конечно, когда используется однонаправленное прогнозирование, прогнозированный кадр является самим кадром с компенсированным движением.Alternatively, the video decoder corresponding to the second example of an embodiment of the present invention may have the same configuration and perform the same operation as shown in FIG. 8, except for the operation performed by the frame recovery apparatus. That is, the frame recovery device according to the second example of the embodiment creates a predicted frame for each level before calculating the difference between frames at these two levels. That is, we can say that the frame recovery device creates a predicted frame at the FGS level and a predicted frame at the main level using reference frames of the FGS level with compensated motion and frames of the main level with compensated motion. Predicted frames can be created by simply averaging two reference frames with compensated movement. Of course, when using unidirectional prediction, the predicted frame is the frame itself with the compensated movement.

Устройство восстановления кадров затем вычисляет разность между прогнозированными кадрами и складывает вместе данные текстуры, восстановленный кадр основного уровня и разность.The frame recovery device then calculates the difference between the predicted frames and adds together the texture data, the reconstructed main level frame, and the difference.

На фиг.9 показана блок-схема видеодекодера 900, соответствующая третьему примеру варианта осуществления настоящего изобретения. Со ссылкой на фиг.9, в отличие от первого и второго примеров вариантов осуществления, в которых компенсация движения выполняется до вычисления разности между опорным изображением уровня FGS и опорным изображением основного уровня, видеодекодер 900 выполняет компенсацию движения после вычисления разности между опорными кадрами на этих двух уровнях. Во избежание повторного объяснения, последующее описание сосредоточит внимание на отличительных признаках первого примера варианта осуществления, показанного на фиг.4.FIG. 9 shows a block diagram of a video decoder 900 corresponding to a third example of an embodiment of the present invention. With reference to Fig. 9, in contrast to the first and second examples of embodiments in which motion compensation is performed before calculating the difference between the reference image of the FGS level and the reference image of the main level, video decoder 900 performs motion compensation after calculating the difference between the reference frames on these two levels. In order to avoid re-explanation, the following description will focus on the distinguishing features of the first example embodiment shown in FIG.

Вычитающее устройство 960 вычитает восстановленные опорные кадры F_MB' и F_NB' основного уровня, принятые от буфера 925 из опорных кадров F_MF', F_NF' уровня FGS, и подает результаты вычитания F_MF' - F_MB' и F_NF' - F_NB' на компенсатор 935 движения. Когда используется однонаправленный опорный кадр, существует только одна разность.Subtractor 960 subtracts the reconstructed reference frames F _MB 'and F _NB ' received from the buffer 925 from the reference frames F _MF ', F _NF ' of the FGS level, and provides subtraction results F _MF '- F _MB ' and F _NF '- F _NB 'on the compensator 935 movement. When a unidirectional reference frame is used, there is only one difference.

Компенсатор 935 движения использует измененный вектор движения MV₁, принятый от преобразователя 930 вектора движения, чтобы выполнить компенсацию движения на разностях F_MF' - F_MB' и F_NF' - F_NB' между опорными кадрами на уровне FGS и на основном уровне, принятыми от вычитающего устройства 960. Когда во время компенсации движения используется вектор движения MV₁ с точностью 1/2, для интерполяции может использоваться билинейный фильтр, требующий малого объема вычислений, вместо шестиполюсного фильтра, используемого в стандарте H.264. Как описано выше, интерполяция мало влияет на эффективность сжатия.The motion compensator 935 uses the modified motion vector MV ₁ received from the motion vector transducer 930 to perform motion compensation on the differences F _MF '- F _MB ' and F _NF '- F _NB ' between the reference frames at the FGS level and at the main level received from a subtractor 960. When a motion vector MV ₁ with 1/2 accuracy is used during motion compensation, a bilinear filter that requires a small amount of computation can be used for interpolation instead of the six-pole filter used in the H.264 standard. As described above, interpolation has little effect on compression efficiency.

Устройство 955 восстановления кадров вычисляет среднее значение разностей компенсированного движения, то есть, среднее значение между mc(F_MF' - F_MB') и mc(F_NF' - F_NB') и складывает с вычисленным средним значением, F_Δ', принятым от инверсного преобразователя 950, и восстановленным изображением F_о' основного уровня. При использовании однонаправленного опорного кадра процесс усреднения не требуется.The frame recovery device 955 calculates the average value of the differences of the compensated movement, that is, the average value between mc (F _MF '- F _MB ') and mc (F _NF '- F _NB ') and adds to the calculated average value, F _Δ ', taken from the inverse transformer 950, and the reconstructed image F _o 'of the main level. When using a unidirectional reference frame, the averaging process is not required.

На фиг.10 и 11 показаны блок-схемы примеров видеодекодеров 1000 и 1200, соответствующих четвертому примеру варианта осуществления настоящего изобретения.10 and 11 are block diagrams of examples of video decoders 1000 and 1200 corresponding to a fourth example of an embodiment of the present invention.

Со ссылкой на фиг.10 и 11, соответствующую фиг.8 и 9, показывающих видеодекодеры 700 и 900, соответствующие первому и третьему примерам вариантов осуществления, устройства 1055 и 1255 восстановления кадров прибавляют прогнозированный кадр F_PB основного уровня вместо восстановленного кадра F_о' основного уровня.With reference to FIGS. 10 and 11, corresponding to FIGS. 8 and 9, showing video decoders 700 and 900 corresponding to the first and third exemplary embodiments, frame recovery devices 1055 and 1255 add a predicted base layer frame F _PB instead of a reconstructed base frame F _o ′ level.

Видеодекодеры 1000 и 1200 четвертого примера варианта осуществления, показанного на фиг.10 и 11, соответствуют декодерам первого и третьего примеров вариантов осуществления, показанных на фиг.8 и 9, соответственно.The video decoders 1000 and 1200 of the fourth example embodiment shown in FIGS. 10 and 11 correspond to decoders of the first and third example embodiments shown in FIGS. 8 and 9, respectively.

Со ссылкой сначала на фиг.10, соответствующую фиг.8, компенсатор 1020 движения подает опорное изображение F_PB основного уровня на устройство 1055 восстановления кадров вместо восстановленного изображения F_о'. Таким образом, устройство 470 восстановления кадров складывает вместе F_Δ', полученный от инверсного преобразователя 1050, прогнозированное изображение F_PB основного уровня и среднее значение разностей Δ_Ми Δ_N для получения восстановленного изображения F_OF' основного уровня.With reference first to FIG. 10 corresponding to FIG. 8, the motion compensator 1020 supplies the reference layer image F _PB to the frame recovery device 1055 instead of the reconstructed image F _o ′. Thus, the frame reconstructor 470 adds together the device F _Δ ', received from inverse converter 1050, the predicted image F _PB core layer and the average value of the differences Δ _M and Δ _N to obtain the reconstructed image F _OF' of the core layer.

Точно так же, со ссылкой на фиг.11, устройство 1255 восстановления файлов складывает вместе F_Δ', принятый от инверсного преобразователя 1250, прогнозированное изображение F_PB основного уровня, принятое от компенсатора 1220 движения, и среднее значение разностей mc(F_MF' - F_MB') и mc(F_NF' - F_NB') с компенсированным движением, чтобы получить восстановленное изображение F_OF' основного уровня.Similarly, with reference to FIG. 11, the file recovery device 1255 adds together F _Δ 'received from the inverter 1250, the predicted main level image F _PB received from the motion compensator 1220, and the average difference value mc (F _MF ' - F _MB ') and mc (F _NF ' - F _NB ') with a compensated movement to obtain a reconstructed image of the basic level F _OF '.

Между тем, видеодекодер четвертого примера варианта осуществления, соответствующий видеодекодеру второго примера варианта осуществления (не показан), может иметь ту же самую конфигурацию и выполнять ту же самую операцию, как показано на фиг.8, за исключением операции, выполняемой устройством 1255 восстановления кадров. В видеодекодере четвертого примера варианта осуществления, соответствующего второму примеру варианта осуществления, устройство 1255 восстановления кадров создает прогнозированный кадр F_PF уровня FGS и прогнозированный кадр F_BFосновного уровня, используя опорные кадры mc(F_MF') и mc(F_NF') уровня FGS с компенсированным движением и опорные кадры mc(F_MB'), mc(F_NB') основного уровня с компенсированным движением. Устройство 1255 восстановления кадров также вычисляет разность F_PF-F_PB между прогнозированным кадром F_Pуровня FGS и прогнозированным кадром F_PB основного уровня и складывает с F_о', принятым от инверсного преобразователя 1250, прогнозированным изображением F_PB, принятым от компенсатора 1220 движения, и разностью F_PF-F_PB, чтобы получить восстановленное изображение F_OF'.Meanwhile, the video decoder of the fourth example embodiment, corresponding to the video decoder of the second example of the embodiment (not shown), can have the same configuration and perform the same operation as shown in Fig. 8, except for the operation performed by the frame recovery device 1255. In the video decoder of the fourth example of the embodiment corresponding to the second example of the embodiment, the frame recovery device 1255 creates the predicted frame F _{PF of} the FGS layer and the predicted frame F _{BF of the} main layer using the reference frames mc (F _MF ') and mc (F _NF ') of the FGS level with compensated movement and reference frames mc (F _MB '), mc (F _NB ') of the main level with compensated movement. The frame recovery device 1255 also calculates the difference F _PF -F _PB between the predicted FGS level frame F _P and the main level predicted frame F _PB and adds to F _o ′ received from the inverter 1250, the predicted image F _PB received from the motion compensator 1220, and a difference F _PF -F _PB to obtain a reconstructed image F _OF '.

При применении прогнозирования с пропусканием (пятый пример варианта осуществления) устройство 1255 восстановления кадров умножает весовой коэффициент α на разность F_PF-F_PBпромежуточного слоя и складывает вместе F_Δ' F_о' произведение α x (F_PF-F_PB), чтобы получить F_OF'.When applying transmission prediction (fifth example of an embodiment), the frame recovery device 1255 multiplies the weight coefficient α by the difference F _PF -F _{PB of the} intermediate layer and adds together F _Δ 'F _o ' the product α x (F _PF -F _PB ) to obtain F _OF '.

На фиг.12 показана блок-схема системы для выполнения процесса кодирования или декодирования, используя видеокодер 100, 300, 400, 600 или видеодекодер 700, 900, 1000, 1200 в соответствии с примером вариантов осуществления настоящего изобретения. Система может быть телевизором, компьютерной приставкой к телевизору (STB), настольным компьютером, ноутбуком или портативным компьютером, специализированным карманным компьютером (PDA), устройством хранения видеоинформации или изображений (например, видеомагнитофон или устройство цифровой видеозаписи). Система может быть комбинацией устройств, перечисленных выше, или другим устройством, содержащим их в себе. Кроме того, система может быть комбинацией вышеупомянутых устройств или одним из устройств, которое содержит в себе часть другого устройства из перечисленных. Система содержит, по меньшей мере, один источник 1310 видеосигнала, по меньшей мере, один модуль 1320 ввода-вывода, процессор 1340, запоминающее устройство 1350 и дисплейный блок 1330.12 is a block diagram of a system for performing an encoding or decoding process using a video encoder 100, 300, 400, 600 or a video decoder 700, 900, 1000, 1200 in accordance with an example embodiment of the present invention. The system may be a television, a television set-top box (STB), a desktop computer, laptop or portable computer, a specialized handheld computer (PDA), a storage device for video information or images (for example, a VCR or digital video recorder). The system may be a combination of the devices listed above, or another device containing them. In addition, the system may be a combination of the above devices or one of the devices, which contains part of another device from the above. The system comprises at least one video source 1310, at least one input / output module 1320, a processor 1340, a storage device 1350, and a display unit 1330.

Источник 1310 видеосигнала может быть телевизионным приемником, видеомагнитофоном или другим устройством, хранящим видеоинформацию. Источник 1310 видеосигнала может означать, по меньшей мере, одно сетевое подключение для приема видеоинформации или изображения от сервера, использующего Интернет, глобальную сеть (WAN), локальную сеть (LAN), территориальную радиовещательную систему, кабельную сеть, сеть спутниковых связей, радиосеть, телефонную сеть или подобное. Кроме того, источник 1310 видеосигнала может быть комбинацией сетей или одной сетью, содержащей часть другой сети среди сетей.The video source 1310 may be a television receiver, VCR, or other device that stores video information. Video source 1310 may mean at least one network connection for receiving video information or images from a server using the Internet, wide area network (WAN), local area network (LAN), territorial broadcasting system, cable network, satellite communications network, radio network, telephone network or the like. In addition, the video source 1310 may be a combination of networks or one network containing a portion of another network among networks.

Устройство 1320 ввода-вывода, процессор 1340 и запоминающее устройство 1350 связываются друг с другом через среду 1360 связи. Среда 1360 связи может быть шиной связи, сетью связи или, по меньшей мере, одной внутренней схемой связи. Входные видеоданные, принятые от источника 1310 видеосигнала, могут быть обработаны процессором 1340, используя, по меньшей мере, одну программу, которая хранится в запоминающем устройстве 1350 и может быть выполнена процессором 1340, чтобы создать выходные видеоданные, подаваемые на дисплейный блок 1330.An input / output device 1320, a processor 1340, and a storage device 1350 communicate with each other through a communication medium 1360. The communication medium 1360 may be a communication bus, a communication network, or at least one internal communication circuit. The input video data received from the video source 1310 may be processed by the processor 1340 using at least one program that is stored in the storage device 1350 and may be executed by the processor 1340 to create output video data supplied to the display unit 1330.

В частности, программа, хранящаяся в запоминающем устройстве 1350, содержит в себе масштабируемый на импульсной основе кодек, выполняющий способ настоящего изобретения. Кодек может храниться в запоминающем устройстве 1350, может считываться с носителя данных, такого как постоянное запоминающее устройство на компакт-диске (CD-ROM) или дискета, или может быть загружен с заранее определенного сервера через множество сетей.In particular, the program stored in the storage device 1350 includes a pulse-scalable codec that implements the method of the present invention. The codec may be stored in memory 1350, read from a storage medium, such as read-only memory on a compact disc (CD-ROM) or floppy disk, or may be downloaded from a predetermined server via multiple networks.

Промышленная применимостьIndustrial applicability

Как описано выше, настоящее изобретение обеспечивает видеокодирование, которое может значительно снизить объем вычислений, требующихся для осуществления алгоритма PFGS. Поскольку, согласно настоящему изобретению, процесс декодирования изменяется в соответствии с процессом видеокодирования, настоящее изобретение может применяться к стандартизированному документу H.264 SE.As described above, the present invention provides video coding, which can significantly reduce the amount of computation required to implement the PFGS algorithm. Since, according to the present invention, the decoding process is changed in accordance with the video encoding process, the present invention can be applied to a standardized H.264 SE document.

Хоты настоящее изобретение было показано и описано, в частности, со ссылкой на примеры его вариантов осуществления, специалистам в данной области техники должно быть понятно, что в форме и деталях в нем могут быть сделаны различные изменения без отхода от объема и сущности настоящего изобретения, как они определяются приведенной ниже формулой изобретения.Although the present invention has been shown and described, in particular with reference to examples of its embodiments, it will be understood by those skilled in the art that various changes can be made in the form and details without departing from the scope and spirit of the present invention, as they are defined by the claims below.

Claims

1. A video coding method supporting accurate scalability in quality (FGS), the method comprises the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; calculating the difference between the current frame and the predicted image; performing quantization of the difference between the current frame and the predicted image, performing inverse quantization of the quantized difference between the current frame and the predicted image, and creating a reconstructed image for the current frame by adding the inverted transformed result of the quantized difference with the predicted image for the current frame; performing motion compensation on the reference frame at the FGS level and on the reference frame at the main level, using a second motion vector with lower accuracy than the first motion vector; calculating the difference between the reference frame at the FGS level with compensated movement and the reference frame at the main level with compensated movement; subtracting the reconstructed image for the current frame and the calculated difference from the current frame and encode the result of the subtraction.

2. The method according to claim 1, in which the execution of the motion compensation comprises creating a second motion vector by changing the accuracy of the first motion vector, and the accuracy of the second motion vector used in performing the motion compensation is lower than the accuracy of the first motion vector used in obtaining predicted image for the current frame.

3. The method according to claim 1, in which the calculated difference is the average value of the first difference between the subsequent reference frame of the FGS level and the subsequent reference frame of the main level and the second difference between the previous reference frame of the FGS level and the previous reference frame of the main level.

4. The method according to claim 2, in which if interpolation is performed to compensate for the movement, then an interpolation filter of a different type is used for interpolation than the filter used to obtain the predicted image of the current frame.

5. The method according to claim 1, in which the encoding of the subtraction result comprises the steps of converting the subtraction result to create a conversion coefficient; quantization of the transform coefficient is performed to create a quantization coefficient, and a lossless quantization coefficient is encoded.

6. The method according to claim 1, in which obtaining a predicted image for the current frame comprises the steps of evaluating the first motion vector using the current frame and at least one reconstructed frame of the main level as a reference frame; performing motion compensation on the reference frames using the first motion vector; get the predicted image by averaging the reference frames with compensated movement.

7. The method according to claim 1, in which obtaining a predicted image for the current frame comprises the steps of evaluating the first motion vector using the current frame and the original frame adjacent to the current frame as a reference frame; performing motion compensation on the reference frame using the first motion vector, and a predicted image is obtained by averaging the reference frames with compensated movement.

8. The method according to claim 1, in which the reference frame at the FGS level is the source frame adjacent to the reference frame of the FGS level, and the reference frame of the main level is an adjacent frame restored from the main level.

9. The method according to claim 1, in which the reference frame at the FGS level is an adjacent frame recovered from the FGS level, and the reference frame of the main level is an adjacent frame recovered from the main level.

10. The method according to claim 5, in which the quantization step size used in the quantization of the transform coefficient is smaller than that used in the quantization of the difference.

11. A video coding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; calculating the difference between the current frame and the predicted image; performing quantization of the difference between the current frame and the predicted image, performing inverse quantization of the quantized difference between the current frame and the predicted image, and creating a reconstructed image for the current frame by adding the inverted transformed result of the quantized difference with the predicted image for the current frame; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the first motion vector, and create a predicted frame for the FGS level and a predicted frame for the main level, respectively; calculating the difference between the predicted frame for the FGS level and the predicted frame for the main level; subtracting the reconstructed image and the difference from the current frame; and encode the result of the subtraction.

12. The method according to claim 11, in which the execution of the motion compensation comprises creating a second motion vector by changing the accuracy of the first motion vector, and the accuracy of the second motion vector used in performing the motion compensation is lower than the accuracy of the first motion vector used in obtaining the predicted images for the current frame.

13. The method according to claim 11, in which the predicted frame of the FGS level is the average value of the reference frames of the F6S level with compensated movement, and the predicted frame of the main level is the average value of the reference frames of the main level with compensated movement.

14. The method according to item 12, in which if the interpolation is performed to compensate for the movement, then the filter type is used for interpolation, different from the interpolation filter, which is used to obtain the predicted image for the current interpolation frame.

15. The method according to claim 11, in which the encoding of the subtraction result comprises the steps of converting the subtraction result to create a conversion coefficient; quantization of the transform coefficient is performed to create a quantization coefficient, and a lossless quantization coefficient is encoded.

16. The method of claim 15, wherein the quantization step size used in quantizing the transform coefficient is smaller than the step size used in quantizing the difference.

17. A video coding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; calculating the difference between the current frame and the predicted image; performing quantization of the difference between the current frame and the predicted image, inverse quantization of the quantized difference between the current frame and the predicted image, and creating a reconstructed image for the current frame by adding the inverted transformed result of the quantized difference with the predicted image for the current frame; calculating the difference between the reference frame of the exact scalability level in quality (FGS) and the reference frame of the main level; performing motion compensation for the difference using the second motion vector with lower accuracy than the first motion vector; subtracting the reconstructed image for the current frame and the result of motion compensation from the current frame; encode the result of the subtraction.

18. The method according to 17, in which performing motion compensation for the difference comprises creating a second motion vector by changing the accuracy of the first motion vector, and the accuracy of the second motion vector used when performing motion compensation for the difference is lower than the accuracy of the first motion vector used upon receipt of the predicted image for the current frame.

19. The method according to 17, in which the result of the motion compensation for the difference to be subtracted is the average value of the differences with compensated movement.

20. The method according to p. 18, in which, if the interpolation is performed to compensate for the difference, then the filter type is used for interpolation, different from the interpolation filter, which is used to obtain the predicted image for the current frame.

21. The method according to 17, in which the encoding of the subtraction result comprises the steps of converting the subtraction result to create a conversion coefficient; quantizing a transform coefficient to create a quantization coefficient; encode lossless quantization coefficient.

22. The method according to item 21, in which the quantization step size used in the quantization of the converted coefficient is smaller than the step size used in the quantization of the difference.

23. A video coding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the accuracy of the first motion vector; calculating the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level; subtracting the predicted image and the difference from the current frame; encode the result of the subtraction.

24. A video coding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the accuracy of the first motion vector, and create a predicted frame for the FGS level and a predicted frame for the main level, respectively; calculating the difference between the predicted frame for the FGS layer and the reference frame for the main layer; subtracting the predicted image and the calculated difference from the current frame; encode the result of the subtraction.

25. The method according to paragraph 24, further comprising the step of multiplying the difference between the predicted frame of the FGS level and the predicted frame of the main level by the weight coefficient a, in which the calculated difference when subtracting the predicted image is the product of the weight coefficient a and the difference between the predicted frame for the FGS level and predicted frame for the main level.

26. A video coding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of obtaining a predicted image for the current frame using a first motion vector estimated with a predetermined accuracy; calculating the difference between the reference frame of the FGS layer and the reference frame of the main level; performing motion compensation for a difference between the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the accuracy of the first motion vector; creating a reconstructed image for the current frame by adding the inverted transformed result of the quantized difference with the predicted image for the current frame; subtracting the reconstructed image and the result of motion compensation from the current frame; encode the result of the subtraction.

27. A video decoding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of extracting the texture layer data of the main layer and texture data of the FGS layer and the first motion vector from the input bit stream; restoring the frame of the main level from the texture data of the main level, while restoring the frame of the main level contains the steps of performing inverse quantization of texture data of the main level; performing inverse transformation of the inverse quantization result; creating a predicted image from a previously reconstructed reference frame of the main level using the first motion vector; and add together the predicted image and the result of the inverse transform; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the first motion vector; calculating the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement; add together the frame of the main level, texture data of the FGS level and the difference.

28. The method according to item 27, in which the second motion vector used when performing motion compensation, has a lower accuracy than the first motion vector.

29. The method according to item 27, in which the calculated difference is the average value of the first difference between the previous reference frame of the FGS level and the previous reference frame of the reference level and the second difference between the subsequent reference frame of the FGS level and the subsequent reference frame of the main level.

30. The method according to p, in which, if interpolation is used to compensate for the movement, then an interpolation filter of a different type than that used when restoring the frame of the main level is used for interpolation.

31. The method according to item 27, in which the texture data of the FGS level, used when adding to the frame of the main level, are obtained by performing inverse quantization and inverse transformation of the extracted texture data of the FGS level.

32. The method of claim 31, wherein the quantization step size used in the inverse quantization applied to the texture data of the FGS layer is smaller than the step size used in the inverse quantization performed when reconstructing the main layer frame.

33. The method of video decoding that supports the algorithm for accurate scalability in quality (FGS), the method comprises the steps of extracting texture data of the main layer and texture data of the FGS level and the first motion vector from the input bit stream; restoring the frame of the main level from the texture data of the main level, while restoring the frame of the main level contains the steps of performing inverse quantization of texture data of the main level; performing inverse transformation of the inverse quantization result; creating a predicted image from a previously reconstructed reference frame of the main level using the first motion vector; and add together the predicted image and the result of the inverse transform; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using a second motion vector with lower accuracy than the first motion vector, and creating a predicted frame of the FGS level and a predicted frame of the main level; calculating the difference between the predicted reference frame of the FGS level and the predicted reference frame of the main level; and add together the texture data, the reconstructed frame of the main level and the difference.

34. The method of claim 33, wherein the second motion vector used to perform the motion compensation has lower accuracy than the first motion vector.

35. The method according to clause 34, in which, if interpolation is used to compensate for the movement, then an interpolation filter of a different type than that used when restoring the frame of the main level is used for interpolation.

36. The method according to clause 33, in which the texture data of the FGS level used when adding the frame of the main level, texture data of the FGS level and the remainder are obtained by performing inverse quantization and inverse conversion of the extracted texture data of the FGS level.

37. A video decoding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of extracting texture level data and texture level FGS data and a first motion vector from an input bit stream; restoring the frame of the main level from the texture data of the main level, while restoring the frame of the main level contains the steps of performing inverse quantization of texture data of the main level; performing inverse transformation of the inverse quantization result; creating a predicted image from a previously reconstructed reference frame of the main level using the first motion vector; and add together the predicted image and the result of the inverse transform; calculating the difference between the reference frame of the FGS layer and the reference frame of the main level; performing motion compensation on the difference using the second motion vector with lower accuracy than the first motion vector, and add together the texture data of the FGS level, the reconstructed frame of the main level, and the result of the motion compensation.

38. The method according to clause 37, in which the result of the motion compensation subjected to addition is the average value of the differences with compensated movement.

39. The method according to clause 37, in which the second motion vector used when performing motion compensation, has a lower accuracy than the first motion vector.

40. The method according to § 39, in which, if interpolation is performed to compensate for the movement, then an interpolation filter is used for interpolation, different from the interpolation filter used when restoring the frame of the main level.

41. The method according to clause 37, in which the FGS level texture data for addition is obtained by performing inverse quantization and inverse transformation on the extracted FGS level texture data.

42. A video decoding method supporting an accurate quality scalability algorithm (FGS), the method comprises the steps of extracting texture layer data of a basic layer, texture data of an FGS layer and a first motion vector from an input bit stream; restoring the predicted image for the frame of the main level from the texture data of the main level using the first motion vector; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using the second motion vector with an accuracy lower than the accuracy of the first motion vector; calculating the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement; add together the texture data of the FGS level, the predicted image and the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement.

43. A video decoding method supporting an accurate quality scalability algorithm (FGS), comprising the steps of extracting texture level data and FGS level texture data and first motion vectors from an input bit stream; restoring the predicted image for the frame of the main level from the texture data of the main level using the first motion vector; performing motion compensation on the reference frame of the FGS level and the reference frame of the main level using a second motion vector with an accuracy lower than the first motion vector, and creating a predicted frame of the FGS level and a predicted frame of the main level; calculating the difference between the predicted frame of the FGS level and the predicted frame of the main level; and add together the texture data of the FGS level, the predicted images and the calculated difference.

44. The method according to item 43, further comprising the step of multiplying the difference between the predicted frame of the FGS level and the predicted frame of the main level by the weight coefficient a, in which the calculated difference used in addition is the product of the weight coefficient a and the difference between the predicted frame of the FGS level and predicted frame of the main level.

45. The method of video decoding that supports the algorithm for accurate scalability in quality (FGS), comprising the steps of extracting texture data of the main level, texture data of the FGS level and the first motion vector from the input bit stream; restoring the predicted image for the frame of the main level from the texture data of the main level using the first motion vector; calculating the difference between the reference frame of the FGS layer and the reference frame of the main level; performing motion compensation for the difference using the second motion vector with an accuracy lower than the accuracy of the first vector; and add together the texture data of the FGS level, the predicted image and the difference.

46. A video encoder based on the Fine Quality Scalability Algorithm (FGS), comprising means for obtaining a predicted image for a current frame using a first motion vector estimated with predetermined accuracy; means for calculating the difference between the current frame and the predicted image; means for quantizing the difference between the current frame and the predicted image; means for inverting quantization of the quantized difference between the current frame and the predicted image; means for creating a reconstructed image for the current frame; motion compensation means for the reference frame of the FGS layer and the reference frame of the main layer using the second motion vector; means for calculating the difference between the reference frame of the FGS level with compensated movement and the reference frame of the main level with compensated movement; means for subtracting the reconstructed image and the difference from the current frame; means for encoding the result of the subtraction.

47. The encoder of claim 46, wherein the accuracy of the second motion vector used to perform the motion compensation is lower than the accuracy of the first motion vector used to obtain the predicted image for the current frame.

48. A video decoder based on the Fine Quality Scalability Algorithm (FGS), comprising: means for extracting core layer texture data, FGS layer texture data and a first motion vector from an input bit stream; means for reconstructing a frame of the base layer from texture data of the base layer; means of inverse quantization of texture data of the main level; means for inverting the inverse quantization result; means for performing motion compensation for the reference frame of the FGS level and the reference frame of the main level using the second motion vector with lower accuracy than the first motion vector; means for creating a predicted frame of the FGS level and a predicted frame of the main level; means for calculating the difference between the predicted frame of the FGS level and the predicted frame of the main level; and means for adding together the texture data, the reconstructed frame of the base layer and the difference.

49. The video decoder of claim 48, wherein the accuracy of the second motion vector used to perform the motion compensation is lower than the accuracy of the first motion vector extracted from the input bit stream.