HK1211754B - Arithmetic encoding device and arithmetic decoding device - Google Patents
Arithmetic encoding device and arithmetic decoding device Download PDFInfo
- Publication number
- HK1211754B HK1211754B HK15112355.7A HK15112355A HK1211754B HK 1211754 B HK1211754 B HK 1211754B HK 15112355 A HK15112355 A HK 15112355A HK 1211754 B HK1211754 B HK 1211754B
- Authority
- HK
- Hong Kong
- Prior art keywords
- background
- spectral coefficients
- probability density
- coefficients
- spectral
- Prior art date
Links
Description
本申请是申请日为2010年10月01日、申请号为201080045319.6、发明名称为“算术编码或算术解码的方法和设备”的发明专利申请的分案申请。This application is a divisional application of the invention patent application with application date of October 1, 2010, application number 201080045319.6, and invention name “Method and device for arithmetic coding or arithmetic decoding”.
技术领域Technical Field
本发明涉及多媒体数据的算术编码和解码。The present invention relates to arithmetic coding and decoding of multimedia data.
背景技术Background Art
算术编码是一种数据无损压缩的方法。算术编码基于概率密度函数 (PDF)。为了达到压缩效果,编码所基于的概率密度函数必须与数据实际遵循的实际概率密度函数相同或至少相似一越接近越好.Arithmetic coding is a method for lossless data compression. Arithmetic coding is based on a probability density function (PDF). To achieve compression, the PDF used for coding must be identical to, or at least similar to, the actual PDF of the data—the closer the better.
如果算术编码基于适当概率密度函数,则可以实现导致至少几乎最佳代码的显著压缩。因此,在编码和解码系数序列的音频、语音或视频编码中,算术编码是一种频繁使用技术,其中这些系数是用二进制表示的视频像素或音频或语音信号样本值的量化时频变换。If arithmetic coding is based on an appropriate probability density function, significant compression leading to at least nearly optimal codes can be achieved. Therefore, arithmetic coding is a frequently used technique in audio, speech or video coding for encoding and decoding sequences of coefficients, where these coefficients are quantized time-frequency transforms of the values of video pixels or audio or speech signal samples represented in binary.
为了进一步提高压缩,算术编码可以基于一组概率密度函数,其中用于编码当前系数的概率密度函数取决于所述当前系数的背景。也就是说,取决于出现具有相同量化值的系数的背景,可以将不同概率密度函数用于编码所述相同量化值。系数的背景通过包含在与各自系数相邻的一个或多个相邻系数的邻域,例如,序列中相邻地在要编码或要解码的各自系数前面的一个或多个已编码或已解码系数的子序列中的系数的量化值来定义。邻域可能出现的每种不同可能定义每一种被映射成概率密度函数的不同可能背景。To further improve compression, arithmetic coding can be based on a set of probability density functions, where the probability density function used to encode a current coefficient depends on the context of the current coefficient. That is, depending on the context in which coefficients with the same quantized value appear, different probability density functions can be used to encode the same quantized value. The context of a coefficient is defined by the quantized values of the coefficient contained in a neighborhood of one or more adjacent coefficients adjacent to the respective coefficient, for example, a subsequence of one or more coded or decoded coefficients that immediately precede the respective coefficient to be coded or decoded in the sequence. Each different possibility in which a neighborhood may appear defines a different possible context that is mapped to a probability density function.
事实上,只有当邻域足够大时所述压缩提高才变得明显。随之而来的是不同可能背景的数量的组合激增以及相应数量巨大的可能概率密度函数或相应复杂映射。In fact, the compression improvement becomes noticeable only when the neighborhood is large enough. This is accompanied by a combinatorial explosion in the number of different possible backgrounds and a correspondingly huge number of possible probability density functions or correspondingly complex mappings.
在如下文献中可以找到基于背景算术编码方案的一个例子:ISO/IEC JTC1/SC29/WG11 N10215,October 2008,Busan,Korea,提出统一语音和音频编码(USAC)的参考模型。按照该建议,将已经编码的4元组(4-tuples) 考虑为背景。An example of a background arithmetic coding scheme can be found in ISO/IEC JTC1/SC29/WG11 N10215, October 2008, Busan, Korea, Proposing a Reference Model for Unified Speech and Audio Coding (USAC). According to this proposal, already coded 4-tuples are considered as background.
在如下文献中可以找到基于USAC相关背景算术编码的另一个例子: ISO/IECJTC1/SC29/WG11 N10847,July 2009,London,UK。Another example of arithmetic coding based on a USAC related background can be found in ISO/IEC JTC1/SC29/WG11 N10847, July 2009, London, UK.
为了降低高阶条件熵编码中的复杂性,美国专利5,298,896提出了限定码元(symbol)的非均匀量化。In order to reduce the complexity of high-order conditional entropy coding, US Pat. No. 5,298,896 proposes non-uniform quantization of limited symbols.
发明内容Summary of the Invention
与要处理的数量巨大背景相对应,存在需要存储,检索和处理的数量巨大概率密度函数或从背景到概率密度函数的至少相应复杂映射。这提高了编码/解码延迟和存储容量要求的至少一种。因此,在技术上需要一种允许在降低编码/解码延迟和存储容量要求的至少一种的同时,几乎一样好地实现压缩的可替代解决方案。The large number of backgrounds to be processed corresponds to a large number of probability density functions, or at least a correspondingly complex mapping from backgrounds to probability density functions, that must be stored, retrieved, and processed. This increases at least one of encoding/decoding latency and storage capacity requirements. Therefore, there is a need in the art for an alternative solution that allows for compression that is nearly as good while reducing at least one of encoding/decoding latency and storage capacity requirements.
为了解决这种需要,本发明提出了根据实施例的用于使用前谱系数算术解码当前谱系数的设备和用于使用前谱系数算术编码当前谱系数的设备,如文中所述。In order to address this need, the present invention proposes an apparatus for arithmetically decoding a current spectral coefficient using previous spectral coefficients and an apparatus for arithmetically encoding a current spectral coefficient using previous spectral coefficients according to an embodiment, as described herein.
在从属权利要求中规定了进一步提出实施例的特征。Characteristic features of further proposed embodiments are specified in the dependent claims.
一种使用前谱系数算术解码当前谱系数的设备,所述前谱系数是已经解码的,其中所述前谱系数和所述当前谱系数被包含在对时频变换的视频、音频或语音信号样本值进行量化所得的一个或多个量化谱中,所述设备包含:A device for arithmetically decoding a current spectral coefficient using previous spectral coefficients, the previous spectral coefficients being already decoded, wherein the previous spectral coefficients and the current spectral coefficient are contained in one or more quantized spectra obtained by quantizing sample values of a time-frequency transformed video, audio or speech signal, the device comprising:
-处理前谱系数的处理部件;-processing unit for processing pre-spectral coefficients;
-从至少两个不同背景类别确定背景类别的第一部件,所述第一部件适用于将处理后前谱系数用于确定背景类别,- a first means for determining a background class from at least two different background classes, said first means being adapted to use the processed front spectral coefficients for determining the background class,
其中将前谱系数的量化绝对值之和用于背景类别的确定;The sum of the quantized absolute values of the front spectrum coefficients is used to determine the background category;
-确定概率密度函数的第二部件,所述第二部件适用于将所确定背景类别和从至少两个不同背景类别到至少两个不同概率密度函数的映射用于确定概率密度函数;- second means for determining a probability density function, said second means being adapted to use the determined background class and the mapping from at least two different background classes to at least two different probability density functions for determining the probability density function;
-根据所确定概率密度函数算术解码当前谱系数的算术解码器,- an arithmetic decoder for arithmetically decoding the current spectral coefficient according to the determined probability density function,
其中所述处理部件适用于非均匀地量化前谱系数的绝对值以便在背景类别的确定中使用;wherein the processing means is adapted to non-uniformly quantize the absolute values of the front spectral coefficients for use in the determination of the background class;
其中所述第二部件被配置为使用查找表或散列表实现所述映射。The second component is configured to implement the mapping using a lookup table or a hash table.
一种使用前谱系数算术编码当前谱系数的设备,所述前谱系数是已经编码的,其中所述前谱系数和所述当前谱系数被包含在对时频变换的视频、音频或语音信号样本值进行量化所得的一个或多个量化谱中,所述设备包含:A device for arithmetically encoding a current spectral coefficient using previous spectral coefficients, wherein the previous spectral coefficients are already encoded, wherein the previous spectral coefficients and the current spectral coefficient are contained in one or more quantized spectra obtained by quantizing sample values of a time-frequency transformed video, audio or speech signal, the device comprising:
-处理前谱系数的处理部件;-processing unit for processing pre-spectral coefficients;
-从至少两个不同背景类别确定背景类别的第一部件,所述第一部件适用于将处理后前谱系数用于确定背景类别,- a first means for determining a background class from at least two different background classes, said first means being adapted to use the processed front spectral coefficients for determining the background class,
其中将前谱系数的量化绝对值之和用于背景类别的确定;The sum of the quantized absolute values of the front spectrum coefficients is used to determine the background category;
-确定概率密度函数的第二部件,所述第二部件适用于将所确定背景类别和从至少两个不同背景类别到至少两个不同概率密度函数的映射用于确定概率密度函数;- second means for determining a probability density function, said second means being adapted to use the determined background class and the mapping from at least two different background classes to at least two different probability density functions for determining the probability density function;
-根据所确定概率密度函数算术编码当前谱系数的算术编码器,- an arithmetic coder for arithmetically coding the current spectral coefficient according to the determined probability density function,
其中所述处理部件适用于非均匀地量化前谱系数的绝对值以便在背景类别的确定中使用;wherein the processing means is adapted to non-uniformly quantize the absolute values of the front spectral coefficients for use in the determination of the background class;
其中所述第二部件被配置为使用查找表或散列表实现所述映射。The second component is configured to implement the mapping using a lookup table or a hash table.
算术编码或解码的所述方法将前谱系数分别用于当前谱系数的算术编码或解码,其中所述前谱系数是已经分别编码或解码的。所述前谱系数和所述当前谱系数两者都被包含在视频、音频或语音信号样本值的量化时频变换所得的一个或多个量化谱中。所述方法进一步包含处理前谱系数,将处理后前谱系数用于确定作为至少两个不同背景类别之一的背景类别,将所确定背景类别和从至少两个不同背景类别到至少两个不同概率密度函数的映射用于确定概率密度函数,以及根据所确定概率密度函数算术分别编码或解码当前谱系数。该方法的一个特征是处理前谱系数包含非均匀地量化前谱系数的绝对值。The method of arithmetic coding or decoding uses the previous spectrum coefficients for arithmetic coding or decoding of the current spectrum coefficients, respectively, wherein the previous spectrum coefficients have been encoded or decoded respectively. Both the previous spectrum coefficients and the current spectrum coefficients are contained in one or more quantized spectra obtained by quantizing the time-frequency transformation of the sample values of the video, audio or speech signal. The method further includes processing the previous spectrum coefficients, using the processed previous spectrum coefficients to determine a background category as one of at least two different background categories, using the determined background category and the mapping from the at least two different background categories to at least two different probability density functions to determine a probability density function, and arithmetically encoding or decoding the current spectrum coefficients according to the determined probability density function. A feature of the method is that the processed previous spectrum coefficients include the absolute values of the non-uniformly quantized previous spectrum coefficients.
将背景类别替代背景用于确定概率密度函数便于将得出不同但非常相似概率密度函数的两个或更多个不同背景分组成映射到单个概率密度函数的单个背景类别。该分组是通过将前谱系数的非均匀量化绝对值用于确定背景类别实现的。Using background classes instead of backgrounds for determining the probability density function facilitates grouping two or more different backgrounds that yield different but very similar probability density functions into a single background class that is mapped to a single probability density function. This grouping is achieved by using the non-uniform quantized absolute values of the front spectral coefficients for determining the background class.
例如,存在处理前谱系数包含确定前谱系数的量化绝对值之和以便用在确定背景类别中的实施例。类似地,存在算术编码设备的相应实施例,以及算术解码设备的相应实施例,其中的处理部件适用于确定前谱系数的量化绝对值之和以便用于确定背景类别。For example, there are embodiments in which processing the front spectral coefficients includes determining the sum of the quantized absolute values of the front spectral coefficients for use in determining the background class. Similarly, there are corresponding embodiments of the arithmetic encoding device and corresponding embodiments of the arithmetic decoding device, in which the processing means are adapted to determine the sum of the quantized absolute values of the front spectral coefficients for use in determining the background class.
在设备的进一步实施例中,所述处理部件适用于使处理前谱系数进一步包含按照第一量化方案量化前谱系数的绝对值的第一量化、确定按照第一量化方案量化的前谱系数的绝对值的方差的方差确定、将所确定方差用于选择至少两种不同非线性第二量化方案之一、和按照所选择非线性第二量化方案进一步量化按照第一量化方案量化的前谱系数的绝对值的第二量化。所述方法的进一步实施例包含相应步骤。所述方差确定可以包含确定按照第一量化方案量化的前谱系数的绝对值之和,并将所确定和值与至少一个阈值相比较。In a further embodiment of the device, the processing means is adapted to cause the processing of the pre-spectral coefficients to further comprise a first quantization of the absolute values of the pre-spectral coefficients according to a first quantization scheme, a variance determination of determining a variance of the absolute values of the pre-spectral coefficients quantized according to the first quantization scheme, using the determined variance to select one of at least two different non-linear second quantization schemes, and a second quantization of the absolute values of the pre-spectral coefficients quantized according to the first quantization scheme according to the selected non-linear second quantization scheme. A further embodiment of the method comprises the corresponding steps. The variance determination may comprise determining a sum of the absolute values of the pre-spectral coefficients quantized according to the first quantization scheme, and comparing the determined sum value with at least one threshold value.
在进一步实施例中,每种设备的所述处理部件可以适用于使处理导致第一后果或至少一个不同第二后果。然后,确定背景类别进一步包含确定对其处理导致第一后果的那些前谱系数的数量,并将所确定数量用于确定背景类别。In a further embodiment, the processing means of each device may be adapted to cause the processing to result in a first consequence or at least one different second consequence. Determining the context class then further comprises determining a number of those prespectral coefficients for which the processing results in the first consequence, and using the determined number for determining the context class.
每种设备可以包含接收模式切换信号和复位信号的至少一种的部件,其中所述设备适用于将所接收信号的至少一种用于控制背景类别的确定。Each device may comprise means for receiving at least one of a mode switch signal and a reset signal, wherein the device is adapted to use at least one of the received signals for controlling the determination of the context category.
所述至少两个不同概率密度函数可以通过将代表性数据集用于确定至少两个不同概率密度函数事先确定,所述映射可以使用查找表或散列表实现。The at least two different probability density functions may be determined in advance by using a representative data set to determine the at least two different probability density functions, and the mapping may be implemented using a lookup table or a hash table.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
本发明的示范性实施例例示在附图中,并在如下描述中得到更详细说明。这些示范性实施例只是为了阐明本发明,而不是限制定义在权利要求书中的本发明范围和精神而说明的。Exemplary embodiments of the present invention are illustrated in the accompanying drawings and described in more detail in the following description. These exemplary embodiments are only provided to illustrate the present invention, rather than to limit the scope and spirit of the present invention as defined in the claims.
在附图中:In the attached figure:
图1示范性地描绘了本发明编码器的一个实施例;FIG1 schematically illustrates an embodiment of an encoder according to the present invention;
图2示范性地描绘了本发明解码器的一个实施例;FIG2 exemplarily depicts an embodiment of a decoder according to the present invention;
图3示范性地描绘了确定背景类别的背景分类器的第一实施例;FIG3 exemplarily depicts a first embodiment of a background classifier for determining background categories;
图4示范性地描绘了确定背景类别的背景分类器的第二实施例;FIG4 exemplarily depicts a second embodiment of a background classifier for determining background categories;
图5a示范性地描绘了要在频域模式下编码或解码的当前谱区(bin)之前的前谱区的第一邻域;FIG5 a exemplarily depicts a first neighborhood of a previous spectral bin preceding a current spectral bin to be encoded or decoded in frequency domain mode;
图5b示范性地描绘了要在加权线性预测变换模式下编码或解码的当前谱区之前的前谱区的第二邻域;FIG5 b exemplarily depicts a second neighborhood of a previous spectral region preceding a current spectral region to be encoded or decoded in a weighted linear prediction transform mode;
图6a示范性地描绘了要在频域模式下编码或解码的当前最低频谱区之前的前谱区的第三邻域;FIG6 a exemplarily depicts a third neighborhood of the preceding spectral region preceding the current lowest spectral region to be encoded or decoded in frequency domain mode;
图6b示范性地描绘了要在频域模式下编码或解码的当前次最低频谱区之前的前谱区的第四邻域;FIG6 b exemplarily depicts a fourth neighboring region of a preceding spectral region preceding a current second lowest spectral region to be encoded or decoded in frequency domain mode;
图7a示范性地描绘了要在加权线性预测变换模式下编码或解码的当前最低频谱区之前的前谱区的第五邻域;FIG7 a exemplarily depicts a fifth neighborhood of the preceding spectral region preceding the current lowest spectral region to be encoded or decoded in the weighted linear prediction transform mode;
图7b示范性地描绘了要在加权线性预测变换模式下编码或解码的当前次最低频谱区之前的前谱区的第六邻域;FIG7 b exemplarily depicts a sixth neighbor of a previous spectral region preceding a current second lowest spectral region to be encoded or decoded in a weighted linear prediction transform mode;
图7c示范性地描绘了要在加权线性预测变换模式下编码或解码的当前第三最低频谱区之前的前谱区的第七邻域;FIG7 c exemplarily depicts a seventh neighbor of the previous spectral region preceding the current third lowest spectral region to be encoded or decoded in the weighted linear prediction transform mode;
图7d示范性地描绘了要在加权线性预测变换模式下编码或解码的当前第四最低频谱区之前的前谱区的第八邻域;FIG7 d exemplarily depicts an eighth neighbor of the preceding spectral region preceding the current fourth lowest spectral region to be encoded or decoded in the weighted linear prediction transform mode;
图8示范性地描绘了要编码或解码的不同谱区的邻域,所述不同谱区包含在要在频域模式下开始编码/解码或出现复位信号之后编码或解码的第一频谱中;以及FIG8 exemplarily depicts a neighborhood of different spectral regions to be encoded or decoded, said different spectral regions being included in a first spectrum to be encoded or decoded after starting encoding/decoding in frequency domain mode or after occurrence of a reset signal; and
图9示范性地描绘了要在加权线性预测变换模式下编码或解码的不同谱区的进一步邻域,所述不同谱区被包含在要在在加权线性预测变换模式下开始编码/解码或出现复位信号之后编码或解码的第二频谱中。FIG9 exemplarily depicts further neighborhoods of different spectral regions to be encoded or decoded in the weighted linear prediction transform mode, said different spectral regions being contained in a second spectrum to be encoded or decoded after the start of encoding/decoding in the weighted linear prediction transform mode or the occurrence of a reset signal.
具体实施方式DETAILED DESCRIPTION
本发明可以在包含相应适配的处理设备的任何电子设备上实现。例如,算术解码的设备可以在电视机、移动电话、个人计算机、mp3播放器、导航系统或汽车音响系统中实现。算术编码的设备可以在移动电话、个人计算机、有源汽车导航系统、数字照相机、数字摄像机或录音机等中实现。The present invention can be implemented in any electronic device that includes a correspondingly adapted processing device. For example, the device for arithmetic decoding can be implemented in a television, mobile phone, personal computer, MP3 player, navigation system, or car audio system. The device for arithmetic coding can be implemented in a mobile phone, personal computer, active car navigation system, digital camera, digital video camera, or audio recorder.
下文描述的示范性实施例涉及量化多媒体样本的时频变换所得的量化谱区的编码或解码。The exemplary embodiments described below relate to encoding or decoding of quantized spectral regions resulting from time-frequency transformation of quantized multimedia samples.
本发明基于将已经发射量化谱区,例如,序列中当前量化谱区BIN之前的前量化谱区用于确定用于分别算术编码和解码当前量化谱区BIN的概率密度函数的方式。The present invention is based on the method of using an already transmitted quantized spectral region, for example, a previous quantized spectral region before a current quantized spectral region BIN in a sequence, to determine a probability density function for respectively arithmetic encoding and decoding the current quantized spectral region BIN.
算术编码或算术解码的方法和设备的所述示范性实施例分别包含用于非均匀量化的步骤或部件。所有步骤或部件一起提供最高编码效率,但每个步骤或部件已单独实现本发明的构思,并提供与编码/解码延迟和/或存储要求有关的好处。因此,详细的描述应该理解为描述只实现所述的步骤或部件之一的示范性实施例,以及描述实现所述的步骤或部件的两个或更多个步骤或部件的组合的示范性实施例。The exemplary embodiments of the methods and apparatus for arithmetic coding or arithmetic decoding each include steps or components for non-uniform quantization. While all steps or components together provide the highest coding efficiency, each step or component individually implements the concepts of the present invention and offers benefits related to encoding/decoding latency and/or storage requirements. Therefore, the detailed description should be understood as describing exemplary embodiments that implement only one of the steps or components, as well as exemplary embodiments that implement a combination of two or more steps or components.
可以但无需包括在本方法的示范性实施例中的第一步骤是决定应该使用哪种一般变换模式的切换步骤。例如,在USAC无噪编码方案中,一般变换模式可以是频域(FD)模式或加权线性预测变换(wLPT)模式。每种一般模式可以将已编码或解码谱区的不同邻域,即,不同的选择用于确定PDF。The first step that may, but need not, be included in an exemplary embodiment of the present method is a switching step that determines which general transform mode should be used. For example, in the USAC noiseless coding scheme, the general transform mode may be a frequency domain (FD) mode or a weighted linear predictive transform (wLPT) mode. Each general mode may use a different neighborhood of the coded or decoded spectral region, i.e., a different selection, for determining the PDF.
此后,可以在模块背景生成COCL下确定当前谱区BIN的背景。根据确定的背景,通过分类背景确定背景类别,其中在分类之前,最好但未必通过背景的谱区的非均匀量化NUQ1处理背景。分类可以包含估计背景的方差 VES并将方差与至少一个阈值相比较。或者,直接从背景中确定方差估计值。然后将方差估计值用于控制最好但未必非线性的进一步量化NUQ2。Thereafter, the background of the current spectral region BIN can be determined in the Background Generation COCL module. Based on the determined background, a background class is determined by background classification, wherein the background is preferably, but not necessarily, processed by non-uniform quantization of the spectral region NUQ1 before classification. The classification can include estimating the variance of the background VES and comparing the variance to at least one threshold. Alternatively, a variance estimate can be determined directly from the background. The variance estimate is then used to control a further quantization NUQ2, preferably, but not necessarily non-linear.
在示范性地描绘在图1中的编码过程中,确定适当概率密度函数(PDF) 来编码当前量化谱区BIN。为此,只能使用在解码器方也已知的信息。也就是说,只能使用前编码或解码量化谱区。这是在背景分类器块COCL中完成的。在那里,所选前谱区定义用于确定实际背景类别的邻域NBH。背景类别可以通过背景类别号表示。背景类别号用于经由映射MAP,例如,经由查找表或散列表从PDF存储器MEM1中检索相应PDF。背景类别的确定可能取决于允许视所选模式而定使用不同邻域的一般模式开关GMS。如上所述,对于USAC,可能存在两种一般模式(FD模式和wLPT模式)。如果一般模式开关GMS是在编码器方实现的,则模式改变信号或当前一般信号必须被包含在位流中,以便解码器也知道它。例如,在ISO/IEC JTC1/SC29/WG11N10847, 2009年7月,英国伦敦,提出的统一语音和音频编码(USAC)的参考模型中,存在为发送一般模式而提出的表格4.4core_mode和表格4.5 core_mode0/1。In the encoding process exemplarily depicted in FIG1 , a suitable probability density function (PDF) is determined for encoding the current quantized spectral region BIN. For this purpose, only information that is also known on the decoder side can be used. That is, only the previously encoded or decoded quantized spectral region can be used. This is done in the background classifier block COCL. There, the selected previous spectral region defines a neighborhood NBH for determining the actual background class. The background class can be represented by a background class number. The background class number is used to retrieve the corresponding PDF from the PDF memory MEM1 via a mapping MAP, for example, via a lookup table or a hash table. The determination of the background class may depend on a general mode switch GMS that allows the use of different neighborhoods depending on the selected mode. As mentioned above, for USAC, there may be two general modes (FD mode and wLPT mode). If the general mode switch GMS is implemented on the encoder side, the mode change signal or the current general signal must be included in the bit stream so that the decoder also knows it. For example, in the reference model of Unified Speech and Audio Coding (USAC) proposed by ISO/IEC JTC1/SC29/WG11N10847, July 2009, London, UK, there are Table 4.4 core_mode and Table 4.5 core_mode0/1 proposed for transmitting general modes.
在确定了适合算术编码器AEC编码当前量化谱区BIN的PDF之后,将当前量化谱区BIN馈入邻域存储器MEM2中,即,当前谱区BIN变成前谱区。包含在邻域存储器MEM2中的前谱区可以被块COCL用于编码下一个谱区BIN。在存储当前谱区BIN期间,之前或之后,通过算术编码器AEC算术编码所述当前谱区BIN。将算术编码器AEC的输出存储在位缓冲器BUF中或直接写入位流中。After determining a PDF suitable for encoding the current quantized spectral region BIN by the arithmetic encoder (AEC), the current quantized spectral region BIN is fed into the neighborhood memory MEM2. That is, the current spectral region BIN becomes the previous spectral region. The previous spectral region contained in the neighborhood memory MEM2 can be used by the block COCL to encode the next spectral region BIN. The current spectral region BIN is arithmetically encoded by the arithmetic encoder (AEC) during, before, or after the storage of the current spectral region BIN. The output of the arithmetic encoder (AEC) is stored in the bit buffer BUF or written directly to the bitstream.
可以经由,例如,电缆或卫星发送或广播位流或缓冲器BUF的内容。或者,可以将算术编码谱区写在像DVD、硬盘、蓝光盘等那样的存储媒体上。 PDF存储器MEM1和邻域存储器MEM2可以在单个物理存储器中实现。The bit stream or the contents of the buffer BUF may be transmitted or broadcasted via, for example, cable or satellite. Alternatively, the arithmetically coded spectral region may be written on a storage medium like a DVD, hard disk, Blu-ray disc, etc. The PDF memory MEM1 and the neighborhood memory MEM2 may be implemented in a single physical memory.
复位开关RS可以便于不用知道前谱地在可以开始编码和解码的专用帧上不时重新开始编码或解码,专用帧被称为解码入口点。如果复位开关RS 是在编码器方实现的,则复位信号必须被包含在位流中,以便解码器也知道它。例如,在ISO/IEC JTC1/SC29/WG11N10847,2009年7月,英国伦敦,提出的统一语音和音频编码(USAC)的参考模型中,在WD表格4.10和表格4.14中存在arith_reset_flag。A reset switch, RS, allows for the occasional restart of encoding or decoding at a dedicated frame, known as a decoding entry point, without requiring prior knowledge of the preceding spectrum. If the reset switch, RS, is implemented on the encoder side, the reset signal must be included in the bitstream so that the decoder is also aware of it. For example, in the Unified Speech and Audio Coding (USAC) reference model, as proposed in ISO/IEC JTC1/SC29/WG11N10847, July 2009, London, UK, the arith_reset_flag is included in WD Tables 4.10 and 4.14.
在图2中示范性地描绘基于相应邻域的解码方案。它包含与编码方案相似的块。要用于算术解码的PDF的确定与编码方案相同,以保证在编码器和解码器两者中,确定的PDF相同。算术解码从位缓冲器BUF中或直接从位流中获取位,并使用确定的PDF解码当前量化谱区BIN。之后,解码的量化谱区馈入背景类别号确定块COCL的邻域存储器MEM2中,并可以用于解码下一个谱区。FIG2 exemplarily illustrates a decoding scheme based on a corresponding neighborhood. It contains blocks similar to the encoding scheme. The PDF to be used for arithmetic decoding is determined in the same way as the encoding scheme, ensuring that the determined PDF is identical in both the encoder and decoder. Arithmetic decoding retrieves bits from the bit buffer BUF or directly from the bitstream and uses the determined PDF to decode the current quantized spectral region BIN. The decoded quantized spectral region is then fed into the neighborhood memory MEM2 of the background category number determination block COCL and can be used to decode the next spectral region.
图3更详细地示范性描绘了确定背景类别的背景分类器COCL的第一实施例。FIG3 exemplarily depicts a first embodiment of a background classifier COCL for determining a background class in more detail.
在将当前量化谱区BIN存储在谱存储器MEM2中之前,可以在块NUQ1 中对其进行非均匀量化。这具有两方面好处:其一,使通常是16位带码元整数值的量化谱区的存储更有效,其二,减少了每个量化谱区具有的值的数量。这使得在块CLASS中的背景类别确定过程中极大地减少了可能背景类别。更进一步,由于在背景类别确定中,可能舍弃了量化谱区的码元,所以可以在非均匀量化块NUQ1中包括绝对值计算。在表1中,示出了如块NUQ1可以进行的示范性非均匀量化。在本例中,在非均匀量化之后,每个谱区可能有三个不同值。但是,一般说来,非均匀量化的唯一约束是减少一个谱区可能采用的值的数量。Before the current quantized spectral region BIN is stored in the spectral memory MEM2, it can be subjected to non-uniform quantization in the block NUQ1. This has two advantages: firstly, it makes the storage of the quantized spectral regions, which are usually 16-bit integer values with symbol values, more efficient, and secondly, it reduces the number of values that each quantized spectral region has. This results in a significant reduction of the possible background categories during the background category determination in the block CLASS. Furthermore, since the symbols of the quantized spectral region may be discarded in the background category determination, the absolute value calculation may be included in the non-uniform quantization block NUQ1. In Table 1, an exemplary non-uniform quantization as can be performed by the block NUQ1 is shown. In this example, after the non-uniform quantization, each spectral region may have three different values. However, in general, the only constraint of non-uniform quantization is to reduce the number of values that a spectral region may take.
表1 包括绝对值计算的示范性非均匀量化步骤Table 1 Example non-uniform quantization steps including absolute value calculation
将非均匀量化/映射谱区存储在谱存储器MEM2中。按照所选一般模式选择GMS,对于要编码的每个谱区的背景类别确定CLASS,选择谱区的所选邻域NBH。The non-uniformly quantized/mapped spectral regions are stored in the spectral memory MEM2. According to the selected general mode, GMS is selected, the background class CLASS is determined for each spectral region to be coded, and the selected neighborhood NBH of the spectral region is selected.
图5a示范性地描绘了要编码或解码的谱区的第一示范性邻域NBH。FIG5 a exemplarily depicts a first exemplary neighborhood NBH of a spectral region to be encoded or decoded.
在这个例子中,只将实际或当前频谱(帧)的谱区和一个前频谱(帧) 的谱区定义成邻域NBH。当然,可以将不止一个前频谱的谱区用作邻域的一部分,这样会变得更加复杂,但最终也可能提供更高编码效率。注意,相对于实际频谱,只有已经发送的谱区才可以用于定义邻域NBH,因为在解码器上也必须可访问它们。这里,以及在下面的例子中,假设谱区的发送顺序从低频到高频。In this example, only the spectral regions of the actual or current spectrum (frame) and the spectral regions of one previous spectrum (frame) are defined as the neighborhood NBH. Of course, more than one spectral region of the previous spectrum can be used as part of the neighborhood, which increases complexity but may ultimately provide higher coding efficiency. Note that only spectral regions that have been transmitted relative to the actual spectrum can be used to define the neighborhood NBH, as they must also be accessible at the decoder. Here, and in the following examples, it is assumed that the spectral regions are transmitted in order from low frequency to high frequency.
然后,将所选邻域NBH用作背景类别确定块COCL的输入。在下文中,在描述特定实现之前,首先说明背景类别确定以及简化形式后面的总体思路。The selected neighborhood NBH is then used as input to the background category determination block COCL. In the following, the general idea behind background category determination as well as a simplified form is first explained before describing a specific implementation.
背景类别确定后面的总体思路是使要编码的谱区的方差得到可靠估计。这个预测方差可以再次用于获取要编码的谱区的PDF的估计值。对于方差估计,没有必要评估邻域中的谱区的码元。因此,在存储在谱存储器MEM2中之前的量化步骤中已经可以舍弃码元。极简单背景类别确定看起来可能像如下那样:谱区BIN的邻域NBH看起来可能像图5a中那样,由7个谱区组成。如果使用在表1中所示的示范性非均匀量化,每个谱区可以具有3个值。这导致37=2187个可能背景类别。The general idea behind background class determination is to obtain a reliable estimate of the variance of the spectral region to be coded. This predicted variance can be used again to obtain an estimate of the PDF of the spectral region to be coded. For variance estimation, it is not necessary to evaluate the symbols of the spectral regions in the neighborhood. Therefore, symbols can already be discarded in the quantization step before storage in the spectrum memory MEM2. A very simple background class determination may look like the following: the neighborhood NBH of the spectral region BIN may look like in FIG5 a, consisting of 7 spectral regions. If the exemplary non-uniform quantization shown in Table 1 is used, each spectral region can have 3 values. This results in 3 7 = 2187 possible background classes.
为了进一步减少这种可能背景类别的数量,可以舍弃邻域NBH中的每个谱区的相对位置。因此,只计数分别具有0,1或2值的谱区的数量,其中0 谱区的数量、1谱区的数量和2谱区的数量之和当然等于邻域中谱区的总数量。在包含每一个可以呈现三个不同值当中的一个的n个谱区的邻域NBH中,存在0.5*(n2+3*n+2)种背景类别。例如,在7个谱区的邻域中,存在36个可能背景类别,而在6个谱区的邻域中,存在28个可能背景类别。To further reduce the number of possible background categories, the relative position of each bin in the neighborhood NBH can be discarded. Thus, only bins with values of 0, 1, or 2 are counted, where the sum of the number of 0 bins, the number of 1 bins, and the number of 2 bins is equal to the total number of bins in the neighborhood. In a neighborhood NBH containing n bins that can each assume one of three different values, there are 0.5*( n² + 3*n+2) possible background categories. For example, in a neighborhood of 7 bins, there are 36 possible background categories, while in a neighborhood of 6 bins, there are 28 possible background categories.
更复杂但仍然相当简单背景类别确定考虑到有研究表明前谱相同频率的谱区特别重要(在图5a,5b,6a,6b,7a,7b,7c,8和9中用带点圆圈描绘的谱区)。对于邻域中的其它谱区,即,在各自图形中描绘成带横线圆圈的那些谱区,相关位置较不相关。因此,将前谱中相同频率的谱区显性地用于背景类别确定,而对于其它6个谱区,只计数0谱区的数量、1谱区的数量和2谱区的数量。这导致了3×28=84个可能背景类别。实验表明,这样的背景分类对于FD模式非常有效。A more complex but still quite simple background class determination takes into account studies showing that the spectral regions of the same frequency as the previous spectrum are particularly important (the spectral regions depicted with dotted circles in Figures 5a, 5b, 6a, 6b, 7a, 7b, 7c, 8 and 9). For the other spectral regions in the neighborhood, i.e., those depicted as circles with horizontal lines in the respective figures, the relevant positions are less relevant. Therefore, the spectral regions of the same frequency in the previous spectrum are explicitly used for background class determination, while for the other 6 spectral regions, only the number of 0 spectral regions, the number of 1 spectral regions and the number of 2 spectral regions are counted. This results in 3×28=84 possible background classes. Experiments have shown that such background classification is very effective for FD mode.
背景类别确定可以通过控制第二非均匀量化NUQ2的方差估计VES扩展。这使背景类别生成COCL更好地适用于要编码的谱区的预测方差的更高动态范围。在图4中示范性地示出扩展背景类别确定的相应框图。The background class determination can be extended by controlling the variance estimate VES of the second non-uniform quantization NUQ2. This makes the background class generation COCL better adapted to the higher dynamic range of the prediction variance of the spectral region to be coded. A corresponding block diagram for the extended background class determination is shown exemplarily in FIG4.
在图4中所示的例子中,将非均匀量化分成两个步骤,其中前一个步骤提供较细量化(块NUQ1),后一步骤提供较粗量化(块NUQ2)。这便于该量化适用于,例如,邻域的方差。邻域的方差是在方差估计块VES中估计的,其中该方差估计基于块NUQ1中邻域NBH中的谱区的所述前较细量化。方差的估计无需精确,而是可以非常粗略。例如,将USAC应用于确定所述较细量化之后邻域NBH中的谱区的绝对值是否达到或超过方差阈值就足够了,也就是说,在高低方差之间切换就足够了。In the example shown in FIG4 , the non-uniform quantization is divided into two steps, wherein the first step provides a finer quantization (block NUQ1) and the second step provides a coarser quantization (block NUQ2). This facilitates the quantization to be applied, for example, to the variance of a neighborhood. The variance of the neighborhood is estimated in the variance estimation block VES, wherein the variance estimate is based on the previous finer quantization of the spectral region in the neighborhood NBH in block NUQ1. The estimate of the variance does not need to be exact, but can be very rough. For example, it is sufficient to apply USAC to determine whether the absolute value of the spectral region in the neighborhood NBH after the finer quantization reaches or exceeds a variance threshold, that is, it is sufficient to switch between high and low variance.
2步非均匀量化看起来可能像表2那样。在这个例子中,低方差模式对应于在表2中所示的1步量化。2-step non-uniform quantization may look like Table 2. In this example, the low variance mode corresponds to the 1-step quantization shown in Table 2.
表2描绘了示范性2步非均匀量化;第二或随后步骤依赖于方差被估计成高还是低地不同地量化Table 2 depicts an exemplary 2-step non-uniform quantization; the second or subsequent steps are quantized differently depending on whether the variance is estimated to be high or low
块CLASS中的最终背景类别确定与图3的简化版本中相同。可以按照方差模式使用不同背景类别确定。也可以使用不止两种方差模式,当然这导致背景类别的数量增加和复杂性的增加。The final background class determination in block CLASS is the same as in the simplified version of FIG3 . Different background class determinations can be used according to the variance pattern. More than two variance patterns can also be used, which of course results in an increased number of background classes and increased complexity.
对于频谱中的第一谱区,像在图5a或5b中所示那样的邻域是不可应用的,因为对于第一谱区,根本不存在或不是存在所有较低频的谱区。对于这些特殊情况的每一种,可以定义自身邻域。在进一步的实施例中,将预定值填入不存在谱区中。对于在图5a中给出的示范性邻域,频谱中要发送的第一谱区的定义邻域像在图6a和图6b中所示那样。其思路是将邻域扩展到较高频谱区,以便于将相同背景类别确定功能用于频谱的其余部分。这也意味着相同的背景类别,并且至少可以使用相同PDF。如果仅仅减小邻域的尺寸(当然这也是一种选择),则这是不可能的。For the first spectral region in the spectrum, a neighborhood like that shown in Figures 5a or 5b is not applicable, because for the first spectral region, not all lower frequency spectral regions exist or do not exist. For each of these special cases, a own neighborhood can be defined. In a further embodiment, predetermined values are filled in the non-existent spectral regions. For the exemplary neighborhood given in Figure 5a, the defined neighborhood for the first spectral region to be transmitted in the spectrum is as shown in Figures 6a and 6b. The idea is to extend the neighborhood to higher spectral regions so that the same background category determination function can be used for the rest of the spectrum. This also means the same background category and at least the same PDF can be used. This is not possible if the size of the neighborhood is simply reduced (which is of course also an option).
复位通常发生在编码新谱之前。如上所述,有必要将专用起点用于解码。例如,如果解码过程从某个帧/谱开始,实际上解码过程必须从最后复位的点开始,以便相继解码前帧直到所希望的开始谱。这意味着,复位发生得越多,解码退出的入口点就越多。但是,在复位后的频谱中编码效率更低。A reset typically occurs before encoding a new spectrum. As mentioned above, a dedicated starting point is necessary for decoding. For example, if the decoding process begins at a specific frame/spectrum, the decoding process must actually begin at the point of the last reset, decoding the previous frames sequentially until the desired starting spectrum is reached. This means that the more resets occur, the more exit points there are for decoding. However, encoding efficiency is lower in the spectrum after the reset.
在发生复位之后,没有前谱可用于邻域定义。这意味着只有实际频谱的前谱区才可以用在邻域中。但是,一般过程可能没有发生变化,并且可以使用相同“工具”。并且,必须与前节已经所述不同地对待第一谱区。After a reset occurs, there is no previous spectrum available for neighborhood definition. This means that only the first spectral region of the actual spectrum can be used in the neighborhood. However, the general process may not change, and the same "tools" can be used. However, the first spectral region must be treated differently than described in the previous section.
在图8中,示出了示范性复位邻域定义。这种定义可以用于在USAC的 FD模式下复位的情况下。An exemplary reset neighborhood definition is shown in Figure 8. This definition can be used in the case of reset in FD mode of USAC.
如图8的例子所示的附加背景类别的数量(如果使用量化步骤1之后的值,使用最终具有3个可能量化值或6个值的表格的量化)如下:对第1谱区的处理增加1个背景类别,对第2谱区的处理增加6个背景类别(使用量化步骤1之后的值),对第3谱区的处理增加6个背景类别,并且对第4谱区的处理增加10个背景类别。如果另外考虑两种(高低)方差模式,则这种背景类别的数量几乎加倍(只有对于没有信息可用的第一谱区、和使用量化步骤1之后的谱区的值的第二谱区没有加倍)。8 (using quantization resulting in a table with 3 possible quantized values or 6 values if the values after quantization step 1 are used) is as follows: 1 background category is added for the processing of spectral region 1, 6 background categories are added for the processing of spectral region 2 (using the values after quantization step 1), 6 background categories are added for the processing of spectral region 3, and 10 background categories are added for the processing of spectral region 4. If two (high and low) variance modes are additionally considered, the number of such background categories is almost doubled (only for the first spectral region, for which no information is available, and for the second spectral region using the values of the spectral region after quantization step 1).
对于复位的处理,在本例中导致了1+6+2×6+2×10=39个附加背景类别。The resetting process results in 1+6+2×6+2×10=39 additional background categories in this example.
映射块MAP采用块COCL确定的背景分类,例如,确定的背景类别号,并从PDF存储器MEM1中选择相应PDF。在这个步骤中,通过将单个PDF 用于不止一个背景类别,可以进一步减小必要存储容量。也就是说,具有相似PDF的背景类别可以使用联合PDF。这些PDF可以在训练阶段使用足够大代表性数据集预定。这种训练可以包括识别与相似PDF相对应的背景类别并合并相应PDF的优化阶段。取决于数据统计,这可以导致必须存储在存储器中的数量相当少PDF。在USAC的示范性实验形式中,成功地应用了从822 个背景类别到64个PDF的映射。The mapping block MAP uses the background classification determined by the block COCL, for example, the determined background category number, and selects the corresponding PDF from the PDF memory MEM1. In this step, the necessary storage capacity can be further reduced by using a single PDF for more than one background category. That is, background categories with similar PDFs can use a joint PDF. These PDFs can be predetermined during a training phase using a sufficiently large representative dataset. This training can include an optimization phase in which background categories corresponding to similar PDFs are identified and the corresponding PDFs are merged. Depending on the data statistics, this can result in a significantly smaller number of PDFs that must be stored in the memory. In an exemplary experimental form of USAC, a mapping from 822 background categories to 64 PDFs was successfully applied.
如果背景类别的数量不太大,这种映射功能MAP的实现可以是简单的查找表。如果数量越来越大,则由于效率原因,可以应用散列表搜索。If the number of background categories is not too large, the implementation of this mapping function MAP can be a simple lookup table. If the number is getting larger, a hash table search can be applied for efficiency reasons.
如上所述,一般模式开关GMS允许在频域模式(FD)与加权线性预测变换模式(wLPT)之间切换。依赖于模式,可以使用不同邻域。实验表明在图5a,图6a和6b和图8中描绘的示范性邻域对于FD模式来说足够大。但对于wLPT模式,发现如示范性地在图5b,图7a,7b和7c和图9中描绘的较大邻域是有利的。As described above, the general mode switch GMS allows switching between frequency domain mode (FD) and weighted linear predictive transform mode (wLPT). Depending on the mode, different neighborhoods can be used. Experiments have shown that the exemplary neighborhoods depicted in Figures 5a, 6a and 6b, and 8 are sufficiently large for FD mode. However, for wLPT mode, larger neighborhoods, as exemplarily depicted in Figures 5b, 7a, 7b, and 7c, and 9, have been found to be advantageous.
也就是说,在图9中描绘了wLPT模式下的示范性复位处理。在图7a, 7b,7c和7d中分别描述了频谱中最低、次最低、第三最低和第四最低谱区在 wLPT模式下的示范性邻域。并且,在图5b中描绘了频谱中所有谱区在wLPT 模式下的示范性邻域。That is, an exemplary reset process in wLPT mode is depicted in FIG9 . Exemplary neighborhoods of the lowest, second lowest, third lowest, and fourth lowest spectral regions in the spectrum in wLPT mode are depicted in FIG7 a , 7 b , 7 c , and 7 d , respectively. Furthermore, exemplary neighborhoods of all spectral regions in the spectrum in wLPT mode are depicted in FIG5 b .
在图5b中描绘的示范性邻域引起的背景类别的数量是3×91=273个背景类别。因子3是由在与当前要编码或当前要解码的一个谱区相同的频率上对一个谱区的特殊处理引起的。按照上面给出的公式,对于邻域中的其余12个谱区,存在0.5*((12*12)+3*12+2)=91种具有值2,1或0的谱区的号码的组合。在依赖于邻域的方差是否达到或超过阈值区分背景类别的实施例中,将 273个背景类别加倍。The number of background classes resulting from the exemplary neighborhood depicted in FIG5 b is 3×91=273 background classes. The factor 3 is due to the special treatment of a spectral region at the same frequency as a spectral region currently being encoded or decoded. According to the formula given above, for the remaining 12 spectral regions in the neighborhood, there are 0.5*((12*12)+3*12+2)=91 combinations of spectral region numbers having the values 2, 1, or 0. In an embodiment that relies on whether the variance of the neighborhood meets or exceeds a threshold to distinguish between background classes, the 273 background classes are doubled.
如图9所示的示范性复位处理也可以增加背景类别的数量。The exemplary reset process shown in FIG. 9 may also increase the number of background categories.
在在实验中产生良好结果的示范性测试实验中,存在以下表1中分解的 822个可能背景类别。In an exemplary test experiment that produced good results in experiments, there were 822 possible background categories broken down in Table 1 below.
表1 MPEG USAC CE建议的分解可能背景类别Table 1 Possible background categories for decomposition recommended by MPEG USAC CE
在示范性测试实施例中,将这822个可能背景类别映射成64个PDF。如上所述,该映射在训练阶段确定。所得64个PDF必须存储在ROM表中,例如,对于定点算术编码器,以16位精度存储。这里,揭露了所提方案的另一个优点:在在背景技术部分提到的USAC标准化的当前工作草案中,利用单个码字共同编码四元组(quadruple)(包含4个谱区的矢量)。即使矢量中的每个分量的动态范围极小,这也导致极大的码簿(例如,每个分量可能具有值[-4,..., 3]→84=4096个可能不同矢量)。但是,标量的编码使得即使每个谱区的动态范围很大码簿也极小。在示范性测试实施例中使用的码簿具有提供从-15到+15的谱区动态范围和转义码字(Esc-codeword)(对于该情况,谱区的值在这个范围之外)的32个项目。这意味着只有64×32个16位值必须存储在ROM表中。In the exemplary test embodiment, these 822 possible background categories are mapped to 64 PDFs. As mentioned above, this mapping is determined during the training phase. The resulting 64 PDFs must be stored in a ROM table, for example, with 16-bit precision for a fixed-point arithmetic coder. Here, another advantage of the proposed scheme is revealed: in the current working draft of the USAC standardization mentioned in the background section, a quadruple (a vector containing four spectral bins) is jointly encoded using a single codeword. Even if the dynamic range of each component in the vector is extremely small, this results in a very large codebook (e.g., each component can have the value [-4, ..., 3] → 8 4 = 4096 possible different vectors). However, scalar encoding allows for an extremely small codebook even with a large dynamic range for each spectral bin. The codebook used in the exemplary test embodiment has 32 entries, providing a spectral bin dynamic range from -15 to +15 and an escape codeword (for the case where the spectral bin value is outside this range). This means that only 64×32 16-bit values must be stored in the ROM table.
上面描述了使用前谱系数算术编码当前谱系数的方法,其中所述前谱系数是已经编码的,并且所述前谱系数和当前谱系数两者都被包含在视频、音频或语音信号样本值的量化时频变换所得的一个或多个量化谱中。在一个实施例中,所述方法包含处理前谱系数,将处理后前谱系数用于确定作为至少两个不同背景类别之一的背景类别,将所确定背景类别和从至少两个不同背景类别到至少两个不同概率密度函数的映射用于确定概率密度函数,以及根据所确定概率密度函数算术编码当前谱系数,其中处理前谱系数包含非均匀地量化前谱系数。The above describes a method for arithmetically encoding a current spectral coefficient using a previous spectral coefficient, wherein the previous spectral coefficient is already encoded and both the previous spectral coefficient and the current spectral coefficient are contained in one or more quantized spectra obtained by quantizing the time-frequency transform of the sample values of the video, audio or speech signal. In one embodiment, the method includes processing the previous spectral coefficient, using the processed previous spectral coefficient to determine a background category as one of at least two different background categories, using the determined background category and the mapping from the at least two different background categories to at least two different probability density functions to determine a probability density function, and arithmetically encoding the current spectral coefficient according to the determined probability density function, wherein the processed previous spectral coefficient includes non-uniformly quantized previous spectral coefficients.
在另一个示范性实施例中,使用已经编码的前谱系数算术编码当前谱系数的设备包含处理部件、确定背景类别的第一部件、存储至少两个不同概率密度函数的存储器、检索概率密度的第二部件、和算术编码器。In another exemplary embodiment, an apparatus for arithmetically encoding current spectral coefficients using already encoded previous spectral coefficients comprises a processing component, a first component for determining a background category, a memory for storing at least two different probability density functions, a second component for retrieving the probability density, and an arithmetic encoder.
然后,所述处理部件适用于通过非均匀量化处理已经编码的前谱系数,并且所述第一部件适用于将处理结果用于确定作为至少两个不同背景类别之一的背景类别。所述存储器存储至少两个不同概率密度函数、和便于检索与所确定背景类别相对应的概率密度函数的从至少两个不同背景类别到至少两个不同概率密度函数的映射。所述第二部件适用于从所述存储器中检索与所确定背景类别相对应的概率密度函数,并且所述算术编码器适用于根据检索的概率密度函数算术编码当前谱系数。Then, the processing means is adapted to process the encoded previous spectral coefficients by non-uniform quantization, and the first means is adapted to use the processing result to determine a background category as one of at least two different background categories. The memory stores at least two different probability density functions and a mapping from the at least two different background categories to the at least two different probability density functions for facilitating retrieval of a probability density function corresponding to the determined background category. The second means is adapted to retrieve the probability density function corresponding to the determined background category from the memory, and the arithmetic encoder is adapted to arithmetically encode the current spectral coefficient according to the retrieved probability density function.
存在使用已经编码的前谱系数算术解码当前谱系数的设备的对应另一个示范性实施例,该设备包含处理部件、确定背景类别的第一部件、存储至少两个不同概率密度函数的存储器、检索概率密度的第二部件、和算术解码器。There is a corresponding another exemplary embodiment of a device for arithmetically decoding current spectral coefficients using already encoded previous spectral coefficients, the device comprising a processing component, a first component for determining a background category, a memory for storing at least two different probability density functions, a second component for retrieving the probability density, and an arithmetic decoder.
然后,所述处理部件适用于通过非均匀量化处理已经编码的前谱系数,并且所述第一部件适用于将处理结果用于确定作为至少两个不同背景类别之一的背景类别。所述存储器存储至少两个不同概率密度函数、和便于检索与所确定背景类别相对应的概率密度函数的从至少两个不同背景类别到至少两个不同概率密度函数的映射。所述第二部件适用于从所述存储器中检索与所确定背景类别相对应的概率密度函数,并且所述算术解码器适用于根据检索的概率密度函数算术解码当前谱系数。Then, the processing means is adapted to process the encoded previous spectral coefficients by non-uniform quantization, and the first means is adapted to use the processing result to determine a background category as one of at least two different background categories. The memory stores at least two different probability density functions and a mapping from the at least two different background categories to the at least two different probability density functions for facilitating retrieval of a probability density function corresponding to the determined background category. The second means is adapted to retrieve the probability density function corresponding to the determined background category from the memory, and the arithmetic decoder is adapted to arithmetically decode the current spectral coefficient according to the retrieved probability density function.
Claims (2)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP09305961A EP2315358A1 (en) | 2009-10-09 | 2009-10-09 | Method and device for arithmetic encoding or arithmetic decoding |
| EP09305961.6 | 2009-10-09 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1211754A1 HK1211754A1 (en) | 2016-05-27 |
| HK1211754B true HK1211754B (en) | 2019-10-18 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US12463660B2 (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK1217824B (en) | Device for arithmetic encoding and device for arithmetic decoding | |
| HK1217827B (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK1217825B (en) | Arithmetic encoding device and arithmetic decoding device | |
| HK1211754B (en) | Arithmetic encoding device and arithmetic decoding device | |
| HK1211753B (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK1217826B (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK1211755B (en) | Arithmetic encoding device and arithmetic decoding device | |
| HK40064994B (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK40053509A (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| HK40053509B (en) | Method and device for arithmetic encoding or arithmetic decoding | |
| TW202549282A (en) | Method and apparatus for decoding spectral coefficients |