CN1150516C - Voice coding method and voice coder - Google Patents
Voice coding method and voice coder Download PDFInfo
- Publication number
- CN1150516C CN1150516C CNB971034516A CN97103451A CN1150516C CN 1150516 C CN1150516 C CN 1150516C CN B971034516 A CNB971034516 A CN B971034516A CN 97103451 A CN97103451 A CN 97103451A CN 1150516 C CN1150516 C CN 1150516C
- Authority
- CN
- China
- Prior art keywords
- reflection coefficient
- speech
- vector
- code
- codebook
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 239000013598 vector Substances 0.000 claims abstract description 128
- 230000008569 process Effects 0.000 claims abstract description 5
- 238000012545 processing Methods 0.000 claims description 14
- 230000005284 excitation Effects 0.000 claims description 13
- 230000004044 response Effects 0.000 claims description 11
- 230000006870 function Effects 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 abstract description 2
- 239000000654 additive Substances 0.000 abstract 1
- 230000000996 additive effect Effects 0.000 abstract 1
- 238000013139 quantization Methods 0.000 description 24
- 230000007774 longterm Effects 0.000 description 11
- 230000003595 spectral effect Effects 0.000 description 9
- 238000004891 communication Methods 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000005311 autocorrelation function Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/083—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/135—Vector sum excited linear prediction [VSELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Analogue/Digital Conversion (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
矢量相加激励的线性预测编码(VSELP)语音编码器提供了改进的质量,减少了通常语音编码器的复杂性。VSELP使用代码簿,它有一预先定义的结构使得代码簿搜索过程所要求的计算大为减少。这个VSELP语音编码器运用基于定点阵技术(FLAT)的反射系数的单一或多段矢量量化器。此外,这个语音编码器使用预量化器来减小搜索的复杂性,使用高分辨标量量化器来减小所需要的存贮总量。以减少计算量和存贮要求。
A Vector Additive Excited Linear Predictive Coding (VSELP) speech coder provides improved quality and reduces the complexity of conventional speech coders. VSELP uses codebooks, which have a predefined structure so that the computation required for the codebook search process is greatly reduced. The VSELP speech coder employs single- or multi-segment vector quantizers based on reflection coefficients of fixed-array technology (FLAT). In addition, this speech coder uses a prequantizer to reduce the complexity of the search and a high-resolution scalar quantizer to reduce the amount of memory required. To reduce computation and storage requirements.
Description
技术领域technical field
本发明一般地涉及使用代码激励的线性预测编码(CELP),使用随机编码或矢量激励的语音编码方法,以及更具体地讲,涉及用于矢量相加激励的线性预测编码(VSELP)的矢量量化器方法。The present invention relates generally to code-excited linear predictive coding (CELP), speech coding methods using random coding or vector-excited coding, and more particularly to vector quantization for vector-additionally excited linear predictive coding (VSELP) device method.
背景技术Background technique
代码激励的线性预测(CELP)是一用于产生高质量合成语音的语音编码技术。这种语音编码也称为矢量激励的线性预测,被用于许多的语音通信和语音合成应用中。CELP特别是应用于数字语音加密和数字无线电话通信系统,这里语音质量,数据速率,大小和费用都是重要的问题。Code Excited Linear Prediction (CELP) is a speech coding technique used to generate high quality synthetic speech. This type of speech coding, also known as vector-excited linear prediction, is used in many speech communication and speech synthesis applications. CELP finds particular application in digital voice encryption and digital wireless telephone communication systems where voice quality, data rate, size and cost are all important issues.
在CELP语音编码器中,模拟输入语音信号特性的长期(音调(pitch))和短期(峰段(formant))预测器包含在一套时变滤波器内。具体地,可使用一个长期和一个短期滤波器。用于滤波器的激励信号从已存贮的改进序列的代码簿或码矢(codevectors)中选择。In a CELP speech coder, long-term (pitch) and short-term (formant) predictors of the characteristics of the analog input speech signal are contained within a set of time-varying filters. Specifically, one long-term and one short-term filter can be used. The excitation signal for the filter is selected from a stored codebook or codevectors of modified sequences.
对于每一帧语音,选择一最佳的激励信号。语音编码器施加一单独码矢到滤波器以产生一重建的语音信号,该重建的语音信号和原始输入的语音信号相比较,产生一误差信号。该误差信号通过一谱噪声加权滤波器被加权。谱噪声加权滤波器有一以人体听力感觉为基础的响应。最佳激励信号是一选出的码矢,对当时的语音帧它以最小能量产生加权的误差信号。For each frame of speech, an optimal excitation signal is selected. The speech encoder applies a single code vector to the filter to generate a reconstructed speech signal, which is compared with the original input speech signal to generate an error signal. The error signal is weighted by a spectral noise weighting filter. The spectral noise weighting filter has a response based on the human perception of hearing. The optimal excitation signal is a selected code vector which produces a weighted error signal with the least energy for the current speech frame.
典型地,线性预测编码(LPC)是用来模拟采样区段内的短期信号相关,也称为短期滤波器。短期信号相关表示声道的谐振频率。LPC系数是一组语音模型参量。其它的参量组可用来表示加于短期预测滤波器的激励信号的特性。这些其它的语音模型参量包括:线谱频率(LSF),逆谱(cepstral)系数,反射系数,对数面积比和反正弦。Typically, Linear Predictive Coding (LPC) is used to model short-term signal correlations within a sample segment, also known as a short-term filter. The short-term signal correlation indicates the resonant frequency of the vocal tract. The LPC coefficients are a set of speech model parameters. Other sets of parameters can be used to characterize the excitation signal applied to the short-term prediction filter. These other speech model parameters include: line spectral frequency (LSF), inverse spectral (cepstral) coefficients, reflection coefficient, log area ratio and arcsine.
语音编码器通常对激励信号进行矢量量化以减少为表示信号特性所必需的比特数。LPC系数可在量化前被转换为其它的上述的参量组。这些系数可被单独地量化(标量量化)或被一组地量化(矢量量化)。标量量化不如矢量量化那样有效。然而标量量化在计算和存贮要求上比矢量量化费便宜。LPC参量的矢量量化是应用于主要考虑编码效率的地方。Speech coders typically vector quantize the excitation signal to reduce the number of bits necessary to represent the signal characteristics. The LPC coefficients can be converted to other above-mentioned parameter groups before quantization. These coefficients can be quantized individually (scalar quantization) or as a group (vector quantization). Scalar quantization is not as efficient as vector quantization. However, scalar quantization is cheaper than vector quantization in terms of computation and storage requirements. The vector quantization of LPC parameters is applied where the coding efficiency is mainly considered.
多段矢量量化可被用来均衡编码效率,矢量量化器搜索复杂性,和矢量量化器存贮要求等。第一类多段矢量量化是将一Np元素的LPC参量矢量划分为n段,n段的每一段是单独地被矢量量化。第二类多段矢量量化是在n个矢量的代码簿中划分LPC参量,这里每个矢量代码簿复盖所有Np矢量元素。为了说明矢量量化,假设Np=10个元素,每个元素以2比特代表。传统的矢量量化每10个元素将要求220个码矢来表示所有可能码矢的可能性。第一类带有两段的多段矢量量化每5个元素将要求210+210个码矢。第二类带有2段的多段矢量量化每5个元素将要求210+210个码矢。这些矢量量化的每一种方法对于编码效率,搜索复杂性和存贮要求等方面都提供不同的利益。这样,现代技术的语音编码器将会从矢量量化方法和设备(这些设备和方法增加了编码效率,或减少了搜索复杂性或存贮要求而无需改变相应的要求)中得益。Multi-segment vector quantization can be used to balance coding efficiency, vector quantizer search complexity, and vector quantizer storage requirements. The first type of multi-segment vector quantization is to divide an LPC parameter vector of Np elements into n segments, and each segment of the n segments is vector quantized separately. The second type of multi-segment vector quantization divides LPC parameters in codebooks of n vectors, where each vector codebook covers all Np vector elements. To illustrate vector quantization, it is assumed that Np=10 elements, and each element is represented by 2 bits. Every 10 elements of traditional vector quantization will require 220 code vectors to represent the possibility of all possible code vectors. The first type of multi-segment vector quantization with two segments will require 2 10 +2 10 code vectors per 5 elements. The second type of multi-segment vector quantization with 2 segments will require 2 10 +2 10 code vectors per 5 elements. Each of these methods of vector quantization offers different benefits in terms of coding efficiency, search complexity, and storage requirements. Thus, state-of-the-art speech coders would benefit from vector quantization methods and devices that increase coding efficiency, or reduce search complexity or storage requirements without changing the corresponding requirements.
发明内容Contents of the invention
针对现有技术的缺陷,本发明提供了一种语音编码方法,包括步骤:For the defects of prior art, the invention provides a kind of speech coding method, comprising steps:
(a)利用M个基本矢量,构建2M个代码矢量的激励代码簿,其中M是自然数;(a) Utilize M basic vectors to construct an incentive codebook of 2 M code vectors, where M is a natural number;
(b)接收输入的语音;(b) receiving input speech;
(c)响应于该输入语音,计算对应于表示该输入语音的语音参数的反射系数值;(c) in response to the input speech, calculating a reflection coefficient value corresponding to a speech parameter representing the input speech;
(d)在一个表格中存储2N个反射系数值,每个反射系数值利用一个N比特代码来寻址,其中N是自然数;(d) store 2 N reflection coefficient values in a table, each reflection coefficient value is addressed by an N-bit code, where N is a natural number;
(e)处理代码矢量以产生一个合成语音;(e) processing the code vectors to produce a synthesized speech;
(f)从该激励代码簿中选择一个代码矢量,其中该代码矢量使该合成语音相对于该输入语音的加权误差实现最小,包括:(f) selecting a code vector from the excitation codebook, wherein the code vector minimizes the weighted error of the synthesized speech relative to the input speech, comprising:
(f1)当处理过程中需要反射系数值时,向该表格提供对应的N比特代码,以查寻反射系数值,(f1) When the reflection coefficient value is needed during processing, the corresponding N-bit code is provided to the table to look up the reflection coefficient value,
(f2)当处理过程中不需要反射系数值时,在处理期间仅存储该N比特代码,从而使反射系数值的存储要求实现最小。(f2) When the reflection coefficient value is not required during processing, only the N-bit code is stored during processing, thereby minimizing the storage requirements for the reflection coefficient value.
根据本发明的另一个方面,提供了一种语音编码器,包括:According to another aspect of the present invention, a speech encoder is provided, comprising:
一个代码簿生成器,其生成具有利用M个基本矢量形成的2M个代码矢量的激励代码簿,其中M是自然数;a codebook generator that generates a stimulus codebook with 2 M codevectors formed from M basis vectors, where M is a natural number;
接收输入的语音并且产生数据矢量的输入装置;an input device that receives input speech and generates a data vector;
与该输入装置相耦合的编码装置,其生成对应于表示该输入语音的语音参数的反射系数,并且该编码装置处理这些代码矢量,以生成合成语音;encoding means coupled to the input means to generate reflection coefficients corresponding to speech parameters representing the input speech, and the encoding means to process the code vectors to generate synthesized speech;
用于量化该反射系数的矢量量化器,该矢量量化器包括一个矢量量化器存储器,该矢量量化器存储器被配置成存储2N个反射系数值,且具有一个N比特的输入端和一个输出端,而且该矢量量化器存储器响应于该N比特输入端所接收的一个N比特地址,在输出端提供2N个反射系数值中的一个,其中N是自然数;A vector quantizer for quantizing the reflection coefficient, the vector quantizer comprising a vector quantizer memory configured to store 2N reflection coefficient values and having an N-bit input and an output , and the vector quantizer memory provides one of 2 N reflection coefficient values at the output, where N is a natural number, in response to an N-bit address received at the N-bit input;
与代码簿生成器相耦合的代码簿搜索控制器,用于从激励代码簿中选择一个代码矢量,以使合成语音和数据矢量之间的加权误差实现最小,该代码簿搜索控制器又与该矢量量化器相耦合,并且当处理过程中需要反射系数值时,向该矢量量化器提供一个相应的N比特代码,以查寻用于处理的反射系数值,当处理过程中不需要反射系数值时,该代码簿搜索控制器仅存储该N比特代码。a codebook search controller coupled to the codebook generator for selecting a code vector from the excitation codebook such that the weighted error between the synthesized speech and data vectors is minimized, the codebook search controller being coupled to the The vector quantizer is coupled, and the vector quantizer is provided with a corresponding N-bit code to look up the reflection coefficient value for processing when a reflection coefficient value is required for processing, and when a reflection coefficient value is not required for processing , the codebook search controller only stores the N-bit code.
根据本发明,减少了通常语音编码器的复杂性,并且减少了计算量和存储要求。According to the invention, the complexity of conventional speech coders is reduced, and the computation and storage requirements are reduced.
附图说明Description of drawings
图1是一无线电通信系统的方框图,它包括依据本发明的语音编码器。Figure 1 is a block diagram of a radio communication system including a speech encoder according to the invention.
图2是依据本发明的语音编码器的方框图。Figure 2 is a block diagram of a speech encoder according to the invention.
图3是依据本发明所使用的反正弦函数的曲线图。Figure 3 is a graph of the arcsine function used in accordance with the present invention.
具体实施方式Detailed ways
这里描述代码激励线性预测编码(CELP)的一种变形,被称为矢量相加激励的线性预测编码(VSELP)是本发明的优选实施例。VSELP使用一具有预先定义的结构的激励代码簿,这样,代码簿搜索过程所要求的计算就大大减少。这个VSELP语音编码器使用基于定点点阵技术(FLAT)的反射系数的单一或多段矢量量化器。此外,这个语音编码器使用了预量化器以减少矢量代码簿搜索复杂性,和使用高分辨率标量量化器以减少为存贮反射系数矢量代码簿所需要的存贮器总量。这就得到一反射系数的高性能的矢量量化器,它在计算上是有效的并已减少了存贮要求。A variant of Code Excited Linear Predictive Coding (CELP) described herein, known as Vector Addition Excited Linear Predictive Coding (VSELP), is the preferred embodiment of the invention. VSELP uses an excitation codebook with a predefined structure, so that the computation required for the codebook search process is greatly reduced. The VSELP vocoder uses single or multi-segment vector quantizers based on reflection coefficients in Fixed Point Lattice Technology (FLAT). In addition, this speech coder uses a prequantizer to reduce the vector codebook search complexity, and a high resolution scalar quantizer to reduce the total amount of memory required to store the reflection coefficient vector codebook. This results in a high performance vector quantizer of reflection coefficients which is computationally efficient and has reduced memory requirements.
图1是无线电通信系统100的方框图。无线电通信系统100包括两个收发机101,113,他们互相发送和接收语音数据。这两个收发机101,113可以是中继无线电系统,或无线电话通信系统或任何其它发送和接收语音数据的无线电通信系统的一部分。在发射机中,语音信号被输入到话筒108,语音编码器选择被量化的语音模型参量。被量化的参量的代码被发送到另一收发信机113。在另一收发信机113,所发送的被量化参量的代码被接收机121接收,并用来在语音译码器123中再生语音。再生的语音输出到扬声器124。FIG. 1 is a block diagram of a
图2是一VSELP语音编码器200的方框图。VSELP语音编码器200使用所接收的代码来决定使用代码簿中的激励矢量。VSELP编码器使用有2M个码矢(它由M个基本矢量构成)的激励代码簿。字度Vm(n)为第m个基本矢量,定义ui(n)为代码簿中的第i个码矢。FIG. 2 is a block diagram of a
于是then
这里0≤i-≤2M-1;0≤n≤N-1。换句话说,代码簿中的每一个码矢被构成为M个基本矢量的线性组合。线性组合由Q参量定义。Here 0≤i- ≤2M -1; 0≤n≤N-1. In other words, each code vector in the codebook is constituted as a linear combination of M basis vectors. Linear combinations are defined by Q parameters.
θim被定义为θ im is defined as
θim=+1如果码字i的比特m=1θ im =+1 if bit m=1 of codeword i
θim=-1如果码字i的比特m=0码矢i被构成为M个基本矢量之和,这里每一基本矢量的符号(正号或负号)由码字i中相应的比特状态来决定。注意如果我们求补码字i中全部比特数,相应的码矢就是码矢i的负数。所以,对于每个码矢,它的负数也是代码簿中的一个码矢。由于为相应的码字彼此互补,这些码矢对被称为互补码矢。θ im =-1 If the bit m=0 of the codeword i, the codevector i is constituted as the sum of M basic vectors, where the sign (positive or negative) of each basic vector is determined by the corresponding bit state in the codeword i to decide. Note that if we find the number of bits in the complement codeword i, the corresponding code vector is the negative of code vector i. So, for each codevector, its negative is also a codevector in the codebook. Since the corresponding codewords are complementary to each other, these codevector pairs are called complementary codevectors.
适宜的矢量被选择后,增益单元205对所选择的矢量乘以增益项γ换算。增益单元205的输出加到一组线性滤波器207和209以获得重建语音的N个样值。滤波器包括一“长期”(或“音调”)滤波器207,它将音调周期性插入激励中。“长期”滤波器207的输出又加到“短期”(或“峰段”)滤波器209。短期滤波器209给信号加上谱包络。After a suitable vector is selected, the
长期滤波器207包括长期预测器系数(LTP)。长期滤波器207试图从一个或多个很远以前的样值中预测下一个输出样值。如果仅有一个过去的样值被应用于预测器中,那么,预测器就是一单抽头预测器。典型地使用一到三个抽头。含有一个单抽头长期预测器的长期(“音调”)滤波器207,其传递函数由(1.1)给出The long-
B(z)的特性由两个量L和β表示。L称为“滞后”。对于声频的语音,L典型地应是音调周期或是它的倍数。L也可能是一非整数值。如果L是一非整数,一个内插的有限脉冲响应(FIR)滤波器被用来产生部分延迟的样值。β是长期(或“音调”)预测器系数。B(z) is characterized by two quantities L and β. L is called "lag". For audible speech, L would typically be the pitch period or a multiple thereof. L may also be a non-integer value. If L is a non-integer, an interpolating finite impulse response (FIR) filter is used to generate partially delayed samples. β is the long-term (or "pitch") predictor coefficient.
短期滤波器209包括短期预测器系数αi,它试图从前面的Np个输出样值中预测下一个输出样值。Np典型的范围是从8到12。在本优选的实施例中,Np等于10。短期滤波器209相当于传统的Lpc合成滤波器。短期滤波器209的传递函数由(1.2)给出The short-
短期滤波器209的特性由αi参量表示。对于全极点“合成”滤波器,它是直接形式滤波器系数。关于αi参量的详情可在下面见到。The characteristics of the short-
各种参量(代码,增益,滤波器参量)并不以相同的速率发送到合成器(语音译码器)。典型地,短期参量的更新通常比代码少。我们将定义短期参量更新率为“帧频”,更新之间的间隔为一“帧”。代码更新率由矢量长度N决定。我们将定义代码更新率为“子帧频”,代码更新间隔为“子帧”。一帧通常由整数个子帧组成。增益和长期参量可以子帧频,帧频或根据语音编码器设计的两者之间的某些速率更新。The various parameters (code, gain, filter parameters) are not sent to the synthesizer (speech coder) at the same rate. Typically, short-term parameters are updated less often than code. We will define the short-term parameter update rate as "frame rate", and the interval between updates as one "frame". The code update rate is determined by the vector length N. We will define the code update rate as "subframe frequency" and the code update interval as "subframe". A frame usually consists of an integer number of subframes. The gain and long-term parameters can be updated at the subframe rate, the frame rate, or some rate in between depending on the vocoder design.
代码簿搜索过程包括尝试每一个码矢作为可能激励CELP合成器的码矢。合成语音S′(n)在比较器211和输入语音S(n)相比较,而产生一差值信号ei。这个差值信号ei(n)由一谱加权滤波器W(z)213(也可能由第二加权滤波器C(z)加以滤波以产生一加权的误差信号e′(n)。e′(n)中的功率在能量计算器215上计算。产生最小加权误差功率的码矢被选为该子帧的码矢。谱加权滤波器213用来对基于感官考虑的误差谱进行加权。这个加权滤波器213是语音谱的一个函数,并能以短期(谱)滤波器209的α参量来表示。The codebook search process consists of trying each codevector as a possible codevector to excite the CELP synthesizer. The synthesized speech S'(n) is compared with the input speech S(n) in the
有两种方法能用来计算增益γ。可以在根据残余能量的代码簿搜索之前决定增益。这个增益然后可被固定用于代码簿搜索。另一个方法是当代码簿搜索时优化每个码矢的增益。产生最小加权误差的码矢将被选择,并且它相应的最佳增益将被用于γ。由于增益是对每个码矢的优化,所以后一方法一般地能产生更好的效果。这个方法也意味着增益项必须以子帧频更新。此技术的最佳代码和增益可计算如下:There are two methods that can be used to calculate the gain γ. The gain can be determined prior to the codebook search from the residual energy. This gain can then be fixed for codebook searches. Another approach is to optimize the gain of each codevector when the codebook is searched. The codevector that yields the smallest weighting error will be chosen, and its corresponding best gain will be used for γ. Since the gain is optimized for each code vector, the latter approach generally yields better results. This approach also means that the gain term must be updated at the subframe rate. The optimal code and gain for this technique can be calculated as follows:
1.对于子帧计算已加权的输入信号y(n)。1. Compute the weighted input signal y(n) for a subframe.
2.计算d(n),子帧的B(z)和W(z)(和(C(z),如果使C(z)的情况下)滤波器的零输入响应。(零输入响应是滤波器没有输入时的响应;滤波器状态的衰减)。2. Calculate d(n), B(z) and W(z) of the subframe (and (C(z), if C(z) is used) the zero-input response of the filter. (The zero-input response is filter response when there is no input; decay of the filter state).
3.在整个子帧计算p(n)=y(n)-d(n)(0≤n≤N-1)3. Calculate p(n)=y(n)-d(n) (0≤n≤N-1) in the entire subframe
4.对于每个代码i。4. For each code i.
a.计算g(n),即码矢i的B(z)和W(z)(和C(z),如果使用的情况下)的零状态响应。(零状态响应是将起始滤波状态置为零的滤波器输出)。a. Compute g(n), the zero-state response of B(z) and W(z) (and C(z), if used) for code vector i. (The zero state response is the filter output with the initial filter state set to zero).
b.计算b. Calculate
即,已滤波的码矢i和p(n)之间的互相关That is, the cross-correlation between the filtered codevectors i and p(n)
c.计算c. Calculate
即,已滤波的码矢i中的功率。That is, the power in the filtered codevector i.
5.选择i,它使
6.使用所选择的码字和它相应的量化增益,更新B(z)和W(z)(和C(z),如使用的情况下)滤波器的滤波状态,以得到合成器在步骤2的下一子帧起始的同样滤波状态。码矢i的最佳增益由(1.8)式给出6. Using the chosen codeword and its corresponding quantization gain, update the filtering states of the B(z) and W(z) (and C(z), if used) filters to obtain the synthesizer at step The same filtering state at the start of the next subframe of 2. The optimal gain for code vector i is given by (1.8)
使用最佳增益γi的码矢i的总加权误差由(1.9)给出The total weighted error for code vector i using the optimal gain γ i is given by (1.9)
短期预测器参量是图2短期滤波器209的αi′。这些是标准的LPC直接形式滤波器系数,许多LPC分析技术可被用来决定这些系数。在本优选实施例中,使用了一个快速定点协方差点阵算法(FLAT)。FLAT具有点阵算法的全部优点,包括有保证的滤波器稳定性,非窗口分析和在循环内量化反射系数的能力。此外,FLAT在数值上是健全的,并能在一定点处理器上很容易实现。The short-term predictor parameter is α i ' of the short-
短期预测器参量是从输入语音计算的,没有使用预加重。用于参量计算的分析长度是170个样值(NA=170)。预测器的阶数是10(Np=10)。The short-term predictor parameters are computed from the input speech without using pre-emphasis. The analysis length used for parameter calculation is 170 samples (N A =170). The order of the predictor is 10 (Np=10).
这部分将详细描述FLAT算法。令落入分析间隔内输入语音的样值用S(n)表示;0≤n≤NA-1。因为FLAT是一点阵算法,可以认为该技术是试图逐级地建立一最佳(它使残余能量最小)的逆点阵滤波器。定义bj(n)为来自逆点阵滤波器第j级的反向余项,fj(n)为来自逆点阵滤波器第j级的正向余项,我们可定义This section will describe the FLAT algorithm in detail. Let the samples of input speech falling within the analysis interval be denoted by S(n); 0≤n≤NA - 1. Because FLAT is a lattice algorithm, the technique can be thought of as an attempt to progressively build an optimal (which minimizes residual energy) inverse lattice filter. Define b j (n) as the reverse remainder from the jth stage of the inverse lattice filter, and f j (n) as the forward remainder from the jth stage of the inverse lattice filter, we can define
为fj(n)的自相关;is the autocorrelation of f j (n);
为bj(n-1)的自相关;以及is the autocorrelation of b j (n-1); and
为fj(n)和bj(n-1)之间的互相关。is the cross-correlation between f j (n) and b j (n-1).
令rj表示逆点阵第j级的反射系数。于是Let r j denote the reflection coefficient of the jth order of the inverse lattice. then
和and
和and
我们已选择的用来决定rj的公式可表示为The formula we have chosen to determine rj can be expressed as
FLAT算法现可描述如下:The FLAT algorithm can now be described as follows:
1.首先从输入语音中计算协方差(自相关)矩阵1. First calculate the covariance (autocorrelation) matrix from the input speech
2.2.
F0(i,k)=f(i,k) 0≤i,k≤NP-1 (2.9)F0(i,k)=f(i,k) 0≤i,k≤NP-1 (2.9)
B0(i,k)=f(i+1,k+1) 0≤i,k≤NP-1 (2.10)B0(i,k)=f(i+1,k+1) 0≤i,k≤NP-1 (2.10)
C0(i,k)=f(i,k+1) 0≤i,k≤NP-1 (2.11)C0(i,k)=f(i,k+1) 0≤i,k≤NP-1 (2.11)
3.设j=13. Let j=1
4.用(2.7)式计算rj4. Use formula (2.7) to calculate rj
5.如j=Np,于是结束5. If j=Np, then end
6.用(2.4)计算Fj(i,k) 0≤i,k≤Np-j-16. Use (2.4) to calculate F j (i, k) 0≤i, k≤Np-j-1
用(2.5)计算B(i,k) 0≤i,k≤Np-j-1Use (2.5) to calculate B(i, k) 0≤i, k≤Np-j-1
用(2.6)计算Cj(i,k) 0≤i,k≤Np-j-1Use (2.6) to calculate C j (i, k) 0≤i, k≤Np-j-1
7.j=j+1;转到47. j=j+1; go to 4
在解出反射系数之前,用给自相关函数开窗的方法修正φ数值Before solving the reflection coefficient, correct the value of φ by windowing the autocorrelation function
φ′(i,k)=φ′(i,k)w(|i-k|) (2.12)φ′(i, k)=φ′(i, k)w(|i-k|) (2.12)
在计算反射系数之前,给自相关函数开窗通称为谱平滑(SST)。Windowing the autocorrelation function before calculating the reflection coefficient is known as spectral smoothing (SST).
从反射系数rj,可计算短期LPC预测系数αi。From the reflection coefficient r j , the short-term LPC prediction coefficient α i can be calculated.
使用反射系数的28比特三段矢量量化器。矢量量化器的段分别复盖反射系数r1-r3,r4-r6,和r7-r10。矢量量化器段的比特分配是28-bit three-segment vector quantizer using reflection coefficients. The segments of the vector quantizer cover reflection coefficients r 1 -r 3 , r 4 -r 6 , and r 7 -r 10 , respectively. The bit allocation of the vector quantizer section is
Q1 11比特Q 1 11 bits
Q2 9比特Q 2 9 bits
Q3 8比特为避免穷举的矢量量化器搜索的计算复杂性,在每一段使用了反射系数矢量的预量化器。每段预量化器的大小是:Q 3 8 bits To avoid the computational complexity of an exhaustive vector quantizer search, a prequantizer of reflection coefficient vectors is used in each segment. The size of each prequantizer is:
P1 6比特P 1 6 bits
P2 5比特P 2 5 bits
P3 4比特P 3 4 bits
在一给定段,来自预量化器的每个矢量的残余误差被计算并存入暂时存贮器中。搜索这个表以识别最低失真的4个预量化器矢量。每个选出的预量化器矢量的指数是用来计算进入矢量量化器表中的偏移,与预量化器矢量相连系的量化器矢量邻接的子集从这里开始。在第k段每个矢量量化器子集的大小由下式给出:At a given stage, the residual error for each vector from the prequantizer is calculated and stored in temporary memory. Search this table to identify the 4 prequantizer vectors with the lowest distortion. The index of each selected prequantizer vector is used to calculate the offset into the vector quantizer table from which the contiguous subset of quantizer vectors associated with the prequantizer vector begins. The size of each vector quantizer subset in segment k is given by:
与选出的预量化器矢量相连系的量化器矢量的4个子集被检索用来寻找产生最低残余误差的量化器矢量。这样,在第一段估算64个预量化器矢量和128个量化器矢量,在第二段估算32个预量化器矢量和64个量化器矢量,在第三段估算16个预量化器矢量和64个量化器矢量。通过上述具有频带扩展的FLAT技术所计算的最佳反射系数在矢量量化之前被转换为一自相关矢量。The 4 subsets of quantizer vectors associated with the selected prequantizer vector are searched to find the quantizer vector that produces the lowest residual error. Thus, 64 prequantizer vectors and 128 quantizer vectors are estimated in the first stage, 32 prequantizer vectors and 64 quantizer vectors are estimated in the second stage, and 16 prequantizer vectors and 64 quantizer vectors. The optimal reflection coefficient calculated by the above-mentioned FLAT technique with band extension is converted into an autocorrelation vector before vector quantization.
FLAT算法的自相关方案AFLAT是被用来计算正在被估算的反射系数矢量的残余误差能量。象FLAT一样,当计算最佳反射系数或从当前段的矢量量化器中选择反射系数时,这个算法具有部分地补偿来自以前点阵级中反射系数量化误差的能力。这个改善对于具有高反射系数量化失真的帧是很重要的。现在描述AFLAT算法(在带有预量化器的多段矢量量化器的范围内):The autocorrelation scheme AFLAT of the FLAT algorithm is used to calculate the residual error energy of the reflection coefficient vector being estimated. Like FLAT, this algorithm has the ability to partially compensate reflection coefficient quantization errors from previous lattice stages when calculating the best reflection coefficient or selecting reflection coefficients from the current stage's vector quantizer. This improvement is important for frames with high reflectance quantization distortion. Now describe the AFLAT algorithm (in the context of a multi-segment vector quantizer with prequantizer):
在0≤i≤Np范围内,根据最佳反射系数计算自相关序列R(i)。或者,可根据其它的LPC参量表示(如直接形式LPC预测器系数αi′)来计算自相关序列,或直接根据输入语音来计算。In the range of 0≤i≤Np, the autocorrelation sequence R(i) is calculated according to the optimal reflection coefficient. Alternatively, the autocorrelation sequence can be calculated according to other LPC parameter representations (such as direct form LPC predictor coefficients α i '), or directly from the input speech.
定义AFLAT循环的起始条件:Define the starting conditions for the AFLAT loop:
P0(i)=R(i),0≤i≤Np-1 (2.14)P 0 (i)=R(i), 0≤i≤N p -1 (2.14)
V0(i)=R(|i+1|),1-Np≤i≤Np-1 (2.15)V 0 (i)=R(|i+1|), 1-N p ≤ i ≤ N p -1 (2.15)
初始化矢量量化器段指数k:Initialize the vector quantizer segment index k:
k=1 (2.16)令Il(k)是第k段中第一点阵级的指数,Ih(k)是第k段的最后点阵级的指数。在第k段估算来自点阵级Ih(k)的残余误差的循环,已知r,来自预量化器的反射系数矢量或来自量化器的反射系数矢量表示如下。k = 1 (2.16) Let I l (k) be the index of the first lattice stage in the kth segment and Ih (k) be the index of the last lattice stage in the kth segment. The cycle of estimating the residual error from the lattice level Ih (k) at segment k, knowing r, the reflection coefficient vector from the prequantizer or the reflection coefficient vector from the quantizer is expressed as follows.
初始化点阵级的指数j,以指向第k段的开始:Initialize the index j of the lattice level to point to the start of the kth segment:
j=Il(k) (2.17)j=I l (k) (2.17)
设置Pj-1和Vj-1的起始条件为Set the initial conditions of P j-1 and V j-1 as
Pj-1(i)=Pj-1(i),0≤i≤Ih(k)-Il(k)+1 (2.18)P j-1 (i) = P j-1 (i), 0≤i≤I h (k)-I l (k)+1 (2.18)
Vj-1(i)= Vj-1(i),-Ih(k)+Il(k)-1≤i≤Ih(k)-Il(k)+1(2.19)V j-1 (i) = V j-1 (i), -I h (k)+I l (k)-1≤i≤I h (k)-I l (k)+1 (2.19)
用下式计算Vj和Pj数值,Calculate the V j and P j values using the following formula,
j增加:j increase:
j=j+1 (2.22)j=j+1 (2.22)
如j≤Ih(k)转到(2.20)式。Such as j≤I h (k) to (2.20) formula.
已知反射系数矢量 来自点阵级Lh(k)的残余误差由下式给出:Known reflection coefficient vector The residual error from lattice level L h (k) is given by:
使用概述的AFLAT循环,‘估算来自第k段预量化器的每个矢量引起的残余误差,识别要搜索的量化器矢量的4个子集,并计算来自所选出的4个子集的每个量化器矢量引起的残余误差。在4个子集中所有量化器矢量范围内使Er为最小的量化器矢量 的指数,被用Qk比特编码。如k<3,于是在k+1段进行循环的起始条件需要计算。设点阵级指数j等于:Using the outlined AFLAT loop, 'Estimate the residual error induced by each vector from the k-th segment prequantizer, identify 4 subsets of quantizer vectors to search for, and compute each quantizer from the 4 selected subsets The residual error caused by the device vector. The quantizer vector that minimizes Er over the range of all quantizer vectors in the 4 subsets The exponent of is coded with Q k bits. If k<3, then the initial condition for looping in segment k+1 needs to be calculated. Let the lattice index j be equal to:
j=Il(k) (2.24)j=I l (k) (2.24)
计算:calculate:
j增加,j increases,
j=j+1 (2.27)j=j+1 (2.27)
如j≤Ih(k),转到式(2.25)If j≤I h (k), go to formula (2.25)
增加矢量量化器分段指数k:Increase the vector quantizer segment index k:
k=k+1 (2.28)k=k+1 (2.28)
如k≤3,转到(2.17)。否则,选择三段的反射系数矢量的指数,并终止反射系数矢量量化器的搜索。If k≤3, go to (2.17). Otherwise, the index of the reflection coefficient vector of the three segments is selected and the search of the reflection coefficient vector quantizer is terminated.
为了使反射系数矢量量化器的存贮要求为最小,单个反射系数的八位代码而不是实际的反射系数值被存贮于矢量量化器表中,该代码被用来从有256项的标量量化表中查寻反射系数值。八位代码表示通过均匀采样图3所示的反正弦函数所得到的反射系数值。反射系数值从-1到+1变化。当值是接近于+/-1的极限时,在反射系数域(X轴)中的非线性间隔提供更精确的反射系数。而当值是近于0时,提供稍差的精确度。假设为256量化级时,和在反射系数域中均匀采样相比较,这减少了由于反射系数的标量量化引起的谱失真。In order to minimize the storage requirements of the reflection coefficient vector quantizer, the eight-bit code of the individual reflection coefficient, rather than the actual reflection coefficient value, is stored in the vector quantizer table. Look up reflectance values in the table. The eight-bit code represents the reflection coefficient value obtained by uniformly sampling the arcsine function shown in Figure 3. The reflectance value varies from -1 to +1. Non-linear spacing in the reflection coefficient domain (X-axis) provides more accurate reflection coefficients when values are near the limit of +/-1. And when the value is close to 0, slightly worse accuracy is provided. Assuming 256 quantization levels, this reduces spectral distortion due to scalar quantization of reflection coefficients compared to uniform sampling in the reflection coefficient domain.
Claims (6)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US3779393A | 1993-03-26 | 1993-03-26 | |
| US037793 | 1993-03-26 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN94190277A Division CN1051392C (en) | 1993-03-26 | 1994-03-07 | vector quantizer method |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1166019A CN1166019A (en) | 1997-11-26 |
| CN1150516C true CN1150516C (en) | 2004-05-19 |
Family
ID=21896370
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN94190277A Expired - Lifetime CN1051392C (en) | 1993-03-26 | 1994-03-07 | vector quantizer method |
| CNB971034516A Expired - Lifetime CN1150516C (en) | 1993-03-26 | 1997-03-12 | Voice coding method and voice coder |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN94190277A Expired - Lifetime CN1051392C (en) | 1993-03-26 | 1994-03-07 | vector quantizer method |
Country Status (12)
| Country | Link |
|---|---|
| US (2) | US5826224A (en) |
| JP (1) | JP3042886B2 (en) |
| CN (2) | CN1051392C (en) |
| AU (2) | AU668817B2 (en) |
| BR (1) | BR9404725A (en) |
| CA (1) | CA2135629C (en) |
| DE (2) | DE4492048T1 (en) |
| FR (1) | FR2706064B1 (en) |
| GB (2) | GB2282943B (en) |
| SE (2) | SE518319C2 (en) |
| SG (1) | SG47025A1 (en) |
| WO (1) | WO1994023426A1 (en) |
Families Citing this family (39)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6006174A (en) * | 1990-10-03 | 1999-12-21 | Interdigital Technology Coporation | Multiple impulse excitation speech encoder and decoder |
| IT1277194B1 (en) * | 1995-06-28 | 1997-11-05 | Alcatel Italia | METHOD AND RELATED APPARATUS FOR THE CODING AND DECODING OF A CHAMPIONSHIP VOICE SIGNAL |
| FR2738383B1 (en) * | 1995-09-05 | 1997-10-03 | Thomson Csf | METHOD FOR VECTOR QUANTIFICATION OF LOW FLOW VOCODERS |
| JP3680380B2 (en) * | 1995-10-26 | 2005-08-10 | ソニー株式会社 | Speech coding method and apparatus |
| TW307960B (en) * | 1996-02-15 | 1997-06-11 | Philips Electronics Nv | Reduced complexity signal transmission system |
| JP2914305B2 (en) * | 1996-07-10 | 1999-06-28 | 日本電気株式会社 | Vector quantizer |
| FI114248B (en) * | 1997-03-14 | 2004-09-15 | Nokia Corp | Method and apparatus for audio coding and audio decoding |
| US6826524B1 (en) | 1998-01-08 | 2004-11-30 | Purdue Research Foundation | Sample-adaptive product quantization |
| US6453289B1 (en) | 1998-07-24 | 2002-09-17 | Hughes Electronics Corporation | Method of noise reduction for speech codecs |
| IL129752A (en) | 1999-05-04 | 2003-01-12 | Eci Telecom Ltd | Telecommunication method and system for using same |
| GB2352949A (en) * | 1999-08-02 | 2001-02-07 | Motorola Ltd | Speech coder for communications unit |
| US6910007B2 (en) * | 2000-05-31 | 2005-06-21 | At&T Corp | Stochastic modeling of spectral adjustment for high quality pitch modification |
| JP2002032096A (en) * | 2000-07-18 | 2002-01-31 | Matsushita Electric Ind Co Ltd | Noise section / speech section determination device |
| US7171355B1 (en) * | 2000-10-25 | 2007-01-30 | Broadcom Corporation | Method and apparatus for one-stage and two-stage noise feedback coding of speech and audio signals |
| CA2733453C (en) * | 2000-11-30 | 2014-10-14 | Panasonic Corporation | Lpc vector quantization apparatus |
| JP4857468B2 (en) * | 2001-01-25 | 2012-01-18 | ソニー株式会社 | Data processing apparatus, data processing method, program, and recording medium |
| US7003454B2 (en) * | 2001-05-16 | 2006-02-21 | Nokia Corporation | Method and system for line spectral frequency vector quantization in speech codec |
| US6584437B2 (en) * | 2001-06-11 | 2003-06-24 | Nokia Mobile Phones Ltd. | Method and apparatus for coding successive pitch periods in speech signal |
| US7110942B2 (en) * | 2001-08-14 | 2006-09-19 | Broadcom Corporation | Efficient excitation quantization in a noise feedback coding system using correlation techniques |
| US7206740B2 (en) * | 2002-01-04 | 2007-04-17 | Broadcom Corporation | Efficient excitation quantization in noise feedback coding with general noise shaping |
| EP1489599B1 (en) * | 2002-04-26 | 2016-05-11 | Panasonic Intellectual Property Corporation of America | Coding device and decoding device |
| CA2388358A1 (en) * | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for multi-rate lattice vector quantization |
| US7337110B2 (en) * | 2002-08-26 | 2008-02-26 | Motorola, Inc. | Structured VSELP codebook for low complexity search |
| US7047188B2 (en) * | 2002-11-08 | 2006-05-16 | Motorola, Inc. | Method and apparatus for improvement coding of the subframe gain in a speech coding system |
| US7054807B2 (en) * | 2002-11-08 | 2006-05-30 | Motorola, Inc. | Optimizing encoder for efficiently determining analysis-by-synthesis codebook-related parameters |
| US7272557B2 (en) * | 2003-05-01 | 2007-09-18 | Microsoft Corporation | Method and apparatus for quantizing model parameters |
| EP1673764B1 (en) * | 2003-10-10 | 2008-04-09 | Agency for Science, Technology and Research | Method for encoding a digital signal into a scalable bitstream, method for decoding a scalable bitstream |
| US8473286B2 (en) * | 2004-02-26 | 2013-06-25 | Broadcom Corporation | Noise feedback coding system and method for providing generalized noise shaping within a simple filter structure |
| US7697766B2 (en) * | 2005-03-17 | 2010-04-13 | Delphi Technologies, Inc. | System and method to determine awareness |
| JP4871894B2 (en) | 2007-03-02 | 2012-02-08 | パナソニック株式会社 | Encoding device, decoding device, encoding method, and decoding method |
| CN101030377B (en) * | 2007-04-13 | 2010-12-15 | 清华大学 | A Method of Improving the Quantization Accuracy of Vocoder Pitch Period Parameters |
| BRPI0915450B1 (en) * | 2008-07-10 | 2020-03-10 | Voiceage Corporation | Device and method for inversely quantizing and quantizing lpc filters in a superframe |
| US8363957B2 (en) * | 2009-08-06 | 2013-01-29 | Delphi Technologies, Inc. | Image classification system and method thereof |
| CN101968778A (en) * | 2010-08-13 | 2011-02-09 | 广州永日电梯有限公司 | Lattice serial display method |
| BR112014022848B1 (en) * | 2012-03-29 | 2021-07-20 | Telefonaktiebolaget Lm Ericsson (Publ) | METHOD FOR PEAK REGION ENCODING PERFORMED BY A TRANSFORM CODEC, TRANSFORM CODEC, MOBILE TERMINAL, AND, COMPUTER-READABLE STORAGE MEDIA |
| KR102745244B1 (en) * | 2014-03-28 | 2024-12-20 | 삼성전자주식회사 | Method and apparatus for quantizing linear predictive coding coefficients and method and apparatus for dequantizing linear predictive coding coefficients |
| ES2982894T3 (en) | 2014-05-07 | 2024-10-18 | Industry Univ Cooperation Foundationhanyang Univ Erica Campus | Device for quantifying the linear predictive coefficient |
| PH12017500352B1 (en) * | 2014-08-28 | 2022-07-06 | Nokia Technologies Oy | Audio parameter quantization |
| CN109887519B (en) * | 2019-03-14 | 2021-05-11 | 北京芯盾集团有限公司 | Method for improving voice channel data transmission accuracy |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4544919A (en) * | 1982-01-03 | 1985-10-01 | Motorola, Inc. | Method and means of determining coefficients for linear predictive coding |
| JPS59116698A (en) * | 1982-12-23 | 1984-07-05 | シャープ株式会社 | Voice data compression |
| US4896361A (en) * | 1988-01-07 | 1990-01-23 | Motorola, Inc. | Digital speech coder having improved vector excitation source |
| US4817157A (en) * | 1988-01-07 | 1989-03-28 | Motorola, Inc. | Digital speech coder having improved vector excitation source |
| DE3871369D1 (en) * | 1988-03-08 | 1992-06-25 | Ibm | METHOD AND DEVICE FOR SPEECH ENCODING WITH LOW DATA RATE. |
| DE3883519T2 (en) * | 1988-03-08 | 1994-03-17 | Ibm | Method and device for speech coding with multiple data rates. |
| JPH02250100A (en) * | 1989-03-24 | 1990-10-05 | Mitsubishi Electric Corp | Speech encoding device |
| US4974099A (en) * | 1989-06-21 | 1990-11-27 | International Mobile Machines Corporation | Communication signal compression system and method |
| US5012518A (en) * | 1989-07-26 | 1991-04-30 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
| US4975956A (en) * | 1989-07-26 | 1990-12-04 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
| US4963030A (en) * | 1989-11-29 | 1990-10-16 | California Institute Of Technology | Distributed-block vector quantization coder |
| JP2626223B2 (en) * | 1990-09-26 | 1997-07-02 | 日本電気株式会社 | Audio coding device |
| JP3129778B2 (en) * | 1991-08-30 | 2001-01-31 | 富士通株式会社 | Vector quantizer |
| US5307460A (en) * | 1992-02-14 | 1994-04-26 | Hughes Aircraft Company | Method and apparatus for determining the excitation signal in VSELP coders |
| US5351338A (en) * | 1992-07-06 | 1994-09-27 | Telefonaktiebolaget L M Ericsson | Time variable spectral analysis based on interpolation for speech coding |
-
1994
- 1994-03-07 DE DE4492048T patent/DE4492048T1/en active Pending
- 1994-03-07 AU AU63970/94A patent/AU668817B2/en not_active Expired
- 1994-03-07 WO PCT/US1994/002370 patent/WO1994023426A1/en not_active Ceased
- 1994-03-07 CA CA002135629A patent/CA2135629C/en not_active Expired - Lifetime
- 1994-03-07 GB GB9422823A patent/GB2282943B/en not_active Expired - Lifetime
- 1994-03-07 BR BR9404725A patent/BR9404725A/en not_active IP Right Cessation
- 1994-03-07 SG SG1996002558A patent/SG47025A1/en unknown
- 1994-03-07 DE DE4492048A patent/DE4492048C2/en not_active Expired - Lifetime
- 1994-03-07 CN CN94190277A patent/CN1051392C/en not_active Expired - Lifetime
- 1994-03-07 JP JP6522073A patent/JP3042886B2/en not_active Expired - Lifetime
- 1994-03-23 FR FR9403414A patent/FR2706064B1/en not_active Expired - Lifetime
- 1994-11-25 SE SE9404086A patent/SE518319C2/en not_active IP Right Cessation
-
1996
- 1996-02-29 US US08/609,027 patent/US5826224A/en not_active Expired - Lifetime
- 1996-03-08 US US08/611,608 patent/US5675702A/en not_active Expired - Lifetime
- 1996-08-01 AU AU60843/96A patent/AU678953B2/en not_active Expired
-
1997
- 1997-03-12 CN CNB971034516A patent/CN1150516C/en not_active Expired - Lifetime
-
1998
- 1998-02-12 GB GBGB9802900.2A patent/GB9802900D0/en active Pending
-
2002
- 2002-04-12 SE SE0201109A patent/SE524202C2/en not_active IP Right Cessation
Also Published As
| Publication number | Publication date |
|---|---|
| SE518319C2 (en) | 2002-09-24 |
| AU6397094A (en) | 1994-10-24 |
| JPH07507885A (en) | 1995-08-31 |
| AU678953B2 (en) | 1997-06-12 |
| AU668817B2 (en) | 1996-05-16 |
| SG47025A1 (en) | 1998-03-20 |
| GB9802900D0 (en) | 1998-04-08 |
| SE9404086D0 (en) | 1994-11-25 |
| CA2135629C (en) | 2000-02-08 |
| SE0201109D0 (en) | 2002-04-12 |
| CN1166019A (en) | 1997-11-26 |
| DE4492048C2 (en) | 1997-01-02 |
| FR2706064B1 (en) | 1997-06-27 |
| US5675702A (en) | 1997-10-07 |
| WO1994023426A1 (en) | 1994-10-13 |
| CN1109697A (en) | 1995-10-04 |
| AU6084396A (en) | 1996-10-10 |
| DE4492048T1 (en) | 1995-04-27 |
| JP3042886B2 (en) | 2000-05-22 |
| CA2135629A1 (en) | 1994-10-13 |
| GB9422823D0 (en) | 1995-01-04 |
| FR2706064A1 (en) | 1994-12-09 |
| GB2282943B (en) | 1998-06-03 |
| SE524202C2 (en) | 2004-07-06 |
| US5826224A (en) | 1998-10-20 |
| CN1051392C (en) | 2000-04-12 |
| SE9404086L (en) | 1995-01-25 |
| SE0201109L (en) | 2002-04-12 |
| GB2282943A (en) | 1995-04-19 |
| BR9404725A (en) | 1999-06-15 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1150516C (en) | Voice coding method and voice coder | |
| CN1158648C (en) | Method and apparatus for variable rate speech coding | |
| CN1241170C (en) | Method and system for line spectral frequency vector quantization in speech codec | |
| US6393390B1 (en) | LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor and optimized ternary source excitation codebook derivation | |
| CN1198262C (en) | Method and apparatus for searching in an algebraic codebook to encode a sound signal | |
| CN1121683C (en) | Speech coding | |
| CN1202514C (en) | Method for encoding and decoding speech and its parameters, encoder, decoder | |
| CN1820306A (en) | Method and device for gain quantization in variable bit rate wideband speech coding | |
| CN1347550A (en) | CELP transcoding | |
| CN1151492C (en) | Synthesis-Analysis of Gain Quantization Methods in Linear Predictive Speech Coding | |
| CN1708907A (en) | Method and apparatus for fast CELP parameter mapping | |
| CN1145143C (en) | Speech coding method of synthesis analysis | |
| CN1159691A (en) | Method for linear predictive analyzing audio signals | |
| CN1152164A (en) | Code excitation linear predictive coding device | |
| CN1173938A (en) | Comprehensive Analysis of Speech Coding Methods | |
| CN1139912C (en) | CELP voice encoder | |
| CN1173940A (en) | Comprehensive Analysis of Speech Coding Methods | |
| CN1325529A (en) | Adaptive criterion for speech coding | |
| CN1751338A (en) | Method and apparatus for speech coding | |
| CN1145925C (en) | Transmitter with improved speech encoder and decoder | |
| CN1496556A (en) | Sound encoding device and method and sound decoding device and method | |
| CN1711589A (en) | Method and apparatus for coding gain information in a speech coding system | |
| CN1218296C (en) | Pitch cycle search range setting device | |
| CN1124590C (en) | Method for improving performance of voice coder | |
| Sadek et al. | An enhanced variable bit-rate CELP speech coder |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C06 | Publication | ||
| PB01 | Publication | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| ASS | Succession or assignment of patent right |
Owner name: RESEARCH IN MOTION LTD. Free format text: FORMER OWNER: MOTOROLA INC. Effective date: 20110212 |
|
| C41 | Transfer of patent application or patent right or utility model | ||
| COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: ILLINOIS, USA TO: ONTARIO, CANADA |
|
| TR01 | Transfer of patent right |
Effective date of registration: 20110212 Address after: Ontario, Canada Patentee after: Research In Motion Ltd. Address before: Illinois Instrunment Patentee before: Motorola Inc. |
|
| C17 | Cessation of patent right | ||
| CX01 | Expiry of patent term |
Expiration termination date: 20140307 Granted publication date: 20040519 |