JPH08147000A

JPH08147000A - System for encoding and decoding by vector quantization

Info

Publication number: JPH08147000A
Application number: JP6309855A
Authority: JP
Inventors: Akitoshi Saito; 彰利斉藤
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 1994-11-18
Filing date: 1994-11-18
Publication date: 1996-06-07
Anticipated expiration: 2018-01-07
Also published as: JP3362534B2

Abstract

PURPOSE: To prevent an error in vector quantization from increasing extremely even in the case where a speaker is changed in a learning vector quantization which updates the contents of a code book so that they are specialized to be adapted to a specific speaker. CONSTITUTION: The system for encoding and decoding by the learning vector quantization which adaptively updates the contents of the code book 6 on the basis of the result of matching between a vector to be quantized and a code vector detects a change of feature parameters of a speech signal by a speaker change detection part 12 and judges that the speaker changes when the change is detected, thereby setting an initialization flag. A rewriting control part 8 initializes the contents of the code book 6 with an initialization code book 7 holding the contents of the contents of the code book 6 in its initial state once the initialization flag is set. Consequently, the contents of the code book after the speaker changes are initialized into averaged contents for an unspecified speaker and the vector quantization error is prevented from increasing.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、コートブックを用い
て音声や画像等を高能率に圧縮して伝送又は記録するた
めのベクトル量子化による符号化復号方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an encoding / decoding system by vector quantization for highly efficiently compressing voices, images and the like for transmission or recording using a coatbook.

【０００２】[0002]

【従来の技術】従来より、音声や画像を高能率に圧縮し
て伝送又は記録するためのベクトル量子化が知られてい
る。ベクトル量子化では、量子化すべき信号波形を一定
区間について切り出し、この区間の波形パターンを１つ
の符号（インデックス）で表現する。このため予め種々
の波形パターンを蓄えておき、各波形パターンにインデ
ックスを付加しておく。種々の波形パターンは、サンプ
ル数の次元数で表現されたベクトル情報であり、これら
のベクトル情報をコードベクトル、これらのコードベク
トルとインデックスとの対応表をコードブックと呼ぶ。2. Description of the Related Art Conventionally, vector quantization has been known for compressing voices and images with high efficiency for transmission or recording. In vector quantization, a signal waveform to be quantized is cut out for a certain section, and the waveform pattern of this section is expressed by one code (index). Therefore, various waveform patterns are stored in advance and an index is added to each waveform pattern. Various waveform patterns are vector information expressed by the number of dimensions of the number of samples. These vector information are called code vectors, and the correspondence table between these code vectors and indexes is called a codebook.

【０００３】例えば音声の符号化方式の一つとして知ら
れているＣＥＬＰ（"Code-ExcitedLinear Prediction :
High Quality Speech at Very low Bit Rate" M.R.Sch
roeder, B.S.Atal ICASSP,1985 ）では、音声の予測残
差成分をサブフレームと呼ばれる４０サンプル程のデー
タに分割し、この各サブフレームに対してベクトル量子
化を適用している。しかし、このベクトル量子化に使用
されるコードブックは、多数の話者の平均的なものとな
るので、特定の個人向けには最適化されていない。この
ため、量子化されるベクトルとコードブックのベクトル
との誤差が大きく、良質の音声を伝送することができな
い。コードブックとして蓄えられるコードベクトルの数
が多ければ、音質は向上するが、反面、コードブックの
量が大きくなりコストアップになると共に、インデック
スの情報量も増すために伝送量も増す。また、コードブ
ックのサーチに時間がかかるという問題もある。[0003] For example, CELP ("Code-Excited Linear Prediction:") which is known as one of audio coding systems.
High Quality Speech at Very low Bit Rate "MRSch
Roeder, BSAtal ICASSP, 1985) divides a prediction residual component of speech into data of about 40 samples called subframes, and applies vector quantization to each subframe. However, the codebook used for this vector quantization is not optimized for any particular individual as it will be average of many speakers. For this reason, the error between the quantized vector and the codebook vector is large, and good quality speech cannot be transmitted. If the number of code vectors stored as a codebook is large, the sound quality is improved, but on the other hand, the amount of codebook is large and the cost is increased, and the information amount of the index is also increased, so that the transmission amount is also increased. There is also a problem that it takes time to search the codebook.

【０００４】そこで、少ないコードブックの情報量で量
子化歪の発生を極力抑えるようにした学習ベクトル量子
化器（ＬＶＱ）が、特開平２−１８６８３６号に開示さ
れている。この学習ベクトル量子化器は、ベクトル量子
化時の最小歪が所定値よりも大きい場合に、その被量子
化ベクトルを復号装置側にも送信して、コードブック中
の最も過去に選択された量子化代表ベクトル（コードベ
クトル）と書き換えることにより、コードブックの内容
を被量子化ベクトルの特性に応じて適応的に更新するも
のである。Therefore, a learning vector quantizer (LVQ) that suppresses the occurrence of quantization distortion with a small amount of codebook information is disclosed in Japanese Patent Laid-Open No. 2-1886836. This learning vector quantizer also transmits the quantized vector to the decoding device side when the minimum distortion at the time of vector quantization is larger than a predetermined value, and selects the earliest selected quantum in the codebook. By rewriting the coded representative vector (code vector), the contents of the codebook are adaptively updated according to the characteristics of the quantized vector.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上述し
た従来の学習ベクトル量子化による符号化復号装置で
は、平均的な多くの話者にまんべんなく合致するように
作成された元のコードブックを、特定の話者に特化して
いくように順次更新していくように構成されている。従
って、話者が変わった場合、交替前の話者に特化されて
いたコードブックが災いし、ベクトル量子化誤差が増大
し、復号音声の悪化をもたらすという問題がある。However, in the above-mentioned conventional coding / decoding apparatus for learning vector quantization, the original codebook created so as to be evenly matched with many average speakers is specified. It is configured to be updated sequentially so as to be specialized for the speaker. Therefore, when the speaker changes, there is a problem in that the codebook specialized for the speaker before the replacement is damaged, the vector quantization error increases, and the decoded speech deteriorates.

【０００６】この発明は、このような問題点に鑑みなさ
れたもので、話者に特化していくようにコードブックの
内容を更新していく学習ベクトル量子化において、話者
が変化した場合でも、ベクトル量子化誤差が極端に増大
することがないベクトル量子化による符号化復号方式を
提供することを目的とする。The present invention has been made in view of such a problem, and in the learning vector quantization for updating the contents of the codebook so as to be specialized for the speaker, even when the speaker changes. An object of the present invention is to provide a coding / decoding method by vector quantization in which the vector quantization error does not increase extremely.

【０００７】[0007]

【課題を解決するための手段】この発明に係るベクトル
量子化による符号化復号方式は、所定区間内のいくつか
の代表的な信号パターンを示すコードベクトルとそれら
のコードベクトルをそれぞれ特定するインデックスとの
対応表を記憶してなるコードブックを、符号化装置及び
復号装置の双方に設け、前記符号化装置では、符号化対
象信号に基づく量子化すべき被量子化ベクトルと最もマ
ッチングする前記コードブック内のコードベクトルを選
択してそのインデックスを出力し、前記復号装置では、
前記符号化装置側で得られたインデックスに対応するコ
ードベクトルを前記コードブックを参照して求めること
により前記被量子化ベクトルを復号すると共に、前記符
号化装置での前記被量子化ベクトルと前記コードベクト
ルとのマッチング結果に基づいて前記コードブックの内
容を適応的に更新するコードブック書換手段を前記符号
化装置及び復号装置の双方に設けたベクトル量子化によ
る符号化復号方式において、初期状態の前記コードブッ
クの内容を保持した初期化コードブックを前記符号化装
置及び復号装置の双方に設けると共に、前記符号化装置
は、前記符号化対象信号の特徴パラメータを抽出する特
徴抽出手段と、この特徴抽出手段で抽出された特徴パラ
メータの変化を検出する特徴変化検出手段とを備え、前
記コードブック書換手段は、前記特徴変化検出手段で前
記特徴パラメータの変化が検出された場合には、前記コ
ードブックの内容を前記初期化コードブックの内容に初
期化するものであることを特徴とする。A coding / decoding method by vector quantization according to the present invention includes a code vector indicating some typical signal patterns in a predetermined section and an index for specifying each of the code vectors. A codebook storing the correspondence table of is provided in both the encoding device and the decoding device, and in the encoding device, the codebook that best matches the quantized vector to be quantized based on the encoding target signal , And outputs the index, and in the decoding device,
While decoding the quantized vector by obtaining the code vector corresponding to the index obtained on the encoding device side with reference to the codebook, the quantized vector and the code in the encoding device In a coding / decoding method by vector quantization provided in both the coding apparatus and the decoding apparatus, codebook rewriting means for adaptively updating the contents of the codebook based on the result of matching with a vector, in the initial state, An initialization codebook holding the contents of the codebook is provided in both the encoding device and the decoding device, and the encoding device extracts the characteristic parameter of the signal to be encoded, and the feature extraction means. And a feature change detecting means for detecting a change in the feature parameter extracted by the means. Means, when said change in said characteristic parameters characteristic change detecting means is detected, characterized in that said that the contents of the codebook intended to initialize the contents of said initial codebook.

【０００８】また、この発明の更に好ましい態様によれ
ば、前記コードブックの初期化に先立って前記コードブ
ックの内容及び特徴パラメータを一時的に記憶する一時
記憶手段を更に備え、前記コードブック書換手段は、前
記特徴変化検出手段で前記特徴パラメータの変化が検出
された場合に、先ず前記特徴抽出手段で抽出された特徴
パラメータと前記一時記憶手段に記憶された特徴パラメ
ータとを比較して、両者が近似している場合には、前記
コードブックの内容を前記一時記憶手段に記憶されたコ
ードブックの内容に書換え、前記両特徴パラメータが近
似していない場合には、前記コードブックの内容を前記
一時記憶手段に退避したのちに前記初期化コードブック
の内容に初期化するものであることを特徴とする。Further, according to a further preferred aspect of the present invention, further comprising a temporary storage means for temporarily storing the contents and characteristic parameters of the codebook prior to the initialization of the codebook, the codebook rewriting means. When a change in the characteristic parameter is detected by the characteristic change detection means, first, the characteristic parameter extracted by the characteristic extraction means is compared with the characteristic parameter stored in the temporary storage means, and both are compared. If they are similar, the contents of the codebook are rewritten to the contents of the codebook stored in the temporary storage means, and if the feature parameters are not similar, the contents of the codebook are temporarily changed. The contents are stored in the storage means and then initialized to the contents of the initialization codebook.

【０００９】なお、前記符号化装置は、コードブックの
内容が初期化コードブックの内容に書き換えられたこと
を示す初期化情報を前記復号装置へ供給し、前記復号装
置は、供給された前記初期化情報に基づいてコードブッ
クの内容を初期化コードブックの内容に初期化するもの
でもよいし、前記符号化装置は、出力する符号化結果を
一旦復号し、この復号結果に対して前記特徴抽出手段に
よる特徴パラメータの抽出を実行し、抽出された特徴パ
ラメータの変化が検出されたときに前記コードブックの
内容を初期化コードブックの内容に初期化するものであ
り、前記復号装置は、入力された符号化結果を復号し、
この復号結果に対して前記特徴抽出手段による特徴パラ
メータの抽出を実行し、抽出された特徴パラメータの変
化が検出されたときに前記コードブックの内容を初期化
コードブックの内容に初期化するものでもよい。The encoding device supplies initialization information indicating that the contents of the codebook are rewritten to the contents of the initialization codebook to the decoding device, and the decoding device supplies the supplied initial information. The contents of the codebook may be initialized to the contents of the initialization codebook based on the encoding information, and the encoding device once decodes the encoding result to be output, and the feature extraction is performed on the decoding result. Extracting the characteristic parameters by means, and initializing the contents of the codebook to the contents of the initialization codebook when a change in the extracted characteristic parameters is detected. Decoding the encoded result,
A feature parameter may be extracted by the feature extracting means with respect to this decoding result, and the contents of the codebook may be initialized to the contents of the initialization codebook when a change in the extracted feature parameter is detected. Good.

【００１０】また、前記符号化対象信号が音声信号であ
る場合、前記特徴変化検出手段は、前記特徴抽出手段か
ら出力される前記音声信号の有音／無音の変化、ピッチ
変化、特徴係数の変化、スペクトラム包絡線情報の変
化、及びゲイン変化、並びに前記符号化装置内での符号
化の過程で出力される前記音声信号の有音／無音の変
化、ピッチ変化、特徴係数の変化、ゲイン変化、及びベ
クトル量子化誤差の変化の少なくとも一つに基づいて前
記コードブック書換手段を制御するものでよい。When the signal to be encoded is a voice signal, the feature change detection means changes the presence / absence of voice, the pitch change, and the feature coefficient of the voice signal output from the feature extraction means. , A change in spectrum envelope information, a change in gain, a change in voiced / silent of the voice signal output in the process of encoding in the encoding device, a change in pitch, a change in characteristic coefficient, a change in gain, And the codebook rewriting means may be controlled based on at least one of the changes in the vector quantization error.

【００１１】[0011]

【作用】この発明によれば、被量子化ベクトとコードベ
クトルとのマッチング結果に基づいてコードブックの内
容を適応的に更新する学習ベクトル量子化による符号化
復号方式において、符号化対象信号の特徴パラメータの
変化を検出し、この変化が検出された場合には話者が変
わったものと判断して、コードブックの内容を初期化す
るようにしているので、話者交替後のコードブックの内
容を不特定話者向けの平均的な内容とすることができ、
ベクトル量子化誤差の増大を防ぐことができる。According to the present invention, in the encoding / decoding system by the learning vector quantization for adaptively updating the contents of the codebook based on the matching result of the vector to be quantized and the code vector, the characteristics of the signal to be encoded are Changes in parameters are detected, and if this change is detected, it is determined that the speaker has changed, and the contents of the codebook are initialized. Can be average content for unspecified speakers,
It is possible to prevent an increase in vector quantization error.

【００１２】なお、電話においては、ほんの少しの間だ
け別の話者が電話に出たり、一時的にメロディー等の別
の音が入ることが良くある。このような場合、話者が変
化したことを検出して、その度にコードブックを初期化
するのは効率的でない。そこで、コードブックの初期化
に先立って、コードブックの内容及び特徴パラメータを
一時的に記憶しておく一時記憶手段を設け、特徴パラメ
ータの変化が検出された場合には、まず、その特徴パラ
メータと一時記憶手段に退避してある特徴パラメータと
を比較する。そして、両者が近似している場合には、前
回の話者に戻ったと判断して、コードブックの内容を一
時記憶手段に記憶された内容に書き換え、近似していな
い場合には、全く別人であるとしてコードブックを初期
化する。これにより、一時的に別の話者が電話に出たよ
うな場合には、前回の学習結果が反映されたコードブッ
クを再び使用することができ、正確なベクトル量子化を
図ることができる。In a telephone, another speaker often answers the telephone for a little while, or another sound such as a melody is temporarily input. In such a case, it is not efficient to detect that the speaker has changed and initialize the codebook each time. Therefore, prior to the initialization of the codebook, a temporary storage means for temporarily storing the contents of the codebook and the characteristic parameters is provided, and when a change in the characteristic parameters is detected, first, the characteristic parameters and The characteristic parameters saved in the temporary storage means are compared. If they are close to each other, it is determined that the previous speaker has returned, and the contents of the codebook are rewritten to the contents stored in the temporary storage means. Initialize the codebook as if there were. As a result, when another speaker answers the telephone temporarily, the codebook in which the previous learning result is reflected can be used again, and accurate vector quantization can be achieved.

【００１３】また、コードブックの初期化は、符号化装
置と復号装置の双方で行われる必要があるが、符号化装
置から復号装置へ初期化情報を供給することにより、符
号化装置に同期した復号装置でのコードブックの初期化
が可能である。更に、符号化装置と復号装置とで同一の
アルゴリズムに従って、コードブックの初期化を行うよ
うに設定すると、上記のような初期化情報を復号装置側
に供給しなくても、符号化装置と復号装置とで初期化タ
イミングの同期をとることができる。例えば、符号化装
置内では、符号化結果を一旦復号し（局部復号）、この
復号結果に対して特徴抽出を行ってその変化を検出す
る。また、復号装置でも、受信された符号化結果を復号
し、この復号結果に対して特徴抽出を行ってその変化を
検出する。このように、同一の情報に対して同一のアル
ゴリズムを適用することにより、符号化装置と復号装置
とで同期したコードブックの初期化処理が可能になる。Further, the initialization of the codebook needs to be performed by both the encoding device and the decoding device, but by supplying the initialization information from the encoding device to the decoding device, it is synchronized with the encoding device. The codebook can be initialized in the decoding device. Furthermore, if the encoding device and the decoding device are set to perform the initialization of the codebook according to the same algorithm, the encoding device and the decoding device can be decoded without supplying the above initialization information to the decoding device side. The initialization timing can be synchronized with the device. For example, in the encoding device, the encoded result is once decoded (local decoding), and the feature is extracted from the decoded result to detect the change. The decoding device also decodes the received coding result, performs feature extraction on the decoding result, and detects the change. In this way, by applying the same algorithm to the same information, it is possible to perform a codebook initialization process that is synchronized between the encoding device and the decoding device.

【００１４】[0014]

【実施例】以下、図面を参照して、この発明の実施例を
説明する。図１は、この発明の一実施例に係るＣＥＬＰ
方式によるベクトル量子化を含む音声信号の符号化装
置、図２は同じく復号化装置を示すブロック図である。
図１において、符号化装置に入力される符号化対象信号
である音声信号は、Ａ／Ｄ変換器１でＡ／Ｄ変換された
のち、長期予測器２及び特徴抽出部１１に供給される。
長期予測器２では、入力されたディジタル音声信号を、
例えば１６０サンプル毎のフレームに分割し、各フレー
ム毎に分析して、パーコール係数、ゲイン、ピッチ等の
長期予測係数を多重化部３に出力する。また、長期予測
器２は、入力されたディジタル音声信号から長期予測係
数によって線形予測された信号を差し引いて長期予測残
差成分を求め、これをフレーム分割部４に出力する。フ
レーム分割部４では、フレームタイマを用いて、この１
６０サンプルの長期予測残差成分を、例えば４０サンプ
ル毎のサブフレームに分割し、４０次元の予測残差ベク
トルとしてベクトル量子化部５に出力する。ＣＥＬＰで
は、この予測残差ベクトルがベクトル量子化すべき被量
子化ベクトルとなる。ベクトル量子化部５は、予測残差
ベクトルをベクトルの大きさが１となるように正規化す
ると共に、正規化後の予測残差ベクトルと最も距離的に
近いコードベクトルをコードブック６から検索し、その
インデックスＩＤＸと正規化係数とを多重化部３に出力
する。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 shows a CELP according to an embodiment of the present invention.
FIG. 2 is a block diagram showing a speech signal coding apparatus including vector quantization according to the method, and FIG. 2 also showing a decoding apparatus.
In FIG. 1, a speech signal, which is a signal to be encoded and is input to the encoding device, is A / D converted by the A / D converter 1 and then supplied to the long-term predictor 2 and the feature extraction unit 11.
The long-term predictor 2 converts the input digital voice signal into
For example, it is divided into frames of 160 samples and analyzed for each frame, and long-term prediction coefficients such as percoll coefficients, gains, and pitches are output to the multiplexing unit 3. Further, the long-term predictor 2 subtracts the signal linearly predicted by the long-term prediction coefficient from the input digital audio signal to obtain a long-term prediction residual component, and outputs this to the frame division unit 4. The frame division unit 4 uses the frame timer to
The long-term prediction residual component of 60 samples is divided into, for example, 40-sample subframes, and is output to the vector quantization unit 5 as a 40-dimensional prediction residual vector. In CELP, this prediction residual vector becomes a quantized vector to be vector-quantized. The vector quantizer 5 normalizes the prediction residual vector so that the magnitude of the vector becomes 1, and searches the codebook 6 for the code vector closest in distance to the normalized prediction residual vector. , The index IDX and the normalization coefficient are output to the multiplexing unit 3.

【００１５】また、この符号化装置には、コードブック
６の他に、コードブック６を初期化する際の内容を保持
した初期化コードブック７が設けられている。コードブ
ック６は、書換制御部８によって順次その内容を更新さ
れる。このため、コードブック６は、書き換え可能なＲ
ＡＭ等のメモリから構成される。これに対し、初期化コ
ードブック７の内容は固定的なものであるため、ＲＯＭ
等のメモリにより構成される。Further, in addition to the codebook 6, this coding apparatus is provided with an initialization codebook 7 which holds the contents when the codebook 6 is initialized. The contents of the codebook 6 are sequentially updated by the rewriting control unit 8. Therefore, the codebook 6 is a rewritable R
It is composed of a memory such as AM. On the other hand, since the contents of the initialization codebook 7 are fixed, the ROM
And the like.

【００１６】ベクトル量子化部５に入力される予測残差
ベクトルと、ベクトル量子化によって選択されたコード
ベクトルとの誤差は、減算器９によって計算される。減
算器９からの誤差ベクトルは、誤差量検出部１０に与え
られている。誤差量検出部１０は、誤差ベクトルの大き
さが所定値よりも大きいときには、誤差送信フラグ
“１”にして、このフラグと誤差ベクトルとを更新情報
として書換制御部８及び多重化部３に供給する。これに
より、書換制御部８が制御され、コードブック６の内容
が更新される。また、誤差量検出部１０は、誤差ベクト
ルの大きさが所定量よりも小さいときには、誤差送信フ
ラグを“０”にして、これを書換制御部８及び多重化部
３に供給する。An error between the prediction residual vector input to the vector quantizer 5 and the code vector selected by vector quantization is calculated by the subtractor 9. The error vector from the subtractor 9 is given to the error amount detector 10. When the magnitude of the error vector is larger than the predetermined value, the error amount detection unit 10 sets the error transmission flag “1” and supplies this flag and the error vector as update information to the rewrite control unit 8 and the multiplexing unit 3. To do. As a result, the rewrite control unit 8 is controlled and the content of the codebook 6 is updated. Further, when the magnitude of the error vector is smaller than the predetermined amount, the error amount detection unit 10 sets the error transmission flag to “0” and supplies it to the rewrite control unit 8 and the multiplexing unit 3.

【００１７】また、音声特徴抽出部１１に供給された音
声信号は、音声特徴抽出部１１によって特徴パラメータ
を抽出される。この特徴パラメータは、音声信号の有音
／無音の変化、ピッチ変化、特徴係数の変化、スペクト
ラム包絡線情報の変化、及びゲイン変化等、話者に特有
の情報である。これらの特徴パラメータは、話者変化検
出部１２に供給されている。話者変化検出部１２は、特
徴パラメータの変化を検出すると、話者が変化したと認
識して初期化フラグを“１”にする。この初期化フラグ
も書換制御部８に供給されており、書換制御部８は、初
期化フラグが“１”のときに、コードブック６の内容を
初期化コードブック７の内容に初期化する。多重化部３
は、長期予測係数、ＩＤＸ、その正規化係数、更新情報
及び初期化フラグを符号化及び多重化して伝送路に送信
又は記録媒体に記録する。Further, the voice signal supplied to the voice feature extraction unit 11 is extracted with a feature parameter by the voice feature extraction unit 11. This characteristic parameter is information peculiar to the speaker such as a change in voiced / non-voiced sound signal, a change in pitch, a change in characteristic coefficient, a change in spectrum envelope information, and a change in gain. These characteristic parameters are supplied to the speaker change detection unit 12. When the change in the characteristic parameter is detected, the speaker change detection unit 12 recognizes that the speaker has changed and sets the initialization flag to "1". This initialization flag is also supplied to the rewrite control unit 8, and the rewrite control unit 8 initializes the contents of the codebook 6 to the contents of the initialization codebook 7 when the initialization flag is “1”. Multiplexing unit 3
Encodes and multiplexes the long-term prediction coefficient, IDX, its normalization coefficient, update information, and the initialization flag, and transmits them to the transmission path or records them on a recording medium.

【００１８】一方、復号化装置側では、図２に示すよう
に、受信又は読み出した信号は、逆多重化部２１に供給
され、逆符号化・逆多重化されることにより、長期予測
係数及び正規化係数と、インデックスＩＤＸと、更新情
報と、初期化フラグとに分割される。長期予測係数及び
正規化係数は、長期予測合成部２２に供給される。ま
た、復号化装置側にも符号化装置と同一内容のコードブ
ック２４及び初期化コードブック２５が備えられてお
り、ベクトル逆量子化部２３は、供給されたインデック
スＩＤＸからコードブック２４を参照することにより、
コードベクトルを選択する。これによりコードブック２
４から出力されるコードベクトルは、長期予測合成部２
２に供給される。長期予測合成部２２は、コードベクト
ルを正規化係数によって元の信号レベルに戻し、この信
号と長期予測係数から線形予測して求められた信号とを
合成する。この合成信号がＤ／Ａ変換器２６でＤ／Ａ変
換されて復号された音声信号として出力される。また、
逆多重化部２１で分離された更新情報及び初期化フラグ
は、書換制御部２７に供給されている。書換制御部２７
は、符号化装置の書換制御部８と同様の機能を有するも
ので、更新情報に基づいてコードブック２４の内容を更
新し、初期化フラグに基づいてコードブック２４の内容
を初期化コードブック２５の内容に初期化する。On the other hand, on the decoding device side, as shown in FIG. 2, the received or read signal is supplied to the demultiplexing unit 21 and is decoded / demultiplexed to obtain the long-term prediction coefficient and It is divided into a normalization coefficient, an index IDX, update information, and an initialization flag. The long-term prediction coefficient and the normalization coefficient are supplied to the long-term prediction synthesis unit 22. Further, the decoding device side is also provided with the codebook 24 and the initialization codebook 25 having the same contents as the encoding device, and the vector dequantization unit 23 refers to the codebook 24 from the supplied index IDX. By
Select a code vector. Codebook 2
The code vector output from 4 is the long-term prediction synthesis unit 2
2 is supplied. The long-term prediction synthesizing unit 22 restores the code vector to the original signal level by the normalization coefficient, and synthesizes this signal and the signal obtained by linear prediction from the long-term prediction coefficient. This synthesized signal is D / A converted by the D / A converter 26 and output as a decoded audio signal. Also,
The update information and the initialization flag separated by the demultiplexing unit 21 are supplied to the rewriting control unit 27. Rewriting control unit 27
Has the same function as the rewrite control unit 8 of the encoding device, updates the contents of the codebook 24 based on the update information, and initializes the contents of the codebook 24 based on the initialization flag. To the contents of.

【００１９】次に、このように構成された符号化復号シ
ステムの動作について説明する。音声が入力されると、
その特徴的な部分、即ち、パーコール係数、ピッチ、ゲ
イン等は、長期予測部２及び特徴抽出部１１により抽出
されることになるが、その残りの残差成分は、コードブ
ック６，２４により、符号化及び復号されることにな
る。図３は、コードブック６，２４の内容を示す図であ
る。コードブック６，２４には、初期化時には、不特定
のあらゆる話者に対応可能なように、多種の言語、人種
及び性別等から選択された例えば５１２種類の代表的な
予測残差成分のコードベクトルが予め格納されている。
また、コードブック６，２４には、各コードベクトルと
対応する形で、そのコードベクトルを特定するインデッ
クスＩＤＸが格納されている。また、この他、各コード
ベクトルの書換可否を示すフラグや、各コードベクトル
の選択優先度の重み付けの情報等を追加するようにして
もよい。Next, the operation of the encoding / decoding system configured as described above will be described. When voice is input,
The characteristic part, that is, the Percoll coefficient, the pitch, the gain, etc. are extracted by the long-term predicting unit 2 and the feature extracting unit 11, but the remaining residual components are It will be encoded and decoded. FIG. 3 is a diagram showing the contents of the code books 6 and 24. At the time of initialization, the codebooks 6 and 24 include, for example, 512 typical prediction residual components selected from various languages, races, genders, etc., so as to be compatible with all unspecified speakers. Code vectors are stored in advance.
Further, the codebooks 6 and 24 store the index IDX for identifying the code vector in a form corresponding to each code vector. In addition to this, a flag indicating whether or not each code vector can be rewritten, information regarding weighting of selection priority of each code vector, and the like may be added.

【００２０】コードベクトルの数が多ければ、量子化誤
差が少なくなり、高品質の符号化・復号化が実現できる
が、反面、メモリの記憶容量を増さなくてはならずコス
トアップとなると共に、サーチにも時間がかかる。ま
た、例えば５１２種類のコードベクトルであれば、イン
デックスＩＤＸとして９ビットの情報を伝送すれば良い
が、コードベクトルの種類が増えると、伝送すべきビッ
ト数も増える。If the number of code vectors is large, the quantization error is reduced, and high quality encoding / decoding can be realized, but on the other hand, the memory capacity must be increased and the cost is increased. , Search also takes time. Further, for example, if there are 512 types of code vectors, 9-bit information may be transmitted as the index IDX, but if the number of types of code vectors increases, the number of bits to be transmitted also increases.

【００２１】コードブック６，２４を構成するパターン
ベクトルを簡単のために２次元のベクトルとして２次元
平面に表現すると、図４（ａ）の黒丸のようになる。こ
こで、黒丸に併記された数値は各コードベクトルのイン
デックスである。いま、量子化すべき予測残差ベクトル
として図中“×”で示すベクトルＶi がベクトル量子化
部５に入力されると、このベクトルＶi はベクトルＶ12
に最も近いので、インデックス１２がベクトル量子化部
５での量子化結果として選択されることになる。When the pattern vectors forming the codebooks 6 and 24 are expressed as a two-dimensional vector on a two-dimensional plane for the sake of simplicity, a black circle in FIG. 4A is obtained. Here, the numerical value written in black circles is the index of each code vector. When a vector Vi indicated by "x" in the figure is input to the vector quantizer 5 as a prediction residual vector to be quantized, this vector Vi is converted into the vector V12.
Since the index 12 is closest to, the index 12 is selected as the quantization result in the vector quantization unit 5.

【００２２】ベクトル量子化部５に入力される残差成分
の傾向は話者毎に特有であるため、コードブック６，２
４は、学習によって内部のコードベクトルを最良の内容
に更新していく必要がある。そこで、符号化装置の誤差
量検出部１０において、特定のコードベクトルについて
の量子化誤差が連続的に一定値を越えたことが検出され
た場合に、コードブック６で選択されたコードベクトル
と、量子化誤差の平均値とを加算したベクトルを新たな
コードベクトルとし、元のコードベクトルと置き換える
ことにより、コードブック６の内容を更新する。また、
このとき、誤差送信フラグ“１”と共に前記量子化誤差
の平均値を復号装置側にも転送して、コードブック２４
の内容を更新する。図４（ｂ）は、同図（ａ）のコード
ベクトルＶ12と、その量子化誤差から新たに生成された
コードベクトルとを置き換えた例を示している。このと
き、新たなコードベクトルと所定時間当たりの最も選択
頻度の低いコードベクトルとを置き換えるようにしても
よい。この場合、ベクトル空間の外縁に存在するプリミ
ティブなベクトルについては、置き換えを禁止するフラ
グを設定するようにしてもよい。なお、コードブック２
４の更新処理を、このようなコードベクトルの書換とせ
ずに、各コードベクトルの重み付けを変化させる処理と
してもよい。Since the tendency of the residual component input to the vector quantizer 5 is peculiar to each speaker, the codebooks 6 and 2 are used.
In No. 4, it is necessary to update the internal code vector to the best contents by learning. Therefore, when the error amount detection unit 10 of the encoding device detects that the quantization error for a specific code vector continuously exceeds a certain value, the code vector selected by the codebook 6 and The contents of the codebook 6 are updated by replacing the original code vector with a vector obtained by adding the average value of the quantization error to a new code vector. Also,
At this time, the average value of the quantization error is transferred to the decoding device side together with the error transmission flag “1”, and the codebook 24
Update the contents of. FIG. 4B shows an example in which the code vector V12 shown in FIG. 4A is replaced with the code vector newly generated from the quantization error. At this time, a new code vector may be replaced with the code vector with the lowest selection frequency per predetermined time. In this case, a flag that prohibits replacement may be set for the primitive vector existing at the outer edge of the vector space. Codebook 2
The updating process of No. 4 may be a process of changing the weighting of each code vector without rewriting such a code vector.

【００２３】このようなコードブック６，２４の更新に
よってコードブックの内容を話者の特性に応じて適応的
に変化させることができるが、電源投入時や、話者が変
わって音声入力の特性が急変したような場合には、学習
されたコードブックの内容では、かえって都合が悪い場
合がある。この場合には、コードブック６，２４の内容
を当初の内容に初期化することが望ましい。そこで、音
声特徴抽出部１１で音声信号から話者の特徴パラメータ
を抽出し、話者変化検出部１２で話者の特徴パラメータ
が変化したことを検出する。話者の特徴パラメータとし
ては、次のものが考えられる。By updating the codebooks 6 and 24 as described above, the contents of the codebook can be adaptively changed according to the characteristics of the speaker. However, when the power is turned on or the speaker changes, the characteristics of voice input can be changed. In case of sudden change, the contents of the learned codebook may be rather inconvenient. In this case, it is desirable to initialize the contents of the codebooks 6 and 24 to the original contents. Therefore, the voice feature extraction unit 11 extracts the speaker's feature parameter from the voice signal, and the speaker change detection unit 12 detects that the speaker's feature parameter has changed. The following are possible speaker characteristic parameters.

【００２４】（ａ）有音／無音有音／無音の割合は話者毎に異なる。また、無音部が一
般の喋り言葉に含まれる平均よりも長く発生した場合に
は、話者が変化する可能性が高い。（ｂ）音声のピッチ音声のピッチは、声帯の振動周波数、ひいては声の高さ
であるので、その周波数の分布（時間平均、分散等）は
個人の特性を反映する。（ｃ）音声の特徴係数音声の特徴係数には、声道を全極型フィルタで表現した
パーコール係数（反射係数）があるが、これらの時間平
均、その分散等は話者毎に異なる。（ｄ）スペクトラム包絡線音声を周波数分析し、そのスペクトラム包絡線の形状、
その時間的変化率及び変化の分散についても（ｃ）と同
様の傾向がある。（ｅ）音声のゲイン音声のゲインは、話者のくせによって異なる。また、ゲ
インの変動率（分散）も、抑揚の付け方等に応じて変化
するので、個人判別に使用することができる。（ｆ）ベクトル量子化誤差第１話者に特化したコードブックができあがっていると
きに、話者が変化すると、当然、量子化誤差も増大す
る。従って、この量子化誤差の変化を監視することによ
っても話者変化を判別することができる。(A) Sound / Silence The ratio of sound / silence differs for each speaker. Further, when the silent portion occurs longer than the average included in the general speaking language, the speaker is likely to change. (B) Voice Pitch Since the voice pitch is the vibration frequency of the vocal cords, and by extension, the voice pitch, the frequency distribution (time average, variance, etc.) reflects individual characteristics. (C) Speech Feature Coefficients As speech feature coefficients, there is a Percoll coefficient (reflection coefficient) that represents the vocal tract with an all-pole filter, but the time average, variance, etc. thereof differ from speaker to speaker. (D) Spectrum envelope The frequency of the voice is analyzed and the shape of the spectrum envelope is
The temporal change rate and the change variance also have the same tendency as in (c). (E) Voice gain The voice gain varies depending on the habit of the speaker. Further, the rate of change (variance) of the gain also changes according to the method of applying intonation, etc., so that it can be used for individual discrimination. (F) Vector quantization error When the codebook specialized for the first speaker is completed, if the speaker changes, the quantization error naturally increases. Therefore, the speaker change can also be discriminated by monitoring the change in the quantization error.

【００２５】なお、以上の各特徴パラメータは、特徴パ
ラメータ抽出部１１で抽出されたものでもよいし、ベク
トル量子化の過程で求められたものでもよい。例えば、
ＣＥＬＰ方式の符号化においては、ピッチ、パーコール
係数、ゲイン、有音／無音フラグ等を、ＣＥＬＰのアル
ゴリズム内で発生させるので、これを利用することがで
ある。また、スペクトル分析を行って、その包絡線情報
を用いて情報を圧縮する形式の符号化復号方式では、
（ｄ）のスペクトル包絡線パラメータをそのまま利用す
ることが可能である。The above-mentioned characteristic parameters may be those extracted by the characteristic parameter extraction unit 11 or those obtained in the process of vector quantization. For example,
In CELP encoding, pitch, Percoll coefficient, gain, voiced / non-voiced flag, etc. are generated in the CELP algorithm, which can be used. In addition, in a coding / decoding method of a format that performs spectrum analysis and compresses information using the envelope information,
It is possible to use the spectrum envelope parameter of (d) as it is.

【００２６】これらのパラメータが話者変化検出部１２
に入力されると、話者変化検出部１２は、先ず、話者の
話し方などに起因した量子化誤差等の一時的な増大や、
例えば口笛を吹いた場合等のピッチや特徴係数等の一時
的な変化を、話者の変化であると検出しないように、入
力パラメータにＬＰＦ（ローパス・フィルタ）等の前処
理を施す。そして、例えば以下のような判定式に基づい
て総合評価値ｅtotalを求める。These parameters are used as the speaker change detection unit 12
, The speaker change detection unit 12 firstly temporarily increases the quantization error and the like due to the way the speaker speaks,
For example, preprocessing such as LPF (low-pass filter) is applied to the input parameters so that a temporary change in pitch, characteristic coefficient, etc., such as when a whistle is blown, is not detected as a change in the speaker. Then, for example, the total evaluation value etotal is obtained based on the following determination formula.

【００２７】[0027]

【数１】ｅtotal ＝ｋa Ｐa ＋ｋb Ｐb ＋ｋc Ｐc ＋ｋ
d Ｐd ＋ｋe Ｐe ＋ｋf Ｐf Ｐa 〜Ｐf ：各パラメータｋa 〜ｋf ：各パラメータの評価の重み付け係数[Equation 1] e total = ka Pa + kb Pb + kc Pc + k
d Pd + ke Pe + kf Pf Pa to Pf: Each parameter ka to kf: Weighting coefficient for evaluation of each parameter

【００２８】このようにして求められた総合評価値ｅto
tal に対し、更に後処理としてＬＰＦをかけたのち、所
定のしきい値と比較して、しきい値を超えるようであれ
ば話者が変化したと判別すればよい。The total evaluation value eto thus obtained
After tal is further subjected to LPF as post-processing, it is compared with a predetermined threshold value, and if it exceeds the threshold value, it can be determined that the speaker has changed.

【００２９】話者変化検出部１２が話者の変化を検出し
た場合には、初期化フラグが“１”になり、書換制御部
８がコードブック６の内容を初期化コードブック７の内
容に初期化する。初期化フラグは、送信又は記録される
ので、復号装置においても、同様のコードブック２４の
初期化が行われることになる。このとき、コードブック
６，２４は、全ての内容が書き換えられるようにしても
よいが、書き換えの効率を考慮すると、例えば個人情報
のあまり入らないプリミティブなコードベクトルには、
書き換え禁止フラグ等を付与しておくことにより、書き
換えすべきデータ量を削減するようにしても良い。ま
た、初期化フラグと共に、例えばピッチ情報等の特徴パ
ラメータを書換制御部８に供給し、このパラメータを参
照して男声／女声判別を行う。そして、その判別結果に
基づいてコードブックの初期値を男声の典型例、又は女
声の典型例で初期化すれば、話者変化後のベクトル量子
化誤差を減少させることができる。When the speaker change detecting unit 12 detects a speaker change, the initialization flag becomes "1", and the rewriting control unit 8 changes the contents of the codebook 6 to the contents of the initialization codebook 7. initialize. Since the initialization flag is transmitted or recorded, the same initialization of the codebook 24 is performed also in the decoding device. At this time, all the contents of the codebooks 6 and 24 may be rewritten, but considering the efficiency of rewriting, for example, a primitive code vector that does not contain much personal information is
The amount of data to be rewritten may be reduced by adding a rewrite prohibition flag or the like. Further, together with the initialization flag, characteristic parameters such as pitch information are supplied to the rewriting control unit 8, and male / female voice discrimination is performed by referring to these parameters. Then, if the initial value of the codebook is initialized with a typical example of male voice or a typical example of female voice based on the determination result, the vector quantization error after the speaker change can be reduced.

【００３０】なお、前述したように、初期化コードブッ
ク７，２５をＲＯＭで構成して、全ての初期化データを
格納しておくようにしても良いが、初期化コードブック
７，２５を書換可能なメモリで構成して、書き換えられ
たベクトル情報のみを格納しておくようにしてもよい。
また、初期化コードブック７，２５に全初期化データが
格納されている場合でも、コードブック６，２４の書き
換えは、書き換えられたデータについてのみ行うことに
より、書き換え時間を短縮するようにしてもよい。As described above, the initialization codebooks 7 and 25 may be configured by ROM so as to store all the initialization data, but the initialization codebooks 7 and 25 are rewritten. It may be configured with a possible memory so that only the rewritten vector information is stored.
Even if all the initialization data are stored in the initialization codebooks 7 and 25, the rewriting of the codebooks 6 and 24 is performed only on the rewritten data, so that the rewriting time can be shortened. Good.

【００３１】図５は、この発明の他の実施例に係る符号
化復号方式における符号化装置を示すブロック図であ
る。この実施例のシステムが前述した実施例のシステム
と異なる点は、コードブック及び特徴パラメータの一時
退避用の一時記憶用メモリ３１と、その退避時間を計測
するタイマ３２とを設けた点にある。FIG. 5 is a block diagram showing an encoding apparatus in an encoding / decoding system according to another embodiment of the present invention. The system of this embodiment is different from the system of the above-described embodiment in that a temporary storage memory 31 for temporarily saving the codebook and the characteristic parameters and a timer 32 for measuring the saving time are provided.

【００３２】即ち、電話においては、「ちょっと代わる
ね」といって、ほんの少しの間だけ話者が代わることが
ある。そこで、初期化フラグが“１”になったら、書換
制御部８は、前話者の特徴パラメータと書き換え前のコ
ードブック６の内容とを一時記憶用メモリ３１に退避し
たのちに、タイマ３２を起動して、コードブック６の内
容を初期化する。タイマ３２を例えば数分に設定してお
き、タイムアウトが生ずる前に初期化フラグが“１”に
なったときには、一時記憶用メモリ３１に退避されてい
る特徴パラメータと、今回抽出された特徴パラメータと
を比較して、両者が近似している場合には、前の話者に
戻ったものと判断して、コードブック６に一時記憶用メ
モリ３１に退避されているコードブックの内容をコード
ブック６に書き戻す。話者が切り替わった際にタイムア
ウトが生じていた場合には、前の話者が話す確率は低い
と考え、コードブック６の内容及び特徴パラメータを退
避後、コードブック６を初期化する。また、話者が切り
替わったときに、今回の特徴パラメータと退避されてい
た特徴パラメータとが近似していない場合には、全く別
の話者に代わったものと判断してコードブック６の内容
及び特徴パラメータを退避後、コードブック６を初期化
する。パラメータが近似しているかどうかは、次のよう
な判別式を用いて判別すれば良い。In other words, on the telephone, the speaker may change for a little while, saying "I'm a little different". Therefore, when the initialization flag becomes “1”, the rewrite control unit 8 saves the characteristic parameters of the previous speaker and the contents of the codebook 6 before the rewriting in the temporary storage memory 31, and then sets the timer 32. It starts and initializes the contents of the codebook 6. The timer 32 is set to, for example, several minutes, and when the initialization flag becomes “1” before the timeout occurs, the feature parameter saved in the temporary storage memory 31 and the feature parameter extracted this time When the two are close to each other, it is determined that the previous speaker has been returned, and the contents of the codebook saved in the temporary storage memory 31 are stored in the codebook 6. Write back to. If a time-out occurs when the speaker is switched, the probability that the previous speaker speaks is considered to be low, and the contents and characteristic parameters of the codebook 6 are saved, and then the codebook 6 is initialized. Further, when the speaker is switched, if the feature parameter of this time and the saved feature parameter are not close to each other, it is determined that another speaker has been completely replaced, and the contents of the codebook 6 and After saving the characteristic parameters, the codebook 6 is initialized. Whether the parameters are close to each other can be determined by using the following discriminant equation.

【００３３】[0033]

【数２】ｄperson＝√［ｋa'（Ｐma−Ｐa ）² ＋ｋb'（Ｐmb−Ｐb ）² ＋ …… …… ＋ｋf'（Ｐmf−Ｐf ）² ］ｄperson ：パラメータが近似しているかの評価値Ｐma〜Ｐmf：一時記憶しておいたパラメータＰa 〜Ｐf ：話者が切り替わった後に得られたパラメー
タｋa'〜ｋf'：各パラメータに対する判定重み付け係数[Equation 2] dperson = √ [ka '(Pma-Pa) ² + kb' (Pmb-Pb) ² + ......... + kf '(Pmf-Pf) ² ] dperson: Evaluation value Pma of whether the parameters are close to each other -Pmf: Parameters temporarily stored Pa-Pf: Parameters obtained after the speaker is switched ka'-kf ': Judgment weighting coefficient for each parameter

【００３４】パラメータＰma〜Ｐmf、Ｐa 〜Ｐf につい
ては、この場合にもＬＰＦ等の前処理を施しておくのが
望ましい。符号化装置と復号装置のコードブック６，２
４の整合を図るためには、一時記憶用メモリ３１の内容
が復活したことを示す復活フラグを、初期化フラグに加
えて符号化装置から送信又は記録すれば良い。復号装置
は、復活フラグ及び初期化フラグに基づいてコードブッ
ク２４の内容を復活又は初期化させる。Regarding the parameters Pma to Pmf and Pa to Pf, it is desirable to carry out preprocessing such as LPF also in this case. Codebooks 6 and 2 of the encoding device and the decoding device
In order to achieve the consistency of 4, the restoration flag indicating that the contents of the temporary storage memory 31 have been restored may be transmitted or recorded from the encoding device in addition to the initialization flag. The decoding device restores or initializes the contents of the codebook 24 based on the restoration flag and the initialization flag.

【００３５】なお、以上の実施例では、初期化フラグや
復活フラグを符号化装置から復号装置に供給したが、符
号化装置及び復号装置で全く同一のアルゴリズムによっ
てこれらの情報を生成するようにすれば、これらの情報
転送は不要となる。図６及び図７は、このようなシステ
ムの例を示す図で、図６は符号化装置、図７は復号装置
のブロック図である。例えばＣＥＬＰ方式のように分析
合成手法を用いている場合、符号化装置の内部で一旦合
成音声が作成される。以下、この合成音声を局部復号デ
ータと呼ぶ。長期予測部２で予測残差ベクトルを求める
際には、長期予測部２の内部で局部復号データを得てい
る。この局部復号データは、復号装置の復号処理によっ
て合成される復号データと一致する。そこで、符号化装
置では、この局部復号データから音声特徴抽出部１１で
特徴パラメータを抽出し、この抽出された特徴パラメー
タと長期予測部２で求められた特徴係数、ピッチ等を話
者変化検出部１２に供給して特徴パラメータの変化を検
出する。ディレイ部４１は、ベクトル量子化での処理の
遅れを補償するための遅延手段である。In the above embodiment, the initialization flag and the restoration flag are supplied from the encoding device to the decoding device. However, the encoding device and the decoding device may generate these pieces of information by using exactly the same algorithm. For example, the transfer of such information becomes unnecessary. 6 and 7 are diagrams showing an example of such a system, FIG. 6 is a block diagram of an encoding device, and FIG. 7 is a block diagram of a decoding device. For example, when an analysis and synthesis method such as the CELP method is used, synthetic speech is temporarily created inside the encoding device. Hereinafter, this synthesized voice is referred to as locally decoded data. When the long-term prediction unit 2 obtains a prediction residual vector, locally decoded data is obtained inside the long-term prediction unit 2. This locally decoded data matches the decoded data synthesized by the decoding process of the decoding device. Therefore, in the encoding device, the speech feature extraction unit 11 extracts a feature parameter from the locally decoded data, and the extracted feature parameter and the feature coefficient, pitch, etc. obtained by the long-term prediction unit 2 are used as a speaker change detection unit. 12 to detect changes in the characteristic parameters. The delay unit 41 is a delay unit for compensating for a delay in processing in vector quantization.

【００３６】一方、復号装置でも、全く同様に、長期予
測合成部２２で合成された復号データから音声特徴抽出
部５１で特徴パラメータを抽出し、この抽出された特徴
パラメータを話者変化検出部５２に供給すると共に、受
信又は読み出された特徴係数、ピッチ等の情報をディレ
イ部５３を介して話者変化検出部５２に供給する。これ
により、符号化装置と全く同一のデータ、同一のアルゴ
リズムにて話者変化を検出することができるので、符号
化装置と復号装置との間で初期化フラグを供給する必要
がない。このため、データの送信又は記憶容量を削減す
ることができる。On the other hand, also in the decoding device, the voice feature extraction unit 51 extracts feature parameters from the decoded data synthesized by the long-term prediction synthesis unit 22, and the speaker feature detection unit 52 uses the extracted feature parameters. And the received or read characteristic coefficient, pitch, and other information to the speaker change detection unit 52 via the delay unit 53. As a result, the speaker change can be detected with the same data and the same algorithm as the encoding device, so that it is not necessary to supply an initialization flag between the encoding device and the decoding device. Therefore, data transmission or storage capacity can be reduced.

【００３７】なお、以上の実施例では、音声信号の予測
残差ベクトルのベクトル量子化を例にとったが、音声信
号そのもの、又は画像信号等のベクトル量子化にも同様
に適用可能であることはいうまでもない。In the above embodiments, the vector quantization of the prediction residual vector of the voice signal is taken as an example, but the same applies to the vector quantization of the voice signal itself or the image signal. Needless to say.

【００３８】[0038]

【発明の効果】以上述べたように、この発明によれば、
被量子化ベクトとコードベクトルとのマッチング結果に
基づいてコードブックの内容を適応的に更新する学習ベ
クトル量子化による符号化復号方式において、符号化対
象信号の特徴パラメータの変化を検出し、この変化が検
出された場合には話者が変わったものと判断して、コー
ドブックの内容を初期化するようにしているので、話者
交替後のコードブックの内容を不特定話者向けの平均的
な内容とすることができ、ベクトル量子化誤差の増大を
防ぐことができる。As described above, according to the present invention,
In a coding / decoding method by learning vector quantization that adaptively updates the contents of the codebook based on the matching result between the quantized vector and the code vector, the change in the characteristic parameter of the signal to be coded is detected, and this change is detected. If is detected, it is determined that the speaker has changed, and the contents of the codebook are initialized.Therefore, the contents of the codebook after the speaker change is averaged for unspecified speakers. It is possible to prevent the increase of vector quantization error.

[Brief description of drawings]

【図１】この発明の一実施例に係るベクトル量子化に
よる符号化装置のブロック図である。FIG. 1 is a block diagram of an encoding device by vector quantization according to an embodiment of the present invention.

【図２】同ベクトル量子化による復号装置のブロック
図である。FIG. 2 is a block diagram of a decoding device using the same vector quantization.

【図３】同符号化装置及び復号化装置におけるコード
ブックの内容を説明するための図である。FIG. 3 is a diagram for explaining the contents of a codebook in the encoding device and the decoding device.

【図４】同コードブックの更新前及び更新後のコード
ベクトルの内容を模式的に示す図である。FIG. 4 is a diagram schematically showing the contents of code vectors before and after updating the codebook.

【図５】この発明の他の実施例に係るベクトル量子化
による符号化装置のブロック図である。FIG. 5 is a block diagram of an encoding device by vector quantization according to another embodiment of the present invention.

【図６】この発明の更に他の実施例に係るベクトル量
子化による符号化装置のブロック図である。FIG. 6 is a block diagram of an encoding apparatus by vector quantization according to still another embodiment of the present invention.

【図７】同復号装置のブロック図である。FIG. 7 is a block diagram of the decoding device.

[Explanation of symbols]

１…Ａ／Ｄ変換器、２…長期予測部、３…多重化部、４
…フレーム分割部、５…ベクトル量子化部、６，２４…
コードブック、７，２５…初期化コードブック、８，２
７…書換制御部、９…減算器、１０…誤差量検出部、１
１，５１…音声特徴抽出部、１２，５２…話者変化検出
部、２１…逆多重化部、２２…長期予測合成部、２３…
ベクトル逆量子化部、２６…Ｄ／Ａ変換器、３１…一時
記憶用メモリ、３２…タイマ、４１，５３…ディレイ。1 ... A / D converter, 2 ... Long-term prediction unit, 3 ... Multiplexing unit, 4
... Frame division section, 5 ... Vector quantization section, 6, 24 ...
Codebook, 7, 25 ... Initialization codebook, 8, 2
7 ... Rewrite control unit, 9 ... Subtractor, 10 ... Error amount detection unit, 1
1, 51 ... Voice feature extraction unit, 12, 52 ... Speaker change detection unit, 21 ... Demultiplexing unit, 22 ... Long-term prediction synthesis unit, 23 ...
Vector dequantization unit, 26 ... D / A converter, 31 ... Temporary storage memory, 32 ... Timer, 41, 53 ... Delay.

Claims

[Claims]

1. A codebook, which stores a correspondence table of code vectors indicating some typical signal patterns in a predetermined section and indexes for specifying the code vectors, an encoding device and a decoding device. Provided in both, in the encoding device, select the code vector in the codebook that best matches the quantized vector to be quantized based on the encoding target signal and output the index, in the decoding device, While decoding the quantized vector by obtaining the code vector corresponding to the index obtained on the encoding device side with reference to the codebook, the quantized vector and the code in the encoding device A codebook book that adaptively updates the contents of the codebook based on the result of matching with a vector In an encoding / decoding method by vector quantization in which a conversion unit is provided in both the encoding device and the decoding device, an initialization codebook holding the contents of the codebook in the initial state is provided in both the encoding device and the decoding device. In addition to the above, the encoding device includes a feature extraction unit that extracts a feature parameter of the encoding target signal, and a feature change detection unit that detects a change in the feature parameter extracted by the feature extraction unit, The codebook rewriting means initializes the content of the codebook to the content of the initialization codebook when the characteristic change detecting means detects a change in the characteristic parameter. Encoding and decoding method by vector quantization.

2. The method further comprises: a temporary storage unit for temporarily storing the contents and characteristic parameters of the codebook prior to the initialization of the codebook, wherein the codebook rewriting unit is the characteristic change detecting unit. When a change in the parameters is detected, first, the characteristic parameters extracted by the characteristic extracting means are compared with the characteristic parameters stored in the temporary storage means, and when the two are approximate, the code The contents of the book are rewritten to the contents of the codebook stored in the temporary storage means, and when the two characteristic parameters are not close to each other, the contents of the codebook are saved in the temporary storage means and then the initialization is performed. The encoding / decoding method by vector quantization according to claim 1, wherein the encoding / decoding method is initialized to the contents of a codebook.

3. The encoding device supplies initialization information indicating that the content of the codebook has been rewritten with the content of the initialization codebook to the decoding device, and the decoding device provides the supplied information. 3. The coding / decoding method by vector quantization according to claim 1, wherein the content of the codebook is initialized to the content of the initialized codebook based on the initialization information.

4. The encoding device temporarily decodes the output encoding result, and the feature extracting means extracts the feature parameter from the decoded result, and a change in the extracted feature parameter is detected. The content of the codebook is initialized to the content of the initialization codebook when the decoding device decodes the input coding result, and the decoding result is characterized by the feature extraction means. 3. The parameter extraction is executed, and when the change of the extracted characteristic parameter is detected, the contents of the codebook are initialized to the contents of the initialization codebook.
An encoding / decoding method by the described vector quantization.

5. The encoding target signal is a voice signal, and the feature change detection means changes voiced / unvoiced changes, pitch changes, and feature coefficient changes of the voice signal output from the feature extraction means. , A change in spectrum envelope information, a change in gain, a change in voiced / silent of the voice signal output in the process of encoding in the encoding device, a change in pitch, a change in characteristic coefficient, a change in gain, And controlling the codebook rewriting means on the basis of at least one of the change of the vector quantization error.
An encoding / decoding method by vector quantization according to any one of 1.

6. A codebook, which stores a correspondence table of code vectors showing some typical signal patterns in a predetermined section and indexes for specifying those code vectors, and a codebook based on a signal to be coded. Vector quantizing means for selecting the code vector in the code book that most matches the quantized vector to be quantized and outputting the index, and based on the matching result of the quantized vector and the code vector Codebook rewriting means for adaptively updating the contents of the codebook, an initialization codebook holding the contents of the codebook in the initial state, and a feature extracting means for extracting feature parameters of the signal to be coded, A characteristic change detecting means for detecting a change in the characteristic parameter extracted by the characteristic extracting means, The codebook rewriting means initializes the content of the codebook to the content of the initialization codebook when a change in the characteristic parameter is detected by the characteristic change detecting means. An encoding device using vector quantization.

7. A codebook that stores a correspondence table of code vectors indicating some typical signal patterns in a predetermined section and indexes that respectively specify those code vectors, and a codebook that corresponds to received indexes. Vector dequantization means for decoding the quantized vector by obtaining the code vector by referring to the codebook, and codebook rewriting means for updating the content of the codebook based on the received update information, An initialization codebook holding the contents of the codebook in an initial state, wherein the codebook rewriting means is based on an initialization flag received based on a change in a characteristic parameter of a signal to be coded. Is initialized to the contents of the initialization codebook. Decoder according Le quantization.