JP2002116799A

JP2002116799A - Audio signal encoding device

Info

Publication number: JP2002116799A
Application number: JP2000308274A
Authority: JP
Inventors: Sadahiro Yasura; 定浩安良
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2000-10-06
Filing date: 2000-10-06
Publication date: 2002-04-19

Abstract

PROBLEM TO BE SOLVED: To perform fading processing when an audio signal is encoded so that noise such as a click sound is not generated. SOLUTION: When a quantization and encoding part 13 quantizes an audio spectrum signal outputted from a time-frequency conversion part 11 by using an auditory parameter SMR(Signal-to-Mask ratio) outputted from an auditory model part 12 and outputs the quantized signal, the quantization and encoding part 13 quantizes the audio spectrum signal by using a 1st variable for controlling the encoding quantity of the quantized signal and a 2nd variable for controlling the quantization distortion of the quantized signal and then vary the 1st variable with a parameter fp for fading.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、入力したオーディ
オ信号を周波数領域に変換した後に符号化を行なうオー
ディオ信号符号化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal encoding apparatus for performing encoding after converting an input audio signal into a frequency domain.

【０００２】[0002]

【従来の技術】従来より、高能率符号化によって入力し
たオーディオ信号の符号化を行うオーディオ信号符号化
方法には、例えば適応スペクトル聴感制御エントロピー
符号化法（ＡＳＰＥＣ：Adaptive Spectral Perceptual
Entropy Coding ）や、ＭＰＥＧ（ Moving Picture Ex
pert Group）１オーディオ・レイヤ３とか、ＭＰＥＧ２
オーディオＡＡＣ（ Advanced Audio Coding）などがあ
る。2. Description of the Related Art Conventionally, an audio signal encoding method for encoding an input audio signal by high efficiency encoding includes, for example, an adaptive spectral perceptual encoding method (ASPEC: Adaptive Spectral Perceptual).
Entropy Coding) and MPEG (Moving Picture Ex)
pert Group) 1 audio layer 3 or MPEG2
Audio AAC (Advanced Audio Coding) and the like.

【０００３】上記したオーディオ信号符号化方法を適用
した従来のオーディオ信号符号化装置について、図６及
び図７を併用して説明する。A conventional audio signal encoding apparatus to which the above-described audio signal encoding method is applied will be described with reference to FIGS. 6 and 7.

【０００４】図６は従来のオーディオ信号符号化装置を
示したブロック図、図７は従来のオーディオ信号符号化
装置において、量子化時の２重ループの動作を示したフ
ロー図、図８はオーディオ・スペクトル信号に対するバ
ンドｓｆｂとの対応を示した模式図である。FIG. 6 is a block diagram showing a conventional audio signal encoding apparatus, FIG. 7 is a flow chart showing a double loop operation at the time of quantization in the conventional audio signal encoding apparatus, and FIG. It is the schematic diagram which showed the correspondence with the band sfb with respect to a spectrum signal.

【０００５】図６に示した従来のオーディオ信号符号化
装置１０Ｂは、時間周波数変換部１１と、聴覚モデル部
１２と、量子化符号化部１３と、ビットストリーム化部
１４とで構成されている。The conventional audio signal encoding device 10B shown in FIG. 6 comprises a time-frequency converter 11, an auditory model 12, a quantization encoder 13, and a bit stream generator 14. .

【０００６】まず、時間軸上のディジタル・オーディオ
信号（以下、オーディオ・ＰＣＭ信号と記す）は、略一
定の処理期間となるフレーム単位で時間周波数変換部１
１と、聴覚モデル部１２とに並列に入力されている。First, a digital audio signal (hereinafter referred to as an audio PCM signal) on a time axis is converted into a time-frequency signal by a time-frequency converter 1 in a frame unit having a substantially constant processing period.
1 and the auditory model unit 12 are input in parallel.

【０００７】上記した時間周波数変換部１１では、入力
されたオーディオ・ＰＣＭ（ Pulse-Code Modulation）
信号に対して、ＦＦＴ（ Fast Fourier Transform ）処
理やＭＤＣＴ（ Modified Discrete Cosine Transform
）処理等を用いて、時間軸から周波数軸への変換が行
なわれ、変換されたオーディオ・スペクトル信号が量子
化符号化部１３に送られる。In the above-mentioned time-frequency converter 11, the input audio / PCM (Pulse-Code Modulation) is input.
FFT (Fast Fourier Transform) processing and MDCT (Modified Discrete Cosine Transform)
) The conversion from the time axis to the frequency axis is performed using processing or the like, and the converted audio spectrum signal is sent to the quantization encoding unit 13.

【０００８】一方、上記した聴覚モデル部１２では、人
間の聴覚心理に基づいてマスキングレベルの計算により
求めた聴覚パラメータとなる信号対マスキング率ＳＭＲ
（ Signal-to-Mask-ratio ）が量子化符号化部１３に送
られる。On the other hand, in the above-mentioned auditory model unit 12, a signal-to-masking ratio SMR, which is an auditory parameter obtained by calculating a masking level based on human psychological psychology, is used.
(Signal-to-Mask-ratio) is sent to the quantization encoding unit 13.

【０００９】次に、上記した量子化符号化部１３では、
時間周波数変換部１１から出力されたオーディオ・スペ
クトル信号に対してバンド（帯域）ごとに量子化を施
す。ここでオーディオ・スペクトル信号に対して量子化
を行う際、非線形量子化とハフマン符号化のために後述
するようなイタレーションループ（繰り返しループ）に
より２重ループを形成して、量子化時の符号量と、量子
化歪みとを制御している。Next, in the above-mentioned quantization encoding unit 13,
The audio spectrum signal output from the time-frequency converter 11 is quantized for each band. Here, when performing quantization on the audio spectrum signal, a double loop is formed by an iteration loop (repetition loop) described later for nonlinear quantization and Huffman coding, and the code at the time of quantization is formed. The amount and the quantization distortion are controlled.

【００１０】この際、イタレーションループによる２重
ループではアウターループの中にインナーループを含ま
せており、インナーループで時間周波数変換部１１から
のオーディオ・スペクトル信号に対して使用ビット数が
所定のビット数の範囲内に収まるように制御を行いなが
ら量子化信号を得ている。この後、アウターループで量
子化信号の量子化歪みを算出して、量子化歪みが聴覚モ
デル部１２から出力された信号対マスキング率ＳＭＲに
基づいて許容ノイズレベル以下になるように制御して、
量子化歪みが許容ノイズレベル以下になった時に量子化
信号をビットストリーム化部１４に送っている。At this time, in the double loop of the iteration loop, the inner loop is included in the outer loop, and the number of bits used for the audio / spectrum signal from the time-frequency converter 11 is predetermined in the inner loop. A quantized signal is obtained while performing control so as to be within the range of the number of bits. Thereafter, the quantization distortion of the quantized signal is calculated in the outer loop, and the quantization distortion is controlled to be equal to or less than the allowable noise level based on the signal-to-masking ratio SMR output from the auditory model unit 12,
When the quantization distortion falls below the allowable noise level, the quantization signal is sent to the bit stream generator 14.

【００１１】より具体的には、図７（ａ）に示した如
く、ステップ１でイタレーションループＩＴＲを開始す
ると、ステップ２でアウターループＯＲ中のインナール
ープＩＲに直ちに移行して、図７（ｂ）に示したように
インナーループＩＲを開始する。More specifically, as shown in FIG. 7 (a), when the iteration loop ITR is started in step 1, the process immediately shifts to the inner loop IR in the outer loop OR in step 2, and FIG. The inner loop IR is started as shown in b).

【００１２】そして、インナーループＩＲを開始する
と、ステップ２ａで時間周波数変換部１１からのオーデ
ィオ・スペクトル信号に対してバンドごとに量子化を施
して量子化信号を得る。次に、ステップ２ｂで量子化信
号に対してハフマン符号化により使用ビット数を算出す
る。次に、ステップ２ｃで量子化信号の使用ビット数が
所定ビット数に収まっているか否かの判断を行う。そし
て、ステップ２ｃで、量子化信号の使用ビット数が所定
ビット数に収まっていない（ＮＯ）場合には、ステップ
２ｄでｇｌｏｂａｌ＿ｇａｉｎを増加する方向に調整す
ることで、量子化信号中の全てのバンドに対してレベル
を一様に可変し、この後、上記したステップ２ａまで戻
って再び上記したステップ２ａ〜ステップ２ｃを行い、
ステップ２ｃで量子化信号の使用ビット数が所定ビット
数に収まるまで繰り返す。一方、ステップ２ｃで、量子
化信号の使用ビット数が所定ビット数に収まっている
（ＹＥＳ）場合には、ｇｌｏｂａｌ＿ｇａｉｎが確定さ
れる。この後、ステップ２ｅでインナーループＩＲを終
了して、アウターループＯＲに戻る。When the inner loop IR is started, the audio spectrum signal from the time-frequency converter 11 is quantized for each band in step 2a to obtain a quantized signal. Next, in step 2b, the number of bits used is calculated for the quantized signal by Huffman coding. Next, in step 2c, it is determined whether or not the number of bits used of the quantized signal is within a predetermined number of bits. If the number of bits used of the quantized signal does not fall within the predetermined number of bits (NO) in step 2c, global_gain is adjusted to increase in step 2d, so that all the bands in the quantized signal are adjusted. , The level is uniformly changed, and thereafter, the process returns to the above-described step 2a, and performs the above-described steps 2a to 2c again.
In step 2c, the process is repeated until the number of bits used of the quantized signal falls within the predetermined number of bits. On the other hand, in step 2c, if the number of bits used of the quantized signal is within the predetermined number of bits (YES), global_gain is determined. Thereafter, in step 2e, the inner loop IR ends, and the process returns to the outer loop OR.

【００１３】尚、ステップ２ｃ中の所定ビット数とは、
予め設定されたビットレートより求められる１オーディ
オフレームにおいて使用可能なビット数を意味する。Note that the predetermined number of bits in step 2c is
It means the number of bits that can be used in one audio frame obtained from a preset bit rate.

【００１４】この際、インナーループＩＲ中のステップ
２ａで時間周波数変換部１１からのオーディオ・スペク
トル信号に対してバンドごとに量子化を施す場合には、
下記する［数１］に示した量子化式に準拠して量子化が
行われている。At this time, when quantizing the audio spectrum signal from the time-frequency converter 11 for each band in step 2a in the inner loop IR,
Quantization is performed according to a quantization equation shown in [Equation 1] below.

【００１５】[0015]

【数１】上記した［数１］中において、ｑｕａｎｔ（ｋ）はフレ
ーム内の量子化信号のインデックスｋに対する量子化値
を示している。また、［数１］中の分母側に示したｇｌ
ｏｂａｌ＿ｇａｉｎは、フレーム内の全てのオーディオ
・スペクトル信号に対してレベルを可変するための第１
の変数であり、この第１の変数は整数値である。また、
［数１］中の分子側に示した｜ｍｄｃｔ＿ｌｉｎｅ
（ｋ）｜は、オーディオ・スペクトル信号のインデック
スｋに対する絶対レベルを示している。更に、［数１］
中の分子側に示したｓｃａｌｅｆａｃｔｏｒ（ｓｆｂ）
は、バンドｓｆｂ単位でオーディオ・スペクトル信号の
レベルを可変することで、量子化歪みを制御するための
第２の変数である。この際、オーディオ・スペクトル信
号に対するバンドｓｆｂとの対応を模式的に示すと図８
の如くになっている。(Equation 1) In the above [Equation 1], quant (k) indicates a quantization value for an index k of a quantization signal in a frame. Also, gl shown on the denominator side in [Equation 1]
obal_gain is the first for varying the level for all audio spectrum signals in the frame.
, And the first variable is an integer value. Also,
| Mdct_line shown on the molecule side in [Equation 1]
(K) | indicates the absolute level of the audio spectrum signal with respect to the index k. Furthermore, [Equation 1]
Scalefactor (sfb) shown on the molecular side in
Is a second variable for controlling the quantization distortion by varying the level of the audio spectrum signal in band sfb units. At this time, the correspondence between the audio spectrum signal and the band sfb is schematically shown in FIG.
It is like.

【００１６】次に、ステップ２によるインナーループＩ
Ｒが終了したら、図７（ａ）に示したアウターループＯ
Ｒ中のステップ３に移行し、ステップ３でインナールー
プＩＲで求めた量子化結果を基に逆量子化を行ない、バ
ンド単位で量子化歪みを算出する。ここで、逆量子化を
行う際には、下記する［数２］に示した逆量子化式に準
拠して逆量子化が行われている。Next, the inner loop I in step 2
When R is completed, the outer loop O shown in FIG.
The process proceeds to step 3 in R, and in step 3, inverse quantization is performed based on the quantization result obtained by the inner loop IR, and quantization distortion is calculated for each band. Here, when performing the inverse quantization, the inverse quantization is performed based on the inverse quantization formula shown in [Equation 2] below.

【００１７】[0017]

【数２】次に、ステップ４では、バンド単位で算出された量子化
歪みが、聴覚モデル部１２の信号対マスキング率ＳＭＲ
から求めた許容歪み内に収まっているか否かを判断し、
量子化歪みが許容歪み内に収まっていない（ＮＯ）場合
には、ステップ５でそのバンドのｓｃａｌｅｆａｃｔｏ
ｒ（ｓｆｂ）を増加する方向に調整する。この場合に
は、量子化歪みが許容歪み内に収まっていないバンドが
１バンド以上存在するので、上記したステップ２まで戻
って再び上記したステップ２〜ステップ４を行い、ステ
ップ４で量子化歪みが許容歪み内に収まるまで繰り返
す。一方、ステップ４で量子化歪みが許容歪み内に収ま
っている（ＹＥＳ）場合には、ｓｃａｌｅｆａｃｔｏｒ
（ｓｆｂ）が確定される。この後、ステップ６で量子
化信号と、第１の変数であるｇｌｏｂａｌ＿ｇａｉｎ
と、第２の変数であるｓｃａｌｅｆａｃｔｏｒ（ｓｆ
ｂ）とをビットストリーム化部１４に出力して、ステッ
プ７でイタレーションループＩＴＲが終了する。(Equation 2) Next, in step 4, the quantization distortion calculated for each band is used as the signal-to-masking ratio SMR of the auditory model unit 12.
Judge whether it is within the allowable distortion obtained from
If the quantization distortion is not within the allowable distortion (NO), the scalefactor of that band is determined in step 5.
r (sfb) is adjusted to increase. In this case, since there is one or more bands in which the quantization distortion is not within the allowable distortion, the process returns to the above-described step 2 and performs the above-described steps 2 to 4 again. Repeat until it falls within the allowable distortion. On the other hand, if the quantization distortion falls within the allowable distortion in step 4 (YES), the scalefactor
(Sfb) is determined. Thereafter, in step 6, the quantized signal and the first variable global_gain
And the second variable, scalefactor (sf
b) is output to the bit stream generator 14, and the iteration loop ITR ends in step S7.

【００１８】図６に戻り、ビットストリーム化部１４で
は、量子化符号化部１３から出力された量子化信号と、
ｇｌｏｂａｌ＿ｇａｉｎと、ｓｃａｌｅｆａｃｔｏｒ
（ｓｆｂ）とを多重化して、ビットストリームを出力す
ることで、オーディオ信号の符号化が終了する。Returning to FIG. 6, in the bit stream generator 14, the quantized signal output from the quantization encoder 13 and
global_gain and scalefactor
(Sfb) is multiplexed and a bit stream is output, thereby completing the encoding of the audio signal.

【００１９】[0019]

【発明が解決しようとする課題】ところで、上記した従
来のオーディオ信号符号化装置を適用して、オーディオ
・ＰＣＭ信号（コンテンツ）を符号化する場合、番組の
切り替え、曲の切り替え時にコンテンツ側でフェードが
かけられた状態で符号化を行うことが望ましいが、コン
テンツ側のフェード処理がかけられていない場合（接続
部位が不連続の場合）に、符号化・復号化した際にレベ
ル変動が起こることになり、クリック音などの異音を発
生するので問題となる。By the way, when encoding the audio / PCM signal (content) by applying the above-mentioned conventional audio signal encoding apparatus, when the program is switched or the music is switched, the content side fades. It is desirable to perform encoding in the state where is applied, but when the fade processing on the content side is not applied (when the connection part is discontinuous), level fluctuation occurs when encoding / decoding This causes a problem such as generation of an abnormal sound such as a click sound.

【００２０】[0020]

【課題を解決するための手段】本発明は上記課題に鑑み
てなされたものであり、オーディオ信号を符号化するオ
ーディオ信号符号化装置において、入力した前記オーデ
ィオ信号を時間軸から周波数軸に変換してオーディオ・
スペクトル信号を出力する時間周波数変換部と、入力し
た前記オーディオ信号から人間の聴覚特性に基づいた聴
覚パラメータを算出して出力する聴覚モデル部と、前記
オーディオ信号をフェード処理するために、入力したフ
ェード用パラメータを確認して該フェード用パラメータ
を出力するフェード用パラメータ入力部と、前記聴覚モ
デル部から出力された前記聴覚パラメータを用いて前記
時間周波数変換部から出力された前記オーディオ・スペ
クトル信号に対して量子化を行って量子化信号を出力す
る量子化符号化部と、前記量子化符号化部から出力され
た前記量子化信号をビットストリームに変換して出力す
るビットストリーム化部とを備えており、前記量子化符
号化部は、前記量子化信号の符号量を制御するための第
１の変数と、前記量子化信号の量子化歪みを制御するた
めの第２の変数とを用いて前記オーディオ・スペクトル
信号を量子化した後、前記フェード用パラメータにより
前記第１の変数を変更することを特徴とするオーディオ
信号符号化装置を提供するものである。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and an audio signal encoding apparatus for encoding an audio signal converts the input audio signal from a time axis to a frequency axis. Audio
A time-frequency conversion unit that outputs a spectrum signal, an auditory model unit that calculates and outputs auditory parameters based on human auditory characteristics from the input audio signal, and an input fade that is used to perform a fade process on the audio signal. A fade parameter input unit for checking the parameters for the fade and outputting the parameters for the fade, and the audio spectrum signal output from the time-frequency conversion unit using the auditory parameters output from the auditory model unit. A quantization encoding unit that performs quantization and outputs a quantized signal, and a bit stream conversion unit that converts the quantized signal output from the quantization encoding unit into a bit stream and outputs the bit stream. The quantization coding unit includes a first variable for controlling a code amount of the quantization signal; And a second variable for controlling a quantization distortion of the child signal. After quantizing the audio spectrum signal using the second variable, the first variable is changed by the fade parameter. A signal encoding device is provided.

【００２１】[0021]

【発明の実施の形態】以下に本発明に係るオーディオ信
号符号化装置の一実施例を図１乃至図５を参照して詳細
に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of an audio signal encoding apparatus according to the present invention will be described below in detail with reference to FIGS.

【００２２】図１は本発明に係るオーディオ信号符号化
装置を示したブロック図、図２は本発明に係るオーディ
オ信号符号化装置において、量子化時の２重ループの動
作を示したフロー図、図３は図２に示したフェード処理
の動作を示したフロー図、図４は本発明に係るオーディ
オ信号符号化装置から出力されたビットストリームを復
号化するためのオーディオ信号復号化装置を示したブロ
ック図、図５（ａ）はフェード用パラメータを使用しな
い場合の復号化出力例を示した波形図であり、（ｂ）は
フェード用パラメータを使用した場合の復号化出力例を
示した波形図である。FIG. 1 is a block diagram showing an audio signal encoding apparatus according to the present invention. FIG. 2 is a flowchart showing the operation of a double loop during quantization in the audio signal encoding apparatus according to the present invention. FIG. 3 is a flowchart showing the operation of the fade processing shown in FIG. 2, and FIG. 4 shows an audio signal decoding apparatus for decoding a bit stream output from the audio signal encoding apparatus according to the present invention. FIG. 5A is a waveform diagram illustrating a decoded output example when a fade parameter is not used, and FIG. 5B is a waveform diagram illustrating a decoded output example when a fade parameter is used. It is.

【００２３】尚、説明の便宜上、先に従来例で示した構
成部材と同一構成部材に対しては同一の符号を付して適
宜説明し、且つ、従来例と異なる構成部材に新たな符号
を付すと共に、この実施例では従来例と異なる点を中心
に説明する。For convenience of explanation, the same reference numerals are given to the same constituent members as those shown in the conventional example, and the description is appropriately given, and new reference numerals are given to constituent members different from the conventional example. In addition, this embodiment will be described focusing on points different from the conventional example.

【００２４】本発明に係るオーディオ信号符号化装置で
は、とくに、クリック音などの異音を発生することがな
いようにオーディオ信号を符号化する際に、フェード処
理を行うことを特徴とするものである。The audio signal encoding apparatus according to the present invention is characterized by performing a fade process when encoding an audio signal so as not to generate an abnormal sound such as a click sound. is there.

【００２５】図１に示した本発明に係るオーディオ信号
符号化装置１０Ａは、時間周波数変換部１１と、聴覚モ
デル部１２と、量子化符号化部１３と、ビットストリー
ム化部１４と、新たに追加したフェード用パラメータ入
力部１５とで構成されている。The audio signal encoding apparatus 10A according to the present invention shown in FIG. 1 includes a time-frequency conversion unit 11, an auditory model unit 12, a quantization encoding unit 13, a bit stream conversion unit 14, It comprises an added fade parameter input unit 15.

【００２６】入力されたオーディオ・ＰＣＭ信号は、略
一定の処理期間となるフレーム単位で時間周波数変換部
１１と、聴覚モデル部１２とに送られて、時間周波数変
換部１１で時間軸から周波数軸への変換が行なわれ、変
換されたオーディオ・スペクトル信号が量子化符号化部
１３に送られる。一方、聴覚モデル部１２で人間の聴覚
心理に基づいてマスキングレベルの計算により求めた聴
覚パラメータとなる信号対マスキング率ＳＭＲが量子化
符号化部１３に送られる。The input audio / PCM signal is sent to the time-frequency conversion unit 11 and the auditory model unit 12 on a frame-by-frame basis with a substantially constant processing period. Then, the converted audio / spectrum signal is sent to the quantization encoding unit 13. On the other hand, the signal-to-masking ratio SMR, which is an auditory parameter obtained by calculating the masking level based on the auditory psychology of the human in the auditory model unit 12, is sent to the quantization encoding unit 13.

【００２７】次に、本発明の要部となるフェード用パラ
メータ入力部１５は、入力したオーディオ・ＰＣＭをフ
レーム単位でフェード処理するために、外部からここに
入力されたフェード用パラメータを確認して、フェード
用パラメータｆｐを量子化符号化部１３に供給する。Next, a fade parameter input unit 15 which is a main part of the present invention checks a fade parameter input from outside to input the audio / PCM in order to perform a fade process on a frame basis. , And the fade parameter fp to the quantization encoding unit 13.

【００２８】ここで、量子化符号化部１３に供給するフ
ェード用パラメータｆｐは、フレームに対して設定され
る０又は自然数（１，２，……ｎ）である。そして、オ
ーディオ・ＰＣＭ信号に対してフェード処理対象のフレ
ーム範囲が設定されると、フェード用パラメータｆｐが
各フレームごとにフェード処理すべきフレーム状況に応
じて０又は自然数による適宜な数値に設定されて、量子
化符号化部１３でフェード時の振幅レベルを変更できる
ようになっている。Here, the fade parameter fp supplied to the quantization encoder 13 is 0 or a natural number (1, 2,..., N) set for the frame. Then, when the frame range to be subjected to the fade processing is set for the audio / PCM signal, the fade parameter fp is set to 0 or an appropriate numerical value by a natural number according to the frame state in which the fade processing is to be performed for each frame. The amplitude level at the time of fading can be changed by the quantization encoding unit 13.

【００２９】次に、量子化符号化部１３では、時間周波
数変換部１１から出力されたオーディオ・スペクトル信
号に対してバンドごとに量子化を施す際に、従来例で説
明したと同様にイタレーションループＩＴＲを終了し、
このイタレーションループＩＴＲを終了した段階の量子
化信号に対して、フェード用パラメータ入力部１５から
送られたフェード用パラメータｆｐに基づいてフェード
処理を行っている。Next, in the quantization encoding unit 13, when quantizing the audio spectrum signal output from the time-frequency conversion unit 11 for each band, the quantization is performed in the same manner as described in the conventional example. End the loop ITR,
A fade process is performed on the quantized signal at the stage where the iteration loop ITR has been completed, based on the fade parameter fp sent from the fade parameter input unit 15.

【００３０】即ち、上記した量子化符号化部１３では、
図２（ａ），（ｂ）に示した如く、先に従来例で説明し
たと同様のイタレーションループＩＴＲによる２重ルー
プが行われており、ここではイタレーションループＩＴ
Ｒ中のインナーループＩＲ及びアウターループＯＲの結
果だけを述べると、インナーループＩＲによる各ステッ
プが終了すると、量子化信号の使用ビット数が所定ビッ
ト数に収まっている状態になって、量子化信号中の全て
のバンドに対してレベルを可変するための第１の変数で
あるｇｌｏｂａｌ＿ｇａｉｎが確定される。That is, in the above-mentioned quantization encoding unit 13,
As shown in FIGS. 2A and 2B, a double loop by the same iteration loop ITR as described in the conventional example is performed.
If only the results of the inner loop IR and the outer loop OR in R are described, when each step by the inner loop IR is completed, the number of bits used of the quantized signal is within a predetermined number of bits, and the quantized signal is The first variable global_gain for changing the level for all the bands in the band is determined.

【００３１】この後、アウターループＯＲによる各ステ
ップが終了すると、バンド単位で量子化歪みが許容ノイ
ズレベル以下の状態になって、量子化歪みを制御するた
めの第２の変数であるｓｃａｌｅｆａｃｔｏｒ（ｓｆ
ｂ）も確定される。After that, when each step by the outer loop OR is completed, the quantization distortion becomes equal to or less than the allowable noise level in band units, and scalefactor (sf) which is a second variable for controlling the quantization distortion is used.
b) is also determined.

【００３２】そして、アウターループＯＲ中のステップ
６’では、量子化信号と、第１の変数であるｇｌｏ
ｂａｌ＿ｇａｉｎと、第２の変数であるｓｃａｌｅｆ
ａｃｔｏｒ（ｓｆｂ）とが、先に従来例で説明したよう
なビットストリーム化部１４に出力されることなく、量
子化符号化部１３内で下記するフェード処理側に出力さ
れる。Then, in step 6 'in the outer loop OR, the quantized signal and glo which is the first variable
bal_gain and the second variable, scalef
The actor (sfb) is output to the following fade processing side in the quantization encoding unit 13 without being output to the bit stream conversion unit 14 as described in the conventional example.

【００３３】この後、ステップ７でイタレーションルー
プＩＴＲが終了すると、次に、ステップ８のフェード処
理Ｆに入る。このフェード処理Ｆは、図３に示した如
く、ステップ８ａで、イタレーションループＩＴＲが終
了した時に確定されたｇｌｏｂａｌ＿ｇａｉｎに対して
フェード用パラメータｆｐを考慮して、ここに供給され
た（ｇｌｏｂａｌ＿ｇａｉｎ）を｛（ｇｌｏｂａｌ＿ｇ
ａｉｎ）−ｆｐ｝に置き換えている。After that, when the iteration loop ITR is completed in step 7, the process enters a fade process F in step 8. In the fade process F, as shown in FIG. 3, in step 8a, the global_gain determined at the end of the iteration loop ITR and the supplied (global_gain) are considered in consideration of the fade parameter fp. ｛(Global_g
ain) -fp}.

【００３４】この後、ステップ８ｂでは、量子化信号
と、第１の変数である置き換えられた｛（ｇｌｏｂａ
ｌ＿ｇａｉｎ）−ｆｐ｝と、第２の変数であるｓｃａ
ｌｅｆａｃｔｏｒ（ｓｆｂ）とが量子化符号化部１３か
らビットストリーム化部１４に送られる。Thereafter, in step 8b, the quantized signal and the replaced 置き換え (globa) which is the first variable
l_gain) -fp} and the second variable sca
The factor (sfb) is sent from the quantization encoding unit 13 to the bit stream generation unit 14.

【００３５】図１に戻り、ビットストリーム化部１４で
は、量子化符号化部１３から出力された量子化信号と、
置き換えられた｛（ｇｌｏｂａｌ＿ｇａｉｎ）−ｆｐ｝
と、ｓｃａｌｅｆａｃｔｏｒ（ｓｆｂ）とを多重化し
て、ビットストリームを出力することで、オーディオ信
号の符号化が終了する。Returning to FIG. 1, in the bit stream generator 14, the quantized signal output from the quantization encoder 13 and
Replaced {(global_gain) -fp}
And scalefactor (sfb) are multiplexed to output a bit stream, thereby completing the encoding of the audio signal.

【００３６】次に、本発明に係るオーディオ信号符号化
装置１０Ａによりフェード用パラメータｆｐに基づいて
フェード処理して、得られたビットストリームを復号化
する場合には、図４に示したオーディオ信号復号化装置
２０を用いており、このオーディオ信号復号化装置２０
について簡略に説明する。Next, when the audio signal encoding apparatus 10A according to the present invention performs a fade process based on the fade parameter fp and decodes the obtained bit stream, the audio signal decoding shown in FIG. Audio signal decoding device 20
Will be described briefly.

【００３７】上記したオーディオ信号復号化装置２０で
は、本発明に係るオーディオ信号符号化装置１０Ａのビ
ットストリーム化部１４から出力されたビットストリー
ムをビットストリーム分解部２１に入力して、このビッ
トストリーム分解部２１で多重化されたビットストリー
ムから、量子化信号に相当する信号と、｛（ｇｌｏｂａ
ｌ＿ｇａｉｎ）−ｆｐ｝に相当する信号と、ｓｃａｌｅ
ｆａｃｔｏｒ（ｓｆｂ）に相当する信号とに分解して逆
量子化復号化部２２に送り、この逆量子化復号化部２２
で各信号に対して逆量子化を行い復号化が行われて、復
号化された各信号は周波数軸上の信号として得られる。In the audio signal decoding apparatus 20 described above, the bit stream output from the bit stream forming section 14 of the audio signal coding apparatus 10A according to the present invention is input to the bit stream decomposing section 21, From the bit stream multiplexed by the unit 21, a signal corresponding to a quantized signal and ｛(globa)
l_gain) -fp}, and scale
The signal is decomposed into a signal corresponding to factor (sfb) and sent to the inverse quantization decoding unit 22.
Then, inverse quantization is performed on each signal and decoding is performed, and each decoded signal is obtained as a signal on the frequency axis.

【００３８】ここで、量子化信号は、フェード用パラメ
ータｆｐを用いて置き換えられる前のｇｌｏｂａｌ＿ｇ
ａｉｎを用いて、先に説明した［数１］により量子化し
た結果である。この量子化信号に対して通常先に説明し
た［数２］により逆量子化を行うところを、前記した第
１の変数である置き換えられた｛（ｇｌｏｂａｌ＿ｇａ
ｉｎ）−ｆｐ｝分を考慮することで、下記する［数３］
に示した逆量子化式に準拠して逆量子化を行った場合と
等価になり、復号化後の振幅レベルを減衰することがで
きる。Here, the quantized signal is global_g before being replaced using the fade parameter fp.
This is a result of quantization using “ain” and [Equation 1] described above. Normally, the inverse quantization of the quantized signal according to [Equation 2] described above is performed by replacing the first variable, ie, the replaced ｛(global_ga).
in) -fp}, the following [Equation 3] is obtained.
This is equivalent to the case where inverse quantization is performed according to the inverse quantization formula shown in (1), and the amplitude level after decoding can be attenuated.

【００３９】[0039]

【数３】そして、ここで用いられているフェード用パラメータｆ
ｐは、前述したようにフェード処理すべきフレーム状況
に応じて数値が設定されているが、例えば、フェード用
パラメータｆｐ＝１の場合でｄＢ値を算出すると、ｄＢ
値は、２０×ｌｏｇ_１０｛［数２］／［数３］｝により
求められ、即ち、２０×ｌｏｇ
_１０｛２^{（−ｆｐ／４）}｝ｄＢとなるので、この式にｆ
ｐ＝１を代入すれば、２０×ｌｏｇ_１０｛２
^{（−１／４）}｝ｄＢ＝−１．５０５ｄＢと求まる。(Equation 3) The fade parameter f used here is
As for p, a numerical value is set in accordance with the frame condition to be faded as described above. For example, if the dB value is calculated when the fade parameter fp = 1, dB
The value is calculated by 20 × log ₁₀ {[Equation 2] / [Equation 3]}, that is, 20 × log 10
₁₀ ｛2 ^{(−fp / 4)} ｝ dB, so that f
By substituting p = 1, 20 × log ₁₀ ｛2
⁽ −1/4 ⁾ ΔdB = −1.505 dB

【００４０】更に、復号化後の振幅レベルを連続的に減
衰させるには、［数３］中の｛（ｇｌｏｂａｌ＿ｇａｉ
ｎ）−ｆｐ｝におけるフェード用パラメータｆｐの値を
フレーム単位で増加させることが必要である。例えば、
フェード用パラメータｆｐを１，２，３，……と１つづ
増加させる方法や、１，３，５，……と適宜な間隔をあ
けて増加させる方法がある。このため、量子化符号化部
１３内でフェード処理Ｆする場合、フェード用パラメー
タ入力部１５に上記した各方法に対応したフェード用パ
ラメータｆｐの値を入力することが必要となる。Further, in order to continuously attenuate the amplitude level after decoding, ｛(global_gai) in [Equation 3] is used.
n) It is necessary to increase the value of the fade parameter fp in -fp} in frame units. For example,
There is a method of increasing the fade parameter fp by 1, 2, 3,... One by one, or a method of increasing the fade parameter fp by 1, 3, 5,. For this reason, when performing the fade processing F in the quantization encoding unit 13, it is necessary to input the value of the fade parameter fp corresponding to each of the above methods to the fade parameter input unit 15.

【００４１】この後、逆量子化復号化部２２で逆量化復
号化された各信号は、周波数時間変換部２３に送られ
て、この周波数時間変換部２３で周波数軸から時間軸へ
の変換が行なわれ、元のオーディオ・ＰＣＭ信号に戻さ
れて周波数時間変換部２３から出力される。Thereafter, the signals inversely decoded by the inverse quantization decoding unit 22 are sent to a frequency-time conversion unit 23, where the conversion from the frequency axis to the time axis is performed. Then, the signal is returned to the original audio / PCM signal and output from the frequency / time conversion unit 23.

【００４２】次に、図５（ａ），（ｂ）を用いて、フェ
ード用パラメータｆｐを使用しない場合の復号化出力例
と、フェード用パラメータｆｐを使用した場合の復号化
出力例とを比較して説明する。Next, referring to FIGS. 5 (a) and 5 (b), a comparison is made between a decoded output example when the fade parameter fp is not used and a decoded output example when the fade parameter fp is used. I will explain.

【００４３】まず、図５（ａ）に示した如く、フェード
用パラメータｆｐを使用しないで符号化したビットスト
リームを上記したオーディオ信号復号化装置２０で復号
化した復号化出力例の場合を説明すると、フレーム１及
びフレーム２に対して共にフェード用パラメータｆｐが
設定されてなく、この状態でフレーム１及びフレーム２
に対して窓関数の１種であるｌｏｎｇ窓を開いてフレー
ム１とフレーム２とを接続すると、同図に示した加算結
果のように接続した部位に全くフェード処理がなされて
いないため、接続部位が不連続の場合に接続部位のレベ
ルが減衰されないので聞きずらい音となってしまう。First, as shown in FIG. 5A, an example of a decoded output example in which a bit stream encoded without using the fade parameter fp is decoded by the audio signal decoding device 20 will be described. , Frame 1 and frame 2 are not set with the fade parameter fp.
When a long window, which is a type of window function, is opened and frame 1 and frame 2 are connected to each other, no fade processing is performed on the connected portion as in the addition result shown in FIG. Is discontinuous, the level of the connection portion is not attenuated, so that the sound becomes hard to hear.

【００４４】一方、図５（ｂ）に示した如く、フェード
用パラメータｆｐを使用して符号化したビットストリー
ムを上記したオーディオ信号復号化装置２０で復号化し
た復号化出力例の場合を説明すると、フレーム１に対し
てはフェード用パラメータｆｐが設定されていないもの
の、フレーム２に対してはフェード用パラメータｆｐが
設定されているので、この状態でフレーム１及びフレー
ム２に対してｌｏｎｇ窓を開いてフレーム１とフレーム
２とを接続すると、同図に示した加算結果のように接続
した部位にフェード処理がなされているため、接続部位
のレベルが減衰されて滑らかで聞き心地の良い音とな
る。On the other hand, as shown in FIG. 5B, an example of a decoded output example in which a bit stream encoded using the fade parameter fp is decoded by the audio signal decoding device 20 will be described. Since the fade parameter fp is not set for the frame 1 but the fade parameter fp is set for the frame 2, the long window is opened for the frames 1 and 2 in this state. When the frame 1 and the frame 2 are connected in this way, the level of the connected portion is attenuated because the connected portion is subjected to fade processing as shown in the addition result shown in FIG. .

【００４５】[0045]

【発明の効果】以上詳述した本発明に係るオーディオ信
号符号化装置によると、とくに、入力したオーディオ信
号を時間軸から周波数軸に変換してオーディオ・スペク
トル信号を出力する時間周波数変換部と、入力したオー
ディオ信号から人間の聴覚特性に基づいた聴覚パラメー
タを算出して出力する聴覚モデル部と、オーディオ信号
をフェード処理するために、入力したフェード用パラメ
ータを確認してフェード用パラメータを出力するフェー
ド用パラメータ入力部と、聴覚モデル部から出力された
聴覚パラメータを用いて時間周波数変換部から出力され
たオーディオ・スペクトル信号に対して量子化を行って
量子化信号を出力する量子化符号化部と、量子化符号化
部から出力された量子化信号をビットストリームに変換
して出力するビットストリーム化部とを備えた際に、量
子化符号化部は、量子化信号の符号量を制御するための
第１の変数と、量子化信号の量子化歪みを制御するため
の第２の変数とを用いてオーディオ・スペクトル信号を
量子化した後、フェード用パラメータにより第１の変数
を変更することで、クリック音などの異音を発生するこ
とがないようにオーディオ信号を符号化する際に、フェ
ード処理を行うことができる。According to the audio signal encoding apparatus according to the present invention described in detail above, in particular, a time-frequency converter for converting an input audio signal from a time axis to a frequency axis and outputting an audio spectrum signal; An auditory model unit that calculates and outputs auditory parameters based on human auditory characteristics from the input audio signal, and a fade that outputs the fader parameter by checking the input fader parameter to fade the audio signal A parameter input unit, and a quantization encoding unit that performs quantization on the audio spectrum signal output from the time-frequency conversion unit using the auditory parameters output from the auditory model unit and outputs a quantized signal. And a bit for converting the quantized signal output from the quantization encoding unit into a bit stream and outputting the bit stream. In the case of including the streaming unit, the quantization encoding unit includes a first variable for controlling the code amount of the quantized signal and a second variable for controlling the quantization distortion of the quantized signal. After quantizing the audio spectrum signal by using the above, by changing the first variable by the fade parameter, the audio signal is encoded so as not to generate an abnormal sound such as a click sound. , A fade process can be performed.

[Brief description of the drawings]

【図１】本発明に係るオーディオ信号符号化装置を示し
たブロック図である。FIG. 1 is a block diagram illustrating an audio signal encoding device according to the present invention.

【図２】本発明に係るオーディオ信号符号化装置におい
て、量子化時の２重ループの動作を示したフロー図であ
る。FIG. 2 is a flowchart showing an operation of a double loop at the time of quantization in the audio signal encoding apparatus according to the present invention.

【図３】図２に示したフェード処理の動作を示したフロ
ー図である。FIG. 3 is a flowchart showing an operation of the fade processing shown in FIG. 2;

【図４】本発明に係るオーディオ信号符号化装置から出
力されたビットストリームを復号化するためのオーディ
オ信号復号化装置を示したブロック図である。FIG. 4 is a block diagram showing an audio signal decoding device for decoding a bit stream output from the audio signal encoding device according to the present invention.

【図５】（ａ）はフェード用パラメータを使用しない場
合の復号化出力例を示した波形図であり、（ｂ）はフェ
ード用パラメータを使用した場合の復号化出力例を示し
た波形図である。FIG. 5A is a waveform diagram illustrating a decoded output example when a fade parameter is not used, and FIG. 5B is a waveform diagram illustrating a decoded output example when a fade parameter is used. is there.

【図６】従来のオーディオ信号符号化装置を示したブロ
ック図である。FIG. 6 is a block diagram showing a conventional audio signal encoding device.

【図７】従来のオーディオ信号符号化装置において、量
子化時の２重ループの動作を示したフロー図である。FIG. 7 is a flowchart showing an operation of a double loop at the time of quantization in a conventional audio signal encoding device.

【図８】オーディオ・スペクトル信号に対するバンドｓ
ｆｂとの対応を示した模式図である。FIG. 8 shows a band s for an audio spectrum signal.
It is the schematic diagram which showed the correspondence with fb.

[Explanation of symbols]

１０Ａ…本発明に係るオーディオ信号符号化装置、１１…時間周波数変換部、１２…聴覚モデル部、１３…量子化符号化部、１４…ビットストリーム化部、１５…フェード用パラメータ入力部、Ｆ…フェード処理、ｆｐ…フェード用パラメータ。 10A: audio signal encoding apparatus according to the present invention, 11: time-frequency conversion unit, 12: auditory model unit, 13: quantization encoding unit, 14: bit stream generation unit, 15: parameter input unit for fade, F: Fade processing, fp: Fade parameters.

Claims

[Claims]

1. An audio signal encoding apparatus for encoding an audio signal, comprising: a time-frequency converting unit for converting the input audio signal from a time axis to a frequency axis to output an audio spectrum signal; A hearing model unit for calculating and outputting a hearing parameter based on a human hearing characteristic from a signal, and a fade for outputting a fade parameter after confirming an input fade parameter in order to perform a fade process on the audio signal. A parameter input unit, and a quantization code that performs quantization on the audio spectrum signal output from the time-frequency conversion unit using the auditory parameter output from the auditory model unit and outputs a quantized signal. A quantizing unit, and the quantized signal output from the quantization encoding unit. A bit stream conversion unit that converts the stream into a stream and outputs the stream. The quantization coding unit includes a first variable for controlling a code amount of the quantization signal, and a quantization of the quantization signal. An audio signal encoding apparatus, comprising: quantizing the audio spectrum signal using a second variable for controlling distortion, and changing the first variable according to the fade parameter.