JP2000349643A

JP2000349643A - Digital audio signal compression apparatus, digital audio signal compression method, and storage medium

Info

Publication number: JP2000349643A
Application number: JP11161337A
Authority: JP
Inventors: Takayuki Kikuchi; 孝之菊池
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1999-06-08
Filing date: 1999-06-08
Publication date: 2000-12-15

Abstract

(57)【要約】【課題】オーディオ信号を圧縮処理するのに必要な演
算処理時間を短縮できるようにする。【解決手段】ディジタルオーディオ信号を圧縮するデ
ィジタルオーディオ信号圧縮装置において、各サブバン
ドにおけるマスク対雑音比からマスク対雑音比向上指標
値を予め求め、上記求めたマスク対雑音比向上指標値を
超えるまで、予め定められたサブバンド毎にマスク対雑
音比演算を行うことにより、最小マスク対雑音比演算処
理を省略できるようにして、最小マスク対雑音比演算処
理を省略して演算処理に要する時間を短縮できるように
する。 (57) [Summary] [PROBLEMS] To reduce the processing time required for compressing an audio signal. SOLUTION: In a digital audio signal compression apparatus for compressing a digital audio signal, a mask-to-noise ratio improvement index value is obtained in advance from a mask-to-noise ratio in each subband, and the mask-to-noise ratio improvement index value is obtained until the above-mentioned mask-to-noise ratio improvement index value is exceeded. By performing the mask-to-noise ratio calculation for each predetermined sub-band, the minimum mask-to-noise ratio calculation process can be omitted, and the time required for the calculation process can be reduced by omitting the minimum mask-to-noise ratio calculation process. Be able to shorten.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明はディジタルオーディ
オ信号圧縮装置、ディジタルオーディオ信号圧縮方法及
び記憶媒体に関し、特に、ディジタルオーディオ信号を
圧縮する際のビット割り当て演算に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a digital audio signal compression apparatus, a digital audio signal compression method, and a storage medium, and more particularly to a bit allocation operation for compressing a digital audio signal.

【０００２】[0002]

【従来の技術】ディジタルオーディオ信号を圧縮する手
法にＩＳＯ／ＩＥＣ１１７２−３に示されるＭＰＥＧオ
ーディオ圧縮方式がある。図２は、ＭＰＥＧオーディオ
圧縮装置の概略構成を示すブロック図である。2. Description of the Related Art There is an MPEG audio compression method described in ISO / IEC1172-3 as a method for compressing a digital audio signal. FIG. 2 is a block diagram showing a schematic configuration of the MPEG audio compression device.

【０００３】図２において、１はデジタルオーディオ入
力端、２はデジタルオーディオ入力端１からのディジタ
ルオーディオ信号を所定の時間間隔に分割し、複数の周
波数帯域に写像するサブバンド符号化部、３はサブバン
ド符号化部２と対応した期間のデジタルオーディオ入力
端１からのディジタルオーディオ信号をＦＦＴ解析する
ＦＦＴ演算部である。In FIG. 2, 1 is a digital audio input terminal, 2 is a sub-band encoding unit that divides a digital audio signal from the digital audio input terminal 1 into predetermined time intervals, and maps them to a plurality of frequency bands. An FFT operation unit that performs FFT analysis on the digital audio signal from the digital audio input terminal 1 during a period corresponding to the subband encoding unit 2.

【０００４】４はＦＦＴ演算部３の結果を音圧に換算
し、マスク量を演算するマスク量演算部であり、５はマ
スク量演算部４で算出したマスク量を用いて各サブバン
ドに割り当てるビット数を見積もるビット割り当て演算
部であり、６はビット割り当て演算部５で割り当てられ
たビット数に対応してサブバンド符号化部２からの出力
であるサブバンド信号を量子化するデータ量子化部であ
る。[0004] Reference numeral 4 denotes a mask amount calculator for converting the result of the FFT calculator 3 into sound pressure and calculating a mask amount. Reference numeral 5 denotes a mask amount calculated by the mask amount calculator 4, which is assigned to each subband. A bit allocation operation unit for estimating the number of bits, and a data quantization unit 6 for quantizing a subband signal output from the subband encoding unit 2 in accordance with the number of bits allocated by the bit allocation operation unit 5 It is.

【０００５】７はデータ量子化部６の出力である量子化
データや、ビット割り当て演算部５からのビット割り当
て情報等を所定の伝送フォーマットにフォーマットする
伝送フォーマット形成部であり、８は圧縮データ出力端
８である。[0005] Reference numeral 7 denotes a transmission format forming unit for formatting the quantized data output from the data quantization unit 6 and the bit allocation information from the bit allocation calculation unit 5 into a predetermined transmission format. End 8.

【０００６】以下に、オーディオ信号圧縮装置の圧縮動
作について説明する。デジタルオーディオ入力端１に入
力されたデータはサブバンド符号化部２及び、ＦＦＴ演
算部３に入力される。The compression operation of the audio signal compression device will be described below. The data input to the digital audio input terminal 1 is input to the subband encoding unit 2 and the FFT operation unit 3.

【０００７】まず、サブバンド符号化部２ではオーバラ
ップ・ポリフェイズ・フィルタ・バンク（ＯＰＦＢ）に
より、入力信号を所定の時間間隔毎（オーディオフレー
ムと称す）に７５０Ｈｚ単位の帯域分割データに写像す
る。このデータは、後に符号化されるデータとなる。一
方、ＦＦＴ演算部３では、サンプルデータの周波数分布
を分析するためにＦＦＴ分析を行なう。First, the subband encoder 2 maps an input signal into band division data in units of 750 Hz at predetermined time intervals (referred to as audio frames) by an overlap polyphase filter bank (OPFB). . This data is data to be encoded later. On the other hand, the FFT operation unit 3 performs FFT analysis in order to analyze the frequency distribution of the sample data.

【０００８】この結果は、マスク量演算部４に入力され
る。マスク量演算部４において、ＦＦＴ解析の結果は音
圧値［ｄＢ］に換算される。そして、計算された音圧値
から、他の周波数軸上の音圧成分を妨害するマスカ成分
を抽出し、その近傍の周波数位置ｉに及ぼすマスク量Ｌ
Ｔｍ（ｉ）を算出する。[0008] The result is input to the mask amount calculation unit 4. In the mask amount calculation unit 4, the result of the FFT analysis is converted into a sound pressure value [dB]. Then, from the calculated sound pressure value, a masker component that interferes with a sound pressure component on another frequency axis is extracted, and a mask amount L exerted on a nearby frequency position i is extracted.
Calculate Tm (i).

【０００９】上記マスカ抽出、マスク量演算処理を全帯
域にわたって行い、最終的に周波数位置ｉのマスク量は
静的聴覚限界値と複数のマスカによるマスク量との和で
与えられることになる。The above-described masker extraction and mask amount calculation processing are performed over the entire band, and finally the mask amount at the frequency position i is given by the sum of the static hearing limit value and the mask amounts by a plurality of maskers.

【００１０】マスク量演算部４で算出されたマスク量
は、ビット割り当て演算部５に出力される。ビット割り
当て演算部５の動作を図７のフローチャートと図８の概
念図を参照しながら説明する。ここでは、説明を簡単に
するために図８（ｃ）に示すように、全サブバンド数を
「８」とし、また、割り当て可能なビット数Ｂａｌｃを
３０００ビットと仮定している。The mask amount calculated by the mask amount calculator 4 is output to the bit allocation calculator 5. The operation of the bit allocation calculation unit 5 will be described with reference to the flowchart of FIG. 7 and the conceptual diagram of FIG. Here, for the sake of simplicity, it is assumed that the total number of subbands is “8” and the number of bits Balc that can be allocated is 3000 bits, as shown in FIG.

【００１１】ビット割り当て演算部５では、図８（ａ）
のようにサブバンド単位にマスク量の最小値を検出し、
最小マスク量ＬＴｍｉｎ［ｓｂ］とする（図７のステッ
プＳ７００）。ここで、サフィックスのｓｂは、サブバ
ンド番号の関数である。FIG. 8 (a)
, The minimum value of the mask amount is detected for each sub-band,
The minimum mask amount LTmin [sb] is set (step S700 in FIG. 7). Here, the suffix sb is a function of the subband number.

【００１２】また、サブバンドデータの最大値を最大音
圧値Ｌ［ｓｂ］として、この最大音圧値Ｌ［ｓｂ］と最
小マスク量ＬＴｍｉｎ［ｓｂ］との差分から、図８
（ｂ）に示した信号対マスク比を計算する（図７のステ
ップＳ７０１）。The maximum value of the sub-band data is defined as the maximum sound pressure value L [sb], and the difference between the maximum sound pressure value L [sb] and the minimum mask amount LTmin [sb] is calculated as shown in FIG.
The signal-to-mask ratio shown in (b) is calculated (step S701 in FIG. 7).

【００１３】さらに、この信号対マスク比［ｓｂ］と、
信号対雑音比ＳＮＲ（ｓｔｅｐ）との差をとることによ
り、図８（ｃ）のようなＭＮＲ（マスク対雑音比）を算
出する（図７のステップＳ７０２）。このＳＮＲ［ｓｔ
ｅｐ］は、予め定められている量子化ステップとＳＮＲ
（信号対雑音比）との関係を定量的に示したテーブルで
あり、図６に示したような関係になっている。Further, the signal-to-mask ratio [sb],
By taking the difference from the signal-to-noise ratio SNR (step), an MNR (mask-to-noise ratio) as shown in FIG. 8C is calculated (step S702 in FIG. 7). This SNR [st
ep] is a predetermined quantization step and SNR
FIG. 6 is a table quantitatively showing a relationship with (signal-to-noise ratio), and has a relationship as shown in FIG.

【００１４】なお、ＭＮＲ算出にあたっては、量子化ス
テップを与えない状態、すなわち、図６のｓｔｅｐを
「０」として計算するので、実質的にはＭＮＲ＝−ＳＭ
Ｒ［ｓｂ］…（１）としているにすぎない。上記ＭＮＲ値の例を図８（ｃ）
に示す。In the calculation of the MNR, no quantization step is applied, that is, since the step in FIG. 6 is calculated as "0", the MNR is substantially equal to -SM.
R [sb] (1) FIG. 8C shows an example of the above MNR value.
Shown in

【００１５】全てのサブバンドにおいてＭＮＲ計算が終
了した時点で、圧縮フォーマットで許されているヘッダ
情報以外の圧縮サンプルデータの最大割り当てビット可
能数Ｂａｌｃを設定する（図７のステップＳ７０３）。When the MNR calculation is completed for all the subbands, the maximum allocable bit number Balc of the compressed sample data other than the header information permitted in the compression format is set (step S703 in FIG. 7).

【００１６】このＢａｌｃは上記の仮定のとうり「３０
００」とする。そして、先ほど計算したＭＮＲ［ｓｂ］
の中で最小の値を検索する（図７のステップＳ７０
４）。そして、検索した結果を、ＭＮＲｍｉｎ［ＳＢ］
とする。さらに、ＭＮＲｍｉｎに対応するサブバンドの
量子化ステップを１段階上の値に変更する（図７のステ
ップＳ７０５）。これは、量子化ステップを小さくする
ことに相当する。This Balc is "30", based on the above assumption.
00 ”. Then, the MNR [sb] calculated earlier
Is searched for the minimum value (step S70 in FIG. 7).
4). Then, the search result is represented by MNRmin [SB].
And Further, the quantization step of the subband corresponding to MNRmin is changed to a value one step higher (step S705 in FIG. 7). This corresponds to reducing the quantization step.

【００１７】例えば、図８（ｃ）の例であれば、最小値
はサブバンドＳＢ１のＭＮＲ、すなわち、９０［ｄＢ］
となるので、現在「０」であるｓｔｅｐ［ＳＢ］を１段
向上させ、サブバンドＳＢ１についての量子化ステップ
を小さくする。For example, in the example of FIG. 8C, the minimum value is the MNR of the sub-band SB1, that is, 90 [dB].
Therefore, step [SB], which is currently “0”, is improved by one step, and the quantization step for the subband SB1 is reduced.

【００１８】量子化ステップを小さくしたことに対応し
て、これによって使用されるビット数を示す割り当て済
みビット数Ｂｕｓｅを計算する（図７のステップＳ７０
６）。Ｂｕｓｅは、量子化ステップとサブバンド単位の
使用ビット数との関数であり、図６に示されているよう
に、サブバンドあたりのサンプル数（３６サンプル）
と、割り当てビット数の積になっている。したがって、
例ではＳＢ１に７２ビットが割り当てられたことにな
る。In response to the reduction in the quantization step, an assigned bit number Buse indicating the number of bits used by the quantization step is calculated (step S70 in FIG. 7).
6). Buse is a function of the quantization step and the number of bits used per subband, and as shown in FIG. 6, the number of samples per subband (36 samples)
And the number of allocated bits. Therefore,
In the example, 72 bits are allocated to SB1.

【００１９】そして、上記割り当て可能なビット数Ｂａ
ｌｃと上記割り当て済みビット数Ｂｕｓｅの数値比較を
行い（図７のステップＳ７０７）、上記Ｂｕｓｅが上記
Ｂａｌｃに達していない場合はビット割り当て可能と判
断し、ビット割り当て量を保持しているバッファである
ＡＬＣ［ＳＢ］の内容を更新する（図７のステップＳ７
０８）。The number Ba of bits that can be allocated is
lc is compared with the number of allocated bits Buse (Step S707 in FIG. 7). If the Buse does not reach the Balc, it is determined that bit allocation is possible, and the buffer holds the bit allocation amount. Update the contents of ALC [SB] (Step S7 in FIG. 7)
08).

【００２０】そして、新しいＳＮＲ［ｓｔｅｐ［Ｓ
Ｂ］］値を用いて再度マスク対雑音比を計算し、ＭＮＲ
［ＳＢ］値を更新する（図７のステップＳ７０９）。Then, a new SNR [step [S
B]] value to calculate the mask-to-noise ratio again,
[SB] The value is updated (step S709 in FIG. 7).

【００２１】図８（ｃ）の斜線部は、ＳＮＲ［ｓｔｅｐ
［ＳＢ］］値を示していてＳＢ１のＭＮＲ値はその差分
となる。すなわち、図６から１段上の量子化ステップで
あるステップ“２”にすることで、ＳＮＲが「７ｄＢ」
となり、その結果、ＳＮＲ（７ｄＢ）一信号対マスク比
（９０ｄｂ）から新しいＭＮＲが「８３ｄＢ」が導かれ
る。その後、処理を図７のステップＳ７０３に戻し、最
小ＭＮＲを探索するループ処理を行う。The hatched portion in FIG. 8C indicates the SNR [step
[SB]], and the MNR value of SB1 is the difference. That is, by making the quantization step one step higher than that of FIG. 6 as “2”, the SNR becomes “7 dB”.
As a result, a new MNR “83 dB” is derived from the SNR (7 dB) one signal-to-mask ratio (90 db). Thereafter, the process returns to step S703 in FIG. 7, and a loop process for searching for the minimum MNR is performed.

【００２２】図８に示している例では、ＳＢ２の「−８
７ｄＢ」がＭＮＲｍｉｎとして判定され、上記に示した
処理が繰り返される。そして、図７のステップＳ７０７
で、ビット割り当てが不可能と判断された時、すなわ
ち、割り当て可能なビット数Ｂａｌｃ＜割り当て済みビット
数Ｂｕｓｅとなった時点でループ処理を終了し、ビット割り当て演
算処理を終える。In the example shown in FIG. 8, SB-8 "-8"
7 dB "is determined as MNRmin, and the processing described above is repeated. Then, step S707 in FIG.
When it is determined that bit allocation is impossible, that is, when the number of allocable bits Balc <the number of allocated bits Buse, the loop processing ends, and the bit allocation calculation processing ends.

【００２３】図８の例では、（ｄ）に示すように、ビッ
ト割り当てＡＬＣ及びＮＭＲが定まった時点でＢｕｓｅ
が「２９８８ビット」となり、これ以上割り当てが不能
となるので、図７のステップＳ７０７にてビット割り当
て量演算を終了する。In the example of FIG. 8, as shown in (d), when the bit allocation ALC and NMR are determined, the Buse
Becomes "2988 bits" and cannot be allocated any more, and the bit allocation amount calculation ends in step S707 in FIG.

【００２４】そして、このビット割り当て演算処理部５
の結果であるビット割り当て量を保持しているバッファ
であるＡＬＣ［ｓｂ］をもとにして、データ量子化部６
でサブバンドデータの量子化を行い、伝送フォーマット
形成部７でビット割り当て情報などのサイド情報ととも
に量子化データを所定のスタイルにフォーマットし、圧
縮データ出力端８に出力する。このような圧縮処理は、
一般的にＤＳＰ等を利用しでソフト的な処理によって具
現化される。The bit allocation operation processing unit 5
Based on ALC [sb], which is a buffer holding the bit allocation amount resulting from
Then, the sub-band data is quantized, and the transmission format forming unit 7 formats the quantized data together with side information such as bit allocation information in a predetermined style, and outputs it to the compressed data output terminal 8. Such a compression process
Generally, it is realized by software processing using a DSP or the like.

【００２５】[0025]

【発明が解決しようとする課題】しかしながら、上記従
来例によると、ビット割り当て量演算の際に、１つのサ
ブバンドの量子化ステップを１段階ずつ小さくしていく
度に全てのサブバンドから最小ＭＮＲを改めて検索する
ようにしていた。However, according to the above-mentioned prior art, when calculating the bit allocation amount, every time the quantization step of one sub-band is reduced by one step, the minimum MNR is reduced from all the sub-bands. Was searched again.

【００２６】例えば、ＩＳＯ／ＩＥＣ１１７２−３、い
わゆるＭＰＥＧオーディオ圧縮方式のレイア２，３８４
ｋｂｐｓにおいては、全対象サブバンドについておおむ
ね２５０段階量子化ステップを振り分けることができる
ので、この検索処理が３００ステップ必要としたときに
はこの検索処理だけで「７５ｋ処理」という、例えば３
０ＭＨｚで動作するＤＳＰを用いた場合には「２４ｍ
ｓ」のオーディオフレームに対して約「２．５ｍｓ」の
処理時間が必用であった。For example, ISO / IEC1172-3, a so-called MPEG audio compression layer 2,384
In kbps, approximately 250 quantization steps can be assigned to all the target subbands. Therefore, when this search process requires 300 steps, the search process alone requires a “75k process”.
When a DSP operating at 0 MHz is used, “24 m
A processing time of about "2.5 ms" was required for the audio frame "s".

【００２７】したがって、従来は、オーディオ信号を圧
縮処理するのに、膨大な演算処理時間がかかるという問
題があった。本発明は上述の問題点にかんがみ、オーデ
ィオ信号を圧縮処理するのに必要な演算処理時間を短縮
できるようにすることを目的としている。Therefore, conventionally, there has been a problem that it takes an enormous amount of processing time to compress the audio signal. SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems, and has as its object to reduce the processing time required for compressing an audio signal.

【００２８】[0028]

【課題を解決するための手段】本発明のディジタルオー
ディオ信号圧縮装置は、ディジタルオーディオ信号を圧
縮するディジタルオーディオ信号圧縮装置において、各
サブバンドにおけるマスク対雑音比からマスク対雑音比
向上指標値を求め、上記求めたマスク対雑音比向上指標
値を超えるまで上記各サブバンド毎にビット割り当て処
理を行うことを特徴としている。また、本発明のディジ
タルオーディオ信号圧縮装置の他の特徴とするところ
は、ディジタルオーディオ信号を入力する入力手段と、
上記入力手段を介して入力されたディジタルオーディオ
信号を直交変換する直交変換手段と、上記直交変換手段
の結果であるパワースペクトラムから、音声のマスキン
グ効果に基づいてマスク量を計算するマスク量計算手段
と、上記パワースペクトラム、もしくは上記ディジタル
オーディオ信号を上記直交変換手段とは異なる手法で周
波数軸に写像した写像信号のいずれかと、上記マスク量
計算手段により得られたマスク量との比を予め定められ
た周波数数幅単位で計算する信号対マスク比演算手段
と、所定のビット割り当て量に対するＳＮ比を定義する
信号対雑音比定義手段と、上記所定のビット割り当て量
を変更する毎に上記信号対雑音比定義手段の結果を更新
し、上記信号対マスク比演算手段の結果とのマスク対雑
音比の演算を行う繰り返し演算手段と、上記繰り返し演
算手段の結果により上記予め定められた周波数数幅単位
で圧縮データのビット割り当て数を決定するビット割り
当て手段とを有するディジタルオーディオ信号圧縮装置
において、上記予め定められた周波数数幅単位で共通な
予め定められたビット割り当て目標値を設定するビット
割り当て目標値設定手段と、上記信号対雑音比定義手段
の結果が上記ビット割り当て目標値よりも大きい最小値
になる値を選択する選択手段と、上記選択手段の結果を
利用してビット割り当て演算を行うビット割り当て演算
手段とを具備することを特徴としている。また、本発明
のディジタルオーディオ信号圧縮装置のその他の特徴と
するところは、ディジタルオーディオ信号を入力する入
力手段と、入力された上記ディジタルオーディオ信号を
オーディオフレームなる予め定められた期間単位に直交
変換する直交変換手段と、上記直交変換手段により得ら
れたディジタルオーディオ信号のパワースペクトラムか
ら音声のマスキング効果に基づいて予め定められた周波
数数幅単位であるサブバンド毎にマスク量を計算するた
めのマスク量計算手段と、上記パワースペクトラムもし
くは上記ディジタルオーディオ信号を上記直交変換手段
とは異なる手法で周波数軸に写像した写像信号のいずれ
かと上記マスク量計算手段により得られたマスク量との
差分を上記サブバンド毎に計算し、上記オーディオフレ
ームにおける信号対マスク比を求める信号対マスク比演
算手段と、予め定められている量子化ステップに対する
信号対雑音比をテーブルとして格納している信号対雑音
比記憶手段と、上記信号対雑音比記憶手段の出力である
上記信号対雑音比と上記信号対マスク比演算手段により
得られた信号対マスク比との差分演算を上記サブバンド
毎に行いマスク対雑音比を求めるマスク対雑音比演算手
段と、上記マスク対雑音比演算手段により得られたマス
ク対雑音比と上記信号対雑音比と、オーディオ信号圧縮
の際に、上記オーディオフレームに割り当て可能なビッ
トを用いて上記サブバンド毎の圧縮データのビット割り
当て数を決定するビット割り当て手段を有するディジタ
ルオーディオ信号圧縮装置において、マスク対雑音比演
算手段により得られたマスク対雑音比について、所定の
サブバンドにおける平均値を計算するマスク対雑音比平
均値演算手段を有し、上記ビット割り当て手段では上記
所定のサブバンドごとの上記マスク対雑音比と上記マス
ク対雑音比平均値と上記信号対雑音比、オーディオ信号
圧縮の際に上記オーディオフレームに割り当て可能なビ
ットを用いて上記サブバンド毎の圧縮データのビット割
り当て数を決定することを特徴としている。また、本発
明のディジタルオーディオ信号圧縮装置のその他の特徴
とするところは、上記サブバンドの１つについて、上記
マスク対雑音比演算手段において上記マスク対雑音比平
均値を超えるまで上記量子化ステップを細かくしていく
ように上記マスク対雑音比を更新する繰り返しマスク対
雑音比演算手段を有し、その後、他の上記サブバンドに
ついて同様の演算を繰り返すことを特徴としている。ま
た、本発明のディジタルオーディオ信号圧縮装置のその
他の特徴とするところは、上記サブバンドの１つについ
て、上記マスク対雑音比演算手段において上記マスク対
雑音比平均値を超える上記信号対雑音比を上記信号対雑
音比検索手段を用いて検索する信号対雑音比検索手段を
有し、その後他のサブバンドについて同様の演算を繰り
返すことを特徴としている。また、本発明のディジタル
オーディオ信号圧縮装置のその他の特徴とするところ
は、上記所定のサブバンドについて、上記マスク対雑音
比が上記マスク対雑音比平均値を超えたのち、上記サブ
バンドごとのマスク対雑音比と、上記マスク対雑音比の
平均値と、上記信号対雑音比と、オーディオ信号圧縮の
際に上記オーディオフレームに割り当て可能なビットと
を用いて上記サブバィド毎圧縮データのビット割り当て
数を決定することを特徴としている。また、本発明のデ
ィジタルオーディオ信号圧縮装置のその他の特徴とする
ところは、ディジタルオーディオ信号を入力する入力手
段と、入力された上記ディジタルオーディオ信号をオー
ディオフレームなる予め定められた期間単位に直交変換
する直交変換手段と、上記直交変換手段により得られた
ディジタルオーディオ信号のパワースペクトラムから音
声のマスキング効果に基づいて予め定められた周波数数
幅単位であるサブバンド毎にマスク量を計算するための
マスク量計算手段と、上記パワースペクトラムもしくは
上記ディジタルオーディオ信号を上記直交変換手段とは
異なる手法で周波数軸に写像した写像信号のいずれかと
上記マスク量計算手段により得られたマスク量との差分
を上記サブバンド毎に計算し上記オーディオフレームに
おける信号対マスク比を求める信号対マスク比演算手段
と、予め定められている量子化ステップに対する信号対
雑音比をテーブルとして格納している信号対雑音比記憶
手段と、上記信号対雑音比記憶手段の出力である上記信
号対雑音比と上記信号対マスク比演算手段により得られ
た信号対マスク比との差分演算を上記サブバンド毎に行
いマスク対雑音比を求めるマスク対雑音比演算手段と、
上記マスク対雑音比演算手段により得られたマスク対雑
音比と上記信号対雑音比と、オーディオ信号圧縮の際に
上記オーディオフレームに割り当て可能なビットを用い
て上記サブバンド毎の圧縮データのビット割り当て数を
決定するビット割り当て手段を有するディジタルオーデ
ィオ信号圧縮装置において、予めオーディオ圧縮の際
に、上記オーディオフレームに割り当て可能なビット数
及び上記量子化ステップ、上記信号対雑音比の関係から
上記オーディオフレームに割り当てられるマスク対雑音
比向上量を計算するマスク対雑音比向上量演算手段と、
上記サブバンドにおいて、上記マスク対雑音比演算手段
により得られたマスク対雑音比の総和値を計算するマス
ク対雑音比総和演算手段と、上記マスク対雑音比向上量
演算手段の結果であるマスク対雑音比向上量と上記マス
ク対雑音比総和演算手段により得られたマスク対雑音比
総和との和を求めるとともに、上記所定のサブバンド数
で割り算を行いビット割り当て目標値を求める目標値演
算手段を有することを特徴としている。また、本発明の
ディジタルオーディオ信号圧縮装置のその他の特徴とす
るところは、上記サブバンドの１つについて、上記マス
ク対雑音比演算手段において上記ビット割り当て目標値
を超える上記信号対雑音比を上記信号対雑音比検索手段
を用いて検索する信号対雑音比検索手段を有し、その後
他のサブバンドについて同様の演算を繰り返すことを特
徴としている。また、本発明のディジタルオーディオ信
号圧縮装置のその他の特徴とするところは、上記サブバ
ンドの１つにについて上記マスク対雑音比演算手段にお
いて上記ビット割り当て目標値を超えるまで上記量子化
ステップを細かくしていくように上記マスク対雑音比を
更新する繰り返しマスク対雑音比演算手段を有し、その
後他のサブバンドについて同様の演算を繰り返すことを
特徴としている。また、本発明のディジタルオーディオ
信号圧縮装置のその他の特徴とするところは、上記所定
のサブバンドについて、上記マスク対雑音比が上記ビッ
ト割り当て目標値を超えたのち、上記所定のサブバンド
ごとの上記マスク対雑音比と上記マスク対雑音比平均値
と上記信号対雑音比そして、オーディオ信号圧縮の際に
上記オーディオフレームに割り当て可能なビットを用い
て上記サブバンド毎圧縮データのビット割り当て数を決
定することを特徴としている。また、本発明のディジタ
ルオーディオ信号圧縮装置のその他の特徴とするところ
は、上記ビット割り当て手段は、ビット割り当てを必要
とする全てのサブバンドについてビット割り当てを行う
ことを特徴としている。A digital audio signal compressor according to the present invention is a digital audio signal compressor for compressing a digital audio signal, wherein a mask-to-noise ratio improvement index value is obtained from a mask-to-noise ratio in each subband. The bit allocation process is performed for each sub-band until the calculated mask-to-noise ratio improvement index value is exceeded. Another feature of the digital audio signal compression device of the present invention is that input means for inputting a digital audio signal,
Orthogonal transform means for orthogonally transforming the digital audio signal input via the input means; and a mask amount calculating means for calculating a mask amount based on a sound masking effect from a power spectrum as a result of the orthogonal transform means. The ratio between the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from that of the orthogonal transform means and the mask amount obtained by the mask amount calculating means is predetermined. Signal-to-mask ratio calculating means for calculating in units of frequency width, signal-to-noise ratio defining means for defining an S / N ratio for a predetermined bit allocation amount, and signal-to-noise ratio each time the predetermined bit allocation amount is changed Updating the result of the defining means and calculating the mask-to-noise ratio with the result of the signal-to-mask ratio calculating means; A digital audio signal compression apparatus comprising: a return operation unit; and a bit allocation unit that determines a bit allocation number of compressed data in the predetermined frequency width unit based on a result of the repetition operation unit. A bit allocation target value setting means for setting a common predetermined bit allocation target value for each unit of width, and selecting a value at which a result of the signal-to-noise ratio defining means is a minimum value larger than the bit allocation target value. And a bit allocation operation means for performing a bit allocation operation using the result of the selection means. Another feature of the digital audio signal compression device of the present invention is that input means for inputting a digital audio signal and orthogonally transform the input digital audio signal into a predetermined period unit of an audio frame. Orthogonal transform means, and a mask amount for calculating a mask amount for each subband, which is a predetermined frequency band unit, based on a sound masking effect from a power spectrum of the digital audio signal obtained by the orthogonal transform means Calculating means for calculating the difference between the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from the orthogonal transform means and the mask amount obtained by the mask amount calculating means; Calculate each audio frame above Signal-to-mask ratio calculating means for obtaining a signal-to-mask ratio in the memory, signal-to-noise ratio storing means for storing a signal-to-noise ratio for a predetermined quantization step as a table, and the signal-to-noise ratio storing means Mask-to-noise ratio calculating means for performing a difference calculation between the signal-to-noise ratio, which is the output of the signal, and the signal-to-mask ratio obtained by the signal-to-mask ratio calculating means, for each subband to obtain a mask-to-noise ratio; The mask-to-noise ratio obtained by the mask-to-noise ratio calculation means, the signal-to-noise ratio, and the bits of the compressed data for each sub-band using bits that can be assigned to the audio frame during audio signal compression. In a digital audio signal compression device having bit allocation means for deciding the number of allocations, the digital audio signal compression means A mask-to-noise ratio average value calculating means for calculating an average value of the mask-to-noise ratio in a predetermined sub-band, wherein the bit allocating means includes the mask-to-noise ratio and the mask-to-noise ratio for each of the predetermined sub-bands. It is characterized in that the number of bits of compressed data for each subband is determined by using a ratio average value, the signal-to-noise ratio, and bits that can be allocated to the audio frame during audio signal compression. Another feature of the digital audio signal compression apparatus of the present invention is that, for one of the sub-bands, the mask-to-noise ratio calculating means performs the quantization step until the mask-to-noise ratio average value is exceeded. It is characterized by having a repetitive mask-to-noise ratio calculating means for updating the mask-to-noise ratio so as to make it finer, and thereafter repeating the same calculation for the other sub-bands. Another feature of the digital audio signal compression apparatus of the present invention is that, for one of the subbands, the mask-to-noise ratio calculating means sets the signal-to-noise ratio exceeding the mask-to-noise ratio average value. There is provided a signal-to-noise ratio search means for searching using the signal-to-noise ratio search means, and thereafter, the same operation is repeated for other subbands. Another feature of the digital audio signal compression apparatus of the present invention is that, for the predetermined subband, after the mask-to-noise ratio exceeds the mask-to-noise ratio average value, the mask for each of the subbands The number of bits of the compressed data for each sub-band is determined by using the noise-to-noise ratio, the average value of the mask-to-noise ratio, the signal-to-noise ratio, and the bits that can be allocated to the audio frame during audio signal compression. It is characterized by the decision. Another feature of the digital audio signal compression device of the present invention is that input means for inputting a digital audio signal and orthogonally transform the input digital audio signal into a predetermined period unit of an audio frame. Orthogonal transform means, and a mask amount for calculating a mask amount for each subband, which is a predetermined frequency band unit, based on a sound masking effect from a power spectrum of the digital audio signal obtained by the orthogonal transform means Calculating means for calculating the difference between the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from the orthogonal transform means and the mask amount obtained by the mask amount calculating means; Calculated for each audio frame Signal-to-mask ratio calculating means for obtaining a signal-to-mask ratio in the memory, signal-to-noise ratio storing means for storing a signal-to-noise ratio for a predetermined quantization step as a table, and the signal-to-noise ratio storing means Mask-to-noise ratio calculating means for performing a difference calculation between the signal-to-noise ratio, which is the output of the signal, and the signal-to-mask ratio obtained by the signal-to-mask ratio calculating means, for each subband to obtain a mask-to-noise ratio;
Bit allocation of compressed data for each sub-band by using the mask-to-noise ratio and the signal-to-noise ratio obtained by the mask-to-noise ratio calculating means, and the bits that can be allocated to the audio frame during audio signal compression In a digital audio signal compression apparatus having bit allocation means for determining the number, the number of bits that can be allocated to the audio frame, the quantization step, and the relationship between the signal-to-noise ratio and the relationship between the signal-to-noise ratio in the audio compression in advance Mask-to-noise ratio improvement amount calculating means for calculating an assigned mask-to-noise ratio improvement amount;
In the sub-band, a mask-to-noise ratio sum calculating means for calculating a total value of the mask-to-noise ratio obtained by the mask-to-noise ratio calculating means, and a mask pair which is a result of the mask-to-noise ratio improvement amount calculating means. A target value calculating means for calculating the sum of the noise ratio improvement amount and the mask-to-noise ratio total obtained by the mask-to-noise ratio total calculating means, and dividing by the predetermined number of sub-bands to obtain a bit allocation target value. It is characterized by having. Another feature of the digital audio signal compression apparatus of the present invention is that, for one of the sub-bands, the mask-to-noise ratio calculating means sets the signal-to-noise ratio exceeding the bit allocation target value to the signal. It is characterized by having signal-to-noise ratio search means for searching using the noise-to-noise ratio search means, and thereafter repeating the same calculation for other subbands. Another feature of the digital audio signal compression apparatus of the present invention is that the mask-to-noise ratio calculation means makes the quantization step finer for one of the sub-bands until the bit allocation target value is exceeded. And a repetitive mask-to-noise ratio calculating means for updating the mask-to-noise ratio so that the same calculation is repeated for the other subbands. Another feature of the digital audio signal compression device of the present invention is that, for the predetermined subband, after the mask-to-noise ratio exceeds the bit allocation target value, The mask-to-noise ratio, the mask-to-noise ratio average value, the signal-to-noise ratio, and the number of bits to be assigned to the compressed data for each sub-band are determined by using bits that can be assigned to the audio frame during audio signal compression. It is characterized by: Another feature of the digital audio signal compression device of the present invention is that the bit allocation means performs bit allocation for all subbands requiring bit allocation.

【００２９】本発明のディジタルオーディオ信号圧縮方
法は、ディジタルオーディオ信号を圧縮するディジタル
オーディオ信号圧縮方法において、各サブバンドにおけ
るマスク対雑音比からマスク対雑音比向上指標値を求
め、上記求めたマスク対雑音比向上指標値を超えるまで
上記各サブバンド毎にビット割り当て処理を行うことを
特徴としている。また、本発明のディジタルオーディオ
信号圧縮方法の他の特徴とするところは、ディジタルオ
ーディオ信号を入力する入力処理と、入力された上記デ
ィジタルオーディオ信号を直交変換する直交変換処理
と、上記直交変換処理の結果であるパワースペクトラム
から音声のマスキング効果に基づいてマスク量を計算す
るためのマスク量計算処理と、上記パワースペクトラ
ム、もしくは上記ディジタルオーディオ信号を上記直交
変換処理とは異なる手法で周波数軸に写像した写像信号
のいずれかと、上記マスク量計算処理により得られたマ
スク量との比を予め定められた周波数数幅単位で計算す
る信号対マスク比演算処理と、所定のビット割り当て量
に対するＳＮ比を定義する信号対雑音比定義処理と、上
記所定のビット割り当て量を変更する毎に上記信号対雑
音比定義処理の結果を更新し、上記信号対マスク比演算
処理の結果とのマスク対雑音比の演算を行う繰り返し演
算処理と、上記繰り返し演算処理の結果により上記予め
定められた周波数数幅単位で圧縮データのビット割り当
て数を決定するビット割り当て処理を行うディジタルオ
ーディオ信号圧縮方法において、上記予め定められた周
波数数幅単位で共通な予め定められたビット割り当て目
標値を設定するビット割り当て目標値設定処理と、上記
信号対雑音比定義処理の結果が上記ビット割り当て目標
値設定処理により得られたビット割り当て目標値よりも
大きい最小値になる値を選択する選択処理と、上記選択
処理の結果を利用してビット割り当て演算とを行うこと
を特徴としている。また、本発明のディジタルオーディ
オ信号圧縮方法のその他の特徴とするところは、ディジ
タルオーディオ信号を入力する入力処理と、入力された
上記ディジタルオーディオ信号をオーディオフレームな
る予め定められた期間単位に直交変換する直交変換処理
と、上記直交変換処理により得られたディジタルオーデ
ィオ信号のパワースペクトラムから音声のマスキング効
果に基づいて予め定められた周波数数幅単位であるサブ
バンド毎にマスク量を計算するためのマスク量計算処理
と、上記パワースペクトラムもしくは上記ディジタルオ
ーディオ信号を上記直交変換処理とは異なる手法で周波
数軸に写像した写像信号のいずれかと上記マスク量計算
処理により得られたマスク量との差分を上記サブバンド
毎に計算し、上記オーディオフレームにおける信号対マ
スク比を求める信号対マスク比演算処理と、予め定めら
れている量子化ステップに対する信号対雑音比をテーブ
ルとして格納している信号対雑音比記憶処理と、上記信
号対雑音比記憶処理の出力である上記信号対雑音比と上
記信号対マスク比演算処理により得られた信号対マスク
比との差分演算を上記サブバンド毎に行いマスク対雑音
比を求めるマスク対雑音比演算処理と、上記マスク対雑
音比演算処理により得られたマスク対雑音比と上記信号
対雑音比と、オーディオ信号圧縮の際に、上記オーディ
オフレームに割り当て可能なビットを用いて上記サブバ
ンド毎の圧縮データのビット割り当て数を決定するビッ
ト割り当て処理を行うディジタルオーディオ信号圧縮方
法において、マスク対雑音比演算処理により得られたマ
スク対雑音比について、所定のサブバンドにおいての平
均値を計算するマスク対雑音比平均値演算処理を有し、
上記ビット割り当て処理では上記所定のサブバンドごと
の上記マスク対雑音比と上記マスク対雑音比平均値と上
記信号対雑音比、オーディオ信号圧縮の際に上記オーデ
ィオフレームに割り当て可能なビットを用いて上記サブ
バンド毎の圧縮データのビット割り当て数を決定するこ
とを特徴としている。また、本発明のディジタルオーデ
ィオ信号圧縮方法のその他の特徴とするところは、上記
サブバンドの１つについて、上記マスク対雑音比演算処
理において上記マスク対雑音比平均値を超えるまで上記
量子化ステップを細かくしていくように上記マスク対雑
音比を更新する繰り返しマスク対雑音比演算処理を有
し、その後、他のサブバンドについて同様の演算を繰り
返すことを特徴としている。また、本発明のディジタル
オーディオ信号圧縮方法のその他の特徴とするところ
は、上記サブバンドの１つについて、上記マスク対雑音
比演算処理において上記マスク対雑音比平均値を超える
上記信号対雑音比を上記信号対雑音比検索処理を用いて
検索する信号対雑音比検索処理を有し、その後他のサブ
バンドについて同様の演算を繰り返すことを特徴として
いる。また、本発明のディジタルオーディオ信号圧縮方
法のその他の特徴とするところは、上記所定のサブバン
ドについて上記マスク対雑音比が上記マスク対雑音比平
均値を超えたのち、上記サブバンドごとのマスク対雑音
比と、上記マスク対雑音比の平均値と、上記信号対雑音
比と、オーディオ信号圧縮の際に上記オーディオフレー
ムに割り当て可能なビットとを用いて上記サブバィド毎
圧縮データのビット割り当て数を決定することを特徴と
している。また、本発明のディジタルオーディオ信号圧
縮方法のその他の特徴とするところは、ディジタルオー
ディオ信号を入力する入力処理と、入力された上記ディ
ジタルオーディオ信号をオーディオフレームなる予め定
められた期間単位に直交変換する直交変換処理と、上記
直交変換処理により得られたディジタルオーディオ信号
のパワースペクトラムから音声のマスキング効果に基づ
いて予め定められた周波数数幅単位であるサブバンド毎
にマスク量を計算するためのマスク量計算処理と、上記
パワースペクトラムもしくは上記ディジタルオーディオ
信号を上記直交変換処理とは異なる手法で周波数軸に写
像した写像信号のいずれかと上記マスク量計算処理によ
り得られたマスク量との差分を上記サブバンド毎に計算
し上記オーディオフレームにおける信号対マスク比を求
める信号対マスク比演算処理と、予め定められている量
子化ステップに対する信号対雑音比をテーブルとして格
納している信号対雑音比記憶処理と、上記信号対雑音比
記憶処理の出力である上記信号対雑音比と上記信号対マ
スク比演算処理により得られた信号対マスク比との差分
演算を上記サブバンド毎に行いマスク対雑音比を求める
マスク対雑音比演算処理と、上記マスク対雑音比演算処
理により得られたマスク対雑音比と上記信号対雑音比
と、オーディオ信号圧縮の際に上記オーディオフレーム
に割り当て可能なビットを用いて上記サブバンド毎の圧
縮データのビット割り当て数を決定するビット割り当て
処理を行うディジタルオーディオ信号圧縮方法におい
て、予めオーディオ圧縮の際に、上記オーディオフレー
ムに割り当て可能なビット数及び上記量子化ステップ、
上記信号対雑音比の関係から上記オーディオフレームに
割り当てられるマスク対雑音比向上量を計算するマスク
対雑音比向上量演算処理と、上記サブバンドにおいて上
記マスク対雑音比演算処理により得られたマスク対雑音
比の総和値を計算するマスク対雑音比総和演算処理と、
上記マスク対雑音比向上量演算処理の結果であるマスク
対雑音比向上量と上記マスク対雑音比総和演算処理によ
り得られたマスク対雑音比総和との和を求めるととも
に、上記サブバンド数で割り算を行い、ビット割り当て
目標値を求める目標値演算処理を行うことを特徴として
いる。また、本発明のディジタルオーディオ信号圧縮方
法のその他の特徴とするところは、上記サブバンドの１
つについて、上記マスク対雑音比演算処理において上記
ビット割り当て目標値を超える上記信号対雑音比を上記
信号対雑音比検索処理を用いて検索する信号対雑音比検
索処理を有し、その後他のサブバンドについて同様の演
算を繰り返すことを特徴としている。また、本発明のデ
ィジタルオーディオ信号圧縮方法のその他の特徴とする
ところは、上記サブバンドの１つにについて上記マスク
対雑音比演算処理において上記ビット割り当て目標値を
超えるまで上記量子化ステップを細かくしていくように
上記マスク対雑音比を更新する繰り返しマスク対雑音比
演算処理を有し、その後他のサブバンドについて同様の
演算を繰り返すことを特徴としている。また、本発明の
ディジタルオーディオ信号圧縮方法のその他の特徴とす
るところは、上記所定のサブバンドについて、上記マス
ク対雑音比が上記ビット割り当て目標値を超えたのち、
上記サブバンドごとの上記マスク対雑音比と上記マスク
対雑音比平均値と上記信号対雑音比そして、オーディオ
信号圧縮の際に上記オーディオフレームに割り当て可能
なビットを用いて上記サブバンド毎圧縮データのビット
割り当て数を決定することを特徴としている。また、本
発明のディジタルオーディオ信号圧縮方法のその他の特
徴とするところは、上記ビット割り当て処理は、ビット
割り当てを必要とする全てのサブバンドについてビット
割り当てを行うことを特徴としている。According to the digital audio signal compression method of the present invention, in the digital audio signal compression method for compressing a digital audio signal, a mask-to-noise ratio improvement index value is obtained from a mask-to-noise ratio in each subband, and the obtained mask pair It is characterized in that bit allocation processing is performed for each of the above subbands until the noise ratio improvement index value is exceeded. Further, other features of the digital audio signal compression method of the present invention include an input process of inputting a digital audio signal, an orthogonal transform process of orthogonally transforming the input digital audio signal, and an orthogonal transform process of the orthogonal transform process. A mask amount calculation process for calculating a mask amount based on a sound masking effect from a power spectrum as a result, and the power spectrum or the digital audio signal is mapped to a frequency axis by a method different from the orthogonal transformation process. A signal-to-mask ratio calculation process for calculating the ratio between any of the mapping signals and the mask amount obtained by the mask amount calculation process in a predetermined frequency width unit, and defining an SN ratio for a predetermined bit allocation amount Signal-to-noise ratio definition processing, and every time the predetermined bit allocation amount is changed. Updating the result of the signal-to-noise ratio defining process and performing a mask-to-noise ratio calculation with the result of the signal-to-mask ratio calculating process; and In a digital audio signal compression method for performing a bit allocation process for determining a bit allocation number of compressed data in a unit of a number of bits, a bit allocation for setting a predetermined predetermined bit allocation target value common to the unit of a predetermined frequency width A target value setting process, a selection process of selecting a value at which a result of the signal-to-noise ratio definition process is a minimum value larger than a bit allocation target value obtained by the bit allocation target value setting process, It is characterized in that a bit allocation operation is performed using the result. Other features of the digital audio signal compression method of the present invention include an input process of inputting a digital audio signal, and an orthogonal transformation of the input digital audio signal into a predetermined period unit of an audio frame. An orthogonal transform process, and a mask amount for calculating a mask amount for each subband, which is a predetermined frequency width unit, based on a sound masking effect from the power spectrum of the digital audio signal obtained by the orthogonal transform process Calculating the difference between one of the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from the orthogonal transform processing and the mask amount obtained by the mask amount calculating process, Calculate each audio frame above Signal-to-mask ratio calculation processing for obtaining a signal-to-mask ratio in the memory, signal-to-noise ratio storage processing for storing a signal-to-noise ratio for a predetermined quantization step as a table, and the signal-to-noise ratio storage processing A mask-to-noise ratio calculation process of obtaining a mask-to-noise ratio by performing a difference calculation between the signal-to-noise ratio and the signal-to-mask ratio obtained by the signal-to-mask ratio calculation process for each subband, The mask-to-noise ratio and the signal-to-noise ratio obtained by the mask-to-noise ratio calculation processing, and the bits of the compressed data for each sub-band using bits that can be assigned to the audio frame during audio signal compression. In a digital audio signal compression method for performing bit allocation processing for determining the number of allocations, a mask obtained by a mask-to-noise ratio calculation processing. For click-to-noise ratio, it has a mask-to-noise ratio average value calculation process that calculates the average value of the predetermined sub-band,
In the bit allocation process, the mask-to-noise ratio, the mask-to-noise ratio average value, and the signal-to-noise ratio for each of the predetermined sub-bands, using the bits that can be assigned to the audio frame during audio signal compression, It is characterized in that the number of bits of compressed data for each subband is determined. Another feature of the digital audio signal compression method of the present invention is that, for one of the subbands, the quantization step is performed in the mask-to-noise ratio calculation process until the mask-to-noise ratio average value is exceeded. It is characterized in that it has a repetitive mask-to-noise ratio calculation process for updating the mask-to-noise ratio so as to make it finer, and then repeats the same calculation for other subbands. Another feature of the digital audio signal compression method of the present invention is that, for one of the sub-bands, the signal-to-noise ratio exceeding the mask-to-noise ratio average value in the mask-to-noise ratio calculation processing. It has a signal-to-noise ratio search process for searching using the above-described signal-to-noise ratio search process, and thereafter repeats the same calculation for other subbands. Another feature of the digital audio signal compression method of the present invention is that, after the mask-to-noise ratio exceeds the mask-to-noise ratio average value for the predetermined subband, the mask pair for each subband is set. Using the noise ratio, the average value of the mask-to-noise ratio, the signal-to-noise ratio, and the bits that can be allocated to the audio frame when compressing the audio signal, determine the number of bits to be allocated to the compressed data for each sub-band. It is characterized by doing. Other features of the digital audio signal compression method of the present invention include an input process of inputting a digital audio signal, and an orthogonal transformation of the input digital audio signal into a predetermined period unit of an audio frame. An orthogonal transform process, and a mask amount for calculating a mask amount for each subband, which is a predetermined frequency width unit, based on a sound masking effect from the power spectrum of the digital audio signal obtained by the orthogonal transform process Calculating the difference between one of the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from the orthogonal transform processing and the mask amount obtained by the mask amount calculating process, Calculated for each audio frame Signal-to-mask ratio calculation processing for obtaining a signal-to-mask ratio in the memory, signal-to-noise ratio storage processing for storing a signal-to-noise ratio for a predetermined quantization step as a table, and signal-to-noise ratio storage processing A mask-to-noise ratio calculation process of obtaining a mask-to-noise ratio by performing a difference calculation between the signal-to-noise ratio and the signal-to-mask ratio obtained by the signal-to-mask ratio calculation process for each subband, Bit allocation of compressed data for each sub-band using the mask-to-noise ratio obtained by the mask-to-noise ratio calculation process, the signal-to-noise ratio, and bits that can be allocated to the audio frame during audio signal compression In a digital audio signal compression method for performing a bit allocation process for determining the number of audio signals, the above-described audio file The number of possible allocated bits to over arm and the quantization step,
A mask-to-noise ratio improvement amount calculation process for calculating a mask-to-noise ratio improvement amount assigned to the audio frame from the signal-to-noise ratio relationship; and a mask pair obtained by the mask-to-noise ratio calculation process in the subband. Mask-to-noise ratio sum calculation processing for calculating the sum of the noise ratios;
The sum of the mask-to-noise ratio improvement amount obtained as a result of the mask-to-noise ratio improvement amount calculation process and the mask-to-noise ratio total sum obtained by the mask-to-noise ratio total calculation process is obtained, and divided by the number of subbands. And performs a target value calculation process for obtaining a bit allocation target value. Another feature of the digital audio signal compression method according to the present invention is that the sub-band 1
And a signal-to-noise ratio search process for searching the signal-to-noise ratio exceeding the bit allocation target value using the signal-to-noise ratio search process in the mask-to-noise ratio calculation process. It is characterized in that the same calculation is repeated for the band. Another feature of the digital audio signal compression method of the present invention is that, in one of the sub-bands, the quantization step is finely performed in the mask-to-noise ratio calculation process until the bit allocation target value is exceeded. And a repetitive mask-to-noise ratio calculation process for updating the mask-to-noise ratio so that the same calculation is repeated for the other subbands. Another feature of the digital audio signal compression method of the present invention is that, for the predetermined subband, after the mask-to-noise ratio exceeds the bit allocation target value,
The mask-to-noise ratio, the mask-to-noise ratio average value, the signal-to-noise ratio, and the bits that can be assigned to the audio frame at the time of audio signal compression for each of the subbands. It is characterized in that the number of allocated bits is determined. Another feature of the digital audio signal compression method of the present invention is that, in the bit allocation processing, bit allocation is performed for all subbands requiring bit allocation.

【００３０】本発明の記憶媒体は、上記オーディオ信号
圧縮装置を構成する各手段としてコンピュータを機能さ
せるためのプログラムをコンピュータから読み出し可能
に格納したことを特徴としている。また、本発明の記憶
媒体の他の特徴とするところは、上記オーディオ信号圧
縮方法の手順をコンピュータに実行させるためのプログ
ラムをコンピュータから読み出し可能に格納したことを
特徴としている。[0030] The storage medium of the present invention is characterized in that a program for causing a computer to function as each means constituting the audio signal compression device is stored so as to be readable from the computer. Another feature of the storage medium of the present invention is that a program for causing a computer to execute the procedure of the audio signal compression method is stored so as to be readable from the computer.

【００３１】[0031]

【作用】本発明は上記技術手段を有するので、ディジタ
ルオーディオ信号を圧縮するディジタルオーディオ信号
圧縮方法において、各サブバンドにおけるマスク対雑音
比からマスク対雑音比向上指標値が求められ、上記求め
られたマスク対雑音比向上指標値を超えるまで、上記各
サブバンド毎にビット割り当て処理が行われることによ
り、最小マスク対雑音比演算処理を省略することが可能
となる。Since the present invention has the above technical means, in a digital audio signal compression method for compressing a digital audio signal, a mask-to-noise ratio improvement index value is obtained from a mask-to-noise ratio in each subband. Until the mask-to-noise ratio improvement index value is exceeded, the bit allocation process is performed for each of the subbands, thereby making it possible to omit the minimum mask-to-noise ratio calculation process.

【００３２】[0032]

【発明の実施の形態】［第１の実施の形態］図１は、本
実施の形態のビット割り当て演算に関するフローチャー
トである。また、図２はブロック図、図３は第１の実施
の形態の動作概念図である。以下、図１〜図３を参照し
ながら本実施の形態の動作について説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS [First Embodiment] FIG. 1 is a flowchart relating to a bit allocation operation according to the present embodiment. FIG. 2 is a block diagram, and FIG. 3 is an operation conceptual diagram of the first embodiment. Hereinafter, the operation of the present embodiment will be described with reference to FIGS.

【００３３】図２において、１はデジタルオーディオ入
力端、２はデジタルオーディオ入力端１からのディジタ
ルオーディオ信号を所定の時間間隔に分割し、複数の周
波数帯域に写像するサブバンド符号化部、３はサブバン
ド符号化部２と対応した期間のデジタルオーディオ入力
端１からのディジタルオーディオ信号をＦＦＴ解析する
ＦＦＴ演算部である。In FIG. 2, 1 is a digital audio input terminal, 2 is a sub-band encoding unit that divides the digital audio signal from the digital audio input terminal 1 into predetermined time intervals, and maps them to a plurality of frequency bands. An FFT operation unit that performs FFT analysis on the digital audio signal from the digital audio input terminal 1 during a period corresponding to the subband encoding unit 2.

【００３４】４はＦＦＴ演算部３の結果を音圧に換算
し、マスク量を演算するマスク量演算部であり、５はマ
スク量演算部４で算出したマスク量を用いて各サブバン
ドに割り当てるビット数を見積もるビット割り当て演算
部であり、６はビット割り当て演算部５で割り当てられ
たビット数に対応してサブバンド符号化部２からの出力
であるサブバンド信号を量子化するデータ量子化部であ
る。Reference numeral 4 denotes a mask amount calculation unit for converting the result of the FFT calculation unit 3 into sound pressure and calculating a mask amount. Reference numeral 5 designates a mask amount calculated by the mask amount calculation unit 4, which is assigned to each subband. A bit allocation operation unit for estimating the number of bits, and a data quantization unit 6 for quantizing a subband signal output from the subband encoding unit 2 in accordance with the number of bits allocated by the bit allocation operation unit 5 It is.

【００３５】そして、７はデータ量子化部６の出力であ
る量子化データや、ビット割り当て演算部５からのビッ
ト割り当て情報等を所定の伝送フォーマットにフォーマ
ットする伝送フォーマット形成部であり、８は圧縮デー
タ出力端である。以下に、圧縮動作について、説明す
る。Reference numeral 7 denotes a transmission format forming unit for formatting the quantized data output from the data quantization unit 6 and the bit allocation information from the bit allocation calculating unit 5 into a predetermined transmission format. Data output end. Hereinafter, the compression operation will be described.

【００３６】デジタルオーディオ入力端１に入力された
データは、サブバンド符号化部２、及びＦＦＴ演算部３
に入力される。まず、サブバンド符号化部２ではオーバ
ラップ・ポリフェイズ・フィルタ・バンク（ＯＰＦＢ）
により、入力信号を所定の時間間隔毎（オーディオフレ
ームと称す）に７５０Ｈｚ単位の帯域分割データに写像
する。このデータは、後に符号化されるデータとなる。The data input to the digital audio input terminal 1 is supplied to a sub-band encoder 2 and an FFT processor 3
Is input to First, in the subband encoding unit 2, an overlap polyphase filter bank (OPFB)
Thus, the input signal is mapped to band division data in units of 750 Hz at predetermined time intervals (referred to as audio frames). This data is data to be encoded later.

【００３７】一方、ＦＦＴ演算部３では、サンプルデー
タの周波数分布を分析するためにＦＦＴ分析を行なう。
この結果は、マスク量演算部４に入力される。マスク量
演算部４において、ＦＦＴ解析の結果は音圧値［ｄＢ］
に換算される。そして、計算された音圧値から、他の周
波数軸上の音圧成分を妨害するマスカ成分を抽出し、そ
の近傍の周波数位置ｉに及ぼすマスク量ＬＴｍ（ｉ）を
算出する。On the other hand, the FFT operation unit 3 performs an FFT analysis in order to analyze the frequency distribution of the sample data.
This result is input to the mask amount calculation unit 4. In the mask amount calculation unit 4, the result of the FFT analysis is a sound pressure value [dB].
Is converted to Then, a masker component that interferes with a sound pressure component on another frequency axis is extracted from the calculated sound pressure value, and a mask amount LTm (i) that affects a nearby frequency position i is calculated.

【００３８】上記マスカ抽出、マスク量演算処理を全帯
域にわたって行い、最終的に周波数位置ｉのマスク量
は、静的聴覚限界値と複数のマスカによるマスク量との
和で与えられることになる。マスク量演算部４で算出さ
れたマスク量はビット割り当て演算部５に出力される。The above-described masker extraction and mask amount calculation processing are performed over the entire band, and finally the mask amount at the frequency position i is given by the sum of the static hearing limit value and the mask amounts by a plurality of maskers. The mask amount calculated by the mask amount calculator 4 is output to the bit allocation calculator 5.

【００３９】ビット割り当て演算部５では、図８（ａ）
を参照しながら説明したように、サブバンド単位にマス
ク量の最小値を検出し、最小マスク量ＬＴｍｉｎ［ｓ
ｂ］とする（図１のステップＳ１００）。ここで、サフ
ィックスの［ｓｂ］はサブバンド番号の関数である。FIG. 8 (a)
, The minimum value of the mask amount is detected for each sub-band, and the minimum mask amount LTmin [s
b] (Step S100 in FIG. 1). Here, the suffix [sb] is a function of the subband number.

【００４０】また、サブバンドデータの最大値を最大音
圧値Ｌ［ｓｂ］として、この最大音圧値Ｌ［ｓｂ］と最
小マスク量ＬＴｍｉｎ［ｓｂ］との差分から、マスク対
雑音比（ＭＮＲ）を算出する（図１のステップＳ１０
１）。全てのサブバンドにおいて、マスク対雑音比計算
が終了し各サブバンドのマスク対雑音比が図３（ａ）に
示した値になったとする。The maximum value of the sub-band data is defined as the maximum sound pressure value L [sb], and the mask-to-noise ratio (MNR) is calculated from the difference between the maximum sound pressure value L [sb] and the minimum mask amount LTmin [sb]. (Step S10 in FIG. 1)
1). It is assumed that the calculation of the mask-to-noise ratio is completed for all the sub-bands and the mask-to-noise ratio of each sub-band has the value shown in FIG.

【００４１】このＮＭＲから全サブバンドのＮＭＲ平均
値ＭＮＲａｖｅ−５３［ｄＢ］を計算する（図１のステ
ップＳ１０２）。そして、圧縮フォーマットで許されて
いるヘッダ情報以外の圧縮サンプルデータの最大割り当
てビット可能数Ｂａｌｃ（３０００）を設定する（図１
のステップＳ１０３）。From this NMR, the NMR average value MNRave-53 [dB] of all the sub-bands is calculated (step S102 in FIG. 1). Then, the maximum allocable bit number Balc (3000) of the compressed sample data other than the header information permitted in the compression format is set (FIG. 1).
Step S103).

【００４２】そして、サブバンドを指標するループカウ
ンタｓｂをリセットしておき（図１のステップＳ１０
４）、ＭＮＲ［ｓｂ］とＭＮＲａｖｅとの比較を行い
（図１のステップＳ１０５）、ＭＮＲ［ｓｂ］が小であ
る場合にはそのサブバンドの量子化ステップを１段階上
の値に変更する（図１のステップＳ１０６）。これは、
量子化ステップを小さくすることに相当する。Then, the loop counter sb indicating the sub-band is reset (step S10 in FIG. 1).
4) MNR [sb] is compared with MNRave (step S105 in FIG. 1), and if MNR [sb] is small, the quantization step of the subband is changed to a value one step higher (step S105). Step S106 in FIG. 1). this is,
This corresponds to reducing the quantization step.

【００４３】量子化ステップを小さくしたことに対応し
て、これによって使用されるビット数を示す割り当て済
みビット数Ｂｕｓｅを計算する（図１のステップＳ１０
７）。そして、ビット割り当て量を保持しているバッフ
ァであるＡＬＣ［ＳＢ］の内容を更新（図１のステップ
Ｓ１０８）し、新しいＳＮＲ［ｓｔｅｐ［ＳＢ］］値を
用いて再度マスク対雑音比を計算し、ＭＮＲ［ＳＢ］値
を更新する（図１のステップＳ１０９）。In response to the reduction in the quantization step, an assigned bit number Buse indicating the number of bits used by the quantization step is calculated (step S10 in FIG. 1).
7). Then, the contents of ALC [SB], which is the buffer holding the bit allocation amount, are updated (step S108 in FIG. 1), and the mask-to-noise ratio is calculated again using the new SNR [step [SB]] value. , MNR [SB] value (step S109 in FIG. 1).

【００４４】その後、処理を図１のステップＳ１０５に
戻し、処理対象となっているサブバンドのＭＮＲ値がＭ
ＮＲａｖｅより大になるまで量子化ステップを小さくし
ていく。例えば、図３（ａ）の例であれば、ｓｂカウン
タが「０」であるので、ＭＮＲ［ＳＢ０］（一８５ｄ
Ｂ）と、ＭＮＲａｖｅ（一５３ｄＢ）との比較を行う。Thereafter, the process returns to step S105 in FIG. 1, and the MNR value of the sub-band to be processed is M
The quantization step is reduced until it becomes larger than NRave. For example, in the example of FIG. 3A, since the sb counter is “0”, the MNR [SB0] (-85d
B) is compared with MNRave (53 dB).

【００４５】その結果、ＭＮＲ［ＳＢ０］がＭＮＲａｖ
ｅ以下であるので、サブバンドＳＢ０についての量子化
ステップを小さくする。それにより、図６のステップ
“２”が選ばれ、ＳＮＲは「７ｄＢ」へ更新される。つ
まり、ＳＢ０のＭＮＲは「−８３ｄＢ」となる。As a result, MNR [SB0] becomes MNRav
e, the quantization step for the sub-band SB0 is reduced. Thereby, step “2” in FIG. 6 is selected, and the SNR is updated to “7 dB”. That is, the MNR of SB0 is “−83 dB”.

【００４６】しかし、ＭＮＲａｖｅに達していないの
で、ステップ“２”からステップ“３”へＳＢ０の量子
化ステップをさらに更新し、ＳＮＲを「７ｄＢ」から
「１６ｄＢ」に変更してＭＮＲを再計算する。However, since the MNRave has not been reached, the quantization step of SB0 is further updated from step "2" to step "3", the SNR is changed from "7 dB" to "16 dB", and the MNR is recalculated. .

【００４７】このようなループ処理を、ＭＮＲがＭＮＲ
ａｖｅ以上になるまで行う。その結果、ＳＢ０において
は、ステップ“５”でＳＮＲ（３２ｄＢ）−ＳＭＲ（８
５ｄＢ）＝ＭＮＲ（一５３ｄＢ）となり、図１のステッ
プＳ１０５の分岐でループから脱出する。Such a loop process is performed by setting the MNR to the MNR.
Repeat until ave or more. As a result, in SB0, in step "5", SNR (32 dB) -SMR (8
5 dB) = MNR (-53 dB), and the process exits the loop at the branch of step S105 in FIG.

【００４８】なお、図３（ａ）の太斜線部はこのループ
処理により求められたＳＮＲ［ｓｔｅｐ［ＳＢ０］］値
「５３ｄＢ」を示しており、またＳＢ０のＭＮＲ値はそ
の差分となっている。The thick shaded portion in FIG. 3A indicates the SNR [step [SB0]] value "53 dB" obtained by this loop processing, and the MNR value of SB0 is the difference. .

【００４９】図１のステップＳ１０５で処理対象となっ
ているサブバンドのＭＮＲ値がＭＮＲａｖｅより大とい
う判断がなされた場合、処理対象となっているサブバン
ドが予め定められている最大サブバンド数ｓｂｍａｘで
あるか判断する（図１のステップＳ１１０）。When it is determined in step S105 of FIG. 1 that the MNR value of the subband to be processed is larger than MNRave, the subband to be processed is determined by the predetermined maximum number of subbands sbmax. Is determined (step S110 in FIG. 1).

【００５０】この判断の結果、ｓｂ＜ｓｂｍａｘであれ
ば、ｓｂカウンタ値をインクリメントし（図１のステッ
プＳ１１１）、次のサブバンドＳＢ１のＭＮＲ処理（図
１のステップＳ１０５〜ステップＳ１０９）を上述に従
って繰り返す。If the result of this determination is that sb <sbmax, the sb counter value is incremented (step S111 in FIG. 1), and the MNR processing of the next subband SB1 (steps S105 to S109 in FIG. 1) is performed as described above. repeat.

【００５１】全てのサブバンドについてＭＮＲ処理（図
１のステップＳ１０５〜１０９）を施した時点で、図３
（ｂ）太斜線部で示したようなＳＮＲ値が求まり、白抜
き部が現在のビット割り当て量に対応したＭＮＲ値とな
っている。At the time when the MNR process (steps S105 to S109 in FIG. 1) is performed on all the subbands,
(B) An SNR value as shown by a thick shaded portion is obtained, and an outlined portion is an MNR value corresponding to the current bit allocation amount.

【００５２】その後、従来例と同様の処理を行う。すな
わち、いままで計算してきたＭＮＲ［ｓｂ１の中で最小
の値を検索する（図１のステップＳ１１２）。その検索
結果をＭＮＲｍｉｎ［ＳＢ］とする。さらに、ＭＮＲｍ
ｉｎに対応するサブバンドの量子化ステップを１段階上
の値に変更する（図１のステップＳ１１３）。そして、
量子化ステップを小さくしたことに対応して、これによ
って使用されるビット数を示す割り当て済みビット数Ｂ
ｕｓｅを計算する（図１のステップＳ１１４）。Thereafter, the same processing as in the conventional example is performed. That is, the minimum value is searched for in the MNR [sb1 calculated so far (step S112 in FIG. 1). The search result is set to MNRmin [SB]. Further, MNRm
The quantization step of the subband corresponding to in is changed to a value one step higher (step S113 in FIG. 1). And
The number of allocated bits B indicating the number of bits used by the quantization step, corresponding to the reduced quantization step.
Use is calculated (step S114 in FIG. 1).

【００５３】次に、割り当て可能なビット数Ｂａｌｃと
割り当て済みビット数Ｂｕｓｅの数値比較を行い（図１
のステップＳ１１５）、ＢｕｓｅがＢａｌｃに達してい
ない場合はビット割り当て可能と判断し、ビット割り当
て量を保持しているバッファであるＡＬＣ［ＳＢ］の内
容を更新する（図１のステップＳ１１６）。Next, a numerical comparison is made between the number of allocable bits Balc and the number of allocated bits Buse (FIG. 1).
If Bus does not reach Balc, it is determined that bit allocation is possible, and the contents of ALC [SB], which is a buffer holding the bit allocation amount, are updated (step S116 in FIG. 1).

【００５４】そして、新しいＳＮＲ［ｓｔｅｐ［Ｓ
Ｂ］］値を用いて再度マスク対雑音比を計算しＭＮＲ
［ＳＢ］値を更新する（図１のステップＳ１１６）。そ
の後、処理を図１のステップＳ１１２に戻し、再度最小
ＭＮＲを探索するループ処理を行う。このループ処理
を、図１のステップＳ１１５でビット割り当てが不可能
と判断されるまで、すなわちＢａｌｃ＜Ｂｕｓｅとなる
時点まで行う。Then, a new SNR [step [S
B]] value to calculate the mask-to-noise ratio again and calculate the MNR
[SB] The value is updated (step S116 in FIG. 1). Thereafter, the process returns to step S112 in FIG. 1, and a loop process for searching for the minimum MNR is performed again. This loop process is performed until it is determined in step S115 in FIG. 1 that bit allocation is impossible, that is, until Balc <Buse.

【００５５】Ｂａｌｃ＜Ｂｕｓｅとなった時点で、図３
（ｄ）の細斜線で示された量だけＭＮＲ値が向上するこ
とになり、かつ、サブバンド間のＭＮＲばらつきも少な
い状態でビット割り当て演算処理を終える。At the time when Balc <Buse, FIG.
The bit allocation calculation process is completed in a state where the MNR value is improved by the amount indicated by the thin oblique line in (d) and the MNR variation between subbands is small.

【００５６】そして、このビット割り当て演算処理部５
の結果であるビット割り当て量を保持しているバッファ
であるＡＬＣ［ｓｂ］をもとにして、データ量子化部６
でサブバンドデータの量子化を行い、伝送フォーマット
形成部７でビット割り当て情報などのサイド情報ととも
に量子化データを所定のスタイルにフォーマットし、圧
縮データ出力端８に出力する。Then, the bit allocation operation processing section 5
Based on ALC [sb], which is a buffer holding the bit allocation amount resulting from
Then, the sub-band data is quantized, and the transmission format forming unit 7 formats the quantized data together with side information such as bit allocation information in a predetermined style, and outputs it to the compressed data output terminal 8.

【００５７】以上のようなサブバンド間のＭＮＲ平均値
まで量子化ステップを一意的に小さくしていく構成をと
ることによって、量子化ステップを１ステップでも変更
する度に最小ＮＭＲｍｉｎを検索する等のステップを踏
むこと無しに最適なビット割り当て演算が可能となる。By adopting a configuration in which the quantization step is uniquely reduced to the MNR average value between the subbands as described above, the minimum NMRmin is searched every time the quantization step is changed even by one step. Optimal bit allocation calculation can be performed without taking steps.

【００５８】［第２の実施の形態］図４は、本発明の第
２の実施の形態のビット割り当て演算の手順を説明する
フローチャートである。以下、図２のブロック図及び図
５の動作概念図を参照しながら本発明の第２の実施の形
態の動作について説明する。なお、ここでは、圧縮全体
の処理説明については第１の実施の形態と等価であるの
で、説明を省略する。[Second Embodiment] FIG. 4 is a flowchart for explaining a procedure of a bit allocation operation according to a second embodiment of the present invention. Hereinafter, the operation of the second exemplary embodiment of the present invention will be described with reference to the block diagram of FIG. 2 and the operation conceptual diagram of FIG. Here, the description of the entire compression processing is equivalent to that of the first embodiment, and thus the description thereof will be omitted.

【００５９】ビット割り当て演算部５では、図５（ａ）
のようにサブバンド単位にマスク量の最小値を検出し、
最小マスク量ＬＴｍｉｎ［ｓｂ］とする（図４のステッ
プＳ４００）。ここで、サフィックスの［ｓｂ］はサブ
バンド番号の関数である。FIG. 5A shows the bit allocation operation section 5.
, The minimum value of the mask amount is detected for each sub-band,
The minimum mask amount LTmin [sb] is set (step S400 in FIG. 4). Here, the suffix [sb] is a function of the subband number.

【００６０】また、サブバンドデータの最大値を最大音
圧値Ｌ［ｓｂ］として、この最大音圧値Ｌ［ｓｂ］と最
小マスク量ＬＴｍｉｎ［ｓｂ］との差分から、ＭＮＲ
（マスク対雑音比）を算出する（図４のステップＳ４０
１）。The maximum value of the sub-band data is defined as the maximum sound pressure value L [sb], and the MNR is calculated from the difference between the maximum sound pressure value L [sb] and the minimum mask amount LTmin [sb].
(Mask to noise ratio) is calculated (Step S40 in FIG. 4)
1).

【００６１】次に、圧縮フォーマットで許されているヘ
ッダ情報以外の圧縮サンプルデータの最大割り当てビッ
ト可能数Ｂａｌｃから定まるＭＮＲ向上量ＭＮＲｕｐを
設定する（図４のステップＳ４０２）。Next, an MNR improvement amount MNRup determined from the maximum allocable bit number Balc of the compressed sample data other than the header information permitted in the compression format is set (step S402 in FIG. 4).

【００６２】ＭＮＲｕｐ値は、Ｂａｌｃのビット数で向
上できるデシベル量、すなわち、１つのサブバンドのＭ
ＮＲを「１ｄＢ」向上するために必要なビット数から割
り当て可能なビット数Ｂａｌｃで与えられるビット数で
ＭＮＲを向上できるデシベル量を算出した値であり、た
とえば図６のＳＮＲテーブルから換算できる。The MNRup value is a decibel amount that can be improved by the number of bits of Balc, that is, the MNRup value of one subband.
This is a value calculated from the number of bits necessary for improving the NR by “1 dB” and the decibel amount that can improve the MNR with the number of bits given by the allocatable bit number Balc, and can be converted from, for example, the SNR table in FIG.

【００６３】すなわち、図６に示したＳＮＲと割り当て
ビット数の関係から、例えばステップ“１６”の「９８
ｄＢ」に注目すれば、ビット割り当て１ビットあたりで
向上できるＳＮＲはＳＮＲ（９８ｄＢ）／サブバンドあ
たりのビット割り当て最（５７６ｂｉｔ）＝１ビットあ
たりで向上最（０．１７ｄＢ）となる。これから、割り
当て可能なビット数Ｂａｌｃで向上できるＳＮＲは、１
ビットあたりで向上量（０．１７ｄＢ）×Ｂａｌｃ（３
０００）＝ＭＮＲｕｐ（５１０ｄＢ）となる。That is, from the relationship between the SNR and the number of allocated bits shown in FIG.
Paying attention to "dB", the SNR that can be improved per bit allocation is SNR (98 dB) / the maximum bit allocation per subband (576 bits) = the maximum improvement (0.17 dB) per bit. From this, the SNR that can be improved by the number of assignable bits Balc is 1
Improvement per bit (0.17 dB) x Balc (3
000) = MNRup (510 dB).

【００６４】一方、全てのサブバンドにおいて、ＭＮＲ
計算が終了した時点で、全サブバンドのＭＮＲ値の総和
ＭＮＲａｌｌを計算し（図４のステップＳ４０３）、そ
してＭＮＲｕｐ値を各サブバンドに分配し、その結果各
サブバンドのＮＭＲが一定になるようなアロケーション
ターゲット値ＡＬＣｔｇｔを、ＡＬＣｔｇｔ＝（ＭＮＲ
ｕｐ十ＭＮＲａｌｌ）／ＳＢＳＢ：ビット割り当てが必
要なサブバンド数から求める（図４のステップＳ４０
４）。図５の例では、ＭＮＲａｌｌが「−４２４ｄＢ」
となるので、ＡＬＣｔｇｔは１０ｄＢとなる。On the other hand, in all subbands, the MNR
When the calculation is completed, the sum MNRall of the MNR values of all subbands is calculated (step S403 in FIG. 4), and the MNRup value is distributed to each subband, so that the NMR of each subband becomes constant. ALCtgt = (MNR
up tens MNRall) / SBSB: Obtained from the number of subbands requiring bit allocation (step S40 in FIG. 4)
4). In the example of FIG. 5, MNRall is "-424 dB".
Therefore, ALCtgt is 10 dB.

【００６５】そして、サブバンドを指標するループカウ
ンタｓｂをリセットしておき（図４のステップＳ４０
５）、ＡＬＣｔｇｔとＭＮＲ［ｓｂ］との差分計算の解
をＭＮＲｔｍｐに格納し（図４のステップＳ４０６）、
図６で示されるようなＳＮＲテーブルを検索し、ＳＮＲ
［ｓｔｅｐ］がＭＮＲｔｍｐより大きな最小値になるよ
うにｓｔｅｐ値を定める（図４のステップＳ処理４０７
〜ステップＳ４１０）。Then, the loop counter sb indicating the sub-band is reset (step S40 in FIG. 4).
5), the solution of the difference calculation between ALCtgt and MNR [sb] is stored in MNRtmp (step S406 in FIG. 4),
The SNR table is searched as shown in FIG.
The step value is determined so that [step] becomes the minimum value larger than MNRtmp (step S 407 in FIG. 4).
-Step S410).

【００６６】ＳＮＲ［ｓｔｅｐ］がＭＮＲｔｍｐより大
きな最小値という関係が成立（図４のステップＳ４０
８）するか、ｓｔｅｐ値が、所定の量子化ステップを超
えるとき、例えば図６の場合には１８以上のステップに
なる場合（図４のステップＳ４０９）には、ｓｔｅｐ＝
１７として、そのｓｔｅｐを、ビット割り当て量を保持
するバッファであるＡＬＣ［ＳＢ０］に格納する（図４
のステップＳ４１１）。The relationship that SNR [step] is the minimum value larger than MNRtmp is established.
8) or when the step value exceeds a predetermined quantization step, for example, in the case of FIG. 6, when the number of steps is 18 or more (step S409 in FIG. 4), step =
As step 17, the step is stored in ALC [SB0] which is a buffer for holding the bit allocation amount (FIG. 4).
Step S411).

【００６７】例えば、図５の例であれば、現在ｓｂカウ
ンタが「０」であるので、「ＳＢ０」についての処理を
まず行う。For example, in the example shown in FIG. 5, since the sb counter is currently "0", the process for "SB0" is performed first.

【００６８】ＭＮＲ［ＳＢ０］（−８５ｄＢ）とＡＬＣ
ｔｇｔ（１０ｄＢ）からＭＮＲｔｍｐ（９５ｄＢ）を導
き、ＭＮＲｔｍｐ（９５ｄＢ）からＳＮＲテーブルを用
いてｓｔｅｐ値１６を定め、これをビット割り当て量を
保持するバッファであるＡＬＣ［ＳＢ０］に格納する。MNR [SB0] (-85 dB) and ALC
MNRtmp (95 dB) is derived from tgt (10 dB), a step value 16 is determined from the MNRtmp (95 dB) using an SNR table, and this is stored in ALC [SB0], which is a buffer that holds the bit allocation amount.

【００６９】したがって、図５（ａ）斜線部のようなＭ
ＮＲ改善が１度のＭＮＲｔｍｐ計算で実現される。そし
て、次のサブバンドＳＢ１についても同様な処理を行う
が、この場合、ＭＮＲｔｍｐが「１００ｄＢ」となり、
量子化ステップの最大値を超えることになるので、図５
（ｂ）のようにＡＬＣｔｇｔ（１０ｄＢ）に到達しない
で図４のステップＳ４０４でｓｔｅｐが最大値“１６”
であるという条件によりＳＢ１のループ処理を脱出す
る。Therefore, M shown in FIG.
The NR improvement is realized by one MNRtmp calculation. Then, the same processing is performed for the next subband SB1, but in this case, the MNRtmp becomes “100 dB”, and
Since the maximum value of the quantization step is exceeded, FIG.
As shown in (b), the step does not reach ALCtgt (10 dB), and the step has the maximum value “16” in step S404 in FIG.
The process exits the loop processing of SB1 under the condition that

【００７０】このような処理を繰り返し行い、現在の処
理対象となっているサブバンドが予め定められている最
大サブバンド数ｓｂｍａｘであるか判断し（図４のステ
ップＳ４１２）、ｓｂ＜ｓｂｍａｘであれば、ｓｂカウ
ンタ値をインクリメントし（図４のステップＳ４１
３）、次のサブバンドのビット割り当て演算処理（図４
のステップＳ４０６〜４１０）を全てのサブバンドにつ
いて上述に従って繰り返す。Such processing is repeatedly performed to determine whether the number of subbands currently being processed is the predetermined maximum number of subbands sbmax (step S412 in FIG. 4), and if sb <sbmax. For example, the sb counter value is incremented (step S41 in FIG. 4).
3), the next sub-band bit allocation operation (FIG. 4)
Steps S406 to S410) are repeated as described above for all subbands.

【００７１】以上のような処理を行うことによって、量
子化ステップを最小ＮＭＲｍｉｎを検索する等のステッ
プを踏むこと無しに最適なビット割り当て演算が可能と
なる。さらに、ビット割り当て可能ビット数Ｂａｌｃを
要素に含むアロケーションターゲット値を設定している
ため、各サブバンドにつき１度の演算処理で最適なビッ
ト割り当て演算が可能となるので処理の高速化が図れ
る。By performing the above-described processing, an optimal bit allocation operation can be performed without performing a quantization step such as searching for the minimum NMRmin. Furthermore, since the allocation target value including the number of bits that can be allocated, Balc, as an element is set, an optimal bit allocation operation can be performed by one operation for each subband, so that the processing can be speeded up.

【００７２】（本発明の他の実施形態）本発明は複数の
機器（例えば、ホストコンピュータ、インタフェース機
器、リーダ、プリンタ等）から構成されるシステムに適
用しても１つの機器からなる装置に適用しても良い。(Other Embodiments of the Present Invention) The present invention is applied to a system composed of a plurality of devices (for example, a host computer, an interface device, a reader, a printer, etc.) and is applied to an apparatus composed of one device. You may.

【００７３】また、上述した実施形態の機能を実現する
ように各種のデバイスを動作させるように、上記各種デ
バイスと接続された装置あるいはシステム内のコンピュ
ータに対し、上記実施形態の機能を実現するためのソフ
トウェアのプログラムコードを供給し、そのシステムあ
るいは装置のコンピュータ（ＣＰＵあるいはＭＰＵ）に
格納されたプログラムに従って上記各種デバイスを動作
させることによって実施したものも、本発明の範疇に含
まれる。Further, in order to realize the functions of the above-described embodiments, a device connected to the above-mentioned various devices or a computer in a system is realized so as to operate various devices so as to realize the functions of the above-described embodiments. The present invention also includes programs implemented by supplying the program code of the software described above and operating the various devices according to programs stored in a computer (CPU or MPU) of the system or apparatus.

【００７４】また、この場合、上記ソフトウェアのプロ
グラムコード自体が上述した実施形態の機能を実現する
ことになり、そのプログラムコード自体、およびそのプ
ログラムコードをコンピュータに供給するための手段、
例えばかかるプログラムコードを格納した記憶媒体は本
発明を構成する。かかるプログラムコードを記憶する記
憶媒体としては、例えばフロッピーディスク、ハードデ
ィスク、光ディスク、光磁気ディスク、ＣＤ−ＲＯＭ、
磁気テープ、不揮発性のメモリカード、ＲＯＭ等を用い
ることができる。In this case, the program code of the software implements the functions of the above-described embodiment, and the program code itself and means for supplying the program code to the computer are provided.
For example, a storage medium storing such a program code constitutes the present invention. As a storage medium for storing such a program code, for example, a floppy disk, hard disk, optical disk, magneto-optical disk, CD-ROM,
A magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.

【００７５】また、コンピュータが供給されたプログラ
ムコードを実行することにより、上述の実施形態の機能
が実現されるだけでなく、そのプログラムコードがコン
ピュータにおいて稼働しているＯＳ（オペレーティング
システム）あるいは他のアプリケーションソフト等の共
同して上述の実施形態の機能が実現される場合にもかか
るプログラムコードは本発明の実施形態に含まれること
は言うまでもない。When the computer executes the supplied program code, not only the functions of the above-described embodiment are realized, but also the OS (operating system) or other operating system running on the computer. Needless to say, even when the functions of the above-described embodiments are realized in cooperation with application software or the like, such program codes are included in the embodiments of the present invention.

【００７６】さらに、供給されたプログラムコードがコ
ンピュータの機能拡張ボードやコンピュータに接続され
た機能拡張ユニットに備わるメモリに格納された後、そ
のプログラムコードの指示に基づいてその機能拡張ボー
ドや機能拡張ユニットに備わるＣＰＵ等が実際の処理の
一部または全部を行い、その処理によって上述した実施
形態の機能が実現される場合にも本発明に含まれること
は言うまでもない。Further, after the supplied program code is stored in a memory provided in a function expansion board of a computer or a function expansion unit connected to the computer, the function expansion board or the function expansion unit is specified based on the instruction of the program code. It is needless to say that the present invention also includes a case where the CPU or the like provided in the first embodiment performs part or all of the actual processing, and the processing realizes the functions of the above-described embodiments.

【００７７】[0077]

【発明の効果】本発明によれば、各サブバンドにおける
マスク対雑音比からマスク対雑音比向上指標値を予め求
め、上記各サブバンド毎に一意的に、マスク対雑音比演
算をマスク対雑音比向上指標値を超えるまでビット割り
当て処理を行うようにしたので、最小マスク対雑音比演
算処理を省略して演算処理に要する時間を短縮すること
ができる。According to the present invention, a mask-to-noise ratio improvement index value is obtained in advance from the mask-to-noise ratio in each sub-band, and the mask-to-noise ratio calculation is performed uniquely for each of the sub-bands. Since the bit allocation process is performed until the ratio improvement index value is exceeded, the time required for the calculation process can be reduced by omitting the minimum mask-to-noise ratio calculation process.

【００７８】また、本発明の他の特徴によれば、各サブ
バンドで１度のビット割り当て演算処理で適正なビット
割り当てを行うことが可能となり、演算処理時間を大幅
に短縮することができる。According to another feature of the present invention, it is possible to perform appropriate bit allocation in a single bit allocation calculation process for each subband, and it is possible to greatly reduce the calculation processing time.

【００７９】また、本発明のその他の特徴によれば、ビ
ット割り当て可能ビット数Ｂａｌｃを要素に含むアロケ
ーションターゲット値を設定したので、各サブバンドに
つき１度の演算処理で最適なビット割り当て演算を行う
ことが可能となり、処理の高速化を図ることができる。Further, according to another feature of the present invention, since the allocation target value including the number of allocable bits Balc as an element is set, the optimum bit allocation operation is performed by one operation for each subband. It is possible to increase the processing speed.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態のビット割り当て方
式のフローチャートである。FIG. 1 is a flowchart of a bit allocation method according to a first embodiment of the present invention.

【図２】ディジタルオーディオ信号圧縮のブロック図で
ある。FIG. 2 is a block diagram of digital audio signal compression.

【図３】本発明の第１の実施の形態の動作概念図であ
る。FIG. 3 is an operation conceptual diagram of the first embodiment of the present invention.

【図４】本発明の第２の実施の形態のビット割り当て方
式のフローチャートである。FIG. 4 is a flowchart of a bit allocation method according to a second embodiment of the present invention.

【図５】本発明の第２の実施の形態の動作概念図であ
る。FIG. 5 is an operation conceptual diagram according to a second embodiment of the present invention.

【図６】量子化ステップと信号対雑音比のテーブル例を
示す図である。FIG. 6 is a diagram illustrating an example of a table of quantization steps and a signal-to-noise ratio.

【図７】従来例のビット割り当て方式のフローチャート
である。FIG. 7 is a flowchart of a conventional bit allocation method.

【図８】ビット割り当て演算部の動作を説明する図であ
る。FIG. 8 is a diagram for explaining the operation of a bit allocation operation unit.

[Explanation of symbols]

１デジタルオーディオ入力端子２サブバンド符号化部３ＦＦＴ演算部４マスク量演算部５ビット割り当て演算部６データ量子化部７伝送フォーマット形成部８圧縮データ出力端子 Reference Signs List 1 digital audio input terminal 2 subband encoding unit 3 FFT operation unit 4 mask amount operation unit 5 bit allocation operation unit 6 data quantization unit 7 transmission format formation unit 8 compressed data output terminal

Claims

[Claims]

1. A digital audio signal compression apparatus for compressing a digital audio signal, wherein a mask-to-noise ratio improvement index value is obtained from a mask-to-noise ratio in each subband until the calculated mask-to-noise ratio improvement index value is exceeded. A digital audio signal compression device, wherein a bit allocation process is performed for each subband.

2. An input means for inputting a digital audio signal, an orthogonal transform means for orthogonally transforming the digital audio signal input via the input means, and a sound spectrum obtained from a power spectrum as a result of the orthogonal transform means. A mask amount calculating means for calculating a mask amount based on a masking effect; and either the power spectrum or a mapped signal obtained by mapping the digital audio signal on a frequency axis by a method different from that of the orthogonal transform means; A signal-to-mask ratio calculating means for calculating a ratio of the mask amount obtained by the means in a predetermined frequency range unit; a signal-to-noise ratio defining means for defining an SN ratio for a predetermined bit allocation amount; The result of the signal-to-noise ratio definition means is updated each time the predetermined bit allocation amount is changed, and And a bit for determining the bit allocation number of the compressed data in the predetermined frequency width unit based on the result of the iterative operation means. A digital audio signal compression device having an allocation unit, wherein the bit allocation target value setting unit sets a common predetermined bit allocation target value in the predetermined frequency band unit, and the signal to noise ratio defining unit. Selecting means for selecting a value that results in a minimum value greater than the bit allocation target value, and bit allocation calculating means for performing a bit allocation operation using the result of the selecting means. Digital audio signal compression device.

3. An input means for inputting a digital audio signal, an orthogonal transform means for orthogonally transforming the input digital audio signal in a predetermined period unit of an audio frame, and a digital signal obtained by the orthogonal transform means. Mask amount calculating means for calculating a mask amount for each sub-band, which is a predetermined frequency range unit, based on a sound masking effect from a power spectrum of an audio signal, and the power spectrum or the digital audio signal The difference between one of the mapped signals mapped on the frequency axis by a method different from the orthogonal transformation means and the mask amount obtained by the mask amount calculation means is calculated for each subband, and the signal-to-mask ratio in the audio frame is calculated. The signal-to-mask ratio calculation means to be determined and a predetermined Signal-to-noise ratio storage means for storing a signal-to-noise ratio for a given quantization step as a table, and signal-to-noise ratio and signal-to-mask ratio calculation means output from the signal-to-noise ratio storage means. A mask-to-noise ratio calculating means for calculating a difference between the signal-to-mask ratio obtained by the above for each sub-band to obtain a mask-to-noise ratio; and a mask-to-noise ratio obtained by the mask-to-noise ratio calculating means. In a digital audio signal compression apparatus having a signal-to-noise ratio and bit allocation means for determining the number of allocated bits of compressed data for each sub-band using bits that can be allocated to the audio frame during audio signal compression, An average value in a predetermined sub-band is calculated for the mask-to-noise ratio obtained by the mask-to-noise ratio calculating means. A mask-to-noise ratio average value calculating means, wherein the bit allocating means performs the mask-to-noise ratio, the mask-to-noise ratio average value, the signal-to-noise ratio, and the audio-signal compression for each of the predetermined subbands. A digital audio signal compression device, wherein a bit allocation number of compressed data for each subband is determined using bits that can be allocated to the audio frame.

4. The mask-to-noise ratio of one of the sub-bands is updated by the mask-to-noise ratio calculating means so that the quantization step is refined until the mask-to-noise ratio average value is exceeded. 4. The digital audio signal compression apparatus according to claim 3, further comprising a repetition mask-to-noise ratio calculation unit, and thereafter repeating the same calculation for the other subbands.

5. A signal pair for one of the sub-bands, wherein the signal-to-noise ratio search means uses the signal-to-noise ratio search means to search for the signal-to-noise ratio exceeding the mask-to-noise ratio average value in the mask-to-noise ratio calculation means. 4. The digital audio signal compression device according to claim 3, further comprising a noise ratio search unit, wherein the same operation is repeated for other subbands thereafter.

6. The mask-to-noise ratio for each of the sub-bands and the mask-to-noise ratio for each of the sub-bands after the mask-to-noise ratio exceeds the mask-to-noise ratio average value. A digital audio signal compression apparatus for determining the number of bits to be allocated to the compressed data for each sub-band by using the signal-to-noise ratio and bits that can be allocated to the audio frame when compressing the audio signal.

7. An input means for inputting a digital audio signal, an orthogonal transform means for orthogonally transforming the input digital audio signal in a predetermined period unit of an audio frame, and a digital signal obtained by the orthogonal transform means. Mask amount calculation means for calculating a mask amount for each sub-band, which is a predetermined frequency range unit, based on a sound masking effect from a power spectrum of an audio signal, and the power spectrum or the digital audio signal The difference between one of the mapped signals mapped on the frequency axis by a method different from the orthogonal transform means and the mask amount obtained by the mask amount calculating means is calculated for each of the subbands, and the signal-to-mask ratio in the audio frame is obtained. Signal-to-mask ratio calculation means and predetermined Signal-to-noise ratio storage means for storing a signal-to-noise ratio for a given quantization step as a table; and signal-to-noise ratio and signal-to-mask ratio calculation means output from the signal-to-noise ratio storage means. A mask-to-noise ratio calculating means for performing a difference calculation with respect to the signal-to-mask ratio obtained for each sub-band to obtain a mask-to-noise ratio; and a mask-to-noise ratio obtained by the mask-to-noise ratio calculating means. In a digital audio signal compression apparatus having a signal-to-noise ratio and bit allocation means for determining a bit allocation number of compressed data for each sub-band by using bits that can be allocated to the audio frame at the time of audio signal compression, In audio compression, the number of bits that can be allocated to the audio frame and the quantization step, Mask-to-noise ratio improvement amount calculating means for calculating a mask-to-noise ratio improvement amount assigned to the audio frame from a signal-to-noise ratio relationship, and a mask pair obtained by the mask-to-noise ratio calculation means in the subband. A mask-to-noise ratio sum calculating means for calculating the sum of the noise ratios; a mask-to-noise ratio improvement amount as a result of the mask-to-noise ratio improvement amount calculating means; and a mask obtained by the mask-to-noise ratio sum calculating means. A digital audio signal compression device comprising a target value calculating means for obtaining a sum with a noise-to-noise ratio sum and dividing by the predetermined number of subbands to obtain a bit allocation target value.

8. A signal-to-noise ratio for one of the sub-bands, wherein the mask-to-noise ratio calculating means searches for the signal-to-noise ratio exceeding the bit allocation target value using the signal-to-noise ratio searching means. 8. The digital audio signal compression apparatus according to claim 7, further comprising a search unit, wherein the same operation is repeated for other subbands thereafter.

9. A repetitive mask for updating the mask-to-noise ratio for one of the sub-bands so that the mask-to-noise ratio calculating means refines the quantization step until the bit allocation target value is exceeded. 8. The digital audio signal compression apparatus according to claim 7, further comprising noise-to-noise ratio calculation means, and thereafter repeating the same calculation for other subbands.

10. The mask-to-noise ratio, the mask-to-noise ratio average value, and the mask-to-noise ratio for each of the predetermined sub-bands, after the mask-to-noise ratio exceeds the bit allocation target value. Signal to noise ratio and
10. The digital audio signal compression apparatus according to claim 8, wherein the number of bits allocated to the compressed data for each sub-band is determined by using bits that can be allocated to the audio frame when compressing the audio signal.

11. A digital audio signal compression method for compressing a digital audio signal, comprising: obtaining a mask-to-noise ratio improvement index value from a mask-to-noise ratio in each subband until the mask-to-noise ratio improvement index value exceeds the calculated mask-to-noise ratio improvement index value. A digital audio signal compression method, wherein a bit allocation process is performed for each sub-band.

12. An input process for inputting a digital audio signal, an orthogonal transform process for orthogonally transforming the input digital audio signal, and a mask based on a speech masking effect from a power spectrum which is a result of the orthogonal transform process. A mask amount calculation process for calculating the amount, and either the power spectrum or the mapped signal obtained by mapping the digital audio signal on the frequency axis by a method different from the orthogonal transformation process, and the mask amount calculation process. A signal-to-mask ratio calculation process for calculating a ratio with the mask amount in a predetermined frequency range unit; a signal-to-noise ratio definition process for defining an SN ratio for a predetermined bit allocation amount; Each time the amount is changed, the result of the signal-to-noise ratio definition process is updated, and the signal-to-mask A repetition calculation process for calculating the mask-to-noise ratio with the result of the ratio calculation process; and a bit allocation process for determining the bit allocation number of the compressed data in the predetermined frequency width unit based on the result of the repetition calculation process. In the digital audio signal compression method to be performed, a bit allocation target value setting process for setting a common predetermined bit allocation target value in the predetermined frequency range unit, and a result of the signal-to-noise ratio defining process are as described above. A digital processing unit for performing a selection process of selecting a value that is smaller than a bit allocation target value obtained by a bit allocation target value setting process, and performing a bit allocation operation using a result of the selection process. Audio signal compression method.

13. An input process for inputting a digital audio signal, an orthogonal transform process for orthogonally transforming the input digital audio signal in a predetermined period unit of an audio frame, and a digital signal obtained by the orthogonal transform process. A mask amount calculation process for calculating a mask amount for each sub-band which is a predetermined frequency range unit based on a sound masking effect from a power spectrum of an audio signal, and converting the power spectrum or the digital audio signal to The difference between one of the mapped signals mapped on the frequency axis by a method different from the orthogonal transformation process and the mask amount obtained by the mask amount calculation process is calculated for each subband, and the signal-to-mask ratio in the audio frame is calculated. The required signal-to-mask ratio calculation A signal-to-noise ratio storage process for storing a signal-to-noise ratio for a quantization step being set as a table, and a calculation of the signal-to-noise ratio and the signal-to-mask ratio output from the signal-to-noise ratio storage process A mask-to-noise ratio calculation process for calculating a difference between the signal-to-mask ratio obtained by the processing for each sub-band to obtain a mask-to-noise ratio; and a mask-to-noise ratio obtained by the mask-to-noise ratio calculation process. In the digital audio signal compression method, the signal-to-noise ratio and a bit allocation process for determining a bit allocation number of compressed data for each sub-band by using bits that can be allocated to the audio frame during audio signal compression. Calculating an average value in a predetermined sub-band of the mask-to-noise ratio obtained by the mask-to-noise ratio calculation processing. In the bit allocation process, the mask-to-noise ratio, the mask-to-noise ratio average value, the signal-to-noise ratio, and the audio-to-noise ratio compression process are performed in the bit allocation process. Determining a bit allocation number of compressed data for each of said subbands by using bits that can be allocated to said audio frame.

14. The mask-to-noise ratio for one of the sub-bands is updated in the mask-to-noise ratio calculation process such that the quantization step is refined until the average value exceeds the mask-to-noise ratio. 14. The digital audio signal compression method according to claim 13, further comprising a repetitive mask-to-noise ratio calculation process, and thereafter repeating the same calculation for other subbands.

15. A signal pair for searching one of the subbands for the signal-to-noise ratio exceeding the mask-to-noise ratio average value in the mask-to-noise ratio calculation process using the signal-to-noise ratio search process. 14. The digital audio signal compression method according to claim 13, further comprising a noise ratio search process, and thereafter repeating the same operation for other subbands.

16. After the mask-to-noise ratio exceeds the mask-to-noise ratio average value for the predetermined sub-band, the mask-to-noise ratio for each sub-band, the mask-to-noise ratio average value, A digital audio signal compression method, comprising: determining a bit allocation number of compressed data for each sub-band by using the signal-to-noise ratio and bits that can be allocated to the audio frame during audio signal compression.

17. An input process for inputting a digital audio signal, an orthogonal transform process for orthogonally transforming the input digital audio signal in a predetermined period unit of an audio frame, and a digital signal obtained by the orthogonal transform process. A mask amount calculation process for calculating a mask amount for each sub-band which is a predetermined frequency width unit based on a sound masking effect from a power spectrum of an audio signal, and the power spectrum or the digital audio signal A difference between one of the mapped signals mapped on the frequency axis by a method different from the orthogonal transformation process and the mask amount obtained by the mask amount calculation process is calculated for each of the subbands, and a signal-to-mask ratio in the audio frame is obtained. Signal-to-mask ratio calculation processing and preset A signal-to-noise ratio storing process for storing a signal-to-noise ratio for a quantization step being set as a table; and calculating the signal-to-noise ratio and the signal-to-mask ratio output from the signal-to-noise ratio storing process. A mask-to-noise ratio calculation process for calculating a difference between the signal-to-mask ratio obtained by the processing for each sub-band to obtain a mask-to-noise ratio; and a mask-to-noise ratio obtained by the mask-to-noise ratio calculation process. In the digital audio signal compression method, the signal-to-noise ratio and a bit allocation process of determining a bit allocation number of compressed data for each of the sub-bands using bits that can be allocated to the audio frame during audio signal compression, The number of bits that can be allocated to the audio frame and the quantization step during audio compression in advance; A mask-to-noise ratio improvement amount calculation process for calculating a mask-to-noise ratio improvement amount assigned to the audio frame from a relationship between the signal-to-noise ratio, and a mask-to-noise ratio obtained by the mask-to-noise ratio calculation process in the subband. A mask-to-noise ratio sum calculation process for calculating the sum of the ratios; a mask-to-noise ratio improvement amount as a result of the mask-to-noise ratio improvement amount calculation process; and a mask pair obtained by the mask-to-noise ratio sum calculation process. A digital audio signal compression method comprising: obtaining a sum with a noise ratio sum; performing a division by the number of sub-bands; and performing a target value calculation process for obtaining a bit allocation target value.

18. A signal-to-noise ratio for searching one of the sub-bands using the signal-to-noise ratio search process to find the signal-to-noise ratio exceeding the bit allocation target value in the mask-to-noise ratio calculation process. 18. The digital audio signal compression method according to claim 17, further comprising a search process, and thereafter repeating the same operation for other subbands.

19. A repetitive mask for updating the mask-to-noise ratio for one of the sub-bands so as to refine the quantization step until the bit-allocation target value is exceeded in the mask-to-noise ratio calculation process. 18. The digital audio signal compression method according to claim 17, further comprising a noise-to-noise ratio calculation process, and thereafter repeating the same calculation for other subbands.

20. For the predetermined sub-band, after the mask-to-noise ratio exceeds the bit allocation target value, the mask-to-noise ratio, the mask-to-noise ratio average value, and the signal pair for each of the sub-bands. 19. The noise ratio and the number of bits to be allocated to the compressed data for each sub-band are determined by using bits that can be allocated to the audio frame when compressing the audio signal.
Or a digital audio signal compression method according to item 19.

21. A storage medium storing a program for causing a computer to function as each means constituting the audio signal compression apparatus according to claim 1 so as to be readable from the computer.

22. A storage medium storing a program for causing a computer to execute the procedure of the audio signal compression method according to claim 11 so as to be readable from the computer.

23. The digital audio signal compression apparatus according to claim 2, wherein said bit allocation means performs bit allocation for all subbands requiring bit allocation.

24. The bit allocation process according to claim 11, wherein bit allocation is performed for all subbands requiring bit allocation.
3. A digital audio signal compression method according to claim 1.