JP2004004530A

JP2004004530A - Encoding apparatus, decoding apparatus and method thereof

Info

Publication number: JP2004004530A
Application number: JP2003012952A
Authority: JP
Inventors: Naoya Tanaka; 直也田中; Mineo Tsushima; 峰生津島; Takeshi Norimatsu; 武志則松
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2002-01-30
Filing date: 2003-01-22
Publication date: 2004-01-08
Anticipated expiration: 2023-01-22
Also published as: JP4272897B2

Abstract

【課題】音響信号を高効率に符号化し、広い帯域幅の音響信号を高品質に復号化できる符号化装置および復号化装置を提供する。
【解決手段】符号化装置２００は、入力信号２０７を低域信号２０８と高域信号２０９とに分割する帯域分割部２０１と、低域信号２０８を符号化し、低域符号２１３を生成する低域符号化部２０２と、高域信号２０９と低域信号２０８との類似度を判定して類似度情報２１１と切換情報２１０とを生成する類似度判定部２０３と、高域信号２０９を、異なる符号化方法で符号化し、高域符号２１２を生成するｎ個の高域符号化部２０５と、切換情報２１０に基づいて、高域符号化部２０５の１つを選択して、選択された高域符号化部２０５に符号化を実行させる切換部２０４と、低域符号２１３、高域符号２１２および切換情報２１０を多重化して、出力符号２１４を生成する符号多重化部２０６とを備える。
【選択図】図１An encoding apparatus and a decoding apparatus capable of encoding an acoustic signal with high efficiency and decoding an acoustic signal with a wide bandwidth with high quality are provided.
An encoding apparatus includes: a band dividing unit that divides an input signal into a low-frequency signal and a high-frequency signal; and a low-frequency code that generates a low-frequency code. The encoding unit 202, the similarity determination unit 203 that determines the similarity between the high frequency signal 209 and the low frequency signal 208 and generates the similarity information 211 and the switching information 210, and the high frequency signal 209 are encoded with different codes. N high-frequency encoding units 205 that generate a high-frequency code 212 and select one of the high-frequency encoding units 205 based on the switching information 210 and select the selected high frequency band A switching unit 204 that causes the encoding unit 205 to perform encoding, and a code multiplexing unit 206 that multiplexes the low frequency code 213, the high frequency code 212, and the switching information 210 to generate an output code 214 are provided.
[Selection] Figure 1

Description

【０００１】
【発明の属する技術分野】
本発明は、入力された音響信号を符号化する符号化装置およびその符号化装置によって得られた符号を蓄積もしくは伝送した後、復号化を行って再び音響信号を再生する復号化装置に関し、また、前記符号化装置および前記復号化装置における符号化方法および復号化方法に関する。
【０００２】
【従来の技術】
一般的な音響信号を、少ないビット量で符号化し、かつ、高品質な再生信号を得るための技術として、帯域分割符号化を利用する方法が広く知られている。これは、入力された音響信号を、帯域分割フィルタを用いて複数の周波数帯域の信号に分割するか、もしくはフーリエ変換等の時間−周波数変換を用いて周波数軸の信号に変換した後、周波軸上で複数の帯域に分割した上で、分割された各帯域に適切な符号化ビット割当を行うことにより、実現されるものである。帯域分割符号化を用いることにより、少ないビット量の符号から高品質な再生信号を得られる理由は、符号化段階において人間の聴覚特性に基づいた処理を行うことができることにある。
【０００３】
一般に、人間の聴覚は、１０ｋＨｚ程度以上の高い周波数の音に対しては感度が下がり、レベルの低い音は感知されにくくなる。また、周波数マスキングと呼ばれる現象も良く知られており、ある特定の周波数帯域に高いレベルの音が存在する場合、その周辺帯域のレベルの低い音は感知されにくくなる。このような、聴覚的な特性によって感知されにくい部分については、多くのビットを割り当てて符号化を行っても再生信号の品質向上にはほとんど寄与せず、符号化する意味をなさない。逆に、聴覚的特性を考慮しないままこの部分に割り当てられていた符号化ビットを、他の聴覚的に敏感な部分に割当て直すことによって、聴覚的に敏感な部分を詳細に符号化し、再生信号の品質を向上することができる。
【０００４】
このような帯域分割を利用した符号化の代表例として、ＩＳＯ国際標準規格ＭＰＥＧ−４　ＡＡＣ（ＩＳＯ／ＩＥＣ　１４４９６−３）がある。以下、図を参照しながら、ＭＰＥＧ−４　ＡＡＣ（以下ＡＡＣと表記する）の動作を説明する。
【０００５】
図２６は、従来のＡＡＣ方式を用いた符号化装置１００の構成を示すブロック図である。符号化装置１００は、入力信号１０９を聴覚特性に従って評価し、評価に応じたビット割当てにより符号化する符号化装置であって、聴覚特性評価部１０１、変換ブロック長選択部１０２、ＭＤＣＴ変換部１０３、帯域分割部１０４、スペクトル信号処理部１０５、ビット割当部１０６、量子化部１０７および符号多重化部１０８を備える。
【０００６】
入力信号１０９は、１０２４サンプル／フレームの基本フレーム長で分割され聴覚特性評価部１０１およびＭＤＣＴ変換部１０３に入力される。聴覚特性評価部１０１は、人間の聴覚特性に従って入力信号１０９を評価し、聴覚特性評価値１１０を出力する。変換ブロック長選択部１０２は、聴覚特性評価値１１０から、入力信号１０９を符号化するのに適切な変換ブロック長を選択し、ＭＤＣＴ変換部１０３に出力する。ＭＤＣＴ変換部１０３は、選択された変換ブロック長を適用して、入力信号１０９をＭＤＣＴ係数１１１に変換する。ＡＡＣの場合、変換ブロック長は１２８サンプルもしくは１０２４サンプルであり、通常、入力信号１０９が過渡的な場合には短い変換ブロック長、定常的な場合には長い変換ブロック長が割り当てられる。
【０００７】
ここで用いられるＭＤＣＴ（Ｍｏｄｉｆｉｅｄ　Ｄｉｓｃｒｅｔｅ　Ｃｏｓｉｎｅ　Ｔｒａｎｓｆｏｒｍ）はコサイン変換の一種であり、算出されたＭＤＣＴ係数１１１は、入力信号１０９の周波数スペクトルを表現する係数となる。算出されたＭＤＣＴ係数１１１は、帯域分割部１０４において複数の周波数バンド（サブバンド）に分割される。その後、スペクトル信号処理部１０５は、各周波数バンドに分割されたＭＤＣＴ係数１１２に対して、より高効率な符号化を可能にするための予測、および聴覚特性評価値１１０に基づくノイズシェイピング処理を行う。
【０００８】
また、スペクトル信号処理部１０５では、さらに、入力信号１０９がステレオ信号等の複数チャネルからなる信号の場合には、チャネル間の信号の相関を利用して符号化効率を高めるジョイントステレオと呼ばれる処理も行われる。さらに、ＰＮＳ（Ｐｅｒｃｅｐｔｕａｌ　Ｎｏｉｓｅ　Ｓｕｂｓｔｉｔｕｔｉｏｎ）と呼ばれる処理が行われる場合があるが、これについては後で詳しく説明する。
【０００９】
なお、スペクトル信号処理部１０５においてどのような処理が行われたかについての情報は、補助情報符号１１４として出力される。ビット割当部１０６は、聴覚特性評価値１１０に基づいて、量子化に必要なビット割当１１５を算出し、量子化部１０７に出力する。量子化部１０７は、スペクトル信号処理部１０５で処理されたＭＤＣＴ係数１１３を、ビット割当１１５で示されるビット数で量子化する。量子化は、スケールファクタと呼ばれるサブバンドごとの正規化ゲイン情報と、スケールファクタによって正規化されたＭＤＣＴ係数の値の組み合わせに対して行われる。
【００１０】
符号多重化部１０８は、スペクトル信号処理部１０５から出力される補助情報符号１１４と、量子化部１０７から出力されるスペクトル符号１１６とを多重化し、所定のフォーマットに並べ換えて出力符号１１７として出力する。なお、ＡＡＣの場合、基本フレームに対して割り当てるビット数は、フレームごとに任意に定めることができるため、基本的に可変ビットレートでの符号化となるが、最終出力の前段にビットリザーバと呼ばれるバッファを設け、フレームごとのビットレートの変動をバッファによって吸収することにより、固定ビットレートでの伝送を可能にしている。
【００１１】
次に、ＰＮＳの処理について説明する。ＰＮＳでは、上記の各サブバンドについて、そのバンドが聴覚的にノイズ成分として表せるかどうかを判定し、ノイズ成分として表せると判定した場合には、そのバンドのＭＤＣＴ係数を、ランダムに生成されるノイズ信号で置き換える。ノイズ信号で置き換えられたバンドに対しては、ＭＤＣＴ係数の値の量子化を行う必要が無くなり、スケールファクタに対応するゲイン情報のみ量子化すれば良いので、量子化に必要な符号化ビット数を大幅に削減することが可能となる。
【００１２】
このような符号化処理を行うことにより、ＭＰＥＧ−４　ＡＡＣでは、例えば、９６ｋｂｐｓ程度のビットレートにおいて、２０Ｈｚ−１６ｋＨｚもしくはそれ以上の広帯域のステレオ信号を高音質に符号化することができる。
【００１３】
【発明が解決しようとする課題】
しかしながら、ビットレートをさらに低下させた場合、例えばステレオ信号に対して４８ｋｂｐｓ程度のビットレートとした場合には、高音質に符号化できる帯域が狭くなり、聴感上こもった感じの音になってしまうという問題点がある。
【００１４】
また、ビットレートを低下させる段階で、ＭＤＣＴ係数の符号化ビット数を削減するためにＰＮＳを多用すると、ノイズ信号によって置き換えられる部分が増加し、聴感上でもノイズや歪みの多い音となってしまうという問題がある。
【００１５】
上記課題に鑑み、本発明は、符号化装置で符号化された音響信号の符号を低ビットレートで伝送する場合において、符号を受け取った復号化装置側で、広い帯域幅の音響信号を高品質に復号化することができる符号化装置、復号化装置、符号化方法および復号化方法を提供することを目的とする。
【００１６】
【課題を解決するための手段】
本発明の符号化装置は、上記目的を達成するために、入力信号を低い周波数帯域の信号を表す低域信号と高い周波数帯域の信号を表す高域信号とに分割する帯域分割手段と、分割された低域信号を符号化し、低域符号を生成する低域符号化手段と、前記高域信号と前記低域信号との類似度を判定する類似度判定手段と、前記高域信号を、異なる符号化方法で符号化し、高域符号を生成する複数の高域符号化手段と、前記類似度に基づいて、前記複数の高域符号化手段の１つを選択して、選択された高域符号化手段に符号化を実行させ、選択された高域符号化手段の符号化方法を特定するための選択情報を生成する符号化選択手段と、前記低域符号化手段からの低域符号、選択された前記高域符号化手段からの高域符号および前記符号化選択手段からの選択情報を多重化して、出力符号を生成する符号多重化手段とを備える。
【００１７】
また、この符号化装置によって符号化された出力符号を復号化する復号化装置は、入力符号から、低い周波数帯域の信号を表す低域信号を復号化する低域復号化手段と、入力符号から、異なる復号化方法を用いて、高い周波数帯域の信号を表す高域信号を復号化する複数の高域復号化手段と、入力符号に含まれる選択情報に基づいて、前記高域復号化手段の１つを選択し、選択した高域復号化手段に復号化を実行させる復号化選択手段と、前記低域復号化手段によって復号化された前記低域信号と、選択された前記高域復号化手段によって復号化された前記高域信号とを合成して、出力復号化信号を生成する信号合成手段とを備える。
【００１８】
【発明の実施の形態】
以下、本発明の実施の形態について図面を参照しながら詳細に説明する。
（実施の形態１）
図１は、本発明の実施の形態１に係る符号化装置２００の構成を示すブロック図である。符号化装置２００は、入力信号について、周波数高域部の信号と周波数低域部の信号との類似度を調べ、高域部信号を、より少ないビット量で符号化する符号化装置であって、帯域分割部２０１、低域符号化部２０２、類似度判定部２０３、切換部２０４、高域符号化部２０５および符号多重化部２０６を備える。これにおいて、帯域分割部２０１は、入力信号２０７を帯域分割し、低い周波数帯域の信号のみを含む低域信号２０８と高い周波数帯域の信号のみを含む高域信号２０９とを出力する。
【００１９】
低域信号２０８に含まれる上限周波数と高域信号に含まれる下限周波数の境界周波数は任意に定めてよく、また、両者にオーバーラップがあってもよい。低域符号化部２０２は、低域信号２０８を符号化し、低域符号２１３を出力する。ここで、低域符号化部２０２としては公知のどのような符号化部を用いても良い。次に、類似度判定部２０３は、低域信号２０８と高域信号２０９との類似度を判定し、切換情報２１０と類似度情報２１１とを出力する。類似度の判定方法としては、例えば予測誤差を用いることができるが、具体的な方法については後で詳細を説明する。
【００２０】
高域符号化部２０５は、切換情報２１０および類似度情報２１１の値に応じて高域信号２０９を最適に符号化する。具体的には、高域符号化部２０５は、第１〜第ｎ（ｎは自然数）からなるｎ個の高域符号化部によって構成されており、切換情報２１０に応じてそのうちの１つが選択される。第１〜第ｎの各高域符号化部は、低域信号２０８と高域信号２０９との類似度によって分類される特性の異なる高域信号２０９を最適に符号化できるように構成されている。
【００２１】
切換部２０４は、類似度判定部２０３が出力した切換情報２１０に基づいて、第１〜第ｎの符号化部のうちからあらかじめ定められた最適な高域符号化部を選択し、選択された高域符号化部は、類似度情報２１１を参照しながら高域信号２０９を符号化して、高域符号２１２を出力する。類似度判定部２０３の一例としては、類似度が低い場合には、入力信号２０７はランダムなノイズ性の信号であると判断して、第１〜第ｎの高域符号化部のうちから、ノイズを用いて高域信号２０９を符号化する高域符号化部が選択されるような切換情報２１０を出力し、逆に類似度が高い場合には、類似度が高いと判定された低域信号２０８を利用して高域信号２０９から低域信号２０８を差し引いた差分信号を符号化する高域符号化部が選択されるような切換情報２１０を出力する。
【００２２】
また、類似度判定部２０３は、低域信号２０８と高域信号２０９との類似度が高い場合には、高域符号化部２０５に対し、類似度情報２１１の一部として類似度の高い部分の低域信号２０８または高域信号２０９と類似度の高い低域信号２０８との差分値を出力し、低域信号２０８と高域信号２０９との類似度が低い場合には、類似度情報２１１として「０」を出力するような構成が考えられる。
【００２３】
一方、高域符号化部２０５において、ノイズを用いる符号化では、選択された高域符号化部は、符号化対象の高域信号をランダムに生成されるノイズ信号とみなし、高域信号の強度を示すゲイン情報のみを符号化するので、高域信号の値を直接符号化するのと比較して符号のビット量を大幅に削減できる。また、類似度が高い場合には、差分信号を用いる符号化では、高域信号２０９と類似する低域信号２０８との差分を量子化することによって、符号化対象である高域信号２０９の符号化に必要なビット量を削減できる。
【００２４】
なお、類似度が非常に高い場合や聴覚特性上あまり重要でない非常に高い周波数帯域の信号を符号化する際には、差分信号を符号化せずに、対象となる高域信号２０９を類似の低域信号２０８で置き換えても良い。このようにすれば、置き換える元の低域信号２０８を特定するための情報だけを符号化すればよいので、高域信号２０９の符号化に必要なビット量をさらに削減することができる。低域符号化部２０２、高域符号化部２０５および類似度判定部２０３の各符号化処理の出力結果である低域符号２１３、高域符号２１２および切換情報２１０は、符号多重化部２０６において多重化され、出力符号２１４として出力される。
【００２５】
次に、類似度の判定方法の一例として、予測誤差を用いる方法を説明する。低域信号２０８と高域信号２０９とは、入力信号を帯域分割して生成された信号であるので、信号に含まれる周波数成分が異なっている。したがって、それぞれの信号を時間信号として見る限りにおいては両者の類似性を判定することは困難であるが、これらの信号を周波数信号として見た場合には、それぞれの周波数成分の分布比較の形で両者の類似性の判定が容易となる。以下に説明する図２は、この原理を表したものであり、周波数成分の異なる信号の例として、同じ振幅の１ｋＨｚの正弦波と２ｋＨｚの正弦波を時間信号と周波数信号（例としてＦＦＴパワスペクトル係数）として示している。
【００２６】
図２は、２つの信号を時間信号としてみた場合と周波数信号としてみた場合との相違を示す図である。図２（ａ）は、周波数の異なる２つの正弦波を時間信号として示す図である。図２（ｂ）は、周波数の異なる２つの正弦波を周波数信号として示す図である。図２（ａ）において、時間信号として表された１ｋＨｚの正弦波３０１と２ｋＨｚの正弦波３０２との間には容易に判定できる類似点は無い。これに対して、図２（ｂ）において、周波数信号として表された１ｋＨｚの正弦波３０３と２ｋＨｚの正弦波３０４は共に１本の線スペクトルとして表され、両者の類似度が非常に大きいことが容易に判定できる。すなわち、両者の違いは「周波数」で表される位置情報が異なるだけであり、例えば、図２（ｂ）において２ｋＨｚの正弦波３０４の位置を基準点３０５として、基準点３０５から取り出した信号と、基準点３０５から１ｋＨｚの正弦波までの距離ｋ（＝シフト数３０６）だけ離れた位置から取り出した信号とを比較すると、２つの信号は一致する。
【００２７】
このような原理に基づいて、低域信号２０８と高域信号２０９とは、類似度判定部２０３において、時間−周波数変換を用いて入力信号２０７の周波数成分を示すスペクトル係数に変換され、類似度判定が行われる。使用される時間−周波数変換としては、フーリエ変換、コサイン変換、フィルタバンクなど、公知のいかなる変換を用いても良い。図２の例で示した様に、低域信号２０８と高域信号２０９との類似度は、一方の信号を周波数軸上でシフトして他方の信号に重ね合わせることで判定でき、予測誤差は信号を重ね合わせた時に一致しない部分を表す尺度として与えられる。本発明では、高域信号２０９を符号化するために低域信号２０８を利用するので、高域信号２０９をターゲットとして固定し、低域信号２０８をシフトして類似度を判定する構成が望ましい。この構成において、予測誤差は、スペクトル係数に変換された低域信号２０８をｘ（ｉ）、高域信号２０９をｙ（ｊ）として、
【００２８】
【数１】

【００２９】
で表される。ここで、ｘ（ｋ，ｊ）はｘ（ｊ）を基準点３０５としてｋサンプルだけシフトした位置から取り出した信号、ａはゲイン、ｍは低域信号ｘ（ｋ，ｊ）および高域信号ｙ（ｊ）に含まれるスペクトル係数の個数である。スペクトル係数の数ｍは符号化対象となる高域信号ｙ（ｊ）の範囲を指定するもので、任意に決めて良い。基準点３０５の位置および低域信号のシフト数ｋの上限（ｋ＿ＭＡＸ）と下限（ｋ＿ＭＩＮ）とは任意に定めることが可能であり、設定されたシフト数範囲内でｋを変化させ、予測誤差Ｅ（ｋ）を評価して最小値を選択すれば、ターゲットとなる高域信号ｙ（ｊ）に対して最も類似度の高い低域信号ｘ（ｋ，ｊ）とシフト数ｋおよびゲインａを確定することができる。ｘ（ｋ，ｊ）はシフト数ｋが確定すれば一意に定まるので、類似度情報２１１としては予測誤差Ｅ（ｋ）とシフト数ｋとゲインａを使用すれば良い。
【００３０】
第１〜第ｎの高域符号化部の切り換えは、予測誤差Ｅ（ｋ）を評価尺度として行われ、予測誤差Ｅ（ｋ）があらかじめ定められた閾値よりも小さければ、類似度が高い場合に使用される高域符号化部が選択される切換情報２１０を出力し、逆に大きければ類似度が低い場合に使用される高域符号化部が選択される切換情報２１０を出力する。
【００３１】
なお、上記の説明では、類似度判定部２０３における類似度判定の方法としてスペクトル係数の予測誤差を用いたが、それ以外に、例えばフィルタバンクにより帯域分割された信号に対しては、時間信号の予測誤差を用いることもできる。また、上記の説明ではノイズを用いて符号化する高域符号化部と、低域信号２０８と高域信号２０９との類似度を利用する高域符号化部を用いたが、例えば、類似度が中程度の場合には、類似度を利用する符号化とノイズによる符号化とを組み合わせた高域符号化部を用いるとしてもよい。なお、入力信号がステレオ信号など複数チャネルの信号である場合には、高域信号２０９をチャネル毎に独立に符号化する方法に加えて、ジョイントステレオ符号化技術を用いて符号化するように構成することもできる。
【００３２】
以上で説明したように、入力信号２０７を高域信号２０９と低域信号２０８とに帯域分割し、高域信号２０９を、当該高域信号２０９と低域信号２０８との類似度を利用して符号化することにより、より少ないビット量の符号から高品質な再生信号を得ることができる。
【００３３】
図３は、実施の形態１に係る復号化装置４００の構成を示すブロック図である。復号化装置４００は、図１に示した符号化装置２００によって符号化された出力符号２１４を復号化し、音響信号を再生する復号化装置であって、符号分割部４０１、低域復号化部４０２、切換部４０３、高域復号化部４０４および信号合成部４０５を備える。
【００３４】
図３において、入力符号４０６は図１における符号化装置２００の出力符号２１４に相当する。符号分割部４０１は、入力符号４０６を、低域符号４０７、高域符号４０８および切換情報４０９に分割する。低域復号化部４０２は、低域符号４０７を復号化して低域信号４１０を出力する。低域復号化部４０２としては、低域符号４０７を復号化する機能を有していれば、公知のどのような復号化部を使用しても良い。
【００３５】
高域復号化部４０４は第１〜第ｎからなるｎ個の高域復号化部によって構成されており、それぞれの高域復号化部は、あらかじめ定められた方法に従って高域符号を復号化するように構成されている。切換部４０３は、切換情報４０９によって指定される高域復号化部を選択し、選択された高域復号化部は、低域信号４１０を参照しながら高域符号４０８を復号化し高域信号４１１を出力する。信号合成部４０５は、低域信号４１０と高域信号４１１とを合成し、出力信号４１２を生成する。
【００３６】
ここで、先に説明した符号化装置２００側の高域符号化部２０５が、ノイズを用いて符号化する高域符号化部と、類似と判定された低域信号を利用して、高域信号から低域信号を差し引いた差分信号を符号化する高域符号化部とから構成される場合、復号化装置４００においても、これに対応する高域復号化部４０４を備える必要がある。以下では、その構成例およびその動作を説明する。説明のため、図３において、ノイズを用いて符号化された符号を復号化するための復号化部を第１の高域復号化部、低域信号と高域信号との差分信号として符号化された符号を復号化する復号化部を第２の高域復号化部とする。
【００３７】
切換情報４０９によって、高域符号４０８が第１の復号化部に入力されることが示された場合、復号化されるべき高域信号はランダムなノイズ信号で表現される。したがって、高域符号４０８には信号強度を表すゲイン情報のみが含まれる。これに応じて、第１の高域復号化部は、高域符号４０８に含まれるゲイン情報を復号化するとともに、内部でランダムなノイズ信号を生成し、生成されたノイズ信号に前記ゲイン情報で示されるゲインを乗じることによって高域信号４１１を生成する。
【００３８】
これに対して、高域符号４０８が第２の高域復号化部に入力されることが示された場合には、高域符号４０８には例えば、低域信号２０８において高域信号２０９と類似度の高い部分を特定するためのシフト数、ゲインおよび差分信号などの類似度情報が符号化されて含まれている。また、高域信号は、低域復号化部４０２で得られた低域信号４１０から抽出された低域信号と、第２の高域復号化部で得られた差分信号との和によって表現される。したがって、第２の高域復号化部は、まず高域符号４０８を復号化し、その高域符号４０８に含まれる類似度情報に基づいて、高域信号２０９に類似する低域信号２０８を低域信号４１０から抽出し、復号化したゲインを乗じた後、復号化した差分信号を加算することにより高域信号４１１を生成する。このようにして得られた低域信号４１０と高域信号４１１とは信号合成部４０５において合成され、出力信号４１２として出力される。
【００３９】
以上で説明したように、高域信号と低域信号とに分割されて符号化されている入力符号から、高域信号を、高域信号と低域信号との類似度を利用して復号化することにより、より効率的に符号化された高域符号を利用することができるので、より少ないビット量の符号から高品質な再生信号を得ることができる。
（実施の形態２）
本発明の実施の形態２における符号化装置の構成を、図４を参照しながら説明する。図４は、実施の形態２に係る符号化装置５００の構成を示すブロック図である。
【００４０】
図４に示される符号化装置５００は、図１に示される実施の形態１における符号化装置２００に対して、新たに低域復号化部５０１を設け、低域符号化部２０２で符号化された低域符号２１３を復号化して低域復号化信号５１０を生成し、類似度判定部５０３に対して、低域信号２０８の代わりに低域復号化信号５１０を入力させるように構成したものである。このような符号化装置５００は、帯域分割部２０１、低域符号化部２０２、低域復号化部５０１、類似度判定部５０３、切換部２０４、高域符号化部２０５および符号多重化部２０６を備える。
【００４１】
なお、図４において、以下の図においても同様、図１に示した符号化装置２００と同様の構成要素については既に説明しているので、同一の参照符号を付し説明を省略する。このように構成された符号化装置５００において、類似度判定部２０３は、低域符号化部２０２によって一旦、符号化された後、低域復号化部５０１によって復号化された低域復号化信号５１０を用いて高域信号２０９との類似度を判定するので、復号化装置において、原音の低域信号２０８を用いて高域信号２０９との類似度を判定した符号化装置２００からの出力符号２１４を復号化する場合と比べて、より原音に近い高域信号を得ることができるという効果がある。
【００４２】
具体的には、図３の復号化装置４００の例で示したように、復号化装置側では、原音の低域信号ではなく、一旦、符号化された低域符号を復号化して得られる低域信号を利用して、高域信号を復号化する。このことは、高域信号をより正確に符号化するためには、一旦、符号化された後、復号化された低域信号と符号化のターゲットとなる高域信号との類似度に基づいて、高域信号の符号化を行う必要があることを示している。したがって、符号化装置５００においては、類似度判定部５０３により算出される類似度情報５１１は、高域信号２０９と低域復号化信号５１０との類似度となる。
【００４３】
これにより、高域符号化部２０５は、その類似度情報５１１と切換情報５１３にしたがって高域信号２０９を符号化するので、復号化装置において高域信号をより正確に復号化できる高域符号５１２を出力することができる。この結果、符号化装置５００では、このようにして得られた切換情報５１３、高域符号５１２および低域符号２１３が多重化された出力符号５１４を出力し、復号化装置における復号化信号の品質を向上させることができる。
【００４４】
なお、実施の形態１の符号化装置２００と実施の形態２の符号化装置５００とは、類似度を算出するために使用する低域信号が異なるのみであり、高域符号化部と低域符号化部の構成および符号化の対象となる高域信号と低域信号は同一であるので、実施の形態２の符号化装置５００に対応する復号化装置の構成は、図３に示される実施の形態１の復号化装置４００と同様の構成で良い。
（実施の形態３）
本発明の実施の形態３における符号化装置の構成を、図５を参照しながら説明する。図５は、実施の形態３に係る符号化装置６００の構成を示すブロック図である。
【００４５】
図５に示される符号化装置６００は、図１に示される実施の形態１における符号化装置２００に対して、新たにダウンサンプリング部６０１を設け、低域符号化部２０２に対して、帯域分割された原音の低域信号２０８の代わりに、ダウンサンプリング部６０１によってダウンサンプリングされた低域信号６１６を入力させ、符号化結果である低域符号６１３を出力させるように構成したものである。
【００４６】
一般に帯域分割部２０１によって帯域分割された低域信号２０８および高域信号２０９は、入力信号２０７と同じサンプリング周波数の信号である。一例として、入力信号２０７のサンプリング周波数を４８ｋＨｚ、帯域分割部２０１によって分割される境界周波数を８ｋＨｚとすると、サンプリング定理により入力信号２０７には０−２４ｋＨｚの信号が含まれているので、低域信号２０８に含まれる信号の帯域は０−８ｋＨｚ、高域信号２０９は８−２４ｋＨｚとなるが、低域信号２０８、高域信号２０９共にサンプリング周波数は４８ｋＨｚで変化しない。この低域信号２０８をそのまま低域符号化部２０２によって符号化することは可能であるが、一般に、低域符号化部２０２として用いる公知の符号化部は、入力信号２０７に含まれる信号帯域が、入力される信号のサンプリング周波数で決まる最大周波数（＝サンプリング周波数／２）である時に最も効率的に動作するように構成されている。
【００４７】
したがって、サンプリング周波数４８ｋＨｚで２４ｋＨｚまでの信号を含むことを前提として構成された低域符号化部２０２に、前提の１／３である８ｋＨｚまでしか含まない低域信号２０８を入力させて符号化を行うことによって、低域符号化の効率が低下するという問題が発生する。
【００４８】
この問題を解決するためには、低域符号化部２０２に入力される低域信号２０８のサンプリング周波数を低域信号２０８に含まれる信号の周波数帯域に合わせてダウンサンプリングし、低域符号化部２０２の効率を上げるようにすればよい。より具体的には、ダウンサンプリング部６０１に入力される低域信号２０８が時間信号である場合、ダウンサンプリング部６０１は所望のサンプリング周波数となるよう、サンプリングされたデータを間引き、入力される低域信号２０８が周波数変換されたスペクトルデータである場合、低域符号化部２０２に低域信号２０８のデータ長を知らせる。本例では、低域信号２０８に含まれる信号の周波数帯域は０−８ｋＨｚであるので、低域信号２０８を表現するのに必要なサンプリング周波数は１６ｋＨｚとなる。１６ｋＨｚにダウンサンプリングされた低域信号６１６を低域符号化部２０２に入力させれば、低域符号化部２０２では入力された低域信号２０８を最も効率的に符号化することができ、復号化信号の品質を向上させながらビット量の少ない出力符号６１４を得ることができるという効果がある。
【００４９】
次に、実施の形態３における復号化装置７００の構成を、図６を参照しながら説明する。図６は、図５に示した符号化装置６００によって符号化された出力符号６１４を復号化する復号化装置７００の構成を示すブロック図である。図６に示される復号化装置７００は、図３に示した実施の形態１における復号化装置４００に対して、新たにアップサンプリング部７０１を設け、低域復号化部４０２によって復号化された低域信号７１０をアップサンプリング部７０１においてアップサンプリングし、アップサンプリングされた低域信号４１０を高域復号化部４０４および信号合成部４０５に入力するようにした構成である。この復号化装置７００は、符号分割部４０１、低域復号化部４０２、アップサンプリング部７０１、切換部４０３、高域復号化部４０４および信号合成部４０５を備える。同図において、図３に示した復号化装置４００と同様の構成要素については既に説明しているので、同一の参照符号を付し説明を省略する。
【００５０】
図６において、入力符号７０６は、図５の符号化装置６００における出力符号６１４に対応する。復号化装置７００において、符号分割部４０１から出力された低域符号７０７は、符号化装置６００側でダウンサンプリングされた低域信号６１６を符号化したものであるため、低域復号化部４０２において復号化された低域信号７１０のサンプリング周波数は、符号化装置６００側で使用した低域信号６１６のサンプリング周波数（例えば、１６ｋＨｚ）に等しい。
【００５１】
その一方で、高域符号４０８は、ダウンサンプリングされていない高域信号２０９を符号化したものであり、復号化される高域符号４０８のサンプリング周波数は、符号化装置６００に入力された入力信号２０７のサンプリング周波数（例えば、４８ｋＨｚ）に等しい。したがって、復号化装置７００は、復号化された低域信号７１０を用いて高域信号の復号化処理を行うために、低域信号７１０をアップサンプリングして復号化される高域信号のサンプリング周波数（＝出力信号のサンプリング周波数、例えば、４８ｋＨｚ）に合わせてから高域復号化部４０４に入力するように構成する。同様に、信号合成部４０５においても、復号化された高域信号４１１とアップサンプリングされた低域信号４１０とを用いて信号合成処理を行い、出力信号４１２を生成する。
【００５２】
以上で説明したように、高域符号４０８とダウンサンプリングされた低域符号７０７とに分割されて符号化されている入力符号７０６から、低域信号７１０を復号化した後にアップサンプリングし、高域信号４１１を高域信号４１１とアップサンプリングされた低域信号４１０との類似度を利用して復号化することにより、より効率的に符号化された低域符号を利用して、より少ないビット量の符号から高品質な再生信号を得ることができるという効果がある。
【００５３】
（実施の形態４）
以下では、本発明の実施の形態４における符号化装置８００の構成を、図７を参照しながら詳細に説明する。図７は、実施の形態４に係る符号化装置８００において高域符号８１７および切換情報８１６を生成する部分の構成を示すブロック図である。図７に破線で囲んで示される類似度判定部８１８および高域符号化部８１９の部分は、本発明の実施の形態１から実施の形態３における符号化装置２００および符号化装置５００において、低域信号２０８と高域信号２０９との類似度を判定する類似度判定部２０３および類似度判定部５０３と、その判定結果に応じて高域信号２０９を符号化する高域符号化部２０５とに対応しており、それぞれの類似度判定部および高域符号化部として、より望ましい形態を表すものである。
【００５４】
このような符号化装置８００は、主に、高域信号２０９を符号化する部分として、時間−周波数変換部８０１、帯域分割部８０２、ノイズ性判定部８０３、予測誤差算出部８０４、切換判定部８０５、切換部８０６、第１の高域符号化部８０７および第２の高域符号化部８０８を備える。
【００５５】
図７において、時間−周波数変換部８０１には、例えば、図示しない帯域分割部２０１によってあらかじめ高域部と低域部とに帯域分割された高域信号２０９と低域信号２０８とが入力される。低域信号２０８としては、入力信号から直接分割された低域信号、もしくは、図示しない低域符号化部によって一旦符号化された後、復号化された低域復号化信号のいずれを用いても良い。時間−周波数変換部８０１は、入力された高域信号２０９および低域信号２０８をスペクトル係数に変換し、それぞれ高域係数８１１と低域係数８１２として出力する。時間−周波数変換部８０１において使用する時間−周波数変換としては、フーリエ変換、コサイン変換、フィルタバンクなど、公知のいかなる変換を用いても良いが、ここではＡＡＣで用いられるＭＤＣＴを用いた場合を例として説明する。
【００５６】
帯域分割部８０２は、高域信号２０９を変換したＭＤＣＴ係数である高域係数８１１を、さらに複数の帯域（以下、高域符号化サブバンドと記す。）に分割し、分割された高域係数８１３は、ノイズ性判定部８０３、予測誤差算出部８０４および切換部８０６に出力される。ここで、帯域分割部８０２によって分割された各高域符号化サブバンドに対する以下の処理は同様なので、以下では、ある特定の高域符号化サブバンドにおける処理を説明する。高域係数の符号化を完了するためには、すべての高域符号化サブバンドについて以下の処理を繰り返せばよい。
【００５７】
ノイズ性判定部８０３は、高域係数８１３のノイズ性を判定しノイズ性判定値８１４を出力する。ここでノイズ性判定を行う理由は、符号化対象である高域係数８１３がノイズ性の信号ならば、低域係数８１２との類似度が低く予測誤差が大きいことがあらかじめ推定できることにより、以降の予測誤差算出に係る処理を省略することができるためである。ノイズには、ノイズ性信号を表すＭＤＣＴ係数の値の大きさはほぼ一様な分布を示し、ある帯域に含まれる複数のＭＤＣＴ係数を調べると、（定常的な）ノイズの場合は、ある値（例えば、平均値）の近傍に多くの値が存在するのに対して、非ノイズの場合には、平均値よりも大きく離れたところにも比較的多くの値が存在するという特徴がある。
【００５８】
このような特徴を利用して、帯域分割された高域係数８１３のノイズ性を判定するために、ノイズ性判定部８０３は、例えば、各高域符号化サブバンド内におけるＭＤＣＴ係数の大きさの分布を評価し、あらかじめ定められた閾値に対して、分布の幅が大きければ非ノイズ性、分布の幅が小さければノイズ性の信号であると判定すればよい。ＭＤＣＴ係数の分布を評価する尺度としては、高域符号化サブバンド内のＭＤＣＴ係数の最小値と最大値との差、平均値と最大値との差、分散、標準偏差等のいかなる統計的評価値を用いても良い。
【００５９】
ノイズ性判定部８０３は、判定の結果、高域係数８１３のノイズ性／非ノイズ性を示すノイズ性判定値８１４を予測誤差算出部８０４および切換判定部８０５に出力する。ノイズ性判定値８１４は、高域係数８１３がノイズ性である場合には、例えば「０」、非ノイズ性である場合には、例えば「１」とする。ノイズ性判定値８１４の具体値は、これ以外の値としてもよい。
【００６０】
予測誤差算出部８０４は、ノイズ性判定部８０３の判定結果として、ノイズ性判定値８１４により高域係数８１３が非ノイズ性の信号であると通知された場合には、高域係数８１３と低域係数８１２との類似度を判定するために予測誤差の算出と評価とを行う。予測誤差の算出は、実施の形態１において開示した方法と同様に行うことができる。すなわち、予測誤差Ｅ（ｋ）は、低域係数８１２をｘ（ｉ）、高域符号化サブバンドに含まれる高域係数８１３をｙ（ｊ）、ｍを高域符号化サブバンドに含まれるＭＤＣＴ係数の個数とすれば、数式１で算出することができる。この数式１で、シフト数ｋをあらかじめ定められた下限値Ｋ＿ＭＩＮと上限値Ｋ＿ＭＡＸの間で変化させることにより、類似度情報８１５として予測誤差Ｅ（ｋ）を最小にする低域係数ｘ（ｋ，ｊ）とシフト数ｋとゲインａを求めることができる。
【００６１】
切換判定部８０５は、ノイズ性判定値８１４と類似度情報８１５とから、帯域分割された高域係数８１３の符号化に使用する高域符号化部を決定し、切換情報８１６を出力して切換部８０６により第１の高域符号化部８０７と第２の高域符号化部８０８とを切り換える。第１の高域符号化部８０７は、類似度が低い高域係数８１３を符号化するのに適するように構成された符号化部、第２の高域符号化部８０８は、類似度が高い高域係数８１３を符号化するのに適するように構成された符号化部である。切換部８０６は、▲１▼ノイズ性判定値８１４がノイズ性信号を示す場合と、▲２▼ノイズ性判定値８１４が非ノイズ性信号を示し、かつ、類似度が低い場合とには、第１の高域符号化部８０７を選択し、ノイズ性判定値が非ノイズ性信号を示し、かつ、類似度が高い場合には第２の高域符号化部８０８を選択する。
【００６２】
次に、高域符号化部８１９の動作について図面を参照しながら詳しく説明する。図８（ａ）は、第１の高域符号化部８０７の符号化方法を示す図であり、図８（ｂ）は、第２の高域符号化部８０８の符号化方法を示す図である。第１の高域符号化部８０７に入力される高域係数８１３は、符号化境界周波数９０５において分割された低域符号化バンド９０２と高域符号化バンド９０３とのうち、高域符号化バンド９０３の範囲にあるＭＤＣＴ係数（スペクトル係数）９０１である。さらに高域符号化バンド９０３は、複数の高域符号化サブバンド９０４に分割されている。
【００６３】
第１の高域符号化部８０７に入力されるスペクトル係数９０１は、ノイズ性もしくは低域符号化バンド９０２のＭＤＣＴ係数との類似度が低い係数であるので、第１の高域符号化部８０７は、類似度情報８１５を使用せず、高域符号化サブバンド９０４内の係数をランダムなノイズとみなすことにより符号化する。すなわち、高域符号化サブバンド９０４内の係数は、平均振幅９０６で代表されるランダムな系列として表され、平均振幅９０６（振幅情報）のみが符号化される。なお、ランダムな系列の代表値としては、平均振幅の他に、平均エネルギー、平均パワ等を用いてもよい。
【００６４】
同様に、第２の高域符号化部８０８において、入力される高域係数８１３は、符号化境界周波数９１１において分割された低域符号化バンド９０８と高域符号化バンド９０９とのうち、高域符号化バンド９０９の範囲にあるＭＤＣＴ係数（スペクトル係数）９０７である。さらに高域符号化バンド９０９は、複数の高域符号化サブバンド９１０に分割されている。第２の高域符号化部８０８に入力されるスペクトル係数９０７は、低域符号化バンド９０８のスペクトル係数との類似度が高い係数であるので、第２の高域符号化部８０８は類似度情報８１５を使用して符号化する。高域符号化サブバンド９１０の一つを高域区間Ａ９１５とすると、先に説明した予測誤差算出部８０４において、低域符号化バンド９０８の係数のうちから高域区間Ａ９１５に対して同じ個数のスペクトル係数を含む最も類似度の高い低域区間Ｂ９１４が選択され、類似度情報８１５が出力される。
【００６５】
類似度情報８１５のうちシフト数９１３ｋは、符号化境界周波数９１１を基準点とし、符号化境界周波数９１１から低域区間Ｂ９１４の下限周波数までの距離とする。またゲインａは、高域区間Ａ９１５のスペクトル係数９０７と低域区間Ｂ９１４のスペクトル係数９０７との振幅比となる。第２の高域符号化部８０８では、得られた類似度情報８１５として、シフト数９１３ｋ、ゲインａ、および高域区間Ａ９１５と低域区間Ｂ９１４とのスペクトル係数のサンプルごとの差分信号を符号化する。
【００６６】
なお、高域区間Ａ９１５と低域区間Ｂ９１４との差分信号の符号化は省略することもでき、その場合にはシフト数９１３ｋおよびゲインａのみが符号化される。もしくは、高域区間Ａ９１５と低域区間Ｂ９１４との差分信号をノイズ成分で代表させることもでき、シフト数ｋおよびゲインａに加えてノイズ成分のゲインに関する情報を符号化すればよい。
【００６７】
また、ゲインａの表現方法としては、ゲインａを、▲１▼高域区間Ａ９１５のスペクトル係数９０７と低域区間Ｂ９１４のスペクトル係数９０７との振幅比として表す他に、第１の高域符号化部８０７における振幅情報と同様に、各高域符号化サブバンド９１０内の▲２▼平均振幅９１２、もしくは各高域符号化サブバンド９１０内の▲３▼平均エネルギー、▲４▼平均パワ等で表し、それを符号化しても良い。高域符号化サブバンド９１０ごとに符号化された差分信号の符号は、シフト数９１３ｋおよびゲインａなどの情報と多重化され、高域符号８１７として出力される。
【００６８】
なお、多重化の段階において数学的手法を用いて符号を圧縮し、高域符号８１７のビット量を削減することも可能である。例えば、符号間の差分情報を再符号化する方法、複数の符号をベクトル情報として再符号化する方法、符号をハフマン符号等の可変長符号に置き換える方法などが利用できる。
【００６９】
また、本例では、高域符号化サブバンド９１０のスペクトル係数９０７を求めるにあたって、高域信号２０９をＭＤＣＴ係数に変換した後、ＭＤＣＴ係数８１１上で帯域分割を行っているが、等価な構成として、高域信号２０９を、フィルタバンク等を用いて高域符号化サブバンド９１０の帯域に分割した後、それぞれの帯域の信号をＭＤＣＴ変換することによって高域符号化サブバンド９１０のＭＤＣＴ係数を求めてもよい。
【００７０】
また、符号化されるシフト数９１３ｋは、低域区間Ｂ９１４を特定するための情報であるので、周波数軸上のサンプル数を表す値である必要は無く、例えば、あらかじめ低域符号化バンド９０８に高域符号化サブバンド９１２と同じ周波数幅の複数の低域区間を設定しておき、そのうちから、どの低域区間が選択されたかを示す識別情報で表すようにしても良い。
【００７１】
さらに、本例では、帯域分割部８０２を時間−周波数変換部８０１の後段に備えたが、本発明はこれに限定されず、帯域分割部８０２を時間−周波数変換部８０１の前段に備えるようにしてもよい。
【００７２】
以上で説明したように、高域のＭＤＣＴ係数を複数の高域符号化サブバンドに分割し、高域符号化サブバンドごとに高域のＭＤＣＴ係数のノイズ性と、高域のＭＤＣＴ係数および低域のＭＤＣＴ係数間の類似度とを判定し、判定結果に基づいて最適な高域符号化部を用いて符号化を行うことにより、より少ないビット量の符号から高品質な再生信号を得ることができる。
【００７３】
次に本発明の実施の形態４における復号化装置の構成を、図９を参照しながら説明する。図９は、図７に示した符号化装置８００に対応する復号化装置において高域符号８１７を復号化する部分（高域復号化装置１０００）の構成を示すブロック図である。図９に示される高域復号化部分（高域復号化装置１０００）は、本発明の実施の形態１から実施の形態３における復号化装置４００および復号化装置７００において、高域符号を復号化する部分として、より望ましい形態を表すものである。
【００７４】
この高域復号化装置１０００は、時間−周波数変換部１００１、第１の高域復号化部１００３、第２の高域復号化部１００４、スペクトル合成部１００５および周波数−時間変換部１００６を備える。以下、高域復号化装置１０００においても、前記の符号化装置８００と同様にスペクトル係数としてＭＤＣＴ係数を使用する場合を例として説明する。図９において、高域復号化装置１０００に入力される情報は、高域符号１００７、切換情報１００８、低域復号化信号１００９である。
【００７５】
高域符号１００７は、例えば、図７に示した符号化装置８００の第１の高域符号化部８０７または第２の高域符号化部８０８によって出力された高域符号８１７である。また、切換情報１００８は、例えば、符号化装置８００の切換判定部８０５によって出力された切換情報８１６である。さらに、低域復号化信号１００９は、例えば、図３に示した復号化装置４００の低域復号化部４０２によって出力された低域信号４１０である。
【００７６】
時間−周波数変換部１００１は、入力された低域復号化信号１００９をＭＤＣＴ変換し、ＭＤＣＴ係数で表された低域係数１０１０を出力する。また、第１の高域復号化部１００３は、ノイズを用いて符号化された高域符号１００７を復号化するための復号化部であり、第２の高域復号化部１００４は、低域係数８１２と高域係数８１３との類似度を利用して符号化された高域符号１００７を復号化する復号化部である。
【００７７】
符号化装置８００において高域係数８１３はあらかじめ定められた高域サブバンドに分割されて符号化されているので、それに対応する復号化処理は各高域サブバンドに対応する符号に基づいて行われる。すなわち、高域符号１００７に含まれる符号を高域符号化サブバンドごとの符号に分割し、すべての高域符号化サブバンドについて、分割された符号に基づいて復号化処理を繰り返す。ここで、前記符号化装置８００側において、各高域符号化サブバンドに対応する符号が数学的手法で圧縮されている場合には、数学的手法による復号化も行われる。
【００７８】
切換情報１００８によって、高域符号１００７が第１の復号化部１００３に入力されることが示された場合、復号化の結果得られる高域係数１０１１はランダムなノイズ信号で表現される。したがって、高域符号１００７には信号強度を表すゲイン情報のみが含まれる。これに対応して、第１の高域復号化部１００３は内部でランダムなノイズ信号を生成し、復号化されたゲイン情報で示されるゲインを生成されたノイズ信号に乗じることによって帯域分割された高域係数１０１１を生成する。
【００７９】
これに対して、高域符号１００７が第２の高域復号化部１００４に入力されることが示された場合には、復号化結果として得られるべき高域係数１０１１は、シフト数に従って低域係数１０１０から抽出された特定の低域区間内の低域係数と、高域符号１００７から抽出された差分との和によって表現される。したがって、高域符号１００７には、低域係数を抽出する位置を示すシフト数ｋ、高域係数の例えば、平均振幅を示すゲインａ、および高域符号化サブバンド内の高域係数と最も類似する低域区間内の低域係数との差分を表す情報が含まれる。
【００８０】
高域係数はスペクトル係数の一種であるＭＤＣＴ係数として符号化されているため、高域係数を復号化するために入力された低域復号化信号１００９もＭＤＣＴ係数として表現されている必要がある。一般に、低域復号化信号１００９は時間信号として与えられるので、時間−周波数変換信号部１００１によってＭＤＣＴ係数に変換され、低域係数１０１０として第２の高域復号化部１００４に入力される。低域復号化信号１００９がＭＤＣＴ係数として与えられるならば、低域信号１００９は低域係数１０１０とみなすことができ、時間−周波数変換信号部１００１は省略できる。
【００８１】
第２の高域復号化部１００４は、低域係数１０１０のあらかじめ定められた基準点からシフト数ｋだけシフトした位置から、あらかじめ定められた数のＭＤＣＴ係数を抽出する。次いで、第２の高域復号化部１００４は、抽出した低域ＭＤＣＴ係数にゲインａを乗じ、差分を表すスペクトル係数を加算することによって、高域係数１０１１を表すＭＤＣＴ係数を生成する。なお、差分を表すスペクトル係数が省略されている場合は、抽出したＭＤＣＴ係数にゲインａを乗じたものが高域係数１０１１を表すＭＤＣＴ係数となる。
【００８２】
第１の高域復号化部１００３もしくは第２の高域復号化部１００４によって復号化され、高域符号化サブバンドの周波数幅に帯域分割された高域係数１０１１は、スペクトル合成部１００５によって合成され、高域符号化バンド内でサンプルが周波数順に並べられた高域係数１０１２として出力される。周波数‐時間変換部１００６は、高域係数１０１２を時間信号に変換し、高域信号１０１３を生成する。
【００８３】
なお、上記説明において、シフト数ｋが周波数軸上のサンプル数を示さず、あらかじめ定められた特定の低域区間を表す識別情報である場合は、識別情報に従って、あらかじめ定められた特定低域区間のＭＤＣＴ係数を抽出すればよい。
【００８４】
以上で説明したように、高域信号と低域信号に分割されて符号化されている入力符号から、高域信号を、高域信号と低域信号との類似度を利用して復号化することにより、より効率的に符号化された高域符号を利用することができるので、より少ないビット量の符号から高品質な再生信号を得ることができる。
（実施の形態５）
本発明の実施の形態５における符号化装置の構成を、図１０を参照しながら説明する。図１０は、実施の形態５に係る符号化装置１１００の構成を示すブロック図である。図１０に示される符号化装置１１００は、図７に示した符号化装置８００における類似度判定部８１８および高域符号化部８１９を備え、あらかじめ時間‐周波数変換された低域スペクトル係数を用いて低域部を符号化する。
【００８５】
低域符号化部が符号化するスペクトル係数は、フーリエ変換係数、コサイン変換係数など、公知のどのようなスペクトル係数を用いても良いが、ここでは例として、低域符号化部はＡＡＣ方式の符号化を行い、スペクトル係数はＭＤＣＴ係数を使用する。図１０において、符号化装置１１００は、時間−周波数変換部１１０１、帯域分割部１１０２、類似度判定部８１８（ノイズ性判定部８０３、予測誤差算出部８０４、切換判定部８０５）、高域符号化部８１９（第１の高域符号化部８０７、第２の高域符号化部８０８）、低域符号化部１１０９および符号多重化部１１１０を備える。時間−周波数変換部１１０１は、低域信号と高域信号とを含む入力信号２０７を一括してＭＤＣＴ変換し、ＭＤＣＴ係数１１１２を出力する。
【００８６】
低域符号化部１１０９、第１の高域符号化部８０７および第２の高域符号化部８０８はすべてＭＤＣＴ係数を符号化する符号化部であるので、時間−周波数変換部１１０１が入力信号２０７を一括してＭＤＣＴ係数１１１２に変換することにより、低域符号化部と高域符号化部とで別々にＭＤＣＴ変換する場合と比べて、必要とするＭＤＣＴ係数を一回の変換で求めることができるため、ＭＤＣＴ係数算出に係る処理量を削減することができる。
【００８７】
帯域分割部１１０２は、ＭＤＣＴ係数１１１２を低域係数１１１４と高域係数８１３とに分割する。これにおいて、帯域分割処理がＭＤＣＴ係数上で行われるため、帯域分割部１１０２における帯域分割フィルタなどの処理を省くことができ、帯域分割部１１０２の処理量を削減することができる。また高域係数８１３は、この段階において高域符号化サブバンドにまで帯域分割することが可能である。帯域分割された高域係数８１３の処理は、既に説明したように、図７に示した類似度判定部８１８および高域符号化部８１９によって行われ、切換情報８１６と高域符号８１７として出力される。
【００８８】
また低域符号化部１１０９は、低域係数１１１４をＡＡＣ方式で符号化し、低域符号１１１９として出力する。符号多重化部１１１０は、切換判定部８０５から出力された切換情報８１６と、高域符号化部８１９から出力された高域符号８１７と、低域符号化部１１０９から出力された低域符号１１１９とを多重化し、出力符号１１２０を生成する。
【００８９】
以上のように、符号化装置１１００をこのような構成とすることにより、より少ない処理量で高域係数および低域係数を符号化することができる。
次に、本発明の実施の形態５における復号化装置の構成を、図１１を参照しながら説明する。図１１は、図１０に示した符号化装置１１００の出力符号１１２０である入力符号１２０８を復号化する復号化装置１２００の構成を示すブロック図である。図１１に示される復号化装置１２００は、図９に示した高域復号化装置１０００を備え、低域復号化部はＭＤＣＴ係数として得られた低域スペクトル係数を復号化する。
【００９０】
低域復号化部が復号化するスペクトル係数としては、フーリエ変換係数、コサイン変換係数など、公知のどのようなスペクトル係数を用いても良いが、ここでは例えば、低域復号化部はＡＡＣ方式による復号化を行い、スペクトル係数はＭＤＣＴ係数を使用する。この復号化装置１２００は、高域復号化部１２０３（切換部１００２、第１の高域復号化部１００３、第２の高域復号化部１００４）、符号分割部１２０１、低域復号化部１２０２、スペクトル合成部１２０６および周波数‐時間変換部１２０７を備える。
【００９１】
復号化装置１２００において、図９に示した高域復号化装置１０００に対応する部分は、切換部１００２、第１の高域復号化部１００３、第２の高域復号化部１００４、低域復号化部１２０２、スペクトル合成部１２０６および周波数‐時間変換部１２０７であるが、低域復号化部１２０２は、低域ＭＤＣＴ係数が符号化された低域符号１２１０を復号化する点で、図９に示した時間−周波数変換部１００１と異なる。スペクトル合成部１２０６は、高域係数１０１１のみならず、スペクトル合成された高域係数１０１１にさらに低域係数１０１０を合成する点で、図９に示したスペクトル合成部１００５と異なる。また、復号化装置１２００の周波数‐時間変換部１２０７は、高域係数１０１２にさらに低域係数１０１０が合成されたＭＤＣＴ係数１２１４をＩＭＤＣＴする点で、時間軸上の低域復号化信号１００９をＭＤＣＴ変換する図９の周波数‐時間変換部１００６と異なる。
【００９２】
図１１の復号化装置１２００において、符号分割部１２０１は、入力符号１２０８から切換情報１００８と高域符号１００７と低域符号１２１０とを抽出する。これに対し、低域復号化部１２０２は、抽出された低域符号１２１０をＡＡＣ方式で復号化し、低域係数１０１０を出力する。入力符号１２０８から抽出された切換情報１００８と高域符号１００７とは共に、高域復号化部１２０３に入力される。高域復号化部１２０３内では、切換情報１００８にしたがって選択された第１の高域復号化部１００３または第２の高域復号化部１００４が高域符号１００７を復号化し、高域係数１０１１を出力する。
【００９３】
ここで、高域係数１０１１は、低域係数との類似度を用いて復号化されるため、低域復号化部の出力する低域復号化信号を高域係数１０１１と同じＭＤＣＴ係数に変換する必要がある。これに対し、この構成においては符号化装置１１００側において低域係数も高域係数も、ともにＭＤＣＴ係数が符号化されているので、低域係数１０１０がＭＤＣＴ係数として出力される。したがって、低域係数１０１０の変換処理が不要となり処理量が削減される。
【００９４】
スペクトル合成部１２０６は、低域復号化部１２０２から出力された低域係数１０１０と、高域復号化部１２０３から出力された高域係数１０１１とを周波数軸上で合成する。周波数−時間変換部１２０７は、スペクトル合成部１２０６によって合成されたＭＤＣＴ係数１２１４を一括して時間信号に変換し、出力信号１２１５を生成する。
【００９５】
以上のように、符号化装置側では、一括して時間−周波数変換された高域係数と低域係数とを用いて符号化し、復号化装置側では、このように符号化された符号を一括して周波数−時間変換する構成とすることにより、復号化装置側ではより少ない処理量で高域スペクトル係数および低域スペクトル係数を復号化し出力信号を得ることができる。
【００９６】
なお、上記構成の符号化装置１１００および復号化装置１２００においても、実施の形態３で示したように、符号化装置側にダウンサンプリング部、復号化装置側にアップサンプリング部を設け、低域符号化を入力信号のサンプリング周波数より低いサンプリング周波数において行うようにしてもよい。特に、本構成においては、ダウンサンプリング方法として、符号化装置１１００の帯域分割部１１０２によって分割された低域係数１１１４を、そのまま、ダウンサンプリング後のサンプリング周波数の信号を変換して得られた係数とみなして処理を行うことができる。
【００９７】
例として、サンプリング周波数４８ｋＨｚの入力信号を半分の２４ｋＨｚにダウンサンプリングする場合を示す。入力信号２０４８サンプルを変換して２０４８点のＭＤＣＴ係数を得たとすると、２０４８点のＭＤＣＴ係数は０から２４ｋＨｚの信号を表す係数である。このうち、低域係数として２０４８点のＭＤＣＴ係数の低域側１０２４点を取り出せば、低域係数は０から１２ｋＨｚの信号を表す係数となる。この低域係数１０２４点を、逆ＭＤＣＴ変換すれば１０２４サンプルの時間信号が得られることとなり、時間信号のサンプル数が半分となる、つまり、半分のサンプリング周波数にダウンサンプリングされたことになる。低域符号化部がＭＤＣＴ係数を用いる場合は、逆ＭＤＣＴ変換処理は不要であるので、ダウンサンプリングに係る処理を簡素化できる。アップサンプリングの場合も同様に、復号化された低域係数を、そのままアップサンプリング後のサンプリング周波数の信号を変換して得られた係数とみなして処理すればよい。（実施の形態６）
【００９８】
図１２は、実施の形態６に係る符号化装置１３００の構成を示すブロック図である。符号化装置１３００は、入力信号が微小時間に急峻な振幅変化をする信号であるか否かに応じて入力信号の変換ブロック長を切り換えて時間‐周波数変換を行う符号化装置であって、帯域分割部２０１、低域符号化部２０２、変換ブロック長選択部１３０１、時間−周波数変換部１３０２、帯域分割部１３０３、類似度判定部１３１２（ノイズ性判定部１３０４、予測誤差算出部１３０５、切換判定部１３０６）、切換部１３０７、高域符号化部１３１３（第１の高域符号化部１３０８、第２の高域符号化部１３０９）および符号多重化部１３２１を備える。
【００９９】
同図に示した高域符号化部１３１３および類似度判定部１３１２と、図７に示した符号化装置８００における高域符号化部８１９および類似度判定部８１８とは、変換ブロック長選択部１３０１の選択に応じ、異なるサンプル数の高域係数を単位として処理を行う点が異なるだけで、構成および処理の内容は同じである。また、高域符号１３２０も、変換ブロック長選択部１３０１の選択に応じて、異なる変換ブロック長で符号化されている点が高域符号８１７と異なるだけである。同じく、ノイズ性判定値１３１７、類似度情報１３１８および切換情報１３１９も、処理対象である高域係数１３１６の変換ブロック長が変換ブロック長選択部１３０１の選択に応じて変化する点だけが、図７に示したノイズ性判定値８１４、類似度情報８１５および切換情報８１６と異なる。図７に示した高域符号化部８１９および類似度判定部８１８の構成および動作についてはすでに説明しているので、以下では、新たな処理に関連のない場合は、高域符号化部１３１３および類似度判定部１３１２についての説明を省略する。
【０１００】
図１２において、変換ブロック長選択部１３０１は、入力信号２０７の特性に応じて、時間−周波数変換部１３０２において用いる変換ブロック長を決定する。変換ブロック長選択部１３０１は、例えば、微小時間Δｔ内における入力信号２０７の平均信号レベルの変化量を測定し、測定された変化量が所定の値を超える場合には短い変換ブロック長を選択する。また、測定された変化量が所定の値以下である場合には長い変換ブロック長を選択する。決定される変換ブロック長は任意の長さでも良いし、あらかじめ定められた複数の選択肢から選択するようにしても良い。以下の説明では、２種類の変換ブロック長を切り換えて使用することとし、長い変換ブロック長として１０２４サンプル、短い変換ブロック長として１２８サンプルを用い、それぞれを基本変換ブロックおよび変換サブブロックと呼ぶこととする。この構成によれば、１個の基本変換ブロックは８個の変換サブブロックに分割することができることになる。変換ブロック長選択部１３０１は、選択した変換ブロック長を表すブロック長情報１３１０を生成する。
【０１０１】
時間−周波数変換部１３０２は、変換ブロック長選択部１３０１によって生成されたブロック長情報１３１０にしたがって、帯域分割部２０１で帯域分割された低域信号２０８と高域信号２０９とを変換し、それぞれ低域係数１３１５、高域係数１３１４を出力する。時間−周波数変換部１３０２が使用する時間−周波数変換としては、フーリエ変換、コサイン変換、フィルタバンクなど、公知のいかなる変換を用いても良いが、以下の説明ではＭＤＣＴを用いるものとする。帯域分割部１３０３は、ＭＤＣＴ係数で表される高域係数１３１４をさらに複数の高域符号化サブバンドに分割した後、ノイズ性判定部１３０４、予測誤差算出部１３０５および切換部１３０７の各部に出力する。
【０１０２】
ここで、帯域分割された高域係数１３１６と、以降の符号化処理との関係を図面を参照しながら説明する。図１３は、図１２に示した変換ブロック長選択部１３０１において基本変換ブロック１４０１が選択された場合の高域係数１３１６と低域係数１３１５とを示す図であり、図１４は、図１２に示した変換ブロック長選択部１３０１において変換サブブロック１５０６が選択された場合の高域係数１３１６と低域係数１３１５とを示す図である。
【０１０３】
図１３、図１４共に、時間信号として、例えば、１フレーム１０２４サンプルの信号が入力された際の高域係数１３１６と低域係数１３１５とを表している。図１３に示すように、図１２の変換ブロック長選択部１３０１によって基本変換ブロック１４０１が選択されたときには、時間−周波数変換部１３０２は、１フレーム１０２４サンプルであらわされている高域信号２０９に対して、１０２４サンプルの変換ブロック長による変換を１回行うだけである。低域信号２０８についても同様である。
【０１０４】
したがって、高域符号化バンド１４０２内の高域係数１３１４と低域符号化バンド１４０３内の低域係数１３１５とは、時間軸方向については１回の変換で求められる１組の係数からなる。すなわち、この係数は、この１フレームのサンプリング時間内での時間変化には依存しない係数とみることができる。しかし、周波数軸方向については、高域符号化バンド１４０２内の高域係数１３１４はさらに、帯域分割部１３０３によって複数の高域符号化サブバンド１４０４に分割され、複数組の高域符号化サブバンド１４０４からなる高域係数１３１６となる。
【０１０５】
各基本変換ブロック１４０１に含まれる高域係数１３１４は、高域符号化バンド１４０２の周波数帯域にあたる係数のみが値を持ち、残りの係数は０である。各基本変換ブロック１４０１に含まれる低域係数１３１５も同様に、低域符号化バンド１４０３の周波数帯域にあたる係数のみが値を持ち、残りの係数は０である。図１３では、値を持つ低域係数と高域係数のみを同一平面上に並べている。なお、各高域係数１３１６に対する以降の符号化処理は、周波数軸方向の高域符号化サブバンド１４０４ごとに行われるので、基本変換ブロック１４０１使用時の符号化処理の単位は、図１３において斜線で示される各高域符号化サブバンド１４０４内の領域１４０５（以下、「符号化セル」と呼ぶ）となる。
【０１０６】
これに対して、図１４に示すように、変換ブロック長選択部１３０１によって変換サブブロック１５０６が選択されたときには、時間−周波数変換部１３０２は、１フレーム１５０１内において１０２４サンプルで表される高域信号２０９に対し、１２８サンプルの変換ブロック長を単位とする変換を、１０２４／１２８＝８回行う。低域符号化バンド１５０３内の低域信号２０８についても同様である。したがって、時間−周波数変換部１３０２による変換の結果得られた高域係数１３１４と低域係数１３１５とは、いずれも時間軸方向について８組の変換サブブロック１５０６からなる係数となる。
【０１０７】
各変換サブブロック１５０６に含まれる高域係数１３１４は、それぞれ、１２８サンプルであり、高域符号化バンド１５０２の周波数帯域にあたる係数のみが値を持ち、残りの係数は０である。各変換サブブロック１５０６に含まれる低域係数１３１５も同様に、それぞれ、１２８サンプルであり、低域符号化バンド１５０３の周波数帯域にあたる係数のみが値を持ち、残りの係数は０である。図１５では、値を持つ低域係数と高域係数のみを同一平面上に並べている。高域符号化バンド１５０２内の高域係数１３１４は、帯域分割部１３０３によって周波数軸方向に、さらに複数の高域符号化サブバンド１５０４に分割される。よって、以降の符号化処理の単位は、図１４において斜線で示される符号化セル１５０５となる。
【０１０８】
図１３および図１４から明らかなように、変換ブロック長として基本変換ブロック１４０１を用いるよりも変換サブブロック１５０６を用いた方が、符号化セルが、時間軸方向により細かく分割されるため、時間軸方向により精度の高い符号化が可能となる。ただし、周波数軸方向への分割数が同じであれば、単位時間当たりの符号化セルの数は増加することになるため、必然的に符号化に必要なビット量が増加する。これを防ぐためには、図１４に示す変換サブブロック１５０６使用時には、周波数軸方向への分割数つまり高域符号化サブバンド１５０４の数を、図１３に示した基本変換ブロック１４０１使用時における高域符号化サブバンド１４０４の数よりも減少させれば良い。
【０１０９】
結果として、基本変換ブロック１４０１使用時には、周波数軸方向に細かく分割された符号化セル１４０５が使用され、変換サブブロック１５０６使用時には、時間軸方向に細かく分割された符号化セル１５０５が使用されることになる。このように符号化装置１３００は、入力信号２０７の特性に応じて、符号化セル１４０５と符号化セル１５０５とを切り換えて使用することにより、より効率的に入力信号２０７の符号化を行うことが可能となる。
【０１１０】
符号化装置１３００の各処理部が各符号化セルに対して行う処理は基本変換ブロック１４０１の場合でも、変換サブブロック１５０６の場合でも同様である。したがって、１フレームにおける高域係数１３１６の符号化を完了するためには、すべての符号化セルについて類似度判定部１３１２と高域符号化部１３１３との符号化処理を繰り返せばよい。
【０１１１】
符号多重化部１３２１は、変換ブロック長選択部１３０１によって生成されたブロック長情報１３１０、切換判定部１３０６からの切換情報１３１９、低域符号化部２０２からの低域符号２１３および高域符号化部１３１３からの高域符号１３２０を多重化し、出力符号１３１１を生成する。
【０１１２】
なお、本実施の形態では、高域符号化サブバンドのＭＤＣＴ係数を求めるにあたって、高域信号２０９をＭＤＣＴ係数に変換した後、ＭＤＣＴ係数上で帯域分割を行っているが、等価な構成として、高域信号２０９をフィルタバンク等を用いてあらかじめ高域符号化サブバンドの帯域に分割した後、それぞれの帯域の信号をＭＤＣＴ変換することによって高域符号化サブバンドのＭＤＣＴ係数を求めてもよい。
【０１１３】
以上で説明したように、高域のＭＤＣＴ係数を複数の符号化セルに分割し、符号化セルごとにノイズ性と、低域のＭＤＣＴ係数との類似度とを判定し、判定結果に基づいて最適な高域符号化部を用いて符号化を行うことにより、より少ないビット量の符号から入力信号２０７の特性に応じた高品質な再生信号を得ることができるという効果がある。
【０１１４】
次に、本実施の形態６における復号化装置について、図１５を参照しながら説明する。図１５は、図１２に示した符号化装置１３００によって出力された出力符号１３１１を復号化する復号化装置１６００の構成を示すブロック図である。以下では、前記の符号化装置１３００の場合と同様、周波数スペクトル係数としてＭＤＣＴ係数を使用する場合を例として説明する。
【０１１５】
図１５において、復号化装置１６００は、基本変換ブロックおよび変換サブブロックのいずれかのブロック長で符号化されている高域符号を、入力符号に含まれているブロック長情報に従って復号化する復号化装置であって、符号分割部１６０１、低域復号化部１６０２、時間−周波数変換部１６０３、切換部１６０４、第１の高域復号化部１６０５、第２の高域復号化部１６０６、スペクトル合成部１６０７および周波数‐時間変換部１６０８を備える。符号分割部１６０１は、入力符号１６０９からブロック長情報１６１０、切換情報１６１１、低域符号１６１２および高域符号１６１３を抽出する。
【０１１６】
低域復号化部１６０２は、抽出された低域符号１６１２を復号化し、時間信号である低域信号１６１４を出力する。時間−周波数変換部１６０３は、ブロック長情報１６１０で示される変換ブロック長で、低域信号１６１４を変換し、低い周波数成分を表すＭＤＣＴ係数である低域係数１６１５を算出する。
【０１１７】
切換部１６０４は、抽出された切換情報１６１１に従って、第１の高域復号化部１６０５と第２の高域復号化部１６０６とを切り換え、選択された高域復号化部に対してブロック長情報１６１０と高域符号１６１３とを入力する。ここで、第１の高域復号化部１６０５は、ノイズを用いて符号化された符号を復号化するための復号部、第２の高域復号部１６０６は低域係数１６１５と高域係数との類似度を利用して符号化された符号を復号化する復号部である。高域係数は、変換ブロック長と高域符号化サブバンドにしたがって、あらかじめ定められた符号化セルに分割されて符号化されているので、復号化処理は各符号化セルに対応する符号に対して行われる。
【０１１８】
すなわち、高域符号１６１３に含まれる符号を符号化セルごとの符号に分割し、すべての符号化セルについて、分割された符号に対する復号化処理を繰り返す。ここで、前記符号化装置１３００側において、各符号化セルに対応する符号が数学的手法で圧縮されている場合には、数学的手法による復号化も行われる。
【０１１９】
切換情報１６１１によって、高域符号１６１３が第１の高域復号化部１６０５に入力されることが示された場合、復号化される高域係数はランダムなノイズ信号で表現され、高域符号１６１３には信号強度を表すゲイン情報のみが含まれる。したがって、第１の高域復号化部１６０５は、内部でランダムなノイズ信号を生成し、復号したゲインを乗じることによって帯域分割された高域係数１６１６を生成する。
【０１２０】
これに対して、高域符号１６１３が第２の高域復号化部１６０６に入力されることが示された場合には、帯域分割された高域係数１６１６は、低域係数１６１５から抽出された係数と、高域符号１６１３内に符号化されている高域係数および低域係数間の差分との和によって表現される。高域符号１６１３には、低域係数を抽出するための位置を示すシフト数ｋ、ゲインａおよび差分を表す情報が含まれる。第２の高域復号化部１６０６は、入力された低域係数１６１５について、あらかじめ定められた基準点からシフト数ｋだけシフトした位置から、あらかじめ定められた数のＭＤＣＴ係数を抽出する。抽出されるＭＤＣＴ係数のサンプル数は、ブロック長によって異なる。
【０１２１】
第２の高域復号化部１６０６は、抽出したＭＤＣＴ係数にゲインａを乗じ、復号化した差分情報を表すスペクトル係数を加算することによって、高域係数を表すＭＤＣＴ係数を生成する。なお、差分を表す情報が省略されている場合は、抽出したＭＤＣＴ係数にゲインａを乗じたものが高域係数を表すＭＤＣＴ係数となる。また、差分を表す情報がノイズ成分で代表されている場合には、抽出したＭＤＣＴ係数にゲインａを乗じたものに、復号したゲインを乗じたノイズ成分を加算したものが高域係数を表すＭＤＣＴ係数となる。
【０１２２】
スペクトル合成部１６０７は、第１の高域復号化部１６０５もしくは第２の高域復号化部１６０６によって復号化された、高域符号化サブバンドに帯域分割された高域係数１６１６および低域係数１６１５を合成し、合成スペクトル係数１６１７として出力する。周波数‐時間変換部１６０８は、合成スペクトル係数１６１７の高域部については、ブロック長情報１６１０によって示される変換ブロック長で時間信号に変換し、出力信号１６１８を生成する。
【０１２３】
なお、第２の高域復号化部１６０６において、シフト数ｋが周波数軸上のサンプル数を示さず、あらかじめ定められた特定の低域区間を表す識別情報を示している場合は、その識別情報に従って、あらかじめ定められた特定区間のＭＤＣＴ係数を抽出すればよい。
【０１２４】
以上で説明したように、実施の形態６では、符号化装置１３００において、入力信号２０７の特性に応じて、選択的に高域信号２０９の変換ブロック長を切り換えて符号化し、復号化装置１６００において、このように符号化された入力符号１６０９をブロック長情報１６１０に基づいて復号化するので、実施の形態１から実施の形態５における効果に加えて、復号化装置１６００では、原音の時間変化に対する追従性が向上された、より品質の高い再生信号を得ることができるという効果がある。
【０１２５】
（実施の形態７）
本発明の実施の形態７における符号化装置の構成を図１６を参照しながら説明する。図１６は、実施の形態７に係る符号化装置１７００の構成を示すブロック図である。図１６の符号化装置１７００は、図１２に示した符号化装置１３００に対して、低域符号化部が、高域符号化部と同一形式の変換係数を用いて符号化を行い、かつ、低域符号化部においても、変換ブロック長選択部によって決定された変換ブロック長にしたがって符号化処理を行う点が異なる。
【０１２６】
この符号化装置１７００は、変換ブロック長選択部１３０１、切換部１３０７、時間−周波数変換部１７０１、帯域分割部１７０２、低域符号化部１７０３、類似度判定部１３１２（ノイズ性判定部１３０４、予測誤差算出部１３０５、切換判定部１３０６）、高域符号化部１３１３（第１の高域符号化部１３０８、第２の高域符号化部１３０９）および符号多重化部１７０４を備える。なお、同図において、図１２に示した符号化装置１３００と同様の構成要素についてはすでに説明しているので、同一の参照符号を付し、説明を省略する。
【０１２７】
図１６において、変換ブロック長選択部１３０１は、入力信号２０７の特性に応じて変換ブロック長を決定する。時間−周波数変換部１７０１は、変換ブロック長選択部１３０１からのブロック長情報１３１０にしたがって、低域および高域からなる全帯域の入力信号２０７を一括して時間‐周波数変換し、全域係数１７０５を出力する。時間−周波数変換部１７０１は、公知のどのような変換方法を用いても良いが、ここではＭＤＣＴを用いる例を説明する。
【０１２８】
帯域分割部１７０２は、基本変換ブロックおよび変換サブブロックのいずれかのブロック長のＭＤＣＴ係数で表された全域係数１７０５を低域および高域の各帯域に分割し、高域についてはさらに複数の高域符号化サブバンドに分割して、低域係数１７０６と高域係数１３１６とを出力する。低域符号化部１７０３は、可変変換ブロック長のＭＤＣＴ係数であらわされた低域係数１７０６を符号化して、低域符号１７０７を出力する。この低域符号化部１７０３としては、可変変換ブロック長に対応できる公知のどのような符号化を用いても良いが、ここではそのような符号化部の一例として、ＭＰＥＧ−４　ＡＡＣ方式を使用する。ＭＰＥＧ−４　ＡＡＣ方式では、１０２４サンプルと１２８サンプルとの２種類の変換ブロック長に対応しており、必要に応じて切り換えて使用することが可能である。
【０１２９】
ここで、図１２に示した符号化装置１３００のように、高域符号化部で用いる変換ブロック長と、低域符号化部で用いる変換ブロック長とが異なっている場合、それぞれの符号化において時間−周波数変換を行う必要がある。そこで、高域符号化部で用いる変換ブロック長についても、１０２４サンプルと１２８サンプルの２種類の変換ブロック長から選択するようにすれば、低域符号化と高域符号化の変換ブロック長は一致し、両者の時間−周波数変換処理は同一の処理となる。すなわち、低域符号化部と高域符号化部とにおいて時間−周波数変換処理を共有化することができる。符号多重化部１７０４は、変換ブロック長選択部１３０１からのブロック長情報１３１０、切換判定部１３０６からの切換情報１３１９、高域符号化部１３１３からの高域符号１３２０、低域符号化部１７０３からの低域符号１７０７を多重化して、出力符号１７０８を出力する。
【０１３０】
以下では、上記のように構成された符号化装置１７００の動作について、データの流れに沿って説明する。時間−周波数変換部１７０１は、ブロック長情報１３１０に従って入力信号２０７を一括してＭＤＣＴ係数に変換する。変換された全帯域の周波数成分を含んだ全域係数１７０５は、帯域分割部１７０２に入力され、低域係数１７０６とさらに複数の帯域に分割された高域係数１３１６として出力される。
【０１３１】
低域係数１７０６は低域符号化部１７０３において符号化され、帯域分割された高域係数１３１６は、第１の高域符号化部１３０８もしくは第２の高域符号化部１３０９において符号化される。
【０１３２】
ブロック長情報１３１０、切換情報１３１９、高域符号化部１３１３による高域係数１３１６の符号化処理の結果、得られた高域符号１３２０および低域符号１７０７は、符号多重化部１７０４において多重化され、出力符号１７０８が生成される。
【０１３３】
符号化装置１７００は、上記のような構成を備えることにより、低域符号化と高域符号化とにおける時間−周波数変換処理を共有化できると共に、帯域分割部１７０２における帯域分割処理を周波数領域の係数、たとえばＭＤＣＴ係数上で行うことが可能となるため帯域分割処理に必要な演算量を削減することができる。
【０１３４】
次に、本発明の実施の形態７の復号化装置について、図１７を参照しながら説明する。図１７は、図１６に示した符号化装置１７００によって符号化された出力符号１７０８を復号化する復号化装置１８００の構成を示すブロック図である。図１７に表される復号化装置１８００は、実施の形態６の復号化装置１６００に対して、低域復号化部が、高域と同一形式の変換係数を用いて低域の復号化を行い、かつ、変換ブロック長選択部から受取ったブロック長情報で示される変換ブロック長にしたがって低域の復号化処理を行う点が異なる。
【０１３５】
この復号化装置１８００は、符号分割部１６０１、切換部１６０４、第１の高域復号化部１６０５、第２の高域復号化部１６０６、スペクトル合成部１６０７、周波数‐時間変換部１６０８および低域復号化部１８０１を備える。なお、同図において、図１５に示した復号化装置１６００と同様の構成要素については、すでに説明しているので、同一の参照符号を付し、説明を省略する。以下、本復号化装置１８００についても、前記の符号化装置１７００と同様に周波数スペクトル係数としてＭＤＣＴ係数、低域復号化方法としてＡＡＣ方式を使用する場合を例として説明する。低域復号化部１８０１は、入力符号１８０２から分離された低域符号１８０３を、ブロック長情報１６１０に従って復号化し、低域係数１６１５を出力する。
【０１３６】
以下では、上記のように構成された復号化装置１８００の動作について説明する。図１７の復号化装置１８００において、入力符号１８０２は、符号分割部１６０１によって、ブロック長情報１６１０、切換情報１６１１、低域符号１８０３および高域符号１６１３に分割される。それらのうち、低域符号１８０３は、ブロック長情報１６１０で示される変換ブロック長にしたがって、低域復号化部１８０１によって復号化される。復号化された低域符号１８０３は、ＡＡＣ方式の符号化パラメータであるＭＤＣＴ係数で表されており、低域係数１６１５として出力される。
【０１３７】
一方、入力符号１８０２から分離された切換情報１６１１によって切換部１６０４が切り換えられ、第１の高域復号化部１６０５もしくは第２の高域復号化部１６０６のいずれかが選択される。選択された高域復号化部によって高域符号１６１３が復号化され、高域係数１６１６が出力される。第１の高域復号化部１６０５と第２の高域復号化部１６０６との動作については、図１５に示した実施の形態７の復号化装置１６００と同じであるので、説明を省略する。それぞれ復号化された低域係数１６１５と高域係数１６１６とは、同じ変換ブロック長に基づくＭＤＣＴ係数であるので、両者の合成は単純な加算処理で実現することができる。スペクトル合成部１６０７は、低域係数１６１５と高域係数１６１６とを加算し、合成スペクトル係数１６１７を算出する。周波数−時間変換部１６０８は、合成スペクトル係数１６１７を逆ＭＤＣＴ変換して、出力信号１６１８を出力する。
【０１３８】
本実施の形態によれば、復号化装置１８００がこのような構成をとることにより、低域復号化と高域復号化の周波数−時間変換処理を共有化できるので、復号化装置１６００で必要であった低域信号を低域係数に変換するための時間−周波数変換部を省略することができ、復号化装置１８００をよりコンパクトに実装することができるという効果がある。さらに、本実施の形態によれば、スペクトル合成部における合成処理を周波数領域の係数、たとえばＭＤＣＴ係数上で行うことが可能となるため、合成処理に必要な演算量を削減することができるという効果がある。
【０１３９】
（実施の形態８）
本発明の実施の形態８における符号化装置の構成を図１８を参照しながら説明する。図１８は、実施の形態８に係る符号化装置１９００の構成を示すブロック図である。図１８に示す符号化装置１９００は、図１６に示した符号化装置１７００に対し、低域部の符号化に、ブロックグルーピングと呼ばれる符号化方法に対応したＭＰＥＧ−４　ＡＡＣ方式等の方式を採用し、低域符号化部から与えられるグルーピング情報を利用して高域符号化を行うようにした点が異なる。
【０１４０】
この符号化装置１９００は、変換ブロック長選択部１３０１、類似度判定部１３１２（ノイズ性判定部１３０４、予測誤差算出部１３０５、切換判定部１３０６）、時間−周波数変換部１７０１、帯域分割部１７０２、低域符号化部１９０１、高域符号化部１９０４（第１の高域符号化部１９０２、第２の高域符号化部１９０３）および符号多重化部１９０５を備える。なお、同図において、図１２または図１６に示した符号化装置１３００または符号化装置１７００の構成要素と同様の構成要素については、すでに説明しているので、同一の参照符号を付し、説明を省略する。
【０１４１】
ブロックグルーピングとは、互いにスペクトルが類似する変換サブブロックからなるグループにおいて符号化パラメータを共有することによって、出力符号に含まれる符号化パラメータの数を削減する方法である。以下に、ＡＡＣ方式で用いられるブロックグルーピングについて、図１９を参照しながら、より詳細に説明する。図１９（ａ）は、図１８に示した符号化装置１９００の時間−周波数変換部１７０１からブロック長情報１３１０にしたがって出力される全域係数１７０５の一例を示す図である。図１９（ｂ）は、図１８に示した符号化装置１９００の高域符号化部１９０４に入力される高域係数１３１６の一例を示す図である。図１９（ｃ）は、図１８に示した符号化装置１９００の低域符号化部１９０１に入力される低域係数１７０６の一例を示す図である。
【０１４２】
ＡＡＣ方式は１０２４サンプル（ロングブロック＝基本変換ブロック）および１２８サンプル（ショートブロック＝変換サブブロック）の２種類の変換ブロック長に対応しているが、ショートブロック時には複数の隣接するショートブロック内のＭＤＣＴ係数を比較し、それらの類似性が高い場合には一部の符号化パラメータを共有化することによって符号化パラメータの数を減少させ、入力信号の符号化に必要なビット数を削減することができる構成となっている。
【０１４３】
図１８に示した時間−周波数変換部１７０１は、変換ブロック長選択部１３０１からのブロック長情報１３１０が、低域および高域それぞれにつき１２８サンプルからなるショートブロックを示している場合、例えば、入力信号２０７の１２８サンプルごとにＭＤＣＴ変換を行い、図１９（ａ）に示すような全域係数１７０５を出力する。このように出力された全域係数１７０５は、１つの基本変換ブロック（１０２４サンプル）につき、８個のショートブロックから構成されており、各ショートブロックのＭＤＣＴ係数は、サンプリング周波数によって定まる低域から高域までの周波数帯域を表している。このような全域係数１７０５は、帯域分割部１７０２によって低域係数１７０６と、高域係数１３１６とに分割され、それぞれ低域符号化部１９０１と高域符号化部１９０４とに出力される。
【０１４４】
低域符号化部１９０１では、隣接しあうショートブロックのＭＤＣＴ係数を比較し、ショートブロック間の類似度が高い場合には、図１９（ｃ）に示すように、該当する２つ以上のショートブロックを１つのグループに分類する。１つのグループに属するショートブロックでは、例えば、ショートブロック内の符号化単位ごとに算出される正規化係数などの符号化パラメータが共有される。このとき、符号化パラメータを共有する隣接ショートブロックの組をショートブロックグループと呼び、１０２４サンプル長のフレーム内に含まれる８個のショートブロックがどのようにグループ分けされるかの情報は、グルーピング情報１９１１として生成され、高域符号化部１９０４に出力される。
【０１４５】
例えば、図１９（ｃ）に示すように、低域符号化部１９０１において、ショートブロック１とショートブロック２とがグルーピングされている場合、それを示すグルーピング情報１９１１が出力される。このグルーピング情報１９１１は、低域符号化部１９０１によって低域係数１７０６とともに符号化される。高域符号化部１９０４では、帯域分割部１７０２から入力される高域係数１３１６は、低域係数１７０６よりさらに複数の帯域に分割されており、高域符号化部１９０４は、この帯域ごとに高域係数１３１６を符号化する。このとき、高域符号化部１９０４は、グルーピング情報１９１１から、低域のショートブロック１とショートブロック２とがグルーピングされている場合には、高域において対応する変換サブブロック１と変換サブブロック２との間で符号化パラメータを共有させて符号化する。
【０１４６】
ここで、符号化装置１９００の構成では、高域部における変換サブブロックと低域部におけるＡＡＣ方式のショートブロックとの長さはいずれも１２８サンプルで同一であり、かつ、１２８サンプルの変換ブロック長で変換された高域ＭＤＣＴ係数と１２８サンプルのショートブロックで変換された低域ＭＤＣＴ係数とのそれぞれが同じ信号の周波数高域成分および低域成分を保持している。したがって、両者に含まれるＭＤＣＴ係数同士は密接に関連しており、例えば、ある低域側の隣接するショートブロック同士が類似していれば、対応する高域側のサブブロック同士も類似である可能性が高い。したがって、ＡＡＣ方式による低域符号化部から出力されるグルーピング情報１９１１を高域符号化側でも利用し、類似するサブブロック間の符号化パラメータを共有化すれば、符号化パラメータの数を減少させ、符号化に必要なビット数を削減することができる。
【０１４７】
図１８の符号化装置１９００において、低域符号化部１９０１から出力されたグルーピング情報１９１１は、切換部１３０７を介して第１の高域符号化部１９０２もしくは第２の高域符号化部１９０３に入力される。切換部１３０７によって選択された高域符号化部では、前記グルーピング情報１９１１に基づいて、共有されるパラメータを、グループ内の１つの変換サブブロックについてのみ符号化し、残りの変換サブブロックについては符号化を省略する。ここで、グルーピング情報１９１１に基づくパラメータの具体的な共有方法について説明する。
【０１４８】
図２０は、図１８に示した符号化装置１９００の低域符号化部１９０１においてグルーピングが行われる場合の高域係数１３１６と低域係数１７０６との関係を示す図である。図２０に示される様に、変換ブロック長選択部１３０１において変換ブロック長として変換サブブロック２１０６が選択された場合、１０２４サンプルの基本変換ブロック２１０１の入力信号２０７に対して、１２８サンプル単位の変換ブロック長で変換された８組のＭＤＣＴ係数（全帯域係数１７０５）が得られる。
【０１４９】
この全帯域係数１７０５は、帯域分割部１７０２によって、低域符号化バンド２１０３に属する低域係数１７０６と高域符号化バンド２１０２に属する高域係数１３１６とに分割される。低域符号化部１９０１は、低域係数１７０６の隣接ショートブロック間の類似度を調べ、互いに類似度の高いショートブロックをグルーピングする。この際に、どのショートブロックとどのショートブロックとをグルーピングしたかを示すグルーピング情報１９１１を生成し、高域符号化部１９０４に出力する。例えば、低域符号化部１９０１は、互いに隣接するショートブロック２１０８とショートブロック２１０９とをグルーピングしたことを示すグルーピング情報１９１１を生成したとする。高域符号化部１９０４は、低域符号化部１９０１から与えられるグルーピング情報１９１１によって、互いに類似する隣接変換サブブロック２１０６の組が示されるので、それにしたがって８組の変換サブブロック２１０６を複数の変換サブブロックグループに分類する。
【０１５０】
例えば、高域符号化バンド２１０２において、８組の変換サブブロック２１０６は、変換サブブロックグループ２１１０、変換サブブロックグループ２１１１および変換サブブロックグループ２１１２に分類する。例えば、変換サブブロックグループ２１１０は、１つの変換サブブロック２１０６から構成され、変換サブブロックグループ２１１１は、図示しない５つの変換サブブロック２１０６から構成される。変換サブブロックグループ２１１２は、低域符号化バンド２１０３におけるショートブロック２１０８とショートブロック２１０９とに対応する、７番目と８番目との２つの変換サブブロック２１０６から構成される。このような変換サブブロックグループ内で符号化パラメータを共有するためには、元々高域符号化サブバンド２１０４ごとの変換サブブロック２１０６ごとに設定されていた符号化セルを、図２０の符号化セル２１０５で示されるように、各変換サブブロックグループに対して設定すればよい。
【０１５１】
この例では、符号化セル２１０５に含まれる２組の高域ＭＤＣＴ係数は、１組の符号化パラメータで代表され符号化される。例えば、第２の高域符号化部１９０３が符号化セル２１０５に含まれる高域係数１３１６を符号化する場合、この２組の変換サブブロック２１０６内のＭＤＣＴ係数は、同一のシフト数ｋとゲインａで表され符号化される。これに対し、サブブロックグルーピングを行わない場合には、第２の高域符号化部１９０３は、２組の変換サブブロック２１０６内のＭＤＣＴ係数それぞれに対して、独立にシフト数ｋとゲインａとを符号化しなければならない。このように、例えば、２つの変換サブブロック２１０６を１つの変換サブブロックグループ２１１２にグルーピングすることにより、符号化パラメータの数を半分に低減することができる。
【０１５２】
なお、グルーピングされる変換サブブロック２１０６の数に制限は無く、さらに多くの変換サブブロック２１０６をグループ化すれば、さらに符号化パラメータの数を減少させることができる。結果として、このようなグルーピングを行うことによって、全帯域の符号化パラメータの符号化に必要なビット数を削減することができる。符号多重化部１９０５では、低域符号化部１９０１からの▲１▼低域符号１９１０、切換判定部１３０６からの▲２▼切換情報１３１９、変換ブロック長選択部１３０１からの▲３▼ブロック長情報１３１０および高域符号化部１９０４からの▲４▼高域符号１９１２を多重化して、出力符号１９１３を生成する。
【０１５３】
なお、符号化パラメータ共有化のためのグループ化は、時間軸方向の複数の変換サブブロックに対して行うのと同様に、周波数軸方向の複数の高域符号化サブバンドに対して行うことも可能である。
【０１５４】
なお、上記の例では、低域符号化部１９０１から得られるグルーピング情報１９１１（第１のグルーピング情報とする）にしたがって変換サブブロック２１０６のグループ化を行ったが、低域側のグルーピング情報１９１１を用いずに高域側で独自にグルーピングを行い、そのグルーピング結果を示す第２のグルーピング情報を生成して高域符号化部における変換サブブロック２１０６のグループ化を行うように構成しても良い。
【０１５５】
図２１は、実施の形態８に係る他の符号化装置２２００の構成を示すブロック図である。符号化装置２２００は、高域の変換サブブロック２１０６ごとに独自にグルーピングを行う符号化装置であって、変換ブロック長選択部１３０１、類似度判定部１３１２（ノイズ性判定部１３０４、予測誤差算出部１３０５、切換判定部１３０６）、時間−周波数変換部１７０１、帯域分割部１７０２、低域符号化部１７０３、高域符号化部１９０４（第１の高域符号化部１９０２、第２の高域符号化部１９０３）、グルーピング情報生成部２２０１および符号多重化部２２０２を備える。
【０１５６】
グルーピング情報生成部２２０１は、帯域分割部１７０２によって帯域分割された高域係数１３１６における変換サブブロック２１０６の類似度を調べ、互いに類似度の高い変換サブブロック２１０６同士をグルーピングする。さらに、グルーピング結果を表す第２のグルーピング情報２２１０を生成し、高域符号化部１９０４および符号多重化部２２０２に出力する。
【０１５７】
切換情報１３１９によって選択された高域符号化部１９０４は、入力された第２のグルーピング情報２２１０にしたがって高域部の符号化セル２１０５を決定し、符号化セル２１０５内の各変換サブブロックに符号化パラメータを共有させた高域符号２２１１を生成する。符号多重化部２２０２は、▲１▼グルーピング情報生成部２２０１からの第２のグルーピング情報２２１０、▲２▼高域符号化部１９０４からの高域符号２２１１、▲３▼低域符号化部１７０３からの低域符号１７０７、▲４▼切換判定部１３０６からの切換情報１３１９および▲５▼変換ブロック長選択部１３０１からのブロック長情報１３１０を多重化して、出力符号２２１２を生成する。
【０１５８】
図１８に示した低域符号化部１９０１から得られる第１のグルーピング情報１９１１は、低域符号化で使用される低域係数１７０６をもとに生成されているため、必ずしも高域係数１３１６のクループ化に適しているとは限らない。これに対して、高域係数１３１６をもとに高域側独自にグルーピング情報２２１０を生成すれば、高域係数１３１６の符号化により適したグルーピングを行うことが可能となり、高域部における符号化の品質を向上させることができる。ただし、この場合には高域側で独自に生成した第２のグルーピング情報２２１０も符号化し、出力符号２２１２に多重化する必要がある。
【０１５９】
なお、符号化装置２２００の構成では、低域符号化部１７０３においてグルーピングを行わないが、符号化装置２２００において低域符号化部１７０３の代わりに、図１８に示した符号化装置１９００における低域符号化部１９０１を備え、低域符号化部１９０１においてもショートブロックのグルーピングを行うように構成してもよい。
【０１６０】
また、図２１に示した符号化装置２２００では、低域と高域とを別個にグルーピングしたが、低域符号化部から得られる第１のグルーピング情報と、高域符号化側で独自に生成した第２のグルーピング情報とのいずれかを選択して使用する構成としても良い。図２２は、実施の形態８に係るさらに他の符号化装置２３００の構成を示すブロック図である。符号化装置２３００は、低域ショートブロックのグルーピングを示す第１のグルーピング情報と、高域変換サブブロックのグルーピングを示す第２のグルーピング情報とのいずれかを選択的に使用して高域における符号化セルを決定する符号化装置であって、変換ブロック長選択部１３０１、類似度判定部１３１２（ノイズ性判定部１３０４、予測誤差算出部１３０５、切換判定部１３０６）、時間−周波数変換部１７０１、帯域分割部１７０２、低域符号化部１９０１、高域符号化部１９０４（第１の高域符号化部１９０２、第２の高域符号化部１９０３）、グルーピング情報生成部２３０１、グルーピング情報切換部２３０２および符号多重化部２３０３を備える。
【０１６１】
グルーピング情報生成部２３０１は、いったん、第１のグルーピング情報１９１１にしたがって高域係数１３１６のグルーピングを行う。この処理において、グルーピング情報生成部２３０１は、例えば、同じグループにグルーピングされた変換サブブロック間の差分誤差を計算し、算出された差分誤差の値が、あらかじめ定めた閾値を超える場合には、第１のグルーピング情報１９１１に基づくグルーピングをキャンセルし、高域係数１３１６に基づいて独自にグルーピングを行う。グルーピングの結果、グルーピング情報生成部２３０１は、高域独自のグルーピング結果（すなわち、どの変換サブブロックとどの変換サブブロックとをグルーピングしたか）を示す第２のグルーピング情報２２１０を生成する。
【０１６２】
また、グルーピング情報切換部２３０２にグルーピング情報切換情報２３１１を出力し、グルーピング情報切換部２３０２が第２のグルーピング情報２２１０を選択するよう切り換える。このように選択されたグルーピング情報２３１０にしたがって、高域符号化部１９０４は、低域係数１７０６との類似度情報１３１８に基づいて高域係数１３１６を符号化し、高域符号２３１２を出力する。符号多重化部２３０３は、このように生成された高域符号化部１９０４からの▲１▼高域符号２３１２、低域符号化部１９０１からの▲２▼低域符号１９１０、グルーピング情報切換情報２３１１にしたがってグルーピング情報切換部２３０２によって第２のグルーピング情報２２１０が選択された場合には、その▲３▼グルーピング情報２３１０、変換ブロック長選択部１３０１からの▲４▼ブロック長情報１３１０および切換判定部１３０６からの▲５▼切換情報１３１９を多重化して、出力符号２３１３を生成する。
【０１６３】
上記のように、この構成においては、通常は第１のグルーピング情報１９１１を用いて高域係数１３１６のグループ化を行い、第１のグルーピング情報１９１１が高域係数１３１６のグループ化に適さない場合にのみ、高域符号化側で独自に生成した第２のグルーピング情報２２１０を利用するようにできるので、第２のグルーピング情報２２１０を表すために必要な符号ビット数を削減することができ、かつ、符号化品質を向上することが可能である。この構成においては、第２のグルーピング情報２２１０が選択された場合にのみ、第２のグルーピング情報２２１０を符号化し、出力符号２３１３に多重化する。
【０１６４】
なお、ここでは符号多重化部２３０３は、グルーピング情報生成部２３０１からのグルーピング情報切換情報２３１１に基づいて第２のグルーピング情報２２１０を符号化し、出力符号２３１３に多重化したが、本発明はこれに限定されない。例えば、グルーピング情報生成部２３０１は、第２のグルーピング情報２２１０に高域独自のグルーピング情報であることを示す識別コードを付加しておくものとする。これに対し、選択された高域符号化部１９０４は、第２のグルーピング情報２２１０を示す識別コードが付加されている場合には、その第２のグルーピング情報２２１０を高域符号２３１２に組み込んで符号化するが、識別コードが付加されていない第１のグルーピング情報１９１１は高域符号２３１２に組み込まないとしてもよい。この場合、符号多重化部２３０３は、グルーピング情報２３１０およびグルーピング情報切換情報２３１１を符号化および多重化する必要はなく、符号多重化部２３０３は切換情報１３１９、ブロック長情報１３１０、低域符号１９１０および高域符号２３１２を多重化するだけでよい。
【０１６５】
次に、本実施の形態８における復号化装置の構成について図２３を参照しながら説明する。図２３は、図１８に示した符号化装置１９００によって生成された出力符号１９１３を復号化する復号化装置２４００の構成を示すブロック図である。図２３に示した復号化装置２４００は、低域復号化部として、図１８に示した符号化装置１９００におけるＭＰＥＧ−４　ＡＡＣ方式等のブロックグルーピングと呼ばれる符号化方法に対応した復号化方式を使用し、低域復号化部から与えられるグルーピング情報を利用して高域の復号化を行うようにした復号化装置であって、符号分割部２４０１、低域復号化部２４０２、第１の高域復号化部２４０３、第２の高域復号化部２４０４、切換部２４０５、スペクトル合成部１６０７および周波数‐時間変換部１６０８を備える。
【０１６６】
図２３において、符号分割部２４０１は、入力符号２４１０をブロック長情報１６１０、切換情報１６１１、低域符号２４１１および高域符号２４１２に分割する。低域復号化部２４０２は、低域符号２４１１を、ブロック長情報１６１０で示される変換ブロック長と、低域符号２４１１に含まれるグルーピング情報２４１３とにしたがって復号化し、ＡＡＣ方式の符号化パラメータであるＭＤＣＴ係数で表された低域係数２４１５を出力する。これと併せて、低域復号化部２４０２は、低域符号２４１１から得られたグルーピング情報２４１３を切換部２４０５に出力する。このグルーピング情報２４１３は、切換部２４０５を介して、第１の高域復号化部２４０３および第２の高域復号化部２４０４のうち、切換情報１６１１によって選択された一方に入力される。選択されたほうの第１の高域復号化部または第２の高域復号化部は、ブロック長情報１６１０、高域符号２４１２、低域係数２４１５（第２の高域復号化部２４０４が選択された場合のみ）およびグルーピング情報２４１３から、高域係数２４１４を生成する。
【０１６７】
すなわち、選択された高域復号化部は、ブロック長情報１６１０により、基本変換ブロックが使用されることが示された場合には、基本変換ブロックに基づく符号化セル単位で復号化処理を行い、逆に変換サブブロックが使用されることが示された場合には、グルーピング情報２４１３によって示される変換サブブロックグループに基づく符号化セル単位で復号化処理を行う。
【０１６８】
変換サブブロックグループに基づく符号化セルには複数の変換サブブロックに対応するＭＤＣＴ係数の組が含まれているが、それぞれのＭＤＣＴ係数の組は、復号化された同一の符号化パラメータにしたがって復号化される。符号化パラメータから符号化セル内のＭＤＣＴ係数を算出する、第１の高域復号化部２４０３と第２の高域復号化部２４０４との動作については、グルーピング情報２４１３によって示される変換サブブロックグループに基づく符号化セル単位で復号化処理を行う以外、図１７に示した復号化装置１８００における第１の高域復号化部１６０５および第２の高域復号化部１６０６と同じであるので、説明を省略する。
【０１６９】
このように、それぞれ復号された低域係数２４１５と高域係数２４１４とは、同じ変換ブロック長に基づくＭＤＣＴ係数であるので、両者の合成は単純な加算処理で実現できる。スペクトル合成部１６０７は、低域係数２４１５と高域係数２４１４とを加算し、合成スペクトル係数２４１６を算出する。周波数−時間変換部１６０８は、得られた合成スペクトル係数２４１６を逆ＭＤＣＴ変換することにより、出力信号２４１７を生成し出力する。
【０１７０】
このような構成をとることにより、符号化装置１９００側では変換サブブロックをグループ化することによってより少ないビット量で入力信号２０７を符号化し、復号化装置２４００ではこのように少ないビット量で符号化された入力符号２４１０を正しく復号化することができる。
【０１７１】
また、図２１に示した符号化装置２２００による出力符号２２１２のように、復号化装置の入力符号に、高域符号化側で独自に生成された第２のグルーピング情報が多重化されている場合には、図２４に示すように第２のグルーピング情報に基づいて復号化を行う構成となる。図２４は、図２１に示した符号化装置２２００による出力符号２２１２を復号化する復号化装置２５００の構成を示すブロック図である。復号化装置２５００は、符号化装置において低域と高域とで別個にブロックグルーピングを行った場合の出力符号を、入力符号として復号化する復号化装置であって、符号分割部２５０１、低域復号化部２５０２、第１の高域復号化部２５０３、第２の高域復号化部２５０４、切換部２４０５、スペクトル合成部１６０７および周波数‐時間変換部１６０８を備える。
【０１７２】
符号分割部２５０１は、入力符号２５１０を低域符号２４１１、第２のグルーピング情報２５１１、高域符号２５１２、ブロック長情報１６１０および切換情報１６１１に分割する。低域復号化部２５０２は、低域符号２４１１に含まれている第１のグルーピング情報を切換部２４０５に出力しない点が図２３に示した低域復号化部２４０２と異なる。
【０１７３】
それ以外では、低域復号化部２５０２は、低域復号化部２４０２と同様、ショートブロックでブロックグルーピングが行われている場合には、低域符号２４１１に含まれている第１のグルーピング情報にしたがって低域符号２４１１を復号化し、ＭＤＣＴ係数で表された低域係数２４１５をスペクトル合成部１６０７と第２の高域復号化部２５０４とに出力する。
【０１７４】
第１の高域復号化部２５０３および第２の高域復号化部２５０４は、第２のグルーピング情報２５１１にしたがって、高域独自にグルーピングされた高域符号２５１２を復号化する点が、図２３に示した第１の高域復号化部２４０３および第２の高域復号化部２４０４と異なる。ただし、ハードウェア的に見れば、切換部２４０５を介して入力されるグルーピング情報にしたがって高域符号２４１２を復号化するという点では、第１の高域復号化部２５０３および第２の高域復号化部２５０４と、第１の高域復号化部２４０３および第２の高域復号化部２４０４とは、まったく同じものである。
【０１７５】
高域符号２５１２は、符号化装置２２００において、低域のショートブロックにおけるブロックグルーピングとは関係なく、高域における隣接変換サブブロック間の類似性に基づいてブロックグルーピングされている。第２のグルーピング情報２５１１には、高域符号２５１２の変換サブブロックがどのようにブロックグルーピングされているかが示されているので、第１の高域復号化部２５０３および第２の高域復号化部２５０４は、この第２のグルーピング情報２５１１にしたがって、正しく高域符号２５１２の符号化セルを判別することができる。
【０１７６】
このように、高域符号２５１２は、符号化装置２２００において高域の変換サブブロックの類似性に基づいてブロックグルーピングされているので、低域のブロックグルーピングをそのまま高域に転用して得られた復号化装置２４００の高域係数２４１４と比べて、より原音に忠実に符号化されているといえる。復号化装置２５００がこのような構成をとることにより、符号化装置２２００において変換サブブロックを高域係数に対して最適になるようにグループ化され、より効率的に符号化された入力符号を、復号化装置２５００において復号し、より高品質な出力信号を得ることができる。
【０１７７】
さらに、図２２に示した符号化装置２３００からの出力符号のように、高域係数と低域係数との類似度に応じて、高域係数のブロックグルーピングに用いられたグルーピング情報が異なっている場合の入力符号を復号化する復号化装置について説明する。図２５は、図２２に示した符号化装置２３００からの出力符号を入力符号として復号化する復号化装置２６００の構成を示すブロック図である。復号化装置２６００は、入力符号に第２のグルーピング情報が多重化されている場合には第２のグルーピング情報に従って高域符号を復号化し、入力符号に第２のグルーピング情報が多重化されていない場合には第１のグルーピング情報に従って高域符号を復号化する復号化装置であって、符号分割部２６０１、グルーピング情報切換部２６０２、第１の高域復号化部２６０３、第２の高域復号化部２６０４、低域復号化部２４０２、切換部２４０５、スペクトル合成部１６０７および周波数‐時間変換部１６０８を備える。
【０１７８】
符号分割部２６０１は、入力符号２６１０を、第２のグルーピング情報２６１１、高域符号２６１２、切換情報１６１１、ブロック長情報１６１０および低域符号２４１１に分割する。この入力符号２６１０には、高域について独自に生成された第２のグルーピング情報２６１１が使用される場合にのみ、第２のグルーピング情報２６１１が多重化されている。
【０１７９】
グルーピング情報切換部２６０２には、符号分割部２６０１から出力される第２のグルーピング情報２６１１と、低域復号化部２４０２から出力される第１のグルーピング情報２４１３とが入力される。グルーピング情報切換部２６０２は、入力符号２６１０に第２のグルーピング情報２６１１が多重化されている場合には、前記２入力のうち、符号分割部２６０１から出力される第２のグルーピング情報２６１１が高域復号化部に入力されるよう、第２のグルーピング情報２６１１を選択する。逆に、入力符号２６１０に第２のグルーピング情報２６１１が多重化されていない場合には、第１のグルーピング情報２４１３が高域復号化部に入力されるよう、第１のグルーピング情報２４１３を選択する。
【０１８０】
これによって、切換情報１６１１に従って選択されたほうの第１の高域復号化部２６０３または第２の高域復号化部２６０４は、グルーピング情報切換部２６０２からの出力である、選択された方のグルーピング情報に基づいて高域符号２６１２の復号化を行うことができる。すなわち、入力符号２６１０に第２のグルーピング情報２６１１が多重化されている場合には、第２のグルーピング情報２６１１に基づいて復号化を行い、入力符号２６１０に第２のグルーピング情報２６１１が多重化されていない場合には、低域復号化部２４０２からの第１のグルーピング情報２４１３に基づいて復号化を行う。
【０１８１】
なお、ここでは、グルーピング情報切換部２６０２が、第２のグルーピング情報２６１１のデータの有無を検知して、第２のグルーピング情報２６１１のデータがあれば第２のグルーピング情報２６１１を選択し、なければ第１のグルーピング情報２４１３を選択する機能を備えているが、それ以外の方法を用いて、第１のグルーピング情報２４１３と第２のグルーピング情報２６１１とを切り換えるようにしてもよい。
【０１８２】
例えば、符号分割部２６０１からグルーピング情報切換部２６０２への制御線を設けておき、符号分割部２６０１が入力符号２６１０に第２のグルーピング情報２６１１が多重化されているか否かを判断し、この判断結果に基づいてグルーピング情報切換部２６０２を切り換えるための制御信号をグルーピング情報切換部２６０２に出力するとしてもよい。
【０１８３】
あるいは、図２２に示した符号化装置２３００側で、例えば、グルーピング情報生成部２３０１がグルーピング情報切換部２３０２の切り換え結果を示すグルーピング情報切換情報２３１１を生成し、これを高域符号化部１９０４において高域符号２３１２に多重化しておいてもよい。この場合も例えば、符号分割部２６０１からグルーピング情報切換部２６０２への制御線を設けておき、符号分割部２６０１が高域符号２６１２からグルーピング情報切換情報２３１１を分離して制御線に出力し、グルーピング情報切換部２６０２を切り換えるようにしてもよい。
【０１８４】
さらに、グルーピング情報切換部２６０２を設けないで、第１のグルーピング情報２４１３および第２のグルーピング情報２６１１が両方ともに、切換部２４０５を介して選択されたほうの高域復号化部に入力されるようにしておき、選択された高域復号化部において、第２のグルーピング情報２６１１のデータがあれば、第２のグルーピング情報２６１１を第１のグルーピング情報２４１３に優先して用いる、としてもよい。
【０１８５】
このような構成をとることにより、符号化装置２３００において変換サブブロックが高域係数に対して最適になるようにグループ化され、より少ないビット量で符号化された入力符号を復号化装置２６００において復号化することができ、より高品質な出力信号を得ることができる。復号化装置２６００では、復号化装置２４００と比較して入力符号のビット量は大きいが、より原音に近い高品質な出力信号を得ることができる。また、復号化装置２６００では、復号化装置２５００と比較して入力符号のビット量は小さいが、それほど品質が劣化しない出力信号を得ることができる。
【０１８６】
なお、前記の実施の形態７および実施の形態８に係る符号化装置および復号化装置の説明においては、高域符号化部と低域符号化部とで同一のサンプリング周波数に基づく信号もしくは係数を用いたが、それぞれに異なるサンプリング周波数を用いることもできる。一例として、基本変換ブロックによる変換を行い、低域符号化において２４ｋＨｚ、高域符号化において４８ｋＨｚのサンプリング周波数を使用する場合を説明する。
【０１８７】
まず基本変換ブロック長として２０４８サンプルを使用し、サンプリング周波数４８ｋＨｚの入力信号２０４８サンプルを変換して２０４８点のＭＤＣＴ係数を得たとすると、このＭＤＣＴ係数は０から２４ｋＨｚの信号を表す係数である。このうち、低域係数として２０４８点のＭＤＣＴ係数の低域側１０２４点を取り出せば、低域係数は０から１２ｋＨｚの信号を表す係数となる。この低域係数１０２４点を、逆ＭＤＣＴ変換すれば１０２４サンプルの時間信号が得られることとなり、時間信号のサンプル数が半分となる、つまり、半分のサンプリング周波数にダウンサンプリングされたことになる。
【０１８８】
入力信号を直接ＭＤＣＴ変換して得られた２０４８点のＭＤＣＴ係数がサンプリング周波数４８ｋＨｚに基づく係数であれば、その低域側１０２４点を取り出すことによって得られた低域ＭＤＣＴ係数は、サンプリング周波数２４ｋＨｚに基づく係数であるといえる。この様にして得られた１０２４点の低域ＭＤＣＴ係数と２０４８点の全域ＭＤＣＴ係数を、それぞれ低域符号化部と高域符号化部に入力するようにすれば、低域符号化部と高域符号化部とで異なるサンプリング周波数を用いる構成が実現できる。
【０１８９】
この構成においては、高域符号化部の変換ブロック長２０４８サンプルに対して、低域符号化の変換ブロック長は１０２４サンプルとなり、両者は異なるが、それぞれのサンプリング周波数が４８ｋＨｚおよび２４ｋＨｚであるため、時間としての変換ブロック長は２０４８／４８０００＝１０２４／２４０００で等しくなる。逆に言うと、低域符号化の変換ブロック長と高域符号化の変換ブロック長とが時間的に等しいことが、本構成を実現するために必要な条件となる。
【０１９０】
復号化についても同様に、低域復号化部の変換ブロック長と高域復号化部の変換ブロック長とが時間的に等しくなるように設定し、低域復号化部で復号された２４ｋＨｚのサンプリング周波数に基づく１０２４点の低域ＭＤＣＴ係数を、アップサンプリング後の４８ｋＨｚのサンプリング周波数に基づく２０４８点のＭＤＣＴ係数の低域１０２４点部分とみなし、４８ｋＨｚサンプリングに基づくＭＤＣＴ係数として復号化された高域ＭＤＣＴ係数と合成するように構成すればよい。
【０１９１】
また、前記の実施の形態６から実施の形態８に係る符号化装置および復号化装置の説明においては、符号化セルごとに、第１と第２の高域符号化部もしくは第１と第２の高域復号化部を切り換えて使用できる構成を示したが、変換サブブロックが用いられる場合については、各変換サブブロックまたは変換サブブロックグループごとに切り換えて使用できる構成としても良い。符号化において変換サブブロックが使用される場合、入力信号は、時間軸方向の変化は大きいが、周波数軸方向の変化は小さいことが多い。したがって、同一の変換サブブロックもしくは変換サブブロックグループに含まれる、周波数軸方向の複数の高域符号化サブバンドに対しては、同一の高域符号化部を選択することができる。この構成によれば、符号化部の切換情報を、符号化セル単位では無く、変換サブブロックまたは変換サブブロックグループ単位で符号化すれば良いので、切換情報の符号化に必要なビット量を削減できる。
【０１９２】
なお、本発明の実施の形態１〜実施の形態８に係る符号化装置および復号化装置は、各部の機能を実現する処理手順を連携することによって、符号化方法および復号化方法として実現することができる。各処理手順は、それぞれを、プログラミング言語を用いてソフトウェアとして記述し、プロセッサ上でソフトウェアを実行することによって実現される。プログラミング言語を用いて記述されたソフトウェアは、ＲＯＭ（読み出し専用メモリ）やハードディスクドライブ等の記録媒体に記録され、記録媒体に直接もしくはインターネット等のネットワークを介して接続されたプロセッサ上で実行することにより、本発明の実施の形態１から実施の形態８に係る符号化装置および復号化装置の機能を実現することができる。
【０１９３】
なお、本発明に係る符号化装置は、ＢＳおよびＣＳを含む衛星放送の放送局に備えられる音響符号化装置として、またインターネットなどの通信ネットワークを介してコンテンツを配信するコンテンツ配信サーバの音響符号化装置として、さらに、汎用のコンピュータによって実行される音響信号符号化用のプログラムおよびそれを記録した記録媒体として有用である。
【０１９４】
また、本発明に係る復号化装置は、家庭のＳＴＢに備えられる音響復号化装置としてだけでなく、汎用のコンピュータによって実行される音響信号復号化用のプログラムおよびそれを記録した記録媒体として、またＳＴＢまたは汎用のコンピュータに備えられる音響信号復号化用の専用の回路基板、ＬＳＩなどとして、さらにＳＴＢまたは汎用のコンピュータに挿入されるＩＣカードとして有用である。
【０１９５】
【発明の効果】
本発明の符号化装置は、上記目的を達成するために、入力信号を低い周波数帯域の信号を表す低域信号と高い周波数帯域の信号を表す高域信号とに分割する帯域分割手段と、分割された低域信号を符号化し、低域符号を生成する低域符号化手段と、前記高域信号と前記低域信号との類似度を判定する類似度判定手段と、前記高域信号を、異なる符号化方法で符号化し、高域符号を生成する複数の高域符号化手段と、前記類似度に基づいて、前記複数の高域符号化手段の１つを選択して、選択された高域符号化手段に符号化を実行させ、選択された高域符号化手段の符号化方法を特定するための選択情報を生成する符号化選択手段と、前記低域符号化手段からの低域符号、選択された前記高域符号化手段からの高域符号および前記符号化選択手段からの選択情報を多重化して、出力符号を生成する符号多重化手段とを備える。
【０１９６】
また、この符号化装置によって符号化された出力符号を復号化する復号化装置は、入力符号から、低い周波数帯域の信号を表す低域信号を復号化する低域復号化手段と、入力符号から、異なる復号化方法を用いて、高い周波数帯域の信号を表す高域信号を復号化する複数の高域復号化手段と、入力符号に含まれる選択情報に基づいて、前記高域復号化手段の１つを選択し、選択した高域復号化手段に復号化を実行させる復号化選択手段と、前記低域復号化手段によって復号化された前記低域信号と、選択された前記高域復号化手段によって復号化された前記高域信号とを合成して、出力復号化信号を生成する信号合成手段とを備える。
【０１９７】
以上のような構成をとることにより、入力信号を低い周波数帯域の信号と高い周波数帯域の信号に分離し、高い周波数帯域の信号を低い周波数帯域の信号との類似度に基づいて少ないビット量で適切に符号化することができるので、より少ないビット量の符号から高品質な再生信号を得ることができる。
【０１９８】
また、本発明の他の符号化装置および復号化装置によれば、類似度を算出する際に低い周波数帯域の信号として低域符号化手段により符号化された符号を復号化して得られた信号を用いることにより、類似度としてより適正な値を得ることができるため、より少ないビット量の符号から高品質な再生信号を得ることができる。
【０１９９】
また、本発明のさらに他の符号化装置および復号化装置によれば、低い周波数帯域の信号をダウンサンプリングすることによって低域符号化手段の符号化効率を高めることができるため、より少ないビット量の符号から高品質な再生信号を得ることができる。
【０２００】
また、本発明のさらに他の符号化装置および復号化装置によれば、高い周波数帯域のスペクトル係数を分割し、分割されたそれぞれの係数に対して最適な符号化方法を適用することができるので、より少ないビット量の符号から高品質な再生信号を得ることができる。
【０２０１】
本発明のさらに他の符号化装置および復号化装置によれば、低域符号化手段と高域符号化手段とにおいて符号化されるスペクトル係数を一括して算出することができるため、時間−周波数変換に必要な演算量を削減することができる。さらに、低域復号化手段と高域復号化手段とで復号化されたスペクトル係数を一括して時間信号に変換できるため、周波数−時間変換に必要な演算量を削減することができる。
【０２０２】
また、本発明のさらに他の符号化装置および復号化装置によれば、入力信号を低い周波数帯域の信号と高い周波数帯域の信号に分離した上で、低い周波数帯域の信号に対しては従来の符号化方法を適用し、高い周波数帯域の信号に対しては、高い周波数帯域の信号と低い周波数帯域の信号との類似度を判定する類似度判定手段と、類似度判定手段の判定結果に基づいて、低い周波数帯域の信号との類似度を利用もしくはランダムノイズ信号を用いて符号化する高域符号化手段を適用する。このような構成をとることより、例えばステレオで６４ｋｂｐｓ以下といった低ビットレートにおいて、従来の方法では符号化できなかった高い周波数帯域の信号を符号化および復号化することを可能にし、符号化された音響信号の聴感的な品質を向上させることができる。
【０２０３】
以上の説明から明らかなように、本発明の符号化装置および復号化装置によれば、入力信号を低い周波数帯域の信号と高い周波数帯域の信号に分割し、高い周波数帯域の信号について、高い周波数帯域の信号と低い周波数帯域の類似度を利用して符号化を行うことにより、高い周波数帯域の信号を効率的に符号化し、より少ないビット量の符号から高音質な復号化信号を得ることができる。
【０２０４】
また、低い周波数帯域の符号化と高い周波数帯域の符号化に同一形式のスペクトル係数を使用することにより、より少ない処理量で高音質な復号化信号を得ることができる。
【図面の簡単な説明】
【図１】実施の形態１に係る符号化装置の構成を示すブロック図である。
【図２】図２（ａ）は、周波数の異なる２つの正弦波を時間信号として示す図である。図２（ｂ）は、周波数の異なる２つの正弦波を周波数信号として示す図である。
【図３】実施の形態１に係る復号化装置の構成を示すブロック図である。
【図４】実施の形態２に係る符号化装置の構成を示すブロック図である。
【図５】実施の形態３に係る符号化装置の構成を示すブロック図である。
【図６】図５に示した符号化装置によって符号化された出力符号を復号化する復号化装置の構成を示すブロック図である。
【図７】実施の形態４に係る符号化装置において高域符号および切換情報を生成する部分の構成を示すブロック図である。
【図８】図８（ａ）は、第１の高域符号化部の符号化方法を示す図である。図８（ｂ）は、第２の高域符号化部の符号化方法を示す図である。
【図９】図７に示した符号化装置に対応する復号化装置において高域符号を復号化する部分（高域復号化装置）の構成を示すブロック図である。
【図１０】実施の形態５に係る符号化装置の構成を示すブロック図である。
【図１１】図１０に示した符号化装置の出力符号である入力符号を復号化する復号化装置の構成を示すブロック図である。
【図１２】実施の形態６に係る符号化装置の構成を示すブロック図である。
【図１３】図１２に示した変換ブロック長選択部において基本変換ブロックが選択された場合の高域係数と低域係数とを示す図である。
【図１４】図１２に示した変換ブロック長選択部において変換サブブロックが選択された場合の高域係数と低域係数とを示す図である。
【図１５】図１２に示した符号化装置によって出力された出力符号を復号化する復号化装置の構成を示すブロック図である。
【図１６】実施の形態７に係る符号化装置の構成を示すブロック図である。
【図１７】図１６に示した符号化装置によって符号化された出力符号を復号化する復号化装置の構成を示すブロック図である。
【図１８】実施の形態８に係る符号化装置の構成を示すブロック図である。
【図１９】図１９（ａ）は、図１８に示した符号化装置の時間−周波数変換部からブロック長情報にしたがって出力される全域係数の一例を示す図である。図１９（ｂ）は、図１８に示した符号化装置の高域符号化部に入力される高域係数の一例を示す図である。図１９（ｃ）は、図１８に示した符号化装置の低域符号化部に入力される低域係数の一例を示す図である。
【図２０】図１８に示した符号化装置の低域符号化部においてグルーピングが行われる場合の高域係数と低域係数との関係を示す図である。
【図２１】図２１は、実施の形態８に係る他の符号化装置の構成を示すブロック図である。
【図２２】図２２は、実施の形態８に係るさらに他の符号化装置の構成を示すブロック図である。
【図２３】図１８に示した符号化装置によって生成された出力符号を復号化する復号化装置の構成を示すブロック図である。
【図２４】図２１に示した符号化装置による出力符号を復号化する復号化装置の構成を示すブロック図である。
【図２５】図２２に示した符号化装置からの出力符号を入力符号として復号化する復号化装置の構成を示すブロック図である。
【図２６】従来のＡＡＣ方式を用いた符号化装置の構成を示すブロック図である。
【符号の説明】
２００　　符号化装置
２０１　　帯域分割部
２０２　　低域符号化部
２０３　　類似度判定部
２０４　　切換部
２０５　　高域符号化部
２０６　　符号多重化部
２０７　　入力信号
２０８　　低域信号
２０９　　高域信号
２１０　　切換情報
２１１　　類似度情報
２１２　　高域符号
２１３　　低域符号
２１４　　出力符号
４００　　復号化装置
４０１　　符号分割部
４０２　　低域復号化部
４０３　　切換部
４０４　　高域復号化部
４０５　　信号合成部
４０６　　入力符号
４０７　　低域符号
４０８　　高域符号
４０９　　切換情報
４１０　　低域信号
４１１　　高域信号
４１２　　出力信号[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an encoding apparatus that encodes an input acoustic signal, and a decoding apparatus that reproduces an acoustic signal again by performing decoding after accumulating or transmitting a code obtained by the encoding apparatus, and The present invention relates to an encoding method and a decoding method in the encoding device and the decoding device.
[0002]
[Prior art]
As a technique for encoding a general acoustic signal with a small bit amount and obtaining a high-quality reproduction signal, a method using band division coding is widely known. This is because the input acoustic signal is divided into a plurality of frequency band signals using a band division filter, or converted to a frequency axis signal using time-frequency conversion such as Fourier transform, and then the frequency axis. This can be realized by dividing the data into a plurality of bands and assigning appropriate coded bits to the divided bands. The reason why a high-quality reproduction signal can be obtained from a code with a small bit amount by using band division coding is that processing based on human auditory characteristics can be performed at the coding stage.
[0003]
In general, human hearing is less sensitive to sounds with a high frequency of about 10 kHz or more, and sounds with low levels are less likely to be detected. Also, a phenomenon called frequency masking is well known, and when a high level sound exists in a specific frequency band, a low level sound in the peripheral band becomes difficult to detect. For such a portion that is difficult to detect due to auditory characteristics, even if many bits are assigned and encoded, it hardly contributes to improving the quality of the reproduced signal and does not make sense to encode. Conversely, by reassigning the coded bits assigned to this part without considering the auditory characteristics to other auditory sensitive parts, the auditory sensitive parts are encoded in detail and the reproduced signal is reproduced. Can improve the quality.
[0004]
A typical example of encoding using such band division is ISO international standard MPEG-4 AAC (ISO / IEC 14496-3). The operation of MPEG-4 AAC (hereinafter referred to as AAC) will be described below with reference to the drawings.
[0005]
FIG. 26 is a block diagram showing a configuration of a coding apparatus 100 using a conventional AAC scheme. The encoding apparatus 100 is an encoding apparatus that evaluates an input signal 109 according to auditory characteristics, and encodes the signals by bit allocation according to the evaluation, and includes an auditory characteristic evaluation unit 101, a transform block length selection unit 102, and an MDCT conversion unit 103. , A band division unit 104, a spectrum signal processing unit 105, a bit allocation unit 106, a quantization unit 107, and a code multiplexing unit 108.
[0006]
The input signal 109 is divided by a basic frame length of 1024 samples / frame and input to the auditory characteristic evaluation unit 101 and the MDCT conversion unit 103. The auditory characteristic evaluation unit 101 evaluates the input signal 109 according to the human auditory characteristic and outputs an auditory characteristic evaluation value 110. The transform block length selection unit 102 selects a transform block length suitable for encoding the input signal 109 from the auditory characteristic evaluation value 110, and outputs it to the MDCT transform unit 103. The MDCT conversion unit 103 converts the input signal 109 into the MDCT coefficient 111 by applying the selected conversion block length. In the case of AAC, the transform block length is 128 samples or 1024 samples. Usually, a short transform block length is assigned when the input signal 109 is transient, and a long transform block length is assigned when the input signal 109 is stationary.
[0007]
The MDCT (Modified Discrete Cosine Transform) used here is a kind of cosine transform, and the calculated MDCT coefficient 111 is a coefficient representing the frequency spectrum of the input signal 109. The calculated MDCT coefficient 111 is divided into a plurality of frequency bands (subbands) in the band dividing unit 104. After that, the spectrum signal processing unit 105 performs prediction for enabling more efficient encoding and noise shaping processing based on the auditory characteristic evaluation value 110 for the MDCT coefficient 112 divided into each frequency band. .
[0008]
Further, in the spectrum signal processing unit 105, when the input signal 109 is a signal composed of a plurality of channels such as a stereo signal, a process called joint stereo is also performed to increase the coding efficiency by utilizing the correlation of signals between channels. Done. Furthermore, there is a case where a process called PNS (Perceptual Noise Substation) is performed, which will be described in detail later.
[0009]
Information about what processing is performed in the spectrum signal processing unit 105 is output as the auxiliary information code 114. The bit allocation unit 106 calculates a bit allocation 115 necessary for quantization based on the auditory characteristic evaluation value 110 and outputs the bit allocation 115 to the quantization unit 107. The quantization unit 107 quantizes the MDCT coefficient 113 processed by the spectrum signal processing unit 105 with the number of bits indicated by the bit allocation 115. The quantization is performed on a combination of normalized gain information for each subband called a scale factor and MDCT coefficient values normalized by the scale factor.
[0010]
The code multiplexing unit 108 multiplexes the auxiliary information code 114 output from the spectrum signal processing unit 105 and the spectrum code 116 output from the quantization unit 107, rearranges them into a predetermined format, and outputs the result as an output code 117. . In the case of AAC, since the number of bits allocated to a basic frame can be arbitrarily determined for each frame, encoding is basically performed at a variable bit rate, but it is called a bit reservoir before the final output. Transmission at a fixed bit rate is possible by providing a buffer and absorbing fluctuations in the bit rate for each frame by the buffer.
[0011]
Next, PNS processing will be described. In the PNS, for each of the subbands, it is determined whether the band can be expressed as a noise component audibly, and if it is determined that the band can be expressed as a noise component, the MDCT coefficient of the band is randomly generated. Replace with signal. For the band replaced with the noise signal, it is not necessary to quantize the MDCT coefficient value, and only the gain information corresponding to the scale factor needs to be quantized. It becomes possible to reduce significantly.
[0012]
By performing such encoding processing, in MPEG-4 AAC, for example, a wideband stereo signal of 20 Hz to 16 kHz or more can be encoded with high sound quality at a bit rate of about 96 kbps.
[0013]
[Problems to be solved by the invention]
However, when the bit rate is further reduced, for example, when the bit rate is about 48 kbps with respect to a stereo signal, the band that can be encoded with high sound quality is narrowed, and the sound becomes audible. There is a problem.
[0014]
Also, if PNS is used extensively in order to reduce the number of encoded bits of MDCT coefficients at the stage of lowering the bit rate, the number of parts replaced by noise signals will increase, resulting in a sound with a lot of noise and distortion in terms of hearing. There is a problem.
[0015]
In view of the above problems, the present invention provides a high-quality audio signal having a wide bandwidth on the decoding device side that has received the code when transmitting the code of the audio signal encoded by the encoding device at a low bit rate. It is an object of the present invention to provide an encoding device, a decoding device, an encoding method, and a decoding method that can be decoded.
[0016]
[Means for Solving the Problems]
In order to achieve the above object, the encoding apparatus of the present invention splits an input signal into a low frequency signal representing a low frequency band signal and a high frequency signal representing a high frequency band signal, A low-frequency encoding unit that encodes the generated low-frequency signal and generates a low-frequency code; a similarity determination unit that determines a similarity between the high-frequency signal and the low-frequency signal; and the high-frequency signal. Based on the similarity, a plurality of high frequency encoding units that encode with different encoding methods and generate a high frequency code, and select one of the plurality of high frequency encoding units to select the selected high frequency encoding unit. A coding selection unit for causing the band coding unit to perform coding and generating selection information for specifying a coding method of the selected high band coding unit; and a low band code from the low band coding unit The high-frequency code from the selected high-frequency encoding means and the encoding selection method Multiplexes selection information from, and a code multiplexing unit configured to generate an output code.
[0017]
Further, the decoding device for decoding the output code encoded by the encoding device includes a low frequency decoding means for decoding a low frequency signal representing a signal of a low frequency band from the input code, and an input code. A plurality of high frequency decoding means for decoding a high frequency signal representing a signal of a high frequency band using different decoding methods, and the high frequency decoding means based on the selection information included in the input code. Decoding selection means for selecting one and causing the selected high-frequency decoding means to perform decoding, the low-frequency signal decoded by the low-frequency decoding means, and the selected high-frequency decoding Signal synthesizing means for synthesizing the high frequency signal decoded by the means to generate an output decoded signal.
[0018]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
(Embodiment 1)
FIG. 1 is a block diagram showing a configuration of coding apparatus 200 according to Embodiment 1 of the present invention. The encoding device 200 is an encoding device that checks the similarity between a high frequency signal and a low frequency signal for an input signal, and encodes the high frequency signal with a smaller amount of bits. , A band division unit 201, a low frequency encoding unit 202, a similarity determination unit 203, a switching unit 204, a high frequency encoding unit 205, and a code multiplexing unit 206. In this case, the band dividing unit 201 divides the input signal 207 into a band and outputs a low frequency signal 208 including only a low frequency band signal and a high frequency signal 209 including only a high frequency band signal.
[0019]
The boundary frequency between the upper limit frequency included in the low-frequency signal 208 and the lower limit frequency included in the high-frequency signal may be arbitrarily determined, and both may overlap. The low frequency encoding unit 202 encodes the low frequency signal 208 and outputs a low frequency code 213. Here, any known encoding unit may be used as the low-frequency encoding unit 202. Next, the similarity determination unit 203 determines the similarity between the low frequency signal 208 and the high frequency signal 209, and outputs the switching information 210 and the similarity information 211. For example, a prediction error can be used as the similarity determination method, but a specific method will be described later in detail.
[0020]
The high frequency encoding unit 205 optimally encodes the high frequency signal 209 according to the values of the switching information 210 and the similarity information 211. Specifically, the high frequency encoding unit 205 includes n high frequency encoding units including first to nth (n is a natural number), one of which is selected according to the switching information 210. Is done. Each of the first to nth high frequency encoding units is configured to optimally encode the high frequency signal 209 having different characteristics classified according to the similarity between the low frequency signal 208 and the high frequency signal 209. .
[0021]
Based on the switching information 210 output from the similarity determination unit 203, the switching unit 204 selects and selects a predetermined optimum high-frequency encoding unit from the first to n-th encoding units. The high frequency encoding unit encodes the high frequency signal 209 with reference to the similarity information 211 and outputs a high frequency code 212. As an example of the similarity determination unit 203, when the similarity is low, it is determined that the input signal 207 is a random noise signal, and among the first to nth high frequency encoding units, The switching information 210 is selected so that a high frequency encoding unit that encodes the high frequency signal 209 using noise is selected. On the contrary, when the similarity is high, the low frequency determined to be high Using the signal 208, the switching information 210 is output so that the high frequency encoding unit that encodes the differential signal obtained by subtracting the low frequency signal 208 from the high frequency signal 209 is selected.
[0022]
Further, when the similarity between the low-frequency signal 208 and the high-frequency signal 209 is high, the similarity determination unit 203 provides a high-similarity part as a part of the similarity information 211 to the high-frequency encoding unit 205. When the similarity between the low frequency signal 208 and the high frequency signal 209 is low and the similarity between the low frequency signal 208 and the high frequency signal 209 is low, the similarity information 211 is output. A configuration is possible in which “0” is output.
[0023]
On the other hand, in the encoding using noise in the high frequency encoding unit 205, the selected high frequency encoding unit regards the high frequency signal to be encoded as a noise signal generated at random, and the intensity of the high frequency signal. Since only the gain information indicating is encoded, it is possible to greatly reduce the bit amount of the code as compared with the case where the value of the high frequency signal is directly encoded. When the similarity is high, the encoding using the differential signal quantizes the difference between the high frequency signal 209 and the low frequency signal 208 similar to the code of the high frequency signal 209 to be encoded. Can reduce the amount of bits required for conversion.
[0024]
In addition, when encoding a signal of a very high frequency band that is not very important for auditory characteristics when the similarity is very high, the target high frequency signal 209 is similar to that of the target without encoding the difference signal. The low frequency signal 208 may be replaced. In this way, since only the information for specifying the original low-frequency signal 208 to be replaced needs to be encoded, the amount of bits necessary for encoding the high-frequency signal 209 can be further reduced. The low frequency code 213, the high frequency code 212, and the switching information 210, which are output results of the encoding processes of the low frequency encoding unit 202, the high frequency encoding unit 205, and the similarity determination unit 203, Multiplexed and output as an output code 214.
[0025]
Next, a method using a prediction error will be described as an example of a similarity determination method. Since the low-frequency signal 208 and the high-frequency signal 209 are signals generated by dividing the input signal into bands, the frequency components included in the signals are different. Therefore, as long as each signal is viewed as a time signal, it is difficult to determine the similarity between the two, but when these signals are viewed as frequency signals, the distribution of frequency components is compared. It is easy to determine the similarity between the two. FIG. 2 described below illustrates this principle. As examples of signals having different frequency components, a 1 kHz sine wave and a 2 kHz sine wave having the same amplitude are used as a time signal and a frequency signal (for example, an FFT power spectrum). (Coefficient).
[0026]
FIG. 2 is a diagram illustrating a difference between a case where two signals are viewed as time signals and a case where they are viewed as frequency signals. FIG. 2A shows two sine waves having different frequencies as time signals. FIG. 2B is a diagram showing two sine waves having different frequencies as frequency signals. In FIG. 2A, there is no similarity that can be easily determined between the 1 kHz sine wave 301 and the 2 kHz sine wave 302 represented as a time signal. On the other hand, in FIG. 2B, the 1 kHz sine wave 303 and the 2 kHz sine wave 304 represented as frequency signals are both represented as one line spectrum, and the similarity between them is very large. Easy to judge. That is, the difference between the two is only the position information represented by “frequency”. For example, in FIG. 2B, the position of the sine wave 304 of 2 kHz is set as the reference point 305 and the signal extracted from the reference point 305 When the signal taken out from the position separated from the reference point 305 by the distance k (= shift number 306) from the 1 kHz sine wave is compared, the two signals match.
[0027]
Based on such a principle, the low-frequency signal 208 and the high-frequency signal 209 are converted into spectral coefficients indicating the frequency component of the input signal 207 by using the time-frequency conversion in the similarity determination unit 203, and the similarity is determined. A determination is made. As the time-frequency transformation used, any known transformation such as Fourier transformation, cosine transformation, and filter bank may be used. As shown in the example of FIG. 2, the similarity between the low frequency signal 208 and the high frequency signal 209 can be determined by shifting one signal on the frequency axis and superimposing it on the other signal. It is given as a measure that represents the part that does not match when the signals are superimposed. In the present invention, since the low-frequency signal 208 is used to encode the high-frequency signal 209, a configuration in which the similarity is determined by fixing the high-frequency signal 209 as a target and shifting the low-frequency signal 208 is desirable. In this configuration, the prediction error is defined as x (i) for the low frequency signal 208 converted to the spectral coefficient and y (j) for the high frequency signal 209.
[0028]
[Expression 1]

[0029]
It is represented by Here, x (k, j) is a signal extracted from a position shifted by k samples from x (j) as a reference point 305, a is a gain, m is a low-frequency signal x (k, j), and a high-frequency signal y. This is the number of spectral coefficients included in (j). The number m of spectral coefficients specifies the range of the high frequency signal y (j) to be encoded, and may be arbitrarily determined. The position of the reference point 305 and the upper limit (k_MAX) and the lower limit (k_MIN) of the shift number k of the low-frequency signal can be arbitrarily determined, and the prediction error E is varied by changing k within the set shift number range. If (k) is evaluated and the minimum value is selected, the low frequency signal x (k, j) having the highest similarity to the target high frequency signal y (j), the shift number k, and the gain a are determined. can do. Since x (k, j) is uniquely determined when the shift number k is determined, the prediction error E (k), the shift number k, and the gain a may be used as the similarity information 211.
[0030]
Switching between the first to nth high frequency encoding units is performed using the prediction error E (k) as an evaluation measure, and if the prediction error E (k) is smaller than a predetermined threshold, the similarity is high. The switching information 210 for selecting the high-frequency encoding unit used for the output is output. On the other hand, if it is large, the switching information 210 for selecting the high-frequency encoding unit used when the similarity is low is output.
[0031]
In the above description, the prediction error of the spectrum coefficient is used as the similarity determination method in the similarity determination unit 203. However, other than that, for example, a signal divided into bands by a filter bank is Prediction errors can also be used. In the above description, the high frequency encoding unit that encodes using noise and the high frequency encoding unit that uses the similarity between the low frequency signal 208 and the high frequency signal 209 are used. In the case of a medium level, a high-frequency encoding unit that combines encoding using similarity and encoding using noise may be used. If the input signal is a multi-channel signal such as a stereo signal, the high-frequency signal 209 is encoded using a joint stereo encoding technique in addition to the method of encoding the high-frequency signal 209 independently for each channel. You can also
[0032]
As described above, the input signal 207 is band-divided into a high frequency signal 209 and a low frequency signal 208, and the high frequency signal 209 is divided using the similarity between the high frequency signal 209 and the low frequency signal 208. By encoding, a high-quality reproduction signal can be obtained from a code having a smaller bit amount.
[0033]
FIG. 3 is a block diagram showing a configuration of decoding apparatus 400 according to Embodiment 1. The decoding apparatus 400 is a decoding apparatus that decodes the output code 214 encoded by the encoding apparatus 200 illustrated in FIG. 1 and reproduces an acoustic signal. The decoding apparatus 400 includes a code division unit 401 and a low frequency decoding unit 402. , A switching unit 403, a high frequency decoding unit 404, and a signal synthesis unit 405.
[0034]
In FIG. 3, an input code 406 corresponds to the output code 214 of the encoding device 200 in FIG. The code division unit 401 divides the input code 406 into a low frequency code 407, a high frequency code 408, and switching information 409. The low frequency decoding unit 402 decodes the low frequency code 407 and outputs a low frequency signal 410. As the low frequency decoding unit 402, any known decoding unit may be used as long as it has a function of decoding the low frequency code 407.
[0035]
The high frequency decoding unit 404 includes n high frequency decoding units including first to nth, and each high frequency decoding unit decodes a high frequency code according to a predetermined method. It is configured as follows. The switching unit 403 selects a high frequency decoding unit designated by the switching information 409, and the selected high frequency decoding unit decodes the high frequency code 408 while referring to the low frequency signal 410, and the high frequency signal 411. Is output. The signal synthesis unit 405 synthesizes the low frequency signal 410 and the high frequency signal 411 to generate an output signal 412.
[0036]
Here, the high frequency encoding unit 205 on the encoding device 200 side described above uses the high frequency encoding unit that encodes using noise and the low frequency signal determined to be similar to the high frequency encoding unit 205. In the case where the decoding apparatus 400 includes a high-frequency encoding unit that encodes a differential signal obtained by subtracting a low-frequency signal from a signal, the decoding device 400 needs to include a high-frequency decoding unit 404 corresponding thereto. Below, the structural example and its operation | movement are demonstrated. For the sake of explanation, in FIG. 3, a decoding unit for decoding a code encoded using noise is encoded as a first high frequency decoding unit, a difference signal between a low frequency signal and a high frequency signal. A decoding unit that decodes the generated code is a second high frequency decoding unit.
[0037]
When the switching information 409 indicates that the high frequency code 408 is input to the first decoding unit, the high frequency signal to be decoded is represented by a random noise signal. Accordingly, the high frequency code 408 includes only gain information indicating the signal strength. In response to this, the first high frequency decoding unit decodes the gain information included in the high frequency code 408, generates a random noise signal therein, and adds the gain information to the generated noise signal. A high frequency signal 411 is generated by multiplying the indicated gain.
[0038]
On the other hand, when it is indicated that the high frequency code 408 is input to the second high frequency decoding unit, the high frequency code 408 is similar to the high frequency signal 209 in the low frequency signal 208, for example. Similarity information such as the number of shifts, gain, and difference signal for specifying a high-frequency portion is encoded and included. The high frequency signal is expressed by the sum of the low frequency signal extracted from the low frequency signal 410 obtained by the low frequency decoding unit 402 and the difference signal obtained by the second high frequency decoding unit. The Therefore, the second high frequency decoding unit first decodes the high frequency code 408 and, based on the similarity information included in the high frequency code 408, converts the low frequency signal 208 similar to the high frequency signal 209 into the low frequency signal. After extracting from the signal 410 and multiplying by the decoded gain, the high frequency signal 411 is generated by adding the decoded differential signal. The low-frequency signal 410 and the high-frequency signal 411 obtained in this way are synthesized by the signal synthesis unit 405 and output as an output signal 412.
[0039]
As described above, the high frequency signal is decoded using the similarity between the high frequency signal and the low frequency signal from the input code which is divided and encoded into the high frequency signal and the low frequency signal. By doing so, it is possible to use a high-frequency code encoded more efficiently, so that a high-quality reproduction signal can be obtained from a code with a smaller bit amount.
(Embodiment 2)
The configuration of the encoding apparatus according to Embodiment 2 of the present invention will be described with reference to FIG. FIG. 4 is a block diagram showing a configuration of coding apparatus 500 according to Embodiment 2.
[0040]
The encoding apparatus 500 shown in FIG. 4 is newly provided with a low frequency decoding unit 501 with respect to the encoding apparatus 200 in Embodiment 1 shown in FIG. The low frequency code 213 is decoded to generate a low frequency decoded signal 510, and the similarity determination unit 503 is configured to input the low frequency decoded signal 510 instead of the low frequency signal 208. is there. Such an encoding apparatus 500 includes a band dividing unit 201, a low frequency encoding unit 202, a low frequency decoding unit 501, a similarity determination unit 503, a switching unit 204, a high frequency encoding unit 205, and a code multiplexing unit 206. Is provided.
[0041]
In FIG. 4, the same components as those of the encoding device 200 shown in FIG. 1 have already been described in the following drawings, and thus the same reference numerals are given and description thereof is omitted. In the encoding apparatus 500 configured as described above, the similarity determination unit 203 is a low-frequency decoded signal that is once encoded by the low-frequency encoding unit 202 and then decoded by the low-frequency decoding unit 501. Since the similarity with the high frequency signal 209 is determined using 510, the output code from the encoding device 200 that has determined the similarity with the high frequency signal 209 using the low frequency signal 208 of the original sound in the decoding device. Compared with decoding 214, there is an effect that a high-frequency signal closer to the original sound can be obtained.
[0042]
Specifically, as shown in the example of the decoding apparatus 400 in FIG. 3, the decoding apparatus side is not a low-frequency signal of the original sound, but a low-frequency code obtained by decoding the encoded low-frequency code once. The high frequency signal is decoded using the high frequency signal. In order to encode the high frequency signal more accurately, this is based on the similarity between the encoded low frequency signal and the high frequency signal that is the target of encoding. This indicates that it is necessary to encode a high frequency signal. Therefore, in encoding apparatus 500, similarity information 511 calculated by similarity determination section 503 is the similarity between high frequency signal 209 and low frequency decoded signal 510.
[0043]
As a result, the high frequency encoding unit 205 encodes the high frequency signal 209 in accordance with the similarity information 511 and the switching information 513, so that the high frequency code 512 that can decode the high frequency signal more accurately in the decoding device. Can be output. As a result, the encoding apparatus 500 outputs the output code 514 in which the switching information 513, the high-frequency code 512, and the low-frequency code 213 obtained in this way are multiplexed, and the quality of the decoded signal in the decoding apparatus. Can be improved.
[0044]
Note that the coding apparatus 200 according to the first embodiment and the coding apparatus 500 according to the second embodiment differ only in the low frequency signal used for calculating the similarity, and the high frequency coding unit and the low frequency signal are different. Since the configuration of the encoding unit and the high frequency signal and the low frequency signal to be encoded are the same, the configuration of the decoding device corresponding to the encoding device 500 of the second embodiment is the implementation shown in FIG. The same configuration as that of the decoding device 400 of the first embodiment may be used.
(Embodiment 3)
The configuration of the encoding apparatus according to Embodiment 3 of the present invention will be described with reference to FIG. FIG. 5 is a block diagram showing a configuration of coding apparatus 600 according to Embodiment 3.
[0045]
5 is newly provided with a downsampling unit 601 with respect to the encoding device 200 in Embodiment 1 shown in FIG. 1, and band division is performed with respect to the low-frequency encoding unit 202. Instead of the original low-frequency signal 208, the low-frequency signal 616 down-sampled by the down-sampling unit 601 is input, and the low-frequency code 613 that is the encoding result is output.
[0046]
In general, the low-frequency signal 208 and the high-frequency signal 209 that are band-divided by the band dividing unit 201 are signals having the same sampling frequency as that of the input signal 207. As an example, if the sampling frequency of the input signal 207 is 48 kHz and the boundary frequency divided by the band dividing unit 201 is 8 kHz, the input signal 207 includes a signal of 0-24 kHz according to the sampling theorem. The signal band included in 208 is 0-8 kHz and the high-frequency signal 209 is 8-24 kHz, but the sampling frequency of both the low-frequency signal 208 and the high-frequency signal 209 does not change at 48 kHz. Although it is possible to encode the low frequency signal 208 as it is by the low frequency encoding unit 202, generally, a known encoding unit used as the low frequency encoding unit 202 has a signal band included in the input signal 207. The maximum frequency determined by the sampling frequency of the input signal (= sampling frequency / 2) is configured to operate most efficiently.
[0047]
Therefore, encoding is performed by inputting a low-frequency signal 208 including only up to 8 kHz, which is 1/3 of the premise, to the low-frequency encoding unit 202 configured on the assumption that the sampling frequency is 48 kHz and includes a signal up to 24 kHz. This causes a problem that the efficiency of low-frequency encoding is reduced.
[0048]
In order to solve this problem, the sampling frequency of the low frequency signal 208 input to the low frequency encoding unit 202 is down-sampled according to the frequency band of the signal included in the low frequency signal 208, and the low frequency encoding unit The efficiency of 202 may be increased. More specifically, when the low-frequency signal 208 input to the down-sampling unit 601 is a time signal, the down-sampling unit 601 thins out the sampled data so that the desired sampling frequency is obtained, and the low-frequency signal is input. When the signal 208 is spectrum data subjected to frequency conversion, the data length of the low frequency signal 208 is notified to the low frequency encoding unit 202. In this example, since the frequency band of the signal included in the low frequency signal 208 is 0-8 kHz, the sampling frequency required to express the low frequency signal 208 is 16 kHz. If the low frequency signal 616 down-sampled to 16 kHz is input to the low frequency encoding unit 202, the low frequency encoding unit 202 can encode the input low frequency signal 208 most efficiently, There is an effect that the output code 614 with a small bit amount can be obtained while improving the quality of the digitized signal.
[0049]
Next, the configuration of decoding apparatus 700 according to Embodiment 3 will be described with reference to FIG. FIG. 6 is a block diagram showing a configuration of decoding apparatus 700 that decodes output code 614 encoded by encoding apparatus 600 shown in FIG. The decoding apparatus 700 shown in FIG. 6 is provided with an upsampling unit 701 in addition to the decoding apparatus 400 in Embodiment 1 shown in FIG. The band signal 710 is up-sampled by the up-sampling unit 701, and the up-sampled low-frequency signal 410 is input to the high-frequency decoding unit 404 and the signal synthesis unit 405. The decoding apparatus 700 includes a code division unit 401, a low frequency decoding unit 402, an upsampling unit 701, a switching unit 403, a high frequency decoding unit 404, and a signal synthesis unit 405. In the same figure, since the same components as those of the decoding device 400 shown in FIG. 3 have already been described, the same reference numerals are given and description thereof is omitted.
[0050]
In FIG. 6, an input code 706 corresponds to the output code 614 in the encoding device 600 of FIG. In the decoding device 700, the low frequency code 707 output from the code division unit 401 is obtained by encoding the low frequency signal 616 down-sampled on the encoding device 600 side. The sampling frequency of the decoded low frequency signal 710 is equal to the sampling frequency (for example, 16 kHz) of the low frequency signal 616 used on the encoding device 600 side.
[0051]
On the other hand, the high frequency code 408 is obtained by encoding the high frequency signal 209 that is not down-sampled, and the sampling frequency of the high frequency code 408 to be decoded is the input signal input to the encoding device 600. Equal to a sampling frequency of 207 (eg 48 kHz). Therefore, the decoding apparatus 700 performs sampling processing of the high frequency signal that is decoded by up-sampling the low frequency signal 710 in order to perform the decoding process of the high frequency signal using the decoded low frequency signal 710. (= Sampling frequency of output signal, for example, 48 kHz) and then input to high frequency decoding section 404. Similarly, the signal synthesis unit 405 also performs signal synthesis processing using the decoded high frequency signal 411 and the upsampled low frequency signal 410 to generate an output signal 412.
[0052]
As described above, after the low frequency signal 710 is decoded from the input code 706 that is divided and encoded into the high frequency code 408 and the down-sampled low frequency code 707, the high frequency code 408 is upsampled. By decoding the signal 411 using the similarity between the high-frequency signal 411 and the up-sampled low-frequency signal 410, a lower bit amount is obtained by using a low-frequency code encoded more efficiently. There is an effect that a high-quality reproduction signal can be obtained from the above code.
[0053]
(Embodiment 4)
Below, the structure of the encoding apparatus 800 in Embodiment 4 of this invention is demonstrated in detail, referring FIG. FIG. 7 is a block diagram showing a configuration of a part that generates high-frequency code 817 and switching information 816 in coding apparatus 800 according to Embodiment 4. In FIG. The portions of the similarity determination unit 818 and the high frequency encoding unit 819 shown by being surrounded by a broken line in FIG. 7 are low in the encoding device 200 and the encoding device 500 in the first to third embodiments of the present invention. A similarity determination unit 203 and a similarity determination unit 503 that determine the similarity between the high frequency signal 209 and the high frequency signal 209, and a high frequency encoding unit 205 that encodes the high frequency signal 209 according to the determination result. It corresponds, and represents a more desirable form as each similarity determination unit and high frequency encoding unit.
[0054]
Such an encoding apparatus 800 mainly includes a time-frequency conversion unit 801, a band division unit 802, a noise determination unit 803, a prediction error calculation unit 804, and a switching determination unit as a part for encoding the high frequency signal 209. 805, a switching unit 806, a first high frequency encoding unit 807, and a second high frequency encoding unit 808.
[0055]
In FIG. 7, for example, a high-frequency signal 209 and a low-frequency signal 208 that are band-divided into a high-frequency part and a low-frequency part in advance by a band dividing unit 201 (not shown) are input to the time-frequency conversion unit 801. . As the low-frequency signal 208, either a low-frequency signal directly divided from the input signal or a low-frequency decoded signal that has been once encoded by a low-frequency encoding unit (not shown) and then decoded can be used. good. The time-frequency conversion unit 801 converts the input high frequency signal 209 and low frequency signal 208 into spectral coefficients, and outputs them as a high frequency coefficient 811 and a low frequency coefficient 812, respectively. As the time-frequency conversion used in the time-frequency conversion unit 801, any known conversion such as Fourier transform, cosine transform, filter bank, etc. may be used, but here, the case where MDCT used in AAC is used as an example Will be described.
[0056]
The band dividing unit 802 further divides the high frequency coefficient 811 that is an MDCT coefficient obtained by converting the high frequency signal 209 into a plurality of bands (hereinafter referred to as a high frequency encoding subband), and divides the high frequency coefficient. 813 is output to the noise determination unit 803, the prediction error calculation unit 804, and the switching unit 806. Here, since the following processing for each high frequency encoding subband divided by the band dividing unit 802 is the same, processing in a specific high frequency encoding subband will be described below. In order to complete the encoding of the high frequency coefficient, the following process may be repeated for all the high frequency encoding subbands.
[0057]
The noise property determination unit 803 determines the noise property of the high frequency coefficient 813 and outputs a noise property determination value 814. The reason for performing the noise determination here is that if the high frequency coefficient 813 to be encoded is a noise signal, it can be estimated in advance that the similarity with the low frequency coefficient 812 is low and the prediction error is large. This is because the processing related to the prediction error calculation can be omitted. In the noise, the magnitude of the value of the MDCT coefficient representing the noisy signal shows a substantially uniform distribution. When a plurality of MDCT coefficients included in a certain band are examined, a certain value is obtained in the case of (stationary) noise. While many values exist in the vicinity of (for example, the average value), in the case of non-noise, there is a characteristic that a relatively large number of values exist in places far away from the average value.
[0058]
In order to determine the noise characteristic of the band-divided high frequency coefficient 813 using such characteristics, the noise characteristic determination unit 803, for example, determines the size of the MDCT coefficient in each high frequency encoding subband. The distribution may be evaluated, and it may be determined that the signal is non-noise if the distribution width is large and the noise signal if the distribution width is small with respect to a predetermined threshold. As a measure for evaluating the distribution of the MDCT coefficients, any statistical evaluation such as the difference between the minimum value and the maximum value of the MDCT coefficient in the high frequency coding subband, the difference between the average value and the maximum value, variance, standard deviation, etc. A value may be used.
[0059]
As a result of the determination, the noise property determination unit 803 outputs a noise property determination value 814 indicating the noise property / non-noise property of the high frequency coefficient 813 to the prediction error calculation unit 804 and the switching determination unit 805. The noise determination value 814 is, for example, “0” when the high frequency coefficient 813 is noise, and is “1” when the high frequency coefficient 813 is non-noise. The specific value of the noise determination value 814 may be a value other than this.
[0060]
The prediction error calculation unit 804 determines that the high frequency coefficient 813 and the low frequency when the high frequency coefficient 813 is notified by the noise characteristic determination value 814 as a non-noise signal as a determination result of the noise characteristic determination unit 803. In order to determine the degree of similarity with the coefficient 812, a prediction error is calculated and evaluated. The calculation of the prediction error can be performed in the same manner as the method disclosed in the first embodiment. That is, the prediction error E (k) includes the low frequency coefficient 812 as x (i), the high frequency coefficient 813 included in the high frequency encoding subband as y (j), and m as the high frequency encoding subband. Assuming the number of MDCT coefficients, it can be calculated by Equation 1. In Equation 1, by changing the shift number k between a predetermined lower limit value K_MIN and an upper limit value K_MAX, the low frequency coefficient x (k, k, which minimizes the prediction error E (k) as the similarity information 815. j), the shift number k, and the gain a can be obtained.
[0061]
The switching determination unit 805 determines a high frequency encoding unit to be used for encoding the band-divided high frequency coefficient 813 from the noise characteristic determination value 814 and the similarity information 815, and outputs the switching information 816 to perform switching. The unit 806 switches between the first high frequency encoding unit 807 and the second high frequency encoding unit 808. The first high frequency encoding unit 807 is configured to be suitable for encoding the high frequency coefficient 813 having a low similarity, and the second high frequency encoding unit 808 has a high similarity. This is an encoding unit configured to be suitable for encoding the high frequency coefficient 813. The switching unit 806 determines whether the (1) noise characteristic determination value 814 indicates a noise characteristic signal and (2) the noise characteristic determination value 814 indicates a non-noise signal and the similarity is low. 1 high frequency encoding unit 807 is selected, and when the noise determination value indicates a non-noise signal and the similarity is high, the second high frequency encoding unit 808 is selected.
[0062]
Next, the operation of the high frequency encoding unit 819 will be described in detail with reference to the drawings. FIG. 8A is a diagram illustrating an encoding method of the first high frequency encoding unit 807, and FIG. 8B is a diagram illustrating an encoding method of the second high frequency encoding unit 808. is there. The high frequency coefficient 813 input to the first high frequency encoding unit 807 is a high frequency encoding band out of the low frequency encoding band 902 and the high frequency encoding band 903 divided at the encoding boundary frequency 905. The MDCT coefficient (spectral coefficient) 901 is in the range of 903. Further, the high frequency encoding band 903 is divided into a plurality of high frequency encoding subbands 904.
[0063]
The spectral coefficient 901 input to the first high frequency encoding unit 807 is a coefficient having low noise characteristics or a low similarity to the MDCT coefficient of the low frequency encoding band 902, and thus the first high frequency encoding unit 807. Does not use the similarity information 815, and encodes the coefficients in the high frequency encoding subband 904 as random noise. That is, the coefficients in the high frequency encoding subband 904 are represented as a random sequence represented by the average amplitude 906, and only the average amplitude 906 (amplitude information) is encoded. In addition to the average amplitude, average energy, average power, or the like may be used as the representative value of the random series.
[0064]
Similarly, in the second high frequency encoding unit 808, the input high frequency coefficient 813 is the high frequency coefficient 913 divided between the low frequency encoding band 908 and the high frequency encoding band 909 divided at the encoding boundary frequency 911. MDCT coefficients (spectral coefficients) 907 are in the range of the area coding band 909. Further, the high frequency encoding band 909 is divided into a plurality of high frequency encoding subbands 910. Since the spectral coefficient 907 input to the second high frequency encoding unit 808 is a coefficient having a high similarity with the spectral coefficient of the low frequency encoding band 908, the second high frequency encoding unit 808 determines the similarity. Encode using information 815. Assuming that one of the high frequency encoding subbands 910 is a high frequency interval A915, the prediction error calculation unit 804 described above has the same number of coefficients for the high frequency interval A915 out of the coefficients of the low frequency encoding band 908. The low frequency section B 914 having the highest similarity including the spectrum coefficient is selected, and the similarity information 815 is output.
[0065]
The shift number 913k in the similarity information 815 is a distance from the encoding boundary frequency 911 to the lower limit frequency of the low frequency section B 914, with the encoding boundary frequency 911 as a reference point. The gain a is an amplitude ratio between the spectral coefficient 907 of the high frequency section A915 and the spectral coefficient 907 of the low frequency section B914. The second high frequency encoding unit 808 encodes the difference signal for each sample of the shift number 913k, the gain a, and the spectral coefficient of the high frequency interval A 915 and the low frequency interval B 914 as the obtained similarity information 815. To do.
[0066]
Note that encoding of the difference signal between the high frequency section A 915 and the low frequency section B 914 can be omitted. In this case, only the shift number 913 k and the gain a are encoded. Alternatively, the difference signal between the high frequency section A 915 and the low frequency section B 914 can be represented by a noise component, and information relating to the gain of the noise component may be encoded in addition to the shift number k and the gain a.
[0067]
As a method of expressing the gain a, the gain a is represented as (1) the amplitude ratio between the spectral coefficient 907 of the high frequency section A 915 and the spectral coefficient 907 of the low frequency section B 914, and the first high frequency encoding. Similarly to the amplitude information in the unit 807, (2) average amplitude 912 in each high frequency encoding subband 910 or (3) average energy, (4) average power in each high frequency encoding subband 910, etc. And may be encoded. The code of the differential signal encoded for each high frequency encoding subband 910 is multiplexed with information such as the number of shifts 913k and gain a, and is output as a high frequency code 817.
[0068]
It is also possible to reduce the bit amount of the high-frequency code 817 by compressing the code using a mathematical method at the multiplexing stage. For example, a method of re-encoding difference information between codes, a method of re-encoding a plurality of codes as vector information, a method of replacing a code with a variable length code such as a Huffman code, and the like can be used.
[0069]
Further, in this example, in obtaining the spectral coefficient 907 of the high frequency encoding subband 910, the high frequency signal 209 is converted into the MDCT coefficient and then the band division is performed on the MDCT coefficient 811. Then, after dividing the high-frequency signal 209 into bands of the high-frequency encoding subband 910 using a filter bank or the like, the MDCT coefficients of the high-frequency encoding subband 910 are obtained by performing MDCT conversion on the signals of the respective bands. May be.
[0070]
Further, the number of shifts 913k to be encoded is information for specifying the low-frequency section B 914, and therefore does not need to be a value representing the number of samples on the frequency axis. A plurality of low frequency sections having the same frequency width as that of the high frequency encoding subband 912 may be set, and may be represented by identification information indicating which low frequency section has been selected.
[0071]
Furthermore, in this example, the band dividing unit 802 is provided in the subsequent stage of the time-frequency converting unit 801. However, the present invention is not limited to this, and the band dividing unit 802 is provided in the preceding stage of the time-frequency converting unit 801. May be.
[0072]
As described above, the high frequency MDCT coefficient is divided into a plurality of high frequency encoding subbands, and the noise characteristics of the high frequency MDCT coefficient, the high frequency MDCT coefficient, and the low frequency are divided for each high frequency encoding subband. To obtain a high-quality reproduction signal from a code with a smaller bit amount by determining the similarity between the MDCT coefficients of the regions and performing encoding using the optimum high-frequency encoding unit based on the determination result Can do.
[0073]
Next, the configuration of the decoding apparatus according to Embodiment 4 of the present invention will be described with reference to FIG. FIG. 9 is a block diagram showing a configuration of a portion (high frequency decoding apparatus 1000) that decodes high frequency code 817 in a decoding apparatus corresponding to encoding apparatus 800 shown in FIG. The highband decoding part (highband decoding apparatus 1000) shown in FIG. 9 decodes the highband code in decoding apparatus 400 and decoding apparatus 700 in Embodiments 1 to 3 of the present invention. This represents a more desirable form.
[0074]
The high frequency decoding apparatus 1000 includes a time-frequency conversion unit 1001, a first high frequency decoding unit 1003, a second high frequency decoding unit 1004, a spectrum synthesis unit 1005, and a frequency-time conversion unit 1006. Hereinafter, in the high frequency decoding apparatus 1000 as well, the case where MDCT coefficients are used as the spectrum coefficients as in the encoding apparatus 800 will be described as an example. In FIG. 9, information input to highband decoding apparatus 1000 is highband code 1007, switching information 1008, and lowband decoded signal 1009.
[0075]
The high frequency code 1007 is, for example, the high frequency code 817 output by the first high frequency encoding unit 807 or the second high frequency encoding unit 808 of the encoding device 800 illustrated in FIG. The switching information 1008 is switching information 816 output by the switching determination unit 805 of the encoding device 800, for example. Furthermore, the low frequency decoded signal 1009 is, for example, the low frequency signal 410 output by the low frequency decoding unit 402 of the decoding device 400 illustrated in FIG.
[0076]
The time-frequency conversion unit 1001 performs MDCT conversion on the input low-frequency decoded signal 1009 and outputs a low-frequency coefficient 1010 represented by MDCT coefficients. The first high frequency decoding unit 1003 is a decoding unit for decoding the high frequency code 1007 encoded using noise, and the second high frequency decoding unit 1004 This is a decoding unit that decodes the high frequency code 1007 encoded using the similarity between the coefficient 812 and the high frequency coefficient 813.
[0077]
In encoding apparatus 800, high frequency coefficient 813 is divided and encoded into predetermined high frequency sub-bands, and therefore decoding processing corresponding thereto is performed based on the code corresponding to each high frequency sub-band. . That is, the code included in high frequency code 1007 is divided into codes for each high frequency encoded subband, and the decoding process is repeated for all high frequency encoded subbands based on the divided codes. Here, when the code corresponding to each high frequency encoding subband is compressed by the mathematical method on the encoding device 800 side, decoding by the mathematical method is also performed.
[0078]
When the switching information 1008 indicates that the high frequency code 1007 is input to the first decoding unit 1003, the high frequency coefficient 1011 obtained as a result of decoding is expressed by a random noise signal. Therefore, the high frequency code 1007 includes only gain information indicating the signal strength. Correspondingly, the first high-frequency decoding unit 1003 generates a random noise signal internally, and is divided into bands by multiplying the generated noise signal by the gain indicated by the decoded gain information. A high frequency coefficient 1011 is generated.
[0079]
On the other hand, when it is shown that the high frequency code 1007 is input to the second high frequency decoding unit 1004, the high frequency coefficient 1011 to be obtained as the decoding result is the low frequency according to the shift number. This is expressed by the sum of the low-frequency coefficient in a specific low-frequency section extracted from the coefficient 1010 and the difference extracted from the high-frequency code 1007. Therefore, the high frequency code 1007 is most similar to the high frequency coefficient in the high frequency encoding subband and the shift number k indicating the position where the low frequency coefficient is extracted, the gain a indicating the average amplitude of the high frequency coefficient, for example. The information which represents the difference with the low frequency coefficient in the low frequency section to be included is included.
[0080]
Since the high frequency coefficient is encoded as an MDCT coefficient which is a kind of spectrum coefficient, the low frequency decoded signal 1009 input for decoding the high frequency coefficient needs to be expressed as the MDCT coefficient. In general, since the low frequency decoded signal 1009 is given as a time signal, it is converted into an MDCT coefficient by the time-frequency conversion signal unit 1001 and input to the second high frequency decoding unit 1004 as the low frequency coefficient 1010. If the low frequency decoded signal 1009 is given as an MDCT coefficient, the low frequency signal 1009 can be regarded as the low frequency coefficient 1010, and the time-frequency conversion signal unit 1001 can be omitted.
[0081]
The second high frequency decoding unit 1004 extracts a predetermined number of MDCT coefficients from a position shifted by a shift number k from a predetermined reference point of the low frequency coefficients 1010. Next, the second high frequency decoding unit 1004 generates the MDCT coefficient representing the high frequency coefficient 1011 by multiplying the extracted low frequency MDCT coefficient by the gain a and adding the spectrum coefficient representing the difference. When the spectrum coefficient representing the difference is omitted, the MDCT coefficient representing the high frequency coefficient 1011 is obtained by multiplying the extracted MDCT coefficient by the gain a.
[0082]
The high frequency coefficient 1011 decoded by the first high frequency decoding unit 1003 or the second high frequency decoding unit 1004 and band-divided into the frequency width of the high frequency encoding subband is synthesized by the spectrum synthesis unit 1005. And output as a high frequency coefficient 1012 in which samples are arranged in order of frequency within the high frequency encoding band. The frequency-time conversion unit 1006 converts the high frequency coefficient 1012 into a time signal and generates a high frequency signal 1013.
[0083]
In the above description, when the shift number k does not indicate the number of samples on the frequency axis and is identification information representing a predetermined specific low frequency section, the specific low frequency section determined in advance according to the identification information What is necessary is just to extract MDCT coefficient.
[0084]
As described above, the high frequency signal is decoded using the similarity between the high frequency signal and the low frequency signal from the input code that is divided and encoded into the high frequency signal and the low frequency signal. As a result, a high-frequency code encoded more efficiently can be used, so that a high-quality reproduction signal can be obtained from a code with a smaller bit amount.
(Embodiment 5)
The configuration of the encoding apparatus according to Embodiment 5 of the present invention will be described with reference to FIG. FIG. 10 is a block diagram showing a configuration of encoding apparatus 1100 according to Embodiment 5. In FIG. An encoding apparatus 1100 shown in FIG. 10 includes a similarity determination unit 818 and a high frequency encoding unit 819 in the encoding apparatus 800 shown in FIG. 7, and uses low-frequency spectral coefficients that have been time-frequency converted in advance. Encode the low-frequency part.
[0085]
Any known spectral coefficient such as a Fourier transform coefficient or a cosine transform coefficient may be used as the spectral coefficient encoded by the low frequency encoding unit, but here, as an example, the low frequency encoding unit is an AAC system Encoding is performed, and MDCT coefficients are used as spectral coefficients. In FIG. 10, an encoding apparatus 1100 includes a time-frequency conversion unit 1101, a band division unit 1102, a similarity determination unit 818 (noise determination unit 803, prediction error calculation unit 804, switching determination unit 805), high frequency encoding. Section 819 (first highband encoding section 807, second highband encoding section 808), lowband encoding section 1109, and code multiplexing section 1110. The time-frequency conversion unit 1101 collectively performs MDCT conversion on the input signal 207 including the low-frequency signal and the high-frequency signal, and outputs an MDCT coefficient 1112.
[0086]
Since the low-frequency encoding unit 1109, the first high-frequency encoding unit 807, and the second high-frequency encoding unit 808 are all encoding units that encode MDCT coefficients, the time-frequency conversion unit 1101 receives the input signal. By converting 207 to MDCT coefficients 1112 in a lump, the required MDCT coefficients can be obtained by a single conversion, compared to the case where the MDCT conversion is separately performed by the low-frequency encoding unit and the high-frequency encoding unit. Therefore, the processing amount related to the MDCT coefficient calculation can be reduced.
[0087]
Band division section 1102 divides MDCT coefficient 1112 into low frequency coefficient 1114 and high frequency coefficient 813. In this case, since the band division processing is performed on the MDCT coefficient, processing such as a band division filter in the band division unit 1102 can be omitted, and the processing amount of the band division unit 1102 can be reduced. Further, the high frequency coefficient 813 can be divided into high frequency encoded subbands at this stage. As described above, the processing of the band-divided high frequency coefficient 813 is performed by the similarity determination unit 818 and the high frequency encoding unit 819 shown in FIG. 7, and is output as the switching information 816 and the high frequency code 817. The
[0088]
The low frequency encoding unit 1109 encodes the low frequency coefficient 1114 using the AAC method and outputs the low frequency coefficient 1114 as the low frequency code 1119. The code multiplexing unit 1110 includes the switching information 816 output from the switching determination unit 805, the high frequency code 817 output from the high frequency encoding unit 819, and the low frequency code 1119 output from the low frequency encoding unit 1109. And the output code 1120 is generated.
[0089]
As described above, by configuring the encoding apparatus 1100 with such a configuration, it is possible to encode the high frequency coefficient and the low frequency coefficient with a smaller processing amount.
Next, the configuration of the decoding apparatus according to Embodiment 5 of the present invention will be described with reference to FIG. FIG. 11 is a block diagram showing a configuration of decoding apparatus 1200 that decodes input code 1208 that is output code 1120 of encoding apparatus 1100 shown in FIG. The decoding apparatus 1200 shown in FIG. 11 includes the high frequency decoding apparatus 1000 shown in FIG. 9, and the low frequency decoding unit decodes the low frequency spectrum coefficients obtained as MDCT coefficients.
[0090]
As a spectral coefficient to be decoded by the low-frequency decoding unit, any known spectral coefficient such as a Fourier transform coefficient or a cosine transform coefficient may be used, but here, for example, the low-frequency decoding unit is based on the AAC method. Decoding is performed, and MDCT coefficients are used as spectral coefficients. The decoding apparatus 1200 includes a high frequency decoding unit 1203 (switching unit 1002, first high frequency decoding unit 1003, second high frequency decoding unit 1004), code division unit 1201, and low frequency decoding unit 1202. A spectrum synthesis unit 1206 and a frequency-time conversion unit 1207 are provided.
[0091]
In the decoding apparatus 1200, portions corresponding to the high frequency decoding apparatus 1000 shown in FIG. 9 are a switching unit 1002, a first high frequency decoding unit 1003, a second high frequency decoding unit 1004, a low frequency decoding. 9, the spectrum synthesizing unit 1206, and the frequency-time conversion unit 1207, but the low frequency decoding unit 1202 decodes the low frequency code 1210 in which the low frequency MDCT coefficients are encoded in FIG. It differs from the time-frequency converter 1001 shown. The spectrum synthesizing unit 1206 is different from the spectrum synthesizing unit 1005 shown in FIG. 9 in that not only the high frequency coefficient 1011 but also the low frequency coefficient 1010 is further synthesized with the spectrum synthesized high frequency coefficient 1011. Also, the frequency-time conversion unit 1207 of the decoding device 1200 performs IMDCT on the MDCT coefficient 1214 in which the low-frequency coefficient 1010 is further combined with the high-frequency coefficient 1012, and the low-frequency decoded signal 1009 on the time axis is MDCT Different from the frequency-time conversion unit 1006 of FIG.
[0092]
In decoding apparatus 1200 in FIG. 11, code division section 1201 extracts switching information 1008, high frequency code 1007, and low frequency code 1210 from input code 1208. On the other hand, the low frequency decoding unit 1202 decodes the extracted low frequency code 1210 by the AAC method, and outputs a low frequency coefficient 1010. The switching information 1008 extracted from the input code 1208 and the high frequency code 1007 are both input to the high frequency decoding unit 1203. In the high frequency decoding unit 1203, the first high frequency decoding unit 1003 or the second high frequency decoding unit 1004 selected according to the switching information 1008 decodes the high frequency code 1007 and sets the high frequency coefficient 1011. Output.
[0093]
Here, since the high frequency coefficient 1011 is decoded using the similarity with the low frequency coefficient, the low frequency decoded signal output from the low frequency decoding unit is converted into the same MDCT coefficient as the high frequency coefficient 1011. There is a need. On the other hand, in this configuration, since both the low frequency coefficient and the high frequency coefficient are encoded on the encoding device 1100 side, the low frequency coefficient 1010 is output as the MDCT coefficient. Therefore, the conversion process of the low frequency coefficient 1010 is not required, and the processing amount is reduced.
[0094]
The spectrum synthesis unit 1206 synthesizes the low frequency coefficient 1010 output from the low frequency decoding unit 1202 and the high frequency coefficient 1011 output from the high frequency decoding unit 1203 on the frequency axis. The frequency-time conversion unit 1207 collectively converts the MDCT coefficients 1214 synthesized by the spectrum synthesis unit 1206 into a time signal, and generates an output signal 1215.
[0095]
As described above, on the encoding device side, encoding is performed using the high-frequency coefficients and low-frequency coefficients that are collectively time-frequency converted, and on the decoding device side, the encoded codes are collectively displayed. By adopting a configuration that performs frequency-time conversion, the decoding apparatus can decode the high-frequency spectrum coefficient and the low-frequency spectrum coefficient with a smaller amount of processing to obtain an output signal.
[0096]
In the encoding apparatus 1100 and decoding apparatus 1200 configured as described above, as shown in the third embodiment, a downsampling unit is provided on the encoding apparatus side, and an upsampling unit is provided on the decoding apparatus side. The conversion may be performed at a sampling frequency lower than the sampling frequency of the input signal. In particular, in this configuration, as a down-sampling method, the low-frequency coefficient 1114 divided by the band dividing unit 1102 of the encoding device 1100 is directly converted into a coefficient obtained by converting a signal of the sampling frequency after down-sampling. It can be considered and processed.
[0097]
As an example, a case where an input signal having a sampling frequency of 48 kHz is down-sampled to 24 kHz, which is a half, is shown. Assuming that 2048 points of MDCT coefficients are obtained by converting 2048 samples of the input signal, the 2048 points of MDCT coefficients are coefficients representing a signal of 0 to 24 kHz. Among these, if the low frequency side 1024 points of the 2048 MDCT coefficients are taken out as the low frequency coefficient, the low frequency coefficient becomes a coefficient representing a signal of 0 to 12 kHz. If the low frequency coefficient 1024 points are subjected to inverse MDCT conversion, a time signal of 1024 samples can be obtained, and the number of samples of the time signal is halved, that is, down-sampled to a half sampling frequency. When the low-frequency encoding unit uses MDCT coefficients, the inverse MDCT conversion process is unnecessary, and thus the process related to downsampling can be simplified. Similarly, in the case of up-sampling, the decoded low-frequency coefficient may be processed as if it were a coefficient obtained by converting the signal of the sampling frequency after up-sampling as it is. (Embodiment 6)
[0098]
FIG. 12 is a block diagram showing a configuration of coding apparatus 1300 according to Embodiment 6. In FIG. The encoding device 1300 is an encoding device that performs time-frequency conversion by switching the conversion block length of an input signal according to whether or not the input signal is a signal that changes sharply in a minute time. Dividing unit 201, low frequency encoding unit 202, transform block length selecting unit 1301, time-frequency converting unit 1302, band dividing unit 1303, similarity determining unit 1312 (noise characteristic determining unit 1304, prediction error calculating unit 1305, switching determination Unit 1306), a switching unit 1307, a high frequency encoding unit 1313 (a first high frequency encoding unit 1308, a second high frequency encoding unit 1309), and a code multiplexing unit 1321.
[0099]
The high-frequency encoding unit 1313 and the similarity determination unit 1312 shown in the figure, and the high-frequency encoding unit 819 and the similarity determination unit 818 in the encoding device 800 shown in FIG. According to the selection, the configuration and the content of the processing are the same except that the processing is performed in units of high frequency coefficients of different numbers of samples. The high frequency code 1320 is also different from the high frequency code 817 in that it is encoded with a different transform block length according to the selection of the transform block length selection unit 1301. Similarly, in the noise determination value 1317, the similarity information 1318, and the switching information 1319, only the point that the transform block length of the high frequency coefficient 1316 to be processed changes according to the selection of the transform block length selection unit 1301 is shown in FIG. Are different from the noise determination value 814, similarity information 815, and switching information 816 shown in FIG. Since the configurations and operations of the high frequency encoding unit 819 and the similarity determination unit 818 illustrated in FIG. 7 have already been described, in the following, when there is no relation to new processing, the high frequency encoding unit 1313 and A description of the similarity determination unit 1312 is omitted.
[0100]
In FIG. 12, the transform block length selection unit 1301 determines the transform block length used in the time-frequency transform unit 1302 according to the characteristics of the input signal 207. For example, the transform block length selection unit 1301 measures the amount of change in the average signal level of the input signal 207 within a minute time Δt, and selects a short transform block length when the measured amount of change exceeds a predetermined value. . In addition, when the measured change amount is equal to or less than a predetermined value, a long transform block length is selected. The determined transform block length may be an arbitrary length, or may be selected from a plurality of predetermined options. In the following description, two types of transform block lengths are switched and used, and 1024 samples are used as a long transform block length, and 128 samples are used as a short transform block length, which are called a basic transform block and a transform subblock, respectively. To do. According to this configuration, one basic transform block can be divided into eight transform sub-blocks. The transform block length selection unit 1301 generates block length information 1310 representing the selected transform block length.
[0101]
The time-frequency conversion unit 1302 converts the low frequency signal 208 and the high frequency signal 209 band-divided by the band division unit 201 according to the block length information 1310 generated by the conversion block length selection unit 1301, respectively. A band coefficient 1315 and a high band coefficient 1314 are output. As the time-frequency conversion used by the time-frequency conversion unit 1302, any known conversion such as Fourier transform, cosine transform, and filter bank may be used, but MDCT is used in the following description. The band dividing unit 1303 further divides the high frequency coefficient 1314 represented by the MDCT coefficient into a plurality of high frequency encoded subbands, and then outputs the high frequency coefficient 1314 to each unit of the noise determination unit 1304, the prediction error calculation unit 1305, and the switching unit 1307. To do.
[0102]
Here, the relationship between the band-divided high frequency coefficient 1316 and the subsequent encoding process will be described with reference to the drawings. FIG. 13 is a diagram showing a high frequency coefficient 1316 and a low frequency coefficient 1315 when the basic transform block 1401 is selected by the transform block length selection unit 1301 shown in FIG. 12, and FIG. It is a figure which shows the high-pass coefficient 1316 and the low-pass coefficient 1315 when the conversion subblock 1506 is selected in the converted block length selection part 1301.
[0103]
Both FIG. 13 and FIG. 14 represent a high frequency coefficient 1316 and a low frequency coefficient 1315 when, for example, a signal of 1024 samples per frame is input as a time signal. As shown in FIG. 13, when the basic transform block 1401 is selected by the transform block length selection unit 1301 in FIG. 12, the time-frequency conversion unit 1302 applies the high frequency signal 209 represented by one frame 1024 samples. Thus, the conversion with the conversion block length of 1024 samples is performed only once. The same applies to the low frequency signal 208.
[0104]
Therefore, the high frequency coefficient 1314 in the high frequency encoding band 1402 and the low frequency coefficient 1315 in the low frequency encoding band 1403 are composed of a set of coefficients obtained by one conversion in the time axis direction. That is, this coefficient can be regarded as a coefficient that does not depend on a time change within the sampling time of this one frame. However, in the frequency axis direction, the high frequency coefficient 1314 in the high frequency encoding band 1402 is further divided into a plurality of high frequency encoding subbands 1404 by the band dividing unit 1303, and a plurality of sets of high frequency encoding subbands are obtained. A high frequency coefficient 1316 consisting of 1404 is obtained.
[0105]
Of the high frequency coefficients 1314 included in each basic transform block 1401, only the coefficient corresponding to the frequency band of the high frequency encoding band 1402 has a value, and the remaining coefficients are zero. Similarly, the low frequency coefficient 1315 included in each basic transform block 1401 has a value only for the coefficient corresponding to the frequency band of the low frequency encoding band 1403, and the remaining coefficients are zero. In FIG. 13, only the low frequency coefficient and high frequency coefficient having values are arranged on the same plane. Since the subsequent encoding process for each high frequency coefficient 1316 is performed for each high frequency encoding subband 1404 in the frequency axis direction, the unit of the encoding process when using the basic transform block 1401 is a diagonal line in FIG. This is a region 1405 (hereinafter referred to as “encoded cell”) in each high-band encoded subband 1404 shown in FIG.
[0106]
On the other hand, as shown in FIG. 14, when the transform subblock 1506 is selected by the transform block length selection unit 1301, the time-frequency transform unit 1302 has a high frequency represented by 1024 samples in one frame 1501. The signal 209 is converted by 1024/128 = 8 times with a conversion block length of 128 samples as a unit. The same applies to the low frequency signal 208 in the low frequency encoding band 1503. Therefore, both the high frequency coefficient 1314 and the low frequency coefficient 1315 obtained as a result of the conversion by the time-frequency conversion unit 1302 are coefficients composed of eight sets of conversion sub-blocks 1506 in the time axis direction.
[0107]
Each high frequency coefficient 1314 included in each transform sub-block 1506 is 128 samples, and only the coefficient corresponding to the frequency band of the high frequency encoding band 1502 has a value, and the remaining coefficients are zero. Similarly, the low frequency coefficient 1315 included in each transform sub-block 1506 is 128 samples, only the coefficient corresponding to the frequency band of the low frequency encoding band 1503 has a value, and the remaining coefficients are zero. In FIG. 15, only the low frequency coefficient and the high frequency coefficient having values are arranged on the same plane. The high frequency coefficient 1314 in the high frequency encoding band 1502 is further divided into a plurality of high frequency encoding subbands 1504 in the frequency axis direction by the band dividing unit 1303. Therefore, the unit of the subsequent encoding process is an encoding cell 1505 indicated by hatching in FIG.
[0108]
As apparent from FIG. 13 and FIG. 14, the coding cell is more finely divided in the time axis direction when the transform subblock 1506 is used than the basic transform block 1401 as the transform block length. Encoding with high accuracy is possible depending on the direction. However, if the number of divisions in the frequency axis direction is the same, the number of encoded cells per unit time increases, so the amount of bits necessary for encoding inevitably increases. In order to prevent this, when the transform subblock 1506 shown in FIG. 14 is used, the number of divisions in the frequency axis direction, that is, the number of highband encoding subbands 1504 is set to the high frequency when using the basic transform block 1401 shown in FIG. What is necessary is just to reduce rather than the number of the encoding subbands 1404.
[0109]
As a result, when the basic transform block 1401 is used, the encoded cell 1405 finely divided in the frequency axis direction is used, and when the transform subblock 1506 is used, the encoded cell 1505 finely divided in the time axis direction is used. become. As described above, the encoding apparatus 1300 can more efficiently encode the input signal 207 by switching between the encoding cell 1405 and the encoding cell 1505 according to the characteristics of the input signal 207. It becomes possible.
[0110]
The processing performed by each processing unit of the encoding device 1300 for each encoded cell is the same for both the basic transform block 1401 and the transform sub-block 1506. Therefore, in order to complete the encoding of the high frequency coefficient 1316 in one frame, the encoding process of the similarity determination unit 1312 and the high frequency encoding unit 1313 may be repeated for all the encoded cells.
[0111]
The code multiplexing unit 1321 includes block length information 1310 generated by the transform block length selection unit 1301, switching information 1319 from the switching determination unit 1306, low frequency code 213 from the low frequency encoding unit 202, and high frequency encoding unit. The high frequency code 1320 from 1313 is multiplexed to generate an output code 1311.
[0112]
In this embodiment, when the MDCT coefficient of the high frequency encoding subband is obtained, band conversion is performed on the MDCT coefficient after converting the high frequency signal 209 to the MDCT coefficient. After dividing the high frequency signal 209 into high frequency encoding subband bands using a filter bank or the like, the MDCT coefficients of the high frequency encoding subband may be obtained by performing MDCT conversion on the signals of the respective bands. .
[0113]
As described above, the high-frequency MDCT coefficient is divided into a plurality of encoded cells, and noise characteristics and similarity between the low-frequency MDCT coefficients are determined for each encoded cell. By performing encoding using the optimum high-frequency encoding unit, there is an effect that a high-quality reproduction signal corresponding to the characteristics of the input signal 207 can be obtained from a code with a smaller bit amount.
[0114]
Next, a decoding apparatus according to the sixth embodiment will be described with reference to FIG. FIG. 15 is a block diagram showing a configuration of decoding apparatus 1600 that decodes output code 1311 output by encoding apparatus 1300 shown in FIG. Hereinafter, as in the case of the encoding apparatus 1300, a case where MDCT coefficients are used as frequency spectrum coefficients will be described as an example.
[0115]
In FIG. 15, decoding apparatus 1600 decodes a high-frequency code encoded with a block length of either a basic transform block or a transform subblock according to block length information included in the input code. A code division unit 1601, a low-band decoding unit 1602, a time-frequency conversion unit 1603, a switching unit 1604, a first high-band decoding unit 1605, a second high-band decoding unit 1606, spectrum synthesis Unit 1607 and a frequency-time conversion unit 1608. The code dividing unit 1601 extracts block length information 1610, switching information 1611, a low frequency code 1612, and a high frequency code 1613 from the input code 1609.
[0116]
The low frequency decoding unit 1602 decodes the extracted low frequency code 1612 and outputs a low frequency signal 1614 that is a time signal. The time-frequency conversion unit 1603 converts the low frequency signal 1614 with the converted block length indicated by the block length information 1610, and calculates a low frequency coefficient 1615 that is an MDCT coefficient representing a low frequency component.
[0117]
The switching unit 1604 switches between the first highband decoding unit 1605 and the second highband decoding unit 1606 according to the extracted switching information 1611, and block length information for the selected highband decoding unit. 1610 and high frequency code 1613 are input. Here, the first high frequency decoding unit 1605 is a decoding unit for decoding a code encoded using noise, and the second high frequency decoding unit 1606 is a low frequency coefficient 1615 and a high frequency coefficient. It is a decoding part which decodes the code | symbol encoded using the similarity degree of. Since the high frequency coefficient is divided and encoded into predetermined encoding cells according to the transform block length and the high frequency encoding subband, the decoding process is performed for the code corresponding to each encoding cell. Done.
[0118]
That is, the code included in highband code 1613 is divided into codes for each coding cell, and the decoding process for the divided codes is repeated for all the coding cells. Here, on the side of the encoding apparatus 1300, when the code corresponding to each encoded cell is compressed by the mathematical method, decoding by the mathematical method is also performed.
[0119]
When the switching information 1611 indicates that the high frequency code 1613 is input to the first high frequency decoding unit 1605, the decoded high frequency coefficient is represented by a random noise signal, and the high frequency code 1613 is displayed. Includes only gain information representing signal strength. Accordingly, the first high frequency decoding unit 1605 generates a random noise signal internally, and multiplies the decoded gain to generate a band-divided high frequency coefficient 1616.
[0120]
On the other hand, when it is indicated that the high frequency code 1613 is input to the second high frequency decoding unit 1606, the band-divided high frequency coefficient 1616 is extracted from the low frequency coefficient 1615. This is expressed by the sum of the coefficient and the difference between the high-frequency coefficient encoded in the high-frequency code 1613 and the low-frequency coefficient. The high frequency code 1613 includes information indicating a shift number k indicating a position for extracting a low frequency coefficient, a gain a, and a difference. The second high frequency decoding unit 1606 extracts a predetermined number of MDCT coefficients from the position shifted by the shift number k from a predetermined reference point for the input low frequency coefficients 1615. The number of samples of the extracted MDCT coefficient varies depending on the block length.
[0121]
The second high frequency decoding unit 1606 generates an MDCT coefficient representing a high frequency coefficient by multiplying the extracted MDCT coefficient by a gain a and adding a spectrum coefficient representing the decoded difference information. When the information indicating the difference is omitted, the MDCT coefficient representing the high frequency coefficient is obtained by multiplying the extracted MDCT coefficient by the gain a. When the information representing the difference is represented by a noise component, the MDCT representing the high frequency coefficient is obtained by multiplying the extracted MDCT coefficient by the gain a and adding the noise component multiplied by the decoded gain. It becomes a coefficient.
[0122]
The spectrum synthesizer 1607 is decoded by the first high frequency decoding unit 1605 or the second high frequency decoding unit 1606, and is divided into a high frequency encoding subband and a high frequency coefficient 1616 and a low frequency coefficient 1615 is synthesized and output as a synthesized spectral coefficient 1617. The frequency-time conversion unit 1608 converts the high frequency part of the combined spectral coefficient 1617 into a time signal with the conversion block length indicated by the block length information 1610, and generates an output signal 1618.
[0123]
In the second high frequency decoding unit 1606, when the shift number k does not indicate the number of samples on the frequency axis but indicates identification information indicating a predetermined low frequency section, the identification information Thus, the MDCT coefficient of a predetermined specific section may be extracted.
[0124]
As described above, in Embodiment 6, in encoding apparatus 1300, encoding is performed by selectively switching the transform block length of highband signal 209 in accordance with the characteristics of input signal 207, and decoding apparatus 1600 Since the input code 1609 encoded in this way is decoded based on the block length information 1610, in addition to the effects in the first to fifth embodiments, the decoding apparatus 1600 can cope with the time change of the original sound. There is an effect that it is possible to obtain a higher-quality playback signal with improved followability.
[0125]
(Embodiment 7)
The configuration of the encoding apparatus according to Embodiment 7 of the present invention will be described with reference to FIG. FIG. 16 is a block diagram showing the configuration of encoding apparatus 1700 according to Embodiment 7. In FIG. In the encoding device 1700 of FIG. 16, the low frequency encoding unit performs encoding using the transform coefficient in the same format as the high frequency encoding unit with respect to the encoding device 1300 illustrated in FIG. 12, and The low-frequency encoding unit also differs in that encoding processing is performed according to the transform block length determined by the transform block length selection unit.
[0126]
The encoding apparatus 1700 includes a transform block length selection unit 1301, a switching unit 1307, a time-frequency conversion unit 1701, a band division unit 1702, a low frequency encoding unit 1703, a similarity determination unit 1312 (a noise determination unit 1304, a prediction An error calculation unit 1305, a switching determination unit 1306), a high frequency encoding unit 1313 (a first high frequency encoding unit 1308, a second high frequency encoding unit 1309), and a code multiplexing unit 1704. In the figure, the same components as those of the encoding device 1300 shown in FIG. 12 have already been described, and thus the same reference numerals are assigned and description thereof is omitted.
[0127]
In FIG. 16, the transform block length selection unit 1301 determines the transform block length according to the characteristics of the input signal 207. The time-frequency conversion unit 1701 performs time-frequency conversion of the input signals 207 of the entire low band and high band in accordance with the block length information 1310 from the conversion block length selection unit 1301 and performs a global coefficient 1705. Output. The time-frequency conversion unit 1701 may use any known conversion method, but here, an example using MDCT will be described.
[0128]
Band division section 1702 divides overall coefficient 1705 represented by the MDCT coefficient of the block length of either the basic transform block or the transform sub-block into each of the low frequency band and the high frequency band. A low frequency coefficient 1706 and a high frequency coefficient 1316 are output after being divided into area encoded subbands. The low frequency encoding unit 1703 encodes the low frequency coefficient 1706 represented by the MDCT coefficient of the variable transform block length, and outputs a low frequency code 1707. As this low-frequency encoding unit 1703, any known encoding that can cope with the variable transform block length may be used. Here, the MPEG-4 AAC system is used as an example of such an encoding unit. To do. The MPEG-4 AAC system supports two types of conversion block lengths of 1024 samples and 128 samples, and can be switched and used as necessary.
[0129]
Here, when the transform block length used in the high frequency encoding unit is different from the transform block length used in the low frequency encoding unit as in the encoding device 1300 illustrated in FIG. 12, It is necessary to perform time-frequency conversion. Therefore, if the transform block length used in the high frequency encoding unit is selected from two types of transform block lengths of 1024 samples and 128 samples, the transform block length of the low frequency encoding and the high frequency encoding is one. And both time-frequency conversion processes are the same. That is, the time-frequency conversion process can be shared between the low-frequency encoding unit and the high-frequency encoding unit. The code multiplexing unit 1704 includes block length information 1310 from the transform block length selection unit 1301, switching information 1319 from the switching determination unit 1306, high frequency code 1320 from the high frequency encoding unit 1313, and low frequency encoding unit 1703. The low-frequency code 1707 is multiplexed and an output code 1708 is output.
[0130]
Hereinafter, the operation of encoding apparatus 1700 configured as described above will be described along the flow of data. The time-frequency conversion unit 1701 collectively converts the input signal 207 into MDCT coefficients according to the block length information 1310. The converted whole area coefficient 1705 including the frequency components of the entire band is input to the band dividing unit 1702, and is output as the low band coefficient 1706 and the high band coefficient 1316 further divided into a plurality of bands.
[0131]
The low frequency coefficient 1706 is encoded by the low frequency encoding unit 1703, and the band-divided high frequency coefficient 1316 is encoded by the first high frequency encoding unit 1308 or the second high frequency encoding unit 1309. .
[0132]
As a result of encoding processing of the high frequency coefficient 1316 by the block length information 1310, the switching information 1319, and the high frequency encoding unit 1313, the high frequency code 1320 and the low frequency code 1707 obtained are multiplexed in the code multiplexing unit 1704. , An output code 1708 is generated.
[0133]
By providing the above configuration, the encoding apparatus 1700 can share the time-frequency conversion processing in the low frequency encoding and the high frequency encoding, and can perform the frequency division processing in the frequency band dividing unit 1702 in the frequency domain. Since the calculation can be performed on the coefficient, for example, the MDCT coefficient, the amount of calculation required for the band division processing can be reduced.
[0134]
Next, a decoding apparatus according to Embodiment 7 of the present invention will be described with reference to FIG. FIG. 17 is a block diagram showing a configuration of decoding apparatus 1800 that decodes output code 1708 encoded by encoding apparatus 1700 shown in FIG. In the decoding apparatus 1800 shown in FIG. 17, the low frequency decoding unit performs low frequency decoding on the decoding device 1600 according to Embodiment 6 using a transform coefficient in the same format as the high frequency. In addition, the low frequency decoding process is performed in accordance with the transform block length indicated by the block length information received from the transform block length selection unit.
[0135]
The decoding apparatus 1800 includes a code division unit 1601, a switching unit 1604, a first high frequency decoding unit 1605, a second high frequency decoding unit 1606, a spectrum synthesis unit 1607, a frequency-time conversion unit 1608, and a low frequency A decoding unit 1801 is provided. In the figure, since the same components as those of the decoding device 1600 shown in FIG. 15 have already been described, the same reference numerals are assigned and description thereof is omitted. Hereinafter, the decoding apparatus 1800 will be described as an example in which MDCT coefficients are used as frequency spectrum coefficients and the AAC scheme is used as a low-frequency decoding method, similar to the encoding apparatus 1700 described above. The low frequency decoding unit 1801 decodes the low frequency code 1803 separated from the input code 1802 according to the block length information 1610, and outputs a low frequency coefficient 1615.
[0136]
Hereinafter, an operation of decoding apparatus 1800 configured as described above will be described. In decoding apparatus 1800 in FIG. 17, input code 1802 is divided into block length information 1610, switching information 1611, low band code 1803 and high band code 1613 by code dividing section 1601. Among these, the low frequency code 1803 is decoded by the low frequency decoding unit 1801 according to the transform block length indicated by the block length information 1610. The decoded low frequency code 1803 is represented by MDCT coefficients that are AAC coding parameters, and is output as a low frequency coefficient 1615.
[0137]
On the other hand, the switching unit 1604 is switched by the switching information 1611 separated from the input code 1802, and either the first highband decoding unit 1605 or the second highband decoding unit 1606 is selected. The selected high frequency decoding unit decodes the high frequency code 1613, and outputs a high frequency coefficient 1616. The operations of first high frequency decoding section 1605 and second high frequency decoding section 1606 are the same as those of decoding apparatus 1600 of Embodiment 7 shown in FIG. Since the decoded low frequency coefficient 1615 and high frequency coefficient 1616 are MDCT coefficients based on the same transform block length, the combination of both can be realized by a simple addition process. The spectrum synthesizer 1607 adds the low frequency coefficient 1615 and the high frequency coefficient 1616 to calculate a synthesized spectral coefficient 1617. The frequency-time conversion unit 1608 performs inverse MDCT conversion on the combined spectral coefficient 1617 and outputs an output signal 1618.
[0138]
According to the present embodiment, since decoding apparatus 1800 has such a configuration, frequency-time conversion processing for low-band decoding and high-band decoding can be shared, so that decoding apparatus 1600 requires this. The time-frequency conversion unit for converting the existing low-frequency signal into the low-frequency coefficient can be omitted, and there is an effect that the decoding apparatus 1800 can be mounted more compactly. Furthermore, according to the present embodiment, the synthesis process in the spectrum synthesis unit can be performed on a frequency domain coefficient, for example, an MDCT coefficient, so that the amount of calculation required for the synthesis process can be reduced. There is.
[0139]
(Embodiment 8)
The configuration of the encoding apparatus according to Embodiment 8 of the present invention will be described with reference to FIG. FIG. 18 is a block diagram showing the configuration of encoding apparatus 1900 according to Embodiment 8. In FIG. The encoding device 1900 shown in FIG. 18 employs a method such as the MPEG-4 AAC method corresponding to an encoding method called block grouping for encoding the low frequency band, compared to the encoding device 1700 shown in FIG. However, the difference is that high-frequency encoding is performed using grouping information given from the low-frequency encoding unit.
[0140]
The encoding apparatus 1900 includes a transform block length selection unit 1301, a similarity determination unit 1312 (noise determination unit 1304, prediction error calculation unit 1305, switching determination unit 1306), time-frequency conversion unit 1701, band division unit 1702, A low-frequency encoding unit 1901, a high-frequency encoding unit 1904 (a first high-frequency encoding unit 1902, a second high-frequency encoding unit 1903) and a code multiplexing unit 1905 are provided. In the figure, the same components as those of the encoding device 1300 or the encoding device 1700 shown in FIG. 12 or FIG. 16 have already been described. Is omitted.
[0141]
Block grouping is a method of reducing the number of coding parameters included in an output code by sharing coding parameters in groups of transform sub-blocks having similar spectra. Hereinafter, block grouping used in the AAC scheme will be described in more detail with reference to FIG. FIG. 19A is a diagram illustrating an example of the global coefficient 1705 output according to the block length information 1310 from the time-frequency conversion unit 1701 of the encoding device 1900 illustrated in FIG. FIG. 19B is a diagram illustrating an example of the high frequency coefficient 1316 input to the high frequency encoding unit 1904 of the encoding device 1900 illustrated in FIG. FIG. 19C is a diagram illustrating an example of the low frequency coefficient 1706 input to the low frequency encoding unit 1901 of the encoding device 1900 illustrated in FIG.
[0142]
The AAC method supports two conversion block lengths of 1024 samples (long block = basic conversion block) and 128 samples (short block = conversion sub-block). However, in the case of a short block, MDCT in a plurality of adjacent short blocks is used. When the coefficients are compared and their similarity is high, the number of encoding parameters can be reduced by sharing some encoding parameters, and the number of bits required for encoding the input signal can be reduced. It can be configured.
[0143]
When the block length information 1310 from the transform block length selection unit 1301 indicates a short block consisting of 128 samples for each of the low frequency range and the high frequency range, the time-frequency conversion unit 1701 illustrated in FIG. MDCT conversion is performed for every 128 samples of 207, and a global coefficient 1705 as shown in FIG. The global coefficient 1705 output in this way is composed of 8 short blocks per basic conversion block (1024 samples), and the MDCT coefficient of each short block is from a low frequency to a high frequency determined by the sampling frequency. Represents the frequency band up to. Such a whole area coefficient 1705 is divided into a low frequency coefficient 1706 and a high frequency coefficient 1316 by the band dividing unit 1702 and output to the low frequency encoding unit 1901 and the high frequency encoding unit 1904, respectively.
[0144]
The low frequency encoding unit 1901 compares the MDCT coefficients of adjacent short blocks, and if the similarity between the short blocks is high, as shown in FIG. Are classified into one group. In short blocks belonging to one group, for example, an encoding parameter such as a normalization coefficient calculated for each encoding unit in the short block is shared. At this time, a set of adjacent short blocks sharing the encoding parameter is called a short block group, and information on how eight short blocks included in a frame of 1024 sample length are grouped is grouping information. 1911 is output to the high frequency encoding unit 1904.
[0145]
For example, as shown in FIG. 19C, when the short block 1 and the short block 2 are grouped in the low frequency encoding unit 1901, grouping information 1911 indicating the grouping information 1911 is output. This grouping information 1911 is encoded together with the low frequency coefficient 1706 by the low frequency encoding unit 1901. In the high frequency encoding unit 1904, the high frequency coefficient 1316 input from the band dividing unit 1702 is further divided into a plurality of bands than the low frequency coefficient 1706, and the high frequency encoding unit 1904 outputs a high frequency for each band. The domain coefficient 1316 is encoded. At this time, when the low frequency short block 1 and the short block 2 are grouped from the grouping information 1911, the high frequency encoding unit 1904 converts the conversion sub block 1 and the conversion sub block 2 corresponding to the high frequency. Coding is performed by sharing the encoding parameter with the.
[0146]
Here, in the configuration of encoding apparatus 1900, the lengths of the transform sub-block in the high frequency part and the short block of the AAC scheme in the low frequency part are all equal to 128 samples, and the transform block length of 128 samples. Each of the high frequency MDCT coefficient converted in step 1 and the low frequency MDCT coefficient converted in a short block of 128 samples holds the frequency high frequency component and low frequency component of the same signal. Therefore, the MDCT coefficients included in both are closely related. For example, if adjacent short blocks on a low frequency side are similar, corresponding sub blocks on the high frequency side may be similar. High nature. Therefore, if the grouping information 1911 output from the low frequency encoding unit using the AAC scheme is also used on the high frequency encoding side and the encoding parameters between similar sub-blocks are shared, the number of encoding parameters can be reduced. The number of bits required for encoding can be reduced.
[0147]
In the encoding device 1900 of FIG. 18, the grouping information 1911 output from the low frequency encoding unit 1901 is sent to the first high frequency encoding unit 1902 or the second high frequency encoding unit 1903 via the switching unit 1307. Entered. In the high frequency encoding unit selected by the switching unit 1307, based on the grouping information 1911, the shared parameters are encoded only for one transform subblock in the group, and the remaining transform subblocks are encoded. Is omitted. Here, a specific parameter sharing method based on the grouping information 1911 will be described.
[0148]
FIG. 20 is a diagram illustrating a relationship between the high frequency coefficient 1316 and the low frequency coefficient 1706 when grouping is performed in the low frequency encoding unit 1901 of the encoding device 1900 illustrated in FIG. As shown in FIG. 20, when the transform subblock 2106 is selected as the transform block length in the transform block length selection unit 1301, a transform block of 128 sample units is input to the input signal 207 of the 1024 sample basic transform block 2101. Eight sets of MDCT coefficients (full band coefficient 1705) converted by the length are obtained.
[0149]
The entire band coefficient 1705 is divided by the band dividing unit 1702 into a low frequency coefficient 1706 belonging to the low frequency encoding band 2103 and a high frequency coefficient 1316 belonging to the high frequency encoding band 2102. The low frequency encoding unit 1901 examines the similarity between adjacent short blocks of the low frequency coefficient 1706 and groups the short blocks having a high similarity. At this time, grouping information 1911 indicating which short blocks and which short blocks are grouped is generated and output to the high frequency encoding unit 1904. For example, it is assumed that the low frequency encoding unit 1901 generates grouping information 1911 indicating that the short block 2108 and the short block 2109 adjacent to each other are grouped. The high-band coding unit 1904 indicates a set of adjacent transform sub-blocks 2106 similar to each other by the grouping information 1911 given from the low-band coding unit 1901. Accordingly, the eight transform sub-blocks 2106 are converted into a plurality of transforms accordingly. Classify into sub-block groups.
[0150]
For example, in the high frequency band 2102, eight sets of transform sub-blocks 2106 are classified into transform sub-block group 2110, transform sub-block group 2111, and transform sub-block group 2112. For example, the transformation sub-block group 2110 is composed of one transformation sub-block 2106, and the transformation sub-block group 2111 is composed of five transformation sub-blocks 2106 (not shown). The transform sub-block group 2112 includes two transform sub-blocks 2106 corresponding to the short block 2108 and the short block 2109 in the low frequency band 2103, which are the seventh and eighth transform blocks. In order to share coding parameters within such a transform subblock group, the coding cell originally set for each transform subblock 2106 for each highband coding subband 2104 is replaced with the coding cell of FIG. What is necessary is just to set with respect to each conversion subblock group as 2105 shows.
[0151]
In this example, two sets of high frequency MDCT coefficients included in the encoding cell 2105 are represented by a set of encoding parameters and encoded. For example, when the second high frequency encoding unit 1903 encodes the high frequency coefficient 1316 included in the encoding cell 2105, the MDCT coefficients in the two sets of transform sub-blocks 2106 have the same shift number k and gain. Represented by a and encoded. On the other hand, when the sub-block grouping is not performed, the second high frequency encoding unit 1903 independently sets the shift number k and the gain a for each of the MDCT coefficients in the two sets of transform sub-blocks 2106. Must be encoded. Thus, for example, by grouping two transform sub-blocks 2106 into one transform sub-block group 2112, the number of coding parameters can be reduced to half.
[0152]
Note that the number of transform sub-blocks 2106 to be grouped is not limited. If more transform sub-blocks 2106 are grouped, the number of encoding parameters can be further reduced. As a result, by performing such grouping, it is possible to reduce the number of bits necessary for encoding the encoding parameters of the entire band. In the code multiplexing unit 1905, (1) the low frequency code 1910 from the low frequency encoding unit 1901, (2) the switching information 1319 from the switching determination unit 1306, and (3) the block length information from the conversion block length selection unit 1301. The output code 1913 is generated by multiplexing the high frequency code 1912 from 1310 and the high frequency encoding unit 1904.
[0153]
Note that grouping for sharing the encoding parameter may be performed for a plurality of high frequency encoding subbands in the frequency axis direction in the same manner as for a plurality of transform subblocks in the time axis direction. Is possible.
[0154]
In the above example, the transform sub-block 2106 is grouped according to the grouping information 1911 (first grouping information) obtained from the low frequency encoding unit 1901. However, the low frequency grouping information 1911 is Instead, the grouping may be performed independently on the high frequency side, the second grouping information indicating the grouping result may be generated, and the transform sub-block 2106 may be grouped in the high frequency encoding unit.
[0155]
FIG. 21 is a block diagram showing a configuration of another encoding apparatus 2200 according to Embodiment 8. In FIG. The encoding apparatus 2200 is an encoding apparatus that independently performs grouping for each high-frequency transform subblock 2106, and includes a transform block length selection unit 1301, a similarity determination unit 1312 (a noise determination unit 1304, a prediction error calculation unit). 1305, switching determination unit 1306), time-frequency conversion unit 1701, band division unit 1702, low frequency encoding unit 1703, high frequency encoding unit 1904 (first high frequency encoding unit 1902, second high frequency code). A grouping information generation unit 2201 and a code multiplexing unit 2202.
[0156]
The grouping information generation unit 2201 examines the similarity of the transform subblock 2106 in the high frequency coefficient 1316 band-divided by the band divider 1702 and groups the transform subblocks 2106 having high similarity. Further, second grouping information 2210 representing the grouping result is generated and output to highband encoding section 1904 and code multiplexing section 2202.
[0157]
The high frequency encoding unit 1904 selected by the switching information 1319 determines the encoding cell 2105 of the high frequency unit according to the input second grouping information 2210, and encodes each transform subblock in the encoding cell 2105. Generating a high-frequency code 2211 in which the conversion parameters are shared. The code multiplexing unit 2202 includes: (1) second grouping information 2210 from the grouping information generation unit 2201, (2) highband code 2211 from the highband coding unit 1904, and (3) lowband coding unit 1703. The low-frequency code 1707, (4) the switching information 1319 from the switching determination unit 1306 and (5) the block length information 1310 from the converted block length selection unit 1301 are multiplexed to generate an output code 2212.
[0158]
Since the first grouping information 1911 obtained from the low frequency encoding unit 1901 illustrated in FIG. 18 is generated based on the low frequency coefficient 1706 used in the low frequency encoding, the first grouping information 1911 is not necessarily the high frequency coefficient 1316. It is not always suitable for grouping. On the other hand, if the grouping information 2210 is generated uniquely on the high frequency side based on the high frequency coefficient 1316, it is possible to perform grouping more suitable for encoding the high frequency coefficient 1316, and encoding in the high frequency part Can improve the quality. In this case, however, the second grouping information 2210 uniquely generated on the high frequency side needs to be encoded and multiplexed to the output code 2212.
[0159]
In the configuration of the encoding device 2200, no grouping is performed in the low frequency encoding unit 1703, but the low frequency in the encoding device 1900 shown in FIG. 18 is used instead of the low frequency encoding unit 1703 in the encoding device 2200. An encoding unit 1901 may be provided, and the low-frequency encoding unit 1901 may be configured to perform short block grouping.
[0160]
In the encoding device 2200 shown in FIG. 21, the low frequency band and the high frequency band are grouped separately, but the first grouping information obtained from the low frequency encoding unit and the high frequency encoding side independently generate it. It is also possible to select and use any of the second grouping information. FIG. 22 is a block diagram showing a configuration of still another encoding apparatus 2300 according to Embodiment 8. In FIG. The encoding apparatus 2300 selectively uses either the first grouping information indicating the grouping of the low-frequency short blocks or the second grouping information indicating the grouping of the high-frequency transform sub-block to encode the high-frequency band. An encoding apparatus that determines a coded cell, which includes a transform block length selection unit 1301, a similarity determination unit 1312 (noise determination unit 1304, prediction error calculation unit 1305, switching determination unit 1306), time-frequency conversion unit 1701, Band division section 1702, low band encoding section 1901, high band encoding section 1904 (first high band encoding section 1902, second high band encoding section 1903), grouping information generation section 2301, grouping information switching section 2302 and a code multiplexing unit 2303.
[0161]
The grouping information generation unit 2301 once groups the high frequency coefficient 1316 according to the first grouping information 1911. In this process, for example, the grouping information generation unit 2301 calculates the difference error between the transform sub-blocks grouped in the same group, and if the calculated difference error value exceeds a predetermined threshold, The grouping based on one grouping information 1911 is canceled, and the grouping is performed independently based on the high frequency coefficient 1316. As a result of the grouping, the grouping information generation unit 2301 generates second grouping information 2210 indicating the high band-specific grouping result (that is, which conversion subblock and which conversion subblock are grouped).
[0162]
Further, grouping information switching information 2311 is output to the grouping information switching unit 2302, and the grouping information switching unit 2302 is switched to select the second grouping information 2210. According to the grouping information 2310 selected in this way, the high frequency encoding unit 1904 encodes the high frequency coefficient 1316 based on the similarity information 1318 with the low frequency coefficient 1706, and outputs a high frequency code 2312. The code multiplexing unit 2303 generates (1) the high frequency code 2312 from the high frequency encoding unit 1904, (2) the low frequency code 1910 from the low frequency encoding unit 1901, and grouping information switching information 2311 generated in this way. Accordingly, when the second grouping information 2210 is selected by the grouping information switching unit 2302, the (3) grouping information 2310, the (4) block length information 1310 from the conversion block length selection unit 1301, and the switching determination unit 1306 are selected. The switching information 1319 from (5) is multiplexed to generate an output code 2313.
[0163]
As described above, in this configuration, usually, the first grouping information 1911 is used to group the high frequency coefficients 1316, and the first grouping information 1911 is not suitable for grouping the high frequency coefficients 1316. Since the second grouping information 2210 uniquely generated on the high frequency encoding side can be used only, the number of code bits necessary to represent the second grouping information 2210 can be reduced, and It is possible to improve the encoding quality. In this configuration, the second grouping information 2210 is encoded and multiplexed to the output code 2313 only when the second grouping information 2210 is selected.
[0164]
Here, the code multiplexing unit 2303 encodes the second grouping information 2210 based on the grouping information switching information 2311 from the grouping information generation unit 2301 and multiplexes the second grouping information 2210 into the output code 2313. It is not limited. For example, the grouping information generation unit 2301 adds an identification code indicating that the grouping information is unique to the high frequency band to the second grouping information 2210. On the other hand, when the identification code indicating the second grouping information 2210 is added, the selected high frequency encoding unit 1904 incorporates the second grouping information 2210 into the high frequency code 2312 and encodes it. However, the first grouping information 1911 to which no identification code is added may not be incorporated into the high frequency code 2312. In this case, the code multiplexing unit 2303 does not need to encode and multiplex the grouping information 2310 and the grouping information switching information 2311. The code multiplexing unit 2303 includes the switching information 1319, the block length information 1310, the low frequency code 1910, and It is only necessary to multiplex the high frequency code 2312.
[0165]
Next, the configuration of the decoding apparatus according to the eighth embodiment will be described with reference to FIG. FIG. 23 is a block diagram showing a configuration of decoding apparatus 2400 that decodes output code 1913 generated by encoding apparatus 1900 shown in FIG. 23 uses a decoding method corresponding to an encoding method called block grouping such as the MPEG-4 AAC method in the encoding device 1900 shown in FIG. 18 as a low-frequency decoding unit. In addition, the decoding apparatus is configured to perform high-frequency decoding using grouping information given from the low-frequency decoding unit, which includes a code division unit 2401, a low-frequency decoding unit 2402, and a first high frequency A decoding unit 2403, a second high frequency decoding unit 2404, a switching unit 2405, a spectrum synthesis unit 1607, and a frequency-time conversion unit 1608 are provided.
[0166]
In FIG. 23, the code division unit 2401 divides the input code 2410 into block length information 1610, switching information 1611, a low frequency code 2411, and a high frequency code 2412. The low frequency decoding unit 2402 decodes the low frequency code 2411 according to the transform block length indicated by the block length information 1610 and the grouping information 2413 included in the low frequency code 2411, and is an AAC encoding parameter. A low frequency coefficient 2415 expressed by the MDCT coefficient is output. At the same time, the low frequency decoding unit 2402 outputs grouping information 2413 obtained from the low frequency code 2411 to the switching unit 2405. This grouping information 2413 is input via the switching unit 2405 to one of the first highband decoding unit 2403 and the second highband decoding unit 2404 selected by the switching information 1611. The selected first high frequency decoding unit or second high frequency decoding unit includes block length information 1610, high frequency code 2412, and low frequency coefficient 2415 (selected by the second high frequency decoding unit 2404). The high frequency coefficient 2414 is generated from the grouping information 2413.
[0167]
That is, if the selected high frequency decoding unit indicates that the basic transform block is used by the block length information 1610, the selected high frequency decoding unit performs a decoding process in units of encoded cells based on the basic transform block, Conversely, when it is indicated that a transform subblock is used, decoding processing is performed in units of coded cells based on the transform subblock group indicated by the grouping information 2413.
[0168]
The coding cell based on the transform sub-block group includes a set of MDCT coefficients corresponding to a plurality of transform sub-blocks, and each MDCT coefficient pair is decoded according to the same coded decoding parameter. It becomes. Regarding the operations of the first highband decoding section 2403 and the second highband decoding section 2404 that calculate the MDCT coefficients in the coding cell from the coding parameters, the transform subblock group indicated by the grouping information 2413 17 is the same as the first high frequency decoding unit 1605 and the second high frequency decoding unit 1606 in the decoding apparatus 1800 shown in FIG. Is omitted.
[0169]
As described above, since the decoded low frequency coefficient 2415 and high frequency coefficient 2414 are MDCT coefficients based on the same transform block length, the synthesis of both can be realized by a simple addition process. The spectrum synthesizer 1607 adds the low frequency coefficient 2415 and the high frequency coefficient 2414 to calculate a synthesized spectral coefficient 2416. The frequency-time conversion unit 1608 generates and outputs an output signal 2417 by performing inverse MDCT conversion on the obtained combined spectral coefficient 2416.
[0170]
By adopting such a configuration, the encoding apparatus 1900 encodes the input signal 207 with a smaller bit amount by grouping the transform sub-blocks, and the decoding apparatus 2400 encodes with such a small bit amount. The received input code 2410 can be correctly decoded.
[0171]
When the second grouping information uniquely generated on the high frequency encoding side is multiplexed on the input code of the decoding device, as in the output code 2212 by the encoding device 2200 shown in FIG. As shown in FIG. 24, the decoding is performed based on the second grouping information. FIG. 24 is a block diagram showing a configuration of decoding apparatus 2500 that decodes output code 2212 by encoding apparatus 2200 shown in FIG. The decoding device 2500 is a decoding device that decodes, as an input code, an output code when block grouping is separately performed in a low frequency and a high frequency in the encoding device, and includes a code division unit 2501, a low frequency A decoding unit 2502, a first high frequency decoding unit 2503, a second high frequency decoding unit 2504, a switching unit 2405, a spectrum synthesis unit 1607, and a frequency-time conversion unit 1608 are provided.
[0172]
The code dividing unit 2501 divides the input code 2510 into a low frequency code 2411, second grouping information 2511, high frequency code 2512, block length information 1610, and switching information 1611. The low frequency decoding unit 2502 is different from the low frequency decoding unit 2402 illustrated in FIG. 23 in that the first grouping information included in the low frequency code 2411 is not output to the switching unit 2405.
[0173]
In other cases, the low frequency decoding unit 2502 uses the first grouping information included in the low frequency code 2411 when the block grouping is performed in the short block, similarly to the low frequency decoding unit 2402. Therefore, the low frequency code 2411 is decoded, and the low frequency coefficient 2415 expressed by the MDCT coefficient is output to the spectrum synthesis unit 1607 and the second high frequency decoding unit 2504.
[0174]
The first high frequency decoding unit 2503 and the second high frequency decoding unit 2504 decode the high frequency code 2512 uniquely grouped according to the second grouping information 2511 as shown in FIG. Different from the first high frequency decoding unit 2403 and the second high frequency decoding unit 2404 shown in FIG. However, in terms of hardware, the first high frequency decoding unit 2503 and the second high frequency decoding are the same in that the high frequency code 2412 is decoded in accordance with the grouping information input via the switching unit 2405. The converting unit 2504, the first high frequency decoding unit 2403, and the second high frequency decoding unit 2404 are exactly the same.
[0175]
In the encoding device 2200, the high frequency code 2512 is block-grouped based on the similarity between adjacent transform sub-blocks in the high frequency, regardless of the block grouping in the low frequency short block. Since the second grouping information 2511 indicates how block subgroups of the high frequency code 2512 are grouped, the first high frequency decoding unit 2503 and the second high frequency decoding The unit 2504 can correctly determine the encoded cell of the high-frequency code 2512 according to the second grouping information 2511.
[0176]
As described above, the high frequency code 2512 is block grouped based on the similarity of the high frequency transform sub-blocks in the encoding device 2200, and thus is obtained by diverting the low frequency block grouping to the high frequency as it is. Compared to the high frequency coefficient 2414 of the decoding device 2400, it can be said that the encoding is more faithful to the original sound. With the decoding apparatus 2500 having such a configuration, the input code obtained by grouping the transform sub-blocks in the encoding apparatus 2200 so as to be optimal with respect to the high frequency coefficients and encoding more efficiently is obtained. Decoding is performed in the decoding device 2500, and a higher quality output signal can be obtained.
[0177]
Further, like the output code from the encoding apparatus 2300 shown in FIG. 22, the grouping information used for the block grouping of the high frequency coefficients differs according to the similarity between the high frequency coefficients and the low frequency coefficients. A decoding apparatus for decoding the input code in this case will be described. FIG. 25 is a block diagram showing a configuration of decoding apparatus 2600 that decodes an output code from encoding apparatus 2300 shown in FIG. 22 as an input code. Decoding apparatus 2600 decodes the high frequency code according to the second grouping information when the second grouping information is multiplexed with the input code, and the second grouping information is not multiplexed with the input code. In this case, the decoding apparatus decodes the high frequency code according to the first grouping information, and includes a code division unit 2601, a grouping information switching unit 2602, a first high frequency decoding unit 2603, and a second high frequency decoding. A conversion unit 2604, a low-band decoding unit 2402, a switching unit 2405, a spectrum synthesis unit 1607, and a frequency-time conversion unit 1608.
[0178]
The code dividing unit 2601 divides the input code 2610 into second grouping information 2611, a high frequency code 2612, switching information 1611, block length information 1610, and a low frequency code 2411. In this input code 2610, the second grouping information 2611 is multiplexed only when the second grouping information 2611 uniquely generated for the high frequency band is used.
[0179]
The grouping information switching unit 2602 receives the second grouping information 2611 output from the code division unit 2601 and the first grouping information 2413 output from the low frequency decoding unit 2402. When the second grouping information 2611 is multiplexed on the input code 2610, the grouping information switching unit 2602 uses the second grouping information 2611 output from the code dividing unit 2601 among the two inputs as a high frequency. The second grouping information 2611 is selected to be input to the decoding unit. Conversely, when the second grouping information 2611 is not multiplexed on the input code 2610, the first grouping information 2413 is selected so that the first grouping information 2413 is input to the high frequency decoding unit. .
[0180]
Accordingly, the first high frequency decoding unit 2603 or the second high frequency decoding unit 2604 selected according to the switching information 1611 outputs the grouping of the selected one that is an output from the grouping information switching unit 2602. Based on the information, the high-frequency code 2612 can be decoded. That is, when the second grouping information 2611 is multiplexed on the input code 2610, decoding is performed based on the second grouping information 2611, and the second grouping information 2611 is multiplexed on the input code 2610. If not, decoding is performed based on the first grouping information 2413 from the low frequency decoding unit 2402.
[0181]
Here, the grouping information switching unit 2602 detects the presence / absence of data in the second grouping information 2611, and if there is data in the second grouping information 2611, selects the second grouping information 2611. Although the function of selecting the first grouping information 2413 is provided, other methods may be used to switch between the first grouping information 2413 and the second grouping information 2611.
[0182]
For example, a control line from the code division unit 2601 to the grouping information switching unit 2602 is provided, and the code division unit 2601 determines whether or not the second grouping information 2611 is multiplexed on the input code 2610. A control signal for switching the grouping information switching unit 2602 based on the result may be output to the grouping information switching unit 2602.
[0183]
Alternatively, on the encoding device 2300 side illustrated in FIG. 22, for example, the grouping information generation unit 2301 generates grouping information switching information 2311 indicating the switching result of the grouping information switching unit 2302, and this is processed by the high frequency encoding unit 1904. It may be multiplexed with the high frequency code 2312. Also in this case, for example, a control line from the code division unit 2601 to the grouping information switching unit 2602 is provided, and the code division unit 2601 separates the grouping information switching information 2311 from the high frequency code 2612 and outputs it to the control line. The information switching unit 2602 may be switched.
[0184]
Further, without providing the grouping information switching unit 2602, both the first grouping information 2413 and the second grouping information 2611 are input to the selected high frequency decoding unit via the switching unit 2405. In addition, if there is data of the second grouping information 2611 in the selected high frequency decoding unit, the second grouping information 2611 may be used in preference to the first grouping information 2413.
[0185]
By adopting such a configuration, the encoding apparatus 2300 groups transform sub-blocks so as to be optimal with respect to the high frequency coefficient, and the decoding apparatus 2600 converts the input code encoded with a smaller bit amount. It can be decoded and a higher quality output signal can be obtained. In the decoding device 2600, although the bit amount of the input code is larger than that of the decoding device 2400, a high-quality output signal closer to the original sound can be obtained. In addition, the decoding apparatus 2600 can obtain an output signal whose input code bit amount is smaller than that of the decoding apparatus 2500 but whose quality is not deteriorated so much.
[0186]
In the description of the encoding device and the decoding device according to the seventh embodiment and the eighth embodiment, signals or coefficients based on the same sampling frequency are used in the high frequency encoding unit and the low frequency encoding unit. Although used, different sampling frequencies can be used for each. As an example, a case will be described in which conversion is performed using a basic conversion block and a sampling frequency of 24 kHz is used in low-frequency encoding and 48 kHz in high-frequency encoding.
[0187]
First, assuming that 2048 samples are used as the basic transform block length and 2048 samples of the input signal having a sampling frequency of 48 kHz are converted to obtain 2048 MDCT coefficients, this MDCT coefficient is a coefficient representing a signal of 0 to 24 kHz. Among these, if the low frequency side 1024 points of the 2048 MDCT coefficients are taken out as the low frequency coefficient, the low frequency coefficient becomes a coefficient representing a signal of 0 to 12 kHz. If the low frequency coefficient 1024 points are subjected to inverse MDCT conversion, a time signal of 1024 samples can be obtained, and the number of samples of the time signal is halved, that is, down-sampled to a half sampling frequency.
[0188]
If the 2048 MDCT coefficients obtained by direct MDCT conversion of the input signal are based on a sampling frequency of 48 kHz, the low-frequency MDCT coefficient obtained by extracting the low-frequency side 1024 points is set to a sampling frequency of 24 kHz. It can be said that the coefficient is based on this. If the 1024 low frequency MDCT coefficients and the 2048 global MDCT coefficients obtained in this way are input to the low frequency encoding unit and the high frequency encoding unit, respectively, the low frequency encoding unit and the high frequency encoding unit A configuration using a different sampling frequency in the area encoding unit can be realized.
[0189]
In this configuration, the transform block length of the low-frequency encoding is 1024 samples with respect to the transform block length of 2048 samples of the high-frequency encoding unit, and both are different, but the respective sampling frequencies are 48 kHz and 24 kHz. The transform block length as time is equal to 2048/48000 = 1024/24000. In other words, it is a necessary condition for realizing this configuration that the transform block length of the low frequency encoding and the transform block length of the high frequency encoding are temporally equal.
[0190]
Similarly for decoding, the conversion block length of the low-frequency decoding unit and the conversion block length of the high-frequency decoding unit are set to be temporally equal, and sampling at 24 kHz decoded by the low-frequency decoding unit The 1024-point low frequency MDCT coefficient based on the frequency is regarded as a low-frequency 1024-point portion of the 2048-point MDCT coefficient based on the 48 kHz sampling frequency after the upsampling, and is decoded as the MDCT coefficient based on the 48 kHz sampling. What is necessary is just to comprise so that it may synthesize | combine with a coefficient.
[0191]
In the description of the encoding device and the decoding device according to the sixth to eighth embodiments, the first and second high-frequency encoding units or the first and second high-frequency encoding units are provided for each encoding cell. However, when a transform subblock is used, a configuration that can be switched and used for each transform subblock or transform subblock group may be used. When transform sub-blocks are used in encoding, the input signal has a large change in the time axis direction, but often has a small change in the frequency axis direction. Therefore, the same high frequency encoding unit can be selected for a plurality of high frequency encoding subbands in the frequency axis direction included in the same transform subblock or transform subblock group. According to this configuration, it is only necessary to encode the switching information of the encoding unit not in units of encoding cells but in units of transform subblocks or transform subblock groups, thereby reducing the bit amount necessary for encoding the switching information. it can.
[0192]
Note that the encoding device and the decoding device according to Embodiments 1 to 8 of the present invention are realized as an encoding method and a decoding method by linking processing procedures for realizing the functions of the respective units. Can do. Each processing procedure is realized by describing each as software using a programming language and executing the software on the processor. Software written using a programming language is recorded on a recording medium such as a ROM (Read Only Memory) or a hard disk drive, and is executed on a processor connected to the recording medium directly or via a network such as the Internet. The functions of the encoding device and the decoding device according to Embodiments 1 to 8 of the present invention can be realized.
[0193]
The encoding apparatus according to the present invention is an acoustic encoding apparatus provided in a satellite broadcasting station including BS and CS, and an acoustic encoding of a content distribution server that distributes content via a communication network such as the Internet. Further, the apparatus is useful as an audio signal encoding program executed by a general-purpose computer and a recording medium on which the program is recorded.
[0194]
The decoding device according to the present invention is not only an acoustic decoding device provided in a home STB, but also as a program for decoding an acoustic signal executed by a general-purpose computer and a recording medium recording the program. It is useful as a dedicated circuit board for decoding acoustic signals, LSI, etc. provided in an STB or general-purpose computer, and further as an IC card inserted into the STB or general-purpose computer.
[0195]
【The invention's effect】
In order to achieve the above object, the encoding apparatus of the present invention splits an input signal into a low frequency signal representing a low frequency band signal and a high frequency signal representing a high frequency band signal, A low-frequency encoding unit that encodes the generated low-frequency signal and generates a low-frequency code; a similarity determination unit that determines a similarity between the high-frequency signal and the low-frequency signal; and the high-frequency signal. Based on the similarity, a plurality of high frequency encoding units that encode with different encoding methods and generate a high frequency code, and select one of the plurality of high frequency encoding units to select the selected high frequency encoding unit. A coding selection unit for causing the band coding unit to perform coding and generating selection information for specifying a coding method of the selected high band coding unit; and a low band code from the low band coding unit The high-frequency code from the selected high-frequency encoding means and the encoding selection method Multiplexes selection information from, and a code multiplexing unit configured to generate an output code.
[0196]
Further, the decoding device for decoding the output code encoded by the encoding device includes a low frequency decoding means for decoding a low frequency signal representing a signal of a low frequency band from the input code, and an input code. A plurality of high frequency decoding means for decoding a high frequency signal representing a signal of a high frequency band using different decoding methods, and the high frequency decoding means based on the selection information included in the input code. Decoding selection means for selecting one and causing the selected high-frequency decoding means to perform decoding, the low-frequency signal decoded by the low-frequency decoding means, and the selected high-frequency decoding Signal synthesizing means for synthesizing the high frequency signal decoded by the means to generate an output decoded signal.
[0197]
By adopting the above configuration, the input signal is separated into a low frequency band signal and a high frequency band signal, and the high frequency band signal is reduced with a small amount of bits based on the similarity to the low frequency band signal. Since encoding can be performed appropriately, a high-quality reproduction signal can be obtained from a code having a smaller bit amount.
[0198]
Further, according to another encoding device and decoding device of the present invention, a signal obtained by decoding a code encoded by the low frequency encoding means as a low frequency band signal when calculating the similarity. Since a more appropriate value can be obtained as the similarity, a high-quality reproduction signal can be obtained from a code having a smaller bit amount.
[0199]
In addition, according to another encoding device and decoding device of the present invention, it is possible to increase the encoding efficiency of the low-frequency encoding means by down-sampling a signal in a low frequency band, so that a smaller bit amount A high-quality reproduction signal can be obtained from the above code.
[0200]
Further, according to still another encoding device and decoding device of the present invention, it is possible to divide a spectrum coefficient in a high frequency band and apply an optimum encoding method to each of the divided coefficients. Therefore, a high-quality reproduction signal can be obtained from a code having a smaller bit amount.
[0201]
According to still another encoding device and decoding device of the present invention, spectral coefficients encoded by the low-frequency encoding means and the high-frequency encoding means can be calculated in a lump, so that time-frequency The amount of calculation required for conversion can be reduced. Furthermore, since the spectrum coefficients decoded by the low frequency decoding means and the high frequency decoding means can be converted into a time signal at once, the amount of calculation required for frequency-time conversion can be reduced.
[0202]
Further, according to still another encoding device and decoding device of the present invention, the input signal is separated into a low frequency band signal and a high frequency band signal, and a conventional method is applied to a low frequency band signal. Applying the coding method, for high frequency band signals, based on the similarity determination means for determining the similarity between the high frequency band signal and the low frequency band signal, and the determination result of the similarity determination means Thus, a high-frequency encoding unit that uses a similarity with a signal in a low frequency band or encodes using a random noise signal is applied. By adopting such a configuration, for example, at a low bit rate of 64 kbps or less in stereo, it is possible to encode and decode a signal in a high frequency band that could not be encoded by the conventional method. The auditory quality of the acoustic signal can be improved.
[0203]
As is apparent from the above description, according to the encoding device and the decoding device of the present invention, the input signal is divided into a signal of a low frequency band and a signal of a high frequency band, By encoding using the similarity between the band signal and the low frequency band, it is possible to efficiently encode the high frequency band signal and obtain a high-quality decoded signal from a code with a smaller bit amount. it can.
[0204]
Further, by using spectral coefficients of the same format for low frequency band coding and high frequency band coding, a high-quality decoded signal can be obtained with a smaller amount of processing.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 1. FIG.
FIG. 2 (a) is a diagram showing two sine waves having different frequencies as time signals. FIG. 2B is a diagram showing two sine waves having different frequencies as frequency signals.
FIG. 3 is a block diagram showing a configuration of a decoding apparatus according to the first embodiment.
4 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 2. FIG.
FIG. 5 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 3.
6 is a block diagram showing a configuration of a decoding apparatus that decodes an output code encoded by the encoding apparatus shown in FIG. 5. FIG.
FIG. 7 is a block diagram showing a configuration of a part that generates a high-frequency code and switching information in the encoding apparatus according to Embodiment 4;
FIG. 8 (a) is a diagram illustrating a coding method of a first highband coding unit. FIG. 8B is a diagram illustrating an encoding method of the second high frequency encoding unit.
9 is a block diagram showing a configuration of a portion (high frequency decoding device) that decodes a high frequency code in a decoding device corresponding to the encoding device shown in FIG. 7;
10 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 5. FIG.
11 is a block diagram illustrating a configuration of a decoding apparatus that decodes an input code that is an output code of the encoding apparatus illustrated in FIG. 10;
12 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 6. FIG.
13 is a diagram illustrating a high frequency coefficient and a low frequency coefficient when a basic transform block is selected in the transform block length selection unit illustrated in FIG.
14 is a diagram illustrating a high frequency coefficient and a low frequency coefficient when a transform sub-block is selected in the transform block length selection unit illustrated in FIG.
15 is a block diagram showing a configuration of a decoding device that decodes an output code output by the encoding device shown in FIG. 12. FIG.
FIG. 16 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 7.
17 is a block diagram showing a configuration of a decoding device that decodes an output code encoded by the encoding device shown in FIG. 16. FIG.
FIG. 18 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 8.
FIG. 19 (a) is a diagram illustrating an example of a global coefficient output in accordance with block length information from the time-frequency conversion unit of the encoding device illustrated in FIG. FIG.19 (b) is a figure which shows an example of the high frequency coefficient input into the high frequency encoding part of the encoding apparatus shown in FIG. FIG.19 (c) is a figure which shows an example of the low frequency coefficient input into the low frequency encoding part of the encoding apparatus shown in FIG.
20 is a diagram illustrating a relationship between a high frequency coefficient and a low frequency coefficient when grouping is performed in the low frequency encoding unit of the encoding device illustrated in FIG.
FIG. 21 is a block diagram showing a configuration of another encoding apparatus according to Embodiment 8. FIG.
FIG. 22 is a block diagram showing a configuration of still another encoding device according to Embodiment 8. FIG.
FIG. 23 is a block diagram illustrating a configuration of a decoding apparatus that decodes an output code generated by the encoding apparatus illustrated in FIG. 18;
24 is a block diagram showing a configuration of a decoding apparatus that decodes an output code by the encoding apparatus shown in FIG. 21. FIG.
25 is a block diagram showing a configuration of a decoding apparatus that decodes an output code from the encoding apparatus shown in FIG. 22 as an input code.
FIG. 26 is a block diagram illustrating a configuration of an encoding device using a conventional AAC scheme.
[Explanation of symbols]
200 Encoder
201 Band division unit
202 Low frequency encoding unit
203 Similarity determination unit
204 switching part
205 High frequency encoding unit
206 Code multiplexer
207 Input signal
208 Low frequency signal
209 High frequency signal
210 Switching information
211 Similarity information
212 High frequency code
213 Low frequency code
214 Output code
400 Decoding device
401 code division unit
402 Low frequency decoding unit
403 switching unit
404 High-frequency decoding unit
405 Signal synthesis unit
406 Input code
407 Low frequency code
408 High frequency code
409 Switching information
410 Low frequency signal
411 High frequency signal
412 Output signal

Claims

Band dividing means for dividing an input signal into a low-frequency signal representing a signal in a low frequency band and a high-frequency signal representing a signal in a high frequency band;
Low frequency encoding means for encoding the divided low frequency signal and generating a low frequency code;
Similarity determination means for determining the similarity between the high frequency signal and the low frequency signal;
A plurality of high frequency encoding means for encoding the high frequency signal with different encoding methods to generate a high frequency code;
An encoding method of the selected high-frequency encoding unit by selecting one of the plurality of high-frequency encoding units based on the similarity and causing the selected high-frequency encoding unit to perform encoding. Encoding selection means for generating selection information for specifying
Code multiplexing means for multiplexing the low frequency code from the low frequency encoding means, the selected high frequency code from the high frequency encoding means, and the selection information from the encoding selection means to generate an output code An encoding device comprising:

The encoding device further includes:
Comprising low frequency decoding means for decoding the low frequency code generated by the low frequency encoding means,
The similarity determination unit determines a similarity between the high frequency signal obtained from an input signal and a low frequency band decoded signal generated by the low frequency decoding unit. The encoding device according to 1.

The encoding device further includes:
It has a downsampling means for downsampling a low frequency signal obtained by band division,
The encoding apparatus according to claim 1, wherein the low-frequency encoding means encodes the down-sampled low-frequency signal.

The encoding device further includes:
A second band dividing means for dividing the high frequency signal into a plurality of subbands;
The similarity determination means determines the similarity between the signal in the subband and the low-frequency signal for each subband,
The encoding selection unit selects the high-frequency encoding unit corresponding to the similarity for each subband,
The encoding apparatus according to claim 1, wherein the selected high-frequency encoding unit encodes the signal in the subband.

The encoding selection unit selects a high-frequency encoding unit that encodes only the amplitude parameter of the signal in the subband when the similarity indicates a low similarity. The encoding device according to claim 4.

When the similarity indicates a high similarity, the encoding selection means includes a positional parameter indicating a positional relationship between a signal in the subband and a similar low frequency signal, and a difference between the two signals. 5. The encoding apparatus according to claim 4, wherein a high-frequency encoding unit that encodes a difference parameter indicating λ and an amplitude parameter of a signal in the subband is selected.

When the similarity indicates very high similarity, the encoding selection means includes a positional parameter indicating a positional relationship between a signal in the subband and a low frequency signal similar to the subband, and the subband 5. The encoding apparatus according to claim 4, wherein high-frequency encoding means for encoding the amplitude parameter of the signal is selected.

The encoding device further includes:
Time-frequency conversion means for converting the band-divided high-frequency signal and low-frequency signal into spectral coefficients each representing a frequency component;
A second band dividing means for dividing a spectral coefficient of a high frequency band into a plurality of subbands;
The similarity determination means includes a noise characteristic determination unit that determines noise characteristics of the divided spectrum coefficients of each subband, and a prediction error between the spectrum coefficient of each subband and a spectrum coefficient selected from a low frequency band. A prediction error calculation unit that calculates the degree of similarity between the two,
The plurality of high frequency encoding means generate an amplitude parameter representing the amplitude of the spectral coefficient of each subband, and a first high frequency encoding means for encoding only the generated amplitude parameter; Based on the position parameter of the low frequency spectrum coefficient similar to the spectral coefficient of each subband, the amplitude parameter of each subband, and the difference parameter between the spectral coefficient of each subband and the low frequency spectrum coefficient similar thereto Second high-frequency encoding means for
The low frequency encoding means encodes a spectral coefficient representing the low frequency component,
The encoding apparatus according to claim 1, wherein the encoding selection unit selects one of the high-frequency encoding units by comprehensively determining the determined noise characteristic and the similarity. .

The encoding device further includes:
Analyzing the characteristics of the input signal, determining the transform block length of the time-frequency transform for the input signal, and comprising transform block length selection means for generating block length information representing the determined transform block length,
The time-frequency conversion means converts a high frequency signal and a low frequency signal into a high frequency coefficient and a low frequency coefficient, respectively, with the conversion block length selected by the conversion block length selection means,
9. The encoding apparatus according to claim 8, wherein the code multiplexing means multiplexes the block length information in addition to the low frequency code, the high frequency code and the selection information.

The encoding device further includes:
Time-frequency conversion means for converting the band-divided high-frequency signal and low-frequency signal into spectral coefficients each representing a frequency component;
A second band dividing means for dividing a spectral coefficient of a high frequency band into a plurality of subbands;
The similarity determination means includes a noise characteristic determination unit that determines noise characteristics of the divided spectrum coefficients of each subband, and a prediction error between the spectrum coefficient of each subband and a spectrum coefficient selected from a low frequency band. A prediction error calculation unit that calculates the degree of similarity between the two,
The plurality of high frequency encoding means generate an amplitude parameter representing the amplitude of the spectral coefficient of each subband, and a first high frequency encoding means for encoding only the generated amplitude parameter; And a third high-frequency encoding means for encoding only two pieces of information, a low-band spectral coefficient position parameter similar to the spectral coefficient of each sub-band and an amplitude parameter of each sub-band,
The low frequency encoding means encodes a spectral coefficient representing the low frequency component,
The encoding apparatus according to claim 1, wherein the encoding selection unit selects one of the high-frequency encoding units by comprehensively determining the determined noise characteristic and the similarity. .

The encoding device further includes:
A second band dividing means for dividing the divided high frequency signal into a plurality of subbands;
A time-frequency conversion means for converting the divided low-frequency signal and the high-frequency signal divided into the plurality of subbands into spectral coefficients each representing a frequency component;
The similarity determination means includes a noise characteristic determination unit that determines noise characteristics of the divided spectrum coefficients of each subband, and a prediction error between the spectrum coefficient of each subband and a spectrum coefficient selected from a low frequency band. A prediction error calculation unit that calculates the degree of similarity between the two,
The plurality of high frequency encoding means generate an amplitude parameter representing the amplitude of the spectral coefficient of each subband, and a first high frequency encoding means for encoding only the generated amplitude parameter; Based on the position parameter of the low band spectral coefficient similar to the spectral coefficient of each subband, the amplitude parameter of each subband, and the difference parameter between the spectral coefficient of each subband and the low band spectral coefficient similar thereto. Second high frequency encoding means for encoding,
The low frequency encoding means encodes a spectral coefficient representing the low frequency component,
The encoding apparatus according to claim 1, wherein the encoding selection unit selects one of the high-frequency encoding units by comprehensively determining the determined noise characteristic and the similarity. .

The encoding device further includes:
A time-frequency conversion means for converting the input time signal into spectral coefficients indicating frequency components in all frequency bands of the input signal;
The band dividing means divides the spectral coefficient into a low frequency band coefficient and a high frequency band coefficient,
The low frequency encoding means encodes the coefficient of the divided low frequency band,
The similarity determination means determines the similarity between the high frequency band coefficient and the low frequency band coefficient,
2. The encoding apparatus according to claim 1, wherein the plurality of high frequency encoding means generate the high frequency code by encoding the coefficient of the high frequency band.

The band dividing means further divides the spectral coefficient of a high frequency band into a plurality of subbands,
The similarity determination means includes a noise characteristic determination unit that determines noise characteristics of the divided spectrum coefficients of each subband, and a prediction error between the spectrum coefficient of each subband and a spectrum coefficient selected from a low frequency band. A prediction error calculation unit that calculates the degree of similarity between the two,
The plurality of high frequency encoding means generate an amplitude parameter representing the amplitude of the spectral coefficient of each subband, and a first high frequency encoding means for encoding only the generated amplitude parameter; Based on the position parameter of the low band spectral coefficient similar to the spectral coefficient of each subband, the amplitude parameter of each subband, and the difference parameter between the spectral coefficient of each subband and the low band spectral coefficient similar thereto. Second high frequency encoding means for encoding,
13. The encoding apparatus according to claim 12, wherein the encoding selection unit selects one of the high-frequency encoding units by comprehensively determining the determined noise characteristic and the similarity. .

The encoding device further includes:
Analyzing the characteristics of the input signal, determining the transform block length of the time-frequency transform for the input signal, and comprising transform block length selection means for generating block length information representing the determined transform block length,
The time-frequency conversion means converts the input signal into a spectral coefficient indicating the frequency components of all frequency bands of the input signal with the conversion block length selected by the conversion block length selection means,
14. The encoding apparatus according to claim 13, wherein the code multiplexing means further multiplexes the block length information in addition to the low frequency code, the high frequency code and the selection information.

The encoding device further includes:
When a short transform block length is selected by the transform block length selection unit, the similarity of high frequency coefficients included in each transform block having the transform block length as a transform unit is determined, and the similarity is determined to be high. Classification means for classifying the high frequency coefficients of the transform block into groups;
Classification information generating means for generating classification information indicating a classification result of the classification means,
The plurality of high frequency encoding means collectively encode high frequency coefficients in a plurality of transform blocks included in one group,
15. The encoding according to claim 14, wherein the code multiplexing means further multiplexes the classification information in addition to the low frequency code, the high frequency code, the selection information, and the block length information. apparatus.

The classification means compares the low-frequency coefficients included in the transform block for each transform block, and if the low-frequency coefficients of the transform blocks are similar, the high-frequency coefficient included in the same transform block is 16. The encoding apparatus according to claim 15, wherein it is determined that the similarity is high every time, and the low frequency coefficient and the high frequency coefficient are classified into the same group.

The classification means includes
When the low frequency coefficients included in the transform block are compared for each transform block and the low frequency coefficients of the transform block are similar, the high frequency coefficient included in the same transform block is also similar for each transform block. A first determination unit that determines that the
A second determination unit that compares the high frequency coefficients included in the transform block for each transform block and determines the similarity of the high frequency coefficients of the transform block;
The classification means selects either the determination by the first determination unit or the determination by the second determination unit, and classifies the high frequency coefficients of the transform block into groups based on the selected determination. The encoding device according to claim 15.

The band dividing means further divides the spectral coefficient of a high frequency band into a plurality of subbands,
The similarity determination means includes a noise characteristic determination unit that determines noise characteristics of the divided spectrum coefficients of each subband, and a prediction error between the spectrum coefficient of each subband and a spectrum coefficient selected from a low frequency band. A prediction error calculation unit that calculates the degree of similarity between the two,
The plurality of high frequency encoding means generate an amplitude parameter representing the amplitude of the spectral coefficient of each subband, and a first high frequency encoding means for encoding only the generated amplitude parameter; And a third high-frequency encoding means for encoding only two pieces of information, a low-band spectral coefficient position parameter similar to the spectral coefficient of each sub-band and an amplitude parameter of each sub-band,
13. The encoding apparatus according to claim 12, wherein the encoding selection unit selects one of the high-frequency encoding units by comprehensively determining the determined noise characteristic and the similarity. .

Low-frequency decoding means for decoding a low-frequency signal representing a signal of a low frequency band from an input code;
A plurality of high frequency decoding means for decoding a high frequency signal representing a signal of a high frequency band from the input code using different decoding methods;
A decoding selection unit that selects one of the high frequency decoding units based on selection information included in an input code, and causes the selected high frequency decoding unit to perform decoding;
Signal synthesis for generating an output decoded signal by synthesizing the low-frequency signal decoded by the low-frequency decoding means and the high-frequency signal decoded by the selected high-frequency decoding means And a decoding device.

The decoding device further includes:
Comprising upsampling means for upsampling the decoded low-frequency signal;
The signal synthesizing means synthesizes the up-sampled low-frequency signal and the high-frequency signal decoded by the high-frequency decoding means to generate an output decoded signal. Item 20. The decoding device according to Item 19.

The decoding selection means selects the high frequency decoding means according to the selection information for each subband of a high frequency band divided into a plurality of subbands,
20. The decoding apparatus according to claim 19, wherein the selected high frequency decoding means decodes the high frequency signal in the subband.

22. The plurality of high frequency decoding means includes first high frequency decoding means for decoding a high frequency signal using a predetermined noise signal and the amplitude parameter of the subband. The decoding apparatus as described.

The plurality of high frequency decoding means indicate a positional parameter indicating a positional relationship between the decoded low frequency signal, a signal in the subband and a similar portion of the low frequency signal, and a difference between the two signals. The decoding apparatus according to claim 21, further comprising second high-frequency decoding means for decoding the high-frequency signal using the difference parameter and the amplitude parameter of the signal in the subband.

The plurality of highband decoding means, the decoded lowband signal, a positional parameter indicating a positional relationship between the signal in the subband and a similar portion of the lowband signal, and the signal in the subband The decoding apparatus according to claim 21, further comprising third high frequency decoding means for decoding the high frequency signal using the amplitude parameter.

The decoding device further includes:
A time-frequency conversion means for converting the low-frequency signal decoded by the low-frequency decoding means into a low-frequency coefficient that is a spectral coefficient with the transform block length indicated in the block length information included in the input code;
The plurality of high frequency decoding means are first high frequency decoding means for decoding a high frequency coefficient converted by the transform block length using a predetermined noise signal and an amplitude parameter included in an input code. A low frequency coefficient obtained by the conversion by the time-frequency conversion means, a position parameter indicating a position of a portion similar to the corresponding high frequency coefficient in the low frequency coefficient, an amplitude parameter of the high frequency coefficient, Second high-frequency decoding means for decoding the high-frequency coefficient using the low-frequency coefficient and the difference parameter of the high-frequency coefficient, and the signal synthesizing means is decoded by the selected high-frequency decoding means A spectral coefficient synthesizer for synthesizing the converted high frequency coefficient and the low frequency coefficient decoded by the low frequency decoding means, and a spectral coefficient decoded by the transform block length indicated in the block length information The Frequency converting between signal - decoding apparatus according to claim 19, characterized in that it comprises a time conversion unit.

The low frequency signal is a low frequency coefficient representing a spectral coefficient of a low frequency band, and the high frequency signal is a high frequency coefficient representing a spectral coefficient of a high frequency band divided into a plurality of subbands,
The plurality of high frequency decoding means, first high frequency decoding means for decoding the high frequency coefficient only by an amplitude parameter included in an input code, and the high frequency coefficient decoded (1) a position parameter indicating the positional relationship between the low frequency coefficient and the high frequency coefficient and a similar portion of the low frequency coefficient included in the input code, (2) the amplitude parameter, and the low frequency A second high-frequency decoding means for decoding according to the difference parameter between the coefficient and the high-frequency coefficient (3),
The decoding selection means selects one of the high frequency decoding means for each subband according to selection information included in an input code,
The signal synthesizing unit includes a spectrum synthesizing unit that synthesizes the decoded low-frequency coefficient and the selected high-frequency coefficient, and a synthesized spectrum coefficient. 20. The decoding apparatus according to claim 19, further comprising a frequency-time conversion unit for converting into a signal.

The low frequency coefficient and the high frequency coefficient are subjected to time-frequency conversion with the conversion block length indicated in the block length information included in the input code,
27. The decoding apparatus according to claim 26, wherein the frequency-time conversion unit converts the decoded spectral coefficient into a time signal with the conversion block length indicated in the block length information included in the input code. .

The selected high frequency decoding means, when the transform block length indicated in the block length information is a short transform block length, according to the classification information included in the input code, all the transform blocks in the same group 28. The decoding apparatus according to claim 27, wherein decoding is performed using a set of common encoding parameters for a plurality of transform blocks included in the group.

The selected high frequency decoding means, when the conversion block length indicated in the block length information is a short conversion block length, the conversion block in the same group according to the classification information decoded together with the low frequency coefficient 29. The decoding apparatus according to claim 28, wherein the decoding is performed.

The low-band decoding means, based on the first classification information included in the input code, sets all the transform blocks belonging to the same group to a set of common encoding parameters for a plurality of transform blocks included in the group. Decrypt using
The selected high frequency decoding means selects either one of the first classification information obtained from the low frequency decoding means and the second classification information included in the input code, and the selected classification information 29. The decoding apparatus according to claim 28, wherein the high frequency coefficient is decoded according to:

The low frequency signal is a low frequency coefficient representing a spectral coefficient of a low frequency band, and the high frequency signal is a high frequency coefficient representing a spectral coefficient of a high frequency band divided into a plurality of subbands,
The plurality of high frequency decoding means, the first high frequency decoding means for decoding the encoded high frequency coefficient only by the amplitude parameter of the high frequency coefficient, and the encoded high frequency coefficient (1) a position parameter indicating the positional relationship between the decoded low frequency coefficient, the high frequency coefficient and the low frequency coefficient included in the input code, and (2) the high frequency coefficient A third high frequency decoding means for decoding according to the amplitude parameter,
The decoding selection means selects one of the high frequency decoding means for each subband according to selection information included in an input code,
The signal synthesizing unit includes a spectrum synthesizing unit that synthesizes the decoded low-frequency coefficient and the selected high-frequency coefficient, and a synthesized spectrum coefficient. The decoding apparatus according to claim 19, further comprising: a frequency-time conversion unit that converts the signal into a signal.

The decoding device further includes:
Code division means for dividing the multiplexed input code into a plurality of codes,
The low frequency decoding means decodes the low frequency coefficient from the divided input code,
The plurality of high frequency decoding means decodes the high frequency coefficient from the divided input code,
The decoding selection means selects one of a plurality of high frequency decoding means based on the selection information included in the divided input code,
The signal synthesizing unit synthesizes the decoded low-frequency coefficient and high-frequency coefficient to generate a spectral coefficient for the entire band, and converts the synthesized spectral coefficient into a time signal. 20. The decoding device according to claim 19, further comprising a frequency-time conversion unit.

The decoding device further includes:
Code division means for dividing the multiplexed input code into a plurality of codes,
The low frequency decoding means decodes the low frequency coefficient from the divided input code,
The plurality of highband decoding means includes first highband decoding means for decoding the highband coefficients using only amplitude parameters included in the divided input codes, and the decoded lowband decoding means. (1) a position parameter, (2) an amplitude parameter, the low frequency coefficient, and the high frequency coefficient of the high frequency coefficient and the low frequency coefficient included in the divided input code. And (3) second high frequency decoding means for decoding the encoded high frequency coefficient using the difference parameter,
The decoding selection means selects one of the high-frequency decoding means for each subband based on selection information included in an input code,
The signal synthesizing unit synthesizes the decoded low-frequency coefficient and the high-frequency coefficient to generate a spectral coefficient of the entire band, and converts the synthesized spectral coefficient into a time signal. The decoding apparatus according to claim 19, further comprising: a frequency-time conversion unit that performs the processing.

Code dividing means for dividing the multiplexed input code into a plurality of codes;
Low frequency decoding means for decoding the low frequency coefficient from an input code;
The plurality of high frequency decoding means include first high frequency decoding means for decoding the high frequency coefficient encoded by only the amplitude parameter included in the input code, the high frequency coefficient and the low frequency coefficient. A third high-frequency decoding means for decoding the high-frequency coefficient using only the positional parameter indicating the positional relationship with the high-frequency coefficient and the amplitude parameter of the high-frequency coefficient;
The decoding selection means selects one of the high frequency decoding means based on the selection information included in an input code,
The signal synthesizing unit synthesizes the decoded low-frequency coefficient and the high-frequency coefficient to generate a spectral coefficient of the entire band, and converts the synthesized spectral coefficient into a time signal. The decoding apparatus according to claim 19, further comprising: a frequency-time conversion unit that performs the processing.

A band division step for dividing the input signal into a low-frequency signal representing a signal in a low frequency band and a high-frequency signal representing a signal in a high frequency band;
A low frequency encoding step of encoding the divided low frequency signal and generating a low frequency code;
A similarity determination step for determining a similarity between the high-frequency signal and the low-frequency signal;
A plurality of high frequency encoding steps for encoding the high frequency signal with different encoding methods to generate a high frequency code;
Based on the similarity, one of the plurality of high frequency encoding steps is selected, encoding is performed in the selected high frequency encoding step, and an encoding method of the selected high frequency encoding step An encoding selection step for generating selection information for identifying
A code multiplexing step for generating an output code by multiplexing the low frequency code from the low frequency encoding step, the selected high frequency code from the high frequency encoding step, and the selection information from the encoding selection step. And an encoding method comprising:

A low frequency decoding step for decoding a low frequency signal representing a signal of a low frequency band from an input code;
A plurality of high frequency decoding steps for decoding a high frequency signal representing a signal of a high frequency band from the input code using different decoding methods;
A decoding selection step of selecting one of the high-frequency decoding steps based on selection information included in an input code, and performing decoding in the selected high-frequency decoding step;
Signal synthesis for synthesizing the low frequency signal decoded by the low frequency decoding step and the high frequency signal decoded by the selected high frequency decoding step to generate an output decoded signal A decoding method comprising the steps of:

A band division step for dividing the input signal into a low-frequency signal representing a signal in a low frequency band and a high-frequency signal representing a signal in a high frequency band;
A low frequency encoding step of encoding the divided low frequency signal and generating a low frequency code;
A similarity determination step for determining a similarity between the high-frequency signal and the low-frequency signal;
A plurality of high frequency encoding steps for encoding the high frequency signal with different encoding methods to generate a high frequency code;
Based on the similarity, one of the plurality of high frequency encoding steps is selected, encoding is performed in the selected high frequency encoding step, and an encoding method of the selected high frequency encoding step An encoding selection step for generating selection information for identifying
A code multiplexing step for generating an output code by multiplexing the low frequency code from the low frequency encoding step, the selected high frequency code from the high frequency encoding step, and the selection information from the encoding selection step. A program that causes a computer to execute.

A low frequency decoding step for decoding a low frequency signal representing a signal of a low frequency band from an input code;
A plurality of high frequency decoding steps for decoding a high frequency signal representing a signal of a high frequency band from the input code using different decoding methods;
A decoding selection step of selecting one of the high-frequency decoding steps based on selection information included in an input code, and performing decoding in the selected high-frequency decoding step;
Signal synthesis for synthesizing the low frequency signal decoded by the low frequency decoding step and the high frequency signal decoded by the selected high frequency decoding step to generate an output decoded signal A program for causing a computer to execute steps.

A band division step for dividing the input signal into a low-frequency signal representing a signal in a low frequency band and a high-frequency signal representing a signal in a high frequency band;
A low frequency encoding step of encoding the divided low frequency signal and generating a low frequency code;
A similarity determination step for determining a similarity between the high-frequency signal and the low-frequency signal;
A plurality of high frequency encoding steps for encoding the high frequency signal with different encoding methods to generate a high frequency code;
Based on the similarity, one of the plurality of high frequency encoding steps is selected, encoding is performed in the selected high frequency encoding step, and an encoding method of the selected high frequency encoding step An encoding selection step for generating selection information for identifying
A code multiplexing step for generating an output code by multiplexing the low frequency code from the low frequency encoding step, the selected high frequency code from the high frequency encoding step, and the selection information from the encoding selection step. A computer-readable recording medium on which a program for causing a computer to execute is recorded.

A low frequency decoding step for decoding a low frequency signal representing a signal of a low frequency band from an input code;
A plurality of high frequency decoding steps for decoding a high frequency signal representing a signal of a high frequency band from the input code using different decoding methods;
A decoding selection step of selecting one of the high-frequency decoding steps based on selection information included in an input code, and performing decoding in the selected high-frequency decoding step;
Signal synthesis for generating an output decoded signal by synthesizing the low frequency signal decoded by the low frequency decoding step and the high frequency signal decoded by the selected high frequency decoding step. A computer-readable recording medium recording a program for causing a computer to execute the steps.