[go: up one dir, main page]

TWI279693B - Method and device of audio compression - Google Patents

Method and device of audio compression Download PDF

Info

Publication number
TWI279693B
TWI279693B TW94102404A TW94102404A TWI279693B TW I279693 B TWI279693 B TW I279693B TW 94102404 A TW94102404 A TW 94102404A TW 94102404 A TW94102404 A TW 94102404A TW I279693 B TWI279693 B TW I279693B
Authority
TW
Taiwan
Prior art keywords
audio signal
bit
compression
dimensional
audio
Prior art date
Application number
TW94102404A
Other languages
Chinese (zh)
Other versions
TW200627190A (en
Inventor
Yi-Hsin Tao
Chia-Hsing Lin
Cheng-Tao Hwang
Chao-Chin Hsu
Original Assignee
Etoms Electronics Corp
Techsoft Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Etoms Electronics Corp, Techsoft Technology Co Ltd filed Critical Etoms Electronics Corp
Priority to TW94102404A priority Critical patent/TWI279693B/en
Publication of TW200627190A publication Critical patent/TW200627190A/en
Application granted granted Critical
Publication of TWI279693B publication Critical patent/TWI279693B/en

Links

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The present invention discloses a method and device of audio compression. The present invention utilizes wavelet transform technology to transform audio signal into two folds of sub-band via wavelet function: the low frequency sub-band which represented the important part of audio signal, and the high frequency sub-band which represented the low sensitivity part of audio signal for human hearing. The present invention creates a code block, which includes several sub-bands of the one dimensional wavelet audio signal, then processes the code block through EBCOT and arithmetic coding. The present invention can provide a real time and high compression rate audio compression.

Description

1279693 九、發明說明: 【發明所屬之技術領域】 本發明係關於-種音頻訊號壓縮方法,特狀關於一 用一維離散小波轉換技術之音頻訊號壓縮方法盥 【先前技術】 〃衣1 -般的音頻滅_働,大部分都是在_(F卿 Donmn)上做Psychoacoustic分析後,再進行壓縮。藉由將頻 域上較不重要的高鱗分以餘元輪行編碼,並賴域上較1279693 IX. Description of the Invention: [Technical Field] The present invention relates to an audio signal compression method, and relates to an audio signal compression method using a one-dimensional discrete wavelet transform technique. [Prior Art] The audio is off, most of it is done after Psychoacoustic analysis on _(F Qing Donmn). By encoding the less important high-scales in the frequency domain with the remainder of the round, and

重要的低頻部分以高位元率進行編碼,即可獲得相#高的 比0 、 其中,最著名的是音頻訊號壓縮方法為MPEG丨L III,亦即,MP3。MP3的壓縮率可以達到1〇倍,不過,卻仍 然能保留接近原音的音質。所以,Mp3的相關產品目前廣受消 費大眾的歡迎。 〜 但是,一般的頻域音頻訊號壓縮技術,必須使用傅利葉轉 換(Fourier Transform)將時域(Time D〇main)音頻訊號轉#奐為 頻,,頻訊號後,再進行分析及後續壓縮。因此需要相當大的 計异量。使得以此種技術來製作的壓縮引擎硬體,僧柊 T 〇 ^ MP3 時,要一段時間的時域音頻訊號,所以,Mp3的壓縮會產生依 各系統的Latency Delay。若要達到接近即時(real time)的 壓縮,則需要使用更高的工作頻率而使得耗電隨之增加。 /為了達到高壓縮率、低運算量的目的,另一種頻域轉換的 技術一小波轉換(Wavelet Transform)是可採用的技術。藉由 小波函數(Wavelet function)的轉換,將音頻訊號經過轉換 後,即可獲得代表音頻訊號重要資料的中低頻頻帶訊號(L〇w frequency sub-band)與人類耳朵較不敏感的高頻頻帶訊號The important low-frequency part is encoded at a high bit rate, and the ratio #0 of the phase # is obtained, and the most famous one is that the audio signal compression method is MPEG丨L III, that is, MP3. MP3's compression ratio can be up to 1 times, but it still retains the sound quality close to the original sound. Therefore, Mp3 related products are currently popular among consumers. ~ However, the general frequency domain audio signal compression technology must use the Fourier Transform to convert the Time D〇main audio signal to #,, after the frequency signal, and then analyze and subsequent compression. Therefore, a considerable amount of difference is required. The compression engine hardware made by this technology, when T 〇 ^ MP3, takes a period of time domain audio signal, so the compression of Mp3 will generate the Latency Delay according to each system. To achieve near real time compression, you need to use a higher operating frequency to increase power consumption. / In order to achieve high compression ratio and low computational complexity, another technique for frequency domain conversion, Wavelet Transform, is a technique that can be employed. By converting the audio signal (Wavelet function), the audio signal is converted, and the low frequency band signal (L〇w frequency sub-band) representing the important data of the audio signal is obtained, and the high frequency band which is less sensitive to the human ear is obtained. Signal

(High frequency sub-band)。上述的壓縮方法,即利用DWT 的此種特性,來進行即時的音頻訊號壓縮。 6 1279693 Μ热新二ΐ將小波轉換技術運用於音頻訊號的處理上’仍 曰二二領域’ θ而’尚有相當大的發展與努力空間。 尚未見到任何運用小波轉換技術來處理音頻訊號 1= 所以’此為一可開發的技術領域。 【發明内容】 馨?以士驾知技術的_,本發明提供_種音頻訊號壓縮 方法:猎以達到能即時壓縮,且具有高壓縮率的目的。 =明尚有7目的在於,藉由提供—種音頻訊號壓縮方 p相♦ 可動相整_率的目的,此功能能應用於目前不 穩疋頻寬的網路傳輸。 於;述目的’本發明之提供—種音頻訊雜縮方法,係 3W0Kt)之驗,將_貞訊號進行 旦^处、佳/匕3下列步驟··進行N階一維離散小波轉換;進行 Γί元難編斗·係以M軸單位_之該經N階-@小波轉換之音頻訊號形成一碼塊(c〇de Block),並將該 :、2位面排列的方式進行壓縮編碼;進行算術編碼:將 。兀堊縮編碼的資料進行進行算術編碼;以及,輸出位元 方法配if述之音頻壓縮方法,本發明更提供一種音頻解壓縮 馆辦、’+、ti下列步驟:輸入位元串流:將經申請專利範圍第1 入·頻壓縮方法所壓縮之音頻訊號以位元串流方式輸 的挪异術解碼;進行位元壓縮解碼··將Μ筆單位時間(1:) 淮解碼_音頻減以碼塊的方式進行位元壓縮解碼; 仃里化,進行Ν階逆一維離散小波轉換;以及,還原訊號。 縮/解壓法具體化’本發服供一種音頻訊號壓 緩衝記憶體,用以儲存由外部所輸入之一單位時間(t) 1二頻汛號、經過一維離散小波轉換或一維逆離散小波轉換 的曰頻訊號; 1279693 一維離散小波轉換器,與該緩衝記憶體相互配合,以完成 N次的一維離散小波轉換,或N次的一維逆離散小波轉換; 立一位元編碼單元,將該Μ筆經過n次一維離散小波轉換的 音頻訊號形成碼塊後以位元平面方式編碼,或進行逆位元平面 方式解碼; 。、一f術壓縮單元,將該經過位元平面方式編碼之該音頻訊 ?虎進行算術壓縮,或將壓縮完成的音頻訊號進行算術解壓縮; 以及 、一先進先出單元(FIFO),將該壓縮完成的音頻訊號加以輸 出為位το串流,或將壓縮完成的音頻訊號之位元串流送至該算 術壓縮單元進行解壓縮。 此外’本發明更可單獨提供一種音頻訊號壓縮裝置與音頻 訊號解壓縮褒置。 為讓本發明之上述和其他目的、特徵、和優點能更明顯易 •,下文特舉數個較佳實施例,並配合所附圖式,作詳細說明 如下: 【實施方式】 本發明運用離散小波轉換⑽)所提供賴祕㈣㈣(High frequency sub-band). The above compression method uses the characteristics of the DWT to perform instant audio signal compression. 6 1279693 Μ热新二ΐ applies the wavelet transform technology to the processing of audio signals. 'There is still a lot of room for development and hard work.' I have not seen any use of wavelet transform technology to process audio signals. 1= So this is a technical field that can be developed. SUMMARY OF THE INVENTION The present invention provides an audio signal compression method: hunting for instant compression and high compression ratio. There are 7 purposes for Ming, which can be applied to the current unstable network bandwidth by providing the audio signal compression side p phase ♦ movable phase _ rate. The purpose of the present invention is to provide an N-order one-dimensional discrete wavelet transform by performing the following steps: performing the following steps: performing the following steps: performing the N-order one-dimensional discrete wavelet transform on the _贞 signal, and performing the following steps: Γί元难编斗············································· Perform arithmetic coding: will. The collapsed encoded data is subjected to arithmetic coding; and the output bit method is equipped with an audio compression method as described, and the present invention further provides an audio decompression library, '+, ti the following steps: input bit stream: The audio signal compressed by the first input/frequency compression method of the patent application scope is decoded by bit stream method; the bit compression decoding is performed. · The unit time (1:) is decoded by the unit. The bit compression decoding is performed in a code block manner; the Ν 化, the inverse one-dimensional discrete wavelet transform is performed; and the signal is restored. The shrink/decompression method is embodied in an audio signal buffer memory for storing one unit time (t) of the input by the outside, a two-frequency nickname, one-dimensional discrete wavelet transform or one-dimensional inverse discrete Wavelet-converted chirped frequency signal; 1279693 one-dimensional discrete wavelet converter, interacting with the buffer memory to complete N-dimensional one-dimensional discrete wavelet transform, or N-order one-dimensional inverse discrete wavelet transform; The unit, after the n-times one-dimensional discrete wavelet transform audio signal is formed into a code block, is coded in a bit-plane manner, or is decoded in an inverse bit-plane manner; a compression unit that performs arithmetic compression on the audio channel encoded by the bit plane method, or arithmetically decompresses the compressed audio signal; and a first in first out unit (FIFO) The compressed audio signal is output as a bit το stream, or a bit stream of the compressed audio signal is streamed to the arithmetic compression unit for decompression. In addition, the present invention further provides an audio signal compression device and an audio signal decompression device separately. The above and other objects, features, and advantages of the present invention will become more apparent from the aspects of the appended claims. Wavelet transform (10)) provides the secret (4) (four)

位元Μ縮編碼以及鼻術編碼工作。於是, 間。 、 定程度的相似性,有利於之後的 作。於是,可以增加壓縮率的空 此外,由於音頻(Audio)訊號為一維訊號Bit collapse coding and nasal coding work. So, between. The degree of similarity is conducive to the subsequent work. Therefore, the compression ratio can be increased. In addition, since the audio (Audio) signal is a one-dimensional signal

1279693 ’ 用 EBCOT(Embedded Block Coding with Optimized Truncation) 的壓縮技術’因為此項壓縮技術對於經由小波轉換後的資料, • 具有很好的壓縮效率與品質;同時,可讓音頻訊號具有可調 (scalability)、隨意存取(random access)等優點。 特別的是’本發明所採用的EBC0T技術,僅採用EBC0T技 術中位元壓縮編碼以及算術編碼的部分,並運用了自創的新穎 演算法來達到適應性(adaptive)的功效。 々首先,請參考「第2圖」,本發明之音頻訊號壓縮方法之 第一具體實施例,包含下列步驟:取樣(步驟11〇);進行N階 籲 =維離散小波轉換(步驟120);進行量化(步驟130);進行位 元壓縮編碼(步驟140);進行算術編碼(步驟15〇);輪出位元 串流(步驟160)。 在輸入音頻號後’即以固定的取樣率對音頻訊號進行取 樣(步驟110),例如44· 1kHz,每次共取得Μ筆音頻訊號資料。 接著,即可將取樣後的單位時間(t)内的音頻訊號反覆進'行一 維離散小波轉換,共進行N次,以獲得N+1個子頻帶,此即進 行N階一維離散小波轉換(步驟12〇)之步驟。接下來,因為每 個子頻帶内含的音頻資訊的重要程度不同,因此對所有的子頻 待的音頻訊號個別進行量化(步驟130)。此量化為純量量化,、 • 例士只取音頻訊號第16高位元、12高位元、或8高位元...。 接著,對所有量化後的子頻帶音頻訊號進行位元壓縮編碼(步 驟140) ’其係將μ筆的音頻訊號形成一碼塊再進行位元編碼; 亦=,將Μ筆含有所有子頻帶資料的資料包(一個單位時間内 的音頻訊號)組成一個碼塊(c〇de-block)後,再將該碼塊分成 不同的fL元平面(bit plane),以位元平面做為壓縮處理的對 象。接著’進行算術編碼(ArithmeticCoding;AC)(步驟150), - 例如’運用姒―編碼器。最後,輸出位元串流(bit Stream)(步 ,160),亦即,將算術編碼所完成的壓縮資料加以輸出,即 / 完成所有的壓縮程序。 9 1279693 本么明的特色即在於將音頻訊號一維訊號的多個子頻帶 組合為一個碼塊,此與JPEG 2000對於影像壓縮時將各個DWT 子頻帶再細切為壓縮碼塊的做法不同。 在資料的解壓縮上,則以相反的程序進行即可,如「第3 圖示者,其包含了以下步驟:輸入位元串流(步驟210); ,行^術解碼(步驟220);進行位元壓縮解碼(步驟23〇);進 =里t匕i步驟24〇);進行N階逆一維離散小波轉換(步驟 250),還原訊號(步驟260)。 〆娜 步驟210中,係將經過本發明的壓縮方法所產生的 ^串&加以輸入。❿步驟22〇即將此經壓 算,進?算術解碼。接著,依據編碼時= 進仃位兀壓縮解碼動作(步驟23〇)。再來,依據原 ,的里化位階,進行解量化(步驟240)。最後,將^ 行Ν階逆-維離散小波轉換(步驟25。),在二後= 料加^原(步驟·,即可輸出為原先的音頻=換後的貝 斟於立了不同的應㈣求,例如,系統頻寬大小不同或者 黄於音質的要求不同,本發明更可以不同的不壓巧 ,J,. )、量化位階為胸ts、二聲二 22 ^(64kb^·^ ° ^ 儲断音頻訊號的存 以·ϋ體各讀小或頻寬較小時,以較高的麗缩率 (如以128kpbs或64kbps)來壓縮音頻訊號。门他細羊 方法求,種音触號的壓縮 ,眚浐仓丨—人」本务月之音頻訊號麼縮方法第二且 驟 320),·進Α_ _,·取樣4 驟wm·分二广,士维離放小波轉換(步驟330);進行量化(步 驟340),依觀縮率Α進行位元塵縮編碼(步驟^里進= 1279693 術編碼(步驟360);輸出位元串流(步驟370)。 首先,由使用者選擇壓縮率A(步驟310),此壓縮率的選 項係為事先設定,如前所述者。接著,即依據固定的取樣率對 音頻訊號進行取樣(步驟320)。取樣後,將取樣後的音頻訊號 反覆進行N階一維離散小波轉換(步驟330),以獲得N+1^ 頻帶。接下來,進行量化(步驟340),量化位階可以取16高 位元、12咼位元、8高位元…不等。接著,依據壓縮率Α進行 位元壓縮編碼(步驟350),此步驟係依據EBCOT可調的特性來 進行,詳細的做法將於「第10圖」描述。最後,進行管 碼(步驟360),再輸出位元串流(步驟37〇)。 开、 此外,對音頻訊號來講,運用不同的取樣率,同樣可 Ρ爷低訊號資料量的目的。亦即,本發明亦可透過選取不 樣率以降低資料量。例如,卩441]ίΗζ,22·_ζ刪2 11kHz,8kHz…的取樣率,會獲得不同的資料密度。樣‘ 者,音質的表現上較佳。 你午趣呵 方、、者第「2」闰的實施例’將其反過來就成為解塵縮的 一 f 口月多考弟5圖」所不者。其包含了以下步驟:輸入 =串流(步驟410);進行算術解碼(步驟42〇) 進行位元壓騎碼(步驟侧);進行锻化(步驟_ 心皆1^維Ϊ散]、波轉換(步驟450);還原訊號(步驟働f。 朽:L1410中’係將經過本發明的壓縮方法所產生的 串流加以輸人。而步驟即將此經壓 1 壓啦A而健⑷解碼。接著,依據編碼時所採用的 以率A而適應性_的壓縮方 (步驟430)。再來,佑摅盾止从曰 細解碼動作 •最後 音==換後的資料加以還原(步驟46°) ’即可輸出ί 可調整的壓縮率’必須有相應的:#料壓縮技術。由「第4 1279693 的方法可知,本發明係透過位元壓縮編碼的技術來達成壓 縮率的調整。位元壓縮編碼,係為EBCOT當中的主要技術,其 係以位元平面(bit-piane)對每個子頻帶進行壓縮。 一不過,本發明與用於影像壓縮的EBCOT方法有些許的不 同,此不同點成為本發明的創新處。用於影像壓縮處理的 EBCOT ’ +係將各個二維矩陣式的子頻帶個別進行處理,·而音頻 訊-的資料係為一維⑽的資料,因此,其並無EBC〇T處理影 像貧料時所謂的碼塊(c〇de—block)資料。 、 口為了運用EBCOT的壓縮方法來壓縮經DWT轉換的音頻訊 明步處理M _段的音頻罐資料並將所有的子 ί ΓΛΓ成一個碼塊(code—block),再來一起做EBC〇T處理。 八 的選擇端視對硬體的需求而定。於是,本發明可將一 資料,變成二維的DWT音頻訊號資料。例如, ί資1個子頻帶(4L,4H,3H,2H,1H)當中的五個子頻 ,2H,1H),M個時段的DWT資料一併做E_1279693 'Embedded Block Coding with Optimized Truncation' compression technology' because this compression technology has good compression efficiency and quality for wavelet-transformed data. At the same time, the audio signal can be adjusted (scalability). ), random access and other advantages. In particular, the EBCOT technology employed in the present invention uses only the bit compression coding and arithmetic coding portions of the EBC0T technology, and uses a novel algorithm developed to achieve adaptive effects. 々 First, please refer to FIG. 2, the first embodiment of the audio signal compression method of the present invention includes the following steps: sampling (step 11〇); performing N-order call-dimensional discrete wavelet transform (step 120); Quantization is performed (step 130); bit compression coding is performed (step 140); arithmetic coding is performed (step 15A); bit stream is rotated (step 160). After the audio number is input, the audio signal is sampled at a fixed sampling rate (step 110), for example, 44·1 kHz, and the audio signal data is acquired each time. Then, the audio signal in the unit time (t) after sampling can be reversed into the 'one-dimensional discrete wavelet transform, and N times are performed to obtain N+1 sub-bands, which is to perform N-order one-dimensional discrete wavelet transform. (Step 12〇) steps. Next, since the audio information contained in each sub-band is different in importance, the audio signals of all the sub-frequency are individually quantized (step 130). This quantization is scalar quantization, • The case only takes the 16th high bit, 12 high bits, or 8 high bits of the audio signal. Then, all the quantized sub-band audio signals are bit-compress-encoded (step 140). 'The audio signal of the μ-pen is formed into a code block and then bit-encoded; and also, the Μ pen contains all sub-band data. After the data packet (audio signal in one unit time) is composed of a code block (c〇de-block), the code block is divided into different fL element planes, and the bit plane is used as a compression process. Object. Next, 'Arithmetic Coding (AC) is performed (step 150), for example, 'Using 姒-encoder. Finally, the bit stream is output (step, 160), that is, the compressed data completed by the arithmetic coding is output, that is, / all the compression procedures are completed. 9 1279693 The feature of this is that the multiple sub-bands of the audio signal 1D signal are combined into one code block, which is different from the JPEG 2000 method of finely cutting each DWT sub-band into a compressed code block for image compression. In the decompression of the data, the reverse procedure may be performed, such as "the third figure, which includes the following steps: inputting a bit stream (step 210); and decoding (step 220); Perform bit compression decoding (step 23 〇); enter = ri 匕 i step 24 〇); perform an N-th order inverse one-dimensional discrete wavelet transform (step 250), restore the signal (step 260). The string & generated by the compression method of the present invention is input. In step 22, the compression is performed, and the arithmetic decoding is performed. Then, according to the encoding, the compression decoding operation is performed (step 23). Then, according to the original, the grading level, dequantization is performed (step 240). Finally, the inverse-dimensional discrete wavelet transform is performed (step 25), and after the second = material plus (original) , can be output as the original audio = after the change of the beggar in the different (four) seeking, for example, the system bandwidth size is different or yellow in the sound quality requirements, the invention can be different, not compact, J, . ), the quantization level is chest ts, two sounds 22 ^ (64kb ^ · ^ ° ^ storage audio signal storage When the reading is small or the bandwidth is small, the audio signal is compressed at a high sag rate (for example, 128kpbs or 64kbps). The method of squeezing the sound of the seeding, the compression of the seeding horn, the 眚浐 丨 人 人 人 人 人The audio signal of the moon is the second method and the step 320), the input _ _, the sampling 4 the step wm · the second wide, the Shi Wei off the wavelet transform (step 330); the quantization (step 340), the view The reduction rate Α performs bit dust reduction coding (step 里 = 1279693 operative code (step 360); output bit stream (step 370). First, the user selects compression ratio A (step 310), the compression ratio The option is set in advance, as described above. Then, the audio signal is sampled according to a fixed sampling rate (step 320). After sampling, the sampled audio signal is repeatedly subjected to N-order one-dimensional discrete wavelet transform ( Step 330), to obtain the N+1^ frequency band. Next, quantization is performed (step 340), and the quantization level can be unequal to 16 high bits, 12 咼 bits, 8 high bits, etc. Then, according to the compression ratio Α Meta-compression coding (step 350), this step is based on the EBCOT adjustable characteristics, detailed The method will be described in "Figure 10." Finally, the tube code is executed (step 360), and the bit stream is output (step 37). On, in addition, for audio signals, different sampling rates can be used. The purpose of the data volume of the grandfather is low. That is, the present invention can also reduce the amount of data by selecting the sample rate. For example, 取样 441] Ηζ 22, 22 _ ζ 2 2 11 kHz, 8 kHz ... sampling rate, will be different The data density is the same as that of the sampler. The performance of the sound quality is better. You will have a good afternoon, and the embodiment of the "2" ' will turn it into a dusty one. Nothing. It includes the following steps: input=streaming (step 410); performing arithmetic decoding (step 42〇) performing bit pressing code (step side); performing forging (step _ heart is 1 ^ dimensional dispersion), wave Conversion (step 450); restore signal (step 働f. 朽: L1410' is to input the stream generated by the compression method of the present invention, and the step is to compress the A and the (4) decoding. Next, the compression side is adapted according to the rate A used in the encoding (step 430). Then, the 摅 摅 止 还原 曰 • • • • • • • • • ( ( ( ( ( ( ( ( ( ( ( ( ( ) 'Easy to output ί adjustable compression ratio' must have a corresponding: #料压缩技术. As can be seen from the method of No. 4 1279693, the present invention achieves compression ratio adjustment through bit compression coding techniques. Compression coding is the main technique in EBCOT, which compresses each sub-band with a bit-piane. However, the present invention is slightly different from the EBCOT method for image compression. Become the innovation of the present invention. It is used for image compression processing. EBCOT ' + system separates the sub-bands of each two-dimensional matrix, and the audio-data is one-dimensional (10) data. Therefore, there is no so-called code block when EBC 〇T processes image poor materials ( C〇de-block) data. In order to use the EBCOT compression method to compress the audio channel data of the M_ segment processed by the DWT-converted audio step and divide all the sub-rules into a code block. Then, I will do EBC〇T processing together. The choice of eight depends on the hardware requirements. Therefore, the present invention can convert a data into two-dimensional DWT audio signal data. For example, one sub-band (4L) , 5H among 3H, 3H, 2H, 1H), 2H, 1H), DWT data for M time periods are E_ together

j者’4做到較高的壓縮率,也可將ιη的部分直接删 除’而將五個子頻帶資料Γ4丨仙叫 1接冊JThe j's 4 achieve a higher compression ratio, and the part of the ιη can be directly deleted, and the data of the five sub-bands is Γ4丨仙叫1

資料一併做_τ _料(L,4H,3H,2H,1H),M個時段的DWT 縮。可將1H資料直接刪除的原因是,其 ’直接刪除並不會對音質造成太大的影 不宜太長。 姻雜時財—定的限制’ 以下,我們將以直接刪除1H的例子, 應性壓縮/解壓縮方法。 』丁水》兄明本發明的適The data is done together with _τ _ material (L, 4H, 3H, 2H, 1H), DWT shrinkage of M time periods. The reason why the 1H data can be deleted directly is that its direct deletion does not cause too much impact on the sound quality. In the case of marriage and wealth - the limit is limited. Below, we will directly remove the 1H example, and compress/decompress the method. "Ding Shui" brother Ming Ming's invention

音頻訊號㈣16位元量化的數位資 展^句為DWT 「第7圖」的魏資㈣之位元平面(bi==,。即成為 12 1279693 EBCOT係以位元平面為壓縮處理的基礎,本發明運用相同 的處理技術。不過,為了要能達到壓縮率可調(scalable)的目 的’本發明採用了與JPEG2000用於影像處理有些許差显的方 法。 ’、 EBC0T編碼日令’係於母個碼塊(c〇de-bl〇ck)當中的位元平 面的最顯著位元平面(Most significant bit-plane,MSB)開 埤進行編碼。以「第7圖」為例,由16位元的量化資料所組 成的碼塊資料30中,第一個位元平面係為符號平面(sign bit-plane),其後則為MSB,最後一個位元平面則為最不顯著Audio signal (4) 16-bit quantized digital asset exhibition sentence is the bit plane of Wei (4) of DWT "Fig. 7" (bi==, which becomes 12 1279693 EBCOT is based on the bit plane as the basis of compression processing, this The invention uses the same processing technique. However, in order to achieve the scalability of the scalability, the present invention adopts a method which is somewhat inferior to that of JPEG2000 for image processing. ', EBC0T encoding Japanese order' is in the mother The most significant bit-plane (MSB) of the bit-plane in the block (c〇de-bl〇ck) is coded. For example, in Figure 7, the 16-bit is used. In the code block data 30 composed of the quantized data, the first bit plane is the symbol bit-plane, followed by the MSB, and the last bit plane is the least significant.

位元平面(Least significant bit-plane,LSB)。EBC0T 的編 碼係從MSB開始到LSB進行編碼’並搜尋第一個具有1的顯著 (「^gnificant)值,以從此一顯著值之後,開始進行編碼二以 第7圖」為例,從第八位元平面3〇_8開始,即有第一個顯 著值營生。由於未出現丨之前的位元平面,其值均為零,因此, y以早-㈣代表其巾數各位元平面的龍,可大幅增加壓縮 率。 本f明即以此為基礎’更進一步將音頻訊號的特色加以發 達至動態選取壓縮率的目的。本發明係利用音頻訊號 =色-日1大時,微弱的訊號將非常不明顯,人耳也不容易 ’音頻峨將會頃向於針在_在特定的數個位 m二是’只要將音頻訊號所集中的必要資訊加以保 邊,不必要的賢訊則視為〇,即可達到有效壓縮的目的。 私,ίΐίΐ法為:從第—個具有1的㈣值之位元平面開 口,又據所、取的壓縮率來選擇要處理 - 爾的音頻訊號(4L,银3Η,2Η,職例,本 人 麟分卿取ηι滅留 分別斜4L 4H叫者9U再依據此nl,n2,n3,n4四個處理位元數, 於進杆η/ ?,/四個子頻帶從第—個具有1的顯著位元開 口 ,n,n,n4四個處理位元數的位元壓縮處理,其餘者 13 1279693 則以,〇方式進行處理。如此,即可獲得很好的壓縮效率。而 在曰1小的情形,其資料分布將可能會分布在附近,此 時’處理位元數大於第一個具有丨的顯著值的位元平面將可能 小於處理位元數。此種情形下,本發明則直接從該第一個具有 1的顯著值的位元平面進行EBC0T壓縮。本發明稱此為適應性 位元平面調整方法。 —因此,處理位元數的值,雖然已經事先指定,不過,在經 過資料分析後而解讀出第一個具有1的顯著值之位元平面 時,即可判斷是否採用此處理位元數的設定。因此,本方法係 具有適應性的效果。 而此111,112,113,114的值,可以實驗來獲得。以「第8圖」 的例子來說,本發明在4L,4H,3H,2H四個子頻帶各取一個垂直Least significant bit-plane (LSB). The encoding of EBC0T is encoded from the MSB to the LSB's and searches for the first significant ("^gnificant" value with 1 to start encoding 2 to 7th from this significant value." The bit plane starts at 3〇_8, which means that there is the first significant value to live. Since the bit plane before the 丨 is not present, its value is zero. Therefore, y is a dragon whose early - (four) represents its element plane, which can greatly increase the compression ratio. Based on this, Ben Fen further developed the characteristics of the audio signal to dynamically select the compression ratio. The present invention utilizes audio signal = color - day 1 when the weak signal will be very inconspicuous, and the human ear is not easy. 'Audio 峨 will be directed at _ in a certain number of bits m is 'as long as The necessary information gathered by the audio signal is preserved, and the unnecessary information is regarded as a flaw, and the purpose of effective compression can be achieved. Private, ίΐίΐ method: from the first opening of a bit plane with a value of 1 (four), and according to the compression ratio of the location, the audio signal (4L, silver Lin Ziqing takes ηι 灭 留 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 The bit opening, n, n, n4 are the bit compression processing of the four processing bit numbers, and the rest 13 13279693 are processed in the 〇 mode. Thus, a good compression efficiency can be obtained. In case, the data distribution will probably be distributed nearby, in which case the number of processing bits larger than the first significant value with a significant value of 丨 will probably be smaller than the number of processing bits. In this case, the present invention directly The first bit plane having a significant value of 1 performs EBCOT compression. The present invention refers to this as an adaptive bit plane adjustment method. - Therefore, the value of the number of processed bits, although already specified in advance, is in the past After analysis, the first bit with a significant value of 1 is interpreted. In the plane, it can be judged whether the setting of the number of processing bits is used. Therefore, the method has an adaptive effect. The values of 111, 112, 113, 114 can be obtained experimentally. For example, the present invention takes one vertical in each of the four sub-bands 4L, 4H, 3H, and 2H.

切面 Cl,C2, C3, C4,並取(nl,n2, n3, n4>(5, 4, 3, 3)。由於 4L 的子頻帶的音頻訊號具有最有用資料量,因此取5個處理位 元’ 4Η其次,因此取4個處理位元,3Η取3、2Η取3。 1,η2, η3, η4)的值,以(5, 4, 3, 3)來取,再以EBCOT的位元壓 縮編碼方式編碼,經過還原後仍可獲得相當高的音質。 Μ然,(111,112,113,114)的值視壓縮率的需求來進行選取, 例如,較低的壓縮率可取(6, 5, 4, 4)或(7, 6, 5, 5)···不等。 y本發明的壓縮效果,可由以下的計算結果得知:假設取樣 後的總資料量為1,原先五個子頻帶的資料量為1/16,1/16, 1/8’ 1/4,1/2 ’由於直接刪除了 1H,因此,處理的資料量總 和為1/2。經過適應性位元平面調整方法處理後,以(5, 4, 3, 3) 為例,整體的資料量將可降’低’為 1/16*5/15+1/16*4/15+1/8*3/15+1/4*3/15=9/80,甚至更低, 資料1大幅降低。此資料量為尚未進行位元壓縮編碼與算術編 碼的結果。也就是,經過位元壓縮編碼與算術編竭後,' 更高的壓縮效果。 適應性位元平面調整方法的說明,請參考「第8A圖」與 1279693 「第8B圖」。「第8A圖」說明了 4L,4H兩個子頻帶的處理位元 數分別為(5, 4) ’「第8B圖」則說明了 3H,2H兩個子頻帶的處 , 理位元數為(3, 3)。由於第8位元平面為首次出現顯著值1者, 此顯著值出現後,即依據(5, 4, 3, 3)分別對各個子頻帶的位元 平面進行位元壓縮處理。 此外’為了要讓處理位元數的位元平面能更順利地運作, 本發明將疋義好的處理位元數的位元平面整體向LSB平移,並 只,紀錄平移的位元數即可。如此,更可增加壓縮的效率。如 _「第8C圖」、「第8D圖」所示者,本發明將原先處理的處理位 • 元平面,由第8位元平面開始,分別平移至第η位元(C1)、 第12位元平面(C2)、第13位元平面(C3)、第13位元平面(C4) 開始。平移後,其他部分的值皆以補〇的方式加以處理。整個 電路只要紀錄平移的值即可。 ”上述說明了本發明如何藉由「處理位元數」的調整來調整 壓縮率,進而達到動態調整壓縮率的目的。 ”具體的流程,請參考「第9A圖」,其為本發明之動態調整 壓縮率的具體流程,包含下列步驟··讀取壓縮率之值^步驟 610);依據壓縮率定義處理位元數(步驟62〇);進行位元 處理(步驟630)。 • 首先,先讀取壓縮率之值(步驟610),此值係由系统事先 定義,並由使用者所選擇。接著,即依據壓縮率定義處理位元 數(步驟620),較咼的壓縮率,處理位元數的值較小;反之, 較低的壓縮率,處理位元數的值相對較高。最後,進行位元 面編碼(步驟630)。 一其中的步驟620,又包含了幾個子步驟,如「第9β圖」 所示^搜尋最顯著位元(步驟621);依據壓縮率給定子頻帶處 理位,數nL(步=622);將所處理位元調整至LSB(步驟623)。 首先,先搜哥敢顯著位元(步驟621),並從最顯著位元所 處的位元平關始進行㈣平Φ處理。接下來,依據壓縮率給 15 1279693 定子頻帶處理位元數nL(步驟622),L代表同步處理的時段。 广同步處理四個時段為例,上述的(5,4,3,3)、(6,5,4,4)、 7’6’5’5)…各種組合將可形成不同的壓縮率選項。最後,再 里位元調整至LSB(步驟623),卿,將所保留的處理 位7L貝料’平移至LSB,不進行處理的部分則補為〇。 解^的流程,則請參考「第9C圖」,包含兩個步驟:進 订亚面解碼(步驟71G),將所處理位元由LSB調整至原位 ^的貝科進仃位几平面解碼後,還原為碼塊㈣e bi〇ck)資 本發明的音頻壓縮方法之具體電路裝置,請參考「第1〇 Γι」’Λ為本發明音賴麵縮/麵、雜置的第—具體實施 =少取樣單元η、增加取樣單元12、緩 單元16域狄料W早70 15、鼻術壓縮 其中,減少取樣單元丨丨係用來將單位時間(t)内 的資料’以減少取樣的方式,將取樣的資i依據 p .lkHZ的取樣資料,經設定為以_ζ 作為其取樣率,即可以減少取樣單元u達 2維用ί將儲存於緩衝記憶體13當中的經過 加ί樣ίίΐϊί的θ頻賴以原細減少的取樣倍數增 iii 2或者將經過一維逆離散小波轉 訊號 在進行N次一維離散小波轉換的處理的音頻 合,3二=散小波轉換器14與緩衝記憶體13相互配The cut surfaces Cl, C2, C3, C4, and take (nl, n2, n3, n4> (5, 4, 3, 3). Since the audio signal of the sub-band of 4L has the most useful data amount, 5 processing bits are taken. Yuan '4 Η second, so take 4 processing bits, 3 3 3, 2 3 3. 1, η2, η3, η4), take (5, 4, 3, 3) to take, then EBCOT bit The meta-encoding coding method can still obtain a relatively high sound quality after being restored. Suddenly, the value of (111, 112, 113, 114) is selected according to the demand of the compression ratio. For example, a lower compression ratio may be (6, 5, 4, 4) or (7, 6, 5, 5). ··· Not waiting. y The compression effect of the present invention can be obtained from the following calculation results: assuming that the total amount of data after sampling is 1, the data amount of the original five sub-bands is 1/16, 1/16, 1/8' 1/4, 1 /2 'Because 1H is directly deleted, the sum of the processed data is 1/2. After the adaptive bit plane adjustment method is processed, taking (5, 4, 3, 3) as an example, the overall data volume can be reduced to 'low' to 1/16*5/15+1/16*4/15. +1/8*3/15+1/4*3/15=9/80, even lower, data 1 is greatly reduced. This amount of data is the result of no bit compression coding and arithmetic coding. That is, after bit compression coding and arithmetic editing, 'higher compression effect. For an explanation of the adaptive bit plane adjustment method, please refer to "8A" and 1279693 "8B". "8A" shows that the number of processing bits in the 4L and 4H sub-bands is (5, 4) ''8B'' shows the 3H and 2H sub-bands, and the number of parity bits is (3, 3). Since the 8th bit plane is the first time that a significant value of 1 occurs, after the significant value appears, the bit plane of each sub-band is subjected to bit compression processing according to (5, 4, 3, 3). In addition, in order to make the bit plane of the processing bit number work more smoothly, the present invention shifts the bit plane of the well-processed bit number as a whole to the LSB, and only records the bit number of the translation. . In this way, the efficiency of compression can be increased. As shown in _ "8C" and "8D", the present invention shifts the processing bit/meta plane originally processed from the 8th bit plane to the nth bit (C1), 12th. The bit plane (C2), the 13th bit plane (C3), and the 13th bit plane (C4) start. After panning, the values of other parts are processed in a complementary manner. The entire circuit only needs to record the value of the translation. The above describes how the present invention adjusts the compression ratio by adjusting the "number of processing bits" to achieve the purpose of dynamically adjusting the compression ratio. For the specific process, please refer to "FIG. 9A", which is a specific process of dynamically adjusting the compression ratio of the present invention, including the following steps: · reading the value of the compression ratio ^ step 610); defining the number of processing bits according to the compression ratio (Step 62); Perform bit processing (step 630). • First, the value of the compression ratio is read (step 610), which is defined in advance by the system and selected by the user. Then, the number of processing bits is defined according to the compression ratio (step 620). The value of the processing bit number is smaller than the lower compression ratio; conversely, the lower compression ratio, the value of the processing bit number is relatively higher. Finally, bit-plane coding is performed (step 630). Step 620, which further includes several sub-steps, such as "the 9th figure", searches for the most significant bit (step 621); the bit is processed according to the compression rate, the number is nL (step = 622); The processed bit is adjusted to the LSB (step 623). First, the first search brother dares to significant bits (step 621), and performs (four) flat Φ processing from the level of the most significant bit. Next, the number of bits nL is processed in accordance with the compression ratio to the 15 1279693 stator band (step 622), where L represents the period of synchronization processing. For example, the above four (5, 4, 3, 3), (6, 5, 4, 4), 7'6'5'5)... various combinations will form different compression ratio options. . Finally, the bit is adjusted to the LSB (step 623), and the reserved processing bit 7L is shifted to the LSB, and the portion that is not processed is added as 〇. For the process of solving ^, please refer to "9C", which consists of two steps: sub-plane decoding (step 71G), and the processed bit is adjusted from LSB to in-situ ^. After the reduction to the code block (4) e bi〇ck) The specific circuit device of the audio compression method of the capital invention, please refer to "1st 」"" Λ is the first part of the invention, the surface of the sound surface, the miscellaneous - the specific implementation = The sampling unit η, the sampling unit 12, the buffer unit 12, the wafer W are 70 、, and the nasal compression is performed therein, and the sampling unit is used to reduce the data in the unit time (t) to reduce the sampling. According to the sample data of p.lkHZ, the sampled data is set to _ζ as its sampling rate, that is, the sampling unit u can be reduced to 2 dimensions, and the sample stored in the buffer memory 13 is added with ί ίίί The θ frequency is increased by iii 2 with a reduction factor of the original fine reduction or the audio combination of the one-dimensional inverse discrete wavelet signal is subjected to N-dimensional one-dimensional discrete wavelet conversion, the 3 bis = scatter wavelet converter 14 and the buffer memory 13 mutual match

i料St i離散小麟換。娜完錢,將所有的轉換 、、位几、、扁碼單元15進行編碼。編碼的方式,係以Eg0T 1279693 的位元平面方式編 法進行編碼,即可造^可配合本發明的壓縮率動態調整的方 在位元編碼單元^疋壓縮率來進行編碼的目的。 16進行算術編媽的動^完料送至算術_單元 出單元17,輪出位4馬元成的貧料,隨即送至先進先 缩單元Π以進行算^先,將資料位4流輸入算賴 元編碼單元崎。解敬成的資料,送至位 石馬方式,並配合原*選擇方式、’同樣運用EBCOT的解 來,再透過一維離散小波^哭^進=態的解碼。接下 體13共同維離散小波轉換器Η與緩衝記憶 樣器12增錄樣,物先雑;魏由增加取 本發ϊϊ將ΐίΐ:,小波轉換14的n次轉換過程中, dir t =要將此資料加以補齊。於是,再增加取樣ϊ 12做^^㈣動作時,係直接將高頻的㈣作補Q ^ 此^ ’由於電路面積與製造成本成正比關係,因此,減少 t面積對程式設計師來講是項挑戰。在「第10圖」中,緩 =記憶體13制來紀錄單位時間隨過減少取樣的 1 ,。由於記憶體往往佔了電路相當大的面積,因此,本發明即 «減少缓衝記憶體13的容量著手。 請參考「第11圖」,其為本發明音頻訊號壓縮/解壓縮 ,的第二具體實施例,包含有··減少取樣單元21、增加取^ 單元22、緩衝記憶體23、第--維離散小波轉換器24、位元 碥碼單元25、算術壓縮單元26、先進先出單元(fif〇)27與第 〜一維離散小波轉換器28。 與「第10圖」比較可發現,本發明增加了 一個第二一維 17 1279693 .黎侗雪玖下緩衝圮憶體23比緩衝記憶體13的容量 i;iS:L=ri轉換器28,並同 M nri ^ ^ 兩者4 一減,總體電路面積實質降 料加以捨棄後,只^下固子頻7 ’f,.1子頻帶的資 參 28所增加的電路面積’社於苐二一維離散小波轉換器 此外,要節省記憶體空間的另一 ·一^ 單中的記憶體來作為緩衝記憶體。亦即,「第 二圖」的緩衝記憶體23 ’事實上“其 整個操作流程‘「第、當中的記憶酿供。 費述。 圖」、第η圖」的流程相同,不再 定本^如上所述,議非用以限 圍内,告相關技藝者,在不脫離本發明之精神和範 須視本許之1動細飾’因此本發明之專利保護範圍 i圖中請專利範圍所界定者為準。 ϊ ί =ίΐ頻訊號經過四次1DDWT後的五個子頻帶示意圖; 第3 發明之音頻訊麵縮方法之第—具體實施例; 第4Ht發明之音頻訊號解壓縮方法之第—具體實施例; 圖係為本發明之音頻職_方法第二具體實施例; 圖係為本發明之音頻訊號解壓縮方法第二具體實施例; =係為同步處理4個時段(M=4)的DWT音頻訊號的示意圖,· 圖係為碼塊之位元平面(bi t-plane)圖; 8A圖係為第8圖之礼,4H兩個子頻帶的處理位元數示意圖; 18 1279693 ί ί ϊίΛ之观2H兩個子頻帶的處理位元數示意圖; ϋ ®㈣平移後U兩個子頻帶的處理位 兀歎不思圖,· 圖^第9B ®經過平移後之3H,2H兩個子㈣的處理位 第9A圖係為本發明之動態調整壓縮率的具體流程· 第9B圖係為依據壓縮率定義處理位元數[子二程: ^ 9C圖係為本發明之動態調整壓縮率之解壓“程圖·, 第.10圖係為本發明的音頻壓縮/解壓縮裝置的第—具體實施i material St i discrete small Lin change. After completing the money, Na will encode all the conversion, position, and flat code units 15. The coding method is coded in the bit-plane manner of Eg0T 1279693, so that the compression rate can be matched with the compression factor of the present invention. 16 perform the arithmetic editing of the mother's movement ^ finished material to the arithmetic _ unit out unit 17, the round out of the 4 Ma Yuancheng's poor material, then sent to the advanced first shrinking unit Π to calculate ^ first, the data bit 4 stream input count Meta coding unit. Xie Jingcheng's information is sent to the Shima way, and with the original * selection method, 'the same use of the EBCOT solution, and then through the one-dimensional discrete wavelet ^ cry ^ into the state of decoding. Connected to the body 13 common dimensional discrete wavelet converter 缓冲 and buffer memory sample 12 to increase the sample, the first 雑; Wei by the increase of the hair ϊϊ ΐ ΐ ,:, wavelet conversion 14 n conversion process, dir t = to be This information is added. Therefore, when the sampling is further increased ϊ 12 to do ^^ (4) action, the high frequency (4) is directly added to the Q ^ This ^ ' because the circuit area is directly proportional to the manufacturing cost, therefore, reducing the t area is for the programmer Challenge. In "Pic 10", the memory = 13 system is used to record the unit time to reduce the sampling 1 . Since the memory tends to occupy a considerable area of the circuit, the present invention is to reduce the capacity of the buffer memory 13. Please refer to FIG. 11 , which is a second embodiment of the audio signal compression/decompression of the present invention, including a reduction sampling unit 21, an incrementing unit 22, a buffer memory 23, and a first-dimensional dimension. The discrete wavelet transformer 24, the bit weight unit 25, the arithmetic compression unit 26, the first in first out unit (fif) 27, and the first to first dimensional discrete wavelet converter 28. Compared with "Fig. 10", it can be found that the present invention adds a second one dimension 17 1279693. The volume i of the buffer memory 23 is smaller than the buffer memory 13; iS: L = ri converter 28, And with M nri ^ ^ both 4 minus, the overall circuit area is substantially reduced, after the discarding, only the lower sub-frequency 7 'f, the sub-band of the sub-band 28 increased the circuit area '社于苐二One-Dimensional Discrete Wavelet Converter In addition, it is necessary to save memory in another memory space of the memory space as a buffer memory. That is to say, the buffer memory 23' of the "second picture" is in fact "the entire operation flow" "memory of the first and middle memories. The flow of the description, the picture, and the figure η" is the same, no longer fixed The discussion is not intended to limit the scope of the invention, and the relevant artisan, without departing from the spirit and scope of the present invention, shall be regarded as the finest decoration of the present invention. Prevail. ϊ ί = ί ΐ 五个 五个 经过 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个 五个The second embodiment of the audio service method of the present invention is a second embodiment of the audio signal decompression method of the present invention; = is a DWT audio signal for synchronously processing 4 time periods (M=4) Schematic diagram, the diagram is the bit plane of the code block (bi t-plane); 8A is the diagram of the 8th diagram, the number of processing bits of the 4H two subbands; 18 1279693 ί ί ϊ Λ 观 2H Schematic diagram of the number of processing bits in the two sub-bands; ϋ ® (4) The processing bits of the two sub-bands of the U after the translation are stunned, · Figure 9BB: The processed bits of the 3H, 2H and the two sub-fourths after the translation Fig. 9A is a specific flow of the dynamic adjustment compression ratio of the present invention. Fig. 9B is a process for defining the number of processing bits according to the compression ratio [sub-two-way: ^ 9C diagram is the decompression of the dynamic adjustment compression ratio of the present invention. Fig. 10 is the first embodiment of the audio compression/decompression apparatus of the present invention. Detailed Description

第11圖係為本發明音頻訊號壓縮/解壓縮裝置的第二具體實 施例0 〆、' 【主要元件符號說明】 11 減少取樣器 12 增加取樣器 13 緩衝記憶體 14 一維離散小波轉換器 15 位元壓縮編碼單元 16 算術壓縮單元 17 先進先出單元 21 減少取樣器 22 增加取樣器 23 緩衝記憶體 24 第一^•維離散小波轉換器 25 位元壓縮編碼單元 26 算術壓縮單元 27 先進先出單元 28 第二一維離散小波轉換器 30 碼塊 1279693 30-8第八位元平面Figure 11 is a second embodiment of the audio signal compression/decompression apparatus of the present invention. 〆, ' [Key element symbol description] 11 Reducer 12 Add sampler 13 Buffer memory 14 One-dimensional discrete wavelet converter 15 Bit compression coding unit 16 Arithmetic compression unit 17 First in first out unit 21 Reducer 22 Add sampler 23 Buffer memory 24 First dimension discrete wavelet converter 25 Bit compression coding unit 26 Arithmetic compression unit 27 First in first out Unit 28 second one-dimensional discrete wavelet converter 30 code block 1279693 30-8 eighth bit plane

Claims (1)

1279693 45年丨0月巧日修ί黑)正本 十、申請專利範圍: 】· :ϊΐ頻訊號壓縮方法,係於接故取樣—單位時_之音頻 、巧後’將該音頻訊舰行_處理,包含 進行Ν階一維離散小波轉換; 進行量化; 私t 兀壓縮編碼:係以Μ筆該單位時間之該經ν階一維離 政小波轉換之音頻訊號形成一碼塊後,進行編碼· t,彳Γ二彳f編碼·將該祕元魏_之音頻訊舰行算術編 碼,以及輸出位元串流。 2· ϋ請專利範圍第i項所述之音頻訊號壓縮方法,其中於進 ^ ^一維離散小波轉換前’更包含一選擇壓縮率之步驟, 其係以一適應性調整之壓縮方法達成。 3· 清專利範圍第2項所述之音頻訊號壓縮方法,其中該適 μ生調整之壓縮;^法係於進行位元壓縮編碼步驟巾進 含下列子步驟: 依據該選擇之壓縮率定義每個子頻帶各一處理位元數; 對碼塊進行位元平面分析,以確認第一個具有i的顯著位 兀十曲,1279693 45 years 丨 月 巧 巧 修 ) ) ) ) ) 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 、 Processing, including performing one-dimensional one-dimensional discrete wavelet transform; performing quantization; private t 兀 compression coding: encoding the OV-order one-dimensional political wavelet converted audio signal of the unit time to form a code block · t, 彳Γ二彳f encoding · The secret element Wei _ the audio ship line arithmetic coding, and the output bit stream. 2) The audio signal compression method described in item i of the patent scope, wherein the step of selecting a compression ratio before the one-dimensional discrete wavelet transform is achieved by an adaptively adjusted compression method. 3. The audio signal compression method according to item 2 of the patent scope, wherein the method is adapted to perform bit compression adjustment, and the method comprises the following sub-steps: defining a compression ratio according to the selected Each sub-band has a processing bit number; a bit-plane analysis is performed on the code block to confirm the first significant position of the ten-curve of i, 進行位元平_整:第-個具有丨賴著位元平面以 理位元數對各子解進減理,非魏理位元數的部分 ^ 设疋為0’,以及進行位元平面編碼。 4.如申請專利範圍第3項所述之音頻訊號壓縮方法,其中 兀平面調整,更包含將該處理位元數之位元平面,平g至 LSB之步驟。 5.如申請專利範圍第3項所述之音頻訊號壓縮方法,其 理位元數之值係經由實驗獲得。 μ处 法的音葙醢 壓縮方法,包含下列步驟·· ~一'一~ 輸入位元串流:將經申請專利範圍第丨項所述之音頻壓縮方 21 1279693 法所壓縮之音頻訊號以位元串流方式輸入; 進行算術解碼; 進行位元壓縮解碼:將Μ筆單位時間⑴的經算術解碼的該音 頻訊號以碼塊的方式進行位元壓縮解碼; 進行解量化; 進行Ν階逆一維離散小波轉換·,以及 還原訊號。 7·,申請專利範圍第6項所述之音頻訊號解壓縮方法,其中當 遠使用者選擇壓縮料,該進行位元麵解碼之步驟,即依 據適應性調整之壓縮方法所設定之處理位元數進行位元平 面逆處理。 8· 請專利範圍第丨項所述之音頻訊號解壓縮方法,其中進 二量化之步驟係為純量量化,可取16高位元、12高位元、8 高位元…等Ν位高位元。 9· 一種音頻壓縮/解壓縮電路裝置,包含: 立一緩衝§己憶體’用以儲存由外部所輸入之一單位時間⑴内 ,音頻訊號、經過一維離散小波轉換或一維逆離散小波轉換的 音頻訊號; a一維離散小波轉換器,與該緩衝記憶體相互配合,以完成 N次的一維離散小波轉換,或N次的一維逆離散小波轉換; 立二位元編碼單元,將該Μ筆經過N次一維離散小波轉換的 曰頻汛號形成碼塊後以位元平面方式編碼,或進行逆位元平面 方式解碼; ,、Γΐ術壓縮單元,將該經過位元平面方式編碼之該音頻訊 進行算術壓縮,或將壓縮完成的音頻訊號進行算術解壓; 以及 ,將該壓縮完成的音頻訊號加以輪 /為位元串流’或將壓縮完成的音頻訊號之位元串流送至該算 術壓縮單元進行解壓縮。 22 % 1279693 10·如申請專利範圍第9項所述之音頻壓縮/解壓縮電路裝置,其 中更包含: 一減少取樣單元,用以將該單位時間(t)内的音頻訊號所經 過取樣後的資料,以減少取樣的方式,將取樣的資料佑 之取樣率進行減少取樣;以及 ^ 一增加取樣單元,用以將儲存於該緩衝記憶體當中的經過 N次一維逆離散小波轉換的音頻訊號加以增加取樣,以還原為 該音頻訊號。 ^ 11·如申請專利範圍第9項所述之音頻壓縮/解壓縮電路裝置,其 中該緩衝記憶體係包含於該位元編碼單元當中。、^ 12. 如申請專利範圍第9項所述之音頻壓縮/解壓縮電路裝置,其 中該位元編碼單元更包含一記憶體,用以儲存該 次一維離散小波轉換的音頻訊號。 13. =申請專利範圍第9項所述之音頻麵/解壓縮電路 盆 人波轉換器’其介於該音頻訊號輸 的一維逆離散小波轉換,藉以丄.ί 14. ίΓί專;項f1G項所述之音頻壓縮/解壓縮電 減少取樣維離散小波轉換器,其介於該 用以執行第」=s 口取才水單70 ’和該緩衝記憶體之間’係 離散小iiT'f離散小波轉換或最後一次的一維逆 & 一一電記憶體的容量需求。 的音頻訊Si工所輸入之-單位時間_ _—3離今,U政小波衛奐之該音頻訊號; 兀扁馬單70 ’將该M筆經過N次一維離散小波轉換的 23 1279693 音頻訊號形成碼塊後以位元平面方式編碼· - 壓縮單元,將飾職元平財式編狀該音頻訊 號進行算術壓縮;以及 -先進先出單元(FIFO),將該壓、缩完成的音頻訊號加以輸 出為位元串流。 16,如申請專利範圍第15項所述之音麵縮電路裝置,其中更 包含-減少取樣單元,用以將該單位時間⑴内的音頻訊號所 經過取樣後的資料’以減少取樣的方式,將取樣的資料依據 設定之取樣率進行減少取樣。 17*如申明專利乾圍弟15項所述之音頻壓縮電路裝置,其中該 缓衝g己憶體係包含於該位元編碼單元當中。 18·如申請專^範圍第15項所述之音頻^縮電路裝置,其中該 位兀編碼單元更包含一記憶體,用以儲存該M筆經過N次 一維離散小波轉換的音頻訊號。 19·如申請,利範圍第15項所述之音頻壓縮電路裝置,其中更 包含一第二一維離散小波轉換器,其介於該音頻訊號輸入與 該緩巧記憶體之間,係用以執行第一次的一維離散小波轉 換’藉以降低該緩衝記憶體的容量需求。 20·如申睛專利範圍第15項或第μ項所述之音頻壓縮電路裝 置’ ΐ中更包含一第二一維離散小波轉換器,其介於該減少 取樣單元和該緩衝記憶體之間,係用以執行第一次的一維離 散小波轉換,藉以降低該緩衝記憶體的容量需求。 21· —種配合申請專利範圍第一項之音頻壓縮電路裝置之音頻 解壓縮電路裝置,包含: 一先進先出單元(FIFO),接收壓縮完成的音頻訊號所形成 之位元; 一要立ji壓縮完成的音頻訊號進行算術解壓 縮; ~ 一位元解碼單元,將該經過算術解壓縮之音頻訊號,解出 24 1279693 頻 以位元平面方式編碼之Μ筆經過N次一維離散小波轉換的音 訊號; 一維逆離散小波轉換器,用以將該Μ筆經過N次一維離散 小波轉換的音頻訊號進行Ν次的一維逆離散小波轉換;以及月 一緩衝記憶體,用以儲存經過^^次一維逆離散小波轉換 音頻訊號,並加以輪出。 22·如申請專利範圍第21項所述之音頻解壓縮電路裝置,其 更包含一增加取樣單元,用以將儲存於該缓衝記憶體當 經過Ν次一維逆離散小波轉換的音頻訊號加以增加取樣,以 輸出該音頻訊號。 Μ 21項所述之音頻解壓縮電路裝置,其中 以緩衝圮憶體係包含於該位元解碼各 、 24·如申請專利範圍镇91 + 該位元解解,路裝置,其中 次-維逆離散小波二的音=號用以齡該Μ筆經過Ν 25. 如申请專利範圍第21項戶 更包含-第二—維逆離解壓縮電路裝置,其中 入/輸出與該㈣之n ’其介於該音頻訊號輪 離散小轉_ 一維逆 26. 如申請專利範圍第21項戋第。思體的各直需求。 置,其=更包含-第二解壓縮電路裝 少取樣單兀和該緩衝記憶體之 /轉換斋,其介於該減 維逆離散小波轉換,藉以降二以執行最後一次的一 ^亥緩衝記憶體的容量需求。 25The bit-level _---the first one has the number of bits in the bit-plane, and the number of non-Willie-bits is set to 0', and the bit plane is performed. coding. 4. The audio signal compression method according to claim 3, wherein the plane adjustment further comprises the step of flattening the bit plane of the number of processing bits to LSB. 5. The audio signal compression method of claim 3, wherein the value of the number of parity bits is obtained experimentally. The sound compression method of the μ method includes the following steps: · 1 ' input bit stream: the audio signal compressed by the audio compression method 21 1279693 method described in the third paragraph of the patent application is in place. Meta-streaming mode input; performing arithmetic decoding; performing bit compression decoding: bit-compressing and decoding the audio signal of the arithmetically decoded audio signal of unit time (1) in a code block manner; performing dequantization; Dimensional discrete wavelet transform ·, and restore signals. 7. The method for decompressing the audio signal according to item 6 of the patent application scope, wherein when the remote user selects the compressed material, the step of decoding the bit surface, that is, the processing bit set according to the adaptive adjustment compression method The number is inversely processed by the bit plane. 8. The audio signal decompression method described in the third paragraph of the patent scope, wherein the second quantization step is scalar quantization, and may take 16 high bits, 12 high bits, 8 high bits, etc. 9. An audio compression/decompression circuit device comprising: a buffered § memory element for storing one unit time (1) input by an external, an audio signal, a one-dimensional discrete wavelet transform or a one-dimensional inverse discrete wavelet Converted audio signal; a one-dimensional discrete wavelet converter, cooperate with the buffer memory to complete N-dimensional one-dimensional discrete wavelet transform, or N-order one-dimensional inverse discrete wavelet transform; vertical two-bit coding unit, The Μ pen is subjected to N-dimensional one-dimensional discrete wavelet transform 曰 汛 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成 形成The audio signal encoded by the method is subjected to arithmetic compression, or the compressed audio signal is subjected to arithmetic decompression; and the compressed audio signal is rounded/translated as a bit stream or a bit string of the compressed audio signal is compressed. The stream is sent to the arithmetic compression unit for decompression. The audio compression/decompression circuit device of claim 9, further comprising: a reduced sampling unit for sampling the audio signal in the unit time (t) Data, in order to reduce the sampling method, reduce the sampling rate of the sampled data; and increase the sampling unit for the N-dimensional one-dimensional inverse discrete wavelet converted audio signal stored in the buffer memory Add a sample to restore to the audio signal. The audio compression/decompression circuit device of claim 9, wherein the buffer memory system is included in the bit coding unit. The audio compression/decompression circuit device of claim 9, wherein the bit coding unit further comprises a memory for storing the audio signal of the one-dimensional discrete wavelet transform. 13. = The audio surface/decompression circuit basin human wave converter described in the scope of claim 9 is a one-dimensional inverse discrete wavelet transform of the audio signal input, whereby 丄.ί 14. ίΓί special; item f1G The audio compression/decompression electric reduction sampling dimension discrete wavelet converter described in the item, which is used to perform the "== s mouth water meter 70' and the buffer memory between the system and the discrete iiT'f Discrete wavelet transform or the last one-dimensional inverse & one-time electrical memory capacity requirements. The audio input Si machine input - unit time _ _ - 3 from today, U Zheng Xiaobo defending the audio signal; 兀 flat horse single 70 'the M pen after N times one-dimensional discrete wavelet transform 23 1279693 audio After the signal is formed into a code block, it is coded in a bit-plane manner. - The compression unit is used to perform arithmetic compression on the audio signal, and the first-in-first-out unit (FIFO) is used to compress the compressed audio. The signal is output as a bit stream. The sound surface reduction circuit device of claim 15, further comprising: a reduction sampling unit for reducing the sampling by the sampled data of the audio signal in the unit time (1), The sampled data is reduced for sampling according to the set sampling rate. 17* The audio compression circuit device of claim 15, wherein the buffering system is included in the bit coding unit. 18. The audio compression circuit device of claim 15, wherein the code encoding unit further comprises a memory for storing the M signal through N times of one-dimensional discrete wavelet transform. The audio compression circuit device of claim 15, further comprising a second one-dimensional discrete wavelet converter interposed between the audio signal input and the buffer memory. Perform the first one-dimensional discrete wavelet transform 'to reduce the capacity requirement of the buffer memory. 20. The audio compression circuit device as described in claim 15 or claim 19, further comprising a second one-dimensional discrete wavelet converter interposed between the reduction sampling unit and the buffer memory Used to perform the first one-dimensional discrete wavelet transform, thereby reducing the capacity requirement of the buffer memory. An audio decompression circuit device for an audio compression circuit device of the first application of the patent application scope includes: a first in first out unit (FIFO), which receives a bit formed by the compressed audio signal; The compressed audio signal is subjected to arithmetic decompression; ~ a one-bit decoding unit that decodes the arithmetically decompressed audio signal to solve the 24 1279693 frequency coded in a bit-plane manner by N-dimensional one-dimensional discrete wavelet transform Audio signal; a one-dimensional inverse discrete wavelet converter for performing one-dimensional inverse discrete wavelet transform of the N-dimensional one-dimensional discrete wavelet converted audio signal; and a monthly buffer memory for storing ^^ One-dimensional inverse discrete wavelet transforms the audio signal and turns it out. The audio decompression circuit device of claim 21, further comprising an additional sampling unit for converting the audio signal stored in the buffer memory by one-dimensional inverse discrete wavelet transform Increase the sampling to output the audio signal. Μ The audio decompression circuit device of claim 21, wherein the buffer memory system is included in the bit decoding each, 24 · as claimed in the patent range 91 + the bit solution, the road device, wherein the sub-dimensional inverse discrete The sound of the second wave of the wavelet is used for the age of the pen. 25. For example, the 21st item of the patent application includes a second-dimensional inverse decompression circuit device, in which the input/output and the (n) n 'between The audio signal wheel discrete small turn _ one-dimensional inverse 26. As claimed in the scope of the 21st item. The direct needs of the body. Set, the = more include - the second decompression circuit is equipped with a small sample unit and the buffer memory / conversion fast, which is between the reduced dimension inverse discrete wavelet transform, thereby reducing the second to perform the last one The capacity requirements of the memory. 25
TW94102404A 2005-01-27 2005-01-27 Method and device of audio compression TWI279693B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW94102404A TWI279693B (en) 2005-01-27 2005-01-27 Method and device of audio compression

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW94102404A TWI279693B (en) 2005-01-27 2005-01-27 Method and device of audio compression

Publications (2)

Publication Number Publication Date
TW200627190A TW200627190A (en) 2006-08-01
TWI279693B true TWI279693B (en) 2007-04-21

Family

ID=38645494

Family Applications (1)

Application Number Title Priority Date Filing Date
TW94102404A TWI279693B (en) 2005-01-27 2005-01-27 Method and device of audio compression

Country Status (1)

Country Link
TW (1) TWI279693B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI586177B (en) * 2011-01-28 2017-06-01 艾艾歐有限公司 Adaptive bit rate control based on scenes

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI410136B (en) * 2008-08-08 2013-09-21 Innolux Corp Data compression method and video processing system and display using thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI586177B (en) * 2011-01-28 2017-06-01 艾艾歐有限公司 Adaptive bit rate control based on scenes

Also Published As

Publication number Publication date
TW200627190A (en) 2006-08-01

Similar Documents

Publication Publication Date Title
CN100442850C (en) Method and system for lossless data encoding and decoding
JP5859504B2 (en) Synthesis filter bank, filtering method and computer program
JP4742087B2 (en) Double transform coding of audio signals
US7689427B2 (en) Methods and apparatus for implementing embedded scalable encoding and decoding of companded and vector quantized audio data
CN107025909B (en) Energy lossless encoding method and device and energy lossless decoding method and device
JP4081447B2 (en) Apparatus and method for encoding time-discrete audio signal and apparatus and method for decoding encoded audio data
US7196641B2 (en) System and method for audio data compression and decompression using discrete wavelet transform (DWT)
WO2023241205A1 (en) Audio processing method and apparatus, and electronic device, computer-readable storage medium and computer program product
JP2012118546A (en) Fast lattice vector quantization
CN115116451B (en) Audio decoding, encoding method, device, electronic device and storage medium
CN104718572A (en) Audio encoding method and device, audio decoding method and device, and multimedia device employing same
CN1905010B (en) Device and method for encoding audio data and device and method for decoding audio data
JP2004531151A (en) Method and apparatus for processing time discrete audio sample values
JP4978539B2 (en) Encoding apparatus, encoding method, and program.
JP4800645B2 (en) Speech coding apparatus and speech coding method
JP3487250B2 (en) Encoded audio signal format converter
TWI279693B (en) Method and device of audio compression
JP2001044847A (en) Lossless encoding method, lossless decoding method, these devices and their respective program recording media
CN117219095A (en) Audio encoding method, audio decoding method, device, equipment and storage medium
WO2012006942A1 (en) Audio data encoding method and device
TWI246256B (en) Apparatus for audio compression using mixed wavelet packets and discrete cosine transformation
JP2002091497A (en) Audio signal encoding method, decoding method, and program storage medium for executing those methods
CN113314131A (en) Multistep audio object coding and decoding method based on two-stage filtering
Auristin et al. New Ieee Standard For Advanced Audio Coding In Lossless Audio Compression: A Literature Review
JP2006337508A (en) Method and circuit for data compression, and circuit for data expansion

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees