TWI456567B - 用於在譜域編碼系統中不需側邊資訊地提供雜訊時間性波封任意整形功能之技術 - Google Patents
用於在譜域編碼系統中不需側邊資訊地提供雜訊時間性波封任意整形功能之技術 Download PDFInfo
- Publication number
- TWI456567B TWI456567B TW096129984A TW96129984A TWI456567B TW I456567 B TWI456567 B TW I456567B TW 096129984 A TW096129984 A TW 096129984A TW 96129984 A TW96129984 A TW 96129984A TW I456567 B TWI456567 B TW I456567B
- Authority
- TW
- Taiwan
- Prior art keywords
- frequency domain
- signal
- time domain
- discrete time
- magnitude
- Prior art date
Links
- 230000002123 temporal effect Effects 0.000 title claims abstract 8
- 238000007493 shaping process Methods 0.000 title claims abstract 5
- 238000000034 method Methods 0.000 title claims 25
- 230000003595 spectral effect Effects 0.000 title abstract 2
- 238000013139 quantization Methods 0.000 claims abstract 16
- 238000001914 filtration Methods 0.000 claims abstract 12
- 230000000873 masking effect Effects 0.000 claims 3
- 238000004364 calculation method Methods 0.000 claims 2
- 238000004590 computer program Methods 0.000 claims 2
- 230000005236 sound signal Effects 0.000 claims 2
- 238000001228 spectrum Methods 0.000 claims 1
- 230000007704 transition Effects 0.000 claims 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/03—Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/665—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Transducers For Ultrasonic Waves (AREA)
- Burglar Alarm Systems (AREA)
Claims (21)
- 一種用於編碼一離散時域信號的數位音訊編碼方法,該方法使用該離散時域信號之一頻域表示的量化,該方法包含以下步驟:於頻域中獲得頻域量化誤差的量值;於頻域中濾波量化誤差的該量值以產生量化誤差的一經濾波量值;及於頻域中施加量化誤差的該經濾波量值作為一回饋信號到頻域量化前的該離散時域信號之該頻域表示,藉此當該離散時域信號之經量化的該頻域表示從頻域逆轉換回時域時,該濾波步驟之濾波參數影響量化雜訊的時間性整形。
- 如申請專利範圍第1項所述之方法,其中該濾波步驟係為一或更多頻率容量或頻率容量組的每一個來濾波頻域量化誤差的該量值以產生量化誤差之一經濾波量值,藉此量化誤差的該經濾波量值在該離散時域信號之該頻域表示的整個頻譜的各段中可以改變。
- 如申請專利範圍第1項所述之方法,其中該濾波參數是動態可控的。
- 如申請專利範圍第2項所述之方法,其中該濾波參數是動態可控的。
- 如申請專利範圍第3項所述之方法,其中該濾波參數係響應於該離散時域信號的量值而動態可控。
- 如申請專利範圍第4項所述之方法,其中該濾波參數係 響應於該離散時域信號的量值而動態可控。
- 如申請專利範圍第5項所述之方法,其中該離散時域信號的量值係透過一流程而獲得,該流程包括計算時間性信號的波封,把它取倒數並計算結果的逆DFT。
- 如申請專利範圍第6項所述之方法,其中該離散時域信號的量值係透過一流程而獲得,該流程包括計算時間性信號的波封,把它取倒數並計算結果的逆DFT。
- 如申請專利範圍第3項所述之方法,其中該濾波參數是響應於該離散時域信號之一頻域表示的量值而動態可控的。
- 如申請專利範圍第4項所述之方法,其中該濾波參數是響應於該離散時域信號之一頻域表示的量值而動態可控的。
- 如申請專利範圍第9項所述之方法,其中該離散時域信號之頻域表示的量值透過一流程獲得,該流程包括一線性預測編碼(LPC)計算。
- 如申請專利範圍第10項所述之方法,其中該離散時域信號之頻域表示的量值透過一流程獲得,該流程包括一線性預測編碼(LPC)計算。
- 如申請專利範圍第1至12項任一項所述之方法,其中該濾波參數也響應於一時間遮蔽性模型。
- 如申請專利範圍第13項所述之方法,其中該時間遮蔽性模型尋求提供一選出的量化雜訊時間性整形,且/或其中該時間遮蔽性模型尋求在一轉換區塊內移動該時間性量化雜訊,從該離散時域信號的相對安靜區段到相對大聲區段。
- 如申請專利範圍第1至12項及14項中任一項所述之數位音訊編碼方法,還包含:編碼該離散時域信號之經量化的頻域表示,以產生一編碼位元流。
- 如申請專利範圍第13項中所述之數位音訊編碼方法,還包含:編碼該離散時域信號之經量化的頻域表示,以產生一編碼位元流。
- 一種儲存於一電腦可讀媒體的電腦程式,當該電腦程式由該電腦執行時以使該電腦執行如申請專利範圍第1到第16項任一項所述之方法之所有步驟。
- 一種在一數位音訊編碼器中使用的頻域雜訊-回饋量化器,包括:把源自於一時域音訊信號的一頻域信號與一頻域雜訊回饋信號結合起來以產生一頻域量化器輸入信號的一第一結合器;量化該頻域量化器輸入信號以產生一頻域量化器輸出信號的一量化器;結合該頻域量化器輸入信號與該頻域量化器輸出信號以產生一量化誤差信號的一第二結合器;及濾波該頻域量化誤差信號以產生該頻域雜訊回饋信號的一雜訊回饋濾波器。
- 如申請專利範圍第18項所述之量化器,還包括:一動態控制該雜訊回饋濾波器參數的頻域濾波器參數控制器。
- 如申請專利範圍第19項所述之量化器,其中該濾波器參數控制器響應於一時域音訊信號之一個或更多的量值來控制該雜訊回饋濾波器參數,而該時域信號為頻域信號的來源。
- 一種如申請專利範圍第18至20項中任一項所述之量化器,其中該雜訊回饋濾波器的階數在10到20的範圍內。
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US83809406P | 2006-08-15 | 2006-08-15 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW200818123A TW200818123A (en) | 2008-04-16 |
| TWI456567B true TWI456567B (zh) | 2014-10-11 |
Family
ID=38984075
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW096129984A TWI456567B (zh) | 2006-08-15 | 2007-08-14 | 用於在譜域編碼系統中不需側邊資訊地提供雜訊時間性波封任意整形功能之技術 |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US8706507B2 (zh) |
| EP (1) | EP2054882B1 (zh) |
| JP (1) | JP5096468B2 (zh) |
| CN (1) | CN101501761B (zh) |
| AT (1) | ATE496365T1 (zh) |
| DE (1) | DE602007012116D1 (zh) |
| TW (1) | TWI456567B (zh) |
| WO (1) | WO2008021247A2 (zh) |
Families Citing this family (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8335684B2 (en) * | 2006-07-12 | 2012-12-18 | Broadcom Corporation | Interchangeable noise feedback coding and code excited linear prediction encoders |
| CN101501761B (zh) | 2006-08-15 | 2012-02-08 | 杜比实验室特许公司 | 无需边信息对时域噪声包络的任意整形 |
| AT504164B1 (de) * | 2006-09-15 | 2009-04-15 | Tech Universit T Graz | Vorrichtung zur gerauschunterdruckung bei einem audiosignal |
| US8190440B2 (en) * | 2008-02-29 | 2012-05-29 | Broadcom Corporation | Sub-band codec with native voice activity detection |
| JP4603062B2 (ja) * | 2008-06-26 | 2010-12-22 | 京セラ株式会社 | 信号変換器、無線信号送信システム及び無線信号受信システム |
| FR2938688A1 (fr) * | 2008-11-18 | 2010-05-21 | France Telecom | Codage avec mise en forme du bruit dans un codeur hierarchique |
| PT2491553T (pt) | 2009-10-20 | 2017-01-20 | Fraunhofer Ges Forschung | Codificador de áudio, descodificador de áudio, método para codificar uma informação de áudio, método para descodificar uma informação de áudio e programa de computador que utiliza uma redução iterativa de tamanho de intervalo |
| PL2524372T3 (pl) | 2010-01-12 | 2015-08-31 | Fraunhofer Ges Forschung | Koder audio. dekoder audio, sposób kodowania i dekodowania informacji audio i program komputerowy uzyskujący wartość podobszaru kontekstu w oparciu o normę uprzednio zdekodowanych wartości widmowych |
| US9530419B2 (en) | 2011-05-04 | 2016-12-27 | Nokia Technologies Oy | Encoding of stereophonic signals |
| US8891775B2 (en) * | 2011-05-09 | 2014-11-18 | Dolby International Ab | Method and encoder for processing a digital stereo audio signal |
| US9905236B2 (en) | 2012-03-23 | 2018-02-27 | Dolby Laboratories Licensing Corporation | Enabling sampling rate diversity in a voice communication system |
| RU2625444C2 (ru) * | 2013-04-05 | 2017-07-13 | Долби Интернэшнл Аб | Система обработки аудио |
| EP2830058A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Frequency-domain audio coding supporting transform length switching |
| EP2887350B1 (en) * | 2013-12-19 | 2016-10-05 | Dolby Laboratories Licensing Corporation | Adaptive quantization noise filtering of decoded audio data |
| CN111312265B (zh) * | 2014-01-15 | 2023-04-28 | 三星电子株式会社 | 对线性预测编码系数进行量化的加权函数确定装置和方法 |
| MX367544B (es) | 2014-02-14 | 2019-08-27 | Ericsson Telefon Ab L M | Generación de ruido de confort. |
| US9576589B2 (en) * | 2015-02-06 | 2017-02-21 | Knuedge, Inc. | Harmonic feature processing for reducing noise |
| EP3649640A1 (en) * | 2017-07-03 | 2020-05-13 | Dolby International AB | Low complexity dense transient events detection and coding |
| US10044367B1 (en) * | 2017-08-08 | 2018-08-07 | Intel Corporation | Arbitrary noise shaping transmitter with receive band notches |
| EP3483878A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder supporting a set of different loss concealment tools |
| EP3483882A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Controlling bandwidth in encoders and/or decoders |
| EP3483880A1 (en) * | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Temporal noise shaping |
| EP3483883A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding and decoding with selective postfiltering |
| EP3483884A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signal filtering |
| EP3483886A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selecting pitch lag |
| EP3483879A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analysis/synthesis windowing function for modulated lapped transformation |
| WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
| US11295750B2 (en) * | 2018-09-27 | 2022-04-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for noise shaping using subspace projections for low-rate coding of speech and audio |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5487086A (en) * | 1991-09-13 | 1996-01-23 | Comsat Corporation | Transform vector quantization for adaptive predictive coding |
| US6363338B1 (en) * | 1999-04-12 | 2002-03-26 | Dolby Laboratories Licensing Corporation | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading |
| TW200423028A (en) * | 2002-12-23 | 2004-11-01 | Arbitron Inc | Systems and methods for identifying and encoding audio data |
| TWI226602B (en) * | 2001-04-13 | 2005-01-11 | Dolby Lab Licensing Corp | High quality time-scaling and pitch-scaling of audio signals |
| TW200504683A (en) * | 2003-05-08 | 2005-02-01 | Dolby Lab Licensing Corp | Improved audio coding systems and methods using spectral component coupling and spectral component regeneration |
| US20060074693A1 (en) * | 2003-06-30 | 2006-04-06 | Hiroaki Yamashita | Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH03201715A (ja) * | 1989-12-28 | 1991-09-03 | Sony Corp | ノイズシェーピング回路 |
| JP3010663B2 (ja) * | 1989-12-28 | 2000-02-21 | ソニー株式会社 | ノイズシェーピング回路 |
| US5206884A (en) * | 1990-10-25 | 1993-04-27 | Comsat | Transform domain quantization technique for adaptive predictive coding |
| US5623577A (en) * | 1993-07-16 | 1997-04-22 | Dolby Laboratories Licensing Corporation | Computationally efficient adaptive bit allocation for encoding method and apparatus with allowance for decoder spectral distortions |
| TW324762B (en) | 1996-07-15 | 1998-01-11 | Tokyo Electric Power Co | Manufacturing method for concrete sections |
| US6415251B1 (en) * | 1997-07-11 | 2002-07-02 | Sony Corporation | Subband coder or decoder band-limiting the overlap region between a processed subband and an adjacent non-processed one |
| US6446037B1 (en) * | 1999-08-09 | 2002-09-03 | Dolby Laboratories Licensing Corporation | Scalable coding method for high quality audio |
| JP3681105B2 (ja) * | 2000-02-24 | 2005-08-10 | アルパイン株式会社 | データ処理方式 |
| US7171355B1 (en) * | 2000-10-25 | 2007-01-30 | Broadcom Corporation | Method and apparatus for one-stage and two-stage noise feedback coding of speech and audio signals |
| US7512535B2 (en) * | 2001-10-03 | 2009-03-31 | Broadcom Corporation | Adaptive postfiltering methods and systems for decoding speech |
| KR100984637B1 (ko) * | 2002-01-25 | 2010-10-05 | 엔엑스피 비 브이 | 양자화 노이즈 제거 방법 및 장치 |
| JP2006047561A (ja) * | 2004-08-03 | 2006-02-16 | Matsushita Electric Ind Co Ltd | オーディオ信号符号化装置およびオーディオ信号復号化装置 |
| US8332228B2 (en) * | 2005-04-01 | 2012-12-11 | Qualcomm Incorporated | Systems, methods, and apparatus for anti-sparseness filtering |
| CN101501761B (zh) | 2006-08-15 | 2012-02-08 | 杜比实验室特许公司 | 无需边信息对时域噪声包络的任意整形 |
| TWM324762U (en) | 2007-05-11 | 2008-01-01 | Wan-Chen Jou | Externally connected type condensed water atomizer of air conditioner |
-
2007
- 2007-08-10 CN CN200780030179.3A patent/CN101501761B/zh active Active
- 2007-08-10 AT AT07836718T patent/ATE496365T1/de not_active IP Right Cessation
- 2007-08-10 US US12/310,124 patent/US8706507B2/en active Active
- 2007-08-10 EP EP07836718A patent/EP2054882B1/en active Active
- 2007-08-10 WO PCT/US2007/017811 patent/WO2008021247A2/en not_active Ceased
- 2007-08-10 DE DE602007012116T patent/DE602007012116D1/de active Active
- 2007-08-10 JP JP2009524635A patent/JP5096468B2/ja active Active
- 2007-08-14 TW TW096129984A patent/TWI456567B/zh active
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5487086A (en) * | 1991-09-13 | 1996-01-23 | Comsat Corporation | Transform vector quantization for adaptive predictive coding |
| US6363338B1 (en) * | 1999-04-12 | 2002-03-26 | Dolby Laboratories Licensing Corporation | Quantization in perceptual audio coders with compensation for synthesis filter noise spreading |
| TWI226602B (en) * | 2001-04-13 | 2005-01-11 | Dolby Lab Licensing Corp | High quality time-scaling and pitch-scaling of audio signals |
| TW200423028A (en) * | 2002-12-23 | 2004-11-01 | Arbitron Inc | Systems and methods for identifying and encoding audio data |
| TW200504683A (en) * | 2003-05-08 | 2005-02-01 | Dolby Lab Licensing Corp | Improved audio coding systems and methods using spectral component coupling and spectral component regeneration |
| US20060074693A1 (en) * | 2003-06-30 | 2006-04-06 | Hiroaki Yamashita | Audio coding device with fast algorithm for determining quantization step sizes based on psycho-acoustic model |
Also Published As
| Publication number | Publication date |
|---|---|
| JP5096468B2 (ja) | 2012-12-12 |
| WO2008021247A9 (en) | 2008-07-10 |
| WO2008021247A2 (en) | 2008-02-21 |
| JP2010500631A (ja) | 2010-01-07 |
| TW200818123A (en) | 2008-04-16 |
| US20100094637A1 (en) | 2010-04-15 |
| DE602007012116D1 (de) | 2011-03-03 |
| EP2054882B1 (en) | 2011-01-19 |
| WO2008021247A3 (en) | 2008-04-17 |
| CN101501761A (zh) | 2009-08-05 |
| ATE496365T1 (de) | 2011-02-15 |
| US8706507B2 (en) | 2014-04-22 |
| EP2054882A2 (en) | 2009-05-06 |
| CN101501761B (zh) | 2012-02-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI456567B (zh) | 用於在譜域編碼系統中不需側邊資訊地提供雜訊時間性波封任意整形功能之技術 | |
| RU2012120850A (ru) | Аудиокодер и декодер | |
| TWI578308B (zh) | 音訊信號頻譜之頻譜係數的編碼技術 | |
| US10600428B2 (en) | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal | |
| CN1926608A (zh) | 多声道信号处理设备和方法 | |
| TWI536369B (zh) | 用以基於線性預測編碼之於頻域中編碼的低頻率增強技術 | |
| CN117316168A (zh) | 用于对音频信号进行编码的音频编码器以及方法 | |
| JP7003253B2 (ja) | エンコーダおよび/またはデコーダの帯域幅の制御 | |
| TW201724087A (zh) | 對信號的包絡進行寫碼的設備及對其進行解碼的設備 | |
| JP2021502592A (ja) | スケールパラメータのダウンサンプリングまたは補間を使用してオーディオ信号をエンコードおよびデコードするための装置および方法 | |
| JP6730391B2 (ja) | オーディオ信号内の雑音を推定するための方法、雑音推定器、オーディオ符号化器、オーディオ復号器、およびオーディオ信号を送信するためのシステム | |
| KR100848370B1 (ko) | 오디오 부호화 | |
| CN105122358A (zh) | 用于处理编码信号的装置和方法与用于产生编码信号的编码器和方法 | |
| JP2012519309A (ja) | オーディオ符号化のための量子化 | |
| JP3630082B2 (ja) | オーディオ信号符号化方法及びその装置 | |
| JP2001148632A (ja) | 符号化装置、符号化方法、及びその記録媒体 | |
| RU2019122302A (ru) | Аудиокодер и декодер | |
| HK1233759A1 (zh) | 用於对音频信号中的噪声进行估计的方法、噪声估计器、音频编码器、音频解码器、以及用於传输音频信号的系统 | |
| JP2005197989A (ja) | べき乗演算回路、量子化回路および方法 | |
| HK1233759B (zh) | 用於对音频信号中的噪声进行估计的方法、噪声估计器、音频编码器、音频解码器、以及用於传输音频信号的系统 | |
| JP2002116799A (ja) | オーディオ信号符号化装置 |