TWI469136B - 在一頻譜域中用以處理已解碼音訊信號之裝置及方法 - Google Patents
在一頻譜域中用以處理已解碼音訊信號之裝置及方法 Download PDFInfo
- Publication number
- TWI469136B TWI469136B TW101104349A TW101104349A TWI469136B TW I469136 B TWI469136 B TW I469136B TW 101104349 A TW101104349 A TW 101104349A TW 101104349 A TW101104349 A TW 101104349A TW I469136 B TWI469136 B TW I469136B
- Authority
- TW
- Taiwan
- Prior art keywords
- audio signal
- signal
- time
- decoder
- filter
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims description 85
- 238000000034 method Methods 0.000 title claims description 30
- 238000012545 processing Methods 0.000 title claims description 30
- 230000003595 spectral effect Effects 0.000 title claims description 29
- 238000001228 spectrum Methods 0.000 claims description 26
- 238000001914 filtration Methods 0.000 claims description 14
- 230000007774 longterm Effects 0.000 claims description 13
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 10
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 230000005284 excitation Effects 0.000 claims description 6
- 230000015572 biosynthetic process Effects 0.000 claims description 4
- 238000003786 synthesis reaction Methods 0.000 claims description 4
- 230000002238 attenuated effect Effects 0.000 claims 2
- 230000001131 transforming effect Effects 0.000 claims 2
- 230000004044 response Effects 0.000 description 23
- 230000006870 function Effects 0.000 description 11
- 238000012805 post-processing Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 9
- 239000003623 enhancer Substances 0.000 description 8
- 238000005070 sampling Methods 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 6
- 230000009897 systematic effect Effects 0.000 description 4
- 238000012952 Resampling Methods 0.000 description 3
- 230000001934 delay Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000007176 multidirectional communication Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000000411 transmission spectrum Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/028—Noise substitution, i.e. substituting non-tonal spectral components by noisy source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K11/00—Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/16—Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/03—Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
- G10L19/107—Sparse pulse excitation, e.g. by using algebraic codebook
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/13—Residual excited linear prediction [RELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Algebra (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
Description
本發明係有關於音訊處理,及更明確言之,係有關於用於品質提升的已解碼音訊信號之處理。
晚近已經達成有關切換式音訊編解碼器的進一步發展。高品質及低位元率的切換式音訊編解碼器乃統一語音與音訊編碼構思(USAC構思)。常見前處理/後處理包含:MPEG環繞(MPEGs)功能單元其處置立體聲或多聲道處理,及加強SBR(eSBR)單元其處理於輸入信號中較高音頻的參數表示型態。接著有二分支,一個分支包含高階音訊編碼(AAC)工具路徑,及另一個分支包含以線性預測編碼(LP或LPC定義域)為基礎的路徑,其又轉而成為LPC殘差之頻域表示型態或時域表示型態。於量化及算術編碼後,AAC及LPC二者的全部傳輸頻譜係表示於MDCT定義域。時域表示型態使用ACELP激勵編碼方案。編碼器及解碼器之方塊圖係給定於ISO/IEC CD 23003-3之第1.1圖及第1.2圖。
切換式音訊編解碼器之一額外實例為如3GPP TS 26.290 V10.0.0(2011-3)描述的擴充式適應多速率寬帶(AMR-WB+)編解碼器。AMR-WB+音訊編解碼器處理輸入訊框等於於內部取樣頻率Fs
為2048樣本。內部取樣頻率係限於12800至38400 Hz之範圍。2048樣本訊框係分裂成兩個臨界取樣的相等頻率頻帶。如此導致相對應於低頻(LF)頻帶及高頻(HF)頻帶的兩個1024樣本之超訊框。各個超訊框
係劃分為四個256樣本訊框。於內部取樣率取樣係經由使用可變取樣變換方案獲得,該方案係重新取樣輸入信號。然後低頻信號及高頻信號使用兩個不同辦法編碼:低頻信號係使用「核心」編碼器/解碼器基於切換式ACELP及變換編碼激勵(TCX)編碼與解碼。於ACELP模式中,使用標準AMR-WB編解碼器。高頻信號係利用頻寬延長(BWE)方法以相當少的位元(每個訊框16位元)編碼。AMR-WB編碼器包括前處理功能、LPC分析、開放回路搜尋功能、適應性碼簿搜尋功能、創新性碼簿搜尋功能、及記憶體更新。ACELP解碼器包含數項功能,諸如解碼適應性碼簿、解碼增益、解碼創新性碼簿、解碼ISP、長期預測濾波器(LTP濾波器)、組成性激勵功能、四個子訊框之ISP之內插、後處理、合成濾波器、解除強調及升頻取樣方塊來最終獲得語音輸出的低頻帶部分。語音輸出的高頻帶部分係藉使用HB增益指數、VAD旗標、及16 kHz隨機激勵而產生。此外,HB合成濾波器的使用係接著帶通濾波器。進一步細節請參考G.722.2之第3圖。
此一方案於AMR-WB+已藉執行單聲道低帶信號之後處理而予提升。參考第7、8及9圖例示說明於AMR-WB+之功能。第7圖例示說明音準加強器700、低通濾波器702、高通濾波器704、音準追蹤階段706及加法器708。該等方塊係連結如第7圖所示及係饋以解碼信號。
於低頻音準加強中,使用二頻帶分解,及適應性濾波只應用至低頻帶。如此導致總後處理,大部分係鎖定目標
於接近該合成語音信號之第一諧波之頻率。第7圖顯示二頻帶音準加強器之方塊圖。於較高分支中,解碼信號係藉高通濾波器704濾波來產生較高頻帶信號sH
。於較低分支中,解碼信號首先係透過音準加強器700處理,及然後經由低通濾波器702濾波來獲得較低頻帶後處理信號(sLEE
)。後處理解碼信號係經由該較低頻帶後處理信號與該較高頻帶信號相加獲得。音準加強器之目的係減低於該解碼信號中之諧波間雜訊,該項目的係藉第9圖第一行指示的具有轉移函式HE
之時變線性濾波器達成,及藉第9圖第二行之方程式描述。α乃控制諧波間衰減之係數。T為輸入信號(n
)之音準週期,及sLE
(n)為音準加強器之輸出信號。參數T及α係隨著時間改變,且係藉音準追蹤階段706以數值α=1給定,藉第9圖第二行之方程式描述的濾波器增益於頻率1/(2T)、3/(2T)、5/(2T)等亦即於DC(0 Hz)與諧波頻率1/T、3/T、5/T等間之中點係恰為零。當α趨近於零時,如第9圖第二行定義的由濾波器所產生的諧波間之衰減減少。當α為零時,濾波器無效用,且為全通。為了將後處理限於低頻區,加強信號sLE
係經低通濾波來產生信號sLEF
,該信號加至高通濾波信號sH
來獲得後處理合成信號sE
。
相當於第7圖之例示說明的另一組態係例示說明於第8圖,第8圖之組態免除高通濾波的需要。此點係就第9圖針對sE
的第三方程式解說。hLP
(n)為低通濾波器的脈衝響應,及hHP
(n)為互補高通濾波器的脈衝響應。然後,後處理信號sE(n)
係由第9圖的第三方程式給定。如此,後處理係相當於
從合成信號(n
)扣除已定標低通濾波長期誤差信號α.eLT
(n)。長期預測濾波器的轉移函式係給定如第9圖之末行指示。此種交替後處理組態係例示說明於第8圖。數值T係藉於各個子訊框所接收的閉路音準滯後給定(分量音準滯後係捨入至最近的整數)。執行檢查音準加倍的簡單追蹤。若於延遲T/2的標準化音準相關性係大於0.95,則值T/2係用作為用於後處理的新音準滯後。因數α係藉α=0.5gp
給定,限於α大於或等於零及小於或等於0.5。gp
為以0及1為界限的解碼音準增益。於TCX模式中,α值係設定為零。具有25係數的線性相位有限脈衝響應(FIR)低通濾波器係以約500赫茲之截止頻率使用。濾波器延遲為12樣本。上分支須導入相對應於在下分支處理延遲的延遲,來維持在執行減法前兩個分支之信號的時間排齊。於AMR-WB+中Fs=2x核心之取樣率。核心取樣率係等於12800赫茲。故截止頻率係等於500赫茲。業已發現特別係針對低延遲應用,由線性相位FIR低通濾波器所導入的12樣本濾波器延遲促成編碼/解碼方案之總延遲。於編碼/解碼鏈中其它位置有其它系統性延遲來源,FIR濾波器延遲與其它來源累積。
本發明之一目的係提供改良之音訊信號處理構思,該構思係更適用於即時應用或多向通訊景況,諸如行動電話景況。
此項目的係藉如申請專利範圍第1項之處理已解碼音訊信號之設備、或如申請專利範圍第15項之處理已解碼音
訊信號之方法、或如申請專利範圍第16項之電腦程式而予達成。
本發明係基於發現於已解碼信號之低音後濾波中的低通濾波器對總延遲的貢獻成問題而須減少。為了達成此項目的,已濾波音訊信號於時域係未經低通濾波,但於頻譜域經低通濾波,諸如QMF定義域或任何其它頻譜域,例如MDCT定義域、快速傅利葉變換(FFT)定義域等。業已發現從頻譜域變換至頻域,及例如變換至低解析度頻域,諸如QMF定義域可以低延遲執行,欲於頻譜域體現的濾波器之頻率選擇性,可藉只加權來自已濾波音訊信號之頻域表示型態的個別子帶信號而體現。因此頻率選擇特性之此種「影響」係經執行而無任何系統性延遲,原因在於子帶信號的乘法或加權運算不會遭致任何延遲。已濾波音訊信號及原先音訊信號之減法也係在頻譜域執行。又復,較佳係執行例如無論如何皆需要的額外操作,諸如頻譜帶複製解碼或立體聲或多聲道解碼係在一且同一QMF域額外地執行。頻時變換只在解碼鏈的末端執行來將最終產生的音訊信號帶回時域。如此,取決於應用用途,當不再要求於QMF域的額外處理操作時,藉減法器產生的結果音訊信號可就此變換回時域。但當解碼演算法於QMF域有額外處理操作時,則頻譜時間變換器並非連結至減法器輸出,反而係連結至最末頻域處理裝置之輸出。
較佳地,用以濾波已解碼音訊信號之濾波器為長期預測濾波器。又,較佳頻譜表示型態為QMF表示型態,額外
地較佳頻率選擇性為低通特性。
但與長期預測濾波器相異的任何其它濾波器、與QMF表示型態相異的任何其它頻譜表示型態、或與低通特性相異的任何其它頻率選擇性可用來獲得已解碼音訊信號之低延遲後處理。
後文將就附圖描述本發明之較佳實施例,附圖中:第1a圖為依據一實施例用以處理已解碼音訊信號之設備之方塊圖;第1b圖為用以處理已解碼音訊信號之設備之一較佳實施例之方塊圖;第2a圖顯示頻率選擇特性作為低通特性;第2b圖顯示加權係數及相聯結的子帶;第2c圖顯示時/頻變換器及隨後連結的用以施加加權係數至各個個別子帶信號之加權器之串級;第3圖顯示於第8圖例示說明之AMR-WB+中低通濾波器之頻率響應中的脈衝響應;第4圖顯示脈衝響應及頻率響應變換成QMF域;第5圖顯示用於32 QMF子帶實例之加權器的加權因數;第6圖顯示針對16 QMF頻帶之頻率響應及相聯結的16加權因數;第7圖顯示AMR-WB+之低頻音準加強器之方塊圖;第8圖顯示AMR-WB+之體現後處理組態;
第9圖顯示第8圖之體現之推衍;及第10圖顯示依據一實施例之長期預測濾波器之低延遲體現。
第1a圖例示說明用以處理線上已解碼音訊信號100之設備。線上已解碼音訊信號100係輸入濾波器102用以濾波該已解碼音訊信號來獲得線上已濾波音訊信號104。濾波器102係連結至時間頻譜變換器階段106,例示說明為用於已濾波音訊信號之106a及用於線上已解碼音訊信號100之106b兩個個別時間頻譜變換器。時間頻譜變換器階段106係經組配來將該音訊信號及該已濾波音訊信號變換成各自有多個子密碼有效期的相對應頻譜表示型態。於第1a圖中此係以雙線表示,指示方塊106a、106b的輸出包含多個個別子帶信號而非單一信號,如針對方塊106a、106b的輸入例示說明。
處理設備額外包含加權器108,係用以對方塊106a輸出的已濾波音訊信號執行頻率選擇性加權,執行方式係將個別子帶信號乘以個別加權係數來獲得線上已加權已濾波音訊信號110。
此外,設置減法器112。減法器係經組配來執行已加權已濾波音訊信號與由方塊106b所產生的該音訊信號之頻譜表示型態間之逐一子帶減法。
此外,設置頻譜時間變換器114。由方塊114所執行的頻時變換使得藉減法器112所產生的結果音訊信號或從該
結果音訊信號推衍得的信號係變換成時域表示型態而獲得線上已處理已解碼音訊信號116。
雖然第1a圖指示因時頻變換及加權的延遲係顯著低於因FIR濾波的延遲,但此點並非於全部情況下皆屬必要,原因在於其中QMF乃絕對地必要之情況下,可避免FIR濾波的延遲及QMF的延遲累加。因此當針對低音後濾波因時頻變換加權的延遲甚至高於FIR濾波的延遲時,本發明也有用。
第1b圖例示說明USAC解碼器或AMR-WB+解碼器之脈絡的本發明之較佳實施例。第1b圖例示說明之設備包含ACELP解碼器階段120、TCX解碼器階段122及連結點124,於該處連結解碼器120、122之輸出。連結點124始於兩個個別分支。第一分支包含濾波器102,濾波器102較佳地係經組配成藉音準滯後T設定的長期預測濾波器,接著為適應性增益α之放大器129。此外,第一分支包含時間頻譜變換器106a,其較佳係體現為QMF分析濾波器組。又復,第一分支包含加權器108,其係經組配來加權由QMF分析濾波器組106a所產生的子帶信號。
於第二分支中,已解碼音訊信號係藉QMF分析濾波器組106b而變換成頻譜域。
雖然個別QMF方塊106a、106b係例示說明為兩個分開元件,但須注意用於分析已濾波音訊信號及音訊信號,並非必要要求有兩個個別的QMF分析濾波器組。取而代之,當信號係逐一地變換時,單一QMF分析濾波器組及記憶體即足。但用於極低延遲體現,較佳係針對各個信號使用個
別QMF分析濾波器組,讓單一QMF方塊不會形成演算法的瓶頸。
較佳地,變換成頻譜域及變換回時域係藉演算法執行,具有針對正向及反向變換之延遲係小於具有頻率選擇性特性的時域中濾波的延遲。因此,變換須具有總延遲係小於關注的濾波器之延遲。特別有用者為低解析度變換,諸如以QMF為基礎的變換,原因在於低頻率解析度結果導致需要小型變換窗,亦即導致縮小的系統性延遲。較佳應用用途只要求低解析度變換分解該信號成少於40個子帶,諸如32或只有16個子帶。但即便於時頻變換及加權導入比低通濾波器更高的延遲的應用中,由於下述事實而獲得優點,免除了其它處理程序所必然需要的低通濾波器與時間頻譜變換的延遲累加。
但針對由於其它處理操作諸如重新取樣、SBR或MPS而無論如何皆要求時頻變換的應用,與由時頻變換或頻時變換所遭致的延遲無關地,獲得延遲減少,原因在於將濾波器體現「含括」入頻譜域,可完全節省時域濾波器延遲,由於下述事實:執行逐一子帶加權而無任何系統性延遲。
適應性放大器129係藉控制器130控制。控制器130係經組配來當輸入信號為TCX解碼信號時,設定放大器129之增益α為零。典型地,於切換音訊編解碼器諸如USAC或AMR-WB+中,於連結點124的已解碼信號典型地係來自TCX解碼器122或來自ACELP解碼器120。因此有兩個解碼器120、122的已解碼輸出信號之時間多工。控制器130係經
組配來針對目前時間瞬間,決定該輸出信號係來自TCX解碼信號或ACELP解碼信號。當決定有TCX信號時,適應性增益α係設定為零,使得由元件102、109、106a、108所組成的第一分支不具任何意義。此點係由於下述事實,用在AMR-WB+或USAC之特定種類的濾波只要求用在ACELP解碼信號。但當執行諧波濾波或音準加強以外的其它後濾波體現時,則取決於需求,可差異地設定可變增益α。
但當控制器130決定目前可用信號乃ACELP解碼信號時,放大器129之值係設定為α之正確值,典型地為0至0.5。於此種情況下,第一分支為有意義,減法器112之輸出信號實質上係與在連結點124的原先已解碼音訊信號有別。
用在解碼器120及放大器128的音準資訊(音準滯後及增益α)可來自該解碼器及/或專用音準追蹤器。較佳地,資訊係來自該解碼器,及然後透過專用音準追蹤器/該已解碼信號之長期預測分析而重新處理(精製)。
藉減法器112執行每帶或每子帶減法所產生的結果音訊信號並不立刻執行返回時域。取而代之,該信號係前傳至SBR解碼器模組128。模組128係連結至單聲-立體聲或單聲道-多聲道解碼器,諸如MPS解碼器131,於該處MPS表示MPEG環繞。
典型地,頻帶數目係藉頻譜帶寬複製解碼器提升,係藉在方塊128輸出的額外三行132指示。
又復,輸出數目係藉方塊131額外提升。方塊131從在方塊129輸出的單聲道信號產生例如五聲道信號或任何其
它有二或更多聲道的信號。例示說明具有左聲道L、右聲道R、中聲道C、左環繞聲道Ls
及右環繞聲道Rs
的五聲道景況。因此針對各個個別聲道存在有頻譜時間變換器114,換言之,於第1b圖中存在有五倍,來將各個個別聲道信號從頻譜域,於第1b圖實例中為QMF域,變換回於方塊114輸出的時域。再度,並非必要為多個個別頻譜時間變換器。也可有單一頻譜時間變換器,其逐一地處理變換。但當要求極低延遲體現時,較佳係針對各個頻道使用個別頻譜時間變換器。
本發明之優點在於藉低音後濾波器所導入的延遲及更明確言之,由低通濾波器FIR濾波器所導入的延遲減少。因此任一種頻率選擇性濾波就QMF所要求的延遲,或概略言之,就時/頻變換而言不會導入額外延遲。
當無論如何要求QMF或一般而言要求時-頻變換時,本發明特別優異,例如於第1b圖之情況,於該處無論如何SBR功能及MPS功能係在頻譜域執行。於該處要求QMF之替代體現為當以已解碼信號執行重新取樣時的景況,及當為了重新取樣目的而要求具有不同濾波器組聲道數目的QMF分析濾波器組及QMF合成濾波器組時的景況。
此外,由於二信號亦即TCX及ACELP信號現在具有相同延遲,故ACELP與TCX間維持恆定訊框。
帶寬延展解碼器129之功能係以細節描述於ISO/IEC CD 23003-3章節6.5。多聲道解碼器131之功能係以細節描述於ISO/IEC CD 23003-3章節6.11。TCX解碼器及ACELP解碼
器背後的功能係以細節描述於ISO/IEC CD 23003-3區塊6.12至6.17。
隨後,討論第2a至2c圖來例示說明示意實例。第2a圖例示說明示意低通濾波器之經頻率選擇的頻率響應。
第2b圖例示說明針對第2a圖所指子帶數目或子帶的加權指數。於第2a圖之示意情況下,子帶1至6具有等於1之加權係數,亦即無加權,而子帶7至10具有遞減的加權係數,及子帶11至11具有零之加權係數。
時間頻譜變換器諸如106a及隨後連接器加權器108之串級的相對應體現係例示說明於第2c圖。各個子帶1、2、...、14係輸入以W1
、W2
、...W14
指示的個別加權方塊內。
加權器108藉該子帶信號之各次取樣乘以加權係數而施加第2b圖之該表的加權因數至各個個別子帶信號。然後,於加權器的輸出端,存在有已加權子帶信號,然後輸入第1a圖之減法器112,減法器112額外地執行於頻譜域的減法。
第3圖例示說明該AMR-WB+編碼器於第8圖之低通濾波器的脈衝響應及頻率響應。於時域的低通濾波器hLP
(n)係於AMR-WB+藉下列係數定義。
a[13]=[0.088250,0.086410,0.081074,0.072768,0.062294,0.050623,0.038774,0.027692,0.018130,0.010578,0.005221,0.001946,0.000385];hLP
(n)=a(13-n)針對n為1至12 hLP
(n)=a(n-12)針對n為13至25第3圖例示說明的脈衝響應及頻率響應係針對一種情
況,當濾波器係施加至12.8 kHz的時域信號樣本時。則所產生的延遲為12樣本延遲,亦即0.9375毫秒。
第3圖例示說明之濾波器具有於QMF域的頻率響應,於該處各個QMF具有400赫茲解析度。32 QMF頻帶涵蓋於12.8 kHz之信號樣本的帶寬。頻率響應及QMF域係例示說明於第4圖。
具有400赫茲解析度之幅值頻率響應形成當施加低通濾波器於QMF域時的權值。加權器108之權值係用於第5圖摘述之前述參數實例。
此等權值可計算如下:W=abs(DFT(hLP
(n),64)),於該處DFT(x,N)代表信號x之長度N的離散富利葉變換。若x係比N更短,則信號係以N減x個零的大小填塞。DFT之長度N係相對應於兩倍QMF子帶數目。因hLP
(n)乃實際係數信號,W顯示頻率0與尼奎斯特(Nysquist)頻率間的厄爾米辛(Hermitian)對稱及N/2頻率係數。
藉由分析濾波器係數的頻率響應,其係相對應於約2*pi*10/256之截止頻率。此點用來設計濾波器。為了節省若干ROM的耗用及有鑑於定點體現,然後該等係數經量化來以14位元寫成。
然後於QMF域的濾波執行如下:Y=於QMF域之後處理信號
X=於來自核心編碼器的QMF信號中之已解碼信號
E=於TD產生的欲從X移除的諧波間雜訊
Y(k)=X(k)-W(k).E(k),針對k為1至32
第6圖例示說明又一實例,於該處QMF具有800赫茲解析度,故16頻帶涵蓋於12.8 kHz取樣的信號之全帶寬。然後係數W如第6圖指示於線圖下方。濾波係以就第6圖討論之相同方式進行,但k只有1至16。
於16頻帶QMF中的該濾波器之頻率響應係作圖為如第6圖之例示說明。
第10圖例示說明於第1b圖顯示於102的長期預測濾波器之更進一步加強。
更明確言之,針對低延遲體現,第9圖中第三行至末行的該項(n
+T
)有問題。原因在於相對於真實時間n,T樣本係在未來。因此為了解決此種情況,於該處因低延遲體現,尚未能獲得未來數值,故(n
+T
)係以置換,如第10圖指示。然後,長期預測濾波器估算先前技術之長期預測,但使用較少延遲或零延遲。業已發現估算為夠好,相對於減少延遲的增益係比音準加強的些微損耗更優異。
雖然已經以設備脈絡描述若干構面,但顯然此等構面也表示相對應方法的描述,於該處一方塊或一裝置係相對應於一方法步驟或一方法步驟之特徵。同理,以方法步驟之脈絡描述的構面也表示相對應設備之相對應方塊或項或特徵結構之描述。
取決於某些體現要求,本發明之實施例可於硬體或於軟體體現。體現可使用數位儲存媒體執行,例如軟碟、DVD、CD、ROM、PROM、EPROM、EEPROM或快閃記憶
體,具有可電子讀取控制信號儲存於其上,該等信號與(或可與)可程式規劃電腦系統協作,因而執行個別方法。
依據本發明之若干實施例包含具有可電子式讀取控制信號的非過渡資料載體,該等控制信號可與可程式規劃電腦系統協作,因而執行此處所述方法中之一者。
大致言之,本發明之實施例可體現為具有程式代碼的電腦程式產品,該程式代碼係當電腦程式產品在電腦上跑時可執行該等方法中之一者。該程式代碼例如可儲存在機器可讀取載體上。
其它實施例包含儲存在機器可讀取載體上的用以執行此處所述方法中之一者的電腦程式。
換言之,因此,本發明方法之實施例為一種具有一程式代碼之電腦程式,該程式代碼係當該電腦程式於一電腦上跑時用以執行此處所述方法中之一者。
因此,本發明方法之又一實施例為資料載體(或數位儲存媒體或電腦可讀取媒體)包含用以執行此處所述方法中之一者的電腦程式記錄於其上。
因此,本發明方法之又一實施例為表示用以執行此處所述方法中之一者的電腦程式的資料串流或信號序列。資料串流或信號序列例如可經組配來透過資料通訊連結,例如透過網際網路轉移。
又一實施例包含處理構件例如電腦或可程式規劃邏輯裝置,其係經組配來或適用於執行此處所述方法中之一者。
又一實施例包含一電腦,其上安裝有用以執行此處所
述方法中之一者的電腦程式。
於若干實施例中,可程式規劃邏輯裝置(例如可現場程式規劃閘陣列)可用來執行此處描述之方法的部分或全部功能。於若干實施例中,可現場程式規劃閘陣列可與微處理器協作來執行此處所述方法中之一者。大致上該等方法較佳係藉任何硬體裝置執行。
前述實施例係僅供舉例說明本發明之原理。須瞭解此處所述配置及細節之修改及變化將為熟諳技藝人士顯然易知。因此,意圖僅受審查中之專利申請範圍所限而非受藉以描述及解說此處實施例所呈示之特定細節所限。
100‧‧‧線上已解碼音訊信號
102‧‧‧濾波器
104‧‧‧線上已濾波音訊信號
106‧‧‧時間頻譜變換器階段
106a-b‧‧‧時間頻譜變換器、方塊、QMF分析濾波器組
108‧‧‧加權器
110‧‧‧線上已加權已濾波音訊信號
112‧‧‧減法器
114‧‧‧頻譜時間變換器
116‧‧‧線上已處理已解碼音訊信號
120‧‧‧ACELP解碼器階段
122‧‧‧TCX解碼器階段
124‧‧‧連結點
128‧‧‧SBR解碼器模組、方塊
129‧‧‧放大器、方塊、帶寬延展解碼器
130‧‧‧控制器
131‧‧‧MPS解碼器、方塊、多聲道解碼器
132‧‧‧行
700‧‧‧音準加強器
702‧‧‧低通濾波器
704‧‧‧高通濾波器
706‧‧‧音準追蹤階段
708‧‧‧加法器
第1a圖為依據一實施例用以處理已解碼音訊信號之設備之方塊圖;第1b圖為用以處理已解碼音訊信號之設備之一較佳實施例之方塊圖;第2a圖顯示頻率選擇特性作為低通特性;第2b圖顯示加權係數及相聯結的子帶;第2c圖顯示時/頻變換器及隨後連結的用以施加加權係數至各個個別子帶信號之加權器之串級;第3圖顯示於第8圖例示說明之AMR-WB+中低通濾波器之頻率響應中的脈衝響應;第4圖顯示脈衝響應及頻率響應變換成QMF域;第5圖顯示用於32 QMF子帶實例之加權器的加權因數;
第6圖顯示針對16 QMF頻帶之頻率響應及相聯結的16加權因數;第7圖顯示AMR-WB+之低頻音準加強器之方塊圖;第8圖顯示AMR-WB+之體現後處理組態;第9圖顯示第8圖之體現之推衍;及第10圖顯示依據一實施例之長期預測濾波器之低延遲體現。
100‧‧‧音訊信號
102‧‧‧濾波器
104‧‧‧線上已濾波音訊信號
106、106a-b‧‧‧時間頻譜變換器
108‧‧‧加權器
110‧‧‧線上已加權已濾波音訊信號
112‧‧‧減法器
114‧‧‧頻譜時間變換器
116‧‧‧已處理之已解碼音訊信號
Claims (16)
- 一種用以處理已解碼音訊信號之設備,該設備係包含:用以濾波該已解碼音訊信號來獲得一已濾波音訊信號之一濾波器;用以將該已解碼音訊信號及該已濾波音訊信號變換成相對應頻譜表示型態之一時間頻譜變換器階段,各個頻譜表示型態具有多個子頻帶信號;用以執行該已濾波音訊信號之該頻譜表示型態之頻率選擇性加權之一加權器,該加權係藉將子頻帶信號乘以個別加權係數來獲得一已加權已濾波音訊信號;用以執行該已加權已濾波音訊信號與該已解碼音訊信號之該頻譜表示型態間之一逐一子頻帶減法以獲得一結果音訊信號之一減法器;及用以將該結果音訊信號或從該結果音訊信號推衍得的一信號變換成一時域表示型態來獲得一已處理已解碼音訊信號之一頻譜時間變換器。
- 如請求項1之設備,其係進一步包含一帶寬增強解碼器或一單聲-立體聲解碼器或一單聲道-多聲道解碼器來計算從該結果音訊信號推衍得的該信號,其中該頻譜時間變換器係組配來非將該結果音訊信號而是將從該結果音訊信號推衍得的該信號變換到時域,使得藉該帶寬增強解碼器或該單聲-立體聲或單聲道-多聲道解碼器進行的全部處理係於由該時間頻譜變換器階段所定義的相同頻譜域中執行。
- 如請求項1之設備,其中該已解碼音訊信號係為一代數碼簿激勵線性預測(ACELP)已解碼輸出信號,及其中該濾波器係為藉音準資訊控制的一長期預測濾波器。
- 如請求項1之設備,其中該加權器係組配來加權該已濾波音訊信號,使得相較於較高頻子頻帶,較低頻子頻帶係較少衰減或不衰減,致使該頻率選擇性加權將一低通特性加諸給該已濾波音訊信號。
- 如請求項1之設備,其中該時間頻譜變換器階段及該頻譜時間變換器係組配來分別地體現一正交鏡像濾波器組(QMF)分析濾波器組及一QMF合成濾波器組。
- 如請求項1之設備,其中該減法器係組配來從音訊信號之相對應子頻帶信號中扣除該已加權已濾波音訊信號之一子頻帶信號,來獲得該結果音訊信號之一子頻帶,該等子頻帶屬於相同濾波器組聲道。
- 如請求項1之設備,其中該濾波器係組配來執行該音訊信號與時間上位移一音準週期之至少該音訊信號之一加權組合。
- 如請求項7之設備,其中該濾波器係組配來藉由只組合該音訊信號與 存在於較早時間瞬間之該音訊信號而執行該加權組合。
- 如請求項1之設備,其中該頻譜時間變換器相對於該時間頻譜變換器階段具有一不同數目的輸入聲道,致使獲得一樣品率變換,其中當輸入該頻譜時間變換器之輸入聲道數目係高於該時間頻譜變換器階段之輸出聲道數目時獲得一升頻取樣;及其中當輸入該頻譜時間變換器之該輸入聲道數目係小於該時間頻譜變換器階段之輸出聲道數目時獲得一降頻取樣。
- 如請求項1之設備,進一步包含:用以於一第一時間部分提供該已解碼音訊信號之一第一解碼器;用以於一不同的第二時間部分提供又一已解碼音訊信號之一第二解碼器;連結至該第一解碼器及該第二解碼器之一第一處理分支;連結至該第一解碼器及該第二解碼器之一第二處理分支;其中該第二處理分支包含該濾波器及該加權器,及額外地,包含一可控制式增益階段及一控制器,其中該控制器係組配來設定該增益階段之一增益至針對該第一時間部分之一第一值及至針對該第二時間部分之一第二值或設定至零,該第二值係低於該第一值。
- 如請求項1之設備,其係進一步包含一音準追蹤器用以 提供一音準滯後及用以基於該音準滯後作為音準資訊而設定該濾波器。
- 如請求項10之設備,其中該第一解碼器係組配來提供該音準資訊或該音準資訊之一部分用以設定該濾波器。
- 如請求項10之設備,其中該第一處理分支之一輸出及該第二處理分支之一輸出係連結至該減法器之輸入。
- 如請求項1之設備,其中該已解碼音訊信號係由含括於該設備中之一ACELP解碼器提供,及其中該設備進一步包含體現為一變換編碼激勵(TCX)解碼器之又一解碼器。
- 一種處理已解碼音訊信號之方法,該方法係包含:濾波該已解碼音訊信號來獲得一已濾波音訊信號;將該已解碼音訊信號及該已濾波音訊信號變換成相對應頻譜表示型態,各個頻譜表示型態具有多個子頻帶信號;藉將子頻帶信號乘以個別加權係數來執行該已濾波音訊信號之頻率選擇性加權以獲得一已加權已濾波音訊信號;執行該已加權已濾波音訊信號與該已解碼音訊信號之該頻譜表示型態間之一逐一子頻帶減法以獲得一結果音訊信號;及將該結果音訊信號或從該結果音訊信號推衍得的一信號變換成一時域表示型態來獲得一已處理已解碼音訊信號。
- 一種具有程式代碼之電腦程式,當在一電腦上執行時,該程式代碼係用以執行如請求項15之處理已解碼音訊信號之方法。
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161442632P | 2011-02-14 | 2011-02-14 | |
| PCT/EP2012/052292 WO2012110415A1 (en) | 2011-02-14 | 2012-02-10 | Apparatus and method for processing a decoded audio signal in a spectral domain |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201237848A TW201237848A (en) | 2012-09-16 |
| TWI469136B true TWI469136B (zh) | 2015-01-11 |
Family
ID=71943604
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW101104349A TWI469136B (zh) | 2011-02-14 | 2012-02-10 | 在一頻譜域中用以處理已解碼音訊信號之裝置及方法 |
Country Status (18)
| Country | Link |
|---|---|
| US (1) | US9583110B2 (zh) |
| EP (1) | EP2676268B1 (zh) |
| JP (1) | JP5666021B2 (zh) |
| KR (1) | KR101699898B1 (zh) |
| CN (1) | CN103503061B (zh) |
| AR (1) | AR085362A1 (zh) |
| AU (1) | AU2012217269B2 (zh) |
| BR (1) | BR112013020482B1 (zh) |
| CA (1) | CA2827249C (zh) |
| ES (1) | ES2529025T3 (zh) |
| MX (1) | MX2013009344A (zh) |
| MY (1) | MY164797A (zh) |
| PL (1) | PL2676268T3 (zh) |
| RU (1) | RU2560788C2 (zh) |
| SG (1) | SG192746A1 (zh) |
| TW (1) | TWI469136B (zh) |
| WO (1) | WO2012110415A1 (zh) |
| ZA (1) | ZA201306838B (zh) |
Families Citing this family (35)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2908576C (en) | 2008-12-15 | 2018-11-27 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Audio encoder and bandwidth extension decoder |
| JP5712288B2 (ja) | 2011-02-14 | 2015-05-07 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | 重複変換を使用した情報信号表記 |
| CA2827249C (en) * | 2011-02-14 | 2016-08-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
| ES2623291T3 (es) | 2011-02-14 | 2017-07-10 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificación de una porción de una señal de audio utilizando una detección de transitorios y un resultado de calidad |
| PT3239978T (pt) | 2011-02-14 | 2019-04-02 | Fraunhofer Ges Forschung | Codificação e descodificação de posições de pulso de faixas de um sinal de áudio |
| WO2012110447A1 (en) | 2011-02-14 | 2012-08-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for error concealment in low-delay unified speech and audio coding (usac) |
| KR101617816B1 (ko) | 2011-02-14 | 2016-05-03 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 스펙트럼 도메인 잡음 형상화를 사용하는 선형 예측 기반 코딩 방식 |
| EP2720222A1 (en) | 2012-10-10 | 2014-04-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns |
| PL3584791T3 (pl) * | 2012-11-05 | 2024-03-18 | Panasonic Holdings Corporation | Urządzenie do kodowania mowy/dźwięku oraz sposób kodowania mowy/dźwięku |
| WO2014118157A1 (en) | 2013-01-29 | 2014-08-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an encoded signal and encoder and method for generating an encoded signal |
| AR094845A1 (es) | 2013-02-20 | 2015-09-02 | Fraunhofer Ges Forschung | Aparato y método para codificar o decodificar una señal de audio utilizando una superposición dependiente de la ubicación de un transitorio |
| EP2981958B1 (en) | 2013-04-05 | 2018-03-07 | Dolby International AB | Audio encoder and decoder |
| CN110223702B (zh) * | 2013-05-24 | 2023-04-11 | 杜比国际公司 | 音频解码系统和重构方法 |
| PT3011554T (pt) | 2013-06-21 | 2019-10-24 | Fraunhofer Ges Forschung | Estimação de atraso de tom. |
| EP3582220B1 (en) * | 2013-09-12 | 2021-10-20 | Dolby International AB | Time-alignment of qmf based processing data |
| KR102244613B1 (ko) | 2013-10-28 | 2021-04-26 | 삼성전자주식회사 | Qmf 필터링 방법 및 이를 수행하는 장치 |
| EP2887350B1 (en) | 2013-12-19 | 2016-10-05 | Dolby Laboratories Licensing Corporation | Adaptive quantization noise filtering of decoded audio data |
| EP2980797A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, method and computer program using a zero-input-response to obtain a smooth transition |
| JP6035270B2 (ja) * | 2014-03-24 | 2016-11-30 | 株式会社Nttドコモ | 音声復号装置、音声符号化装置、音声復号方法、音声符号化方法、音声復号プログラム、および音声符号化プログラム |
| BR112016019838B1 (pt) | 2014-03-31 | 2023-02-23 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Codificador de áudio, decodificador de áudio, método de codificação, método de decodificação e mídia de registro legível por computador não transitória |
| EP2980799A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an audio signal using a harmonic post-filter |
| EP2980793A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoder, decoder, system and methods for encoding and decoding |
| TWI602172B (zh) | 2014-08-27 | 2017-10-11 | 弗勞恩霍夫爾協會 | 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法 |
| TWI758146B (zh) | 2015-03-13 | 2022-03-11 | 瑞典商杜比國際公司 | 解碼具有增強頻譜帶複製元資料在至少一填充元素中的音訊位元流 |
| EP3079151A1 (en) * | 2015-04-09 | 2016-10-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and method for encoding an audio signal |
| CN106157966B (zh) * | 2015-04-15 | 2019-08-13 | 宏碁股份有限公司 | 语音信号处理装置及语音信号处理方法 |
| CN106297814B (zh) * | 2015-06-02 | 2019-08-06 | 宏碁股份有限公司 | 语音信号处理装置及语音信号处理方法 |
| US9613628B2 (en) | 2015-07-01 | 2017-04-04 | Gopro, Inc. | Audio decoder for wind and microphone noise reduction in a microphone array system |
| MY181992A (en) * | 2016-01-22 | 2021-01-18 | Fraunhofer Ges Forschung | Apparatus and method for encoding or decoding a multi-channel signal using spectral-domain resampling |
| US10638227B2 (en) | 2016-12-02 | 2020-04-28 | Dirac Research Ab | Processing of an audio input signal |
| EP3382704A1 (en) | 2017-03-31 | 2018-10-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for determining a predetermined characteristic related to a spectral enhancement processing of an audio signal |
| CN111630594B (zh) * | 2017-12-01 | 2023-08-01 | 日本电信电话株式会社 | 基音增强装置、其方法以及记录介质 |
| EP3671741A1 (en) * | 2018-12-21 | 2020-06-24 | FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. | Audio processor and method for generating a frequency-enhanced audio signal using pulse processing |
| KR102511377B1 (ko) | 2020-03-20 | 2023-03-17 | 돌비 인터네셔널 에이비 | 라우드스피커들을 위한 저음 향상 |
| CN114280571B (zh) * | 2022-03-04 | 2022-07-19 | 北京海兰信数据科技股份有限公司 | 一种雨杂波信号的处理方法、装置及设备 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050131696A1 (en) * | 2001-06-29 | 2005-06-16 | Microsoft Corporation | Frequency domain postfiltering for quality enhancement of coded speech |
| WO2006126844A2 (en) * | 2005-05-26 | 2006-11-30 | Lg Electronics Inc. | Method and apparatus for decoding an audio signal |
| US7519538B2 (en) * | 2003-10-30 | 2009-04-14 | Koninklijke Philips Electronics N.V. | Audio signal encoding or decoding |
| TW201009810A (en) * | 2008-07-11 | 2010-03-01 | Fraunhofer Ges Forschung | Time warp contour calculator, audio signal encoder, encoded audio signal representation, methods and computer program |
| TW201032218A (en) * | 2009-01-28 | 2010-09-01 | Fraunhofer Ges Forschung | Audio encoder, audio decoder, encoded audio information, methods for encoding and decoding an audio signal and computer program |
Family Cites Families (222)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10007A (en) * | 1853-09-13 | Gear op variable cut-ofp valves for steau-ehgietes | ||
| ES2240252T3 (es) | 1991-06-11 | 2005-10-16 | Qualcomm Incorporated | Vocodificador de velocidad variable. |
| US5408580A (en) | 1992-09-21 | 1995-04-18 | Aware, Inc. | Audio compression system employing multi-rate signal analysis |
| SE501340C2 (sv) | 1993-06-11 | 1995-01-23 | Ericsson Telefon Ab L M | Döljande av transmissionsfel i en talavkodare |
| BE1007617A3 (nl) | 1993-10-11 | 1995-08-22 | Philips Electronics Nv | Transmissiesysteem met gebruik van verschillende codeerprincipes. |
| US5657422A (en) | 1994-01-28 | 1997-08-12 | Lucent Technologies Inc. | Voice activity detection driven noise remediator |
| US5784532A (en) | 1994-02-16 | 1998-07-21 | Qualcomm Incorporated | Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system |
| US5684920A (en) | 1994-03-17 | 1997-11-04 | Nippon Telegraph And Telephone | Acoustic signal transform coding method and decoding method having a high efficiency envelope flattening method therein |
| US5568588A (en) | 1994-04-29 | 1996-10-22 | Audiocodes Ltd. | Multi-pulse analysis speech processing System and method |
| CN1090409C (zh) | 1994-10-06 | 2002-09-04 | 皇家菲利浦电子有限公司 | 采用不同编码原理的传送系统 |
| US5537510A (en) | 1994-12-30 | 1996-07-16 | Daewoo Electronics Co., Ltd. | Adaptive digital audio encoding apparatus and a bit allocation method thereof |
| SE506379C3 (sv) | 1995-03-22 | 1998-01-19 | Ericsson Telefon Ab L M | Lpc-talkodare med kombinerad excitation |
| US5727119A (en) | 1995-03-27 | 1998-03-10 | Dolby Laboratories Licensing Corporation | Method and apparatus for efficient implementation of single-sideband filter banks providing accurate measures of spectral magnitude and phase |
| JP3317470B2 (ja) | 1995-03-28 | 2002-08-26 | 日本電信電話株式会社 | 音響信号符号化方法、音響信号復号化方法 |
| US5659622A (en) | 1995-11-13 | 1997-08-19 | Motorola, Inc. | Method and apparatus for suppressing noise in a communication system |
| US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
| US5890106A (en) | 1996-03-19 | 1999-03-30 | Dolby Laboratories Licensing Corporation | Analysis-/synthesis-filtering system with efficient oddly-stacked singleband filter bank using time-domain aliasing cancellation |
| US5848391A (en) | 1996-07-11 | 1998-12-08 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Method subband of coding and decoding audio signals using variable length windows |
| JP3259759B2 (ja) | 1996-07-22 | 2002-02-25 | 日本電気株式会社 | 音声信号伝送方法及び音声符号復号化システム |
| JPH10124092A (ja) | 1996-10-23 | 1998-05-15 | Sony Corp | 音声符号化方法及び装置、並びに可聴信号符号化方法及び装置 |
| US5960389A (en) | 1996-11-15 | 1999-09-28 | Nokia Mobile Phones Limited | Methods for generating comfort noise during discontinuous transmission |
| JPH10214100A (ja) | 1997-01-31 | 1998-08-11 | Sony Corp | 音声合成方法 |
| US6134518A (en) | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
| SE512719C2 (sv) | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion |
| JP3223966B2 (ja) | 1997-07-25 | 2001-10-29 | 日本電気株式会社 | 音声符号化/復号化装置 |
| US6070137A (en) | 1998-01-07 | 2000-05-30 | Ericsson Inc. | Integrated frequency-domain voice coding using an adaptive spectral enhancement filter |
| ES2247741T3 (es) | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio. |
| GB9811019D0 (en) * | 1998-05-21 | 1998-07-22 | Univ Surrey | Speech coders |
| US6173257B1 (en) | 1998-08-24 | 2001-01-09 | Conexant Systems, Inc | Completed fixed codebook for speech encoder |
| US6439967B2 (en) | 1998-09-01 | 2002-08-27 | Micron Technology, Inc. | Microelectronic substrate assembly planarizing machines and methods of mechanical and chemical-mechanical planarization of microelectronic substrate assemblies |
| SE521225C2 (sv) | 1998-09-16 | 2003-10-14 | Ericsson Telefon Ab L M | Förfarande och anordning för CELP-kodning/avkodning |
| US7272556B1 (en) | 1998-09-23 | 2007-09-18 | Lucent Technologies Inc. | Scalable and embedded codec for speech and audio signals |
| US6317117B1 (en) | 1998-09-23 | 2001-11-13 | Eugene Goff | User interface for the control of an audio spectrum filter processor |
| US7124079B1 (en) | 1998-11-23 | 2006-10-17 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech coding with comfort noise variability feature for increased fidelity |
| FI114833B (fi) | 1999-01-08 | 2004-12-31 | Nokia Corp | Menetelmä, puhekooderi ja matkaviestin puheenkoodauskehysten muodostamiseksi |
| DE19921122C1 (de) | 1999-05-07 | 2001-01-25 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Verschleiern eines Fehlers in einem codierten Audiosignal und Verfahren und Vorrichtung zum Decodieren eines codierten Audiosignals |
| WO2000075919A1 (en) | 1999-06-07 | 2000-12-14 | Ericsson, Inc. | Methods and apparatus for generating comfort noise using parametric noise model statistics |
| JP4464484B2 (ja) | 1999-06-15 | 2010-05-19 | パナソニック株式会社 | 雑音信号符号化装置および音声信号符号化装置 |
| US6236960B1 (en) | 1999-08-06 | 2001-05-22 | Motorola, Inc. | Factorial packing method and apparatus for information coding |
| US6636829B1 (en) | 1999-09-22 | 2003-10-21 | Mindspeed Technologies, Inc. | Speech communication system and method for handling lost frames |
| DE60031002T2 (de) | 2000-02-29 | 2007-05-10 | Qualcomm, Inc., San Diego | Multimodaler mischbereich-sprachkodierer mit geschlossener regelschleife |
| US6757654B1 (en) | 2000-05-11 | 2004-06-29 | Telefonaktiebolaget Lm Ericsson | Forward error correction in speech coding |
| JP2002118517A (ja) | 2000-07-31 | 2002-04-19 | Sony Corp | 直交変換装置及び方法、逆直交変換装置及び方法、変換符号化装置及び方法、並びに復号装置及び方法 |
| FR2813722B1 (fr) | 2000-09-05 | 2003-01-24 | France Telecom | Procede et dispositif de dissimulation d'erreurs et systeme de transmission comportant un tel dispositif |
| US6847929B2 (en) | 2000-10-12 | 2005-01-25 | Texas Instruments Incorporated | Algebraic codebook system and method |
| US6636830B1 (en) | 2000-11-22 | 2003-10-21 | Vialta Inc. | System and method for noise reduction using bi-orthogonal modified discrete cosine transform |
| CA2327041A1 (en) | 2000-11-22 | 2002-05-22 | Voiceage Corporation | A method for indexing pulse positions and signs in algebraic codebooks for efficient coding of wideband signals |
| US20040142496A1 (en) | 2001-04-23 | 2004-07-22 | Nicholson Jeremy Kirk | Methods for analysis of spectral data and their applications: atherosclerosis/coronary heart disease |
| US7136418B2 (en) | 2001-05-03 | 2006-11-14 | University Of Washington | Scalable and perceptually ranked signal coding and decoding |
| US7206739B2 (en) | 2001-05-23 | 2007-04-17 | Samsung Electronics Co., Ltd. | Excitation codebook search method in a speech coding system |
| US20020184009A1 (en) | 2001-05-31 | 2002-12-05 | Heikkinen Ari P. | Method and apparatus for improved voicing determination in speech signals containing high levels of jitter |
| US20030120484A1 (en) | 2001-06-12 | 2003-06-26 | David Wong | Method and system for generating colored comfort noise in the absence of silence insertion description packets |
| DE10129240A1 (de) | 2001-06-18 | 2003-01-02 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Verarbeiten von zeitdiskreten Audio-Abtastwerten |
| US6879955B2 (en) | 2001-06-29 | 2005-04-12 | Microsoft Corporation | Signal modification based on continuous time warping for low bit rate CELP coding |
| DE10140507A1 (de) | 2001-08-17 | 2003-02-27 | Philips Corp Intellectual Pty | Verfahren für die algebraische Codebook-Suche eines Sprachsignalkodierers |
| US7711563B2 (en) | 2001-08-17 | 2010-05-04 | Broadcom Corporation | Method and system for frame erasure concealment for predictive speech coding based on extrapolation of speech waveform |
| KR100438175B1 (ko) | 2001-10-23 | 2004-07-01 | 엘지전자 주식회사 | 코드북 검색방법 |
| CA2365203A1 (en) | 2001-12-14 | 2003-06-14 | Voiceage Corporation | A signal modification method for efficient coding of speech signals |
| US6934677B2 (en) | 2001-12-14 | 2005-08-23 | Microsoft Corporation | Quantization matrices based on critical band pattern information for digital audio wherein quantization bands differ from critical bands |
| US7240001B2 (en) | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
| DE10200653B4 (de) | 2002-01-10 | 2004-05-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Skalierbarer Codierer, Verfahren zum Codieren, Decodierer und Verfahren zum Decodieren für einen skalierten Datenstrom |
| CA2388439A1 (en) | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for efficient frame erasure concealment in linear predictive based speech codecs |
| CA2388358A1 (en) | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for multi-rate lattice vector quantization |
| CA2388352A1 (en) * | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for frequency-selective pitch enhancement of synthesized speed |
| US7302387B2 (en) | 2002-06-04 | 2007-11-27 | Texas Instruments Incorporated | Modification of fixed codebook search in G.729 Annex E audio coding |
| US20040010329A1 (en) | 2002-07-09 | 2004-01-15 | Silicon Integrated Systems Corp. | Method for reducing buffer requirements in a digital audio decoder |
| DE10236694A1 (de) | 2002-08-09 | 2004-02-26 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum skalierbaren Codieren und Vorrichtung und Verfahren zum skalierbaren Decodieren |
| US7299190B2 (en) | 2002-09-04 | 2007-11-20 | Microsoft Corporation | Quantization and inverse quantization for audio |
| US7502743B2 (en) | 2002-09-04 | 2009-03-10 | Microsoft Corporation | Multi-channel audio encoding and decoding with multi-channel transform selection |
| AU2003260958A1 (en) | 2002-09-19 | 2004-04-08 | Matsushita Electric Industrial Co., Ltd. | Audio decoding apparatus and method |
| BR0315179A (pt) | 2002-10-11 | 2005-08-23 | Nokia Corp | Método e dispositivo para codificar um sinal de fala amostrado compreendendo quadros de fala |
| US7343283B2 (en) | 2002-10-23 | 2008-03-11 | Motorola, Inc. | Method and apparatus for coding a noise-suppressed audio signal |
| US7363218B2 (en) | 2002-10-25 | 2008-04-22 | Dilithium Networks Pty. Ltd. | Method and apparatus for fast CELP parameter mapping |
| KR100463559B1 (ko) | 2002-11-11 | 2004-12-29 | 한국전자통신연구원 | 대수 코드북을 이용하는 켈프 보코더의 코드북 검색방법 |
| KR100463419B1 (ko) | 2002-11-11 | 2004-12-23 | 한국전자통신연구원 | 적은 복잡도를 가진 고정 코드북 검색방법 및 장치 |
| KR100465316B1 (ko) | 2002-11-18 | 2005-01-13 | 한국전자통신연구원 | 음성 부호화기 및 이를 이용한 음성 부호화 방법 |
| KR20040058855A (ko) | 2002-12-27 | 2004-07-05 | 엘지전자 주식회사 | 음성 변조 장치 및 방법 |
| AU2003208517A1 (en) | 2003-03-11 | 2004-09-30 | Nokia Corporation | Switching between coding schemes |
| US7249014B2 (en) | 2003-03-13 | 2007-07-24 | Intel Corporation | Apparatus, methods and articles incorporating a fast algebraic codebook search technique |
| US20050021338A1 (en) | 2003-03-17 | 2005-01-27 | Dan Graboi | Recognition device and system |
| KR100556831B1 (ko) | 2003-03-25 | 2006-03-10 | 한국전자통신연구원 | 전역 펄스 교체를 통한 고정 코드북 검색 방법 |
| WO2004090870A1 (ja) | 2003-04-04 | 2004-10-21 | Kabushiki Kaisha Toshiba | 広帯域音声を符号化または復号化するための方法及び装置 |
| US7318035B2 (en) | 2003-05-08 | 2008-01-08 | Dolby Laboratories Licensing Corporation | Audio coding systems and methods using spectral component coupling and spectral component regeneration |
| DE10321983A1 (de) | 2003-05-15 | 2004-12-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Einbetten einer binären Nutzinformation in ein Trägersignal |
| JP4719674B2 (ja) | 2003-06-30 | 2011-07-06 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | ノイズの加算によるデコードオーディオの品質の向上 |
| DE10331803A1 (de) | 2003-07-14 | 2005-02-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Umsetzen in eine transformierte Darstellung oder zum inversen Umsetzen der transformierten Darstellung |
| US7565286B2 (en) | 2003-07-17 | 2009-07-21 | Her Majesty The Queen In Right Of Canada, As Represented By The Minister Of Industry, Through The Communications Research Centre Canada | Method for recovery of lost speech data |
| DE10345996A1 (de) | 2003-10-02 | 2005-04-28 | Fraunhofer Ges Forschung | Vorrichtung und Verfahren zum Verarbeiten von wenigstens zwei Eingangswerten |
| DE10345995B4 (de) | 2003-10-02 | 2005-07-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Verarbeiten eines Signals mit einer Sequenz von diskreten Werten |
| US7418396B2 (en) | 2003-10-14 | 2008-08-26 | Broadcom Corporation | Reduced memory implementation technique of filterbank and block switching for real-time audio applications |
| US20050091044A1 (en) | 2003-10-23 | 2005-04-28 | Nokia Corporation | Method and system for pitch contour quantization in audio coding |
| US20050091041A1 (en) | 2003-10-23 | 2005-04-28 | Nokia Corporation | Method and system for speech coding |
| US20080249765A1 (en) | 2004-01-28 | 2008-10-09 | Koninklijke Philips Electronic, N.V. | Audio Signal Decoding Using Complex-Valued Data |
| EP2770694A1 (en) | 2004-02-12 | 2014-08-27 | Core Wireless Licensing S.a.r.l. | Classified media quality of experience |
| DE102004007200B3 (de) | 2004-02-13 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiocodierung |
| CA2457988A1 (en) | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization |
| FI118835B (fi) | 2004-02-23 | 2008-03-31 | Nokia Corp | Koodausmallin valinta |
| FI118834B (fi) | 2004-02-23 | 2008-03-31 | Nokia Corp | Audiosignaalien luokittelu |
| US7809556B2 (en) | 2004-03-05 | 2010-10-05 | Panasonic Corporation | Error conceal device and error conceal method |
| WO2005096274A1 (en) | 2004-04-01 | 2005-10-13 | Beijing Media Works Co., Ltd | An enhanced audio encoding/decoding device and method |
| GB0408856D0 (en) | 2004-04-21 | 2004-05-26 | Nokia Corp | Signal encoding |
| DE602004025517D1 (de) | 2004-05-17 | 2010-03-25 | Nokia Corp | Audiocodierung mit verschiedenen codierungsrahmenlängen |
| JP4168976B2 (ja) | 2004-05-28 | 2008-10-22 | ソニー株式会社 | オーディオ信号符号化装置及び方法 |
| US7649988B2 (en) | 2004-06-15 | 2010-01-19 | Acoustic Technologies, Inc. | Comfort noise generator using modified Doblinger noise estimate |
| US8160274B2 (en) | 2006-02-07 | 2012-04-17 | Bongiovi Acoustics Llc. | System and method for digital signal processing |
| DE102004043521A1 (de) * | 2004-09-08 | 2006-03-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes |
| US7630902B2 (en) | 2004-09-17 | 2009-12-08 | Digital Rise Technology Co., Ltd. | Apparatus and methods for digital audio coding using codebook application ranges |
| KR100656788B1 (ko) | 2004-11-26 | 2006-12-12 | 한국전자통신연구원 | 비트율 신축성을 갖는 코드벡터 생성 방법 및 그를 이용한 광대역 보코더 |
| TWI253057B (en) | 2004-12-27 | 2006-04-11 | Quanta Comp Inc | Search system and method thereof for searching code-vector of speech signal in speech encoder |
| CN101120398B (zh) | 2005-01-31 | 2012-05-23 | 斯凯普有限公司 | 通信系统中用于帧连接的方法 |
| US7519535B2 (en) | 2005-01-31 | 2009-04-14 | Qualcomm Incorporated | Frame erasure concealment in voice communications |
| JP4519169B2 (ja) | 2005-02-02 | 2010-08-04 | 富士通株式会社 | 信号処理方法および信号処理装置 |
| US20070147518A1 (en) | 2005-02-18 | 2007-06-28 | Bruno Bessette | Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX |
| US8155965B2 (en) | 2005-03-11 | 2012-04-10 | Qualcomm Incorporated | Time warping frames inside the vocoder by modifying the residual |
| US8332228B2 (en) | 2005-04-01 | 2012-12-11 | Qualcomm Incorporated | Systems, methods, and apparatus for anti-sparseness filtering |
| US7707034B2 (en) * | 2005-05-31 | 2010-04-27 | Microsoft Corporation | Audio codec post-filter |
| RU2296377C2 (ru) | 2005-06-14 | 2007-03-27 | Михаил Николаевич Гусев | Способ анализа и синтеза речи |
| PL1897085T3 (pl) | 2005-06-18 | 2017-10-31 | Nokia Technologies Oy | System i sposób adaptacyjnej transmisji parametrów szumu łagodzącego w czasie nieciągłej transmisji mowy |
| FR2888699A1 (fr) * | 2005-07-13 | 2007-01-19 | France Telecom | Dispositif de codage/decodage hierachique |
| KR100851970B1 (ko) | 2005-07-15 | 2008-08-12 | 삼성전자주식회사 | 오디오 신호의 중요주파수 성분 추출방법 및 장치와 이를이용한 저비트율 오디오 신호 부호화/복호화 방법 및 장치 |
| US7610197B2 (en) | 2005-08-31 | 2009-10-27 | Motorola, Inc. | Method and apparatus for comfort noise generation in speech communication systems |
| RU2312405C2 (ru) | 2005-09-13 | 2007-12-10 | Михаил Николаевич Гусев | Способ осуществления машинной оценки качества звуковых сигналов |
| US20070174047A1 (en) | 2005-10-18 | 2007-07-26 | Anderson Kyle D | Method and apparatus for resynchronizing packetized audio streams |
| US7720677B2 (en) | 2005-11-03 | 2010-05-18 | Coding Technologies Ab | Time warped modified transform coding of audio signals |
| US7536299B2 (en) | 2005-12-19 | 2009-05-19 | Dolby Laboratories Licensing Corporation | Correlating and decorrelating transforms for multiple description coding systems |
| US8255207B2 (en) | 2005-12-28 | 2012-08-28 | Voiceage Corporation | Method and device for efficient frame erasure concealment in speech codecs |
| WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
| WO2007083934A1 (en) | 2006-01-18 | 2007-07-26 | Lg Electronics Inc. | Apparatus and method for encoding and decoding signal |
| CN101371296B (zh) | 2006-01-18 | 2012-08-29 | Lg电子株式会社 | 用于编码和解码信号的设备和方法 |
| US8032369B2 (en) | 2006-01-20 | 2011-10-04 | Qualcomm Incorporated | Arbitrary average data rates for variable rate coders |
| FR2897733A1 (fr) | 2006-02-20 | 2007-08-24 | France Telecom | Procede de discrimination et d'attenuation fiabilisees des echos d'un signal numerique dans un decodeur et dispositif correspondant |
| FR2897977A1 (fr) | 2006-02-28 | 2007-08-31 | France Telecom | Procede de limitation de gain d'excitation adaptative dans un decodeur audio |
| US20070253577A1 (en) | 2006-05-01 | 2007-11-01 | Himax Technologies Limited | Equalizer bank with interference reduction |
| EP1852848A1 (en) | 2006-05-05 | 2007-11-07 | Deutsche Thomson-Brandt GmbH | Method and apparatus for lossless encoding of a source signal using a lossy encoded data stream and a lossless extension data stream |
| US7873511B2 (en) | 2006-06-30 | 2011-01-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic |
| JP4810335B2 (ja) | 2006-07-06 | 2011-11-09 | 株式会社東芝 | 広帯域オーディオ信号符号化装置および広帯域オーディオ信号復号装置 |
| WO2008007700A1 (en) | 2006-07-12 | 2008-01-17 | Panasonic Corporation | Sound decoding device, sound encoding device, and lost frame compensation method |
| EP2040251B1 (en) | 2006-07-12 | 2019-10-09 | III Holdings 12, LLC | Audio decoding device and audio encoding device |
| US7933770B2 (en) | 2006-07-14 | 2011-04-26 | Siemens Audiologische Technik Gmbh | Method and device for coding audio data based on vector quantisation |
| CN102096937B (zh) | 2006-07-24 | 2014-07-09 | 索尼株式会社 | 毛发运动合成器系统和用于毛发/皮毛流水线的优化技术 |
| US7987089B2 (en) | 2006-07-31 | 2011-07-26 | Qualcomm Incorporated | Systems and methods for modifying a zero pad region of a windowed frame of an audio signal |
| US8024192B2 (en) | 2006-08-15 | 2011-09-20 | Broadcom Corporation | Time-warping of decoded audio signal after packet loss |
| US7877253B2 (en) | 2006-10-06 | 2011-01-25 | Qualcomm Incorporated | Systems, methods, and apparatus for frame erasure recovery |
| DE102006049154B4 (de) | 2006-10-18 | 2009-07-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Kodierung eines Informationssignals |
| US8036903B2 (en) | 2006-10-18 | 2011-10-11 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Analysis filterbank, synthesis filterbank, encoder, de-coder, mixer and conferencing system |
| US8041578B2 (en) | 2006-10-18 | 2011-10-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
| US8417532B2 (en) | 2006-10-18 | 2013-04-09 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
| US8126721B2 (en) | 2006-10-18 | 2012-02-28 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
| BRPI0709310B1 (pt) * | 2006-10-25 | 2019-11-05 | Fraunhofer Ges Zur Foeerderung Der Angewandten Forschung E V | equipamento e método para a geração de valores de sub-banda de áudio e equipamento e método para a geração de amostras de áudio no domínio do tempo |
| DE102006051673A1 (de) | 2006-11-02 | 2008-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Nachbearbeiten von Spektralwerten und Encodierer und Decodierer für Audiosignale |
| MX2009006201A (es) | 2006-12-12 | 2009-06-22 | Fraunhofer Ges Forschung | Codificador, decodificador y metodos para codificar y decodificar segmentos de datos que representan una corriente de datos del dominio temporal. |
| FR2911228A1 (fr) | 2007-01-05 | 2008-07-11 | France Telecom | Codage par transformee, utilisant des fenetres de ponderation et a faible retard. |
| KR101379263B1 (ko) | 2007-01-12 | 2014-03-28 | 삼성전자주식회사 | 대역폭 확장 복호화 방법 및 장치 |
| FR2911426A1 (fr) | 2007-01-15 | 2008-07-18 | France Telecom | Modification d'un signal de parole |
| US7873064B1 (en) | 2007-02-12 | 2011-01-18 | Marvell International Ltd. | Adaptive jitter buffer-packet loss concealment |
| BRPI0808202A8 (pt) | 2007-03-02 | 2016-11-22 | Panasonic Corp | Dispositivo de codificação e método de codificação. |
| JP5596341B2 (ja) | 2007-03-02 | 2014-09-24 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 音声符号化装置および音声符号化方法 |
| JP4708446B2 (ja) | 2007-03-02 | 2011-06-22 | パナソニック株式会社 | 符号化装置、復号装置およびそれらの方法 |
| DE102007013811A1 (de) | 2007-03-22 | 2008-09-25 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Verfahren zur zeitlichen Segmentierung eines Videos in Videobildfolgen und zur Auswahl von Keyframes für das Auffinden von Bildinhalten unter Einbeziehung einer Subshot-Detektion |
| JP2008261904A (ja) | 2007-04-10 | 2008-10-30 | Matsushita Electric Ind Co Ltd | 符号化装置、復号化装置、符号化方法および復号化方法 |
| US8630863B2 (en) | 2007-04-24 | 2014-01-14 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding and decoding audio/speech signal |
| CN101388210B (zh) | 2007-09-15 | 2012-03-07 | 华为技术有限公司 | 编解码方法及编解码器 |
| ES2529292T3 (es) | 2007-04-29 | 2015-02-18 | Huawei Technologies Co., Ltd. | Método de codificación y de decodificación |
| ES2663269T3 (es) | 2007-06-11 | 2018-04-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificador de audio para codificar una señal de audio que tiene una porción similar a un impulso y una porción estacionaria |
| US9653088B2 (en) | 2007-06-13 | 2017-05-16 | Qualcomm Incorporated | Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding |
| KR101513028B1 (ko) | 2007-07-02 | 2015-04-17 | 엘지전자 주식회사 | 방송 수신기 및 방송신호 처리방법 |
| US8185381B2 (en) | 2007-07-19 | 2012-05-22 | Qualcomm Incorporated | Unified filter bank for performing signal conversions |
| CN101110214B (zh) * | 2007-08-10 | 2011-08-17 | 北京理工大学 | 一种基于多描述格型矢量量化技术的语音编码方法 |
| US8428957B2 (en) | 2007-08-24 | 2013-04-23 | Qualcomm Incorporated | Spectral noise shaping in audio coding based on spectral dynamics in frequency sub-bands |
| CN101878504B (zh) | 2007-08-27 | 2013-12-04 | 爱立信电话股份有限公司 | 使用时间分辨率能选择的低复杂性频谱分析/合成 |
| JP4886715B2 (ja) | 2007-08-28 | 2012-02-29 | 日本電信電話株式会社 | 定常率算出装置、雑音レベル推定装置、雑音抑圧装置、それらの方法、プログラム及び記録媒体 |
| JP5264913B2 (ja) | 2007-09-11 | 2013-08-14 | ヴォイスエイジ・コーポレーション | 話声およびオーディオの符号化における、代数符号帳の高速検索のための方法および装置 |
| CN100524462C (zh) | 2007-09-15 | 2009-08-05 | 华为技术有限公司 | 对高带信号进行帧错误隐藏的方法及装置 |
| US8576096B2 (en) | 2007-10-11 | 2013-11-05 | Motorola Mobility Llc | Apparatus and method for low complexity combinatorial coding of signals |
| KR101373004B1 (ko) * | 2007-10-30 | 2014-03-26 | 삼성전자주식회사 | 고주파수 신호 부호화 및 복호화 장치 및 방법 |
| CN101425292B (zh) | 2007-11-02 | 2013-01-02 | 华为技术有限公司 | 一种音频信号的解码方法及装置 |
| DE102007055830A1 (de) | 2007-12-17 | 2009-06-18 | Zf Friedrichshafen Ag | Verfahren und Vorrichtung zum Betrieb eines Hybridantriebes eines Fahrzeuges |
| CN101483043A (zh) | 2008-01-07 | 2009-07-15 | 中兴通讯股份有限公司 | 基于分类和排列组合的码本索引编码方法 |
| CN101488344B (zh) | 2008-01-16 | 2011-09-21 | 华为技术有限公司 | 一种量化噪声泄漏控制方法及装置 |
| DE102008015702B4 (de) | 2008-01-31 | 2010-03-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zur Bandbreitenerweiterung eines Audiosignals |
| ATE528747T1 (de) | 2008-03-04 | 2011-10-15 | Fraunhofer Ges Forschung | Vorrichtung zum mischen mehrerer eingabedatenströme |
| US8000487B2 (en) | 2008-03-06 | 2011-08-16 | Starkey Laboratories, Inc. | Frequency translation by high-frequency spectral envelope warping in hearing assistance devices |
| FR2929466A1 (fr) | 2008-03-28 | 2009-10-02 | France Telecom | Dissimulation d'erreur de transmission dans un signal numerique dans une structure de decodage hierarchique |
| EP2107556A1 (en) | 2008-04-04 | 2009-10-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio transform coding using pitch correction |
| US8423852B2 (en) | 2008-04-15 | 2013-04-16 | Qualcomm Incorporated | Channel decoding-based error detection |
| US8768690B2 (en) | 2008-06-20 | 2014-07-01 | Qualcomm Incorporated | Coding scheme selection for low-bit-rate applications |
| PL2346030T3 (pl) | 2008-07-11 | 2015-03-31 | Fraunhofer Ges Forschung | Koder audio, sposób kodowania sygnału audio oraz program komputerowy |
| AU2009267518B2 (en) | 2008-07-11 | 2012-08-16 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding/decoding an audio signal using an aliasing switch scheme |
| RU2536679C2 (ru) | 2008-07-11 | 2014-12-27 | Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен | Передатчик сигнала активации с деформацией по времени, кодер звукового сигнала, способ преобразования сигнала активации с деформацией по времени, способ кодирования звукового сигнала и компьютерные программы |
| EP2144230A1 (en) | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
| ES2657393T3 (es) | 2008-07-11 | 2018-03-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificador y descodificador de audio para codificar y descodificar muestras de audio |
| ES2683077T3 (es) | 2008-07-11 | 2018-09-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificador y decodificador de audio para codificar y decodificar tramas de una señal de audio muestreada |
| MX2011000375A (es) | 2008-07-11 | 2011-05-19 | Fraunhofer Ges Forschung | Codificador y decodificador de audio para codificar y decodificar tramas de una señal de audio muestreada. |
| US8352279B2 (en) | 2008-09-06 | 2013-01-08 | Huawei Technologies Co., Ltd. | Efficient temporal envelope coding approach by prediction between low band signal and high band signal |
| US8380498B2 (en) | 2008-09-06 | 2013-02-19 | GH Innovation, Inc. | Temporal envelope coding of energy attack signal by using attack point location |
| WO2010031049A1 (en) | 2008-09-15 | 2010-03-18 | GH Innovation, Inc. | Improving celp post-processing for music signals |
| US8798776B2 (en) | 2008-09-30 | 2014-08-05 | Dolby International Ab | Transcoding of audio metadata |
| DE102008042579B4 (de) | 2008-10-02 | 2020-07-23 | Robert Bosch Gmbh | Verfahren zur Fehlerverdeckung bei fehlerhafter Übertragung von Sprachdaten |
| CN102177426B (zh) | 2008-10-08 | 2014-11-05 | 弗兰霍菲尔运输应用研究公司 | 多分辨率切换音频编码/解码方案 |
| KR101315617B1 (ko) | 2008-11-26 | 2013-10-08 | 광운대학교 산학협력단 | 모드 스위칭에 기초하여 윈도우 시퀀스를 처리하는 통합 음성/오디오 부/복호화기 |
| CN101770775B (zh) * | 2008-12-31 | 2011-06-22 | 华为技术有限公司 | 信号处理方法及装置 |
| ES2904373T3 (es) | 2009-01-16 | 2022-04-04 | Dolby Int Ab | Transposición armónica mejorada de producto cruzado |
| US8457975B2 (en) | 2009-01-28 | 2013-06-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, methods for decoding and encoding an audio signal and computer program |
| EP2214165A3 (en) | 2009-01-30 | 2010-09-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus, method and computer program for manipulating an audio signal comprising a transient event |
| EP2398017B1 (en) | 2009-02-16 | 2014-04-23 | Electronics and Telecommunications Research Institute | Encoding/decoding method for audio signals using adaptive sinusoidal coding and apparatus thereof |
| EP2234103B1 (en) | 2009-03-26 | 2011-09-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Device and method for manipulating an audio signal |
| KR20100115215A (ko) | 2009-04-17 | 2010-10-27 | 삼성전자주식회사 | 가변 비트율 오디오 부호화 및 복호화 장치 및 방법 |
| EP3352168B1 (en) | 2009-06-23 | 2020-09-16 | VoiceAge Corporation | Forward time-domain aliasing cancellation with application in weighted or original signal domain |
| JP5267362B2 (ja) | 2009-07-03 | 2013-08-21 | 富士通株式会社 | オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラムならびに映像伝送装置 |
| CN101958119B (zh) | 2009-07-16 | 2012-02-29 | 中兴通讯股份有限公司 | 一种改进的离散余弦变换域音频丢帧补偿器和补偿方法 |
| US8635357B2 (en) | 2009-09-08 | 2014-01-21 | Google Inc. | Dynamic selection of parameter sets for transcoding media data |
| CA2778373C (en) | 2009-10-20 | 2015-12-01 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio signal encoder, audio signal decoder, method for providing an encoded representation of an audio content, method for providing a decoded representation of an audio content and computer program for use in low delay applications |
| AU2010309838B2 (en) * | 2009-10-20 | 2014-05-08 | Navigate Llc | Audio signal encoder, audio signal decoder, method for encoding or decoding an audio signal using an aliasing-cancellation |
| CN102859589B (zh) | 2009-10-20 | 2014-07-09 | 弗兰霍菲尔运输应用研究公司 | 多模式音频编译码器及其适用的码簿激励线性预测编码 |
| CN102081927B (zh) | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | 一种可分层音频编码、解码方法及系统 |
| US8428936B2 (en) | 2010-03-05 | 2013-04-23 | Motorola Mobility Llc | Decoder for audio signal including generic audio and speech frames |
| US8423355B2 (en) | 2010-03-05 | 2013-04-16 | Motorola Mobility Llc | Encoder for audio signal including generic audio and speech frames |
| CN103069484B (zh) * | 2010-04-14 | 2014-10-08 | 华为技术有限公司 | 时/频二维后处理 |
| WO2011147950A1 (en) | 2010-05-28 | 2011-12-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low-delay unified speech and audio codec |
| CN103477386B (zh) | 2011-02-14 | 2016-06-01 | 弗劳恩霍夫应用研究促进协会 | 音频编解码器中的噪声产生 |
| CA2827249C (en) * | 2011-02-14 | 2016-08-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
| WO2013075753A1 (en) | 2011-11-25 | 2013-05-30 | Huawei Technologies Co., Ltd. | An apparatus and a method for encoding an input signal |
-
2012
- 2012-02-10 CA CA2827249A patent/CA2827249C/en active Active
- 2012-02-10 MX MX2013009344A patent/MX2013009344A/es active IP Right Grant
- 2012-02-10 WO PCT/EP2012/052292 patent/WO2012110415A1/en not_active Ceased
- 2012-02-10 TW TW101104349A patent/TWI469136B/zh active
- 2012-02-10 ES ES12704258.8T patent/ES2529025T3/es active Active
- 2012-02-10 MY MYPI2013002981A patent/MY164797A/en unknown
- 2012-02-10 RU RU2013142138/08A patent/RU2560788C2/ru active
- 2012-02-10 SG SG2013061361A patent/SG192746A1/en unknown
- 2012-02-10 AR ARP120100444A patent/AR085362A1/es active IP Right Grant
- 2012-02-10 AU AU2012217269A patent/AU2012217269B2/en active Active
- 2012-02-10 EP EP12704258.8A patent/EP2676268B1/en active Active
- 2012-02-10 BR BR112013020482A patent/BR112013020482B1/pt active IP Right Grant
- 2012-02-10 JP JP2013553881A patent/JP5666021B2/ja active Active
- 2012-02-10 CN CN201280015997.7A patent/CN103503061B/zh active Active
- 2012-02-10 PL PL12704258T patent/PL2676268T3/pl unknown
- 2012-02-10 KR KR1020137023820A patent/KR101699898B1/ko active Active
-
2013
- 2013-08-14 US US13/966,570 patent/US9583110B2/en active Active
- 2013-09-11 ZA ZA2013/06838A patent/ZA201306838B/en unknown
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050131696A1 (en) * | 2001-06-29 | 2005-06-16 | Microsoft Corporation | Frequency domain postfiltering for quality enhancement of coded speech |
| US7519538B2 (en) * | 2003-10-30 | 2009-04-14 | Koninklijke Philips Electronics N.V. | Audio signal encoding or decoding |
| WO2006126844A2 (en) * | 2005-05-26 | 2006-11-30 | Lg Electronics Inc. | Method and apparatus for decoding an audio signal |
| TW201009810A (en) * | 2008-07-11 | 2010-03-01 | Fraunhofer Ges Forschung | Time warp contour calculator, audio signal encoder, encoded audio signal representation, methods and computer program |
| TW201032218A (en) * | 2009-01-28 | 2010-09-01 | Fraunhofer Ges Forschung | Audio encoder, audio decoder, encoded audio information, methods for encoding and decoding an audio signal and computer program |
Non-Patent Citations (1)
| Title |
|---|
| LANCIANI C. A. and SCHAFER R. W. "SUBBAND-DOMAIN FILTERING OF MPEG AUDIO SIGNALS," IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PHOENIX, AZ, MARCH 15 - 19, 1999, pp. 917-920 * |
Also Published As
| Publication number | Publication date |
|---|---|
| JP5666021B2 (ja) | 2015-02-04 |
| BR112013020482A2 (pt) | 2018-07-10 |
| US9583110B2 (en) | 2017-02-28 |
| AU2012217269B2 (en) | 2015-10-22 |
| EP2676268A1 (en) | 2013-12-25 |
| EP2676268B1 (en) | 2014-12-03 |
| PL2676268T3 (pl) | 2015-05-29 |
| TW201237848A (en) | 2012-09-16 |
| ZA201306838B (en) | 2014-05-28 |
| MY164797A (en) | 2018-01-30 |
| US20130332151A1 (en) | 2013-12-12 |
| WO2012110415A1 (en) | 2012-08-23 |
| MX2013009344A (es) | 2013-10-01 |
| CA2827249A1 (en) | 2012-08-23 |
| ES2529025T3 (es) | 2015-02-16 |
| CN103503061B (zh) | 2016-02-17 |
| CN103503061A (zh) | 2014-01-08 |
| AR085362A1 (es) | 2013-09-25 |
| JP2014510301A (ja) | 2014-04-24 |
| CA2827249C (en) | 2016-08-23 |
| AU2012217269A1 (en) | 2013-09-05 |
| HK1192048A1 (zh) | 2014-08-08 |
| RU2013142138A (ru) | 2015-03-27 |
| RU2560788C2 (ru) | 2015-08-20 |
| KR20130133843A (ko) | 2013-12-09 |
| BR112013020482B1 (pt) | 2021-02-23 |
| KR101699898B1 (ko) | 2017-01-25 |
| SG192746A1 (en) | 2013-09-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI469136B (zh) | 在一頻譜域中用以處理已解碼音訊信號之裝置及方法 | |
| TWI488177B (zh) | 使用頻譜域雜訊整形之基於線性預測的編碼方案 | |
| JP5165559B2 (ja) | オーディオコーデックポストフィルタ | |
| CN102934163B (zh) | 用于宽带语音编码的系统、方法、设备 | |
| TWI479478B (zh) | 用以使用對齊的預看部分將音訊信號解碼的裝置與方法 | |
| MX2011000366A (es) | Codificador y decodificador de audio para codificar y decodificar muestras de audio. | |
| WO2013061584A1 (ja) | 音信号ハイブリッドデコーダ、音信号ハイブリッドエンコーダ、音信号復号方法、及び音信号符号化方法 | |
| JPH09127985A (ja) | 信号符号化方法及び装置 | |
| JPH09127987A (ja) | 信号符号化方法及び装置 | |
| JP3598111B2 (ja) | 広帯域音声復元装置 | |
| HK1192048B (zh) | 在一频谱域中用以处理已解码音讯信号的装置及方法 | |
| Li et al. | Audio codingwith power spectral density preserving quantization | |
| JP3560964B2 (ja) | 広帯域音声復元装置及び広帯域音声復元方法及び音声伝送システム及び音声伝送方法 | |
| JPH09127994A (ja) | 信号符号化方法及び装置 | |
| JP3598112B2 (ja) | 広帯域音声復元方法及び広帯域音声復元装置 | |
| RU2574849C2 (ru) | Устройство и способ для кодирования и декодирования аудиосигнала с использованием выровненной части опережающего просмотра | |
| JPH09127986A (ja) | 符号化信号の多重化方法及び信号符号化装置 | |
| JP2006065362A (ja) | 広帯域音声復元装置 | |
| JP2004046238A (ja) | 広帯域音声復元装置及び広帯域音声復元方法 | |
| JP2004355018A (ja) | 広帯域音声復元方法及び広帯域音声復元装置 | |
| JP2004341551A (ja) | 広帯域音声復元方法及び広帯域音声復元装置 | |
| JP2005284317A (ja) | 広帯域音声復元方法及び広帯域音声復元装置 | |
| JP2005284314A (ja) | 広帯域音声復元方法及び広帯域音声復元装置 |