[go: up one dir, main page]

WO2008108083A1 - 音声符号化装置および音声符号化方法 - Google Patents

音声符号化装置および音声符号化方法 Download PDF

Info

Publication number
WO2008108083A1
WO2008108083A1 PCT/JP2008/000407 JP2008000407W WO2008108083A1 WO 2008108083 A1 WO2008108083 A1 WO 2008108083A1 JP 2008000407 W JP2008000407 W JP 2008000407W WO 2008108083 A1 WO2008108083 A1 WO 2008108083A1
Authority
WO
WIPO (PCT)
Prior art keywords
pitch pulse
pulse
pitch
point
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2008/000407
Other languages
English (en)
French (fr)
Inventor
Hiroyuki Ehara
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Corp
Panasonic Holdings Corp
Original Assignee
Panasonic Corp
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Corp, Matsushita Electric Industrial Co Ltd filed Critical Panasonic Corp
Priority to US12/528,880 priority Critical patent/US8364472B2/en
Priority to JP2009502461A priority patent/JP5596341B2/ja
Priority to EP08710510A priority patent/EP2128855A1/en
Publication of WO2008108083A1 publication Critical patent/WO2008108083A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0011Long term prediction filters, i.e. pitch estimation

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

 ピッチパルス情報を消失補償処理用の冗長情報として用いる場合に、最適なピッチパルスを検出することができる音声符号化装置。この装置において、探索始点決定部(121)は、ピッチパルスが存在し得る複数の点の中で最も過去にある点を探索始点として決定し、ピッチパルス候補選択部(122)は、探索始点から現フレームの先頭の点の1つ前の点までを探索範囲とし、この探索範囲において振幅が大きい復号音源ベクトルの位置をピッチパルス位置候補として選択し、切替スイッチ(125)は、ピッチパルス候補選択部(122)から入力される複数のピッチパルス位置候補を順次切り替えてパルス列生成部(123)および誤差最小化部(124)に出力し、パルス列生成部(123)は、切替スイッチ(125)から入力されたピッチパルス位置候補にピッチパルスを立てた場合に、現フレームでこのピッチパルスから適応符号帳成分として生成されるベクトルをパルス列として生成する。
PCT/JP2008/000407 2007-03-02 2008-02-29 音声符号化装置および音声符号化方法 Ceased WO2008108083A1 (ja)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US12/528,880 US8364472B2 (en) 2007-03-02 2008-02-29 Voice encoding device and voice encoding method
JP2009502461A JP5596341B2 (ja) 2007-03-02 2008-02-29 音声符号化装置および音声符号化方法
EP08710510A EP2128855A1 (en) 2007-03-02 2008-02-29 Voice encoding device and voice encoding method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007053530 2007-03-02
JP2007-053530 2007-03-02

Publications (1)

Publication Number Publication Date
WO2008108083A1 true WO2008108083A1 (ja) 2008-09-12

Family

ID=39737981

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/000407 Ceased WO2008108083A1 (ja) 2007-03-02 2008-02-29 音声符号化装置および音声符号化方法

Country Status (4)

Country Link
US (1) US8364472B2 (ja)
EP (1) EP2128855A1 (ja)
JP (1) JP5596341B2 (ja)
WO (1) WO2008108083A1 (ja)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8725501B2 (en) * 2004-07-20 2014-05-13 Panasonic Corporation Audio decoding device and compensation frame generation method
US9082416B2 (en) * 2010-09-16 2015-07-14 Qualcomm Incorporated Estimating a pitch lag
EP2676266B1 (en) 2011-02-14 2015-03-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Linear prediction based coding scheme using spectral domain noise shaping
KR101551046B1 (ko) * 2011-02-14 2015-09-07 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 저-지연 통합 스피치 및 오디오 코딩에서 에러 은닉을 위한 장치 및 방법
CA2799343C (en) 2011-02-14 2016-06-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Information signal representation using lapped transform
TWI469136B (zh) 2011-02-14 2015-01-11 Fraunhofer Ges Forschung 在一頻譜域中用以處理已解碼音訊信號之裝置及方法
RU2586597C2 (ru) 2011-02-14 2016-06-10 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Кодирование и декодирование позиций импульсов дорожек аудиосигнала
CN103493129B (zh) 2011-02-14 2016-08-10 弗劳恩霍夫应用研究促进协会 用于使用瞬态检测及质量结果将音频信号的部分编码的装置与方法
US9275644B2 (en) * 2012-01-20 2016-03-01 Qualcomm Incorporated Devices for redundant frame coding and decoding
CN104751849B (zh) * 2013-12-31 2017-04-19 华为技术有限公司 语音频码流的解码方法及装置
CN107369454B (zh) 2014-03-21 2020-10-27 华为技术有限公司 语音频码流的解码方法及装置

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2005040749A1 (ja) * 2003-10-23 2005-05-06 Matsushita Electric Industrial Co., Ltd. スペクトル符号化装置、スペクトル復号化装置、音響信号送信装置、音響信号受信装置、およびこれらの方法
JP2005513539A (ja) * 2001-12-14 2005-05-12 ノキア コーポレイション 音声信号の効率的コーディングのための信号修正方法

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04264597A (ja) * 1991-02-20 1992-09-21 Fujitsu Ltd 音声符号化装置および音声復号装置
US5265190A (en) * 1991-05-31 1993-11-23 Motorola, Inc. CELP vocoder with efficient adaptive codebook search
EP0657874B1 (en) * 1993-12-10 2001-03-14 Nec Corporation Voice coder and a method for searching codebooks
JP3024467B2 (ja) * 1993-12-10 2000-03-21 日本電気株式会社 音声符号化装置
US5704003A (en) * 1995-09-19 1997-12-30 Lucent Technologies Inc. RCELP coder
DE19641619C1 (de) * 1996-10-09 1997-06-26 Nokia Mobile Phones Ltd Verfahren zur Synthese eines Rahmens eines Sprachsignals
DE69730316T2 (de) * 1996-11-07 2005-09-08 Matsushita Electric Industrial Co., Ltd., Kadoma Schallquellengenerator, sprachkodierer und sprachdekodierer
US6385576B2 (en) * 1997-12-24 2002-05-07 Kabushiki Kaisha Toshiba Speech encoding/decoding method using reduced subframe pulse positions having density related to pitch
US6141638A (en) * 1998-05-28 2000-10-31 Motorola, Inc. Method and apparatus for coding an information signal
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
JP4173940B2 (ja) * 1999-03-05 2008-10-29 松下電器産業株式会社 音声符号化装置及び音声符号化方法
WO2001052241A1 (fr) * 2000-01-11 2001-07-19 Matsushita Electric Industrial Co., Ltd. Dispositif de codage vocal multimode et dispositif de decodage
US6757654B1 (en) 2000-05-11 2004-06-29 Telefonaktiebolaget Lm Ericsson Forward error correction in speech coding
CA2388439A1 (en) 2002-05-31 2003-11-30 Voiceage Corporation A method and device for efficient frame erasure concealment in linear predictive based speech codecs
JP4331928B2 (ja) 2002-09-11 2009-09-16 パナソニック株式会社 音声符号化装置、音声復号化装置、及びそれらの方法
US7047188B2 (en) * 2002-11-08 2006-05-16 Motorola, Inc. Method and apparatus for improvement coding of the subframe gain in a speech coding system
CN1735927B (zh) * 2003-01-09 2011-08-31 爱移通全球有限公司 用于高质量语音编码转换的方法和装置
US7904292B2 (en) * 2004-09-30 2011-03-08 Panasonic Corporation Scalable encoding device, scalable decoding device, and method thereof
BRPI0607303A2 (pt) 2005-01-26 2009-08-25 Matsushita Electric Industrial Co Ltd dispositivo de codificação de voz e método de codificar voz
US20100049508A1 (en) * 2006-12-14 2010-02-25 Panasonic Corporation Audio encoding device and audio encoding method
JP5230444B2 (ja) * 2006-12-15 2013-07-10 パナソニック株式会社 適応音源ベクトル量子化装置および適応音源ベクトル量子化方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005513539A (ja) * 2001-12-14 2005-05-12 ノキア コーポレイション 音声信号の効率的コーディングのための信号修正方法
WO2005040749A1 (ja) * 2003-10-23 2005-05-06 Matsushita Electric Industrial Co., Ltd. スペクトル符号化装置、スペクトル復号化装置、音響信号送信装置、音響信号受信装置、およびこれらの方法

Also Published As

Publication number Publication date
JPWO2008108083A1 (ja) 2010-06-10
US20100106488A1 (en) 2010-04-29
JP5596341B2 (ja) 2014-09-24
US8364472B2 (en) 2013-01-29
EP2128855A1 (en) 2009-12-02

Similar Documents

Publication Publication Date Title
WO2008108083A1 (ja) 音声符号化装置および音声符号化方法
JP5190363B2 (ja) 音声復号装置、音声符号化装置、および消失フレーム補償方法
WO2010085064A3 (ko) 움직임 벡터 부호화/복호화 장치 및 방법과 그를 이용한 영상 부호화/복호화 장치 및 방법
WO2007011653A3 (en) Selectively using multiple entropy models in adaptive coding and decoding
WO2008108081A1 (ja) 適応音源ベクトル量子化装置および適応音源ベクトル量子化方法
EP4629107A3 (en) Artificial intelligence-based text-to-speech system and method
WO2011090314A3 (en) Method and apparatus for encoding and decoding motion vector based on reduced motion vector predictor candidates
WO2011031692A3 (en) Speedup techniques for rate distortion optimized quantization
EP4583104A3 (en) Method for encoding a signal
DK1879179T3 (da) Fremgangsmåde og anordning til kodning af audiodata baseret på vektorkvantisering
RU2011124080A (ru) Устройство декодирования параметров, устройство кодирования параметров и способ декодирования параметров
CA2636330A1 (en) Method and apparatus for processing an audio signal
WO2012030193A3 (ko) 영상 부호화 및 복호화 방법과 이를 이용한 장치
JP2016504637A5 (ja)
MX2025001790A (es) Metodo y dispositivo de codificacion/decodificacion de video para derivar el indice de ponderacion para prediccion bidireccional de candidato de fusion, y metodo para transmitir flujo de bits
WO2010078146A3 (en) Motion estimation techniques
JP2003337600A (ja) 音声符号化復号方式間の符号変換方法および装置とその記憶媒体
WO2009131406A3 (en) Decoding image
WO2008094821A3 (en) Systems and methods for low-complexity mimo detection using leaf-node prediction via look-up tables
CN103646647B (zh) 混合音频解码器中帧差错隐藏的谱参数代替方法及系统
WO2009096721A3 (en) Method and apparatus for encoding and decoding video signal using motion compensation based on affine transformation
US6856955B1 (en) Voice encoding/decoding device
ATE515019T1 (de) Verfahren und vorrichtung zur ausführung einer optimalizierten audiokodierung zwischen zwei langzeitvorhersagemodellen
BRPI0418389A (pt) esquema de codificação preditiva
Tian et al. One in a hundred: Selecting the best predicted sequence from numerous candidates for speech recognition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08710510

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2009502461

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2008710510

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 12528880

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 1654/MUMNP/2009

Country of ref document: IN

NENP Non-entry into the national phase

Ref country code: DE