[go: up one dir, main page]

CN1653521B - 用于音频代码转换中的自适应码本音调滞后计算的方法 - Google Patents

用于音频代码转换中的自适应码本音调滞后计算的方法 Download PDF

Info

Publication number
CN1653521B
CN1653521B CN038106450A CN03810645A CN1653521B CN 1653521 B CN1653521 B CN 1653521B CN 038106450 A CN038106450 A CN 038106450A CN 03810645 A CN03810645 A CN 03810645A CN 1653521 B CN1653521 B CN 1653521B
Authority
CN
China
Prior art keywords
subframe
tone
input
sluggish
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN038106450A
Other languages
English (en)
Chinese (zh)
Other versions
CN1653521A (zh
Inventor
M·A·加布里
J·W·王
S·乔吉
M·伊布拉西姆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Di Lee Sim For Benefit Of Creditors Ltd
Di Lee Sim Network Inc
Dilithium Networks Inc
Original Assignee
Dilithium Networks Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dilithium Networks Inc filed Critical Dilithium Networks Inc
Publication of CN1653521A publication Critical patent/CN1653521A/zh
Application granted granted Critical
Publication of CN1653521B publication Critical patent/CN1653521B/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN038106450A 2002-03-12 2003-03-12 用于音频代码转换中的自适应码本音调滞后计算的方法 Expired - Fee Related CN1653521B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36440302P 2002-03-12 2002-03-12
US60/364,403 2002-03-12
PCT/US2003/007901 WO2003079330A1 (fr) 2002-03-12 2003-03-12 Procede de calcul du retard du pas de livres de codes adaptatifs dans des transcodeurs audio

Publications (2)

Publication Number Publication Date
CN1653521A CN1653521A (zh) 2005-08-10
CN1653521B true CN1653521B (zh) 2010-05-26

Family

ID=28041908

Family Applications (1)

Application Number Title Priority Date Filing Date
CN038106450A Expired - Fee Related CN1653521B (zh) 2002-03-12 2003-03-12 用于音频代码转换中的自适应码本音调滞后计算的方法

Country Status (7)

Country Link
US (2) US7260524B2 (fr)
EP (1) EP1483758A4 (fr)
JP (1) JP2005520206A (fr)
KR (1) KR20040104508A (fr)
CN (1) CN1653521B (fr)
AU (1) AU2003214182A1 (fr)
WO (1) WO2003079330A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9263051B2 (en) 2009-01-06 2016-02-16 Skype Speech coding by quantizing with random-noise signal
US9530423B2 (en) 2009-01-06 2016-12-27 Skype Speech encoding by determining a quantization gain based on inverse of a pitch correlation

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1483758A4 (fr) * 2002-03-12 2007-04-11 Dilithium Networks Pty Ltd Procede de calcul du retard du pas de livres de codes adaptatifs dans des transcodeurs audio
KR100546758B1 (ko) * 2003-06-30 2006-01-26 한국전자통신연구원 음성의 상호부호화시 전송률 결정 장치 및 방법
US7433815B2 (en) * 2003-09-10 2008-10-07 Dilithium Networks Pty Ltd. Method and apparatus for voice transcoding between variable rate coders
US7519532B2 (en) * 2003-09-29 2009-04-14 Texas Instruments Incorporated Transcoding EVRC to G.729ab
US9058812B2 (en) * 2005-07-27 2015-06-16 Google Technology Holdings LLC Method and system for coding an information signal using pitch delay contour adjustment
US7602745B2 (en) * 2005-12-05 2009-10-13 Intel Corporation Multiple input, multiple output wireless communication system, associated methods and data structures
JP3981399B1 (ja) * 2006-03-10 2007-09-26 松下電器産業株式会社 固定符号帳探索装置および固定符号帳探索方法
KR100900438B1 (ko) * 2006-04-25 2009-06-01 삼성전자주식회사 음성 패킷 복구 장치 및 방법
US8218529B2 (en) * 2006-07-07 2012-07-10 Avaya Canada Corp. Device for and method of terminating a VoIP call
EP1903559A1 (fr) * 2006-09-20 2008-03-26 Deutsche Thomson-Brandt Gmbh Procédé et dispositif de transcodage de signaux audio
GB2466669B (en) * 2009-01-06 2013-03-06 Skype Speech coding
GB2466673B (en) 2009-01-06 2012-11-07 Skype Quantization
GB2466670B (en) 2009-01-06 2012-11-14 Skype Speech encoding
GB2466672B (en) 2009-01-06 2013-03-13 Skype Speech coding
US8243610B2 (en) * 2009-04-21 2012-08-14 Futurewei Technologies, Inc. System and method for precoding codebook adaptation with low feedback overhead
EP2249334A1 (fr) 2009-05-08 2010-11-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Transcodeur de format audio
US8452606B2 (en) 2009-09-29 2013-05-28 Skype Speech encoding using multiple bit rates
US8521541B2 (en) 2010-11-02 2013-08-27 Google Inc. Adaptive audio transcoding
CN104243734B (zh) * 2013-06-18 2019-03-01 深圳市共进电子股份有限公司 音频处理系统和方法
MY177559A (en) 2013-06-21 2020-09-18 Fraunhofer Ges Forschung Apparatus and method for improved concealment of the adaptive codebook in acelp-like concealment employing improved pitch lag estimation
BR112015031606B1 (pt) 2013-06-21 2021-12-14 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Aparelho e método para desvanecimento de sinal aperfeiçoado em diferentes domínios durante ocultação de erros
EP3011555B1 (fr) 2013-06-21 2018-03-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Reconstructrion d'une trame vocale
EP2980799A1 (fr) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Appareil et procédé de traitement d'un signal audio à l'aide d'un post-filtre harmonique

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08146997A (ja) 1994-11-21 1996-06-07 Hitachi Ltd 符号変換装置および符号変換システム
US5995923A (en) * 1997-06-26 1999-11-30 Nortel Networks Corporation Method and apparatus for improving the voice quality of tandemed vocoders
US6115687A (en) * 1996-11-11 2000-09-05 Matsushita Electric Industrial Co., Ltd. Sound reproducing speed converter

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6260009B1 (en) * 1999-02-12 2001-07-10 Qualcomm Incorporated CELP-based to CELP-based vocoder packet translation
WO2001020595A1 (fr) * 1999-09-14 2001-03-22 Fujitsu Limited Codeur/decodeur vocal
US6760698B2 (en) * 2000-09-15 2004-07-06 Mindspeed Technologies Inc. System for coding speech information using an adaptive codebook with enhanced variable resolution scheme
JP2002202799A (ja) * 2000-10-30 2002-07-19 Fujitsu Ltd 音声符号変換装置
JP2002229599A (ja) 2001-02-02 2002-08-16 Nec Corp 音声符号列の変換装置および変換方法
EP1483758A4 (fr) * 2002-03-12 2007-04-11 Dilithium Networks Pty Ltd Procede de calcul du retard du pas de livres de codes adaptatifs dans des transcodeurs audio
JP2004222009A (ja) 2003-01-16 2004-08-05 Nec Corp 異種網接続ゲートウェイおよび異種網間通信課金システム

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08146997A (ja) 1994-11-21 1996-06-07 Hitachi Ltd 符号変換装置および符号変換システム
US6115687A (en) * 1996-11-11 2000-09-05 Matsushita Electric Industrial Co., Ltd. Sound reproducing speed converter
US5995923A (en) * 1997-06-26 1999-11-30 Nortel Networks Corporation Method and apparatus for improving the voice quality of tandemed vocoders

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9263051B2 (en) 2009-01-06 2016-02-16 Skype Speech coding by quantizing with random-noise signal
US9530423B2 (en) 2009-01-06 2016-12-27 Skype Speech encoding by determining a quantization gain based on inverse of a pitch correlation

Also Published As

Publication number Publication date
AU2003214182A1 (en) 2003-09-29
JP2005520206A (ja) 2005-07-07
EP1483758A1 (fr) 2004-12-08
US7260524B2 (en) 2007-08-21
WO2003079330A1 (fr) 2003-09-25
US20080189101A1 (en) 2008-08-07
KR20040104508A (ko) 2004-12-10
CN1653521A (zh) 2005-08-10
US20040002855A1 (en) 2004-01-01
US7996217B2 (en) 2011-08-09
EP1483758A4 (fr) 2007-04-11

Similar Documents

Publication Publication Date Title
CN1653521B (zh) 用于音频代码转换中的自适应码本音调滞后计算的方法
RU2675044C1 (ru) Способ квантования коэффициентов кодирования с линейным предсказанием, способ кодирования звука, способ деквантования коэффициентов кодирования с линейным предсказанием, способ декодирования звука и носитель записи
KR100923896B1 (ko) 분산형 음성 인식 시스템에서 음성 활성을 송신하는 방법및 장치
US6625576B2 (en) Method and apparatus for performing text-to-speech conversion in a client/server environment
KR100837451B1 (ko) 향상된 품질의 음성 변환부호화를 위한 방법 및 장치
CN105244034B (zh) 针对语音信号或音频信号的量化方法以及解码方法和设备
CN1954367B (zh) 支持音频编码器模式间的转换
US6119086A (en) Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens
US20230197061A1 (en) Method and System for Outputting Target Audio, Readable Storage Medium, and Electronic Device
JP2004501391A (ja) 可変レート音声符号器におけるフレーム消去補償方法
AU5958599A (en) Automatic speech/speaker recognition over digital wireless channels
JP4511094B2 (ja) 音声コーダにおける線スペクトル情報量子化方法を交錯するための方法および装置
CN112908293B (zh) 一种基于语义注意力机制的多音字发音纠错方法及装置
CN102934162A (zh) 搜索随后被重放的包括基本层和至少一个增强层分层分级比特流的方法和设备
JP2003036097A (ja) 情報検出装置及び方法、並びに情報検索装置及び方法
US20020128826A1 (en) Speech recognition system and method, and information processing apparatus and method used in that system
US7200557B2 (en) Method of reducing index sizes used to represent spectral content vectors
CN120600029B (zh) 一种智能体对话系统及方法
Garcia et al. Low bit rate compression methods of feature vectors for distributed speech recognition
JP3700310B2 (ja) ベクトル量子化装置及びベクトル量子化方法
Huong et al. A new vocoder based on AMR 7.4 kbit/s mode in speaker dependent coding system
JP4932530B2 (ja) 音響処理装置、音響処理方法、音響処理プログラム、照合処理装置、照合処理方法及び照合処理プログラム
TW541516B (en) Distributed speech recognition using dynamically determined feature vector codebook size
JPH09120300A (ja) ベクトル量子化装置
US7031914B2 (en) Systems and methods for concatenating electronically encoded voice

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: ONMOBILE GLOBAL LTD.

Free format text: FORMER OWNER: DILITHIUM (ASSIGNMENT FOR THE BENEFIT OF CREDITORS) INC.

Effective date: 20130220

Owner name: DILITHIUM (ASSIGNMENT FOR THE BENEFIT OF CREDITORS

Free format text: FORMER OWNER: DILITHIUM NETWORKS INC.

Effective date: 20130220

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20130220

Address after: bangalore

Patentee after: DILITHIUM NETWORKS, Inc.

Address before: California, USA

Patentee before: Di Lee Sim (for the benefit of creditors) Ltd.

Effective date of registration: 20130220

Address after: California, USA

Patentee after: Di Lee Sim (for the benefit of creditors) Ltd.

Address before: California, USA

Patentee before: Di Lee Sim Network Inc.

Effective date of registration: 20130220

Address after: California, USA

Patentee after: Di Lee Sim Network Inc.

Address before: New South Wales

Patentee before: DILITHIUM NETWORKS Pty Ltd.

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100526

Termination date: 20150312

EXPY Termination of patent right or utility model