[go: up one dir, main page]

WO2004090864A3 - Method and apparatus for the encoding and decoding of speech - Google Patents

Method and apparatus for the encoding and decoding of speech Download PDF

Info

Publication number
WO2004090864A3
WO2004090864A3 PCT/IN2004/000060 IN2004000060W WO2004090864A3 WO 2004090864 A3 WO2004090864 A3 WO 2004090864A3 IN 2004000060 W IN2004000060 W IN 2004000060W WO 2004090864 A3 WO2004090864 A3 WO 2004090864A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
quantised
pvq
quantisation
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IN2004/000060
Other languages
French (fr)
Other versions
WO2004090864B1 (en
WO2004090864A2 (en
Inventor
Preeti Rao
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Indian Institute of Technology Bombay
Original Assignee
Indian Institute of Technology Bombay
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Indian Institute of Technology Bombay filed Critical Indian Institute of Technology Bombay
Publication of WO2004090864A2 publication Critical patent/WO2004090864A2/en
Publication of WO2004090864A3 publication Critical patent/WO2004090864A3/en
Publication of WO2004090864B1 publication Critical patent/WO2004090864B1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

Methods and apparatus for encoding speech for communication to a decoder for reproduction of the speech signal where the speech signal is represented by the parameters of a speech model, and a specific quantisation' scheme is used for each parameter, with novel quantisation schemes for the spectral amplitudes. The spectral amplitudes are represented by line spectral frequencies (LSFs) and gain. The LSF vector is split into sub-vectors for quantisation by SNPVQ and frame-fill interpolation. The low-frequency split vector is quantised by an SN-PVQ scheme, and the high frequency split vector by SN-PVQ in the even-numbered frames and frame-fill interpolation in the odd-numbered frames. Optionally all LSF sub-vectors can be quantised by SN-PVQ. Further, the gain parameters of two frames are jointly quantised. These result in a system of encoder and decoder for speech coding with communication quality output speech at bit rates below 2 kbps.
PCT/IN2004/000060 2003-03-12 2004-03-12 Method and apparatus for the encoding and decoding of speech Ceased WO2004090864A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN273/MUM/2003 2003-03-12
IN273MU2003 2003-03-12

Publications (3)

Publication Number Publication Date
WO2004090864A2 WO2004090864A2 (en) 2004-10-21
WO2004090864A3 true WO2004090864A3 (en) 2005-03-24
WO2004090864B1 WO2004090864B1 (en) 2005-05-19

Family

ID=33156203

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2004/000060 Ceased WO2004090864A2 (en) 2003-03-12 2004-03-12 Method and apparatus for the encoding and decoding of speech

Country Status (1)

Country Link
WO (1) WO2004090864A2 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100857112B1 (en) 2005-10-05 2008-09-05 엘지전자 주식회사 Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
EP1949061A4 (en) * 2005-10-05 2009-11-25 Lg Electronics Inc Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor
US8199827B2 (en) 2005-10-13 2012-06-12 Lg Electronics Inc. Method of processing a signal and apparatus for processing a signal
AU2006300102B2 (en) * 2005-10-13 2010-09-16 Lg Electronics Inc. Method and apparatus for signal processing
US8179977B2 (en) 2005-10-13 2012-05-15 Lg Electronics Inc. Method of apparatus for processing a signal
US7752053B2 (en) 2006-01-13 2010-07-06 Lg Electronics Inc. Audio signal processing using pilot based coding
BRPI0915450B1 (en) 2008-07-10 2020-03-10 Voiceage Corporation Device and method for inversely quantizing and quantizing lpc filters in a superframe
US8762136B2 (en) 2011-05-03 2014-06-24 Lsi Corporation System and method of speech compression using an inter frame parameter correlation
KR102034419B1 (en) * 2014-07-28 2019-10-18 텔레폰악티에볼라겟엘엠에릭슨(펍) Pyramid vector quantizer shape search
JP7167335B2 (en) * 2018-10-29 2022-11-08 ドルビー・インターナショナル・アーベー Method and Apparatus for Rate-Quality Scalable Coding Using Generative Models
CN113808601B (en) * 2021-11-19 2022-02-22 信瑞递(北京)科技有限公司 Method, device and electronic equipment for generating RDSS short message channel voice code
CN115035885A (en) * 2022-04-15 2022-09-09 科大讯飞股份有限公司 A kind of speech synthesis method, apparatus, equipment and storage medium
CN115050378B (en) * 2022-05-19 2024-06-07 腾讯科技(深圳)有限公司 Audio encoding and decoding method and related products

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001022403A1 (en) * 1999-09-22 2001-03-29 Microsoft Corporation Lpc-harmonic vocoder with superframe structure
WO2002025638A2 (en) * 2000-09-15 2002-03-28 Conexant Systems, Inc. Codebook structure and search for speech coding

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001022403A1 (en) * 1999-09-22 2001-03-29 Microsoft Corporation Lpc-harmonic vocoder with superframe structure
WO2002025638A2 (en) * 2000-09-15 2002-03-28 Conexant Systems, Inc. Codebook structure and search for speech coding

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHAMBERLAIN M.W. ET AL.: "A 6000 bps MELP vocoder for use on HF channels", MILITARY COMMUNICATIONS CONFERENCE, vol. 1, 28 October 2001 (2001-10-28) - 31 October 2001 (2001-10-31), pages 447 - 453 *
MOUY B. ET AL.: "NATO SATANAG 4479: a standard for an 800 bps vocoder and channel coding in HF-ECCM system", ACOUSTIC, SPEECH, AND SIGNAL PROCESSING, 9 May 1995 (1995-05-09) *
WANG T. ET AL.: "A 1200 BPS speech coder based on MELP", 5 June 2000 (2000-06-05) *

Also Published As

Publication number Publication date
WO2004090864B1 (en) 2005-05-19
WO2004090864A2 (en) 2004-10-21

Similar Documents

Publication Publication Date Title
CN1815558B (en) Low bit-rate coding of unvoiced segments of speech
WO2004090864A3 (en) Method and apparatus for the encoding and decoding of speech
US6470313B1 (en) Speech coding
AU7830300A (en) Lpc-harmonic vocoder with superframe structure
CA2179228A1 (en) Method and apparatus for reproducing speech signals and method for transmitting same
CA2388358A1 (en) A method and device for multi-rate lattice vector quantization
ATE368279T1 (en) METHOD AND APPARATUS FOR QUANTIZING THE GAIN FACTOR IN A VARIABLE BIT RATE WIDEBAND VOICE ENCODER
CN100527225C (en) A transcoding scheme between CELP-based speech codes
AU768744B2 (en) Method for quantizing speech coder parameters
CN1954366A (en) Method and apparatus for voice trans-rating in multi-rate voice coders for telecommunications
JPH0850500A (en) Voice encoder and voice decoder as well as voice coding method and voice encoding method
EP1597721B1 (en) 600 bps mixed excitation linear prediction transcoding
CN102855878B (en) Quantification method of pure and impure pitch parameters of narrow-band voice sub-band
CN112614495A (en) Software radio multi-system voice coder-decoder
CN103236262B (en) A transcoding method of code stream of speech coder
JP3537008B2 (en) Speech coding communication system and its transmission / reception device.
EP1204092A3 (en) Speech decoder capable of decoding background noise signal with high quality
CN102903365B (en) Method for refining parameter of narrow band vocoder on decoding end
CA2340160A1 (en) Speech coding with improved background noise reproduction
US5717819A (en) Methods and apparatus for encoding/decoding speech signals at low bit rates
EP1397655A1 (en) Method and device for coding speech in analysis-by-synthesis speech coders
US7584096B2 (en) Method and apparatus for encoding speech
US20040133422A1 (en) Speech compression method and apparatus
CN101887727B (en) Speech coding data conversion system and method from HELP coding to MELP coding
FR2869151B1 (en) METHOD OF QUANTIFYING A VERY LOW SPEECH ENCODER

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
B Later publication of amended claims

Effective date: 20041216

122 Ep: pct application non-entry in european phase