[go: up one dir, main page]

DE69726685T2 - Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung - Google Patents

Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung Download PDF

Info

Publication number
DE69726685T2
DE69726685T2 DE69726685T DE69726685T DE69726685T2 DE 69726685 T2 DE69726685 T2 DE 69726685T2 DE 69726685 T DE69726685 T DE 69726685T DE 69726685 T DE69726685 T DE 69726685T DE 69726685 T2 DE69726685 T2 DE 69726685T2
Authority
DE
Germany
Prior art keywords
speech
analysis
coding
speech coding
speech analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69726685T
Other languages
English (en)
Other versions
DE69726685D1 (de
Inventor
Masayuki Nishiguchi
Jun Matsumoto
Kazuyuki Iijima
Akira Inoue
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Application granted granted Critical
Publication of DE69726685D1 publication Critical patent/DE69726685D1/de
Publication of DE69726685T2 publication Critical patent/DE69726685T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
DE69726685T 1996-10-18 1997-10-17 Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung Expired - Lifetime DE69726685T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP27650196A JP4121578B2 (ja) 1996-10-18 1996-10-18 音声分析方法、音声符号化方法および装置

Publications (2)

Publication Number Publication Date
DE69726685D1 DE69726685D1 (de) 2004-01-22
DE69726685T2 true DE69726685T2 (de) 2004-10-07

Family

ID=17570349

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69726685T Expired - Lifetime DE69726685T2 (de) 1996-10-18 1997-10-17 Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung

Country Status (6)

Country Link
US (1) US6108621A (de)
EP (1) EP0837453B1 (de)
JP (1) JP4121578B2 (de)
KR (1) KR100496670B1 (de)
CN (1) CN1161751C (de)
DE (1) DE69726685T2 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001500284A (ja) * 1997-07-11 2001-01-09 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 改良した調波音声符号器を備えた送信機
JP4641620B2 (ja) * 1998-05-11 2011-03-02 エヌエックスピー ビー ヴィ ピッチ検出の精密化
US6418407B1 (en) * 1999-09-30 2002-07-09 Motorola, Inc. Method and apparatus for pitch determination of a low bit rate digital voice message
JP3916834B2 (ja) * 2000-03-06 2007-05-23 独立行政法人科学技術振興機構 雑音が付加された周期波形の基本周期あるいは基本周波数の抽出方法
TW525146B (en) * 2000-09-22 2003-03-21 Matsushita Electric Industrial Co Ltd Method and apparatus for shifting pitch of acoustic signals
JP3997522B2 (ja) * 2000-12-14 2007-10-24 ソニー株式会社 符号化装置および方法、復号装置および方法、並びに記録媒体
JP4207568B2 (ja) 2000-12-14 2009-01-14 ソニー株式会社 情報抽出装置および方法、情報合成装置および方法、並びに記録媒体
KR100347188B1 (en) * 2001-08-08 2002-08-03 Amusetec Method and apparatus for judging pitch according to frequency analysis
KR100463417B1 (ko) * 2002-10-10 2004-12-23 한국전자통신연구원 상관함수의 최대값과 그의 후보값의 비를 이용한 피치검출 방법 및 그 장치
JP4381291B2 (ja) * 2004-12-08 2009-12-09 アルパイン株式会社 車載用オーディオ装置
KR20060067016A (ko) 2004-12-14 2006-06-19 엘지전자 주식회사 음성 부호화 장치 및 방법
KR100713366B1 (ko) * 2005-07-11 2007-05-04 삼성전자주식회사 모폴로지를 이용한 오디오 신호의 피치 정보 추출 방법 및그 장치
KR100827153B1 (ko) 2006-04-17 2008-05-02 삼성전자주식회사 음성 신호의 유성음화 비율 검출 장치 및 방법
JPWO2008001779A1 (ja) * 2006-06-27 2009-11-26 国立大学法人豊橋技術科学大学 基本周波数推定法および音響信号推定システム
JP4380669B2 (ja) * 2006-08-07 2009-12-09 カシオ計算機株式会社 音声符号化装置、音声復号装置、音声符号化方法、音声復号方法、及び、プログラム
US8620660B2 (en) * 2010-10-29 2013-12-31 The United States Of America, As Represented By The Secretary Of The Navy Very low bit rate signal coder and decoder
EP2795613B1 (de) 2011-12-21 2017-11-29 Huawei Technologies Co., Ltd. Erkennung und codierung von sehr kurzer längsneigung
CN103426441B (zh) * 2012-05-18 2016-03-02 华为技术有限公司 检测基音周期的正确性的方法和装置
EP2922053B1 (de) * 2012-11-15 2019-08-28 NTT Docomo, Inc. Audiocodierungsvorrichtung, audiocodierungsverfahren, audiocodierungsprogramm, audiodecodierungsvorrichtung, audiodecodierungsverfahren und audiodecodierungsprogramm
EP2980797A1 (de) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiodecodierer, Verfahren und Computerprogramm mit Zero-Input-Response zur Erzeugung eines sanften Übergangs
EP2980799A1 (de) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur Verarbeitung eines Audiosignals mit Verwendung einer harmonischen Nachfilterung
JP6759927B2 (ja) * 2016-09-23 2020-09-23 富士通株式会社 発話評価装置、発話評価方法、および発話評価プログラム
JP2022055464A (ja) * 2020-09-29 2022-04-08 Kddi株式会社 音声分析装置、方法及びプログラム
KR102608344B1 (ko) * 2021-02-04 2023-11-29 주식회사 퀀텀에이아이 실시간 End-to-End 방식의 음성 인식 및 음성DNA 생성 시스템
US11545143B2 (en) * 2021-05-18 2023-01-03 Boris Fridman-Mintz Recognition or synthesis of human-uttered harmonic sounds
KR102581221B1 (ko) * 2023-05-10 2023-09-21 주식회사 솔트룩스 재생 중인 응답 발화를 제어 및 사용자 의도를 예측하는 방법, 장치 및 컴퓨터-판독 가능 기록 매체

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3681530A (en) * 1970-06-15 1972-08-01 Gte Sylvania Inc Method and apparatus for signal bandwidth compression utilizing the fourier transform of the logarithm of the frequency spectrum magnitude
US4214125A (en) * 1977-01-21 1980-07-22 Forrest S. Mozer Method and apparatus for speech synthesizing
JPS5921039B2 (ja) * 1981-11-04 1984-05-17 日本電信電話株式会社 適応予測符号化方式
EP0163829B1 (de) * 1984-03-21 1989-08-23 Nippon Telegraph And Telephone Corporation Sprachsignaleverarbeitungssystem
CA1252568A (en) * 1984-12-24 1989-04-11 Kazunori Ozawa Low bit-rate pattern encoding and decoding capable of reducing an information transmission rate
US5115240A (en) * 1989-09-26 1992-05-19 Sony Corporation Method and apparatus for encoding voice signals divided into a plurality of frequency bands
US5127053A (en) * 1990-12-24 1992-06-30 General Electric Company Low-complexity method for improving the performance of autocorrelation-based pitch detectors
JP3277398B2 (ja) * 1992-04-15 2002-04-22 ソニー株式会社 有声音判別方法
CA2105269C (en) * 1992-10-09 1998-08-25 Yair Shoham Time-frequency interpolation with application to low rate speech coding
JP3343965B2 (ja) * 1992-10-31 2002-11-11 ソニー株式会社 音声符号化方法及び復号化方法
JP3137805B2 (ja) * 1993-05-21 2001-02-26 三菱電機株式会社 音声符号化装置、音声復号化装置、音声後処理装置及びこれらの方法
JP3475446B2 (ja) * 1993-07-27 2003-12-08 ソニー株式会社 符号化方法
US5715365A (en) * 1994-04-04 1998-02-03 Digital Voice Systems, Inc. Estimation of excitation parameters
JP3277692B2 (ja) * 1994-06-13 2002-04-22 ソニー株式会社 情報符号化方法、情報復号化方法及び情報記録媒体
JP3557662B2 (ja) * 1994-08-30 2004-08-25 ソニー株式会社 音声符号化方法及び音声復号化方法、並びに音声符号化装置及び音声復号化装置
US5717819A (en) * 1995-04-28 1998-02-10 Motorola, Inc. Methods and apparatus for encoding/decoding speech signals at low bit rates
JPH0990974A (ja) * 1995-09-25 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> 信号処理方法
JP4132109B2 (ja) * 1995-10-26 2008-08-13 ソニー株式会社 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置
JP3653826B2 (ja) * 1995-10-26 2005-06-02 ソニー株式会社 音声復号化方法及び装置

Also Published As

Publication number Publication date
CN1187665A (zh) 1998-07-15
KR19980032825A (ko) 1998-07-25
EP0837453A2 (de) 1998-04-22
JPH10124094A (ja) 1998-05-15
KR100496670B1 (ko) 2006-01-12
US6108621A (en) 2000-08-22
JP4121578B2 (ja) 2008-07-23
CN1161751C (zh) 2004-08-11
DE69726685D1 (de) 2004-01-22
EP0837453B1 (de) 2003-12-10
EP0837453A3 (de) 1998-12-30

Similar Documents

Publication Publication Date Title
DE69727895D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69726685T2 (de) Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung
DE69631728D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69717899D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625875D1 (de) Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
DE69715478D1 (de) Verfahren und Vorrichtung zur CELP Sprachkodierung und -dekodierung
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69518705D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69923253D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69806557D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69328450D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69726525D1 (de) Verfahren und Vorrichtung zur Vektorquantisierung und zur Sprachkodierung
DE69831991D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69830017D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625950D1 (de) Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem
DE69931257D1 (de) Verfahren und vorrichtung zur spektralanalyse und kodierer
DE69430082D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69632901D1 (de) Vorrichtung und Verfahren zur Sprachsynthese
DE69731937D1 (de) Verfahren und vorrichtung zur datencodierung
DE69519820D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69710525D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69431445D1 (de) Verfahren und Vorrichtung zur Sprachkodierung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition