[go: up one dir, main page]

WO2001048737A3 - Speech recognizer with a lexical tree based n-gram language model - Google Patents

Speech recognizer with a lexical tree based n-gram language model Download PDF

Info

Publication number
WO2001048737A3
WO2001048737A3 PCT/CN1999/000217 CN9900217W WO0148737A3 WO 2001048737 A3 WO2001048737 A3 WO 2001048737A3 CN 9900217 W CN9900217 W CN 9900217W WO 0148737 A3 WO0148737 A3 WO 0148737A3
Authority
WO
WIPO (PCT)
Prior art keywords
probabilities
lexical tree
estimated probabilities
stored
phonemes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN1999/000217
Other languages
French (fr)
Other versions
WO2001048737A2 (en
Inventor
Zhiwei Lin
Yonghong Yan
Qingwei Zhao
Baosheng Yuan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to AU17676/00A priority Critical patent/AU1767600A/en
Priority to PCT/CN1999/000217 priority patent/WO2001048737A2/en
Priority to CN99817058.5A priority patent/CN1201286C/en
Publication of WO2001048737A2 publication Critical patent/WO2001048737A2/en
Anticipated expiration legal-status Critical
Publication of WO2001048737A3 publication Critical patent/WO2001048737A3/en
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

In some embodiments, the invention includes a method comprising creating a lexical tree and identifying beginning phonemes in the lexical tree. The method of these embodiments further includes estimating probabilities of words in the lexical tree having particular ones of the beginning phonemes and storing at least some of the estimated probabilities, wherein backoff weights are not stored with the estimated probabilities. The estimated probabilities may be stored in a lookup table. In other embodiment, the invention includes a method of receiving phonemes and identifying them on a lexical tree. The method of these embodiments also includes estimating probabilities of words that include the phonemes through use of estimated probabilities retrieved from storage, wherein the retrieve probabilities do not include backoff weights stored with the estimated probabilities. Again, the estimated probabilities may be stored in a lookup table. The estimated probabilities may be used in establishing a pruning threshold. The methods may be implemented by instructions on a computer readable medium.
PCT/CN1999/000217 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model Ceased WO2001048737A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU17676/00A AU1767600A (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model
PCT/CN1999/000217 WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model
CN99817058.5A CN1201286C (en) 1999-12-23 1999-12-23 Speech recognizer with a lexial tree based N-gram language model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN1999/000217 WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model

Publications (2)

Publication Number Publication Date
WO2001048737A2 WO2001048737A2 (en) 2001-07-05
WO2001048737A3 true WO2001048737A3 (en) 2002-11-14

Family

ID=4575158

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN1999/000217 Ceased WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model

Country Status (3)

Country Link
CN (1) CN1201286C (en)
AU (1) AU1767600A (en)
WO (1) WO2001048737A2 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0420464D0 (en) 2004-09-14 2004-10-20 Zentian Ltd A speech recognition circuit and method
CN101271450B (en) * 2007-03-19 2010-09-29 株式会社东芝 Method and device for tailoring language model
GB2453366B (en) * 2007-10-04 2011-04-06 Toshiba Res Europ Ltd Automatic speech recognition method and apparatus
WO2010105427A1 (en) 2009-03-19 2010-09-23 Google Inc. Input method editor
KR101524740B1 (en) * 2009-03-19 2015-06-01 구글 인코포레이티드 Input method editor
US8655647B2 (en) 2010-03-11 2014-02-18 Microsoft Corporation N-gram selection for practical-sized language models
US8589164B1 (en) * 2012-10-18 2013-11-19 Google Inc. Methods and systems for speech recognition processing using search query information
CN111128172B (en) * 2019-12-31 2022-12-16 达闼机器人股份有限公司 Voice recognition method, electronic equipment and storage medium

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473694A (en) * 1990-07-13 1992-03-09 Nippon Telegr & Teleph Corp <Ntt> Japanese language speech recognizing method
EP0533260A2 (en) * 1991-09-14 1993-03-24 Philips Patentverwaltung GmbH Method and apparatus for recognizing the uttered words in a speech signal
US5502791A (en) * 1992-09-29 1996-03-26 International Business Machines Corporation Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords
JPH08123479A (en) * 1994-10-26 1996-05-17 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Continuous speech recognition device
JPH08221091A (en) * 1995-02-17 1996-08-30 Matsushita Electric Ind Co Ltd Voice recognition device
WO1996027872A1 (en) * 1995-03-07 1996-09-12 British Telecommunications Public Limited Company Speech recognition
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
EP0825586A2 (en) * 1996-08-22 1998-02-25 Dragon Systems Inc. Lexical tree pre-filtering in speech recognition
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees
US5832428A (en) * 1995-10-04 1998-11-03 Apple Computer, Inc. Search engine for phrase recognition based on prefix/body/suffix architecture
CN1233803A (en) * 1998-04-29 1999-11-03 松下电器产业株式会社 Method and apparatus for generating and scoring pronunciation of spelled words using decision trees
WO1999059141A1 (en) * 1998-05-11 1999-11-18 Siemens Aktiengesellschaft Method and array for introducing temporal correlation in hidden markov models for speech recognition
JPH11344991A (en) * 1998-05-30 1999-12-14 Brother Ind Ltd Voice recognition device and storage medium

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473694A (en) * 1990-07-13 1992-03-09 Nippon Telegr & Teleph Corp <Ntt> Japanese language speech recognizing method
EP0533260A2 (en) * 1991-09-14 1993-03-24 Philips Patentverwaltung GmbH Method and apparatus for recognizing the uttered words in a speech signal
US5502791A (en) * 1992-09-29 1996-03-26 International Business Machines Corporation Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
JPH08123479A (en) * 1994-10-26 1996-05-17 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Continuous speech recognition device
JPH08221091A (en) * 1995-02-17 1996-08-30 Matsushita Electric Ind Co Ltd Voice recognition device
WO1996027872A1 (en) * 1995-03-07 1996-09-12 British Telecommunications Public Limited Company Speech recognition
US5832428A (en) * 1995-10-04 1998-11-03 Apple Computer, Inc. Search engine for phrase recognition based on prefix/body/suffix architecture
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees
EP0825586A2 (en) * 1996-08-22 1998-02-25 Dragon Systems Inc. Lexical tree pre-filtering in speech recognition
CN1233803A (en) * 1998-04-29 1999-11-03 松下电器产业株式会社 Method and apparatus for generating and scoring pronunciation of spelled words using decision trees
WO1999059141A1 (en) * 1998-05-11 1999-11-18 Siemens Aktiengesellschaft Method and array for introducing temporal correlation in hidden markov models for speech recognition
JPH11344991A (en) * 1998-05-30 1999-12-14 Brother Ind Ltd Voice recognition device and storage medium

Also Published As

Publication number Publication date
CN1201286C (en) 2005-05-11
CN1406374A (en) 2003-03-26
AU1767600A (en) 2001-07-09
WO2001048737A2 (en) 2001-07-05

Similar Documents

Publication Publication Date Title
AU2001274936A1 (en) Creating a unified task dependent language models with information retrieval techniques
WO2004003688A3 (en) A method for comparing a transcribed text file with a previously created file
EP1128361A3 (en) Language models for speech recognition
EP2453436A3 (en) Automatic language model update
EP1220197A3 (en) Speech recognition method and system
CA2508946A1 (en) Method and apparatus for natural language call routing using confidence scores
CA2493640A1 (en) Improvements in or relating to information provision for call centres
EP1265162A3 (en) System and method of storing digital tree data structure
EP1538535A3 (en) Determination of meaning for text input in natural language understanding systems
EP2416262A3 (en) Information retrieval based on historical data
EP1083545A3 (en) Voice recognition of proper names in a navigation apparatus
WO2007035186A3 (en) A method and system for the automatic recognition of deceptive language
CA2488814A1 (en) System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
ATE386318T1 (en) IMPROVING THE TRANSCRIPTION ACCURACY OF VOICE RECOGNITION SOFTWARE
EP1653444A3 (en) System and method for converting text to speech
IT1279171B1 (en) CONTINUOUS SPEECH RECOGNITION SYSTEM
WO2001048737A3 (en) Speech recognizer with a lexical tree based n-gram language model
US20010023398A1 (en) Pattern matching method and apparatus
WO2001084357A3 (en) Cluster and pruning-based language model compression
CN108304561A (en) A kind of semantic understanding method, equipment and robot based on finite data
EP0949606A3 (en) Method and system for speech recognition based on phonetic transcriptions
Nocera et al. Phoneme lattice based A* search algorithm for speech recognition
EP0984354A3 (en) Method for creating dictation macros
Wang et al. A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues
EP1321862A3 (en) Hash function based transcription database

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 998170585

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 09979628

Country of ref document: US

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase