[go: up one dir, main page]

WO2001035249A3 - Language input architecture for converting one text form to another text form with modeless entry - Google Patents

Language input architecture for converting one text form to another text form with modeless entry Download PDF

Info

Publication number
WO2001035249A3
WO2001035249A3 PCT/US2000/028418 US0028418W WO0135249A3 WO 2001035249 A3 WO2001035249 A3 WO 2001035249A3 US 0028418 W US0028418 W US 0028418W WO 0135249 A3 WO0135249 A3 WO 0135249A3
Authority
WO
WIPO (PCT)
Prior art keywords
typing
language
string
conversion
input
Prior art date
Application number
PCT/US2000/028418
Other languages
French (fr)
Other versions
WO2001035249A2 (en
Inventor
Kai-Fu Lee
Zheng Chen
Jian Han
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Priority to JP2001536715A priority Critical patent/JP4833476B2/en
Priority to HK03102615.8A priority patent/HK1050578B/en
Priority to AU80209/00A priority patent/AU8020900A/en
Publication of WO2001035249A2 publication Critical patent/WO2001035249A2/en
Publication of WO2001035249A3 publication Critical patent/WO2001035249A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

A language input architecture converts input strings of phonetic text (e.g., Chinese Pinyin) to an output string of language text (e.g., Chinese Hanzi) in a manner that minimizes typographical errors and conversion errors that occur during conversion from the phonetic text to the language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. Each typing model is trained on real data, and learns probabilities of typing errors. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The probable typing candidates may be stored in a database. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string. The search engine combines the probabilities of the typing and language models to find the most probable conversion string that represents a converted form of the input string. By generating typing candidates and then using the associated conversion strings to replace the input string, the architecture eliminates many common typographical errors. When multiple typing models are employed, the architecture can automatically distinguish among multiple languages without requiring mode switching for entry of the different languages.
PCT/US2000/028418 1999-11-05 2000-10-13 Language input architecture for converting one text form to another text form with modeless entry WO2001035249A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2001536715A JP4833476B2 (en) 1999-11-05 2000-10-13 Language input architecture that converts one text format to the other text format with modeless input
HK03102615.8A HK1050578B (en) 1999-11-05 2000-10-13 Language input architecture for converting one text form to another text form with modeless entry
AU80209/00A AU8020900A (en) 1999-11-05 2000-11-13 Language input architecture for converting one text form to another text form with modeless entry

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US16390299P 1999-11-05 1999-11-05
US60/163,902 1999-11-05
US09/606,807 US7165019B1 (en) 1999-11-05 2000-06-28 Language input architecture for converting one text form to another text form with modeless entry
US09/606,807 2000-06-28

Publications (2)

Publication Number Publication Date
WO2001035249A2 WO2001035249A2 (en) 2001-05-17
WO2001035249A3 true WO2001035249A3 (en) 2001-12-20

Family

ID=26860055

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/028418 WO2001035249A2 (en) 1999-11-05 2000-10-13 Language input architecture for converting one text form to another text form with modeless entry

Country Status (5)

Country Link
US (1) US7165019B1 (en)
JP (1) JP4833476B2 (en)
CN (1) CN100492350C (en)
AU (1) AU8020900A (en)
WO (1) WO2001035249A2 (en)

Families Citing this family (100)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7403888B1 (en) * 1999-11-05 2008-07-22 Microsoft Corporation Language input user interface
JP4543294B2 (en) * 2000-03-14 2010-09-15 ソニー株式会社 Voice recognition apparatus, voice recognition method, and recording medium
US8706747B2 (en) * 2000-07-06 2014-04-22 Google Inc. Systems and methods for searching using queries written in a different character-set and/or language from the target pages
US7277732B2 (en) * 2000-10-13 2007-10-02 Microsoft Corporation Language input system for mobile devices
KR100400694B1 (en) * 2001-07-03 2003-10-08 황재엽 The Chinese code generator for mobile phone
US7734565B2 (en) * 2003-01-18 2010-06-08 Yahoo! Inc. Query string matching method and apparatus
US20050010392A1 (en) * 2003-07-10 2005-01-13 International Business Machines Corporation Traditional Chinese / simplified Chinese character translator
US20050010391A1 (en) * 2003-07-10 2005-01-13 International Business Machines Corporation Chinese character / Pin Yin / English translator
US20050027547A1 (en) * 2003-07-31 2005-02-03 International Business Machines Corporation Chinese / Pin Yin / english dictionary
US8137105B2 (en) 2003-07-31 2012-03-20 International Business Machines Corporation Chinese/English vocabulary learning tool
US8543378B1 (en) * 2003-11-05 2013-09-24 W.W. Grainger, Inc. System and method for discerning a term for an entry having a spelling error
US20050125218A1 (en) * 2003-12-04 2005-06-09 Nitendra Rajput Language modelling for mixed language expressions
US8200475B2 (en) * 2004-02-13 2012-06-12 Microsoft Corporation Phonetic-based text input method
US7478033B2 (en) * 2004-03-16 2009-01-13 Google Inc. Systems and methods for translating Chinese pinyin to Chinese characters
GB0406451D0 (en) * 2004-03-23 2004-04-28 Patel Sanjay Keyboards
US7091885B2 (en) * 2004-06-02 2006-08-15 2012244 Ontario Inc. Handheld electronic device with text disambiguation
US7312726B2 (en) 2004-06-02 2007-12-25 Research In Motion Limited Handheld electronic device with text disambiguation
US7711542B2 (en) 2004-08-31 2010-05-04 Research In Motion Limited System and method for multilanguage text input in a handheld electronic device
US7676357B2 (en) * 2005-02-17 2010-03-09 International Business Machines Corporation Enhanced Chinese character/Pin Yin/English translator
GB0505942D0 (en) 2005-03-23 2005-04-27 Patel Sanjay Human to mobile interfaces
GB0505941D0 (en) 2005-03-23 2005-04-27 Patel Sanjay Human-to-mobile interfaces
US9471566B1 (en) * 2005-04-14 2016-10-18 Oracle America, Inc. Method and apparatus for converting phonetic language input to written language output
US7516062B2 (en) * 2005-04-19 2009-04-07 International Business Machines Corporation Language converter with enhanced search capability
US7506254B2 (en) * 2005-04-21 2009-03-17 Google Inc. Predictive conversion of user input
US8904282B2 (en) * 2005-04-21 2014-12-02 Motorola Mobility Llc Electronic device having capability for interpreting user inputs and method therefor
US7620540B2 (en) * 2005-04-29 2009-11-17 Research In Motion Limited Method for generating text in a handheld electronic device and a handheld electronic device incorporating the same
US20060293890A1 (en) * 2005-06-28 2006-12-28 Avaya Technology Corp. Speech recognition assisted autocompletion of composite characters
US8249873B2 (en) * 2005-08-12 2012-08-21 Avaya Inc. Tonal correction of speech
US7562811B2 (en) 2007-01-18 2009-07-21 Varcode Ltd. System and method for improved quality management in a product logistic chain
JP2009537038A (en) 2006-05-07 2009-10-22 バーコード リミティド System and method for improving quality control in a product logistic chain
CN101131690B (en) * 2006-08-21 2012-07-25 富士施乐株式会社 Method and system for mutual conversion between simplified Chinese characters and traditional Chinese characters
US8626486B2 (en) * 2006-09-05 2014-01-07 Google Inc. Automatic spelling correction for machine translation
US20080221866A1 (en) * 2007-03-06 2008-09-11 Lalitesh Katragadda Machine Learning For Transliteration
CN101271450B (en) * 2007-03-19 2010-09-29 株式会社东芝 Method and device for cutting language model
CN105117376B (en) * 2007-04-10 2018-07-10 谷歌有限责任公司 Multi-mode input method editor
CN104866469B (en) * 2007-04-11 2018-10-02 谷歌有限责任公司 Input Method Editor with secondary language mode
EP2156369B1 (en) 2007-05-06 2015-09-02 Varcode Ltd. A system and method for quality management utilizing barcode indicators
EG25474A (en) * 2007-05-21 2012-01-11 Sherikat Link Letatweer Elbarmaguey At Sae Method for translitering and suggesting arabic replacement for a given user input
CN101802812B (en) 2007-08-01 2015-07-01 金格软件有限公司 Automatic context-sensitive language correction and enhancement using Internet corpora
US8540156B2 (en) 2007-11-14 2013-09-24 Varcode Ltd. System and method for quality management utilizing barcode indicators
US20090300126A1 (en) * 2008-05-30 2009-12-03 International Business Machines Corporation Message Handling
US11704526B2 (en) 2008-06-10 2023-07-18 Varcode Ltd. Barcoded indicators for quality management
JP2010176543A (en) 2009-01-30 2010-08-12 Toshiba Corp Translation device, method and program
CN102439544A (en) * 2009-03-20 2012-05-02 谷歌股份有限公司 Interaction with ime computing device
JP5343744B2 (en) * 2009-07-24 2013-11-13 富士通株式会社 Speech translation apparatus and speech translation method
WO2011050494A1 (en) * 2009-10-29 2011-05-05 Google Inc. Generating input suggestions
US9015036B2 (en) 2010-02-01 2015-04-21 Ginger Software, Inc. Automatic context sensitive language correction using an internet corpus particularly for small keyboard devices
JP2013520878A (en) * 2010-02-18 2013-06-06 スレイマン アルカジ, Configurable multilingual keyboard
US8972930B2 (en) * 2010-06-04 2015-03-03 Microsoft Corporation Generating text manipulation programs using input-output examples
US9613115B2 (en) 2010-07-12 2017-04-04 Microsoft Technology Licensing, Llc Generating programs based on input-output examples using converter modules
US9262397B2 (en) * 2010-10-08 2016-02-16 Microsoft Technology Licensing, Llc General purpose correction of grammatical and word usage errors
WO2013007210A1 (en) * 2011-07-14 2013-01-17 腾讯科技(深圳)有限公司 Character input method, device and system
US8855997B2 (en) 2011-07-28 2014-10-07 Microsoft Corporation Linguistic error detection
US9043198B1 (en) * 2012-04-13 2015-05-26 Google Inc. Text suggestion
US8983211B2 (en) * 2012-05-14 2015-03-17 Xerox Corporation Method for processing optical character recognizer output
US9552335B2 (en) 2012-06-04 2017-01-24 Microsoft Technology Licensing, Llc Expedited techniques for generating string manipulation programs
CN103631802B (en) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 Song information searching method, device and corresponding server
US8807422B2 (en) 2012-10-22 2014-08-19 Varcode Ltd. Tamper-proof quality management barcode indicators
US9600473B2 (en) 2013-02-08 2017-03-21 Machine Zone, Inc. Systems and methods for multi-user multi-lingual communications
US9031829B2 (en) * 2013-02-08 2015-05-12 Machine Zone, Inc. Systems and methods for multi-user multi-lingual communications
US9298703B2 (en) 2013-02-08 2016-03-29 Machine Zone, Inc. Systems and methods for incentivizing user feedback for translation processing
US8990068B2 (en) 2013-02-08 2015-03-24 Machine Zone, Inc. Systems and methods for multi-user multi-lingual communications
US10650103B2 (en) 2013-02-08 2020-05-12 Mz Ip Holdings, Llc Systems and methods for incentivizing user feedback for translation processing
US9231898B2 (en) 2013-02-08 2016-01-05 Machine Zone, Inc. Systems and methods for multi-user multi-lingual communications
US8996352B2 (en) 2013-02-08 2015-03-31 Machine Zone, Inc. Systems and methods for correcting translations in multi-user multi-lingual communications
JP6155821B2 (en) 2013-05-08 2017-07-05 ソニー株式会社 Information processing apparatus, information processing method, and program
CN104298672B (en) * 2013-07-16 2018-09-11 北京搜狗科技发展有限公司 A kind of error correction method and device of input
JP2015022590A (en) * 2013-07-19 2015-02-02 株式会社東芝 Character input apparatus, character input method, and character input program
CN105814556B (en) * 2013-09-26 2019-09-13 谷歌有限责任公司 The input tool of context-sensitive
US20150100537A1 (en) * 2013-10-03 2015-04-09 Microsoft Corporation Emoji for Text Predictions
CN103578464B (en) * 2013-10-18 2017-01-11 威盛电子股份有限公司 Language model building method, speech recognition method and electronic device
CN104750672B (en) * 2013-12-27 2017-11-21 重庆新媒农信科技有限公司 A kind of Chinese vocabulary error correction method and its device being applied in search
CN104808806B (en) * 2014-01-28 2019-10-25 北京三星通信技术研究有限公司 Method and device for realizing Chinese character input according to uncertainty information
CN103885608A (en) 2014-03-19 2014-06-25 百度在线网络技术(北京)有限公司 Input method and system
US9524293B2 (en) * 2014-08-15 2016-12-20 Google Inc. Techniques for automatically swapping languages and/or content for machine translation
US9372848B2 (en) 2014-10-17 2016-06-21 Machine Zone, Inc. Systems and methods for language detection
US10162811B2 (en) 2014-10-17 2018-12-25 Mz Ip Holdings, Llc Systems and methods for language detection
TWI590080B (en) * 2014-11-26 2017-07-01 納寶股份有限公司 Content participation translation apparatus and method
US10229674B2 (en) * 2015-05-15 2019-03-12 Microsoft Technology Licensing, Llc Cross-language speech recognition and translation
US11060924B2 (en) 2015-05-18 2021-07-13 Varcode Ltd. Thermochromic ink indicia for activatable quality labels
CN104991656B (en) * 2015-06-11 2018-12-21 浦江开丰广告设计有限公司 A method of input Chinese phrase
JP6898298B2 (en) 2015-07-07 2021-07-07 バーコード リミティド Electronic quality display index
US10765956B2 (en) 2016-01-07 2020-09-08 Machine Zone Inc. Named entity recognition on chat data
CN106297797B (en) * 2016-07-26 2019-05-31 百度在线网络技术(北京)有限公司 Method for correcting error of voice identification result and device
CN107798386B (en) * 2016-09-01 2022-02-15 微软技术许可有限责任公司 Multi-process collaborative training based on unlabeled data
US10346548B1 (en) * 2016-09-26 2019-07-09 Lilt, Inc. Apparatus and method for prefix-constrained decoding in a neural machine translation system
US11256710B2 (en) 2016-10-20 2022-02-22 Microsoft Technology Licensing, Llc String transformation sub-program suggestion
US11620304B2 (en) 2016-10-20 2023-04-04 Microsoft Technology Licensing, Llc Example management for string transformation
US10846298B2 (en) 2016-10-28 2020-11-24 Microsoft Technology Licensing, Llc Record profiling for dataset sampling
US10275646B2 (en) 2017-08-03 2019-04-30 Gyrfalcon Technology Inc. Motion recognition via a two-dimensional symbol having multiple ideograms contained therein
WO2019060353A1 (en) 2017-09-21 2019-03-28 Mz Ip Holdings, Llc System and method for translating chat messages
US10671353B2 (en) 2018-01-31 2020-06-02 Microsoft Technology Licensing, Llc Programming-by-example using disjunctive programs
CN111198936B (en) * 2018-11-20 2023-09-15 北京嘀嘀无限科技发展有限公司 Voice search method and device, electronic equipment and storage medium
CN111859948B (en) * 2019-04-28 2024-06-11 北京嘀嘀无限科技发展有限公司 Language identification, language model training and character prediction method and device
CN112287100B (en) * 2019-07-12 2024-11-29 阿里巴巴集团控股有限公司 Text recognition method, spelling correction method and voice recognition method
CN110415679B (en) * 2019-07-25 2021-12-17 北京百度网讯科技有限公司 Voice error correction method, device, equipment and storage medium
CN112988955B (en) * 2019-12-02 2024-03-15 卢文祥 Multilingual voice recognition and topic semantic analysis method and device
CN112560855B (en) * 2020-12-18 2022-10-14 平安银行股份有限公司 Image information extraction method and device, electronic equipment and storage medium
CN112735396B (en) * 2021-02-05 2024-10-15 北京小米松果电子有限公司 Speech recognition error correction method, device and storage medium
US12086542B2 (en) * 2021-04-06 2024-09-10 Talent Unlimited Online Services Private Limited System and method for generating contextualized text using a character-based convolutional neural network architecture

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5319552A (en) * 1991-10-14 1994-06-07 Omron Corporation Apparatus and method for selectively converting a phonetic transcription of Chinese into a Chinese character from a plurality of notations
US5535119A (en) * 1992-06-11 1996-07-09 Hitachi, Ltd. Character inputting method allowing input of a plurality of different types of character species, and information processing equipment adopting the same

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3435124A (en) * 1966-02-07 1969-03-25 William H Channell Pedestal and underground terminals for buried cable systems
GB2158776A (en) 1984-02-24 1985-11-20 Chang Chi Chen Method of computerised input of Chinese words in keyboards
US5175803A (en) * 1985-06-14 1992-12-29 Yeh Victor C Method and apparatus for data processing and word processing in Chinese using a phonetic Chinese language
US4833610A (en) 1986-12-16 1989-05-23 International Business Machines Corporation Morphological/phonetic method for ranking word similarities
US5218536A (en) 1988-05-25 1993-06-08 Franklin Electronic Publishers, Incorporated Electronic spelling machine having ordered candidate words
JPH02140868A (en) 1988-11-22 1990-05-30 Toshiba Corp Machine translating system
US5258909A (en) 1989-08-31 1993-11-02 International Business Machines Corporation Method and apparatus for "wrong word" spelling error detection and correction
US5278943A (en) 1990-03-23 1994-01-11 Bright Star Technology, Inc. Speech animation and inflection system
US5572423A (en) 1990-06-14 1996-11-05 Lucent Technologies Inc. Method for correcting spelling using error frequencies
JPH0475162A (en) * 1990-07-18 1992-03-10 Toshiba Corp Japanese syllabary/chinese character conversion device
US5270927A (en) * 1990-09-10 1993-12-14 At&T Bell Laboratories Method for conversion of phonetic Chinese to character Chinese
US5267345A (en) 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
JPH05282360A (en) * 1992-03-31 1993-10-29 Hitachi Ltd Multilingual input device
US5675815A (en) 1992-11-09 1997-10-07 Ricoh Company, Ltd. Language conversion system and text creating system using such
US5671426A (en) 1993-06-22 1997-09-23 Kurzweil Applied Intelligence, Inc. Method for organizing incremental search dictionary
DE4323241A1 (en) 1993-07-12 1995-02-02 Ibm Method and computer system for finding incorrect character strings in a text
WO1995017729A1 (en) 1993-12-22 1995-06-29 Taligent, Inc. Input methods framework
US5704007A (en) 1994-03-11 1997-12-30 Apple Computer, Inc. Utilization of multiple voice sources in a speech synthesizer
US5930755A (en) 1994-03-11 1999-07-27 Apple Computer, Inc. Utilization of a recorded sound sample as a voice source in a speech synthesizer
US6154758A (en) 1994-05-13 2000-11-28 Apple Computer, Inc. Text conversion method for computer systems
US5510998A (en) * 1994-06-13 1996-04-23 Cadence Design Systems, Inc. System and method for generating component models
JP2773652B2 (en) 1994-08-04 1998-07-09 日本電気株式会社 Machine translation equipment
AU3734395A (en) 1994-10-03 1996-04-26 Helfgott & Karas, P.C. A database accessing system
SG42314A1 (en) * 1995-01-30 1997-08-15 Mitsubishi Electric Corp Language processing apparatus and method
CA2170669A1 (en) * 1995-03-24 1996-09-25 Fernando Carlos Neves Pereira Grapheme-to phoneme conversion with weighted finite-state transducers
US5893133A (en) 1995-08-16 1999-04-06 International Business Machines Corporation Keyboard for a system and method for processing Chinese language text
US5806021A (en) 1995-10-30 1998-09-08 International Business Machines Corporation Automatic segmentation of continuous text using statistical approaches
JPH09259126A (en) 1996-03-21 1997-10-03 Sharp Corp Data processing device
US5933525A (en) 1996-04-10 1999-08-03 Bbn Corporation Language-independent and segmentation-free optical character recognition system and method
DE69711761T2 (en) 1996-05-29 2002-08-14 Matsushita Electric Industrial Co., Ltd. Arrangement for document conversion
US5907705A (en) 1996-10-31 1999-05-25 Sun Microsystems, Inc. Computer implemented request to integrate (RTI) system for managing change control in software release stream
JP2806452B2 (en) * 1996-12-19 1998-09-30 オムロン株式会社 Kana-kanji conversion device and method, and recording medium
CN1193779A (en) 1997-03-13 1998-09-23 国际商业机器公司 Chinese Sentence Segmentation Method and Its Application in Chinese Error Checking System
CN1161701C (en) * 1997-03-14 2004-08-11 欧姆龙株式会社 Speech recognition device, method and recording medium for storing program of the speech recognition device
US6047300A (en) 1997-05-15 2000-04-04 Microsoft Corporation System and method for automatically correcting a misspelled word
JP3548747B2 (en) * 1997-06-17 2004-07-28 オムロン株式会社 Recording medium and character input device
CA2242065C (en) * 1997-07-03 2004-12-14 Henry C.A. Hyde-Thomson Unified messaging system with automatic language identification for text-to-speech conversion
US5974413A (en) 1997-07-03 1999-10-26 Activeword Systems, Inc. Semantic user interface
US6131102A (en) 1998-06-15 2000-10-10 Microsoft Corporation Method and system for cost computation of spelling suggestions and automatic replacement
US6490563B2 (en) 1998-08-17 2002-12-03 Microsoft Corporation Proofreading with text to speech feedback
US6356866B1 (en) 1998-10-07 2002-03-12 Microsoft Corporation Method for converting a phonetic character string into the text of an Asian language
US6148285A (en) 1998-10-30 2000-11-14 Nortel Networks Corporation Allophonic text-to-speech generator
CN1143232C (en) 1998-11-30 2004-03-24 皇家菲利浦电子有限公司 Automatic segmentation of text
US6573844B1 (en) 2000-01-18 2003-06-03 Microsoft Corporation Predictive keyboard
US6646572B1 (en) 2000-02-18 2003-11-11 Mitsubish Electric Research Laboratories, Inc. Method for designing optimal single pointer predictive keyboards and apparatus therefore

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5319552A (en) * 1991-10-14 1994-06-07 Omron Corporation Apparatus and method for selectively converting a phonetic transcription of Chinese into a Chinese character from a plurality of notations
US5535119A (en) * 1992-06-11 1996-07-09 Hitachi, Ltd. Character inputting method allowing input of a plurality of different types of character species, and information processing equipment adopting the same

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YUAN M. ET AL: "A neural network for disambiguating pinyin Chinese input", PROC. OF THE COMPUTER ASSISTED LANGUAGE INSTRUCTION CONSORTIUM 94 ANNUAL SYMPOSIUM, 14 March 1994 (1994-03-14) - 18 March 1994 (1994-03-18), pages 239 - 243, XP001020457 *

Also Published As

Publication number Publication date
CN1384940A (en) 2002-12-11
AU8020900A (en) 2001-06-06
CN100492350C (en) 2009-05-27
JP2003527676A (en) 2003-09-16
WO2001035249A2 (en) 2001-05-17
US7165019B1 (en) 2007-01-16
JP4833476B2 (en) 2011-12-07
HK1050578A1 (en) 2003-06-27

Similar Documents

Publication Publication Date Title
WO2001035249A3 (en) Language input architecture for converting one text form to another text form with modeless entry
WO2001035250A3 (en) Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors
Chen et al. A new statistical approach to Chinese Pinyin input
KR101263332B1 (en) Automatic translation apparatus by using user interaction in mobile device and its method
KR100656736B1 (en) System and method for disambiguating phonetic input
US7506254B2 (en) Predictive conversion of user input
KR100724141B1 (en) Apparatus for Hangul output and method thereof
US8515733B2 (en) Method, device, computer program and computer program product for processing linguistic data in accordance with a formalized natural language
CN100472411C (en) Method for canceling character string in input method and text input system
Smith Limits on the application of frequency-based language models to OCR
CN101320366A (en) Apparatus, method for machine translation
KR100853173B1 (en) Automatic Speech Interpretation System based on Statistical Automatic Translation Method, Translation Processing Method Applied to It and Training Method
KR102794379B1 (en) Learning data correction method and apparatus thereof using ensemble score
Soumya et al. Development of a POS tagger for Malayalam-an experience
Ma et al. Easy-first chinese pos tagging and dependency parsing
CN102063196A (en) Intelligent Japanese input method capable of spelling by Romaji for mobile phone
CA2496872A1 (en) Phonetic and stroke input methods of chinese characters and phrases
Li et al. The study of comparison and conversion about traditional Mongolian and Cyrillic Mongolian
CN114282530B (en) Complex sentence emotion analysis method based on grammar structure and connection information trigger
Zhang et al. Normalization of homophonic words in chinese microblogs
Islam et al. Design analysis rules to identify proper noun from Bengali sentence for universal networking language
Cubel et al. Finite-state models for computer assisted translation
Ikegami et al. Flick: Japanese input method editor using N-gram and recurrent neural network language model based predictive text input
Li A pinyin input method editor with English-Chinese aided translation function
Sadiqui et al. A new method to construct a statistical model for Arabic language

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 008149062

Country of ref document: CN

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2001 536715

Kind code of ref document: A

Format of ref document f/p: F

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase