MX2016017394A - Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos. - Google Patents
Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos.Info
- Publication number
- MX2016017394A MX2016017394A MX2016017394A MX2016017394A MX2016017394A MX 2016017394 A MX2016017394 A MX 2016017394A MX 2016017394 A MX2016017394 A MX 2016017394A MX 2016017394 A MX2016017394 A MX 2016017394A MX 2016017394 A MX2016017394 A MX 2016017394A
- Authority
- MX
- Mexico
- Prior art keywords
- heterographs
- systems
- methods
- word
- words
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- User Interface Of Digital Computer (AREA)
- Information Transfer Between Computers (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Machine Translation (AREA)
- Steroid Compounds (AREA)
Abstract
Son proporcionados sistemas y métodos de realización del reconocimiento automático del habla (ASR, por sus siglas en Inglés) en la presencia de heterógrafos. Una entrada verbal es recibida del usuario, la cual incluye una pluralidad de pronunciaciones. Una primera de la pluralidad de pronunciaciones es comparada con una primera palabra. Después, es determinado que una segunda pronunciación en la pluralidad de pronunciaciones coincide con una pluralidad de palabras que está en un mismo conjunto heterógrafo. A continuación es identificada cuál de la pluralidad de palabras está asociada con un contexto de la primera palabra. Finalmente es realizada una función basada en la primera palabra y en la palabra identificada de la pluralidad de palabras.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/448,308 US9721564B2 (en) | 2014-07-31 | 2014-07-31 | Systems and methods for performing ASR in the presence of heterographs |
| PCT/US2015/042584 WO2016018981A1 (en) | 2014-07-31 | 2015-07-29 | Systems and methods for performing asr in the presence of heterographs |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| MX2016017394A true MX2016017394A (es) | 2017-04-27 |
| MX359330B MX359330B (es) | 2018-09-25 |
Family
ID=53784025
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2016017394A MX359330B (es) | 2014-07-31 | 2015-07-29 | Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos. |
Country Status (13)
| Country | Link |
|---|---|
| US (1) | US9721564B2 (es) |
| EP (2) | EP3364408B1 (es) |
| JP (1) | JP6684231B2 (es) |
| KR (3) | KR102438752B1 (es) |
| CN (1) | CN106471571A (es) |
| AU (1) | AU2015296597A1 (es) |
| CA (2) | CA2954197C (es) |
| DK (1) | DK3175442T3 (es) |
| ES (1) | ES2675302T3 (es) |
| GB (1) | GB2530871B (es) |
| MX (1) | MX359330B (es) |
| PT (2) | PT3364408T (es) |
| WO (1) | WO2016018981A1 (es) |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10068023B2 (en) | 2014-12-30 | 2018-09-04 | Rovi Guides, Inc. | Systems and methods for updating links between keywords associated with a trending topic |
| US9854049B2 (en) * | 2015-01-30 | 2017-12-26 | Rovi Guides, Inc. | Systems and methods for resolving ambiguous terms in social chatter based on a user profile |
| US10628009B2 (en) | 2015-06-26 | 2020-04-21 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on user profile |
| US9576578B1 (en) * | 2015-08-12 | 2017-02-21 | Google Inc. | Contextual improvement of voice query recognition |
| US10031967B2 (en) | 2016-02-29 | 2018-07-24 | Rovi Guides, Inc. | Systems and methods for using a trained model for determining whether a query comprising multiple segments relates to an individual query or several queries |
| US10133735B2 (en) | 2016-02-29 | 2018-11-20 | Rovi Guides, Inc. | Systems and methods for training a model to determine whether a query with multiple segments comprises multiple distinct commands or a combined command |
| US20170272825A1 (en) | 2016-03-16 | 2017-09-21 | Rovi Guides, Inc. | System and method for locating content related to a media asset |
| US10169470B2 (en) | 2016-04-11 | 2019-01-01 | Rovi Guides, Inc. | Systems and methods for identifying a meaning of an ambiguous term in a natural language query |
| US10503832B2 (en) * | 2016-07-29 | 2019-12-10 | Rovi Guides, Inc. | Systems and methods for disambiguating a term based on static and temporal knowledge graphs |
| US9959864B1 (en) | 2016-10-27 | 2018-05-01 | Google Llc | Location-based voice query recognition |
| US10097898B2 (en) | 2016-11-21 | 2018-10-09 | Rovi Guides, Inc. | Systems and methods for generating for display recommendations that are temporally relevant to activities of a user and are contextually relevant to a portion of a media asset that the user is consuming |
| US11094317B2 (en) | 2018-07-31 | 2021-08-17 | Samsung Electronics Co., Ltd. | System and method for personalized natural language understanding |
| CN110176237A (zh) * | 2019-07-09 | 2019-08-27 | 北京金山数字娱乐科技有限公司 | 一种语音识别方法及装置 |
| US11721322B2 (en) | 2020-02-28 | 2023-08-08 | Rovi Guides, Inc. | Automated word correction in speech recognition systems |
| CN119228271B (zh) * | 2024-11-30 | 2025-02-18 | 西昌学院 | 一种图书馆的库存管理方法及系统 |
Family Cites Families (38)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS60130798A (ja) * | 1983-12-19 | 1985-07-12 | 松下電器産業株式会社 | 音声識別装置 |
| US4980918A (en) | 1985-05-09 | 1990-12-25 | International Business Machines Corporation | Speech recognition system with efficient storage and rapid assembly of phonological graphs |
| US6239794B1 (en) | 1994-08-31 | 2001-05-29 | E Guide, Inc. | Method and system for simultaneously displaying a television program and information about the program |
| US6388714B1 (en) | 1995-10-02 | 2002-05-14 | Starsight Telecast Inc | Interactive computer system for providing television schedule information |
| US6177931B1 (en) | 1996-12-19 | 2001-01-23 | Index Systems, Inc. | Systems and methods for displaying and recording control interface with television programs, video, advertising information and program scheduling information |
| US5963957A (en) * | 1997-04-28 | 1999-10-05 | Philips Electronics North America Corporation | Bibliographic music data base with normalized musical themes |
| US6182038B1 (en) | 1997-12-01 | 2001-01-30 | Motorola, Inc. | Context dependent phoneme networks for encoding speech information |
| US6564378B1 (en) | 1997-12-08 | 2003-05-13 | United Video Properties, Inc. | Program guide system with browsing display |
| ES2197627T3 (es) | 1998-03-04 | 2004-01-01 | United Video Properties, Inc. | Sistema de guia de programas con publicidad orientada a objetivos. |
| US6236968B1 (en) | 1998-05-14 | 2001-05-22 | International Business Machines Corporation | Sleep prevention dialog based car system |
| CN1867068A (zh) | 1998-07-14 | 2006-11-22 | 联合视频制品公司 | 交互式电视节目导视系统及其方法 |
| EP1213919B1 (en) | 1998-07-17 | 2010-03-10 | United Video Properties, Inc. | Interactive television program guide system having multiple devices within a household |
| AR020608A1 (es) | 1998-07-17 | 2002-05-22 | United Video Properties Inc | Un metodo y una disposicion para suministrar a un usuario acceso remoto a una guia de programacion interactiva por un enlace de acceso remoto |
| US6269335B1 (en) | 1998-08-14 | 2001-07-31 | International Business Machines Corporation | Apparatus and methods for identifying homophones among words in a speech recognition system |
| US7165098B1 (en) | 1998-11-10 | 2007-01-16 | United Video Properties, Inc. | On-line schedule system with personalization features |
| US6370503B1 (en) | 1999-06-30 | 2002-04-09 | International Business Machines Corp. | Method and apparatus for improving speech recognition accuracy |
| KR101775064B1 (ko) | 2001-02-21 | 2017-09-06 | 로비 가이드스, 인크. | 개인용 비디오 녹화 특징을 갖는 대화식 프로그램 가이드를 위한 시스템 및 방법 |
| US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
| JP2006085565A (ja) * | 2004-09-17 | 2006-03-30 | Fuji Xerox Co Ltd | 情報処理装置、および情報処理方法、並びにコンピュータ・プログラム |
| US7818179B2 (en) | 2004-11-12 | 2010-10-19 | International Business Machines Corporation | Devices and methods providing automated assistance for verbal communication |
| WO2006085565A1 (ja) * | 2005-02-08 | 2006-08-17 | Nippon Telegraph And Telephone Corporation | 情報通信端末、情報通信システム、情報通信方法、情報通信プログラムおよびそれを記録した記録媒体 |
| KR100755677B1 (ko) * | 2005-11-02 | 2007-09-05 | 삼성전자주식회사 | 주제 영역 검출을 이용한 대화체 음성 인식 장치 및 방법 |
| US20100153885A1 (en) | 2005-12-29 | 2010-06-17 | Rovi Technologies Corporation | Systems and methods for interacting with advanced displays provided by an interactive media guidance application |
| JP4734155B2 (ja) * | 2006-03-24 | 2011-07-27 | 株式会社東芝 | 音声認識装置、音声認識方法および音声認識プログラム |
| CN101118541B (zh) * | 2006-08-03 | 2011-08-17 | 苗玉水 | 汉语语音码汉语语音识别方法 |
| JP5121252B2 (ja) | 2007-02-26 | 2013-01-16 | 株式会社東芝 | 原言語による音声を目的言語に翻訳する装置、方法およびプログラム |
| US20080270110A1 (en) | 2007-04-30 | 2008-10-30 | Yurick Steven J | Automatic speech recognition with textual content input |
| WO2009105639A1 (en) | 2008-02-22 | 2009-08-27 | Vocera Communications, Inc. | System and method for treating homonyms in a speech recognition system |
| CN101655837B (zh) * | 2009-09-08 | 2010-10-13 | 北京邮电大学 | 一种对语音识别后文本进行检错并纠错的方法 |
| US8744860B2 (en) * | 2010-08-02 | 2014-06-03 | At&T Intellectual Property I, L.P. | Apparatus and method for providing messages in a social network |
| EP2721608B1 (en) | 2011-06-19 | 2019-03-13 | MModal IP LLC | Speech recognition using context-aware recognition models |
| WO2013006215A1 (en) * | 2011-07-01 | 2013-01-10 | Nec Corporation | Method and apparatus of confidence measure calculation |
| US8606577B1 (en) | 2012-06-25 | 2013-12-10 | Google Inc. | Visual confirmation of voice recognized text input |
| US8909526B2 (en) | 2012-07-09 | 2014-12-09 | Nuance Communications, Inc. | Detecting potential significant errors in speech recognition results |
| US9588964B2 (en) * | 2012-09-18 | 2017-03-07 | Adobe Systems Incorporated | Natural language vocabulary generation and usage |
| US20140122069A1 (en) | 2012-10-30 | 2014-05-01 | International Business Machines Corporation | Automatic Speech Recognition Accuracy Improvement Through Utilization of Context Analysis |
| US9189742B2 (en) | 2013-11-20 | 2015-11-17 | Justin London | Adaptive virtual intelligent agent |
| US10296160B2 (en) * | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
-
2014
- 2014-07-31 US US14/448,308 patent/US9721564B2/en active Active
-
2015
- 2015-07-29 EP EP18165726.3A patent/EP3364408B1/en active Active
- 2015-07-29 JP JP2016575665A patent/JP6684231B2/ja active Active
- 2015-07-29 WO PCT/US2015/042584 patent/WO2016018981A1/en not_active Ceased
- 2015-07-29 CA CA2954197A patent/CA2954197C/en active Active
- 2015-07-29 EP EP15747723.3A patent/EP3175442B1/en active Active
- 2015-07-29 MX MX2016017394A patent/MX359330B/es active IP Right Grant
- 2015-07-29 PT PT181657263T patent/PT3364408T/pt unknown
- 2015-07-29 KR KR1020167036970A patent/KR102438752B1/ko active Active
- 2015-07-29 CN CN201580035900.2A patent/CN106471571A/zh active Pending
- 2015-07-29 PT PT157477233T patent/PT3175442T/pt unknown
- 2015-07-29 KR KR1020237029548A patent/KR20230130761A/ko not_active Withdrawn
- 2015-07-29 ES ES15747723.3T patent/ES2675302T3/es active Active
- 2015-07-29 AU AU2015296597A patent/AU2015296597A1/en not_active Abandoned
- 2015-07-29 CA CA3187269A patent/CA3187269A1/en active Pending
- 2015-07-29 KR KR1020227029745A patent/KR102574333B1/ko active Active
- 2015-07-29 DK DK15747723.3T patent/DK3175442T3/en active
- 2015-07-30 GB GB1513493.5A patent/GB2530871B/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| MX359330B (es) | 2018-09-25 |
| EP3175442A1 (en) | 2017-06-07 |
| WO2016018981A1 (en) | 2016-02-04 |
| AU2015296597A1 (en) | 2017-01-12 |
| KR102438752B1 (ko) | 2022-08-30 |
| KR102574333B1 (ko) | 2023-09-01 |
| JP2017525993A (ja) | 2017-09-07 |
| CA2954197A1 (en) | 2016-02-04 |
| CN106471571A (zh) | 2017-03-01 |
| US20160035347A1 (en) | 2016-02-04 |
| US9721564B2 (en) | 2017-08-01 |
| GB2530871B (en) | 2018-11-21 |
| ES2675302T3 (es) | 2018-07-10 |
| KR20230130761A (ko) | 2023-09-12 |
| PT3175442T (pt) | 2018-06-19 |
| GB201513493D0 (en) | 2015-09-16 |
| EP3364408B1 (en) | 2021-05-19 |
| EP3364408A1 (en) | 2018-08-22 |
| PT3364408T (pt) | 2021-06-14 |
| DK3175442T3 (en) | 2018-06-18 |
| JP6684231B2 (ja) | 2020-04-22 |
| EP3175442B1 (en) | 2018-06-06 |
| KR20170040134A (ko) | 2017-04-12 |
| GB2530871A (en) | 2016-04-06 |
| CA2954197C (en) | 2023-03-21 |
| KR20220123347A (ko) | 2022-09-06 |
| CA3187269A1 (en) | 2016-02-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX2016017394A (es) | Sistemas y metodos de realizacion de reconocimiento automatico del habla (asr) en la presencia de heterografos. | |
| EP3180785A4 (en) | Systems and methods for speech transcription | |
| EP3114679A4 (en) | Predicting pronunciation in speech recognition | |
| EP3172729A4 (en) | Text rule based multi-accent speech recognition with single acoustic model and automatic accent detection | |
| WO2016044027A8 (en) | Method and apparatus for performing speaker recognition | |
| MX2017003316A (es) | Eliminacion de ambigüedades de la entrada de teclado. | |
| GB2530131B (en) | Speech recognition methods, devices, and systems | |
| EP3482564A4 (en) | METHODS AND SYSTEMS FOR GENERATING AND PROVIDING PROGRAM GUIDES AND CONTENT | |
| EP3371808B8 (en) | Speech processing system and method | |
| SG10201807147TA (en) | Verification methods and verification devices | |
| SG11201707861UA (en) | Systems and methods for executing cryptographically secure transactions using voice and natural language processing | |
| GB201501383D0 (en) | Adjusting speech recognition using contextual information | |
| GB2540702A (en) | Nucleic acid processing of a nucleic acid fragment with a triazole linkage | |
| EP3767620A3 (en) | Speech endpointing based on word comparisons | |
| AU2015302050B2 (en) | Ans assessment systems, kits, and methods | |
| ZA201805939B (en) | Systems and methods for searching databases using graphical user interfaces that include concept stacks | |
| EP4597490A3 (en) | Electronic device and voice recognition method thereof | |
| EP3193328A4 (en) | Method and device for performing voice recognition using grammar model | |
| WO2014102548A3 (en) | Search system and corresponding method | |
| GB201701141D0 (en) | Acoustic and domain based speech recognition for vehicles | |
| EP3300075A4 (en) | Speech recognition device and computer program | |
| EP3125134A4 (en) | Speech retrieval device, speech retrieval method, and display device | |
| EP3168839A4 (en) | Voice recognition device and voice recognition system | |
| EP3100257A4 (en) | System and method for learning,composing,and playing music with physical objects | |
| EP3211637A4 (en) | Speech synthesis device and method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FG | Grant or registration |