MX2018001996A - Modelo acustico dinamico para un vehículo. - Google Patents
Modelo acustico dinamico para un vehículo.Info
- Publication number
- MX2018001996A MX2018001996A MX2018001996A MX2018001996A MX2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A
- Authority
- MX
- Mexico
- Prior art keywords
- acoustic model
- vehicle
- identification information
- dynamic acoustic
- speaker
- Prior art date
Links
- 238000013500 data storage Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/227—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
- User Interface Of Digital Computer (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
- Navigation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Un procesador de voz para vehículo incluye un dispositivo de procesamiento y un medio de almacenamiento de datos. El dispositivo de procesamiento está programado para recibir información de identificación desde un dispositivo portable, identificar a un hablante a partir de la información de identificación, identificar un dialecto asociado con el hablante a partir de la información de identificación, seleccionar un modelo acústico predeterminado y ajustar el modelo acústico predeterminado en base, al menos en parte, al dialecto identificado.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/US2015/046473 WO2017034536A1 (en) | 2015-08-24 | 2015-08-24 | Dynamic acoustic model for vehicle |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MX2018001996A true MX2018001996A (es) | 2018-06-06 |
Family
ID=58100773
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2018001996A MX2018001996A (es) | 2015-08-24 | 2015-08-24 | Modelo acustico dinamico para un vehículo. |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US10593335B2 (es) |
| CN (1) | CN108292507A (es) |
| DE (1) | DE112015006831T5 (es) |
| GB (1) | GB2557132B (es) |
| MX (1) | MX2018001996A (es) |
| RU (1) | RU2704746C2 (es) |
| WO (1) | WO2017034536A1 (es) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR102225404B1 (ko) * | 2014-05-23 | 2021-03-09 | 삼성전자주식회사 | 디바이스 정보를 이용하는 음성인식 방법 및 장치 |
| MX2018001996A (es) * | 2015-08-24 | 2018-06-06 | Ford Global Tech Llc | Modelo acustico dinamico para un vehículo. |
| US10647332B2 (en) * | 2017-09-12 | 2020-05-12 | Harman International Industries, Incorporated | System and method for natural-language vehicle control |
| US11011162B2 (en) | 2018-06-01 | 2021-05-18 | Soundhound, Inc. | Custom acoustic models |
| KR102718582B1 (ko) * | 2018-10-19 | 2024-10-17 | 삼성전자주식회사 | 음성을 인식하는 장치 및 방법, 음성 인식 모델을 트레이닝하는 장치 및 방법 |
| KR102871441B1 (ko) * | 2019-06-18 | 2025-10-15 | 엘지전자 주식회사 | 음성 정보 기반 언어 모델링 시스템 및 방법 |
| KR102871460B1 (ko) * | 2019-06-18 | 2025-10-15 | 엘지전자 주식회사 | 사투리 음소 적응 학습 시스템 및 방법 |
| US12067972B2 (en) * | 2020-12-16 | 2024-08-20 | Samsung Electronics Co., Ltd. | Electronic device and operation method thereof |
| US20230197097A1 (en) * | 2021-12-16 | 2023-06-22 | Mediatek Inc. | Sound enhancement method and related communication apparatus |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1187096A1 (en) * | 2000-09-06 | 2002-03-13 | Sony International (Europe) GmbH | Speaker adaptation with speech model pruning |
| US20020143540A1 (en) * | 2001-03-28 | 2002-10-03 | Narendranath Malayath | Voice recognition system using implicit speaker adaptation |
| US8762143B2 (en) * | 2007-05-29 | 2014-06-24 | At&T Intellectual Property Ii, L.P. | Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition |
| US8548807B2 (en) * | 2009-06-09 | 2013-10-01 | At&T Intellectual Property I, L.P. | System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring |
| WO2011016886A1 (en) * | 2009-08-05 | 2011-02-10 | Ford Global Technologies, Llc | System and method for transmitting vehicle information to an occupant communication device |
| US9484027B2 (en) * | 2009-12-10 | 2016-11-01 | General Motors Llc | Using pitch during speech recognition post-processing to improve recognition accuracy |
| US8660842B2 (en) | 2010-03-09 | 2014-02-25 | Honda Motor Co., Ltd. | Enhancing speech recognition using visual information |
| GB2489489B (en) | 2011-03-30 | 2013-08-21 | Toshiba Res Europ Ltd | A speech processing system and method |
| US9082414B2 (en) * | 2011-09-27 | 2015-07-14 | General Motors Llc | Correcting unintelligible synthesized speech |
| WO2013110125A1 (en) * | 2012-01-24 | 2013-08-01 | Auraya Pty Ltd | Voice authentication and speech recognition system and method |
| US9798799B2 (en) * | 2012-11-15 | 2017-10-24 | Sri International | Vehicle personal assistant that interprets spoken natural language input based upon vehicle context |
| US9146899B2 (en) * | 2013-02-07 | 2015-09-29 | Ford Global Technologies, Llc | System and method of arbitrating audio source streamed by mobile applications |
| US20140278395A1 (en) | 2013-03-12 | 2014-09-18 | Motorola Mobility Llc | Method and Apparatus for Determining a Motion Environment Profile to Adapt Voice Recognition Processing |
| US20140379346A1 (en) | 2013-06-21 | 2014-12-25 | Google Inc. | Video analysis based language model adaptation |
| US9336781B2 (en) * | 2013-10-17 | 2016-05-10 | Sri International | Content-aware speaker recognition |
| US9037125B1 (en) | 2014-04-07 | 2015-05-19 | Google Inc. | Detecting driving with a wearable computing device |
| MX2018001996A (es) * | 2015-08-24 | 2018-06-06 | Ford Global Tech Llc | Modelo acustico dinamico para un vehículo. |
| US10297251B2 (en) * | 2016-01-21 | 2019-05-21 | Ford Global Technologies, Llc | Vehicle having dynamic acoustic model switching to improve noisy speech recognition |
-
2015
- 2015-08-24 MX MX2018001996A patent/MX2018001996A/es unknown
- 2015-08-24 DE DE112015006831.7T patent/DE112015006831T5/de not_active Withdrawn
- 2015-08-24 WO PCT/US2015/046473 patent/WO2017034536A1/en not_active Ceased
- 2015-08-24 US US15/747,276 patent/US10593335B2/en active Active
- 2015-08-24 GB GB1803532.9A patent/GB2557132B/en not_active Expired - Fee Related
- 2015-08-24 CN CN201580082572.1A patent/CN108292507A/zh active Pending
- 2015-08-24 RU RU2018106645A patent/RU2704746C2/ru active
Also Published As
| Publication number | Publication date |
|---|---|
| RU2704746C2 (ru) | 2019-10-30 |
| US20180286413A1 (en) | 2018-10-04 |
| CN108292507A (zh) | 2018-07-17 |
| DE112015006831T5 (de) | 2018-05-24 |
| GB2557132B (en) | 2021-06-23 |
| US10593335B2 (en) | 2020-03-17 |
| WO2017034536A1 (en) | 2017-03-02 |
| GB201803532D0 (en) | 2018-04-18 |
| RU2018106645A3 (es) | 2019-09-26 |
| RU2018106645A (ru) | 2019-09-26 |
| GB2557132A (en) | 2018-06-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX2018001996A (es) | Modelo acustico dinamico para un vehículo. | |
| EP2806425A3 (en) | System and method for speaker verification | |
| EP3751561A3 (en) | Hotword recognition | |
| IL282172A (en) | Systems and methods for training machine models with augmented data | |
| EP3648099A4 (en) | METHOD, DEVICE, APPARATUS, AND MEDIUM OF VOICE RECOGNITION INFORMATION | |
| WO2016033480A3 (en) | Intermediate compression for higher order ambisonic audio data | |
| EP3154055A3 (en) | Dynamic threshold for speaker verification | |
| EP4047497A3 (en) | Speaker verification using co-location information | |
| PH12019500283A1 (en) | Distributed transaction processing and authentication system | |
| EP3779789A4 (en) | CLASSIFICATION MODEL PRODUCTION PROCESS AND DEVICE, AND DATA IDENTIFICATION PROCESS AND DEVICE | |
| KR20180084394A (ko) | 발화 완료 감지 방법 및 이를 구현한 전자 장치 | |
| EP4283613A3 (en) | Noise mitigation for a voice interface device | |
| MY201873A (en) | Risk address identification method and apparatus, and electronic device | |
| GB2543972A (en) | Systems and methods for equalizing audio for playback on an electronic device | |
| EP2966644A3 (en) | Methods and systems for managing speech recognition in a multi-speech system environment | |
| EP4236332A3 (en) | Techniques and apparatus for editing video | |
| MX2014010795A (es) | Dispositivo para extraer informacion a partir de un dialogo. | |
| EP2787449A3 (en) | Text data processing method and corresponding electronic device | |
| EP4243450A3 (en) | Method of calibrating a playback device, corresponding playback device, system and computer readable storage medium | |
| SG10201900178WA (en) | Speech transaction processing | |
| WO2014150214A3 (en) | Questions answering to populate knowledge base | |
| GB2551917A (en) | Privacy-preserving training corpus selection | |
| WO2014210193A3 (en) | Providing information to a user based on determined user activity | |
| SG11202101248SA (en) | Information processing method and device based on blockchain, and computer-readable storage medium | |
| WO2018118492A3 (en) | Linguistic modeling using sets of base phonetics |