[go: up one dir, main page]

MX2018001996A - Modelo acustico dinamico para un vehículo. - Google Patents

Modelo acustico dinamico para un vehículo.

Info

Publication number
MX2018001996A
MX2018001996A MX2018001996A MX2018001996A MX2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A MX 2018001996 A MX2018001996 A MX 2018001996A
Authority
MX
Mexico
Prior art keywords
acoustic model
vehicle
identification information
dynamic acoustic
speaker
Prior art date
Application number
MX2018001996A
Other languages
English (en)
Inventor
John Simonds Craig
Hassani Ali
A Cuddihy Mark
Mitra Pramita
Melcher David
Steven Strumolo Gary
Original Assignee
Ford Global Tech Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ford Global Tech Llc filed Critical Ford Global Tech Llc
Publication of MX2018001996A publication Critical patent/MX2018001996A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/227Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • User Interface Of Digital Computer (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)
  • Navigation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Un procesador de voz para vehículo incluye un dispositivo de procesamiento y un medio de almacenamiento de datos. El dispositivo de procesamiento está programado para recibir información de identificación desde un dispositivo portable, identificar a un hablante a partir de la información de identificación, identificar un dialecto asociado con el hablante a partir de la información de identificación, seleccionar un modelo acústico predeterminado y ajustar el modelo acústico predeterminado en base, al menos en parte, al dialecto identificado.
MX2018001996A 2015-08-24 2015-08-24 Modelo acustico dinamico para un vehículo. MX2018001996A (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2015/046473 WO2017034536A1 (en) 2015-08-24 2015-08-24 Dynamic acoustic model for vehicle

Publications (1)

Publication Number Publication Date
MX2018001996A true MX2018001996A (es) 2018-06-06

Family

ID=58100773

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2018001996A MX2018001996A (es) 2015-08-24 2015-08-24 Modelo acustico dinamico para un vehículo.

Country Status (7)

Country Link
US (1) US10593335B2 (es)
CN (1) CN108292507A (es)
DE (1) DE112015006831T5 (es)
GB (1) GB2557132B (es)
MX (1) MX2018001996A (es)
RU (1) RU2704746C2 (es)
WO (1) WO2017034536A1 (es)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102225404B1 (ko) * 2014-05-23 2021-03-09 삼성전자주식회사 디바이스 정보를 이용하는 음성인식 방법 및 장치
MX2018001996A (es) * 2015-08-24 2018-06-06 Ford Global Tech Llc Modelo acustico dinamico para un vehículo.
US10647332B2 (en) * 2017-09-12 2020-05-12 Harman International Industries, Incorporated System and method for natural-language vehicle control
US11011162B2 (en) 2018-06-01 2021-05-18 Soundhound, Inc. Custom acoustic models
KR102718582B1 (ko) * 2018-10-19 2024-10-17 삼성전자주식회사 음성을 인식하는 장치 및 방법, 음성 인식 모델을 트레이닝하는 장치 및 방법
KR102871441B1 (ko) * 2019-06-18 2025-10-15 엘지전자 주식회사 음성 정보 기반 언어 모델링 시스템 및 방법
KR102871460B1 (ko) * 2019-06-18 2025-10-15 엘지전자 주식회사 사투리 음소 적응 학습 시스템 및 방법
US12067972B2 (en) * 2020-12-16 2024-08-20 Samsung Electronics Co., Ltd. Electronic device and operation method thereof
US20230197097A1 (en) * 2021-12-16 2023-06-22 Mediatek Inc. Sound enhancement method and related communication apparatus

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1187096A1 (en) * 2000-09-06 2002-03-13 Sony International (Europe) GmbH Speaker adaptation with speech model pruning
US20020143540A1 (en) * 2001-03-28 2002-10-03 Narendranath Malayath Voice recognition system using implicit speaker adaptation
US8762143B2 (en) * 2007-05-29 2014-06-24 At&T Intellectual Property Ii, L.P. Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition
US8548807B2 (en) * 2009-06-09 2013-10-01 At&T Intellectual Property I, L.P. System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring
WO2011016886A1 (en) * 2009-08-05 2011-02-10 Ford Global Technologies, Llc System and method for transmitting vehicle information to an occupant communication device
US9484027B2 (en) * 2009-12-10 2016-11-01 General Motors Llc Using pitch during speech recognition post-processing to improve recognition accuracy
US8660842B2 (en) 2010-03-09 2014-02-25 Honda Motor Co., Ltd. Enhancing speech recognition using visual information
GB2489489B (en) 2011-03-30 2013-08-21 Toshiba Res Europ Ltd A speech processing system and method
US9082414B2 (en) * 2011-09-27 2015-07-14 General Motors Llc Correcting unintelligible synthesized speech
WO2013110125A1 (en) * 2012-01-24 2013-08-01 Auraya Pty Ltd Voice authentication and speech recognition system and method
US9798799B2 (en) * 2012-11-15 2017-10-24 Sri International Vehicle personal assistant that interprets spoken natural language input based upon vehicle context
US9146899B2 (en) * 2013-02-07 2015-09-29 Ford Global Technologies, Llc System and method of arbitrating audio source streamed by mobile applications
US20140278395A1 (en) 2013-03-12 2014-09-18 Motorola Mobility Llc Method and Apparatus for Determining a Motion Environment Profile to Adapt Voice Recognition Processing
US20140379346A1 (en) 2013-06-21 2014-12-25 Google Inc. Video analysis based language model adaptation
US9336781B2 (en) * 2013-10-17 2016-05-10 Sri International Content-aware speaker recognition
US9037125B1 (en) 2014-04-07 2015-05-19 Google Inc. Detecting driving with a wearable computing device
MX2018001996A (es) * 2015-08-24 2018-06-06 Ford Global Tech Llc Modelo acustico dinamico para un vehículo.
US10297251B2 (en) * 2016-01-21 2019-05-21 Ford Global Technologies, Llc Vehicle having dynamic acoustic model switching to improve noisy speech recognition

Also Published As

Publication number Publication date
RU2704746C2 (ru) 2019-10-30
US20180286413A1 (en) 2018-10-04
CN108292507A (zh) 2018-07-17
DE112015006831T5 (de) 2018-05-24
GB2557132B (en) 2021-06-23
US10593335B2 (en) 2020-03-17
WO2017034536A1 (en) 2017-03-02
GB201803532D0 (en) 2018-04-18
RU2018106645A3 (es) 2019-09-26
RU2018106645A (ru) 2019-09-26
GB2557132A (en) 2018-06-13

Similar Documents

Publication Publication Date Title
MX2018001996A (es) Modelo acustico dinamico para un vehículo.
EP2806425A3 (en) System and method for speaker verification
EP3751561A3 (en) Hotword recognition
IL282172A (en) Systems and methods for training machine models with augmented data
EP3648099A4 (en) METHOD, DEVICE, APPARATUS, AND MEDIUM OF VOICE RECOGNITION INFORMATION
WO2016033480A3 (en) Intermediate compression for higher order ambisonic audio data
EP3154055A3 (en) Dynamic threshold for speaker verification
EP4047497A3 (en) Speaker verification using co-location information
PH12019500283A1 (en) Distributed transaction processing and authentication system
EP3779789A4 (en) CLASSIFICATION MODEL PRODUCTION PROCESS AND DEVICE, AND DATA IDENTIFICATION PROCESS AND DEVICE
KR20180084394A (ko) 발화 완료 감지 방법 및 이를 구현한 전자 장치
EP4283613A3 (en) Noise mitigation for a voice interface device
MY201873A (en) Risk address identification method and apparatus, and electronic device
GB2543972A (en) Systems and methods for equalizing audio for playback on an electronic device
EP2966644A3 (en) Methods and systems for managing speech recognition in a multi-speech system environment
EP4236332A3 (en) Techniques and apparatus for editing video
MX2014010795A (es) Dispositivo para extraer informacion a partir de un dialogo.
EP2787449A3 (en) Text data processing method and corresponding electronic device
EP4243450A3 (en) Method of calibrating a playback device, corresponding playback device, system and computer readable storage medium
SG10201900178WA (en) Speech transaction processing
WO2014150214A3 (en) Questions answering to populate knowledge base
GB2551917A (en) Privacy-preserving training corpus selection
WO2014210193A3 (en) Providing information to a user based on determined user activity
SG11202101248SA (en) Information processing method and device based on blockchain, and computer-readable storage medium
WO2018118492A3 (en) Linguistic modeling using sets of base phonetics