Ondas et al., 2017 - Google Patents

Domain-specific language models training methodology for the in-car infotainment

Ondas et al., 2017

Document ID: 3099912013524682421
Author: Ondas S; Gurcik M
Publication year: 2017
Publication venue: Intelligent Decision Technologies

External Links

Cited by

Snippet

The proposed paper focuses on the methodology for training small domain-specific language models. The methodology has been applied for creating a language model for the demonstration version of the in-vehicle infotainment system speech interface. The proposed …

Continue reading at journals.sagepub.com (other versions)

238000000034 method 0 title abstract description 42

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
- G06Q10/105—Human resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce, e.g. shopping or e-commerce
- G06Q30/02—Marketing, e.g. market research and analysis, surveying, promotions, advertising, buyer profiling, customer management or rewards; Price estimation or determination
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances

Similar Documents

Publication	Publication Date	Title
Kraljic et al.	2008	First impressions and last resorts: How listeners adjust to speaker variability
US10770062B2 (en)	2020-09-08	Adjusting a ranking of information content of a software application based on feedback from a user
US8103510B2 (en)	2012-01-24	Device control device, speech recognition device, agent device, on-vehicle device control device, navigation device, audio device, device control method, speech recognition method, agent processing method, on-vehicle device control method, navigation method, and audio device control method, and program
EP3736807B1 (en)	2024-06-05	Apparatus for media entity pronunciation using deep learning
Gnevsheva	2020	The role of style in the ethnolect: Style-shifting in the use of ethnolectal features in first-and second-generation speakers
KR20210067426A (en)	2021-06-08	Voice diary device
Caplan et al.	2021	Now you hear me, later you don’t: The immediacy of linguistic computation and the representation of speech
Hansen Edwards et al.	2021	Social factors and the teaching of pronunciation: What the research tells us
Meer et al.	2022	The Trini Sing-Song: Sociophonetic variation in Trinidadian English prosody and differences to other varieties
Soffer	2020	From textual orality to oral textuality: The case of voice queries
da Silva et al.	2024	How do illiterate people interact with an intelligent voice assistant?
Neustein	2010	Advances in speech recognition: mobile environments, call centers and clinics
MacDonald et al.	2024	Growing up and waking up: A conversation with Ken Wilber about leaving transpersonal to form integral psychology
Bédard et al.	2017	SyllabO+: A new tool to study sublexical phenomena in spoken Quebec French
Reineke et al.	2024	User practices in dealing with trouble in interactions with virtual assistants in German: Repeating, altering and insisting
Cibrian et al.	2025	Limitations in speech recognition for young adults with down syndrome
US20220012420A1 (en)	2022-01-13	Process, system, and method for collecting, predicting, and instructing the pronunciaiton of words
Riverin-Coutlée et al.	2023	Using Mahalanobis distances to investigate second dialect acquisition: A study on Quebec French
Ondas et al.	2017	Domain-specific language models training methodology for the in-car infotainment
Hackert et al.	2022	Recent grammatical change in postcolonial Englishes: A real-time study of genitive variation in Caribbean and Indian news writing
Myers	2020	An acoustic study of sandhi vowel hiatus in Luganda
Ramachandran	2018	Predicting user acceptance of Tamil speech to text by native Tamil Brahmans
Mittal et al.	2017	Speaker-independent automatic speech recognition system for mobile phone applications in Punjabi
de Vries et al.	2019	“You Can Do It!”—Crowdsourcing Motivational Speech and Text Messages
Golob et al.	2012	FST-based pronunciation lexicon compression for speech engines