Ion et al., 2020 - Google Patents

A dialog manager for micro-worlds

Ion et al., 2020

Document ID: 6467772030983757377
Author: Ion R; Badea V; Cioroiu G; Mititelu V; Irimia E; Mitrofan M; Tufis D
Publication year: 2020
Publication venue: Studies in informatics and control

External Links

Cited by

Snippet

The paper describes the micro-world-based dialog manager which was developed in the ROBIN project. The manager was designed to be loaded into the Pepper robot, used in real- world scenarios and interface with real-time automatic speech recognition and synthesis for …

Continue reading at sic.ici.ro (PDF) (other versions)

235000002566 Capsicum 0 abstract description 41

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2785—Semantic analysis
- G06F17/279—Discourse representation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Taking into account non-speech caracteristics
- G10L2015/228—Taking into account non-speech caracteristics of application context
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback

Similar Documents

Publication	Publication Date	Title
AU2021202694B2 (en)	2022-06-02	Facilitating end-to-end communications with automated assistants in multiple languages
US11354521B2 (en)	2022-06-07	Facilitating communications with automated assistants in multiple languages
CN114041283B (en)	2024-06-07	Engage automated assistants using pre- and post-event input streams
Lee et al.	2010	Recent approaches to dialog management for spoken dialog systems
US7860705B2 (en)	2010-12-28	Methods and apparatus for context adaptation of speech-to-speech translation systems
US7552053B2 (en)	2009-06-23	Techniques for aiding speech-to-speech translation
Ashraff	2025	Voice-based interaction with digital services
Ion et al.	2020	A dialog manager for micro-worlds
Trigui et al.	2016	Statistical Approach for Spontaneous Arabic Speech Understanding Based on Stochastic Speech Recognition Module.
Sinha et al.	2020	Transforming interactions: mouse-based to voice-based interfaces
Kawahara	2009	New perspectives on spoken language understanding: Does machine need to fully understand speech?
Romero-González et al.	2020	Spoken language understanding for social robotics
Zribi et al.	2022	Toward developing an intelligent personal assistant for Tunisian Arabic
Lin	2024	Reinforcement Learning in spoken language understanding (SLU): giving machines an ear for understanding
Petukhova et al.	2014	Incremental recognition and prediction of dialogue acts
Wu et al.	2019	Chinese spoken dialog system
Awino	2022	Swahili Conversational Ai Voicebot for Customer Support
Stoyanchev	2009	Impact of responsive and directive adaptation on local dialog processing
Cho et al.	2020	Discourse component-based argument extraction of Seoul Korean directives
Saulwick et al.	2012	A Spoken Dialogue System for Command and Control
Yoshino	2014	Spoken Dialogue System for Information Navigation based on Statistical Learning of Semantic and Dialogue Structure
RANDHAWA et al.	2008	Speech and Language Processing: A Conceptual Framework
Ahmad	2012	Framework for Human Computer Interaction for Learning Dialogue Strategies using Controlled Natural Language in Information Systems
Wang et al.	2009	An Online Algorithm for Applying Reinforcement Learning to Handle Ambiguity in Spoken Dialogues
STOYANCHEV	2008	Exploring Adaptation in Dialog Systems