Turunen et al., 2004 - Google Patents

Speech application design and development

Turunen et al., 2004

Document ID: 16045608177094206382
Author: Turunen M; Turunen C
Publication year: 2004
Publication venue: On-line book

External Links

Cited by

Snippet

The use of speech is expected to make the human-computer interaction more natural and efficient than it has been so far. Many successful speech applications have been constructed, but speech is still far from being the most common input and output modalities …

Continue reading at www.researchgate.net (PDF) (other versions)

238000011161 development 0 title abstract description 48

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
Zue et al.	2002	Conversational interfaces: Advances and challenges
Pietquin	2005	A framework for unsupervised learning of dialogue strategies
Cole et al.	1995	The challenge of spoken language systems: Research directions for the nineties
Wilpon et al.	1994	Voice communication between humans and machines
Klemmer et al.	2000	Suede: a wizard of oz prototyping tool for speech user interfaces
Gustafson et al.	1999	The august spoken dialogue system.
Gustafson et al.	2000	AdApt—a multimodal conversational dialogue system in an apartment domain
Delgado et al.	2007	Spoken, multilingual and multimodal dialogue systems: development and assessment
Aylett et al.	2021	Building and designing expressive speech synthesis
Gustafson et al.	2000	Speech technology on trial: Experiences from the August system
El Ouahabi et al.	2019	Toward an automatic speech recognition system for amazigh-tarifit language
Fellbaum et al.	2008	Principles of electronic speech processing with applications for people with disabilities
McTear	2022	Rule-based dialogue systems: Architecture, methods, and tools
Gustafson et al.	1999	Experiences from the development of August-a multi-modal spoken dialogue system
Ifeanyi et al.	2014	Text–To–Speech Synthesis (TTS)
Gilbert et al.	2005	Intelligent virtual agents for contact center automation
Stifelman	1995	A tool to support speech and non-speech audio feedback generation in audio interfaces
Kawahara	2001	Domain-independent spoken dialogue platform using key-phrase spotting based on combined language model
Ward et al.	2003	Hands-free documentation
Jackson	2005	Automatic speech recognition: Human computer interface for kinyarwanda language
Turunen et al.	2004	Speech application design and development
Kumar et al.	2014	Bridging the gap between disabled people and new technology in interactive web application with the help of voice
Apaydin	2002	Networked humanoid animation driven by human voice using extensible 3d (x3d), h-anim and java speech open standards
Vesnicer et al.	2003	A voice-driven web browser for blind people.
Rudnicky	1995	The design of spoken language interfaces