Zairi - Google Patents
Managemant of the Text and Speech Resources in the Rapid Language Adaptation ToolkitZairi
View PDF- Document ID
- 2972600503433499443
- Author
- Zairi A
External Links
Snippet
This work describes our extension of the Rapid Language Adaptation Toolkit (RLAT) to share text and speech resources with other users. RLAT aims to significantly reduce the amount of time and effort involved in building speech processing systems for new languages …
- 230000004301 light adaptation 0 title abstract description 9
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
- G09B5/065—Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Michaud et al. | Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone toolkit | |
CN102084417B (en) | System and methods for maintaining speech-to-speech translation in the field | |
WO2007027817A1 (en) | Incorporation of speech engine training into interactive user tutorial | |
Gibbon et al. | Spoken language system and corpus design | |
Sangeetha et al. | Speech translation system for english to dravidian languages | |
El Ouahabi et al. | Toward an automatic speech recognition system for amazigh-tarifit language | |
James et al. | Developing resources for te reo Māori text to speech synthesis system | |
Anastasopoulos | Computational tools for endangered language documentation | |
Melnik-Leroy et al. | An overview of Lithuanian intonation: a linguistic and modelling perspective | |
MacWhinney et al. | Fostering human rights through TalkBank | |
Cibrian et al. | Limitations in speech recognition for young adults with down syndrome | |
Vitevitch et al. | Speech error and tip of the tongue diary for mobile devices | |
Narayanan et al. | Transonics: A speech to speech system for English-Persian interactions | |
Trivedi | Fundamentals of Natural Language Processing | |
Coto-Solano et al. | Managing data workflows for untrained forced alignment: examples from Costa Rica, Mexico, the Cook Islands, and Vanuatu | |
Draxler | Automatic Transcription of Spoken Language Using Publicly Available Web Services | |
Cucchiarini et al. | The JASMIN speech corpus: recordings of children, non-natives and elderly people | |
Zairi | Managemant of the Text and Speech Resources in the Rapid Language Adaptation Toolkit | |
MacWhinney | Understanding Language Through TalkBank | |
Sefara et al. | The development of an automatic pronunciation assistant | |
Müller et al. | Segments, letters and gestures: thoughts on doing and teaching phonetics and transcription | |
Carson-Berndsen | Multilingual time maps: portable phonotactic models for speech technology | |
Kumar et al. | Voice to Text Summarization Using NLP | |
Jeevitha et al. | A study on innovative trends in multimedia library using speech enabled softwares | |
Baumann et al. | The spoken wikipedia corpus collection |