Zairi - Google Patents

Managemant of the Text and Speech Resources in the Rapid Language Adaptation Toolkit

Zairi

Document ID: 2972600503433499443
Author: Zairi A

External Links

Cited by

Snippet

This work describes our extension of the Rapid Language Adaptation Toolkit (RLAT) to share text and speech resources with other users. RLAT aims to significantly reduce the amount of time and effort involved in building speech processing systems for new languages …

Continue reading at csl.uni-bremen.de (PDF) (other versions)

230000004301 light adaptation 0 title abstract description 9

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/289—Use of machine translation, e.g. multi-lingual retrieval, server side translation for client devices, real-time translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
- G10L15/265—Speech recognisers specially adapted for particular applications
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
- G09B5/065—Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems

Similar Documents

Publication	Publication Date	Title
Michaud et al.	2018	Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone toolkit
CN102084417B (en)	2014-05-21	System and methods for maintaining speech-to-speech translation in the field
WO2007027817A1 (en)	2007-03-08	Incorporation of speech engine training into interactive user tutorial
Gibbon et al.	2013	Spoken language system and corpus design
Sangeetha et al.	2017	Speech translation system for english to dravidian languages
El Ouahabi et al.	2019	Toward an automatic speech recognition system for amazigh-tarifit language
James et al.	2020	Developing resources for te reo Māori text to speech synthesis system
Anastasopoulos	2019	Computational tools for endangered language documentation
Melnik-Leroy et al.	2022	An overview of Lithuanian intonation: a linguistic and modelling perspective
MacWhinney et al.	2018	Fostering human rights through TalkBank
Cibrian et al.	2025	Limitations in speech recognition for young adults with down syndrome
Vitevitch et al.	2015	Speech error and tip of the tongue diary for mobile devices
Narayanan et al.	2003	Transonics: A speech to speech system for English-Persian interactions
Trivedi	2023	Fundamentals of Natural Language Processing
Coto-Solano et al.	2022	Managing data workflows for untrained forced alignment: examples from Costa Rica, Mexico, the Cook Islands, and Vanuatu
Draxler	2022	Automatic Transcription of Spoken Language Using Publicly Available Web Services
Cucchiarini et al.	2012	The JASMIN speech corpus: recordings of children, non-natives and elderly people
Zairi	0	Managemant of the Text and Speech Resources in the Rapid Language Adaptation Toolkit
MacWhinney	2025	Understanding Language Through TalkBank
Sefara et al.	2019	The development of an automatic pronunciation assistant
Müller et al.	2011	Segments, letters and gestures: thoughts on doing and teaching phonetics and transcription
Carson-Berndsen	2002	Multilingual time maps: portable phonotactic models for speech technology
Kumar et al.	2024	Voice to Text Summarization Using NLP
Jeevitha et al.	2018	A study on innovative trends in multimedia library using speech enabled softwares
Baumann et al.	2018	The spoken wikipedia corpus collection