Read et al., 2007 - Google Patents

Automatic pitch accent prediction for text-to-speech synthesis.

Read et al., 2007

Document ID: 3213810039649384236
Author: Read I; Cox S
Publication year: 2007
Publication venue: Interspeech

External Links

Cited by

Snippet

Determining pitch accents in a sentence is a key task for a textto-speech (TTS) system. We describe some methods for pitch accent assignment which make use of features that contain information about a complete phrase or sentence, in contrast to most previous work which …

Continue reading at www.researchgate.net (PDF) (other versions)

230000015572 biosynthetic process 0 title description 5

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser

Similar Documents

Publication	Publication Date	Title
CN108304468B (en)	2021-12-07	Text classification method and text classification device
US9176946B2 (en)	2015-11-03	System and method of extracting clauses for spoken language understanding
KR100509797B1 (en)	2005-08-23	Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word
US9484020B2 (en)	2016-11-01	System and method of extracting clauses for spoken language understanding
US20030046078A1 (en)	2003-03-06	Supervised automatic text generation based on word classes for language modeling
Watts	2013	Unsupervised learning for text-to-speech synthesis
Araujo	2002	Part-of-speech tagging with evolutionary algorithms
Belay et al.	2021	Impacts of homophone normalization on semantic models for amharic
Mishra et al.	2015	Intonational phrase break prediction for text-to-speech synthesis using dependency relations
Read et al.	2007	Automatic pitch accent prediction for text-to-speech synthesis.
Chen et al.	2011	The ustc system for blizzard challenge 2011
Tamiru et al.	2021	Sentence-level automatic speech segmentation for amharic
Sawalha et al.	2012	Prosody prediction for arabic via the open-source boundary-annotated qur’an corpus
Jauk et al.	2018	Expressive speech synthesis using sentiment embeddings
Dwivedi et al.	2023	Develo ping Chunker for Indian Regional Songs using HMM
Cohen	2004	A survey of machine learning methods for predicting prosody in radio speech
Zhao et al.	2015	Active learning for the prediction of prosodic phrase boundaries in chinese speech synthesis systems using conditional random fields
Moisa et al.	2001	Speech synthesis using neural networks trained by an evolutionary algorithm
Yan-qiu et al.	2006	Comparison of approaches for predicting break indices in mandarin speech synthesis
Kallimani	2012	Normalization of non standard words for Kannada speech synthesis
Zervas et al.	2004	Evaluation of corpus based tone prediction in mismatched environments for greek tts synthesis.
Kim et al.	2009	Prediction of Korean prosodic phrase boundary by efficient feature selection in machine learning
Shao et al.	2005	Using different models to label the break indices for mandarin speech synthesis
Kim et al.	2007	Decision‐Tree‐Based Markov Model for Phrase Break Prediction
Pradheeba et al.	2021	Effective Cataloging over Diverse Algorithms for Automatic Text Summarization and Its Survey