Read et al., 2007 - Google Patents
Automatic pitch accent prediction for text-to-speech synthesis.Read et al., 2007
View PDF- Document ID
- 3213810039649384236
- Author
- Read I
- Cox S
- Publication year
- Publication venue
- Interspeech
External Links
Snippet
Determining pitch accents in a sentence is a key task for a textto-speech (TTS) system. We describe some methods for pitch accent assignment which make use of features that contain information about a complete phrase or sentence, in contrast to most previous work which …
- 230000015572 biosynthetic process 0 title description 5
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G06F17/2715—Statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G06F17/277—Lexical analysis, e.g. tokenisation, collocates
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G06F17/30657—Query processing
- G06F17/30675—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30705—Clustering or classification
- G06F17/3071—Clustering or classification including class or cluster creation or modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2809—Data driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/28—Processing or translating of natural language
- G06F17/2872—Rule based translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN108304468B (en) | Text classification method and text classification device | |
| US9176946B2 (en) | System and method of extracting clauses for spoken language understanding | |
| KR100509797B1 (en) | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word | |
| US9484020B2 (en) | System and method of extracting clauses for spoken language understanding | |
| US20030046078A1 (en) | Supervised automatic text generation based on word classes for language modeling | |
| Watts | Unsupervised learning for text-to-speech synthesis | |
| Araujo | Part-of-speech tagging with evolutionary algorithms | |
| Belay et al. | Impacts of homophone normalization on semantic models for amharic | |
| Mishra et al. | Intonational phrase break prediction for text-to-speech synthesis using dependency relations | |
| Read et al. | Automatic pitch accent prediction for text-to-speech synthesis. | |
| Chen et al. | The ustc system for blizzard challenge 2011 | |
| Tamiru et al. | Sentence-level automatic speech segmentation for amharic | |
| Sawalha et al. | Prosody prediction for arabic via the open-source boundary-annotated qur’an corpus | |
| Jauk et al. | Expressive speech synthesis using sentiment embeddings | |
| Dwivedi et al. | Develo ping Chunker for Indian Regional Songs using HMM | |
| Cohen | A survey of machine learning methods for predicting prosody in radio speech | |
| Zhao et al. | Active learning for the prediction of prosodic phrase boundaries in chinese speech synthesis systems using conditional random fields | |
| Moisa et al. | Speech synthesis using neural networks trained by an evolutionary algorithm | |
| Yan-qiu et al. | Comparison of approaches for predicting break indices in mandarin speech synthesis | |
| Kallimani | Normalization of non standard words for Kannada speech synthesis | |
| Zervas et al. | Evaluation of corpus based tone prediction in mismatched environments for greek tts synthesis. | |
| Kim et al. | Prediction of Korean prosodic phrase boundary by efficient feature selection in machine learning | |
| Shao et al. | Using different models to label the break indices for mandarin speech synthesis | |
| Kim et al. | Decision‐Tree‐Based Markov Model for Phrase Break Prediction | |
| Pradheeba et al. | Effective Cataloging over Diverse Algorithms for Automatic Text Summarization and Its Survey |