CN1292400C

CN1292400C - Expression figure explanation treatment method for text and voice transfer system

Info

Publication number: CN1292400C
Application number: CNB2004100781977A
Authority: CN
Inventors: 姜容成
Original assignee: LG Electronics China Research and Development Center Co Ltd
Current assignee: LG Electronics China Research and Development Center Co Ltd
Priority date: 2004-02-10
Filing date: 2004-09-17
Publication date: 2006-12-27
Anticipated expiration: 2024-09-17
Also published as: KR20050080671A; CN1655231A

Abstract

The invention relates to technology which outputs corresponding pronunciation if expression graph has been found when a text-voice transfer system (TTS) transfers character string into voice signal. The invention is comprised by following three steps: The first step of using corresponding pronunciation to output the expression graph contained in the character string while implementing glossology processing in order to transfer the input character string in the TTS engine into voice signal; The second step of confirming voice related information such as tone, sound length and so on in order to confirm rhythm of the sentence of the voice signal, and properly adjusting rhythm according to the sensation represented by the expression graph; the third step of outputting the adjusted voice signal of the rhythm to external.

Description

The expression figure explanation treatment method of text and voice transfer system

Technical field

The present invention relates to text voice conversion (TTS:Text To Speech) system handles expression (emotion) illustrated technology, particularly relate to a kind of when tts engine is converted to voice signal with character string, if the discovery expression figure explanation is then exported the expression figure explanation treatment method of tts system of its corresponding pronunciation.

Background of invention

Tts system mainly is the system that character string is converted to human speech, and basic goal is the people not to be spent see the text that is made of character string, but goes to listen with ear.This TTS technology is than the speech recognition technology technology of closeness to life more, can be used for various text messages are converted to the service of voice.Recently, along with popularizing of Email, can utilize phone to read the mail of new reception from the outside, this also be to benefit from the TTS technology.In addition, the TTS technology can also utilize voice to listen to by the sentence of word processor input, the html document that web browser shows on screen.For visually impaired person, can be to listen to behind the voice the information translation on the internet, thereby can equally with the ordinary people obtain various Useful Informations.Recently, develop and surmounted the synthesized voice of mechanical sound level in the past, can send the technology of the synthesized voice of similar human speech, utilize the service of TTS technology just presenting the trend that enlarges gradually towards masses.

Yet the language that people use has vitality, and is constantly changing, and under as it is by the situation of diverse network with written communication thought, the speed of its variation is accelerated just day by day.

Recently, in fields such as compunication, the frequency of utilization of expression figure explanation increases just gradually.Above-mentioned " expression figure explanation " is used to show user's emotion or wish, is the synthetic language of emotion and icon (icon), makes by various symbols and literal on the integral keyboard.For example, the smiling face can be expressed as :) or:-), turn to the left side to see to be exactly a smiling face.Use the student's Scott Fil graceful (Scott Fahlman) that allegedly is the 1980's Ka Naiji-Mei Long university at first.It can be artificially guided into the stiff inflexible compunication that is easy to become soft and is full of the boundary of enjoyment, makes the hommization more that becomes of communicating by letter between machine and the machine.

Yet, in the tts system under the prior art, can only be voice with common text conversion, expression figure explanation is processed into simple sentence symbol or totally uninteresting symbol, have difficulties aspect the document content fully transmitting to the user.

Summary of the invention

Therefore, the present invention seeks to overcome deficiency of the prior art, a kind of expression figure explanation treatment method of tts system is provided.

For achieving the above object, the expression figure explanation treatment method of tts system of the present invention is made of following three steps: the 1st step, for the character string that will import in the tts engine is converted to voice signal, carrying out the sentence of text handles, non-Chinese text is handled, operations such as analysis of part of speech such as verb, adjective and syntactic analysis and the conversion of pronunciation mark are exported the expression figure explanation that comprises in this character string with corresponding pronunciation simultaneously; The 2nd step is determined acoustic informations such as tone, the duration of a sound for the rhythm of the sentence of determining to be converted to described voice signal, and the emotion of representing according to described expression figure explanation is suitably regulated the rhythm then; The 3rd step after speech database generation actual speech signal, is carried out the D/A conversion and is carried out processing and amplifying this voice signal.

As the above-mentioned detailed description of doing, the present invention has following effect, promptly, when tts engine is converted to voice signal with character string, if the discovery expression figure explanation then utilizes the output of expression figure explanation pronunciation dictionary and this corresponding sound of expressing one's feelings, thereby, when the content of the literal that will comprise expression figure explanation is converted to corresponding sound and exports, can directly transmit its content.

Description of drawings

Fig. 1 is system's pie graph of expression figure explanation treatment method of the present invention;

Fig. 2 is the pronunciation sample table of the expression figure explanation pronunciation dictionary among Fig. 1.

The symbol description of accompanying drawing major part

1: text input part 2: the linguistics handling part

3: rhythm handling part 4: the voice signal handling part

5: voice signal efferent 6: dictionary portion

7: expression figure explanation pronunciation dictionary 8: speech database

Embodiment

Below with reference to Fig. 1 and Fig. 2, describe expression figure explanation treatment step of the present invention in detail.

Character string is behind the text input part 1 of external device (ED) or internal storage input tts engine, linguistics handling part 2 is in order to be converted into voice signal, with reference to the various data in the numeral/abbreviation symbol dictionary of dictionary portion 6, part of speech dictionary, the pronunciation dictionary, operations such as the sentence processing of execution contexts, syntactic analysis, non-Chinese text processing, morpheme analysis and syntactic analysis, the conversion of pronunciation mark.

At this moment, above-mentioned linguistics handling part 2 is not that it is processed into simple symbol, but exports with the pronunciation of income in the expression figure explanation pronunciation dictionary 7 after utilizing the expression figure explanation that comprises in the above-mentioned character string of expression figure explanation pronunciation dictionary 7 identifications.

As a reference, Fig. 2 has shown the pronunciation example of each expression figure explanation of record in the above-mentioned expression figure explanation pronunciation dictionary 7.For instance, ^^, ^-^ :), the pronunciation of expression figure explanations such as ^o^, ^_^ is " having laughed at ".Again for example, for-.-,--,-expression figure explanation of .-, pronounce to be " amimia ".

Then, rhythm handling part 3 is determined the length of tone, sound etc. in order to determine the rhythm with the voice output sentence time.At this moment, the emotion of representing according to above-mentioned expression figure explanation is suitably regulated the rhythm.

Then, voice signal handling part 4 generates actual speech signal with reference to the actual speech data in voice data storehouse 8, the actual speech signal of 5 pairs of above-mentioned generations of voice signal efferent is carried out D/A conversion, exports after being enlarged into suitable level simultaneously, so that the people can hear.

Claims

1. the expression figure explanation treatment method of a text and voice transfer system is characterized in that being made of following steps:

The 1st step when carrying out the linguistics processing, is exported the expression figure explanation that comprises in this character string with corresponding pronunciation, is converted to voice signal in order to the character string that will import in the text voice transform engine;

The 2nd step is determined the information relevant with sound for the rhythm of the sentence of determining to be converted to described voice signal, and the emotion of representing according to described expression figure explanation is suitably regulated the rhythm then;

The 3rd step outputs to the outside with the voice signal of the described rhythm regulated.

2. the expression figure explanation treatment method of text and voice transfer system according to claim 1 is characterized in that the character string of input text speech conversion engine is provided by external device (ED) or internal storage.

3. the expression figure explanation treatment method of text and voice transfer system according to claim 1 is characterized in that described the 1st step also comprises:

After utilizing the expression figure explanation pronunciation dictionary to discern the expression figure explanation that comprises in the described character string, the step of exporting with the pronunciation of taking in the expression figure explanation pronunciation dictionary.

4. the expression figure explanation treatment method of text and voice transfer system according to claim 1 is characterized in that the described information relevant with sound comprises the tone and the duration of a sound.

5. the expression figure explanation treatment method of text and voice transfer system according to claim 3 is characterized in that, described expression figure explanation pronunciation dictionary is being stored the pronunciation corresponding with each expression figure explanation.