JP2016114729A

JP2016114729A - Text message voicing device, text message voicing method, and text message voicing program

Info

Publication number: JP2016114729A
Application number: JP2014252654A
Authority: JP
Inventors: 裕生渡邉; Hiroo Watanabe
Original assignee: JVCKenwood Corp
Current assignee: JVCKenwood Corp
Priority date: 2014-12-15
Filing date: 2014-12-15
Publication date: 2016-06-23
Anticipated expiration: 2034-12-15
Also published as: JP6428229B2

Abstract

【課題】同じ話者モデルデータを用いたとしても、作成者が異なる複数のテキストメッセージを聴覚的に異なる状態で音声化することができるテキストメッセージ音声化装置を提供する。【解決手段】メッセージデータ取得部１０２は、テキストメッセージとメッセージ作成者を示す情報とを含むメッセージデータを取得する。記憶部１０３は、テキストメッセージの読み上げ速度と、読み上げ音程と、読み上げ音量と、音声出力定位とのうちの少なくとも１つのパラメータの設定値を、メッセージ作成者に対応させて記憶する。メッセージデータ取得部１０２がメッセージデータを取得したら、制御部１０１は、記憶部１０３より、メッセージ作成者に対応したパラメータの設定値を読み出す。音声変換部１０４は、テキストメッセージを話者モデルデータと、パラメータの設定値とを用いて音声データに変換する。【選択図】図１Provided is a text message speechization device that can synthesize a plurality of text messages of different creators in an audibly different state even when the same speaker model data is used. A message data acquisition unit acquires message data including a text message and information indicating a message creator. The storage unit 103 stores a setting value of at least one parameter among a text message reading speed, a reading pitch, a reading volume, and a voice output localization in association with the message creator. When the message data acquisition unit 102 acquires the message data, the control unit 101 reads the parameter setting value corresponding to the message creator from the storage unit 103. The voice conversion unit 104 converts the text message into voice data using the speaker model data and parameter setting values. [Selection] Figure 1

Description

本発明は、テキストメッセージを音声合成によって音声化するテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムに関する。 The present invention relates to a text message speechization device, a text message speechization method, and a text message speechization program for speechizing a text message by speech synthesis.

テキストメッセージを音声合成によって音声化するテキストメッセージ音声化装置は、カーナビゲーション装置等の各種の装置で用いられている。テキストメッセージを音声合成によって音声化するためには、話者モデルデータが必要である。 A text message speech converting apparatus that synthesizes a text message by speech synthesis is used in various apparatuses such as a car navigation apparatus. To make a text message into speech by speech synthesis, speaker model data is required.

話者モデルデータは、テキストメッセージを音声に変換するために必要な音素、単語、文節等の多数の音声要素データを含むため、データ量が膨大である。また、１つの話者モデルデータを作成するには多大な工数が必要となる。 The speaker model data includes a large amount of data because it includes a large number of speech element data such as phonemes, words, and phrases necessary for converting a text message into speech. Also, a great deal of man-hour is required to create one speaker model data.

特開２０１０−１０２１６３号公報JP 2010-102163 A

複数の作成者それぞれによって作成されたテキストメッセージをテキストメッセージ音声化装置によって音声化する場合がある。このような場合、複数の作成者のテキストメッセージそれぞれに対して用いる話者モデルデータを異ならせれば、複数の作成者の声や話し方が異なって、作成者の違いを認識することができる。 In some cases, a text message created by each of a plurality of creators is voiced by a text message voice device. In such a case, if the speaker model data used for each of the text messages of a plurality of creators is different, the voices and manners of speaking of the creators are different, and the difference between the creators can be recognized.

ところが、上記のように話者モデルデータはデータ量が膨大であり、話者モデルデータを作成するには多大の工数が必要であるから、複数の作成者それぞれに対応させた話者モデルデータを用意し、テキストメッセージ音声化装置に記憶させておくことは困難である。 However, as described above, the speaker model data has an enormous amount of data, and it takes a lot of man-hours to create the speaker model data. It is difficult to prepare and memorize it in the text message voicing apparatus.

本発明はこのような問題点に鑑み、同じ話者モデルデータを用いたとしても、作成者が異なる複数のテキストメッセージを聴覚的に異なる状態で音声化することができるテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムを提供することを目的とする。 In view of such problems, the present invention provides a text message speechization device, a text that can audibly utter a plurality of different text messages in different states even when the same speaker model data is used. An object of the present invention is to provide a message sounding method and a text message sounding program.

本発明は、上述した従来の技術の課題を解決するため、テキストメッセージと、前記テキストメッセージを作成したメッセージ作成者を示す情報とを含むメッセージデータを取得するメッセージデータ取得部と、前記テキストメッセージを所定の話者モデルデータを用いて音声データに変換する音声変換部と、前記テキストメッセージの読み上げ速度と、読み上げ音程と、読み上げ音量と、前記音声データによる音声を複数のスピーカより音声を出力する際のバランスを示す音声出力定位とのうちの少なくとも１つのパラメータの設定値を含む音声変換参照データを、メッセージ作成者ごとに異なるように、前記メッセージ作成者に対応させて記憶する記憶部と、前記メッセージデータ取得部が所定のメッセージデータを取得したとき、前記記憶部より、前記所定のメッセージデータのメッセージ作成者に対応して記憶されている音声変換参照データに含まれるパラメータの設定値を読み出し、前記音声変換部が、前記所定のメッセージデータに含まれているテキストメッセージを、前記話者モデルデータと、前記パラメータの設定値とを用いて音声データに変換するよう、前記音声変換部を制御する制御部とを備えることを特徴とするテキストメッセージ音声化装置を提供する。 In order to solve the above-described problems of the related art, the present invention provides a message data acquisition unit that acquires message data including a text message and information indicating a message creator who created the text message, and the text message A voice conversion unit for converting into voice data using predetermined speaker model data, a reading speed of the text message, a reading pitch, a reading volume, and a voice output from a plurality of speakers; Storage unit that stores voice conversion reference data including a setting value of at least one parameter of the voice output localization indicating the balance of the message corresponding to the message creator so as to be different for each message creator; When the message data acquisition unit acquires predetermined message data, A setting value of a parameter included in the voice conversion reference data stored corresponding to the message creator of the predetermined message data is read from the storage unit, and the voice conversion unit is included in the predetermined message data. A text message voicing apparatus comprising: a control unit that controls the voice conversion unit so as to convert the text message into voice data using the speaker model data and the set value of the parameter I will provide a.

また、本発明は、上述した従来の技術の課題を解決するため、テキストメッセージと、前記テキストメッセージを作成したメッセージ作成者を示す情報とを含む所定のメッセージデータを取得し、前記テキストメッセージの読み上げ速度と、読み上げ音程と、読み上げ音量と、音声を複数のスピーカより音声を出力する際のバランスを示す音声出力定位とのうちの少なくとも１つのパラメータの設定値を含む音声変換参照データが、メッセージ作成者ごとに異なるように、前記メッセージ作成者に対応させて記憶されている記憶部より、前記メッセージ作成者に対応する前記パラメータの設定値を読み出し、前記所定のメッセージデータに含まれているテキストメッセージを、所定の話者モデルデータと、前記記憶部より読み出した前記パラメータの設定値とを用いて音声データに変換することを特徴とするテキストメッセージ音声化方法を提供する。 In addition, in order to solve the above-described problems of the conventional technology, the present invention acquires predetermined message data including a text message and information indicating a message creator who created the text message, and reads out the text message. Voice conversion reference data including a setting value of at least one parameter of speed, reading pitch, reading volume, and voice output localization indicating a balance when voice is outputted from a plurality of speakers is created as a message. The setting value of the parameter corresponding to the message creator is read from the storage unit stored corresponding to the message creator, and the text message included in the predetermined message data is different for each message creator. Are the predetermined speaker model data and the parameters read from the storage unit. Providing a text message voicing method characterized by converting the audio data by using the set value of over data.

さらに、本発明は、上述した従来の技術の課題を解決するため、コンピュータに、テキストメッセージと、前記テキストメッセージを作成したメッセージ作成者を示す情報とを含むメッセージデータを取得したとき、前記テキストメッセージの読み上げ速度と、読み上げ音程と、読み上げ音量と、音声を複数のスピーカより音声を出力する際のバランスを示す音声出力定位とのうちの少なくとも１つのパラメータの設定値を含む音声変換参照データが、メッセージ作成者ごとに異なるように、前記メッセージ作成者に対応させて記憶されている記憶部より、前記パラメータの設定値を読み出すステップと、前記メッセージデータに含まれているテキストメッセージを、所定の話者モデルデータと、前記記憶部より読み出した前記パラメータの設定値とを用いて音声データに変換するステップとを実行させることを特徴とするテキストメッセージ音声化プログラムを提供する。 Furthermore, in order to solve the above-described problems of the prior art, the present invention obtains message data including a text message and information indicating a message creator who created the text message in a computer. Voice conversion reference data including a setting value of at least one parameter of a reading speed, a reading pitch, a reading volume, and a sound output localization indicating a balance when sound is output from a plurality of speakers, The step of reading the set value of the parameter from the storage unit stored in correspondence with the message creator, and the text message included in the message data, as different for each message creator, Model data and the parameters read from the storage unit Providing a text message audio programs, characterized in that and a step of converting the voice data by using the value.

本発明のテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムによれば、同じ話者モデルデータを用いたとしても、作成者が異なる複数のテキストメッセージを聴覚的に異なる状態で音声化することができる。 According to the text message speech device, text message speech method, and text message speech program of the present invention, even if the same speaker model data is used, a plurality of text messages with different creators are aurally different. Can be voiced.

各実施形態のテキストメッセージ音声化装置を示すブロック図である。It is a block diagram which shows the text message audio | voice sound apparatus of each embodiment. 第１実施形態のテキストメッセージ音声化装置における制御部１０１の機能的な内部構成を示すブロック図である。It is a block diagram which shows the functional internal structure of the control part 101 in the text message speech apparatus of 1st Embodiment. 第１実施形態のテキストメッセージ音声化装置の動作、第１実施形態のテキストメッセージ音声化方法及びテキストメッセージ音声化プログラムによる処理を説明するためのフローチャートである。It is a flowchart for demonstrating the operation | movement of the text message audio | voice sound apparatus of 1st Embodiment, the text message audio | voice sound method of 1st Embodiment, and the process by a text message voice sound program. メッセージデータの形式の一例を示す図である。It is a figure which shows an example of the format of message data. 第１実施形態で用いる音声変換参照データの一例を示す図である。It is a figure which shows an example of the audio | voice conversion reference data used by 1st Embodiment. 音声変換参照データの新規作成及び登録の動作を説明するための図である。It is a figure for demonstrating the operation | movement of new production | generation of voice conversion reference data, and registration. 第２〜第４実施形態で用いる音声変換参照データの一例を示す図である。It is a figure which shows an example of the audio | voice conversion reference data used by 2nd-4th embodiment. 第２実施形態のテキストメッセージ音声化装置の動作、第２実施形態のテキストメッセージ音声化方法及びテキストメッセージ音声化プログラムによる処理を説明するためのフローチャートである。It is a flowchart for demonstrating the operation | movement of the text message audio | voice conversion apparatus of 2nd Embodiment, the text message audio conversion method of 2nd Embodiment, and the process by a text message audio conversion program. 第３実施形態のテキストメッセージ音声化装置における制御部１０１の機能的な内部構成を示すブロック図である。It is a block diagram which shows the functional internal structure of the control part 101 in the text message speech apparatus of 3rd Embodiment. 第３実施形態において、テキストメッセージにメッセージ送信者の紹介文を示す文字列を付加する第１の例を説明するための部分的なフローチャートである。In 3rd Embodiment, it is a partial flowchart for demonstrating the 1st example which adds the character string which shows the message sender's introduction sentence to a text message. 第３実施形態において、テキストメッセージにメッセージ送信者の紹介文を示す文字列を付加する第２の例を説明するための部分的なフローチャートである。In 3rd Embodiment, it is a partial flowchart for demonstrating the 2nd example which adds the character string which shows the message sender's introduction sentence to a text message. 第４実施形態のテキストメッセージ音声化装置における制御部１０１の機能的な内部構成を示すブロック図である。It is a block diagram which shows the functional internal structure of the control part 101 in the text message speechization apparatus of 4th Embodiment.

以下、各実施形態のテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムについて、添付図面を参照して説明する。 In the following, a text message voice conversion device, a text message voice conversion method, and a text message voice conversion program of each embodiment will be described with reference to the accompanying drawings.

＜第１実施形態＞
図１において、テキストメッセージ音声化装置１００には、外部システム２００と、音声出力部３００とが接続されている。テキストメッセージ音声化装置１００と外部システム２００とは、インターネットを介して接続されていてもよい。 <First Embodiment>
In FIG. 1, an external system 200 and an audio output unit 300 are connected to the text message audio device 100. The text message audio device 100 and the external system 200 may be connected via the Internet.

外部システム２００は、例えばインスタントメッセンジャ等のメッセージデータを管理するシステムである。 The external system 200 is a system that manages message data such as an instant messenger, for example.

テキストメッセージ音声化装置１００は、テキストメッセージ音声化装置１００の全体を制御する制御部１０１と、制御部１０１とそれぞれ接続されたメッセージデータ取得部１０２と記憶部１０３と音声変換部１０４とを備える。 The text message speech apparatus 100 includes a control unit 101 that controls the entire text message speech apparatus 100, a message data acquisition unit 102, a storage unit 103, and a voice conversion unit 104 connected to the control unit 101, respectively.

メッセージデータ取得部１０２は、外部システム２００からメッセージデータを取得する。メッセージデータ取得部１０２が外部システム２００に対してメッセージデータの送信を要求することにより、メッセージデータ取得部１０２がメッセージデータを取得してもよいし、メッセージデータ取得部１０２が受動的にメッセージデータを取得してもよい。 The message data acquisition unit 102 acquires message data from the external system 200. When the message data acquisition unit 102 requests the external system 200 to transmit the message data, the message data acquisition unit 102 may acquire the message data, or the message data acquisition unit 102 passively receives the message data. You may get it.

記憶部１０３は、音声変換部１０４でメッセージデータを音声変換するために用いる話者モデルデータの識別情報と、後述するパラメータデータとを含む音声変換参照データを、メッセージデータの送信者（作成者）ごとに記憶している。 The storage unit 103 stores voice conversion reference data including identification information of speaker model data used for voice conversion of the message data by the voice conversion unit 104 and parameter data to be described later, as a sender (creator) of the message data. Remember every one.

音声変換部１０４は、少なくとも１つの話者モデルデータを記憶している。記憶部１０３に記憶されている音声変換参照データには、複数の送信者に対して、共通の話者モデルデータが割り当てられていることがある。 The voice conversion unit 104 stores at least one speaker model data. The voice conversion reference data stored in the storage unit 103 may be assigned common speaker model data to a plurality of senders.

図２に示すように、制御部１０１は、機能的な内部構成として、音声変換参照データ生成部1011と、音声変換参照データ書き込み部1012と、音声変換参照データ読み出し部1013とを有する。音声変換参照データ生成部1011と、音声変換参照データ書き込み部1012と、音声変換参照データ読み出し部1013は、ソフトウェアによって構成することができる。 As illustrated in FIG. 2, the control unit 101 includes a voice conversion reference data generation unit 1011, a voice conversion reference data writing unit 1012, and a voice conversion reference data reading unit 1013 as functional internal configurations. The voice conversion reference data generation unit 1011, the voice conversion reference data write unit 1012, and the voice conversion reference data read unit 1013 can be configured by software.

制御部１０１は、メッセージデータ取得部１０２が取得したメッセージデータに含まれるテキストメッセージと、音声変換参照データ読み出し部1013が記憶部１０３より読み出した音声変換参照データとを、所定の形式で音声変換部１０４に供給する。 The control unit 101 converts the text message included in the message data acquired by the message data acquisition unit 102 and the voice conversion reference data read from the storage unit 103 by the voice conversion reference data reading unit 1013 into a voice conversion unit in a predetermined format. 104 is supplied.

音声変換部１０４は、テキストメッセージを、所定の話者モデルデータを用い、さらに、音声変換参照データを参照して音声データに変換する。 The voice conversion unit 104 converts the text message into voice data by using predetermined speaker model data and further referring to the voice conversion reference data.

制御部１０１は、音声データ（音声信号）を音声出力部３００に供給して、音声として出力させる。音声出力部３００は、スピーカと増幅部等で構成される。音声データは、制御部１０１または図示していないＤ／Ａ変換器によってアナログ信号に変換されてスピーカに供給される。 The control unit 101 supplies the audio data (audio signal) to the audio output unit 300 and outputs it as audio. The audio output unit 300 includes a speaker and an amplification unit. The audio data is converted into an analog signal by the control unit 101 or a D / A converter (not shown) and supplied to the speaker.

図３に示すフローチャートを用いて、図１のテキストメッセージ音声化装置の動作、テキストメッセージ音声化装置で実行されるテキストメッセージ音声化方法をさらに説明する。 With reference to the flowchart shown in FIG. 3, the operation of the text message voice generating apparatus of FIG. 1 and the text message voice generating method executed by the text message voice generating apparatus will be further described.

図３において、制御部１０１は、ステップＳ０１にて、メッセージデータ取得部１０２が取得したメッセージデータを取り込む。メッセージデータは、一例として、図４に示すようなデータである。メッセージデータの形式は例えばXML形式である。 In FIG. 3, the control unit 101 captures the message data acquired by the message data acquisition unit 102 in step S01. The message data is data as shown in FIG. 4 as an example. The format of the message data is, for example, an XML format.

図４に示すように、メッセージデータは、messageタグを有し、messageタグは、メッセージに関する各種のタグを格納している。accountタグは、メッセージ送信者（メッセージ作成者）を一意に特定できるアカウント名を格納する。genderタグはメッセージ送信者の性別を格納し、languageタグはメッセージの言語を格納する。textタグは、メッセージ本文であるテキストメッセージを格納する。 As shown in FIG. 4, the message data has a message tag, and the message tag stores various tags related to the message. The account tag stores an account name that can uniquely identify the message sender (message creator). The gender tag stores the gender of the message sender, and the language tag stores the language of the message. The text tag stores a text message that is a message body.

制御部１０１は、ステップＳ０２にて、記憶部１０３に、メッセージ送信者に対応する音声変換参照データが存在するか否かを判定する。 In step S02, the control unit 101 determines whether or not the voice conversion reference data corresponding to the message sender exists in the storage unit 103.

図５は、記憶部１０３に記憶されている音声変換参照データの一例を示している。音声変換参照データは、アカウント名と、使用する話者モデルデータを特定する識別情報と、パラメータデータの設定値とを含む。ここでは、パラメータデータとして、読み上げ速度、読み上げ音声、読み上げ音量、音声出力定位それぞれの設定値が設定されている。パラメータデータは、これらのうちの少なくとも１つの設定値であってもよい。 FIG. 5 shows an example of the voice conversion reference data stored in the storage unit 103. The voice conversion reference data includes an account name, identification information for specifying speaker model data to be used, and a setting value of parameter data. Here, setting values for the reading speed, reading voice, reading volume, and sound output localization are set as parameter data. The parameter data may be a set value of at least one of these.

図５では、読み上げ速度、読み上げ音声、読み上げ音量が全てノーマルなる設定値に設定されており、音声出力定位はＬ５：Ｒ５なる設定値に設定されている。音声出力定位のＬ５：Ｒ５とは、左右２つのスピーカにおける左スピーカと右スピーカとより音声を出力させるバランスを示す。音声出力定位は、前後左右の４つのスピーカより音声を出力させるバランスであってもよく、複数のスピーカより音声を出力させるバランスを示せばよい。 In FIG. 5, the reading speed, the reading voice, and the reading volume are all set to a normal setting value, and the voice output localization is set to a setting value L5: R5. The sound output localization L5: R5 indicates a balance in which sound is output from the left speaker and the right speaker in the two left and right speakers. The sound output localization may be a balance in which sound is output from four speakers, front, rear, left, and right, and may indicate a balance in which sound is output from a plurality of speakers.

制御部１０１は、メッセージデータに含まれるaccountタグに記述されているアカウント名を含む音声変換参照データが存在しているか否かによって、メッセージ送信者に対応する音声変換参照データが存在するか否かを判定することができる。 The control unit 101 determines whether there is voice conversion reference data corresponding to the message sender depending on whether voice conversion reference data including the account name described in the account tag included in the message data exists. Can be determined.

制御部１０１は、メッセージ送信者に対応する音声変換参照データが存在すれば（YES）、処理をステップＳ０５に移行させ、存在しなければ（NO）、処理をステップＳ０３に移行させる。 If the voice conversion reference data corresponding to the message sender exists (YES), the control unit 101 shifts the process to step S05, and if not (NO), shifts the process to step S03.

音声変換参照データ生成部1011は、ステップＳ０３にて、新規の音声変換参照データを、パラメータデータの設定値が重複しないように生成する。話者モデルデータは他の音声変換参照データにおけるそれと重複していてもよい。 In step S03, the voice conversion reference data generation unit 1011 generates new voice conversion reference data so that parameter data setting values do not overlap. The speaker model data may overlap with that in other speech conversion reference data.

音声変換参照データ生成部1011は、音声変換参照データが複数のパラメータの設定値を含む場合には、パラメータデータの設定値の組み合わせパターンが重複しないように新規の音声変換参照データを生成すればよい。 When the voice conversion reference data includes a plurality of parameter setting values, the voice conversion reference data generation unit 1011 may generate new voice conversion reference data so that the combination patterns of the parameter data setting values do not overlap. .

音声変換参照データ生成部1011は、音声変換参照データが１つのパラメータの設定値のみを含む場合には、パラメータの設定値が重複しないように新規の音声変換参照データを生成する。 When the voice conversion reference data includes only one parameter setting value, the voice conversion reference data generation unit 1011 generates new voice conversion reference data so that the parameter setting values do not overlap.

複数のテキストメッセージを聴覚的に異なる状態で音声化するには、パラメータデータの設定値を大きく異ならせるのがよい。そこで、例えば、読み上げ速度、読み上げ音声、読み上げ音量の設定値をそれぞれロー、ノーマル、ハイの３段階に設定したとすると、パラメータデータの設定値の組み合わせパターンは比較的限られたパターンとなる。 In order to make a plurality of text messages audibly sound differently, it is preferable to greatly change the setting values of the parameter data. Therefore, for example, if the setting values of the reading speed, reading voice, and reading volume are set to three levels of low, normal, and high, respectively, the combination pattern of the setting values of the parameter data is a relatively limited pattern.

音声変換参照データ書き込み部1012は、ステップＳ０４にて、新たに生成した音声変換参照データを記憶部１０３に書き込んで登録し、処理をステップＳ０５に移行させる。 In step S04, the voice conversion reference data writing unit 1012 writes and registers the newly generated voice conversion reference data in the storage unit 103, and shifts the processing to step S05.

図６の（ａ）に示すように、記憶部１０３に、アカウント名がaccount0と設定されている図５に示す音声変換参照データが予め登録されているとする。 As shown in FIG. 6A, it is assumed that the voice conversion reference data shown in FIG. 5 in which the account name is set to account0 is registered in the storage unit 103 in advance.

図４に示すアカウント名account1を有するメッセージ送信者がメッセージデータを送信したとする。 Assume that a message sender having the account name account1 shown in FIG. 4 has sent message data.

音声変換参照データ書き込み部1012は、ステップＳ０４にて、記憶部１０３にアカウント名account1に対応する音声変換参照データを登録する。よって、図６の（ｂ）に示すように、記憶部１０３には、既存のアカウント名account0に対応する音声変換参照データと、アカウント名account1に対応する音声変換参照データとが記憶された状態となる。 The voice conversion reference data writing unit 1012 registers the voice conversion reference data corresponding to the account name account1 in the storage unit 103 in step S04. Therefore, as shown in FIG. 6B, the storage unit 103 stores the voice conversion reference data corresponding to the existing account name account0 and the voice conversion reference data corresponding to the account name account1. Become.

図６の（ｂ）に示すように、アカウント名account1に対応する音声変換参照データは、例えば読み上げ速度と読み上げ音程の設定値がハイに設定される。音声変換参照データが複数のパラメータデータを含む場合には、少なくとも１つのパラメータデータの設定値を変更すればよい。アカウント名account1に対応する話者モデルデータは、アカウント名account0に対応する話者モデルデータと同じである。 As shown in FIG. 6B, for the voice conversion reference data corresponding to the account name account1, for example, the reading speed and the reading pitch are set to high. When the voice conversion reference data includes a plurality of parameter data, the setting value of at least one parameter data may be changed. The speaker model data corresponding to the account name account1 is the same as the speaker model data corresponding to the account name account0.

ところで、音声変換部１０４が例えば男性用の話者モデルデータと、女性用の話者モデルデータとを記憶している場合には、音声変換参照データ生成部1011は、音声変換参照データを生成する際に、メッセージデータに含まれるgenderタグに記述されている性別に対応した話者モデルデータを選択すればよい。 By the way, when the voice conversion unit 104 stores, for example, male speaker model data and female speaker model data, the voice conversion reference data generation unit 1011 generates voice conversion reference data. At this time, speaker model data corresponding to the gender described in the gender tag included in the message data may be selected.

図３に戻り、音声変換参照データ読み出し部1013は、ステップＳ０５にて、記憶部１０３からメッセージ送信者に対応する音声変換参照データを読み出して、後述するデータを音声変換部１０４に供給する。 Returning to FIG. 3, in step S <b> 05, the voice conversion reference data reading unit 1013 reads voice conversion reference data corresponding to the message sender from the storage unit 103 and supplies data to be described later to the voice conversion unit 104.

制御部１０１が音声変換部１０４へと供給するデータの形式は例えば次のとおりである。制御部１０１は、textタグに記述されているテキストメッセージと、音声変換参照データに含まれる話者モデルデータを示す識別情報と、音声出力定位以外のパラメータデータとを用いて、例えばSSML形式のデータを生成する。 The format of data supplied from the control unit 101 to the audio conversion unit 104 is, for example, as follows. The control unit 101 uses the text message described in the text tag, the identification information indicating the speaker model data included in the speech conversion reference data, and parameter data other than the speech output localization, for example, data in the SSML format Is generated.

制御部１０１は、SSML形式のデータと音声出力定位のパラメータデータとを音声変換部１０４に供給する。音声出力定位のパラメータデータをSSML形式のデータと別にしているのは、音声出力定位のパラメータデータをSSML形式のデータに記述できないからである。 The control unit 101 supplies SSML format data and audio output localization parameter data to the audio conversion unit 104. The reason why the parameter data for the sound output localization is separated from the data in the SSML format is that the parameter data for the sound output localization cannot be described in the data in the SSML format.

制御部１０１は、ステップＳ０６にて、音声変換部１０４による音声変換処理を実行させる。音声変換部１０４は、入力されたSSML形式のデータに記述されている識別情報の話者モデルデータを用い、SSML形式のデータに記述されているパラメータデータ及び音声出力定位のパラメータデータに従って、テキストメッセージを音声変換する。 In step S06, the control unit 101 causes the voice conversion unit 104 to execute voice conversion processing. The voice conversion unit 104 uses the speaker model data of the identification information described in the input SSML format data, and in accordance with the parameter data described in the SSML format data and the parameter data of the voice output localization, the text message Is converted to speech.

制御部１０１は、ステップＳ０７にて、音声変換部１０４によって変換された音声データに基づく音声を音声出力部３００より出力させて、処理を終了させる。 In step S07, the control unit 101 causes the audio output unit 300 to output audio based on the audio data converted by the audio conversion unit 104, and ends the processing.

制御部１０１をマイクロコンピュータによって構成し、コンピュータプログラム（テキストメッセージ音声化プログラム）によって、マイクロコンピュータに図３に示す処理を実行させるように構成してもよい。 The control unit 101 may be configured by a microcomputer, and may be configured to cause the microcomputer to execute the processing shown in FIG. 3 by a computer program (text message speechization program).

以上のように、第１実施形態によれば、音声変換部１０４によってテキストメッセージを音声変換する際に用いる話者モデルデータが同じであっても、メッセージ作成者が異なる複数のテキストメッセージを聴覚的に異なる状態で音声化することができる。 As described above, according to the first embodiment, even when the speaker model data used when the voice conversion unit 104 converts the text message into voice is the same, a plurality of text messages with different message creators can be heard. Can be voiced in different states.

＜第２実施形態＞
第２実施形態のテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムを、第１実施形態におけるそれとは異なる点を中心に説明する。 Second Embodiment
The text message sounding apparatus, text message sounding method, and text message sounding program of the second embodiment will be described with a focus on differences from the first embodiment.

第２実施形態においては、図７に示すように、音声変換参照データに、その音声変換参照データの最終利用日時の情報を追加している。 In the second embodiment, as shown in FIG. 7, information on the last use date and time of the voice conversion reference data is added to the voice conversion reference data.

図８に示す第２実施形態のフローチャートにおいて、図３に示す第１実施形態のフローチャートと同一のステップには同一の符号を付し、説明を省略する。図８において、制御部１０１は、ステップＳ０２にて、記憶部１０３に、メッセージ送信者に対応する音声変換参照データが存在しなければ（NO）、処理をステップＳ０８に移行させる。 In the flowchart of the second embodiment shown in FIG. 8, the same steps as those in the flowchart of the first embodiment shown in FIG. In FIG. 8, if there is no voice conversion reference data corresponding to the message sender in the storage unit 103 in step S02 (NO), the control unit 101 shifts the process to step S08.

制御部１０１は、ステップＳ０８にて、パラメータデータの設定値の組み合わせパターンが不足しているか否かを判定する。パラメータデータが１つのみであれば、パラメータデータの設定値が不足しているか否かを判定すればよい。 In step S08, the control unit 101 determines whether or not a combination pattern of parameter data setting values is insufficient. If there is only one parameter data, it may be determined whether or not the set value of the parameter data is insufficient.

制御部１０１は、組み合わせパターンが不足していれば（YES）、処理をステップＳ０９に移行させ、組み合わせパターンが不足していなければ（NO）、処理をステップＳ０３に移行させる。 If the combination pattern is insufficient (YES), the control unit 101 shifts the process to step S09. If the combination pattern is not insufficient (NO), the control unit 101 shifts the process to step S03.

音声変換参照データ生成部1011は、ステップＳ０９にて、最終利用日時が最も古い音声変換参照データを削除し、新規の音声変換参照データを生成して、処理をステップＳ０５に移行させる。 In step S09, the voice conversion reference data generation unit 1011 deletes the voice conversion reference data having the oldest last use date and time, generates new voice conversion reference data, and shifts the processing to step S05.

具体的には、音声変換参照データ生成部1011が、最終利用日時が最も古い音声変換参照データのアカウント名を他のアカウント名に書き換えた音声変換参照データを生成し、音声変換参照データ書き込み部1012が記憶部１０３に書き込めばよい。これに伴って、最終利用日時が更新される。 Specifically, the voice conversion reference data generation unit 1011 generates voice conversion reference data in which the account name of the voice conversion reference data with the oldest last use date is replaced with another account name, and the voice conversion reference data writing unit 1012 May be written in the storage unit 103. Along with this, the last use date and time is updated.

ステップＳ０９の処理によって、最終利用日時が最も古い、あるメッセージ送信者に対して設定されている音声変換参照データが消去され、新規のメッセージ送信者に対して、音声変換参照データが設定されることになる。 By the process of step S09, the voice conversion reference data set for a message sender having the oldest last use date and time is deleted, and the voice conversion reference data is set for a new message sender. become.

第２実施形態によれば、第１実施形態と同じ効果に加えて、パラメータデータの設定値の組み合わせパターンが不足した場合でも、新規のメッセージ送信者に対して音声変換参照データを設定することができる。 According to the second embodiment, in addition to the same effects as in the first embodiment, voice conversion reference data can be set for a new message sender even when the combination pattern of parameter data setting values is insufficient. it can.

上記のように、パラメータデータの設定値の組み合わせパターンは限られているので、第２実施形態は有効となる。 As described above, since the combination pattern of the setting values of the parameter data is limited, the second embodiment is effective.

＜第３実施形態＞
第３実施形態のテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムを、第１または第２実施形態におけるそれとは異なる点を中心に説明する。以下の第３実施形態の構成及び動作は、第１実施形態に対して加えてもよいし、第２実施形態に対して加えてもよい。 <Third Embodiment>
The text message speechization apparatus, text message speechization method, and text message speechization program of the third embodiment will be described with a focus on differences from the first or second embodiment. The configuration and operation of the following third embodiment may be added to the first embodiment or may be added to the second embodiment.

図９に示すように、第３実施形態においては、制御部１０１は、機能的な内部構成として、文字列付加部1014をさらに有する。文字列付加部1014は、ソフトウェアによって構成することができる。 As shown in FIG. 9, in the third embodiment, the control unit 101 further includes a character string adding unit 1014 as a functional internal configuration. The character string adding unit 1014 can be configured by software.

図１０において、制御部１０１は、ステップＳ０３にて新規の音声変換参照データを生成して、ステップＳ０４にて音声変換参照データを記憶部１０３に登録する。文字列付加部1014は、ステップＳ１１にて、テキストメッセージの前にメッセージ送信者の紹介文を示す文字列を付加して、処理をステップＳ０５に移行させる。 In FIG. 10, the control unit 101 generates new voice conversion reference data in step S03, and registers the voice conversion reference data in the storage unit 103 in step S04. In step S11, the character string adding unit 1014 adds a character string indicating an introduction sentence of the message sender before the text message, and shifts the processing to step S05.

記憶部１０３には、言語ごとの紹介文のテンプレートが記憶されている。例えば日本語のテンプレートを例にすると、記憶部１０３には、一例として、「こんにちは。？？？です。」というテンプレートが記憶されている。 The storage unit 103 stores templates of introduction sentences for each language. For example, if you as an example the Japanese of the template, in the storage unit 103, as an example, "Hello. It is ???." That the template is stored.

制御部１０１は、languageタグに記述されている言語のテンプレートを読み出し、文字列付加部1014は、テンプレートにaccountタグに記述されているアカウント名を追記してメッセージ送信者の紹介文を生成する。 The control unit 101 reads a language template described in the language tag, and the character string adding unit 1014 adds an account name described in the account tag to the template to generate an introduction sentence of the message sender.

languageタグが日本語であることを示せば、文字列付加部1014は、上記のテンプレートにおける「？？？」の部分をアカウント名に置換した文字列を生成して、メッセージ送信者の紹介文としてテキストメッセージの前に付加する。英語等の他の言語の場合も同様である。 If the language tag indicates that it is in Japanese, the character string adding unit 1014 generates a character string by replacing the “???” part of the above template with the account name, and uses it as an introduction sentence of the message sender. Append before the text message. The same applies to other languages such as English.

文字列付加部1014は、新規の音声変換参照データを生成したタイミング以外でも、テキストメッセージの前にメッセージ送信者の紹介文を付加してもよい。例えば、文字列付加部1014は、音声変換参照データの最終利用日時から所定期間以上経過して、その音声変換参照データを利用するときに、テキストメッセージの前に紹介文を付加する。 The character string adding unit 1014 may add an introduction sentence of the message sender before the text message at a timing other than the timing when the new voice conversion reference data is generated. For example, the character string adding unit 1014 adds an introductory sentence before the text message when the voice conversion reference data is used after a predetermined period has elapsed since the last use date and time of the voice conversion reference data.

音声変換参照データの最終利用日時から所定期間以上経過した場合にテキストメッセージに紹介文を付加する場合には、音声変換参照データの形式を、図７のように、最終利用日時の情報を含む音声変換参照データとする。 When an introductory sentence is added to a text message when a predetermined period or more has passed since the last use date and time of the voice conversion reference data, the voice conversion reference data format is a voice including information on the last use date and time as shown in FIG. This is converted reference data.

図１１に示すように、制御部１０１は、ステップＳ０２，Ｓ０４に続くステップＳ１２にて、最終利用日時から所定期間以上経過したか否かを判定する。制御部１０１は、所定期間以上経過していれば（YES）、処理をステップＳ１３に移行させ、所定期間以上経過していなければ（NO）、処理をステップＳ０５に移行させる。 As illustrated in FIG. 11, the control unit 101 determines whether or not a predetermined period or more has elapsed since the last use date and time in step S12 following steps S02 and S04. The control unit 101 shifts the process to step S13 if the predetermined period or more has elapsed (YES), and shifts the process to step S05 if the predetermined period or longer has not elapsed (NO).

文字列付加部1014は、ステップＳ１３にて、テキストメッセージの前にメッセージ送信者の紹介文を示す文字列を付加して、ステップＳ０５に移行させる。 In step S13, the character string adding unit 1014 adds a character string indicating the message sender's introduction before the text message, and proceeds to step S05.

第３実施形態によれば、第１実施形態または第２実施形態と同じ効果に加えて、次のような効果を奏する。 According to 3rd Embodiment, in addition to the same effect as 1st Embodiment or 2nd Embodiment, there exist the following effects.

第１の例である図１０に示す処理によれば、新しいメッセージ送信者がテキストメッセージを送信してきて、音声変換部１０４がテキストメッセージを音声変換するときに、メッセージ送信者の紹介文を再生することができる。よって、新しいメッセージ送信者が誰であるかを認識することが可能となる。 According to the process shown in FIG. 10, which is the first example, when a new message sender sends a text message and the voice conversion unit 104 converts the text message into voice, the message sender's introduction is reproduced. be able to. Therefore, it is possible to recognize who the new message sender is.

第２の例である図１１に示す処理によれば、音声変換参照データの最終利用日時から所定期間以上経過した状態でその音声変換参照データを利用して、音声変換部１０４がテキストメッセージを音声変換するときに、メッセージ送信者の紹介文を再生することができる。よって、期間が経過することによってメッセージ送信者が誰であるかを忘れていたとしても、メッセージ送信者が誰であるかを認識することが可能となる。 According to the process shown in FIG. 11 as the second example, the voice conversion unit 104 uses the voice conversion reference data in a state in which a predetermined period or more has passed since the last use date and time of the voice conversion reference data to convert the text message into a voice message. When converting, the message sender's introduction can be played. Therefore, even if the message sender is forgotten as the period elapses, it is possible to recognize who the message sender is.

＜第４実施形態＞
第４実施形態のテキストメッセージ音声化装置、テキストメッセージ音声化方法、テキストメッセージ音声化プログラムを、第１〜第３実施形態におけるそれとは異なる点を中心に説明する。以下の第４実施形態の構成及び動作は、第１〜第３実施形態それぞれに対して加えてもよい。 <Fourth embodiment>
The text message sounding apparatus, text message sounding method, and text message sounding program of the fourth embodiment will be described with a focus on differences from the first to third embodiments. The configuration and operation of the following fourth embodiment may be added to each of the first to third embodiments.

図１２に示すように、第４実施形態においては、制御部１０１は、機能的な内部構成として、音声変換参照データ変更部1015をさらに有する。音声変換参照データ変更部1015は、ソフトウェアによって構成することができる。図１２における文字列付加部1014は省略可能である。 As shown in FIG. 12, in the fourth embodiment, the control unit 101 further includes an audio conversion reference data changing unit 1015 as a functional internal configuration. The voice conversion reference data changing unit 1015 can be configured by software. The character string adding unit 1014 in FIG. 12 can be omitted.

第４実施形態においても、音声変換参照データの形式を、図７のように、最終利用日時の情報を含む音声変換参照データとする。 Also in the fourth embodiment, the format of the voice conversion reference data is voice conversion reference data including information on the last use date and time as shown in FIG.

まず、メッセージ送信者Ａが、テキストメッセージを送信してきたとする。記憶部１０３に記憶されている、メッセージ送信者Ａに対して設定されている音声出力定位が左右３：７であったとする。このとき、音声変換部１０４は、音声出力定位を左右３：７として、テキストメッセージを音声変換する。 First, assume that message sender A has sent a text message. It is assumed that the sound output localization set for the message sender A stored in the storage unit 103 is 3: 7 left and right. At this time, the voice conversion unit 104 converts the text message into voice by setting the voice output localization to 3: 7 left and right.

その後、メッセージ送信者Ｂが、テキストメッセージを送信してきたとする。記憶部１０３に記憶されている、メッセージ送信者Ｂに対して設定されている音声出力定位が左右５：５であったとする。このとき、音声変換部１０４は、音声出力定位を左右５：５として、テキストメッセージを音声変換する。 Thereafter, it is assumed that the message sender B transmits a text message. It is assumed that the sound output localization set for the message sender B stored in the storage unit 103 is 5: 5 on the left and right. At this time, the voice conversion unit 104 converts the text message into voice with the voice output localization set to 5: 5 on the left and right.

さらにその後、再びメッセージ送信者Ｂが、テキストメッセージを送信してきたとする。このとき、メッセージ送信者Ｂに対応する音声変換参照データの最終利用日時からの経過期間は、メッセージ送信者Ａに対応する音声変換参照データのそれよりも短くなる。 Furthermore, after that, it is assumed that the message sender B sends a text message again. At this time, the elapsed period from the last use date and time of the voice conversion reference data corresponding to the message sender B is shorter than that of the voice conversion reference data corresponding to the message sender A.

音声変換参照データ変更部1015は、記憶部１０３に記憶されている、メッセージ送信者Ｂに対して設定されている音声出力定位を、左右３：７に変更して、音声変換部１０４は、音声出力定位を左右３：７として、テキストメッセージを音声変換する。 The voice conversion reference data changing unit 1015 changes the voice output localization set for the message sender B stored in the storage unit 103 to 3: 7 left and right, and the voice conversion unit 104 A text message is converted into a voice by setting the output localization to 3: 7 left and right.

併せて、音声変換参照データ変更部1015は、記憶部１０３に記憶されている、メッセージ送信者Ａに対して設定されている音声出力定位が左右３：７を左右５：５に変更する。 In addition, the voice conversion reference data changing unit 1015 changes the voice output localization set for the message sender A stored in the storage unit 103 from left / right 3: 7 to left / right 5: 5.

このように、第４実施形態は、最終利用日時からの経過期間に応じて音声出力定位の設定値を変更する音声変換参照データ変更部を有する。第４実施形態によれば、例えば次のような場合に効果を発揮する。左右のスピーカが車両に搭載されているとする。ここでは、車両は右側にステアリングが配置されているとする。 As described above, the fourth embodiment includes the voice conversion reference data changing unit that changes the setting value of the voice output localization according to the elapsed period from the last use date and time. According to 4th Embodiment, an effect is exhibited, for example in the following cases. Assume that left and right speakers are mounted on a vehicle. Here, it is assumed that the steering wheel is arranged on the right side of the vehicle.

メッセージ送信者Ａが送信したテキストメッセージを音声変換部１０４によって音声データに変換し、音声出力定位を左右３：７として、左右のスピーカで再生すると、運転者は、運転者側に近付いた位置からの音声を聞くことになる。 When the text message transmitted by the message sender A is converted into voice data by the voice conversion unit 104 and the voice output localization is set to 3: 7 left and right and reproduced by the left and right speakers, the driver can move from the position closer to the driver side. You will hear the voice.

次に、メッセージ送信者Ｂが送信したテキストメッセージを音声変換部１０４に音声データに変換して、音声出力定位を左右５：５として、左右のスピーカで再生したとする。この場合、音声は左右のほぼ中央から聞こえることになるので、運転者は、音声出力定位を左右３：７とした場合と比較して、運転者から離れた位置からの音声を聞くことになる。 Next, it is assumed that the text message transmitted by the message sender B is converted into voice data by the voice conversion unit 104 and played back by the left and right speakers with the voice output localization set to 5: 5 on the left and right. In this case, since the sound can be heard from almost the center of the left and right, the driver can hear the sound from a position away from the driver as compared with the case where the sound output localization is set to 3: 7 left and right. .

その後、再びメッセージ送信者Ｂが送信したテキストメッセージを音声変換部１０４に音声データに変換して、音声出力定位を左右３：７として、左右のスピーカで再生すると、運転者は、運転者側に近付いた位置からの音声を聞くことになる。 After that, the text message transmitted again by the message sender B is converted into voice data by the voice conversion unit 104, and the voice output localization is set to 3: 7 left and right and reproduced by the left and right speakers. You will hear the sound from the position you approached.

メッセージ送信者Ｂは、メッセージ送信者Ａよりも直近にメッセージデータを送信している。よって、メッセージ送信者Ｂは、メッセージ送信者Ａよりも、メッセージ受信者とメッセージ送信者とでメッセージデータを送受信する際の話題の中心となっている可能性が高い。 Message sender B is sending message data more recently than message sender A. Therefore, the message sender B is more likely to be the center of the topic when the message data is transmitted and received between the message receiver and the message sender than the message sender A.

運転者は、話題の中心となっている可能性が高いメッセージ送信者による音声を運転者側に近付いた位置から聞き、そうでないメッセージ送信者による音声を運転者から比較的離れた位置から聞くことにより、両者を容易に識別することが可能となる。 The driver listens to the voice of the message sender who is likely to be the center of the topic from a position close to the driver, and listens to the voice of the other message sender from a position relatively far from the driver. Thus, it is possible to easily identify both.

本発明は以上説明した各実施形態に限定されるものではなく、本発明の要旨を逸脱しない範囲において種々変更可能である。図５等に示すように、音声変換参照データは話者モデルデータを特定するための識別情報を含むが、音声変換部１０４が有する話者モデルデータが１つのみであれば、話者モデルデータの識別情報を省略してもよい。 The present invention is not limited to the embodiments described above, and various modifications can be made without departing from the scope of the present invention. As shown in FIG. 5 and the like, the speech conversion reference data includes identification information for specifying the speaker model data, but if the speech conversion unit 104 has only one speaker model data, the speaker model data The identification information may be omitted.

１０１制御部
１０２メッセージデータ取得部
１０３記憶部
１０４音声変換部
1011 音声変換参照データ生成部
1014 文字列付加部
1015 音声変換参照データ変更部 101 Control unit 102 Message data acquisition unit 103 Storage unit 104 Voice conversion unit
1011 Voice conversion reference data generator
1014 String addition part
1015 Voice conversion reference data change part

Claims

A message data acquisition unit for acquiring message data including a text message and information indicating a message creator who created the text message;
A voice conversion unit that converts the text message into voice data using predetermined speaker model data;
A setting value of at least one parameter of a reading speed of the text message, a reading pitch, a reading volume, and an audio output localization indicating a balance when outputting the sound based on the sound data from a plurality of speakers. A storage unit that stores voice conversion reference data including the corresponding message creator so as to be different for each message creator;
When the message data acquisition unit acquires predetermined message data, the setting value of the parameter included in the voice conversion reference data stored corresponding to the message creator of the predetermined message data is read from the storage unit The voice conversion unit controls the voice conversion unit to convert a text message included in the predetermined message data into voice data using the speaker model data and the set value of the parameter. A control unit,
A text message voicing apparatus comprising:

When the message data acquisition unit acquires predetermined message data and the storage unit does not store voice conversion reference data corresponding to the message creator of the predetermined message data,
The controller is
When the voice conversion reference data includes setting values of a plurality of parameters, new voice conversion reference data is generated so that combination patterns of parameter data setting values do not overlap, and the voice conversion reference data includes one parameter. 2. The text message voice generating apparatus according to claim 1, further comprising: a voice conversion reference data generation unit that generates new voice conversion reference data so that parameter setting values do not overlap when only the setting values are included.

The voice conversion reference data includes information indicating the last use date and time using the voice conversion reference data,
When the voice conversion reference data generation unit cannot generate new voice conversion reference data in which the combination pattern of parameter data setting values or parameter setting values do not overlap, based on the information indicating the last use date and time, The text message speech conversion apparatus according to claim 2, wherein the voice conversion reference data having the oldest last use date is deleted to generate new voice conversion reference data.

The controller is
The character string adding unit for adding a character string, which is an introduction sentence of the message creator, to the message data based on information indicating the message creator. The text message voice converting device described in 1.

The voice conversion reference data includes, as the parameter, a setting value of the voice output localization, and information indicating the last use date and time using the voice conversion reference data,
The controller is
The text message voice conversion device according to claim 1, further comprising: a voice conversion reference data changing unit that changes a setting value of the voice output localization according to an elapsed period from the last use date and time.

Obtaining predetermined message data including a text message and information indicating a message creator who created the text message;
A voice conversion reference including a setting value of at least one parameter of the reading speed of the text message, a reading pitch, a reading volume, and a voice output localization indicating a balance when voice is outputted from a plurality of speakers. The setting value of the parameter corresponding to the message creator is read out from the storage unit stored corresponding to the message creator so that the data is different for each message creator,
A text message voice, wherein the text message included in the predetermined message data is converted into voice data using predetermined speaker model data and the set value of the parameter read from the storage unit. Method.

On the computer,
When message data including a text message and information indicating the message creator who created the text message is acquired, the text message reading speed, reading pitch, reading volume, and voice are output from a plurality of speakers. Voice conversion reference data including a setting value of at least one parameter of voice output localization indicating a balance at the time of output is stored in association with the message creator so as to be different for each message creator. Reading the setting value of the parameter from the storage unit;
Converting the text message included in the message data into speech data using predetermined speaker model data and the setting values of the parameters read from the storage unit;
A text message voicing program characterized in that