JP3461055B2

JP3461055B2 - Audio channel selection synthesis method and apparatus for implementing the method

Info

Publication number: JP3461055B2
Application number: JP10263095A
Authority: JP
Inventors: 徹定方; 隆幸沖村
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 1995-04-26
Filing date: 1995-04-26
Publication date: 2003-10-27
Anticipated expiration: 2018-10-27
Also published as: JPH08298635A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、音声チャンネル選択
合成方法およびこの方法を実施する装置に関し、特に、
多チャンネルの音声チャンネルの内から複数の音声チャ
ンネルを選択し、ユーザ毎に各別に音声チャンネルを選
択合成する音声チャンネル選択合成方法およびこの方法
を実施する装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio channel selection / synthesis method and an apparatus for implementing the method, and more particularly,
Select multiple audio channels from the multi-channel audio channels and select audio channels for each user.
The present invention relates to an audio channel selection / synthesis method for selective synthesis and an apparatus for implementing this method.

【０００２】[0002]

【従来の技術】従来、テレビの放送番組、ビデオのコン
テンツにおいて、音声チャンネルは２チャンネルの主音
声および副音声、或は右チャンネルおよび左チャンネル
のステレオである。或は、１チャンネルのモノラルであ
る場合もある。音声チャンネルが２チャンネルである場
合、チャンネルの選択方法としては、主音声或は副音声
の何れか一方を選択する選択方法、或は両音声チャンネ
ルの再生の比率をテレビ受信装置のバランスボリューム
つまみにより調節するという選択方法を採用するしか今
のところ道はない。一般に、多数の音声チャンネルが一
つの番組に提供されているとき、これらの内から複数の
音声チャンネルを選択し、選択された音声チャンネルを
適当な合成比率により合成して再生する効率的な選択方
法は開発されていない。2. Description of the Related Art Conventionally, in television broadcast programs and video contents, audio channels are two-channel main audio and sub-audio, or right-channel and left-channel stereo. Alternatively, it may be one-channel monaural. When there are two audio channels, the selection method of the main audio or the sub audio is selected as the channel selection method, or the reproduction ratio of both audio channels is determined by the balance volume knob of the television receiver. For now, the only option is to adjust. Generally, when a large number of audio channels are provided for one program, an efficient selection method for selecting a plurality of audio channels from these and synthesizing the selected audio channels with an appropriate synthesizing ratio for reproduction. Has not been developed.

【０００３】例えば、野球中継において、スタジアムの
複数の音声チャンネルが同時に提供されている、視聴者
がこれらの音声チャンネルの内から複数の好みの音声チ
ャンネルを選択し、これらを適当な合成比率により合成
する音声チャンネルの選択方法はないのである。更に、
一つの映像画面を複数のユーザが視聴している場合、各
ユーザ毎に各別の音声チャンネルを選択し、再生する効
率的な方法も開発されていない。For example, in a baseball relay, a plurality of audio channels of a stadium are provided at the same time. A viewer selects a plurality of favorite audio channels from these audio channels and synthesizes them with an appropriate synthesis ratio. There is no way to select the audio channel to use. Furthermore,
When a plurality of users are watching one video screen, an efficient method for selecting and reproducing different audio channels for each user has not been developed.

【０００４】[0004]

【発明が解決しようとする課題】この発明は、多数の音
声チャンネルが一つの番組に提供されているとき、これ
らのチャンネルの内から複数の音声チャンネルを選択
し、選択された音声チャンネルを適当な合成比率により
合成し、更に、一つの映像画面を複数のユーザが視聴し
ている場合、各ユーザ毎に各別の音声チャンネルを選択
合成して上述の問題を解消した音声チャンネル選択合成
方法およびこの方法を実施する装置を提供するものであ
る。SUMMARY OF THE INVENTION In the present invention, when a plurality of audio channels are provided for one program, a plurality of audio channels are selected from these channels, and the selected audio channel is appropriately selected. When a plurality of users view one video screen by combining with a combining ratio, a different audio channel is selected and combined for each user to solve the above-mentioned problem, and an audio channel selecting and combining method. An apparatus for performing the method is provided.

【０００５】[0005]

【課題を解決するための手段】画面に映像を表示すると
共に複数の音声チャンネルの内から選択されたチャンネ
ルの音声を任意の比率で合成して再生する音声チャンネ
ル選択合成方法において、音声チャンネル位置を示す音
声チャンネルポイントと画面中の位置とを対応させ、音
声チャンネルポイントを選択するポインタ複数個を具備
して、これらポインタを各別に画面に重畳して表示し、
各別のポインタについて、表示されたポインタ内の基準
位置と選択された音声チャンネルポイントとの間の相対
距離に基づいて音声チャンネルの音声合成比率を決定す
る音声チャンネル選択合成方法を構成した。According to an audio channel selecting / synthesizing method of displaying an image on a screen and synthesizing and reproducing audio of a channel selected from a plurality of audio channels at an arbitrary ratio, Equipped with a plurality of pointers for selecting the audio channel points by associating the indicated audio channel points with the positions on the screen.
Then, display these pointers by superimposing them on the screen separately ,
A voice channel selection / synthesis method for deciding a voice synthesis ratio of a voice channel based on a relative distance between a reference position in the displayed pointer and a selected voice channel point is configured for each different pointer .

【０００６】そして、先の音声チャンネル選択合成方法
において、ポインタの大きさを操作する音声チャンネル
選択合成方法を構成した。また、先の音声チャンネル選
択合成方法において、ポインタ内の基準位置と選択され
た音声チャンネルポイントとの間の相対距離に基づく音
声チャンネルの音声合成比率を変化させる音声チャンネ
ル選択合成方法を構成した。ここで、画面に映像を表示
すると共に複数の音声チャンネルの内から選択されたチ
ャンネルの音声を任意の比率で合成して再生する音声チ
ャンネル選択合成装置において、ポインタ位置を操作す
るユーザ設定情報を発生するポインタ操作部１１５を具
備し、ユーザ設定情報に基づいて画面に表示するポイン
タ位置を示すポインタ表示信号を発生すると共に、設定
されたポインタ位置を示すユーザ操作情報を発生するユ
ーザ操作制御部１０５を具備し、音声チャンネルポイン
トの画面における対応配置関係が内部に生成され、ポイ
ンタの位置に基づいて選択されるべき音声チャンネルお
よび音声合成比率を示す音声合成比率情報を発生する音
声チャンネルポイント選択部１０４を具備し、音声合成
比率情報に基づいて複数提供される音声チャンネルから
任意の音声チャンネルを選択する音声チャンネル選択部
１０１を具備し、音声チャンネル選択部１０１により選
択された音声チャンネルを上記音声合成比率で合成する
音声合成部１０２を具備し、合成音声を発声するスピー
カ１０３を具備し、ここで、上記ポインタ操作部、上記
ユーザ操作制御部、上記音声チャンネルポイント選択
部、上記音声チャンネル選択部、上記音声合成部および
上記スピーカの組を複数組具備し、これらの組を各別に
動作せしめる音声チャンネル選択合成装置を構成した。Then, in the above audio channel selection / synthesis method, an audio channel selection / synthesis method for operating the size of the pointer is constructed. Also, in the previous audio channel selection and synthesis method, it was selected as the reference position in the pointer.
Sound based on the relative distance between the audio channel points
We constructed a voice channel selection synthesis method that changes the voice synthesis ratio of voice channels. Here, a user setting information for operating a pointer position is generated in an audio channel selection / synthesis device that displays an image on a screen and synthesizes and reproduces audio of a channel selected from a plurality of audio channels at an arbitrary ratio. A user operation control unit 105 for generating a pointer display signal indicating a pointer position to be displayed on the screen based on the user setting information and generating user operation information indicating the set pointer position. The audio channel point selection unit 104 is provided which internally generates the corresponding arrangement relationship of the audio channel points on the screen, and generates audio channel ratio information indicating the audio channel and the audio channel ratio to be selected based on the position of the pointer. A voice channel that is provided and is provided based on the voice synthesis ratio information. Comprising a speech channel selection unit 101 for selecting any of the voice channels from Le, the audio channel selected by audio channel selector 101 comprises a speech synthesis unit 102 for synthesizing the above speech synthesis ratio, uttering synthesized speech A speaker 103 is provided, in which the pointer operation unit, the
User operation control unit, above audio channel point selection
Section, the voice channel selection section, the voice synthesis section, and
We have multiple sets of the above speakers, and these sets are for each
A voice channel selection / synthesis device that operates is constructed.

【０００７】そして、先の音声チャンネル選択合成装置
において、ポインタ操作部はポインタの形状を変化させ
る構成を有するものである音声チャンネル選択合成装置
を構成した。また、先の音声チャンネル選択合成装置に
おいて、音声チャンネルポイント選択部は音声合成比率
を規定する音声合成比率フィルタを変化させる構成を有
するものである音声チャンネル選択合成装置を構成し
た。In the above-mentioned audio channel selection / synthesis device, the pointer operation unit has a structure for changing the shape of the pointer, thereby constructing an audio channel selection / synthesis device. Further, in the above-mentioned voice channel selection / synthesis device, the voice channel point selection unit has a configuration for changing the voice synthesis ratio filter that defines the voice synthesis ratio.

【０００８】[0008]

【実施例】この発明の音声チャンネル選択合成方法およ
び装置における構成要素を図１を参照して説明する。図
１において、１０１は複数提供される音声チャンネルか
ら任意の音声チャンネルを選択する音声チャンネル選択
部、１０２は音声チャンネル選択部により選択された音
声チャンネルを任意の合成比率で合成する音声合成部、
１０３は合成された音声を出力するスピーカである。１
０４は音声チャンネルポイント４０１の画面における対
応配置関係が内部に生成され、ポインタの位置に基づい
て選択されるべき音声チャンネルを決定する音声チャン
ネルポイント選択部、１０５は画面上に表示するポイン
タを制御するユーザ操作制御部、１０７は視聴者が操作
したポインタ位置、パラメータを伝えるユーザ操作情報
である。１０６は映像およびポインタを表示するディス
プレイである。１０８はセンタからの音声チャンネルポ
イント配置情報その他のセンタ情報、１１０は音声合成
比率情報、１１１は映像信号、１１２は複数提供される
音声チャンネル、１１３はポインタ表示信号、１１４は
音声信号である。１１５はポインタ操作部であり、１１
６はパラメータを操作するボリューム、１１７はポイン
タの位置を操作するポインタ位置操作キーである。１１
８はユーザがポインタ操作部１１５のボリューム１１６
およびポインタ位置操作キー１１７を操作することによ
り設定されるユーザ設定情報である。EXAMPLES way audio channel selection synthesis of the present invention Oyo
And components of the apparatus will be described with reference to FIG. In FIG. 1, 101 is an audio channel selection unit that selects an arbitrary audio channel from a plurality of provided audio channels, 102 is a voice synthesis unit that synthesizes the audio channels selected by the audio channel selection unit at an arbitrary synthesis ratio,
Reference numeral 103 is a speaker that outputs the synthesized voice. 1
Reference numeral 04 is an audio channel point selection unit that internally generates a corresponding layout relationship of the audio channel points 401 on the screen, and determines the audio channel to be selected based on the position of the pointer, and 105 controls the pointer displayed on the screen. A user operation control unit 107 is user operation information that conveys a pointer position operated by a viewer and parameters. Reference numeral 106 denotes a display that displays an image and a pointer. Reference numeral 108 is audio channel point arrangement information from the center and other center information, 110 is audio synthesis ratio information, 111 is a video signal, 112 is a plurality of audio channels provided, 113 is a pointer display signal, and 114 is an audio signal. Reference numeral 115 is a pointer operation unit,
Reference numeral 6 is a volume for operating parameters, and 117 is a pointer position operation key for operating the position of the pointer. 11
8 is the volume 116 of the pointer operation unit 115 by the user.
And user setting information set by operating the pointer position operation key 117.

【０００９】次に、ポインタについて説明する。図２は
ポインタを説明する図である。図２においては、説明の
都合上、パラメータＲをポインタの大きさに対応するも
のとして説明する。２０１は映像が映し出される画面を
示し、２０２は画面に重畳されるポインタを示す。３０
１はポインタ内の基準位置である中心位置、３０２はパ
ラメータＲにより決まるポインタの大きさである。一例
として、ポインタ２０２の形を円とし、パラメータＲに
より決まるポインタの大きさは円の半径に比例するもの
として説明する。ポインタ２０２の形は四角形その他の
多角形、楕円の如き他の形とすることができる。また、
中心３０１について対称ではない形とすることもでき
る。更に、パラメータＲによりポインタ２０２の形状が
非線形に変化をすることも考えられる。Next, the pointer will be described. FIG. 2 is a diagram for explaining the pointer. In FIG. 2, for convenience of explanation, the parameter R will be described as corresponding to the size of the pointer. Reference numeral 201 denotes a screen on which an image is displayed, and 202 denotes a pointer superimposed on the screen. Thirty
1 is a center position which is a reference position in the pointer, and 302 is a size of the pointer determined by the parameter R. As an example, it is assumed that the shape of the pointer 202 is a circle, and the size of the pointer determined by the parameter R is proportional to the radius of the circle. The shape of the pointer 202 may be a polygon such as a quadrangle or another shape such as an ellipse. Also,
It is also possible that the shape is not symmetrical about the center 301. Furthermore, the shape of the pointer 202 may change non-linearly depending on the parameter R.

【００１０】図３は画面とポインタの関係を説明する図
である。図３において、２０１は映像が映し出される画
面であり、図１に示される映像信号１１１が表示され
る。ここにおいては、一例として、野球中継のスタジア
ムが表示されている。画面にはポインタ２０２も重畳さ
れている。次に、図４を参照して複数提供される音声チ
ャンネルとその画面上における配置位置関係について説
明する。それぞれの音声チャンネルに対応する画面上に
おける対応位置を音声チャンネルポイントと称す。図４
は音声チャンネルポイント選択部１０４の内部に表現さ
れている音声チャンネルポイントの画面２０１における
位置関係を示す。画面２０１において４０１により示さ
れる黒点が音声チャンネルポイントを示す。音声チャン
ネルポイント４０１それぞれの配置位置は、センタ情報
１０８の内の音声チャンネルポイント配置情報として音
声チャンネルポイント選択部１０４に伝えられる。これ
により、それぞれの音声チャンネルポイント４０１の画
面上における対応配置関係が音声チャンネルポイント選
択部１０４の内部に生成される。FIG. 3 is a diagram for explaining the relationship between the screen and the pointer. In FIG. 3, 201 is a screen on which an image is displayed, and the image signal 111 shown in FIG. 1 is displayed. Here, as an example, a baseball relay stadium is displayed. A pointer 202 is also superimposed on the screen. Next, with reference to FIG. 4, a description will be given of a plurality of audio channels provided and a positional relationship on the screen. The corresponding position on the screen corresponding to each audio channel is called an audio channel point. Figure 4
Indicates the positional relationship on the screen 201 of the audio channel points expressed inside the audio channel point selection unit 104. Black dots indicated by 401 on the screen 201 indicate audio channel points. The arrangement position of each audio channel point 401 is transmitted to the audio channel point selection unit 104 as the audio channel point arrangement information in the center information 108. As a result, the corresponding layout relationship of each audio channel point 401 on the screen is generated inside the audio channel point selection unit 104.

【００１１】図４には一例として３５個の音声チャンネ
ルポイント４０１があり、それぞれが等間隔に配置され
ている。これらの音声チャンネルポイント４０１は表示
される映像に対応して不均一に配置することもできる。
ここで、この音声チャンネルポイント４０１とそれぞれ
の音声チャンネルは画面２０１に表示される映像の内容
と対応して配置される。これを音声チャンネルポイント
と画面の内容の関係を示す図１０を参照して説明する。
説明の都合上、図５に示される如く、音声チャンネルポ
イント４０１の位置を横軸ｘおよび縦軸ｙ座標により規
定し、音声チャンネルポイント４０１の位置を座標
（ｘ、ｙ）で示す。左下の音声チャンネルポイント４０
１の座標は（１、５）である。As an example, FIG. 4 shows 35 audio channel points 401, which are arranged at equal intervals. These audio channel points 401 may be arranged non-uniformly corresponding to the displayed video.
Here, the audio channel point 401 and each audio channel are arranged corresponding to the contents of the video displayed on the screen 201. This will be described with reference to FIG. 10 showing the relationship between audio channel points and screen contents.
For convenience of explanation, as shown in FIG. 5, the position of the audio channel point 401 is defined by the horizontal axis x and the vertical axis y coordinate, and the position of the audio channel point 401 is indicated by the coordinate (x, y). Lower left audio channel point 40
The coordinates of 1 are (1, 5).

【００１２】図１０に示される野球場からの中継が画面
２０１に表示されているとき、座標（４、５）の音声チ
ャンネルポイントにはホームベース付近で収録した音声
チャンネルに対応させる。そして、座標（４、１）の音
声チャンネルポイントにはスコアボード付近で収録した
音声の音声チャンネルを対応させる。他の音声チャンネ
ルポイントにも、それぞれ、映像の位置に対応した音声
の音声チャンネルを対応させる。次に、複数提供されて
いる音声チャンネルの内から視聴者が聞きたい音声チャ
ンネルを選択する仕方について説明する。When the relay from the baseball field shown in FIG. 10 is displayed on the screen 201, the voice channel points at coordinates (4, 5) are made to correspond to the voice channels recorded near the home base. Then, the audio channel point of the coordinates (4, 1) is associated with the audio channel of the audio recorded near the scoreboard. The audio channel of the audio corresponding to the position of the video is made to correspond to each of the other audio channel points. Next, a method of selecting an audio channel that the viewer wants to listen from among a plurality of audio channels provided will be described.

【００１３】視聴者は図１のポインタ操作部１１５のポ
インタ位置操作キー１１７を操作して自身が聞きたい位
置にポインタ２０２を移動する。次に、そのポインタ２
０２の大きさを決めるためにポインタボリューム１１６
を操作し、パラメータＲの値を変化させる。ポインタ操
作部１１５により設定されたポインタ２０２の位置情報
とパラメータＲはユーザ設定情報１１８としてユーザ操
作制御部１０５に伝送される。ユーザ操作制御部１０５
は、ポインタの位置情報とパラメータＲにより決まるポ
インタ表示信号１１３を映像およびポインタを表示する
ディスプレイ１０６に伝え、ポインタを画面２０１上に
表示する。The viewer operates the pointer position operation key 117 of the pointer operation unit 115 of FIG. 1 to move the pointer 202 to a position where he / she wants to hear. Then the pointer 2
Pointer volume 116 to determine the size of 02
Is operated to change the value of the parameter R. The position information of the pointer 202 and the parameter R set by the pointer operation unit 115 are transmitted to the user operation control unit 105 as user setting information 118. User operation control unit 105
Transmits the pointer display signal 113 determined by the position information of the pointer and the parameter R to the display 106 for displaying the image and the pointer, and displays the pointer on the screen 201.

【００１４】ユーザ操作情報１０７は、ユーザ操作制御
部１０５から音声チャンネルポイント選択部１０４にも
伝えられる。音声チャンネルポイント選択部１０４は、
ユーザ操作情報１０７に含まれるポインタ位置／大きさ
の情報と、伝送されてくるセンタ情報１０８の内の音声
チャンネルポイント配置情報とによりポインタ２０２と
音声チャンネルポイント４０１が図６に示される如くに
表現される。ここで、ポインタ２０２の位置と大きさに
対応して音声チャンネルポイント４０１が選択される。
この場合、ポインタ２０２の内側にある音声チャンネル
ポイントは座標（２、３）、（３、２）、（３、３）、
（３、４）および（４、３）の５個の点の音声チャンネ
ルポイントが選択されている。The user operation information 107 is also transmitted from the user operation control unit 105 to the audio channel point selection unit 104. The audio channel point selection unit 104
The pointer 202 and the audio channel point 401 are represented as shown in FIG. 6 by the pointer position / size information included in the user operation information 107 and the audio channel point arrangement information in the transmitted center information 108. It Here, the audio channel point 401 is selected according to the position and size of the pointer 202.
In this case, the audio channel points inside the pointer 202 are coordinates (2,3), (3,2), (3,3),
Five audio channel points (3, 4) and (4, 3) are selected.

【００１５】音声チャンネルポイント選択部１０４は、
センタ情報１０８の内の音声合成比率フィルタ情報によ
り、音声合成比率フィルタを生成する。図７は音声合成
比率フィルタを説明する図である。縦軸は合成比率であ
り、横軸はポインタの中心位置３０１と音声チャンネル
ポイント４０１の間の相対距離である。横軸上のＲ、−
ＲはパラメータＲにより決定されたポインタ２０２の大
きさを示す。当該音声チャンネルポイント４０１とポイ
ンタの中心位置３０１との間の相対距離に対応して音声
チャンネルの合成比率を決定する。The audio channel point selection unit 104 is
A voice synthesis ratio filter is generated based on the voice synthesis ratio filter information in the center information 108. FIG. 7 is a diagram for explaining the voice synthesis ratio filter. The vertical axis is the synthesis ratio, and the horizontal axis is the relative distance between the center position 301 of the pointer and the audio channel point 401. R on the horizontal axis,-
R indicates the size of the pointer 202 determined by the parameter R. The synthesis ratio of the audio channel is determined according to the relative distance between the audio channel point 401 and the center position 301 of the pointer.

【００１６】図７と同様の図である図８を参照して合成
比率の算出方法を説明する。図６に示される通りに選択
された計５個の座標（２、３）、（３、２）、（３、
３）、（３、４）および（４、３）の音声チャンネルポ
イントの内の一直線上に存在する座標（２、３）、
（３、３）、（４、３）の３つの音声チャンネルポイン
ト４０１について説明する。図８において横軸上に並ん
だ３個の黒丸は、それぞれ、座標（２、３）、（３、
３）、（４、３）の音声チャンネルポイントを示す。こ
れらの３個の音声チャンネルポイントはすべてポインタ
２０２の内側にあるのでこの特性上でそれぞれのポイン
トは相対位置−Ｒ〜＋Ｒの間に位置しており、この特性
より、それぞれのチャンネルポイントの相対位置に対応
する縦軸の合成比率を決定する。例えば、座標（２、
３）の音声チャンネルポイントの音声チャンネルに対す
る合成比率は、この特性の対応点から０．０５と決定す
る。同様に、座標（３、３）の音声チャンネルポイント
の合成比率は０．９５、座標（４、３）の音声チャンネ
ルポイントの合成比率は０．３０と決定する。A method of calculating the composition ratio will be described with reference to FIG. 8 which is the same as FIG. A total of five coordinates (2, 3), (3, 2), (3, selected as shown in FIG.
3), (3, 4) and coordinates (2, 3) existing on a straight line of the audio channel points of (4, 3),
The three audio channel points 401 (3, 3) and (4, 3) will be described. In FIG. 8, three black circles lined up on the horizontal axis represent coordinates (2, 3), (3,
3) and (4, 3) audio channel points are shown. Since these three audio channel points are all inside the pointer 202, each point is located between the relative positions −R to + R on this characteristic, and from this characteristic, the relative position of each channel point is determined. The composite ratio on the vertical axis corresponding to is determined. For example, coordinates (2,
The synthesis ratio of the audio channel point of 3) to the audio channel is determined to be 0.05 from the corresponding point of this characteristic. Similarly, the synthesis ratio of the audio channel points at coordinates (3, 3) is determined to be 0.95, and the synthesis ratio of the audio channel points at coordinates (4, 3) is determined to be 0.30.

【００１７】ここにおいては、音声合成比率フィルタの
特性として、ポインタ２０２の中心位置３０１と音声チ
ャンネルポイント４０１の間の相対距離に対応する１次
元の特性であるものとして説明したが、画面２０１の平
面（ｘ、ｙ）に対応する２次元の特性を使用することも
できる。上述の如くに決定した音声チャンネルの合成比
率は図９に示される如くに表示される。例えば、ポイン
タＲの外に位置する音声チャンネルポイント４０１に対
応する音声チャンネルは、これを合成しないので合成比
率は「０」とされる。Here, the characteristic of the voice synthesis ratio filter is described as a one-dimensional characteristic corresponding to the relative distance between the center position 301 of the pointer 202 and the voice channel point 401, but the plane of the screen 201 is described. It is also possible to use the two-dimensional characteristic corresponding to (x, y). The synthesis ratio of the audio channel determined as described above is displayed as shown in FIG. For example, since the audio channel corresponding to the audio channel point 401 located outside the pointer R is not synthesized, the synthesis ratio is "0".

【００１８】次に、各々の音声チャンネルの合成比率
は、音声合成比率情報１１０として音声チャンネル選択
部１０１および音声合成部１０２に伝送される。音声チ
ャンネル選択部１０１は、この音声合成比率に基づいて
合成比率が「０」以外の音声チャンネルを選択する。次
に、音声合成部１０２は、選択された音声チャンネルを
それぞれの音声合成比率に対応した比率で混合合成し再
生する。以上の実施例は野球中継その他の実況中継映像
について説明されたが、実況中継映像ではなくして例え
ばドラマの如き意図的に作成された映像、アニメーショ
ン、ＣＧその他の現実世界ではない仮想の世界の映像に
ついても同様に実施することができる。この実施例にお
いては、また、画面全体で一つの映像を映している素材
を使用して説明したが、一画面内に複数の映像が映し出
されているマルチ画面映像を素材にしたものについても
同様に実施することができる。図１４はマルチ画面映像
の一例を示す。図１４のマルチ画面映像は、映像が均等
に配置されるものである、不均一に配置されたマルチ画
面映像についても同様に実施することができる。Next, the synthesis ratio of each voice channel is transmitted to the voice channel selection unit 101 and the voice synthesis unit 102 as the voice synthesis ratio information 110. The voice channel selection unit 101 selects a voice channel with a synthesis ratio other than "0" based on this voice synthesis ratio. Next, the voice synthesizing unit 102 mixes and synthesizes the selected voice channels at a ratio corresponding to each voice synthesizing ratio, and reproduces. Although the above-described embodiments have been described with respect to baseball broadcasts and other live broadcast videos, they are not live broadcast videos but intentionally created images such as dramas, animations, CG, and other non-real world virtual world videos. Can be similarly implemented. In this embodiment, the material in which one image is displayed on the entire screen has been described, but the same applies to a material in which a plurality of images are displayed in one screen as a material. Can be carried out. FIG. 14 shows an example of a multi-screen image. The multi-screen image of FIG. 14 is one in which the images are evenly arranged, and can be similarly applied to the non-uniformly arranged multi-screen image.

【００１９】ここで、先のチャンネル選択に際して、音
声合成比率フィルタの特性を操作するパラメータＦを付
加する。以下、視聴者の操作について説明する。視聴者
は、図１のポインタ操作部１１５においてパラメータＦ
を操作する。操作されたパラメータの情報はユーザ操作
情報１０７として音声チャンネルポイント選択部１０４
に伝送される。音声チャンネルポイント選択部１０４に
おいてはこのパラメータの情報に基づいて音声合成比率
フィルタの特性を変化させる。この変化せしめれた音声
合成比率フィルタを使用し、それぞれの音声チャンネル
の合成比率を決定する。 Here, in selecting the previous channel, the sound
A parameter F for operating the characteristics of the voice synthesis ratio filter is attached.
Add The operation of the viewer will be described below. The viewer uses the parameter F on the pointer operation unit 115 of FIG.
To operate. The information on the operated parameter is used as the user operation information 107 by the audio channel point selection unit 104.
Be transmitted to. The voice channel point selection unit 104 changes the characteristics of the voice synthesis ratio filter based on the information of this parameter. This changed voice
Each audio channel using a synthesis ratio filter
Determine the composition ratio of .

【００２０】図１２および図１３は、パラメータＦによ
り変化する音声合成比率フィルタを示す。図１２に示さ
れる音声合成比率フィルタは、ポインタの中心において
合成比率が最大であり、周辺に近づくにつれて合成比率
が減少する例である。周辺における減少率をパラメータ
Ｆにより変化させている。図１３に示される音声合成比
率フィルタは、合成比率の最大になる位置をパラメータ
Ｆにより変化させる例である。視聴者は、パラメータＲ
およびパラメータＦの一方、或は双方を操作し設定す
る。12 and 13 show a voice synthesis ratio filter which changes according to the parameter F. The voice synthesis ratio filter shown in FIG. 12 is an example in which the synthesis ratio is maximum at the center of the pointer and the synthesis ratio decreases as it approaches the periphery. The reduction rate in the periphery is changed by the parameter F. The voice synthesis ratio filter shown in FIG. 13 is an example in which the position at which the synthesis ratio becomes maximum is changed by the parameter F. The viewer is the parameter R
And one or both of the parameters F are operated and set.

【００２１】図１１を参照してこの発明の実施例を説明
する。図１１は複数の視聴者が同時に同一の画面２０１
を視聴する音声チャンネル合成選択装置を示し、各視聴
者毎にポインタ操作部１１５およびスピーカ１０３を準
備してこれらを視聴者各自に独占させて、各視聴者は同
一の画面２０１を視聴しながら好みの音声を選択視聴す
ることができる。この場合、ポインタ操作部１１５、ユ
ーザ操作制御部１０５、音声チャンネルポイント選択部
１０４、音声チャンネル選択部１０１、音声合成部１０
２およびスピーカ１０３より成る音声チャンネル合成選
択装置を複数組具備し、それぞれのユーザ操作制御部１
０５毎にポインタＲを同一画面２０１上に表示しながら
選択操作する。視聴者は各自のポインタ操作部１１５を
操作して先の実施例と同様に音声チャンネルを選択す
る。視聴者の実行すべき操作手順は先の実施例と同様で
ある。 An embodiment of the present invention will be described with reference to FIG .
To do . In FIG. 11, a plurality of viewers simultaneously display the same screen 201.
Shows a voice channel synthesizing / selecting device for viewing, and prepares the pointer operation unit 115 and the speaker 103 for each viewer, and allows each viewer to monopolize them, and each viewer likes while viewing the same screen 201. You can select and listen to the sound of. In this case, the pointer operation unit 115, the user operation control unit 105, the audio channel point selection unit 104, the audio channel selection unit 101, the audio synthesis unit 10
2 and a plurality of audio channel synthesis / selection devices each including a speaker 103, and each user operation control unit 1
The selection operation is performed while displaying the pointer R for each 05 on the same screen 201. The viewer operates his or her pointer operation unit 115 to select the audio channel as in the previous embodiment. The operation procedure to be executed by the viewer is the same as in the previous embodiment.

【００２２】最後に、他の実施例を説明する。この実施
例は、センタ情報１０８の内に標準的なポインタ位置／
大きさ情報および音声合成率フィルタ情報を示す標準情
報を含める。ユーザ操作制御部１０５から格別のユーザ
操作情報１０７が伝送されない場合、音声チャンネル選
択部１０１は、この標準情報に基づいて音声チャンネル
ポイントを選択し、音声チャンネルを合成する。この様
にすることにより、映像の内容の変化に適合する標準の
ポインタ位置／大きさ情報、音声合成比率フィルタ情報
を形成してセンタ側の推賞する音声合成の状態を提供す
る。Finally, another embodiment will be described. In this embodiment, the standard pointer position /
Standard information indicating size information and voice synthesis rate filter information is included. When the particular user operation information 107 is not transmitted from the user operation control unit 105, the audio channel selection unit 101 selects an audio channel point based on this standard information and synthesizes the audio channel. By doing so, the standard pointer position / size information and voice synthesis ratio filter information suitable for the change of the video content are formed to provide the recommended voice synthesis state on the center side.

【００２３】以上の通り、例えば、野球中継などで、ス
タジアムの複数の音声チャンネルが同時に提供されたと
き、視聴者がその内から幾つかの好みの音声チャンネル
を選択し、それを適当な合成比率で合成し視聴すること
ができる。また、例えば、ホームベース近くのバッタの
周辺の音だけを聞きたい場合には、ポインタをホームベ
ースの近くに移動し、ポインタボリュームでポインタの
大きさを小さくすることにより実現できる。また、画面
全体の音声を満遍なく聞きたい場合にはポインタを画面
中央付近に移動し、ポインタの大きさを画面全体を覆う
ように調節することにより実現できる。また、ポインタ
の中心を主として聞きたい音声の位置に移動し、ポイン
タの大きさを画面全体を覆うようにすることにより、主
として聞きたい音声を大きな合成比率で聞き、画面全体
の音声も聞くことができる。このように、野球中継など
で、この発明を用いることにより、番組としての魅力が
高まる。As described above, for example, when a plurality of audio channels of the stadium are provided at the same time in a baseball relay broadcast, the viewer selects some of the favorite audio channels from the audio channels and selects a desired audio channel from them. It can be combined and viewed with. Further, for example, when it is desired to hear only the sound around the grasshopper near the home base, it can be realized by moving the pointer near the home base and reducing the size of the pointer with the pointer volume. Further, when it is desired to hear the sound of the entire screen evenly, it can be realized by moving the pointer near the center of the screen and adjusting the size of the pointer so as to cover the entire screen. Also, by moving the center of the pointer to the position of the voice that you want to hear and covering the entire screen with the size of the pointer, you can hear the voice that you want to hear with a large synthesis ratio and also hear the voice of the entire screen. it can. As described above, by using the present invention in a baseball relay broadcast or the like, the appeal as a program is enhanced.

【００２４】[0024]

【発明の効果】以上の通りであって、この発明に依れ
ば、視聴者は、多数の音声チャンネルが一つの番組に提
供されているときに映像中の好みの位置の音声チャンネ
ルを簡単に選択し、その音声チャンネルを適当な合成比
率で合成して視聴することができる。そして、ポインタ
を移動させることにより映像の任意の位置の音声を簡単
に選択することができる。また、ポインタのボリューム
を調節することにより映像中の聞きたい音声に対して自
由にズームイン、ズームアウトすることができる。更
に、音声合成比率フィルタの特性を操作することにより
音声合成比率を好みの比率に調整することができる。As described above, according to the present invention, a viewer can easily select an audio channel at a desired position in a video when a large number of audio channels are provided for one program. It is possible to select the audio channel, synthesize the audio channel with an appropriate synthesizing ratio, and view the audio channel. Then, by moving the pointer, it is possible to easily select the sound at an arbitrary position in the video. In addition, by adjusting the volume of the pointer, it is possible to freely zoom in and out on the desired sound in the video. Furthermore, the voice synthesis ratio can be adjusted to a desired ratio by operating the characteristics of the voice synthesis ratio filter.

【００２５】また、複数の視聴者が同時に同一の画面を
視聴している場合、複数のポインタと複数のスピカーを
用意することにより、それぞれの視聴者が、同一の画面
を視聴しているにもかかわらず、それぞれ、視聴者ごと
に好みの音声を視聴することができる。この様に、複数
の視聴者が一つの番組を視聴する時に番組の魅力を高め
ることができる。そして、センタから伝送される標準的
なポインタ位置／大きさ情報および音声合成比率フィル
タ情報を映像の内容に適合させるべく変化することによ
り、視聴者は何等の操作をすることなくして、センタ側
の設定する音声合成の状態を視聴することができる。When a plurality of viewers are watching the same screen at the same time, by preparing a plurality of pointers and a plurality of spikers, each viewer can watch the same screen. Regardless, it is possible to listen to the favorite voice for each viewer. In this way, it is possible to enhance the attractiveness of a program when a plurality of viewers watch one program. Then, by changing the standard pointer position / size information and the voice synthesis ratio filter information transmitted from the center so as to match the contents of the video, the viewer can perform operations on the center side without any operation. The state of voice synthesis to be set can be viewed.

[Brief description of drawings]

【図１】この発明の全体の構成を説明する図。FIG. 1 is a diagram illustrating the overall configuration of the present invention.

【図２】ポインタを説明する図。FIG. 2 is a diagram illustrating a pointer.

【図３】ポインタと画面の関係を説明する図。FIG. 3 is a diagram illustrating a relationship between a pointer and a screen.

【図４】音声チャンネルポイントと画面の関係を説明す
る図。FIG. 4 is a diagram illustrating a relationship between audio channel points and a screen.

【図５】音声チャンネルポイントの位置を説明する図。FIG. 5 is a diagram illustrating positions of audio channel points.

【図６】音声チャンネルポイントとポインタの関係を説
明する図。FIG. 6 is a diagram illustrating the relationship between audio channel points and pointers.

【図７】音声合成比率フィルタを説明する図。FIG. 7 is a diagram illustrating a voice synthesis ratio filter.

【図８】音声合成比率フィルタと音声チャンネルポイン
トの関係を示す図。FIG. 8 is a diagram showing a relationship between a voice synthesis ratio filter and voice channel points.

【図９】音声チャンネルと合成比率の関係を示す図。FIG. 9 is a diagram showing a relationship between audio channels and a synthesis ratio.

【図１０】音声チャンネルポイントと画面の内容の関係
を説明する図。FIG. 10 is a diagram illustrating a relationship between audio channel points and screen contents.

【図１１】複数のポインタを使用する実施例を説明する
図。FIG. 11 is a diagram illustrating an example of using a plurality of pointers.

【図１２】音声合成比率フィルタとパラメータＦの関係
を示す図。FIG. 12 is a diagram showing a relationship between a voice synthesis ratio filter and a parameter F.

【図１３】音声合成比率フィルタとパラメータＦの関係
を示す図。FIG. 13 is a diagram showing a relationship between a voice synthesis ratio filter and a parameter F.

【図１４】マルチ画面を示す図。FIG. 14 is a diagram showing a multi-screen.

[Explanation of symbols]

１０１音声チャンネル選択部１０２音声合成部１０３スピーカ１０４音声チャンネルポイント選択部１０５ユーザ操作制御部１０７ユーザ操作情報１１０音声合成比率情報１１３ポインタ表示信号１１５ポインタ操作部１１８ユーザ設定情報２０１画面４０１音声チャンネルポイント 101 Audio channel selection section 102 voice synthesizer 103 speaker 104 Audio channel point selector 105 user operation control unit 107 user operation information 110 Speech synthesis ratio information 113 Pointer display signal 115 Pointer operation unit 118 user setting information 201 screen 401 audio channel points

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) H04N 5/50 - 5/63 H04N 7/00 - 7/088 H04H 5/00 H04S 1/00 - 7/00 G10H 1/00 - 7/08 ─────────────────────────────────────────────────── ─── Continuation of front page (58) Fields surveyed (Int.Cl. ⁷ , DB name) H04N 5/50-5/63 H04N 7/00-7/088 H04H 5/00 H04S 1/00-7 / 00 G10H 1/00-7/08

Claims

(57) [Claims]

1. A voice channel selection / synthesis method for displaying a video on a screen and synthesizing and playing back voices of a channel selected from a plurality of voice channels at an arbitrary ratio, in a voice channel point indicating a voice channel position. And a position on the screen are associated with each other, and a plurality of pointers for selecting an audio channel point are provided, and these pointers are separately superimposed and displayed on the screen. For each different pointer, the reference within the displayed pointer is displayed. A method for selecting and synthesizing a voice channel, characterized in that a voice synthesizing ratio of a voice channel is determined based on a relative distance between a position and a selected voice channel point.

2. The audio channel selection / synthesis method according to claim 1, wherein the size of the pointer is operated.

3. The audio channel selection / synthesis method according to claim 1, wherein the audio channel is based on the relative distance between the reference position in the pointer and the selected audio channel point. A method for selecting and synthesizing voice channels, characterized by changing the voice synthesis ratio of the voice.

4. A user setting information for operating a pointer position in an audio channel selection synthesizing device for displaying a video on a screen and synthesizing and reproducing audio of a channel selected from a plurality of audio channels at an arbitrary ratio. A user operation control unit for generating a pointer display signal indicating a pointer position to be displayed on the screen based on user setting information and generating user operation information indicating the set pointer position. And a voice channel point selection unit for internally generating a corresponding layout relation of the voice channel points on the screen, and generating voice synthesis ratio information indicating a voice channel and a voice synthesis ratio to be selected based on the position of the pointer. However, it is possible to select from multiple audio channels provided based on the audio synthesis ratio information. A voice channel selecting unit for selecting the voice channel of, a voice synthesizing unit for synthesizing the voice channels selected by the voice channel selecting unit at the above voice synthesizing ratio, and a speaker for uttering a synthesized voice, , The pointer operation unit, the user operation control unit,
A plurality of sets of the audio channel point selection unit, the audio channel selection unit, the audio synthesis unit, and the speaker are provided, and each pointer of the plurality of user operation control units
The audio channel selecting / synthesizing device is characterized in that these groups are operated separately while displaying on the same screen .

5. The audio channel selection / synthesis device according to claim 4, wherein the pointer operation unit has a configuration for changing the shape of the pointer.

6. The voice channel selection / synthesis apparatus according to claim 4, wherein the voice channel point selection unit has a configuration for changing a voice synthesis ratio filter that defines a voice synthesis ratio. An audio channel selection / synthesis device characterized in that