JPH0259000A

JPH0259000A - Sound image static reproducing system

Info

Publication number: JPH0259000A
Application number: JP63210975A
Authority: JP
Inventors: Naofumi Inmaki; 印牧　直文; Fumio Kishino; 岸野　文郎
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 1988-08-24
Filing date: 1988-08-24
Publication date: 1990-02-28
Anticipated expiration: 2012-08-25
Also published as: JP2645731B2

Abstract

PURPOSE:To improve both the attendant feeling and the directional feeling of sounds by controlling a sound image static for a frequency band from which the perception of an auditory direction can be obtained using plural speakers, reproducing the sounds from the plural speakers, executing no control for the sound image static for the other frequency bands, and reproducing the sounds using a single speaker. CONSTITUTION:Voices related to the second frequency band 22 and the third frequency band 23 to perceive the auditory directional feeling are reproduced using the plural speakers while sound image static control is executed. On the other hand, voices and environmental sounds related to the first frequency band 21 and the fourth frequency band 24 not to perceive the directional feeling are reproduced from the single speaker provided near a television set, etc. Consequently, since the plural speakers are never used when the voices of the first and fourth frequency bands 21 and 24 are to be reproduced, used, the blur of the sounds caused by plural sound sources never occurs, and the accuracy of the sound image static of the second and third frequency bands 22 and 23 to obtain the perception of the direction can be improved. Thus, both the attendant feeling and the directional feeling of the sounds can be improved in a communication conference.

Description

【発明の詳細な説明】「産業上の利用分野」この発明は映像を用いた通信会議（：おける音１象定位
再生システムにおいて、聴覚上方向知覚が生じる周波数
帯域に着目し、その着目周波数帯域の音声：；ついては
音像定位制御を行い、それ以外の周波数帯域の音声Ｃ二
ついては音像定位制御を行わず、その再生位置をモニタ
等の映像部に近接するよう（；配置する音像定位再生方
式に関するものである。[Detailed Description of the Invention] "Industrial Application Field" This invention focuses on the frequency band in which auditory upward direction perception occurs in a sound one-image localization reproduction system in a communication conference using video. Regarding the sound image localization playback method, the sound image localization control is performed for the audio C in the other frequency bands, and the sound image localization control is not performed for the two audio C in the other frequency bands, and the playback position is placed close to the video part of the monitor etc. It is something.

「従来の技術」第５図は従来のテレビ会議用音像定位再生システムの例
を示している。同図Ａは聴取者１１に対してモニタ１２
の前面中央（ニスピー力１３Ｃを設置する方法でモノラ
ル音で再生する。同図Ｂはモニタ１２の前面両サイド（
；スピーカ１３Ｌ、１３Ｒを設置しステレオ音（２チヤ
ネル）を再生する。"Prior Art" FIG. 5 shows an example of a conventional sound image localization reproduction system for video conferences. In the same figure, A shows the monitor 12 for the listener 11.
The front center of the monitor 12 (monaural sound is played by installing the Nispy Power 13C. Figure B shows the front center of the monitor 12 on both sides (
; Install speakers 13L and 13R to reproduce stereo sound (2 channels).

同図Ｃはモニタ１２の前面、中央と両サイド（ニスピー
力１３Ｃ，１３Ｌ、１３Ｒを設置し３チヤネルステレオ
音を再生する。同図りは４チヤネルステレオ音を再生す
る場合で、同図Ｃに対して聴取者１１の後方（ニスピー
力１３Ｂを設置する点が異なる。通信会議における音の
臨場感を向上させる観点からＢ−Ｄのシステム構成が有
効である。Figure C shows the front, center, and both sides of the monitor 12 (Nispee power units 13C, 13L, and 13R are installed to reproduce 3-channel stereo sound. This figure shows the case where 4-channel stereo sound is reproduced. The difference is that the Nispee force 13B is installed behind the listener 11.The system configuration of B-D is effective from the viewpoint of improving the sense of presence of sound in a teleconference.

このうち同図Ｂ（２関しては例えばモニタ１２（＝映さ
れた３人の横ならびの人物１４において聴取者１１が両
サイドの人物の声に比して中央の人物の声をはっきりと
した方向感をもって知覚しにくい（音ｆ象定位感が低い
）という問題点がある。この点に対処する方法として、
同図Ｃ，Ｄに示すよう（２両サイドのスピーカ１３Ｌ、
１３Ｒの間（；センタ用スピーカ１３Ｃを設置し、通信
会議（：おける音の臨場感とともに方向感の再生向上を
図っている。For example, regarding Figure B (2), the listener 11 heard the voice of the person in the center clearly compared to the voices of the people on both sides of the monitor 12 (= three people 14 displayed side by side). There is a problem that it is difficult to perceive with a sense of direction (low sense of sound localization).As a way to deal with this point,
As shown in C and D of the same figure (speakers 13L on both sides,
A center speaker 13C is installed between 13R and 13C to improve the sense of direction as well as the sense of presence of the sound during the communication conference.

左サイド、センタ、右サイドの３種のスピーカを用いる
従来のシステム（図Ｃ，Ｄ）では各スピーカともに全周
波数帯域の音声を対象にして音像定位を行う。この全周
波数帯域を対象にすることが問題となっている。即ち人
間の聴覚能力上、方向知覚が得られる周波数帯域は限ら
れており、その周波数帯域の音声に対して複数のスピー
カを用いて音像定位制御を行うことは、方向感の再生向
上を図る上で効果的である反面、方向知覚が得られない
周波数帯域については、複数のスピーカを用いると点音
源が複数個存在するため、再生音に「ぼけ」が生じ、こ
れによって方向知覚が得られる帯域の音像定位の精度を
低下させるという第１の問題がある。In a conventional system using three types of speakers (left side, center, and right side) (Figures C and D), each speaker performs sound image localization for audio in all frequency bands. Targeting this entire frequency band has become a problem. In other words, due to human auditory ability, the frequency band in which direction perception can be obtained is limited, and performing sound localization control using multiple speakers for sounds in that frequency band is an effective way to improve the reproduction of direction sense. On the other hand, in the frequency band where direction perception cannot be obtained, when multiple speakers are used, there are multiple point sound sources, resulting in "blur" in the reproduced sound, which makes it difficult to obtain direction perception. The first problem is that the accuracy of sound image localization is reduced.

ところでテレビ会議では映し出された人物の口元にその
人物の声、即ち音像が引き寄せられるという相互作用が
知られている。方向知覚が得られない周波数帯域の音声
は本来感覚的には音像定位し難いものであるが、上記映
像（＝よる相互作用を高めることによって見かけ上疑似
的に映像の人物の口元に音像を定位させる方法が考えら
れる。上記映（象による相互作用を高めるためには、テ
レビに映し出された人物の口元（：近い位置に一つの音
源を設置することがより効果的である。ところが従来の
システムでは前述したように方向知覚が得られない周波
数帯域の音声を複数個の音源から再生するため、上記相
互作用を効果的に活用できないという第２の問題がある
。By the way, in a video conference, it is known that there is an interaction in which the voice of the person being projected, that is, the sound image, is attracted to the mouth of the person being projected. Sound in a frequency band where direction perception cannot be obtained is originally difficult to intuitively localize, but by increasing the interaction with the image above, it is possible to artificially localize the sound image to the mouth of the person in the image. In order to enhance the interaction with the elephant, it is more effective to place one sound source close to the mouth of the person shown on the TV.However, in the conventional system, Then, as mentioned above, since the sound in the frequency band in which direction perception cannot be obtained is reproduced from a plurality of sound sources, there is a second problem that the above-mentioned interaction cannot be effectively utilized.

この発明の目的は方向知覚が得られる周波数帯域の音声
については位相差や音圧レベル差を活用して音像な定位
させるとともに、方向知覚が得られない周波数帯域の音
声については、映像による相互作用を活用して疑似的（
２映像部に音像を定位し易くさせることをねらいとし、
上記従来の欠点を除去するため、聴覚上方向知覚が生じ
る周波数帯域とそれ以外の周波数帯域とに分割し、前者
の帯域の音声については複数スピーカを用いて音像定位
の制御を行い、後者の帯域の音声については音像定位の
制御を行わずモニタ等の映ｒ象部に近接した単一のスピ
ーカを用いて再生する音像定位再生方式を提供すること
にある。The purpose of this invention is to use phase differences and sound pressure level differences to localize sounds in frequency bands where direction perception can be obtained, and to localize sound images in frequency bands where direction perception cannot be obtained, and to localize sound images in frequency bands where direction perception cannot be obtained. Using pseudo (
The aim is to make it easier to localize the sound image in the 2-video section,
In order to eliminate the above-mentioned drawbacks of the conventional method, the frequency band is divided into a frequency band where upward auditory perception occurs and other frequency bands, and sound image localization is controlled using multiple speakers for audio in the former band, and sound image localization in the latter band is controlled using multiple speakers. The object of the present invention is to provide a sound image localization reproduction method in which sound is reproduced using a single speaker located close to an image area such as a monitor without controlling sound image localization.

「課題を解決するための手段」この発明によれば音像定位生成の対象となる複数個の入
力チャネルの各オーディオ信号に対して周波数帯域が分
割され、その分割された第１の周波数帯域のオーディオ
信号を再生チャネルに出力する際に音圧レベルが調整さ
れ、上記分割された第２の周波数帯域のオーディオ信号
に対して音像定位を生成するため（−再生チャネル間に
位相差（時間差）と音圧レベル差が生じるように位相と
レベルとが調整され、上記分割された第３の周波数帯域
のオディオ信号に対して音像定位を生成するために再生
チャネル間に音圧レベル差が生じるよう≦；レベルが調
整され、入力チャネルと再生チャネルとの対応データと
音像定位の距離感、方向感等を規定する定位データ等が
設定され、聴覚上の方向知覚が得られる第２と＄３の周
波数帯域に対して複数スピーカを用いて音１象定位が制
御され、方向知覚が得られない第１の周波数帯域に対し
ては音像定位を行わず単一のスピーカを用いて再生され
る。"Means for Solving the Problem" According to the present invention, a frequency band is divided for each audio signal of a plurality of input channels to be generated for sound image localization, and the audio of the divided first frequency band is divided. The sound pressure level is adjusted when the signal is output to the playback channel, and the sound image localization is generated for the audio signal of the divided second frequency band (- phase difference (time difference) and sound The phase and level are adjusted so that a pressure level difference is generated, and a sound pressure level difference is generated between reproduction channels in order to generate sound image localization for the audio signal of the divided third frequency band. The level is adjusted, the correspondence data between the input channel and the playback channel, and the localization data that defines the sense of distance and sense of direction for sound image localization are set, and the second and third frequency bands are used to obtain an auditory sense of direction. For the first frequency band in which direction perception cannot be obtained, the sound image localization is controlled using a plurality of speakers, and the sound image localization is not performed and the sound is reproduced using a single speaker.

周波数分割は次のように行われる。すなわち第１図は周
波数帯域分割（２関するこの発明の特徴を示す例である
。人間の可聴範囲１５は２０　Ｈｚ〜２０．０ＯＯＨｚ
の周波数帯域である。テレビ会議・音声会議等の通信会
議では人間の発声範囲１６である１　００　Ｈｚ〜８．
　ＯＯＯＨｚの周波数帯域に関する音声の音像定位を如
何（：精度良く、かつ忠実性を高めて再生させるかが特
に重視されている。ところが音の受は手である聴取者１
１の方向知覚能力は人間の発声範囲１６（２比してより
小さい周波数帯域であることが知られている。境、中山
著：聴覚と音響心理（日本音響学全編、１９７８　、コ
ロナ社、ｐｐ１８５−１９５）の文献によれば、人間の
聴覚上の方向知覚能力に関しては２種の知覚方法があり
、第１の方法は２つの音の間の位相差（時間差）に基づ
いて方向感を知覚するもので、この知覚範囲１７は２０
０　Ｈｚ　〜１．３００　Ｈｚと言われている。第２の
方法は２つの音の間の音圧レベル差に基づいて方向感を
知覚するもので、この知覚範囲１８は２００　Ｈｚ　〜
５．　ＯＯＯＨｚと言われている。Frequency division is performed as follows. That is, FIG. 1 is an example showing the features of this invention regarding frequency band division (2).The human audible range 15 is 20 Hz to 20.000 Hz.
frequency band. In communication conferences such as video conferences and audio conferences, the range of human speech ranges from 100 Hz to 8.
Particular emphasis is placed on how to localize the sound image of audio in the OOOHz frequency band (: how to reproduce it with high precision and high fidelity.However, listener 1, whose hands are used to receive the sound,
It is known that the direction perception ability of No. 1 is within the human vocal range of 16 (smaller frequency band than No. 2). Sakai, Nakayama: Hearing and Psychoacoustics (Japan Acoustics Complete Edition, 1978, Coronasha, pp185) According to the literature (195), there are two types of perceptual methods regarding the human auditory ability to perceive direction.The first method is to perceive a sense of direction based on the phase difference (time difference) between two sounds. This perception range 17 is 20
It is said to be between 0 Hz and 1.300 Hz. The second method is to perceive a sense of direction based on the sound pressure level difference between two sounds, and this perception range 18 is from 200 Hz to
5. It is said to be OOOOHz.

図中で示すように音像定位制御の観点から周波数帯域を
分割・整理すると人間の可聴範囲１５は方向感を知覚し
ない音声と環境音とに関する低域の第１の周波数帯域２
１と、位相差（時間差）と音圧レベル差とを混在して方
向感を知覚する第２の周波数帯域２２と、音圧レベル差
だけで方向感を知覚する第３の周波数帯域２３と、方向
感を知覚しない音声と環境音とに関する高域の第４の周
波数帯域２４とに大別される。As shown in the figure, when frequency bands are divided and organized from the perspective of sound image localization control, the human audible range 15 is a low-frequency first frequency band 2 related to voices that do not perceive a sense of direction and environmental sounds.
1, a second frequency band 22 in which a sense of direction is perceived by mixing a phase difference (time difference) and a sound pressure level difference, and a third frequency band 23 in which a sense of direction is perceived only by a sound pressure level difference. It is broadly divided into a high-frequency fourth frequency band 24 related to sounds that do not give a sense of direction and environmental sounds.

この発明は聴覚上方向感を知覚する第２の周波数帯域２
２と第３の周波数帯域２３とに関する音声を複数スピー
カを用いて音像定位制御を行いつつ再生し、他方方向感
を知覚しない第１の周波数帯域２１と第４の周波数帯域
２４Ｃ二関する音声・環境音をテレビ等に近接する単一
スピーカから再生し、映像による相互作用によって疑似
的に映像部に音像を定位し易くさせる。This invention provides a second frequency band 2 for perceiving an auditory sense of direction.
2 and the third frequency band 23 are reproduced while performing sound image localization control using a plurality of speakers, and on the other hand, the sound and environment related to the first frequency band 21 and the fourth frequency band 24C are reproduced without perceiving a sense of direction. Sound is reproduced from a single speaker close to a television or the like, and the sound image is simulated to be easily localized in the video section through interaction with the video.

「実施例」第２図はこの発明の特徴を示す再生システム例である。"Example" FIG. 2 is an example of a playback system showing the features of this invention.

モニタ１２の周囲に右側、左側、上側、下側にそれぞれ
スピーカキャビネット２５が配置されていて２チヤネル
ステレオ音を用いた音像定位再生システムの例を示して
いる。第１の周波数帯域２１の音は下側スピーカ２６か
ら、第４の周波数帯域２４の音は上側スピーカ２７から
単一スピーカで再生される。第２の周波数帯域２２の音
は左右のスピーカ２８Ｌとスピーカ２８Ｒを用いて位相
差と音圧レベル差を調整して音像を定位させる。また第
３の周波数帯域２３の音は左右のスピーカ２９Ｌとスピ
ーカ２９Ｒを用いて、音圧レベル差を調整して音像を定
位させる。Speaker cabinets 25 are arranged around the monitor 12 on the right side, left side, upper side, and lower side, respectively, showing an example of a sound image localization reproduction system using two-channel stereo sound. The sound of the first frequency band 21 is reproduced from the lower speaker 26, and the sound of the fourth frequency band 24 is reproduced from the upper speaker 27 using a single speaker. The sound image of the second frequency band 22 is localized by adjusting the phase difference and the sound pressure level difference using the left and right speakers 28L and 28R. Further, the sound in the third frequency band 23 is localized using the left and right speakers 29L and 29R to adjust the sound pressure level difference.

第３図はこの発明の実施例の構成を示すブロック図であ
って、２６は第１の周波数帯域用スピーカ、２　ｓＬ、
　２８Ｒは第２の周波数帯域用スピーカ（２８Ｌ：第１
再生チヤネル用、２８Ｒ＝第２再生チヤネルＦＦ＋　’
、　Ｎ　２９Ｌ　＋　２９Ｒは第３の周波数帯域用スピ
ーカ（２ｇ７．：第１再生チヤネル用、２９Ｒ：第２再
生チヤネル用）、２７は第４の周波数帯域用スピーカ、
７０は入力端子、１００は低域音圧レベル調整部、２０
０は位相差・音圧レベル差混在音像定位調整部、３００
は音圧レベル差音像定位調整部、４００は高域音圧レベ
ル調整部、５００は周波数帯域分割部、６００は再生チ
ャネル用音像定位初期設定部、７００は制御部である。FIG. 3 is a block diagram showing the configuration of an embodiment of the present invention, in which 26 is a first frequency band speaker, 2 sL,
28R is the second frequency band speaker (28L: first
For playback channel, 28R = 2nd playback channel FF+'
, N 29L + 29R is the third frequency band speaker (2g7.: for the first reproduction channel, 29R: for the second reproduction channel), 27 is the fourth frequency band speaker,
70 is an input terminal, 100 is a low frequency sound pressure level adjustment section, 20
0 is a phase difference/sound pressure level difference mixed sound image localization adjustment unit, 300
400 is a high frequency sound pressure level adjustment section; 500 is a frequency band division section; 600 is a reproduction channel sound image localization initial setting section; 700 is a control section.

これを動作するＣ：は、制御部７００の指令により、入
力端子７０から転送されてくるｍ個の入力チャネルのオ
ーディオ信号に対して周波数帯域分割部５００は再生チ
ャネル用音像定位初期設定部６００から指定される第１
から第４までの周波数帯域２１〜２４の各帯域設定デー
タに基づいて各チャネルのオーディオ信号について帯域
分割し、第１の周波数帯域２１のオーディオ信号を低域
音圧レベル調整部１００へ、第２の周波数帯域２２のオ
ーディオ信号を位相差・音圧レベル差混在音像定位調整
部２００へ、第３の周波数帯域２３のオーディオ信号を
音圧レベル差音像定位調整部３００へ、第４の周波数帯
域２４のオーディオ信号を高域音圧レベル調整部４００
へ転送する。In C:, which operates this, the frequency band dividing unit 500 selects the audio image localization initial setting unit 600 for the reproduction channel for the audio signals of m input channels transferred from the input terminal 70 according to a command from the control unit 700. The first specified
The audio signal of each channel is divided into bands based on the band setting data of the frequency bands 21 to 24 from 1 to 4, and the audio signal of the first frequency band 21 is sent to the low frequency sound pressure level adjustment section 100, The audio signal in the frequency band 22 is sent to the phase difference/sound pressure level difference mixed sound image localization adjustment section 200, the audio signal in the third frequency band 23 is sent to the sound pressure level difference sound image localization adjustment section 300, and the audio signal in the fourth frequency band 24 is sent to the sound image localization adjustment section 300. The high frequency sound pressure level adjustment section 400 adjusts the audio signal of
Transfer to.

低域音圧レベル調整部１００は再生チャネル用音障定位
初期設定部６００からの指定＆：従ってｍチャネルのオ
ーディオ信号の加算を行った加算オーディオ信号又はｍ
チャネルの中より選択抽出した選択オーディオ信号に対
して音圧レベルを調整し、第１の周波数帯域用スピーカ
２６より再生させる。The low-frequency sound pressure level adjustment unit 100 receives the specification from the reproduction channel sound disturbance localization initial setting unit 600 &: Therefore, the added audio signal obtained by adding the audio signals of m channels or m
The sound pressure level of the selected audio signal selected and extracted from the channels is adjusted, and the sound pressure level is adjusted and the audio signal is reproduced from the first frequency band speaker 26.

位相差・音圧レベル差混在音像定位調整部２００は再生
チャネル用音像定位初期設定部６００からの指定に従っ
て、ｍ個のチャネルのオーディオ信号を加算又は選択し
てｎ個の再生チャネルに変換する際に、各再生チャネル
のオーディオ信号に対して位相、音圧レベルの調整を行
うとともにその再生チャネル間の位相差、音圧レベル差
を調整してｎ個の第２の周波数帯域用スピーカ２８Ｌ、
２８Ｒより再生させる。When the phase difference/sound pressure level difference mixed sound image localization adjustment section 200 adds or selects the audio signals of m channels and converts them into n playback channels, according to the specification from the reproduction channel sound image localization initial setting section 600. The phase and sound pressure level of the audio signal of each reproduction channel are adjusted, and the phase difference and sound pressure level difference between the reproduction channels are adjusted to produce n second frequency band speakers 28L,
Play from 28R.

音圧レベル差音像定位調整部３００は再生チャネル用音
像定位初期設定部６００からの指定に従って、ｍ個のチ
ャネルのオーディオ信号を加算又は選択してｎ個の再生
チャネルに変換する際に、各再生チャネルのオーディオ
信号に対して音圧レベルの調整を行うとともＣ二、その
再生チャネル間の音圧レベル差を調整してｎ個の第３の
周波数帯域用スピーカ２９Ｌ　、　２９Ｒより再生させ
る。The sound pressure level difference sound image localization adjustment unit 300 adds or selects the audio signals of m channels and converts them into n playback channels according to the specifications from the playback channel sound image localization initial setting unit 600. The sound pressure level of the audio signal of the channel is adjusted, and the sound pressure level difference between the reproduction channels is adjusted, and the audio signal is reproduced from the n third frequency band speakers 29L and 29R.

高域音圧レベル調整部４００は再生チャネル用音像定位
初期設定部６００からの指定に従って、ｍ個のチャネル
のオーディオ信号の加算を行った加算オーディオ信号又
はｍ個のチャネルの中より選択抽出した選択オーディオ
信号に対して音圧レベルを調整し、第４の周波数帯域用
スピーカ２７より再生させる。The high-frequency sound pressure level adjustment unit 400 generates a summed audio signal obtained by adding audio signals of m channels or a selection selected and extracted from m channels according to the specification from the reproduction channel sound image localization initial setting unit 600. The sound pressure level of the audio signal is adjusted, and the audio signal is reproduced from the fourth frequency band speaker 27.

再生チャネル用音隊定位初期設定部６００は、前述した
ように分割帯域の具体的設定データ、第１又は第４の周
波数帯域に関する音圧レベル値、へカチャネルｍ個に対
する再生チャネルのｎの値、第２又は第３の周波数帯域
に関する位相値、音圧レベル値等を記憶し、これらの値
を各ブロック部の初期設定に際して各ブロックへ転送す
る。As described above, the playback channel sound corps localization initial setting unit 600 includes specific setting data of the divided bands, sound pressure level values regarding the first or fourth frequency band, the value of n of the playback channel for m Heka channels, Phase values, sound pressure level values, etc. regarding the second or third frequency band are stored, and these values are transferred to each block upon initial setting of each block section.

第４図はこの発明の多地点間テレビ会議システムの応用
例である。ｍ個の地点から別々の人物映像信号と音声信
号が通信網６０を介して、受信端末の地点８０にｍ個の
信号として転送される。受信端末ではｍ人の人物映像を
テレビ画面に映し出すとともに音像定位制御を行う。こ
の制御【＝際しては第１と第４の周波数帯域２１．２４
の音声は単一スビーカ２６．２７より再生され、第２と
第４の周波数帯域２２．２３の音声はチャネル毎にペア
にして複数のスピーカ（２８ＬＦ　、２９ＬＦ）、（２
８ＲＦ、２９ＲＦ）、（２８ＬＢ、２９ＬＢ）、（２８
ＲＢ、２９ＲＢ）より再生される。FIG. 4 shows an application example of the multipoint video conference system of the present invention. Separate person video signals and audio signals are transferred from m points via the communication network 60 to the receiving terminal point 80 as m signals. The receiving terminal displays images of m people on a television screen and performs sound image localization control. This control [= in some cases, the first and fourth frequency bands 21.24
The audio of the second and fourth frequency bands 22.23 is played back by a single speaker 26.27, and the audio of the second and fourth frequency bands 22.23 is played back by multiple speakers (28LF, 29LF), (2
8RF, 29RF), (28LB, 29LB), (28
RB, 29RB).

尚ｍ＝ｎの場合で、２チヤネル、３チヤネル、４チヤネ
ル等の既存のステレオ音を第１〜第４の周波数帯域に分
割して、位相、音圧レベルの調整無しでそれぞれのスピ
ーカから再生されるテレビ会議システム等もこの発明に
含まれる。In the case of m=n, the existing stereo sound of 2 channels, 3 channels, 4 channels, etc. can be divided into the 1st to 4th frequency bands and played from each speaker without adjusting the phase or sound pressure level. This invention also includes a video conference system and the like.

「発明の効果」以上説明したよう（−１この発明（；よる音像定位再生
方式（：よれば聴覚上方向知覚が生じる周波数帯域とそ
れ以外の周波数帯域とに分割し、前者の帯域の音声につ
いては複数スピーカを用いて音像定位の制御を行い、後
者の帯域の音声については音像定位の制御を行わずテレ
ビ等の映像部に近接した単一のスピーカを用いて再生す
ることから、方向知覚が得られない第１と第４の周波数
帯域の音声を再生する際、複数スピーカを用いないため
複数の音源による音の「ぼけ」が生じず、方向知覚が得
られる第２と第３の周波数帯域の音像定位の精度が向上
できるという利点がある。また方向知覚が得られない第
１と第４の周波数帯域の音声に関して単一スピーカで再
生しテレビに近接させるため、映像による相互作用によ
って見かけ上疑似的に映像部に音像が定位し易いという
利点がある。"Effects of the Invention" As explained above (-1) This invention (;According to the sound image localization reproduction method (:), the sound image localization reproduction method (:) divides the frequency band into a frequency band in which upward auditory perception occurs and the other frequency bands, and the sound in the former band is The sound image localization is controlled using multiple speakers, and the sound in the latter band is played back using a single speaker located close to the video part of the TV or other device without controlling the sound image localization, which results in poor direction perception. When reproducing audio in the first and fourth frequency bands that cannot be obtained, multiple speakers are not used, so the "blurring" of the sound due to multiple sound sources does not occur, and the second and third frequency bands allow direction perception to be obtained. This has the advantage that the accuracy of sound image localization can be improved.Also, since the audio in the first and fourth frequency bands, where direction perception cannot be obtained, is played back by a single speaker and placed close to the TV, the interaction with the video makes the appearance appear different. This has the advantage that it is easy to localize the sound image in the video area in a pseudo manner.

[Brief explanation of the drawing]

第１図は周波数帯域分割（：関するこの発明の特徴を示
す図、第２図はこの発明の特徴を示す再生システム例を
示すブロック図、第３図はこの発明の実施例の構成を示
すブロック図、第４因はこの発明の多地点間テレビ会議
システｌの応用例を示す図、第５図は従来のテレビ会議用音像定位再生システムの各種
例を示す図である。Fig. 1 is a diagram showing the features of this invention regarding frequency band division (: Fig. 2 is a block diagram showing an example of a reproduction system showing the features of this invention, and Fig. 3 is a block diagram showing the configuration of an embodiment of this invention. Figure 4 is a diagram showing an application example of the multipoint video conference system 1 of the present invention, and Figure 5 is a diagram showing various examples of a conventional sound image localization reproduction system for video conferences.

Claims

[Claims]

(1) A means for dividing a frequency band for each audio signal of a plurality of input channels to be subjected to sound image localization generation, and a means for outputting the divided first frequency band audio signal to a reproduction channel. means for adjusting the sound pressure level; and means for adjusting the audio signal of the divided second frequency band so that a phase difference (time difference) and a sound pressure level difference are generated between the reproduction channels in order to generate sound image localization. means for adjusting the phase and level; and means for adjusting the level so that a sound pressure level difference is generated between the reproduction channels in order to generate sound image localization for the audio signal of the divided third frequency band. , comprising means for setting correspondence data between the input channel and the reproduction channel and localization data that defines the sense of distance, sense of direction, etc. of sound image localization, and the second and second channels are provided with means for setting localization data, etc. that define the sense of distance, sense of direction, etc. of sound image localization, and the second and second The sound image localization is controlled using multiple speakers for the frequency band No. 3, and the above-mentioned No.
A sound image localization reproduction method characterized by using a single speaker for reproduction without performing sound image localization for the frequency band.