[go: up one dir, main page]

JPH0259000A - Sound image static reproducing system - Google Patents

Sound image static reproducing system

Info

Publication number
JPH0259000A
JPH0259000A JP63210975A JP21097588A JPH0259000A JP H0259000 A JPH0259000 A JP H0259000A JP 63210975 A JP63210975 A JP 63210975A JP 21097588 A JP21097588 A JP 21097588A JP H0259000 A JPH0259000 A JP H0259000A
Authority
JP
Japan
Prior art keywords
frequency band
sound
sound image
image localization
pressure level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP63210975A
Other languages
Japanese (ja)
Other versions
JP2645731B2 (en
Inventor
Naofumi Inmaki
印牧 直文
Fumio Kishino
岸野 文郎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NTT Inc
Original Assignee
Nippon Telegraph and Telephone Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Telegraph and Telephone Corp filed Critical Nippon Telegraph and Telephone Corp
Priority to JP63210975A priority Critical patent/JP2645731B2/en
Publication of JPH0259000A publication Critical patent/JPH0259000A/en
Application granted granted Critical
Publication of JP2645731B2 publication Critical patent/JP2645731B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Stereophonic System (AREA)

Abstract

PURPOSE:To improve both the attendant feeling and the directional feeling of sounds by controlling a sound image static for a frequency band from which the perception of an auditory direction can be obtained using plural speakers, reproducing the sounds from the plural speakers, executing no control for the sound image static for the other frequency bands, and reproducing the sounds using a single speaker. CONSTITUTION:Voices related to the second frequency band 22 and the third frequency band 23 to perceive the auditory directional feeling are reproduced using the plural speakers while sound image static control is executed. On the other hand, voices and environmental sounds related to the first frequency band 21 and the fourth frequency band 24 not to perceive the directional feeling are reproduced from the single speaker provided near a television set, etc. Consequently, since the plural speakers are never used when the voices of the first and fourth frequency bands 21 and 24 are to be reproduced, used, the blur of the sounds caused by plural sound sources never occurs, and the accuracy of the sound image static of the second and third frequency bands 22 and 23 to obtain the perception of the direction can be improved. Thus, both the attendant feeling and the directional feeling of the sounds can be improved in a communication conference.

Description

【発明の詳細な説明】 「産業上の利用分野」 この発明は映像を用いた通信会議(:おける音1象定位
再生システムにおいて、聴覚上方向知覚が生じる周波数
帯域に着目し、その着目周波数帯域の音声:;ついては
音像定位制御を行い、それ以外の周波数帯域の音声C二
ついては音像定位制御を行わず、その再生位置をモニタ
等の映像部に近接するよう(;配置する音像定位再生方
式に関するものである。
[Detailed Description of the Invention] "Industrial Application Field" This invention focuses on the frequency band in which auditory upward direction perception occurs in a sound one-image localization reproduction system in a communication conference using video. Regarding the sound image localization playback method, the sound image localization control is performed for the audio C in the other frequency bands, and the sound image localization control is not performed for the two audio C in the other frequency bands, and the playback position is placed close to the video part of the monitor etc. It is something.

「従来の技術」 第5図は従来のテレビ会議用音像定位再生システムの例
を示している。同図Aは聴取者11に対してモニタ12
の前面中央(ニスピー力13Cを設置する方法でモノラ
ル音で再生する。同図Bはモニタ12の前面両サイド(
;スピーカ13L、13Rを設置しステレオ音(2チヤ
ネル)を再生する。
"Prior Art" FIG. 5 shows an example of a conventional sound image localization reproduction system for video conferences. In the same figure, A shows the monitor 12 for the listener 11.
The front center of the monitor 12 (monaural sound is played by installing the Nispy Power 13C. Figure B shows the front center of the monitor 12 on both sides (
; Install speakers 13L and 13R to reproduce stereo sound (2 channels).

同図Cはモニタ12の前面、中央と両サイド(ニスピー
力13C,13L、13Rを設置し3チヤネルステレオ
音を再生する。同図りは4チヤネルステレオ音を再生す
る場合で、同図Cに対して聴取者11の後方(ニスピー
力13Bを設置する点が異なる。通信会議における音の
臨場感を向上させる観点からB−Dのシステム構成が有
効である。
Figure C shows the front, center, and both sides of the monitor 12 (Nispee power units 13C, 13L, and 13R are installed to reproduce 3-channel stereo sound. This figure shows the case where 4-channel stereo sound is reproduced. The difference is that the Nispee force 13B is installed behind the listener 11.The system configuration of B-D is effective from the viewpoint of improving the sense of presence of sound in a teleconference.

このうち同図B(2関しては例えばモニタ12(=映さ
れた3人の横ならびの人物14において聴取者11が両
サイドの人物の声に比して中央の人物の声をはっきりと
した方向感をもって知覚しにくい(音f象定位感が低い
)という問題点がある。この点に対処する方法として、
同図C,Dに示すよう(2両サイドのスピーカ13L、
13Rの間(;センタ用スピーカ13Cを設置し、通信
会議(:おける音の臨場感とともに方向感の再生向上を
図っている。
For example, regarding Figure B (2), the listener 11 heard the voice of the person in the center clearly compared to the voices of the people on both sides of the monitor 12 (= three people 14 displayed side by side). There is a problem that it is difficult to perceive with a sense of direction (low sense of sound localization).As a way to deal with this point,
As shown in C and D of the same figure (speakers 13L on both sides,
A center speaker 13C is installed between 13R and 13C to improve the sense of direction as well as the sense of presence of the sound during the communication conference.

左サイド、センタ、右サイドの3種のスピーカを用いる
従来のシステム(図C,D)では各スピーカともに全周
波数帯域の音声を対象にして音像定位を行う。この全周
波数帯域を対象にすることが問題となっている。即ち人
間の聴覚能力上、方向知覚が得られる周波数帯域は限ら
れており、その周波数帯域の音声に対して複数のスピー
カを用いて音像定位制御を行うことは、方向感の再生向
上を図る上で効果的である反面、方向知覚が得られない
周波数帯域については、複数のスピーカを用いると点音
源が複数個存在するため、再生音に「ぼけ」が生じ、こ
れによって方向知覚が得られる帯域の音像定位の精度を
低下させるという第1の問題がある。
In a conventional system using three types of speakers (left side, center, and right side) (Figures C and D), each speaker performs sound image localization for audio in all frequency bands. Targeting this entire frequency band has become a problem. In other words, due to human auditory ability, the frequency band in which direction perception can be obtained is limited, and performing sound localization control using multiple speakers for sounds in that frequency band is an effective way to improve the reproduction of direction sense. On the other hand, in the frequency band where direction perception cannot be obtained, when multiple speakers are used, there are multiple point sound sources, resulting in "blur" in the reproduced sound, which makes it difficult to obtain direction perception. The first problem is that the accuracy of sound image localization is reduced.

ところでテレビ会議では映し出された人物の口元にその
人物の声、即ち音像が引き寄せられるという相互作用が
知られている。方向知覚が得られない周波数帯域の音声
は本来感覚的には音像定位し難いものであるが、上記映
像(=よる相互作用を高めることによって見かけ上疑似
的に映像の人物の口元に音像を定位させる方法が考えら
れる。上記映(象による相互作用を高めるためには、テ
レビに映し出された人物の口元(:近い位置に一つの音
源を設置することがより効果的である。ところが従来の
システムでは前述したように方向知覚が得られない周波
数帯域の音声を複数個の音源から再生するため、上記相
互作用を効果的に活用できないという第2の問題がある
By the way, in a video conference, it is known that there is an interaction in which the voice of the person being projected, that is, the sound image, is attracted to the mouth of the person being projected. Sound in a frequency band where direction perception cannot be obtained is originally difficult to intuitively localize, but by increasing the interaction with the image above, it is possible to artificially localize the sound image to the mouth of the person in the image. In order to enhance the interaction with the elephant, it is more effective to place one sound source close to the mouth of the person shown on the TV.However, in the conventional system, Then, as mentioned above, since the sound in the frequency band in which direction perception cannot be obtained is reproduced from a plurality of sound sources, there is a second problem that the above-mentioned interaction cannot be effectively utilized.

この発明の目的は方向知覚が得られる周波数帯域の音声
については位相差や音圧レベル差を活用して音像な定位
させるとともに、方向知覚が得られない周波数帯域の音
声については、映像による相互作用を活用して疑似的(
2映像部に音像を定位し易くさせることをねらいとし、
上記従来の欠点を除去するため、聴覚上方向知覚が生じ
る周波数帯域とそれ以外の周波数帯域とに分割し、前者
の帯域の音声については複数スピーカを用いて音像定位
の制御を行い、後者の帯域の音声については音像定位の
制御を行わずモニタ等の映r象部に近接した単一のスピ
ーカを用いて再生する音像定位再生方式を提供すること
にある。
The purpose of this invention is to use phase differences and sound pressure level differences to localize sounds in frequency bands where direction perception can be obtained, and to localize sound images in frequency bands where direction perception cannot be obtained, and to localize sound images in frequency bands where direction perception cannot be obtained. Using pseudo (
The aim is to make it easier to localize the sound image in the 2-video section,
In order to eliminate the above-mentioned drawbacks of the conventional method, the frequency band is divided into a frequency band where upward auditory perception occurs and other frequency bands, and sound image localization is controlled using multiple speakers for audio in the former band, and sound image localization in the latter band is controlled using multiple speakers. The object of the present invention is to provide a sound image localization reproduction method in which sound is reproduced using a single speaker located close to an image area such as a monitor without controlling sound image localization.

「課題を解決するための手段」 この発明によれば音像定位生成の対象となる複数個の入
力チャネルの各オーディオ信号に対して周波数帯域が分
割され、その分割された第1の周波数帯域のオーディオ
信号を再生チャネルに出力する際に音圧レベルが調整さ
れ、上記分割された第2の周波数帯域のオーディオ信号
に対して音像定位を生成するため(−再生チャネル間に
位相差(時間差)と音圧レベル差が生じるように位相と
レベルとが調整され、上記分割された第3の周波数帯域
のオディオ信号に対して音像定位を生成するために再生
チャネル間に音圧レベル差が生じるよう≦;レベルが調
整され、入力チャネルと再生チャネルとの対応データと
音像定位の距離感、方向感等を規定する定位データ等が
設定され、聴覚上の方向知覚が得られる第2と$3の周
波数帯域に対して複数スピーカを用いて音1象定位が制
御され、方向知覚が得られない第1の周波数帯域に対し
ては音像定位を行わず単一のスピーカを用いて再生され
る。
"Means for Solving the Problem" According to the present invention, a frequency band is divided for each audio signal of a plurality of input channels to be generated for sound image localization, and the audio of the divided first frequency band is divided. The sound pressure level is adjusted when the signal is output to the playback channel, and the sound image localization is generated for the audio signal of the divided second frequency band (- phase difference (time difference) and sound The phase and level are adjusted so that a pressure level difference is generated, and a sound pressure level difference is generated between reproduction channels in order to generate sound image localization for the audio signal of the divided third frequency band. The level is adjusted, the correspondence data between the input channel and the playback channel, and the localization data that defines the sense of distance and sense of direction for sound image localization are set, and the second and third frequency bands are used to obtain an auditory sense of direction. For the first frequency band in which direction perception cannot be obtained, the sound image localization is controlled using a plurality of speakers, and the sound image localization is not performed and the sound is reproduced using a single speaker.

周波数分割は次のように行われる。すなわち第1図は周
波数帯域分割(2関するこの発明の特徴を示す例である
。人間の可聴範囲15は20 Hz〜20.0OOHz
の周波数帯域である。テレビ会議・音声会議等の通信会
議では人間の発声範囲16である1 00 Hz〜8.
 OOOHzの周波数帯域に関する音声の音像定位を如
何(:精度良く、かつ忠実性を高めて再生させるかが特
に重視されている。ところが音の受は手である聴取者1
1の方向知覚能力は人間の発声範囲16(2比してより
小さい周波数帯域であることが知られている。境、中山
著:聴覚と音響心理(日本音響学全編、1978 、コ
ロナ社、pp185−195)の文献によれば、人間の
聴覚上の方向知覚能力に関しては2種の知覚方法があり
、第1の方法は2つの音の間の位相差(時間差)に基づ
いて方向感を知覚するもので、この知覚範囲17は20
0 Hz 〜1.300 Hzと言われている。第2の
方法は2つの音の間の音圧レベル差に基づいて方向感を
知覚するもので、この知覚範囲18は200 Hz 〜
5. OOOHzと言われている。
Frequency division is performed as follows. That is, FIG. 1 is an example showing the features of this invention regarding frequency band division (2).The human audible range 15 is 20 Hz to 20.000 Hz.
frequency band. In communication conferences such as video conferences and audio conferences, the range of human speech ranges from 100 Hz to 8.
Particular emphasis is placed on how to localize the sound image of audio in the OOOHz frequency band (: how to reproduce it with high precision and high fidelity.However, listener 1, whose hands are used to receive the sound,
It is known that the direction perception ability of No. 1 is within the human vocal range of 16 (smaller frequency band than No. 2). Sakai, Nakayama: Hearing and Psychoacoustics (Japan Acoustics Complete Edition, 1978, Coronasha, pp185) According to the literature (195), there are two types of perceptual methods regarding the human auditory ability to perceive direction.The first method is to perceive a sense of direction based on the phase difference (time difference) between two sounds. This perception range 17 is 20
It is said to be between 0 Hz and 1.300 Hz. The second method is to perceive a sense of direction based on the sound pressure level difference between two sounds, and this perception range 18 is from 200 Hz to
5. It is said to be OOOOHz.

図中で示すように音像定位制御の観点から周波数帯域を
分割・整理すると人間の可聴範囲15は方向感を知覚し
ない音声と環境音とに関する低域の第1の周波数帯域2
1と、位相差(時間差)と音圧レベル差とを混在して方
向感を知覚する第2の周波数帯域22と、音圧レベル差
だけで方向感を知覚する第3の周波数帯域23と、方向
感を知覚しない音声と環境音とに関する高域の第4の周
波数帯域24とに大別される。
As shown in the figure, when frequency bands are divided and organized from the perspective of sound image localization control, the human audible range 15 is a low-frequency first frequency band 2 related to voices that do not perceive a sense of direction and environmental sounds.
1, a second frequency band 22 in which a sense of direction is perceived by mixing a phase difference (time difference) and a sound pressure level difference, and a third frequency band 23 in which a sense of direction is perceived only by a sound pressure level difference. It is broadly divided into a high-frequency fourth frequency band 24 related to sounds that do not give a sense of direction and environmental sounds.

この発明は聴覚上方向感を知覚する第2の周波数帯域2
2と第3の周波数帯域23とに関する音声を複数スピー
カを用いて音像定位制御を行いつつ再生し、他方方向感
を知覚しない第1の周波数帯域21と第4の周波数帯域
24C二関する音声・環境音をテレビ等に近接する単一
スピーカから再生し、映像による相互作用によって疑似
的に映像部に音像を定位し易くさせる。
This invention provides a second frequency band 2 for perceiving an auditory sense of direction.
2 and the third frequency band 23 are reproduced while performing sound image localization control using a plurality of speakers, and on the other hand, the sound and environment related to the first frequency band 21 and the fourth frequency band 24C are reproduced without perceiving a sense of direction. Sound is reproduced from a single speaker close to a television or the like, and the sound image is simulated to be easily localized in the video section through interaction with the video.

「実施例」 第2図はこの発明の特徴を示す再生システム例である。"Example" FIG. 2 is an example of a playback system showing the features of this invention.

モニタ12の周囲に右側、左側、上側、下側にそれぞれ
スピーカキャビネット25が配置されていて2チヤネル
ステレオ音を用いた音像定位再生システムの例を示して
いる。第1の周波数帯域21の音は下側スピーカ26か
ら、第4の周波数帯域24の音は上側スピーカ27から
単一スピーカで再生される。第2の周波数帯域22の音
は左右のスピーカ28Lとスピーカ28Rを用いて位相
差と音圧レベル差を調整して音像を定位させる。また第
3の周波数帯域23の音は左右のスピーカ29Lとスピ
ーカ29Rを用いて、音圧レベル差を調整して音像を定
位させる。
Speaker cabinets 25 are arranged around the monitor 12 on the right side, left side, upper side, and lower side, respectively, showing an example of a sound image localization reproduction system using two-channel stereo sound. The sound of the first frequency band 21 is reproduced from the lower speaker 26, and the sound of the fourth frequency band 24 is reproduced from the upper speaker 27 using a single speaker. The sound image of the second frequency band 22 is localized by adjusting the phase difference and the sound pressure level difference using the left and right speakers 28L and 28R. Further, the sound in the third frequency band 23 is localized using the left and right speakers 29L and 29R to adjust the sound pressure level difference.

第3図はこの発明の実施例の構成を示すブロック図であ
って、26は第1の周波数帯域用スピーカ、2 sL、
 28Rは第2の周波数帯域用スピーカ(28L:第1
再生チヤネル用、28R=第2再生チヤネルFF+ ’
、 N 29L + 29Rは第3の周波数帯域用スピ
ーカ(2g7.:第1再生チヤネル用、29R:第2再
生チヤネル用)、27は第4の周波数帯域用スピーカ、
70は入力端子、100は低域音圧レベル調整部、20
0は位相差・音圧レベル差混在音像定位調整部、300
は音圧レベル差音像定位調整部、400は高域音圧レベ
ル調整部、500は周波数帯域分割部、600は再生チ
ャネル用音像定位初期設定部、700は制御部である。
FIG. 3 is a block diagram showing the configuration of an embodiment of the present invention, in which 26 is a first frequency band speaker, 2 sL,
28R is the second frequency band speaker (28L: first
For playback channel, 28R = 2nd playback channel FF+'
, N 29L + 29R is the third frequency band speaker (2g7.: for the first reproduction channel, 29R: for the second reproduction channel), 27 is the fourth frequency band speaker,
70 is an input terminal, 100 is a low frequency sound pressure level adjustment section, 20
0 is a phase difference/sound pressure level difference mixed sound image localization adjustment unit, 300
400 is a high frequency sound pressure level adjustment section; 500 is a frequency band division section; 600 is a reproduction channel sound image localization initial setting section; 700 is a control section.

これを動作するC:は、制御部700の指令により、入
力端子70から転送されてくるm個の入力チャネルのオ
ーディオ信号に対して周波数帯域分割部500は再生チ
ャネル用音像定位初期設定部600から指定される第1
から第4までの周波数帯域21〜24の各帯域設定デー
タに基づいて各チャネルのオーディオ信号について帯域
分割し、第1の周波数帯域21のオーディオ信号を低域
音圧レベル調整部100へ、第2の周波数帯域22のオ
ーディオ信号を位相差・音圧レベル差混在音像定位調整
部200へ、第3の周波数帯域23のオーディオ信号を
音圧レベル差音像定位調整部300へ、第4の周波数帯
域24のオーディオ信号を高域音圧レベル調整部400
へ転送する。
In C:, which operates this, the frequency band dividing unit 500 selects the audio image localization initial setting unit 600 for the reproduction channel for the audio signals of m input channels transferred from the input terminal 70 according to a command from the control unit 700. The first specified
The audio signal of each channel is divided into bands based on the band setting data of the frequency bands 21 to 24 from 1 to 4, and the audio signal of the first frequency band 21 is sent to the low frequency sound pressure level adjustment section 100, The audio signal in the frequency band 22 is sent to the phase difference/sound pressure level difference mixed sound image localization adjustment section 200, the audio signal in the third frequency band 23 is sent to the sound pressure level difference sound image localization adjustment section 300, and the audio signal in the fourth frequency band 24 is sent to the sound image localization adjustment section 300. The high frequency sound pressure level adjustment section 400 adjusts the audio signal of
Transfer to.

低域音圧レベル調整部100は再生チャネル用音障定位
初期設定部600からの指定&:従ってmチャネルのオ
ーディオ信号の加算を行った加算オーディオ信号又はm
チャネルの中より選択抽出した選択オーディオ信号に対
して音圧レベルを調整し、第1の周波数帯域用スピーカ
26より再生させる。
The low-frequency sound pressure level adjustment unit 100 receives the specification from the reproduction channel sound disturbance localization initial setting unit 600 &: Therefore, the added audio signal obtained by adding the audio signals of m channels or m
The sound pressure level of the selected audio signal selected and extracted from the channels is adjusted, and the sound pressure level is adjusted and the audio signal is reproduced from the first frequency band speaker 26.

位相差・音圧レベル差混在音像定位調整部200は再生
チャネル用音像定位初期設定部600からの指定に従っ
て、m個のチャネルのオーディオ信号を加算又は選択し
てn個の再生チャネルに変換する際に、各再生チャネル
のオーディオ信号に対して位相、音圧レベルの調整を行
うとともにその再生チャネル間の位相差、音圧レベル差
を調整してn個の第2の周波数帯域用スピーカ28L、
28Rより再生させる。
When the phase difference/sound pressure level difference mixed sound image localization adjustment section 200 adds or selects the audio signals of m channels and converts them into n playback channels, according to the specification from the reproduction channel sound image localization initial setting section 600. The phase and sound pressure level of the audio signal of each reproduction channel are adjusted, and the phase difference and sound pressure level difference between the reproduction channels are adjusted to produce n second frequency band speakers 28L,
Play from 28R.

音圧レベル差音像定位調整部300は再生チャネル用音
像定位初期設定部600からの指定に従って、m個のチ
ャネルのオーディオ信号を加算又は選択してn個の再生
チャネルに変換する際に、各再生チャネルのオーディオ
信号に対して音圧レベルの調整を行うとともC二、その
再生チャネル間の音圧レベル差を調整してn個の第3の
周波数帯域用スピーカ29L 、 29Rより再生させ
る。
The sound pressure level difference sound image localization adjustment unit 300 adds or selects the audio signals of m channels and converts them into n playback channels according to the specifications from the playback channel sound image localization initial setting unit 600. The sound pressure level of the audio signal of the channel is adjusted, and the sound pressure level difference between the reproduction channels is adjusted, and the audio signal is reproduced from the n third frequency band speakers 29L and 29R.

高域音圧レベル調整部400は再生チャネル用音像定位
初期設定部600からの指定に従って、m個のチャネル
のオーディオ信号の加算を行った加算オーディオ信号又
はm個のチャネルの中より選択抽出した選択オーディオ
信号に対して音圧レベルを調整し、第4の周波数帯域用
スピーカ27より再生させる。
The high-frequency sound pressure level adjustment unit 400 generates a summed audio signal obtained by adding audio signals of m channels or a selection selected and extracted from m channels according to the specification from the reproduction channel sound image localization initial setting unit 600. The sound pressure level of the audio signal is adjusted, and the audio signal is reproduced from the fourth frequency band speaker 27.

再生チャネル用音隊定位初期設定部600は、前述した
ように分割帯域の具体的設定データ、第1又は第4の周
波数帯域に関する音圧レベル値、へカチャネルm個に対
する再生チャネルのnの値、第2又は第3の周波数帯域
に関する位相値、音圧レベル値等を記憶し、これらの値
を各ブロック部の初期設定に際して各ブロックへ転送す
る。
As described above, the playback channel sound corps localization initial setting unit 600 includes specific setting data of the divided bands, sound pressure level values regarding the first or fourth frequency band, the value of n of the playback channel for m Heka channels, Phase values, sound pressure level values, etc. regarding the second or third frequency band are stored, and these values are transferred to each block upon initial setting of each block section.

第4図はこの発明の多地点間テレビ会議システムの応用
例である。m個の地点から別々の人物映像信号と音声信
号が通信網60を介して、受信端末の地点80にm個の
信号として転送される。受信端末ではm人の人物映像を
テレビ画面に映し出すとともに音像定位制御を行う。こ
の制御【=際しては第1と第4の周波数帯域21.24
の音声は単一スビーカ26.27より再生され、第2と
第4の周波数帯域22.23の音声はチャネル毎にペア
にして複数のスピーカ(28LF 、29LF)、(2
8RF、29RF)、(28LB、29LB)、(28
RB、29RB)より再生される。
FIG. 4 shows an application example of the multipoint video conference system of the present invention. Separate person video signals and audio signals are transferred from m points via the communication network 60 to the receiving terminal point 80 as m signals. The receiving terminal displays images of m people on a television screen and performs sound image localization control. This control [= in some cases, the first and fourth frequency bands 21.24
The audio of the second and fourth frequency bands 22.23 is played back by a single speaker 26.27, and the audio of the second and fourth frequency bands 22.23 is played back by multiple speakers (28LF, 29LF), (2
8RF, 29RF), (28LB, 29LB), (28
RB, 29RB).

尚m=nの場合で、2チヤネル、3チヤネル、4チヤネ
ル等の既存のステレオ音を第1〜第4の周波数帯域に分
割して、位相、音圧レベルの調整無しでそれぞれのスピ
ーカから再生されるテレビ会議システム等もこの発明に
含まれる。
In the case of m=n, the existing stereo sound of 2 channels, 3 channels, 4 channels, etc. can be divided into the 1st to 4th frequency bands and played from each speaker without adjusting the phase or sound pressure level. This invention also includes a video conference system and the like.

「発明の効果」 以上説明したよう(−1この発明(;よる音像定位再生
方式(:よれば聴覚上方向知覚が生じる周波数帯域とそ
れ以外の周波数帯域とに分割し、前者の帯域の音声につ
いては複数スピーカを用いて音像定位の制御を行い、後
者の帯域の音声については音像定位の制御を行わずテレ
ビ等の映像部に近接した単一のスピーカを用いて再生す
ることから、方向知覚が得られない第1と第4の周波数
帯域の音声を再生する際、複数スピーカを用いないため
複数の音源による音の「ぼけ」が生じず、方向知覚が得
られる第2と第3の周波数帯域の音像定位の精度が向上
できるという利点がある。また方向知覚が得られない第
1と第4の周波数帯域の音声に関して単一スピーカで再
生しテレビに近接させるため、映像による相互作用によ
って見かけ上疑似的に映像部に音像が定位し易いという
利点がある。
"Effects of the Invention" As explained above (-1) This invention (;According to the sound image localization reproduction method (:), the sound image localization reproduction method (:) divides the frequency band into a frequency band in which upward auditory perception occurs and the other frequency bands, and the sound in the former band is The sound image localization is controlled using multiple speakers, and the sound in the latter band is played back using a single speaker located close to the video part of the TV or other device without controlling the sound image localization, which results in poor direction perception. When reproducing audio in the first and fourth frequency bands that cannot be obtained, multiple speakers are not used, so the "blurring" of the sound due to multiple sound sources does not occur, and the second and third frequency bands allow direction perception to be obtained. This has the advantage that the accuracy of sound image localization can be improved.Also, since the audio in the first and fourth frequency bands, where direction perception cannot be obtained, is played back by a single speaker and placed close to the TV, the interaction with the video makes the appearance appear different. This has the advantage that it is easy to localize the sound image in the video area in a pseudo manner.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は周波数帯域分割(:関するこの発明の特徴を示
す図、第2図はこの発明の特徴を示す再生システム例を
示すブロック図、第3図はこの発明の実施例の構成を示
すブロック図、第4因はこの発明の多地点間テレビ会議
システlの応用例を示す図、 第 5図は従来のテレビ会議用音像定位再生システムの各種
例を示す図である。
Fig. 1 is a diagram showing the features of this invention regarding frequency band division (: Fig. 2 is a block diagram showing an example of a reproduction system showing the features of this invention, and Fig. 3 is a block diagram showing the configuration of an embodiment of this invention. Figure 4 is a diagram showing an application example of the multipoint video conference system 1 of the present invention, and Figure 5 is a diagram showing various examples of a conventional sound image localization reproduction system for video conferences.

Claims (1)

【特許請求の範囲】[Claims] (1)音像定位生成の対象となる複数個の入力チャネル
の各オーディオ信号に対して周波数帯域を分割する手段
と、 その分割された第1の周波数帯域のオーディオ信号を再
生チャネルに出力する際に音圧レベルを調整する手段と
、 上記分割された第2の周波数帯域のオーディオ信号に対
して、音像定位を生成するために再生チャネル間に位相
差(時間差)と音圧レベル差が生じるように位相とレベ
ルとを調整する手段と、 上記分割された第3の周波数帯域のオーディオ信号に対
して音像定位を生成するために再生チャネル間に音圧レ
ベル差が生じるようにレベルを調整する手段と、 上記入力チャネルと上記再生チャネルとの対応データと
音像定位の距離感、方向感等を規定する定位データ等を
設定する手段とを具備し、聴覚上の方向知覚が得られる
上記第2と第3の周波数帯域に対して複数スピーカを用
いて音像定位を制御し、方向知覚が得られない上記第1
の周波数帯域に対しては音像定位を行わず単一のスピー
カを用いて再生することを特徴とする音像定位再生方式
(1) A means for dividing a frequency band for each audio signal of a plurality of input channels to be subjected to sound image localization generation, and a means for outputting the divided first frequency band audio signal to a reproduction channel. means for adjusting the sound pressure level; and means for adjusting the audio signal of the divided second frequency band so that a phase difference (time difference) and a sound pressure level difference are generated between the reproduction channels in order to generate sound image localization. means for adjusting the phase and level; and means for adjusting the level so that a sound pressure level difference is generated between the reproduction channels in order to generate sound image localization for the audio signal of the divided third frequency band. , comprising means for setting correspondence data between the input channel and the reproduction channel and localization data that defines the sense of distance, sense of direction, etc. of sound image localization, and the second and second channels are provided with means for setting localization data, etc. that define the sense of distance, sense of direction, etc. of sound image localization, and the second and second The sound image localization is controlled using multiple speakers for the frequency band No. 3, and the above-mentioned No.
A sound image localization reproduction method characterized by using a single speaker for reproduction without performing sound image localization for the frequency band.
JP63210975A 1988-08-24 1988-08-24 Sound image localization reproduction method Expired - Lifetime JP2645731B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63210975A JP2645731B2 (en) 1988-08-24 1988-08-24 Sound image localization reproduction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63210975A JP2645731B2 (en) 1988-08-24 1988-08-24 Sound image localization reproduction method

Publications (2)

Publication Number Publication Date
JPH0259000A true JPH0259000A (en) 1990-02-28
JP2645731B2 JP2645731B2 (en) 1997-08-25

Family

ID=16598228

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63210975A Expired - Lifetime JP2645731B2 (en) 1988-08-24 1988-08-24 Sound image localization reproduction method

Country Status (1)

Country Link
JP (1) JP2645731B2 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004350173A (en) * 2003-05-26 2004-12-09 Nippon Hoso Kyokai <Nhk> Sound image reproducing device and three-dimensional sound image reproducing device
JP2008060725A (en) * 2006-08-29 2008-03-13 Nippon Telegr & Teleph Corp <Ntt> Sound image localization emphasizing reproduction method, apparatus thereof, program thereof, and storage medium thereof
JP2008522551A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Parametric coding of spatial audio using cues based on transmitted channels
WO2009060498A1 (en) * 2007-11-06 2009-05-14 Fujitsu Limited Inter-multipoint voice conversation apparatus
JP2009177827A (en) * 2005-09-07 2009-08-06 Polycom Inc Spatial correlation audio in multipoint video conferencing
WO2010071008A1 (en) 2008-12-16 2010-06-24 ソニー株式会社 Audio outputting apparatus, video/audio reproducing apparatus and audio outputting method
EP2282556A2 (en) 2009-07-30 2011-02-09 Sony Corporation Display device and audio output device
JPWO2009051132A1 (en) * 2007-10-19 2011-03-03 日本電気株式会社 Signal processing system, apparatus, method thereof and program thereof
JP2012238967A (en) * 2011-05-10 2012-12-06 Nippon Hoso Kyokai <Nhk> Sound image localization improving device
JP2013048317A (en) * 2011-08-29 2013-03-07 Nippon Hoso Kyokai <Nhk> Sound image localization device and program thereof
JP2013521725A (en) * 2010-03-23 2013-06-10 ドルビー ラボラトリーズ ライセンシング コーポレイション Technology to perceive sound localization
US10158958B2 (en) 2010-03-23 2018-12-18 Dolby Laboratories Licensing Corporation Techniques for localized perceptual audio
US10200804B2 (en) 2015-02-25 2019-02-05 Dolby Laboratories Licensing Corporation Video content assisted audio object extraction
JP2021507284A (en) * 2017-12-15 2021-02-22 ブームクラウド 360 インコーポレイテッド Subband spatial processing and crosstalk cancellation system for conferences

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3314730B2 (en) 1998-08-05 2002-08-12 ヤマハ株式会社 Audio playback device and communication conference device

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004350173A (en) * 2003-05-26 2004-12-09 Nippon Hoso Kyokai <Nhk> Sound image reproducing device and three-dimensional sound image reproducing device
JP2008522551A (en) * 2004-11-30 2008-06-26 アギア システムズ インコーポレーテッド Parametric coding of spatial audio using cues based on transmitted channels
JP4856653B2 (en) * 2004-11-30 2012-01-18 アギア システムズ インコーポレーテッド Parametric coding of spatial audio using cues based on transmitted channels
JP2009177827A (en) * 2005-09-07 2009-08-06 Polycom Inc Spatial correlation audio in multipoint video conferencing
JP2012213214A (en) * 2005-09-07 2012-11-01 Polycom Inc Spatially correlated audio in multipoint videoconference
JP2008060725A (en) * 2006-08-29 2008-03-13 Nippon Telegr & Teleph Corp <Ntt> Sound image localization emphasizing reproduction method, apparatus thereof, program thereof, and storage medium thereof
US8892432B2 (en) 2007-10-19 2014-11-18 Nec Corporation Signal processing system, apparatus and method used on the system, and program thereof
JPWO2009051132A1 (en) * 2007-10-19 2011-03-03 日本電気株式会社 Signal processing system, apparatus, method thereof and program thereof
WO2009060498A1 (en) * 2007-11-06 2009-05-14 Fujitsu Limited Inter-multipoint voice conversation apparatus
US8264981B2 (en) 2007-11-06 2012-09-11 Fujitsu Limited Inter-multipoint voice conversation apparatus
WO2010071008A1 (en) 2008-12-16 2010-06-24 ソニー株式会社 Audio outputting apparatus, video/audio reproducing apparatus and audio outputting method
US8630428B2 (en) 2009-07-30 2014-01-14 Thomson Licensing Display device and audio output device
EP2282556A2 (en) 2009-07-30 2011-02-09 Sony Corporation Display device and audio output device
RU2559748C2 (en) * 2009-07-30 2015-08-10 Сони Корпорейшн Display and audio output device
US10158958B2 (en) 2010-03-23 2018-12-18 Dolby Laboratories Licensing Corporation Techniques for localized perceptual audio
US10939219B2 (en) 2010-03-23 2021-03-02 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for audio reproduction
US12273695B2 (en) 2010-03-23 2025-04-08 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for audio reproduction
JP2013521725A (en) * 2010-03-23 2013-06-10 ドルビー ラボラトリーズ ライセンシング コーポレイション Technology to perceive sound localization
US9172901B2 (en) 2010-03-23 2015-10-27 Dolby Laboratories Licensing Corporation Techniques for localized perceptual audio
US9544527B2 (en) 2010-03-23 2017-01-10 Dolby Laboratories Licensing Corporation Techniques for localized perceptual audio
US11350231B2 (en) 2010-03-23 2022-05-31 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for audio reproduction
US8755543B2 (en) 2010-03-23 2014-06-17 Dolby Laboratories Licensing Corporation Techniques for localized perceptual audio
US10499175B2 (en) 2010-03-23 2019-12-03 Dolby Laboratories Licensing Corporation Methods, apparatus and systems for audio reproduction
JP2012238967A (en) * 2011-05-10 2012-12-06 Nippon Hoso Kyokai <Nhk> Sound image localization improving device
JP2013048317A (en) * 2011-08-29 2013-03-07 Nippon Hoso Kyokai <Nhk> Sound image localization device and program thereof
US10200804B2 (en) 2015-02-25 2019-02-05 Dolby Laboratories Licensing Corporation Video content assisted audio object extraction
JP2021507284A (en) * 2017-12-15 2021-02-22 ブームクラウド 360 インコーポレイテッド Subband spatial processing and crosstalk cancellation system for conferences
US11252508B2 (en) 2017-12-15 2022-02-15 Boomcloud 360 Inc. Subband spatial processing and crosstalk cancellation system for conferencing
US11736863B2 (en) 2017-12-15 2023-08-22 Boomcloud 360, Inc. Subband spatial processing and crosstalk cancellation system for conferencing
US12309561B2 (en) 2017-12-15 2025-05-20 Boomcloud 360, Inc. Subband spatial processing and crosstalk processing system for conferencing

Also Published As

Publication number Publication date
JP2645731B2 (en) 1997-08-25

Similar Documents

Publication Publication Date Title
US7200236B1 (en) Multi-channel audio enhancement system for use in recording playback and methods for providing same
US8509454B2 (en) Focusing on a portion of an audio scene for an audio signal
US7382885B1 (en) Multi-channel audio reproduction apparatus and method for loudspeaker sound reproduction using position adjustable virtual sound images
KR100626233B1 (en) Equalisation of the output in a stereo widening network
US7158642B2 (en) Method and apparatus for producing a phantom three-dimensional sound space with recorded sound
US5530760A (en) Apparatus and method for adjusting levels between channels of a sound system
US5546465A (en) Audio playback apparatus and method
JP3657120B2 (en) Processing method for localizing audio signals for left and right ear audio signals
CN101356573A (en) Control over decoding of binaural audio signals
JPH0259000A (en) Sound image static reproducing system
US20040013271A1 (en) Method and system for recording and reproduction of binaural sound
JP5363567B2 (en) Sound playback device
Begault Auditory and non-auditory factors that potentially influence virtual acoustic imagery
JPH10336798A (en) Sound field correction circuit
US12395806B2 (en) Object-based audio spatializer
US7599498B2 (en) Apparatus and method for producing 3D sound
US20190246230A1 (en) Virtual localization of sound
JP2523367B2 (en) Audio playback method
JP2005286828A (en) Audio reproducing apparatus
KR101526014B1 (en) Multi-channel surround speaker system
JPH1118199A (en) Acoustic processor
JP2003164000A (en) Speaker device
Toole Perspectives on Sound Production, Recording and Reproduction
Matthews Simulation and testing of a multichannel system for 3D sound localization
CN112255791A (en) VR glasses and method for watching VR video

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090509

Year of fee payment: 12

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20090509

Year of fee payment: 12