[go: up one dir, main page]

CN103037301A - Convenient adjustment method for restoring range information of acoustic images - Google Patents

Convenient adjustment method for restoring range information of acoustic images Download PDF

Info

Publication number
CN103037301A
CN103037301A CN2012105547152A CN201210554715A CN103037301A CN 103037301 A CN103037301 A CN 103037301A CN 2012105547152 A CN2012105547152 A CN 2012105547152A CN 201210554715 A CN201210554715 A CN 201210554715A CN 103037301 A CN103037301 A CN 103037301A
Authority
CN
China
Prior art keywords
speaker system
channel
channel speaker
distance
initial
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012105547152A
Other languages
Chinese (zh)
Other versions
CN103037301B (en
Inventor
胡瑞敏
王松
曾敏
涂卫平
王晓晨
杨玉红
张茂胜
杨乘
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Booslink Suzhou Information Technology Co ltd
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN201210554715.2A priority Critical patent/CN103037301B/en
Publication of CN103037301A publication Critical patent/CN103037301A/en
Application granted granted Critical
Publication of CN103037301B publication Critical patent/CN103037301B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Stereophonic System (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

一种声像距离信息恢复便携调整方法,将m声道扬声器系统下混到初始n声道扬声器系统(m>nm≥3);将从m声道扬声器系统信号中获取的相关参数无损传送至初始n声道扬声器系统,进行初始n声道扬声器系统中相关参数测量;建立将初始n声道扬声器系统变化到新的n声道扬声器系统的模型,保证新的n声道扬声器系统的重建声像与听音点之间距离和m声道扬声器系统的原始声源与听音点之间的距离不变;以调整声道数最小为准则选取目标函数;求解模型;根据模型的解调整扬声器信号。本发明保证新的n声道系统中重建声像与听音点之间的距离与m声道扬声器系统的原始声源与听音点之间的距离一致,并且可以方便快捷地配置扬声器信号。

A portable adjustment method for audio image distance information recovery, downmixing the m -channel speaker system to the original n -channel speaker system ( m > n , m ≥ 3); lossless related parameters obtained from the m -channel speaker system signal Transfer to the initial n- channel speaker system to measure relevant parameters in the initial n -channel speaker system; establish a model for changing the initial n- channel speaker system to a new n -channel speaker system to ensure the accuracy of the new n -channel speaker system The distance between the reconstructed sound image and the listening point and the distance between the original sound source and the listening point of the m -channel loudspeaker system remain unchanged; the objective function is selected based on the minimum number of adjustment channels; the model is solved; according to the solution of the model Adjust the speaker signal. The invention ensures that the distance between the reconstructed sound image and the listening point in the new n -channel system is consistent with the distance between the original sound source and the listening point of the m- channel loudspeaker system, and the loudspeaker signal can be configured conveniently and quickly.

Description

Portable adjustment method for restoring acoustic image distance information
Technical Field
The invention belongs to the field of acoustics, and particularly relates to a portable adjustment method for restoring acoustic image distance information.
Background
5.1 multichannel systems were once a very popular home cinema sound system. However, with the development of 3D video technology, higher requirements are put on audio technology, and now multichannel audio research is focused on more advanced systems with more channels, which can provide people with better immersion. For example, the 22.2 multichannel system of japan broadcasting association laboratories has been used for ultra high definition television distribution. This advanced multi-channel system requires that the speakers be placed according to their own unique speaker placement method to produce the best sound results. Although 24 speakers can be placed in the theater in an optimal way, placement is cumbersome for home use. "downmix" is a good way of reducing the loudspeaker channels in a multi-channel system. Downmix from 5.1 to two-channel stereo or mono has been standardized in ITU-R Recommendation and is used for some television receivers. Although this downmixing method is very efficient, it is not applicable to any number of speaker configurations. In order to make the down-mixing between multiple systems feasible, a new sound field reconstruction or transformation technique is urgently needed. AkioAndo of the Japan broadcast Association laboratory in 2011 proposes a new downmix method, which utilizes a transformation matrix to transform loudspeaker signals of an original sound field into loudspeaker signals of a reconstructed sound field and ensures that the physical properties of sound of the original sound field and the reconstructed sound field at a listening point are consistent. The solution to this problem is frequency dependent, indicating that this transformation does not change the timbre. The physical property utilized by the user is unchanged, namely the sound pressure and the particle velocity direction of the sound at the listening point are unchanged before and after the change. Akio Ando also uses his method to reduce the 22.2 multi-channel system to 10.2 and 8.2 multi-channel systems. After the solution is solved by using the Akio Ando, people can find that the energy of the loudspeaker signal in the original sound field is not equal to the energy of the loudspeaker signal in the reconstructed sound field and contradicts with the law of energy conservation, which indicates that the distance information is damaged in the downmixing method of the Akio Ando. Therefore, the first problem to be solved at present is how to keep the physical properties of the sound at the receiving point unchanged and keep the distance information of the sound source in the original sound field from being damaged in the process of multichannel downmix.
Disclosure of Invention
The invention provides a portable adjustment method for restoring sound image distance information in a sound reproduction stage, aiming at the defects of the prior art.
The invention provides a portable adjustment method for restoring sound image distance information, which is characterized by comprising the following steps:
step 1, downmixing an m-channel loudspeaker system to obtain an n-channel loudspeaker system, recording the n-channel loudspeaker system obtained by downmixing as an initial n-channel loudspeaker system, wherein m is greater than n, and m is greater than or equal to 3;
step 2, obtaining and measuring relevant parameters, including obtaining the distance d from the original sound source of the m-channel loudspeaker system to the listening point0Obtaining the sound pressure P of the listening point from the signal of the m-channel loudspeaker system0And sound pressure P of left and right earsL0、PR0And the particle velocity V at the listening point0And transmitted to the initial n-channel speaker system, where d0、PL0、PR0、V0All transmission is lossless; measuring the distance d of each loudspeaker from the listening point in the initial n-channel loudspeaker systemjMeasuring the distance d of each speaker from the left ear point in the initial n-channel speaker systemLjMeasuring the initial n channel speakerDistance d of each loudspeaker in the system from the right ear pointRjJ =1,2, … n; measuring the radius h of the human head;
step 3, converting the initial n-channel loudspeaker system signal into a new n-channel loudspeaker system signal to obtain an equation as follows;
P 0 = 100 π Σ j = 1 n ω j W j d j 2 P L 0 = 100 π Σ j = 1 n ω j W j d Lj 2 P R 0 = 100 π Σ j = 1 n ω j W j d Rj 2 2 d 2 = d L 2 + d R 2 - 2 h 2 V 0 = G Σ j = 1 n e - ik | 0 → - ρ j | | 0 → - ρ j | ω j S j ( ω ) d 0 = d
Figure BDA00002616080900022
representing the position vector of the listening point, and the polar coordinates are (0, 0, 0);
ρjrepresenting an initial n-channel loudspeaker systemThe distance between the position of the jth loudspeaker and the origin;
d represents the distance between the sound image generated by all the loudspeakers in the new n-channel loudspeaker system and the listening point;
d0representing the distance between the original sound source of the m-channel loudspeaker system and a listening point;
Sj(ω) is the signal of the jth speaker in the initial n-channel speaker system;
π is the circumference ratio;
the coefficient G is an acoustic constant of the audio signal propagating in air;
V0representing the particle velocity at the listening point in an m-channel loudspeaker system;
dLrepresenting the distance of the sound image generated by all the loudspeakers in the new n-channel loudspeaker system to the left ear;
dRrepresenting the distance of the sound image produced by all the loudspeakers in the new n-channel loudspeaker system to the right ear;
djrepresenting the distance of the jth loudspeaker in the initial n-channel loudspeaker system from the listening point;
dLjrepresenting the distance of the jth speaker to the left ear in the initial n-channel speaker system;
dRjrepresenting the distance from the jth speaker to the right ear in the initial n-channel speaker system;
Wjrepresenting the acoustic power of the jth loudspeaker in the initial n-channel loudspeaker system at the sound production point;
ωjrepresenting a weighting factor adjusted for a jth speaker signal in the initial n-channel speaker system;
P0representing sound pressure at a listening point in an m-channel speaker system;
PL0representing the sound pressure at the left ear point in an m-channel speaker system;
PR0representing the sound pressure at the right ear point in the m-channel loudspeaker system;
step 4, setting a target function, adding the following mapping function formula as a constraint condition into the equation obtained in the step 3 to obtain an optimized model M
f ( ω j ) = 1 , ( ω j ) ≠ 1 0 , ω j = 1
The expression of the objective function is M = min Σ i = 1 n f ( ω j ) ;
Step 5, solving the minimum value of the optimization model MObtaining the weight factor omega of each loudspeaker signal adjustmentj,j=1,2,…n;
Step 6, obtaining the weight factor omega according to the solution in the step 5jJ =1,2, … n, and the signals of the speakers in the initial n-channel speaker system are adjusted to obtain a new n-channel speaker system.
By adopting the loudspeaker placement structure and the signal distribution method, the sound pressure of the sound at the positions of the left and right ears L, R is ensured to be unchanged and the particle speed at the listening positions is ensured to be unchanged at the listening positions in the reconstructed sound field under the condition of adopting the same number of loudspeakers, and meanwhile, the distance information between the sound and the listening positions can be recovered, and the signals in the minimum number of sound channels are adjusted, so that the sound field can be reconstructed better and more conveniently.
Drawings
FIG. 1 is an overall framework flow diagram of an embodiment of the invention
Fig. 2 is a diagram of the placement of speakers in a 22.2 multi-channel system according to an embodiment of the present invention.
Fig. 3 is a diagram of a 10.2 multi-channel system speaker selection method according to an embodiment of the invention.
Fig. 4 is a diagram of the placement positions of speakers of a 10.2 multi-channel system according to an embodiment of the present invention.
Fig. 5 is a schematic diagram of a 10.2 multi-channel system speaker of an embodiment of the present invention generating sound pressures at the left and right ears and sound images at the left and right ears.
Fig. 6 is a diagram showing the relationship between the sound image-to-listening point distance, the sound image-to-left and right ear distances, and the radius of the human head in a 10.2 multi-channel system according to an embodiment of the present invention.
Detailed Description
The technical scheme of the invention is explained in detail in the following by combining the drawings and the embodiment.
As shown in fig. 2 and 4, the embodiment takes m =22 and n =10, and the specific example of the embodiment is to use a 22.2 channel system to convert to a 10.2 channel system to describe the content of the invention. 22 loudspeakers in the 22.2 sound channel system are placed on the same spherical surface, 10 loudspeakers in the 10.2 sound channel system are placed on the same spherical surface, and the radius of the external spherical surface is 2 meters. In this embodiment, the 2 sound channels are two low-frequency sound effect sound channels, signals of the two low-frequency sound effect sound channels are not processed, and positions of the two low-frequency sound effect sound channels are not changed. The overall framework flow diagram of this embodiment is shown in fig. 1.
The embodiment adopts the sound image distance information recovery portable adjusting method, which comprises the following steps:
step 1, downmixing an m-channel loudspeaker system to obtain an n-channel loudspeaker system, recording the n-channel loudspeaker system obtained by downmixing as an initial n-channel loudspeaker system, wherein m is greater than n, and m is greater than or equal to 3.
The embodiment downmixes the 22.2 channel speaker system to the original 10.2 channel speaker system (denoted as the original 10.2 channel speaker system). The method adopted in the process is to keep the physical properties (sound pressure size and particle velocity direction) of the sound of the original sound field and the reconstructed sound field at the listening point (namely the center point of the human head) unchanged.
The implementation of step 1 comprises the following sub-steps,
step 1.1, a three-dimensional rectangular coordinate system XYZ is established, the origin is O, all the coordinates related to the invention are polar coordinates and are generally marked as P (sigma, theta, phi), wherein sigma, theta and phi respectively refer to the distance between the P point and the origin O, the included angle between the projection of the connecting line of the P point and the origin O on the X axis and the X axis, and the included angle between the connecting line of the P point and the origin O and the XOY plane, and the coordinates mentioned herein are explained with reference to the description. The 22 loudspeakers in the 22.2-channel loudspeaker system are arranged on the same spherical surface, see fig. 2, the specific position of each loudspeaker is fixed, and the coordinate xi of each loudspeaker is recordedj=(ρjjj) (j =1,2, … 22), and the listening point positions in this embodiment are all the sameAt the origin O;
as shown in fig. 2, in the embodiment, the center coordinates of the sphere are three-dimensional origin coordinates (0, 0, 0) and also coordinates at a listening point, and the coordinates of the 22 speakers are (2, 0 °,0 °), (2, 30 °,0 °), (2, 60 °,0 °), (2, 90 °,0 °), (2, 120 °,0 °), (2, 150 °,0 °), (2, 180 °,0 °), (2, 225 °,0 °), (2, 270 °,0 °), (2, 315 °,0 °), (2, 0 °, 48 °), (2, 45 °, 48 °), (2, 90 °, 48 °), (2, 135 °, 48 °), (2, 180 °, 48 °), (2, 225 °, 48 °), (2, 270 °, 48 °), (2, 0 °, 90 °), (2, 45 °, 30 °), (2, 90 °, -30 °), (2 °,135 °, -30 °) and
step 1.2, selecting the placing positions of 10 loudspeakers in an initial 10.2 loudspeaker sound channel system. The selection requirements are as follows: (1) selecting 1 speaker at a time in the 22.2 channel speaker system, the ray from the origin O to the direction of the point where each speaker of the 22.2 channel speaker system is located must be contained inside or on the side of the spherical triangle formed by the three speakers in the initial 10.2 channel speaker system, (2) while ensuring that the area of the spherical triangle formed by the three selected speakers in the initial 10.2 channel speaker system is the smallest of all alternatives. The positions of the 10 speakers in the initial 10.2 channel speaker system may be determined based on the positions of the 22 speakers in the 22.2 channel speaker system in accordance with the two-point requirements described above. After the position selection of the 10 speakers of the initial 10.2 channel speaker system is completed, the coordinates of the 10 speakers are recorded and recorded as
Figure BDA00002616080900051
(j=1,2,…10)。
The method of selecting speakers of the initial 10.2 channel speaker system in this embodiment is shown in fig. 4, and the coordinates of the 10 speakers in the initial 10.2 channel system are recorded as (2, 0 °,0 °), (2, 60 °,0 °), (2, 120 °,0 °), (2, 180 °,0 °), (2, 270 °,0 °), (2, 0 °, 48 °), (2, 90 °, 48 °), (2, 180 °, 48 °), (2, 0 °, 90 °), (2, 90 °, and-30 °), respectively.
Step 1.3, the signals distributed by 10 loudspeakers in the initial 10.2-channel loudspeaker system are solved. First, the signal distribution coefficients for the three loudspeakers in the initial 10.2 channel system are found. In the actual operation process, the basis is to ensure that the sound pressure and the particle velocity direction of the sound generated by one loudspeaker in the 22.2-channel loudspeaker system at the listening point and the corresponding three loudspeakers in the initial 10.2-channel loudspeaker system at the listening point are unchanged, and one loudspeaker in the 22.2-channel system is placed at the zeta position, wherein the zeta position is set1、ζ2、ζ3The three loudspeakers with the smallest area in the initial 10.2 channel loudspeaker system containing the zeta point are respectively arranged, see fig. 3, the spherical radius is r, and the signal of the virtual loudspeaker at the zeta point is distributed to the zeta point1、ζ2、ζ3The distribution factors of the loudspeakers are respectively omega1、ω2、ω3
The formula is as follows:
Figure BDA00002616080900052
wherein i is an imaginary unit, and e represents a mathematical constant, also called euler number;
ρ1、ρ2、ρ3and rho is zeta1、ζ2、ζ3Distance at ζ and origin O;
k is the number of waves,
Figure BDA00002616080900061
f is the frequency of the sound, c is the speed of sound propagation in air;
θ1、θ2、θ3and theta is zeta1、ζ2、ζ3The line connecting zeta with the origin OThe included angle between the projection on the XOZ plane and the X axis;
Figure BDA00002616080900062
are each ζ1、ζ2、ζ3Angle between the line connecting ζ and origin O and XOY plane.
According to the analogy of the method, the distribution coefficient of each loudspeaker in the 22.2 sound channel loudspeaker system to 3 corresponding loudspeakers in the initial 10.2 sound channel loudspeaker system is obtained, then the signals repeatedly distributed by each loudspeaker in the 10 loudspeakers in the initial 10.2 sound channel loudspeaker system are superposed, and the signals of each loudspeaker in the initial 10.2 sound channel loudspeaker system are obtained, and 2 low-frequency sound effect loudspeaker signals are unchanged.
The distribution coefficients and the speaker signal calculations in this embodiment can be calculated by substituting the associated coordinates into the formula according to formula (1) and the above description. The present embodiment only describes that one method is selected for implementation of downmixing an m-channel speaker system to an initial n-channel speaker system, and the method for implementing the downmixing is not limited to the specific description of the embodiment.
And 2, acquiring and measuring the related parameters.
Keeping the positions of the loudspeakers unchanged, and adjusting the signals of the loudspeakers in the initial 10.2-channel loudspeaker system to obtain a new 10.2-channel loudspeaker system.
In the step, the distance between the original sound source and the listening point can be acquired from a signal acquisition sound field of the 22.2-channel loudspeaker system, and the distance between the original sound source and the listening point of the 22.2-channel loudspeaker system can also be acquired in other modes; obtaining sound pressure P of listening point from 22.2 sound channel loudspeaker system signal0And sound pressure P of left and right earsL0、PR0And the particle velocity V at the listening point0As a signal to an initial 10.2 channel loudspeaker system, where d0,PL0,PR0,V0All without loss of transmission, whichIn d0Can be measured or obtained by other means, PL0,PR0,V0May be calculated or measured. Measuring the distance d of each loudspeaker in an initial 10.2 channel loudspeaker system from the listening pointj(j =1,2, … 10), the distance d of each speaker from the left ear point in the initial 10.2 channel speaker system was measuredLj(j =1,2, … 10), the distance d of each speaker from the right ear point in the initial 10.2 channel speaker system was measuredRj(j =1,2, … 10), the radius of the head h is measured.
And 3, converting the initial n-channel loudspeaker system signal into a new n-channel loudspeaker system signal to obtain an equation.
In this embodiment the original 10.2 channel loudspeaker system signals are transformed to new 10.2 channel loudspeaker system signals. In the transformation process, the positions of the speakers of the new 10.2 channel speaker system and the old 10.2 channel speaker system are not changed, the sound pressures of the sound field of the 22.2 channel system and the sound of the reconstructed sound field at the listening point, the left ear and the right ear are kept unchanged, the particle velocity at the listening point is unchanged, and 2 low-frequency sound effect channels are not processed, as shown in fig. 4 and 5. 22.2 original sound source to listening point distance d of sound track loudspeaker system0And the reconstructed sound image in the new 10.2 channel loudspeaker system is at the same distance from the listening point.
Namely, the positions of the speakers of the new and old n-channel speaker systems are not changed in the transformation process, but the sound pressure of the sound field of the 22.2-channel speaker system and the sound of the reconstructed sound field and the particle speed at the listening point are not changed at three points of the listening point and L, R, and the distance d between the original sound source of the 22.2-channel speaker system and the listening point0And adjusting the loudspeaker signals on the premise of the same distance from the sound image reconstructed in the new n-channel system to the listening point.
The equation that can be derived from these requirements is:
P 0 = P ′ P L 0 = P L 0 ′ P R 0 = P R 0 ′ V 0 = V ′ d 0 = d - - - ( 2 )
wherein,
p' is the sound pressure generated by all the loudspeakers in the new 10.2 channel loudspeaker system at the listening point;
P′L0the sound pressure generated at the left ear for all speakers in the new 10.2 channel speaker system;
P′R0the sound pressure generated at the right ear for all speakers in the new 10.2 channel speaker system;
v' is the particle velocity produced at the listening point by all loudspeakers in the new 10.2 channel loudspeaker system;
d is the distance between the sound image generated by all the loudspeakers in the new 10.2-channel loudspeaker system and the listening point;
P0representing sound pressure at a listening point in a 22.2 channel speaker system;
PL0representing the sound pressure at the left ear point in a 22.2 channel speaker system;
PR0representing the sound pressure at the right ear point in a 22.2 channel loudspeaker system;
d0representing the distance between the original sound source of a 22.2 channel loudspeaker system and a listening point;
V0representing the particle velocity at the listening point in a 22.2 channel speaker system.
One loudspeaker provides a single point source whose sound pressure in the free field varies with the distance of the point from the source (assuming the environment is a free field, i.e. there is no reflection of sound):
P j = 100 π W j d j 2
(3)
wherein,
Pjthe sound pressure of a measuring point;
djthe distance between a point sound source and a measuring point is measured;
Wjis the sound power of the pointing sound source at the sound production point;
and pi means a circumferential ratio.
The variation of sound pressure of n point sound sources in a free field along with the distance between a measuring point and the sound source is as follows:
P j = 100 π Σ j = 1 n W j d j 2 - - - ( 4 )
wherein WjRefers to the sound power of the j-th point sound source at the sound production point, j =1,2, … n. As in fig. 5, ω123Is the signal distribution coefficient, W, of the three loudspeakers1,W2,W3Refers to the acoustic power of the three speakers.
n coordinates are
Figure BDA00002616080900082
j is 1,2, … n, inListening point
Figure BDA00002616080900083
The particle velocity is:
V = G Σ j = 1 n e - ik | r → - ζ j | | r → - ζ j | S j ( ω ) - - - ( 5 )
wherein,
Sj(ω) is the signal in the loudspeaker;
i is an imaginary unit;
k is the number of waves,
Figure BDA00002616080900085
f is the frequency;
c is the speed of sound propagation in air;
the coefficient G is an acoustic constant of the audio signal propagating in the air.
As shown in fig. 6, due to the particularity of selecting two points L and R, the present implementation may also add a constraint condition to the model, i.e. the distance d between the sound image generated by all the speakers in the new 10.2 channel speaker system and the listening point, and the distance d between the sound image and the left and right earsL、dRThe relationship between the human head radius h is:
2 d 2 = d L 2 + d R 2 - 2 h 2 - - - ( 6 )
by collating the above equations (2), (3), (4), (5) and (6) and simplifying, the following equations can be obtained:
P 0 = 100 π Σ j = 1 10 ω j W j d j 2 P L 0 = 100 π Σ j = 1 10 ω j W j d Lj 2 P R 0 = 100 π Σ j = 1 10 ω j W j d Rj 2 2 d 2 = d L 2 + d R 2 - 2 h 2 V 0 = G Σ j = 1 10 e - ik | 0 → - ρ j | | 0 → - ρ j | ω j S j ( ω ) d 0 = d - - - ( 7 )
Figure BDA00002616080900093
representing the position vector of the listening point, and the polar coordinates are (0, 0, 0);
ρjrepresenting the distance between the position of the jth loudspeaker in the initial 10.2-channel loudspeaker system and the origin;
d represents the distance of the sound image produced by all the loudspeakers in the new 10.2 channel loudspeaker system from the listening point;
dLrepresenting the distance of the sound image produced by all the loudspeakers in the new 10.2 channel loudspeaker system to the left ear;
dRshow newDistance of sound image generated by all speakers in the 10.2 channel speaker system to the right ear;
djrepresents the distance from the listening point of the jth loudspeaker in the initial 10.2-channel loudspeaker system, j being 1,2,3 … 10;
dLjrepresents the distance from the jth speaker to the left ear in the initial 10.2 channel speaker system, j =1,2,3 … 10;
dRjrepresents the distance from the jth loudspeaker to the right ear in the initial 10.2-channel loudspeaker system, j being 1,2,3 … 10;
Wjj =1,2,3 … 10, which represents the sound power of the jth speaker in the initial 10.2-channel speaker system at the sound generating point, and can be calculated by the signal input to the jth speaker in the initial 10.2-channel speaker system and the inherent power of the speaker;
ωja signal distribution coefficient representing the jth speaker, i.e. a weight factor adjusted to each speaker signal in the initial 10.2-channel speaker system, j =1,2,3 … 10;
P0representing sound pressure at a listening point in a 22.2 channel speaker system;
PL0representing the sound pressure at the left ear point in a 22.2 channel speaker system;
PR0representing the sound pressure at the right ear point in a 22.2 channel loudspeaker system;
d0representing the distance between the original sound source of a 22.2 channel loudspeaker system and a listening point;
Sj(ω) is the signal of the jth speaker in the initial 10.2 channel speaker system;
π is the circumference ratio;
the coefficient G is an acoustic constant of the audio signal propagating in air;
V0representing 22.2 channel speaker systemsThe velocity of the particles at the listening point in the system.
And 4, selecting a target function. Because the number of the channels adopted in the multi-channel system is more, the placement is troublesome, the signal is convenient to adjust, and the use is convenient, the objective function adopted by the invention is the minimum number of the channels needing to be adjusted. Introducing a weighting factor omega for the adjustmentjA mapping function of:
f ( ω j ) = 1 , ω j ≠ 1 0 , ω j = 1 - - - ( 8 )
the mapping function is added as a constraint to equation (7). The target function expression is:
Figure BDA00002616080900102
step 5, solving the minimum value of the optimization model M to obtain the weight factor omega of each loudspeaker signal adjustmentj. In this embodiment, the minimum value of the optimization model M after the objective function is added in step 4 is solved by using the optimization software Lingo, and the weight factor ω adjusted by each speaker signal is obtainedj,j=1,2,3…10。
And 6, adjusting the loudspeaker signal. In this embodiment, the weight factor ω obtained by solving in step 5 is usedj(j =1,2,3 … 10) adjusting the signals of the speakers in the original 10.2 channel system so that the distances between the reconstructed sound image and the listening point and the distances between the original sound source and the listening point of the 22.2 channel speaker system are equal, the channels needing to be adjusted are the least, and the aim is to adjust the multi-channel system as conveniently and quickly as possible besides ensuring that the sound pressures at the listening point, the L and the R point are constant and the particle speed at the center point is constant.
The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications or additions may be made to the described embodiments or alternatives may be employed by those skilled in the art without departing from the spirit or ambit of the invention as defined in the appended claims.

Claims (1)

1.一种声像距离信息恢复便携调整方法,其特征在于,包括步骤:1. A portable adjustment method for audio-visual distance information recovery, characterized in that, comprising steps: 步骤1,将m声道扬声器系统下混得到n声道扬声器系统,将下混所得n声道扬声器系统记为初始n声道扬声器系统,m>n,且m≥3;Step 1, down-mixing the m-channel speaker system to obtain an n-channel speaker system, and recording the down-mixed n-channel speaker system as the initial n-channel speaker system, m>n, and m≥3; 步骤2,相关参数的获取与测量,包括获取m声道扬声器系统的原始声源到听音点之间的距离d0、从m声道扬声器系统信号中获取听音点声压P0和左、右耳声压PL0、PR0和听音点处的粒子速度V0,并传送至初始n声道扬声器系统,其中d0、PL0、PR0、V0均无损传输;测量初始n声道扬声器系统中每个扬声器距离听音点的距离dj,测量初始n声道扬声器系统中每个扬声器距离左耳点的距离dLj,测量初始n声道扬声器系统中每个扬声器距离右耳点的距离dRj,j=1,2,…n;测量人头半径h;Step 2, acquisition and measurement of relevant parameters, including obtaining the distance d 0 from the original sound source of the m-channel speaker system to the listening point, obtaining the sound pressure P 0 of the listening point and the left , right ear sound pressure P L0 , P R0 and particle velocity V 0 at the listening point, and transmit to the initial n-channel speaker system, where d 0 , P L0 , P R0 , V 0 are all lossless transmission; measure the initial n The distance d j between each speaker in the channel speaker system and the listening point, measure the distance d Lj of each speaker in the initial n-channel speaker system from the left ear point, and measure the distance of each speaker in the initial n-channel speaker system to the right The distance d Rj of the ear point, j=1,2,...n; measure the radius h of the human head; 步骤3,将初始n声道扬声器系统信号变换到新的n声道扬声器系统信号,得到方程如下,Step 3, transform the initial n-channel speaker system signal into a new n-channel speaker system signal, and obtain the equation as follows, P 0 = 100 π Σ j = 1 n ω j W j d j 2 P L 0 = 100 π Σ j = 1 n ω j W j d Lj 2 P R 0 = 100 π Σ j = 1 n ω j W j d Rj 2 2 d 2 = d L 2 + d R 2 - 2 h 2 V 0 = G Σ j = 1 n e - ik | 0 → - ρ j | | 0 → - ρ j | ω j S j ( ω ) d 0 = d
Figure FDA00002616080800012
表示听音点所在位置向量,极坐标为(0,0,0);
P 0 = 100 π Σ j = 1 no ω j W j d j 2 P L 0 = 100 π Σ j = 1 no ω j W j d Lj 2 P R 0 = 100 π Σ j = 1 no ω j W j d Rj 2 2 d 2 = d L 2 + d R 2 - 2 h 2 V 0 = G Σ j = 1 no e - ik | 0 &Right Arrow; - ρ j | | 0 &Right Arrow; - ρ j | ω j S j ( ω ) d 0 = d
Figure FDA00002616080800012
Indicates the location vector of the listening point, and the polar coordinates are (0,0,0);
ρj表示初始n声道扬声器系统中第j个扬声器所在位置和原点之间的距离;ρ j represents the distance between the position of the jth speaker and the origin in the initial n-channel speaker system; d表示新的n声道扬声器系统中所有扬声器产生的声像与听音点处的距离;d represents the distance between the sound image produced by all the speakers in the new n-channel speaker system and the listening point; d0表示m声道扬声器系统的原始声源到听音点之间的距离;d 0 represents the distance between the original sound source of the m-channel speaker system and the listening point; Sj(ω)为初始n声道扬声器系统中第j个扬声器的信号;S j (ω) is the signal of the jth speaker in the initial n-channel speaker system; π是指圆周率;π refers to pi; 系数G是音频信号在空气中传播的声学常量;The coefficient G is the acoustic constant of the audio signal propagating in the air; V0表示m声道扬声器系统中听音点处粒子速度;V 0 represents the particle velocity at the listening point in the m-channel loudspeaker system; dL表示新的n声道扬声器系统中所有扬声器产生的声像到左耳的距离;d L represents the distance from the sound image produced by all the speakers in the new n-channel speaker system to the left ear; DR表示新的n声道扬声器系统中所有扬声器产生的声像到右耳的距离;D R represents the distance from the sound image produced by all the speakers in the new n-channel speaker system to the right ear; dj表示初始n声道扬声器系统中第j个扬声器与听音点处的距离;d j represents the distance between the jth speaker and the listening point in the initial n-channel speaker system; dLj表示初始n声道扬声器系统中第j个扬声器到左耳的距离;d Lj represents the distance from the jth speaker to the left ear in the initial n-channel speaker system; dRj表示初始n声道扬声器系统中第j个扬声器到右耳的距离;d Rj represents the distance from the jth speaker to the right ear in the initial n-channel speaker system; Wj表示初始n声道扬声器系统中第j个扬声器在发声点的声功率;W j represents the sound power of the jth speaker at the sound point in the initial n-channel speaker system; ωj表示对初始n声道扬声器系统中第j个扬声器信号调整的权重因子;ω j represents the weighting factor adjusted to the jth speaker signal in the initial n-channel speaker system; P0表示m声道扬声器系统中听音点处声压;P 0 represents the sound pressure at the listening point in the m-channel speaker system; PL0表示m声道扬声器系统中左耳点处声压;P L0 represents the sound pressure at the left ear point in the m-channel speaker system; PR0表示m声道扬声器系统中右耳点处声压;P R0 represents the sound pressure at the right ear point in the m-channel speaker system; 步骤4,设定目标函数,将以下映射函数式作为一个约束条件加入步骤3所得方程,得到最优化模型MStep 4, set the objective function, add the following mapping function as a constraint to the equation obtained in step 3, and obtain the optimal model M ff (( ωω jj )) == 11 ,, (( ωω jj )) ≠≠ 11 00 ,, ωω jj == 11 目标函数表达式为 M = min Σ i = 1 n f ( ω j ) ; The expression of the objective function is m = min Σ i = 1 no f ( ω j ) ; 步骤5,求解最优化模型M的最小值,获得各扬声器信号调整的权重因子ωj,j=1,2,…n;Step 5, solving the minimum value of the optimization model M, and obtaining the weight factor ω j for each loudspeaker signal adjustment, j=1,2,...n; 步骤6,根据步骤5中求解得到的权重因子ωj,j=1,2,…n,调整初始n声道扬声器系统中各扬声器的信号,得到新的n声道扬声器系统。Step 6. Adjust the signals of each speaker in the initial n-channel speaker system according to the weight factor ω j obtained in step 5, j=1, 2,...n, to obtain a new n-channel speaker system.
CN201210554715.2A 2012-12-19 2012-12-19 Convenient adjustment method for restoring range information of acoustic images Expired - Fee Related CN103037301B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210554715.2A CN103037301B (en) 2012-12-19 2012-12-19 Convenient adjustment method for restoring range information of acoustic images

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210554715.2A CN103037301B (en) 2012-12-19 2012-12-19 Convenient adjustment method for restoring range information of acoustic images

Publications (2)

Publication Number Publication Date
CN103037301A true CN103037301A (en) 2013-04-10
CN103037301B CN103037301B (en) 2014-11-05

Family

ID=48023702

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210554715.2A Expired - Fee Related CN103037301B (en) 2012-12-19 2012-12-19 Convenient adjustment method for restoring range information of acoustic images

Country Status (1)

Country Link
CN (1) CN103037301B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103826194A (en) * 2014-02-28 2014-05-28 武汉大学 Method and device for reconstructing sound source direction and distance in multi-channel system
CN106454685A (en) * 2016-11-25 2017-02-22 武汉大学 Sound field reconstruction method and system
CN108966114A (en) * 2018-07-13 2018-12-07 武汉轻工大学 Sound field rebuilding method, audio frequency apparatus, storage medium and device
CN109474882A (en) * 2018-12-04 2019-03-15 武汉轻工大学 Sound field reconstruction method, device, storage medium and device based on listening point tracking
CN111464932A (en) * 2020-04-07 2020-07-28 武汉轻工大学 Sound field reconstruction method, device, device and storage medium based on multiple listening points

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050220308A1 (en) * 2004-03-31 2005-10-06 Yamaha Corporation Apparatus for creating sound image of moving sound source
CN101208988A (en) * 2005-05-31 2008-06-25 大不列颠投资有限公司 Small audio reproduction system with large perceived acoustic scale and sound image
WO2008111362A1 (en) * 2007-03-15 2008-09-18 Oki Electric Industry Co., Ltd. Sound image localizing device, method, and program
CN101529930A (en) * 2006-10-19 2009-09-09 松下电器产业株式会社 Sound image localization apparatus, sound image localization system, sound image localization method, program, and integrated circuit
CN101784004A (en) * 2008-12-16 2010-07-21 索尼株式会社 Information processing system and information processing method
CN102802111A (en) * 2012-07-19 2012-11-28 新奥特(北京)视频技术有限公司 Method and system for outputting surround sound

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050220308A1 (en) * 2004-03-31 2005-10-06 Yamaha Corporation Apparatus for creating sound image of moving sound source
CN101208988A (en) * 2005-05-31 2008-06-25 大不列颠投资有限公司 Small audio reproduction system with large perceived acoustic scale and sound image
CN101529930A (en) * 2006-10-19 2009-09-09 松下电器产业株式会社 Sound image localization apparatus, sound image localization system, sound image localization method, program, and integrated circuit
WO2008111362A1 (en) * 2007-03-15 2008-09-18 Oki Electric Industry Co., Ltd. Sound image localizing device, method, and program
CN101784004A (en) * 2008-12-16 2010-07-21 索尼株式会社 Information processing system and information processing method
CN102802111A (en) * 2012-07-19 2012-11-28 新奥特(北京)视频技术有限公司 Method and system for outputting surround sound

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
SHIGEYOSHI AMANO;TAKESHI YAMADA; SHOJI MAKINO;NOBUHIKO KITAWAKI: "A Study on Sound Image Control Method for Operational Support of Touch Panel Display", 《2011年亚太信号与信息处理协会年会论文集》 *
YOSHINORI TAKAHASHI,AKIO ANDO: "Down-mixing of multi-channel audio for sound field reproduction on spatial covariance", 《APPLIED ACOUSTICS》 *
汤永清,方勇,黄青华: "控制声像距离的二维FIR滤波器设计", 《信号处理》 *
石蓓,谢菠荪: "环绕声重放中通路信号相关性与听觉空间印象", 《声学学报》 *
谢菠荪: "《头相关传输函数与虚拟听觉》", 31 January 2008, 北京:国防工业出版社 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103826194A (en) * 2014-02-28 2014-05-28 武汉大学 Method and device for reconstructing sound source direction and distance in multi-channel system
CN106454685A (en) * 2016-11-25 2017-02-22 武汉大学 Sound field reconstruction method and system
CN106454685B (en) * 2016-11-25 2018-03-27 武汉大学 A kind of sound field rebuilding method and system
CN108966114A (en) * 2018-07-13 2018-12-07 武汉轻工大学 Sound field rebuilding method, audio frequency apparatus, storage medium and device
CN109474882A (en) * 2018-12-04 2019-03-15 武汉轻工大学 Sound field reconstruction method, device, storage medium and device based on listening point tracking
CN111464932A (en) * 2020-04-07 2020-07-28 武汉轻工大学 Sound field reconstruction method, device, device and storage medium based on multiple listening points

Also Published As

Publication number Publication date
CN103037301B (en) 2014-11-05

Similar Documents

Publication Publication Date Title
CN110089134B (en) Method, system and computer readable medium for reproducing spatially distributed sound
JP7038725B2 (en) Audio signal processing method and equipment
JP4343845B2 (en) Audio data processing method and sound collector for realizing the method
US10142761B2 (en) Structural modeling of the head related impulse response
CN101884065B (en) Spatial audio analysis and synthesis for binaural reproduction and format conversion
CN104904240B (en) Device and method for generating multiple parametric audio streams and device and method for generating multiple loudspeaker signals
CN102812731B (en) Method and device for reproducing three-dimensional sound
CN102790931B (en) Distance sense synthetic method in three-dimensional sound field synthesis
CN106454685B (en) A kind of sound field rebuilding method and system
CN103888889B (en) A kind of multichannel conversion method based on spheric harmonic expansion
US20170339504A1 (en) Impedance matching filters and equalization for headphone surround rendering
CN104956695A (en) Determining renderers for spherical harmonic coefficients
CN103037301B (en) Convenient adjustment method for restoring range information of acoustic images
CN103021414B (en) Method for distance modulation of three-dimensional audio system
CN105120418B (en) Double-sound-channel 3D audio generation device and method
CN114450977A (en) Apparatus, method or computer program for processing a representation of a sound field in the spatial transform domain
CN103826194A (en) Method and device for reconstructing sound source direction and distance in multi-channel system
Lee et al. HRTF measurement for accurate sound localization cues
CN103052018B (en) Audio-visual distance information recovery method
Jakka Binaural to multichannel audio upmix
CN104023304A (en) Method for simplifying five-loudspeaker system to four-loudspeaker system
CN107347173A (en) The implementation method of multi-path surround sound dynamic ears playback system based on mobile phone
CN104936089B (en) A kind of multi-channel system compressing method
Koyama Boundary integral approach to sound field transform and reproduction
Zhou et al. Range extrapolation of head-related transfer function using improved higher order ambisonics

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20210716

Address after: 215000 unit 01, 5 / F, building a, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province

Patentee after: BOOSLINK SUZHOU INFORMATION TECHNOLOGY Co.,Ltd.

Address before: 430072 Hubei Province, Wuhan city Wuchang District of Wuhan University Luojiashan

Patentee before: WUHAN University

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20141105

CF01 Termination of patent right due to non-payment of annual fee