Disclosure of Invention
The invention provides a portable adjustment method for restoring sound image distance information in a sound reproduction stage, aiming at the defects of the prior art.
The invention provides a portable adjustment method for restoring sound image distance information, which is characterized by comprising the following steps:
step 1, downmixing an m-channel loudspeaker system to obtain an n-channel loudspeaker system, recording the n-channel loudspeaker system obtained by downmixing as an initial n-channel loudspeaker system, wherein m is greater than n, and m is greater than or equal to 3;
step 2, obtaining and measuring relevant parameters, including obtaining the distance d from the original sound source of the m-channel loudspeaker system to the listening point0Obtaining the sound pressure P of the listening point from the signal of the m-channel loudspeaker system0And sound pressure P of left and right earsL0、PR0And the particle velocity V at the listening point0And transmitted to the initial n-channel speaker system, where d0、PL0、PR0、V0All transmission is lossless; measuring the distance d of each loudspeaker from the listening point in the initial n-channel loudspeaker systemjMeasuring the distance d of each speaker from the left ear point in the initial n-channel speaker systemLjMeasuring the initial n channel speakerDistance d of each loudspeaker in the system from the right ear pointRjJ =1,2, … n; measuring the radius h of the human head;
step 3, converting the initial n-channel loudspeaker system signal into a new n-channel loudspeaker system signal to obtain an equation as follows;
representing the position vector of the listening point, and the polar coordinates are (0, 0, 0);
ρjrepresenting an initial n-channel loudspeaker systemThe distance between the position of the jth loudspeaker and the origin;
d represents the distance between the sound image generated by all the loudspeakers in the new n-channel loudspeaker system and the listening point;
d0representing the distance between the original sound source of the m-channel loudspeaker system and a listening point;
Sj(ω) is the signal of the jth speaker in the initial n-channel speaker system;
π is the circumference ratio;
the coefficient G is an acoustic constant of the audio signal propagating in air;
V0representing the particle velocity at the listening point in an m-channel loudspeaker system;
dLrepresenting the distance of the sound image generated by all the loudspeakers in the new n-channel loudspeaker system to the left ear;
dRrepresenting the distance of the sound image produced by all the loudspeakers in the new n-channel loudspeaker system to the right ear;
djrepresenting the distance of the jth loudspeaker in the initial n-channel loudspeaker system from the listening point;
dLjrepresenting the distance of the jth speaker to the left ear in the initial n-channel speaker system;
dRjrepresenting the distance from the jth speaker to the right ear in the initial n-channel speaker system;
Wjrepresenting the acoustic power of the jth loudspeaker in the initial n-channel loudspeaker system at the sound production point;
ωjrepresenting a weighting factor adjusted for a jth speaker signal in the initial n-channel speaker system;
P0representing sound pressure at a listening point in an m-channel speaker system;
PL0representing the sound pressure at the left ear point in an m-channel speaker system;
PR0representing the sound pressure at the right ear point in the m-channel loudspeaker system;
step 4, setting a target function, adding the following mapping function formula as a constraint condition into the equation obtained in the step 3 to obtain an optimized model M
The expression of the objective function is
Step 5, solving the minimum value of the optimization model MObtaining the weight factor omega of each loudspeaker signal adjustmentj,j=1,2,…n;
Step 6, obtaining the weight factor omega according to the solution in the step 5jJ =1,2, … n, and the signals of the speakers in the initial n-channel speaker system are adjusted to obtain a new n-channel speaker system.
By adopting the loudspeaker placement structure and the signal distribution method, the sound pressure of the sound at the positions of the left and right ears L, R is ensured to be unchanged and the particle speed at the listening positions is ensured to be unchanged at the listening positions in the reconstructed sound field under the condition of adopting the same number of loudspeakers, and meanwhile, the distance information between the sound and the listening positions can be recovered, and the signals in the minimum number of sound channels are adjusted, so that the sound field can be reconstructed better and more conveniently.
Detailed Description
The technical scheme of the invention is explained in detail in the following by combining the drawings and the embodiment.
As shown in fig. 2 and 4, the embodiment takes m =22 and n =10, and the specific example of the embodiment is to use a 22.2 channel system to convert to a 10.2 channel system to describe the content of the invention. 22 loudspeakers in the 22.2 sound channel system are placed on the same spherical surface, 10 loudspeakers in the 10.2 sound channel system are placed on the same spherical surface, and the radius of the external spherical surface is 2 meters. In this embodiment, the 2 sound channels are two low-frequency sound effect sound channels, signals of the two low-frequency sound effect sound channels are not processed, and positions of the two low-frequency sound effect sound channels are not changed. The overall framework flow diagram of this embodiment is shown in fig. 1.
The embodiment adopts the sound image distance information recovery portable adjusting method, which comprises the following steps:
step 1, downmixing an m-channel loudspeaker system to obtain an n-channel loudspeaker system, recording the n-channel loudspeaker system obtained by downmixing as an initial n-channel loudspeaker system, wherein m is greater than n, and m is greater than or equal to 3.
The embodiment downmixes the 22.2 channel speaker system to the original 10.2 channel speaker system (denoted as the original 10.2 channel speaker system). The method adopted in the process is to keep the physical properties (sound pressure size and particle velocity direction) of the sound of the original sound field and the reconstructed sound field at the listening point (namely the center point of the human head) unchanged.
The implementation of step 1 comprises the following sub-steps,
step 1.1, a three-dimensional rectangular coordinate system XYZ is established, the origin is O, all the coordinates related to the invention are polar coordinates and are generally marked as P (sigma, theta, phi), wherein sigma, theta and phi respectively refer to the distance between the P point and the origin O, the included angle between the projection of the connecting line of the P point and the origin O on the X axis and the X axis, and the included angle between the connecting line of the P point and the origin O and the XOY plane, and the coordinates mentioned herein are explained with reference to the description. The 22 loudspeakers in the 22.2-channel loudspeaker system are arranged on the same spherical surface, see fig. 2, the specific position of each loudspeaker is fixed, and the coordinate xi of each loudspeaker is recordedj=(ρj,θj,φj) (j =1,2, … 22), and the listening point positions in this embodiment are all the sameAt the origin O;
as shown in fig. 2, in the embodiment, the center coordinates of the sphere are three-dimensional origin coordinates (0, 0, 0) and also coordinates at a listening point, and the coordinates of the 22 speakers are (2, 0 °,0 °), (2, 30 °,0 °), (2, 60 °,0 °), (2, 90 °,0 °), (2, 120 °,0 °), (2, 150 °,0 °), (2, 180 °,0 °), (2, 225 °,0 °), (2, 270 °,0 °), (2, 315 °,0 °), (2, 0 °, 48 °), (2, 45 °, 48 °), (2, 90 °, 48 °), (2, 135 °, 48 °), (2, 180 °, 48 °), (2, 225 °, 48 °), (2, 270 °, 48 °), (2, 0 °, 90 °), (2, 45 °, 30 °), (2, 90 °, -30 °), (2 °,135 °, -30 °) and
step 1.2, selecting the placing positions of 10 loudspeakers in an initial 10.2 loudspeaker sound channel system. The selection requirements are as follows: (1) selecting 1 speaker at a time in the 22.2 channel speaker system, the ray from the origin O to the direction of the point where each speaker of the 22.2 channel speaker system is located must be contained inside or on the side of the spherical triangle formed by the three speakers in the initial 10.2 channel speaker system, (2) while ensuring that the area of the spherical triangle formed by the three selected speakers in the initial 10.2 channel speaker system is the smallest of all alternatives. The positions of the 10 speakers in the initial 10.2 channel speaker system may be determined based on the positions of the 22 speakers in the 22.2 channel speaker system in accordance with the two-point requirements described above. After the position selection of the 10 speakers of the initial 10.2 channel speaker system is completed, the coordinates of the 10 speakers are recorded and recorded as

(j=1,2,…10)。
The method of selecting speakers of the initial 10.2 channel speaker system in this embodiment is shown in fig. 4, and the coordinates of the 10 speakers in the initial 10.2 channel system are recorded as (2, 0 °,0 °), (2, 60 °,0 °), (2, 120 °,0 °), (2, 180 °,0 °), (2, 270 °,0 °), (2, 0 °, 48 °), (2, 90 °, 48 °), (2, 180 °, 48 °), (2, 0 °, 90 °), (2, 90 °, and-30 °), respectively.
Step 1.3, the signals distributed by 10 loudspeakers in the initial 10.2-channel loudspeaker system are solved. First, the signal distribution coefficients for the three loudspeakers in the initial 10.2 channel system are found. In the actual operation process, the basis is to ensure that the sound pressure and the particle velocity direction of the sound generated by one loudspeaker in the 22.2-channel loudspeaker system at the listening point and the corresponding three loudspeakers in the initial 10.2-channel loudspeaker system at the listening point are unchanged, and one loudspeaker in the 22.2-channel system is placed at the zeta position, wherein the zeta position is set1、ζ2、ζ3The three loudspeakers with the smallest area in the initial 10.2 channel loudspeaker system containing the zeta point are respectively arranged, see fig. 3, the spherical radius is r, and the signal of the virtual loudspeaker at the zeta point is distributed to the zeta point1、ζ2、ζ3The distribution factors of the loudspeakers are respectively omega1、ω2、ω3:
The formula is as follows:
wherein i is an imaginary unit, and e represents a mathematical constant, also called euler number;
ρ1、ρ2、ρ3and rho is zeta1、ζ2、ζ3Distance at ζ and origin O;
k is the number of waves,
f is the frequency of the sound, c is the speed of sound propagation in air;
θ1、θ2、θ3and theta is zeta1、ζ2、ζ3The line connecting zeta with the origin OThe included angle between the projection on the XOZ plane and the X axis;
are each ζ
1、ζ
2、ζ
3Angle between the line connecting ζ and origin O and XOY plane.
According to the analogy of the method, the distribution coefficient of each loudspeaker in the 22.2 sound channel loudspeaker system to 3 corresponding loudspeakers in the initial 10.2 sound channel loudspeaker system is obtained, then the signals repeatedly distributed by each loudspeaker in the 10 loudspeakers in the initial 10.2 sound channel loudspeaker system are superposed, and the signals of each loudspeaker in the initial 10.2 sound channel loudspeaker system are obtained, and 2 low-frequency sound effect loudspeaker signals are unchanged.
The distribution coefficients and the speaker signal calculations in this embodiment can be calculated by substituting the associated coordinates into the formula according to formula (1) and the above description. The present embodiment only describes that one method is selected for implementation of downmixing an m-channel speaker system to an initial n-channel speaker system, and the method for implementing the downmixing is not limited to the specific description of the embodiment.
And 2, acquiring and measuring the related parameters.
Keeping the positions of the loudspeakers unchanged, and adjusting the signals of the loudspeakers in the initial 10.2-channel loudspeaker system to obtain a new 10.2-channel loudspeaker system.
In the step, the distance between the original sound source and the listening point can be acquired from a signal acquisition sound field of the 22.2-channel loudspeaker system, and the distance between the original sound source and the listening point of the 22.2-channel loudspeaker system can also be acquired in other modes; obtaining sound pressure P of listening point from 22.2 sound channel loudspeaker system signal0And sound pressure P of left and right earsL0、PR0And the particle velocity V at the listening point0As a signal to an initial 10.2 channel loudspeaker system, where d0,PL0,PR0,V0All without loss of transmission, whichIn d0Can be measured or obtained by other means, PL0,PR0,V0May be calculated or measured. Measuring the distance d of each loudspeaker in an initial 10.2 channel loudspeaker system from the listening pointj(j =1,2, … 10), the distance d of each speaker from the left ear point in the initial 10.2 channel speaker system was measuredLj(j =1,2, … 10), the distance d of each speaker from the right ear point in the initial 10.2 channel speaker system was measuredRj(j =1,2, … 10), the radius of the head h is measured.
And 3, converting the initial n-channel loudspeaker system signal into a new n-channel loudspeaker system signal to obtain an equation.
In this embodiment the original 10.2 channel loudspeaker system signals are transformed to new 10.2 channel loudspeaker system signals. In the transformation process, the positions of the speakers of the new 10.2 channel speaker system and the old 10.2 channel speaker system are not changed, the sound pressures of the sound field of the 22.2 channel system and the sound of the reconstructed sound field at the listening point, the left ear and the right ear are kept unchanged, the particle velocity at the listening point is unchanged, and 2 low-frequency sound effect channels are not processed, as shown in fig. 4 and 5. 22.2 original sound source to listening point distance d of sound track loudspeaker system0And the reconstructed sound image in the new 10.2 channel loudspeaker system is at the same distance from the listening point.
Namely, the positions of the speakers of the new and old n-channel speaker systems are not changed in the transformation process, but the sound pressure of the sound field of the 22.2-channel speaker system and the sound of the reconstructed sound field and the particle speed at the listening point are not changed at three points of the listening point and L, R, and the distance d between the original sound source of the 22.2-channel speaker system and the listening point0And adjusting the loudspeaker signals on the premise of the same distance from the sound image reconstructed in the new n-channel system to the listening point.
The equation that can be derived from these requirements is:
wherein,
p' is the sound pressure generated by all the loudspeakers in the new 10.2 channel loudspeaker system at the listening point;
P′L0the sound pressure generated at the left ear for all speakers in the new 10.2 channel speaker system;
P′R0the sound pressure generated at the right ear for all speakers in the new 10.2 channel speaker system;
v' is the particle velocity produced at the listening point by all loudspeakers in the new 10.2 channel loudspeaker system;
d is the distance between the sound image generated by all the loudspeakers in the new 10.2-channel loudspeaker system and the listening point;
P0representing sound pressure at a listening point in a 22.2 channel speaker system;
PL0representing the sound pressure at the left ear point in a 22.2 channel speaker system;
PR0representing the sound pressure at the right ear point in a 22.2 channel loudspeaker system;
d0representing the distance between the original sound source of a 22.2 channel loudspeaker system and a listening point;
V0representing the particle velocity at the listening point in a 22.2 channel speaker system.
One loudspeaker provides a single point source whose sound pressure in the free field varies with the distance of the point from the source (assuming the environment is a free field, i.e. there is no reflection of sound):
(3)
wherein,
Pjthe sound pressure of a measuring point;
djthe distance between a point sound source and a measuring point is measured;
Wjis the sound power of the pointing sound source at the sound production point;
and pi means a circumferential ratio.
The variation of sound pressure of n point sound sources in a free field along with the distance between a measuring point and the sound source is as follows:
wherein WjRefers to the sound power of the j-th point sound source at the sound production point, j =1,2, … n. As in fig. 5, ω1,ω2,ω3Is the signal distribution coefficient, W, of the three loudspeakers1,W2,W3Refers to the acoustic power of the three speakers.
n coordinates are
j is 1,2, … n, inListening point
The particle velocity is:
wherein,
Sj(ω) is the signal in the loudspeaker;
i is an imaginary unit;
k is the number of waves,
f is the frequency;
c is the speed of sound propagation in air;
the coefficient G is an acoustic constant of the audio signal propagating in the air.
As shown in fig. 6, due to the particularity of selecting two points L and R, the present implementation may also add a constraint condition to the model, i.e. the distance d between the sound image generated by all the speakers in the new 10.2 channel speaker system and the listening point, and the distance d between the sound image and the left and right earsL、dRThe relationship between the human head radius h is:
by collating the above equations (2), (3), (4), (5) and (6) and simplifying, the following equations can be obtained:
representing the position vector of the listening point, and the polar coordinates are (0, 0, 0);
ρjrepresenting the distance between the position of the jth loudspeaker in the initial 10.2-channel loudspeaker system and the origin;
d represents the distance of the sound image produced by all the loudspeakers in the new 10.2 channel loudspeaker system from the listening point;
dLrepresenting the distance of the sound image produced by all the loudspeakers in the new 10.2 channel loudspeaker system to the left ear;
dRshow newDistance of sound image generated by all speakers in the 10.2 channel speaker system to the right ear;
djrepresents the distance from the listening point of the jth loudspeaker in the initial 10.2-channel loudspeaker system, j being 1,2,3 … 10;
dLjrepresents the distance from the jth speaker to the left ear in the initial 10.2 channel speaker system, j =1,2,3 … 10;
dRjrepresents the distance from the jth loudspeaker to the right ear in the initial 10.2-channel loudspeaker system, j being 1,2,3 … 10;
Wjj =1,2,3 … 10, which represents the sound power of the jth speaker in the initial 10.2-channel speaker system at the sound generating point, and can be calculated by the signal input to the jth speaker in the initial 10.2-channel speaker system and the inherent power of the speaker;
ωja signal distribution coefficient representing the jth speaker, i.e. a weight factor adjusted to each speaker signal in the initial 10.2-channel speaker system, j =1,2,3 … 10;
P0representing sound pressure at a listening point in a 22.2 channel speaker system;
PL0representing the sound pressure at the left ear point in a 22.2 channel speaker system;
PR0representing the sound pressure at the right ear point in a 22.2 channel loudspeaker system;
d0representing the distance between the original sound source of a 22.2 channel loudspeaker system and a listening point;
Sj(ω) is the signal of the jth speaker in the initial 10.2 channel speaker system;
π is the circumference ratio;
the coefficient G is an acoustic constant of the audio signal propagating in air;
V0representing 22.2 channel speaker systemsThe velocity of the particles at the listening point in the system.
And 4, selecting a target function. Because the number of the channels adopted in the multi-channel system is more, the placement is troublesome, the signal is convenient to adjust, and the use is convenient, the objective function adopted by the invention is the minimum number of the channels needing to be adjusted. Introducing a weighting factor omega for the adjustmentjA mapping function of:
the mapping function is added as a constraint to equation (7). The target function expression is:
step 5, solving the minimum value of the optimization model M to obtain the weight factor omega of each loudspeaker signal adjustmentj. In this embodiment, the minimum value of the optimization model M after the objective function is added in step 4 is solved by using the optimization software Lingo, and the weight factor ω adjusted by each speaker signal is obtainedj,j=1,2,3…10。
And 6, adjusting the loudspeaker signal. In this embodiment, the weight factor ω obtained by solving in step 5 is usedj(j =1,2,3 … 10) adjusting the signals of the speakers in the original 10.2 channel system so that the distances between the reconstructed sound image and the listening point and the distances between the original sound source and the listening point of the 22.2 channel speaker system are equal, the channels needing to be adjusted are the least, and the aim is to adjust the multi-channel system as conveniently and quickly as possible besides ensuring that the sound pressures at the listening point, the L and the R point are constant and the particle speed at the center point is constant.
The specific embodiments described herein are merely illustrative of the spirit of the invention. Various modifications or additions may be made to the described embodiments or alternatives may be employed by those skilled in the art without departing from the spirit or ambit of the invention as defined in the appended claims.