CN109005496A - A kind of HRTF middle vertical plane orientation Enhancement Method - Google Patents
A kind of HRTF middle vertical plane orientation Enhancement Method Download PDFInfo
- Publication number
- CN109005496A CN109005496A CN201810834254.1A CN201810834254A CN109005496A CN 109005496 A CN109005496 A CN 109005496A CN 201810834254 A CN201810834254 A CN 201810834254A CN 109005496 A CN109005496 A CN 109005496A
- Authority
- CN
- China
- Prior art keywords
- azimuth
- itd
- hrtf
- angle
- enhanced
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S1/005—For headphones
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
Abstract
The present invention provides a kind of HRTF middle vertical plane orientation Enhancement Methods, it is related to sound source field, the present invention is after knowing front or dead astern orientation and closest azimuthal ITD, ITD when head and front or dead astern deviation are 1 ° is obtained using interpolation method, original HRTF is transformed using the ITD, new HRTF is obtained, the HRTF that the present invention synthesizes does not have difference on frequency spectrum with original HRTF, but may make hearer to enhance orientation locating effect using the HRTF.The present invention compensates the ITD in confusing orientation on the basis of not changing original signal frequency spectrum, and the HRTF obtained using the compensation and dry audio signal carry out convolution, and the aspect sense of synthesis is good, can effectively promote human ear front and back position Auditory Perception.
Description
Technical Field
The invention relates to the field of sound sources, in particular to an HRTF (head related transfer function) azimuth enhancement method.
Background
Early studies thought that the most important basis for auditory localization was the difference in sound signals between the two ears, which is also the basic idea of duplex theory. If the distances from the sound source to the ears are different, the direct sound waves emitted from the sound source generally have a sequential order when reaching the ears. For example, when the sound source is closer to the right side of the head, the direct sound wave is transmitted to the right ear first, and then to the left ear through diffraction of the head. This process creates a small time difference between the arrival of sound waves at both ears, called Interaural Time Difference (ITD), and the sound waves are attenuated when they are transmitted to the left ear due to the obstruction of the head, and the sound intensity heard at the ear close to the sound source by 1 is greater than that heard at the ear far away, creating an interaural intensity difference (I0 LD). The ITDs and ILDs are also commonly referred to as binaural features.
On the middle vertical plane, the time difference ITD when the sound source reaches two ears and the sound level difference of the two ears are the same, and at the moment, the method for positioning by applying the duplex theory is not feasible any more. The document "Improvement of front-back positioning of 3D sound-based generation in the 7th International Conference on Advanced Communication Technology,2005, ICACT2005" provides a method for enhancing the front and back positions to achieve positioning of the front and back, which uses a spectral difference method to achieve front and back positioning. However, this method changes the frequency spectrum of the original HRTF signal. By analyzing the correlation between the HRTF and the physiological parameters, the physiological parameters influencing the HRTF at 0 degrees and 0 degrees are different from the physiological parameters influencing the HRTF at 180 degrees and 0 degrees, and by analyzing the correlation between the orientation enhancement by using spectral subtraction and the physiological parameters, the related physiological parameters of the HRTF are changed compared with the HRTF without orientation enhancement. This is not in accordance with the actual situation. This method is possible in the sense of auditory perception, but it is not clearly explained in a physical sense.
Disclosure of Invention
To overcome the deficiencies of the prior art, the present invention primarily provides a new method for implementing azimuth enhancement.
The technical scheme adopted by the invention for solving the technical problem comprises the following steps:
the method comprises the following steps: respectively obtaining the azimuth angle (theta) needing to be enhanced by using a 10% rising edge method0Phi) and adjacent azimuth angle (theta) at the same pitch angle as the phi)1Phi) ITD; wherein (theta)0Phi) denotes pitch angle phi and azimuth angle theta0When enhanced orientation is desired, (θ)1Phi) represents the pitch angle phi and the azimuth angle theta0Azimuth θ in nearest neighbor test database1To obtain:
ITDlead(θ,φ)=tL,η-tR,η(1)
wherein the ITDlead(θ, φ) represents a binaural time difference at the azimuth angle (θ, φ); t is tL,η,tR,ηrepresenting the starting time at which the HRIRs of the left and right ears first exceed the HRIR maximum amplitude percentage η, respectively;
step two: obtaining the ITD under the condition that the azimuth angle is 1 DEG deviation from the azimuth angle needing to be enhanced through an interpolation method:
wherein,to require enhanced orientation (theta)0Phi) ITD;is the azimuth (theta) most adjacent to the enhanced azimuth1Phi) ITD;representing azimuth angle theta0+1, the pitch angle is the binaural time difference at phi, theta0+1 represents the azimuth angle theta0An azimuth of 1 ° greater;
step three: angle of azimuth (theta)0Phi) is compensated, compensated (theta)0Phi) binaural time difference of
Step four: to (theta)0Phi), the HRTF at phi) is modified as follows:
if the nearest neighbor orientation (theta) is selected1Phi) is located at (theta)0On the left side of phi), then at the azimuth angle (theta)0Phi), zero padding is performed on the left side of the left ear HRIR; meanwhile, in order to ensure the length of the left ear and the right ear to be consistent, zero filling is carried out on the right side of the HRIR of the right ear, and the HRIR after modification is
Wherein,indicating a modified azimuth angle of (theta)0Phi) right ear head related transfer function;
indicating a modified azimuth angle of (theta)0Phi) left ear head related transfer function;to representAn all-zero array of rows and columns; r represents the right ear, L represents the left ear;
if the selected nearest azimuth is located at the right side of the azimuth to be enhanced, the zero padding mode is as follows:
will be modifiedSet to the tested direction as theta0And the head-related transfer function with the pitching angle phi is used for carrying out convolution on the head-related transfer function and the dry signal to obtain an audio signal with good azimuth sense, and the technology can realize the azimuth theta0And the pitch angle is the azimuth enhancement at phi.
The invention has the advantages that on the basis of not changing the original signal frequency spectrum, ITD of the confusable azimuth is compensated, the HRTF obtained by the compensation is convoluted with the dry audio signal, the synthesized signal azimuth sense is good, and the auditory sense of the front and back azimuths of human ears can be effectively improved.
Drawings
FIG. 1 is a flow chart of the present invention for vertical plane orientation enhancement.
FIG. 2 is a diagram of the frequency domain of the original signal in an embodiment of the present invention.
Fig. 3 is a frequency domain diagram after applying azimuth enhancement in an embodiment of the present invention.
Detailed Description
The present invention will be further explained with reference to the drawings and examples, taking the azimuth angle (0 ° ) as an example.
To overcome the deficiencies of the prior art, the present invention is directed to implementing azimuth enhancement by attempting to invent a new method. The invention can help to realize the direction enhancement at the front and back positions by slightly rotating the head. Because the position that the human ear can recognize is 3.6 degrees in the horizontal position, the human ear can hardly clearly distinguish (0 degrees ) from (0 degrees, 1 degrees); meanwhile, the HRTF at the (0 ° ) bearing reflects the auditory perception of the human ear on the front sound, but the auditory perception itself affects the sound, so that the sound is in the center of the head. The invention discloses a method for reconstructing an original HRTF by knowing the ITD of an azimuth right in front or behind and the nearest azimuth angle, applying an interpolation method to obtain the ITD when the deviation of the head from the azimuth right in front or behind is 1 degree, and applying the ITD to reconstruct the original HRTF to obtain a new HRTF. The synthesized HRTF has no difference with the original HRTF in frequency spectrum, but the application of the HRTF can enhance the azimuth positioning effect of a listener.
Although the orientation enhancement algorithm obtained by the spectral difference method can enhance the orientation feeling of a listener, the application of the method to obtain the front and back orientation spectrums actually changes the physiological factors influencing the frequency spectrums, which are two completely different HRTFs from the HRTF obtained by measurement. The invention provides a new orientation enhancement algorithm on the basis of not changing the HRTF frequency spectrum, so that a listener can accurately distinguish signal sources.
The technical scheme adopted by the invention for solving the technical problem comprises the following steps:
the method comprises the following steps: respectively obtaining the azimuth angle (theta) needing to be enhanced by using a 10% rising edge method0Phi) and adjacent azimuth angle (theta) at the same pitch angle as the phi)1Phi) ITD; wherein (theta)0Phi) denotes pitch angle phi and azimuth angle theta0When enhanced orientation is desired, (θ)1Phi) represents the pitch angle phi and the azimuth angle theta0Azimuth θ in nearest neighbor test database1To obtain:
ITDlead(θ,φ)=tL,η-tR,η(1)
wherein the ITDlead(θ, φ) represents a binaural time difference at the azimuth angle (θ, φ); t is tL,η,tR,ηrespectively representing the starting time when the HRIR of the left ear and the right ear exceeds the maximum HRIR amplitude percentage eta for the first time, wherein eta is 10 percent.
Step two: obtaining the ITD under the condition that the azimuth angle is 1 DEG deviation from the azimuth angle needing to be enhanced through an interpolation method:
wherein,to require enhanced orientation (theta)0Phi) ITD;is the azimuth (theta) most adjacent to the enhanced azimuth1Phi) ITD;representing azimuth angle theta0+1, the pitch angle is the binaural time difference at phi, theta0+1 represents the azimuth angle theta0An azimuth of 1 ° greater;
step three: angle of azimuth (theta)0,φ) Compensating for the ITD of (theta)0Phi) binaural time difference of
Step four: to (theta)0Phi), the HRTF at phi) is modified as follows:
if the nearest neighbor orientation (theta) is selected1Phi) is located at (theta)0On the left side of phi), then at the azimuth angle (theta)0Phi), zero padding is performed on the left side of the left ear HRIR; meanwhile, in order to ensure the length of the left ear and the right ear to be consistent, zero filling is carried out on the right side of the HRIR of the right ear, and the HRIR after modification is
Wherein,indicating a modified azimuth angle of (theta)0Phi) right ear head related transfer function;
indicating a modified azimuth angle of (theta)0Phi) left ear head related transfer function;to representAn all-zero array of rows and columns; r represents the right ear, L represents the left ear;
if the selected nearest azimuth is located at the right side of the azimuth to be enhanced, the zero padding mode is as follows:
will be modifiedSet to the tested direction as theta0And the head-related transfer function with the pitching angle phi is used for carrying out convolution on the head-related transfer function and the dry signal to obtain an audio signal with good azimuth sense, and the technology can realize the azimuth theta0And the pitch angle is the azimuth enhancement at phi.
The examples are as follows:
the method comprises the following steps: referring to fig. 1, the ITDs at the azimuth (0 ° ) and the azimuth (5 °,0 °) adjacent to the same pitch angle are obtained by applying the 10% rising edge method:
ITDlead(θ,φ)=tL,10%-tR,10%(5)
wherein the ITDlead(θ, φ) represents a binaural time difference at the azimuth angle (θ, φ); t is tL,10%、tR,10%Representing the starting time when the HRIRs of the left and right ears first exceeded 10% of their maximum amplitude, respectively.
Step two: the ITD at an azimuth offset of 1 ° from the azimuth to be enhanced, i.e., (1 °,0 °) is obtained by an interpolation method.
Wherein the ITD(0°,0°)For ITD at orientations (0 deg. ) requiring enhancement, ITD(5°,0°)To be most oriented with the enhancementThe ITD of the adjacent orientation, the closest orientation of the database used in the present invention is (5 °,0 °).
Step three: the ITD at azimuth angle (0 ° ) is compensated. After compensation (0 DEG ) the binaural time difference is
Step four: and (5) reconstructing the HRTF at the position of (0 degrees and 0 degrees).
If the closest azimuth (5 DEG, 0 DEG) is selected to be located on the right side of (0 DEG ), the azimuth angle (theta)0Phi) zero padding is performed on the left side of the right ear HRIR; and zero filling is carried out on the right side of the right ear HRIR for ensuring the length consistency of the left ear and the right ear.To rebuild the finished HRIR.
Wherein,representing the right ear head related transfer function at the position angle (0 DEG, phi DEG) after the transformation;
representing the related transfer function of the left ear head at the position of the transformed azimuth angle (0 degrees and 0 degrees);to representRow, 1 column of all-zero arrays; (0 ° ) is the azimuth angle that needs to be enhanced, R for the right ear and L for the left ear.
Compared with the HRTF obtained by measurement, the HRTF at the position of (0 degrees and 0 degrees) reconstructed by applying the method has better direction sense.
As can be seen from fig. 2 and fig. 3, after the method proposed by the present invention is applied to the azimuth enhancement, the frequency domain graph after the azimuth enhancement is not changed from the frequency domain graph of the original signal.
Claims (1)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810834254.1A CN109005496A (en) | 2018-07-26 | 2018-07-26 | A kind of HRTF middle vertical plane orientation Enhancement Method |
| CN201910201711.8A CN109769195B (en) | 2018-07-26 | 2019-03-18 | A method for enhancing vertical plane orientation in HRTF |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201810834254.1A CN109005496A (en) | 2018-07-26 | 2018-07-26 | A kind of HRTF middle vertical plane orientation Enhancement Method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN109005496A true CN109005496A (en) | 2018-12-14 |
Family
ID=64596349
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201810834254.1A Withdrawn CN109005496A (en) | 2018-07-26 | 2018-07-26 | A kind of HRTF middle vertical plane orientation Enhancement Method |
| CN201910201711.8A Active CN109769195B (en) | 2018-07-26 | 2019-03-18 | A method for enhancing vertical plane orientation in HRTF |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910201711.8A Active CN109769195B (en) | 2018-07-26 | 2019-03-18 | A method for enhancing vertical plane orientation in HRTF |
Country Status (1)
| Country | Link |
|---|---|
| CN (2) | CN109005496A (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113079452A (en) * | 2021-03-30 | 2021-07-06 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio processing method, audio direction information generating method, electronic device, and medium |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110493701B (en) * | 2019-07-16 | 2020-10-27 | 西北工业大学 | HRTF (head related transfer function) personalization method based on sparse principal component analysis |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1236652C (en) * | 2002-07-02 | 2006-01-11 | 矽统科技股份有限公司 | Stereo sound effect production method |
| CN101960866B (en) * | 2007-03-01 | 2013-09-25 | 杰里·马哈布比 | Audio Spatialization and Environment Simulation |
| CN103634733B (en) * | 2008-07-31 | 2016-05-25 | 弗劳恩霍夫应用研究促进协会 | The signal of binaural signal generates |
| AU2013235068B2 (en) * | 2012-03-23 | 2015-11-12 | Dolby Laboratories Licensing Corporation | Method and system for head-related transfer function generation by linear mixing of head-related transfer functions |
| US10142761B2 (en) * | 2014-03-06 | 2018-11-27 | Dolby Laboratories Licensing Corporation | Structural modeling of the head related impulse response |
| KR102529121B1 (en) * | 2014-03-28 | 2023-05-04 | 삼성전자주식회사 | Method and apparatus for rendering acoustic signal, and computer-readable recording medium |
| CA3041710C (en) * | 2014-06-26 | 2021-06-01 | Samsung Electronics Co., Ltd. | Method and device for rendering acoustic signal, and computer-readable recording medium |
| CN104410939B (en) * | 2014-10-16 | 2017-12-29 | 华为技术有限公司 | Acoustic image direction feeling treating method and apparatus |
-
2018
- 2018-07-26 CN CN201810834254.1A patent/CN109005496A/en not_active Withdrawn
-
2019
- 2019-03-18 CN CN201910201711.8A patent/CN109769195B/en active Active
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN113079452A (en) * | 2021-03-30 | 2021-07-06 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio processing method, audio direction information generating method, electronic device, and medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109769195B (en) | 2020-04-03 |
| CN109769195A (en) | 2019-05-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Bernschütz et al. | Binaural reproduction of plane waves with reduced modal order | |
| AU2020200448B2 (en) | Headtracking for parametric binaural output system and method | |
| Pörschmann et al. | Directional equalization of sparse head-related transfer function sets for spatial upsampling | |
| US9167344B2 (en) | Spectrally uncolored optimal crosstalk cancellation for audio through loudspeakers | |
| US20090067636A1 (en) | Optimization of Binaural Sound Spatialization Based on Multichannel Encoding | |
| Duda | Elevation dependence of the interaural transfer function | |
| KR20100081999A (en) | A method for headphone reproduction, a headphone reproduction system, a computer program product | |
| CN102100089A (en) | Device operating according to angle and method for obtaining pseudo-stereo audio signal | |
| CN109005496A (en) | A kind of HRTF middle vertical plane orientation Enhancement Method | |
| Deppisch et al. | Blind identification of binaural room impulse responses from smart glasses | |
| JP6964703B2 (en) | Head tracking for parametric binaural output systems and methods | |
| Goetze et al. | Direction of arrival estimation based on the dual delay line approach for binaural hearing aid microphone arrays | |
| EP2612437A1 (en) | Spectrally uncolored optimal croostalk cancellation for audio through loudspeakers | |
| Bai et al. | An integrated analysis-synthesis array system for spatial sound fields | |
| Hammond et al. | Robust median-plane binaural sound source localization | |
| Shigetani et al. | Accuracy of binaural signal in Higher-Order Ambisonics reproduction with different decoding approaches | |
| Ko et al. | DNN-based HRTF individualization for accurate spectral cues using a compact PRTF | |
| Nelson et al. | Binaural hearing and systems for sound reproduction | |
| Ziegelwanger et al. | Modeling the broadband time-of-arrival of the head-related transfer functions for binaural audio | |
| Keyrouz | Humanoid binaural hearing: Adaptive approach | |
| Hughet | Binaural Hearing Effects of Mapping Microphone Array's Responses to a Listener's Head-Related Transfer Functions | |
| Yang et al. | The performance of time reversal beam-forming by linear array | |
| Nelson et al. | Systems for virtual sound imaging | |
| Bai et al. | Sound field reproduction based on the Basis Function Method and Equivalent Source Method | |
| HEARING | Fakheredine Keyrouz Notre Dame University-Louaize Zouk Mosbeh, Lebanon email: fkeyrouz@ ndu. edu. lb |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| WW01 | Invention patent application withdrawn after publication |
Application publication date: 20181214 |
|
| WW01 | Invention patent application withdrawn after publication |