RU2018121757A

RU2018121757A - SYSTEM AND METHOD OF HEADING MOTION OF THE HEAD FOR OBTAINING A PARAMETRIC BINAURAL OUTPUT SIGNAL

Info

Publication number: RU2018121757A
Application number: RU2018121757A
Authority: RU
Inventors: Дирк Ерун БРЕБАРТ; Дэвид Мэттью КУПЕР; Марк Ф. ДЭВИС; Дэвид С. МАКГРАТ; Кристофер ЧЕРЛИНГ; Харальд МУНДТ; Ронда Дж. УИЛСОН
Original assignee: Долби Лэборетериз Лайсенсинг Корпорейшн; Долби Интернэшнл Аб
Priority date: 2015-11-17
Filing date: 2016-11-17
Publication date: 2019-12-19
Also published as: RU2018121757A3; JP2020110007A; RU2020116816A; IL274432B; RU2722391C2; IL274432A; MX2018006075A; ES2779603T3; JP6964703B2; MX384922B

Claims

1. A method of encoding an input audio signal based on a channel or an object for reproduction, which method comprises the steps of

(a) perform initial rendering of an input audio signal based on a channel or an object into an initial output representation;

(b) determining an estimate of the dominant audio component from an input audio signal based on a channel or an object, and determining the sequence of weight components of the dominant audio component to map the initial output representation to the dominant audio component so as to enable the use of the weights of the dominant audio component and the initial output representation to determine the estimate dominant component;

(c) determine an estimate of the direction or position of the dominant audio component; and

(d) encode the initial output representation, the weights of the dominant audio component, the direction or position of the dominant audio component as an encoded signal for playback.

2. The method of claim 1, further comprising determining an estimate of the residual mix, which is the initial output representation minus the rendering of either the dominant audio component or its estimation.

3. The method of claim 1, further comprising generating an anechoic binaural mix of the input audio signal based on the channel or object, and determining an estimate of the residual mix, the estimate of the residual mix being the anechoic binaural mix minus either rendering the dominant audio component, or its assessment.

4. The method of claim 2 or 3, further comprising determining a sequence of residual matrix coefficients for mapping the initial output representation into an estimate of the residual mix.

5. The method according to any one of the preceding paragraphs, in which the initial output presentation comprises a presentation by means of headphones or a speaker.

6. The method according to any one of the preceding paragraphs, in which the input audio signal based on a channel or an object is divided into elements of the division of time and frequency, and said coding step is repeated with respect to a sequence of time steps and a sequence of frequency bands.

7. The method according to any one of the preceding paragraphs, in which the initial output representation comprises a mix of stereo speakers.

8. A method for decoding an encoded audio signal, wherein the encoded audio signal includes:

initial output presentation;

weighting factors of the dominant audio component and the direction of the dominant audio component;

wherein the method comprises the steps in which:

(a) using weights of the dominant audio component and the initial output representation to determine the estimated dominant component;

(b) rendering the estimated dominant component with binauralization at a spatial location relative to the target listener in accordance with the direction of the dominant audio component to form a rendered binauralized estimated dominant component;

(c) reconstructing an estimate of the residual component from the initial output representation; and

(d) combine the rendered binauralized estimated dominant component and the residual component estimate to form an output spatially oriented encoded audio signal.

9. The method of claim 8, wherein the encoded audio signal further includes a sequence of residual matrix coefficients representing the residual audio signal, and step (c) further comprises the step of

(c1) apply said residual matrix coefficients to the initial output representation to reconstruct the estimate of the residual component.

10. The method of claim 8, wherein the residual component estimate is reconstructed by subtracting the rendered binauralized estimated dominant component from the initial output representation.

11. The method according to any one of paragraphs. 8-10, in which step (b) includes an initial rotation of the estimated dominant component in accordance with an input signal tracking the head movement indicating the orientation of the head of the target listener.

12. A method of decoding and reproducing an audio stream for a listener using headphones, which method comprises the steps of

(a) receiving a data stream comprising a first audio presentation and additional audio conversion data;

(b) receiving head orientation data representing the orientation of the listener;

(c) creating one or more auxiliary signals based on the first audio presentation and received transform data;

(d) creating a second audio presentation consisting of a combination of the first audio presentation and an auxiliary signal (s), where one or more auxiliary signals are modified in response to the head orientation data; and

(e) outputting a second audio presentation as an output audio stream.

13. The method according to p. 12, in which said modification of the auxiliary signals consists of modeling the acoustic path from the position of the sound source to the ears of the listener.

14. The method of claim 12 or 13, wherein said transform data consists of matrix coefficients and at least one of a position of the sound source and a direction of the sound source.

15. The method according to any one of paragraphs. 12-14, in which the conversion process is applied as a function of time or frequency.

16. The method according to any one of paragraphs. 12-15, wherein said auxiliary signals represent at least one dominant component.

17. The method according to any one of paragraphs. 12-16, in which the position or direction of the sound source, adopted as part of the conversion data, is rotated in response to head orientation data.

18. The method according to p. 17, in which the maximum amount of rotation is limited to less than 360 degrees in azimuth or elevation.

19. The method according to any one of paragraphs. 12-17, in which the secondary presentation is obtained from the first presentation by matrixing in the transform area or filter block.

20. The method according to any one of paragraphs. 12-19, in which the transform data further comprises additional matrix coefficients, and step (d) further comprises modifying the first audio presentation in response to these additional matrix coefficients, before combining the first audio presentation and the auxiliary audio signal (s).

21. Equipment containing one or more devices configured to implement the method according to any one of paragraphs. 1-20.

22. Machine-readable medium containing a program consisting of instructions that, when executed by one or more processors, instruct one or more devices to perform the method according to any one of claims. 1-20.