Cobos et al., 2022 - Google Patents

An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction

Cobos et al., 2022

Document ID: 12760258977094512988
Author: Cobos M; Ahrens J; Kowalczyk K; Politis A
Publication year: 2022
Publication venue: EURASIP Journal on Audio, Speech, and Music Processing

External Links

Cited by

Snippet

The domain of spatial audio comprises methods for capturing, processing, and reproducing audio content that contains spatial information. Data-based methods are those that operate directly on the spatial information carried by audio signals. This is in contrast to model-based …

Continue reading at link.springer.com (HTML) (other versions)

238000010801 machine learning 0 title abstract description 56

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/308—Electronic adaptation dependent on speaker or headphone connection
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels, e.g. Dolby Digital, Digital Theatre Systems [DTS]
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
- H04S1/007—Two-channel systems in which the audio signals are in digital form
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation

Similar Documents

Publication	Publication Date	Title
Cobos et al.	2022	An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction
JP7342091B2 (en)	2023-09-11	Method and apparatus for encoding and decoding a series of frames of an ambisonics representation of a two-dimensional or three-dimensional sound field
JP7181371B2 (en)	2022-11-30	Rendering method, rendering device and recording medium
Rafaely et al.	2022	Spatial audio signal processing for binaural reproduction of recorded acoustic scenes–review and challenges
US11863962B2 (en)	2024-01-02	Concept for generating an enhanced sound-field description or a modified sound field description using a multi-layer description
Morgado et al.	2018	Self-supervised generation of spatial audio for 360 video
RU2759160C2 (en)	2021-11-09	Apparatus, method, and computer program for encoding, decoding, processing a scene, and other procedures related to dirac-based spatial audio encoding
US8817991B2 (en)	2014-08-26	Advanced encoding of multi-channel digital audio signals
US8284946B2 (en)	2012-10-09	Binaural decoder to output spatial stereo sound and a decoding method thereof
US8379868B2 (en)	2013-02-19	Spatial audio coding based on universal spatial cues
US9313599B2 (en)	2016-04-12	Apparatus and method for multi-channel signal playback
EA034936B1 (en)	2020-04-08	AUDIO CODING AND DECODING USING REPRESENT CONVERSION PARAMETERS
GB2485979A (en)	2012-06-06	Spatial audio coding
He	2017	Spatial audio reproduction with primary ambient extraction
He	2016	Literature review on spatial audio
WO2022050087A1 (en)	2022-03-10	Signal processing device and method, learning device and method, and program
Zhu et al.	2025	ASAudio: A Survey of Advanced Spatial Audio Research
McCormack	2017	Real-time microphone array processing for sound-field analysis and perceptually motivated reproduction
Cheng	2011	Spatial squeezing techniques for low bit-rate multichannel audio coding
Väljamäe	2003	A feasibility study regarding implementation of holographic audio rendering techniques over broadcast networks