[go: up one dir, main page]

WO2005069275A1 - Systems and methods for automatically equalizing audio signals - Google Patents

Systems and methods for automatically equalizing audio signals Download PDF

Info

Publication number
WO2005069275A1
WO2005069275A1 PCT/IB2005/050012 IB2005050012W WO2005069275A1 WO 2005069275 A1 WO2005069275 A1 WO 2005069275A1 IB 2005050012 W IB2005050012 W IB 2005050012W WO 2005069275 A1 WO2005069275 A1 WO 2005069275A1
Authority
WO
WIPO (PCT)
Prior art keywords
spectral energy
energy values
frequency
frequency bands
audio signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2005/050012
Other languages
French (fr)
Inventor
Steven Van De Par
Jeroen Breebaart
Martin Mckinney
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Priority to EP05702547A priority Critical patent/EP1704559A1/en
Publication of WO2005069275A1 publication Critical patent/WO2005069275A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/04Circuits for transducers, loudspeakers or microphones for correcting frequency response
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility

Definitions

  • the present invention generally relates to audio processing applications, and more particularly, to systems and methods for automatically equalizing audio signals based on information contained in the audio signal.
  • High quality audio systems typically utilize an equalizer to amplify or attenuate selected frequency regions in order to enhance the perceived quality of the audio signal during playback.
  • These equalizers may include a set of adjustable controls that allow the user to manually adjust the degree of amplification or attenuation of each frequency region until the desired result is obtained.
  • Other approaches utilize a set of predetennined equalizer settings that optimize playback for different types of music, such as pop, jazz or classical music.
  • the user can then select the appropriate predetermined equalizer setting depending on the type of music being played.
  • these approaches can enhance the perceived quality of the audio signal, these approaches can produce sub-optimal results, especially for poor quality audio recordings with atypical spectral properties.
  • the user typically must engage in a trial and error process in order to determine the optimal settings for each frequency region.
  • Systems employing predetermined equalizer settings similarly require the user to switch between multiple equalizer settings in order to determine the setting that achieves the best results.
  • spectral energy values for a plurality of frequency bands are measured by, for example, measuring frequency components of at least a portion of the audio signal and equalizing the frequency components using previous or initial equalizer settings associated with the frequency bands.
  • spreaded energy values for each frequency band may be determined based on the measured spectral energy values and a spreading function, where the spreading function defines the spread of spectral energy across the frequency bands similar to the way it occurs within the auditory system.
  • the ratio between the unspreaded and spreaded energy in each frequency band is assumed to represent a measure of the audibility of the audio signal within that frequency band.
  • this measure may be used to update equalizer settings for each frequency band by, for example, taking the previous or initial equalizer setting (in decibels) and subtracting a normalized ratio of the measured spectral energy value and the spreaded energy value for that frequency band, where the normalized ratio may be determined by subtracting the above described ratio for that frequency band by an average ratio of unspreaded and spreaded energy values across all frequency bands under consideration to avoid an overall reduction in level.
  • the accumulation of adjustments to the equalizer settings results in slightly attenuating frequency bands that are clearly audible and slightly amplifying frequency bands that are masked by other frequency bands. This process can enhance the perceptual quality of the audio signal by optimizing the information transmitted to the listener.
  • Other embodiments provide mechanisms for updating the equalizer setting so as to avoid switching artifacts and for further enhancing the perceptual quality of the audio signal. For example, one embodiment multiplies the ratio of the spreaded and unspreaded energy values by a constant that is less than 1 in order to reduce the degree to which the equalizer settings can be adjusted in any iteration, thereby reducing switching artifacts. Other embodiments halve the equalizer settings used to amplify the audio signal in the decibel domain in order to provide a compromise between maximizing information transfer and maintaining faithfulness to the original recording. Still further embodiments may configure the device to apply a graded equalization that allows for more adjustments in midrange frequency bands than in the far ends of the spectrum.
  • An alternative or complementary embodiment for deriving a measure of audibility of the signal comprises directly comparing the ratio of the unspreaded energy and predetermined target value (which may be frequency dependent). Equalizer settings may then be adjusted until the equalized energy distribution matches the target spectral energy function stored in memory.
  • the embodiment can be used with many of the embodiments described above, such as embodiments directed to reducing switching artifacts, graded equalization for high, low and midrange frequency bands, and maintaining faithfulness to the original recording.
  • This embodiment also has advantages in that it similarly does not require any additional information about the signal (other than the signal itself) and can adapt the equalizer settings to the spectral characteristics of the signal.
  • embodiments of the present invention can significantly enhance perceived quality without the need for the user to engage in a difficult trial and error process.
  • embodiments of the present can also achieve better equalization results than conventional approaches, especially for poor quality recordings with atypical spectral properties.
  • Figure 1 illustrates an exemplary block diagram of an audio playback system in accordance with one embodiment of the present invention
  • Figure 2 illustrates an exemplary method in flowchart form for automatically equalizing audio signals in accordance with one embodiment of the present invention
  • Figure 3 illustrates an exemplary platform that may be used in accordance with embodiments of the present invention.
  • DETAILED EMBODIMENTS Embodiments of the present invention provide systems and methods for automatically equalizing audio signals. The following description is presented to enable a person skilled in the art to make and use the invention. Descriptions of specific applications are provided only as examples.
  • an audio playback device such as a CD player, MP3 player or digital power amplifier, may be configured with a single button (one key equalizer) for initiating automatic equalization.
  • the device When the button is pressed, the device automatically derives the proper equalizer settings that will enhance the perceived quality of the audio signal.
  • these equalizer settings are derived based on the spectro-temporal properties of the input signal only, thereby avoiding the need for the user to engage in a difficult and time-consuming trial and error process or provide additional information about the audio signal.
  • the audio playback device may exploit psycho-acoustic principles.
  • the audio signal may be analyzed to determine which parts of the audio spectrum that, at any moment in time, are clearly audible and which parts are masked by other spectral parts of the audio signal.
  • the parts of the spectrum that are masked are slightly amplified, while the parts of the spectrum that are clearly audible are slightly attenuated. If the equalizer adjustments are accumulated over several tens of seconds, the equalizer settings will vary relatively slowly which helps avoid switching artifacts. Furthermore, if the measurements of masking versus non-masking are based on the equalized signals, then the accumulation of adjustments will eventually reach an asymptote at some point, thereby producing steady state equalization settings.
  • the philosophy for using masking information to derive equalizer settings is based on maximizing the total information transmission to the listener.
  • an exemplary block diagram of an audio playback system in accordance with one embodiment of the present invention is illustrated generally at 100.
  • the input signal to be equalized is applied to a segmentation windowing unit 110, which segments the input signal into a number of segments of, for example 10-50 ms, and applies an analysis window, such as a Hanning window, to each segment.
  • an analysis window such as a Hanning window
  • the resulting signal in each segment is then transformed to the frequency domain using an FFT and prefiltering unit 120.
  • the FFT and prefiltering unit 120 also prefilters the frequency components with the reciprocal of an equal loudness curve to compensate for the non-equal perception of loudness at different frequencies. This process allows equalization to take into consideration the inherent perceptual characteristics of the human ear.
  • the output of the FFT and prefiltering unit 120 is then applied to an equalizing unit
  • each prefiltered frequency coefficient is amplified or attenuates each prefiltered frequency coefficient according to the current equalizer setting associated with each frequency band, eq(i) (which for the first iteration are set flat).
  • the equalized frequency components are then applied to an energy calculation unit 140 that determines the signal energy, en(i), within each frequency band.
  • the width of each frequency band described above is preferably proportional to the equivalent rectangular bandwidth (ERB) scale. These ERB frequency bands generally provide a better estimate of auditory filter bandwidth than classical Bark scale models.
  • a spreading unit 150 uses a spreading function to determine spreaded energy values for each frequency band, sen(i).
  • each spreaded energy value models the masking that occurs at a given frequency band number as a result of spectral energy at different frequency bands.
  • a signal-to-masking ratio unit 160 estimates the amount of masking at each frequency band by calculating the ratio between the spreaded and unspreaded energy values.
  • eq(i) is the equalizer setting in decibels of the i-th frequency band
  • smr(i) is the signal-to -masking ratio for the i-th frequency band and is equal to en(i)/sen(i)
  • mean(smr) is the average signal-to-masking ratio across all frequency bands under consideration
  • c is a constant for determining how fast the algorithm responds to changes in the signal-to- masking ratio. If c is set to a value of approximately 0.05, the algorithm gives a relatively slow adjustment of equalizer settings, but is sufficiently fast to adjust in a matter of tens of seconds. This relatively slow adjustment of equalizer settings helps avoid switching artifacts.
  • the update unit 170 updates the equalizer settings within the equalizing unit 130. These updated equalizer settings are then used by the equalizing unit 130 for the next iteration of the algorithm.
  • the update unit 170 also applies the updated equalizer settings to an equalizing postfilter 180 that amplifies or attenuates the frequency bands of the input signal according to the associated equalizer values.
  • embodiments of the present invention preferably adjust the equalizer settings provided by the update unit 170 by a compensation factor A.
  • This compensation factor is preferably equal to 0.5 such that the equalizer settings used by the equalizing postfilter 180 to amplify or attenuate the input signal are halved in the decibel domain. The equalizing that is obtained in this way will vary rather slowly and can be applied on a moment-to-moment basis without introducing artifacts.
  • an alternative embodiment may be configured to scan the entire musical recording or the entire CD using the processes described above and obtain an average value of the equalizer settings, eq(i). These average equalizer settings would then be used by the equalizing postfilter 180 for the entire recording or CD under consideration.
  • other embodiments may replace the button for initiating automatic equalization with a knob that is configured to adjust the equalization strength.
  • the knob may be configured to increase or decrease the responsiveness of equalization (e.g., by adjusting the value of c in equation 2 above) and/or increase or decrease the level of information transfer relative to faithfulness to the original recording (e.g., by adjusting the value of the compensation factor A used by the equalizing postfilter 180).
  • Still other embodiments may configure the update unit 170 (or equalizing postfilter 180) to apply a graded equalization that allows for more adjustments in equalizer settings within midrange frequency bands than in the far ends of the spectrum.
  • a graded equalization that allows for more adjustments in equalizer settings within midrange frequency bands than in the far ends of the spectrum.
  • n is the frequency band number
  • A(n) is the compensation factor, A, for frequency band n that is used by the update unit 170 (or equalizing and postfilter unit 180) to determine the equalizer settings used to amplify or attenuate the frequency bands of the audio signal
  • N is the total number of frequency bands.
  • A(n) is (almost) zero. Accordingly, most of the equalizing effect occurs at the midranges and less equalizing is present towards the lower or higher frequencies.
  • the update unit 170 may be configured to derive updated equalizer settings by directly comparing a ratio of the unspreaded energy, en(i), and a predetermined target spectral energy distribution (which may be frequency dependent). Equalizer settings may be adjusted by the update unit 170 until the equalized energy distribution matches the target spectral energy function stored in memory.
  • This embodiment can be used with many of the embodiments described above, such as embodiments directed to reducing switching artifacts, graded equalization at high, low and midrange frequency bands, and maintaining faithfulness to the original recording. This embodiment also has advantages in that it similarly does not require any additional information about the signal (other than the signal itself) and can adapt the equalizer settings to the spectral characteristics of the signal.
  • an exemplary method in flowchart form for automatically equalizing audio signals in accordance with one embodiment of the present invention is illustrated generally at 200.
  • the exemplary method begins at step 210 by equalizing frequency components using the current equalizer settings associated with each frequency band.
  • the equalized frequency components are then used to determine the energy within each frequency band at step 220.
  • the spreaded energy values may be determined at step 230 using, for example, a spreading function as described in equation 1 above.
  • a signal-to-masking ratio for each frequency band may then be determined at step 240 by dividing the measured energy value associated with the frequency band by the spreaded energy value associated with the frequency band.
  • the equalizer setting for each frequency band may then be updated at step 250 by taking the previous equalizer setting and subtracting a normalized signal-to-masking ratio (e.g., the signal-to-masking ratio for the frequency band minus the average signal-to-masking ratio across all frequency bands under consideration) multiplied by a responsiveness factor, c, as described in equation 2 above.
  • a normalized signal-to-masking ratio e.g., the signal-to-masking ratio for the frequency band minus the average signal-to-masking ratio across all frequency bands under consideration
  • a responsiveness factor, c e.g., the signal-to-masking ratio for the frequency band minus the average signal-to-masking ratio across all frequency bands under consideration
  • step 260 may multiply the updated equalizer settings determined in step 250 by a compensation factor A to maintain a certain degree of faithfulness to the original recording.
  • the exemplary platform includes a microprocessor 310 operably coupled to a memory system 320 via a system bus 330.
  • the memory system 320 may comprise a random access memory, a hard drive, floppy drive, a compact disk, or other computer readable medium, that stores computer instructions for an equalization module 340.
  • the exemplary system also includes an I/O interface 350 that is coupled to the microprocessor 310 and the memory system 320 in order to enable the system to input audio signals and output equalizer settings.
  • the processor 310 uses the equalization module 340 to process audio signals received from the I/O interface 350 and to determine the proper equalizer settings. These equalizer settings may then be output to the I/O interface 350 for use in amplifying the audio signal. If the equalization module 340 is implemented in accordance with the principles described above, the exemplary system may be configured to automatically enhance the perceived quality of the audio signal by automatically deriving the appropriate equalizer settings.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A system and method for automatically equalizing audio signals uses an energy calculation unit (140) to calculate spectral energy values for a plurality of frequency bands based on equalized frequency components. A spreading unit (150) then determines spreaded energy values for each frequency band based on the measured spectral energy values and a spreading function, where the spreading function defines the spread of spectral energy across the frequency bands. An update unit (160, 170) then updates equalizer settings for each frequency band by taking a previous (or initial) equalizer setting and subtracting a normalized ratio of the measured spectral energy value and the spreaded energy value for that frequency band. In this manner, the accumulation of adjustments to the equalizer settings results in slightly attenuating frequency bands that are clearly audible and slightly amplifying frequency bands that are masked by other frequency bands.

Description

SYSTEMS AND METHODS FOR AUTOMATICALLY EQUALIZING AUDIO SIGNALS The present invention generally relates to audio processing applications, and more particularly, to systems and methods for automatically equalizing audio signals based on information contained in the audio signal. High quality audio systems typically utilize an equalizer to amplify or attenuate selected frequency regions in order to enhance the perceived quality of the audio signal during playback. These equalizers may include a set of adjustable controls that allow the user to manually adjust the degree of amplification or attenuation of each frequency region until the desired result is obtained. Other approaches utilize a set of predetennined equalizer settings that optimize playback for different types of music, such as pop, jazz or classical music. The user can then select the appropriate predetermined equalizer setting depending on the type of music being played. Although these approaches can enhance the perceived quality of the audio signal, these approaches can produce sub-optimal results, especially for poor quality audio recordings with atypical spectral properties. For systems employing adjustable controls, for example, the user typically must engage in a trial and error process in order to determine the optimal settings for each frequency region. Systems employing predetermined equalizer settings similarly require the user to switch between multiple equalizer settings in order to determine the setting that achieves the best results. These types of trial and error processes can lead to a frustrating listening experience for the user and may still produce sub-optimal results depending on the technical capabilities of the user, the characteristics of the music involved and the quality of the audio recording. Therefore, in light of the foregoing problems, there is a need for systems and methods for automatically equalizing audio signals. These systems and methods would preferably perform equalization based only on information contained in the audio signal, thereby avoiding the need for the user to provide additional information about the audio recording. Embodiments of the present invention alleviate many of the foregoing problems by providing systems and method for automatically equalizing audio signals. In one embodiment, spectral energy values for a plurality of frequency bands are measured by, for example, measuring frequency components of at least a portion of the audio signal and equalizing the frequency components using previous or initial equalizer settings associated with the frequency bands. Based on these measurements, spreaded energy values for each frequency band may be determined based on the measured spectral energy values and a spreading function, where the spreading function defines the spread of spectral energy across the frequency bands similar to the way it occurs within the auditory system. The ratio between the unspreaded and spreaded energy in each frequency band is assumed to represent a measure of the audibility of the audio signal within that frequency band. Accordingly, this measure may be used to update equalizer settings for each frequency band by, for example, taking the previous or initial equalizer setting (in decibels) and subtracting a normalized ratio of the measured spectral energy value and the spreaded energy value for that frequency band, where the normalized ratio may be determined by subtracting the above described ratio for that frequency band by an average ratio of unspreaded and spreaded energy values across all frequency bands under consideration to avoid an overall reduction in level. In this manner, the accumulation of adjustments to the equalizer settings results in slightly attenuating frequency bands that are clearly audible and slightly amplifying frequency bands that are masked by other frequency bands. This process can enhance the perceptual quality of the audio signal by optimizing the information transmitted to the listener. Other embodiments provide mechanisms for updating the equalizer setting so as to avoid switching artifacts and for further enhancing the perceptual quality of the audio signal. For example, one embodiment multiplies the ratio of the spreaded and unspreaded energy values by a constant that is less than 1 in order to reduce the degree to which the equalizer settings can be adjusted in any iteration, thereby reducing switching artifacts. Other embodiments halve the equalizer settings used to amplify the audio signal in the decibel domain in order to provide a compromise between maximizing information transfer and maintaining faithfulness to the original recording. Still further embodiments may configure the device to apply a graded equalization that allows for more adjustments in midrange frequency bands than in the far ends of the spectrum. By placing limits on the level of amplification at the far ends of the spectrum, these embodiments may reduce the level of noise or distortion perceived by the user, since very little signal is generally present in these outer portions of the frequency spectrum. An alternative or complementary embodiment for deriving a measure of audibility of the signal comprises directly comparing the ratio of the unspreaded energy and predetermined target value (which may be frequency dependent). Equalizer settings may then be adjusted until the equalized energy distribution matches the target spectral energy function stored in memory. The embodiment can be used with many of the embodiments described above, such as embodiments directed to reducing switching artifacts, graded equalization for high, low and midrange frequency bands, and maintaining faithfulness to the original recording. This embodiment also has advantages in that it similarly does not require any additional information about the signal (other than the signal itself) and can adapt the equalizer settings to the spectral characteristics of the signal. By automatically equalizing the audio signal based on the audio signal itself, embodiments of the present invention can significantly enhance perceived quality without the need for the user to engage in a difficult trial and error process. Furthermore, by deriving equalizer settings based on the spectro-temporal properties of the audio signal and adjusting the derived settings to maintain faithfulness to original recording, embodiments of the present can also achieve better equalization results than conventional approaches, especially for poor quality recordings with atypical spectral properties. These and other features and advantages of the present invention will become more apparent to those skilled in the art from the following detailed description in conjunction with the appended drawings in which: Figure 1 illustrates an exemplary block diagram of an audio playback system in accordance with one embodiment of the present invention; Figure 2 illustrates an exemplary method in flowchart form for automatically equalizing audio signals in accordance with one embodiment of the present invention; and Figure 3 illustrates an exemplary platform that may be used in accordance with embodiments of the present invention. DETAILED EMBODIMENTS Embodiments of the present invention provide systems and methods for automatically equalizing audio signals. The following description is presented to enable a person skilled in the art to make and use the invention. Descriptions of specific applications are provided only as examples. Various modifications, substitutions and variations of the preferred embodiment will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments and applications without departing from the scope of the invention. Thus, the present invention is not intended to be limited to the described and illustrated embodiments, and should be accorded the widest scope consistent with the principles and features disclosed herein. Embodiments of the present invention may be used to enhance the perceived quality of audio signals during playback by automatically equalizing the audio signal. In one embodiment, an audio playback device, such as a CD player, MP3 player or digital power amplifier, may be configured with a single button (one key equalizer) for initiating automatic equalization. When the button is pressed, the device automatically derives the proper equalizer settings that will enhance the perceived quality of the audio signal. Preferably, these equalizer settings are derived based on the spectro-temporal properties of the input signal only, thereby avoiding the need for the user to engage in a difficult and time-consuming trial and error process or provide additional information about the audio signal. In order to automatically derive the proper equalizer settings, the audio playback device may exploit psycho-acoustic principles. In this context, the audio signal may be analyzed to determine which parts of the audio spectrum that, at any moment in time, are clearly audible and which parts are masked by other spectral parts of the audio signal. Depending on these measurements, the parts of the spectrum that are masked are slightly amplified, while the parts of the spectrum that are clearly audible are slightly attenuated. If the equalizer adjustments are accumulated over several tens of seconds, the equalizer settings will vary relatively slowly which helps avoid switching artifacts. Furthermore, if the measurements of masking versus non-masking are based on the equalized signals, then the accumulation of adjustments will eventually reach an asymptote at some point, thereby producing steady state equalization settings. The philosophy for using masking information to derive equalizer settings is based on maximizing the total information transmission to the listener. In other words, parts of the spectrum that are consistently (or very often) masked are increased in level until they are not more often masked than other parts of the spectrum. However, it should be noted that simply selecting equalizer settings that maximize total information transmission can lead to undesirable results. From research on visual quality perception of color and contrast, for example, it was learned that optimum quality is reached when a compromise is made between maximum information transmission and faithfulness to a (visual) reproduction. For the particular problem of enhancing audio quality, there is generally not any additional information about the most faithful equalizer setting. Accordingly, for embodiments of the present invention, it is assumed that the audio signal itself is already set such that it achieves the most faithful reproduction of the original. Therefore, in order to get the best possible audio quality that optimizes information transfer and achieves a faithful reproduction, a compromise is made between a flat equalizer setting and an adjusted equalizer setting based on masking information of the audio signal. A particularly good compromise is reached when the equalizer adjustments resulting from maximizing the information transmission are halved in the decibel domain. By providing a single button for automatically equalizing the audio signal, embodiments of the present invention can significantly enhance perceived quality without the need for the user to engage in a difficult trial and error process. Furthermore, by deriving equalizer settings based on the spectro-temporal properties of the audio signal and adjusting the derived settings to maintain faithfulness to original recording, embodiments of the present can achieve better equalization results than conventional approaches, especially for poor quality recordings with atypical spectral properties. These and other advantages will readily apparent when viewed in conjunction with the following detailed embodiments. Referring to Figure 1, an exemplary block diagram of an audio playback system in accordance with one embodiment of the present invention is illustrated generally at 100. As illustrated, the input signal to be equalized is applied to a segmentation windowing unit 110, which segments the input signal into a number of segments of, for example 10-50 ms, and applies an analysis window, such as a Hanning window, to each segment. The resulting signal in each segment is then transformed to the frequency domain using an FFT and prefiltering unit 120. The FFT and prefiltering unit 120 also prefilters the frequency components with the reciprocal of an equal loudness curve to compensate for the non-equal perception of loudness at different frequencies. This process allows equalization to take into consideration the inherent perceptual characteristics of the human ear. The output of the FFT and prefiltering unit 120 is then applied to an equalizing unit
140 that amplifies or attenuates each prefiltered frequency coefficient according to the current equalizer setting associated with each frequency band, eq(i) (which for the first iteration are set flat). The equalized frequency components are then applied to an energy calculation unit 140 that determines the signal energy, en(i), within each frequency band. It should be noted that the width of each frequency band described above is preferably proportional to the equivalent rectangular bandwidth (ERB) scale. These ERB frequency bands generally provide a better estimate of auditory filter bandwidth than classical Bark scale models. Once the frequency band energies are determined, a spreading unit 150 uses a spreading function to determine spreaded energy values for each frequency band, sen(i). In this context, each spreaded energy value models the masking that occurs at a given frequency band number as a result of spectral energy at different frequency bands. For example, the spreaded energy values may be determined in accordance with the following formula: sen(i) = Σ gfm(i,j)en(j) (1) j where sen(i) is the spreaded (masking) energy found in the i-th frequency band; en(j) is the spectral energy in the j-th frequency band; and gfm(i,j) is a spreading matrix that defines how the spectral energy distribution en(j) is spread across auditory filters. Based on the foregoing measurements of en(i) and sen(i), a signal-to-masking ratio unit 160 estimates the amount of masking at each frequency band by calculating the ratio between the spreaded and unspreaded energy values. An update unit 170 then uses the signal-to-masking ratio to update the equalization settings for each frequency band according to the following formula: eq(i) = eq(i) - c*(smr(i) - mean(smr)) (2)
where eq(i) is the equalizer setting in decibels of the i-th frequency band; smr(i) is the signal-to -masking ratio for the i-th frequency band and is equal to en(i)/sen(i); mean(smr) is the average signal-to-masking ratio across all frequency bands under consideration; and c is a constant for determining how fast the algorithm responds to changes in the signal-to- masking ratio. If c is set to a value of approximately 0.05, the algorithm gives a relatively slow adjustment of equalizer settings, but is sufficiently fast to adjust in a matter of tens of seconds. This relatively slow adjustment of equalizer settings helps avoid switching artifacts. Furthermore, by subtracting the signal-to-masking ratio for each frequency band by the average signal-to-masking ratio across all frequency bands under consideration, this process effectively normalizes the signal-to-masking ratio about the average so that some frequency bands would be amplified and some frequency bands would be attenuated, thereby avoiding an overall reduction in level. Once the updated equalizer settings are determined, the update unit 170 then updates the equalizer settings within the equalizing unit 130. These updated equalizer settings are then used by the equalizing unit 130 for the next iteration of the algorithm. The update unit 170 also applies the updated equalizer settings to an equalizing postfilter 180 that amplifies or attenuates the frequency bands of the input signal according to the associated equalizer values. As noted above, using the updated equalizer settings provided by the update unit 170 may cause the output signal to depart from a faithful reproduction of the original signal, thereby causing annoying distortion. Accordingly, in order to provide a compromise between maximum information transfer and faithfulness of reproduction, embodiments of the present invention preferably adjust the equalizer settings provided by the update unit 170 by a compensation factor A. This compensation factor is preferably equal to 0.5 such that the equalizer settings used by the equalizing postfilter 180 to amplify or attenuate the input signal are halved in the decibel domain. The equalizing that is obtained in this way will vary rather slowly and can be applied on a moment-to-moment basis without introducing artifacts. An alternative embodiment, however, may be configured to scan the entire musical recording or the entire CD using the processes described above and obtain an average value of the equalizer settings, eq(i). These average equalizer settings would then be used by the equalizing postfilter 180 for the entire recording or CD under consideration. In order to further enhance the perceptual quality of the audio signal, other embodiments may replace the button for initiating automatic equalization with a knob that is configured to adjust the equalization strength. For example, the knob may be configured to increase or decrease the responsiveness of equalization (e.g., by adjusting the value of c in equation 2 above) and/or increase or decrease the level of information transfer relative to faithfulness to the original recording (e.g., by adjusting the value of the compensation factor A used by the equalizing postfilter 180). Still other embodiments may configure the update unit 170 (or equalizing postfilter 180) to apply a graded equalization that allows for more adjustments in equalizer settings within midrange frequency bands than in the far ends of the spectrum. One exemplary grading function that works well is provided below:
A(n) = 0.25 - 0.25 * cos(2 * pi * n/N) (3)
where n is the frequency band number; A(n) is the compensation factor, A, for frequency band n that is used by the update unit 170 (or equalizing and postfilter unit 180) to determine the equalizer settings used to amplify or attenuate the frequency bands of the audio signal; and N is the total number of frequency bands. Notably, for n=N and n=l, A(n) is (almost) zero. Accordingly, most of the equalizing effect occurs at the midranges and less equalizing is present towards the lower or higher frequencies. By placing limits on the level of amplification at the far ends of the spectrum, these embodiments may reduce the level of noise or distortion perceived by the user, since very little signal is generally present in these outer portions of the frequency spectrum. In an alternative or complementary embodiment, the update unit 170 may be configured to derive updated equalizer settings by directly comparing a ratio of the unspreaded energy, en(i), and a predetermined target spectral energy distribution (which may be frequency dependent). Equalizer settings may be adjusted by the update unit 170 until the equalized energy distribution matches the target spectral energy function stored in memory. This embodiment can be used with many of the embodiments described above, such as embodiments directed to reducing switching artifacts, graded equalization at high, low and midrange frequency bands, and maintaining faithfulness to the original recording. This embodiment also has advantages in that it similarly does not require any additional information about the signal (other than the signal itself) and can adapt the equalizer settings to the spectral characteristics of the signal. Referring to Figure 2, an exemplary method in flowchart form for automatically equalizing audio signals in accordance with one embodiment of the present invention is illustrated generally at 200. As illustrated, the exemplary method begins at step 210 by equalizing frequency components using the current equalizer settings associated with each frequency band. The equalized frequency components are then used to determine the energy within each frequency band at step 220. Based on these determined energy values, the spreaded energy values may be determined at step 230 using, for example, a spreading function as described in equation 1 above. A signal-to-masking ratio for each frequency band may then be determined at step 240 by dividing the measured energy value associated with the frequency band by the spreaded energy value associated with the frequency band. The equalizer setting for each frequency band may then be updated at step 250 by taking the previous equalizer setting and subtracting a normalized signal-to-masking ratio (e.g., the signal-to-masking ratio for the frequency band minus the average signal-to-masking ratio across all frequency bands under consideration) multiplied by a responsiveness factor, c, as described in equation 2 above. These updated equalizer settings may then be used to update the equalizer settings used at step 210 for the next iteration of the exemplary method. The updated equalizer settings may also be used at step 260 to set the equalizer settings used to amplify or attenuate corresponding frequency bands of the input signal. In this regard, step 260 may multiply the updated equalizer settings determined in step 250 by a compensation factor A to maintain a certain degree of faithfulness to the original recording. Referring to Figure 3, an exemplary platform that may be used in accordance with embodiments of the present invention is illustrated generally at 300. As illustrated, the exemplary platform includes a microprocessor 310 operably coupled to a memory system 320 via a system bus 330. The memory system 320 may comprise a random access memory, a hard drive, floppy drive, a compact disk, or other computer readable medium, that stores computer instructions for an equalization module 340. The exemplary system also includes an I/O interface 350 that is coupled to the microprocessor 310 and the memory system 320 in order to enable the system to input audio signals and output equalizer settings. In operation, if the user presses an equalization button for initiating automatic equalization of the audio signal, the processor 310 uses the equalization module 340 to process audio signals received from the I/O interface 350 and to determine the proper equalizer settings. These equalizer settings may then be output to the I/O interface 350 for use in amplifying the audio signal. If the equalization module 340 is implemented in accordance with the principles described above, the exemplary system may be configured to automatically enhance the perceived quality of the audio signal by automatically deriving the appropriate equalizer settings. While the present invention has been described with reference to exemplary embodiments, it will be readily apparent to those skilled in the art that the invention is not limited to the disclosed and illustrated embodiments but, on the contrary, is intended to cover numerous other modifications, substitutions and variations and broad equivalent arrangements that are included within the scope of the following claims.

Claims

CLAIMS What is claimed is: 1. A method for automatically equalizing an audio signal, the method comprising: measuring spectral energy values for a plurality of frequency bands; determining spreaded energy values for each frequency band based on the measured spectral energy values and a spreading function, the spreading function defining a spread of spectral energy across frequency bands; and updating equalizer settings for each frequency band based on the measured spectral energy values and the spreaded spectral energy values. 2. The method of claim 1 , wherein the step of measuring comprises: measuring frequency components for at least a portion of the audio signal; and equalizing each frequency component using the equalizer setting associated with the corresponding frequency band. 3. The method of claim 2, further comprising prefiltering the measured frequency components with an inverse of an equal loudness curve prior to equalizing each frequency component. 4. The method of claim 1, wherein the step of updating comprises updating the equalizer settings based on a ratio of the measured spectral energy values and the spreaded spectral energy values. 5. The method of claim 1 , further comprising using the updated equalizer settings to amplify corresponding frequency bands of the audio signal. 6. The method of claim 5, wherein the step of using comprises using a fraction of the updated equalizer settings in the decibel domain to amplifying the audio signal. 7. The method of claim 6, wherein a smaller fraction of the updated equalizer settings are used for low and high frequency bands than for midrange frequency bands. 8. The method of claim 1, wherein the method is performed in response to actuation of a button for controlling initiation of automatic equalization. 9. The method of claim 1, wherein the method is performed in response to actuation of a knob for controlling equalization strength. 10. A system for automatically equalizing an audio signal, the system comprising: means (120, 130, 140) for measuring spectral energy values for a plurality of frequency bands; means (150) for determining spreaded energy values for each frequency band based on the measured spectral energy values and a spreading function, the spreading function defining a spread of spectral energy across frequency bands; and means (160, 170) for updating equalizer settings for each frequency band based on the measured spectral energy values and the spreaded spectral energy values. 11. The system of claim 10, wherein the means (120, 130, 140) for measuring comprises means (120) for measuring frequency components for at least a portion of the audio signal; and means (130) for equalizing the frequency components using the equalizer setting associated with the corresponding frequency band. 12. The system of claim 11, further comprising means (120) for prefiltering the measured frequency components with an inverse of an equal loudness curve prior to equalizing each frequency component. 13. The system of claim 10, wherein the means (160, 170) for updating comprises means (160) for determining the equalizer settings based on a ratio of the measured spectral energy values and the spreaded spectral energy values. 14. The system of claim 10, further comprising means (180) for using the updated equalizer settings to amplify corresponding frequency bands of the audio signal. 15. The system of claim 14, wherein the means (180) for using comprises means (180) for using a fraction of the updated equalizer settings in the decibel domain to amplify the audio signal. 16. The system of claim 15, wherein a smaller fraction of the updated equalizer settings are used for low and high frequency bands than for midrange frequency bands. 17. The system of claim 10, further comprising a button for initiating automatic equalization. 18. The system of claim 10, further comprising a knob for controlling equalization strength. 19. A method for automatically equalizing an audio signal, the method comprising: measuring spectral energy values for a plurality of frequency bands; comparing the measured energy values with a predetermined spectral energy distribution; and updating equalizer settings for each frequency band based on the comparison between the measured energy values and the predetermined spectral energy distribution. 20. The method of claim 19, wherein the step of measuring comprises: measuring frequency components for at least a portion of the audio signal; and equalizing each frequency component using the equalizer setting associated with the corresponding frequency band.
PCT/IB2005/050012 2004-01-06 2005-01-04 Systems and methods for automatically equalizing audio signals Ceased WO2005069275A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP05702547A EP1704559A1 (en) 2004-01-06 2005-01-04 Systems and methods for automatically equalizing audio signals

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US53462704P 2004-01-06 2004-01-06
US60/534,627 2004-01-06

Publications (1)

Publication Number Publication Date
WO2005069275A1 true WO2005069275A1 (en) 2005-07-28

Family

ID=34794300

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2005/050012 Ceased WO2005069275A1 (en) 2004-01-06 2005-01-04 Systems and methods for automatically equalizing audio signals

Country Status (2)

Country Link
EP (1) EP1704559A1 (en)
WO (1) WO2005069275A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8538749B2 (en) 2008-07-18 2013-09-17 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for enhanced intelligibility
US8831936B2 (en) 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US9053697B2 (en) 2010-06-01 2015-06-09 Qualcomm Incorporated Systems, methods, devices, apparatus, and computer program products for audio equalization
US9202456B2 (en) 2009-04-23 2015-12-01 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
US10121491B2 (en) 2014-12-22 2018-11-06 Nokia Technologies Oy Intelligent volume control interface

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0554962A1 (en) * 1992-02-07 1993-08-11 Novanex Automation N.V. Tone control circuitry having a frequency characteristic dependent on the input signal level
DE19832472A1 (en) * 1998-07-20 2000-01-27 Bosch Gmbh Robert Device and method for influencing an audio signal according to ambient noise amplifies or dampens audio signal frequencies selectively by mixing signals to suit ambient noise.
EP1170727A2 (en) * 2000-07-05 2002-01-09 Nec Corporation Audio encoder using psychoacoustic bit allocation
US20020196951A1 (en) * 2001-06-26 2002-12-26 Kuo-Liang Tsai System for automatically performing a frequency response equalization tuning on speaker of electronic device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0554962A1 (en) * 1992-02-07 1993-08-11 Novanex Automation N.V. Tone control circuitry having a frequency characteristic dependent on the input signal level
DE19832472A1 (en) * 1998-07-20 2000-01-27 Bosch Gmbh Robert Device and method for influencing an audio signal according to ambient noise amplifies or dampens audio signal frequencies selectively by mixing signals to suit ambient noise.
EP1170727A2 (en) * 2000-07-05 2002-01-09 Nec Corporation Audio encoder using psychoacoustic bit allocation
US20020196951A1 (en) * 2001-06-26 2002-12-26 Kuo-Liang Tsai System for automatically performing a frequency response equalization tuning on speaker of electronic device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PAR VAN DE S ET AL: "A new psychoacoustical masking model for audio coding applications", 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS. (ICASSP). ORLANDO, FL, MAY 13 - 17, 2002, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), NEW YORK, NY : IEEE, US, vol. VOL. 3 OF 4, 13 May 2002 (2002-05-13), pages II - 1805, XP002256785, ISBN: 0-7803-7402-9 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8831936B2 (en) 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US8538749B2 (en) 2008-07-18 2013-09-17 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for enhanced intelligibility
US9202456B2 (en) 2009-04-23 2015-12-01 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation
US9053697B2 (en) 2010-06-01 2015-06-09 Qualcomm Incorporated Systems, methods, devices, apparatus, and computer program products for audio equalization
US10121491B2 (en) 2014-12-22 2018-11-06 Nokia Technologies Oy Intelligent volume control interface

Also Published As

Publication number Publication date
EP1704559A1 (en) 2006-09-27

Similar Documents

Publication Publication Date Title
US11296668B2 (en) Methods and apparatus for adjusting a level of an audio signal
US8199933B2 (en) Calculating and adjusting the perceived loudness and/or the perceived spectral balance of an audio signal
EP2002429B1 (en) Controlling a perceived loudness characteristic of an audio signal
EP1704559A1 (en) Systems and methods for automatically equalizing audio signals

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2005702547

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

WWP Wipo information: published in national office

Ref document number: 2005702547

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2005702547

Country of ref document: EP