CN1121684C - System for adaptively filtering audio signals to enhance speech intelligibility in noisy environmental conditions - Google Patents
System for adaptively filtering audio signals to enhance speech intelligibility in noisy environmental conditions Download PDFInfo
- Publication number
- CN1121684C CN1121684C CN96198008A CN96198008A CN1121684C CN 1121684 C CN1121684 C CN 1121684C CN 96198008 A CN96198008 A CN 96198008A CN 96198008 A CN96198008 A CN 96198008A CN 1121684 C CN1121684 C CN 1121684C
- Authority
- CN
- China
- Prior art keywords
- noise
- speech
- filter
- frame
- filter circuit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02168—Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
- G10L2025/786—Adaptive threshold
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Noise Elimination (AREA)
- Filters That Use Time-Delay Elements (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
Abstract
Description
技术领域Technical field
该发明涉及噪声削减系统,特别涉及一种用于便携数字无线电话的自适应语音清晰度增强系统以及用于选择性地改变一帧数字信号的方法和装置。The invention relates to noise reduction systems, and more particularly to an adaptive speech intelligibility enhancement system for portable digital radiotelephones and a method and apparatus for selectively altering a frame of digital signals.
背景技术 Background technique
在美国以及世界其他地区的商业运做中,蜂窝电话工业已经取得了显著的进展。在主要都市地区,对蜂窝服务的需求正在超过现有系统的容量。假设这种趋势继续下去,蜂窝无线电通讯将触及甚至最小的农村市场。因此,以合理花费保持高质量服务的同时蜂窝容量必须加大。朝向提高容量的重量一步是蜂窝系统从模拟传送向数字传送的转换。这种转换也是重要的,这是因为第一代个人通讯网(PCNs)将可能由使用下一代数字蜂窝基础结构的蜂窝载体来提供,其中,第一代个人通讯网采用了便于携带及便于在家中、办公室、街上、汽车里等地打电话或接电话的低花费、衣袋大小的无绳电话。The cellular telephone industry has made remarkable strides in commercial operations in the United States and other parts of the world. In major metropolitan areas, the demand for cellular service is exceeding the capacity of existing systems. Assuming this trend continues, cellular radio communications will reach even the smallest rural markets. Therefore, cellular capacity must be increased while maintaining a high quality of service at a reasonable cost. A major step towards increasing capacity is the conversion of cellular systems from analog to digital transmission. This transition is also important because first-generation Personal Communications Networks (PCNs) will likely be provided by cellular carriers using next-generation digital cellular Low-cost, pocket-sized cordless phone for making and receiving calls in the office, on the street, in the car, and more.
数字通讯系统利用了强有力的数字信号处理技术。数字信号处理通常指的是数字化信号的数学或其他方面的处理。例如,将模拟信号转换(数字化)为数字形式之后,可能利用数字信号处理器(DSP)中的简单数学例程来滤波、放大并衰减该数字信号。数字信号处理器一般被制造为高速集成电路,使得数据处理操作基本上实时进行。数字信号处理器也可以被用来降低数字化语音的比特传输率,其结果表现为发送无线电信号的谱占有率降低以及系统容量扩大。例如如果使用14比特线形脉冲码调制(PCM)来数字化语音信号,并以8KHZ的采样率采样,会产生112K比特/秒的串行比特率。此外,通过从数学上利用冗余的特性及其它人类语音的可预测特性,声音编码技术可以用来将112K比特/秒的串行比特率压缩到7.95K比特/秒以获得14∶1的比特传输率削减。传输率削减意味着更多可得的带宽。Digital communication systems utilize powerful digital signal processing techniques. Digital signal processing generally refers to the mathematical or other manipulation of digitized signals. For example, after converting (digitizing) an analog signal to digital form, the digital signal may be filtered, amplified, and attenuated using simple mathematical routines in a digital signal processor (DSP). Digital signal processors are typically fabricated as high-speed integrated circuits such that data processing operations are performed substantially in real time. Digital signal processors can also be used to reduce the bit rate of digitized speech. The result is a reduction in the spectrum occupancy of the transmitted radio signal and an increase in system capacity. For example, if 14-bit linear pulse code modulation (PCM) is used to digitize the voice signal and sampled at a sampling rate of 8KHZ, a serial bit rate of 112K bits/second will be generated. In addition, by mathematically exploiting the properties of redundancy and other predictable properties of human speech, vocoder techniques can be used to compress the serial bit rate of 112Kbit/s to 7.95Kbit/s to obtain a 14:1 bit Transmission rate reduction. Reduced transfer rates mean more available bandwidth.
在美国,被TIA采用用做第二代蜂窝电话系统(i.e.IS-54)的数字化标准的一种流行的语音压缩技术是矢量源码书激励的线形预测编码(VSELP)。不幸的是,当包括语音且混有高电平环境噪声(尤其是″有色噪声″)的音频信号被利用VSELP编码/压缩时,结果里可能包括部分不希望的音频信号特性。例如,如果在噪声环境(例如,其中移动车辆的车内)中使用数字移动电话,环境噪声和希望的语音都被使用VSELP编码算法压缩,并被发送到基站,在基站中压缩后的信号被解码并被重构为可听的语音。当背景噪声被重构为模拟形式时,噪声的不被希望的可听失真,以及偶然发生于语音中的这些情况被引入。这种失真对于一般听众来说是非常讨厌的。In the United States, a popular speech compression technique adopted by TIA as the digitization standard for the second generation cellular telephone system (i.e. IS-54) is Vector Source Book Excited Linear Predictive Coding (VSELP). Unfortunately, when an audio signal comprising speech mixed with high levels of ambient noise (especially "colored noise") is encoded/compressed using VSELP, the result may include some undesired characteristics of the audio signal. For example, if a digital mobile phone is used in a noisy environment (e.g., inside a vehicle in which it is moving), both the ambient noise and the desired speech are compressed using the VSELP coding algorithm and sent to a base station where the compressed signal is decoded and reconstructed into audible speech. Undesirable audible distortions of the noise, as well as those that happen to occur in speech, are introduced when the background noise is reconstructed into analog form. This distortion is very annoying to the average listener.
这种失真大部分由使用移动电话的环境造成。移动电话一般用于车辆内部,在那里常常有汽车引擎产生的环境噪声及周围车流造成的噪声。这种车辆内部的环境噪声通常集中在低音频范围,并且噪声幅度会由于车辆的速度和加速以及周围车流量大小程度这样的因素而变化。这种低频噪声还具有这样的趋势:严重降低来自汽车中讲话人的语音清晰度。在采用VSELP声音编码器的通讯系统中,这种由低频噪声造成的语音清晰度的降低可能会尤其显著,但是这种现象也可能发生于不包括VSELP声音编码器的通讯系统中。Much of this distortion is caused by the environment in which the mobile phone is used. Mobile phones are typically used inside vehicles, where there is often ambient noise from the car engine and noise from surrounding traffic. Such ambient noise inside a vehicle is typically concentrated in the low frequency range, and the noise amplitude can vary due to factors such as the speed and acceleration of the vehicle and the amount of surrounding traffic. This low frequency noise also has a tendency to severely degrade speech intelligibility from people speaking in the car. This reduction in speech intelligibility due to low-frequency noise may be particularly pronounced in communication systems employing VSELP vocoders, but it may also occur in communication systems that do not include VSELP vocoders.
环境噪声对移动电话的影响也可能会因使用移动电话的方式而起作用。尤其是移动电话可以用于免提方式,即电话使用者对着放在托架内的移动电话讲话。这使得移动电话使用者的手可以腾出来驾驶,但也增加了使用者讲出的话到达移动电话麦克风输入端之前必须走过的距离。这种使用者与移动电话之间加大了的距离,加上变化的环境噪声,会导致噪声成为输入到移动电话的音频信号总功率谱能量的一个显著部分。The effect of ambient noise on a mobile phone may also play a role in how the mobile phone is used. In particular the mobile phone can be used in hands-free mode, ie the user of the phone speaks into the mobile phone placed in the cradle. This frees the mobile phone user's hands for driving, but also increases the distance that the user's spoken words must travel before reaching the mobile phone's microphone input. This increased distance between the user and the mobile phone, combined with varying ambient noise, can cause noise to be a significant portion of the total power spectral energy of the audio signal input to the mobile phone.
现有技术的描述包括在EP 0 645 756,EP 0 558 312,EP 0 665530,DE 4 012 349,美国专利No.4,811,404,4,461,025,及5,251,263中,所有这些都描述了滤除不希望信号成分的方式。Descriptions of the prior art include EP 0 645 756, EP 0 558 312, EP 0 665530, DE 4 012 349, U.S. Patent Nos. 4,811,404, 4,461,025, and 5,251,263, all of which describe methods for filtering out unwanted signal components Way.
理论上来说,可利用数字信号处理器来实现各种数字信号处理算法以滤除VSELP编码的背景噪声。然而,这些解决方法常常需要以每秒执行几百万条指令(MIPS)来计算的很大的数字信号处理开销,这耗费了可贵的处理时间,内存空间及功率耗费。然而,在便携无线电话中,这些信号处理资源中的每一个都是有限的。因此,对于最小化VSLEP编码的背景噪声以及其他类型的背景噪声,简单地增加DSP的处理负担并不是一个最佳解决方案。Theoretically, a digital signal processor can be used to implement various digital signal processing algorithms to filter out the background noise of the VSELP code. However, these solutions often require significant digital signal processing overhead measured in millions of instructions per second (MIPS), which consumes valuable processing time, memory space, and power consumption. However, in portable radiotelephones, each of these signal processing resources is limited. Therefore, simply increasing the processing burden on the DSP is not an optimal solution for minimizing the background noise of VSLEP encoding as well as other types of background noise.
发明概要Summary of the invention
该发明给出了一种自适应降噪系统,该系统在使对编码语音质量的任何负面影响最小化及对数字信号处理资源的任何增加消耗最小化的同时,削减了不希望的编码背景噪声作用。该发明的方法和系统增加了数字化音频信号中语音的清晰度,其方法是将数字化音频信号帧通过一个滤波器电路。该滤波器电路起可调节高通滤波器的作用,它滤除一部分低音频范围的数字化信号,通过落在高频范围内的数字化信号部分。因为车辆内的噪声趋于集中在低音频范围,并且只有一小部分语音清晰值落在该低频范围,因此滤波电路在只滤除掉语音的不重要部分的同时,滤除了数字化音频信号中的大部分噪声。这使得与除去的语音能量部分相比,相对更大一部分的噪声能量被除去。通过自适应地调整并选择滤波器电路的频率响应曲线,滤除掉的语音量被限制,并对无线电输出的语音清晰度具有最小的影响。This invention presents an adaptive noise reduction system that cuts unwanted coding background noise while minimizing any negative impact on the quality of coded speech and any increased consumption of digital signal processing resources effect. The inventive method and system increase the intelligibility of speech in a digitized audio signal by passing frames of the digitized audio signal through a filter circuit. The filter circuit acts as an adjustable high-pass filter that rejects a portion of the digitized signal in the low frequency range and passes a portion of the digitized signal that falls in the high frequency range. Because the noise in the vehicle tends to be concentrated in the low frequency range, and only a small part of the speech intelligibility value falls in this low frequency range, the filter circuit filters out the noise in the digitized audio signal while only filtering out the unimportant part of the speech. most of the noise. This results in a relatively larger fraction of noise energy being removed than the fraction of speech energy being removed. By adaptively adjusting and selecting the frequency response curve of the filter circuit, the amount of speech filtered out is limited with minimal impact on speech intelligibility at the radio output.
滤波器控制电路被用来调整滤波器电路,使之以某一噪声估计值和/或谱包络函数的形式来显示不同的频率响应曲线,其中的噪声估计值和/或谱包络对应于音频信号中的噪声。噪声估计值和/或谱包络在逐帧的基础上针对数字信号做调整并以语音检测函数的形式来调整。如果没有检测到语音,那麽为当前帧修正噪声估计值和/或谱包络。如果检测到语音,就不调整噪声估计值和/或谱包络。The filter control circuit is used to adjust the filter circuit to exhibit different frequency response curves in the form of a noise estimate and/or spectral envelope function corresponding to Noise in an audio signal. The noise estimate and/or spectral envelope is adjusted to the digital signal on a frame-by-frame basis and in the form of a speech detection function. If no speech is detected, the noise estimate and/or spectral envelope are revised for the current frame. If speech is detected, the noise estimate and/or spectral envelope are not adjusted.
在第一实施方案中,滤波器电路针对数字化的音频信号帧计算噪声估计值。该噪声估计值对应于数字化音频信号帧中的背景噪声量。当语音低频范围中的背景噪声对语音的相对量增加时,噪声估计值增加。当语音低频范围中的背景噪声对语音的相对量增加时,滤波器控制电路使用噪声估计值来调整滤波器电路以滤除更大部分的低频范围语音。当不存在背景噪声时,没有语音信号被滤除。当存在更高的噪声电平时,更大部分的噪声和语音信息被抽取。因为噪声趋向于集中在低频范围内并且只有相对较小部分语音清晰值落在该低频范围内,当噪声估计值增加时,通过加大正在被滤除的低频能量部分,音频信号的整体清晰度可以被提高。In a first embodiment, a filter circuit calculates a noise estimate for a frame of a digitized audio signal. The noise estimate corresponds to the amount of background noise in the frame of the digitized audio signal. The noise estimate increases as the relative amount of background noise in the low frequency range of speech to speech increases. As the relative amount of background noise to speech in the low frequency range of speech increases, the filter control circuit uses the noise estimate to adjust the filter circuit to filter out a greater portion of the low frequency range speech. When no background noise is present, no speech signal is filtered out. When a higher noise level is present, a larger portion of noise and speech information is extracted. Because noise tends to be concentrated in the low frequency range and only a relatively small portion of the speech intelligibility value falls within this low frequency range, as the noise estimate increases, the overall intelligibility of the audio signal is improved by increasing the portion of low frequency energy that is being filtered out can be improved.
在第二实施方案中,一个修改后的滤波器控制电路被用来调整滤波器电路,使之以某一噪声包络函数的形式来显示出不同的频率响应曲线,其中的噪声包络为音频信号中选出频率范围上噪声估计值的噪声包络。该滤波控制电路包括一个谱分析器,该分析器以检测语音函数的形式来确定一个噪声包络估计值。为当前帧确定一个噪声包络估计值并将该估计值与参考噪声包络相比较。基于该比较,滤波器电路被自适应地调整用来从当前帧中提取不同数量的低频能量。In a second embodiment, a modified filter control circuit is used to adjust the filter circuit to exhibit different frequency response curves as a function of a noise envelope, where the noise envelope is the audio The noise envelope of the noise estimate over the selected frequency range in the signal. The filter control circuit includes a spectral analyzer that determines a noise profile estimate as a function of the detected speech. An estimate of the noise profile is determined for the current frame and compared to the reference noise profile. Based on this comparison, the filter circuit is adaptively adjusted to extract different amounts of low frequency energy from the current frame.
根据该发明的自适应削减系统可以被很好地应用于无线电通讯系统,在该无线电通讯系统中,便携/移动无线电收发器之间及无线电收发器与固定电话线用户之间通过RF信道通讯。每一个收发器包括一个天线,一个用于将通过天线在RF信道上接收到的无线电信号转换成模拟音频信号的接收器,及一个发送器。发送器包括一个编-解码器(codec)用来把将要被发送的模拟音频信号数字化为数字化语音信息帧,该语音信息既包括语音也包括背景噪声。数字信号处理器在背景噪声估计值及当前帧中语音检测的基础上处理当前帧来最小化背景噪声。调制器将处理后的数字化语音信息帧调制到RF载波上用于后续通过天线的发送。The adaptive clipping system according to the invention can be advantageously applied to radio communication systems in which portable/mobile radio transceivers communicate with each other and between radio transceivers and fixed telephone line subscribers via RF channels. Each transceiver includes an antenna, a receiver for converting radio signals received on an RF channel through the antenna into analog audio signals, and a transmitter. The transmitter includes a coder-decoder (codec) for digitizing the analog audio signal to be transmitted into frames of digitized speech information, the speech information including both speech and background noise. The digital signal processor processes the current frame to minimize the background noise based on the background noise estimate and speech detection in the current frame. The modulator modulates the processed digitized speech information frame onto an RF carrier for subsequent transmission through the antenna.
附图简要描述Brief description of the drawings
根据下面书写的描述并结合附图,对于该技术领域的普通技术人员来说,该发明的所有特征及优点将很容易明白。All features and advantages of this invention will become readily apparent to those of ordinary skill in the art from the following written description when taken in conjunction with the accompanying drawings.
图1是该发明的一个通用功能方框图。Figure 1 is a general functional block diagram of the invention.
图2图解说明了用于蜂窝无线电通讯的美国数字标准IS-54的帧和位置结构;Figure 2 illustrates the frame and position structure of the US Digital Standard IS-54 for cellular radio communications;
图3是利用数字信号处理器实现的该发明第一优选实施方案的方框图;Figure 3 is a block diagram of a first preferred embodiment of the invention implemented using a digital signal processor;
图4是该发明的一个示范实施方案的功能方框图,该实施方案应用于无线电通讯系统中多个便携无线电收发器中一个。Fig. 4 is a functional block diagram of an exemplary embodiment of the invention as applied to one of a plurality of portable radio transceivers in a radio communication system.
图5A和5B为一个流程图,它图解说明了在实现该发明第一优选实施方案过程中,数字信号处理器执行的功能/操作。5A and 5B are a flowchart illustrating the functions/operations performed by the digital signal processor in implementing the first preferred embodiment of the invention.
图6A是图解说明根据该发明第一优选实施方案的滤波电路的衰减-频率特性的第一示例图。Fig. 6A is a first exemplary diagram illustrating the attenuation-frequency characteristic of the filter circuit according to the first preferred embodiment of the present invention.
图6B是图解说明根据该发明第一优选实施方案的滤波电路的衰减-频率特性的第二示例图。Fig. 6B is a second exemplary diagram illustrating the attenuation-frequency characteristic of the filter circuit according to the first preferred embodiment of the present invention.
图7是可被该发明第一优选实施方案中滤波控制器电路存取的一个示例查询表。Figure 7 is an exemplary look-up table accessible by the filter controller circuit of the first preferred embodiment of the invention.
图8A和8B图解说明了示例输入音频信号的幅度-频率特性。8A and 8B illustrate amplitude-frequency characteristics of an example input audio signal.
图9A和9B分别图解说明了图8A和8B中输入音频信号被该发明的滤波器电路滤波之后的幅度-频率特性;Figures 9A and 9B illustrate the amplitude-frequency characteristics of the input audio signal in Figures 8A and 8B after being filtered by the inventive filter circuit, respectively;
图10是利用数字信号处理器实现的该发明第二优选实施方案的方框图;Figure 10 is a block diagram of a second preferred embodiment of the invention implemented using a digital signal processor;
图11为一个流程图,对应于图5B的流程图,它图解说明了在实现该发明第二优选实施方案过程中数字信号处理器执行的功能/操作。Fig. 11 is a flowchart, corresponding to the flowchart of Fig. 5B, which illustrates the functions/operations performed by the digital signal processor in implementing the second preferred embodiment of the invention.
图12是可被该发明第二优选实施方案中滤波器控制电路存取的一个示例查询表。Figure 12 is an exemplary look-up table accessible by the filter control circuit in the second preferred embodiment of the invention.
附图详细描述Detailed description of the drawings
在下面的描述中,出于解释而不是限制的目的,为了给出对该发明的全面理解,具体的细节如特殊电路、电路元件、技术、流程图等等被陈述。然而,该技术领域的技术人员会明白,该发明可以实践于偏离这些具体细节的其他实施方案中。在其他例子中,众所周知的方法、设备、和电路的详细描述都被略去以便不会以不必要的细节而模糊了对该发明的描述。In the following description, for purposes of explanation rather than limitation, specific details are set forth such as particular circuits, circuit elements, techniques, flowcharts, etc., in order to give a thorough understanding of the invention. It will be apparent, however, to those skilled in the art that the invention may be practiced in other embodiments that depart from these specific details. In other instances, detailed descriptions of well-known methods, devices, and circuits are omitted so as not to obscure the description of the invention with unnecessary detail.
图1是根据该发明的自适应降噪系统100的通用方框图。自适应降噪系统100包括一个连接到滤波器电路115的滤波器控制电路105。滤波器控制电路105为数字化音频信号的当前帧产生一个滤波器控制信号。该滤波器控制信号被输出到滤波器电路115,滤波器电路115根据滤波器控制信号而调节以显示出一个高通频率响应曲线,该曲线是在滤波器控制信号的基础上选择出的。调节后的滤波电路115滤波数字化音频信号的当前帧。滤波信号被声音编码器120处理来产生表示数字化音频信号的编码信号。FIG. 1 is a generalized block diagram of an adaptive
在蜂窝无线电通讯系统中应用于便携/移动无线电话收发器的该发明的一个示范方案中,图2图解说明了IS-54标准为数字蜂窝无线电通讯采用的时分多址存取(TDMA)的帧结构。一″帧″是一个20毫秒的时间段,它包括一个发送块TX,一个接收块RX及一个信号强度测量块用于移动台辅助切换(MAHO)。图2中表示出的两个连续帧在40毫秒的时间段内被发送。数字化的语音和背景噪声信息如同下面进一步描述的,在逐帧的基础上被处理和滤波。In an exemplary implementation of the invention as applied to a portable/mobile radiotelephone transceiver in a cellular radio communication system, Figure 2 illustrates the Time Division Multiple Access (TDMA) frame adopted by the IS-54 standard for digital cellular radio communication structure. A "frame" is a 20 millisecond time period that includes a transmit block TX, a receive block RX and a signal strength measurement block for Mobile Assisted Handover (MAHO). The two consecutive frames shown in Figure 2 are sent over a period of 40 milliseconds. The digitized speech and background noise information is processed and filtered on a frame-by-frame basis as described further below.
最好的是,图1中的滤波器控制电路105,滤波器电路115及声音编码器120的功能用一个高速数字信号处理器来实现。一种适当的数字信号处理器是可以从TI公司获得的TMS320C53 DSP。该TMS320C53 DSP在一个单集成片上包括一个16比特微处理器,用于存储如将要被处理的语音帧这样的数据的片上RAM,用于存储各种数据处理算法的ROM,其中的算法包括VSELP语音压缩算法,以及下面将要描述的用来完成由滤波器控制电路105和滤波器电路115执行的功能块的其它算法。Preferably, the functions of the
该发明的第一实施方案在图3中被表示出。在第一实施方案中,滤波器电路115以背景噪声估计值函数的形式来调整。该背景噪声估计值由滤波器控制电路来确定。脉冲码调制的音频信息帧被顺序存储在DSP的片上RAM中。可以使用其他的数字化技术来数字化音频信息。每一个PCM数字帧从DSP片上RAM中被取出并被帧能量估计器210处理,然后暂存在临时帧存储器220中。由帧能量估计器210确定的当前帧的能量被提供给噪声估计器230及语音检测器240功能块。当帧能量估计值超过前一噪声估计值与一个语音阈值的和时,语音检测器240表示语音存在于当前帧中。如果语音检测器确定没有语音存在,那么数字信号处理器200以当前噪声估计值和当前帧能量函数的形式来计算一个修正的噪声估计值。A first embodiment of the invention is represented in FIG. 3 . In a first embodiment, the
修正后的噪声估计值被输出到滤波器选择器235。滤波器选择器235基于噪声估计值产生一个滤波器控制信号。在优选实施方案中,滤波器选择器235在产生滤波器控制信号过程中读取查询表。查询表包括一系列的滤波器控制值,每一个控制值都与一个噪声估计值或噪声估计值的范围相匹配。在修正后噪声估计值的基础上,查询表中的滤波器控制值被选择出,该滤波器控制值由一个滤波器控制信号表示,该控制信号为滤波器电路115而输出到滤波器组265。为了稳定该过程,并避免在不同滤波器之间的连续切换,为新滤波器的选择设置了一个N帧的转换时间。新滤波器只能每N帧选择一次,其中N是一个大于1,并最好大于10的整数。The corrected noise estimate is output to the
滤波器电路115根据滤波器控制信号而调整以显示出对应于输入滤波器控制信号和噪声估计值的高通频率响应曲线。现有技术中众所周知的各种不同类型的滤波器电路可以被用来根据滤波器控制信号显示选中的频率响应曲线。这些现有技术的滤波器包括IIR滤波器,如巴特沃斯,契比雪夫或椭圆滤波器,由于较低的处理要求,也可以使用FIR滤波器,但优选IIR滤波器。The
滤波后的信号被声音编码器120处理,120被用来压缩滤波后信号的比特率。在优选实施方案中,声音编码器120使用矢量源码书激励线性预测编码(VSELP)技术来编码音频信号。其他的声音编码技术和算法也可以被使用,例如码激励线性预测(CELP)编码,残留脉冲激励线性预测(RPE-LTP)编码,改进的多带激励(IMBE)编码。通过在声音编码之前根据该发明滤波音频信号帧,背景噪声被最小化,该处理基本上削减了当语音重构时语音中的任何不希望的噪声影响。它也防止了语音被″淹没″在低频噪声中。The filtered signal is processed by a
结合图3描述的数字信号处理器200可以被用于例如无线电通讯系统中使用的数字便携/移动无线电话的收发器这样的装置中。图4图解说明了一个这样的数字无线电收发器,它可以用于蜂窝无线电通讯网络中。The
包括语音和背景噪声的音频信号从麦克风400输入到编-解码器402,402最好是一个专用集成电路(ASIC)。在麦克风400处检测到的带限音频信号被编解码器402以每秒8000个样本的采样率采样并被分块成帧。根据上述,每个20毫秒帧包括160个语音样本。这些样本被量化并被转换成例如14比特线性PCM这样的编码数字格式。一旦当前帧的数字化语音的160个样本被存储到发送DSP200中的片上RAM中,发送DSP200就象上面结合图3描述的一样,根据VSELP算法,执行信道编码功能,帧能量估计,噪声估计,语音检测,FFT,滤波功能和数字语音编码/压缩。An audio signal including speech and background noise is input from a microphone 400 to a codec 402, which is preferably an application specific integrated circuit (ASIC). The band-limited audio signal detected at microphone 400 is sampled by codec 402 at a sampling rate of 8000 samples per second and blocked into frames. According to the above, each 20 millisecond frame includes 160 speech samples. These samples are quantized and converted into a coded digital format such as 14-bit linear PCM. Once the 160 samples of the digitized speech of the current frame are stored in the on-chip RAM in the transmitting
监测微处理器432控制着图4中表示出的收发器中所有元件的整个操作。由发送DSP200产生的滤波后的PCM数据流被提供用来正交调制和发送。到此为止,基于得自DSP 200的滤波后PCM数据流,ASIC门阵列404产生同相(I)信息信道和正交(Q)信息信道。I和Q比特流被相匹配的低通滤波器406和408处理并被传送到平衡调制器410中的IQ混合器中。参考震荡器412和乘法器414给出一个发送中间频率(IF)。I信号与同相IF混合,Q信号与正交IF混合(即同相IF被相移器416滞后90度)。混合的I和Q信号被相加,并被“上”转换到由信道合成器430选出的RF信道频率上,然后通过双工机420和天线422在选出的无线电频率信道上发送。Supervisory microprocessor 432 controls the overall operation of all components in the transceiver shown in FIG. The filtered PCM data stream generated by transmit
在接收边,通过天线422和双工机420接收的信号被从混合器424中的选中接收信道频率上向下转换为第一IF频率,其中的第一IF频率使用了由信道合成器430在参考震荡器428的输出基础上合成的本地震荡器信号。第一IF混合器424的输出被滤波并且其频率被向下转换为第二IF,该转换在信道合成器430和解调器426的另一个输出的基础上进行。然后,接收门阵列434将第二IF信号转换成一系列的相位样本和一系列的频率样本。接收DSP436对接收到的信号执行解调、滤波、增益/衰减、信道解码和语音扩张。然后,处理后的语音数据被送到编解码器402并被转换为基带音频信号用于驱动扬声器438。On the receive side, the signal received via antenna 422 and duplexer 420 is down-converted from the selected receive channel frequency in mixer 424 to a first IF frequency using The output of reference oscillator 428 is based on the synthesized local oscillator signal. The output of the first IF mixer 424 is filtered and its frequency down-converted to the second IF, which conversion is performed on the basis of the other output of the channel combiner 430 and demodulator 426 . Receive gate array 434 then converts the second IF signal into a series of phase samples and a series of frequency samples. Receive DSP 436 performs demodulation, filtering, gain/attenuation, channel decoding and speech expansion on the received signal. The processed speech data is then sent to the codec 402 and converted to a baseband audio signal for driving the speaker 438 .
现在将结合图5A、5B中的流程图描述数字信号处理器200为实现滤波器控制电路105,滤波器电路115和声音编码器120的功能而执行的操作。帧能量估计器210确定每一帧音频信号的能量。通过计算一帧中每一个PCM样本平方值的和(步骤505),帧能量估计器210确定当前帧的能量。因为对于每秒8000样本的采样率来说,每一个20毫秒长的帧有160个样本,那麽就有160个PCM样本平方被相加。按照数学方式表达,帧能量估计根据下面等式1来确定:等式1The operations performed by the
为当前帧计算的帧能量值被存储在DSP200的片上RAM202中(步骤510)。The frame energy value calculated for the current frame is stored in on-chip RAM 202 of DSP 200 (step 510).
语音检测器240的功能包括从DSP200的片上RAM202中取出一个由噪声估计器230先前确定的噪声估计值(步骤515)。当然,当收发器最初上电时,没有噪声估计值存在。判断框520预计到这种情况并在步骤525给出一个噪声估计值。为了象下面将要描述的,迫使对噪声估计值的修正,最好安排一个任意高的值作为噪声估计值,例如在正常语音电平之上的20dB。由帧能量估计器210确定的帧能量被从DSP210的片上RAM202中取出(方框530)。在方框535中确定帧能量估计值是否超过了检出的噪声估计值加上一个预定语音阈值的和,如下面等式2表示的:The
帧能量估计值>(噪声估计值+语音阈值) (等式2)Frame energy estimate > (noise estimate + speech threshold) (Equation 2)
语音阈值可以是一个固定值,该固定值由经验确定大于一般背景噪声的短时能量方差,并且可以被设置为例如9dB。另外,语音阈值可以被自适应地修改来反映变化的语音条件,例如,当讲话者进入一个更嘈杂或更安静的环境时的语音条件。如果帧能量估计值超过了等式2中的和,那麽在方框570中设置一个标志位表示语音存在。如果语音检测器240检测到语音存在,那麽噪声估计器230被越过,为以前的数字化音频帧计算的噪声估计值被检索出并被用做当前噪声估计值。相反的,如果帧能量估计小于等式2中的和,在方框540将语音标志清零。The speech threshold may be a fixed value empirically determined to be greater than the short-term energy variance of general background noise, and may be set to, for example, 9dB. Additionally, speech thresholds can be adaptively modified to reflect changing speech conditions, for example, when a speaker enters a noisier or quieter environment. If the frame energy estimate exceeds the sum in
也可以使用其他检测当前帧中语音的系统。例如,欧洲电讯标准机构(ETSI)已经开发了一种用于全球定位系统GSM中声音活动检测(VAD)的标准。并在ETSI参考文献:RE/SMG-020632P中被描述,该文献以参考的方式在此引用。Other systems that detect speech in the current frame can also be used. For example, the European Telecommunications Standards Institute (ETSI) has developed a standard for Voice Activity Detection (VAD) in the Global Positioning System GSM. and is described in ETSI reference: RE/SMG-020632P, which is hereby incorporated by reference.
如果语音不存在,噪声估计器230中的噪声估计值修正例程被执行。在没有语音存在的时间里,噪声估计值实质上是帧能量的一个在线平均。如上面描述的,如果最初的启动噪声估计值被选择的足够高,那麽语音没有被检测到,并且语音标志被因此清零以迫使对噪声估计值的修正。If speech is not present, the noise estimate modification routine in
在由噪声估计器230执行的噪声估计例程中,在方框545中确定了一个差值/误差(Δ),根据下面等式,该误差为帧能量估计器210产生的帧噪声能量与噪声估计器230以前计算的噪声估计值之间的差值:In the noise estimation routine performed by the
Δ=当前帧能量-以前噪声估计(等式3)Δ = current frame energy - previous noise estimate (Equation 3)
判断框550确定是否Δ超过了0。如果Δ是负的,如同高噪声估计值时发生的一样,那么根据下面等式噪声估计值在方框560中被重新计算:
噪声估计=以前噪声估计+Δ/2(等式4)Noise Estimate = Previous Noise Estimate + Δ/2 (Equation 4)
由于Δ是负的,这导致噪声估计值向下校正。这里选择了相对较大的步长Δ/2来快速校正以降低噪声电平。然而,如果帧能量超过了噪声估计值,给出一个大于0的Δ,则在方框555中噪声被根据下面等式修正:Since Δ is negative, this causes the noise estimate to be corrected downward. Here a relatively large step size Δ/2 is chosen for fast correction to reduce the noise level. However, if the frame energy exceeds the noise estimate, giving a Δ greater than 0, then in
噪声估计=以前噪声估计值+Δ/256(等式5)Noise Estimate = Previous Noise Estimate + Δ/256 (Equation 5)
由于Δ是正的,噪声估计值一定会增加。然而,这里选取了一个更小的步长Δ/256(与Δ/2相比)来逐渐加大噪声估计值并对瞬时噪声做出实质性的消除。Since Δ is positive, the noise estimate must increase. However, here a smaller step size Δ/256 (compared to Δ/2) is chosen to gradually increase the noise estimate and substantially eliminate the instantaneous noise.
为当前帧计算的噪声估计值被输出到滤波器选择器235。在第一优选实施方案中,滤波器选择器235读取查询表并利用当前噪声估计值来选取一个滤波器控制值(步骤572)。然后滤波器电路115(步骤574)以选出滤波器控制值函数的形式调整来显示出一个频率响应曲线,该响应曲线意在当噪声估计值和背景噪声增加时加大滤除的噪声量。然后,存储在DSP RAM中的PCM样本通过调整后的滤波器电路265来滤波PCM样本以除去噪声(步骤576)。滤波后的PCM样本之后被声音编码器120处理(步骤578),然后,编码后的样本被输出到RF发送电路(步骤580)。The noise estimate calculated for the current frame is output to the
图6A和6B给出了几个关于滤波器电路如何调整来为输入到滤波器电路115的不同滤波器控制信号显示不同频率响应曲线F1-F4的例子。如图6A所示,滤波器电路115可以被选择来显示一系列不同的频率响应曲线,并且频率响应曲线F1-F4分别具有截止频率F1c-F4c。在优选实施方案中,滤波器电路115的截止频率的范围可以是300HZ到800HZ。当噪声估计值增加时,滤波器电路115被设计为显示具有更高截止频率的频率响应曲线。这种更高的截止频率导致更大部分落在语音低频范围内的帧能量被滤波器电路115抽取。6A and 6B give several examples of how the filter circuit can be adjusted to exhibit different frequency response curves F1 - F4 for different filter control signals input to the
同样的,如图6B所示,滤波器电路115可以被选择来显示一系列的不同频率响应曲线F1-F4,并且每一个频率响应曲线具有不同的坡度和相同的截止频率。频率响应曲线F1-F4的截止频率在上面提及的范围内。当噪声估计值增加时,滤波器电路115被调整来显示具有更陡坡度的频率响应曲线。这种更陡的坡度导致更大部分落在语音更低频率范围内的帧能量被滤波器电路115抽取。Likewise, as shown in FIG. 6B, the
滤波器电路115以某一噪声估计值函数的形式来滤波当前帧,其中的噪声估计值是为当前帧计算的。当前帧被滤波使得噪声被削减而通过了语音的主要部分。未被滤除且通过的语音主要部分给出了可识别的语音输出,语音信号质量只有很小降低。不同截止频率和不同坡度的组合可以被用来自适应地抽取落在语音低频范围内的帧能量的选出部分。
图7描述了由滤波器选择器235读取的一个示例查询表,以便替滤波器电路115从滤波器响应曲线F1-F4中选出一个。该查询表包括一系列可能的噪声估计值N1-Nn及滤波器控制值F1-Fn,这些值对应于滤波器电路115显示的可能响应曲线。噪声估计值N1-Nn中的每一个可以表示一个范围的噪声估计值,并且每一个都与一个特定的滤波器控制值F1-F4相匹配。滤波器控制电路105产生一个滤波器控制信号,其方法是计算一个噪声估计值并从查询表中检出与之相关的滤波器控制值。FIG. 7 depicts an example look-up table read by
图8A&B和9A&B表示了两帧音频信号的每一帧是如何被自适应滤波以给出一个输出到RF发送器的改进音频信号的。图8A和8B表示了分别包括语音分量s1,s2和噪声分量n1,n2的音频信号的一个第一帧和一个第二帧。如图所示,两帧中的噪声能量n1和n2都集中在低音频范围内。而语音能量s1和s2却集中在较高音频范围内。图9A给出了滤波后第一帧的噪声信号n1和语音信号s1。图9B给出了滤波后第二帧的噪声信号n2和语音信号s2。Figures 8A&B and 9A&B illustrate how each of the two frames of the audio signal is adaptively filtered to give an improved audio signal output to the RF transmitter. 8A and 8B show a first frame and a second frame of an audio signal including speech components s1, s2 and noise components n1, n2, respectively. As shown, the noise energies n1 and n2 in both frames are concentrated in the low audio frequency range. The speech energies s1 and s2 are concentrated in the higher audio frequency range. FIG. 9A shows the noise signal n1 and the speech signal s1 of the first frame after filtering. FIG. 9B shows the noise signal n2 and the speech signal s2 of the second frame after filtering.
如同所讨论的,自适应音频降噪系统100被设计用于计算第一帧和第二帧之间噪声电平的差值,其方法是基于当前帧的计算后噪声估计值来调整滤波器控制电路105。例如,滤波器控制电路105计算出噪声估计N1和谱包络s1并且为第一帧选出一个滤波器控制值F1。在优选实施方案中,基于滤波器控制值F1,滤波器电路115被调整并象图6A所示的,显示出一个具有截止频率F1c的频率响应曲线。然后,第一帧通过该调整后的滤波器电路115。滤波器电路115被选中使得大部分的噪声n1和仅一小部分的语音s1落在频率响应曲线F1的截止频率F1c之下。这导致噪声n1被有效的滤除并且只有一部分相对不重要的语音s1被滤除。滤波后的第一帧音频信号表示在图9A中。As discussed, the adaptive audio
在图8B中显示的第二帧中,存在更高的背景噪声,并假设语音没有被检测到,那麽滤波器控制电路105会计算出一个更高的噪声估计值n2。基于该更高噪声估计值,为第二帧确定一个更高的相应滤波器控制值F2。在第一优选实施方案中,根据更高的滤波器控制值F2来调整滤波器电路115以象图6A表示的那样显示一个具有更高截止频率F2c的频率响应曲线。然后,音频信号的后续帧通过了调整后的滤波器电路115。因为对于后续帧来说,频率响应曲线F2的截止频率F2c更高,所以大部分的噪声n2和语音s2都被滤除了。(但是),语音s2被滤除的部分与该帧包括的清晰度信息相比仍然相对不显著,因此这对语音仅有很小的影响。滤除更大部分的语音s2的缺点被第二帧中噪声n2去除量增加的优点所抵消。被滤除掉的语音谱部分并不显著作用于语音的清晰度。第二帧中滤波后的音频信号在图9B中表示出。In the second frame shown in FIG. 8B , there is higher background noise, and assuming speech is not detected, the
图10-12中给出了自适应降噪系统100的第二优选实施方案。在第二优选实施方案中,滤波器控制电路105以噪声包络估计值函数的形式来调整滤波器电路115。噪声包络估计值是针对每一帧计算的并与参考噪声包络估计值比较。基于该比较,滤波器电路115被自适应地调整来从当前帧中提取不同数量的低频能量。A second preferred embodiment of an adaptive
参考图10,给出了按照第二优选实施方案配置的DSP 200。如图所示,除了参考第一优选实施方案描述的帧能量估计器210,噪声估计器230,语音检测器240和滤波器选择器235之外,滤波器控制电路105还包括谱分析器270。如同第一实施方案所描述的和流程图5A和5B所表示的,滤波器控制电路105为接收到的帧确定噪声估计值并检测语音的存在。在为当前帧检测语音时,谱分析器270修正噪声包络估计值,并在调整滤波器电路115中使用该值。Referring to Figure 10, a
参考图11,给出了修正噪声包络估计值和调整滤波器电路115的步骤。图11给出了谱分析器270执行的步骤,这些步骤在以前第一优选实施方案的流程图5A和5B中描述的整个过程中被引用。Referring to FIG. 11 , the steps of modifying the noise profile estimate and adjusting the
如果当前帧中没有检测到语音,谱分析器270首先为当前帧确定一个噪声包络(步骤600)。为当前帧确定的噪声包络中包括不同频率(即频率点)上的能量计算值,这些频率位于为当前帧选出的语音低频范围中。在优选实施方案中,选中的频率范围大约为300到800HZ。当前帧的噪声包络可以通过利用具有N个频率点的快速傅立叶变换(FFT)处理当前帧来确定。利用FFT处理数字信号在现有技术领域内是众所周知的,其优越处在于当FFT局限于相对较少的频率点,例如32点时,它需要很少的处理功率。具有N个频率点的FFT在N个不同的频率处产生能量计算。落在选中的频率范围内的频率点的能量计算值形成了当前帧的噪声包络。If no speech is detected in the current frame, spectrum analyzer 270 first determines a noise profile for the current frame (step 600). The noise profile determined for the current frame includes calculated energy values at different frequencies (ie, frequency bins) that are located in the speech low frequency range selected for the current frame. In a preferred embodiment, the selected frequency range is approximately 300 to 800 Hz. The noise profile of the current frame can be determined by processing the current frame with a Fast Fourier Transform (FFT) with N frequency bins. Using FFT to process digital signals is well known in the art. It has the advantage that when FFT is limited to relatively few frequency points, eg 32 points, it requires little processing power. An FFT with N frequency bins produces energy calculations at N different frequencies. The calculated energies of frequency points falling within the selected frequency range form the noise envelope for the current frame.
为了确定当前帧的噪声包络估计值(步骤604),将当前帧的噪声包络与为音频信号的以前帧而确定的噪声包络估计值做平均。当没有以前的噪声包络估计值可以得到时,例如初始化之后,可以使用存储的初始噪声包络估计值。噪声包络估计值包括位于连续低频(即,在选中的频率范围内,e1是最高频率的噪声能量估计值而en是最低频率的噪声能量估计值)上的噪声能量估计值ei(其中i=1,2…n)。在优选实施方案中,每个噪声能量估计值ei对应于在某一特定频率上能量计算值的平均值,该特定频率是在大量的其中没有语音被检测到的连续帧上的选中频率范围内的一个频率点。通过使用大量的帧来确定噪声包络估计值,滤波器电路115在一个更为渐进的基础上被调整。在另一实施方案中,噪声包络估计值可以等于当前帧的噪声包络。To determine a noise profile estimate for the current frame (step 604), the noise profile for the current frame is averaged with noise profile estimates determined for previous frames of the audio signal. The stored initial noise profile estimate may be used when no previous noise profile estimate is available, eg after initialization. The noise envelope estimate consists of noise energy estimates e i ( where i=1, 2...n). In a preferred embodiment, each noise energy estimate ei corresponds to the average value of the energy calculations at a specific frequency, which is a selected frequency range over a large number of consecutive frames in which no speech is detected A frequency point within . By using a large number of frames to determine the noise profile estimate, the
然后,噪声包络估计值的能量估计值ei与参考噪声包络相比(步骤604)。参考噪声包络包括参考能量阈值eri(其中i=1,2…n),这些阈值位于对应于噪声包络估计值的噪声能量估计值ei的频率点上。参考能量阈值eri可以按经验确定。按照从最高频率能量估计值e1到最低频率能量估计值en的顺序,噪声能量估计值ei被连续与对应的参考能量阈值eri相比较。Then, the energy estimate ei of the noise profile estimate is compared with the reference noise profile (step 604). The reference noise profile includes reference energy thresholds e ri (where i=1, 2...n) at frequency bins corresponding to noise energy estimates ei of the noise profile estimates. The reference energy threshold e ri can be empirically determined. In order from the highest frequency energy estimate e 1 to the lowest frequency energy estimate e n , the noise energy estimates e i are successively compared with corresponding reference energy thresholds e ri .
更具体的,噪声能量估计e1首先与参考噪声阈值er1相比较。如果e1大于参考噪声阈值er1,那麽比较值C1被选中并输入到滤波器选择器235。如果噪声估计值e1小于参考噪声阈值erl,那麽噪声能量估计值e2(该值为在低于e1的频率处得到的噪声能量估计值)与参考阈值er2相比较。如果噪声能量估计值e2大于参考噪声阈值er2,那麽比较值C2被选中并输入到滤波器选择器235。到比较值Ci(其中i=1,2…n)被选中为止,比较过程一直继续。More specifically, the noise energy estimate e 1 is first compared with a reference noise threshold e r1 . If e 1 is greater than the reference noise threshold e r1 , the comparison value C1 is selected and input to the
滤波器电路235使用确定的比较值Ci来确定一个滤波器控制值。该滤波器控制值从例如图12中给出的查询表中选择。查询表包括一系列的比较值Ci和相应的滤波器控制值Fi。滤波器电路115以选中滤波器控制值函数的形式来调整。滤波器电路115被调整来显示一个频率响应曲线以便从当前帧中提取低频能量。当连续更高频率上的噪声能量估计值超过他们对应的参考能量阈值时,滤波器电路115被调整来提取更多的低频能量。图6A和6B给出了选中滤波器控制值的示例频率响应曲线。The
噪声包络估计值的使用帮助提高了自适应地调整滤波器电路来提取低频能量的能力,其所采取方式有助于改善语音总体质量。由于汽车环境不是使用移动无线电通讯装置的唯一环境。因此,某一环境中的噪声包络可能倾向于更高频率。当低频中的噪声能量很小时,谱分析器270可以被选择性地被禁止。而且,当噪声频率谱的很大一部分位于低频时,那麽即使某些处理功率被牺牲掉,也要应用更陡的滤波坡度。这种额外处理要求仍然是很小的。The use of the noise envelope estimate helps improve the ability to adaptively adjust the filter circuit to extract low frequency energy in a way that helps improve the overall quality of speech. Since the automotive environment is not the only environment in which mobile radios are used. Therefore, the noise profile in an environment may be skewed toward higher frequencies. Spectrum analyzer 270 can be selectively disabled when the noise energy in low frequencies is small. Also, when a significant portion of the noise frequency spectrum is at low frequencies, then steeper filtering slopes are applied even though some processing power is sacrificed. This additional processing requirement is still minimal.
根据上面描述显而易见,该发明的自适应噪声滤波系统被简单地实现。并且DSP的计算量没有显著增加。削减噪声更复杂的方法,例如″谱削减″要求几个涉及计算的MIPS和存储数据和程序代码用的大量存储器。通过比较,该发明可以通过使用仅仅一小部分″谱削减″算法所要求的MIPS和存储器而实现,其中的谱削减算法同时也引入了更多的语音失真。容量降低的存储器减小了DSP集成电路的大小;降低的MIPS减少了功率消耗。这些特性对于电池供电的便携/移动无线电话都是很理想的。As apparent from the above description, the adaptive noise filtering system of the invention is simply implemented. And the calculation amount of DSP does not increase significantly. More complex methods of noise reduction, such as "spectral reduction" require several MIPS involved in the computation and a large amount of memory for storing data and program code. By comparison, the invention can be implemented using only a fraction of the MIPS and memory required by the "spectral clipping" algorithm, which also introduces more speech distortion. Reduced memory reduces the size of the DSP integrated circuit; reduced MIPS reduces power consumption. These features are ideal for battery-operated portable/mobile radiotelephones.
参考其优选实施方案,尽管该发明已经被特别表示并描述出,但它不仅仅限于这些实施方案。例如,尽管DSP被描述为执行帧能量估计器210,噪声估计器230,语音检测器240,滤波器选择器235和滤波器电路265的功能,这些功能可以通过使用其它的数字和/或模拟元件来实现。此外,在滤波器电路115以噪声估计和噪声包络估计这两者函数的形式来调整时,自适应滤波系统100也可以被实现。While the invention has been particularly shown and described with reference to its preferred embodiments, it is not limited thereto. For example, although the DSP is described as performing the functions of
Claims (10)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US52800595A | 1995-09-14 | 1995-09-14 | |
| US08/528,005 | 1995-09-14 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN1201547A CN1201547A (en) | 1998-12-09 |
| CN1121684C true CN1121684C (en) | 2003-09-17 |
Family
ID=24103874
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN96198008A Expired - Fee Related CN1121684C (en) | 1995-09-14 | 1996-09-13 | System for adaptively filtering audio signals to enhance speech intelligibility in noisy environmental conditions |
Country Status (15)
| Country | Link |
|---|---|
| EP (1) | EP0852052B1 (en) |
| JP (1) | JPH11514453A (en) |
| KR (1) | KR100423029B1 (en) |
| CN (1) | CN1121684C (en) |
| AU (1) | AU724111B2 (en) |
| BR (1) | BR9610290A (en) |
| CA (1) | CA2231107A1 (en) |
| DE (1) | DE69613380D1 (en) |
| EE (1) | EE03456B1 (en) |
| MX (1) | MX9801857A (en) |
| NO (1) | NO981074L (en) |
| PL (1) | PL185513B1 (en) |
| RU (1) | RU2163032C2 (en) |
| TR (1) | TR199800475T1 (en) |
| WO (1) | WO1997010586A1 (en) |
Families Citing this family (182)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE19747885B4 (en) | 1997-10-30 | 2009-04-23 | Harman Becker Automotive Systems Gmbh | Method for reducing interference of acoustic signals by means of the adaptive filter method of spectral subtraction |
| EP0954849B1 (en) * | 1997-10-31 | 2003-05-28 | Koninklijke Philips Electronics N.V. | A method and apparatus for audio representation of speech that has been encoded according to the lpc principle, through adding noise to constituent signals therein |
| KR20000074236A (en) * | 1999-05-19 | 2000-12-15 | 정몽규 | Auto audio volume control means |
| US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
| JP2001318694A (en) * | 2000-05-10 | 2001-11-16 | Toshiba Corp | Signal processing device, signal processing method and recording medium |
| US6983242B1 (en) * | 2000-08-21 | 2006-01-03 | Mindspeed Technologies, Inc. | Method for robust classification in speech coding |
| KR20030010432A (en) * | 2001-07-28 | 2003-02-05 | 주식회사 엑스텔테크놀러지 | Apparatus for speech recognition in noisy environment |
| RU2221284C2 (en) * | 2002-02-26 | 2004-01-10 | Дочернее государственное унитарное научно-исследовательское предприятие "АРГУС" | Coded speech transmission and reception method |
| IL148592A0 (en) | 2002-03-10 | 2002-09-12 | Ycd Multimedia Ltd | Dynamic normalizing |
| CN100477705C (en) * | 2002-07-01 | 2009-04-08 | 皇家飞利浦电子股份有限公司 | Audio enhancement system, system provided with such system, and distortion signal enhancement method |
| KR100978015B1 (en) * | 2002-07-01 | 2010-08-25 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Fixed Spectrum Power Dependent Audio Enhancement System |
| CN1640191B (en) * | 2002-07-12 | 2011-07-20 | 唯听助听器公司 | Hearing aids and ways to improve speech clarity |
| US7242763B2 (en) | 2002-11-26 | 2007-07-10 | Lucent Technologies Inc. | Systems and methods for far-end noise reduction and near-end noise compensation in a mixed time-frequency domain compander to improve signal quality in communications systems |
| KR20040058855A (en) * | 2002-12-27 | 2004-07-05 | 엘지전자 주식회사 | voice modification device and the method |
| DE10305369B4 (en) * | 2003-02-10 | 2005-05-19 | Siemens Ag | User-adaptive method for noise modeling |
| EP2249586A3 (en) * | 2003-03-03 | 2012-06-20 | Phonak AG | Method for manufacturing acoustical devices and for reducing wind disturbances |
| US7127076B2 (en) | 2003-03-03 | 2006-10-24 | Phonak Ag | Method for manufacturing acoustical devices and for reducing especially wind disturbances |
| US7499686B2 (en) * | 2004-02-24 | 2009-03-03 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement on a mobile device |
| CN101873266B (en) | 2004-08-30 | 2015-11-25 | 高通股份有限公司 | For the adaptive de-jitter buffer of voice IP transmission |
| KR100640865B1 (en) | 2004-09-07 | 2006-11-02 | 엘지전자 주식회사 | Method and device to improve voice quality |
| US8085678B2 (en) | 2004-10-13 | 2011-12-27 | Qualcomm Incorporated | Media (voice) playback (de-jitter) buffer adjustments based on air interface |
| WO2006075563A1 (en) * | 2005-01-11 | 2006-07-20 | Nec Corporation | Audio encoding device, audio encoding method, and audio encoding program |
| US8155965B2 (en) * | 2005-03-11 | 2012-04-10 | Qualcomm Incorporated | Time warping frames inside the vocoder by modifying the residual |
| US7680656B2 (en) * | 2005-06-28 | 2010-03-16 | Microsoft Corporation | Multi-sensory speech enhancement using a speech-state model |
| CN101233561B (en) * | 2005-08-02 | 2011-07-13 | 皇家飞利浦电子股份有限公司 | Enhancing speech intelligibility in mobile communication devices by controlling the operation of the vibrator according to the background noise |
| GB2429139B (en) * | 2005-08-10 | 2010-06-16 | Zarlink Semiconductor Inc | A low complexity noise reduction method |
| US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
| KR100667852B1 (en) * | 2006-01-13 | 2007-01-11 | 삼성전자주식회사 | Noise canceller and method for portable recorder equipment |
| RU2402872C2 (en) * | 2006-01-27 | 2010-10-27 | Коудинг Текнолоджиз Аб | Efficient filtering with complex modulated filterbank |
| RU2417460C2 (en) * | 2006-06-05 | 2011-04-27 | Эксаудио Аб | Blind signal extraction |
| US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
| US20080152167A1 (en) * | 2006-12-22 | 2008-06-26 | Step Communications Corporation | Near-field vector signal enhancement |
| KR101414233B1 (en) | 2007-01-05 | 2014-07-02 | 삼성전자 주식회사 | Apparatus and method for improving intelligibility of speech signal |
| KR100883896B1 (en) * | 2007-01-19 | 2009-02-17 | 엘지전자 주식회사 | Voice Clarity Improvement Device and Method |
| US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
| KR100876794B1 (en) | 2007-04-03 | 2009-01-09 | 삼성전자주식회사 | Apparatus and method for improving speech intelligibility in a mobile terminal |
| RU2343563C1 (en) * | 2007-05-21 | 2009-01-10 | Федеральное государственное унитарное предприятие "ПЕНЗЕНСКИЙ НАУЧНО-ИССЛЕДОВАТЕЛЬСКИЙ ЭЛЕКТРОТЕХНИЧЕСКИЙ ИНСТИТУТ" (ФГУП "ПНИЭИ") | Way of transfer and reception of coded voice signals |
| RU2374763C2 (en) * | 2007-08-07 | 2009-11-27 | Государственное образовательное учреждение высшего профессионального образования "Уральский государственный университет путей сообщения" (УрГУПС) | Method for transfer of voice signals |
| RU2469423C2 (en) * | 2007-09-12 | 2012-12-10 | Долби Лэборетериз Лайсенсинг Корпорейшн | Speech enhancement with voice clarity |
| CN101904098B (en) * | 2007-12-20 | 2014-10-22 | 艾利森电话股份有限公司 | Noise suppression method and apparatus |
| JP5086442B2 (en) | 2007-12-20 | 2012-11-28 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | Noise suppression method and apparatus |
| US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
| CN101221767B (en) * | 2008-01-23 | 2012-05-30 | 晨星半导体股份有限公司 | Human voice speech enhancement device and method applied thereto |
| US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
| MY159890A (en) | 2008-04-18 | 2017-02-15 | Dolby Laboratories Licensing Corp | Method and apparatus for maintaining speech audibiliy in multi-channel audio with minimal impact on surround experience |
| US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
| WO2010067118A1 (en) | 2008-12-11 | 2010-06-17 | Novauris Technologies Limited | Speech recognition involving a mobile device |
| DE102009011583A1 (en) | 2009-03-06 | 2010-09-09 | Krones Ag | Method and device for producing and filling thin-walled beverage containers |
| US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
| US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
| US20120311585A1 (en) | 2011-06-03 | 2012-12-06 | Apple Inc. | Organizing task items that represent tasks to perform |
| US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
| US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
| US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
| US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
| US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
| US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
| DE112011100329T5 (en) | 2010-01-25 | 2012-10-31 | Andrew Peter Nelson Jerram | Apparatus, methods and systems for a digital conversation management platform |
| US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
| CN102202038B (en) * | 2010-03-24 | 2015-05-06 | 华为技术有限公司 | Method and system for realizing voice energy display, conference server and terminal |
| JP5867389B2 (en) * | 2010-05-24 | 2016-02-24 | 日本電気株式会社 | Signal processing method, information processing apparatus, and signal processing program |
| CN101859569B (en) * | 2010-05-27 | 2012-08-15 | 上海朗谷电子科技有限公司 | Method for lowering noise of digital audio-frequency signal |
| US8639516B2 (en) | 2010-06-04 | 2014-01-28 | Apple Inc. | User-specific noise suppression for voice quality improvements |
| US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
| CN102128976B (en) * | 2011-01-07 | 2013-05-15 | 钜泉光电科技(上海)股份有限公司 | Energy pulse output method and device of electric energy meter and electric energy meter |
| US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
| US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
| US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
| EP2575128A3 (en) * | 2011-09-30 | 2013-08-14 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
| US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
| US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
| US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
| US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
| CN102737646A (en) * | 2012-06-21 | 2012-10-17 | 佛山市瀚芯电子科技有限公司 | Real-time dynamic voice noise reduction method for single microphone |
| US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
| US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
| US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
| JP2016508007A (en) | 2013-02-07 | 2016-03-10 | アップル インコーポレイテッド | Voice trigger for digital assistant |
| US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
| US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
| WO2014144579A1 (en) | 2013-03-15 | 2014-09-18 | Apple Inc. | System and method for updating an adaptive speech recognition model |
| AU2014233517B2 (en) | 2013-03-15 | 2017-05-25 | Apple Inc. | Training an at least partial voice command system |
| CN104095640A (en) * | 2013-04-03 | 2014-10-15 | 达尔生技股份有限公司 | Blood oxygen saturation detection method and device |
| WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
| US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
| WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
| WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
| HK1223708A1 (en) | 2013-06-09 | 2017-08-04 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
| US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
| WO2014200731A1 (en) | 2013-06-13 | 2014-12-18 | Apple Inc. | System and method for emergency calls initiated by voice command |
| EP2816557B1 (en) * | 2013-06-20 | 2015-11-04 | Harman Becker Automotive Systems GmbH | Identifying spurious signals in audio signals |
| US9697831B2 (en) | 2013-06-26 | 2017-07-04 | Cirrus Logic, Inc. | Speech recognition |
| US10791216B2 (en) | 2013-08-06 | 2020-09-29 | Apple Inc. | Auto-activating smart responses based on activities from remote devices |
| US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
| US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
| US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
| US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
| US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
| US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
| US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
| US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
| US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
| US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
| US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
| US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
| US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
| TWI566107B (en) | 2014-05-30 | 2017-01-11 | 蘋果公司 | Method for processing a multi-part voice command, non-transitory computer readable storage medium and electronic device |
| US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
| US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
| EP2980801A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method for estimating noise in an audio signal, noise estimator, audio encoder, audio decoder, and system for transmitting audio signals |
| US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
| US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
| US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
| US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
| US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
| US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
| US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
| US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
| US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
| US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
| RU2589298C1 (en) * | 2014-12-29 | 2016-07-10 | Александр Юрьевич Бредихин | Method of increasing legible and informative audio signals in the noise situation |
| US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
| US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
| US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
| US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
| US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
| US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
| US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
| US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
| US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
| US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
| US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
| US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
| US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
| US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
| US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
| US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
| US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
| US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
| US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
| WO2017081092A1 (en) * | 2015-11-09 | 2017-05-18 | Nextlink Ipr Ab | Method of and system for noise suppression |
| US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
| US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
| CN105869650B (en) * | 2015-12-28 | 2020-03-06 | 乐融致新电子科技(天津)有限公司 | Digital audio data playing method and device |
| US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
| CN106060717A (en) * | 2016-05-26 | 2016-10-26 | 广东睿盟计算机科技有限公司 | High-definition dynamic noise-reduction pickup |
| US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
| US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
| US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
| US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
| DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
| US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
| US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
| US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
| US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
| US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
| DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
| DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
| DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
| DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
| US9748929B1 (en) * | 2016-10-24 | 2017-08-29 | Analog Devices, Inc. | Envelope-dependent order-varying filter control |
| US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
| CN107039044B (en) * | 2017-03-08 | 2020-04-21 | Oppo广东移动通信有限公司 | A kind of voice signal processing method and mobile terminal |
| DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
| DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
| US10157627B1 (en) * | 2017-06-02 | 2018-12-18 | Bose Corporation | Dynamic spectral filtering |
| WO2019187841A1 (en) * | 2018-03-30 | 2019-10-03 | パナソニックIpマネジメント株式会社 | Noise reduction device |
| RU2680735C1 (en) * | 2018-10-15 | 2019-02-26 | Акционерное общество "Концерн "Созвездие" | Method of separation of speech and pauses by analysis of the values of phases of frequency components of noise and signal |
| WO2020107269A1 (en) * | 2018-11-28 | 2020-06-04 | 深圳市汇顶科技股份有限公司 | Self-adaptive speech enhancement method, and electronic device |
| US11438452B1 (en) | 2019-08-09 | 2022-09-06 | Apple Inc. | Propagating context information in a privacy preserving manner |
| US11501758B2 (en) | 2019-09-27 | 2022-11-15 | Apple Inc. | Environment aware voice-assistant devices, and related systems and methods |
| CN119107930A (en) | 2019-09-27 | 2024-12-10 | 苹果公司 | Environmentally aware voice assistant device and related systems and methods |
| WO2021179045A1 (en) * | 2020-03-13 | 2021-09-16 | University Of South Australia | A data processing method |
| CN111370033B (en) * | 2020-03-13 | 2023-09-22 | 北京字节跳动网络技术有限公司 | Keyboard sound processing method and device, terminal equipment and storage medium |
| CN111402916B (en) * | 2020-03-24 | 2023-08-04 | 青岛罗博智慧教育技术有限公司 | Voice enhancement system, method and handwriting board |
| CN114093391B (en) * | 2020-07-29 | 2025-02-25 | 华为技术有限公司 | A method and device for filtering abnormal signals |
| CN111916106B (en) * | 2020-08-17 | 2021-06-15 | 牡丹江医学院 | A Method to Improve Pronunciation Quality in English Teaching |
| CN112927715B (en) * | 2021-02-26 | 2024-06-14 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio processing method, device and computer readable storage medium |
| CN114550740B (en) * | 2022-04-26 | 2022-07-15 | 天津市北海通信技术有限公司 | Voice definition algorithm under noise and train audio playing method and system thereof |
| CN116092526B (en) * | 2023-01-10 | 2025-12-19 | 浙江大华技术股份有限公司 | Signal detection method and device |
| CN118411998B (en) * | 2024-07-02 | 2024-09-24 | 杭州知聊信息技术有限公司 | Audio noise processing method and system based on big data |
Family Cites Families (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4461025A (en) * | 1982-06-22 | 1984-07-17 | Audiological Engineering Corporation | Automatic background noise suppressor |
| US4630305A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic gain selector for a noise suppression system |
| US4811404A (en) * | 1987-10-01 | 1989-03-07 | Motorola, Inc. | Noise suppression system |
| DE4012349A1 (en) * | 1989-04-19 | 1990-10-25 | Ricoh Kk | Noise elimination device for speech recognition system - uses spectral subtraction of sampled noise values from sampled speech values |
| RU2010439C1 (en) * | 1991-07-04 | 1994-03-30 | Войсковая Часть 25871 | Device for transmission of data over vocoder path |
| JP3065739B2 (en) * | 1991-10-14 | 2000-07-17 | 三菱電機株式会社 | Voice section detection device |
| US5412735A (en) * | 1992-02-27 | 1995-05-02 | Central Institute For The Deaf | Adaptive noise reduction circuit for a sound reproduction system |
| JPH05259928A (en) * | 1992-03-09 | 1993-10-08 | Oki Electric Ind Co Ltd | Method and device for canceling adaptive control noise |
| US5251263A (en) * | 1992-05-22 | 1993-10-05 | Andrea Electronics Corporation | Adaptive noise cancellation and speech enhancement system and apparatus therefor |
| JPH0695693A (en) * | 1992-09-09 | 1994-04-08 | Fujitsu Ten Ltd | Noise reducing circuit for voice recognition device |
| JP3270866B2 (en) * | 1993-03-23 | 2002-04-02 | ソニー株式会社 | Noise removal method and noise removal device |
| US5485522A (en) * | 1993-09-29 | 1996-01-16 | Ericsson Ge Mobile Communications, Inc. | System for adaptively reducing noise in speech signals |
| US5657422A (en) * | 1994-01-28 | 1997-08-12 | Lucent Technologies Inc. | Voice activity detection driven noise remediator |
-
1996
- 1996-09-13 WO PCT/US1996/014665 patent/WO1997010586A1/en not_active Ceased
- 1996-09-13 PL PL96325532A patent/PL185513B1/en not_active IP Right Cessation
- 1996-09-13 EP EP96931552A patent/EP0852052B1/en not_active Expired - Lifetime
- 1996-09-13 EE EE9800068A patent/EE03456B1/en not_active IP Right Cessation
- 1996-09-13 AU AU70784/96A patent/AU724111B2/en not_active Ceased
- 1996-09-13 BR BR9610290A patent/BR9610290A/en not_active IP Right Cessation
- 1996-09-13 TR TR1998/00475T patent/TR199800475T1/en unknown
- 1996-09-13 RU RU98107313/09A patent/RU2163032C2/en not_active IP Right Cessation
- 1996-09-13 CN CN96198008A patent/CN1121684C/en not_active Expired - Fee Related
- 1996-09-13 JP JP9512112A patent/JPH11514453A/en not_active Ceased
- 1996-09-13 CA CA002231107A patent/CA2231107A1/en not_active Abandoned
- 1996-09-13 KR KR10-1998-0701913A patent/KR100423029B1/en not_active Expired - Fee Related
- 1996-09-13 DE DE69613380T patent/DE69613380D1/en not_active Expired - Lifetime
-
1998
- 1998-03-09 MX MX9801857A patent/MX9801857A/en unknown
- 1998-03-11 NO NO981074A patent/NO981074L/en not_active Application Discontinuation
Also Published As
| Publication number | Publication date |
|---|---|
| JPH11514453A (en) | 1999-12-07 |
| BR9610290A (en) | 1999-03-16 |
| DE69613380D1 (en) | 2001-07-19 |
| NO981074L (en) | 1998-05-13 |
| CN1201547A (en) | 1998-12-09 |
| RU2163032C2 (en) | 2001-02-10 |
| NO981074D0 (en) | 1998-03-11 |
| AU724111B2 (en) | 2000-09-14 |
| WO1997010586A1 (en) | 1997-03-20 |
| EP0852052A1 (en) | 1998-07-08 |
| TR199800475T1 (en) | 1998-06-22 |
| MX9801857A (en) | 1998-11-29 |
| CA2231107A1 (en) | 1997-03-20 |
| PL185513B1 (en) | 2003-05-30 |
| PL325532A1 (en) | 1998-08-03 |
| AU7078496A (en) | 1997-04-01 |
| KR100423029B1 (en) | 2004-07-01 |
| EE03456B1 (en) | 2001-06-15 |
| KR19990044659A (en) | 1999-06-25 |
| EE9800068A (en) | 1998-08-17 |
| EP0852052B1 (en) | 2001-06-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN1121684C (en) | System for adaptively filtering audio signals to enhance speech intelligibility in noisy environmental conditions | |
| EP0645756B1 (en) | System for adaptively reducing noise in speech signals | |
| RU2151430C1 (en) | Noise simulator, which is controlled by voice detection | |
| EP3281199B1 (en) | Audio bandwidth selection | |
| US7058574B2 (en) | Signal processing apparatus and mobile radio communication terminal | |
| KR20010078401A (en) | Complex signal activity detection for improved speech/noise classification of an audio signal | |
| CN104981870B (en) | Sound enhancing devices | |
| US7653539B2 (en) | Communication device, signal encoding/decoding method | |
| US5710862A (en) | Method and apparatus for reducing an undesirable characteristic of a spectral estimate of a noise signal between occurrences of voice signals | |
| JP2003510643A (en) | Processing circuit for correcting audio signal, receiver, communication system, portable device, and method therefor | |
| JP2002162982A (en) | Voice / silence determination device and voice / silence determination method | |
| CN1672193A (en) | Speech communication unit and method for error mitigation of speech frames | |
| WO2001041334A1 (en) | Method and apparatus for suppressing acoustic background noise in a communication system | |
| HK1058427B (en) | Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder | |
| HK1047817A1 (en) | Spectral magnitude quantization for a speech coder | |
| HK1058427A1 (en) | Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder | |
| HK1047817B (en) | Spectral magnitude quantization for a speech coder |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C19 | Lapse of patent right due to non-payment of the annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |