MX2016011750A - Metodo y aparato para la deteccion de señales de audio. - Google Patents
Metodo y aparato para la deteccion de señales de audio.Info
- Publication number
- MX2016011750A MX2016011750A MX2016011750A MX2016011750A MX2016011750A MX 2016011750 A MX2016011750 A MX 2016011750A MX 2016011750 A MX2016011750 A MX 2016011750A MX 2016011750 A MX2016011750 A MX 2016011750A MX 2016011750 A MX2016011750 A MX 2016011750A
- Authority
- MX
- Mexico
- Prior art keywords
- audio signal
- ssnr
- enhanced
- signal
- voice
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 6
- 238000000034 method Methods 0.000 title abstract 3
- 238000001514 detection method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Telephone Function (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Noise Elimination (AREA)
- Telephonic Communication Services (AREA)
- Signal Processing Not Specific To The Method Of Recording And Reproducing (AREA)
- User Interface Of Digital Computer (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Las modalidades de la presente invención proporcionan un método y un aparato para detectar una señal de audio, en donde el método incluye: determinar una señal de audio de entrada como una señal de audio a-ser-determinada; determinar una relación de señal a ruido segmental (SSNR) mejorada de la señal de audio, en donde la SSNR mejorada es mayor que una SSNR de referencia; y comparar la SSNR mejorada con un umbral de decisión de detección de actividad de voz (VAD) para determinar si la señal de audio es una señal activa. De acuerdo con el método y el aparato proporcionado en las modalidades de la presente invención, una voz activa y una voz inactiva se puede distinguir con precisión.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410090386.XA CN104916292B (zh) | 2014-03-12 | 2014-03-12 | 检测音频信号的方法和装置 |
| PCT/CN2014/092694 WO2015135344A1 (zh) | 2014-03-12 | 2014-12-01 | 检测音频信号的方法和装置 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| MX2016011750A true MX2016011750A (es) | 2016-12-12 |
| MX355828B MX355828B (es) | 2018-05-02 |
Family
ID=54070889
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2016011750A MX355828B (es) | 2014-03-12 | 2014-12-01 | Método y aparato para la detección de señales de audio. |
Country Status (14)
| Country | Link |
|---|---|
| US (3) | US10304478B2 (es) |
| EP (2) | EP3118852B1 (es) |
| JP (2) | JP6493889B2 (es) |
| KR (2) | KR101884220B1 (es) |
| CN (3) | CN107293287B (es) |
| AU (1) | AU2014386442B9 (es) |
| CA (1) | CA2940487C (es) |
| ES (2) | ES2787894T3 (es) |
| MX (1) | MX355828B (es) |
| MY (1) | MY193521A (es) |
| PT (2) | PT3118852T (es) |
| RU (1) | RU2666337C2 (es) |
| SG (1) | SG11201607052SA (es) |
| WO (1) | WO2015135344A1 (es) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107293287B (zh) * | 2014-03-12 | 2021-10-26 | 华为技术有限公司 | 检测音频信号的方法和装置 |
| AU2016402256B2 (en) * | 2016-04-29 | 2019-04-18 | Honor Device Co., Ltd. | Voice input exception determining method, apparatus, terminal, and storage medium |
| CN107040359B (zh) * | 2017-05-08 | 2021-01-19 | 海能达通信股份有限公司 | 一种语音呼叫过程中携带随路信令的方法、装置及设备 |
| CN107393558B (zh) * | 2017-07-14 | 2020-09-11 | 深圳永顺智信息科技有限公司 | 语音活动检测方法及装置 |
| CN107393553B (zh) * | 2017-07-14 | 2020-12-22 | 深圳永顺智信息科技有限公司 | 用于语音活动检测的听觉特征提取方法 |
| CN107393550B (zh) * | 2017-07-14 | 2021-03-19 | 深圳永顺智信息科技有限公司 | 语音处理方法及装置 |
| CN107393559B (zh) * | 2017-07-14 | 2021-05-18 | 深圳永顺智信息科技有限公司 | 检校语音检测结果的方法及装置 |
| US11783809B2 (en) * | 2020-10-08 | 2023-10-10 | Qualcomm Incorporated | User voice activity detection using dynamic classifier |
| WO2023085749A1 (ko) | 2021-11-09 | 2023-05-19 | 삼성전자주식회사 | 빔포밍을 제어하는 전자 장치 및 이의 동작 방법 |
| US20240304203A1 (en) * | 2023-03-06 | 2024-09-12 | Nvidia Corporation | Noise reduction using voice activity detection in audio processing systems and applications |
Family Cites Families (54)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS59182498A (ja) * | 1983-04-01 | 1984-10-17 | 日本電気株式会社 | 音声検出回路 |
| JPS63259596A (ja) * | 1987-04-16 | 1988-10-26 | 株式会社日立製作所 | 音声区間検出方式 |
| PL174216B1 (pl) * | 1993-11-30 | 1998-06-30 | At And T Corp | Sposób redukcji w czasie rzeczywistym szumu transmisji mowy |
| FI100840B (fi) * | 1995-12-12 | 1998-02-27 | Nokia Mobile Phones Ltd | Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin |
| US5991718A (en) * | 1998-02-27 | 1999-11-23 | At&T Corp. | System and method for noise threshold adaptation for voice activity detection in nonstationary noise environments |
| US6466906B2 (en) * | 1999-01-06 | 2002-10-15 | Dspc Technologies Ltd. | Noise padding and normalization in dynamic time warping |
| US6453291B1 (en) * | 1999-02-04 | 2002-09-17 | Motorola, Inc. | Apparatus and method for voice activity detection in a communication system |
| US6324509B1 (en) | 1999-02-08 | 2001-11-27 | Qualcomm Incorporated | Method and apparatus for accurate endpointing of speech in the presence of noise |
| JP2001236085A (ja) * | 2000-02-25 | 2001-08-31 | Matsushita Electric Ind Co Ltd | 音声区間検出装置、定常雑音区間検出装置、非定常雑音区間検出装置、及び雑音区間検出装置 |
| JP3588030B2 (ja) * | 2000-03-16 | 2004-11-10 | 三菱電機株式会社 | 音声区間判定装置及び音声区間判定方法 |
| US6898566B1 (en) * | 2000-08-16 | 2005-05-24 | Mindspeed Technologies, Inc. | Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal |
| CN1175398C (zh) * | 2000-11-18 | 2004-11-10 | 中兴通讯股份有限公司 | 一种从噪声环境中识别出语音和音乐的声音活动检测方法 |
| EP1376539B8 (en) * | 2001-03-28 | 2010-12-15 | Mitsubishi Denki Kabushiki Kaisha | Noise suppressor |
| US7941313B2 (en) * | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
| US7203643B2 (en) | 2001-06-14 | 2007-04-10 | Qualcomm Incorporated | Method and apparatus for transmitting speech activity in distributed voice recognition systems |
| US6937980B2 (en) * | 2001-10-02 | 2005-08-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech recognition using microphone antenna array |
| JP4281349B2 (ja) * | 2001-12-25 | 2009-06-17 | パナソニック株式会社 | 電話装置 |
| US7024353B2 (en) * | 2002-08-09 | 2006-04-04 | Motorola, Inc. | Distributed speech recognition with back-end voice activity detection apparatus and method |
| US7146315B2 (en) * | 2002-08-30 | 2006-12-05 | Siemens Corporate Research, Inc. | Multichannel voice detection in adverse environments |
| US7162420B2 (en) * | 2002-12-10 | 2007-01-09 | Liberato Technologies, Llc | System and method for noise reduction having first and second adaptive filters |
| JP4490090B2 (ja) * | 2003-12-25 | 2010-06-23 | 株式会社エヌ・ティ・ティ・ドコモ | 有音無音判定装置および有音無音判定方法 |
| CA2454296A1 (en) | 2003-12-29 | 2005-06-29 | Nokia Corporation | Method and device for speech enhancement in the presence of background noise |
| US8340309B2 (en) * | 2004-08-06 | 2012-12-25 | Aliphcom, Inc. | Noise suppressing multi-microphone headset |
| CN100369113C (zh) * | 2004-12-31 | 2008-02-13 | 中国科学院自动化研究所 | 利用增益自适应提高语音识别率的方法 |
| US8175877B2 (en) * | 2005-02-02 | 2012-05-08 | At&T Intellectual Property Ii, L.P. | Method and apparatus for predicting word accuracy in automatic speech recognition systems |
| ES2525427T3 (es) | 2006-02-10 | 2014-12-22 | Telefonaktiebolaget L M Ericsson (Publ) | Un detector de voz y un método para suprimir sub-bandas en un detector de voz |
| US8032370B2 (en) * | 2006-05-09 | 2011-10-04 | Nokia Corporation | Method, apparatus, system and software product for adaptation of voice activity detection parameters based on the quality of the coding modes |
| US8311814B2 (en) * | 2006-09-19 | 2012-11-13 | Avaya Inc. | Efficient voice activity detector to detect fixed power signals |
| CN101197130B (zh) * | 2006-12-07 | 2011-05-18 | 华为技术有限公司 | 声音活动检测方法和声音活动检测器 |
| US8326620B2 (en) * | 2008-04-30 | 2012-12-04 | Qnx Software Systems Limited | Robust downlink speech and noise detector |
| US7769585B2 (en) * | 2007-04-05 | 2010-08-03 | Avidyne Corporation | System and method of voice activity detection in noisy environments |
| CN101320559B (zh) * | 2007-06-07 | 2011-05-18 | 华为技术有限公司 | 一种声音激活检测装置及方法 |
| US8954324B2 (en) * | 2007-09-28 | 2015-02-10 | Qualcomm Incorporated | Multiple microphone voice activity detector |
| KR101335417B1 (ko) | 2008-03-31 | 2013-12-05 | (주)트란소노 | 노이지 음성 신호의 처리 방법과 이를 위한 장치 및 컴퓨터판독 가능한 기록매체 |
| US8768690B2 (en) * | 2008-06-20 | 2014-07-01 | Qualcomm Incorporated | Coding scheme selection for low-bit-rate applications |
| WO2010091339A1 (en) | 2009-02-06 | 2010-08-12 | University Of Ottawa | Method and system for noise reduction for speech enhancement in hearing aid |
| JP5337530B2 (ja) * | 2009-02-25 | 2013-11-06 | 京セラ株式会社 | 無線基地局および無線通信方法 |
| KR20110001130A (ko) * | 2009-06-29 | 2011-01-06 | 삼성전자주식회사 | 가중 선형 예측 변환을 이용한 오디오 신호 부호화 및 복호화 장치 및 그 방법 |
| CN102044242B (zh) | 2009-10-15 | 2012-01-25 | 华为技术有限公司 | 语音激活检测方法、装置和电子设备 |
| CN102044243B (zh) * | 2009-10-15 | 2012-08-29 | 华为技术有限公司 | 语音激活检测方法与装置、编码器 |
| KR20120091068A (ko) | 2009-10-19 | 2012-08-17 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 음성 활성 검출을 위한 검출기 및 방법 |
| JP2013508773A (ja) * | 2009-10-19 | 2013-03-07 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 音声エンコーダの方法およびボイス活動検出器 |
| US8898058B2 (en) * | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
| ES2740173T3 (es) | 2010-12-24 | 2020-02-05 | Huawei Tech Co Ltd | Un método y un aparato para realizar una detección de actividad de voz |
| EP2619753B1 (en) * | 2010-12-24 | 2014-05-21 | Huawei Technologies Co., Ltd. | Method and apparatus for adaptively detecting voice activity in input audio signal |
| WO2012083552A1 (en) * | 2010-12-24 | 2012-06-28 | Huawei Technologies Co., Ltd. | Method and apparatus for voice activity detection |
| US9099098B2 (en) * | 2012-01-20 | 2015-08-04 | Qualcomm Incorporated | Voice activity detection in presence of background noise |
| DE112012005855B4 (de) * | 2012-02-10 | 2021-07-08 | Mitsubishi Electric Corporation | Störungsunterdrückungsvorrichtung |
| JP5862349B2 (ja) * | 2012-02-16 | 2016-02-16 | 株式会社Jvcケンウッド | ノイズ低減装置、音声入力装置、無線通信装置、およびノイズ低減方法 |
| CN103325380B (zh) * | 2012-03-23 | 2017-09-12 | 杜比实验室特许公司 | 用于信号增强的增益后处理 |
| US20130282373A1 (en) | 2012-04-23 | 2013-10-24 | Qualcomm Incorporated | Systems and methods for audio signal processing |
| US9524735B2 (en) * | 2014-01-31 | 2016-12-20 | Apple Inc. | Threshold adaptation in two-channel noise estimation and voice activity detection |
| CN107293287B (zh) * | 2014-03-12 | 2021-10-26 | 华为技术有限公司 | 检测音频信号的方法和装置 |
| US9775113B2 (en) * | 2014-12-11 | 2017-09-26 | Mediatek Inc. | Voice wakeup detecting device with digital microphone and associated method |
-
2014
- 2014-03-12 CN CN201710312455.0A patent/CN107293287B/zh active Active
- 2014-03-12 CN CN201710313043.9A patent/CN107086043B/zh active Active
- 2014-03-12 CN CN201410090386.XA patent/CN104916292B/zh active Active
- 2014-12-01 JP JP2016556770A patent/JP6493889B2/ja active Active
- 2014-12-01 MX MX2016011750A patent/MX355828B/es active IP Right Grant
- 2014-12-01 RU RU2016139717A patent/RU2666337C2/ru active
- 2014-12-01 ES ES14885786T patent/ES2787894T3/es active Active
- 2014-12-01 MY MYPI2016703030A patent/MY193521A/en unknown
- 2014-12-01 PT PT148857865T patent/PT3118852T/pt unknown
- 2014-12-01 SG SG11201607052SA patent/SG11201607052SA/en unknown
- 2014-12-01 PT PT191976604T patent/PT3660845T/pt unknown
- 2014-12-01 KR KR1020167025280A patent/KR101884220B1/ko active Active
- 2014-12-01 EP EP14885786.5A patent/EP3118852B1/en active Active
- 2014-12-01 CA CA2940487A patent/CA2940487C/en active Active
- 2014-12-01 ES ES19197660T patent/ES2926360T3/es active Active
- 2014-12-01 WO PCT/CN2014/092694 patent/WO2015135344A1/zh not_active Ceased
- 2014-12-01 EP EP19197660.4A patent/EP3660845B1/en active Active
- 2014-12-01 KR KR1020187021506A patent/KR102005009B1/ko active Active
- 2014-12-01 AU AU2014386442A patent/AU2014386442B9/en active Active
-
2016
- 2016-09-12 US US15/262,263 patent/US10304478B2/en active Active
-
2018
- 2018-11-30 JP JP2018225323A patent/JP6793706B2/ja active Active
-
2019
- 2019-04-23 US US16/391,893 patent/US10818313B2/en active Active
-
2020
- 2020-06-15 US US16/901,846 patent/US11417353B2/en active Active
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX355828B (es) | Método y aparato para la detección de señales de audio. | |
| GB2573424A (en) | Low-power, always-listening, voice-command detection and capture | |
| MX2016013630A (es) | Deteccion de conversacion. | |
| WO2014107469A3 (en) | Mobile device speaker control | |
| IN2014CH00781A (es) | ||
| NZ606249A (en) | Methods and devices with leak detection | |
| WO2013162995A3 (en) | Systems and methods for audio signal processing | |
| IN2014CH00810A (es) | ||
| WO2014035680A3 (en) | Systems and methods for a wearable touch-sensitive device | |
| MX2017003151A (es) | Region inactiva para una superficie tactil con base en la informacion contextual. | |
| JP2016521021A5 (es) | ||
| MX2015012466A (es) | Aparato y método contra la elusión en el uso del sistema de prueba de sobriedad. | |
| MX352737B (es) | Sistema y metodo para determinar la posicion angular de un rodillo giratorio. | |
| IN2014MU00871A (es) | ||
| EP4570924A3 (en) | Chromophore-based characterization and detection methods | |
| TW201614544A (en) | Apparatus and method for detecting fault injection | |
| MY182238A (en) | Route guidance device and route guidance method | |
| PH12019502482A1 (en) | Wireless communication method and device | |
| MX2016012359A (es) | Dispositivo para reproducir sonido. | |
| TW201612549A (en) | Apparatus, system and method for space status detection based on an acoustic signal | |
| MX348707B (es) | Accion activada por movimiento para dispositivo movil. | |
| MX2015012443A (es) | Sistemas y metodos para detectar un atributo de documento utilizando acustica. | |
| MX345267B (es) | Sistema y metodo para determinar una propiedad de un objeto, y una valvula. | |
| GB201206977D0 (en) | An enzyme detection device | |
| EP4534688A3 (en) | Probe and method for detecting transcript resulting from fusion gene and/or exon skipping |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FG | Grant or registration |