MX2017016084A - Reconocimiento de voz sin interrumpir la reproduccion de audio. - Google Patents
Reconocimiento de voz sin interrumpir la reproduccion de audio.Info
- Publication number
- MX2017016084A MX2017016084A MX2017016084A MX2017016084A MX2017016084A MX 2017016084 A MX2017016084 A MX 2017016084A MX 2017016084 A MX2017016084 A MX 2017016084A MX 2017016084 A MX2017016084 A MX 2017016084A MX 2017016084 A MX2017016084 A MX 2017016084A
- Authority
- MX
- Mexico
- Prior art keywords
- audio
- component
- speech recognition
- capture
- filter
- Prior art date
Links
- 238000001914 filtration Methods 0.000 abstract 2
- 238000009877 rendering Methods 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Se describen en la presente sistemas, métodos y dispositivos para capturar una entrada de voz de un usuario. Un sistema incluye un componente de audio de reproducción, un componente de traducción de audio, un componente de captura, un componente de filtrado y un componente de reconocimiento de voz. El componente de audio reproducido está configurado para almacenar datos de audio para la generación de sonido. El componente de traducción de audio está configurado para reproducir los datos de audio en uno o más parlantes. El componente de captura está configurado para capturar el audio (audio capturado) mediante el uso de un micrófono. El componente de filtrado está configurado para filtrar el audio capturado para generar audio filtrado, donde el filtrado incluye el uso de los datos de audio almacenados para retirar el audio correspondiente a los datos de audio del audio capturado. El componente de reconocimiento de voz está configurado para generar texto o comandos basados en el audio filtrado.
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/377,600 US20180166073A1 (en) | 2016-12-13 | 2016-12-13 | Speech Recognition Without Interrupting The Playback Audio |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| MX2017016084A true MX2017016084A (es) | 2018-11-09 |
Family
ID=60950167
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| MX2017016084A MX2017016084A (es) | 2016-12-13 | 2017-12-11 | Reconocimiento de voz sin interrumpir la reproduccion de audio. |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20180166073A1 (es) |
| CN (1) | CN108231071A (es) |
| DE (1) | DE102017129484A1 (es) |
| GB (1) | GB2559460A (es) |
| MX (1) | MX2017016084A (es) |
| RU (1) | RU2017143129A (es) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3570279A1 (en) * | 2018-05-17 | 2019-11-20 | MediaTek Inc. | Audio output monitoring for failure detection of warning sound playback |
| US20200211540A1 (en) * | 2018-12-27 | 2020-07-02 | Microsoft Technology Licensing, Llc | Context-based speech synthesis |
| CN109743436B (zh) * | 2018-12-29 | 2020-08-28 | 苏州思必驰信息科技有限公司 | 用于语音对话的通讯补偿方法、装置、设备和存储介质 |
| US10867615B2 (en) * | 2019-01-25 | 2020-12-15 | Comcast Cable Communications, Llc | Voice recognition with timing information for noise cancellation |
| JP7110496B2 (ja) | 2019-01-29 | 2022-08-01 | グーグル エルエルシー | ワイヤレススピーカーにおいて、再生を検出するため、かつ/または不整合な再生に適応するための構造化オーディオ出力の使用 |
| US12332937B2 (en) | 2019-07-31 | 2025-06-17 | Adeia Guides Inc. | Systems and methods for managing voice queries using pronunciation information |
| US11410656B2 (en) * | 2019-07-31 | 2022-08-09 | Rovi Guides, Inc. | Systems and methods for managing voice queries using pronunciation information |
| US11494434B2 (en) | 2019-07-31 | 2022-11-08 | Rovi Guides, Inc. | Systems and methods for managing voice queries using pronunciation information |
| CN111210820B (zh) * | 2020-01-21 | 2022-11-18 | 达闼机器人股份有限公司 | 机器人的控制方法、装置、电子设备以及存储介质 |
Family Cites Families (31)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6001131A (en) * | 1995-02-24 | 1999-12-14 | Nynex Science & Technology, Inc. | Automatic target noise cancellation for speech enhancement |
| US5708704A (en) * | 1995-04-07 | 1998-01-13 | Texas Instruments Incorporated | Speech recognition method and system with improved voice-activated prompt interrupt capability |
| US5848163A (en) * | 1996-02-02 | 1998-12-08 | International Business Machines Corporation | Method and apparatus for suppressing background music or noise from the speech input of a speech recognizer |
| DE19814971A1 (de) * | 1998-04-03 | 1999-10-07 | Daimlerchrysler Aerospace Ag | Verfahren zur Störbefreiung eines Mikrophonsignals |
| US6246986B1 (en) * | 1998-12-31 | 2001-06-12 | At&T Corp. | User barge-in enablement in large vocabulary speech recognition systems |
| US7136458B1 (en) * | 1999-12-23 | 2006-11-14 | Bellsouth Intellectual Property Corporation | Voice recognition for filtering and announcing message |
| US6725193B1 (en) * | 2000-09-13 | 2004-04-20 | Telefonaktiebolaget Lm Ericsson | Cancellation of loudspeaker words in speech recognition |
| WO2002052546A1 (en) * | 2000-12-27 | 2002-07-04 | Intel Corporation | Voice barge-in in telephony speech recognition |
| DE10163214A1 (de) * | 2001-12-21 | 2003-07-10 | Philips Intellectual Property | Verfahren und Steuersystem zur Sprachsteuerung eines Gerätes |
| US7328159B2 (en) * | 2002-01-15 | 2008-02-05 | Qualcomm Inc. | Interactive speech recognition apparatus and method with conditioned voice prompts |
| JP4209247B2 (ja) * | 2003-05-02 | 2009-01-14 | アルパイン株式会社 | 音声認識装置および方法 |
| US8244536B2 (en) * | 2003-08-27 | 2012-08-14 | General Motors Llc | Algorithm for intelligent speech recognition |
| US7099821B2 (en) * | 2003-09-12 | 2006-08-29 | Softmax, Inc. | Separation of target acoustic signals in a multi-transducer arrangement |
| JP4333369B2 (ja) * | 2004-01-07 | 2009-09-16 | 株式会社デンソー | 雑音除去装置、及び音声認識装置、並びにカーナビゲーション装置 |
| JP4283212B2 (ja) * | 2004-12-10 | 2009-06-24 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 雑音除去装置、雑音除去プログラム、及び雑音除去方法 |
| US7813498B2 (en) * | 2007-07-27 | 2010-10-12 | Fortemedia, Inc. | Full-duplex communication device and method of acoustic echo cancellation therein |
| ATE508452T1 (de) * | 2007-11-12 | 2011-05-15 | Harman Becker Automotive Sys | Unterscheidung zwischen vordergrundsprache und hintergrundgeräuschen |
| KR101233271B1 (ko) * | 2008-12-12 | 2013-02-14 | 신호준 | 신호 분리 방법, 상기 신호 분리 방법을 이용한 통신 시스템 및 음성인식시스템 |
| US8364298B2 (en) * | 2009-07-29 | 2013-01-29 | International Business Machines Corporation | Filtering application sounds |
| US8311838B2 (en) * | 2010-01-13 | 2012-11-13 | Apple Inc. | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
| US9111536B2 (en) * | 2011-03-07 | 2015-08-18 | Texas Instruments Incorporated | Method and system to play background music along with voice on a CDMA network |
| US8762151B2 (en) * | 2011-06-16 | 2014-06-24 | General Motors Llc | Speech recognition for premature enunciation |
| KR101641448B1 (ko) * | 2012-03-16 | 2016-07-20 | 뉘앙스 커뮤니케이션즈, 인코포레이티드 | 사용자 전용 자동 음성 인식 |
| US8781821B2 (en) * | 2012-04-30 | 2014-07-15 | Zanavox | Voiced interval command interpretation |
| EP2896194B1 (en) * | 2012-09-14 | 2018-05-09 | Google LLC | Handling concurrent speech |
| TWI557722B (zh) * | 2012-11-15 | 2016-11-11 | 緯創資通股份有限公司 | 語音干擾的濾除方法、系統,與電腦可讀記錄媒體 |
| KR101428245B1 (ko) * | 2012-12-05 | 2014-08-07 | 현대자동차주식회사 | 음성 인식 장치 및 방법 |
| US9767819B2 (en) * | 2013-04-11 | 2017-09-19 | Nuance Communications, Inc. | System for automatic speech recognition and audio entertainment |
| CN105138110A (zh) * | 2014-05-29 | 2015-12-09 | 中兴通讯股份有限公司 | 语音交互方法及装置 |
| US9947318B2 (en) * | 2014-10-03 | 2018-04-17 | 2236008 Ontario Inc. | System and method for processing an audio signal captured from a microphone |
| EP3206204A1 (en) * | 2016-02-09 | 2017-08-16 | Nxp B.V. | System for processing audio |
-
2016
- 2016-12-13 US US15/377,600 patent/US20180166073A1/en not_active Abandoned
-
2017
- 2017-12-04 GB GB1720160.9A patent/GB2559460A/en not_active Withdrawn
- 2017-12-08 CN CN201711292146.8A patent/CN108231071A/zh active Pending
- 2017-12-11 MX MX2017016084A patent/MX2017016084A/es unknown
- 2017-12-11 DE DE102017129484.8A patent/DE102017129484A1/de not_active Withdrawn
- 2017-12-11 RU RU2017143129A patent/RU2017143129A/ru not_active Application Discontinuation
Also Published As
| Publication number | Publication date |
|---|---|
| US20180166073A1 (en) | 2018-06-14 |
| CN108231071A (zh) | 2018-06-29 |
| GB2559460A (en) | 2018-08-08 |
| GB201720160D0 (en) | 2018-01-17 |
| DE102017129484A1 (de) | 2018-06-14 |
| RU2017143129A (ru) | 2019-06-11 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| MX2017016084A (es) | Reconocimiento de voz sin interrumpir la reproduccion de audio. | |
| EP4456566A3 (en) | Linear filtering for noise-suppressed speech detection | |
| MX2016005224A (es) | Metodo y dispositivo para lograr el registro de audio objetivo y aparato electronico. | |
| EP4394768A3 (en) | Vehicle-based media system with audio ad and visual content synchronization feature | |
| WO2016009444A3 (en) | Music performance system and method thereof | |
| WO2011130083A3 (en) | Camera-assisted noise cancellation and speech recognition | |
| EP2863392A3 (en) | Noise reduction in multi-microphone systems | |
| GB2543972A (en) | Systems and methods for equalizing audio for playback on an electronic device | |
| EP3751561A3 (en) | Hotword recognition | |
| MX2023006478A (es) | Aparato y metodo para proporcionar zonas individuales de sonido. | |
| MX2020006207A (es) | Modo de privacidad para un dispositivo de audio inalambrico. | |
| WO2013060574A3 (de) | Geräuschunterdrückungssystem und verfahren zur geräuschunterdrückung | |
| MX377073B (es) | Dispositivo para la reproducción del habla configurado para enmascarar el habla reproducida en una zona de habla enmascarada | |
| MX2010008372A (es) | Aparato y metodo para calcular coeficientes de filtro para supresion de eco. | |
| EP4604583A3 (en) | Method and apparatus for rendering acoustic signal, and computer-readable recording medium | |
| WO2014168934A3 (en) | Systems and methods for generating a digital output signal in a digital microphone system | |
| WO2010004056A3 (en) | Method and system for speech enhancement in a room | |
| MX2024010034A (es) | Procesamiento de audio en servicios de audio inmersivo. | |
| WO2012145709A3 (en) | A method for encoding multiple microphone signals into a source-separable audio signal for network transmission and an apparatus for directed source separation | |
| MY189000A (en) | Audio processing device and method, and program therefor | |
| WO2018063917A3 (en) | Adaptive electronic hearing protection device | |
| PL3216236T3 (pl) | Urządzenie i sposób generowania sygnałów wyjściowych w oparciu o sygnał źródła audio, system odtwarzania dźwięku i sygnał głośników | |
| NZ778334A (en) | Audio-based access control | |
| RU2015145733A (ru) | Оборудование для записи и воспроизведения звуковых сигналов | |
| CN108028982A (zh) | 电子设备及其音频处理方法 |