[go: up one dir, main page]

WO2008108239A1 - 音声認識システム、方法およびプログラム - Google Patents

音声認識システム、方法およびプログラム Download PDF

Info

Publication number
WO2008108239A1
WO2008108239A1 PCT/JP2008/053368 JP2008053368W WO2008108239A1 WO 2008108239 A1 WO2008108239 A1 WO 2008108239A1 JP 2008053368 W JP2008053368 W JP 2008053368W WO 2008108239 A1 WO2008108239 A1 WO 2008108239A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
voice recognition
recognition system
program
detection means
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2008/053368
Other languages
English (en)
French (fr)
Inventor
Toru Iwasawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to US12/528,767 priority Critical patent/US8417518B2/en
Priority to JP2009502537A priority patent/JP5229217B2/ja
Publication of WO2008108239A1 publication Critical patent/WO2008108239A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/285Memory allocation or algorithm optimisation to reduce hardware requirements

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Manipulator (AREA)
  • Telephone Function (AREA)
  • Selective Calling Equipment (AREA)
  • Telephonic Communication Services (AREA)

Abstract

 音声認識システムであって、音声入力素子からの入力信号を取得して出力する音声入力部1と、前記入力信号中の発声区間を検出する音声検出手段2と、前記発声区間に対する音声認識を行う音声認識手段3と、音声検出手段2における検出頻度があらかじめ定めた条件を満たした場合には音声入力部1または音声検出手段2の少なくともいずれか一方に対して制御信号を送出して前記検出頻度を抑制する制御手段4とを備える。
PCT/JP2008/053368 2007-02-27 2008-02-27 音声認識システム、方法およびプログラム Ceased WO2008108239A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US12/528,767 US8417518B2 (en) 2007-02-27 2008-02-27 Voice recognition system, method, and program
JP2009502537A JP5229217B2 (ja) 2007-02-27 2008-02-27 音声認識システム、方法およびプログラム

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007047217 2007-02-27
JP2007-047217 2007-02-27

Publications (1)

Publication Number Publication Date
WO2008108239A1 true WO2008108239A1 (ja) 2008-09-12

Family

ID=39738124

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/053368 Ceased WO2008108239A1 (ja) 2007-02-27 2008-02-27 音声認識システム、方法およびプログラム

Country Status (3)

Country Link
US (1) US8417518B2 (ja)
JP (1) JP5229217B2 (ja)
WO (1) WO2008108239A1 (ja)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8374854B2 (en) * 2008-03-28 2013-02-12 Southern Methodist University Spatio-temporal speech enhancement technique based on generalized eigenvalue decomposition
US9123340B2 (en) 2013-03-01 2015-09-01 Google Inc. Detecting the end of a user question
US9892729B2 (en) 2013-05-07 2018-02-13 Qualcomm Incorporated Method and apparatus for controlling voice activation
US9240182B2 (en) 2013-09-17 2016-01-19 Qualcomm Incorporated Method and apparatus for adjusting detection threshold for activating voice assistant function

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07312795A (ja) * 1994-05-18 1995-11-28 Nagano Japan Radio Co 音声信号処理方法及び装置
JPH10288994A (ja) * 1997-04-15 1998-10-27 Mitsubishi Electric Corp 雑音レベル推定方法、音声区間検出方法、音声認識方法、音声区間検出装置及び音声認識装置
JP2004271736A (ja) * 2003-03-06 2004-09-30 Sony Corp 情報検出装置及び方法、並びにプログラム
JP2005165021A (ja) * 2003-12-03 2005-06-23 Fujitsu Ltd 雑音低減装置、および低減方法
JP2006039447A (ja) * 2004-07-30 2006-02-09 Nissan Motor Co Ltd 音声入力装置
JP2006038894A (ja) * 2004-07-22 2006-02-09 Sony Corp ロボット制御装置および方法、記録媒体、並びにプログラム

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3370423D1 (en) * 1983-06-07 1987-04-23 Ibm Process for activity detection in a voice transmission system
US5210366A (en) * 1991-06-10 1993-05-11 Sykes Jr Richard O Method and device for detecting and separating voices in a complex musical composition
JPH05249987A (ja) 1992-03-09 1993-09-28 Matsushita Electric Ind Co Ltd 音声検出方法および音声検出装置
KR960700602A (ko) * 1993-01-14 1996-01-20 세이버리 그레도빌레 전화 네트워크 수행 모니터 방법 및 시스템(telephone network performance monitoring method and system)
US5572591A (en) * 1993-03-09 1996-11-05 Matsushita Electric Industrial Co., Ltd. Sound field controller
JP2955247B2 (ja) * 1997-03-14 1999-10-04 日本放送協会 話速変換方法およびその装置
US5867574A (en) * 1997-05-19 1999-02-02 Lucent Technologies Inc. Voice activity detection system and method
US5991718A (en) * 1998-02-27 1999-11-23 At&T Corp. System and method for noise threshold adaptation for voice activity detection in nonstationary noise environments
US6539355B1 (en) * 1998-10-15 2003-03-25 Sony Corporation Signal band expanding method and apparatus and signal synthesis method and apparatus
US6256606B1 (en) * 1998-11-30 2001-07-03 Conexant Systems, Inc. Silence description coding for multi-rate speech codecs
US6862567B1 (en) * 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters
GB2391373A (en) * 2002-07-31 2004-02-04 David Toms A system for the automatic detection of a fraudulent transaction
US7024353B2 (en) * 2002-08-09 2006-04-04 Motorola, Inc. Distributed speech recognition with back-end voice activity detection apparatus and method
KR100463657B1 (ko) * 2002-11-30 2004-12-29 삼성전자주식회사 음성구간 검출 장치 및 방법
JP2005242182A (ja) 2004-02-27 2005-09-08 Toshiba Corp 音声検出装置、音声認識装置、音声検出方法および音声認識方法
KR100657912B1 (ko) * 2004-11-18 2006-12-14 삼성전자주식회사 잡음 제거 방법 및 장치
EP1681670A1 (en) * 2005-01-14 2006-07-19 Dialog Semiconductor GmbH Voice activation
US7464029B2 (en) * 2005-07-22 2008-12-09 Qualcomm Incorporated Robust separation of speech signals in a noisy environment
US8559646B2 (en) * 2006-12-14 2013-10-15 William G. Gardner Spatial audio teleconferencing

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07312795A (ja) * 1994-05-18 1995-11-28 Nagano Japan Radio Co 音声信号処理方法及び装置
JPH10288994A (ja) * 1997-04-15 1998-10-27 Mitsubishi Electric Corp 雑音レベル推定方法、音声区間検出方法、音声認識方法、音声区間検出装置及び音声認識装置
JP2004271736A (ja) * 2003-03-06 2004-09-30 Sony Corp 情報検出装置及び方法、並びにプログラム
JP2005165021A (ja) * 2003-12-03 2005-06-23 Fujitsu Ltd 雑音低減装置、および低減方法
JP2006038894A (ja) * 2004-07-22 2006-02-09 Sony Corp ロボット制御装置および方法、記録媒体、並びにプログラム
JP2006039447A (ja) * 2004-07-30 2006-02-09 Nissan Motor Co Ltd 音声入力装置

Also Published As

Publication number Publication date
JP5229217B2 (ja) 2013-07-03
JPWO2008108239A1 (ja) 2010-06-10
US20100106495A1 (en) 2010-04-29
US8417518B2 (en) 2013-04-09

Similar Documents

Publication Publication Date Title
ATE457511T1 (de) Sprechererkennung
WO2010129056A3 (en) System and method for speech processing and speech to text
EP4462424A3 (en) Method and apparatus for activating application by speech input
WO2010013754A1 (ja) オーディオ信号処理装置、オーディオ信号処理システム、およびオーディオ信号処理方法
WO2009051132A1 (ja) 信号処理システムと、その装置、方法及びそのプログラム
WO2007044763A3 (en) System and method for detecting fraudulent transactions
WO2016028628A3 (en) System and method for speech validation
WO2009141828A3 (en) A method and a system for processing signals
WO2010090427A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
WO2010087614A3 (ko) 오디오 신호의 부호화 및 복호화 방법 및 그 장치
EP2339576A3 (en) Multi-modal input on an electronic device
WO2009004750A1 (ja) 音声認識装置
WO2006126843A3 (en) Method and apparatus for decoding audio signal
DK2200342T3 (da) Høreapparat styret ved hjælp af et signal fra en hjernepotentialsvingning
WO2007049863A3 (en) Removing time delays in signal paths
WO2010117712A3 (en) Systems and methods for measuring speech intelligibility
WO2010117666A3 (en) Methods and apparatus to limit a change of a drive value in an electro-pneumatic controller
WO2009147062A3 (de) Detektionssystem zur annäherungserkennung
TN2009000546A1 (en) Method for electronically analysing a dialogue and corresponding systems
WO2008126916A1 (ja) 音源分離装置および音源分離方法
WO2008066623A3 (en) Biometric garment and method of operation
WO2010067976A3 (ko) 신호 분리 방법, 상기 신호 분리 방법을 이용한 통신 시스템 및 음성인식시스템
WO2009143225A3 (en) Multiple-mode location determining methods and systems
EP1647972A3 (de) Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen
WO2011083979A3 (en) An apparatus for processing an audio signal and method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08720916

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2009502537

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 12528767

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08720916

Country of ref document: EP

Kind code of ref document: A1