[go: up one dir, main page]

WO2009028647A1 - 非対話型学習装置及び対話型学習装置 - Google Patents

非対話型学習装置及び対話型学習装置 Download PDF

Info

Publication number
WO2009028647A1
WO2009028647A1 PCT/JP2008/065498 JP2008065498W WO2009028647A1 WO 2009028647 A1 WO2009028647 A1 WO 2009028647A1 JP 2008065498 W JP2008065498 W JP 2008065498W WO 2009028647 A1 WO2009028647 A1 WO 2009028647A1
Authority
WO
WIPO (PCT)
Prior art keywords
dialogue
speech
mode
learning device
expert
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2008/065498
Other languages
English (en)
French (fr)
Inventor
Naoto Iwahashi
Noriyuki Kimura
Mikio Nakano
Kotaro Funakoshi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Honda Motor Co Ltd
National Institute of Information and Communications Technology
Original Assignee
Honda Motor Co Ltd
National Institute of Information and Communications Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honda Motor Co Ltd, National Institute of Information and Communications Technology filed Critical Honda Motor Co Ltd
Priority to JP2009530194A priority Critical patent/JP5386692B2/ja
Priority to US12/675,381 priority patent/US8868410B2/en
Publication of WO2009028647A1 publication Critical patent/WO2009028647A1/ja
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Manipulator (AREA)

Abstract

 対話対象と対話を行う対話型学習装置において、音声を入力する音声入力装置(10)と、入力された音声を認識する音声認識部(20)と、音声認識結果に応じて対話行動を制御する対話行動制御部(30)と、を備え、対話行動制御部(30)が、発話内容の記憶及びマッチングを行うトピック認識エキスパート(34)と、モードの変更を管理するモード切換エキスパート(35)とを有し、モード切換エキスパート(35)が対話対象の発話に応じてモードの切換えを行い、第1のモードでは発話された複数の単語をトピックとして記録するとともに、記録された複数のトピックに対して、第2のモードで発話された内容をマッチングして、最も尤度の高いトピックを選択する。
PCT/JP2008/065498 2007-08-31 2008-08-29 非対話型学習装置及び対話型学習装置 Ceased WO2009028647A1 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2009530194A JP5386692B2 (ja) 2007-08-31 2008-08-29 対話型学習装置
US12/675,381 US8868410B2 (en) 2007-08-31 2008-08-29 Non-dialogue-based and dialogue-based learning apparatus by substituting for uttered words undefined in a dictionary with word-graphs comprising of words defined in the dictionary

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US96960207P 2007-08-31 2007-08-31
US60/969602 2007-08-31

Publications (1)

Publication Number Publication Date
WO2009028647A1 true WO2009028647A1 (ja) 2009-03-05

Family

ID=40387358

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/065498 Ceased WO2009028647A1 (ja) 2007-08-31 2008-08-29 非対話型学習装置及び対話型学習装置

Country Status (3)

Country Link
US (1) US8868410B2 (ja)
JP (1) JP5386692B2 (ja)
WO (1) WO2009028647A1 (ja)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010282199A (ja) * 2009-06-02 2010-12-16 Honda Motor Co Ltd 語彙獲得装置、マルチ対話行動システム及び語彙獲得プログラム
JP2012042952A (ja) * 2010-08-12 2012-03-01 Honda Motor Co Ltd 対話処理装置、対話処理方法、及び対話処理プログラム
WO2017199545A1 (ja) * 2016-05-18 2017-11-23 シャープ株式会社 応答制御装置、制御プログラム、情報処理方法、および通信システム
WO2021192794A1 (ja) 2020-03-25 2021-09-30 ソニーグループ株式会社 情報処理装置及び情報処理方法

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2962048A1 (fr) * 2010-07-02 2012-01-06 Aldebaran Robotics S A Robot humanoide joueur, methode et systeme d'utilisation dudit robot
WO2012132388A1 (ja) * 2011-03-28 2012-10-04 日本電気株式会社 テキスト分析装置、問題言動抽出方法および問題言動抽出プログラム
US9026446B2 (en) * 2011-06-10 2015-05-05 Morgan Fiumi System for generating captions for live video broadcasts
US20130257753A1 (en) * 2012-04-03 2013-10-03 Anirudh Sharma Modeling Actions Based on Speech and Touch Inputs
CN103065630B (zh) * 2012-12-28 2015-01-07 科大讯飞股份有限公司 用户个性化信息语音识别方法及系统
CN103077165A (zh) * 2012-12-31 2013-05-01 威盛电子股份有限公司 自然语言对话方法及其系统
US20150058006A1 (en) * 2013-08-23 2015-02-26 Xerox Corporation Phonetic alignment for user-agent dialogue recognition
TWI536366B (zh) * 2014-03-18 2016-06-01 財團法人工業技術研究院 新增口說語彙的語音辨識系統與方法及電腦可讀取媒體
EP2933067B1 (en) * 2014-04-17 2019-09-18 Softbank Robotics Europe Method of performing multi-modal dialogue between a humanoid robot and user, computer program product and humanoid robot for implementing said method
JP6667855B2 (ja) * 2016-05-20 2020-03-18 日本電信電話株式会社 取得方法、生成方法、それらのシステム、及びプログラム
WO2018117094A1 (ja) * 2016-12-20 2018-06-28 日本電信電話株式会社 音声認識結果リランキング装置、音声認識結果リランキング方法、プログラム
CN108108652B (zh) * 2017-03-29 2021-11-26 广东工业大学 一种基于字典学习的跨视角人体行为识别方法及装置
CN108235697B (zh) * 2017-09-12 2020-03-31 深圳前海达闼云端智能科技有限公司 一种机器人动态学习方法、系统、机器人以及云端服务器
US20190129591A1 (en) * 2017-10-26 2019-05-02 International Business Machines Corporation Dynamic system and method for content and topic based synchronization during presentations
CN107908801A (zh) * 2017-12-25 2018-04-13 广东小天才科技有限公司 一种基于语音的题目搜索方法及电子设备
KR102228866B1 (ko) * 2018-10-18 2021-03-17 엘지전자 주식회사 로봇 및 그의 제어 방법
EP4521393A4 (en) * 2022-11-11 2025-07-16 Samsung Electronics Co Ltd ELECTRONIC DEVICE FOR PERFORMING VOICE RECOGNITION AND CONTROL METHOD THEREOF

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003088209A1 (fr) * 2002-04-12 2003-10-23 Mitsubishi Denki Kabushiki Kaisha Systeme de navigation de voiture et dispositif de reconnaissance vocale de ce systeme
JP2004334591A (ja) * 2003-05-08 2004-11-25 Casio Comput Co Ltd 会話システム及び会話処理プログラム
JP2005257917A (ja) * 2004-03-10 2005-09-22 Nippon Telegr & Teleph Corp <Ntt> 音声解釈方法、音声解釈装置、音声解釈プログラム
JP2006243673A (ja) * 2005-03-07 2006-09-14 Canon Inc データ検索装置および方法

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5131043A (en) * 1983-09-05 1992-07-14 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for speech recognition wherein decisions are made based on phonemes
JPH03123257A (ja) 1989-10-06 1991-05-27 Nec Corp 留守番電話機
US5454062A (en) * 1991-03-27 1995-09-26 Audio Navigation Systems, Inc. Method for recognizing spoken words
US5390278A (en) * 1991-10-08 1995-02-14 Bell Canada Phoneme based speech recognition
DE19533541C1 (de) * 1995-09-11 1997-03-27 Daimler Benz Aerospace Ag Verfahren zur automatischen Steuerung eines oder mehrerer Geräte durch Sprachkommandos oder per Sprachdialog im Echtzeitbetrieb und Vorrichtung zum Ausführen des Verfahrens
JPH11224265A (ja) * 1998-02-06 1999-08-17 Pioneer Electron Corp 情報検索装置及び情報検索方法並びに情報検索プログラムを記録した記録媒体
US6965863B1 (en) * 1998-11-12 2005-11-15 Microsoft Corporation Speech recognition user interface
JP2000259645A (ja) 1999-03-05 2000-09-22 Fuji Xerox Co Ltd 音声処理装置及び音声データ検索装置
GB0011798D0 (en) * 2000-05-16 2000-07-05 Canon Kk Database annotation and retrieval
JP2002281145A (ja) 2001-03-15 2002-09-27 Denso Corp 電話番号入力装置
JP2002358095A (ja) * 2001-03-30 2002-12-13 Sony Corp 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体
JP4072718B2 (ja) * 2002-11-21 2008-04-09 ソニー株式会社 音声処理装置および方法、記録媒体並びにプログラム
JP4631251B2 (ja) 2003-05-06 2011-02-16 日本電気株式会社 メディア検索装置およびメディア検索プログラム
US20060069564A1 (en) * 2004-09-10 2006-03-30 Rightnow Technologies, Inc. Method of weighting speech recognition grammar responses using knowledge base usage data
JP2007013931A (ja) 2005-05-30 2007-01-18 Denso Corp 車載通信装置および車載通信装置用プログラム
US7538667B2 (en) * 2006-10-24 2009-05-26 Webtech Wireless Inc. Dynamically configurable wireless device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003088209A1 (fr) * 2002-04-12 2003-10-23 Mitsubishi Denki Kabushiki Kaisha Systeme de navigation de voiture et dispositif de reconnaissance vocale de ce systeme
JP2004334591A (ja) * 2003-05-08 2004-11-25 Casio Comput Co Ltd 会話システム及び会話処理プログラム
JP2005257917A (ja) * 2004-03-10 2005-09-22 Nippon Telegr & Teleph Corp <Ntt> 音声解釈方法、音声解釈装置、音声解釈プログラム
JP2006243673A (ja) * 2005-03-07 2006-09-14 Canon Inc データ検索装置および方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KATSUSHI ASAMI ET AL.: "Onsei interface no tameno hatsuwa o tan'i to shita wadai oyobi hatsuwa koi type no suitei", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS,INFORMATION AND COMMUNICATION ENGINEERS, vol. J87-D-II, no. 2, 1 February 2004 (2004-02-01), pages 436 - 446 *
MAKOTO IWAYAMA,TAKENOBU TOKUNAGA: "A probabilistic model for text categorization:based on a single random variable with multiple values", THE PROCEEDINGS OF THE FOURTH CONFERENCE ON APPLIED NATURAL LANGUAGE PROCESSING, 13 October 1994 (1994-10-13), pages 162 - 167 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010282199A (ja) * 2009-06-02 2010-12-16 Honda Motor Co Ltd 語彙獲得装置、マルチ対話行動システム及び語彙獲得プログラム
JP2012042952A (ja) * 2010-08-12 2012-03-01 Honda Motor Co Ltd 対話処理装置、対話処理方法、及び対話処理プログラム
WO2017199545A1 (ja) * 2016-05-18 2017-11-23 シャープ株式会社 応答制御装置、制御プログラム、情報処理方法、および通信システム
WO2021192794A1 (ja) 2020-03-25 2021-09-30 ソニーグループ株式会社 情報処理装置及び情報処理方法

Also Published As

Publication number Publication date
US20100250241A1 (en) 2010-09-30
JP5386692B2 (ja) 2014-01-15
JPWO2009028647A1 (ja) 2010-12-02
US8868410B2 (en) 2014-10-21

Similar Documents

Publication Publication Date Title
WO2009028647A1 (ja) 非対話型学習装置及び対話型学習装置
WO2007095591A3 (en) Voice command interface device
WO2008144638A3 (en) Systems and methods of a structured grammar for a speech recognition command system
WO2008005711A3 (en) Non-enrolled continuous dictation
WO2008060834A3 (en) Method and system for a user interface using higher order commands
WO2006069381A3 (en) Turn-taking confidence
ATE489807T1 (de) Sprachfernbedienung
WO2008002365A3 (en) Speech recognition system and method with biometric user identification
WO2009051791A3 (en) Method and system for capturing voice files and rendering them searchable by keyword or phrase
WO2008108232A1 (ja) 音声認識装置、音声認識方法及び音声認識プログラム
WO2007139624A3 (en) Replacing text representing a concept with an alternate written form of the concept
WO2006039566A3 (en) Topical sentiments in electronically stored communications
WO2008067562A3 (en) Multimodal speech recognition system
WO2008115285A3 (en) Content selection using speech recognition
WO2002054033A3 (en) Hierarchical language models for speech recognition
EP1962475A3 (en) Voice interface to NFC applications
WO2007127411A3 (en) Methods and systems for opening and funding a financial account online
WO2010107202A3 (en) A refrigerator and method for controlling same
WO2007150005A3 (en) Automatic decision support
WO2010107196A3 (en) Refrigerator and method for controlling the same
WO2003005258A3 (en) Method of providing an account information and method of and device for transcribing of dictations
ATE445896T1 (de) Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt
WO2005100174A3 (en) Carton and carton blank with reinforced handle structure
WO2009127696A3 (de) Kältegerät mit schublade
MX2008002500A (es) Incorporacion de entrenamiento de voz en tutorial de usuario interactivo.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08828455

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 12675381

Country of ref document: US

Ref document number: 2009530194

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08828455

Country of ref document: EP

Kind code of ref document: A1