WO2015146017A1 - 音声検索装置、音声検索方法、および表示装置 - Google Patents
音声検索装置、音声検索方法、および表示装置 Download PDFInfo
- Publication number
- WO2015146017A1 WO2015146017A1 PCT/JP2015/001257 JP2015001257W WO2015146017A1 WO 2015146017 A1 WO2015146017 A1 WO 2015146017A1 JP 2015001257 W JP2015001257 W JP 2015001257W WO 2015146017 A1 WO2015146017 A1 WO 2015146017A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- search
- url
- unit
- character string
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9566—URL specific, e.g. using aliases, detecting broken or misspelled links
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Definitions
- Patent Document 2 discloses an information search method for performing information search using a table (correspondence table) that associates keywords with categories.
- a table correlates keywords with categories.
- this information retrieval method when the voice uttered by the user can be classified into any of the categories registered in the table, the voice uttered by the user is recognized by voice, and the keyword included in the resulting character string is used. Search within a category is possible.
- the voice search device includes a web page display control unit, a voice recognition unit, a matching unit, and a URL creation unit.
- the web page display control unit controls the web page displayed on the display unit.
- the voice recognition unit recognizes the input voice, converts it into character string information, and outputs it.
- the matching unit refers to the URL list in which a character string representing a part of the URL is registered for each search category, and performs a matching process between the URL of the web page displayed on the display unit and the character string of the URL list.
- a character string of the URL list corresponding to the URL is specified.
- the URL creation unit creates a URL based on the character string specified by the matching unit and the character string information.
- FIG. 1 is a diagram schematically showing a speech recognition system 11 in the first embodiment.
- a voice search device is built in the television 10 which is an example of a display device.
- a voice recognition device included in a voice search device built in the television 10 recognizes a voice uttered by a user and controls the television 10 according to a result of the voice recognition.
- FIG. 2 is a block diagram illustrating a configuration example of the speech recognition system 11 according to the first embodiment.
- the remote controller 20 and the portable terminal 30 are omitted.
- the television 10 includes a voice search device 200, a tuner 190, a display control unit 108, a microphone 130, a display unit 140, a storage unit 181, an operation reception unit 110, and a transmission / reception unit 150.
- the television 10 may include a wireless communication unit (not shown) that performs wireless communication with the remote controller 20 or the mobile terminal 30.
- the remote controller 20 shown in FIG. 1 is a controller for the user 700 to remotely operate the television 10.
- the remote controller 20 includes a microphone and an input unit in addition to a general configuration necessary for remote operation of the television 10.
- the microphone is configured to collect a sound uttered by the user 700 and output a sound signal.
- the input unit is configured to accept an input operation manually performed by the user 700 and output an input signal corresponding to the input operation.
- the input unit is, for example, a touch pad, but may be a keyboard, a button, or the like.
- An audio signal generated by the sound collected by the microphone or an input signal generated when the user 700 performs an input operation on the input unit is wirelessly transmitted to the television 10 by, for example, infrared rays or radio waves.
- the mobile terminal 30 shown in FIG. 1 is a smartphone, for example, and can operate software for remotely operating the television 10. Therefore, in the speech recognition system 11 in the present embodiment, the mobile terminal 30 on which the software is operating can be used for remote operation of the television 10.
- the portable terminal 30 has a microphone and an input unit. Similar to the microphone provided in the remote controller 20, the microphone built in the portable terminal 30 is configured to collect the voice uttered by the user 700 and output the voice signal.
- the input unit is configured to accept an input operation manually performed by the user 700 and output an input signal corresponding to the input operation.
- the input unit is, for example, a touch panel, but may be a keyboard, a button, or the like.
- the voice recognition unit 50 is a server (for example, a server on the cloud) connected to the television 10 via the network 40.
- the voice recognition unit 50 receives voice information transmitted from the television 10 and converts the received voice information into a character string.
- the character string may be a plurality of characters or a single character.
- the voice recognition unit 50 transmits character string information (text information) indicating the converted character string to the television 10 via the network 40 as a result of the voice recognition.
- the search site 70 is a website server that is connected to the network 40 and provides a search service on the network 40.
- the search site 70 may have one server or a plurality of servers.
- the server of the search site 70 is configured to provide the following services.
- a user 700 performs a user operation for the purpose of search on a terminal (for example, the television 10) connected to the network 40, a query string based on the user operation and a character string indicating a path are displayed on the web browser.
- URL bar Uniform Resource Locator bar
- the URL bar includes information on a character string based on a user operation and information for designating a specific server (for example, a server that performs a search).
- the server of the search site 70 designated by the URL bar receives the character string information of the URL bar, and performs a search based on the character string information. Then, the server returns information indicating the search result (information indicating the web page obtained as a result of the search) to the terminal.
- the web browser control unit 170 of the television 10 is configured to perform the above-described exchange with each server of the search site 70.
- the URL server 60 is a server connected to the television 10 via the network 40.
- the URL server 60 stores a search URL list.
- the search URL list is a list in which character string templates (hereinafter referred to as “search URL templates”) set in the URL bar of the web browser are associated with search target categories (hereinafter referred to as “search categories”). It is a table (list).
- the search category is a search range when searching with free words.
- This search URL list includes a communication method (for example, “http” (Hyper Text Transfer Protocol)), each server name of the search site 70 (for example, “search.co.jp”, etc.), a search category (for example, “images”). And the like are registered as a search URL template. Details of the search URL list will be described later.
- the search URL list of the URL server 60 is configured to be updatable.
- the URL server 60 can update the search URL list regularly or irregularly by collecting information from the network 40 or the like.
- the television 10 is configured to be able to acquire a search URL list from the URL server 60 via the network 40. Accordingly, the television 10 can appropriately update the search URL list D500 stored in the storage unit 180 of the television 10.
- the voice recognition device 100 of the television 10 includes a voice acquisition unit 101, a voice processing unit 102, a recognition result acquisition unit 103, an intention interpretation processing unit 104, a reserved word processing unit 106, and a storage unit 180.
- the reserved word table D400 includes “image”, “video”, “program”, “Web”, “news”, “map”, “restaurant”, “gas station” as reserved words. , “Forward”, “return”, “search”, etc. are registered, and “search”, “command”, etc. are registered as attributes of reserved words.
- words used for instructions to the television 10 hereinafter referred to as “instruction information” are classified as “commands”, and words used for specifying a search category are classified as “search”. Yes.
- the audio processing unit 102 collates the audio information with the reserved word table D400, and the reserved word included in the audio information is “image”.
- Reserved word information indicating that the attribute is “search” is output to the recognition result acquisition unit 103.
- the reserved word processing unit 106 When the reserved word processing unit 106 acquires information from the intention interpretation processing unit 104, the reserved word processing unit 106 is configured to confirm whether or not the information includes predetermined information.
- the predetermined information is information indicating “reserved word whose attribute is“ search ””.
- the web browser control unit 170 in the present embodiment switches the subsequent processing depending on whether or not the information output from the intention interpretation processing unit 104 includes a reserved word whose attribute is “search”. It is configured. Specifically, when the reserved word processing unit 106 determines that the reserved word whose attribute is “search” is included in the information output from the intention interpretation processing unit 104, the search by the web browser control unit 170.
- the URL creation unit 171 executes processing.
- the search URL creation unit 171 is an example of a “URL creation unit”. If the information output from the intention interpretation processing unit 104 includes a free word and a reserved word whose attribute is “search”, the search URL creating unit 171 stores the information in the storage unit 180 based on the reserved word.
- the stored search URL list D500 is referred to.
- the search URL matching processing unit 172 is an example of a “matching unit”.
- the search URL matching processing unit 172 is configured to execute the following processing when the information output from the reserved word processing unit 106 does not include a reserved word whose attribute is “search”.
- the search URL matching processing unit 172 sequentially reads the search URL templates from the search URL list D500, and checks whether the character string of the read search URL template is included in the URL acquired from the URL bar display control unit 173. Specifically, it is checked whether or not there is a character string that matches the character string of the read search URL template in the character string of the URL acquired from the URL bar display control unit 173. If there is a search URL template that is determined to have a matching character string in the search URL list D500, the search URL matching processing unit 172 determines that the matching process has been successful. Then, the search URL matching processing unit 172 reads the search URL template, and adds the free word character string output from the reserved word processing unit 106 to the end of the character string of the read search URL template. In this way, the search URL matching processing unit 172 creates a new URL (that is, a search URL) used for the search process.
- a new URL that is, a search URL
- the reserved word recognition processing and the free word recognition processing are performed as the speech recognition processing, and the processing based on the results is performed by the television 10.
- search command a command associated with a search application (hereinafter referred to as “search application”) set in advance
- search application a search application
- the television 10 may use the search application. It is configured to perform a free word search.
- the voice recognition unit 50 receives all voices (voice information) acquired by the voice acquisition unit 101 from the television 10, performs dictation on all the voice information, and results thereof. Are transmitted to the television 10.
- the voice processing unit 102 of the television 10 may be configured to transmit voice information other than the reserved words recognized by the reserved word table D400 to the voice recognition unit 50.
- the Internet search application of the web browser shown as an example in FIG. 5 may be activated based on the above-described reserved word recognition processing, or may be activated based on a user operation using the remote controller 20 or the like.
- the television 10 Upon receiving the character string information from the voice recognition unit 50, the television 10 operates the recognition result acquisition unit 103, the intention interpretation processing unit 104, the reserved word processing unit 106, the display control unit 108, and the like based on the character string information.
- the character string corresponding to the character string information is displayed in the input field 203. For example, when the television 10 receives character string information indicating a character string “ABC” from the voice recognition unit 50, the television 10 displays a character string “ABC” in the input field 203.
- the voice information output from the voice acquisition unit 101 includes voice information based on the word “search” issued by the user 700, and the reserved word table D 400 has the attribute “command” as a reserved word “ If “search” is registered, the voice processing unit 102 determines that the voice information includes the word “search” which is a reserved word having the attribute “command”.
- the voice recognition unit 50 compares the database in which free words and character strings are associated with the received voice information. If a free word registered in the database is included in the received voice information, a character string (including one character) corresponding to the free word is selected.
- the free words in the voice recognition unit 50 may include reserved words. In this way, the voice recognition unit 50 converts the received voice information into a character string. For example, if the voice recognition unit 50 receives voice information based on the voice “search for ABC image” uttered by the user 700, the voice recognition unit 50 converts the voice information into “ABC”, “NO”, “ It is converted into character strings “image”, “to”, and “search”.
- the voice recognition unit 50 transmits character string information (text information) representing each converted character string to the television 10 via the network 40 (step S802).
- the recognition result acquisition unit 103 of the television 10 includes reserved word information (the reserved word information includes reserved words and attributes) output as a result of the voice recognition from the voice processing unit 102, and the voice recognition unit 50.
- the character string information transmitted as a result of voice recognition from (the character string information includes free words and reserved words) is acquired and output to the intention interpretation processing unit 104.
- the intention interpretation processing unit 104 removes a reserved word from character string information, for example, by collating character string information with reserved word information. Note that the intention interpretation processing unit 104 compares the reserved word table D400 shown in FIG. 3 with the character string information, and if the reserved word registered in the reserved word table D400 is included in the character string information. The word may be removed from the character string information as a reserved word.
- the intention interpretation processing unit 104 selects free words by removing character strings such as prepositions and particles that do not correspond to free words from the remaining character string information. Therefore, the free word in the intention interpretation processing unit 104 does not include a reserved word.
- step S702 If it is determined in step S702 that the information output from the intention interpretation processing unit 104 includes a reserved word whose attribute is “search” (Yes), the process proceeds to step S704. Then, a search URL based on the reserved words and free words included in the information output from the intention interpretation processing unit 104 is created by the search URL creation unit 171 of the web browser control unit 170 (step S704).
- step S702 For example, if the words “ABC” and “image” are included in the information output from the intention interpretation processing unit 104, “image” is stored in the reserved word table D400 as a reserved word whose attribute is “search”. Since it is registered, “Yes” is determined in step S702.
- step S702 If it is determined in step S702 that the information output from the intention interpretation processing unit 104 does not include a reserved word whose attribute is “search” (No), the process proceeds to step S703. Then, the search URL matching processing unit 172 of the web browser control unit 170 executes processing for determining a search category.
- search category determination processing the processing performed in step S703 is also referred to as “search category determination processing”.
- the search category indicates a search target range, and corresponds to a reserved word in the search URL list D500 of FIG. 4 in the present embodiment.
- step S703 search category determination processing is performed. Specifically, the search URL matching processing unit 172 matches the search URL list D500 with the URL managed by the URL bar display control unit 173 (URL related to the web page displayed on the display unit 140). (Character string matching processing) is performed, and a search category is determined based on the result (step S703).
- the search URL matching processing unit 172 determines whether the character string A includes a character string that matches the character string B (step S904).
- the search URL matching processing unit 172 compares the variable n with the number of search categories (number of search URL templates) registered in the search URL list D500 (step S907).
- step S907 when it is determined that the variable n is larger than the number of search categories registered in the search URL list D500 (Yes), the search URL matching processing unit 172 determines a search category set in advance (hereinafter, referred to as “search category”). “Default search category”) is set as a return value (step S908).
- search category determination process described above will be described with a specific example.
- the continuation of the above-described specific example when “No” is determined in step S702 will be described as a specific example of the search category determination process.
- the map web page is displayed on the web browser by the search performed by the user 700 immediately before. Therefore, the search URL matching processing unit 172 sets the character string of the URL for displaying the web page of the map as the character string A in step S902.
- co. jp / maps? q Is set to the character string B, the matching process is successful. That is, it is determined that the character string that matches the character string B is included in the character string A. Therefore, in step S905, the search URL matching processing unit 172 sets the search category “map” as a return value, and outputs the return value to the search URL creation unit 171.
- the user 700 immediately utters or inputs “search a map with ABC”, searches for the free word “ABC” in the search category “map”, and then newly adds the free word “XYZ”.
- search category “map” is searched with “”, there is no need to say “search map with XYZ”.
- the user 700 only has to say “XYZ”.
- the search URL is created so that the search for the free word “XYZ” is performed in the same search category “map” as the search category searched immediately before. That is, the user 700 can execute the same search process as when the user “utters a map using XYZ” by simply speaking “XYZ”.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
Description
以下、図1~図9を用いて、実施の形態1を説明する。なお、本実施の形態では、音声検索装置を備えた表示装置の一例としてテレビジョン受像機(テレビ)10を挙げているが、表示装置は何らテレビ10に限定されるものではない。例えば、PCやタブレット端末等であってもよい。
図1は、実施の形態1における音声認識システム11を概略的に示す図である。本実施の形態では、表示装置の一例であるテレビ10に音声検索装置が内蔵されている。テレビ10に内蔵された音声検索装置が有する音声認識装置は、ユーザが発した音声を認識し、その音声認識の結果に応じてテレビ10の制御を行う。
http://search.co.jp/images?q=
である。この検索URLテンプレートにおいて、「http」は通信方法を示し、「search.co.jp」は検索を依頼する検索サイト70のサーバ名を示し、「images」は検索されるカテゴリが「画像」であることを示している。また、「?q=」の後にはフリーワードが付加される。このように、検索URLリストD500には、予約語毎に、互いに異なる検索URLテンプレートが登録されている。
http://search.co.jp/images?q=ABC
という文字列が作成される。この文字列が、検索処理に用いる新たなURL(すなわち、検索URL)となる。
http://search.co.jp/maps?q=ABC
あるいは、表示部140に、フリーワード「ABC」でのレストラン検索のウェブページが表示されているときには、以下のURLが、URLバー表示制御部173で管理されている。
http://web-navi.co.jp/search?category=restaurant&query=ABC
そして、検索URLマッチング処理部172は、URLバー表示制御部173から取得したURLと、記憶部180に記憶されている検索URLリストD500の検索URLテンプレートとの照合(すなわち、マッチング処理)を行う。
次に、本実施の形態におけるテレビ10の音声検索装置200の動作について説明する。
http://search.co.jp/images?q=ABC
という文字列の検索URLが作成される。
http://search.co.jp/web?q=
である。n=2のときの検索カテゴリは、検索URLリストD500の2行目の「画像」であり、その検索URLテンプレートは、
http://search.co.jp/images?q=
である。n=9のときの検索カテゴリは、検索URLリストD500の9行目の「ガソリンスタンド」であり、その検索URLテンプレートは、
http://web-navi.co.jp/search?category=gas-station&query=
である。このように、検索URLマッチング処理部172は、変数nにもとづき、検索URLリストD500から検索URLテンプレートを読み出す。以下、変数nに対応する検索URLテンプレートの文字列を文字列Bとする。
http://search.co.jp/images?q=
であり、文字列Aが
http://search.co.jp/images?q=ABC
であれば、文字列Bに合致する文字列が文字列Aの中に含まれているので、ステップS904では「Yes」の判定がなされる。
http://search.co.jp/maps?q=
である。仮に、フリーワード「ABC」で検索カテゴリ「地図」が検索されていれば、文字列Aに設定される文字列は、例えば、
http://search.co.jp/maps?q=ABC
となる。
http://search.co.jp/maps?q=
が文字列Bに設定されたときに、マッチング処理が成功する。すなわち、文字列Bに合致する文字列が文字列Aに含まれている、と判断される。したがって、検索URLマッチング処理部172は、ステップS905において、検索カテゴリ「地図」を返り値に設定し、その返り値を検索URL作成部171に出力する。
http://search.co.jp/maps?q=
そして、検索URL作成部171は、読み出した検索URLテンプレートと、意図解釈処理部104から出力される情報に含まれるフリーワード「XYZ」と、にもとづき以下の検索URLを作成する。
http://search.co.jp/maps?q=XYZ
URLバー表示制御部173は、ステップS705において、検索URL作成部171で作成された上述の検索URLを、ウェブブラウザのURLバー301に入力する。この検索の結果として検索サイト70から得られるウェブページの情報(すなわち、検索カテゴリ「地図」をフリーワード「XYZ」で検索した結果)は、ウェブページ表示制御部174によってウェブブラウザに読み込まれる。そして、そのウェブページの情報は、表示制御部108により、表示部140に表示中のウェブブラウザ上に表示される。
以上のように、本実施の形態において、音声検索装置は、ウェブページ表示制御部と、音声認識部と、マッチング部と、URL作成部と、を備える。ウェブページ表示制御部は、表示部に表示するウェブページを制御する。音声認識部は、入力された音声を認識して文字列情報に変換して出力する。マッチング部は、URLの一部分を表す文字列が検索カテゴリ毎に登録されたURLリストを参照し、表示部に表示しているウェブページのURLとURLリストの文字列とのマッチング処理を行い、そのURLに対応するURLリストの文字列を特定する。URL作成部は、マッチング部が特定する文字列と、文字列情報と、にもとづきURLを作成する。
以上のように、本出願において開示する技術の例示として、実施の形態1を説明した。しかしながら、本開示における技術は、これに限定されず、変更、置き換え、付加、省略等を行った実施の形態にも適用できる。また、上記実施の形態1で説明した各構成要素を組み合わせて、新たな実施の形態とすることも可能である。
http://web-navi.co.jp/search?category=gas-station&query=
この検索URLテンプレートからは、URLスキームの区切りの「:」やクエリストリングの区切りの「?」を利用して、ドメイン名「web-navi.co.jp」と、クエリストリング「category=gas-station」と、を抽出することができる。したがって、検索URLマッチング処理部172は、このドメイン名の文字列が、URLバー表示制御部173が管理するURLのドメイン名の文字列の中に含まれ、かつ、このクエリストリングの文字列が、URLバー表示制御部173が管理するURLのクエリストリングの文字列に含まれていたときに、マッチング処理に成功したという判断を下す。なお、検索URLマッチング処理部172は、ドメイン名とクエリストリングを用いた上述のマッチング処理以外にも、ドメイン名とパスを用いたマッチング処理や、バスとクエリストリングを用いたマッチング処理を行うように構成することも可能である。
11 音声認識システム
20 リモートコントローラ
30 携帯端末
40 ネットワーク
50 音声認識部
60 URLサーバ
70 検索サイト
100 音声認識装置
101 音声取得部
102 音声処理部
103 認識結果取得部
104 意図解釈処理部
106 予約語処理部
108 表示制御部
110 操作受付部
130 マイク
140 表示部
150 送受信部
170 ウェブブラウザ制御部
171 検索URL作成部
172 検索URLマッチング処理部
173 URLバー表示制御部
174 ウェブページ表示制御部
180,181 記憶部
190 チューナ
200 音声検索装置
201 音声認識アイコン
202 インジケータ
203 入力欄
301 URLバー
302 検索バー
303 ウェブページ
304 ボタン
305 検索用文字列入力エリア
306 検索結果
307 ログイン用文字列入力エリア
308 広告
309 検索用文字列入力エリア
700 ユーザ
D400 予約語テーブル
D500 検索URLリスト
Claims (3)
- 表示部に表示するウェブページを制御するウェブページ表示制御部と、
入力された音声を認識して文字列情報に変換して出力する音声認識部と、
URLの一部分を表す文字列が検索カテゴリ毎に登録されたURLリストを参照し、前記表示部に表示しているウェブページのURLと前記URLリストの前記文字列とのマッチング処理を行い、前記URLに対応する前記URLリストの前記文字列を特定するマッチング部と、
前記マッチング部が特定する前記文字列と、前記文字列情報と、にもとづきURLを作成するURL作成部と、
を備えた音声検索装置。 - 表示部にウェブページを表示するステップと、
入力された音声を認識して文字列情報に変換するステップと、
URLの一部分を表す文字列が検索カテゴリ毎に登録されたURLリストを参照し、前記表示部に表示しているウェブページのURLと前記URLリストの前記文字列とのマッチング処理を行い、前記URLに対応する前記URLリストの前記文字列を特定するステップと、
特定された前記文字列と、前記文字列情報と、にもとづきURLを作成するステップと、
を備えた音声検索方法。 - 表示部と、
前記表示部に表示するウェブページを制御するウェブページ表示制御部と、
入力された音声を認識して文字列情報に変換して出力する音声認識部と、
URLの一部分を表す文字列が検索カテゴリ毎に登録されたURLリストを参照し、前記表示部に表示しているウェブページのURLと前記URLリストの前記文字列とのマッチング処理を行い、前記URLに対応する前記URLリストの前記文字列を特定するマッチング部と、
前記マッチング部が特定する前記文字列と、前記文字列情報と、にもとづきURLを作成するURL作成部と、
を備えた表示装置。
Priority Applications (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP15769934.9A EP3125134B1 (en) | 2014-03-28 | 2015-03-09 | Speech retrieval device, speech retrieval method, and display device |
| CN201580006319.8A CN105940399B (zh) | 2014-03-28 | 2015-03-09 | 声音检索装置、声音检索方法以及显示装置 |
| US15/110,733 US9928030B2 (en) | 2014-03-28 | 2015-03-09 | Speech retrieval device, speech retrieval method, and display device |
| JP2016509973A JP6375521B2 (ja) | 2014-03-28 | 2015-03-09 | 音声検索装置、音声検索方法、および表示装置 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2014-068183 | 2014-03-28 | ||
| JP2014068183 | 2014-03-28 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2015146017A1 true WO2015146017A1 (ja) | 2015-10-01 |
Family
ID=54194571
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2015/001257 Ceased WO2015146017A1 (ja) | 2014-03-28 | 2015-03-09 | 音声検索装置、音声検索方法、および表示装置 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US9928030B2 (ja) |
| EP (1) | EP3125134B1 (ja) |
| JP (1) | JP6375521B2 (ja) |
| CN (1) | CN105940399B (ja) |
| WO (1) | WO2015146017A1 (ja) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017134935A1 (ja) * | 2016-02-03 | 2017-08-10 | ソニー株式会社 | 情報処理装置、情報処理方法、及びプログラム |
| WO2017195388A1 (ja) * | 2016-05-12 | 2017-11-16 | ソニー株式会社 | 情報処理装置、情報処理方法およびプログラム |
| CN107993650A (zh) * | 2017-11-30 | 2018-05-04 | 百度在线网络技术(北京)有限公司 | 用于生成信息的方法和装置 |
| JP2020087262A (ja) * | 2018-11-30 | 2020-06-04 | 株式会社Nttぷらら | 情報提示システム、情報提示装置、情報提示方法及びコンピュータプログラム |
| WO2020129695A1 (ja) * | 2018-12-21 | 2020-06-25 | ソニー株式会社 | 情報処理装置、制御方法、情報処理端末、情報処理方法 |
| JP2022184102A (ja) * | 2021-05-31 | 2022-12-13 | サウンドハウンド,インコーポレイテッド | 情報提供方法、データベースの生成方法、およびプログラム |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7516190B2 (en) * | 2000-02-04 | 2009-04-07 | Parus Holdings, Inc. | Personal voice-based information retrieval system |
| EP3896596A1 (en) * | 2012-12-28 | 2021-10-20 | Saturn Licensing LLC | Information processing device, information processing method and program |
| US10149002B1 (en) * | 2016-03-21 | 2018-12-04 | Tribune Broadcasting Company, Llc | Systems and methods for retrieving content files |
| US20180018965A1 (en) * | 2016-07-12 | 2018-01-18 | Bose Corporation | Combining Gesture and Voice User Interfaces |
| US11907316B2 (en) * | 2016-08-15 | 2024-02-20 | Richard S. Brown | Processor-implemented method, computing system and computer program for invoking a search |
| KR102542766B1 (ko) * | 2016-11-17 | 2023-06-14 | 엘지전자 주식회사 | 디스플레이 장치 및 그의 동작 방법 |
| CN108986800B (zh) * | 2017-05-31 | 2022-08-09 | 上海智觅智能科技有限公司 | 一种基于多处理器协调链表逻辑实时离线语音交互系统及方法 |
| KR102661340B1 (ko) * | 2018-09-21 | 2024-04-30 | 삼성전자주식회사 | 전자 장치 및 이의 제어 방법 |
| CN110085232A (zh) * | 2019-03-13 | 2019-08-02 | 佛山市云米电器科技有限公司 | 一种基于语音的红外控制系统及其方法 |
| US11605378B2 (en) * | 2019-07-01 | 2023-03-14 | Lg Electronics Inc. | Intelligent gateway device and system including the same |
| US11645692B2 (en) * | 2019-11-20 | 2023-05-09 | Microsoft Technology Licensing, Llc | Generating training data for a computer-implemented ranker |
| CN114556353A (zh) * | 2019-12-16 | 2022-05-27 | 深圳市欢太科技有限公司 | 数据处理方法、装置、电子设备和存储介质 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001134580A (ja) * | 1999-11-02 | 2001-05-18 | Nippon Telegr & Teleph Corp <Ntt> | データベース管理システム,データベース管理システムにおける問い合わせ処理方法およびこの方法を実現するためのプログラムを格納した記録媒体 |
| JP2002055872A (ja) * | 2000-07-27 | 2002-02-20 | Internatl Business Mach Corp <Ibm> | ウェブコンテンツを簡略化するための方法、システムおよび媒体 |
| JP2006285855A (ja) * | 2005-04-04 | 2006-10-19 | Ntt Docomo Inc | 検索サーバ |
| JP2009009170A (ja) * | 2005-10-24 | 2009-01-15 | Advanced Media Inc | 情報検索システム及びサーバ装置 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6192343B1 (en) | 1998-12-17 | 2001-02-20 | International Business Machines Corporation | Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms |
| US8275617B1 (en) | 1998-12-17 | 2012-09-25 | Nuance Communications, Inc. | Speech command input recognition system for interactive computer display with interpretation of ancillary relevant speech query terms into commands |
| US6937984B1 (en) | 1998-12-17 | 2005-08-30 | International Business Machines Corporation | Speech command input recognition system for interactive computer display with speech controlled display of recognized commands |
| US7206747B1 (en) | 1998-12-16 | 2007-04-17 | International Business Machines Corporation | Speech command input recognition system for interactive computer display with means for concurrent and modeless distinguishing between speech commands and speech queries for locating commands |
| CN1176432C (zh) * | 1999-07-28 | 2004-11-17 | 国际商业机器公司 | 提供本国语言查询服务的方法和系统 |
| JP2001075968A (ja) | 1999-08-31 | 2001-03-23 | Toshiba Corp | 情報検索方法及びそれを記録した記録媒体 |
| CA2346231A1 (en) | 2000-05-08 | 2001-11-08 | Internet Number Corporation | Method and system for accessing information on a network using message aliasing functions having shadow callback functions |
| US6934675B2 (en) * | 2001-06-14 | 2005-08-23 | Stephen C. Glinski | Methods and systems for enabling speech-based internet searches |
| US8751240B2 (en) * | 2005-05-13 | 2014-06-10 | At&T Intellectual Property Ii, L.P. | Apparatus and method for forming search engine queries based on spoken utterances |
| US7672931B2 (en) * | 2005-06-30 | 2010-03-02 | Microsoft Corporation | Searching for content using voice search queries |
| US20140372119A1 (en) * | 2008-09-26 | 2014-12-18 | Google, Inc. | Compounded Text Segmentation |
| US8234111B2 (en) * | 2010-06-14 | 2012-07-31 | Google Inc. | Speech and noise models for speech recognition |
| CN101989299A (zh) * | 2010-08-30 | 2011-03-23 | 无锡滨达工业创意设计有限公司 | 基于互联网的语音搜索方法 |
| WO2012033820A2 (en) * | 2010-09-08 | 2012-03-15 | Nuance Communications, Inc. | Internet search related methods and apparatus |
| EP2518722A3 (en) * | 2011-04-28 | 2013-08-28 | Samsung Electronics Co., Ltd. | Method for providing link list and display apparatus applying the same |
| CN102629246B (zh) * | 2012-02-10 | 2017-06-27 | 百纳(武汉)信息技术有限公司 | 识别浏览器语音命令的服务器及浏览器语音命令识别方法 |
| CN103020165B (zh) * | 2012-11-26 | 2016-06-22 | 北京奇虎科技有限公司 | 可进行语音识别处理的浏览器及处理方法 |
| CN102968992B (zh) * | 2012-11-26 | 2014-11-05 | 北京奇虎科技有限公司 | 用于浏览器的语音识别处理方法和浏览器 |
| WO2014197336A1 (en) * | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
| US9646606B2 (en) * | 2013-07-03 | 2017-05-09 | Google Inc. | Speech recognition using domain knowledge |
| US9710566B2 (en) * | 2013-12-16 | 2017-07-18 | Google Inc. | User interface for webpage permission requests |
-
2015
- 2015-03-09 JP JP2016509973A patent/JP6375521B2/ja active Active
- 2015-03-09 CN CN201580006319.8A patent/CN105940399B/zh active Active
- 2015-03-09 US US15/110,733 patent/US9928030B2/en active Active
- 2015-03-09 EP EP15769934.9A patent/EP3125134B1/en active Active
- 2015-03-09 WO PCT/JP2015/001257 patent/WO2015146017A1/ja not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001134580A (ja) * | 1999-11-02 | 2001-05-18 | Nippon Telegr & Teleph Corp <Ntt> | データベース管理システム,データベース管理システムにおける問い合わせ処理方法およびこの方法を実現するためのプログラムを格納した記録媒体 |
| JP2002055872A (ja) * | 2000-07-27 | 2002-02-20 | Internatl Business Mach Corp <Ibm> | ウェブコンテンツを簡略化するための方法、システムおよび媒体 |
| JP2006285855A (ja) * | 2005-04-04 | 2006-10-19 | Ntt Docomo Inc | 検索サーバ |
| JP2009009170A (ja) * | 2005-10-24 | 2009-01-15 | Advanced Media Inc | 情報検索システム及びサーバ装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3125134A4 * |
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017134935A1 (ja) * | 2016-02-03 | 2017-08-10 | ソニー株式会社 | 情報処理装置、情報処理方法、及びプログラム |
| WO2017195388A1 (ja) * | 2016-05-12 | 2017-11-16 | ソニー株式会社 | 情報処理装置、情報処理方法およびプログラム |
| US11195517B2 (en) | 2016-05-12 | 2021-12-07 | Sony Corporation | Information processing apparatus and information processing method |
| CN107993650A (zh) * | 2017-11-30 | 2018-05-04 | 百度在线网络技术(北京)有限公司 | 用于生成信息的方法和装置 |
| JP2020087262A (ja) * | 2018-11-30 | 2020-06-04 | 株式会社Nttぷらら | 情報提示システム、情報提示装置、情報提示方法及びコンピュータプログラム |
| WO2020129695A1 (ja) * | 2018-12-21 | 2020-06-25 | ソニー株式会社 | 情報処理装置、制御方法、情報処理端末、情報処理方法 |
| US12506920B2 (en) | 2018-12-21 | 2025-12-23 | Saturn Licensing Llc | Information processing apparatus, control method, information processing terminal, and information processing method |
| JP2022184102A (ja) * | 2021-05-31 | 2022-12-13 | サウンドハウンド,インコーポレイテッド | 情報提供方法、データベースの生成方法、およびプログラム |
| JP7705739B2 (ja) | 2021-05-31 | 2025-07-10 | サウンドハウンド,インコーポレイテッド | 情報提供方法、データベースの生成方法、およびプログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| US9928030B2 (en) | 2018-03-27 |
| CN105940399A (zh) | 2016-09-14 |
| CN105940399B (zh) | 2019-04-26 |
| JPWO2015146017A1 (ja) | 2017-04-13 |
| US20160328206A1 (en) | 2016-11-10 |
| JP6375521B2 (ja) | 2018-08-22 |
| EP3125134B1 (en) | 2018-08-15 |
| EP3125134A4 (en) | 2017-03-01 |
| EP3125134A1 (en) | 2017-02-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6375521B2 (ja) | 音声検索装置、音声検索方法、および表示装置 | |
| KR102304052B1 (ko) | 디스플레이 장치 및 그의 동작 방법 | |
| CN105659318B (zh) | 声音识别处理装置、声音识别处理方法以及显示装置 | |
| JP6603754B2 (ja) | 情報処理装置 | |
| US9230559B2 (en) | Server and method of controlling the same | |
| KR102210933B1 (ko) | 음성 신호에 따라 컨텐츠 정보를 검색하여 제공하는 디스플레이 장치, 서버 장치 및 이들을 포함하는 음성 입력 시스템과, 그 방법들 | |
| KR20140089836A (ko) | 대화형 서버, 디스플레이 장치 및 그 제어 방법 | |
| KR102775800B1 (ko) | 사용자의 발화를 기반으로 컨텐츠를 제공하는 장치 및 시스템 | |
| US20180276186A1 (en) | Computing device and corresponding method for generating data representing text | |
| CN109792502B (zh) | 信息处理设备、信息处理方法、存储介质和信息处理系统 | |
| US11706482B2 (en) | Display device | |
| KR20120083104A (ko) | 멀티미디어 장치의 음성인식을 통한 텍스트 입력 방법 및 그에 따른 멀티미디어 장치 | |
| KR102613623B1 (ko) | 미디어에 포함된 정보를 제공하는 장치 및 그 방법 | |
| US20180011834A1 (en) | Computing device and corresponding method for generating data representing text | |
| KR20160050723A (ko) | 문장 및 명령어의 출력 방법 및 그 장치 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15769934 Country of ref document: EP Kind code of ref document: A1 |
|
| REEP | Request for entry into the european phase |
Ref document number: 2015769934 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2015769934 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2016509973 Country of ref document: JP Kind code of ref document: A |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 15110733 Country of ref document: US |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |