[go: up one dir, main page]

CN103117058B - Based on Multi-voice engine switch system and the method for intelligent television platform - Google Patents

Based on Multi-voice engine switch system and the method for intelligent television platform Download PDF

Info

Publication number
CN103117058B
CN103117058B CN201210558320.XA CN201210558320A CN103117058B CN 103117058 B CN103117058 B CN 103117058B CN 201210558320 A CN201210558320 A CN 201210558320A CN 103117058 B CN103117058 B CN 103117058B
Authority
CN
China
Prior art keywords
voice
engine
speech
module
response time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210558320.XA
Other languages
Chinese (zh)
Other versions
CN103117058A (en
Inventor
陈冠霖
赵波
刘贤洪
杨金峰
毕端
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan Changhong Electric Co Ltd
Original Assignee
Sichuan Changhong Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Changhong Electric Co Ltd filed Critical Sichuan Changhong Electric Co Ltd
Priority to CN201210558320.XA priority Critical patent/CN103117058B/en
Publication of CN103117058A publication Critical patent/CN103117058A/en
Application granted granted Critical
Publication of CN103117058B publication Critical patent/CN103117058B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明涉及智能电视软件平台,其公开了一种基于智能电视平台的多语音引擎切换方法,实现自动查找当前识别效率最高的语音引擎并进行切换,提升用户的语音交互体验。该方法可以概括为:当用户运行语音应用程序使用语音识别功能时,语音引擎选择模块通过语音应用接口获取采集到的语音数据,然后将语音数据发送给每一个语音引擎模块,记录并比较各个语音引擎模块返回识别结果的响应时间,选择响应时间最短的语音引擎模块进行切换。此外,本发明还公开了相应的切换系统,适用于在智能电视中实现快速语音识别功能。

The invention relates to a smart TV software platform, which discloses a multi-speech engine switching method based on a smart TV platform, which realizes automatic search and switching of the speech engine with the highest current recognition efficiency, and improves the user's speech interaction experience. The method can be summarized as follows: when the user runs the voice application program to use the voice recognition function, the voice engine selection module obtains the collected voice data through the voice application interface, and then sends the voice data to each voice engine module, records and compares each voice The engine module returns the response time of the recognition result, and the speech engine module with the shortest response time is selected for switching. In addition, the invention also discloses a corresponding switching system, which is suitable for realizing the fast speech recognition function in the smart TV.

Description

基于智能电视平台的多语音引擎切换系统及方法Multi-speech engine switching system and method based on smart TV platform

技术领域 technical field

本发明涉及智能电视软件平台,具体的说,是涉及一种基于智能电视平台的多语音引擎切换系统及方法。The invention relates to a smart TV software platform, in particular to a multi-voice engine switching system and method based on a smart TV platform.

背景技术 Background technique

随着电视终端智能化、网络化的发展,智能电视可获取的内容得到了极大的丰富,功能也更加的多元化,电视的操控随之变得更加频繁和复杂。语音识别技术在智能电视上的应用大大简化了用户的操作过程,用户体验得到极大提高。由于语音识别需要占用巨大的系统资源,智能电视目前一般都通过网络连接云端服务器来实现语音识别功能;With the development of intelligent and networked TV terminals, the content available to smart TVs has been greatly enriched, and the functions have become more diversified, and the control of TVs has become more frequent and complicated. The application of speech recognition technology on smart TVs greatly simplifies the user's operation process, and the user experience is greatly improved. Since speech recognition needs to occupy huge system resources, smart TVs are generally connected to cloud servers through the network to realize speech recognition functions;

在服务器中用于实现语音识别功能的语音识别引擎由语音检测模块、特征提取模块和识别搜索模块组成;其中,语音检测模块的功能是进行语音信号的检测和与处理,电视将采集到的原始语音数据送入到该模块,语音信号数据需要在语音检测模块里转换成标准的数据格式(比如:8K,16bit);同时,利用高效的信号检测算法,判断出语音的起始点和终止点;特征提取模块收到检测后的语音数据流,从中提取得到语音信号的特征矢量流。语音特征是利用数字信号处理技术,从语音信号中提取最反应其本质属性的信息。在这个模块中,需要对语音信号进行预加重、分帧、加窗、品与变换、倒谱变换、差分等处理,最终得到数十维左右的特征矢量;识别搜索模块将收到的未知语音信号特征与引擎内的声学模型库、词典/字典和识别语法信息进行匹配,得到最适合未知语音特征的词序列。这个过程可以简单描述如下:通过检索词典/字典,可以将句子由词序列分解成音素的序列。这种音素的序列与声学模型相结合,就得到更反映其本质属性的声学模型单元序列信息。然后,将原始语音的特征矢量与所有可能的句子候选的声学模型单元序列的信息相互匹配,计算得到其匹配概率,从中挑选出具有最大后验概率的声学模型单元序列。通过该单元序列,可以得到与之对应的词序列,这就是引擎输出给电视的文字序列。The voice recognition engine used in the server to realize the voice recognition function is composed of a voice detection module, a feature extraction module and a recognition search module; wherein, the function of the voice detection module is to detect and process voice signals, and the TV will collect the original The voice data is sent to the module, and the voice signal data needs to be converted into a standard data format (for example: 8K, 16bit) in the voice detection module; at the same time, the start and end points of the voice are judged by using an efficient signal detection algorithm; The feature extraction module receives the detected voice data stream, and extracts the feature vector stream of the voice signal from it. Speech feature is the use of digital signal processing technology to extract information that best reflects its essential attributes from the speech signal. In this module, it is necessary to perform pre-emphasis, framing, windowing, product and transformation, cepstrum transformation, and difference processing on the speech signal, and finally obtain a feature vector of about tens of dimensions; the unknown speech that the recognition search module will receive The signal features are matched with the acoustic model library, dictionary/dictionary and recognition grammar information in the engine to obtain the word sequence most suitable for unknown speech features. This process can be briefly described as follows: By retrieving a dictionary/dictionary, a sentence can be decomposed from a sequence of words into a sequence of phonemes. This phoneme sequence is combined with the acoustic model to obtain acoustic model unit sequence information that reflects its essential properties. Then, the feature vector of the original speech is matched with the information of all possible sentence candidate acoustic model unit sequences, the matching probability is calculated, and the acoustic model unit sequence with the largest posterior probability is selected. Through this unit sequence, the corresponding word sequence can be obtained, which is the text sequence output by the engine to the TV.

而由于服务器中存在多个语音识别引擎,如果单一的使用某一个固定引擎进行语音识别,不利于智能电视语音识别效率的提升,造成用户语音交互体验不好;因此,如何在多个语音识别引擎之间查找当前最有效率的语音识别引擎并进行切换是语音交互应用中一个亟待解决的问题。And because there are multiple speech recognition engines in the server, if a single fixed engine is used for speech recognition, it is not conducive to the improvement of the efficiency of smart TV speech recognition, resulting in poor user experience in speech interaction; therefore, how to use multiple speech recognition engines Finding the current most efficient speech recognition engine and switching between them is an urgent problem to be solved in speech interaction applications.

发明内容 Contents of the invention

本发明所要解决的技术问题是:提出一种基于智能电视平台的多语音引擎切换系统及方法,实现自动查找当前识别效率最高的语音引擎并进行切换,提升用户的语音交互体验。The technical problem to be solved by the present invention is to propose a multi-speech engine switching system and method based on a smart TV platform, realize automatic search for and switch the speech engine with the highest current recognition efficiency, and improve the user's speech interaction experience.

本发明解决上述技术问题采用的方案是:基于智能电视平台的多语音引擎切换系统,包括:语音引擎选择模块及至少两个语音引擎模块;所有的语音引擎模块由统一的语音引擎接口进行封装,并通过语音引擎接口连接语音引擎选择模块;所述语音引擎选择模块通过语音应用接口与语音应用程序相连。The solution adopted by the present invention to solve the above-mentioned technical problems is: a multi-voice engine switching system based on the smart TV platform, comprising: a voice engine selection module and at least two voice engine modules; all voice engine modules are encapsulated by a unified voice engine interface, And the voice engine selection module is connected through the voice engine interface; the voice engine selection module is connected with the voice application program through the voice application interface.

进一步,所述语音引擎模块用于从语音引擎接口获取语音引擎选择模块传送的语音数据,并对语音数据进行识别,然后向语音引擎选择模块返回识别结果;所述语音引擎选择模块用于在语音应用程序使用语音识别功能时,通过语音应用接口获取采集到的语音数据,将语音数据通过语音引擎接口发送给每一个语音引擎模块,并接收所有语音引擎模块返回的识别结果,记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换,使得语音应用程序可以调用到识别效率最高的语音引擎模块。Further, the voice engine module is used to obtain the voice data transmitted by the voice engine selection module from the voice engine interface, and recognize the voice data, and then return the recognition result to the voice engine selection module; When the application program uses the voice recognition function, it obtains the collected voice data through the voice application interface, sends the voice data to each voice engine module through the voice engine interface, and receives the recognition results returned by all voice engine modules, and records each voice engine module The response time of the recognition result is returned and compared, and the speech engine module with the shortest response time is selected for switching, so that the speech application program can call the speech engine module with the highest recognition efficiency.

进一步,所述选择响应时间最短的语音引擎模块进行切换是指:语音引擎选择模块通过语音引擎接口连接到响应时间最短的语音引擎模块,同时断开与其它语音引擎模块的连接。Further, the selection of the speech engine module with the shortest response time for switching refers to: the speech engine selection module is connected to the speech engine module with the shortest response time through the speech engine interface, and disconnected from other speech engine modules at the same time.

此外,本发明还提出了一种相应的基于智能电视平台的多语音引擎切换方法,包括:In addition, the present invention also proposes a corresponding smart TV platform-based multi-speech engine switching method, including:

a.当用户运行语音应用程序使用语音识别功能时,语音引擎选择模块通过语音应用接口获取采集到的语音数据;a. When the user runs the voice application program to use the voice recognition function, the voice engine selection module obtains the collected voice data through the voice application interface;

b.语音引擎选择模块将语音数据通过语音引擎接口发送给每一个语音引擎模块;b. the voice engine selection module sends the voice data to each voice engine module through the voice engine interface;

c.各个语音引擎模块对语音数据进行识别,然后向语音引擎选择模块返回识别结果;c. each voice engine module identifies the voice data, and then returns the recognition result to the voice engine selection module;

d.语音引擎选择模块记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换。d. The speech engine selection module records and compares the response time of each speech engine module returning the recognition result, and selects the speech engine module with the shortest response time to switch.

进一步,步骤d中,所述选择响应时间最短的语音引擎模块进行切换是指:语音引擎选择模块通过语音引擎接口连接到响应时间最短的语音引擎模块,同时断开与其它语音引擎模块的连接。Further, in step d, selecting the speech engine module with the shortest response time to switch means: the speech engine selection module is connected to the speech engine module with the shortest response time through the speech engine interface, and disconnected from other speech engine modules at the same time.

本发明的有益效果是:通过对各个语音引擎模块返回识别结果的响应时间(即识别速度)进行对比,选择响应时间最短的语音引擎模块进行切换,使得语音应用程序可以调用到识别效率最高的语音引擎模块进行语音识别,从而提升了语音识别的整体识别效率;并且,由于语音应用程序与语音引擎选择模块之间的连接载体(语音应用接口)保持不变,当语音引擎模块发生切换时,语音应用程序无需关注具体是哪一个语音引擎模块发生切换,从而保证了语音识别的稳定性和延续性。The beneficial effects of the present invention are: by comparing the response time (i.e. recognition speed) of the recognition results returned by each speech engine module, the speech engine module with the shortest response time is selected for switching, so that the speech application program can call the speech with the highest recognition efficiency The engine module performs speech recognition, thereby improving the overall recognition efficiency of speech recognition; and, since the connection carrier (voice application interface) between the speech application program and the speech engine selection module remains unchanged, when the speech engine module switches, the speech The application does not need to pay attention to which speech engine module is switched, thus ensuring the stability and continuity of speech recognition.

附图说明 Description of drawings

图1为本发明中基于智能电视平台的多语音引擎切换系统实现构架图;Fig. 1 is based on the multi-speech engine switching system of intelligent TV platform among the present invention and realizes frame diagram;

图2为本发明中的基于智能电视平台的多语音引擎切换方法的流程图。Fig. 2 is a flow chart of the multi-speech engine switching method based on the smart TV platform in the present invention.

具体实施方式 Detailed ways

本发明的实现原理是:由于系统中各个语音引擎模块的性能差异,这些模块对语音数据的处理就有快有慢,因此,我们可以通过设置一个语音引擎选择模块来对各个语音引擎模块处理语音数据的响应时间进行记录和比较,从而找出处理时间最短、响应最快的语音引擎模块,然后切换至该语音引擎模块的连接即可,而语音引擎选择模块的引入由于其与语音应用程序之间的应用接口始终未发生改变,因此,同时还能解决系统的稳定性问题。The realization principle of the present invention is: due to the performance difference of each speech engine module in the system, these modules just have fast or slow to the processing of speech data, therefore, we can process speech to each speech engine module by arranging a speech engine selection module The response time of the data is recorded and compared, so as to find out the voice engine module with the shortest processing time and the fastest response, and then switch to the connection of the voice engine module, and the introduction of the voice engine selection module is due to its relationship with the voice application program The application interface among them has not changed all the time, therefore, it can also solve the stability problem of the system at the same time.

参见图1,本发明中基于智能电视平台的多语音引擎切换系统包括语音引擎选择模块及多个语音引擎模块;所有的语音引擎模块由统一的语音引擎接口进行封装,并通过语音引擎接口连接语音引擎选择模块;所述语音引擎选择模块通过语音应用接口与语音应用程序相连。Referring to Fig. 1, among the present invention, the multi-speech engine switching system based on smart TV platform comprises speech engine selection module and a plurality of speech engine modules; All speech engine modules are encapsulated by unified speech engine interface, and connect voice Engine selection module; the speech engine selection module is connected with the speech application program through the speech application interface.

其中,所述语音引擎模块用于从语音引擎接口获取语音引擎选择模块传送的语音数据,并对语音数据进行识别,然后向语音引擎选择模块返回识别结果;所述语音引擎选择模块用于在语音应用程序使用语音识别功能时,通过语音应用接口获取采集到的语音数据,将语音数据通过语音引擎接口发送给每一个语音引擎模块,并接收所有语音引擎模块返回的识别结果,记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换,使得语音应用程序可以调用到识别效率最高的语音引擎模块。Wherein, the voice engine module is used to obtain the voice data transmitted by the voice engine selection module from the voice engine interface, and recognizes the voice data, and then returns the recognition result to the voice engine selection module; When the application program uses the voice recognition function, it obtains the collected voice data through the voice application interface, sends the voice data to each voice engine module through the voice engine interface, and receives the recognition results returned by all voice engine modules, and records each voice engine module The response time of the recognition result is returned and compared, and the speech engine module with the shortest response time is selected for switching, so that the speech application program can call the speech engine module with the highest recognition efficiency.

图2给出了切换方法的相应流程,其包括以下实现步骤:Figure 2 shows the corresponding flow of the handover method, which includes the following implementation steps:

a.当用户运行语音应用程序使用语音识别功能时,语音引擎选择模块通过语音应用接口获取采集到的语音数据;该语音数据来源于智能电视的语音采集设备采集到得音源信号;a. When the user runs the speech application program to use the speech recognition function, the speech engine selection module obtains the collected speech data through the speech application interface; the speech data comes from the sound source signal collected by the speech collection device of the smart TV;

b.语音引擎选择模块将语音数据通过语音引擎接口发送给每一个语音引擎模块;由于采用了统一的语音引擎接口进行封装,每一个语音引擎模块都能同时收到同样的语音数据;b. The voice engine selection module sends the voice data to each voice engine module through the voice engine interface; due to the adoption of a unified voice engine interface for encapsulation, each voice engine module can receive the same voice data at the same time;

c.各个语音引擎模块对语音数据进行识别,然后向语音引擎选择模块返回识别结果;c. each voice engine module identifies the voice data, and then returns the recognition result to the voice engine selection module;

d.语音引擎选择模块记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换:语音引擎选择模块通过语音引擎接口连接到响应时间最短的语音引擎模块,同时断开与其它语音引擎模块的连接。此后,语音应用程序都可以通过调用该响应时间最短的语音引擎模块实现快速的语音识别,提升用户的语音交互体验。d. the voice engine selection module records the response time of each voice engine module to return the recognition result and compares it, selects the voice engine module with the shortest response time to switch: the voice engine selection module is connected to the voice engine module with the shortest response time through the voice engine interface, Disconnect from other voice engine modules at the same time. Afterwards, voice applications can implement fast voice recognition by invoking the voice engine module with the shortest response time, thereby improving the user's voice interaction experience.

Claims (2)

1.基于智能电视平台的多语音引擎切换系统,其特征在于,包括:语音引擎选择模块及至少两个语音引擎模块;所有的语音引擎模块由统一的语音引擎接口进行封装,并通过语音引擎接口连接语音引擎选择模块;所述语音引擎选择模块通过语音应用接口与语音应用程序相连;1. The multi-voice engine switching system based on the smart TV platform is characterized in that, comprising: a voice engine selection module and at least two voice engine modules; all voice engine modules are encapsulated by a unified voice engine interface, and through the voice engine interface Connect the voice engine selection module; the voice engine selection module is connected with the voice application program through the voice application interface; 所述语音引擎模块用于从语音引擎接口获取语音引擎选择模块传送的语音数据,并对语音数据进行识别,然后向语音引擎选择模块返回识别结果;所述语音引擎选择模块用于在语音应用程序使用语音识别功能时,通过语音应用接口获取采集到的语音数据,将语音数据通过语音引擎接口发送给每一个语音引擎模块,并接收所有语音引擎模块返回的识别结果,记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换,使得语音应用程序可以调用到识别效率最高的语音引擎模块;The voice engine module is used to obtain the voice data transmitted by the voice engine selection module from the voice engine interface, and recognizes the voice data, and then returns the recognition result to the voice engine selection module; the voice engine selection module is used for voice application When using the speech recognition function, obtain the collected speech data through the speech application interface, send the speech data to each speech engine module through the speech engine interface, and receive the recognition results returned by all speech engine modules, and record the recognition returned by each speech engine module The response time of the results is compared, and the speech engine module with the shortest response time is selected for switching, so that the speech application program can call the speech engine module with the highest recognition efficiency; 所述选择响应时间最短的语音引擎模块进行切换是指:语音引擎选择模块通过语音引擎接口连接到响应时间最短的语音引擎模块,同时断开与其它语音引擎模块的连接。The selection of the speech engine module with the shortest response time for switching means that the speech engine selection module is connected to the speech engine module with the shortest response time through the speech engine interface, and disconnected from other speech engine modules at the same time. 2.基于智能电视平台的多语音引擎切换方法,应用在如权利要求1所述的系统中,其特征在于,包括:2. based on the multi-speech engine switching method of smart TV platform, be applied in the system as claimed in claim 1, it is characterized in that, comprising: a.当用户运行语音应用程序使用语音识别功能时,语音引擎选择模块通过语音应用接口获取采集到的语音数据;a. When the user runs the voice application program to use the voice recognition function, the voice engine selection module obtains the collected voice data through the voice application interface; b.语音引擎选择模块将语音数据通过语音引擎接口发送给每一个语音引擎模块;b. the voice engine selection module sends the voice data to each voice engine module through the voice engine interface; c.各个语音引擎模块对语音数据进行识别,然后向语音引擎选择模块返回识别结果;c. each voice engine module identifies the voice data, and then returns the recognition result to the voice engine selection module; d.语音引擎选择模块记录各个语音引擎模块返回识别结果的响应时间并进行对比,选择响应时间最短的语音引擎模块进行切换;d. The speech engine selection module records the response time of each speech engine module to return the recognition result and compares it, and selects the speech engine module with the shortest response time to switch; 步骤d中,所述选择响应时间最短的语音引擎模块进行切换是指:语音引擎选择模块通过语音引擎接口连接到响应时间最短的语音引擎模块,同时断开与其它语音引擎模块的连接。In step d, selecting the speech engine module with the shortest response time to switch means: the speech engine selection module is connected to the speech engine module with the shortest response time through the speech engine interface, and disconnected from other speech engine modules at the same time.
CN201210558320.XA 2012-12-20 2012-12-20 Based on Multi-voice engine switch system and the method for intelligent television platform Expired - Fee Related CN103117058B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210558320.XA CN103117058B (en) 2012-12-20 2012-12-20 Based on Multi-voice engine switch system and the method for intelligent television platform

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210558320.XA CN103117058B (en) 2012-12-20 2012-12-20 Based on Multi-voice engine switch system and the method for intelligent television platform

Publications (2)

Publication Number Publication Date
CN103117058A CN103117058A (en) 2013-05-22
CN103117058B true CN103117058B (en) 2015-12-09

Family

ID=48415416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210558320.XA Expired - Fee Related CN103117058B (en) 2012-12-20 2012-12-20 Based on Multi-voice engine switch system and the method for intelligent television platform

Country Status (1)

Country Link
CN (1) CN103117058B (en)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103336687B (en) * 2013-06-17 2016-09-14 深圳市金立通信设备有限公司 The changing method of a kind of application interface and terminal
CN103714814A (en) * 2013-12-11 2014-04-09 四川长虹电器股份有限公司 Voice introducing method of voice recognition engine
CN104795069B (en) * 2014-01-21 2020-06-05 腾讯科技(深圳)有限公司 Speech recognition method and server
CN105609102B (en) * 2014-11-21 2021-03-16 中兴通讯股份有限公司 A kind of speech engine parameter configuration method and device
CN107018228B (en) * 2016-01-28 2020-03-31 中兴通讯股份有限公司 Voice control system, voice processing method and terminal equipment
CN107526512B (en) * 2017-08-31 2020-11-20 联想(北京)有限公司 Switching method and system for electronic equipment
CN107657031A (en) * 2017-09-28 2018-02-02 四川长虹电器股份有限公司 Method based on android system management intelligent sound box voice technical ability
CN109036427B (en) * 2018-09-25 2021-01-26 苏宁智能终端有限公司 Method and system for dynamically configuring voice recognition service
CN111179934A (en) * 2018-11-12 2020-05-19 奇酷互联网络科技(深圳)有限公司 Method of selecting a speech engine, mobile terminal and computer-readable storage medium
CN109410926A (en) * 2018-11-27 2019-03-01 恒大法拉第未来智能汽车(广东)有限公司 Speech semantic recognition method and system
CN109493862B (en) * 2018-12-24 2021-11-09 深圳Tcl新技术有限公司 Terminal, voice server determination method, and computer-readable storage medium
CN109949816A (en) * 2019-02-14 2019-06-28 安徽云之迹信息技术有限公司 Robot voice processing method and processing device, cloud server
CN109947651B (en) * 2019-03-21 2022-08-02 上海智臻智能网络科技股份有限公司 Artificial intelligence engine optimization method and device
CN110708365A (en) * 2019-09-23 2020-01-17 杭州迪普科技股份有限公司 Data receiver selection method and device
CN113450785B (en) * 2020-03-09 2023-12-19 上海擎感智能科技有限公司 Implementation methods, systems, media and cloud servers for in-vehicle voice processing
CN113593535B (en) * 2021-06-30 2024-05-24 青岛海尔科技有限公司 Voice data processing method and device, storage medium, and electronic device
CN114446279A (en) * 2022-02-18 2022-05-06 青岛海尔科技有限公司 Voice recognition method, voice recognition device, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1323435A (en) * 1998-10-02 2001-11-21 国际商业机器公司 System and method for providing network coordinated conversational services
CN1429019A (en) * 2001-12-18 2003-07-09 松下电器产业株式会社 TV set with sound discrimination function and its control method
CN1633679A (en) * 2001-12-29 2005-06-29 摩托罗拉公司 Method and apparatus for multi-level distributed speech recognition
CN1723487A (en) * 2002-12-13 2006-01-18 摩托罗拉公司 Method and apparatus for selective speech recognition

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6480819B1 (en) * 1999-02-25 2002-11-12 Matsushita Electric Industrial Co., Ltd. Automatic search of audio channels by matching viewer-spoken words against closed-caption/audio content for interactive television

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1323435A (en) * 1998-10-02 2001-11-21 国际商业机器公司 System and method for providing network coordinated conversational services
CN1429019A (en) * 2001-12-18 2003-07-09 松下电器产业株式会社 TV set with sound discrimination function and its control method
CN1633679A (en) * 2001-12-29 2005-06-29 摩托罗拉公司 Method and apparatus for multi-level distributed speech recognition
CN1723487A (en) * 2002-12-13 2006-01-18 摩托罗拉公司 Method and apparatus for selective speech recognition

Also Published As

Publication number Publication date
CN103117058A (en) 2013-05-22

Similar Documents

Publication Publication Date Title
CN103117058B (en) Based on Multi-voice engine switch system and the method for intelligent television platform
JP7242520B2 (en) visually aided speech processing
CN110473546B (en) Media file recommendation method and device
CN112434139B (en) Information interaction method, device, electronic device and storage medium
CN112382287B (en) Voice interaction method, device, electronic equipment and storage medium
CN103730116B (en) Intelligent watch realizes the system and method that intelligent home device controls
CN111462733B (en) Multimodal speech recognition model training method, device, equipment and storage medium
CN103188407B (en) The processing method of interactive voice response IVR, terminal, testing server and system
US20190355354A1 (en) Method, apparatus and system for speech interaction
CN110457256A (en) Date storage method, device, computer equipment and storage medium
JP6681450B2 (en) Information processing method and device
CN113889113A (en) Sentence dividing method and device, storage medium and electronic equipment
JP6901798B2 (en) Audio fingerprinting based on audio energy characteristics
WO2020238209A1 (en) Audio processing method, system and related device
CN112053692B (en) Speech recognition processing method, device and storage medium
CN102625007A (en) A method for controlling home equipment based on voice recognition
CN102847325B (en) Toy control method and system based on voice interaction of mobile communication terminal
JP6783339B2 (en) Methods and devices for processing audio
CN106782561A (en) Audio recognition method and system
JP2022050309A (en) Information processing method, device, system, electronic device, storage medium, and computer program
CN119479620B (en) Streaming voice interaction method and related device, equipment and storage medium
CN112148754A (en) Song identification method and device
CN101527755A (en) Voice interactive method based on VoiceXML movable termination and movable termination
CN105094028B (en) Abnormal state prompt method and server of sweeping robot
CN108337357B (en) Audio playback method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20151209

CF01 Termination of patent right due to non-payment of annual fee