CN103092330B - Electronic device and voice recognition method thereof - Google Patents
Electronic device and voice recognition method thereof Download PDFInfo
- Publication number
- CN103092330B CN103092330B CN201110343163.6A CN201110343163A CN103092330B CN 103092330 B CN103092330 B CN 103092330B CN 201110343163 A CN201110343163 A CN 201110343163A CN 103092330 B CN103092330 B CN 103092330B
- Authority
- CN
- China
- Prior art keywords
- values
- value
- comparison result
- signal
- numerical
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Telephone Function (AREA)
Abstract
一种语音辨识方法,应用于一电子装置。该方法包括以下步骤:取得声音信号;转换声音信号为数字信号;自数字信号中取得多个数值,其中各数值代表对应的声波振幅值;依序比对各数值相对于至少一相邻数值的大小,以取得一组比对结果值;判断该组比对结果值是否对应至少一操作指令的其中之一;若是,执行对应的操作指令。通过本发明的技术方案,使用者所发出的声音信息经过本发明的信号转换及比对分析后,能够准确地判断此声音信息是否对应现有的操作指令,以确保能执行正确的操作指令,并且于使用上更加人性化。
A speech recognition method is applied to an electronic device. The method includes the following steps: obtaining a sound signal; converting the sound signal into a digital signal; obtaining multiple values from the digital signal, wherein each value represents a corresponding sound wave amplitude value; sequentially comparing the size of each value relative to at least one adjacent value to obtain a set of comparison result values; determining whether the set of comparison result values corresponds to at least one of the operation instructions; if so, executing the corresponding operation instruction. Through the technical solution of the present invention, after the sound information emitted by the user is converted and compared and analyzed by the signal of the present invention, it can be accurately determined whether the sound information corresponds to the existing operation instruction, so as to ensure that the correct operation instruction can be executed, and it is more user-friendly in use.
Description
技术领域 technical field
本发明关于一种语音辨识方法,特别涉及一种应用于电子装置以便执行语音控制的语音辨识方法;此外本发明也包括应用该方法的电子装置。The present invention relates to a voice recognition method, in particular to a voice recognition method applied to an electronic device for performing voice control; in addition, the present invention also includes an electronic device applying the method.
背景技术 Background technique
随着语音辨识技术的日益精进,其应用的范围也越来越广泛。以电子装置为例,当结合语音辨识技术后,可提供使用者通过所发出的声音指令,来针对电子装置进行操控的功能;如此不仅简化了电子装置的操作,免除以往使用者必须通过各类遥控器输入指令,或是直接以手动触控电子装置的对应功能键所造成的不便,并且也提高了使用者使用上的便利性。With the increasing sophistication of speech recognition technology, the scope of its application is also becoming wider and wider. Taking electronic devices as an example, when combined with voice recognition technology, it can provide users with the function of controlling electronic devices through voice commands; this not only simplifies the operation of electronic devices, but also eliminates the need for users to go through various methods in the past. The inconvenience caused by the remote controller inputting commands or directly manually touching the corresponding function keys of the electronic device also improves the user's convenience in use.
目前普遍采用的语音辨识技术,大多是直接将声音信号转换为电信号,并通过模拟数字转换器将声音信号波形数值化,以利于进行声音辨识及判断。然而由于不同的译码器所产生的声音波形不尽相同,如何能针对现有波形进行有效分析,以减少声音辨识时的误判并提高辨识度,实为一值得研究的课题。Most of the voice recognition technologies commonly used at present are to directly convert the voice signal into an electrical signal, and digitize the waveform of the voice signal through an analog-to-digital converter, so as to facilitate voice recognition and judgment. However, because the sound waveforms produced by different decoders are not the same, how to effectively analyze the existing waveforms to reduce misjudgment and improve recognition during sound recognition is a subject worth studying.
发明内容 Contents of the invention
本发明的主要目的的提供一种应用声音飞数值化转换以判断语音指令的语音辨识方法,以解决现有技术采用的语音辨识技术声音辨识时的误判较高的问题。The main purpose of the present invention is to provide a speech recognition method that applies the conversion of sound to numerical value to judge the speech command, so as to solve the problem of high misjudgment in the speech recognition technology adopted in the prior art.
为达到上述的目的,本发明采用以下技术方案:In order to achieve the above-mentioned purpose, the present invention adopts the following technical solutions:
一种语音辨识方法,应用于一电子装置,该方法包括以下步骤:A voice recognition method applied to an electronic device, the method includes the following steps:
取得一声音信号;obtain an audio signal;
转换该声音信号为一数字信号;converting the sound signal into a digital signal;
自该数字信号中取得多个数值,其中各该数值代表对应的一声波振幅值;Obtaining a plurality of numerical values from the digital signal, wherein each numerical value represents a corresponding sound wave amplitude value;
依序比对该多个数值中各该数值相对于至少一相邻数值的大小,以取得一组比对结果值;Sequentially comparing the magnitude of each of the multiple values relative to at least one adjacent value to obtain a set of comparison results;
判断该组比对结果值是否对应至少一操作指令的其中之一;以及judging whether the group of comparison result values corresponds to at least one of the operation instructions; and
当判断该组比对结果值对应该至少一操作指令的其中之一时,执行对应的该操作指令。When it is determined that the group of comparison result values corresponds to one of the at least one operation instruction, the corresponding operation instruction is executed.
较佳地,该声音信号通过傅立叶转换而形成该数字信号。Preferably, the sound signal is transformed into the digital signal by Fourier transformation.
较佳地,通过过滤掉该数字信号中低于一标准值的信号值,以取得该多个数值。Preferably, the multiple values are obtained by filtering out signal values lower than a standard value in the digital signal.
较佳地,还包括以下步骤:检测是否接收一启动信号,以判断是否开始取得该声音信号。Preferably, the method further includes the following step: detecting whether a start signal is received, so as to judge whether to start to acquire the sound signal.
较佳地,于判断该组比对结果值是否对应至少一操作指令的其中之一的步骤中,通过比对该至少一操作指令所对应的至少一组参考值与该组比对结果值是否相符或数值差异量在一定范围内,以判断该组比对结果值是否对应该至少一操作指令的其中之一。Preferably, in the step of judging whether the group of comparison result values corresponds to at least one of the operation instructions, by comparing at least one group of reference values corresponding to the at least one operation instruction with the group of comparison result values The coincidence or numerical difference is within a certain range to determine whether the set of comparison result values corresponds to one of the at least one operation instruction.
较佳地,将各该数值均与同一侧的一单一相邻数值加以比对,依据各该数值相对于该单一相邻数值的大小状态,以取得对应的该组比对结果值。Preferably, each of the numerical values is compared with a single adjacent numerical value on the same side, and the corresponding group of comparison result values is obtained according to the magnitude of each of the numerical values relative to the single adjacent numerical value.
较佳地,将各该数值同时与位于该数值两侧的该两相邻数值加以比对,依据各该数值相对于该二相邻数值的大小状态,以取得对应的该组比对结果值。Preferably, each of the numerical values is compared with the two adjacent numerical values located on both sides of the numerical value at the same time, and the corresponding group of comparison result values is obtained according to the magnitude of each of the numerical values relative to the two adjacent numerical values .
较佳地,还包括以下步骤:当判断该组比对结果值未对应该至少一操作指令的其中之一时,检测是否接收一设定信号;若是,设定该声音信号以对应一新操作指令。Preferably, the following steps are further included: when it is judged that the group of comparison result values does not correspond to one of the at least one operation instruction, detecting whether a setting signal is received; if so, setting the sound signal to correspond to a new operation instruction .
本发明还提供一种电子装置,包括:The present invention also provides an electronic device, comprising:
一声音接收模块,用以接收一声音信号;以及a sound receiving module for receiving a sound signal; and
一处理模块,与该声音接收模块电性连接,该处理模块包括:A processing module is electrically connected to the sound receiving module, and the processing module includes:
一转换单元,用以转换该声音信号为一数字信号,并自该数字信号中取得多个数值,其中各该数值代表对应的一声波振幅值;以及a conversion unit, used to convert the sound signal into a digital signal, and obtain a plurality of values from the digital signal, wherein each value represents a corresponding sound wave amplitude value; and
一比对编码单元,用以依序比对该多个数值中各该数值相对于至少一相邻数值的大小,以取得一组比对结果值;A comparison coding unit, used to sequentially compare the magnitude of each of the plurality of values relative to at least one adjacent value, so as to obtain a set of comparison result values;
通过该处理模块判断该组比对结果值是否对应至少一操作指令的其中之一,以决定是否执行对应的该操作指令。The processing module judges whether the group of comparison result values corresponds to one of at least one operation instruction, so as to determine whether to execute the corresponding operation instruction.
较佳地,该处理模块通过傅立叶转换将该声音信号转换成该数字信号。Preferably, the processing module converts the sound signal into the digital signal through Fourier transform.
较佳地,该转换单元过滤掉该数字信号中低于一标准值的信号值,以取得该多个数值。Preferably, the conversion unit filters out signal values lower than a standard value in the digital signal to obtain the plurality of values.
较佳地,还包括一启动模块,用以产生一启动信号至该处理模块,以通知该声音接收模块开始取得该声音信号。Preferably, an activation module is also included, configured to generate an activation signal to the processing module to notify the audio receiving module to start acquiring the audio signal.
较佳地,还包括一存储模块,用以存储该至少一操作指令所对应的至少一组参考值,其中当该处理模块比对该至少一组参考值的其中之一与该组比对结果值相符或数值差异量在一定范围内时,则判断该组比对结果值对应该至少一操作指令的其中之一。Preferably, it also includes a storage module for storing at least one set of reference values corresponding to the at least one operation instruction, wherein when the processing module compares one of the at least one set of reference values with the set of comparison results When the values match or the numerical difference is within a certain range, it is determined that the group of comparison result values corresponds to one of the at least one operation instruction.
较佳地,将各该数值均与同一侧的一单一该相邻数值加以比对,依据各该数值相对于该单一相邻数值的大小状态,以取得对应的该组比对结果值。Preferably, each of the numerical values is compared with a single adjacent numerical value on the same side, and the corresponding group of comparison result values is obtained according to the magnitude of each of the numerical values relative to the single adjacent numerical value.
较佳地,将各该数值同时与位于该数值两侧的该二相邻数值加以比对,依据各该数值相对于该二相邻数值的大小状态,以取得对应的该组比对结果值。Preferably, each of the numerical values is compared with the two adjacent numerical values located on both sides of the numerical value at the same time, and the corresponding group of comparison result values is obtained according to the magnitude of each of the numerical values relative to the two adjacent numerical values .
进一步地,当该处理模块判断该组比对结果值未对应该至少一操作指令的其中之一时,该处理模块检测是否接收一设定信号,以设定该声音信号以对应一新操作指令。Further, when the processing module determines that the group of comparison result values does not correspond to one of the at least one operation instruction, the processing module detects whether a setting signal is received, so as to set the sound signal to correspond to a new operation instruction.
借此设计,使用者所发出的声音信息经过本发明的信号转换及比对分析后,能够准确地判断此声音信息是否对应现有的操作指令,以确保能执行正确的操作指令,并且于使用上更加人性化。With this design, after the signal conversion and comparison analysis of the present invention, the voice information sent by the user can accurately judge whether the voice information corresponds to the existing operation command, so as to ensure that the correct operation command can be executed, and it can be used when using more humane.
附图说明 Description of drawings
图1是本发明的电子装置的示意图;1 is a schematic diagram of an electronic device of the present invention;
图2(a)是本发明的电子装置所接收的声音信号的波形图;Fig. 2 (a) is the waveform diagram of the sound signal received by the electronic device of the present invention;
图2(b)本发明的电子装置所接收的声音信号经转换后所形成的波形图;Fig. 2 (b) is the waveform diagram formed after the sound signal received by the electronic device of the present invention is converted;
图2(c)是本发明的电子装置将比对结果值与参考值相比对的示意图;Fig. 2(c) is a schematic diagram of comparing the comparison result value with the reference value by the electronic device of the present invention;
图3是本发明的语音辨识方法的第一实施例的流程图;Fig. 3 is the flowchart of the first embodiment of the speech recognition method of the present invention;
图4是本发明的语音辨识方法的第二实施例的流程图;Fig. 4 is the flowchart of the second embodiment of the speech recognition method of the present invention;
具体实施方式 Detailed ways
为能让审查员能更了解本发明的技术内容,特举出较佳实施例说明如下。In order to enable the examiners to better understand the technical content of the present invention, preferred embodiments are given and described as follows.
请先参考图1是本发明的电子装置1的示意图。在本发明的一实施例中,电子装置1是一遥控器以针对家用电器进行远程控制,而依据应用类型不同,电子装置1也可为一家用电器(例如电视或影音播放机等)、一可携式电子产品(例如计算机或手机等)或其它可应用声控方式操作的电子装置,但不以本实施例为限。Please refer to FIG. 1 , which is a schematic diagram of an electronic device 1 of the present invention. In an embodiment of the present invention, the electronic device 1 is a remote controller for remote control of household appliances, and depending on the type of application, the electronic device 1 can also be a household appliance (such as a TV or an audio-visual player, etc.), a Portable electronic products (such as computers or mobile phones, etc.) or other electronic devices that can be operated by voice control, but not limited to this embodiment.
如图1所示,本发明的电子装置1包括声音接收模块10、处理模块20及存储模块30。声音接收模块10用以接收使用者所发出的声音信号,在本发明的一实施例中,声音接收模块10为一声音转化器或一麦克风,但也可采用其它具有类似收音功能的装置。As shown in FIG. 1 , the electronic device 1 of the present invention includes a sound receiving module 10 , a processing module 20 and a storage module 30 . The sound receiving module 10 is used to receive the sound signal from the user. In one embodiment of the present invention, the sound receiving module 10 is a sound converter or a microphone, but other devices with similar sound receiving functions can also be used.
处理模块20与声音接收模块10电性连接,处理模块20包括转换单元22及比对编码单元24。转换单元22用以转换已接收的声音信号为数字信号,其中转换单元22通过傅立叶转换方式将声音信号转换成对应的数字信号,例如数字模拟转换器。The processing module 20 is electrically connected to the sound receiving module 10 , and the processing module 20 includes a conversion unit 22 and a comparison and encoding unit 24 . The conversion unit 22 is used for converting the received audio signal into a digital signal, wherein the conversion unit 22 converts the audio signal into a corresponding digital signal by means of Fourier transform, such as a digital-to-analog converter.
请一并参考图1及图2(a)、(b)及(c)。图2(a)是本发明的电子装置1所接收的声音信号的波形图,图2(b)是本发明的电子装置1所接收的声音信号经转换后所形成的波形图,图2(c是本发明的电子装置1将比对结果值与参考值相比对的示意图。如图2(a)所示,本发明的电子装置1的声音接收模块10在接收到使用者所发出的声音信号后,即可取得如图所示声波的时间-振幅波形图,由于此声音信号过于杂乱而难以辨识,因此需要经过信号的转换以便提高辨识效果。如图2(b)所示,经傅立叶转换后所产生声波的频率-振幅波形图,将使得数字信号会呈现多个高低不一的振幅峰值,本发明的技术即是针对此振幅峰值的数值处理以达到语音辨识的效果。Please refer to Figure 1 and Figure 2 (a), (b) and (c) together. Fig. 2 (a) is the waveform diagram of the sound signal received by the electronic device 1 of the present invention, Fig. 2 (b) is the waveform diagram formed after the conversion of the sound signal received by the electronic device 1 of the present invention, Fig. 2 ( C is the schematic diagram that the electronic device 1 of the present invention compares the comparison result value with the reference value.As shown in Figure 2 (a), the sound receiving module 10 of the electronic device 1 of the present invention receives the user's sent After the sound signal, the time-amplitude waveform diagram of the sound wave can be obtained as shown in the figure. Since the sound signal is too messy and difficult to identify, it needs to undergo signal conversion in order to improve the identification effect. As shown in Figure 2 (b), after The frequency-amplitude waveform diagram of the sound wave generated after Fourier transform will cause the digital signal to present multiple amplitude peaks with different heights. The technology of the present invention is aimed at the numerical processing of these amplitude peaks to achieve the effect of speech recognition.
如图2(b)所示,转换单元22自经转换后的数字信号中取得多个数值,而在欲取得这些数值的过程中,为避免大量振幅峰值较低的数值影响判断的准确度,因此转换单元22先依已设定的一振幅数值做为标准值(如图中虚线所示),并针对数字信号中所取得的多个峰值加以过滤,删除掉低于前述标准值的峰值以避免噪声干扰,最后取得余留下来的多个数值;其中各数值代表对应的声波振幅值。在本发明的一实施例中,转换单元22自经转换后的数字信号中取得M个数值,且M为大于1的整数。As shown in FIG. 2( b ), the conversion unit 22 obtains a plurality of values from the converted digital signal, and in the process of obtaining these values, in order to avoid a large number of values with lower peak amplitudes from affecting the accuracy of judgment, Therefore, the conversion unit 22 first takes a set amplitude value as a standard value (as shown by the dotted line in the figure), and filters the multiple peaks obtained in the digital signal, and deletes the peaks lower than the aforementioned standard value to obtain Avoid noise interference, and finally obtain the remaining multiple values; where each value represents the corresponding sound wave amplitude value. In an embodiment of the present invention, the conversion unit 22 obtains M values from the converted digital signal, and M is an integer greater than 1.
比对编码单元24用以对多个数值中的各数值,依序比对各数值相对于至少一相邻数值的大小,以取得一组比对结果值。此处所述依序比对各数值相对于至少一相邻数值的设计,是指将多个数值中的各个数值按照前后顺序各别与其相邻的左右数值进行比对,例如将各数值依序均与同一侧(左侧或右侧)的单一相邻数值两两比对,或是依序将各数值同时与左右两侧的相邻数值一齐比对;每个数值经前述比对后可产生一个比对结果,而在多个数值均经过比对后,即取得由多个比对结果组成的一组比对结果值。在本发明的一实施例中,假设比对编码单元24针对M个数值中的第N个数值相对于其左侧相邻的第(N-1)个数值依序进行两两比对,以取得一组比对结果值,其中N为整数且2≤N≤M。如图2(b)所示,举例来说,若转换单元22取得5个数值(此时M等于5,且N为2至5之间的任意数),比对编码单元24会依序比对第2个数值对第1个数值的大小,接着比对第3个数值对第2个数值的大小...依此类推,最后比对第5个数值对第4个数值的大小,进而得到一组比对结果值。The comparing and encoding unit 24 is used for sequentially comparing the magnitude of each numerical value relative to at least one adjacent numerical value for each numerical value among the plurality of numerical values, so as to obtain a set of comparison result values. The design of sequentially comparing each numerical value with respect to at least one adjacent numerical value mentioned here refers to comparing each numerical value among the multiple numerical values with its adjacent left and right numerical values in sequence, for example, comparing each numerical value according to The sequence is compared with a single adjacent value on the same side (left or right), or each value is compared with the adjacent values on the left and right sides at the same time; each value after the aforementioned comparison One comparison result can be generated, and after multiple values are compared, a set of comparison result values composed of multiple comparison results is obtained. In one embodiment of the present invention, it is assumed that the comparison coding unit 24 performs a pairwise comparison of the Nth value among the M values relative to the (N-1)th value adjacent to its left side, so as to Obtain a set of comparison result values, where N is an integer and 2≤N≤M. As shown in Figure 2(b), for example, if the conversion unit 22 obtains 5 values (at this time M is equal to 5, and N is any number between 2 and 5), the comparison coding unit 24 will sequentially compare For the size of the second value to the first value, then compare the size of the third value to the second value...and so on, and finally compare the size of the fifth value to the fourth value, and then Get a set of comparison result values.
其中对应前述数值大小的比较结果,可定义其结果分别对应不同的比对结果值。当比对编码单元24判断第N个数值大于第(N-1)个数值时,记录第N个数值为一第一比对结果值;当判断第N个数值等于第(N-1)个数值时,记录第N个数值为一第二比对结果值;当判断第N个数值小于第(N-1)个数值时,记录第N个数值为一第三比对结果值。延续前例,假设定义第一比对结果值为1,第二比对结果值为2,第三比对结果值为0;在取得5个数值的状况下,当第2个数值大于第1个数值时,记录第2个数值的比对结果值为1;当第3个数值大于第2个数值时,记录第3个数值的比对结果值为1;当第4个数值等于第3个数值时,记录第4个数值的比对结果值为0;最后当第5个数值小于第4个数值时,记录第5个数值的比对结果值为2。因此,比对编码单元24经数值依序两两比对后,会取得由前述四个比对结果所组成的一组比对结果值为”1120”。通过所获得的此组比对结果值,即可作为后续指令比对的判断来源。The comparison results corresponding to the aforementioned numerical values may be defined to correspond to different comparison result values. When the comparison encoding unit 24 judges that the N numerical value is greater than the (N-1) numerical value, the N numerical value is recorded as a first comparison result value; when the N numerical value is judged to be equal to the (N-1) For numerical values, record the Nth numerical value as a second comparison result value; when it is judged that the Nth numerical value is smaller than the (N-1)th numerical value, record the Nth numerical value as a third comparison result value. Continuing the previous example, assume that the first comparison result is defined as 1, the second comparison result is 2, and the third comparison result is 0; in the case of obtaining 5 values, when the second value is greater than the first value, record the comparison result value of the second value as 1; when the third value is greater than the second value, record the comparison result value of the third value as 1; when the fourth value is equal to the third value, record the comparison result value of the 4th value as 0; finally, when the 5th value is smaller than the 4th value, record the comparison result value of the 5th value as 2. Therefore, the comparing and encoding unit 24 will obtain a set of comparison results consisting of the aforementioned four comparison results as “1120” after performing pairwise comparison of values. The obtained comparison result value can be used as a judgment source for subsequent instruction comparison.
在本发明的另一实施例中,若是在各数值同时与左右两侧的相邻数值一齐比对的设计下,假设比对编码单元24针对M个数值中的第N个数值相对于其左侧相邻的第(N-1)个数值及右侧相邻的第(N+1)个数值依序进行一齐比对,以取得一组比对结果值,其中N为整数且2≤N≤M-1。举例来说,若转换单元22取得如图2(b)所示的5个数值(此时M等于5,且N为2至4之间的任意数),比对编码单元24会依序比对第2个数值相对第1个数值及第3个数值的大小,接着比对第3个数值相对第2个数值及第4个数值的大小,最后比对第4个数值相对第3个数值及第5个数值的大小,进而得到一组比对结果值。In another embodiment of the present invention, if each numerical value is compared with adjacent numerical values on the left and right sides at the same time, it is assumed that the comparison coding unit 24 compares the Nth numerical value among the M numerical values with respect to its left The (N-1)th values adjacent to the side and the (N+1)th values adjacent to the right are sequentially compared to obtain a set of comparison results, where N is an integer and 2≤N ≤M-1. For example, if the conversion unit 22 obtains 5 values as shown in FIG. Compare the size of the 2nd value with respect to the 1st value and the 3rd value, then compare the size of the 3rd value with respect to the 2nd value and the 4th value, finally compare the 4th value with respect to the 3rd value and the size of the fifth value to obtain a set of comparison result values.
其中依据前述数值大小的比较结果,可分别定义出对应九种不同结果的比对结果值,其略分为三个数值相等(1种)、任两个数值相等且搭配另一数值的大小变化(6种),三个数值均不相等(2种)等态样,也可使用数字1至9来表示各个态样所对应的比对结果值。因此若依图2(b)所取得的5个数值来判断,假设定义三个数值均不相等且第2个数值大于第1个数值但小于第3个数值的比对结果值为8,第3个数值等于第4个数值且大于第2个数值的比对结果值为2,而第4个数值等于第3个数值且大于第5个数值的比对结果值为5。因此,比对编码单元24经各数值与两侧相邻数值依序比对后,会取得由前述三个比对结果所组成的一组比对结果值为”825”,以此类推。前述各比对结果值可依不同的辨识设计而改变,本发明不以此为限。Among them, according to the comparison results of the aforementioned numerical values, the comparison result values corresponding to nine different results can be respectively defined, which can be roughly divided into three values that are equal (one type), any two values that are equal and matched with another value. (6 types), the three values are not equal (2 types), etc., and numbers 1 to 9 can also be used to represent the comparison result values corresponding to each type. Therefore, if judged according to the 5 numerical values obtained in Figure 2(b), assuming that the three numerical values are not equal and the second numerical value is greater than the first numerical value but less than the third numerical value, the comparison result value is 8. The comparison result value of 3 values equal to the 4th value and greater than the 2nd value is 2, and the comparison result value of the 4th value equal to the 3rd value and greater than the 5th value is 5. Therefore, after the comparing and encoding unit 24 sequentially compares each value with the adjacent values on both sides, it will obtain a set of comparison results consisting of the aforementioned three comparison results as "825", and so on. The values of the aforementioned comparison results may vary according to different identification designs, and the present invention is not limited thereto.
须注意的是,本发明的电子装置1的处理模块20可为一处理器或其与软件的结合,但处理模块20也可为单纯的硬件、软件、硬体或前述至少两者以上的组合,本发明不以此为限。It should be noted that the processing module 20 of the electronic device 1 of the present invention can be a processor or its combination with software, but the processing module 20 can also be pure hardware, software, hardware or a combination of at least two or more of the foregoing. , the present invention is not limited thereto.
存储模块30用以存储前述该组比对结果值及至少一操作指令所对应的至少一组参考值。其中使用者可预先设定并记录至少一声音指令以对应至少一操作指令,例如预录”关机”或”开机”的声音指令以对应本发明的电子装置1的关机或开机操作。而这些声音指令一样可通过前述针对声音信号的处理方式,进行信号转换及比对分析,以取得其对应的至少一操作指令的至少一组参考值。例如经信号转换比对后,”关机”的操作指令对应于一组参考值”2101”,而”开机”的操作指令则对应于另一组参考值”1121”。这些参考值也会被存储于存储模块30中以供后续比对使用。The storage module 30 is used for storing at least one set of reference values corresponding to the aforementioned set of comparison result values and at least one operation instruction. The user can pre-set and record at least one voice command to correspond to at least one operation command, for example, a pre-recorded voice command of “power off” or “power on” corresponds to the power off or power on operation of the electronic device 1 of the present invention. These voice commands can also be converted, compared and analyzed through the above-mentioned processing method for voice signals, so as to obtain at least one set of reference values corresponding to at least one operation command. For example, after the signal conversion and comparison, the operation command of "power off" corresponds to a set of reference values "2101", while the operation command of "power on" corresponds to another set of reference values "1121". These reference values are also stored in the storage module 30 for subsequent comparison.
借此,处理模块20依据所取得的该组比对结果值判断其是否对应于前述任一组参考值,以确认该组比对结果值是否对应任一操作指令,进而执行所对应的操作指令。其中当处理模块20比对任一组参考值与该组比对结果值相符或数值差异量在一定范围内时,则判断该组比对结果值对应该至少一操作指令的其中之一,进而执行对应的操作指令。如图2(c)所示,延续前例,假设当处理模块20判断任一组参考值与该组比对结果值数值完全相同或两者数值差异量为1个数值以内,当已取得的该组比对结果值为”1120”分别与代表关机的该组参考值”2101”及代表开机的该组参考值”1121”加以比对后,可得知该组比对结果值”1120”与代表开机的该组参考值”1121”较为相近且仅有1个数值的差异,因此处理模块20即可判断该组比对结果值对应开机的操作指令,借此依据该声音信息执行电子装置1的开机操作。须注意的是,前述设定的比对数值差异量也可依辨识设计需求而改变,本发明不以此为限。在本发明的一实施例中,存储模块20可为内存或其它可供存储的装置。Thereby, the processing module 20 judges whether it corresponds to any one of the aforementioned reference values according to the obtained group of comparison result values, so as to confirm whether the group of comparison result values corresponds to any operation instruction, and then executes the corresponding operation instruction . When the processing module 20 compares any set of reference values with the set of comparison result values, or when the numerical difference is within a certain range, it is determined that the set of comparison result values corresponds to at least one of the operation instructions, and then Execute the corresponding operation command. As shown in Figure 2(c), continuing the previous example, suppose that when the processing module 20 judges that any set of reference values is completely the same as the set of comparison result values or the difference between the two values is within 1 value, when the acquired After the group comparison result value "1120" is compared with the group reference value "2101" representing power off and the group reference value "1121" representing power on, it can be known that the group comparison result value "1120" and The set of reference values "1121" representing power-on are relatively similar with only one value difference, so the processing module 20 can judge that the set of comparison result values corresponds to the operation command of power-on, so as to execute the electronic device 1 according to the voice information. start-up operation. It should be noted that the above-mentioned difference in comparison value can also be changed according to the identification design requirements, and the present invention is not limited thereto. In an embodiment of the present invention, the storage module 20 may be a memory or other storage devices.
此外,当处理模块20依据前述判断机制判断该组比对结果值并未对应任一操作指令时,表示使用者所发出的声音信号与任一操作指令均不相符,此时处理模块20可通过其它器件或模块发出提示信息,例如预设的声响或闪光等,用以告知使用者。此时使用者可通过电子装置1的特定按键输入一设定信号,当处理模块20检测已接收此设定信号时,即可依据此设定信号将所接收的声音信号赋予新的对应设定,使其对应一新操作指令。借此,当语音辨识效果不理想而无法分辨使用者时声音时,使用者能自行重新录制及设定目前所发出的声音信息及其对应的操作指令,提高使用上的便利性。In addition, when the processing module 20 judges that the group of comparison result values does not correspond to any operation instruction according to the above-mentioned judgment mechanism, it means that the sound signal issued by the user does not match any operation instruction. At this time, the processing module 20 can pass Other devices or modules send out prompt information, such as preset sounds or flashes, to inform users. At this point, the user can input a setting signal through a specific button on the electronic device 1, and when the processing module 20 detects that the setting signal has been received, the received sound signal can be assigned a new corresponding setting according to the setting signal. , so that it corresponds to a new operation instruction. In this way, when the voice recognition effect is not satisfactory and the voice of the user cannot be distinguished, the user can re-record and set the currently issued voice information and the corresponding operation instructions, thereby improving the convenience of use.
本发明的电子装置1还包括启动模块40,用以产生一启动信号至处理模块20,以通知声音接收模块10开始取得使用者所发出的声音信号。在本发明的一实施例中,启动模块40可为一输入按键或以其它具类似功能的器件所取代,不以本实施例为限。The electronic device 1 of the present invention further includes an activation module 40 for generating an activation signal to the processing module 20 to notify the sound receiving module 10 to start acquiring the audio signal from the user. In an embodiment of the present invention, the activation module 40 may be an input button or replaced by other devices with similar functions, which is not limited to this embodiment.
请参考图3是本发明的语音辨识方法的第一实施例的流程图。须注意的是,以下虽以图1所示的电子装置1为例说明本发明的语音辨识方法,但本发明并不以适用于电子装置1为限,任何其它具类似架构的电子装置也可适用本发明的语音辨识方法。如图3所示,本发明的语音辨识方法包括步骤S301至步骤S306。以下将详细说明该方法的各个步骤。Please refer to FIG. 3 , which is a flow chart of the first embodiment of the speech recognition method of the present invention. It should be noted that, although the electronic device 1 shown in FIG. 1 is used as an example below to illustrate the voice recognition method of the present invention, the present invention is not limited to be applicable to the electronic device 1, and any other electronic device with a similar structure can also be used. The speech recognition method of the present invention is applicable. As shown in FIG. 3 , the speech recognition method of the present invention includes steps S301 to S306. Each step of the method will be described in detail below.
步骤S301:取得一声音信号。电子装置1通过声音接收模块10来取得使用者所发出声音信号,以进行后续处理。Step S301: Obtain an audio signal. The electronic device 1 obtains the sound signal from the user through the sound receiving module 10 for subsequent processing.
步骤S302:转换声音信号为数字信号。通过处理模块20的转换单元22针对声音信号进行傅立叶转换,已取得对应的数字信号。Step S302: converting the sound signal into a digital signal. The conversion unit 22 of the processing module 20 performs Fourier transformation on the sound signal to obtain a corresponding digital signal.
步骤S303:自数字信号中取得多个数值。转换单元22自转换后的数字信号中,先过滤掉低于一标准值的振幅峰值,以减少其所造成类似噪声的误判影响,接着自剩余振幅峰值中取得多个数值,其中各数值代表对应的声波振幅值。Step S303: Obtain a plurality of values from the digital signal. The conversion unit 22 first filters out the amplitude peak value lower than a standard value from the converted digital signal, so as to reduce the misjudgment effect similar to noise caused by it, and then obtains a plurality of values from the remaining amplitude peak value, wherein each value represents The corresponding sound wave amplitude value.
步骤S304:依序比对各个数值相对于至少一相邻数值的大小,以取得一组比对结果值。处理模块20的比对编码单元24依据前述所取得的多个数值,针对各个数值与至少一相邻数值相互比对,并依比对结果不同而赋予不同的比对结果值,借此以取得一组比对结果值。Step S304: Sequentially compare the magnitude of each numerical value relative to at least one adjacent numerical value to obtain a set of comparison result values. The comparison encoding unit 24 of the processing module 20 compares each numerical value with at least one adjacent numerical value according to the multiple numerical values obtained above, and assigns different comparison result values according to the comparison results, thereby obtaining A set of comparison result values.
步骤S305:判断该组比对结果值是否对应至少一操作指令的其中之一。处理模块20依据所取得的该组比对结果值与至少一操作指令所对应的至少一组参考值相比对,判断任一组参考值与该组比对结果值是否相符或数值差异量在一定范围内。Step S305: Determine whether the group of comparison result values corresponds to one of at least one operation instruction. The processing module 20 compares the obtained set of comparison result values with at least one set of reference values corresponding to at least one operation instruction, and judges whether any set of reference values is consistent with the set of comparison result values or whether the numerical difference is within within a certain range.
步骤S306:执行对应的操作指令。当处理模块20判断有任一组参考值与该组比对结果值是否相符或数值差异量在一定范围内时,即表示该组比对结果值所对应的声音信号代表该组参考值所对应的操作指令,因此处理模块20可执行对应的操作指令。Step S306: Execute the corresponding operation instruction. When the processing module 20 judges whether any set of reference values is consistent with the set of comparison result values or whether the numerical difference is within a certain range, it means that the sound signal corresponding to the set of comparison result values represents the value corresponding to the set of reference values. The operation instruction, so the processing module 20 can execute the corresponding operation instruction.
借此设计,将使用者所发出的声音信息通过信号转换、过滤、比对分析等处理,来判别此声音信息是否与现存的操作指令相符,以决定是否执行对应的操作指令。With this design, the voice information sent by the user is processed through signal conversion, filtering, comparison analysis, etc. to judge whether the voice information is consistent with the existing operation instructions, so as to determine whether to execute the corresponding operation instructions.
请参考图4是本发明的语音辨识方法的第二实施例的流程图。如图4所示,其中步骤S401至步骤S406与前述步骤S301至步骤S306相同,而本实施例还包括步骤S407、S408及至步骤S409。Please refer to FIG. 4 , which is a flowchart of a second embodiment of the speech recognition method of the present invention. As shown in FIG. 4 , the steps S401 to S406 are the same as the aforementioned steps S301 to S306 , and this embodiment further includes steps S407 , S408 to S409 .
于步骤S401之前还包括步骤S407:检测是否接收一启动信号。当使用者欲使用电子装置1执行本发明的语音辨识方法时,先通过处理模块20检测是否接收使用者通过启动模块40所输入的启动信号,以决定是否启动声音撷取的相关功能。当检测到此启动信号时,处理模块20即通知声音接收模块10开始进行声音撷取。Step S407 is also included before step S401: detecting whether a start signal is received. When the user intends to use the electronic device 1 to execute the voice recognition method of the present invention, the processing module 20 first detects whether the activation signal input by the user through the activation module 40 is received, so as to determine whether to activate the related function of voice capture. When the activation signal is detected, the processing module 20 notifies the sound receiving module 10 to start sound capture.
于步骤S405之后还包括步骤S408:检测是否接收一设定信号。当通过步骤S405判断该组比对结果值并未对应任一操作指令时,处理模块20检测是否接收到使用者所输入的设定信号,以判断是否针对该组比对结果值所对应的声音信号重新进行操作指令的定义。After the step S405, a step S408 is further included: detecting whether a setting signal is received. When it is determined through step S405 that the group of comparison result values does not correspond to any operation command, the processing module 20 detects whether a setting signal input by the user is received to determine whether the sound corresponding to the group of comparison result values The signal redefines the definition of the operation instruction.
于步骤S408之后还包括步骤S409:设定声音信号以对应一新操作指令。当处理模块20检测到使用者所输入的设定信号时,则依据此设定信号将声音信息重新定义以对应新的操作指令。After the step S408, a step S409 is further included: setting the sound signal to correspond to a new operation command. When the processing module 20 detects the setting signal input by the user, it redefines the sound information according to the setting signal to correspond to the new operation instruction.
通过本发明的设计,除了应用于遥控器或类似装置上,让使用者可通过语音输入方式通过遥控器发出对应的操作指令,以针对遥控器的对应装置来进行操作外,本发明也可直接应用于待操作的电子装置本体上,或是通过电子装置本体间接控制与该电子装置连接的周边装置,其实施状态依设计需求不同而改变,达到任意装置均可通过语音操控的效果。Through the design of the present invention, in addition to being applied to remote controls or similar devices, users can issue corresponding operating instructions through the remote control through voice input to operate on the corresponding devices of the remote control, and the present invention can also directly It is applied to the main body of the electronic device to be operated, or indirectly controls the peripheral devices connected to the electronic device through the main body of the electronic device, and its implementation status changes according to different design requirements, so that any device can be controlled by voice.
综上所陈,本发明无论就目的、手段及功效,均显示其迥异于现有技术。但是须注意,上述实施例仅为例示性说明本发明的原理及其功效,而非用于限制本发明的范围。任何熟于此项技艺的人士均可在不违背本发明的技术原理及精神下,对实施例作修改与变化。In summary, the present invention is quite different from the prior art in terms of purpose, means and effects. However, it should be noted that the above-mentioned embodiments are only illustrative to illustrate the principles and effects of the present invention, and are not intended to limit the scope of the present invention. Any person familiar with the art can modify and change the embodiments without violating the technical principle and spirit of the present invention.
Claims (16)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201110343163.6A CN103092330B (en) | 2011-10-27 | 2011-10-27 | Electronic device and voice recognition method thereof |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201110343163.6A CN103092330B (en) | 2011-10-27 | 2011-10-27 | Electronic device and voice recognition method thereof |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103092330A CN103092330A (en) | 2013-05-08 |
| CN103092330B true CN103092330B (en) | 2015-11-25 |
Family
ID=48205006
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201110343163.6A Expired - Fee Related CN103092330B (en) | 2011-10-27 | 2011-10-27 | Electronic device and voice recognition method thereof |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN103092330B (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104113144B (en) * | 2014-06-27 | 2017-12-05 | 青岛众海汇智能源科技有限责任公司 | Support the wireless feeder equipment and its method of supplying power to of multi-load |
| CN106296890B (en) * | 2016-07-22 | 2019-06-04 | 北京小米移动软件有限公司 | Unlocking method, device and mobile terminal of mobile terminal |
| CN109346081A (en) * | 2018-12-20 | 2019-02-15 | 广州河东科技有限公司 | Voice control method, device, equipment and storage medium |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101114448A (en) * | 2006-07-28 | 2008-01-30 | 佛山市顺德区顺达电脑厂有限公司 | Voice identification system |
| CN100514446C (en) * | 2004-09-16 | 2009-07-15 | 北京中科信利技术有限公司 | Pronunciation evaluating method based on voice identification and voice analysis |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7206420B2 (en) * | 1999-11-29 | 2007-04-17 | Syfx Tekworks | Softclip method and apparatus |
| US8155965B2 (en) * | 2005-03-11 | 2012-04-10 | Qualcomm Incorporated | Time warping frames inside the vocoder by modifying the residual |
-
2011
- 2011-10-27 CN CN201110343163.6A patent/CN103092330B/en not_active Expired - Fee Related
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100514446C (en) * | 2004-09-16 | 2009-07-15 | 北京中科信利技术有限公司 | Pronunciation evaluating method based on voice identification and voice analysis |
| CN101114448A (en) * | 2006-07-28 | 2008-01-30 | 佛山市顺德区顺达电脑厂有限公司 | Voice identification system |
Also Published As
| Publication number | Publication date |
|---|---|
| CN103092330A (en) | 2013-05-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10049665B2 (en) | Voice recognition method and apparatus using video recognition | |
| US9354842B2 (en) | Apparatus and method of controlling voice input in electronic device supporting voice recognition | |
| JP6012877B2 (en) | Voice control system and method for multimedia device and computer storage medium | |
| US20070274591A1 (en) | Input apparatus and input method thereof | |
| CN104580699B (en) | Acoustic control intelligent terminal method and device when a kind of standby | |
| US10551973B2 (en) | Method of controlling a mobile device | |
| CN102591448A (en) | Information processing apparatus, information processing method, and computer-readable storage medium | |
| CN110825446B (en) | Parameter configuration method and device, storage medium and electronic equipment | |
| CN104750254B (en) | A kind of information processing method and device, electronic equipment | |
| US20120197420A1 (en) | Signal processing device, signal processing method, and program | |
| US20150310878A1 (en) | Method and apparatus for determining emotion information from user voice | |
| CN111816192A (en) | Voice equipment and control method, device and equipment thereof | |
| CN103198829A (en) | Method, device and equipment of reducing interior noise and improving voice recognition rate | |
| CN103092330B (en) | Electronic device and voice recognition method thereof | |
| CN105159492A (en) | Music play control method and terminal device | |
| CN111785267A (en) | A kind of interactive control method, device and computer readable storage medium | |
| CN203167075U (en) | Mobile terminal | |
| EP3282680B1 (en) | Blowing action-based method for operating mobile terminal and mobile terminal | |
| CN102608931B (en) | System and method for controlling electronic equipment | |
| CN103945152A (en) | Television set and method for voice control over television set | |
| TWI539442B (en) | Electronic apparatus and voice recognition method thereof | |
| CN103943103B (en) | Electronic device and voice processing method thereof | |
| CN110321020B (en) | Piezoelectric sensor signal noise removing method and device | |
| CN114839891B (en) | Intelligent nightstand control method, computer and storage medium based on internet of things | |
| CN114171039B (en) | Signal processing method and device and electronic equipment |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20151125 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |