TWI633484B - Activation assisting apparatus, speech operation system and method thereof - Google Patents
Activation assisting apparatus, speech operation system and method thereof Download PDFInfo
- Publication number
- TWI633484B TWI633484B TW102121753A TW102121753A TWI633484B TW I633484 B TWI633484 B TW I633484B TW 102121753 A TW102121753 A TW 102121753A TW 102121753 A TW102121753 A TW 102121753A TW I633484 B TWI633484 B TW I633484B
- Authority
- TW
- Taiwan
- Prior art keywords
- wireless transmission
- mobile terminal
- terminal device
- transmission module
- voice
- Prior art date
Links
- 230000004913 activation Effects 0.000 title claims description 56
- 238000000034 method Methods 0.000 title claims description 36
- 230000005540 biological transmission Effects 0.000 claims abstract description 138
- 238000004891 communication Methods 0.000 claims abstract description 70
- 230000001960 triggered effect Effects 0.000 claims abstract description 14
- 230000006870 function Effects 0.000 claims description 52
- 239000011521 glass Substances 0.000 claims description 4
- 238000012545 processing Methods 0.000 description 75
- 230000003321 amplification Effects 0.000 description 37
- 238000003199 nucleic acid amplification method Methods 0.000 description 37
- 238000005070 sampling Methods 0.000 description 23
- 230000004044 response Effects 0.000 description 19
- 230000000875 corresponding effect Effects 0.000 description 16
- 230000003993 interaction Effects 0.000 description 11
- 238000013461 design Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 9
- 230000005236 sound signal Effects 0.000 description 8
- 230000009471 action Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000010295 mobile communication Methods 0.000 description 5
- 230000002787 reinforcement Effects 0.000 description 5
- 230000011218 segmentation Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000010411 cooking Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Landscapes
- Mobile Radio Communication Systems (AREA)
- Telephone Function (AREA)
Abstract
一種輔助啟動裝置,用以輔助開啟一行動終端裝置的一語音系統。上述行動終端裝置包括一第一無線傳輸模組,以及耦接第一無線傳輸模組的語音系統。輔助啟動裝置包括一第二無線傳輸模組及一觸發模組。第二無線傳輸模組與第一無線傳輸模組匹配,觸發模組耦接第二無線傳輸模組。當觸發模組被觸發時,第二無線傳輸模組透過一無線通訊協定與行動終端裝置的第一無線傳輸模組連結,以啟動行動終端裝置的語音系統。 An auxiliary starting device for assisting in turning on a voice system of a mobile terminal device. The mobile terminal device includes a first wireless transmission module and a voice system coupled to the first wireless transmission module. The auxiliary starting device comprises a second wireless transmission module and a trigger module. The second wireless transmission module is matched with the first wireless transmission module, and the trigger module is coupled to the second wireless transmission module. When the trigger module is triggered, the second wireless transmission module is coupled to the first wireless transmission module of the mobile terminal device via a wireless communication protocol to activate the voice system of the mobile terminal device.
Description
本發明是有關於一種開啟語音功能的技術,且特別是有關於一種輔助啟動裝置、語音操控系統及其方法。 The present invention relates to a technique for turning on a voice function, and more particularly to an auxiliary activation device, a voice control system, and a method thereof.
隨著科技的發展,具有語音系統之行動終端裝置已日漸普及。上述的語音系統是透過語音理解技術,讓使用者與行動終端裝置進行溝通。舉例來說,使用者只要對上述的行動終端裝置講出某項要求,例如想要查車次、查天氣或是欲撥打電話等,系統便會依據使用者的語音信號,採取對應的動作。上述的動作可能是以語音方式回答使用者問題或是依照使用者指令去驅使行動終端裝置的系統進行動作。 With the development of technology, mobile terminal devices with voice systems have become increasingly popular. The above voice system is a voice understanding technology that allows the user to communicate with the mobile terminal device. For example, if the user speaks a certain request to the mobile terminal device, for example, if he wants to check the number of times, check the weather, or want to make a call, the system will take corresponding actions according to the user's voice signal. The above actions may be to answer the user's question by voice or to drive the system of the mobile terminal device to operate according to the user's instruction.
然而,在語音系統的技術發展過程中,卻面臨一些問題亟待解決。例如:語音結合雲端伺服器之資料安全性、語音系統啟動的便捷性等問題。 However, in the process of technological development of the voice system, there are some problems to be solved. For example: voice combined with the data security of the cloud server, the convenience of the voice system startup and so on.
以語音結合雲端伺服器之資料安全性來說,目前是以語音交互系統結合雲端技術的概念,將複雜而需要強大運算能力支 援的語音處理過程交由雲端伺服器來執行。雖然這樣的方式可大幅降低行動終端裝置所需配置硬體的成本。但是,對於需要透過通訊錄進行通話、傳簡訊等動作來說,由於需藉由上傳通訊錄至雲端伺服器中以找尋通話或傳簡訊的對象,因此通訊錄的保密將是一個重要的議題。雖然雲端伺服器可以採用加密連線,並且採取即用即傳、不保存的方式,還是難以消除使用者對上述作法的擔憂。 In terms of voice and cloud server security, the concept of voice interaction system combined with cloud technology is complex and requires powerful computing power. The voice processing process is performed by the cloud server. Although such an approach can significantly reduce the cost of configuring the hardware required for the mobile terminal device. However, for the need to make calls and send text messages through the address book, the confidentiality of the address book will be an important issue because it is necessary to upload the address book to the cloud server to find the object of the call or the text message. Although the cloud server can use encrypted connection and adopts the method of ready-to-use and non-storage, it is difficult to eliminate the user's concerns about the above-mentioned practices.
另一方面,以語音系統啟動的便捷性來說,目前大都是 觸發行動終端裝置的螢幕其所顯示的應用程式來啟動,或者透過行動終端裝置所設置的實體按鍵來啟動。上述的設計皆須透過行動終端裝置本身來啟動,但是在某些場合,上述的設計卻是相當的不便。比如說:在行車期間,而行動終端裝置被放置於口袋或是提袋中,或者在廚房做菜時,需要撥打位於客廳的行動電話,以詢問友人食譜細節等使用者無法立即觸及行動終端裝置,但需使語音系統開啟的情況。 On the other hand, in terms of the convenience of voice system startup, most of the current The application displayed by the screen of the mobile terminal device is activated or activated by a physical button set by the mobile terminal device. The above design must be initiated by the mobile terminal device itself, but in some cases, the above design is quite inconvenient. For example, during driving, when the mobile terminal device is placed in a pocket or a bag, or when cooking in the kitchen, it is necessary to dial a mobile phone located in the living room to ask the user for details of the recipe, etc., and the user cannot immediately touch the mobile terminal device. , but the voice system needs to be turned on.
此外,行動終端裝置中的擴音功能同樣也有類似的問 題。雖然目前使用者可以透過手指操作行動電話,或是用單手持握行動電話以將行動電話貼近耳朵以啟動擴音功能。但是,當使用者無法立即觸及行動終端裝置,但需使擴音功能時,目前需透過行動終端裝置本身來啟動的設計仍將造成使用者的不便。 In addition, the sound reinforcement function in the mobile terminal device also has similar questions. question. Although the user can currently operate the mobile phone through a finger, or hold the mobile phone with a single hand to bring the mobile phone close to the ear to activate the sound amplification function. However, when the user cannot immediately access the mobile terminal device, but needs to make the sound amplification function, the design that needs to be activated by the mobile terminal device itself still causes inconvenience to the user.
因此,如何改進上述的這些缺點,成為亟待解決的議題。 Therefore, how to improve these shortcomings has become an urgent issue to be solved.
本發明提供一種輔助啟動裝置、語音操控系統及其方法,其可讓使用者透過輔助啟動裝置而無線地啟動行動通訊裝置所提供的語音對話功能,以增進使用上的便利性。 The invention provides an auxiliary starting device, a voice control system and a method thereof, which enable a user to wirelessly activate a voice dialogue function provided by the mobile communication device through the auxiliary starting device to improve the convenience in use.
本發明提出一種語音操控系統,此語音操控系統包括一輔助啟動裝置與一行動終端裝置。上述輔助啟動裝置用以輔助開啟行動終端裝置的一語音系統,且輔助啟動裝置包括一第一無線傳輸模組。行動終端裝置包括一第二無線傳輸模組與上述的語音系統。第二無線傳輸模組與第一無線傳輸模組匹配,語音系統耦接第二無線傳輸模組。當輔助啟動裝置被觸發時,第一無線傳輸模組透過一無線通訊協定與行動終端裝置的第二無線傳輸模組連結,以啟動行動終端裝置的語音系統。 The invention provides a voice control system comprising an auxiliary starting device and a mobile terminal device. The auxiliary activation device is configured to assist in opening a voice system of the mobile terminal device, and the auxiliary activation device includes a first wireless transmission module. The mobile terminal device includes a second wireless transmission module and the voice system described above. The second wireless transmission module is matched with the first wireless transmission module, and the voice system is coupled to the second wireless transmission module. When the auxiliary activation device is triggered, the first wireless transmission module is coupled to the second wireless transmission module of the mobile terminal device via a wireless communication protocol to activate the voice system of the mobile terminal device.
本發明另提出一種輔助啟動裝置,用以輔助開啟一行動終端裝置的一語音系統。上述行動終端裝置包括一第一無線傳輸模組,以及耦接第一無線傳輸模組的語音系統。輔助啟動裝置包括一第二無線傳輸模組及一觸發模組。第二無線傳輸模組與第一無線傳輸模組匹配,觸發模組耦接第二無線傳輸模組。當觸發模組被觸發時,第二無線傳輸模組透過一無線通訊協定與行動終端裝置的第一無線傳輸模組連結,以啟動行動終端裝置的語音系統。 The invention further provides an auxiliary starting device for assisting in opening a voice system of a mobile terminal device. The mobile terminal device includes a first wireless transmission module and a voice system coupled to the first wireless transmission module. The auxiliary starting device comprises a second wireless transmission module and a trigger module. The second wireless transmission module is matched with the first wireless transmission module, and the trigger module is coupled to the second wireless transmission module. When the trigger module is triggered, the second wireless transmission module is coupled to the first wireless transmission module of the mobile terminal device via a wireless communication protocol to activate the voice system of the mobile terminal device.
本發明又提出一種語音操控的方法,適用於一行動終端裝置與一輔助啟動裝置。上述輔助啟動裝置用以輔助開啟行動終端裝置的語音系統。輔助啟動裝置及行動終端裝置分別具有相互 匹配的第一無線傳輸模組與第二無線傳輸模組。上述語音操控的方法係第一無線傳輸模組接收一觸發信號。接著,啟動第一無線傳輸模組,並發送一無線傳輸信號。然後,第二無線傳輸模組接收無線傳輸信號,以啟動語音系統。 The invention further proposes a voice manipulation method suitable for a mobile terminal device and an auxiliary activation device. The auxiliary starting device is used to assist in turning on the voice system of the mobile terminal device. The auxiliary starting device and the mobile terminal device respectively have mutual Matching the first wireless transmission module and the second wireless transmission module. The above voice control method is that the first wireless transmission module receives a trigger signal. Then, the first wireless transmission module is activated and a wireless transmission signal is transmitted. Then, the second wireless transmission module receives the wireless transmission signal to activate the voice system.
基於上述,當行動通訊裝置接收來自輔助啟動裝置的無線傳輸信號以啟動語音系統時,行動通訊裝置便可開始接收使用者的語音信號,以透過語音理解模組來解析所述語音信號。如此一來,使用者透過輔助啟動裝置而無線地啟動行動通訊裝置所提供的語音對話功能,藉以增進使用上的便利性。 Based on the above, when the mobile communication device receives the wireless transmission signal from the auxiliary activation device to activate the voice system, the mobile communication device can start receiving the user's voice signal to parse the voice signal through the voice understanding module. In this way, the user wirelessly activates the voice conversation function provided by the mobile communication device through the auxiliary activation device, thereby improving the convenience of use.
為讓本發明之上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。 The above described features and advantages of the present invention will be more apparent from the following description.
100、200‧‧‧語音操控系統 100,200‧‧‧ voice control system
110‧‧‧輔助啟動裝置 110‧‧‧Auxiliary starter
112、122‧‧‧無線傳輸模組 112, 122‧‧‧Wireless transmission module
114‧‧‧觸發模組 114‧‧‧Trigger module
116‧‧‧無線充電電池 116‧‧‧Wireless rechargeable battery
1162‧‧‧電池單元 1162‧‧‧ battery unit
1164‧‧‧無線充電模組 1164‧‧‧Wireless charging module
120、220、420‧‧‧行動終端裝置 120, 220, 420‧‧‧ mobile terminal devices
121、426‧‧‧語音系統 121, 426‧‧ ‧ voice system
124、610‧‧‧語音取樣模組 124, 610‧‧‧Voice sampling module
127‧‧‧語音輸出介面 127‧‧‧Voice output interface
128、424‧‧‧通訊模組 128, 424‧‧‧Communication Module
130、410‧‧‧(雲端)伺服器 130, 410‧‧‧ (cloud) server
132‧‧‧語音理解模組 132‧‧‧Voice Understanding Module
1322‧‧‧語音辨識模組 1322‧‧‧Voice recognition module
1324‧‧‧語音處理模組 1324‧‧‧Voice Processing Module
400‧‧‧語音交互系統 400‧‧‧Voice Interactive System
412、422、660‧‧‧處理單元 412, 422, 660‧‧ ‧ processing unit
414‧‧‧傳輸模組 414‧‧‧Transmission module
428‧‧‧儲存單元 428‧‧‧ storage unit
429‧‧‧通訊錄 429‧‧‧Contacts
330‧‧‧顯示單元 330‧‧‧Display unit
620‧‧‧輸入單元 620‧‧‧ input unit
630‧‧‧撥接單元 630‧‧‧Dial-up unit
640‧‧‧聽筒 640‧‧‧ earpiece
650‧‧‧擴音設備 650‧‧‧Audio equipment
670‧‧‧耳機 670‧‧‧ headphones
S302~S312、S501~S519、S710~S770‧‧‧步驟 S302~S312, S501~S519, S710~S770‧‧‧ steps
DRC‧‧‧通話接收資料 DRC‧‧‧Call receiving data
DTC‧‧‧通話傳送資料 DTC‧‧‧Call transmission data
SAI‧‧‧輸入音頻信號 SAI‧‧‧ input audio signal
SAO‧‧‧輸出音頻信號 SAO‧‧‧ output audio signal
SIO‧‧‧輸入操作信號 SIO‧‧‧ input operation signal
圖1是依照本發明一實施例所繪示之語音操控系統的方塊圖。 1 is a block diagram of a voice control system in accordance with an embodiment of the invention.
圖2是依照本發明另一實施例所繪示之語音操控系統的方塊圖。 2 is a block diagram of a voice manipulation system in accordance with another embodiment of the present invention.
圖3是依照本發明一實施例所繪示之語音操控方法的流程圖。 FIG. 3 is a flow chart of a voice control method according to an embodiment of the invention.
圖4是依照本發明一實施例之語音交互系統的方塊圖。 4 is a block diagram of a voice interaction system in accordance with an embodiment of the present invention.
圖5是依照本發明一實施例之用於語音交互系統的語音通信 流程的示意圖。 FIG. 5 is a voice communication for a voice interaction system according to an embodiment of the invention Schematic diagram of the process.
圖6為依據本發明一實施例的行動終端裝置的系統示意圖。 FIG. 6 is a schematic diagram of a system of a mobile terminal device according to an embodiment of the invention.
圖7為依據本發明一實施例的行動終端裝置的通話擴音功能的自動啟動方法的流程圖。 FIG. 7 is a flowchart of a method for automatically starting a call amplification function of a mobile terminal device according to an embodiment of the present invention.
雖然現今的行動終端裝置已可提供語音系統,以讓使用者發出語音來和行動終端裝置溝通,但使用者在啟動此語音系統時,仍必須透過行動終端裝置本身來啟動。因此在使用者無法立即觸及行動終端裝置,但需使語音系統開啟的情況,往往無法滿足使用者立即的需求。為此,本發明提出一種輔助語音系統開啟的裝置及其對應的方法,讓使用者能夠更便捷地開啟語音系統。為了使本發明之內容更為明瞭,以下特舉實施例作為本發明確實能夠據以實施的範例。 Although the mobile terminal device of the present invention can provide a voice system for the user to make a voice to communicate with the mobile terminal device, the user must still activate the mobile terminal device itself when the voice system is activated. Therefore, when the user cannot immediately touch the mobile terminal device, but the voice system needs to be turned on, the user's immediate needs are often not met. To this end, the present invention proposes an apparatus for assisting the activation of a voice system and a corresponding method thereof, so that the user can turn on the voice system more conveniently. In order to clarify the content of the present invention, the following specific examples are given as examples in which the present invention can be implemented.
圖1是依照本發明一實施例所繪示之語音操控系統的方塊圖。請參照圖1,語音操控系統100包括輔助啟動裝置110、行動終端裝置120以及伺服器130。在本實施例中,輔助啟動裝置110會透過無線傳輸信號,來啟動行動終端裝置120的語音系統,使得行動終端裝置120根據語音信號與伺服器130進行溝通。 1 is a block diagram of a voice control system in accordance with an embodiment of the invention. Referring to FIG. 1 , the voice control system 100 includes an auxiliary activation device 110 , a mobile terminal device 120 , and a server 130 . In the present embodiment, the auxiliary activation device 110 activates the voice system of the mobile terminal device 120 by wirelessly transmitting signals, so that the mobile terminal device 120 communicates with the server 130 according to the voice signal.
詳細而言,輔助啟動裝置110包括第一無線傳輸模組112以及觸發模組114,其中觸發模組114耦接於第一無線傳輸模組112。第一無線傳輸模組112例如是支援無線相容認證(Wireless fidelity,Wi-Fi)、全球互通微波存取(Worldwide Interoperability for Microwave Access,WiMAX)、藍芽(Bluetooth)、超寬頻(ultra-wideband,UWB)或射頻識別(Radio-frequency identification,RFID)等通訊協定的裝置,其可發出無線傳輸信號,以和另一無線傳輸模組彼此對應而建立無線連結。觸發模組114例如為按鈕、按鍵等。在本實施例中,當使用者按壓此觸發模組114產生一觸發信號後,第一無線傳輸模組112接收此觸發信號而啟動,此時第一無線傳輸模組112會發出無線傳輸信號,並透過第一無線傳輸模組112傳送此無線傳輸信號至行動終端裝置120。在一實施例中,上述的輔助啟動裝置110可為一藍牙耳機。 In detail, the auxiliary activation device 110 includes a first wireless transmission module 112 and a trigger module 114 , wherein the trigger module 114 is coupled to the first wireless transmission module 112 . The first wireless transmission module 112 supports, for example, wireless compatible authentication (Wireless) Fidelity, Wi-Fi, Worldwide Interoperability for Microwave Access (WiMAX), Bluetooth, Ultra-wideband (UWB) or Radio-frequency Identification (RFID) A protocol device that can transmit a wireless transmission signal to establish a wireless connection with another wireless transmission module. The trigger module 114 is, for example, a button, a button, or the like. In this embodiment, after the user presses the trigger module 114 to generate a trigger signal, the first wireless transmission module 112 receives the trigger signal and starts, and the first wireless transmission module 112 sends a wireless transmission signal. And transmitting the wireless transmission signal to the mobile terminal device 120 through the first wireless transmission module 112. In an embodiment, the auxiliary activation device 110 may be a Bluetooth headset.
值得注意的是,雖然目前有些免持的耳機/麥克風亦具有啟動行動終端裝置120某些功能的設計,但本發明的另一實施例中,輔助啟動裝置110可以不同於上述的耳機/麥克風。上述的耳機/麥克風藉由與行動終端裝置的連線,以取代行動終端裝置120上的耳機/麥克風而進行聽/通話,啟動功能為附加設計,但本案之輔助啟動裝置110”僅”用於開啟行動終端裝置120中的語音系統,並不具有聽/通話的功能,故內部的電路設計可簡化,成本也較低。換言之,相對於上述的免持耳機/麥克風而言,輔助啟動裝置110是另外裝置,即使用者可能同時具備免持的耳機/麥克風以及本案的輔助啟動裝置110。 It should be noted that although some hands-free headsets/microphones currently have a design to activate certain functions of the mobile terminal device 120, in another embodiment of the present invention, the auxiliary activation device 110 may be different from the earphone/microphone described above. The above-mentioned earphone/microphone performs listening/talking by replacing the earphone/microphone on the mobile terminal device 120 by connecting with the mobile terminal device, and the startup function is an additional design, but the auxiliary activation device 110" of the present invention is only used for Turning on the voice system in the mobile terminal device 120 does not have the function of listening/talking, so the internal circuit design can be simplified and the cost is low. In other words, the auxiliary starting device 110 is another device with respect to the above-mentioned hands-free headset/microphone, that is, the user may have both the hands-free earphone/microphone and the auxiliary starting device 110 of the present case.
此外,上述的輔助啟動裝置110的形體可以是使用者隨手可及的用品,例如戒指、手錶、耳環、項鍊、眼鏡等裝飾品, 即各種隨身可攜式物品,或者是安裝構件,例如為配置於方向盤上的行車配件,不限於上述。也就是說,輔助啟動裝置110為”生活化”的裝置,透過內部系統的設置,讓使用者能夠輕易地觸碰到觸發模組114,以開啟語音系統。舉例來說,當輔助啟動裝置110的形體為戒指時,使用者可輕易地移動手指來按壓戒指的觸發模組114使其被觸發。另一方面,當輔助啟動裝置110的形體為配置於行車配件的裝置時,使用者亦能夠在行車期間輕易地觸發行車配件裝置的觸發模組114。此外,相較於配戴耳機/麥克風進行聽/通話的不舒適感,使用本案之輔助啟動裝置110可以將行動終端裝置120中的語音系統開啟,甚至進而開啟擴音功能(後將詳述),使得使用者在不需配戴耳機/麥克風,仍可直接透過行動終端裝置120進行聽/通話。另外,對於使用者而言,這些”生活化”的輔助啟動裝置110為原本就會配戴或使用的物品,故在使用上不會有不習慣或是不舒適感的問題,即不需要花時間適應。舉例來說,當使用者在廚房做菜時,需要撥打放置於客廳的行動電話時,假設其配戴具有戒指、項鍊或手錶形體之本發明的輔助啟動裝置110,就可以輕觸戒指、項鍊或手錶以開啟語音系統以詢問友人食譜細節。雖然目前部份具有啟動功能的耳機/麥克風亦可以達到上述的目的,但是在每次做菜的過程中,並非每次都需要撥打電話請教友人,故對於使用者來說,隨時配戴耳機/麥克風做菜,以備隨時操控行動終端裝置可說是相當的不方便。 In addition, the shape of the above-mentioned auxiliary starting device 110 may be an item that the user can easily access, such as a ring, a watch, an earring, a necklace, a glasses, and the like. That is, various portable items, or mounting members, such as a traveling accessory disposed on the steering wheel, are not limited to the above. That is to say, the auxiliary activation device 110 is a "living" device, and through the setting of the internal system, the user can easily touch the trigger module 114 to turn on the voice system. For example, when the shape of the auxiliary activation device 110 is a ring, the user can easily move the finger to press the trigger module 114 of the ring to be triggered. On the other hand, when the shape of the auxiliary starting device 110 is a device disposed on the driving accessory, the user can also easily trigger the triggering module 114 of the driving accessory device during driving. In addition, the auxiliary activation device 110 of the present invention can turn on the voice system in the mobile terminal device 120, and even turn on the amplification function (described later), compared to the discomfort of listening/talking with the earphone/microphone. Therefore, the user can still listen/talk directly through the mobile terminal device 120 without wearing the earphone/microphone. In addition, for the user, these "living" auxiliary starting devices 110 are items that would otherwise be worn or used, so there is no problem of uncomfortable or uncomfortable use, that is, no flowers are needed. Time to adapt. For example, when a user is cooking in a kitchen and needs to dial a mobile phone placed in a living room, assuming that it is wearing the auxiliary activation device 110 of the present invention having a ring, a necklace or a watch shape, the ring and the necklace can be tapped. Or watch to turn on the voice system to ask for friend recipe details. Although some earphones/microphones with start-up function can achieve the above purposes, in the process of cooking, not every time you need to call a friend, so for the user, wear headphones at any time. It is quite inconvenient to cook the microphone in order to control the mobile terminal device at any time.
在其他實施例中,輔助啟動裝置110還可配置有無線充 電電池116,用以驅動第一無線傳輸模組112。進一步而言,無線充電電池116包括電池單元1162以及無線充電模組1164,其中無線充電模組1164耦接於電池單元1162。在此,無線充電模組1164可接收來自一無線供電裝置(未繪示)所供應的能量,並將此能量轉換為電力來對電池單元1162充電。如此一來,輔助啟動裝置110的第一無線傳輸模組112可便利地透過無線充電電池116來進行充電。 In other embodiments, the auxiliary activation device 110 can also be configured with a wireless charger. The electric battery 116 is configured to drive the first wireless transmission module 112. Further, the wireless charging battery 116 includes a battery unit 1162 and a wireless charging module 1164. The wireless charging module 1164 is coupled to the battery unit 1162. Here, the wireless charging module 1164 can receive energy supplied from a wireless power supply device (not shown) and convert the energy into power to charge the battery unit 1162. In this way, the first wireless transmission module 112 of the auxiliary activation device 110 can be conveniently charged through the wireless rechargeable battery 116.
另一方面,行動終端裝置120例如為行動電話(Cell phone)、個人數位助理(Personal Digital Assistant,PDA)手機、智慧型手機(Smart phone),或是安裝有通訊軟體的掌上型電腦(Pocket PC)、平板型電腦(Tablet PC)或筆記型電腦等等。行動終端裝置120可以是任何具備通訊功能的可攜式(Portable)行動裝置,在此並不限制其範圍。此外,行動終端裝置120可使用Android作業系統、Microsoft作業系統、Android作業系統、Linux作業系統等等,不限於上述。 On the other hand, the mobile terminal device 120 is, for example, a Cell phone, a Personal Digital Assistant (PDA) mobile phone, a smart phone, or a Pocket PC equipped with a communication software (Pocket PC). ), Tablet PC or laptop, and more. The mobile terminal device 120 can be any portable mobile device with communication function, and the scope is not limited herein. Further, the mobile terminal device 120 may use an Android operating system, a Microsoft operating system, an Android operating system, a Linux operating system, etc., without being limited to the above.
行動終端裝置120包括第二無線傳輸模組122,第二無線傳輸模組122能與輔助啟動裝置110的第一無線傳輸模組112相匹配,並採用相對應的無線通訊協定(例如無線相容認證、全球互通微波存取、藍芽、超寬頻通訊協定或射頻識別等通訊協定),藉以與第一無線傳輸模組112建立無線連結。值得注意的是,在此所述的”第一”無線傳輸模組112、”第二”無線傳輸模組122係用以說明無線傳輸模組配置於不同的裝置,並非用以限定本發 明。 The mobile terminal device 120 includes a second wireless transmission module 122, and the second wireless transmission module 122 can be matched with the first wireless transmission module 112 of the auxiliary activation device 110, and adopts a corresponding wireless communication protocol (for example, wireless compatibility). A communication protocol such as authentication, global interoperability microwave access, Bluetooth, ultra-wideband communication protocol or radio frequency identification is established to establish a wireless connection with the first wireless transmission module 112. It should be noted that the "first" wireless transmission module 112 and the "second" wireless transmission module 122 are used to describe that the wireless transmission module is configured in different devices, and is not intended to limit the present invention. Bright.
在其他實施例中,行動終端裝置120還包括語音系統121,此語音系統121耦接於第二無線傳輸模組122,故使用者觸發輔助啟動裝置110的觸發模組114後,能透過第一無線傳輸模組112與第二無線傳輸模組122無線地啟動語音系統121。在一實施例中,此語音系統121可包括語音取樣模組124以及語音輸出介面127。語音取樣模組124用以接收來自使用者的語音信號,此語音取樣模組124例如為麥克風(Microphone)等接收音訊的裝置。上述的語音輸出介面127例如為喇叭或耳機等。 In other embodiments, the mobile terminal device 120 further includes a voice system 121. The voice system 121 is coupled to the second wireless transmission module 122. Therefore, after the user triggers the trigger module 114 of the auxiliary activation device 110, the user can transmit the first The wireless transmission module 112 and the second wireless transmission module 122 wirelessly activate the voice system 121. In an embodiment, the voice system 121 can include a voice sampling module 124 and a voice output interface 127. The voice sampling module 124 is configured to receive a voice signal from a user. The voice sampling module 124 is, for example, a device for receiving audio such as a microphone. The above-described voice output interface 127 is, for example, a speaker or an earphone.
另外,行動終端裝置120還可配置有通訊模組128。通訊模組128例如是能傳遞與接收無線訊號的元件,如射頻收發器。進一步而言,通訊模組128能夠讓使用者透過行動終端裝置120接聽或撥打電話或使用電信業者所提供的其他服務。在本實施例中,通訊模組128可透過網際網路接收來自伺服器130的應答資訊,並依據此應答資訊建立行動終端裝置120與至少一電子裝置之間的通話連線,其中所述電子裝置例如為另一行動終端裝置(未繪示)。 In addition, the mobile terminal device 120 may also be configured with a communication module 128. The communication module 128 is, for example, an element capable of transmitting and receiving wireless signals, such as a radio frequency transceiver. Further, the communication module 128 enables the user to answer or make a call through the mobile terminal device 120 or use other services provided by the carrier. In this embodiment, the communication module 128 can receive the response information from the server 130 through the Internet, and establish a call connection between the mobile terminal device 120 and the at least one electronic device according to the response information, wherein the electronic device The device is, for example, another mobile terminal device (not shown).
伺服器130例如為網路伺服器或雲端伺服器等,其具有語音理解模組132。在本實施例中,語音理解模組132包括語音辨識模組1322以及語音處理模組1324,其中語音處理模組1324耦接於語音辨識模組1322。在此,語音辨識模組1322會接收從語音取樣模組124傳來的語音信號,以將語音信號轉換成多個分段語 義(例如詞彙或字句等)。語音處理模組1324則可依據這些分段語義而解析出這些分段語義所代表的意指(例如意圖、時間、地點等),進而判斷出上述語音信號中所表示的意思。此外,語音處理模組1324還會根據所解析的結果產生對應的應答資訊。在本實施例中,語音理解模組132可由一個或數個邏輯閘組合而成的硬體電路來實作,亦可以是以電腦程式碼來實作。值得一提的是,在另一實施例中,語音理解模組132可配置於行動終端裝置220中,如圖2所示之語音操控系統200。 The server 130 is, for example, a web server or a cloud server, and has a speech understanding module 132. In this embodiment, the voice recognition module 132 includes a voice recognition module 1322 and a voice processing module 1324. The voice processing module 1324 is coupled to the voice recognition module 1322. Here, the speech recognition module 1322 receives the speech signal transmitted from the speech sampling module 124 to convert the speech signal into a plurality of segmentation languages. Meaning (such as words or words, etc.). The speech processing module 1324 can parse the meanings (such as intent, time, location, etc.) represented by the segmentation semantics according to the segmentation semantics, and thereby determine the meaning represented in the speech signal. In addition, the voice processing module 1324 also generates corresponding response information according to the parsed result. In this embodiment, the speech understanding module 132 can be implemented by a hardware circuit composed of one or several logic gates, or can be implemented by a computer program code. It is worth mentioning that in another embodiment, the voice understanding module 132 can be configured in the mobile terminal device 220, such as the voice control system 200 shown in FIG.
以下即搭配上述語音操控系統100來說明語音操控的方法。圖3是依照本發明一實施例所繪示之語音操控方法的流程圖。請同時參照圖1及圖3,於步驟302中,輔助啟動裝置110發送無線傳輸信號至行動終端裝置120。詳細的說明是,當輔助啟動裝置110的第一無線傳輸模組112因接收到一觸發信號被觸發時,此輔助啟動裝置110會發送無線傳輸信號至行動終端裝置120。具體而言,當輔助啟動裝置110中的觸發模組114被使用者按壓時,此時觸發模組114會因觸發信號被觸發,而使第一無線傳輸模組112發送無線傳輸信號至行動終端裝置120的第二無線傳輸模組122,藉以使得第一無線傳輸模組112透過無線通訊協定與第二無線傳輸模組122連結。上述的輔助啟動裝置110僅用於開啟行動終端裝置120中的語音系統,並不具有聽/通話的功能,故內部的電路設計可簡化,成本也較低。換言之,相對於一般行動終端裝置120所附加的免持耳機/麥克風而言,輔助啟動裝置110是另一 裝置,即使用者可能同時具備免持的耳機/麥克風以及本案的輔助啟動裝置110。 The following is a description of the voice manipulation method in conjunction with the voice control system 100 described above. FIG. 3 is a flow chart of a voice control method according to an embodiment of the invention. Referring to FIG. 1 and FIG. 3 simultaneously, in step 302, the auxiliary activation device 110 transmits a wireless transmission signal to the mobile terminal device 120. The detailed description is that when the first wireless transmission module 112 of the auxiliary activation device 110 is triggered by receiving a trigger signal, the auxiliary activation device 110 transmits a wireless transmission signal to the mobile terminal device 120. Specifically, when the trigger module 114 in the auxiliary activation device 110 is pressed by the user, the trigger module 114 is triggered by the trigger signal, and the first wireless transmission module 112 sends the wireless transmission signal to the mobile terminal. The second wireless transmission module 122 of the device 120 is configured such that the first wireless transmission module 112 is coupled to the second wireless transmission module 122 via a wireless communication protocol. The above-mentioned auxiliary starting device 110 is only used to activate the voice system in the mobile terminal device 120, and does not have the function of listening/talking, so the internal circuit design can be simplified and the cost is low. In other words, the auxiliary starting device 110 is another with respect to the hands-free headset/microphone attached to the general mobile terminal device 120. The device, that is, the user may have both a hands-free headset/microphone and an auxiliary activation device 110 of the present invention.
值得一提的是,上述的輔助啟動裝置110的形體可以是 使用者隨手可及的用品,例如戒指、手錶、耳環、項鍊、眼鏡等各種隨身可攜式物品,或者是安裝構件,例如為配置於方向盤上的行車配件,不限於上述。也就是說,輔助啟動裝置110為”生活化”的裝置,透過內部系統的設置,讓使用者能夠輕易地觸碰到觸發模組114,以開啟語音系統121。因此,使用本案之輔助啟動裝置110可以將行動終端裝置120中的語音系統121開啟,甚至進而開啟擴音功能(後將詳述),使得使用者在不需配戴耳機/麥克風,仍可直接透過行動終端裝置120進行聽/通話。此外,對於使用者而言,這些”生活化”的輔助啟動裝置110為原本就會配戴或使用的物品,故在使用上不會有不習慣或是不舒適感的問題。 It is worth mentioning that the shape of the auxiliary starting device 110 described above may be The portable items that are accessible to the user, such as rings, watches, earrings, necklaces, glasses, and the like, or the mounting members, such as the traveling accessories disposed on the steering wheel, are not limited to the above. That is to say, the auxiliary activation device 110 is a "living" device, and through the setting of the internal system, the user can easily touch the trigger module 114 to turn on the voice system 121. Therefore, the auxiliary activation device 110 of the present invention can turn on the voice system 121 in the mobile terminal device 120, and even turn on the sound amplification function (which will be described in detail later), so that the user can directly directly without using the earphone/microphone. Listening/talking is performed through the mobile terminal device 120. In addition, for the user, these "living" auxiliary starting devices 110 are items that would otherwise be worn or used, so there is no problem of uncomfortable or uncomfortable use.
此外,第一無線傳輸模組112與第二無線傳輸模組122皆可處於睡眠模式或工作模式。其中,睡眠模式指的是無線傳輸模組為關閉狀態,亦即無線傳輸模組不會接收/偵測無線傳輸信號,而無法與其它無線傳輸模組連結。工作模式指的是無線傳輸模組為開啟狀態,亦即無線傳輸模組可不斷地偵測無線傳輸信號,或隨時發送無線傳輸信號,而能夠與其它無線傳輸模組連結。在此,當觸發模組114被觸發時,倘若第一無線傳輸模組112處於睡眠模式,則觸發模組114會喚醒第一無線傳輸模組112,使第一無線傳輸模組112進入工作模式,並使第一無線傳輸模組112 發送無線傳輸信號至第二無線傳輸模組122,而讓第一無線傳輸模組112透過無線通訊協定與行動終端裝置120的第二無線傳輸模組122連結。 In addition, both the first wireless transmission module 112 and the second wireless transmission module 122 can be in a sleep mode or an operating mode. The sleep mode refers to the wireless transmission module being in a closed state, that is, the wireless transmission module does not receive/detect wireless transmission signals, and cannot be connected with other wireless transmission modules. The working mode refers to the wireless transmission module being turned on, that is, the wireless transmission module can continuously detect wireless transmission signals, or can transmit wireless transmission signals at any time, and can be connected with other wireless transmission modules. Here, when the trigger module 114 is triggered, if the first wireless transmission module 112 is in the sleep mode, the trigger module 114 wakes up the first wireless transmission module 112, and causes the first wireless transmission module 112 to enter the working mode. And causing the first wireless transmission module 112 The wireless transmission module is sent to the second wireless transmission module 122, and the first wireless transmission module 112 is coupled to the second wireless transmission module 122 of the mobile terminal device 120 via the wireless communication protocol.
另一方面,為了避免第一無線傳輸模組112持續維持在 工作模式而消耗過多的電力,在第一無線傳輸模組112進入工作模式後的預設時間(例如為5分鐘)內,倘若觸發模組114未再被觸發,則第一無線傳輸模組112會自工作模式進入睡眠模式,並停止與行動終端裝置120的第二無線傳輸模組120連結。 On the other hand, in order to avoid the first wireless transmission module 112 being continuously maintained The working mode consumes too much power. Within a preset time (for example, 5 minutes) after the first wireless transmission module 112 enters the working mode, if the triggering module 114 is not triggered again, the first wireless transmission module 112 The sleep mode is entered from the work mode, and the second wireless transmission module 120 of the mobile terminal device 120 is stopped.
之後,於步驟304中,行動終端裝置120的第二無線傳 輸模組122會接收無線傳輸信號,以啟動語音系統121。接著,於步驟S306,當第二無線傳輸模組122偵測到無線傳輸信號時,行動終端裝置120可啟動語音系統121,而語音系統的121取樣模組124可開始接收語音信號,例如「今天溫度幾度?」、「打電話給老王。」、「請查詢電話號碼。」等等。 Thereafter, in step 304, the second wireless transmission of the mobile terminal device 120 The transmission module 122 receives the wireless transmission signal to activate the voice system 121. Next, in step S306, when the second wireless transmission module 122 detects the wireless transmission signal, the mobile terminal device 120 can activate the voice system 121, and the 121 sampling module 124 of the voice system can start receiving the voice signal, for example, "Today How many degrees?", "Call to Pharaoh.", "Please check the phone number."
於步驟S308,語音取樣模組124會將上述語音信號傳送 至伺服器130中的語音理解模組132,以透過語音理解模組132解析語音信號以及產生應答資訊。進一步而言,語音理解模組132中的語音辨識模組1322會接收來自語音取樣模組124的語音信號,並將語音信號分割成多個分段語義,而語音處理模組1324則會對上述分段語義進行語音理解,以產生用以回應語音信號的應答資訊。 In step S308, the voice sampling module 124 transmits the voice signal. The speech understanding module 132 in the server 130 analyzes the speech signal and generates response information through the speech understanding module 132. Further, the speech recognition module 1322 in the speech understanding module 132 receives the speech signal from the speech sampling module 124 and divides the speech signal into a plurality of segmentation semantics, and the speech processing module 1324 The segmentation semantics perform speech understanding to generate response information for responding to the speech signal.
在本發明之另一實施例中,行動終端裝置120更可接收 語音處理模組1324所產生的應答資訊,據以透過語音輸出介面127輸出應答資訊中的內容或執行應答資訊所下達的操作。於步驟S310,行動終端裝置120的會接收語音理解模組132所產生的應答資訊,並依據應答資訊中的內容(例如詞彙或字句等)產生語音應答。並且,於步驟S312,語音輸出介面127會接收並輸出此語音應答。 In another embodiment of the present invention, the mobile terminal device 120 is more receivable. The response information generated by the voice processing module 1324 is used to output the content of the response information or perform the operation of the response information through the voice output interface 127. In step S310, the mobile terminal device 120 receives the response information generated by the speech understanding module 132, and generates a speech response according to the content (such as a vocabulary or a sentence) in the response information. And, in step S312, the voice output interface 127 receives and outputs the voice response.
舉例而言,當使用者按壓輔助啟動裝置110中的觸發模 組114時,第一無線傳輸模組112則會發送無線傳輸信號至第二無線傳輸模組122,使得行動終端裝置120啟動語音系統121的語音取樣模組124。在此,假設來自使用者的語音信號為一詢問句,例如「今天溫度幾度?」,則語音取樣模組124便會接收並將此語音信號傳送至伺服器130中的語音理解模組132進行解析,且語音理解模組132可將解析所產生的應答資訊傳送回行動終端裝置120。假設語音理解模組132所產生的應答資訊中的內容為「30℃」,則語音輸出介面127能將此語音應播報給使用者。 For example, when the user presses the trigger mode in the auxiliary activation device 110 In the group 114, the first wireless transmission module 112 transmits a wireless transmission signal to the second wireless transmission module 122, so that the mobile terminal device 120 activates the voice sampling module 124 of the voice system 121. Here, assuming that the voice signal from the user is a query sentence, such as "Today's temperature is a few degrees?", the voice sampling module 124 receives and transmits the voice signal to the voice understanding module 132 in the server 130. The speech understanding module 132 can transmit the response information generated by the parsing back to the mobile terminal device 120. Assuming that the content of the response information generated by the speech understanding module 132 is "30 ° C", the speech output interface 127 can broadcast the speech to the user.
在另一實施例中,假設來自使用者的語音信號為一命令 句,例如「打電話給老王。」,則語音理解模組132中可辨識出此命令句為「撥電話給老王的請求」。此外,語音理解模組132會再產生新的應答資訊,例如「請確認是否撥給老王」,並將此新的應答資訊傳送至行動終端裝置120。在此,此新的應答資訊透過語音輸出介面127播報於使用者。更進一步地說,當使用者的應答為「是」之類的肯定答案時,類似地,語音取樣模組124可接收並 傳送此語音信號至伺服器130,以讓語音理解模組132進行解析。語音理解模組132解析結束後,便會在應答資訊記錄有一撥號指令資訊,並傳送至行動終端裝置120。此時,通訊模組128則會依據電話資料庫所記錄的聯絡人資訊,查詢出「老王」的電話號碼,以建立行動終端裝置120與另一電子裝置之間的通話連線,亦即撥號給「老王」。 In another embodiment, assume that the voice signal from the user is a command For example, "call to Pharaoh.", the speech understanding module 132 can recognize the command sentence as "a request to dial a phone to Pharaoh." In addition, the voice understanding module 132 will generate new response information, such as "Please confirm whether to dial to Pharaoh", and transmit the new response information to the mobile terminal device 120. Here, the new response message is broadcast to the user via the voice output interface 127. Further, when the user's response is a positive answer such as "Yes", similarly, the voice sampling module 124 can receive and The voice signal is transmitted to the server 130 for the voice understanding module 132 to parse. After the speech understanding module 132 finishes parsing, a dialing command information is recorded in the response information and transmitted to the mobile terminal device 120. At this time, the communication module 128 queries the phone number of the "Pharaoh" according to the contact information recorded in the phone database to establish a call connection between the mobile terminal device 120 and another electronic device, that is, Dial to "Pharaoh."
在其他實施例中,除上述的語音操控系統100外,亦可利用語音操控系統200或其他類似的系統,進行上述的操作方法,並不以上述的實施例為限。 In other embodiments, in addition to the voice control system 100 described above, the voice operation system 200 or other similar system may be used to perform the above operation method, which is not limited to the above embodiments.
綜上所述,在本實施例之語音操控系統與方法中,輔助啟動裝置能夠無線地開啟行動終端裝置的語音功能。而且,此輔助啟動裝置的形體可以是使用者隨手可及的”生活化”的用品,例如戒指、手錶、耳環、項鍊、眼鏡等裝飾品,即各種隨身可攜式物品,或者是安裝構件,例如為配置於方向盤上的行車配件,不限於上述。如此一來,相較於目前另外配戴免持耳機/麥克風的不舒適感,使用本案之輔助啟動裝置110來開啟行動終端裝置120中的語音系統將更為便利。 In summary, in the voice control system and method of the embodiment, the auxiliary activation device can wirelessly activate the voice function of the mobile terminal device. Moreover, the shape of the auxiliary starting device may be a "living" item accessible by the user, such as a ring, a watch, an earring, a necklace, an eyeglass, etc., that is, various portable items, or a mounting member. For example, the traveling accessory disposed on the steering wheel is not limited to the above. As a result, it is more convenient to use the auxiliary activation device 110 of the present invention to turn on the voice system in the mobile terminal device 120 compared to the current discomfort of wearing the hands-free headset/microphone.
值得注意的是,上述具有語音理解模組的伺服器130可能為網路伺服器或雲端伺服器,而雲端伺服器可能會涉及到使用者的隱私權的問題。例如,使用者需上傳完整的通訊錄至雲端伺服器,才能完成如撥打電話、發簡訊等與通訊錄相關的操作。即使雲端伺服器採用加密連線,並且即用即傳不保存,還是難以消 除使用者的擔優。據此,以下提供另一種語音操控的方法及其對應的語音交互系統,行動終端裝置可在不上傳完整通訊錄的情況下,與雲端伺服器來執行語音交互服務。為了使本發明之內容更為明瞭,以下特舉實施例作為本發明確實能夠據以實施的範例。 It should be noted that the server 130 with the voice understanding module may be a network server or a cloud server, and the cloud server may involve the privacy of the user. For example, the user needs to upload a complete address book to the cloud server in order to complete operations related to the address book such as making a call, sending a text message, and the like. Even if the cloud server uses encrypted connection, and it is not saved, it is difficult to eliminate. In addition to the user's superiority. Accordingly, the following provides another voice control method and a corresponding voice interaction system thereof, and the mobile terminal device can perform a voice interaction service with the cloud server without uploading the complete address book. In order to clarify the content of the present invention, the following specific examples are given as examples in which the present invention can be implemented.
圖4是依照本發明一實施例之語音交互系統的方塊圖。請參照圖4,語音交互系統400可包括雲端伺服器410以及行動終端裝置420,雲端伺服器410以及行動終端裝置420可相互連線。語音交互系統400是透過雲端伺服器410來進行語音交互服務。即,由具有強大運算能力的雲端伺服器410來處理語音識別,藉此降低行動終端裝置420的資料處理負載,還可提升語音識別的準確性及識別速度。 4 is a block diagram of a voice interaction system in accordance with an embodiment of the present invention. Referring to FIG. 4, the voice interaction system 400 can include a cloud server 410 and a mobile terminal device 420. The cloud server 410 and the mobile terminal device 420 can be connected to each other. The voice interaction system 400 is a voice interaction service through the cloud server 410. That is, the voice recognition is processed by the cloud server 410 having powerful computing capability, thereby reducing the data processing load of the mobile terminal device 420, and improving the accuracy and recognition speed of the voice recognition.
在行動終端裝置420中,包括處理單元422、通訊模組424、語音系統426、儲存單元428。在一實施例中,行動終端裝置420還配置有一顯示單元430。其中,處理單元422耦接至通訊模組424、語音系統426、儲存單元428以及顯示單元430。儲存單元428中更儲存有一通訊錄429。 The mobile terminal device 420 includes a processing unit 422, a communication module 424, a voice system 426, and a storage unit 428. In an embodiment, the mobile terminal device 420 is further configured with a display unit 430. The processing unit 422 is coupled to the communication module 424, the voice system 426, the storage unit 428, and the display unit 430. An address book 429 is further stored in the storage unit 428.
上述處理單元422為具備運算能力的硬體(例如晶片組、處理器等),用以控制行動終端裝置420的整體運作。處理單元422例如是中央處理單元(Central Processing Unit,CPU),或是其他可程式化之微處理器(Microprocessor)、數位信號處理器(Digital Signal Processor,DSP)、可程式化控制器、特殊應用積體電路(Application Specific Integrated Circuits,ASIC)、可程式化邏輯 裝置(Programmable Logic Device,PLD)或其他類似裝置。 The processing unit 422 is a hardware (for example, a chipset, a processor, or the like) having computing power for controlling the overall operation of the mobile terminal device 420. The processing unit 422 is, for example, a central processing unit (CPU), or other programmable microprocessor (Microprocessor), a digital signal processor (DSP), a programmable controller, and a special application. Application Specific Integrated Circuits (ASIC), programmable logic Programmable Logic Device (PLD) or other similar device.
上述通訊模組424例如為網路卡,其可以是經由有線傳輸或無線傳輸與雲端伺服器410進行溝通。而上述語音系統426至少包括麥克風等收音器,以將聲音轉換為電子信號。上述儲存單元428例如為隨機存取記憶體(Random Access Memory,RAM)、唯讀記憶體(Read-Only Memory,ROM)、快閃記憶體(Flash memory)或磁碟儲存裝置(Magnetic disk storage device)等。上述顯示單元430例如為液晶顯示器(Liquid Crystal Display,LCD)或是具有觸控模組的觸控螢幕(touch screen)等。 The communication module 424 is, for example, a network card, which can communicate with the cloud server 410 via wired transmission or wireless transmission. The speech system 426 described above includes at least a microphone such as a microphone to convert the sound into an electrical signal. The storage unit 428 is, for example, a random access memory (RAM), a read-only memory (ROM), a flash memory, or a magnetic disk storage device. )Wait. The display unit 430 is, for example, a liquid crystal display (LCD) or a touch screen with a touch module.
另一方面,雲端伺服器410為具有強大運算能力的實體主機,或者可以是由一群實體主機組成的一個超級虛擬電腦,藉以來執行大型任務。在此,雲端伺服器410包括處理單元412及通訊模組414。在此,雲端伺服器410的通訊模組414,耦接至其處理單元412。通訊模組414用以與行動終端裝置420的通訊模組424進行溝通。通訊模組414,例如為網路卡,其可以是經由有線傳輸或無線傳輸與行動終端裝置420進行溝通。 On the other hand, the cloud server 410 is a physical host with powerful computing power, or can be a super virtual computer composed of a group of physical hosts, so as to perform large tasks. Here, the cloud server 410 includes a processing unit 412 and a communication module 414. Here, the communication module 414 of the cloud server 410 is coupled to its processing unit 412. The communication module 414 is configured to communicate with the communication module 424 of the mobile terminal device 420. The communication module 414 is, for example, a network card, which can communicate with the mobile terminal device 420 via wired transmission or wireless transmission.
另外,雲端伺服器410中的處理單元412為具有更強大的運算能力,例如為多核心的CPU、或者由多個CPU所組成CPU陣列。雲端伺服器410的處理單元412例如至少包括如圖1所示的語音理解模組132。處理單元412可透過語音理解模組來對自行動終端裝置420所接收的語音信號進行解析。而雲端伺服器410透過通訊模組414將解析的結果傳送至行動終端裝置420,使得行 動終端裝置420得以依據結果來執行對應的動作。 In addition, the processing unit 412 in the cloud server 410 is a CPU array having more powerful computing capabilities, such as a multi-core CPU or a plurality of CPUs. The processing unit 412 of the cloud server 410 includes, for example, at least the speech understanding module 132 as shown in FIG. The processing unit 412 can parse the voice signal received from the mobile terminal device 420 through the voice understanding module. The cloud server 410 transmits the parsed result to the mobile terminal device 420 through the communication module 414, so that the line The mobile terminal device 420 can perform a corresponding action depending on the result.
以下即搭配上述圖4來說明於語音交互系統的語音交換流程。 The voice exchange process in the voice interaction system will be described below with reference to FIG. 4 described above.
圖5是依照本發明一實施例之用於語音交互系統的語音通信流程的示意圖。請同時參照圖4及圖5,在步驟S501中,於行動終端裝置420中,透過語音系統426接收第一語音信號,並且在步驟S503中,透過通訊模組424將第一語音信號傳送至雲端伺服器410。在此,行動終端裝置420例如是透過語音系統426中的麥克風等元件而自使用者接收第一語音信號。舉例來說,假設行動終端裝置420為手機,使用者對著手機說出“打電話給老王”,則語音系統426在接收此語音信號”打電話給老王”後,會透過通訊模組424將此語音信號”打電話給老王”傳送至雲端伺服器410。在一實施例中,上述的語音系統426可藉由圖1~圖3所示之輔助啟動裝置進行啟動。 FIG. 5 is a schematic diagram of a voice communication flow for a voice interaction system according to an embodiment of the invention. Referring to FIG. 4 and FIG. 5 simultaneously, in step S501, the first voice signal is received by the voice system 426 in the mobile terminal device 420, and the first voice signal is transmitted to the cloud through the communication module 424 in step S503. Server 410. Here, the mobile terminal device 420 receives the first voice signal from the user, for example, through a component such as a microphone in the voice system 426. For example, if the mobile terminal device 420 is a mobile phone, and the user speaks "call to Pharaoh" to the mobile phone, the voice system 426 will receive the voice signal "calling the Pharaoh" and then pass the communication module. 424 transmits the voice signal "call to Lao Wang" to the cloud server 410. In an embodiment, the voice system 426 can be activated by the auxiliary activation device shown in FIGS. 1 to 3.
接著,在步驟S505中,於雲端伺服器410中,處理單元412利用語音理解模組來解析第一語音信號,並且,在步驟S507中,處理單元412將由第一語音信號所獲得的通信目標,透過通訊模組414傳送至行動終端裝置420。以第一語音信號的內容“打電話給老王”為例,雲端伺服器410的處理單元412可利用語音理解模組來解析第一語音信號,藉此獲得通信指令與通信目標。即,語音理解模組可解析出第一語音信號包括“打電話”與“老王”,據此,雲端伺服器410的處理單元412便能夠判斷出通信 指令為撥號指令,以及通信目標為“老王”,並透過通訊模組414傳送至行動終端裝置420。 Next, in step S505, in the cloud server 410, the processing unit 412 parses the first voice signal using the voice understanding module, and in step S507, the processing unit 412 will obtain the communication target obtained by the first voice signal, It is transmitted to the mobile terminal device 420 through the communication module 414. Taking the content of the first voice signal "calling Pharaoh" as an example, the processing unit 412 of the cloud server 410 can use the voice understanding module to parse the first voice signal, thereby obtaining the communication command and the communication target. That is, the voice understanding module can parse out the first voice signal including "calling" and "pharaoh", according to which the processing unit 412 of the cloud server 410 can determine the communication. The command is a dialing command, and the communication target is "Pharaoh" and transmitted to the mobile terminal device 420 through the communication module 414.
然後,在步驟S509中,於行動終端裝置420中,行動終 端裝置420的處理單元422依據通信目標搜尋儲存單元428中的通訊錄429,並獲得符合通信目標的選擇列表。例如,行動終端裝置420的處理單元422在搜尋通訊錄的過程中,找到多筆具有“王”的聯絡人資訊,因而產生選擇列表,並顯示於顯示單元430中,以供使用者進行選擇。 Then, in step S509, in the mobile terminal device 420, the action ends The processing unit 422 of the end device 420 searches the address book 429 in the storage unit 428 according to the communication target, and obtains a selection list conforming to the communication target. For example, the processing unit 422 of the mobile terminal device 420 finds a plurality of contact information with "king" in the process of searching for the address book, thereby generating a selection list and displaying it in the display unit 430 for the user to make a selection.
舉例來說,選擇列表例如底下表1所示,在通訊錄中搜 尋符合通信目標“老王”的聯絡人資訊。在此例中,假設找到4筆符合的聯絡人資訊,並且將聯絡人資訊中的聯絡人名稱,即“王聰明”、“王五”、“王安石”以及“王維”,寫入至選擇列表中。 For example, the selection list is shown in the following table, for example, in the address book. Find contact information that meets the communication target "Pharaoh". In this example, assume that 4 matching contact information is found, and the contact names in the contact information, namely "Wang Cong", "Wang Wu", "Wang Anshi" and "Wang Wei", are written to the selection list. in.
而倘若使用者對著行動終端裝置420說話,如步驟S511所示,行動終端裝置420會透過語音系統426而接收到第二語音信號。而在行動終端裝置420接收到第二語音信號的同時,在步驟S513中,行動終端裝置420會將第二語音信號與選擇列表透過 通訊模組424同時傳送至雲端伺服器410。例如:使用者在觀看到選擇列表之後而對著行動終端裝置420說出“第1筆”或“王聰明”等內容,而形成第二語音信號時,行動終端裝置420便會將第二語音信號與選擇列表一起傳送至雲端伺服器410。 If the user speaks to the mobile terminal device 420, the mobile terminal device 420 receives the second voice signal through the voice system 426 as shown in step S511. While the mobile terminal device 420 receives the second voice signal, the mobile terminal device 420 transmits the second voice signal and the selection list in step S513. The communication module 424 is simultaneously transmitted to the cloud server 410. For example, after the user views the selection list and speaks the content of "1st stroke" or "Wang smart" to the mobile terminal device 420, and forms a second voice signal, the mobile terminal device 420 will use the second voice. The signal is transmitted to the cloud server 410 along with the selection list.
另外,使用者亦可隨意說出其他內容,也就是說,不管 使用者說出的內容為何,只要行動終端裝置420接收到第二語音信號,便會同時將第二語音信號與選擇列表傳送至雲端伺服器410。 In addition, the user can also freely say other content, that is, regardless of What is said by the user is that as long as the mobile terminal device 420 receives the second voice signal, the second voice signal and the selection list are simultaneously transmitted to the cloud server 410.
值得一提的是,在本案中,並未將“完整”的通訊錄上傳 至雲端伺服器410,而只將符合通信目標以“選擇列表”的形式,上傳至雲端伺服器410以進行第二次語音信號分析。換言之,只有“部份”的聯絡人資料會被上傳。在一實施例中,行動終端裝置420上傳至雲端伺服器410的選擇列表中可以只包括聯絡人名稱,而不包括電話號碼或其他資訊。所上傳之選擇列表的內容可依使用者的需求而進行設定。 It is worth mentioning that in this case, the "complete" address book was not uploaded. To the cloud server 410, only the communication target is uploaded to the cloud server 410 in the form of a "selection list" for the second voice signal analysis. In other words, only "partial" contact information will be uploaded. In an embodiment, the mobile terminal device 420 uploads to the selection list of the cloud server 410 to include only the contact name, not the phone number or other information. The content of the uploaded selection list can be set according to the needs of the user.
此外,值得注意的是,在本案中,第二語音信號與選擇 列表同時傳送至雲端伺服器410,相較於目前不需上傳通訊錄的通信方法係需分次解析每一個語音信號及每一個列表,即一步驟僅包含一項資訊,本案的語音交換方法更為快速。 In addition, it is worth noting that in this case, the second voice signal and selection The list is simultaneously transmitted to the cloud server 410. Compared with the current communication method that does not need to upload the address book, each voice signal and each list need to be parsed in stages, that is, only one piece of information is included in one step, and the voice exchange method in this case is more For the quick.
接著,於雲端伺服器410中,處理單元412會利用語音 理解模組來解析第二語音信號,如步驟S515所示。例如,利用語音理解模組解析出第二語音信號所包括的內容為“第3個”,則雲 端伺服器410的處理單元412便可進一步去比對自行動終端裝置420所接收的選擇列表中的第3個聯絡人資訊。以表1為例,第3個聯絡人資訊即為“王安石”。 Then, in the cloud server 410, the processing unit 412 utilizes voice. The module is understood to parse the second speech signal as shown in step S515. For example, using the voice understanding module to parse out that the content included in the second voice signal is “the third one”, then the cloud The processing unit 412 of the end server 410 can further compare the third contact information in the selection list received from the mobile terminal device 420. Taking Table 1 as an example, the third contact information is “Wang Anshi”.
值得注意的是,透過如圖1所示的語音理解模組132的 設計,使用者不需完整講出選擇列表的內容作為第二語音信號,如”第1筆王聰明”,僅需講出部份選擇列表的內容,如“第1筆”或“王聰明”作為第二語音信號,並同時搭配選擇列表上傳至雲端伺服器的語音理解模組132,即可解析出選擇目標。換言之,選擇列表內容包含多個項目資訊,且每一個項目資訊至少具有編號及對應此編號的內容(如:姓名、電話號碼等),而第二語音信號來自於對應此編號的部份內容或編號。 It is worth noting that through the speech understanding module 132 as shown in FIG. Design, the user does not need to completely tell the content of the selection list as the second voice signal, such as "the first pen king smart", just need to tell the contents of some selection lists, such as "1st pen" or "Wang smart" As the second voice signal, and simultaneously with the selection list uploaded to the voice understanding module 132 of the cloud server, the selected target can be parsed. In other words, the selection list content includes a plurality of item information, and each item information has at least a number and a content corresponding to the number (eg, name, phone number, etc.), and the second voice signal is from a part of the content corresponding to the number or Numbering.
之後,在步驟S517中,雲端伺服器410透過其通訊模組 414將通信指令與選擇目標傳送至行動終端裝置420。而在其他實施例中,雲端伺服器410亦可在步驟S505解析完第一語音信號之後,即先傳送通信指令至行動終端裝置420儲存,之後再傳送選擇目標,在此並不限定通信指令的傳送時間點。 Thereafter, in step S517, the cloud server 410 transmits through its communication module. 414 transmits the communication command and the selection target to the mobile terminal device 420. In other embodiments, after the first voice signal is parsed in step S505, the cloud server 410 may first transmit the communication command to the mobile terminal device 420 for storage, and then transmit the selection target, where the communication command is not limited. Transfer time point.
在行動終端裝置420接收到通信指令與選擇目標之後, 在步驟S519中,行動終端裝置420透過其處理單元422對選擇目標,執行通信指令對應的通信動作。上述通信指令例如為撥號指令或傳訊指令等需使用該通訊錄內容的指令,而通信指令是由雲端伺服器410基於第一語音信號而獲得。例如,假設第一語音信號的內容為“打電話給老王”,則雲端伺服器410由“打電話” 而判斷出通信指令為撥號指令。又例如,假設第一語音信號的內容為“傳簡訊給老王”,則雲端伺服器410由“傳簡訊”而判斷出通信指令為傳訊指令。另外,上述選擇目標則是由雲端伺服器410基於第二語音信號以及選擇列表而獲得。以上述表1所示的選擇列表為例,假設第二語音信號的內容為“第3個”,則雲端伺服器410便可判斷出選擇目標為“王安石”。例如,撥打電話給選擇目標,或是啟動一傳訊介面,以傳送簡訊給選擇目標。 After the mobile terminal device 420 receives the communication command and the selection target, In step S519, the mobile terminal device 420 transmits a communication operation corresponding to the communication command to the selection target through its processing unit 422. The communication command is, for example, a command for using the address of the address book, such as a dialing instruction or a communication command, and the communication command is obtained by the cloud server 410 based on the first voice signal. For example, if the content of the first voice signal is "call to Pharaoh", the cloud server 410 is called "calling". And it is judged that the communication command is a dialing instruction. For another example, if the content of the first voice signal is “send a message to Pharaoh”, the cloud server 410 determines that the communication command is a communication command by “transmitting the message”. In addition, the above selection target is obtained by the cloud server 410 based on the second voice signal and the selection list. Taking the selection list shown in Table 1 as an example, if the content of the second voice signal is “3rd”, the cloud server 410 can determine that the selection target is “Wang Anshi”. For example, make a call to select a destination, or launch a messaging interface to send a newsletter to a selected destination.
值得注意的是,行動終端裝置420在上述步驟S509所獲 得的選擇列表中可以只包括聯絡人名稱,而不包括電話號碼或其他資訊。因此,當行動終端裝置420自雲端伺服器410接收到通信指令與選擇目標時,行動終端裝置420的處理單元422會自通訊錄中取出對應選擇目標的電話號碼,並依據電話號碼來執行通信指令對應的通信動作。 It is to be noted that the mobile terminal device 420 obtains the above step S509. The list of choices you can include only the contact name, not the phone number or other information. Therefore, when the mobile terminal device 420 receives the communication command and the selection target from the cloud server 410, the processing unit 422 of the mobile terminal device 420 extracts the phone number corresponding to the selected target from the address book, and executes the communication command according to the phone number. Corresponding communication action.
另外,在其他實施例中,行動終端裝置420在上述步驟 S509所獲得的選擇列表中亦可同時包括聯絡人名稱與電話號碼,或者更可包括其他資訊。因此,在步驟S515中,雲端伺服器410的處理單元412便能夠基於第二語音信號以及選擇列表,而獲得選擇目標的電話號碼,並且在步驟S517中,將通信指令與電話號碼傳送至行動終端裝置420。據此,在步驟S519中,行動終端裝置420依據電話號碼來執行通信指令對應的通信動作。 In addition, in other embodiments, the mobile terminal device 420 is in the above steps. The selection list obtained by S509 may also include the contact name and phone number, or may include other information. Therefore, in step S515, the processing unit 412 of the cloud server 410 can obtain the telephone number of the selection target based on the second voice signal and the selection list, and in step S517, transmit the communication instruction and the telephone number to the mobile terminal. Device 420. According to this, in step S519, the mobile terminal device 420 executes the communication operation corresponding to the communication command in accordance with the telephone number.
綜上所述,本案利用同時上傳第一語音所產生的選擇列 表、第二語音信號所產生的選擇目標的方式至具有強大運算能力 的雲端伺服器來執行語音理解程序,且此選擇列表僅包含部份的通訊錄。因此,本案的語音操控系統可同時保有較高的處理效能及較佳的安全性。 In summary, this case utilizes the selection column generated by simultaneously uploading the first voice. The manner in which the table and the second voice signal are generated to select a target to have powerful computing power The cloud server performs the speech understanding program, and this selection list only contains part of the address book. Therefore, the voice control system of the present case can maintain high processing performance and better security at the same time.
另一方面,值得注意的是,雖然上述的輔助啟動裝置解 決了使用者無法立即觸及行動終端裝置,但需使用語音系統問題,使得使用者可以藉由語音理解技術,讓使用者與行動終端裝置進行問答。然而,對於需要擴音功能開啟的情況,目前仍需透過行動終端裝置本身來啟動擴音功能,當使用者無法立即觸及行動終端裝置,但需使擴音功能時,目前需透過行動終端裝置本身來啟動的設計仍將造成使用者的不便。為此,本發明提出一種開啟擴音功能的方法及其對應的裝置,讓使用者能夠更便捷地開啟擴音功能。為了使本發明之內容更為明瞭,以下特舉實施例作為本發明確實能夠據以實施的範例。 On the other hand, it is worth noting that although the above auxiliary starter solution It is decided that the user cannot immediately touch the mobile terminal device, but the voice system problem is required, so that the user can perform the question and answer with the mobile terminal device through the voice understanding technology. However, in the case where the amplification function is required to be turned on, it is still necessary to activate the sound reinforcement function through the mobile terminal device itself. When the user cannot immediately touch the mobile terminal device, but needs to make the sound amplification function, it is currently required to pass through the mobile terminal device itself. The design to be launched will still cause inconvenience to the user. To this end, the present invention provides a method for turning on the sound amplification function and a corresponding device thereof, so that the user can turn on the sound reinforcement function more conveniently. In order to clarify the content of the present invention, the following specific examples are given as examples in which the present invention can be implemented.
圖6為依據本發明一實施例的行動終端裝置的系統示意 圖。請參照圖6,在本實施例中,行動終端裝置600包括語音系統、輸入單元620、撥接單元630、聽筒640、擴音設備650及處理單元660。在本發明的另一實施例中,行動終端裝置600更可包括耳機670。行動終端裝置600可以是行動電話或其他類似的電子裝置,其類似於圖1的行動終端裝置120,其詳細內容可參照前述內容,於此不再贅述。處理單元660耦接語音取樣模組610、輸入單元620、撥接單元630、聽筒640、擴音設備650、耳機670。語音系統包括語音取樣模組610,此語音取樣模組610將聲音轉換為輸 入語音信號SAI,上述的語音取樣模組610可以是麥克風或類似的電子元件。換言之,語音取樣模組610可視為語音系統的一部份,而此所述的語音系統類似於圖1的語音系統121,其詳細內容可參照前述內容,於此不再贅述。輸入單元620對應使用者的操作提供輸入操作信號SIO,且輸入單元620可以是鍵盤、觸控面板或類似的電子元件。撥接單元630用以受控於處理單元660執行撥接功能。聽筒640、擴音設備650、耳機670用以將處理單元660提供的輸出語音信號SAO轉換為聲音,故可視為聲音輸出介面。上述的擴音設備650例如是揚聲器等。上述的耳機670可以是有線耳機及無線耳機的至少其中之一。 FIG. 6 is a schematic diagram of a system of a mobile terminal device according to an embodiment of the present invention; Figure. Referring to FIG. 6, in the embodiment, the mobile terminal device 600 includes a voice system, an input unit 620, a dialing unit 630, an earpiece 640, a sound amplification device 650, and a processing unit 660. In another embodiment of the present invention, the mobile terminal device 600 may further include an earphone 670. The mobile terminal device 600 can be a mobile phone or other similar electronic device, which is similar to the mobile terminal device 120 of FIG. 1 . For details, refer to the foregoing content, and details are not described herein again. The processing unit 660 is coupled to the voice sampling module 610, the input unit 620, the dialing unit 630, the earpiece 640, the sound amplification device 650, and the earphone 670. The voice system includes a voice sampling module 610, and the voice sampling module 610 converts the voice into a voice. The voice sampling module 610 can be a microphone or similar electronic component. In other words, the voice sampling module 610 can be regarded as a part of the voice system, and the voice system is similar to the voice system 121 of FIG. 1 . For details, refer to the foregoing content, and details are not described herein. The input unit 620 provides an input operation signal SIO corresponding to the user's operation, and the input unit 620 may be a keyboard, a touch panel, or the like. The dialing unit 630 is configured to control the processing unit 660 to perform a dialing function. The earpiece 640, the sound amplification device 650, and the earphone 670 are used to convert the output speech signal SAO provided by the processing unit 660 into a sound, so that it can be regarded as a sound output interface. The above-described sound amplification device 650 is, for example, a speaker or the like. The earphone 670 described above may be at least one of a wired earphone and a wireless earphone.
由上可知,語音功能的開啟可以透過按壓行動通訊裝置 的實體按鍵、操控螢幕或是利用本發明之輔助啟動裝置。在假設語音功能已開啟的情況下,當使用者對著行動終端裝置600講話時,透過語音取樣模組610可將聲音轉換為輸入語音信號SAI,處理單元660可依照輸入語音信號SAI,針對通訊錄中的聯絡人名稱或電話號碼等資訊進行內容匹配時,當通訊錄中的資訊與輸入語音信號SAI相符時,處理單元660則可開啟撥接單元630的撥接功能及擴音設備650,以便接通後,使用者可與聯絡人的通話。詳細的說明是,處理單元660會將輸入語音信號SAI轉換為一輸入字串,並且將輸入字串與通訊錄中的多個聯絡人名稱、多個電話號碼等資訊比較。當輸入字串符合這些聯絡人名稱、這些電話號碼等資訊的其中之一時,處理單元660開啟撥接單元630 的撥接功能。相反地,當輸入字串不符合這些聯絡人名稱及這些電話號碼時,處理單元660不開啟撥接單元630的撥接功能。 As can be seen from the above, the voice function can be turned on by pressing the mobile communication device. The physical button, the manipulation screen or the auxiliary activation device of the present invention. When the voice function is turned on, when the user speaks to the mobile terminal device 600, the voice sampling module 610 can convert the voice into the input voice signal SAI, and the processing unit 660 can respond to the input voice signal SAI. When the information such as the contact name or the phone number is recorded for content matching, when the information in the address book matches the input voice signal SAI, the processing unit 660 can enable the dialing function of the dialing unit 630 and the sound amplification device 650. In order to be connected, the user can talk to the contact person. In detail, the processing unit 660 converts the input speech signal SAI into an input string and compares the input string with information such as a plurality of contact names, a plurality of telephone numbers, and the like in the address book. When the input string matches one of the contact name, the phone number, and the like, the processing unit 660 opens the dialing unit 630. Dial-up feature. Conversely, when the input string does not match the contact names and the phone numbers, the processing unit 660 does not turn on the dialing function of the dialing unit 630.
換言之,本實施例中,當處理單元660確認輸入語音信 號SAI與通訊錄中的內容匹配時,處理單元660會提供啟動信號,以便自動開啟行動終端裝置100的通話擴音功能。詳言之,處理單元660會自動提供啟動信號至擴音設備650,並且將輸入語音信號SAI轉換為通話傳送資料DTC,並透過撥接單元630傳送通話傳送資料DTC至聯絡人(另一行動終端裝置,未繪示)。同時,處理單元660會透過撥接單元630接收通話接收資料DRC,並依據通話接收資料DRC提供輸出音頻信號SAO至擴音設備650,以將輸出音頻信號SAO轉換為聲音,並以擴音的方式將聲音輸出。 In other words, in this embodiment, when the processing unit 660 confirms the input voice message When the number SAI matches the content in the address book, the processing unit 660 provides a start signal to automatically turn on the call amplification function of the mobile terminal device 100. In detail, the processing unit 660 automatically provides an activation signal to the sound amplification device 650, and converts the input voice signal SAI into a call transmission data DTC, and transmits the call transmission data DTC to the contact person through the dial-up unit 630 (another mobile terminal) Device, not shown). At the same time, the processing unit 660 receives the call receiving data DRC through the dialing unit 630, and provides the output audio signal SAO to the sound amplifying device 650 according to the call receiving data DRC, to convert the output audio signal SAO into sound, and expands the sound. Output the sound.
值得一提的是,以目前啟動擴音功能的方式來說,仍是 採用透過行動終端裝置本身來啟動的方式進行,但當使用者無法立即觸及行動終端裝置,卻需使用擴音功能時,目前的設計將造成使用者的不便。所以,在本實施例中,在語音系統開啟的情況下,可以透過語音撥接的動作,進一步開啟擴音功能,方便使用者進行通話。 It is worth mentioning that, in the current way of starting the sound reinforcement function, it is still It is implemented by means of the mobile terminal device itself, but when the user cannot immediately access the mobile terminal device but needs to use the sound amplification function, the current design will cause inconvenience to the user. Therefore, in the embodiment, when the voice system is turned on, the voice amplification function can be further activated through the voice dialing action, which is convenient for the user to make a call.
在又一實施例中,當擴音設備650與耳機670皆與行動終端裝置600連線的情況下(即擴音設備650與耳機670皆耦接處理單元),若提供至處理單元660為輸入語音信號SAI,處理單元660可依使用者的設定,使耳機670通話為第一優先的通話方式(預設值),擴音設備650為第二優先的通話方式。或者,將擴音設備 650設為第一優先的通話方式(預設值),耳機670通話設為第二優先的通話方式。設定上述通話方式的順序,是因為使用者可能無法立即觸及行動終端裝置600,故使用擴音設備650與耳機670來進行通話。 In another embodiment, when both the sound amplification device 650 and the earphone 670 are connected to the mobile terminal device 600 (ie, the sound amplification device 650 and the earphone 670 are coupled to the processing unit), if the processing unit 660 is provided as an input. The voice signal SAI, the processing unit 660 can make the earphone 670 talk to the first priority call mode (preset value) according to the user's setting, and the sound amplification device 650 is the second priority call mode. Or, will be amplifying equipment The 650 is set as the first priority call mode (preset value), and the headset 670 call is set as the second priority call mode. The order of the above-described call mode is set because the user may not be able to touch the mobile terminal device 600 immediately, so that the speaker device 670 is used to make a call using the sound amplifying device 650.
此外,在另一實施例中,當使用者透過輸入單元620提 供輸入操作信號SIO時,表示使用者並沒有無法立即觸及行動終端裝置的問題,故在處理單元660依據輸入操作信號SIO進行通訊錄資料匹配後,透過處理單元660、撥接單元630可將輸出音頻信號SAO傳送至擴音設備650、聽筒640或耳機670等聲音輸出介面,其端視使用者預設的輸出介面(預設值)而定。 In addition, in another embodiment, when the user sends through the input unit 620 When the operation signal SIO is input, it indicates that the user does not have the problem that the mobile terminal device cannot be touched immediately. Therefore, after the processing unit 660 performs the matching of the address data according to the input operation signal SIO, the processing unit 660 and the dialing unit 630 can output the output. The audio signal SAO is transmitted to a sound output interface such as the sound amplification device 650, the earpiece 640 or the earphone 670, and the end is determined by the user's preset output interface (preset value).
舉例來說,當使用者對著行動終端裝置說”打電話給老 王”,此時語音取樣模組610接收此聲音後,將其轉成輸入語音信號SAI,而此輸入語音信號SAI透過語音理解模組的解析,得到通信指令(例如:打電話)與通信目標(例如:老王),並進而得到選擇目標(例如:王安石)。由於是來自“語音”所解析的通信指令,故處理單元660自動提供啟動信號而開啟擴音設備650,以利後續之擴音通話。也就是說,當撥接單元完成撥接後,使用者可利用擴音設備直接與老王對話。或者,在另一例子中,當使用者對著行動終端裝置說“接電話”,此時語音取樣模組610接收此聲音後,將其轉成輸入語音信號SAI,而此輸入語音信號SAI透過語音理解模組的解析,得到通信指令(如:接電話)。由於是來自“語音”所解析的通信指令,故處理單元660自動提供啟動信號而開啟擴 音設備650,以利使用者可利用擴音設備直接與老王對話。關於上述語音理解模組的配置方式與相關細節已描述於前面的實施例,於此不再贅述。另外,關於通訊目標以及最後所得到的選擇目標,其實施方式可以採取前述利用雲端伺服器的方法或其他類似的方法,於此不再贅述。當然,如上所述,當擴音設備650與耳機670並存的情況下,處理單元660可依使用者的設定,使耳機670通話為第一優先的通話方式,擴音設備650為第二優先的通話方式。 For example, when the user speaks to the mobile terminal device, "call the old Wang, at this time, the voice sampling module 610 receives the sound and converts it into an input voice signal SAI, and the input voice signal SAI is analyzed by the voice understanding module to obtain a communication command (for example, a call) and a communication target. (for example: Pharaoh), and then get the selected target (for example: Wang Anshi). Because it is a communication command parsed from "speech", the processing unit 660 automatically provides a start signal to turn on the sound amplifying device 650, so as to facilitate subsequent expansion. That is, after the dial-up unit completes the dial-up, the user can directly talk to the Lao Wang using the sound-amplifying device. Or, in another example, when the user speaks to the mobile terminal device, "answer the call". At this time, the voice sampling module 610 receives the sound and converts it into an input voice signal SAI, and the input voice signal SAI is parsed through the voice understanding module to obtain a communication command (eg, answering a call). The voice" parses the communication command, so the processing unit 660 automatically provides the start signal and turns on the expansion The audio device 650 is provided so that the user can directly talk to the Lao Wang using the sound amplifying device. The configuration manners and related details of the voice understanding module described above have been described in the foregoing embodiments, and details are not described herein again. In addition, regarding the communication target and the final selection target, the implementation manner may adopt the foregoing method using the cloud server or other similar methods, and details are not described herein again. Of course, as described above, when the sound reinforcement device 650 and the earphone 670 coexist, the processing unit 660 can make the earphone 670 talk as the first priority call mode according to the user's setting, and the sound amplification device 650 is the second priority. Call method.
在另一個例子中,若使用者透過類似圖4的顯示單元 430,以利用按鍵或是觸控選擇通訊錄中的“王安石”時,由於是透過輸入單元620提供輸入操作信號SIO時,處理單元660會依據輸入操作信號SIO進行通訊錄資料匹配,並透過處理單元660、撥接單元630及使用者之設定,將輸出音頻信號SAO傳送至擴音設備650、聽筒640或耳機670等聲音輸出介面,使得使用者可與王安石對話。 In another example, if the user transmits a display unit similar to that of FIG. 430, when using the button or the touch to select "Wang Anshi" in the address book, since the input operation signal SIO is provided through the input unit 620, the processing unit 660 performs the matching of the address book according to the input operation signal SIO, and through the processing The setting of the unit 660, the dialing unit 630 and the user transmits the output audio signal SAO to the sound output interface such as the sound amplification device 650, the earpiece 640 or the earphone 670, so that the user can talk to Wang Anshi.
依據上述,可彙整出一行動終端裝置的一種通話擴音功 能的自動啟動方法。圖7為依據本發明一實施例的行動終端裝置的通話擴音功能的自動啟動方法的流程圖。請同時參照圖7,在本實施例中,判斷行動終端裝置600的處理單元660是否將開啟撥接功能(步驟S710)。換言之,來自輸入單元620的輸入操作信號SIO、或語音取樣模組610的輸入語音信號SAI未必與撥接有關,其有可能是進行其他的操作。比如:啟用行動終端裝置中的計算機功能、或是利用語音系統詢問天氣等。當處理單元660依據輸 入信號判斷將開啟撥接單元630的撥接功能時,亦即輸入信號與一撥接動作有關,步驟S710的判斷結果為“是”,則執行步驟S720;反之,當處理單元660依據輸入信號判斷將不會撥接功能時,亦即步驟S710的判斷結果為“否”,則結束此通話擴音功能的自動啟動方法。 According to the above, a call amplification function of a mobile terminal device can be integrated The automatic start method can be. FIG. 7 is a flowchart of a method for automatically starting a call amplification function of a mobile terminal device according to an embodiment of the present invention. Referring to FIG. 7, at the same time, in the present embodiment, it is determined whether the processing unit 660 of the mobile terminal device 600 will turn on the dialing function (step S710). In other words, the input operation signal SIO from the input unit 620 or the input speech signal SAI of the speech sampling module 610 is not necessarily related to dialing, and it is possible to perform other operations. For example, enabling computer functions in a mobile terminal device, or using a voice system to ask for weather, and the like. When the processing unit 660 is based on the input When the input signal determines that the dialing function of the dialing unit 630 is to be enabled, that is, the input signal is related to a dialing action, if the answer of step S710 is YES, step S720 is performed; otherwise, when the processing unit 660 is based on the input signal When it is judged that the dialing function will not be dialed, that is, if the result of the determination in step S710 is "NO", the automatic starting method of the call sounding function is ended.
接著,在步驟S720中,判斷處理單元660是否接收用以 開啟撥接功能的輸入語音信號SAI。當處理單元660接收來自語音取樣模組610的用以開啟撥接功能的輸入語音信號SAI時,亦即步驟S720的判斷結果為“是”,會檢測處理單元660是否與耳機670連接(步驟S730)。當處理單元660與耳機670連接時,亦即步驟S730的判斷結果為“是”,處理單元660自動提供啟動信號以啟動耳機,並輸出音頻信號SAO至耳機670(步驟S740);反之,當處理單元660未與耳機670連接時,亦即步驟S730的判斷結果為“否”,處理單元660自動提供啟動信號以啟動擴音設備650,並輸出語音信號SAO至行動終端裝置600的擴音設備650,以開啟行動終端裝置600的通話擴音功能(步驟S750)。值得一提的是,當處理單元660接收用以開啟撥接功能的輸入語音信號時,上述的步驟730~步驟750是在使用者將耳機670設定為優先的聲音輸出介面(假設擴音設備650與耳機670皆連線)的情況下進行。在其他實施例中,使用者也可以將擴音設備650設定為優先的聲音輸出介面。當然,在耳機670與擴音設備650僅有其中之一連線時,則可設定已連線的設備作為優先的聲音輸出介面。上述的實 施步驟為熟知技術者可依其需求作對應的變動。 Next, in step S720, it is determined whether the processing unit 660 receives the Turn on the input voice signal SAI of the dial-up function. When the processing unit 660 receives the input voice signal SAI from the voice sampling module 610 to enable the dialing function, that is, the determination result of step S720 is YES, it is detected whether the processing unit 660 is connected to the earphone 670 (step S730). ). When the processing unit 660 is connected to the earphone 670, that is, the determination result of step S730 is YES, the processing unit 660 automatically provides an activation signal to activate the earphone, and outputs the audio signal SAO to the earphone 670 (step S740); When the unit 660 is not connected to the earphone 670, that is, the determination result of step S730 is "NO", the processing unit 660 automatically provides an activation signal to activate the sound amplification device 650, and outputs the voice signal SAO to the sound amplification device 650 of the mobile terminal device 600. To activate the call amplification function of the mobile terminal device 600 (step S750). It is worth mentioning that when the processing unit 660 receives the input voice signal for opening the dialing function, the above steps 730 to 750 are performed by the user setting the earphone 670 as a priority sound output interface (assuming the sound amplification device 650) This is done in the case where the headphones 670 are connected. In other embodiments, the user can also set the sound amplifying device 650 as a priority sound output interface. Of course, when only one of the earphones 670 and the sound amplification device 650 is connected, the connected device can be set as the priority sound output interface. The above The steps are those skilled in the art, and the corresponding changes can be made according to their needs.
另一方面,當處理單元660並未接收來自語音取樣模組610的用以開啟撥接功能的輸入語音信號SAI時,亦即步驟S720的判斷結果為“否”,會接著檢測處理單元660是否與耳機670連接(步驟S760)。詳言之,處理單元660未接收來自語音取樣模組610的輸入語音信號SAI,但處理單元又將開啟撥接功能,表示處理單元660接收來自輸入單元620的輸入操作信號SIO,且此輸入操作信號SIO與一撥接動作有關。當處理單元660與耳機670連接時,亦即步驟S760的判斷結果為“是”,處理單元660會自動提供啟動信號以啟動耳機670,並輸出語音信號SAO至耳機670(步驟S740)。反之,當處理單元660未與耳機670連接時,亦即步驟S760的判斷結果為“否”,處理單元660依據一預設值提供輸出語音信號SAO至擴音設備及聽筒的其中之一(步驟S770)。其中,上述步驟的順序係做為說明之用,本發明實施例不以此為限。值得一提的是,當步驟760判斷為“是”,則將提供輸出音頻信號SAO至耳機670,上述狀況為使用者將耳機670設定為優先的聲音輸出介面(假設聽筒640、擴音設備650、耳機670皆連線)的狀況。在其他實施例中,使用者也可以將聽筒640或擴音設備650設定為優先的聲音輸出介面。當然,在聽筒640、擴音設備650、耳機670設備僅有其中之一連線時,則可設定已連線的設備作為優先的聲音輸出介面。上述的實施步驟為熟知技術者可依其需求作對應的變動。 On the other hand, when the processing unit 660 does not receive the input voice signal SAI from the voice sampling module 610 to enable the dialing function, that is, the determination result of step S720 is "NO", it is next detected whether the processing unit 660 is The headset 670 is connected (step S760). In detail, the processing unit 660 does not receive the input voice signal SAI from the voice sampling module 610, but the processing unit will turn on the dialing function again, indicating that the processing unit 660 receives the input operation signal SIO from the input unit 620, and the input operation The signal SIO is related to a dialing action. When the processing unit 660 is connected to the earphone 670, that is, the determination result of step S760 is YES, the processing unit 660 automatically provides an activation signal to activate the earphone 670, and outputs the voice signal SAO to the earphone 670 (step S740). On the other hand, when the processing unit 660 is not connected to the earphone 670, that is, the determination result of the step S760 is "NO", the processing unit 660 provides the output voice signal SAO to one of the sound amplification device and the earpiece according to a preset value (step S770). The order of the above steps is for illustrative purposes, and the embodiment of the present invention is not limited thereto. It is worth mentioning that when the determination in step 760 is YES, the output audio signal SAO will be provided to the earphone 670. The above situation is that the user sets the earphone 670 as a priority sound output interface (assuming the earpiece 640, the sound amplifying device 650) The condition that the earphones 670 are connected. In other embodiments, the user can also set the earpiece 640 or the sound amplifying device 650 as a priority sound output interface. Of course, when only one of the earpiece 640, the sound amplification device 650, and the earphone 670 device is connected, the connected device can be set as the priority sound output interface. The above implementation steps are suitable for those skilled in the art to make corresponding changes according to their needs.
綜上所述,本發明實施例的行動終端裝置及其通話擴音功能的自動啟動方法,當處理單元接收用以開啟撥接功能的輸入語音信號時,除開啟撥接功能之外,更可自動開啟擴音功能,以將輸出語音信號至擴音設備。如此一來,當使用者無法立即觸及行動終端裝置,但需使擴音功能時,可透過語音系統來啟動擴音功能,以提高行動終端的使用便利性。 In summary, the mobile terminal device and the automatic activation method of the call amplification function according to the embodiment of the present invention, when the processing unit receives the input voice signal for opening the dial-up function, in addition to the dial-up function, The sound amplification function is automatically turned on to output a voice signal to the sound amplification device. In this way, when the user cannot immediately touch the mobile terminal device, but needs to make the sound amplification function, the sound amplification function can be activated through the voice system to improve the convenience of using the mobile terminal.
雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。 Although the present invention has been disclosed in the above embodiments, it is not intended to limit the present invention, and any one of ordinary skill in the art can make some changes and refinements without departing from the spirit and scope of the present invention. The scope of the invention is defined by the scope of the appended claims.
Claims (23)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN 201210593061 CN103077716A (en) | 2012-12-31 | 2012-12-31 | Auxiliary starting device, voice control system and method thereof |
| ??201210593061.4 | 2012-12-31 | ||
| CN2013101832176A CN103280086A (en) | 2012-12-31 | 2013-05-17 | Auxiliary starting device, voice control system and method thereof |
| ??201310183217.6 | 2013-05-17 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201426531A TW201426531A (en) | 2014-07-01 |
| TWI633484B true TWI633484B (en) | 2018-08-21 |
Family
ID=48154224
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW102121753A TWI633484B (en) | 2012-12-31 | 2013-06-19 | Activation assisting apparatus, speech operation system and method thereof |
Country Status (2)
| Country | Link |
|---|---|
| CN (3) | CN103077716A (en) |
| TW (1) | TWI633484B (en) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103728906B (en) * | 2014-01-13 | 2017-02-01 | 江苏惠通集团有限责任公司 | Intelligent home control device and method |
| CN103838159B (en) * | 2014-03-17 | 2017-03-01 | 联想(北京)有限公司 | A kind of control method, device and electronic equipment |
| CN104134442A (en) * | 2014-08-15 | 2014-11-05 | 广东欧珀移动通信有限公司 | Method and device for starting voice service |
| WO2018023417A1 (en) * | 2016-08-02 | 2018-02-08 | 张阳 | Information pushing method for use when making telephone call, and glasses |
| CN106714023B (en) * | 2016-12-27 | 2019-03-15 | 广东小天才科技有限公司 | Bone conduction earphone-based voice awakening method and system and bone conduction earphone |
| TWI712944B (en) * | 2019-11-28 | 2020-12-11 | 睿捷國際股份有限公司 | Sound-based equipment surveillance method |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TW201041364A (en) * | 2009-05-08 | 2010-11-16 | Htc Corp | Mobile device, power saving method and computer executable medium |
| US20100330908A1 (en) * | 2009-06-25 | 2010-12-30 | Blueant Wireless Pty Limited | Telecommunications device with voice-controlled functions |
| CN102006373A (en) * | 2010-11-24 | 2011-04-06 | 深圳市子栋科技有限公司 | Vehicle-mounted service system and method based on voice command control |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2004036939A1 (en) * | 2002-10-18 | 2004-04-29 | Institute Of Acoustics Chinese Academy Of Sciences | Portable digital mobile communication apparatus, method for controlling speech and system |
| CN1831937A (en) * | 2005-03-08 | 2006-09-13 | 台达电子工业股份有限公司 | Method and device for speech recognition and language understanding analysis |
| CN101923539B (en) * | 2009-06-11 | 2014-02-12 | 珠海市智汽电子科技有限公司 | Man-machine conversation system based on natural language |
| CN101951553B (en) * | 2010-08-17 | 2012-10-10 | 深圳市车音网科技有限公司 | Navigation method and system based on speech command |
| CN102779509B (en) * | 2011-05-11 | 2014-12-03 | 联想(北京)有限公司 | Voice processing equipment and voice processing method |
| CN102821424B (en) * | 2011-06-09 | 2015-02-18 | 中磊电子股份有限公司 | Method for assisting mobile data distribution, communication device, and mobile device |
-
2012
- 2012-12-31 CN CN 201210593061 patent/CN103077716A/en active Pending
-
2013
- 2013-05-17 CN CN2013101832176A patent/CN103280086A/en active Pending
- 2013-05-17 CN CN201711423107.7A patent/CN108124043A/en active Pending
- 2013-06-19 TW TW102121753A patent/TWI633484B/en active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TW201041364A (en) * | 2009-05-08 | 2010-11-16 | Htc Corp | Mobile device, power saving method and computer executable medium |
| US20100330908A1 (en) * | 2009-06-25 | 2010-12-30 | Blueant Wireless Pty Limited | Telecommunications device with voice-controlled functions |
| CN102006373A (en) * | 2010-11-24 | 2011-04-06 | 深圳市子栋科技有限公司 | Vehicle-mounted service system and method based on voice command control |
Also Published As
| Publication number | Publication date |
|---|---|
| TW201426531A (en) | 2014-07-01 |
| CN103077716A (en) | 2013-05-01 |
| CN108124043A (en) | 2018-06-05 |
| CN103280086A (en) | 2013-09-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI497408B (en) | Voice interaction system, mobile terminal apparatus and method of voice communication | |
| CN107277754B (en) | A method of bluetooth connection and bluetooth peripheral device | |
| CN110413134B (en) | Wearing state detection method and related equipment | |
| TWI633484B (en) | Activation assisting apparatus, speech operation system and method thereof | |
| CN102231865B (en) | A bluetooth headset | |
| WO2021013156A1 (en) | Bluetooth switching method and bluetooth device | |
| EP3735646B1 (en) | Using auxiliary device case for translation | |
| CN108391206A (en) | Signal processing method, device, terminal, earphone and readable storage medium storing program for executing | |
| WO2020010579A1 (en) | Smart watch having voice interaction function-enabled earphones | |
| CN107506353B (en) | Translation box and translation system | |
| CN106817540A (en) | A camera control method and device | |
| US20240386893A1 (en) | Smart glasses, system and control method based on generative artificial intelligence large language models | |
| WO2021233398A1 (en) | Wireless audio system, wireless communication method, and device | |
| CN108763913A (en) | Data processing method, device, terminal, earphone and readable storage medium | |
| US8934886B2 (en) | Mobile apparatus and method of voice communication | |
| CN113301544B (en) | Method and equipment for voice intercommunication between audio equipment | |
| US20170118586A1 (en) | Voice data transmission processing method, terminal and computer storage medium | |
| WO2017127965A1 (en) | Communication earphone | |
| CN103517170A (en) | Remote-control earphone with built-in cellular telephone module | |
| CN108923810A (en) | Translation method and related equipment | |
| US12200789B2 (en) | Electronic device for transmitting data packets in Bluetooth network environment, and method therefor | |
| TWI506618B (en) | Mobile terminal device and method for automatically opening sound output interface of the mobile terminal device | |
| WO2020133564A1 (en) | Mobile terminal casing, mobile terminal and smart home network system | |
| CN104219357A (en) | Voice instruction network telephone and operation method thereof | |
| TW200930003A (en) | Portable apparatus and voice recognition method thereof |