JP2015012530A

JP2015012530A - Electronic apparatus, control method therefor, and program

Info

Publication number: JP2015012530A
Application number: JP2013137949A
Authority: JP
Inventors: 大介杉井; Daisuke Sugii; 康晴大西; Yasuharu Onishi; 敏治相原; Toshiharu Aihara; 黒田　淳; Atsushi Kuroda; 淳黒田
Original assignee: NEC Casio Mobile Communications Ltd
Current assignee: NEC Casio Mobile Communications Ltd
Priority date: 2013-07-01
Filing date: 2013-07-01
Publication date: 2015-01-19

Abstract

PROBLEM TO BE SOLVED: To contribute to appropriate control of output sound signals on the basis of natural motions of a user.SOLUTION: An electronic apparatus includes: a sound input unit for receiving an input of a sound signal; a feature amount extraction unit for extracting an amount of a signal change feature on the basis of the sound signal; a correlation value calculation unit for calculating a correlation value between a pre-registered first amount of signal change feature and a second amount of signal change feature extracted from the sound signal; and an output sound control unit for controlling an output sound signal on the basis of the calculated correlation value.

Description

本発明は、電子機器、その制御方法及びプログラムに関し、特に、マイクロホンを備える電子機器、その制御方法及びプログラムに関する。 The present invention relates to an electronic device, a control method thereof, and a program, and more particularly, to an electronic device including a microphone, a control method thereof, and a program thereof.

電話等の通話時に、通話先の相手に音声を伝えたくない時に、通話者が、マイクロホンの入力音孔部を指で塞ぐ場合がある。 When it is not desired to convey voice to the other party during a call such as a telephone call, the caller may block the input sound hole of the microphone with a finger.

特許文献１においては、指の接触を検出する接触検出部を備え、接触検出部が指の接触を検出した場合、マイクロホンから入力された音声にミュートをかけるように制御する携帯電話装置が開示されている。特に、特許文献１において、マイクロホンに指が接触した場合に、音声にミュートをかけるように、マイクロホン近傍に、上記の接触検出部を設置した携帯電話装置が開示されている。 Patent Document 1 discloses a mobile phone device that includes a contact detection unit that detects a finger contact, and controls the audio input from the microphone to be muted when the contact detection unit detects a finger contact. ing. In particular, Patent Document 1 discloses a mobile phone device in which the contact detection unit is installed in the vicinity of a microphone so that sound is muted when a finger touches the microphone.

特開２００７−０８１４６０号公報JP 2007-081460 A

なお、上記先行技術文献の開示を、本書に引用をもって繰り込むものとする。以下の分析は、本発明の観点からなされたものである。 The disclosure of the above prior art document is incorporated herein by reference. The following analysis has been made from the viewpoint of the present invention.

通話者がマイクロホンの入力音孔部を指で塞いだ場合、指と、入力音孔部との摩擦音が、通話先の相手に伝わってしまう場合がある。また、入力音孔部を塞いでも、指と、入力音孔部との隙間から、通話先の相手に音声が伝わってしまう場合がある。 When the caller blocks the input sound hole of the microphone with a finger, the frictional sound between the finger and the input sound hole may be transmitted to the other party. In addition, even if the input sound hole portion is closed, the voice may be transmitted to the other party through the gap between the finger and the input sound hole portion.

ここで、特許文献１に開示された技術では、指以外の物体が、接触検出部に接触した場合であっても、マイクロホンから入力された音声にミュートをかける恐れがある。例えば、通話先の相手に音声を聞こえやすくするために、通話者が、マイクロホンに口を接触させて、通話する場合がある。しかし、特許文献１に開示された技術では、接触検出部に口が接触した場合、ユーザが意図しない不適切なタイミングで、音声にミュートをかける恐れがある。 Here, with the technique disclosed in Patent Document 1, even when an object other than a finger comes into contact with the contact detection unit, there is a possibility that sound input from the microphone may be muted. For example, in order to make it easier for the other party to hear the voice, there is a case where the caller makes a call with the microphone in contact with the mouth. However, in the technique disclosed in Patent Document 1, when the mouth touches the contact detection unit, there is a possibility that the sound is muted at an inappropriate timing not intended by the user.

そこで、本発明は、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する電子機器、その制御方法及びプログラムを提供することを目的とする。 Therefore, an object of the present invention is to provide an electronic device that contributes to appropriately controlling an output audio signal based on a user's natural actions, a control method thereof, and a program.

本発明の第１の視点によれば、音声信号を入力する音声入力部と、前記音声信号に基づいて、信号変化特徴量を抽出する特徴量抽出部と、予め登録された第１の前記信号変化特徴量と、前記音声信号から抽出された第２の前記信号変化特徴量との相関値を算出する相関値算出部と、前記相関値に基づいて、出力音声信号を制御する出力音声制御部と、を備える電子機器が提供される。 According to the first aspect of the present invention, an audio input unit that inputs an audio signal, a feature amount extraction unit that extracts a signal change feature amount based on the audio signal, and the first signal registered in advance A correlation value calculation unit that calculates a correlation value between a change feature quantity and the second signal change feature quantity extracted from the voice signal, and an output voice control unit that controls an output voice signal based on the correlation value Are provided.

本発明の第２の視点によれば、音声信号を入力する工程と、前記音声信号に基づいて、信号変化特徴量を抽出する特徴量抽出工程と、予め登録された第１の前記信号変化特徴量と、前記音声信号から抽出された第２の前記信号変化特徴量との相関値を算出する工程と、前記相関値に基づいて、出力音声信号を制御する出力音声制御工程と、を含む電子機器の制御方法が提供される。
なお、本方法は、音声信号を制御する電子機器という、特定の機械に結び付けられている。 According to a second aspect of the present invention, a step of inputting an audio signal, a feature amount extracting step of extracting a signal change feature amount based on the audio signal, and a first signal change feature registered in advance. A step of calculating a correlation value between the amount and the second signal variation feature amount extracted from the voice signal, and an output voice control step of controlling an output voice signal based on the correlation value An apparatus control method is provided.
Note that this method is linked to a specific machine, which is an electronic device that controls an audio signal.

本発明の第３の視点によれば、電子機器を制御するコンピュータに実行させるプログラムであって、音声信号を入力する処理と、前記音声信号に基づいて、信号変化特徴量を抽出する特徴量抽出処理と、予め登録された第１の前記信号変化特徴量と、前記音声信号から抽出された第２の前記信号変化特徴量との相関値を算出する処理と、前記相関値に基づいて、出力音声信号を制御する出力音声制御処理と、を実行するプログラムが提供される。
なお、本プログラムは、コンピュータが読み取り可能な記憶媒体に記録することができる。記憶媒体は、半導体メモリ、ハードディスク、磁気記録媒体、光記録媒体等の非トランジェント（non-transient）なものとすることができる。本発明は、コンピュータプログラム製品として具現することも可能である。 According to a third aspect of the present invention, there is provided a program that is executed by a computer that controls an electronic device, the process of inputting an audio signal, and the feature quantity extraction that extracts a signal change feature quantity based on the audio signal Processing, processing for calculating a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal, and output based on the correlation value An output audio control process for controlling the audio signal is provided.
The program can be recorded on a computer-readable storage medium. The storage medium may be non-transient such as a semiconductor memory, a hard disk, a magnetic recording medium, an optical recording medium, or the like. The present invention can also be embodied as a computer program product.

本発明の各視点によれば、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する電子機器、その制御方法及びプログラムが提供される。 According to each aspect of the present invention, an electronic device that contributes to appropriately controlling an output audio signal based on a user's natural actions, a control method thereof, and a program are provided.

一実施形態の概要を説明するための図である。It is a figure for demonstrating the outline | summary of one Embodiment. 第１の実施形態に係る電子機器１の全体構成の一例を示す図である。It is a figure which shows an example of the whole structure of the electronic device 1 which concerns on 1st Embodiment. 第１の実施形態に係る電子機器１の内部構成の一例を示すブロック図である。It is a block diagram which shows an example of the internal structure of the electronic device 1 which concerns on 1st Embodiment. 入力された音声信号をミュートする処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process which mutes the input audio | voice signal. 第３の実施形態に係る電子機器１ａの内部構成の一例を示すブロック図である。It is a block diagram which shows an example of the internal structure of the electronic device 1a which concerns on 3rd Embodiment. 入力された音声信号をミュートする処理の一例を示すフローチャートである。It is a flowchart which shows an example of the process which mutes the input audio | voice signal.

初めに、図１を用いて一実施形態の概要について説明する。なお、この概要に付記した図面参照符号は、理解を助けるための一例として各要素に便宜上付記したものであり、この概要の記載はなんらの限定を意図するものではない。 First, an outline of an embodiment will be described with reference to FIG. Note that the reference numerals of the drawings attached to the outline are attached to the respective elements for convenience as an example for facilitating understanding, and the description of the outline is not intended to be any limitation.

上述の通り、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する電子機器が望まれる。 As described above, an electronic device that contributes to appropriately controlling an output audio signal based on a user's natural actions is desired.

そこで、一例として、図１に示す電子機器１００を提供する。図１（ａ）は、電子機器１００の内部構成の一例を示すブロック図である。図１（ｂ）は、電子機器１００の処理の一例を示すフローチャートである。電子機器１００は、音声入力部１０１と、特徴量抽出部１０２と、相関値算出部１０３と、出力音声制御部１０４と、を備える。 Therefore, as an example, the electronic device 100 illustrated in FIG. 1 is provided. FIG. 1A is a block diagram illustrating an example of an internal configuration of the electronic device 100. FIG. 1B is a flowchart illustrating an example of processing of the electronic device 100. The electronic device 100 includes an audio input unit 101, a feature amount extraction unit 102, a correlation value calculation unit 103, and an output audio control unit 104.

まず、音声入力部１０１は、音声信号を入力する（ステップＳ１００１）。そして、特徴量抽出部１０２は、音声信号に基づいて、信号変化特徴量を抽出する（ステップＳ１００２）。信号変化特徴量とは、音声信号の変化を表す特徴量を意味する。例えば、信号変化特徴量は、周波数領域における、音声信号のエネルギーの変化量であっても良い。 First, the voice input unit 101 inputs a voice signal (step S1001). Then, the feature quantity extraction unit 102 extracts a signal change feature quantity based on the audio signal (step S1002). The signal change feature amount means a feature amount representing a change in the audio signal. For example, the signal change feature amount may be a change amount of energy of the audio signal in the frequency domain.

そして、相関値算出部１０３は、予め登録された第１の信号変化特徴量と、音声信号から抽出された第２の信号変化特徴量との相関値を算出する（ステップＳ１００３）。そして、出力音声制御部１０４は、相関値に基づいて、出力音声信号を制御する（ステップＳ１００４）。例えば、出力音声制御部１０４は、相関値が所定の閾値を超える場合、出力音声信号をミュート等しても良い。 Then, the correlation value calculation unit 103 calculates a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal (step S1003). Then, the output sound control unit 104 controls the output sound signal based on the correlation value (step S1004). For example, the output audio control unit 104 may mute the output audio signal when the correlation value exceeds a predetermined threshold.

例えば、特徴量抽出部１０２は、マイクロホン（音声入力部１０１に相当）の入力音孔部を、指で塞いだ時の音声信号の変化に関する特徴量を、第１の信号変化特徴量として抽出したとする。そして、特徴量抽出部１０２が第１の信号変化特徴量を登録後に、音声入力部１０１は、新たな音声信号を入力したとする。そして、特徴量抽出部１０２は、入力された音声信号の所定の時間での変化に関する特徴量を、第２の信号変化特徴量として抽出したとする。 For example, the feature amount extraction unit 102 extracts, as the first signal change feature amount, a feature amount related to a change in the sound signal when the input sound hole portion of the microphone (corresponding to the sound input unit 101) is closed with a finger. And Then, it is assumed that the voice input unit 101 receives a new voice signal after the feature quantity extraction unit 102 registers the first signal change feature quantity. Then, it is assumed that the feature amount extraction unit 102 extracts a feature amount related to a change in the input audio signal at a predetermined time as a second signal change feature amount.

その場合、相関値算出部１０３は、入力された音声信号の所定の時間での変化に関する特徴量（第２の信号変化特徴量）と、マイクロホンの入力音孔部を塞いだ時の音声信号の変化に関する特徴量（第１の信号変化特徴量）との相関値を算出する。そして、出力音声制御部１０４は、算出された相関値に基づいて、出力音声信号を制御する。例えば、相関値が所定の閾値を超える場合、入力された音声信号が、マイクロホンの入力音孔部を塞がれた状態で入力された音声信号であると、出力音声制御部１０４は判断しても良い。そして、その場合、出力音声制御部１０４は、出力音声信号をミュート等しても良い。従って、電子機器１００は、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する。 In this case, the correlation value calculation unit 103 calculates the feature amount (second signal change feature amount) related to the change of the input sound signal at a predetermined time and the sound signal when the input sound hole portion of the microphone is blocked. A correlation value with the feature amount related to the change (first signal change feature amount) is calculated. Then, the output sound control unit 104 controls the output sound signal based on the calculated correlation value. For example, when the correlation value exceeds a predetermined threshold, the output audio control unit 104 determines that the input audio signal is an audio signal input in a state where the input sound hole of the microphone is blocked. Also good. In this case, the output audio control unit 104 may mute the output audio signal. Therefore, the electronic device 100 contributes to appropriately controlling the output audio signal based on the user's natural actions.

［第１の実施形態］
第１の実施形態について、図面を用いてより詳細に説明する。 [First Embodiment]
The first embodiment will be described in more detail with reference to the drawings.

図２は、本実施形態に係る電子機器１の全体構成の一例を示す図である。電子機器１は、マイクロホン１１と、レシーバ１２と、スピーカ１３と、操作部１４と、表示部１５と、を含んで構成される。なお、図２は、電子機器１を図２で示す形態に限定する趣旨ではない。例えば、電子機器１は、携帯電話、スマートフォン、ゲーム機、タブレットＰＣ、ＰＤＡ（Personal Data Assistants：携帯情報端末）等であっても良い。電子機器１は、マイクロホン１１を備え、マイクロホン１１から入力された音声信号を出力する電子機器であるとしても良い。 FIG. 2 is a diagram illustrating an example of the overall configuration of the electronic apparatus 1 according to the present embodiment. The electronic device 1 includes a microphone 11, a receiver 12, a speaker 13, an operation unit 14, and a display unit 15. 2 is not intended to limit the electronic device 1 to the form shown in FIG. For example, the electronic device 1 may be a mobile phone, a smartphone, a game machine, a tablet PC, a PDA (Personal Data Assistants: portable information terminal), or the like. The electronic device 1 may be an electronic device that includes the microphone 11 and outputs an audio signal input from the microphone 11.

マイクロホン１１は、上記の音声入力部１０１に相当し、音声信号を入力する。つまり、電子機器１は、マイクロホン１１から外部の音声を入力する。例えば、マイクロホン１１は、送話等に用いる。 The microphone 11 corresponds to the sound input unit 101 and inputs a sound signal. That is, the electronic device 1 inputs external sound from the microphone 11. For example, the microphone 11 is used for transmission.

レシーバ１２は、通話処理時の受話音等の音声を出力する。ユーザは、レシーバ１２に耳を押し当てて（近づけて）、レシーバ１２からの出力される音声を聞く。 The receiver 12 outputs a sound such as a received sound during the call processing. The user presses the ear against the receiver 12 (closes it) and listens to the sound output from the receiver 12.

スピーカ１３は、着信音等の音声を出力する。 The speaker 13 outputs sound such as a ring tone.

操作部１４は、ユーザの操作を受け付けるキー、ボタン等である。例えば、ユーザは、操作部１４を介して、通話処理を開始、終了する。 The operation unit 14 is a key, a button, or the like that receives a user operation. For example, the user starts and ends call processing via the operation unit 14.

表示部１５は、電子機器１の動作に関係する情報を表示する。例えば、通話アプリケーションが起動している場合（通話中である場合）、表示部１５は、通話先の電話番号、ミュートの有無等の情報を表示しても良い。表示部１５は、液晶パネル、有機ＥＬ（Electro Luminescence）パネル等であっても良い。 The display unit 15 displays information related to the operation of the electronic device 1. For example, when the call application is activated (when a call is in progress), the display unit 15 may display information such as the telephone number of the call destination and the presence / absence of mute. The display unit 15 may be a liquid crystal panel, an organic EL (Electro Luminescence) panel, or the like.

図３は、本実施形態に係る電子機器１の内部構成の一例を示すブロック図である。電子機器１は、マイクロホン１１と、レシーバ１２と、スピーカ１３と、操作部１４と、表示部１５と、マイクロホンアンプ１６と、Ａ／Ｄ（Analog To Digital）変換器１７と、記憶部１８と、制御部２０と、符号化・複合部３０と、通信部４０と、スピーカアンプ５０と、を含んで構成される。制御部２０は、特徴量抽出部２１と、相関値算出部２２と、出力音声制御部２３と、を含んで構成される。図３は、簡単のため、本実施形態に関係するモジュールを主に記載する。 FIG. 3 is a block diagram illustrating an example of an internal configuration of the electronic apparatus 1 according to the present embodiment. The electronic device 1 includes a microphone 11, a receiver 12, a speaker 13, an operation unit 14, a display unit 15, a microphone amplifier 16, an A / D (Analog To Digital) converter 17, a storage unit 18, The control unit 20 includes an encoding / combining unit 30, a communication unit 40, and a speaker amplifier 50. The control unit 20 includes a feature amount extraction unit 21, a correlation value calculation unit 22, and an output audio control unit 23. FIG. 3 mainly describes modules related to the present embodiment for the sake of simplicity.

マイクロホン１１は、入力音孔部（図示せず）を含む。そして、マイクロホン１１は、入力音孔部を介して、音声信号を入力する。具体的には、マイクロホン１１は、入力音孔部を介して、アナログ音声信号を入力する。そして、マイクロホン１１は入力されたアナログ音声信号をマイクロホンアンプ１６に対して出力する。 The microphone 11 includes an input sound hole (not shown). The microphone 11 inputs an audio signal through the input sound hole. Specifically, the microphone 11 inputs an analog audio signal through the input sound hole. The microphone 11 outputs the input analog audio signal to the microphone amplifier 16.

マイクロホンアンプ１６は、マイクロホン１１から出力されたアナログ音声信号を増幅する。そして、マイクロホンアンプ１６は、増幅されたアナログ音声信号をＡ／Ｄ変換器１７に対して出力する。 The microphone amplifier 16 amplifies the analog audio signal output from the microphone 11. The microphone amplifier 16 outputs the amplified analog audio signal to the A / D converter 17.

Ａ／Ｄ変換器１７は、マイクロホンアンプ１６から入力されたアナログ音声信号を、デジタル音声信号に変換する。そして、Ａ／Ｄ変換器１７は、デジタル音声信号を制御部２０に対して出力する。具体的には、Ａ／Ｄ変換器１７は、デジタル音声信号を特徴量抽出部２１に対して出力する。 The A / D converter 17 converts the analog audio signal input from the microphone amplifier 16 into a digital audio signal. Then, the A / D converter 17 outputs a digital audio signal to the control unit 20. Specifically, the A / D converter 17 outputs a digital audio signal to the feature amount extraction unit 21.

記憶部１８は、電子機器１の動作に必要な情報を記憶する。例えば、記憶部１８は、第１の信号変化特徴量を記憶する。 The storage unit 18 stores information necessary for the operation of the electronic device 1. For example, the storage unit 18 stores the first signal change feature amount.

制御部２０は、電子機器１の全体を制御すると共に、図３に示す各部を制御する。制御部２０は、電子機器１に搭載されたコンピュータに、そのハードウェアを用いて、電子機器１の処理を実行させるコンピュータプログラムにより実現することもできる。 The control unit 20 controls the entire electronic device 1 and controls each unit shown in FIG. The control unit 20 can also be realized by a computer program that causes a computer mounted on the electronic device 1 to execute processing of the electronic device 1 using its hardware.

また、制御部２０は、第１の信号変化特徴量を学習する動作モードと、入力された音声信号をミュートする動作モードと、を制御する。以下の説明では、第１の信号変化特徴量を学習する動作モードを、学習モードと呼ぶ。また、以下の説明では、入力された音声信号をミュートする動作モードを、出力音声制御モードと呼ぶ。 Further, the control unit 20 controls an operation mode for learning the first signal change feature amount and an operation mode for muting the input audio signal. In the following description, the operation mode for learning the first signal change feature amount is referred to as a learning mode. In the following description, an operation mode for muting an input audio signal is referred to as an output audio control mode.

特徴量抽出部２１は、音声信号に基づいて、信号変化特徴量を抽出する。具体的には、特徴量抽出部２１は、Ａ／Ｄ変換器１７から入力されたデジタル音声信号に基づいて、信号変化特徴量を抽出する。 The feature amount extraction unit 21 extracts a signal change feature amount based on the audio signal. Specifically, the feature quantity extraction unit 21 extracts a signal change feature quantity based on the digital audio signal input from the A / D converter 17.

より具体的には、特徴量抽出部２１は、所定の帯域の音声信号のエネルギーの変化量を、信号変化特徴量として抽出する。ここで、特徴量抽出部２１は、所定の時間（例えば、１０ミリ秒〜１００ミリ秒程度）の音声信号に基づいて、信号変化特徴量を抽出することが好ましい。なぜなら、音声信号のエネルギーの変化量を算出するためには、所定の時間の音声信号が必要となるからである。 More specifically, the feature amount extraction unit 21 extracts a change amount of energy of an audio signal in a predetermined band as a signal change feature amount. Here, it is preferable that the feature quantity extraction unit 21 extracts a signal change feature quantity based on an audio signal for a predetermined time (for example, about 10 milliseconds to 100 milliseconds). This is because an audio signal for a predetermined time is required to calculate the amount of change in the energy of the audio signal.

例えば、動作モードが学習モードの場合に、ユーザは、指等で、マイクロホン１１の入力音孔部を塞いでも良い。そして、特徴量抽出部２１は、入力音孔部が開放された状態（以下、音孔開放状態と呼ぶ）から、入力音孔部が塞がれた状態（以下、音孔閉塞状態と呼ぶ）に遷移した場合の、所定の帯域の音声信号のエネルギーの変化量を、第１の信号変化特徴量として抽出することが好ましい。 For example, when the operation mode is the learning mode, the user may block the input sound hole portion of the microphone 11 with a finger or the like. Then, the feature quantity extraction unit 21 is in a state where the input sound hole is closed (hereinafter referred to as a sound hole closed state) from a state where the input sound hole is open (hereinafter referred to as a sound hole open state). It is preferable that the amount of change in the energy of the audio signal in a predetermined band when the transition is made is extracted as the first signal change feature amount.

ここで、音孔開放状態から音孔閉塞状態に遷移した場合、音声信号に含まれる背景雑音のエネルギーが減衰する。ここで、背景雑音は、所謂ホワイトノイズである場合が多い。そのため、音孔開放状態から音孔閉塞状態に遷移した場合、低周波数領域から高周波数領域までの全周波数領域において、音声信号のエネルギーが減衰する場合が多い。なお、ホワイトノイズとは、低周波数領域から高周波数領域までの全周波数領域において、エネルギーの変動が所定の範囲内に抑制された信号を意味する。 Here, when the sound hole is opened to the sound hole closed state, the background noise energy included in the sound signal is attenuated. Here, the background noise is often so-called white noise. Therefore, when the sound hole is opened to the sound hole closed state, the energy of the audio signal is often attenuated in the entire frequency region from the low frequency region to the high frequency region. White noise means a signal in which energy fluctuation is suppressed within a predetermined range in the entire frequency region from the low frequency region to the high frequency region.

そこで、特徴量抽出部２１は、音孔開放状態から音孔閉塞状態に遷移した場合の、低周波数領域から高周波数領域までの所定の帯域における、デジタル音声信号のエネルギーの変化量を算出する。そして、特徴量抽出部２１は、算出されたエネルギーの変化量を、第１の信号変化特徴量として抽出する。そして、動作モードが学習モードである場合、特徴量抽出部２１は、抽出した第１の信号変化特徴量を記憶部１８に対して出力する。 Therefore, the feature amount extraction unit 21 calculates the amount of change in the energy of the digital audio signal in a predetermined band from the low frequency region to the high frequency region when the sound hole is opened to the sound hole closed state. Then, the feature quantity extraction unit 21 extracts the calculated energy change amount as the first signal change feature quantity. When the operation mode is the learning mode, the feature amount extraction unit 21 outputs the extracted first signal change feature amount to the storage unit 18.

一方、動作モードが出力音声制御モードの場合、特徴量抽出部２１は、所定の時間での所定の帯域の音声信号のエネルギーの変化量を、第２の信号変化特徴量として抽出する。具体的には、動作モードが出力音声制御モードの場合、特徴量抽出部２１は、所定の時間での低周波数領域から高周波数領域までの所定の帯域における、デジタル音声信号のエネルギーの変化量を算出しても良い。そして、特徴量抽出部２１は、算出されたエネルギーの変化量を、第２の信号変化特徴量として抽出してもよい。一方、動作モードが出力音声制御モードである場合、特徴量抽出部２１は、抽出した第２の信号変化特徴量を、相関値算出部２２に対して出力する。 On the other hand, when the operation mode is the output voice control mode, the feature amount extraction unit 21 extracts the change amount of the energy of the voice signal in the predetermined band at the predetermined time as the second signal change feature amount. Specifically, when the operation mode is the output audio control mode, the feature amount extraction unit 21 calculates the amount of change in energy of the digital audio signal in a predetermined band from a low frequency region to a high frequency region in a predetermined time. It may be calculated. Then, the feature amount extraction unit 21 may extract the calculated amount of change in energy as the second signal change feature amount. On the other hand, when the operation mode is the output voice control mode, the feature amount extraction unit 21 outputs the extracted second signal change feature amount to the correlation value calculation unit 22.

相関値算出部２２は、予め登録された第１の信号変化特徴量と、音声信号から抽出された第２の信号変化特徴量と、の相関値を算出する。具体的には、相関値算出部２２は、記憶部１８が記憶する第１の信号変化特徴量と、特徴量抽出部２１が出力する第２の信号変化特徴量と、の相関値を算出する。 The correlation value calculation unit 22 calculates a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal. Specifically, the correlation value calculation unit 22 calculates a correlation value between the first signal change feature value stored in the storage unit 18 and the second signal change feature value output from the feature value extraction unit 21. .

出力音声制御部２３は、出力音声信号を制御する。具体的には、出力音声制御部２３は、相関値算出部２２の算出した相関値に基づいて、入力された音声信号をミュートするか否かを判断する。より具体的には、出力音声制御部２３は、相関値算出部２２の算出した相関値が、所定の閾値を超える場合、入力された音声信号をミュートする。 The output sound control unit 23 controls the output sound signal. Specifically, the output sound control unit 23 determines whether to mute the input sound signal based on the correlation value calculated by the correlation value calculation unit 22. More specifically, the output sound control unit 23 mutes the input sound signal when the correlation value calculated by the correlation value calculation unit 22 exceeds a predetermined threshold.

また、出力音声制御部２３は、入力された音声信号をミュートした状態において、相関値が所定の閾値以下となる場合、入力された音声信号のミュートを解除しても良い。そして、出力音声制御部２３は、入力されたデジタル音声信号を、符号化・複合部３０に対して出力する。 In addition, the output audio control unit 23 may cancel the mute of the input audio signal when the correlation value is equal to or less than a predetermined threshold in a state where the input audio signal is muted. Then, the output audio control unit 23 outputs the input digital audio signal to the encoding / compositing unit 30.

符号化・複合部３０は、音声信号の符号化処理、又は複合処理を行う。具体的には、符号化・複合部３０は、出力音声制御部２３から入力されたデジタル音声信号を符号化する。そして、符号化・複合部３０は、符号化された音声信号を、通信部４０に対して出力する。 The encoding / compositing unit 30 performs encoding processing or composite processing of an audio signal. Specifically, the encoding / combining unit 30 encodes the digital audio signal input from the output audio control unit 23. Then, the encoding / compositing unit 30 outputs the encoded audio signal to the communication unit 40.

通信部４０は、通信網を介して、音声信号を送受信する。ここで、通信回線は、公衆電話網、携帯電話網、インターネット、ＬＡＮ（Local Area Network）等、各種あるが、その詳細は問わない。また、通信方法は、有線、無線を問わない。 The communication unit 40 transmits and receives audio signals via a communication network. Here, there are various types of communication lines such as a public telephone network, a mobile phone network, the Internet, and a LAN (Local Area Network), but the details are not limited. The communication method may be wired or wireless.

符号化・複合部３０から通信部４０に音声信号が入力された場合、通信部４０は、通信網を介して、通信相手に音声信号を送信する。また、通信部４０は、通信網を介して、符号化されたデジタル音声信号を受信する。そして、通信部４０は、符号化・複合部３０に対して、受信したデジタル音声信号を出力する。そして、符号化・複合部３０は、通信部４０から入力されたデジタル音声信号を、アナログ音声信号に変換する。そして、符号化・複合部３０は、レシーバ１２に対してアナログ音声信号を出力する。 When an audio signal is input from the encoding / combining unit 30 to the communication unit 40, the communication unit 40 transmits the audio signal to the communication partner via the communication network. The communication unit 40 receives the encoded digital audio signal via the communication network. Then, the communication unit 40 outputs the received digital audio signal to the encoding / compositing unit 30. The encoding / combining unit 30 converts the digital audio signal input from the communication unit 40 into an analog audio signal. Then, the encoding / combining unit 30 outputs an analog audio signal to the receiver 12.

スピーカアンプ５０は、符号化・複合部３０からアナログ音声信号が入力された場合、入力されたアナログ音声信号を増幅する。そして、スピーカアンプ５０は、増幅されたアナログ音声信号を、スピーカ１３に対して出力する。 When an analog audio signal is input from the encoding / combining unit 30, the speaker amplifier 50 amplifies the input analog audio signal. Then, the speaker amplifier 50 outputs the amplified analog audio signal to the speaker 13.

次に、電子機器１の動作について説明する。 Next, the operation of the electronic device 1 will be described.

図４は、入力された音声信号をミュートする処理の一例を示すフローチャートである。 FIG. 4 is a flowchart illustrating an example of a process for muting an input audio signal.

ステップＳ１において、通信部４０が通話処理を開始したか否かを、制御部２０は判断する。通信部４０が通話処理を開始した場合（ステップＳ１のＹｅｓ分岐）には、ステップＳ２に遷移する。一方、通信部４０が通話処理を開始していない場合（ステップＳ１のＮｏ分岐）には、制御部２０は、通信部４０が通話処理を開始したか否かの判断（ステップＳ１）を繰り返す。 In step S1, the control unit 20 determines whether or not the communication unit 40 has started a call process. When the communication unit 40 starts the call process (Yes branch in step S1), the process proceeds to step S2. On the other hand, when the communication unit 40 has not started the call process (No branch in step S1), the control unit 20 repeats the determination whether the communication unit 40 has started the call process (step S1).

ステップＳ２において、マイクロホン１１からアナログ音声信号が入力されたか否かを、制御部２０は判断する。マイクロホン１１からアナログ音声信号が入力された場合（ステップＳ２のＹｅｓ分岐）には、ステップＳ４に遷移する。一方、マイクロホン１１からアナログ音声信号が入力されていない場合（ステップＳ２のＮｏ分岐）には、ステップＳ３に遷移する。 In step S <b> 2, the control unit 20 determines whether an analog audio signal is input from the microphone 11. When an analog audio signal is input from the microphone 11 (Yes branch in step S2), the process proceeds to step S4. On the other hand, when an analog audio signal is not input from the microphone 11 (No branch in step S2), the process proceeds to step S3.

ステップＳ３において、通信部４０が通話処理を終了したか否かを、制御部２０は判断する。通信部４０が通話処理を終了した場合（ステップＳ３のＹｅｓ分岐）には、電子機器１は、出力音声信号を制御する処理を終了する。一方、通信部４０が通話処理を終了していない場合（ステップＳ３のＮｏ分岐）には、ステップＳ２に戻り、処理を継続する。 In step S3, the control unit 20 determines whether or not the communication unit 40 has finished the call process. When the communication unit 40 ends the call process (Yes branch in step S3), the electronic device 1 ends the process of controlling the output audio signal. On the other hand, when the communication unit 40 has not finished the call process (No branch in step S3), the process returns to step S2 and the process is continued.

一方、マイクロホン１１からアナログ音声信号が入力された場合（ステップＳ２のＹｅｓ分岐）には、マイクロホンアンプ１６は、アナログ音声信号を増幅する（ステップＳ４）。そして、ステップＳ５において、Ａ／Ｄ変換器１７は、増幅されたアナログ音声信号をデジタル音声信号に変換する。 On the other hand, when an analog audio signal is input from the microphone 11 (Yes branch in step S2), the microphone amplifier 16 amplifies the analog audio signal (step S4). In step S5, the A / D converter 17 converts the amplified analog audio signal into a digital audio signal.

ステップＳ６において、特徴量抽出部２１は、デジタル音声信号から信号変化特徴量を抽出する。 In step S6, the feature quantity extraction unit 21 extracts a signal change feature quantity from the digital audio signal.

ステップＳ７において、相関値算出部２２は、予め登録された第１の信号変化特徴量と、デジタル音声信号から抽出された第２の信号変化特徴量との相関値を算出する。 In step S7, the correlation value calculation unit 22 calculates a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the digital audio signal.

ステップＳ８において、予め登録された第１の信号変化特徴量と、デジタル音声信号から抽出された第２の信号変化特徴量との相関値が所定の閾値を超えるか否かを、出力音声制御部２３は判断する。 In step S8, it is determined whether or not the correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the digital audio signal exceeds a predetermined threshold value. 23 is judged.

相関値が所定の閾値を超える場合（ステップＳ８のＹｅｓ分岐）には、出力音声制御部２３は、入力された音声信号をミュートする（ステップＳ９）。そして、ステップＳ２に戻り、処理を継続する。 If the correlation value exceeds a predetermined threshold (Yes branch in step S8), the output audio control unit 23 mutes the input audio signal (step S9). And it returns to step S2 and continues a process.

一方、相関値が所定の閾値を超えない場合（ステップＳ８のＮｏ分岐）には、出力音声制御部２３は、入力された音声信号をミュートせず、通話状態を維持する（ステップＳ１０）。そして、ステップＳ２に戻り、処理を継続する。 On the other hand, when the correlation value does not exceed the predetermined threshold (No branch in step S8), the output voice control unit 23 does not mute the input voice signal and maintains the call state (step S10). And it returns to step S2 and continues a process.

以上のように、本実施形態に係る電子機器１は、所定の時間での音声信号の変化量に基づいて、入力された音声信号をミュートするか否かを判断する。例えば、電話等において、マイクロホンの入力音孔部を指で塞いで、マイクロホンから入力される音声を、通話先の相手に伝えないようにする場合がある。そこで、本実施形態に係る電子機器１は、予め登録した音声信号の変化量に基づいて、入力音孔部がユーザの指等で塞がれた状態であるか否かを判断する。そして、本実施形態に係る電子機器１は、入力音孔部がユーザの指等で塞がれた状態であると判断した場合、入力された音声信号をミュートする。従って、本実施形態に係る電子機器１は、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する。 As described above, the electronic apparatus 1 according to the present embodiment determines whether or not to mute the input audio signal based on the change amount of the audio signal at a predetermined time. For example, in a telephone or the like, there is a case where an input sound hole portion of a microphone is blocked with a finger so that a voice input from the microphone is not transmitted to a call partner. Therefore, the electronic apparatus 1 according to the present embodiment determines whether or not the input sound hole portion is closed with a user's finger or the like, based on the change amount of the sound signal registered in advance. When the electronic apparatus 1 according to the present embodiment determines that the input sound hole is in a state of being blocked by the user's finger or the like, the electronic device 1 mutes the input audio signal. Therefore, the electronic device 1 according to the present embodiment contributes to appropriately controlling the output audio signal based on the natural action of the user.

［第２の実施形態］
第２の実施形態について、詳細に説明する。 [Second Embodiment]
The second embodiment will be described in detail.

本実施形態は、ユーザの意図しない場合に、入力された音声信号をミュートする可能性を低減する形態である。なお、本実施形態における説明では、上記の実施形態と重複する部分の説明は省略する。さらに、本実施形態における説明では、上記の実施形態と同一の構成要素には、同一の符号を付し、その説明を省略する。 This embodiment is a form that reduces the possibility of muting the input audio signal when the user does not intend. In the description of the present embodiment, the description of the same part as the above embodiment is omitted. Further, in the description of the present embodiment, the same components as those of the above-described embodiment are denoted by the same reference numerals, and the description thereof is omitted.

本実施形態に係る電子機器１の内部構成は、図３に示すとおりであるため、詳細な説明は省略する。 Since the internal configuration of the electronic apparatus 1 according to the present embodiment is as shown in FIG. 3, detailed description thereof is omitted.

ユーザが指等でマイクロホン１１の入力音孔部を塞いでいる間であっても、相関値算出部２２の算出する相関値が、所定の閾値を超える状態を繰り返す場合が多い。なぜなら、ユーザが指等でマイクロホン１１の入力音孔部を塞いだ場合、指と、入力音孔部との隙間が生じるためである。 Even while the user is closing the input sound hole of the microphone 11 with a finger or the like, the correlation value calculated by the correlation value calculation unit 22 often repeats a state exceeding a predetermined threshold. This is because when the user closes the input sound hole portion of the microphone 11 with a finger or the like, a gap is generated between the finger and the input sound hole portion.

そこで、本実施形態に係る出力音声制御部２３は、所定の時間で、所定の回数を超えて、相関値算出部２２の算出する相関値が、所定の閾値を超えるか否かを判断する。そして、本実施形態に係る出力音声制御部２３は、所定の時間で、所定の回数を超えて、相関値算出部２２の算出する相関値が、所定の閾値を超える場合、入力された音声信号をミュートする。 Therefore, the output voice control unit 23 according to the present embodiment determines whether or not the correlation value calculated by the correlation value calculation unit 22 exceeds a predetermined threshold after a predetermined number of times in a predetermined time. When the correlation value calculated by the correlation value calculation unit 22 exceeds a predetermined threshold in a predetermined time and exceeds a predetermined number of times, the output audio control unit 23 according to the present embodiment inputs the audio signal. Mute

一方、ユーザがマイクロホン１１の入力音孔部をなぞった場合であっても、音声信号のエネルギーは変化する。そして、その場合の信号変化特徴量と、予め登録された第１の信号変化特徴量との相関値が、所定の閾値を超える恐れがある。しかし、ユーザがマイクロホン１１の入力音孔部をなぞった場合、相関値算出部２２の算出する相関値が、所定の閾値を超える回数は制限される。つまり、ユーザがマイクロホン１１の入力音孔部をなぞった場合、相関値算出部２２の算出する相関値が、所定の閾値を超える状態を繰り返さない。 On the other hand, even when the user traces the input sound hole portion of the microphone 11, the energy of the audio signal changes. In this case, the correlation value between the signal change feature value and the first signal change feature value registered in advance may exceed a predetermined threshold value. However, when the user traces the input sound hole portion of the microphone 11, the number of times that the correlation value calculated by the correlation value calculation unit 22 exceeds a predetermined threshold is limited. That is, when the user traces the input sound hole portion of the microphone 11, the state in which the correlation value calculated by the correlation value calculation unit 22 exceeds the predetermined threshold is not repeated.

そのため、本実施形態に係る電子機器１は、指等でマイクロホン１１の入力音孔部が塞がれる場合と、マイクロホン１１の入力音孔部がなぞられる場合と、を区別できる。その結果、本実施形態に係る電子機器１は、ユーザがマイクロホン１１の入力音孔部をなぞったときに、入力された音声信号をミュートすることを防止できる。従って、本実施形態に係る電子機器１は、ユーザの意図しない場合に、入力された音声信号をミュートする可能性を低減することに寄与する。つまり、本実施形態に係る電子機器１は、より一層、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する。 Therefore, the electronic apparatus 1 according to the present embodiment can distinguish between a case where the input sound hole portion of the microphone 11 is closed with a finger or the like and a case where the input sound hole portion of the microphone 11 is traced. As a result, the electronic device 1 according to this embodiment can prevent the input audio signal from being muted when the user traces the input sound hole of the microphone 11. Therefore, the electronic apparatus 1 according to the present embodiment contributes to reducing the possibility of muting the input audio signal when the user does not intend. That is, the electronic device 1 according to the present embodiment further contributes to appropriately controlling the output audio signal based on the user's natural actions.

［第３の実施形態］
第３の実施形態について、詳細に説明する。 [Third Embodiment]
The third embodiment will be described in detail.

本実施形態は、ユーザが電子機器に近接しているか否かを考慮して、入力された音声信号をミュートする形態である。なお、本実施形態における説明では、上記の実施形態と重複する部分の説明は省略する。さらに、本実施形態における説明では、上記の実施形態と同一の構成要素には、同一の符号を付し、その説明を省略する。 In the present embodiment, an input audio signal is muted in consideration of whether or not the user is close to an electronic device. In the description of the present embodiment, the description of the same part as the above embodiment is omitted. Further, in the description of the present embodiment, the same components as those of the above-described embodiment are denoted by the same reference numerals, and the description thereof is omitted.

図５は、本実施形態に係る電子機器１ａの内部構成の一例を示すブロック図である。図３に示す電子機器１と、図５に示す電子機器１ａとの相違点は、図５に示す電子機器１ａは近接センサ（物体検出部）６０を含む点である。 FIG. 5 is a block diagram illustrating an example of an internal configuration of the electronic apparatus 1a according to the present embodiment. The electronic device 1 shown in FIG. 3 is different from the electronic device 1a shown in FIG. 5 in that the electronic device 1a shown in FIG. 5 includes a proximity sensor (object detection unit) 60.

近接センサ６０は、所定の範囲内の距離の物体を検出する。具体的には、近接センサ６０は、電子機器１ａと、所定の範囲内の距離の物体との距離を測定する。距離の測定方式は、赤外線を用いる方式、超音波を用いる方式等、各種あるが、その詳細は問わない。そして、近接センサ６０は、距離の測定結果を含む出力信号を、符号化・複合部３０に対して出力する。 The proximity sensor 60 detects an object at a distance within a predetermined range. Specifically, the proximity sensor 60 measures the distance between the electronic device 1a and an object having a distance within a predetermined range. There are various distance measurement methods such as a method using infrared rays, a method using ultrasonic waves, and the like, but the details are not limited. Then, the proximity sensor 60 outputs an output signal including the distance measurement result to the encoding / combining unit 30.

また、電子機器１ａは、所謂、ハンズフリー機能（オンフック機能とも呼ぶ）を備えていても良い。制御部２０がハンズフリー機能を実行する場合、スピーカ１３は、通信部４０が受信した音声信号を出力しても良い。つまり、制御部２０が、ハンズフリー機能（オンフック機能とも呼ぶ）を実行する場合、符号化・複合部３０は、複合された音声信号を、スピーカアンプ５０に対して出力しても良い。 The electronic device 1a may have a so-called hands-free function (also referred to as an on-hook function). When the control unit 20 executes the hands-free function, the speaker 13 may output an audio signal received by the communication unit 40. That is, when the control unit 20 executes a hands-free function (also referred to as an on-hook function), the encoding / combining unit 30 may output the combined audio signal to the speaker amplifier 50.

次に、本実施形態に係る電子機器１ａの動作について説明する。 Next, the operation of the electronic device 1a according to this embodiment will be described.

図６は、本実施形態に係る電子機器１ａが入力された音声信号をミュートする処理の一例を示すフローチャートである。 FIG. 6 is a flowchart illustrating an example of a process of muting the input audio signal by the electronic apparatus 1a according to the present embodiment.

ここで、相関値算出部２２は、予め登録された第１の信号変化特徴量と、デジタル音声信号から抽出された第２の信号変化特徴量との相関値を算出した（図４に示すステップＳ７）とする。その場合、ステップＳ１０１において、予め登録された第１の信号変化特徴量と、デジタル音声信号から抽出された第２の信号変化特徴量との相関値が所定の閾値を超えるか否かを、出力音声制御部２３は判断する。 Here, the correlation value calculation unit 22 calculates a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the digital audio signal (step shown in FIG. 4). S7). In this case, in step S101, whether or not the correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the digital audio signal exceeds a predetermined threshold is output. The voice control unit 23 determines.

第１の信号変化特徴量と、第２の信号変化特徴量との相関値が所定の閾値を超える場合（ステップＳ１０１のＹｅｓ分岐）には、ステップＳ１０２に遷移する。一方、第１の信号変化特徴量と、第２の信号変化特徴量との相関値が所定の閾値を超えない場合（ステップＳ１０１のＮｏ分岐）には、ステップＳ１０５に遷移する。 When the correlation value between the first signal change feature value and the second signal change feature value exceeds a predetermined threshold (Yes branch in step S101), the process proceeds to step S102. On the other hand, when the correlation value between the first signal change feature value and the second signal change feature value does not exceed the predetermined threshold (No branch in step S101), the process proceeds to step S105.

ステップＳ１０２において、ハンズフリー通話状態であるか否かを、制御部２０は判断する。ハンズフリー通話状態である場合（ステップＳ１０２のＹｅｓ分岐）には、ステップＳ１０５に遷移する。一方、ハンズフリー通話状態ではない場合（ステップＳ１０２のＮｏ分岐）には、ステップＳ１０３に遷移する。 In step S102, the control unit 20 determines whether or not the hands-free call state is set. When it is a hands-free call state (Yes branch of step S102), the process proceeds to step S105. On the other hand, when it is not in the hands-free call state (No branch of step S102), the process proceeds to step S103.

ステップＳ１０３において、近接センサ６０が稼動しているか否かを、制御部２０は判断する。具体的には、近接センサ６０が所定の範囲内の距離に物体を検出したか否かを、制御部２０は判断する。 In step S103, the control unit 20 determines whether or not the proximity sensor 60 is operating. Specifically, the control unit 20 determines whether or not the proximity sensor 60 has detected an object at a distance within a predetermined range.

近接センサ６０が稼動している場合（ステップＳ１０３のＹｅｓ分岐）には、出力音声制御部２３は、入力された音声信号をミュートする（ステップＳ１０４）。そして、図４に示すステップＳ２に戻り、処理を継続する。一方、近接センサ６０が稼動していない場合（ステップＳ１０３のＮｏ分岐）には、ステップＳ１０５に遷移する。 When the proximity sensor 60 is operating (Yes branch of step S103), the output sound control unit 23 mutes the input sound signal (step S104). And it returns to step S2 shown in FIG. 4, and continues a process. On the other hand, when the proximity sensor 60 is not operating (No branch in step S103), the process proceeds to step S105.

ステップＳ１０５において、出力音声制御部２３は、入力された音声信号をミュートせず、通話状態を維持する。そして、図４に示すステップＳ２に戻り、処理を継続する。 In step S105, the output voice control unit 23 does not mute the input voice signal and maintains the call state. And it returns to step S2 shown in FIG. 4, and continues a process.

なお、制御部２０は、通話開始時にハンズフリーで通話する設定であるか否かを確認しても良い。また、ミュート処理を実行中に、割り込み処理として、ハンズフリー処理が開始された場合、出力音声制御部２３は、出力音声信号のミュート処理を解除するように制御しても良い。 Note that the control unit 20 may confirm whether or not it is set to make a hands-free call at the start of the call. Further, when the hands-free process is started as the interrupt process during the mute process, the output sound control unit 23 may perform control so as to cancel the mute process of the output sound signal.

以上のように、本実施形態に係る電子機器１ａは、電子機器１ａが稼動している場合、出力音声信号をミュートできる。例えば、周囲の環境によっては、入力音孔部が塞がれていない場合であっても、音声信号のエネルギー等が変化する恐れがある。しかし、本実施形態に係る電子機器１ａは、ユーザが電子機器１ａに近接しているか否かを判断する。そして、本実施形態に係る電子機器１ａは、近接センサ６０が稼動している場合には、入力された音声信号をミュートする。従って、本実施形態に係る電子機器１ａは、より一層、ユーザの自然な所作に基づいて、適切に出力音声信号を制御することに寄与する。 As described above, the electronic device 1a according to the present embodiment can mute the output audio signal when the electronic device 1a is operating. For example, depending on the surrounding environment, even if the input sound hole is not blocked, the energy of the audio signal may change. However, the electronic device 1a according to the present embodiment determines whether or not the user is close to the electronic device 1a. And the electronic device 1a which concerns on this embodiment mutes the input audio | voice signal, when the proximity sensor 60 is working. Therefore, the electronic apparatus 1a according to the present embodiment further contributes to appropriately controlling the output audio signal based on the user's natural actions.

上記の実施形態の一部又は全部は、以下の付記のようにも記載され得るが、以下には限られない。 A part or all of the above embodiments can be described as in the following supplementary notes, but is not limited thereto.

（付記１）上記第１の視点に係る電子機器の通りである。 (Additional remark 1) It is as the electronic device which concerns on a said 1st viewpoint.

（付記２）前記出力音声制御部は、前記相関値が所定の閾値を超える場合、前記音声信号をミュートする付記１に記載の電子機器。 (Supplementary note 2) The electronic device according to supplementary note 1, wherein the output audio control unit mutes the audio signal when the correlation value exceeds a predetermined threshold.

（付記３）所定の範囲内の距離の物体を検出する物体検出部をさらに備え、前記出力音声制御部は、前記物体検出部が前記物体を検出した場合であるとともに、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記２に記載の電子機器。 (Additional remark 3) The object detection part which detects the object of the distance within the predetermined range is further provided, and the said output audio | voice control part is a case where the said object detection part detects the said object, and the said correlation value is the said predetermined value The electronic device according to supplementary note 2, wherein the audio signal is muted when the threshold value is exceeded.

（付記４）前記出力音声制御部は、所定の時間で、所定の回数を超えて、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記２又は３に記載の電子機器。 (Supplementary note 4) The electronic device according to Supplementary note 2 or 3, wherein the output audio control unit mutes the audio signal when the correlation value exceeds the predetermined threshold value for a predetermined time exceeding a predetermined number of times. .

（付記５）前記音声入力部は入力音孔部を含み、前記音声入力部は、前記入力音孔部を介して、前記音声信号を入力し、前記特徴量抽出部は、前記入力音孔部が開放された音孔開放状態から、前記入力音孔部が塞がれた音孔閉塞状態に遷移した場合の前記音声信号に基づいて、前記第１の信号変化特徴量を抽出する付記１乃至４のいずれか一に記載の電子機器。 (Additional remark 5) The said audio | voice input part contains an input sound hole part, the said sound input part inputs the said audio | voice signal via the said input sound hole part, and the said feature-value extraction part is the said input sound hole part. Supplementary notes 1 to 1 for extracting the first signal change feature amount based on the sound signal when the sound hole is opened to the sound hole closed state in which the input sound hole is closed. 4. The electronic device according to any one of 4.

（付記６）前記特徴量抽出部は、前記音孔開放状態から前記音孔閉塞状態に遷移した場合の、所定の帯域の前記音声信号のエネルギーの変化量を、前記第１の信号変化特徴量として抽出する付記５に記載の電子機器。 (Additional remark 6) The said feature-value extraction part makes the said 1st signal change feature-value the amount of change of the energy of the said audio | voice signal of a predetermined | prescribed band at the time of changing from the said sound-hole open state to the said sound-hole obstruction | occlusion state. The electronic device according to appendix 5, which is extracted as

（付記７）前記特徴量抽出部は、所定の時間での前記音声信号のエネルギーの変化量を、前記第２の信号変化特徴量として抽出する付記１乃至６のいずれか一に記載の電子機器。 (Supplementary note 7) The electronic device according to any one of supplementary notes 1 to 6, wherein the feature amount extraction unit extracts a change amount of energy of the audio signal at a predetermined time as the second signal change feature amount. .

（付記８）上記第２の視点に係る電子機器の制御方法の通りである。 (Additional remark 8) It is as the control method of the electronic device which concerns on the said 2nd viewpoint.

（付記９）前記出力音声制御工程において、前記相関値が所定の閾値を超える場合、前記音声信号をミュートする付記８に記載の電子機器の制御方法。 (Supplementary note 9) The electronic device control method according to supplementary note 8, wherein in the output audio control step, the audio signal is muted when the correlation value exceeds a predetermined threshold.

（付記１０）所定の範囲内の距離の物体を検出する工程をさらに含み、前記出力音声制御工程において、所定の範囲内の距離の物体が検出された場合であるとともに、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記９に記載の電子機器の制御方法。 (Additional remark 10) It further includes the process of detecting the object of the distance within a predetermined range, and when the object of the distance within a predetermined range is detected in the said output audio | voice control process, the said correlation value is the said predetermined value The control method of the electronic device according to appendix 9, wherein the audio signal is muted when the threshold value is exceeded.

（付記１１）前記出力音声制御工程において、所定の時間で、所定の回数を超えて、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記９又は１０に記載の電子機器の制御方法。 (Supplementary note 11) The electronic device according to supplementary note 9 or 10, wherein, in the output audio control step, the audio signal is muted when the correlation value exceeds the predetermined threshold in a predetermined time exceeding a predetermined number of times. Control method.

（付記１２）前記特徴量抽出工程において、所定の時間での前記音声信号のエネルギーの変化量を、前記第２の信号変化特徴量として抽出する付記８乃至１１のいずれか一に記載の電子機器の制御方法。 (Supplementary note 12) The electronic device according to any one of supplementary notes 8 to 11, wherein, in the feature amount extraction step, a change amount of energy of the audio signal at a predetermined time is extracted as the second signal change feature amount. Control method.

（付記１３）上記第３の視点に係るプログラムの通りである。 (Additional remark 13) It is as the program which concerns on the said 3rd viewpoint.

（付記１４）前記出力音声制御処理において、前記相関値が所定の閾値を超える場合、前記音声信号をミュートする付記１３に記載のプログラム。 (Additional remark 14) The program of Additional remark 13 which mutes the said audio | voice signal when the said correlation value exceeds a predetermined threshold value in the said output audio | voice control process.

（付記１５）所定の範囲内の距離の物体を検出する処理をさらに実行し、前記出力音声制御処理において、所定の範囲内の距離の物体が検出された場合であるとともに、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記１４に記載のプログラム。 (Additional remark 15) The process which detects the object of the distance within a predetermined range is further performed, and when the object of the distance within the predetermined range is detected in the output audio control process, the correlation value is The program according to appendix 14, wherein the audio signal is muted when a predetermined threshold is exceeded.

（付記１６）前記出力音声制御処理において、所定の時間で、所定の回数を超えて、前記相関値が前記所定の閾値を超える場合、前記音声信号をミュートする付記１４又は１５に記載のプログラム。 (Supplementary note 16) The program according to supplementary note 14 or 15, wherein, in the output audio control process, the audio signal is muted when the correlation value exceeds the predetermined threshold in a predetermined time exceeding a predetermined number of times.

（付記１７）前記特徴量抽出処理において、所定の時間での前記音声信号のエネルギーの変化量を、前記第２の信号変化特徴量として抽出する付記１４乃至１６のいずれか一に記載のプログラム。 (Additional remark 17) The program as described in any one of additional remark 14 thru | or 16 which extracts the variation | change_quantity of the energy of the said audio | voice signal in predetermined time as said 2nd signal variation | change feature-value in the said feature-value extraction process.

なお、引用した上記の特許文献の開示は、本書に引用をもって繰り込むものとする。本発明の全開示（請求の範囲を含む）の枠内において、さらにその基本的技術思想に基づいて、実施形態ないし実施例の変更・調整が可能である。また、本発明の請求の範囲の枠内において種々の示要素（各請求項の各要素、各実施形態ないし実施例の各要素、各図面の各要素等を含む）の多様な組み合わせ、ないし、選択が可能である。すなわち、本発明は、請求の範囲を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。特に、本書に記載した数値範囲については、当該範囲内に含まれる任意の数値ないし小範囲が、別段の記載のない場合でも具体的に記載されているものと解釈されるべきである。 The disclosure of the cited patent document is incorporated herein by reference. Within the scope of the entire disclosure (including claims) of the present invention, the embodiments and examples can be changed and adjusted based on the basic technical concept. In addition, various combinations of various indication elements (including each element of each claim, each element of each embodiment or example, each element of each drawing, etc.) within the scope of the claims of the present invention, Selection is possible. That is, the present invention of course includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the technical idea. In particular, with respect to the numerical ranges described in this document, any numerical value or small range included in the range should be construed as being specifically described even if there is no specific description.

１、１ａ、１００電子機器
１１マイクロホン
１２レシーバ
１３スピーカ
１４操作部
１５表示部
１６マイクロホンアンプ
１７Ａ／Ｄ変換器
１８記憶部
２０制御部
２１、１０２特徴量抽出部
２２、１０３相関値算出部
２３、１０４出力音声制御部
３０符号化・複合部
４０通信部
５０スピーカアンプ
６０近接センサ（物体検出部）
１０１音声入力部 1, 1a, 100 Electronic device 11 Microphone 12 Receiver 13 Speaker 14 Operation unit 15 Display unit 16 Microphone amplifier 17 A / D converter 18 Storage unit 20 Control unit 21, 102 Feature quantity extraction unit 22, 103 Correlation value calculation unit 23, 104 Output voice control unit 30 Encoding / compositing unit 40 Communication unit 50 Speaker amplifier 60 Proximity sensor (object detection unit)
101 Voice input part

Claims

An audio input unit for inputting an audio signal;
A feature quantity extraction unit that extracts a signal change feature quantity based on the audio signal;
A correlation value calculation unit for calculating a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal;
An output audio control unit that controls an output audio signal based on the correlation value;
Electronic equipment comprising.

The electronic device according to claim 1, wherein the output audio control unit mutes the audio signal when the correlation value exceeds a predetermined threshold.

An object detection unit for detecting an object at a distance within a predetermined range;
The electronic device according to claim 2, wherein the output audio control unit mutes the audio signal when the object detection unit detects the object and the correlation value exceeds the predetermined threshold.

4. The electronic device according to claim 2, wherein the output audio control unit mutes the audio signal when the correlation value exceeds the predetermined threshold in a predetermined time exceeding a predetermined number of times. 5.

The voice input part includes an input sound hole part,
The audio input unit inputs the audio signal through the input sound hole unit,
The feature amount extraction unit is configured to change the first sound hole based on the sound signal when the sound hole is opened and the sound hole is closed. The electronic device according to claim 1, wherein one signal change feature amount is extracted.

The feature amount extraction unit extracts, as the first signal change feature amount, a change amount of energy of the audio signal in a predetermined band when the sound hole is opened to the sound hole closed state. Item 6. The electronic device according to Item 5.

The electronic device according to any one of claims 1 to 6, wherein the feature amount extraction unit extracts a change amount of energy of the audio signal at a predetermined time as the second signal change feature amount.

Inputting an audio signal;
A feature amount extraction step of extracting a signal change feature amount based on the audio signal;
Calculating a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal;
An output sound control step of controlling an output sound signal based on the correlation value;
A method for controlling an electronic device.

A program to be executed by a computer that controls an electronic device,
Processing to input audio signals;
A feature amount extraction process for extracting a signal change feature amount based on the audio signal;
Processing for calculating a correlation value between the first signal change feature value registered in advance and the second signal change feature value extracted from the audio signal;
An output sound control process for controlling an output sound signal based on the correlation value;
A program that executes.