WO2001057852A1

WO2001057852A1 - Robot hearing system

Info

Publication number: WO2001057852A1
Application number: PCT/JP2001/000647
Authority: WO
Inventors: Kazuhiro Nakadai; Hiroshi Okuno; Hiroaki Kitano; Tatsuya Matsui
Original assignee: Japan Science and Technology Corp
Current assignee: Japan Science and Technology Agency
Priority date: 2000-01-31
Filing date: 2001-01-31
Publication date: 2001-08-09
Anticipated expiration: 2002-07-31

Abstract

A hearing system (10) of a robot, especially the one of human or animal type, capable of collecting sound from an external target and actively hearing it without being influenced by the noise produced by, e.g. a drive mechanism in the robot. The system comprises a pair of external microphones (16) provided at the ear parts outside the sound-insulating shell (13), a pair of internal microphones (17) for collecting noise provided inside the shell, noise removers (23, 24) for removing noise signals from an inside noise source from the acoustic signals from the external microphones, pitch extracting sections (25, 26) for extracting acoustic data (DL, DR) about the time, frequency and power from the left and right acoustic signals from the noise removers by frequency analysis, a left/right channel relating section (27) for determining the direction from which sound comes on the basis of the harmonic structure involving the pitches of the left/right acoustic data extracted by the pitch extracting section, and a sound source separating section (28) for separating acoustic data into acoustic data sets of respective sound sources according to the information on the sound direction determination.

Description

明細書ロボット聴覚システム技術分野 Description Robot Auditory system Technical field

本発明はロボット、特に人型または動物型ロボッ卜における聴覚システムに関する。背景技術 The present invention relates to a hearing system in a robot, particularly in a humanoid or animal robot. Background art

近年、このような人型または動物型ロボットにおいては、視覚，聴覚の能動知覚が注目されてきている。 In recent years, attention has been paid to active perception of sight and hearing in such humanoid or animal-like robots.

能動知覚とは、ロボット視覚やロボット聴覚等の知覚を担当する知覚装置を、知覚すべき目標に追従するように、これらの知覚装置を支持する、例えば頭部を駆動機構により姿勢制御するものである。 Active perception is a system that supports a sensory device that is in charge of perception such as robot vision and robot hearing so that it follows the target to be perceived. is there.

ここで、能動視覚に関しては、少なくとも知覚装置であるカメラ力駆動機構による姿勢制御によってカメラの光軸方向が目標に向かって保持され、目標に対して自動的にフォーカシングゃズ一ムィン，ズームァゥト等が行なわれることにより、カメラが目標を撮像するようになっている。このような能動視覚については、種々の研究が行なわれている。 Here, regarding active vision, the optical axis direction of the camera is held toward the target by at least the attitude control by the camera power drive mechanism, which is a perception device, and the target is automatically focused on, for example, focusing, zooming, zooming, etc. Is performed, so that the camera captures the target. Various studies have been conducted on such active vision.

これに対して、能動聴覚に関しては、少なくとも知覚装置であるマイクが、駆動機構による姿勢制御によって、その指向性が目標に向かって保持され、目標からの音がマイクによって集音される。この際、能動聴覚の不利な点として、駆動機構が作動している間はマイクが駆動機構の作動音を拾ってしまうため、目標からの音に比較的大きなノィズが混入してしまい、目標からの音を認識できなくなつてしまうことがある。 On the other hand, with regard to active hearing, at least the microphone, which is a sensory device, maintains its directivity toward the target by controlling the attitude of the drive mechanism, and the microphone collects sound from the target. At this time, the disadvantage of active hearing is that the microphone picks up the operating sound of the drive mechanism while the drive mechanism is operating, so that relatively loud noise is mixed in with the sound from the target. It may not be possible to recognize the sound from the sound.

しかしながら、駆動機構を停止させた状態での聴覚研究では、特に目標が移動しているような場合、目標の移動に追従しながらの所謂能動聴覚を行なうことが困難であった。 However, in auditory research with the drive mechanism stopped, it was difficult to perform so-called active hearing while following the movement of the target, especially when the target is moving.

さらに、上述した駆動機構だけでなく、ロボット内部で発生する各種動作音及び定常的に発生するノイズも聴覚装置としてのマイクが集音してしまうので、同様に完全な能動聴覚を得ることが困難であった。 Furthermore, not only the above-mentioned drive mechanism, but also various operation sounds generated inside the robot Similarly, it is also difficult to obtain perfect active hearing, because microphones as hearing devices also collect noise that is constantly generated.

本発明の目的は、このような従来の技術における欠点を解決するものであり、駆動機構等のロボット内部で発生するノイズに影響されることなく、外部の目標からの音を集音して能動知覚を行なうことができるようにしたロボット、特に人型または動物型ロボットの聴覚システムを提供することにある。発明の開示 An object of the present invention is to solve the above-mentioned drawbacks in the conventional technology, and to collect sound from an external target without being affected by noise generated inside the robot such as a driving mechanism, and to actively control the sound. It is an object of the present invention to provide an auditory system for a robot capable of performing perception, particularly a humanoid or animal robot. Disclosure of the invention

この目的を達成するために、本発明によるロボット聴覚システムは、内部にノィズ発生源を備えたロボッ卜において、少なくともロボッ卜の一部を覆う防音性の外装と、上記外装の外側に設けられ主として外部の音を集音する少なくとも一対の外部マイクと、上記外装の内側に設けられ主として内部のノィズ発生源からのノイズを集音する少なくとも一対の内部マイクと、上記外部マイク及び内部マィクからの左右の信号に基づいてそれぞれ外部マイクからの音響信号から内部のノィズ発生源からのノィズ信号をキャンセルするノィズ除去部と、上記ノィズ除去部からの左右の音響信号からそれそれ周波数解析を行なって、時間，周波数及びパワーに関する音響データ及びピッチ情報を取り出すピッチ抽出部と、上記ビツチ抽出部で取り出された左右の音響データ及びピッチ情報に基づいて当該音響デ一夕が示すピッチを伴う調波構造から音の方向付けを行なう左右チャンネル対応部と、上記左右チャンネル対応部で取り出された音の方向付け情報に基づいて上記音響データから音源毎の音響データに分離する音源分離部と、を含んでいることを特徴とする。 In order to achieve this object, a robot hearing system according to the present invention includes a soundproof exterior covering at least a part of the robot, and a robot provided with a noise source inside, and provided outside the exterior. At least one pair of external microphones that mainly collect external sound, at least one pair of internal microphones that are provided inside the exterior and mainly collect noise from internal noise sources, and the external microphone and the internal microphone. Noise canceling unit that cancels the noise signal from the internal noise source from the audio signal from the external microphone based on the left and right signals from the microphone, respectively, and the frequency from the left and right audio signals from the noise removing unit. A pitch extraction unit that performs analysis to extract acoustic data and pitch information relating to time, frequency, and power; A left and right channel corresponding section for directing sound from a harmonic structure with a pitch indicated by the sound data based on the left and right sound data and pitch information extracted by the left and right channel sections; And a sound source separation unit that separates the sound data into sound data for each sound source based on the sound direction information.

また、上記目的を達成するために、本発明による人型または動物型ロボットの聴覚システムは、内部に駆動機構等のノイズ発生源を備えた人型または動物型のロボットにおいて、少なくともロボットの頭部を覆う防音性の外装と、上記外装の外側にて両側の耳に対応する両耳位置に設けられ主として外部の音を集音する少なくとも一対の外部マイクと、上記外装の内側に設けられた主として内部のノィズ発生源からのノイズを集音する少なくとも一対の内部マイクと、上記外部マィク及び内部マイクからの信号に基づいて外部マイクからの音響信号から内部のノィズ発生源からのノィズ信号をキャンセルするノィズ除去部と、上記ノィズ除去部からの左右の音響信号からそれそれ周波数解析を行なって時間，周波数及びパワーに関する音響データ及びピッチ情報を取り出すピッチ抽出部と、上記ピッチ抽出部で取り出された左右の音響データピッチ情報に基づいて当該音響デ一夕のピッチを伴う調波構造から音の方向付けを行なう左右チャンネル対応部と、上記左右チャンネル対応部で取り出された音の方向付け情報に基づいて上記音響デ一夕から音源毎の音響データに分離する音源分離部と、を含んでいることを特徴としている。 In order to achieve the above object, a human- or animal-type robot auditory system according to the present invention provides a human- or animal-type robot having a noise generating source such as a drive mechanism therein, at least a robot head. A soundproof exterior covering the outside, at least one pair of external microphones provided at both ear positions corresponding to both ears outside the exterior and mainly collecting external sounds, and mainly provided inside the exterior. At least a pair of internal microphones that collect noise from an internal noise source; and an internal microphone based on signals from the external microphone and the internal microphone based on signals from the external microphone and the internal microphone. A noise removing unit for canceling a noise signal from a noise generating source; a pitch extracting unit for performing frequency analysis on left and right acoustic signals from the noise removing unit to extract acoustic data and pitch information relating to time, frequency and power. A left / right channel corresponding unit for directing a sound from a harmonic structure with a pitch of the sound data based on the left / right sound data pitch information extracted by the pitch extracting unit; And a sound source separation unit that separates the sound data into sound data for each sound source based on the sound orientation information extracted by the sound source.

上記構成によれば、外部マイクが主として外部の目標からの音を集音し、内部マイクが主としてロボット内部の駆動機構等のノィズ発生源からのノイズを集音する。その際、外部マイクが集音した音響信号には、ロボット内部のノイズ発生源からのノイズ信号が混入しているが、この混入したノイズ信号は、ノイズ除去部における演算処理によって内部マイクが集音したノィズ信号によりキャンセルされることにより、著しく低減される。 According to the above configuration, the external microphone mainly collects sound from an external target, and the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot. At that time, the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removal unit. It is significantly reduced by being canceled by the noise signal.

そして、ノイズがキャンセルされた音響信号から、ピッチ抽出部により周波数解析によって音響信号を取り出して、この音響信号から左右チャンネル対応部により音の方向付けを行なった後、この音の方向付けに基づいて、音源分離部によつて各音源毎の音響データに分離する。 Then, from the acoustic signal from which the noise has been canceled, the acoustic signal is extracted by frequency analysis by the pitch extraction unit, and the sound direction is determined from the acoustic signal by the left and right channel corresponding units. Then, the sound source is separated into sound data for each sound source by a sound source separation unit.

したがって、ノイズ除去部での演算処理によって、ロボット内部の駆動機構等のノイズ発生源からのノイズが容易に著しく低減されることにより、外部マイクからの音響信号の S /N比が大幅に改善されるので、音源毎の音響デ一夕の分離をより一層良好に行なうことができる。 Therefore, the noise reduction from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination section, and the S / N ratio of the acoustic signal from the external microphone is greatly improved. Therefore, the sound data can be more properly separated for each sound source.

本発明によるロボット聴覚システムは、好ましくは、さらに、ロボットが視覚 , 触覚等の知覚システムを備えており、上記左右チャンネル対応部がこれらの知覚システムからの情報を参照して音の方向付けを行なう。 In the robot hearing system according to the present invention, preferably, the robot further includes a perception system such as visual and tactile sensation, and the left and right channel corresponding units refer to information from these perception systems to determine the direction of sound. Do.

この構成によれば、例えば視覚装置からの目標に関する視覚情報に基づいて左右チャンネル対応部がより明確な方向付けを行なうことができる。 According to this configuration, for example, the left and right channel corresponding sections can perform clearer orientation based on visual information on a target from the visual device.

本発明によるロボット聴覚システムは、好ましくは、上記左右チャンネル対応部が上記知覚システムに対して音の方向付けに関する情報を出力する。この構成によれば、例えば視覚装置に対して目標に関する聴覚による方向付けの情報を出力するので、視覚装置がより一層正確な方向付けを行なうことができる。 In the robot auditory system according to the present invention, preferably, the left and right channel corresponding sections output information relating to the direction of sound to the sensory system. According to this configuration, for example, the information on the orientation of the target by hearing is output to the visual device, so that the visual device can perform more accurate orientation.

さらに、本発明によるロボット聴覚システムは、好ましくは、上記内部マイクが、外装の内側において少なくとも横方向に移動可能に配設されていることを特徴とする。 Furthermore, the robot hearing system according to the present invention is preferably characterized in that the internal microphone is disposed inside the exterior so as to be movable at least in a lateral direction.

この構成によれば、ロボッ卜内部のノイズ発生源からのノイズをより良好に集音するように内部マイクを横方向、前後方向及び/又は上下方向に移動させて、各内部マイクと対となるべき対応する外部マイクとの距離を適宜に調整することができ、外部マイクが集音する音響信号から最も最適な状態でノイズをキャンセルすることができる。 According to this configuration, the internal microphones are moved in the horizontal direction, the front-rear direction, and / or the vertical direction so as to more appropriately collect noise from a noise source inside the robot, and are paired with each internal microphone. The distance to the corresponding external microphone can be appropriately adjusted, and the noise can be canceled in the most optimal state from the acoustic signal collected by the external microphone.

本発明によるロボット聴覚システムは、好ましくは、上記内部マイクが、外装の内側にて横方向に延びる支持部材に対して摺動可能に支持されている。 In the robot hearing system according to the present invention, preferably, the internal microphone is slidably supported by a support member extending in a lateral direction inside the exterior.

本発明によるロボット聴覚システムは、好ましくは、上記支持部材が横方向に延びるスロットを備えており、上記内部マイクが上記支持部材のスロッ卜に沿つて摺動可能に支持された可動部材に固定されている。 In the robot hearing system according to the present invention, preferably, the support member includes a slot extending in a lateral direction, and the internal microphone is fixed to a movable member slidably supported along the slot of the support member. Have been.

本発明によるロボット聴覚システムは、好ましくは、上記可動部材が、スロットに挿通された固定ネジにより支持部材に対して螺着される。 In the robot hearing system according to the present invention, preferably, the movable member is screwed to the support member by a fixing screw inserted into the slot.

本発明によるロボット聴覚システムは、好ましくは、上記内部マイクがさらに前後方向に移動可能に配設されている。 In the robot hearing system according to the present invention, preferably, the internal microphone is further disposed so as to be movable in the front-rear direction.

本発明によるロボット聴覚システムは、好ましくは、上記内部マイクが外装の内側にて横方向に延びる第一の支持部材に対して摺動可能に支持されていて、この第一の支持部材が外装の内側にて前後方向に延びる第二の支持部材に対して摺動可能に支持されている。 In the robot hearing system according to the present invention, preferably, the internal microphone is slidably supported on a first support member extending in the lateral direction inside the exterior, and the first support member is provided on the exterior. It is supported slidably with respect to a second support member extending in the front-rear direction on the inside.

本発明によるロボット聴覚システムは、好ましくは、上記内部マイクがさらに上下方向に移動可能に配設されている。図面の簡単な説明 In the robot hearing system according to the present invention, preferably, the internal microphone is further disposed so as to be vertically movable. BRIEF DESCRIPTION OF THE FIGURES

本発明は、以下の詳細な説明及び本発明の実施例を示す添付図面に基づいて、より良く理解されるものとなろう。なお、添付図面に示す実施例は、本発明を特定することを意図するものではなく、単に説明及び理解を容易にするためのものである。 The present invention is based on the following detailed description and the accompanying drawings, which illustrate embodiments of the present invention. It will be better understood. The embodiments shown in the attached drawings are not intended to specify the present invention, but merely to facilitate explanation and understanding.

図中、 In the figure,

図 1は、本発明によるロボット聴覚装置の第一の実施形態を組み込んだ人型口ボッ卜の外観を示す正面図である。 FIG. 1 is a front view showing the appearance of a humanoid mouthboat incorporating the first embodiment of the robot hearing device according to the present invention.

図 2は、図 1の人型ロボットの側面図である。 FIG. 2 is a side view of the humanoid robot of FIG.

図 3は、図 1の人型ロボッ卜における頭部の構成を示す概略拡大図である。図 4は、図 1の人型ロボッ卜におけるロボット聴覚システムの電気的構成を示すブロック図である。 FIG. 3 is a schematic enlarged view showing the configuration of the head in the humanoid robot of FIG. FIG. 4 is a block diagram showing an electrical configuration of the robot auditory system in the humanoid robot of FIG.

図 5は、図 1の人型ロボッ卜における内部マイクの取付構造を示す概略正面図である。 FIG. 5 is a schematic front view showing a mounting structure of an internal microphone in the humanoid robot of FIG.

図 6は、図 5の内部マイクの取付構造の要部を示す拡大正面図である。 FIG. 6 is an enlarged front view showing a main part of the mounting structure of the internal microphone in FIG.

図 7は、図 5の内部マイクの取付構造の要部を示す拡大側面図である。 FIG. 7 is an enlarged side view showing a main part of the mounting structure of the internal microphone in FIG.

図 8は、図 5の内部マイクの取付構造の要部を示す拡大斜視図である。 FIG. 8 is an enlarged perspective view showing a main part of the mounting structure of the internal microphone in FIG.

図 9は、図 4のロボット聴覚システムにおけるピッチ抽出部により取り出された音響デー夕、ピッチ情報の一例を示すグラフである。 FIG. 9 is a graph showing an example of acoustic data and pitch information extracted by the pitch extraction unit in the robot hearing system of FIG.

図 1 0は、図 5の内部マイクの取付構造の変形例の要部を示す拡大斜視図である。発明を実施するための最良の形態 FIG. 10 is an enlarged perspective view showing a main part of a modified example of the mounting structure of the internal microphone in FIG. BEST MODE FOR CARRYING OUT THE INVENTION

以下、本発明のロボット聴覚システムの最良の第一の実施例を図面を参照して詳細に説明する。 Hereinafter, a first preferred embodiment of the robot hearing system of the present invention will be described in detail with reference to the drawings.

図 1から図 4は本発明によるロボット聴覚システムの一実施形態を備えた実験用の人型ロボッ卜の全体構成を示している。 FIGS. 1 to 4 show the overall configuration of an experimental humanoid robot provided with an embodiment of the robot hearing system according to the present invention.

図 1において、人型ロボット 1 0は、 4 D O F (自由度）のロボットとして構成されている。人型ロボット 1 0は、ベース 1 1と、ベース 1 1上にて一軸（垂直軸）周りに回動可能に支持された胴体部 1 2と、胴体部 1 2上にて三軸方向（垂直軸，左右方向の水平軸及び前後方向の水平軸）の周りに揺動可能に支持された頭部 13と、を含んでいる。 In FIG. 1, the humanoid robot 10 is configured as a 4-DOF (degree of freedom) robot. The humanoid robot 10 includes a base 11, a body 12 supported rotatably around one axis (vertical axis) on the base 11, and a three-axis direction on the body 12 ( (Vertical axis, horizontal axis in the horizontal direction, and horizontal axis in the front-rear direction). The head 13 and

上記べ一ス 11は、固定配置されていてもよく、ロボットの脚部として動作可能としてもよい。また、ベース 11は移動可能な台車等の上に載置されていてもよい。 The base 11 may be fixedly arranged, and may be operable as a leg of a robot. Further, the base 11 may be placed on a movable cart or the like.

上記胴体部 12は、ベース 11に対して垂直軸の周りに、図 1にて矢印 Aで示すように回動可能に支持されており、図示しなレ駆動手段によって回転駆動される。さらに、胴体部 12は、全体が防音性の外装によって覆われている。 The body portion 12 is rotatably supported about a vertical axis with respect to the base 11 as shown by an arrow A in FIG. 1, and is rotationally driven by a driving means (not shown). Further, the body 12 is entirely covered with a soundproof exterior.

上記頭部 13は胴体部 12に対して連結部材 13 aを介して支持されており、この連結部材 13 aに対して前後方向の水平軸の周りに、図 1にて矢印 Bで示すように揺動可能に、また左右方向の水平軸の周りに、図 2にて矢印 Cで示すように揺動可能に支持されている。この上記連結部材 13 aは、胴体部 12に対してさらに前後方向の水平軸の周りに、図 1にて矢印 Dで示すように揺動可能に支持されている。 The head 13 is supported on the body 12 via a connecting member 13a, and around the horizontal axis in the front-rear direction with respect to the connecting member 13a, as shown by an arrow B in FIG. It is swingably supported around a horizontal axis in the left-right direction as shown by arrow C in FIG. The connecting member 13a is swingably supported relative to the body 12 around a horizontal axis in the front-rear direction, as indicated by an arrow D in FIG.

そして、上記矢印 A， B， C， D方向の回転駆動は、それそれ図示しない駆動手段によって行なわれる。 The rotation in the directions of the arrows A, B, C, and D is performed by driving means (not shown).

ここで、上記頭部 13は、図 3に示すように全体が防音性の外装 14により覆われている。さらに、上記頭部 13は、前側にロボット視覚を担当する視覚装置としてのカメラ 15を、また両側にロボット聴覚を担当する聴覚装置としての一対の外部マイク 16 (16a, 16b) を備えている。 Here, the head 13 is entirely covered with a soundproof exterior 14, as shown in FIG. Further, the head 13 has a camera 15 on the front side as a visual device in charge of robot vision, and a pair of external microphones 16 (16a, 16b) on both sides as a hearing device for robot hearing. .

さらに、上記頭部 13は、図 3に示すように、外装 14の内側にて左右に離隔して配設された一対の内部マイク 17 (17a, 17b) を備えている。 Further, as shown in FIG. 3, the head 13 includes a pair of internal microphones 17 (17a, 17b) which are disposed inside the exterior 14 so as to be separated from each other.

上記外装 14は、例えばウレタン樹脂等の吸音性の合成樹脂から構成されている。上記外装 14は、頭部 13の内部をほぼ完全に密閉することにより頭部 13 の内部の遮音を行なうように構成されている。なお、胴体部 12の外装も、同様にして吸音性の合成樹脂から構成されている。 The exterior 14 is made of a sound absorbing synthetic resin such as a urethane resin. The exterior 14 is configured so that the inside of the head 13 is sound-insulated by almost completely sealing the inside of the head 13. The exterior of the body 12 is similarly made of a sound-absorbing synthetic resin.

上記カメラ 15は公知の構成であって、例えば所謂パン，チルト，ズームの 3 DOF (自由度）を有する市販のカメラを使用することができる。 The camera 15 has a known configuration. For example, a commercially available camera having 3 DOFs (degrees of freedom) of so-called pan, tilt, and zoom can be used.

上記外部マイク 16は、それぞれ頭部 13の側面において、前方に向かって指向性を有するように取り付けられている。ここで、外部マイク 1 6の左右の各外部マイク 16 a， 1 6 bは、それそれ図 1及び図 2に示すように、外装 14の両側にて前方に向いた段部 14 a, 14b にて内側に取り付けられ、段部 14 a, 14 bに設けられた貫通穴を通して前方の音を集音すると共に、外装 14の内部の音を拾わないように適宜の手段により遮音されている。 The external microphones 16 are attached to the sides of the head 13 so as to have directionality toward the front. Here, the left and right external microphones 16a and 16b on the left and right of the external microphone 16 are respectively attached to stepped portions 14a and 14b facing forward on both sides of the exterior 14, as shown in FIGS. It is mounted on the inside, and through the through holes provided in the steps 14a, 14b, collects the sound in front and is sound-insulated by appropriate means so as not to pick up the sound inside the exterior 14.

これにより外部マイク 1 6 a, 16 bは、所謂バイノーラルマイクとして構成されている。なお、外部マイク 16 a, 16 bの取付位置の近傍において、外装 14は人間の外耳形状に形成されていてもよい。 Thus, the external microphones 16a and 16b are configured as so-called binaural microphones. In the vicinity of the mounting position of the external microphones 16a and 16b, the exterior 14 may be formed in the shape of a human outer ear.

上記内部マイク 17は、それぞれ外装 14の内側において、上述した各外部マイク 16 a， 16 bの近傍に、図示の場合には、前記カメラ 1 5の両端付近の上方にて、後述するように横方向に移動可能に配設されている。 The internal microphones 17 are respectively located inside the exterior 14 in the vicinity of the external microphones 16a and 16b described above, and in the illustrated case, above the both ends of the camera 15 as described later. Is arranged so as to be movable in the lateral direction.

なお、上記内部マイク 17は、外装 14の内側にて任意の位置に備えられていてもよい。 Note that the internal microphone 17 may be provided at an arbitrary position inside the exterior 14.

図 4は上記外部マイク 16及び内部マイク 17を含む音響処理のための聴覚システムの電気的構成を示している。 FIG. 4 shows an electrical configuration of a hearing system for sound processing including the external microphone 16 and the internal microphone 17.

図 4において、聴覚システム 20は、各外部マイク 16 a， 16 b及び内部マイク 17 a, 17 bからの音響信号をそれぞれ増幅するアンプ 2 1 a, 2 l b, 2 1 c, 2 I dと、これらの増幅信号を AD変換によりデジタル音響信号 SOL ， SOR, S I L, S I Rに変換する ADコンバータ 22 a， 22 b, 22 c, 22 と、これらのデジタル音響信号が入力される左右のノイズ除去回路 23, 24と、ノイズ除去回路 23, 24からのデジタル音響信号 SR, SLが入力されるピッチ抽出部 25, 26と、ピッチ抽出部 25, 26からの音響データが入力される左右チャンネル対応部 27と、左右チャンネル対応部 27からのデータが入力される音源分離部 28と、から構成されている。 In FIG. 4, a hearing system 20 includes amplifiers 21 a, 2 lb, 21 c, and 2 Id that amplify acoustic signals from the external microphones 16 a and 16 b and the internal microphones 17 a and 17 b, respectively. AD converters 22a, 22b, 22c, 22 that convert these amplified signals into digital audio signals SOL, SOR, SIL, SIR by AD conversion, and left and right noise removal circuits to which these digital audio signals are input 23, 24, pitch extraction units 25, 26 to which digital audio signals SR, SL from noise elimination circuits 23, 24 are input, and left and right channel corresponding units to which audio data from pitch extraction units 25, 26 are input 27, and a sound source separation unit 28 to which data from the left and right channel corresponding units 27 is input.

上記 ADコンバータ 22 a乃至 22 dは、例えば量子化ビット数 16または 2 4で、 48kHzサンプリングした信号を取り出すように構成されている。そして、左側の外部マイク 16 aからのデジ夕ル音響信号 SOL及び左側の内部マイク 1 Ί aからのデジタル音響信号 S I Lは、ノイズ除去回路 23に入力される。また、右側の外部マイク 16 bからのデジタル音響信号 S OR及び左側の内部マイク 1 7 bからのデジタル音響信号 S I Rは、ノイズ除去回路 2 4に入力される。 The AD converters 22a to 22d are configured to take out a signal sampled at 48 kHz with a quantization bit number of 16 or 24, for example. Then, the digital audio signal SOL from the left external microphone 16 a and the digital audio signal SIL from the left internal microphone 1 a are input to the noise removal circuit 23. The digital audio signal SOR from the external microphone 16b on the right and the left The digital acoustic signal SIR from the internal microphone 17 b is input to the noise elimination circuit 24.

これらのノイズ除去回路 2 3 , 2 4は同一の構成であり、それそれ外部マイク 1 6からの音響信号から内部マイク 1 7のノイズ信号によりノイズキャンセルするように構成されている。 These noise elimination circuits 23 and 24 have the same configuration, and are configured to cancel noise from the acoustic signal from the external microphone 16 by the noise signal from the internal microphone 17.

即ち、ノイズ除去回路 2 3は、外部マイク 1 6 aからのデジタル音響信号 S O Lを、内部マイク 1 7 aが集音したロボット内部のノイズ発生源からのノイズ信号 S I Lに基づいて、例えば最も簡便には、外部マイク 1 6 aからのデジタル音響信号 S O Lから、内部マイク 1 Ί aからの音響信号 S I Lを減算する等の適宜の演算処理によるノイズキャンセル処理によって、外部マイク 1 6 aからの音響信号 S 0 Lに混入したロボット内部の各駆動機構等のノィズ発生源からのノイズを除去して、左の音響信号 S Lを生成する。 That is, the noise elimination circuit 23 converts the digital sound signal SOL from the external microphone 16a into a noise signal SIL from the noise source inside the robot collected by the internal microphone 17a. The noise from the external microphone 16a is reduced by appropriate noise reduction processing such as subtracting the acoustic signal SIL from the internal microphone 16a from the digital acoustic signal SOL from the external microphone 16a. The left acoustic signal SL is generated by removing noise from noise sources such as each drive mechanism inside the robot mixed in the signal S 0 L.

また、ノイズ除去回路 2 4は、外部マイク 1 6 bからのデジタル音響信号 S O Rを、内部マイク 1 7 bが集音したロボット内部のノイズ発生源からのノイズ信号 S I Rに基づいて、例えば最も簡便には、外部マイク 1 6 bからのデジタル音響信号 S O Rから、内部マイク 1 Ί bからの音響信号 S I Rを減算する等の適宜の演算処理によるノイズキャンセル処理によって、外部マイク 1 6 bからの音響信号 S I Rに混入したロボット内部の各駆動機構等のノィズ発生源からのノイズを除去して、右の音響信号 S Rを生成する。 In addition, the noise removal circuit 24 is based on the digital audio signal SOR from the external microphone 16 b and the noise signal SIR from the noise source inside the robot collected by the internal microphone 17 b, The noise from the external microphone 16b is reduced by appropriate noise cancellation processing such as subtracting the acoustic signal SIR from the internal microphone 1 内部 b from the digital sound signal SOR from the external microphone 16b. The noise from the noise source such as each drive mechanism inside the robot mixed into the signal SIR is removed, and the right acoustic signal SR is generated.

上記ピッチ抽出部 2 5， 2 6は同一の構成であり、ノイズ除去回路 2 3 , 2 4 からの左右の音響信号 S L， S Rをそれそれ周波数解析して、時間，周波数及びパワーの三軸から成る音響データ、並びにピッチ情報を取り出すように構成されている。即ち、ピッチ抽出部 2 5は、ノイズ除去回路 2 3からの左の音響信号 S Lを周波数解析することにより、時間とパワーから成る二軸の音響信号 S から時間，周波数及びパワーの三軸から成るスぺクトログラムと呼ばれる左の音響デ一夕 D Lを取り出す。 The pitch extraction units 25 and 26 have the same configuration. The left and right sound signals SL and SR from the noise elimination circuits 23 and 24 are subjected to frequency analysis, and the three axes of time, frequency and power are analyzed. It is configured to retrieve the acoustic data and pitch information. That is, the pitch extraction unit 25 performs frequency analysis on the left acoustic signal SL from the noise elimination circuit 23 to obtain three axes of time, frequency and power from the two-axis acoustic signal S composed of time and power. Take out the left acoustic data DL called a spectrogram.

また、ピッチ抽出部 2 6も同様にして、ノイズ除去回路 2 4からの右の音響信号 S Rを周波数解析することにより、時間とパワーから成る二軸の音響信号 S R から時間，周波数及びパワーの三軸から成る右の音響デ一夕 D R、ピッチ情報を取り出す。 Similarly, the pitch extraction unit 26 performs frequency analysis on the right acoustic signal SR from the noise elimination circuit 24 to obtain the time, frequency, and power from the two-axis acoustic signal SR composed of time and power. Three-axis right sound data DR, pitch information Take out.

ここで、上記周波数解析は、例えば 20m秒の窓長で、 7. 5 m秒づつずらしながら、 FFT (高速フーリエ変換）を行なうことにより行なわれる。 Here, the above-mentioned frequency analysis is performed by performing FFT (Fast Fourier Transform) with a window length of, for example, 20 ms and shifting by 7.5 ms.

なお、このような周波数解析は、 F FTに限らず一般的な種々の方法で行なわれるようにしてもよい。 Such frequency analysis is not limited to FFT and may be performed by various general methods.

このようにして得られた音響デ一夕 DLにおいては、音声や音楽における各音はスぺクトログラム上にてピークの時間方向への連なりとして示されると共に、一般的に調波構造を有しており、整数倍の周波数値に規則的にピークを有するようになっている。 In the sound data DL obtained in this manner, each sound in voice and music is shown as a series of peaks in the time direction on a spectrogram, and generally has a harmonic structure. And has a peak regularly at an integer multiple of the frequency value.

上記左右チャンネル対応部 27は、左右のピッチ抽出部 25, 26からの音響デ一夕 DL， DRにてそれそれピークにより示される調波構造のピッチ情報に関してその位相差，時間差等に基づいて、同じ音から派生するピッチを左右チャンネルで対応付けることにより音の方向付けを行なう。 The left and right channel corresponding units 27 are used to determine the phase difference, time difference, etc. of the pitch information of the harmonic structure indicated by the peak in the sound data DL and DR from the left and right pitch extraction units 25 and 26, respectively. Based on this, the direction of the sound is determined by associating the pitches derived from the same sound with the left and right channels.

図示の場合、左右チャンネル対応部 27は、聴覚システム 20以外にロボット 10に備えられた図示しない他の知覚システム 30から目標に関する情報、具体的には例えば視覚システムによる目標の位置，方向，形状，移動の有無等に関する情報や、触覚システムによる目標の柔軟度，振動の有無，触感等の情報が入力されることにより、上述した音源からの音の方向付けがより一層明確に行なわれるように構成されている。 In the case shown in the figure, the left and right channel corresponding units 27 provide information on the target from other perceptual systems 30 (not shown) provided in the robot 10 in addition to the auditory system 20, specifically, for example, the position, direction, shape, By inputting information on the presence / absence of movement and information on the target flexibility, presence / absence of vibration, tactile sensation, etc. by the haptic system, the direction of the sound from the sound source described above can be more clearly determined It is configured as follows.

上記音源分離部 28は、公知の構成により、左右チャンネル対応部 27からの方向付けの情報及び音響デ一夕 DL, DRが入力されることにより、例えばディクションパスフィル夕を使用する等の手法によって、方向付けの情報に基づいて音響データ DL, DRから音源を同定すると共に、各音源毎に音響データを分離するように構成されている。 The sound source separation unit 28 receives the direction information and the sound data DL and DR from the left and right channel corresponding units 27 according to a known configuration. Thus, the sound source is identified from the sound data DL and DR based on the information of the direction, and the sound data is separated for each sound source.

ここで、上記内部マイク 17は、具体的には、図 5〜図 8に示すように、カメラ 15に対して取り付けられている。 Here, the internal microphone 17 is specifically attached to the camera 15 as shown in FIGS.

先ず、図 5において、各内部マイク 17 a， 17bは、それそれ可動部材としての取付金具 40, 41に対して固定されている。 First, in FIG. 5, each of the internal microphones 17a and 17b is fixed to mounting brackets 40 and 41 as movable members, respectively.

この取付金具 40, 41は、それそれ上方が開放したほぼ U字形状に形成されており、上端が締付ネジ 40 a， 41 aで互いに近接されることにより内部マイク 17 a, 17 bを挟持して固定保持する。 These mounting brackets 40, 41 are formed in a substantially U-shape with the upper part opened. The upper ends of the inner microphones 17a and 17b are fixedly held by being brought close to each other with the tightening screws 40a and 41a.

また、上記取付金具 40， 41は、それそれ支持部材 42， 43に対して後述するように横方向に摺動可能に取り付けられている。 The mounting brackets 40 and 41 are respectively slidably mounted on the support members 42 and 43 in a lateral direction as described later.

上記支持部材 42, 43は、それぞれカメラ 15の上面に対してブラケット 4 4により固定されている。なお、図示の場合、支持部材 42, 43は、ブラケット 44と一体に形成されていると共に、ブラケット 44はカメラ 15が取り付けられる取付台 15 aに対して螺着されている。 The support members 42 and 43 are fixed to the upper surface of the camera 15 by brackets 44, respectively. In the illustrated case, the support members 42 and 43 are formed integrally with the bracket 44, and the bracket 44 is screwed to a mounting base 15a on which the camera 15 is mounted.

ここで、各支持部材 42， 43は、それそれ横方向に延びるスロット 42 a, 43 aを備えている。 Here, each support member 42, 43 is provided with a slot 42a, 43a extending in the lateral direction, respectively.

そして、上記取付金具 40, 41はそれぞれこれらのスロット 42 a, 43 a に挿通された固定ネジ 45， 46を受容するネジ孔（図示せず）を有しており、これらの固定ネジ 45, 46が締め付けられることで支持部材 42, 43に対して固定保持される。 The mounting brackets 40 and 41 have screw holes (not shown) for receiving the fixing screws 45 and 46 inserted into the slots 42a and 43a, respectively. Are fixed and held on the support members 42 and 43 by being tightened.

本発明実施形態による人型ロボット 10は以上のように構成されており、外部マイク 16 a， 16 bが集音しょうとする目標からの音は、以下のようにして集音され、ノイズキャンセルされることにより音源に知覚される。 The humanoid robot 10 according to the embodiment of the present invention is configured as described above, and the sound from the target to be collected by the external microphones 16a and 16b is collected as follows, and the noise is canceled. Is perceived by the sound source.

先ず、外部マイク 16 a， 16b力主として目標からの外部の音を集音してそれそれアナログ音響信号を出力する。ここで、外部マイク 16 a, 16 bは口ボット内部からのノイズも集音するが、外装 14自体が頭部 13の内部を密閉しており、また外部マイク 16 a， 16 bが頭部 13の内部に対して遮音されていることにより、アナログ音響信号に混入するノイズは、比較的低いレベルに抑えられている。 First, external microphones 16a, 16b mainly collect external sounds from the target and output analog sound signals. Here, the external microphones 16a and 16b also collect noise from the inside of the mouth bot, but the exterior 14 itself seals the inside of the head 13 and the external microphones 16a and 16b The noise that is mixed into the analog sound signal is suppressed to a relatively low level because the sound is shielded from the inside.

これに対して、内部マイク 17 a， 17 bは、主としてロボット内部からのノィズ、例えば前述した各駆動機構の作動音や冷却ファンの作動音等のノィズ発生源からのノイズを集音する。ここで、内部マイク 17 a, 17bは、外部からの音も集音するが、外装 14が内部を密閉していることから、混入するノイズのレベルは比較的低く抑えられている。 On the other hand, the internal microphones 17a and 17b mainly collect noise from the inside of the robot, for example, noise from noise generating sources such as the operating noise of each drive mechanism and the operating noise of the cooling fan. . Here, the internal microphones 17a and 17b also collect sound from outside, but since the exterior 14 hermetically seals the inside, the level of mixed noise is kept relatively low.

このようにして集音された外部マイク 1 6 a, 16 bからのアナログ音響信号及び内部マイク 1 7 a， 17 bからのアナログ音響信号は、それそれアンプ 21 a乃至 21 dにより増幅された後、 ADコンバータ 22 a乃至 22 dによりデジタル音響信号 SOL, SOR, S I L, S IRに AD変換されて、ノイズ除去回路 23及び 24に入力される。 Analog audio signals from external microphones 16a and 16b collected in this way Analog audio signals from the internal microphones 17a and 17b are amplified by amplifiers 21a to 21d, respectively, and then digital audio signals SOL, SOR, SIL, SIR by AD converters 22a to 22d. The signal is AD-converted and input to the noise elimination circuits 23 and 24.

ノイズ除去回路 23, 24は、それそれ外部マイク 16 a, 16 bからの音響信号 SOL, SORから内部マイク 17 a, 17 bからの音響信号 S I L , S I Rを減算する等の演算処理を行なうことにより、外部マイク 16 a, 16 bからの音響信号 SOL, SORから、ロボット内部のノイズ発生源からのノイズ信号を除去する。これにより、ノイズ除去回路 23， 24はそれそれノイズを除去した真の音響信号 SL， SRを外部に出力する。 The noise elimination circuits 23 and 24 perform arithmetic processing such as subtracting the audio signals SIL and SIR from the internal microphones 17a and 17b from the audio signals SOL and SOR from the external microphones 16a and 16b, respectively. The noise signal from the noise source inside the robot is removed from the sound signals SOL and SOR from the external microphones 16a and 16b. As a result, the noise elimination circuits 23 and 24 respectively output the true acoustic signals SL and SR from which noise has been eliminated to the outside.

そして、ピッチ抽出部 25, 26力それそれ音響信号 SL, SRに基づいて周波数解析により音響デ一夕 DL， DRを生成する。 Then, based on the sound signals SL and SR, the pitch extraction units 25 and 26 generate sound data DL and DR by frequency analysis.

ここで、音響データは、例えば図 9に示すようなスペクトログラムとして示される。図 9の音響データにおいて、横軸を時刻 t (秒）、縦軸を周波数（Hz) とし、濃度をパワーとして、例えば時刻 t 1から時刻 t 2まで断続的に連なった周波数成分が示されている。この周波数成分は、図示の場合、 1000 Hz, 2 000 H z , 3000Hz, · · ·のピークの連なりを有しており、これらのピ —クがそれそれ 1000Hzの整数倍であることから、一般的には、これらの周波数成分が唯一つの音を表わし、あるいはそれそれ一つの音を表わすものと解され得る。この場合には、これらの周波数成分が、同時刻に開始し且つ同時刻に終了していることから、これらの周波数成分が唯一つの音の調波構造を示していると判断される。従って、ピッチ抽出部 25, 26は、最も低い周波数である基本周波数 100 OH zをピッチとする。 Here, the acoustic data is shown as, for example, a spectrogram as shown in FIG. In the acoustic data of Fig. 9, the horizontal axis represents time t (seconds), the vertical axis represents frequency (Hz), and the density represents power, for example, frequency components intermittently connected from time t1 to time t2. I have. In the illustrated case, this frequency component has a series of peaks of 1000 Hz, 2000 Hz, 3000 Hz, and so on. Since these peaks are each an integral multiple of 1000 Hz, Typically, these frequency components can be understood to represent only one sound, or each one. In this case, since these frequency components start at the same time and end at the same time, it is determined that these frequency components indicate the harmonic structure of only one sound. Therefore, the pitch extraction units 25 and 26 set the fundamental frequency 100 OHz, which is the lowest frequency, as the pitch.

このようにして、ピッチ抽出部 25, 26は、音響信号 SL， SRに含まれる全ての音に関して当該音に関するピッチを抽出し、このピッチに対応する当該音の調波構造，開始時 t 1, 終了時 t 2と共に、音響データ DL， DR, ピッチ情報として左右チャンネル対応部 27に出力する。 In this way, the pitch extraction units 25 and 26 extract the pitches of all the sounds included in the sound signals SL and SR, and extract the pitches of the sounds corresponding to the pitches, At the end, at time t2, the sound data DL, DR, and pitch information are output to the left and right channel corresponding unit 27.

続いて、左右チャンネル対応部 27が、これらの音響デ一夕 DL, DRに基づいて音源毎の音の方向付けを行なう。この場合、左右チャンネル対応部 27は、例えば各ピッチ抽出部 25 , 26で抽出された音響デ一夕 DL, DRに基づいて、左右のチャンネルの調波構造を比較して最も近いピッチを対応付ける。その際、左右のチャンネルのピッチを一対一で比較するだけでなく、一方のチャンネルの複数のピッチを他方のチャンネルの一つのピッチと比較する等により、より正確な対応付けを行なうことが好ましレ、。そして、左右チャンネル対応部 27は、対応付けられたピッチの位相を比較することにより音源の方向を容易に演算して方向付けることができる。 Subsequently, the left and right channel corresponding unit 27 determines the direction of the sound for each sound source based on the sound data DL and DR. In this case, the left and right channel correspondence unit 27 compares the harmonic structures of the left and right channels based on the sound data DL and DR extracted by the pitch extraction units 25 and 26, and associates the closest pitch . In that case, it is preferable to make more accurate correspondence by comparing not only the pitch of the left and right channels one-to-one but also a plurality of pitches of one channel with one pitch of the other channel. Les ,. Then, the left and right channel corresponding section 27 can easily calculate and direct the direction of the sound source by comparing the phases of the pitches associated with each other.

これにより、音源分離部 28が、左右チャンネル対応部 27からの音の方向付け情報に基づいて、音響データ DL, DRから音源毎の音に関する音響デ一夕を抽出して、音源毎の音に分離する。このようにして、聴覚システム 20は音源毎の音の分離によつて音響認識を行ない、能動聴覚を行なうことができる。 Accordingly, the sound source separation unit 28 extracts the sound data for each sound source from the sound data DL and DR based on the sound direction information from the left and right channel corresponding units 27, and outputs the sound data for each sound source. To separate. In this way, the hearing system 20 can perform sound recognition by separating sounds for each sound source and perform active hearing.

ここで、内部マイク 17 a, 17bは、それそれ取付金具 40, 41に固定保持されると共に、これらの取付金具 40, 41が支持部材 42， 43で固定保持されることにより、カメラ 15に対して固定保持されている。 Here, the internal microphones 17a, 17b are fixedly held by the mounting brackets 40, 41, respectively, and the mounting brackets 40, 41 are fixedly held by the support members 42, 43, so that the internal microphones 17a, 17b are fixed to the camera 15. Fixedly held.

さらに、取付金具 40, 41は、それそれ支持部材 42， 43のスロット 42 a, 43 aに挿通された固定ネジ 45 , 46により支持部材 42， 43に対して螺着されている。 Further, the mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively.

その際、取付金具 40, 41そして内部マイク 17 a, 17bは、固定ネジ 4 5 , 46が緩められた状態にて、固定ネジ 45, 46がそれぞれ支持部材 42, 43のスロット 42 a, 43 a内を摺動することにより、横方向の位置が適宜に調整され、調整後に固定ネジ 45, 46が締め付けられる。 At this time, the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the mounting members 45, 46 with the fixing screws 45, 46 loosened, and the fixing screws 45, 46 are inserted into the slots 42a, 43a of the support members 42, 43, respectively. By sliding inside, the horizontal position is adjusted appropriately, and after adjustment, the fixing screws 45 and 46 are tightened.

これにより、内部マイク 17 a， 17bは、頭部 13の外装 14の内側にて横方向に移動調整されることにより、所望の位置に設定され得る。即ち、内部マイク 17 a, 17 bは、それそれ対応する外部マイク 16 a, 1 6 bに対して適宜の距離の位置に調整されることによって、前述したノイズ除去回路 23, 24によるノイズキャンセルを最も効果的に行なうことができる。 Thus, the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral direction inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, so that the noise generated by the above-described noise removing circuits 23 and 24 is adjusted. Cancellation can be performed most effectively.

このようにして、本発明実施形態による人型ロボット 10によれば、内部マイク 17 a， 17bを横方向に適宜に調整した状態にて、ノイズ除去回路 23, 2 4により、外部マイク 16 a, 16 bからの音響信号 SOL， SORから、内部マイク 17 a, 17 bからの音響信号 S I L， S I Rに基づいてノイズキャンセルを行なうことにより、各駆動機構によって外部マイク 16 a, 16 bの指向性の方向を目標に向けながら、 S/N比の良好な音響信号を得て、各音源からの音を分離してより正確な音声認識を行なうことができる。 Thus, according to the humanoid robot 10 according to the embodiment of the present invention, the noise reduction circuits 23 and 24 allow the external microphones 16 and 17 to be appropriately adjusted in the horizontal direction while the internal microphones 17 a and 17 b are appropriately adjusted. a, 16 b Sound signal from SOL, SOR, internal By performing noise cancellation based on the acoustic signals SIL and SIR from the microphones 17a and 17b, the S / N ratio can be increased while the directivity of the external microphones 16a and 16b is directed to the target by each drive mechanism. A good acoustic signal can be obtained, and the sound from each sound source can be separated to perform more accurate speech recognition.

従って、例えば目標が移動している場合であっても、各駆動機構によって外部マイク 16 a， 16 bの指向性の方向を常に目標に対して追従させながら、目標の音響認識を行なうことができる。 Therefore, even when the target is moving, for example, the acoustic recognition of the target can be performed while the directionality of the directivity of the external microphones 16a and 16b always follows the target by each drive mechanism. .

このとき、左右チャンネル対応部 27が、例えば他の知覚システム 30として視覚システムからの視覚による目標の方向付け情報を参照して、音の方向付けを行なうことにより、より正確な音の方向付けを行なうことができる。 At this time, the left and right channel corresponding unit 27 refers to, for example, the visual target direction information from the visual system as the other perceptual system 30 to determine the direction of the sound, thereby providing more accurate sound direction. Can do it.

また、他の知覚システム 30として視覚システムが使用される場合、左右チヤンネル対応部 27が視覚システムに対して音の方向付け情報を出力するようにしてもよい。これにより、視覚システム 30が、画像認識により目標の方向付けを行なう際に、聴覚システム 20からの音に関する方向付け情報を参照することによって、例えば目標が移動して障害物の裏に隠れてしまった場合でも、目標からの音を参照して、目標の方向付けをより一層正確に行なうことができる。 When a visual system is used as the other perceptual system 30, the left and right channel corresponding units 27 may output sound direction information to the visual system. This allows the visual system 30 to refer to sound-directing information from the auditory system 20 when directing a target by image recognition, for example, by moving the target and hiding behind obstacles. Even if you do, you can refer to the sound from the target and more accurately direct the target.

図 10は上述した内部マイク 17 a, 17bの取付構造の変形例を示している。図 10において、内部マイク 17 a， 17 bを横方向に移動可能に支持する支持部材 42， 43が、さらにカメラ 15の上面に対して図示しない適宜の手段で固定された第二の支持部材 47， 48に対して、前後方向に摺動可能に取り付けられている。上記第二の支持部材 47, 48はそれぞれ前後方向に延びるスロット 47 a， 48 aを備えている。 FIG. 10 shows a modification of the mounting structure of the internal microphones 17a and 17b described above. In FIG. 10, supporting members 42 and 43 for supporting the internal microphones 17a and 17b movably in the lateral direction are further fixed to an upper surface of the camera 15 by appropriate means (not shown). It is attached to 47 and 48 so that it can slide in the front-rear direction. The second support members 47, 48 have slots 47a, 48a extending in the front-rear direction, respectively.

そして、上記支持部材 42, 43の両端に形成された垂直に延びる折曲部 42 b， 42 c (43b， 43 c) が、それそれこれらのスロット 47 a, 48 aに挿通された固定ネジ 49, 50を受容するネジ孔（図示せず）を有しており、これらの固定ネジ 49, 50が締め付けられることにより、第二の支持部材 47, 48に対して固定保持される。 The vertically extending bent portions 42b, 42c (43b, 43c) formed at both ends of the support members 42, 43 are respectively provided with fixing screws 49 inserted into these slots 47a, 48a. Screw holes (not shown) for receiving the fixing members 49, 50, and these fixing screws 49, 50 are fixed to the second support members 47, 48 by being tightened.

このような構成の内部マイク 17 a, 17 bの取付構造によれば、内部マイク 17 a, 17bはそれそれ取付金具 40， 41に固定保持されると共に、これらの取付金具 40, 41が支持部材 42, 43及び第二の支持部材 47, 48で固定保持されることにより、カメラ 15に対して固定保持されている。 According to the mounting structure of the internal microphones 17a and 17b having such a configuration, the internal microphones 17a and 17b are fixedly held by the mounting brackets 40 and 41, respectively. The mounting brackets 40 and 41 are fixedly held by the support members 42 and 43 and the second support members 47 and 48, thereby being fixedly held to the camera 15.

さらに、取付金具 40, 41は、それそれ支持部材 42， 43のスロット 42 a, 43 aに挿通された固定ネジ 45 , 46で支持部材 42， 43に対して螺着されており、また支持部材 42, 43は、それそれ第二の支持部材 47, 48のスロット 47 a， 48 aに揷通された固定ネジ 49 , 50で第二の支持部材 47 , 48に対して螺着されている。 Further, the mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively. 42, 43 are respectively screwed to the second support members 47, 48 with fixing screws 49, 50 passed through the slots 47a, 48a of the second support members 47, 48. .

その際、取付金具 40， 41そして内部マイク 17 a, 17 bは、固定ネジ 4 5, 46が緩められた状態にて、固定ネジ 45, 46がそれそれ支持部材 42， 43のスロット 42 a, 43 a内を摺動することにより横方向の位置が適宜に調整され、調整後に固定ネジ 45， 46が締め付けられる。 At that time, the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the fixing screws 45, 46 while the fixing screws 45, 46 are loosened. The position in the horizontal direction is adjusted as appropriate by sliding in 43a, and after adjustment, the fixing screws 45 and 46 are tightened.

また、上記支持部材 42, 43は、固定ネジ 49， 50が緩められた状態にて、固定ネジ 49, 50がそれそれ第二の支持部材 47， 48のスロット 47 a, 48 a内を摺動することにより前後方向の位置が適宜に調整され、調整後に固定ネジ 49， 50が締め付けられる。 The fixing screws 49 and 50 slide in the slots 47 a and 48 a of the second supporting members 47 and 48, respectively, with the fixing screws 49 and 50 loosened. By doing so, the position in the front-rear direction is appropriately adjusted, and after the adjustment, the fixing screws 49 and 50 are tightened.

これにより、内部マイク 17 a， 17bは、頭部 13の外装 14の内側にて横方向及び前後方向に移動調整されることにより、所望の位置に設定され得る。即ち、内部マイク 17a, 17 bは、それぞれ対応する外部マイク 16 a， 16b に対して適宜の距離の位置に調整されることによって、前述したノイズ除去回路 23, 24によるノイズキャンセルをより一層効果的に行なうことができる。上述した実施形態において、人型ロボット 10は、 4DOF (自由度）を有するように構成されているが、これに限らず、任意の動作を行なうように構成されたロボットに本発明によるロボット聴覚システムを組み込むことも可能である。また、上述した実施形態においては、本発明によるロボット聴覚システムを人型ロボット 10に組み込んだ場合について説明したが、これに限らず、犬型等の各種動物型ロボットや、その他の形式のロボットに組み込むことも可能であることは明らかである。 Thus, the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral and front-back directions inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, respectively, so that the noise cancellation by the noise removing circuits 23 and 24 described above is more effectively achieved. Can be done In the above-described embodiment, the humanoid robot 10 is configured to have 4 DOF (degree of freedom). However, the present invention is not limited to this. The robot according to the present invention may be a robot configured to perform an arbitrary operation. It is also possible to incorporate a hearing system. Further, in the above-described embodiment, the case where the robot hearing system according to the present invention is incorporated in the humanoid robot 10 has been described. However, the present invention is not limited to this. Obviously, it can be built into a robot.

さらに、上述した実施形態において、内部マイク 17は一対のマイク 17 a， 17 bから構成されているが、二対以上のマイクから構成されていてもよい。 Further, in the above-described embodiment, the internal microphone 17 is constituted by a pair of microphones 17a and 17b, but may be constituted by two or more pairs of microphones.

4 さらにまた、上述した実施形態においては、外部マイク 1 6は、一対のマイク 1 6 a , 1 6 bから構成されているが、二対以上のマイクから構成されていてもよい。 Four Furthermore, in the above-described embodiment, the external microphone 16 is configured by a pair of microphones 16a and 16b, but may be configured by two or more pairs of microphones.

また、上述した実施形態では、内部マイク 1 7 a， 1 7 bは、支持部材 4 2， 4 3のスロット 4 2 a , 4 3 aに沿って横方向に移動調整可能に構成されているが、これに限らず他の手段によって横方向に移動調整可能に構成されていてもよい。 In the above-described embodiment, the internal microphones 17a and 17b are configured to be movable and adjustable in the horizontal direction along the slots 42a and 43a of the support members 42 and 43. However, the present invention is not limited to this, and it may be configured so that the movement can be adjusted in the horizontal direction by other means.

取付金具 4 0 , 4 1は、駆動手段によって支持部材 4 2 , 4 3そしてカメラ 1 5に対して横方向に移動調整可能に構成されていてもよい。この場合、人型ロボット 1 0の頭部 1 3から外装 1 4を外すことなく、組み立てた状態にて内部マイク 1 7 a， 1 7 bの横方向の位置調整を行なうことができるので、ノイズキャンセルをより効果的に、そしてより正確に行なうことができる。 The mounting brackets 40 and 41 may be configured to be movable and adjustable in the lateral direction with respect to the support members 42 and 43 and the camera 15 by driving means. In this case, it is possible to adjust the horizontal position of the internal microphones 17a and 17b in the assembled state without removing the exterior 14 from the head 13 of the humanoid robot 10. Therefore, noise cancellation can be performed more effectively and more accurately.

さらに、上述の実施形態においては、内部マイク 1 7 a , 1 7 bは、横方向にあるいは横方向及び前後方向に移動可能に支持されているが、これに限らず、さらに上下方向に移動可能に支持されていてもよい。この場合、内部マイク 1 7 a ， 1 7 bは、公知の構成の三軸方向に移動調整可能な機構により支持される。以上述べたように、本発明によれば、外部マイクが主として外部の目標からの音を集音し、内部マイクが主としてロボット内部の駆動機構等のノイズ発生源からのノイズを集音する。その際、外部マイクが集音した音響信号には、ロボット内部のノィズ発生源からのノィズ信号が混入しているが、この混入したノィズ信号は、ノィズ除去部における演算処理によって内部マイクが集音したノィズ信号にてキャンセルされることにより著しく低減される。 Further, in the above-described embodiment, the internal microphones 17a and 17b are supported so as to be movable in the horizontal direction or in the horizontal direction and the front-back direction. It may be movably supported. In this case, the internal microphones 17a and 17b are supported by a mechanism having a known configuration that can be adjusted in three axes. As described above, according to the present invention, the external microphone mainly collects sound from an external target, and the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot. At this time, the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removing unit. The noise is significantly reduced by being canceled by the noise signal.

したがって、外部マイクからの音響信号は、ノイズ除去部での演算処理によつてロボット内部の駆動機構等のノィズ発生源からのノィズが容易に著しく低減されて、その S /N比が大幅に改善されるので、音源毎の音響データの分離をより一層良好に行なうことができる。 Therefore, in the acoustic signal from the external microphone, the noise from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination unit, and the S / N ratio is greatly increased. To separate sound data for each sound source. It can be performed better.

このようにして、本発明によれば、駆動機構等のロボット内部で発生するノィズに影響されることなく、外部の目標からの音を集音して能動知覚を行なうことができるようにした、極めて優れたロボット聴覚システムが提供される。 As described above, according to the present invention, it is possible to perform active perception by collecting sound from an external target without being affected by noise generated inside the robot such as a drive mechanism. An extremely good robot hearing system is provided.

さらに、上記内部マイクが外装の内側にて横方向に移動可能に配設されていることにより、ロボット内部のノィズ発生源からのノイズをより良好に集音するように内部マイクを横方向に移動させて、各内部マイクと対となるべき対応する外部マイクとの距離を適宜に調整することができ、外部マイクが集音する音響信号からより良好にノイズをキャンセルすることができる。 In addition, since the above-mentioned internal microphone is arranged to be able to move in the horizontal direction inside the exterior, the internal microphone is moved in the horizontal direction so that noise from the noise source inside the robot is better collected. Thus, the distance between each internal microphone and the corresponding external microphone to be paired can be appropriately adjusted, and noise can be better canceled from the acoustic signal collected by the external microphone.

なお、本発明は例示的な実施例について説明したものであり、本発明の要旨及び範囲を逸脱することなく、実施例での種々の変更、省略、追加が可能であることは当業者において自明である。したがって、本発明は実施例に限定されるものではなく、請求の範囲に記載された要素に基づいて規定される範囲及び及びその均等範囲を包含するものとして理解されなければならない。産業上の利用可能性 It should be noted that the present invention has been described with reference to exemplary embodiments, and it is understood by those skilled in the art that various changes, omissions, and additions can be made in the embodiments without departing from the spirit and scope of the present invention. It is obvious. Therefore, the present invention is not limited to the embodiments, but should be understood as including the range defined based on the elements described in the claims and the equivalents thereof. Industrial applicability

以上のように、本発明によるロボット聴覚システムは、ロボット、特に人型または動物型ロボットにおける能動聴覚のための聴覚システムとして極めて有用である。 As described above, the robot hearing system according to the present invention is extremely useful as a hearing system for active hearing in a robot, particularly in a humanoid or animal robot.

Claims

Scope of request For robots with noise sources inside,

A soundproof exterior covering at least a part of the robot,

At least one pair of external microphones provided outside the exterior and mainly collecting external sound;

At least one pair of internal microphones provided inside the exterior and collecting noise mainly from internal noise sources;

A noise removing unit that cancels a noise signal from an internal noise source from an acoustic signal from the external microphone based on the left and right signals from the external microphone and the internal microphone, respectively;

A pitch extraction unit that performs frequency analysis on each of the left and right audio signals from the noise removal unit and extracts audio data and pitch information relating to time, frequency, and power;

A left and right channel corresponding unit for directing sound from a harmonic structure with a pitch indicated by the sound data based on the left and right sound data and the pitch information extracted by the pitch extracting unit;

A sound source separation unit that separates the sound data into sound data for each sound source based on the sound direction information extracted by the left and right channel corresponding units. . In a human or animal robot with a noise source such as a drive mechanism inside,

A soundproof exterior covering at least the head of the robot,

At least one pair of external microphones provided at both ear positions corresponding to both ears on the outside of the exterior, and mainly collecting external sound;

Sound from the external microphone based on the signals from the external microphone and the internal microphone A noise removing unit for canceling a noise signal from an internal noise source from the signal,

A pitch extracting unit that performs frequency analysis from the left and right acoustic signals from the noise removing unit and extracts acoustic data and pitch information relating to time, frequency, and power;

A left and right channel corresponding unit for directing a sound from a harmonic structure with a pitch of the sound data based on the left and right sound data and the bitch information extracted by the pitch extracting unit;

A sound source separation unit configured to separate the sound data into sound data for each sound source based on the sound direction information extracted by the left and right channel corresponding units. Further, the robot is provided with a perception system such as visual and tactile senses, and the left and right channel corresponding units refer to information from these perception systems to direct sound. 3. The robot hearing system according to paragraph 1 or 2. 4. The robot hearing system according to claim 3, wherein the left and right channel corresponding unit outputs information relating to sound direction to the perception system. The robot hearing system according to any one of claims 1 to 4, wherein the internal microphone is disposed inside the exterior so as to be movable at least in a lateral direction. . The robot listening according to claim 5, wherein the internal microphone is slidably supported by a support member extending in a lateral direction inside the exterior. A member comprising a laterally extending slot, wherein the internal microphone comprises: 7. The robot hearing device according to claim 6, wherein the robot hearing device is fixed to a movable member slidably supported along a slot of the support member. 8. The Robo's soto hearing device according to claim 7, wherein the movable member is screwed to a support member by a fixing screw inserted into the slot. The robot hearing device according to claim 5, wherein the internal microphone is further disposed so as to be movable in the front-back direction. 0. The internal microphone is slidably supported on a first support member extending in the lateral direction inside the exterior, and the first support member moves forward and backward inside the exterior. 10. The robot hearing device according to claim 9, wherein the robot hearing device is slidably supported by a second support member that extends. 1. The robot according to any one of claims 5, 9, and 10, wherein the internal microphone is further disposed so as to be movable in a vertical direction.