[go: up one dir, main page]

WO2001057852A1 - Robot hearing system - Google Patents

Robot hearing system Download PDF

Info

Publication number
WO2001057852A1
WO2001057852A1 PCT/JP2001/000647 JP0100647W WO0157852A1 WO 2001057852 A1 WO2001057852 A1 WO 2001057852A1 JP 0100647 W JP0100647 W JP 0100647W WO 0157852 A1 WO0157852 A1 WO 0157852A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
noise
robot
internal
microphone
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2001/000647
Other languages
French (fr)
Japanese (ja)
Inventor
Kazuhiro Nakadai
Hiroshi Okuno
Hiroaki Kitano
Tatsuya Matsui
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Japan Science and Technology Agency
Original Assignee
Japan Science and Technology Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2000022677A external-priority patent/JP2001215989A/en
Priority claimed from JP2000022679A external-priority patent/JP2001215991A/en
Application filed by Japan Science and Technology Corp filed Critical Japan Science and Technology Corp
Publication of WO2001057852A1 publication Critical patent/WO2001057852A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B25HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
    • B25JMANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
    • B25J13/00Controls for manipulators
    • B25J13/003Controls for manipulators by means of an audio-responsive input
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02163Only one microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02165Two microphones, one receiving mainly the noise signal and the other one mainly the speech signal

Definitions

  • the present invention relates to a hearing system in a robot, particularly in a humanoid or animal robot. Background art
  • Active perception is a system that supports a sensory device that is in charge of perception such as robot vision and robot hearing so that it follows the target to be perceived. is there.
  • the optical axis direction of the camera is held toward the target by at least the attitude control by the camera power drive mechanism, which is a perception device, and the target is automatically focused on, for example, focusing, zooming, zooming, etc. Is performed, so that the camera captures the target.
  • the camera power drive mechanism which is a perception device
  • the target is automatically focused on, for example, focusing, zooming, zooming, etc. Is performed, so that the camera captures the target.
  • the microphone which is a sensory device, maintains its directivity toward the target by controlling the attitude of the drive mechanism, and the microphone collects sound from the target.
  • the disadvantage of active hearing is that the microphone picks up the operating sound of the drive mechanism while the drive mechanism is operating, so that relatively loud noise is mixed in with the sound from the target. It may not be possible to recognize the sound from the sound.
  • An object of the present invention is to solve the above-mentioned drawbacks in the conventional technology, and to collect sound from an external target without being affected by noise generated inside the robot such as a driving mechanism, and to actively control the sound. It is an object of the present invention to provide an auditory system for a robot capable of performing perception, particularly a humanoid or animal robot. Disclosure of the invention
  • a robot hearing system includes a soundproof exterior covering at least a part of the robot, and a robot provided with a noise source inside, and provided outside the exterior. At least one pair of external microphones that mainly collect external sound, at least one pair of internal microphones that are provided inside the exterior and mainly collect noise from internal noise sources, and the external microphone and the internal microphone. Noise canceling unit that cancels the noise signal from the internal noise source from the audio signal from the external microphone based on the left and right signals from the microphone, respectively, and the frequency from the left and right audio signals from the noise removing unit.
  • a pitch extraction unit that performs analysis to extract acoustic data and pitch information relating to time, frequency, and power;
  • a left and right channel corresponding section for directing sound from a harmonic structure with a pitch indicated by the sound data based on the left and right sound data and pitch information extracted by the left and right channel sections;
  • a sound source separation unit that separates the sound data into sound data for each sound source based on the sound direction information.
  • a human- or animal-type robot auditory system provides a human- or animal-type robot having a noise generating source such as a drive mechanism therein, at least a robot head.
  • a noise removing unit for canceling a noise signal from a noise generating source; a pitch extracting unit for performing frequency analysis on left and right acoustic signals from the noise removing unit to extract acoustic data and pitch information relating to time, frequency and power.
  • a left / right channel corresponding unit for directing a sound from a harmonic structure with a pitch of the sound data based on the left / right sound data pitch information extracted by the pitch extracting unit;
  • a sound source separation unit that separates the sound data into sound data for each sound source based on the sound orientation information extracted by the sound source.
  • the external microphone mainly collects sound from an external target
  • the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot.
  • the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removal unit. It is significantly reduced by being canceled by the noise signal.
  • the acoustic signal is extracted by frequency analysis by the pitch extraction unit, and the sound direction is determined from the acoustic signal by the left and right channel corresponding units. Then, the sound source is separated into sound data for each sound source by a sound source separation unit.
  • the noise reduction from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination section, and the S / N ratio of the acoustic signal from the external microphone is greatly improved. Therefore, the sound data can be more properly separated for each sound source.
  • the robot further includes a perception system such as visual and tactile sensation, and the left and right channel corresponding units refer to information from these perception systems to determine the direction of sound. Do.
  • a perception system such as visual and tactile sensation
  • the left and right channel corresponding units refer to information from these perception systems to determine the direction of sound. Do.
  • the left and right channel corresponding sections can perform clearer orientation based on visual information on a target from the visual device.
  • the left and right channel corresponding sections output information relating to the direction of sound to the sensory system.
  • the information on the orientation of the target by hearing is output to the visual device, so that the visual device can perform more accurate orientation.
  • the robot hearing system according to the present invention is preferably characterized in that the internal microphone is disposed inside the exterior so as to be movable at least in a lateral direction.
  • the internal microphones are moved in the horizontal direction, the front-rear direction, and / or the vertical direction so as to more appropriately collect noise from a noise source inside the robot, and are paired with each internal microphone.
  • the distance to the corresponding external microphone can be appropriately adjusted, and the noise can be canceled in the most optimal state from the acoustic signal collected by the external microphone.
  • the internal microphone is slidably supported by a support member extending in a lateral direction inside the exterior.
  • the support member includes a slot extending in a lateral direction, and the internal microphone is fixed to a movable member slidably supported along the slot of the support member. Have been.
  • the movable member is screwed to the support member by a fixing screw inserted into the slot.
  • the internal microphone is further disposed so as to be movable in the front-rear direction.
  • the internal microphone is slidably supported on a first support member extending in the lateral direction inside the exterior, and the first support member is provided on the exterior. It is supported slidably with respect to a second support member extending in the front-rear direction on the inside.
  • the internal microphone is further disposed so as to be vertically movable.
  • FIG. 1 is a front view showing the appearance of a humanoid mouthboat incorporating the first embodiment of the robot hearing device according to the present invention.
  • FIG. 2 is a side view of the humanoid robot of FIG.
  • FIG. 3 is a schematic enlarged view showing the configuration of the head in the humanoid robot of FIG.
  • FIG. 4 is a block diagram showing an electrical configuration of the robot auditory system in the humanoid robot of FIG.
  • FIG. 5 is a schematic front view showing a mounting structure of an internal microphone in the humanoid robot of FIG.
  • FIG. 6 is an enlarged front view showing a main part of the mounting structure of the internal microphone in FIG.
  • FIG. 7 is an enlarged side view showing a main part of the mounting structure of the internal microphone in FIG.
  • FIG. 8 is an enlarged perspective view showing a main part of the mounting structure of the internal microphone in FIG.
  • FIG. 9 is a graph showing an example of acoustic data and pitch information extracted by the pitch extraction unit in the robot hearing system of FIG.
  • FIG. 10 is an enlarged perspective view showing a main part of a modified example of the mounting structure of the internal microphone in FIG. BEST MODE FOR CARRYING OUT THE INVENTION
  • FIGS. 1 to 4 show the overall configuration of an experimental humanoid robot provided with an embodiment of the robot hearing system according to the present invention.
  • the humanoid robot 10 is configured as a 4-DOF (degree of freedom) robot.
  • the humanoid robot 10 includes a base 11, a body 12 supported rotatably around one axis (vertical axis) on the base 11, and a three-axis direction on the body 12 ( (Vertical axis, horizontal axis in the horizontal direction, and horizontal axis in the front-rear direction).
  • the base 11 may be fixedly arranged, and may be operable as a leg of a robot. Further, the base 11 may be placed on a movable cart or the like.
  • the body portion 12 is rotatably supported about a vertical axis with respect to the base 11 as shown by an arrow A in FIG. 1, and is rotationally driven by a driving means (not shown). Further, the body 12 is entirely covered with a soundproof exterior.
  • the head 13 is supported on the body 12 via a connecting member 13a, and around the horizontal axis in the front-rear direction with respect to the connecting member 13a, as shown by an arrow B in FIG. It is swingably supported around a horizontal axis in the left-right direction as shown by arrow C in FIG.
  • the connecting member 13a is swingably supported relative to the body 12 around a horizontal axis in the front-rear direction, as indicated by an arrow D in FIG.
  • the head 13 is entirely covered with a soundproof exterior 14, as shown in FIG. Further, the head 13 has a camera 15 on the front side as a visual device in charge of robot vision, and a pair of external microphones 16 (16a, 16b) on both sides as a hearing device for robot hearing. .
  • the head 13 includes a pair of internal microphones 17 (17a, 17b) which are disposed inside the exterior 14 so as to be separated from each other.
  • the exterior 14 is made of a sound absorbing synthetic resin such as a urethane resin.
  • the exterior 14 is configured so that the inside of the head 13 is sound-insulated by almost completely sealing the inside of the head 13.
  • the exterior of the body 12 is similarly made of a sound-absorbing synthetic resin.
  • the camera 15 has a known configuration.
  • a commercially available camera having 3 DOFs (degrees of freedom) of so-called pan, tilt, and zoom can be used.
  • the external microphones 16 are attached to the sides of the head 13 so as to have directionality toward the front.
  • the left and right external microphones 16a and 16b on the left and right of the external microphone 16 are respectively attached to stepped portions 14a and 14b facing forward on both sides of the exterior 14, as shown in FIGS. It is mounted on the inside, and through the through holes provided in the steps 14a, 14b, collects the sound in front and is sound-insulated by appropriate means so as not to pick up the sound inside the exterior 14.
  • the external microphones 16a and 16b are configured as so-called binaural microphones.
  • the exterior 14 may be formed in the shape of a human outer ear.
  • the internal microphones 17 are respectively located inside the exterior 14 in the vicinity of the external microphones 16a and 16b described above, and in the illustrated case, above the both ends of the camera 15 as described later. Is arranged so as to be movable in the lateral direction.
  • the internal microphone 17 may be provided at an arbitrary position inside the exterior 14.
  • FIG. 4 shows an electrical configuration of a hearing system for sound processing including the external microphone 16 and the internal microphone 17.
  • a hearing system 20 includes amplifiers 21 a, 2 lb, 21 c, and 2 Id that amplify acoustic signals from the external microphones 16 a and 16 b and the internal microphones 17 a and 17 b, respectively.
  • AD converters 22a, 22b, 22c, 22 that convert these amplified signals into digital audio signals SOL, SOR, SIL, SIR by AD conversion, and left and right noise removal circuits to which these digital audio signals are input 23, 24, pitch extraction units 25, 26 to which digital audio signals SR, SL from noise elimination circuits 23, 24 are input, and left and right channel corresponding units to which audio data from pitch extraction units 25, 26 are input 27, and a sound source separation unit 28 to which data from the left and right channel corresponding units 27 is input.
  • the AD converters 22a to 22d are configured to take out a signal sampled at 48 kHz with a quantization bit number of 16 or 24, for example. Then, the digital audio signal SOL from the left external microphone 16 a and the digital audio signal SIL from the left internal microphone 1 a are input to the noise removal circuit 23. The digital audio signal SOR from the external microphone 16b on the right and the left The digital acoustic signal SIR from the internal microphone 17 b is input to the noise elimination circuit 24.
  • noise elimination circuits 23 and 24 have the same configuration, and are configured to cancel noise from the acoustic signal from the external microphone 16 by the noise signal from the internal microphone 17.
  • the noise elimination circuit 23 converts the digital sound signal SOL from the external microphone 16a into a noise signal SIL from the noise source inside the robot collected by the internal microphone 17a.
  • the noise from the external microphone 16a is reduced by appropriate noise reduction processing such as subtracting the acoustic signal SIL from the internal microphone 16a from the digital acoustic signal SOL from the external microphone 16a.
  • the left acoustic signal SL is generated by removing noise from noise sources such as each drive mechanism inside the robot mixed in the signal S 0 L.
  • the noise removal circuit 24 is based on the digital audio signal SOR from the external microphone 16 b and the noise signal SIR from the noise source inside the robot collected by the internal microphone 17 b,
  • the noise from the external microphone 16b is reduced by appropriate noise cancellation processing such as subtracting the acoustic signal SIR from the internal microphone 1 ⁇ b from the digital sound signal SOR from the external microphone 16b.
  • the noise from the noise source such as each drive mechanism inside the robot mixed into the signal SIR is removed, and the right acoustic signal SR is generated.
  • the pitch extraction units 25 and 26 have the same configuration.
  • the left and right sound signals SL and SR from the noise elimination circuits 23 and 24 are subjected to frequency analysis, and the three axes of time, frequency and power are analyzed. It is configured to retrieve the acoustic data and pitch information. That is, the pitch extraction unit 25 performs frequency analysis on the left acoustic signal SL from the noise elimination circuit 23 to obtain three axes of time, frequency and power from the two-axis acoustic signal S composed of time and power. Take out the left acoustic data DL called a spectrogram.
  • the pitch extraction unit 26 performs frequency analysis on the right acoustic signal SR from the noise elimination circuit 24 to obtain the time, frequency, and power from the two-axis acoustic signal SR composed of time and power.
  • Three-axis right sound data DR, pitch information Take out.
  • the above-mentioned frequency analysis is performed by performing FFT (Fast Fourier Transform) with a window length of, for example, 20 ms and shifting by 7.5 ms.
  • FFT Fast Fourier Transform
  • Such frequency analysis is not limited to FFT and may be performed by various general methods.
  • each sound in voice and music is shown as a series of peaks in the time direction on a spectrogram, and generally has a harmonic structure. And has a peak regularly at an integer multiple of the frequency value.
  • the left and right channel corresponding units 27 are used to determine the phase difference, time difference, etc. of the pitch information of the harmonic structure indicated by the peak in the sound data DL and DR from the left and right pitch extraction units 25 and 26, respectively. Based on this, the direction of the sound is determined by associating the pitches derived from the same sound with the left and right channels.
  • the left and right channel corresponding units 27 provide information on the target from other perceptual systems 30 (not shown) provided in the robot 10 in addition to the auditory system 20, specifically, for example, the position, direction, shape,
  • the haptic system By inputting information on the presence / absence of movement and information on the target flexibility, presence / absence of vibration, tactile sensation, etc. by the haptic system, the direction of the sound from the sound source described above can be more clearly determined It is configured as follows.
  • the sound source separation unit 28 receives the direction information and the sound data DL and DR from the left and right channel corresponding units 27 according to a known configuration. Thus, the sound source is identified from the sound data DL and DR based on the information of the direction, and the sound data is separated for each sound source.
  • the internal microphone 17 is specifically attached to the camera 15 as shown in FIGS.
  • each of the internal microphones 17a and 17b is fixed to mounting brackets 40 and 41 as movable members, respectively.
  • These mounting brackets 40, 41 are formed in a substantially U-shape with the upper part opened.
  • the upper ends of the inner microphones 17a and 17b are fixedly held by being brought close to each other with the tightening screws 40a and 41a.
  • the mounting brackets 40 and 41 are respectively slidably mounted on the support members 42 and 43 in a lateral direction as described later.
  • the support members 42 and 43 are fixed to the upper surface of the camera 15 by brackets 44, respectively.
  • the support members 42 and 43 are formed integrally with the bracket 44, and the bracket 44 is screwed to a mounting base 15a on which the camera 15 is mounted.
  • each support member 42, 43 is provided with a slot 42a, 43a extending in the lateral direction, respectively.
  • the mounting brackets 40 and 41 have screw holes (not shown) for receiving the fixing screws 45 and 46 inserted into the slots 42a and 43a, respectively. Are fixed and held on the support members 42 and 43 by being tightened.
  • the humanoid robot 10 is configured as described above, and the sound from the target to be collected by the external microphones 16a and 16b is collected as follows, and the noise is canceled. Is perceived by the sound source.
  • external microphones 16a, 16b mainly collect external sounds from the target and output analog sound signals.
  • the external microphones 16a and 16b also collect noise from the inside of the mouth bot, but the exterior 14 itself seals the inside of the head 13 and the external microphones 16a and 16b
  • the noise that is mixed into the analog sound signal is suppressed to a relatively low level because the sound is shielded from the inside.
  • the internal microphones 17a and 17b mainly collect noise from the inside of the robot, for example, noise from noise generating sources such as the operating noise of each drive mechanism and the operating noise of the cooling fan.
  • the internal microphones 17a and 17b also collect sound from outside, but since the exterior 14 hermetically seals the inside, the level of mixed noise is kept relatively low.
  • the noise elimination circuits 23 and 24 perform arithmetic processing such as subtracting the audio signals SIL and SIR from the internal microphones 17a and 17b from the audio signals SOL and SOR from the external microphones 16a and 16b, respectively.
  • the noise signal from the noise source inside the robot is removed from the sound signals SOL and SOR from the external microphones 16a and 16b.
  • the noise elimination circuits 23 and 24 respectively output the true acoustic signals SL and SR from which noise has been eliminated to the outside.
  • the pitch extraction units 25 and 26 generate sound data DL and DR by frequency analysis.
  • the acoustic data is shown as, for example, a spectrogram as shown in FIG.
  • the horizontal axis represents time t (seconds)
  • the vertical axis represents frequency (Hz)
  • the density represents power, for example, frequency components intermittently connected from time t1 to time t2.
  • this frequency component has a series of peaks of 1000 Hz, 2000 Hz, 3000 Hz, and so on. Since these peaks are each an integral multiple of 1000 Hz, Typically, these frequency components can be understood to represent only one sound, or each one. In this case, since these frequency components start at the same time and end at the same time, it is determined that these frequency components indicate the harmonic structure of only one sound. Therefore, the pitch extraction units 25 and 26 set the fundamental frequency 100 OHz, which is the lowest frequency, as the pitch.
  • the pitch extraction units 25 and 26 extract the pitches of all the sounds included in the sound signals SL and SR, and extract the pitches of the sounds corresponding to the pitches,
  • the sound data DL, DR, and pitch information are output to the left and right channel corresponding unit 27.
  • the left and right channel corresponding unit 27 determines the direction of the sound for each sound source based on the sound data DL and DR.
  • the left and right channel correspondence unit 27 compares the harmonic structures of the left and right channels based on the sound data DL and DR extracted by the pitch extraction units 25 and 26, and associates the closest pitch .
  • it is preferable to make more accurate correspondence by comparing not only the pitch of the left and right channels one-to-one but also a plurality of pitches of one channel with one pitch of the other channel. Les ,.
  • the left and right channel corresponding section 27 can easily calculate and direct the direction of the sound source by comparing the phases of the pitches associated with each other.
  • the sound source separation unit 28 extracts the sound data for each sound source from the sound data DL and DR based on the sound direction information from the left and right channel corresponding units 27, and outputs the sound data for each sound source. To separate. In this way, the hearing system 20 can perform sound recognition by separating sounds for each sound source and perform active hearing.
  • the internal microphones 17a, 17b are fixedly held by the mounting brackets 40, 41, respectively, and the mounting brackets 40, 41 are fixedly held by the support members 42, 43, so that the internal microphones 17a, 17b are fixed to the camera 15. Fixedly held.
  • mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively.
  • the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the mounting members 45, 46 with the fixing screws 45, 46 loosened, and the fixing screws 45, 46 are inserted into the slots 42a, 43a of the support members 42, 43, respectively.
  • the fixing screws 45 and 46 are tightened.
  • the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral direction inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, so that the noise generated by the above-described noise removing circuits 23 and 24 is adjusted. Cancellation can be performed most effectively.
  • the noise reduction circuits 23 and 24 allow the external microphones 16 and 17 to be appropriately adjusted in the horizontal direction while the internal microphones 17 a and 17 b are appropriately adjusted.
  • a, 16 b Sound signal from SOL, SOR, internal By performing noise cancellation based on the acoustic signals SIL and SIR from the microphones 17a and 17b, the S / N ratio can be increased while the directivity of the external microphones 16a and 16b is directed to the target by each drive mechanism. A good acoustic signal can be obtained, and the sound from each sound source can be separated to perform more accurate speech recognition.
  • the acoustic recognition of the target can be performed while the directionality of the directivity of the external microphones 16a and 16b always follows the target by each drive mechanism. .
  • the left and right channel corresponding unit 27 refers to, for example, the visual target direction information from the visual system as the other perceptual system 30 to determine the direction of the sound, thereby providing more accurate sound direction. Can do it.
  • the left and right channel corresponding units 27 may output sound direction information to the visual system.
  • FIG. 10 shows a modification of the mounting structure of the internal microphones 17a and 17b described above.
  • supporting members 42 and 43 for supporting the internal microphones 17a and 17b movably in the lateral direction are further fixed to an upper surface of the camera 15 by appropriate means (not shown). It is attached to 47 and 48 so that it can slide in the front-rear direction.
  • the second support members 47, 48 have slots 47a, 48a extending in the front-rear direction, respectively.
  • the vertically extending bent portions 42b, 42c (43b, 43c) formed at both ends of the support members 42, 43 are respectively provided with fixing screws 49 inserted into these slots 47a, 48a. Screw holes (not shown) for receiving the fixing members 49, 50, and these fixing screws 49, 50 are fixed to the second support members 47, 48 by being tightened.
  • the internal microphones 17a and 17b are fixedly held by the mounting brackets 40 and 41, respectively.
  • the mounting brackets 40 and 41 are fixedly held by the support members 42 and 43 and the second support members 47 and 48, thereby being fixedly held to the camera 15.
  • mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively.
  • 42, 43 are respectively screwed to the second support members 47, 48 with fixing screws 49, 50 passed through the slots 47a, 48a of the second support members 47, 48. .
  • the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the fixing screws 45, 46 while the fixing screws 45, 46 are loosened.
  • the position in the horizontal direction is adjusted as appropriate by sliding in 43a, and after adjustment, the fixing screws 45 and 46 are tightened.
  • the fixing screws 49 and 50 slide in the slots 47 a and 48 a of the second supporting members 47 and 48, respectively, with the fixing screws 49 and 50 loosened. By doing so, the position in the front-rear direction is appropriately adjusted, and after the adjustment, the fixing screws 49 and 50 are tightened.
  • the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral and front-back directions inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, respectively, so that the noise cancellation by the noise removing circuits 23 and 24 described above is more effectively achieved.
  • the humanoid robot 10 is configured to have 4 DOF (degree of freedom).
  • the robot according to the present invention may be a robot configured to perform an arbitrary operation. It is also possible to incorporate a hearing system. Further, in the above-described embodiment, the case where the robot hearing system according to the present invention is incorporated in the humanoid robot 10 has been described. However, the present invention is not limited to this. Obviously, it can be built into a robot.
  • the internal microphone 17 is constituted by a pair of microphones 17a and 17b, but may be constituted by two or more pairs of microphones.
  • the external microphone 16 is configured by a pair of microphones 16a and 16b, but may be configured by two or more pairs of microphones.
  • the internal microphones 17a and 17b are configured to be movable and adjustable in the horizontal direction along the slots 42a and 43a of the support members 42 and 43.
  • the present invention is not limited to this, and it may be configured so that the movement can be adjusted in the horizontal direction by other means.
  • the mounting brackets 40 and 41 may be configured to be movable and adjustable in the lateral direction with respect to the support members 42 and 43 and the camera 15 by driving means. In this case, it is possible to adjust the horizontal position of the internal microphones 17a and 17b in the assembled state without removing the exterior 14 from the head 13 of the humanoid robot 10. Therefore, noise cancellation can be performed more effectively and more accurately.
  • the internal microphones 17a and 17b are supported so as to be movable in the horizontal direction or in the horizontal direction and the front-back direction. It may be movably supported. In this case, the internal microphones 17a and 17b are supported by a mechanism having a known configuration that can be adjusted in three axes.
  • the external microphone mainly collects sound from an external target
  • the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot.
  • the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removing unit. The noise is significantly reduced by being canceled by the noise signal.
  • the acoustic signal is extracted by frequency analysis by the pitch extraction unit, and the sound direction is determined from the acoustic signal by the left and right channel corresponding units. Then, the sound source is separated into sound data for each sound source by a sound source separation unit.
  • the noise from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination unit, and the S / N ratio is greatly increased.
  • the S / N ratio is greatly increased.
  • the internal microphone is arranged to be able to move in the horizontal direction inside the exterior, the internal microphone is moved in the horizontal direction so that noise from the noise source inside the robot is better collected.
  • the distance between each internal microphone and the corresponding external microphone to be paired can be appropriately adjusted, and noise can be better canceled from the acoustic signal collected by the external microphone.
  • the robot hearing system according to the present invention is extremely useful as a hearing system for active hearing in a robot, particularly in a humanoid or animal robot.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Robotics (AREA)
  • Mechanical Engineering (AREA)
  • Manipulator (AREA)

Abstract

A hearing system (10) of a robot, especially the one of human or animal type, capable of collecting sound from an external target and actively hearing it without being influenced by the noise produced by, e.g. a drive mechanism in the robot. The system comprises a pair of external microphones (16) provided at the ear parts outside the sound-insulating shell (13), a pair of internal microphones (17) for collecting noise provided inside the shell, noise removers (23, 24) for removing noise signals from an inside noise source from the acoustic signals from the external microphones, pitch extracting sections (25, 26) for extracting acoustic data (DL, DR) about the time, frequency and power from the left and right acoustic signals from the noise removers by frequency analysis, a left/right channel relating section (27) for determining the direction from which sound comes on the basis of the harmonic structure involving the pitches of the left/right acoustic data extracted by the pitch extracting section, and a sound source separating section (28) for separating acoustic data into acoustic data sets of respective sound sources according to the information on the sound direction determination.

Description

明 細 書 ロ ボ ッ ト 聴覚シス テ ム 技術分野  Description Robot Auditory system Technical field

本発明はロボット、 特に人型または動物型ロボッ卜における聴覚システムに関 する。 背景技術  The present invention relates to a hearing system in a robot, particularly in a humanoid or animal robot. Background art

近年、 このような人型または動物型ロボットにおいては、 視覚, 聴覚の能動知 覚が注目されてきている。  In recent years, attention has been paid to active perception of sight and hearing in such humanoid or animal-like robots.

能動知覚とは、 ロボット視覚やロボット聴覚等の知覚を担当する知覚装置を、 知覚すべき目標に追従するように、 これらの知覚装置を支持する、 例えば頭部を 駆動機構により姿勢制御するものである。  Active perception is a system that supports a sensory device that is in charge of perception such as robot vision and robot hearing so that it follows the target to be perceived. is there.

ここで、 能動視覚に関しては、 少なくとも知覚装置であるカメラ力 駆動機構 による姿勢制御によってカメラの光軸方向が目標に向かって保持され、 目標に対 して自動的にフォーカシングゃズ一ムィン, ズームァゥト等が行なわれることに より、 カメラが目標を撮像するようになっている。 このような能動視覚について は、 種々の研究が行なわれている。  Here, regarding active vision, the optical axis direction of the camera is held toward the target by at least the attitude control by the camera power drive mechanism, which is a perception device, and the target is automatically focused on, for example, focusing, zooming, zooming, etc. Is performed, so that the camera captures the target. Various studies have been conducted on such active vision.

これに対して、 能動聴覚に関しては、 少なくとも知覚装置であるマイクが、 駆 動機構による姿勢制御によって、 その指向性が目標に向かって保持され、 目標か らの音がマイクによって集音される。 この際、 能動聴覚の不利な点として、 駆動 機構が作動している間はマイクが駆動機構の作動音を拾ってしまうため、 目標か らの音に比較的大きなノィズが混入してしまい、 目標からの音を認識できなくな つてしまうことがある。  On the other hand, with regard to active hearing, at least the microphone, which is a sensory device, maintains its directivity toward the target by controlling the attitude of the drive mechanism, and the microphone collects sound from the target. At this time, the disadvantage of active hearing is that the microphone picks up the operating sound of the drive mechanism while the drive mechanism is operating, so that relatively loud noise is mixed in with the sound from the target. It may not be possible to recognize the sound from the sound.

しかしながら、 駆動機構を停止させた状態での聴覚研究では、 特に目標が移動 しているような場合、 目標の移動に追従しながらの所謂能動聴覚を行なうことが 困難であった。  However, in auditory research with the drive mechanism stopped, it was difficult to perform so-called active hearing while following the movement of the target, especially when the target is moving.

さらに、 上述した駆動機構だけでなく、 ロボット内部で発生する各種動作音及 び定常的に発生するノイズも聴覚装置としてのマイクが集音してしまうので、 同 様に完全な能動聴覚を得ることが困難であった。 Furthermore, not only the above-mentioned drive mechanism, but also various operation sounds generated inside the robot Similarly, it is also difficult to obtain perfect active hearing, because microphones as hearing devices also collect noise that is constantly generated.

本発明の目的は、 このような従来の技術における欠点を解決するものであり、 駆動機構等のロボット内部で発生するノイズに影響されることなく、 外部の目標 からの音を集音して能動知覚を行なうことができるようにしたロボット、 特に人 型または動物型ロボットの聴覚システムを提供することにある。 発明の開示  An object of the present invention is to solve the above-mentioned drawbacks in the conventional technology, and to collect sound from an external target without being affected by noise generated inside the robot such as a driving mechanism, and to actively control the sound. It is an object of the present invention to provide an auditory system for a robot capable of performing perception, particularly a humanoid or animal robot. Disclosure of the invention

この目的を達成するために、 本発明によるロボット聴覚システムは、 内部にノ ィズ発生源を備えたロボッ卜において、 少なくともロボッ卜の一部を覆う防音性 の外装と、 上記外装の外側に設けられ主として外部の音を集音する少なくとも一 対の外部マイクと、 上記外装の内側に設けられ主として内部のノィズ発生源から のノイズを集音する少なくとも一対の内部マイクと、 上記外部マイク及び内部マ ィクからの左右の信号に基づいてそれぞれ外部マイクからの音響信号から内部の ノィズ発生源からのノィズ信号をキャンセルするノィズ除去部と、 上記ノィズ除 去部からの左右の音響信号からそれそれ周波数解析を行なって、 時間, 周波数及 びパワーに関する音響データ及びピッチ情報を取り出すピッチ抽出部と、 上記ビ ツチ抽出部で取り出された左右の音響データ及びピッチ情報に基づいて当該音響 デ一夕が示すピッチを伴う調波構造から音の方向付けを行なう左右チャンネル対 応部と、 上記左右チャンネル対応部で取り出された音の方向付け情報に基づいて 上記音響データから音源毎の音響データに分離する音源分離部と、 を含んでいる ことを特徴とする。  In order to achieve this object, a robot hearing system according to the present invention includes a soundproof exterior covering at least a part of the robot, and a robot provided with a noise source inside, and provided outside the exterior. At least one pair of external microphones that mainly collect external sound, at least one pair of internal microphones that are provided inside the exterior and mainly collect noise from internal noise sources, and the external microphone and the internal microphone. Noise canceling unit that cancels the noise signal from the internal noise source from the audio signal from the external microphone based on the left and right signals from the microphone, respectively, and the frequency from the left and right audio signals from the noise removing unit. A pitch extraction unit that performs analysis to extract acoustic data and pitch information relating to time, frequency, and power; A left and right channel corresponding section for directing sound from a harmonic structure with a pitch indicated by the sound data based on the left and right sound data and pitch information extracted by the left and right channel sections; And a sound source separation unit that separates the sound data into sound data for each sound source based on the sound direction information.

また、 上記目的を達成するために、 本発明による人型または動物型ロボットの 聴覚システムは、 内部に駆動機構等のノイズ発生源を備えた人型または動物型の ロボットにおいて、 少なくともロボットの頭部を覆う防音性の外装と、 上記外装 の外側にて両側の耳に対応する両耳位置に設けられ主として外部の音を集音する 少なくとも一対の外部マイクと、 上記外装の内側に設けられた主として内部のノ ィズ発生源からのノイズを集音する少なくとも一対の内部マイクと、 上記外部マ ィク及び内部マイクからの信号に基づいて外部マイクからの音響信号から内部の ノィズ発生源からのノィズ信号をキャンセルするノィズ除去部と、 上記ノィズ除 去部からの左右の音響信号からそれそれ周波数解析を行なって時間, 周波数及び パワーに関する音響データ及びピッチ情報を取り出すピッチ抽出部と、 上記ピッ チ抽出部で取り出された左右の音響データピッチ情報に基づいて当該音響デ一夕 のピッチを伴う調波構造から音の方向付けを行なう左右チャンネル対応部と、 上 記左右チャンネル対応部で取り出された音の方向付け情報に基づいて上記音響デ 一夕から音源毎の音響データに分離する音源分離部と、 を含んでいることを特徴 としている。 In order to achieve the above object, a human- or animal-type robot auditory system according to the present invention provides a human- or animal-type robot having a noise generating source such as a drive mechanism therein, at least a robot head. A soundproof exterior covering the outside, at least one pair of external microphones provided at both ear positions corresponding to both ears outside the exterior and mainly collecting external sounds, and mainly provided inside the exterior. At least a pair of internal microphones that collect noise from an internal noise source; and an internal microphone based on signals from the external microphone and the internal microphone based on signals from the external microphone and the internal microphone. A noise removing unit for canceling a noise signal from a noise generating source; a pitch extracting unit for performing frequency analysis on left and right acoustic signals from the noise removing unit to extract acoustic data and pitch information relating to time, frequency and power. A left / right channel corresponding unit for directing a sound from a harmonic structure with a pitch of the sound data based on the left / right sound data pitch information extracted by the pitch extracting unit; And a sound source separation unit that separates the sound data into sound data for each sound source based on the sound orientation information extracted by the sound source.

上記構成によれば、 外部マイクが主として外部の目標からの音を集音し、 内部 マイクが主としてロボット内部の駆動機構等のノィズ発生源からのノイズを集音 する。 その際、 外部マイクが集音した音響信号には、 ロボット内部のノイズ発生 源からのノイズ信号が混入しているが、 この混入したノイズ信号は、 ノイズ除去 部における演算処理によって内部マイクが集音したノィズ信号によりキャンセル されることにより、 著しく低減される。  According to the above configuration, the external microphone mainly collects sound from an external target, and the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot. At that time, the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removal unit. It is significantly reduced by being canceled by the noise signal.

そして、 ノイズがキャンセルされた音響信号から、 ピッチ抽出部により周波数 解析によって音響信号を取り出して、 この音響信号から左右チャンネル対応部に より音の方向付けを行なった後、 この音の方向付けに基づいて、 音源分離部によ つて各音源毎の音響データに分離する。  Then, from the acoustic signal from which the noise has been canceled, the acoustic signal is extracted by frequency analysis by the pitch extraction unit, and the sound direction is determined from the acoustic signal by the left and right channel corresponding units. Then, the sound source is separated into sound data for each sound source by a sound source separation unit.

したがって、 ノイズ除去部での演算処理によって、 ロボット内部の駆動機構等 のノイズ発生源からのノイズが容易に著しく低減されることにより、 外部マイク からの音響信号の S /N比が大幅に改善されるので、 音源毎の音響デ一夕の分離 をより一層良好に行なうことができる。  Therefore, the noise reduction from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination section, and the S / N ratio of the acoustic signal from the external microphone is greatly improved. Therefore, the sound data can be more properly separated for each sound source.

本発明によるロボット聴覚システムは、 好ましくは、 さらに、 ロボットが視覚 , 触覚等の知覚システムを備えており、 上記左右チャンネル対応部がこれらの知 覚システムからの情報を参照して音の方向付けを行なう。  In the robot hearing system according to the present invention, preferably, the robot further includes a perception system such as visual and tactile sensation, and the left and right channel corresponding units refer to information from these perception systems to determine the direction of sound. Do.

この構成によれば、 例えば視覚装置からの目標に関する視覚情報に基づいて左 右チャンネル対応部がより明確な方向付けを行なうことができる。  According to this configuration, for example, the left and right channel corresponding sections can perform clearer orientation based on visual information on a target from the visual device.

本発明によるロボット聴覚システムは、 好ましくは、 上記左右チャンネル対応 部が上記知覚システムに対して音の方向付けに関する情報を出力する。 この構成によれば、 例えば視覚装置に対して目標に関する聴覚による方向付け の情報を出力するので、 視覚装置がより一層正確な方向付けを行なうことができ る。 In the robot auditory system according to the present invention, preferably, the left and right channel corresponding sections output information relating to the direction of sound to the sensory system. According to this configuration, for example, the information on the orientation of the target by hearing is output to the visual device, so that the visual device can perform more accurate orientation.

さらに、 本発明によるロボッ ト聴覚システムは、 好ましくは、 上記内部マイク が、 外装の内側において少なくとも横方向に移動可能に配設されていることを特 徴とする。  Furthermore, the robot hearing system according to the present invention is preferably characterized in that the internal microphone is disposed inside the exterior so as to be movable at least in a lateral direction.

この構成によれば、 ロボッ卜内部のノイズ発生源からのノイズをより良好に集 音するように内部マイクを横方向、 前後方向及び/又は上下方向に移動させて、 各内部マイクと対となるべき対応する外部マイクとの距離を適宜に調整すること ができ、 外部マイクが集音する音響信号から最も最適な状態でノイズをキャンセ ルすることができる。  According to this configuration, the internal microphones are moved in the horizontal direction, the front-rear direction, and / or the vertical direction so as to more appropriately collect noise from a noise source inside the robot, and are paired with each internal microphone. The distance to the corresponding external microphone can be appropriately adjusted, and the noise can be canceled in the most optimal state from the acoustic signal collected by the external microphone.

本発明によるロボット聴覚システムは、 好ましくは、 上記内部マイクが、 外装 の内側にて横方向に延びる支持部材に対して摺動可能に支持されている。  In the robot hearing system according to the present invention, preferably, the internal microphone is slidably supported by a support member extending in a lateral direction inside the exterior.

本発明によるロボット聴覚システムは、 好ましくは、 上記支持部材が横方向に 延びるスロットを備えており、 上記内部マイクが上記支持部材のスロッ卜に沿つ て摺動可能に支持された可動部材に固定されている。  In the robot hearing system according to the present invention, preferably, the support member includes a slot extending in a lateral direction, and the internal microphone is fixed to a movable member slidably supported along the slot of the support member. Have been.

本発明によるロボット聴覚システムは、 好ましくは、 上記可動部材が、 スロッ トに挿通された固定ネジにより支持部材に対して螺着される。  In the robot hearing system according to the present invention, preferably, the movable member is screwed to the support member by a fixing screw inserted into the slot.

本発明によるロボット聴覚システムは、 好ましくは、 上記内部マイクがさらに 前後方向に移動可能に配設されている。  In the robot hearing system according to the present invention, preferably, the internal microphone is further disposed so as to be movable in the front-rear direction.

本発明によるロボット聴覚システムは、 好ましくは、 上記内部マイクが外装の 内側にて横方向に延びる第一の支持部材に対して摺動可能に支持されていて、 こ の第一の支持部材が外装の内側にて前後方向に延びる第二の支持部材に対して摺 動可能に支持されている。  In the robot hearing system according to the present invention, preferably, the internal microphone is slidably supported on a first support member extending in the lateral direction inside the exterior, and the first support member is provided on the exterior. It is supported slidably with respect to a second support member extending in the front-rear direction on the inside.

本発明によるロボット聴覚システムは、 好ましくは、 上記内部マイクがさらに 上下方向に移動可能に配設されている。 図面の簡単な説明  In the robot hearing system according to the present invention, preferably, the internal microphone is further disposed so as to be vertically movable. BRIEF DESCRIPTION OF THE FIGURES

本発明は、 以下の詳細な説明及び本発明の実施例を示す添付図面に基づいて、 より良く理解されるものとなろう。 なお、 添付図面に示す実施例は、 本発明を特 定することを意図するものではなく、 単に説明及び理解を容易にするためのもの である。 The present invention is based on the following detailed description and the accompanying drawings, which illustrate embodiments of the present invention. It will be better understood. The embodiments shown in the attached drawings are not intended to specify the present invention, but merely to facilitate explanation and understanding.

図中、  In the figure,

図 1は、 本発明によるロボット聴覚装置の第一の実施形態を組み込んだ人型口 ボッ卜の外観を示す正面図である。  FIG. 1 is a front view showing the appearance of a humanoid mouthboat incorporating the first embodiment of the robot hearing device according to the present invention.

図 2は、 図 1の人型ロボットの側面図である。  FIG. 2 is a side view of the humanoid robot of FIG.

図 3は、 図 1の人型ロボッ卜における頭部の構成を示す概略拡大図である。 図 4は、 図 1の人型ロボッ卜におけるロボット聴覚システムの電気的構成を示 すブロック図である。  FIG. 3 is a schematic enlarged view showing the configuration of the head in the humanoid robot of FIG. FIG. 4 is a block diagram showing an electrical configuration of the robot auditory system in the humanoid robot of FIG.

図 5は、 図 1の人型ロボッ卜における内部マイクの取付構造を示す概略正面図 である。  FIG. 5 is a schematic front view showing a mounting structure of an internal microphone in the humanoid robot of FIG.

図 6は、 図 5の内部マイクの取付構造の要部を示す拡大正面図である。  FIG. 6 is an enlarged front view showing a main part of the mounting structure of the internal microphone in FIG.

図 7は、 図 5の内部マイクの取付構造の要部を示す拡大側面図である。  FIG. 7 is an enlarged side view showing a main part of the mounting structure of the internal microphone in FIG.

図 8は、 図 5の内部マイクの取付構造の要部を示す拡大斜視図である。  FIG. 8 is an enlarged perspective view showing a main part of the mounting structure of the internal microphone in FIG.

図 9は、 図 4のロボット聴覚システムにおけるピッチ抽出部により取り出され た音響デー夕、 ピッチ情報の一例を示すグラフである。  FIG. 9 is a graph showing an example of acoustic data and pitch information extracted by the pitch extraction unit in the robot hearing system of FIG.

図 1 0は、 図 5の内部マイクの取付構造の変形例の要部を示す拡大斜視図であ る。 発明を実施するための最良の形態  FIG. 10 is an enlarged perspective view showing a main part of a modified example of the mounting structure of the internal microphone in FIG. BEST MODE FOR CARRYING OUT THE INVENTION

以下、 本発明のロボット聴覚システムの最良の第一の実施例を図面を参照して 詳細に説明する。  Hereinafter, a first preferred embodiment of the robot hearing system of the present invention will be described in detail with reference to the drawings.

図 1から図 4は本発明によるロボット聴覚システムの一実施形態を備えた実験 用の人型ロボッ卜の全体構成を示している。  FIGS. 1 to 4 show the overall configuration of an experimental humanoid robot provided with an embodiment of the robot hearing system according to the present invention.

図 1において、 人型ロボット 1 0は、 4 D O F (自由度) のロボットとして構 成されている。 人型ロボット 1 0は、 ベース 1 1と、 ベース 1 1上にて一軸 (垂 直軸) 周りに回動可能に支持された胴体部 1 2と、 胴体部 1 2上にて三軸方向 ( 垂直軸, 左右方向の水平軸及び前後方向の水平軸) の周りに揺動可能に支持され た頭部 13と、 を含んでいる。 In FIG. 1, the humanoid robot 10 is configured as a 4-DOF (degree of freedom) robot. The humanoid robot 10 includes a base 11, a body 12 supported rotatably around one axis (vertical axis) on the base 11, and a three-axis direction on the body 12 ( (Vertical axis, horizontal axis in the horizontal direction, and horizontal axis in the front-rear direction). The head 13 and

上記べ一ス 11は、 固定配置されていてもよく、 ロボットの脚部として動作可 能としてもよい。 また、 ベース 11は移動可能な台車等の上に載置されていても よい。  The base 11 may be fixedly arranged, and may be operable as a leg of a robot. Further, the base 11 may be placed on a movable cart or the like.

上記胴体部 12は、 ベース 11に対して垂直軸の周りに、 図 1にて矢印 Aで示 すように回動可能に支持されており、 図示しなレ 駆動手段によって回転駆動され る。 さらに、 胴体部 12は、 全体が防音性の外装によって覆われている。  The body portion 12 is rotatably supported about a vertical axis with respect to the base 11 as shown by an arrow A in FIG. 1, and is rotationally driven by a driving means (not shown). Further, the body 12 is entirely covered with a soundproof exterior.

上記頭部 13は胴体部 12に対して連結部材 13 aを介して支持されており、 この連結部材 13 aに対して前後方向の水平軸の周りに、 図 1にて矢印 Bで示す ように揺動可能に、 また左右方向の水平軸の周りに、 図 2にて矢印 Cで示すよう に揺動可能に支持されている。 この上記連結部材 13 aは、 胴体部 12に対して さらに前後方向の水平軸の周りに、 図 1にて矢印 Dで示すように揺動可能に支持 されている。  The head 13 is supported on the body 12 via a connecting member 13a, and around the horizontal axis in the front-rear direction with respect to the connecting member 13a, as shown by an arrow B in FIG. It is swingably supported around a horizontal axis in the left-right direction as shown by arrow C in FIG. The connecting member 13a is swingably supported relative to the body 12 around a horizontal axis in the front-rear direction, as indicated by an arrow D in FIG.

そして、 上記矢印 A, B, C, D方向の回転駆動は、 それそれ図示しない駆動 手段によって行なわれる。  The rotation in the directions of the arrows A, B, C, and D is performed by driving means (not shown).

ここで、 上記頭部 13は、 図 3に示すように全体が防音性の外装 14により覆 われている。 さらに、 上記頭部 13は、 前側にロボット視覚を担当する視覚装置 としてのカメラ 15を、 また両側にロボット聴覚を担当する聴覚装置としての一 対の外部マイク 16 (16a, 16b) を備えている。  Here, the head 13 is entirely covered with a soundproof exterior 14, as shown in FIG. Further, the head 13 has a camera 15 on the front side as a visual device in charge of robot vision, and a pair of external microphones 16 (16a, 16b) on both sides as a hearing device for robot hearing. .

さらに、 上記頭部 13は、 図 3に示すように、 外装 14の内側にて左右に離隔 して配設された一対の内部マイク 17 (17a, 17b) を備えている。  Further, as shown in FIG. 3, the head 13 includes a pair of internal microphones 17 (17a, 17b) which are disposed inside the exterior 14 so as to be separated from each other.

上記外装 14は、 例えばウレタン樹脂等の吸音性の合成樹脂から構成されてい る。 上記外装 14は、 頭部 13の内部をほぼ完全に密閉することにより頭部 13 の内部の遮音を行なうように構成されている。 なお、 胴体部 12の外装も、 同様 にして吸音性の合成樹脂から構成されている。  The exterior 14 is made of a sound absorbing synthetic resin such as a urethane resin. The exterior 14 is configured so that the inside of the head 13 is sound-insulated by almost completely sealing the inside of the head 13. The exterior of the body 12 is similarly made of a sound-absorbing synthetic resin.

上記カメラ 15は公知の構成であって、 例えば所謂パン, チルト, ズームの 3 DOF (自由度) を有する市販のカメラを使用することができる。  The camera 15 has a known configuration. For example, a commercially available camera having 3 DOFs (degrees of freedom) of so-called pan, tilt, and zoom can be used.

上記外部マイク 16は、 それぞれ頭部 13の側面において、 前方に向かって指 向性を有するように取り付けられている。 ここで、 外部マイク 1 6の左右の各外部マイク 16 a, 1 6 bは、 それそれ図 1及び図 2に示すように、 外装 14の両側にて前方に向いた段部 14 a, 14b にて内側に取り付けられ、 段部 14 a, 14 bに設けられた貫通穴を通して前方 の音を集音すると共に、 外装 14の内部の音を拾わないように適宜の手段により 遮音されている。 The external microphones 16 are attached to the sides of the head 13 so as to have directionality toward the front. Here, the left and right external microphones 16a and 16b on the left and right of the external microphone 16 are respectively attached to stepped portions 14a and 14b facing forward on both sides of the exterior 14, as shown in FIGS. It is mounted on the inside, and through the through holes provided in the steps 14a, 14b, collects the sound in front and is sound-insulated by appropriate means so as not to pick up the sound inside the exterior 14.

これにより外部マイク 1 6 a, 16 bは、 所謂バイノーラルマイクとして構成 されている。 なお、 外部マイク 16 a, 16 bの取付位置の近傍において、 外装 14は人間の外耳形状に形成されていてもよい。  Thus, the external microphones 16a and 16b are configured as so-called binaural microphones. In the vicinity of the mounting position of the external microphones 16a and 16b, the exterior 14 may be formed in the shape of a human outer ear.

上記内部マイク 17は、 それぞれ外装 14の内側において、 上述した各外部マ イク 16 a, 16 bの近傍に、 図示の場合には、 前記カメラ 1 5の両端付近の上 方にて、 後述するように横方向に移動可能に配設されている。  The internal microphones 17 are respectively located inside the exterior 14 in the vicinity of the external microphones 16a and 16b described above, and in the illustrated case, above the both ends of the camera 15 as described later. Is arranged so as to be movable in the lateral direction.

なお、 上記内部マイク 17は、 外装 14の内側にて任意の位置に備えられてい てもよい。  Note that the internal microphone 17 may be provided at an arbitrary position inside the exterior 14.

図 4は上記外部マイク 16及び内部マイク 17を含む音響処理のための聴覚シ ステムの電気的構成を示している。  FIG. 4 shows an electrical configuration of a hearing system for sound processing including the external microphone 16 and the internal microphone 17.

図 4において、 聴覚システム 20は、 各外部マイク 16 a, 16 b及び内部マ イク 17 a, 17 bからの音響信号をそれぞれ増幅するアンプ 2 1 a, 2 l b, 2 1 c, 2 I dと、 これらの増幅信号を AD変換によりデジタル音響信号 SOL , SOR, S I L, S I Rに変換する ADコンバータ 22 a, 22 b, 22 c, 22 と、 これらのデジタル音響信号が入力される左右のノイズ除去回路 23, 24と、 ノイズ除去回路 23, 24からのデジタル音響信号 SR, SLが入力さ れるピッチ抽出部 25, 26と、 ピッチ抽出部 25, 26からの音響データが入 力される左右チャンネル対応部 27と、 左右チャンネル対応部 27からのデータ が入力される音源分離部 28と、 から構成されている。  In FIG. 4, a hearing system 20 includes amplifiers 21 a, 2 lb, 21 c, and 2 Id that amplify acoustic signals from the external microphones 16 a and 16 b and the internal microphones 17 a and 17 b, respectively. AD converters 22a, 22b, 22c, 22 that convert these amplified signals into digital audio signals SOL, SOR, SIL, SIR by AD conversion, and left and right noise removal circuits to which these digital audio signals are input 23, 24, pitch extraction units 25, 26 to which digital audio signals SR, SL from noise elimination circuits 23, 24 are input, and left and right channel corresponding units to which audio data from pitch extraction units 25, 26 are input 27, and a sound source separation unit 28 to which data from the left and right channel corresponding units 27 is input.

上記 ADコンバータ 22 a乃至 22 dは、 例えば量子化ビット数 16または 2 4で、 48kHzサンプリングした信号を取り出すように構成されている。 そして、 左側の外部マイク 16 aからのデジ夕ル音響信号 SOL及び左側の内 部マイク 1 Ί aからのデジタル音響信号 S I Lは、 ノイズ除去回路 23に入力さ れる。 また、 右側の外部マイク 16 bからのデジタル音響信号 S OR及び左側の 内部マイク 1 7 bからのデジタル音響信号 S I Rは、 ノイズ除去回路 2 4に入力 される。 The AD converters 22a to 22d are configured to take out a signal sampled at 48 kHz with a quantization bit number of 16 or 24, for example. Then, the digital audio signal SOL from the left external microphone 16 a and the digital audio signal SIL from the left internal microphone 1 a are input to the noise removal circuit 23. The digital audio signal SOR from the external microphone 16b on the right and the left The digital acoustic signal SIR from the internal microphone 17 b is input to the noise elimination circuit 24.

これらのノイズ除去回路 2 3 , 2 4は同一の構成であり、 それそれ外部マイク 1 6からの音響信号から内部マイク 1 7のノイズ信号によりノイズキャンセルす るように構成されている。  These noise elimination circuits 23 and 24 have the same configuration, and are configured to cancel noise from the acoustic signal from the external microphone 16 by the noise signal from the internal microphone 17.

即ち、 ノイズ除去回路 2 3は、 外部マイク 1 6 aからのデジタル音響信号 S O Lを、 内部マイク 1 7 aが集音したロボット内部のノイズ発生源からのノイズ信 号 S I Lに基づいて、 例えば最も簡便には、 外部マイク 1 6 aからのデジタル音 響信号 S O Lから、 内部マイク 1 Ί aからの音響信号 S I Lを減算する等の適宜 の演算処理によるノイズキャンセル処理によって、 外部マイク 1 6 aからの音響 信号 S 0 Lに混入したロボット内部の各駆動機構等のノィズ発生源からのノイズ を除去して、 左の音響信号 S Lを生成する。  That is, the noise elimination circuit 23 converts the digital sound signal SOL from the external microphone 16a into a noise signal SIL from the noise source inside the robot collected by the internal microphone 17a. The noise from the external microphone 16a is reduced by appropriate noise reduction processing such as subtracting the acoustic signal SIL from the internal microphone 16a from the digital acoustic signal SOL from the external microphone 16a. The left acoustic signal SL is generated by removing noise from noise sources such as each drive mechanism inside the robot mixed in the signal S 0 L.

また、 ノイズ除去回路 2 4は、 外部マイク 1 6 bからのデジタル音響信号 S O Rを、 内部マイク 1 7 bが集音したロボット内部のノイズ発生源からのノイズ信 号 S I Rに基づいて、 例えば最も簡便には、 外部マイク 1 6 bからのデジタル音 響信号 S O Rから、 内部マイク 1 Ί bからの音響信号 S I Rを減算する等の適宜 の演算処理によるノイズキャンセル処理によって、 外部マイク 1 6 bからの音響 信号 S I Rに混入したロボット内部の各駆動機構等のノィズ発生源からのノイズ を除去して、 右の音響信号 S Rを生成する。  In addition, the noise removal circuit 24 is based on the digital audio signal SOR from the external microphone 16 b and the noise signal SIR from the noise source inside the robot collected by the internal microphone 17 b, The noise from the external microphone 16b is reduced by appropriate noise cancellation processing such as subtracting the acoustic signal SIR from the internal microphone 1 内部 b from the digital sound signal SOR from the external microphone 16b. The noise from the noise source such as each drive mechanism inside the robot mixed into the signal SIR is removed, and the right acoustic signal SR is generated.

上記ピッチ抽出部 2 5, 2 6は同一の構成であり、 ノイズ除去回路 2 3 , 2 4 からの左右の音響信号 S L, S Rをそれそれ周波数解析して、 時間, 周波数及び パワーの三軸から成る音響データ、 並びにピッチ情報を取り出すように構成され ている。 即ち、 ピッチ抽出部 2 5は、 ノイズ除去回路 2 3からの左の音響信号 S Lを周波数解析することにより、 時間とパワーから成る二軸の音響信号 S から 時間, 周波数及びパワーの三軸から成るスぺクトログラムと呼ばれる左の音響デ 一夕 D Lを取り出す。  The pitch extraction units 25 and 26 have the same configuration. The left and right sound signals SL and SR from the noise elimination circuits 23 and 24 are subjected to frequency analysis, and the three axes of time, frequency and power are analyzed. It is configured to retrieve the acoustic data and pitch information. That is, the pitch extraction unit 25 performs frequency analysis on the left acoustic signal SL from the noise elimination circuit 23 to obtain three axes of time, frequency and power from the two-axis acoustic signal S composed of time and power. Take out the left acoustic data DL called a spectrogram.

また、 ピッチ抽出部 2 6も同様にして、 ノイズ除去回路 2 4からの右の音響信 号 S Rを周波数解析することにより、 時間とパワーから成る二軸の音響信号 S R から時間, 周波数及びパワーの三軸から成る右の音響デ一夕 D R、 ピッチ情報を 取り出す。 Similarly, the pitch extraction unit 26 performs frequency analysis on the right acoustic signal SR from the noise elimination circuit 24 to obtain the time, frequency, and power from the two-axis acoustic signal SR composed of time and power. Three-axis right sound data DR, pitch information Take out.

ここで、 上記周波数解析は、 例えば 20m秒の窓長で、 7. 5 m秒づつずらし ながら、 FFT (高速フーリエ変換) を行なうことにより行なわれる。  Here, the above-mentioned frequency analysis is performed by performing FFT (Fast Fourier Transform) with a window length of, for example, 20 ms and shifting by 7.5 ms.

なお、 このような周波数解析は、 F FTに限らず一般的な種々の方法で行なわ れるようにしてもよい。  Such frequency analysis is not limited to FFT and may be performed by various general methods.

このようにして得られた音響デ一夕 DLにおいては、 音声や音楽における各音 はスぺクトログラム上にてピークの時間方向への連なりとして示されると共に、 一般的に調波構造を有しており、 整数倍の周波数値に規則的にピークを有するよ うになっている。  In the sound data DL obtained in this manner, each sound in voice and music is shown as a series of peaks in the time direction on a spectrogram, and generally has a harmonic structure. And has a peak regularly at an integer multiple of the frequency value.

上記左右チャンネル対応部 27は、 左右のピッチ抽出部 25, 26からの音響 デ一夕 DL, DRにてそれそれピークにより示される調波構造のピッチ情報に関 してその位相差, 時間差等に基づいて、 同じ音から派生するピッチを左右チャン ネルで対応付けることにより音の方向付けを行なう。  The left and right channel corresponding units 27 are used to determine the phase difference, time difference, etc. of the pitch information of the harmonic structure indicated by the peak in the sound data DL and DR from the left and right pitch extraction units 25 and 26, respectively. Based on this, the direction of the sound is determined by associating the pitches derived from the same sound with the left and right channels.

図示の場合、 左右チャンネル対応部 27は、 聴覚システム 20以外にロボット 10に備えられた図示しない他の知覚システム 30から目標に関する情報、 具体 的には例えば視覚システムによる目標の位置, 方向, 形状, 移動の有無等に関す る情報や、 触覚システムによる目標の柔軟度, 振動の有無, 触感等の情報が入力 されることにより、 上述した音源からの音の方向付けがより一層明確に行なわれ るように構成されている。  In the case shown in the figure, the left and right channel corresponding units 27 provide information on the target from other perceptual systems 30 (not shown) provided in the robot 10 in addition to the auditory system 20, specifically, for example, the position, direction, shape, By inputting information on the presence / absence of movement and information on the target flexibility, presence / absence of vibration, tactile sensation, etc. by the haptic system, the direction of the sound from the sound source described above can be more clearly determined It is configured as follows.

上記音源分離部 28は、 公知の構成により、 左右チャンネル対応部 27からの 方向付けの情報及び音響デ一夕 DL, DRが入力されることにより、 例えばディ クションパスフィル夕を使用する等の手法によって、 方向付けの情報に基づいて 音響データ DL, DRから音源を同定すると共に、 各音源毎に音響データを分離 するように構成されている。  The sound source separation unit 28 receives the direction information and the sound data DL and DR from the left and right channel corresponding units 27 according to a known configuration. Thus, the sound source is identified from the sound data DL and DR based on the information of the direction, and the sound data is separated for each sound source.

ここで、 上記内部マイク 17は、 具体的には、 図 5〜図 8に示すように、 カメ ラ 15に対して取り付けられている。  Here, the internal microphone 17 is specifically attached to the camera 15 as shown in FIGS.

先ず、 図 5において、 各内部マイク 17 a, 17bは、 それそれ可動部材とし ての取付金具 40, 41に対して固定されている。  First, in FIG. 5, each of the internal microphones 17a and 17b is fixed to mounting brackets 40 and 41 as movable members, respectively.

この取付金具 40, 41は、 それそれ上方が開放したほぼ U字形状に形成され ており、 上端が締付ネジ 40 a, 41 aで互いに近接されることにより内部マイ ク 17 a, 17 bを挟持して固定保持する。 These mounting brackets 40, 41 are formed in a substantially U-shape with the upper part opened. The upper ends of the inner microphones 17a and 17b are fixedly held by being brought close to each other with the tightening screws 40a and 41a.

また、 上記取付金具 40, 41は、 それそれ支持部材 42, 43に対して後述 するように横方向に摺動可能に取り付けられている。  The mounting brackets 40 and 41 are respectively slidably mounted on the support members 42 and 43 in a lateral direction as described later.

上記支持部材 42, 43は、 それぞれカメラ 15の上面に対してブラケット 4 4により固定されている。 なお、 図示の場合、 支持部材 42, 43は、 ブラケッ ト 44と一体に形成されていると共に、 ブラケット 44はカメラ 15が取り付け られる取付台 15 aに対して螺着されている。  The support members 42 and 43 are fixed to the upper surface of the camera 15 by brackets 44, respectively. In the illustrated case, the support members 42 and 43 are formed integrally with the bracket 44, and the bracket 44 is screwed to a mounting base 15a on which the camera 15 is mounted.

ここで、 各支持部材 42, 43は、 それそれ横方向に延びるスロット 42 a, 43 aを備えている。  Here, each support member 42, 43 is provided with a slot 42a, 43a extending in the lateral direction, respectively.

そして、 上記取付金具 40, 41はそれぞれこれらのスロット 42 a, 43 a に挿通された固定ネジ 45, 46を受容するネジ孔 (図示せず) を有しており、 これらの固定ネジ 45, 46が締め付けられることで支持部材 42, 43に対し て固定保持される。  The mounting brackets 40 and 41 have screw holes (not shown) for receiving the fixing screws 45 and 46 inserted into the slots 42a and 43a, respectively. Are fixed and held on the support members 42 and 43 by being tightened.

本発明実施形態による人型ロボット 10は以上のように構成されており、 外部 マイク 16 a, 16 bが集音しょうとする目標からの音は、 以下のようにして集 音され、 ノイズキャンセルされることにより音源に知覚される。  The humanoid robot 10 according to the embodiment of the present invention is configured as described above, and the sound from the target to be collected by the external microphones 16a and 16b is collected as follows, and the noise is canceled. Is perceived by the sound source.

先ず、 外部マイク 16 a, 16b力 主として目標からの外部の音を集音して それそれアナログ音響信号を出力する。 ここで、 外部マイク 16 a, 16 bは口 ボット内部からのノイズも集音するが、 外装 14自体が頭部 13の内部を密閉し ており、 また外部マイク 16 a, 16 bが頭部 13の内部に対して遮音されてい ることにより、 アナログ音響信号に混入するノイズは、 比較的低いレベルに抑え られている。  First, external microphones 16a, 16b mainly collect external sounds from the target and output analog sound signals. Here, the external microphones 16a and 16b also collect noise from the inside of the mouth bot, but the exterior 14 itself seals the inside of the head 13 and the external microphones 16a and 16b The noise that is mixed into the analog sound signal is suppressed to a relatively low level because the sound is shielded from the inside.

これに対して、 内部マイク 17 a, 17 bは、 主としてロボット内部からのノ ィズ、 例えば前述した各駆動機構の作動音や冷却ファンの作動音等のノィズ発生 源からのノイズを集音する。 ここで、 内部マイク 17 a, 17bは、 外部からの 音も集音するが、 外装 14が内部を密閉していることから、 混入するノイズのレ ベルは比較的低く抑えられている。  On the other hand, the internal microphones 17a and 17b mainly collect noise from the inside of the robot, for example, noise from noise generating sources such as the operating noise of each drive mechanism and the operating noise of the cooling fan. . Here, the internal microphones 17a and 17b also collect sound from outside, but since the exterior 14 hermetically seals the inside, the level of mixed noise is kept relatively low.

このようにして集音された外部マイク 1 6 a, 16 bからのアナログ音響信号 及び内部マイク 1 7 a, 17 bからのアナログ音響信号は、 それそれアンプ 21 a乃至 21 dにより増幅された後、 ADコンバータ 22 a乃至 22 dによりデジ タル音響信号 SOL, SOR, S I L, S IRに AD変換されて、 ノイズ除去回 路 23及び 24に入力される。 Analog audio signals from external microphones 16a and 16b collected in this way Analog audio signals from the internal microphones 17a and 17b are amplified by amplifiers 21a to 21d, respectively, and then digital audio signals SOL, SOR, SIL, SIR by AD converters 22a to 22d. The signal is AD-converted and input to the noise elimination circuits 23 and 24.

ノイズ除去回路 23, 24は、 それそれ外部マイク 16 a, 16 bからの音響 信号 SOL, SORから内部マイク 17 a, 17 bからの音響信号 S I L , S I Rを減算する等の演算処理を行なうことにより、 外部マイク 16 a, 16 bから の音響信号 SOL, SORから、 ロボット内部のノイズ発生源からのノイズ信号 を除去する。 これにより、 ノイズ除去回路 23, 24はそれそれノイズを除去し た真の音響信号 SL, SRを外部に出力する。  The noise elimination circuits 23 and 24 perform arithmetic processing such as subtracting the audio signals SIL and SIR from the internal microphones 17a and 17b from the audio signals SOL and SOR from the external microphones 16a and 16b, respectively. The noise signal from the noise source inside the robot is removed from the sound signals SOL and SOR from the external microphones 16a and 16b. As a result, the noise elimination circuits 23 and 24 respectively output the true acoustic signals SL and SR from which noise has been eliminated to the outside.

そして、 ピッチ抽出部 25, 26力 それそれ音響信号 SL, SRに基づいて 周波数解析により音響デ一夕 DL, DRを生成する。  Then, based on the sound signals SL and SR, the pitch extraction units 25 and 26 generate sound data DL and DR by frequency analysis.

ここで、 音響データは、 例えば図 9に示すようなスペクトログラムとして示さ れる。 図 9の音響データにおいて、 横軸を時刻 t (秒) 、 縦軸を周波数 (Hz) とし、 濃度をパワーとして、 例えば時刻 t 1から時刻 t 2まで断続的に連なった 周波数成分が示されている。 この周波数成分は、 図示の場合、 1000 Hz, 2 000 H z , 3000Hz, · · ·のピークの連なりを有しており、 これらのピ —クがそれそれ 1000Hzの整数倍であることから、 一般的には、 これらの周 波数成分が唯一つの音を表わし、 あるいはそれそれ一つの音を表わすものと解さ れ得る。 この場合には、 これらの周波数成分が、 同時刻に開始し且つ同時刻に終 了していることから、 これらの周波数成分が唯一つの音の調波構造を示している と判断される。 従って、 ピッチ抽出部 25, 26は、 最も低い周波数である基本 周波数 100 OH zをピッチとする。  Here, the acoustic data is shown as, for example, a spectrogram as shown in FIG. In the acoustic data of Fig. 9, the horizontal axis represents time t (seconds), the vertical axis represents frequency (Hz), and the density represents power, for example, frequency components intermittently connected from time t1 to time t2. I have. In the illustrated case, this frequency component has a series of peaks of 1000 Hz, 2000 Hz, 3000 Hz, and so on. Since these peaks are each an integral multiple of 1000 Hz, Typically, these frequency components can be understood to represent only one sound, or each one. In this case, since these frequency components start at the same time and end at the same time, it is determined that these frequency components indicate the harmonic structure of only one sound. Therefore, the pitch extraction units 25 and 26 set the fundamental frequency 100 OHz, which is the lowest frequency, as the pitch.

このようにして、 ピッチ抽出部 25, 26は、 音響信号 SL, SRに含まれる 全ての音に関して当該音に関するピッチを抽出し、 このピッチに対応する当該音 の調波構造, 開始時 t 1, 終了時 t 2と共に、 音響データ DL, DR, ピッチ情 報として左右チャンネル対応部 27に出力する。  In this way, the pitch extraction units 25 and 26 extract the pitches of all the sounds included in the sound signals SL and SR, and extract the pitches of the sounds corresponding to the pitches, At the end, at time t2, the sound data DL, DR, and pitch information are output to the left and right channel corresponding unit 27.

続いて、 左右チャンネル対応部 27が、 これらの音響デ一夕 DL, DRに基づ いて音源毎の音の方向付けを行なう。 この場合、 左右チャンネル対応部 27は、 例えば各ピッチ抽出部 25 , 26で 抽出された音響デ一夕 DL, DRに基づいて、 左右のチャンネルの調波構造を比 較して最も近いピッチを対応付ける。 その際、 左右のチャンネルのピッチを一対 一で比較するだけでなく、 一方のチャンネルの複数のピッチを他方のチャンネル の一つのピッチと比較する等により、 より正確な対応付けを行なうことが好まし レ、。 そして、 左右チャンネル対応部 27は、 対応付けられたピッチの位相を比較 することにより音源の方向を容易に演算して方向付けることができる。 Subsequently, the left and right channel corresponding unit 27 determines the direction of the sound for each sound source based on the sound data DL and DR. In this case, the left and right channel correspondence unit 27 compares the harmonic structures of the left and right channels based on the sound data DL and DR extracted by the pitch extraction units 25 and 26, and associates the closest pitch . In that case, it is preferable to make more accurate correspondence by comparing not only the pitch of the left and right channels one-to-one but also a plurality of pitches of one channel with one pitch of the other channel. Les ,. Then, the left and right channel corresponding section 27 can easily calculate and direct the direction of the sound source by comparing the phases of the pitches associated with each other.

これにより、 音源分離部 28が、 左右チャンネル対応部 27からの音の方向付 け情報に基づいて、 音響データ DL, DRから音源毎の音に関する音響デ一夕を 抽出して、 音源毎の音に分離する。 このようにして、 聴覚システム 20は音源毎 の音の分離によつて音響認識を行ない、 能動聴覚を行なうことができる。  Accordingly, the sound source separation unit 28 extracts the sound data for each sound source from the sound data DL and DR based on the sound direction information from the left and right channel corresponding units 27, and outputs the sound data for each sound source. To separate. In this way, the hearing system 20 can perform sound recognition by separating sounds for each sound source and perform active hearing.

ここで、 内部マイク 17 a, 17bは、 それそれ取付金具 40, 41に固定保 持されると共に、 これらの取付金具 40, 41が支持部材 42, 43で固定保持 されることにより、 カメラ 15に対して固定保持されている。  Here, the internal microphones 17a, 17b are fixedly held by the mounting brackets 40, 41, respectively, and the mounting brackets 40, 41 are fixedly held by the support members 42, 43, so that the internal microphones 17a, 17b are fixed to the camera 15. Fixedly held.

さらに、 取付金具 40, 41は、 それそれ支持部材 42, 43のスロット 42 a, 43 aに挿通された固定ネジ 45 , 46により支持部材 42, 43に対して 螺着されている。  Further, the mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively.

その際、 取付金具 40, 41そして内部マイク 17 a, 17bは、 固定ネジ 4 5 , 46が緩められた状態にて、 固定ネジ 45, 46がそれぞれ支持部材 42, 43のスロット 42 a, 43 a内を摺動することにより、 横方向の位置が適宜に 調整され、 調整後に固定ネジ 45, 46が締め付けられる。  At this time, the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the mounting members 45, 46 with the fixing screws 45, 46 loosened, and the fixing screws 45, 46 are inserted into the slots 42a, 43a of the support members 42, 43, respectively. By sliding inside, the horizontal position is adjusted appropriately, and after adjustment, the fixing screws 45 and 46 are tightened.

これにより、 内部マイク 17 a, 17bは、 頭部 13の外装 14の内側にて横 方向に移動調整されることにより、 所望の位置に設定され得る。 即ち、 内部マイ ク 17 a, 17 bは、 それそれ対応する外部マイク 16 a, 1 6 bに対して適宜 の距離の位置に調整されることによって、 前述したノイズ除去回路 23, 24に よるノイズキャンセルを最も効果的に行なうことができる。  Thus, the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral direction inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, so that the noise generated by the above-described noise removing circuits 23 and 24 is adjusted. Cancellation can be performed most effectively.

このようにして、 本発明実施形態による人型ロボッ ト 10によれば、 内部マイ ク 17 a, 17bを横方向に適宜に調整した状態にて、 ノイズ除去回路 23, 2 4により、 外部マイク 16 a, 16 bからの音響信号 SOL, SORから、 内部 マイク 17 a, 17 bからの音響信号 S I L, S I Rに基づいてノイズキャンセ ルを行なうことにより、 各駆動機構によって外部マイク 16 a, 16 bの指向性 の方向を目標に向けながら、 S/N比の良好な音響信号を得て、 各音源からの音 を分離してより正確な音声認識を行なうことができる。 Thus, according to the humanoid robot 10 according to the embodiment of the present invention, the noise reduction circuits 23 and 24 allow the external microphones 16 and 17 to be appropriately adjusted in the horizontal direction while the internal microphones 17 a and 17 b are appropriately adjusted. a, 16 b Sound signal from SOL, SOR, internal By performing noise cancellation based on the acoustic signals SIL and SIR from the microphones 17a and 17b, the S / N ratio can be increased while the directivity of the external microphones 16a and 16b is directed to the target by each drive mechanism. A good acoustic signal can be obtained, and the sound from each sound source can be separated to perform more accurate speech recognition.

従って、 例えば目標が移動している場合であっても、 各駆動機構によって外部 マイク 16 a, 16 bの指向性の方向を常に目標に対して追従させながら、 目標 の音響認識を行なうことができる。  Therefore, even when the target is moving, for example, the acoustic recognition of the target can be performed while the directionality of the directivity of the external microphones 16a and 16b always follows the target by each drive mechanism. .

このとき、 左右チャンネル対応部 27が、 例えば他の知覚システム 30として 視覚システムからの視覚による目標の方向付け情報を参照して、 音の方向付けを 行なうことにより、 より正確な音の方向付けを行なうことができる。  At this time, the left and right channel corresponding unit 27 refers to, for example, the visual target direction information from the visual system as the other perceptual system 30 to determine the direction of the sound, thereby providing more accurate sound direction. Can do it.

また、 他の知覚システム 30として視覚システムが使用される場合、 左右チヤ ンネル対応部 27が視覚システムに対して音の方向付け情報を出力するようにし てもよい。 これにより、 視覚システム 30が、 画像認識により目標の方向付けを 行なう際に、 聴覚システム 20からの音に関する方向付け情報を参照することに よって、 例えば目標が移動して障害物の裏に隠れてしまった場合でも、 目標から の音を参照して、 目標の方向付けをより一層正確に行なうことができる。  When a visual system is used as the other perceptual system 30, the left and right channel corresponding units 27 may output sound direction information to the visual system. This allows the visual system 30 to refer to sound-directing information from the auditory system 20 when directing a target by image recognition, for example, by moving the target and hiding behind obstacles. Even if you do, you can refer to the sound from the target and more accurately direct the target.

図 10は上述した内部マイク 17 a, 17bの取付構造の変形例を示している 。 図 10において、 内部マイク 17 a, 17 bを横方向に移動可能に支持する支 持部材 42, 43が、 さらにカメラ 15の上面に対して図示しない適宜の手段で 固定された第二の支持部材 47, 48に対して、 前後方向に摺動可能に取り付け られている。 上記第二の支持部材 47, 48はそれぞれ前後方向に延びるスロッ ト 47 a, 48 aを備えている。  FIG. 10 shows a modification of the mounting structure of the internal microphones 17a and 17b described above. In FIG. 10, supporting members 42 and 43 for supporting the internal microphones 17a and 17b movably in the lateral direction are further fixed to an upper surface of the camera 15 by appropriate means (not shown). It is attached to 47 and 48 so that it can slide in the front-rear direction. The second support members 47, 48 have slots 47a, 48a extending in the front-rear direction, respectively.

そして、 上記支持部材 42, 43の両端に形成された垂直に延びる折曲部 42 b, 42 c (43b, 43 c) が、 それそれこれらのスロット 47 a, 48 aに 挿通された固定ネジ 49, 50を受容するネジ孔 (図示せず) を有しており、 こ れらの固定ネジ 49, 50が締め付けられることにより、 第二の支持部材 47, 48に対して固定保持される。  The vertically extending bent portions 42b, 42c (43b, 43c) formed at both ends of the support members 42, 43 are respectively provided with fixing screws 49 inserted into these slots 47a, 48a. Screw holes (not shown) for receiving the fixing members 49, 50, and these fixing screws 49, 50 are fixed to the second support members 47, 48 by being tightened.

このような構成の内部マイク 17 a, 17 bの取付構造によれば、 内部マイク 17 a, 17bはそれそれ取付金具 40, 41に固定保持されると共に、 これら の取付金具 40, 41が支持部材 42, 43及び第二の支持部材 47, 48で固 定保持されることにより、 カメラ 15に対して固定保持されている。 According to the mounting structure of the internal microphones 17a and 17b having such a configuration, the internal microphones 17a and 17b are fixedly held by the mounting brackets 40 and 41, respectively. The mounting brackets 40 and 41 are fixedly held by the support members 42 and 43 and the second support members 47 and 48, thereby being fixedly held to the camera 15.

さらに、 取付金具 40, 41は、 それそれ支持部材 42, 43のスロット 42 a, 43 aに挿通された固定ネジ 45 , 46で支持部材 42, 43に対して螺着 されており、 また支持部材 42, 43は、 それそれ第二の支持部材 47, 48の スロッ ト 47 a, 48 aに揷通された固定ネジ 49 , 50で第二の支持部材 47 , 48に対して螺着されている。  Further, the mounting brackets 40, 41 are screwed to the support members 42, 43 by fixing screws 45, 46 inserted into the slots 42a, 43a of the support members 42, 43, respectively. 42, 43 are respectively screwed to the second support members 47, 48 with fixing screws 49, 50 passed through the slots 47a, 48a of the second support members 47, 48. .

その際、 取付金具 40, 41そして内部マイク 17 a, 17 bは、 固定ネジ 4 5, 46が緩められた状態にて、 固定ネジ 45, 46がそれそれ支持部材 42, 43のスロット 42 a, 43 a内を摺動することにより横方向の位置が適宜に調 整され、 調整後に固定ネジ 45, 46が締め付けられる。  At that time, the mounting brackets 40, 41 and the internal microphones 17a, 17b are fixed to the fixing screws 45, 46 while the fixing screws 45, 46 are loosened. The position in the horizontal direction is adjusted as appropriate by sliding in 43a, and after adjustment, the fixing screws 45 and 46 are tightened.

また、 上記支持部材 42, 43は、 固定ネジ 49, 50が緩められた状態にて 、 固定ネジ 49, 50がそれそれ第二の支持部材 47, 48のスロット 47 a, 48 a内を摺動することにより前後方向の位置が適宜に調整され、 調整後に固定 ネジ 49, 50が締め付けられる。  The fixing screws 49 and 50 slide in the slots 47 a and 48 a of the second supporting members 47 and 48, respectively, with the fixing screws 49 and 50 loosened. By doing so, the position in the front-rear direction is appropriately adjusted, and after the adjustment, the fixing screws 49 and 50 are tightened.

これにより、 内部マイク 17 a, 17bは、 頭部 13の外装 14の内側にて横 方向及び前後方向に移動調整されることにより、 所望の位置に設定され得る。 即 ち、 内部マイク 17a, 17 bは、 それぞれ対応する外部マイク 16 a, 16b に対して適宜の距離の位置に調整されることによって、 前述したノイズ除去回路 23, 24によるノイズキャンセルをより一層効果的に行なうことができる。 上述した実施形態において、 人型ロボット 10は、 4DOF (自由度) を有す るように構成されているが、 これに限らず、 任意の動作を行なうように構成され たロボットに本発明によるロボット聴覚システムを組み込むことも可能である。 また、 上述した実施形態においては、 本発明によるロボット聴覚システムを人型 ロボッ ト 10に組み込んだ場合について説明したが、 これに限らず、 犬型等の各 種動物型ロボットや、 その他の形式のロボットに組み込むことも可能であること は明らかである。  Thus, the internal microphones 17a and 17b can be set at desired positions by being moved and adjusted in the lateral and front-back directions inside the exterior 14 of the head 13. That is, the internal microphones 17a and 17b are adjusted to positions at appropriate distances with respect to the corresponding external microphones 16a and 16b, respectively, so that the noise cancellation by the noise removing circuits 23 and 24 described above is more effectively achieved. Can be done In the above-described embodiment, the humanoid robot 10 is configured to have 4 DOF (degree of freedom). However, the present invention is not limited to this. The robot according to the present invention may be a robot configured to perform an arbitrary operation. It is also possible to incorporate a hearing system. Further, in the above-described embodiment, the case where the robot hearing system according to the present invention is incorporated in the humanoid robot 10 has been described. However, the present invention is not limited to this. Obviously, it can be built into a robot.

さらに、 上述した実施形態において、 内部マイク 17は一対のマイク 17 a, 17 bから構成されているが、 二対以上のマイクから構成されていてもよい。  Further, in the above-described embodiment, the internal microphone 17 is constituted by a pair of microphones 17a and 17b, but may be constituted by two or more pairs of microphones.

4 さらにまた、 上述した実施形態においては、 外部マイク 1 6は、 一対のマイク 1 6 a , 1 6 bから構成されているが、 二対以上のマイクから構成されていても よい。 Four Furthermore, in the above-described embodiment, the external microphone 16 is configured by a pair of microphones 16a and 16b, but may be configured by two or more pairs of microphones.

また、 上述した実施形態では、 内部マイク 1 7 a, 1 7 bは、 支持部材 4 2, 4 3のスロット 4 2 a , 4 3 aに沿って横方向に移動調整可能に構成されている が、 これに限らず他の手段によって横方向に移動調整可能に構成されていてもよ い。  In the above-described embodiment, the internal microphones 17a and 17b are configured to be movable and adjustable in the horizontal direction along the slots 42a and 43a of the support members 42 and 43. However, the present invention is not limited to this, and it may be configured so that the movement can be adjusted in the horizontal direction by other means.

取付金具 4 0 , 4 1は、 駆動手段によって支持部材 4 2 , 4 3そしてカメラ 1 5に対して横方向に移動調整可能に構成されていてもよい。 この場合、 人型ロボ ット 1 0の頭部 1 3から外装 1 4を外すことなく、 組み立てた状態にて内部マイ ク 1 7 a, 1 7 bの横方向の位置調整を行なうことができるので、 ノイズキャン セルをより効果的に、 そしてより正確に行なうことができる。  The mounting brackets 40 and 41 may be configured to be movable and adjustable in the lateral direction with respect to the support members 42 and 43 and the camera 15 by driving means. In this case, it is possible to adjust the horizontal position of the internal microphones 17a and 17b in the assembled state without removing the exterior 14 from the head 13 of the humanoid robot 10. Therefore, noise cancellation can be performed more effectively and more accurately.

さらに、 上述の実施形態においては、 内部マイク 1 7 a , 1 7 bは、 横方向に あるいは横方向及び前後方向に移動可能に支持されているが、 これに限らず、 さ らに上下方向に移動可能に支持されていてもよい。 この場合、 内部マイク 1 7 a , 1 7 bは、 公知の構成の三軸方向に移動調整可能な機構により支持される。 以上述べたように、 本発明によれば、 外部マイクが主として外部の目標からの 音を集音し、 内部マイクが主としてロボット内部の駆動機構等のノイズ発生源か らのノイズを集音する。 その際、 外部マイクが集音した音響信号には、 ロボット 内部のノィズ発生源からのノィズ信号が混入しているが、 この混入したノィズ信 号は、 ノィズ除去部における演算処理によって内部マイクが集音したノィズ信号 にてキャンセルされることにより著しく低減される。  Further, in the above-described embodiment, the internal microphones 17a and 17b are supported so as to be movable in the horizontal direction or in the horizontal direction and the front-back direction. It may be movably supported. In this case, the internal microphones 17a and 17b are supported by a mechanism having a known configuration that can be adjusted in three axes. As described above, according to the present invention, the external microphone mainly collects sound from an external target, and the internal microphone mainly collects noise from a noise generating source such as a drive mechanism inside the robot. At this time, the noise signal from the noise source inside the robot is mixed in the acoustic signal collected by the external microphone, and this mixed noise signal is collected by the internal microphone by arithmetic processing in the noise removing unit. The noise is significantly reduced by being canceled by the noise signal.

そして、 ノイズがキャンセルされた音響信号から、 ピッチ抽出部により周波数 解析によって音響信号を取り出して、 この音響信号から左右チャンネル対応部に より音の方向付けを行なった後、 この音の方向付けに基づいて、 音源分離部によ つて各音源毎の音響データに分離する。  Then, from the acoustic signal from which the noise has been canceled, the acoustic signal is extracted by frequency analysis by the pitch extraction unit, and the sound direction is determined from the acoustic signal by the left and right channel corresponding units. Then, the sound source is separated into sound data for each sound source by a sound source separation unit.

したがって、 外部マイクからの音響信号は、 ノイズ除去部での演算処理によつ てロボット内部の駆動機構等のノィズ発生源からのノィズが容易に著しく低減さ れて、 その S /N比が大幅に改善されるので、 音源毎の音響データの分離をより 一層良好に行なうことができる。 Therefore, in the acoustic signal from the external microphone, the noise from the noise source such as the drive mechanism inside the robot is easily and significantly reduced by the arithmetic processing in the noise elimination unit, and the S / N ratio is greatly increased. To separate sound data for each sound source. It can be performed better.

このようにして、 本発明によれば、 駆動機構等のロボット内部で発生するノィ ズに影響されることなく、 外部の目標からの音を集音して能動知覚を行なうこと ができるようにした、 極めて優れたロボット聴覚システムが提供される。  As described above, according to the present invention, it is possible to perform active perception by collecting sound from an external target without being affected by noise generated inside the robot such as a drive mechanism. An extremely good robot hearing system is provided.

さらに、 上記内部マイクが外装の内側にて横方向に移動可能に配設されている ことにより、 ロボット内部のノィズ発生源からのノイズをより良好に集音するよ うに内部マイクを横方向に移動させて、 各内部マイクと対となるべき対応する外 部マイクとの距離を適宜に調整することができ、 外部マイクが集音する音響信号 からより良好にノイズをキャンセルすることができる。  In addition, since the above-mentioned internal microphone is arranged to be able to move in the horizontal direction inside the exterior, the internal microphone is moved in the horizontal direction so that noise from the noise source inside the robot is better collected. Thus, the distance between each internal microphone and the corresponding external microphone to be paired can be appropriately adjusted, and noise can be better canceled from the acoustic signal collected by the external microphone.

なお、 本発明は例示的な実施例について説明したものであり、 本発明の要旨及 び範囲を逸脱することなく、 実施例での種々の変更、 省略、 追加が可能であるこ とは当業者において自明である。 したがって、 本発明は実施例に限定されるもの ではなく、 請求の範囲に記載された要素に基づいて規定される範囲及び及びその 均等範囲を包含するものとして理解されなければならない。 産業上の利用可能性  It should be noted that the present invention has been described with reference to exemplary embodiments, and it is understood by those skilled in the art that various changes, omissions, and additions can be made in the embodiments without departing from the spirit and scope of the present invention. It is obvious. Therefore, the present invention is not limited to the embodiments, but should be understood as including the range defined based on the elements described in the claims and the equivalents thereof. Industrial applicability

以上のように、 本発明によるロボット聴覚システムは、 ロボット、 特に人型ま たは動物型ロボッ トにおける能動聴覚のための聴覚システムとして極めて有用で ある。  As described above, the robot hearing system according to the present invention is extremely useful as a hearing system for active hearing in a robot, particularly in a humanoid or animal robot.

Claims

請 求 の 範 囲 . 内部にノイズ発生源を備えたロボッ トにおいて、 Scope of request For robots with noise sources inside, 少なくともロボッ 卜の一部を覆う防音性の外装と、  A soundproof exterior covering at least a part of the robot, 上記外装の外側に設けられ、 主として外部の音を集音する少なくとも一対の 外部マイクと、  At least one pair of external microphones provided outside the exterior and mainly collecting external sound; 上記外装の内側に設けられ、 主として内部のノイズ発生源からのノイズを集 音する少なくとも一対の内部マイクと、  At least one pair of internal microphones provided inside the exterior and collecting noise mainly from internal noise sources; 上記外部マイク及び内部マイクからの左右の信号に基づいて、 それぞれ外部 マイクからの音響信号から内部のノィズ発生源からのノィズ信号をキャンセル するノイズ除去部と、  A noise removing unit that cancels a noise signal from an internal noise source from an acoustic signal from the external microphone based on the left and right signals from the external microphone and the internal microphone, respectively; 上記ノイズ除去部からの左右の音響信号からそれそれ周波数解析を行って、 時間, 周波数及びパワーに関する音響データ及びピッチ情報を取り出すピッチ 抽出部と、  A pitch extraction unit that performs frequency analysis on each of the left and right audio signals from the noise removal unit and extracts audio data and pitch information relating to time, frequency, and power; 上記ピッチ抽出部で取り出された左右の音響データ及びピッチ情報に基づい て、 当該音響デ一夕が示すピッチを伴う調波構造から音の方向付けを行なう左 右チャンネル対応部と、  A left and right channel corresponding unit for directing sound from a harmonic structure with a pitch indicated by the sound data based on the left and right sound data and the pitch information extracted by the pitch extracting unit; 上記左右チヤンネル対応部で取り出された音の方向付け情報に基づいて、 上 記音響デー夕から音源毎の音響データに分離する音源分離部と、 を含んでいる ことを特徴とする、 ロボット聴覚システム。 . 内部に駆動機構等のノイズ発生源を備えた人型または動物型のロボッ トにお いて、  A sound source separation unit that separates the sound data into sound data for each sound source based on the sound direction information extracted by the left and right channel corresponding units. . In a human or animal robot with a noise source such as a drive mechanism inside, 少なくともロボッ卜の頭部を覆う防音性の外装と、  A soundproof exterior covering at least the head of the robot, 上記外装の外側にて両側の耳に対応する両耳位置に設けられ、 主として外部 の音を集音する少なくとも一対の外部マイクと、  At least one pair of external microphones provided at both ear positions corresponding to both ears on the outside of the exterior, and mainly collecting external sound; 上記外装の内側に設けられた、 主として内部のノィズ発生源からのノイズを 集音する少なくとも一対の内部マイクと、  At least one pair of internal microphones provided inside the exterior and collecting noise mainly from internal noise sources; 上記外部マイク及び内部マイクからの信号に基づいて外部マイクからの音響 信号から内部のノィズ発生源からのノィズ信号をキャンセルするノィズ除去部 と、 Sound from the external microphone based on the signals from the external microphone and the internal microphone A noise removing unit for canceling a noise signal from an internal noise source from the signal, 上記ノイズ除去部からの左右の音響信号からそれそれ周波数解析を行って、 時間, 周波数及びパワーに関する音響データ及びピッチ情報を取り出すピッチ 抽出部と、  A pitch extracting unit that performs frequency analysis from the left and right acoustic signals from the noise removing unit and extracts acoustic data and pitch information relating to time, frequency, and power; 上記ピッチ抽出部で取り出された左右の音響データ及びビッチ情報に基づ ヽ て、 当該音響データのピッチを伴う調波構造から音の方向付けを行なう左右チ ヤンネル対応部と、  A left and right channel corresponding unit for directing a sound from a harmonic structure with a pitch of the sound data based on the left and right sound data and the bitch information extracted by the pitch extracting unit; 上記左右チャンネル対応部で取り出された音の方向付け情報に基づいて、 上 記音響データから音源毎の音響データに分離する音源分離部と、 を含んでいる ことを特徴とする、 ロボット聴覚システム。 . さらに、 ロボットが視覚, 触覚等の知覚システムを備えており、 前記左右チ ヤンネル対応部が、 これらの知覚システムからの情報を参照して音の方向付け を行なうことを特徴とする、 請求の範囲第 1項又は第 2項に記載のロボット聴 覚システム。 . 前記左右チャンネル対応部が、 前記知覚システムに対して音の方向付けに関 する情報を出力することを特徴とする、 請求の範囲第 3項に記載のロボット聴 覚システム。 . 前記内部マイクが、 外装の内側にて少なくとも横方向に移動可能に配設され ていることを特徴とする、 請求の範囲第 1項から第 4項の何れかに記載のロボ ヅ ト聴覚システム。 . 前記内部マイクが、 外装の内側にて横方向に延びる支持部材に対して摺動可 能に支持されていることを特徴とする、 請求の範囲第 5項に記載のロボッ卜聴 . 前記支持部材が横方向に延びるスロットを備えており、 前記内部マイクが、 上記支持部材のスロッ卜に沿って摺動可能に支持された可動部材に固定されて いることを特徴とする、 請求の範囲第 6項に記載のロボット聴覚装置。 . 前記可動部材がスロッ卜に挿通された固定ネジにより支持部材に対して螺着 されることを特徴とする、 請求の範囲第 7項に記載のロボ'ソト聴覚装置。 . 前記内部マイクが、 さらに前後方向に移動可能に配設されていることを特徴 とする、 請求の範囲第 5項に記載のロボット聴覚装置。 0 . 前記内部マイクが、 外装の内側にて横方向に延びる第一の支持部材に対し て摺動可能に支持されていて、 この第一の支持部材が、 外装の内側にて前後方 向に延びる第二の支持部材に対して摺動可能に支持されていることを特徴とす る、 請求の範囲第 9項に記載のロボット聴覚装置。 1 . 前記内部マイクが、 さらに上下方向に移動可能に配設されていることを特 徴とする、 請求の範囲第 5項、 第 9項及び第 1 0項の何れかに記載のロボット A sound source separation unit configured to separate the sound data into sound data for each sound source based on the sound direction information extracted by the left and right channel corresponding units. Further, the robot is provided with a perception system such as visual and tactile senses, and the left and right channel corresponding units refer to information from these perception systems to direct sound. 3. The robot hearing system according to paragraph 1 or 2. 4. The robot hearing system according to claim 3, wherein the left and right channel corresponding unit outputs information relating to sound direction to the perception system. The robot hearing system according to any one of claims 1 to 4, wherein the internal microphone is disposed inside the exterior so as to be movable at least in a lateral direction. . The robot listening according to claim 5, wherein the internal microphone is slidably supported by a support member extending in a lateral direction inside the exterior. A member comprising a laterally extending slot, wherein the internal microphone comprises: 7. The robot hearing device according to claim 6, wherein the robot hearing device is fixed to a movable member slidably supported along a slot of the support member. 8. The Robo's soto hearing device according to claim 7, wherein the movable member is screwed to a support member by a fixing screw inserted into the slot. The robot hearing device according to claim 5, wherein the internal microphone is further disposed so as to be movable in the front-back direction. 0. The internal microphone is slidably supported on a first support member extending in the lateral direction inside the exterior, and the first support member moves forward and backward inside the exterior. 10. The robot hearing device according to claim 9, wherein the robot hearing device is slidably supported by a second support member that extends. 1. The robot according to any one of claims 5, 9, and 10, wherein the internal microphone is further disposed so as to be movable in a vertical direction.
PCT/JP2001/000647 2000-01-31 2001-01-31 Robot hearing system Ceased WO2001057852A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2000022677A JP2001215989A (en) 2000-01-31 2000-01-31 Robot hearing system
JP2000-022679 2000-01-31
JP2000-022677 2000-01-31
JP2000022679A JP2001215991A (en) 2000-01-31 2000-01-31 Robot hearing device

Publications (1)

Publication Number Publication Date
WO2001057852A1 true WO2001057852A1 (en) 2001-08-09

Family

ID=26584532

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2001/000647 Ceased WO2001057852A1 (en) 2000-01-31 2001-01-31 Robot hearing system

Country Status (1)

Country Link
WO (1) WO2001057852A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12183341B2 (en) 2008-09-22 2024-12-31 St Casestech, Llc Personalized sound management and method
US12249326B2 (en) 2007-04-13 2025-03-11 St Case1Tech, Llc Method and device for voice operated control

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
H.G. OKUNO ET AL.: "Research issues of humanois audition", JSAI TECHNICAL REPORT, PROCEEDINGS OF THE SEVENTH MEETING OF SPECIAL INTEREST GROUP ON AI CHALLENGES, SIG-CHALLENGE-9907-10, JAPANESE SPCOETY FOR ARTIFICIAL INTELLIGENCE, 2 November 1999 (1999-11-02), pages 61 - 65, XP002938058 *
S. NAKAMURA ET AL.: "Speech recognition with source localization by microphone array", THE HEISEI 7 SPRING MEETING OF THE ACOUSTICAL SOCIETY OF JAPAN, THE ACOUSTICAL SOCIETY OF JAPAN 1.5.8, vol. 1, 14 March 1995 (1995-03-14), pages 15 - 16, XP002938060 *
T. KIKUCHI ET AL.: "Microphone array system with DOA estimation by using harmonic structure of speech signals", IEICE TECHNICAL REPORT (DIGITAL SHINGO SHORI), DSP98-164, THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATIONS ENGINEERS, vol. 98, no. 534, 22 January 1999 (1999-01-22), pages 23 - 28, XP002938059 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12249326B2 (en) 2007-04-13 2025-03-11 St Case1Tech, Llc Method and device for voice operated control
US12183341B2 (en) 2008-09-22 2024-12-31 St Casestech, Llc Personalized sound management and method
US12374332B2 (en) 2008-09-22 2025-07-29 ST Fam Tech, LLC Personalized sound management and method

Similar Documents

Publication Publication Date Title
JP3780516B2 (en) Robot hearing device and robot hearing system
JPWO2001095314A1 (en) Robot hearing device and robot hearing system
DK1530402T4 (en) Method of fitting a hearing aid taking into account the position of the head and a corresponding hearing aid
JP5401760B2 (en) Headphone device, audio reproduction system, and audio reproduction method
EP1715717B1 (en) Moving object equipped with ultra-directional speaker
US20140270316A1 (en) Sound Induction Ear Speaker for Eye Glasses
CN106416292A (en) Methods, circuits, devices, systems, and related computer-executable code for acquiring acoustic signals
AU2003290190A1 (en) Method for processing audio data and sound acquisition device therefor
JP5259622B2 (en) Sound collection device, sound collection method, sound collection program, and integrated circuit
CN103918283A (en) Audio devices having electroactive polymer actuators
JP2008064892A (en) Speech recognition method and speech recognition apparatus using the same
JP7734346B2 (en) Ear-worn device and playback method
JP2001215990A (en) Robot hearing device
EP1113701A2 (en) Differential-pressure microphone
JP2010124435A (en) Device for assisting conversation in vehicle
JP5205993B2 (en) Hearing monitor apparatus and method for vehicles
JP2001215989A (en) Robot hearing system
WO2001057852A1 (en) Robot hearing system
US7016505B1 (en) Robot acoustic device
JP2007180896A (en) Voice signal processor and voice signal processing method
WO2001057853A1 (en) Robot auditory device
JP2001215991A (en) Robot hearing device
JP2011101407A (en) Robot, and sound collection apparatus
AU2021101324A4 (en) A Wearable Sound Locator Device for Hearing Impaired and Deaf People
JP4660740B2 (en) Voice input device for electric wheelchair

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WA Withdrawal of international application