WO2024154626A1

WO2024154626A1 - Electronic apparatus and program

Info

Publication number: WO2024154626A1
Application number: PCT/JP2024/000328
Authority: WO
Inventors: 遥矢 ▲高▼瀬
Original assignee: 京セラ株式会社
Priority date: 2023-01-16
Filing date: 2024-01-10
Publication date: 2024-07-25

Abstract

An electronic apparatus according to the present invention comprises an acquisition unit that acquires information about one or more interlocutor candidates, a detection unit that detects the one or more interlocutor candidates on the basis of the information about the one or more interlocutor candidates, and a control unit that performs prescribed processing for one or more interlocutors selected from the one or more interlocutor candidates but does not perform the prescribed processing for interlocutor candidates not selected as interlocutors.

Description

Electronic devices and programs

CROSS-REFERENCE TO RELATED APPLICATIONS

　本出願は、２０２３年１月１６日に日本国に特許出願された特願２０２３－４６３１の優先権を主張するものであり、この先の出願の開示全体を、ここに参照のために取り込む。 This application claims priority to patent application No. 2023-4631, filed in Japan on January 16, 2023, the entire disclosure of which is incorporated herein by reference.

　本開示は、電子機器及びプログラムに関する。 This disclosure relates to electronic devices and programs.

　近年、Ｗｅｂ会議又はビデオ会議などのような、いわゆるリモート会議が普及している。リモート会議においては、複数の場所に存在する参加者のコミュニケーションを実現する電子機器（又は電子機器を含むシステム）が使用される。例えば、あるオフィスにおいて会議が行われる際に、会議の参加者の少なくとも１人が、遠隔地の自宅でリモート会議に参加する場面を想定する。この場合、オフィスにおける会議の音声及び／又は映像は、例えばオフィスに設置された電子機器によって取得されて、例えば参加者の自宅に設置された電子機器に送信される。また、参加者の自宅における音声及び／又は映像は、例えば参加者の自宅に設置された電子機器によって取得されて、例えばオフィスに設置された電子機器に送信される。このような電子機器によれば、参加者全員が同じ場所に参集しなくても、互いにコミュニケーションを行うことができる。 In recent years, so-called remote conferences, such as web conferences or video conferences, have become widespread. In remote conferences, electronic devices (or systems including electronic devices) are used to enable communication between participants in multiple locations. For example, consider a situation in which a conference is held in an office, and at least one of the conference participants joins the remote conference at a remote home. In this case, audio and/or video of the conference in the office is acquired by, for example, an electronic device installed in the office, and transmitted to, for example, an electronic device installed in the participant's home. Also, audio and/or video at the participant's home is acquired by, for example, an electronic device installed in the participant's home, and transmitted to, for example, an electronic device installed in the office. Such electronic devices allow participants to communicate with each other without having to all gather in the same place.

　上述のようなリモート会議に応用され得る技術は、種々提案されている。例えば特許文献１に開示のロボット装置は、画像情報から予め学習して知っている人の顔を発見し、その人に呼びかけられたと判断すると、音源の方向に正対する。また、例えば特許文献２は、ユーザとの相対的な距離に応じた行動パターンを生成したり、同一の外部情報が与えられても人工生物モデルごとに異なる行動パターンを生成したりする技術を提案している。また、例えば特許文献３は、ＲＦＩＤの技術を用いて近傍又は周囲の人物を認識することにより、最も近傍の人物とコミュニケーションを取るロボットを開示している。また、例えば特許文献４は、近くの人間に近づいて個人識別の許可を要求し、許可されれば個々に識別を行って、共通して興味を持つ情報を抽出して提示するコミュニケーションロボットを開示している。 Various technologies that can be applied to remote conferences such as those described above have been proposed. For example, a robot device disclosed in Patent Document 1 learns in advance from image information and finds the face of a familiar person. If it determines that the person is calling out to it, it faces the direction of the sound source. For example, Patent Document 2 proposes a technology that generates behavior patterns according to the relative distance from the user, and generates different behavior patterns for each artificial creature model even when the same external information is given. For example, Patent Document 3 discloses a robot that communicates with the nearest person by recognizing people in the vicinity or surrounding area using RFID technology. For example, Patent Document 4 discloses a communication robot that approaches nearby people and requests permission to identify them, and if permission is granted, identifies them individually and extracts and presents information of common interest.

特開２００３－２６６３５１号公報JP 2003-266351 A 特開２００４－６６３６７号公報JP 2004-66367 A 特開２００４－２１６５１３号公報JP 2004-216513 A 特開２００７－２２２９６８号公報JP 2007-222968 A

　一実施形態に係る電子機器（第１電子機器）は、
　少なくとも１人の対話者候補に関する情報を取得する取得部と、
　前記少なくとも１人の対話者候補に関する情報に基づいて、前記少なくとも１人の対話者候補を検出する検出部と、
　前記少なくとも１人の対話者候補から選出された少なくとも１人の対話者に対して所定の処理を実行し、前記対話者に選出されない前記対話者候補に対しては前記所定の処理を実行しない制御部と、
　を備える。 An electronic device (first electronic device) according to an embodiment includes:
An acquisition unit that acquires information regarding at least one interlocutor candidate;
a detection unit that detects the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
a control unit that executes a predetermined process for at least one interlocutor selected from the at least one interlocutor candidate, and does not execute the predetermined process for the interlocutor candidate who is not selected as the interlocutor;
Equipped with.

　一実施形態に係る電子機器（第２電子機器）は、
　少なくとも１人の対話者候補に関する情報を取得する取得部と、
　前記少なくとも１人の対話者候補から少なくとも１人の対話者を選出する選出部と、
　前記少なくとも１人の対話者に対して所定の処理を実行し、前記対話者に選出されない前記対話者候補に対しては前記所定の処理を実行しない制御部と、
　を備える。 An electronic device (second electronic device) according to an embodiment includes:
An acquisition unit that acquires information regarding at least one interlocutor candidate;
A selection unit that selects at least one interlocutor from the at least one interlocutor candidate;
a control unit that executes a predetermined process for the at least one interlocutor and does not execute the predetermined process for the interlocutor candidates who are not selected as the interlocutor;
Equipped with.

　一実施形態に係る電子機器（第３電子機器）は、
　少なくとも１人の対話者候補に関する情報を第１電子機器から取得する取得部と、
　前記少なくとも１人の対話者候補に関する情報に基づいて、前記少なくとも１人の対話者候補を検出する検出部と、
　前記少なくとも１人の対話者候補から第２電子機器によって選出された少なくとも１人の対話者に対して所定の処理を実行し、前記対話者に選出されない前記対話者候補に対しては前記所定の処理を実行しないように、前記第１電子機器及び前記第２電子機器の少なくとも一方を制御する制御部と、
　を備える。 An electronic device (third electronic device) according to an embodiment includes:
an acquisition unit that acquires information about at least one interlocutor candidate from the first electronic device;
a detection unit that detects the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
a control unit that controls at least one of the first electronic device and the second electronic device so as to execute a predetermined process for at least one interlocutor selected by the second electronic device from the at least one interlocutor candidate, and not to execute the predetermined process for the interlocutor candidate not selected as the interlocutor;
Equipped with.

　一実施形態に係るプログラムは、
　コンピュータに、
　少なくとも１人の対話者候補に関する情報を取得するステップと、
　前記少なくとも１人の対話者候補に関する情報に基づいて、前記少なくとも１人の対話者候補を検出するステップと、
　前記少なくとも１人の対話者候補から選出された少なくとも１人の対話者に対して所定の処理を実行し、前記対話者に選出されない前記対話者候補に対しては前記所定の処理を実行しないステップと、
　を実行させる。 A program according to an embodiment includes:
On the computer,
obtaining information about at least one potential interlocutor;
detecting the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
executing a predetermined process for at least one interlocutor selected from the at least one interlocutor candidate, and not executing the predetermined process for the interlocutor candidate who is not selected as the interlocutor;
Execute the command.

一実施形態に係るシステムの使用態様の例を示す図である。FIG. 1 is a diagram illustrating an example of a usage mode of a system according to an embodiment. 一実施形態に係る第１電子機器の構成を概略的に示す機能ブロック図である。FIG. 2 is a functional block diagram illustrating a schematic configuration of a first electronic device according to an embodiment. 一実施形態に係る第１電子機器の駆動部による駆動の例を示す図である。6A and 6B are diagrams illustrating an example of driving by a driving unit of the first electronic device according to an embodiment. 一実施形態に係る第２電子機器の構成を概略的に示す機能ブロック図である。FIG. 4 is a functional block diagram illustrating a schematic configuration of a second electronic device according to an embodiment. 一実施形態に係る第３電子機器の構成を概略的に示す機能ブロック図である。FIG. 4 is a functional block diagram illustrating a configuration of a third electronic device according to an embodiment. 一実施形態に係るシステムの基本的な動作を説明するシーケンス図である。FIG. 2 is a sequence diagram illustrating a basic operation of a system according to an embodiment. 一実施形態に係るシステムの動作例を示す図である。FIG. 1 is a diagram illustrating an example of the operation of a system according to an embodiment. 一実施形態に係るシステムの動作例を示す図である。FIG. 1 is a diagram illustrating an example of the operation of a system according to an embodiment. 一実施形態に係るシステムの動作例を示す図である。FIG. 1 is a diagram illustrating an example of the operation of a system according to an embodiment. 一実施形態に係るシステムの動作の一例を示す図である。FIG. 1 is a diagram illustrating an example of the operation of a system according to an embodiment. 他の実施形態に係るシステムの基本的な動作を説明するシーケンス図である。FIG. 11 is a sequence diagram illustrating a basic operation of a system according to another embodiment.

　本開示において、「電子機器」とは、例えば電力系統又はバッテリなどから供給される電力により駆動する機器としてよい。本開示において、「情報処理装置」とは、電子機器の一形態としてよい。また、「情報処理装置」とは、例えばパソコン（ＰＣ）、ノートＰＣ、サーバ、又はスマートフォンなどのような、コンピュータがプログラムを実行することにより、所定の処理を行う任意の装置としてよい。本開示において、「システム」とは、例えば、少なくとも電子機器及び／又は情報処理装置を含むものとしてよい。本開示において、「ユーザ」とは、一実施形態に係る電子機器及び／又は情報処理装置を使用する者又は使用し得る者（典型的には人間）、並びに、一実施形態に係る電子機器及び／又は情報処理装置を含むシステムを使用する者又は使用し得る者としてよい。また、「ユーザ」とは、一実施形態に係る電子機器、情報処理装置、及び／又はシステムによる恩恵を享受し得るものとしてもよい。また、本開示において、Ｗｅｂ会議又はビデオ会議などのように、参加者の少なくとも１人が他の参加者と異なる場所から通信により参加する方式の会議を、「リモート会議」と総称する。 In the present disclosure, the term "electronic device" may be, for example, a device that is driven by power supplied from a power system or a battery. In the present disclosure, the term "information processing device" may be, for example, a form of electronic device. In addition, the term "information processing device" may be, for example, any device that performs a predetermined process by a computer executing a program, such as a personal computer (PC), a notebook PC, a server, or a smartphone. In the present disclosure, the term "system" may include, for example, at least an electronic device and/or an information processing device. In the present disclosure, the term "user" may be, for example, a person who uses or may use an electronic device and/or an information processing device according to an embodiment (typically a human), as well as a person who uses or may use a system including an electronic device and/or an information processing device according to an embodiment. In addition, the term "user" may be, for example, a person who may enjoy the benefits of an electronic device, an information processing device, and/or a system according to an embodiment. In the present disclosure, a conference in which at least one participant participates by communication from a location different from the other participants, such as a Web conference or a video conference, is collectively referred to as a "remote conference".

　例えばリモート会議などにおいてコミュニケーションを実現する電子機器について、コミュニケーションの円滑化のため、さらなる機能の向上が望まれている。本開示の目的は、コミュニケーションを円滑にする電子機器及びプログラムを提供することにある。一実施形態によれば、コミュニケーションを円滑にする電子機器及びプログラムを提供することができる。以下、一実施形態に係る電子機器を含むシステムについて、図面を参照して詳細に説明する。 For example, in order to facilitate communication in remote conferences and the like, further improvements in functionality are desired for electronic devices that enable communication. An object of the present disclosure is to provide an electronic device and program that facilitates communication. According to one embodiment, an electronic device and program that facilitates communication can be provided. Below, a system including an electronic device according to one embodiment will be described in detail with reference to the drawings.

　図１は、一実施形態に係るシステムの使用態様の例を示す図である。以下、図１に示すように、会議室ＭＲにおいて行われる会議に、対話者Ｍｇが自宅ＲＬからリモートで参加する場面を想定して説明する。ここで、会議室ＭＲは、クローズドな空間であってもよいし、オープンスペースのような空間であってもよい。 FIG. 1 is a diagram showing an example of how the system according to one embodiment is used. The following description assumes a situation in which interlocutor Mg remotely participates in a conference held in a conference room MR from his/her home RL, as shown in FIG. 1. Here, the conference room MR may be a closed space or may be a space such as an open space.

　図１に示すように、会議室ＭＲにおいて、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人が、対話者として会議に参加し得るものとする。ここでは、会議室ＭＲにおいて、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄは、会議室ＭＲのデスクの周囲に着席している場面を想定する。また、会議室ＭＲにおいて、対話者候補Ｍｅ及びＭｆは、会議室ＭＲのデスクから少し離れた場所（例えば、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄよりもデスクから離れた位置）で立っている場面を想定する。会議室ＭＲにおいて、会議の参加者となり得る対話者候補は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどに限定されず、例えばさらに他の対話者候補を含んでもよい。対話者候補Ｍｅ及びＭｆは、着座していてもよい。会議室ＭＲにおいて、対話者候補は、少なくとも１人の任意の数としてよい。また、対話者Ｍｇ以外の対話者も、それぞれの自宅から、当該会議にリモートで参加してもよい。 As shown in FIG. 1, in the conference room MR, at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf may participate in the conference as an interlocutor. Here, it is assumed that the interlocutor candidates Ma, Mb, Mc, and Md are seated around the desk in the conference room MR. It is also assumed that the interlocutor candidates Me and Mf are standing in a position slightly away from the desk in the conference room MR (for example, at a position farther away from the desk than the interlocutor candidates Ma, Mb, Mc, and Md). In the conference room MR, the interlocutor candidates who may become participants in the conference are not limited to the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, and may include, for example, other interlocutor candidates. The interlocutor candidates Me and Mf may be seated. In the conference room MR, the interlocutor candidates may be any number of at least one person. Additionally, interlocutors other than interlocutor Mg may also participate in the conference remotely from their own homes.

　本開示において、「対話者」とは、例えば図１に示すリモート会議のような場面において、対話が想定される者、及び／又は、対話が許可された者としてよい。すなわち、「対話者」とは、例えば図１に示すリモート会議のような場面において、会議に参加する者としてよい。また、「対話者候補」とは、まだ上述の対話者として許可されていない、又は設定されていないが、対話者になり得る者としてよい。すなわち、「対話者候補」とは、例えば図１に示すリモート会議のような場面において、会議に参加し得る者、又は会議に参加することができる者であって、まだ会議に参加していない者としてよい。要するに、一実施形態において、「対話者」は、「対話者候補」の中から許可及び／又は設定されるものとしてよい。ここで、「対話」とは、例えば、対話者Ｍｇと、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人との間で行われる対話を含むものとしてよい。 In the present disclosure, an "interlocutor" may be a person who is expected to have a dialogue and/or a person who is permitted to have a dialogue, for example, in a situation such as the remote conference shown in FIG. 1. That is, an "interlocutor" may be a person who participates in a conference, for example, in a situation such as the remote conference shown in FIG. 1. Also, an "interlocutor candidate" may be a person who has not yet been permitted or set as the above-mentioned interlocutor, but who can become an interlocutor. That is, an "interlocutor candidate" may be a person who can participate in a conference, for example, in a situation such as the remote conference shown in FIG. 1, or a person who can participate in a conference but has not yet participated in the conference. In short, in one embodiment, an "interlocutor" may be permitted and/or set from among the "interlocutor candidates". Here, a "dialogue" may include, for example, a dialogue between an interlocutor Mg and at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf.

　図１に示すように、一実施形態に係るシステムは、例えば、第１電子機器１と、第２電子機器１００と、第３電子機器３００と、を含んで構成されてよい。図１において、第１電子機器１、第２電子機器１００、及び第３電子機器３００は、それぞれ概略的な形状のみを示している。一実施形態に係るシステムは、第１電子機器１、第２電子機器１００、及び第３電子機器３００の少なくともいずれかを含まなくてもよいし、前述の電子機器以外の機器を含んでもよい。 As shown in FIG. 1, the system according to an embodiment may include, for example, a first electronic device 1, a second electronic device 100, and a third electronic device 300. In FIG. 1, the first electronic device 1, the second electronic device 100, and the third electronic device 300 are shown only in schematic form. The system according to an embodiment may not include at least any of the first electronic device 1, the second electronic device 100, and the third electronic device 300, and may include devices other than the electronic devices mentioned above.

　一実施形態に係る第１電子機器１は、会議室ＭＲに設置されてよい。一方、一実施形態に係る第２電子機器１００は、対話者Ｍｇの自宅ＲＬに設置されてよい。第１電子機器１と、第２電子機器１００とは、互いに通信可能に構成されてよい。対話者Ｍｇの自宅ＲＬの場所は、会議室ＭＲの場所とは異なる場所としてよい。対話者Ｍｇの自宅ＲＬの場所は、会議室ＭＲの場所から遠く離れていてもよいし、会議室ＭＲの場所の近く（例えば会議室ＭＲに隣接する部屋など）としてもよい。さらに、対話者Ｍｇの自宅ＲＬの場所は、会議室ＭＲ内にあるものとしてもよい。 The first electronic device 1 according to one embodiment may be installed in the conference room MR. Meanwhile, the second electronic device 100 according to one embodiment may be installed in the home RL of the interlocutor Mg. The first electronic device 1 and the second electronic device 100 may be configured to be able to communicate with each other. The location of the home RL of the interlocutor Mg may be a location different from the location of the conference room MR. The location of the home RL of the interlocutor Mg may be far away from the location of the conference room MR, or may be close to the location of the conference room MR (for example, a room adjacent to the conference room MR). Furthermore, the location of the home RL of the interlocutor Mg may be within the conference room MR.

　図１に示すように、一実施形態に係る第１電子機器１は、例えばネットワークＮを介して、一実施形態に係る第２電子機器１００と接続されてよい。また、図１に示すように、一実施形態に係る第３電子機器３００は、例えばネットワークＮを介して、第１電子機器１及び第２電子機器１００の少なくとも一方と接続されてよい。一実施形態に係る第１電子機器１は、無線及び有線の少なくとも一方により、一実施形態に係る第２電子機器１００と接続されてよい。一実施形態に係る第３電子機器３００は、無線及び有線の少なくとも一方により、第１電子機器１及び第２電子機器１００の少なくとも一方と接続されてよい。図１において、第１電子機器１、第２電子機器１００、及び第３電子機器３００がネットワークＮを介して無線及び／又は有線により接続されている様子を、破線によって示してある。一実施形態において、第１電子機器１及び第２電子機器１００は、一実施形態に係るリモート会議システムに含まれるものとしてよい。また、第３電子機器３００も、一実施形態に係るリモート会議システムに含まれるものとしてもよい。 1, the first electronic device 1 according to an embodiment may be connected to the second electronic device 100 according to an embodiment, for example, via a network N. Also, as shown in FIG. 1, the third electronic device 300 according to an embodiment may be connected to at least one of the first electronic device 1 and the second electronic device 100, for example, via a network N. The first electronic device 1 according to an embodiment may be connected to the second electronic device 100 according to an embodiment, by at least one of wireless and wired. The third electronic device 300 according to an embodiment may be connected to at least one of the first electronic device 1 and the second electronic device 100, by at least one of wireless and wired. In FIG. 1, the first electronic device 1, the second electronic device 100, and the third electronic device 300 are shown by dashed lines as being connected wirelessly and/or wired via the network N. In an embodiment, the first electronic device 1 and the second electronic device 100 may be included in a remote conference system according to an embodiment. Also, the third electronic device 300 may be included in a remote conference system according to an embodiment.

　本開示において、図１に示すようなネットワークＮは、例えば各種の電子機器及び／又はサーバのような機器を、適宜含んでもよい。また、図１に示すようなネットワークＮは、例えば基地局及び／又は中継器のような機器も、適宜含んでもよい。また、本開示において、例えば第１電子機器１と第２電子機器１００とが「通信する」場合、第１電子機器１と第２電子機器１００とが直接通信するものとしてもよい。また、例えば第１電子機器１と第２電子機器１００とが「通信する」場合、第１電子機器１と第２電子機器１００とが例えば第３電子機器３００のような他の機器、中継器、及び／又は基地局などの少なくともいずれかを介して通信するものとしてもよい。また、例えば第１電子機器１と第２電子機器１００とが「通信する」場合、より詳細には、第１電子機器１が備える通信部と、第２電子機器１００が備える通信部とが通信を行うものとしてよい。 In the present disclosure, the network N as shown in FIG. 1 may include various electronic devices and/or devices such as a server as appropriate. The network N as shown in FIG. 1 may also include devices such as a base station and/or a repeater as appropriate. In the present disclosure, for example, when the first electronic device 1 and the second electronic device 100 "communicate", the first electronic device 1 and the second electronic device 100 may communicate directly. In the present disclosure, for example, when the first electronic device 1 and the second electronic device 100 "communicate", the first electronic device 1 and the second electronic device 100 may communicate via at least one of other devices such as the third electronic device 300, a repeater, and/or a base station. In the present disclosure, for example, when the first electronic device 1 and the second electronic device 100 "communicate", more specifically, the communication unit of the first electronic device 1 and the communication unit of the second electronic device 100 may communicate.

　以上のような表記は、第１電子機器１と第２電子機器１００とが「通信する」場合のみならず、一方が他方に情報を「送信する」場合、及び／又は、一方が送信した情報を他方が「受信する」場合にも、上述同様の意図を含んでもよい。さらに、以上のような表記は、第１電子機器１と第２電子機器１００とが「通信する」場合のみならず、例えば第３電子機器３００を含む任意の電子機器が、他の任意の電子機器と通信する場合にも、上述同様の意図を含んでもよい。 The above-mentioned notation may include the same intention as above not only when the first electronic device 1 and the second electronic device 100 "communicate" with each other, but also when one "sends" information to the other and/or when the other "receives" information sent by one. Furthermore, the above-mentioned notation may include the same intention as above not only when the first electronic device 1 and the second electronic device 100 "communicate" with each other, but also when any electronic device, including the third electronic device 300, communicates with any other electronic device.

　一実施形態に係る第１電子機器１は、会議室ＭＲにおいて、例えば図１に示すように配置されてよい。この場合、第１電子機器１は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人の音声及び／又は映像を取得可能な位置に配置されてよい。また、第１電子機器１は、後述のように、対話者Ｍｇの音声及び／又は映像を出力する。このため、第１電子機器１は、第１電子機器１から出力される対話者Ｍｇの音声及び／又は映像が対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人に届くように配置されてよい。また、一実施形態に係る第１電子機器１は、後述のように、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人の視線、当該視線の向き、及び／又は、当該視線の動きなど、対話者候補又は対話者の視線の情報を取得してもよい。第１電子機器１による視線の情報の取得については、さらに後述する。 The first electronic device 1 according to one embodiment may be arranged in the conference room MR, for example, as shown in FIG. 1. In this case, the first electronic device 1 may be arranged in a position where it can acquire the voice and/or image of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf. The first electronic device 1 outputs the voice and/or image of the interlocutor Mg, as described later. Therefore, the first electronic device 1 may be arranged so that the voice and/or image of the interlocutor Mg output from the first electronic device 1 reaches at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf. The first electronic device 1 according to one embodiment may acquire information on the gaze of the interlocutor candidates or interlocutors, such as the gaze, the direction of the gaze, and/or the movement of the gaze, of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, as described later. The acquisition of gaze information by the first electronic device 1 will be described further below.

　一実施形態に係る第２電子機器１００は、対話者Ｍｇの自宅ＲＬにおいて、例えば図１に示すような態様で配置されてよい。この場合、第２電子機器１００は、対話者Ｍｇの音声及び／又は映像を取得可能な位置に配置されてよい。第２電子機器１００は、第２電子機器１００に接続されたマイク若しくはヘッドセット及び／又はカメラなどによって、対話者Ｍｇの音声及び／又は映像を取得してもよい。また、一実施形態に係る第２電子機器１００は、後述のように、対話者Ｍｇの視線、当該視線の向き、及び／又は、当該視線の動きなど、対話者Ｍｇの視線の情報を取得してもよい。第２電子機器１００による視線の情報の取得については、さらに後述する。 The second electronic device 100 according to one embodiment may be arranged in the home RL of the interlocutor Mg, for example, in a manner as shown in FIG. 1. In this case, the second electronic device 100 may be arranged in a position where it is possible to acquire the voice and/or image of the interlocutor Mg. The second electronic device 100 may acquire the voice and/or image of the interlocutor Mg by a microphone, a headset, and/or a camera connected to the second electronic device 100. Furthermore, the second electronic device 100 according to one embodiment may acquire information on the line of sight of the interlocutor Mg, such as the line of sight of the interlocutor Mg, the direction of the line of sight, and/or the movement of the line of sight, as described below. The acquisition of line of sight information by the second electronic device 100 will be described further below.

　また、第２電子機器１００は、後述のように、会議室ＭＲにおける対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人の音声及び／又は映像を出力する。このため、第２電子機器１００は、第２電子機器１００から出力される音声及び／又は映像が対話者Ｍｇに届くように配置されてよい。第２電子機器１００から出力される音声は、例えばヘッドフォン、イヤフォン、スピーカ、又はヘッドセットなどを介して、対話者Ｍｇの耳に届くように配置されてもよい。また、第２電子機器１００から出力される映像は、例えばディスプレイなどを介して、対話者Ｍｇに視覚的に認識されるように配置されてもよい。 Furthermore, the second electronic device 100 outputs the voice and/or image of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf in the conference room MR, as described below. For this reason, the second electronic device 100 may be positioned so that the voice and/or image output from the second electronic device 100 reaches the interlocutor Mg. The voice output from the second electronic device 100 may be positioned so that it reaches the ears of the interlocutor Mg, for example, via headphones, earphones, speakers, or a headset. Furthermore, the image output from the second electronic device 100 may be positioned so that it is visually recognized by the interlocutor Mg, for example, via a display.

　第３電子機器３００は、第１電子機器１と第２電子機器１００とを中継する例えばサーバのような機器としてよい。また、一実施形態に係るシステムは、第３電子機器３００を含まなくてもよい。 The third electronic device 300 may be, for example, a server-like device that relays between the first electronic device 1 and the second electronic device 100. Also, the system according to one embodiment may not include the third electronic device 300.

　図１は、一実施形態に係る第１電子機器１、第２電子機器１００、及び第３実施形態３００の使用態様の単なる一例を示すものである。一実施形態に係る第１電子機器１、第２電子機器１００、及び第３実施形態３００は、他の種々の態様で使用されてもよい。 FIG. 1 shows only one example of a usage mode of the first electronic device 1, the second electronic device 100, and the third embodiment 300 according to an embodiment. The first electronic device 1, the second electronic device 100, and the third embodiment 300 according to an embodiment may be used in various other modes.

　図１に示す第１電子機器１及び第２電子機器１００を含むリモート会議システムにより、対話者Ｍｇは、自宅ＲＬに居ながら、あたかも会議室ＭＲにおいて実施される会議に参加しているように振る舞うことができる。また、このリモート会議システムにより、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人の対話者は、会議室ＭＲにおいて実施される会議にあたかも対話者Ｍｇが現実に参加しているかのような感覚を得ることができる。すなわち、第１電子機器１及び第２電子機器１００を含むリモート会議システムにおいて、会議室ＭＲに配置された第１電子機器１は、対話者Ｍｇのアバターのような役割を担うことができる。この場合、第１電子機器１は、当該第１電子機器１を対話者Ｍｇに見立てたフィジカルアバター（例えばテレプレゼンスロボット又はコミュニケーションロボットのような）として機能するようにしてもよい。また、第１電子機器１は、当該第１電子機器１に対話者Ｍｇの画像又は対話者Ｍｇを例えばキャラクタ化したような画像を表示させたバーチャルアバターとして機能するようにしてもよい。第１電子機器１による、対話者Ｍｇの画像又は対話者Ｍｇの画像の表示は、例えば、第１電子機器１自身が備えるディスプレイ、外部のディスプレイ、又は第１電子機器１が投影する３Ｄホログラムなどであってよい。 The remote conference system including the first electronic device 1 and the second electronic device 100 shown in FIG. 1 allows the interlocutor Mg to behave as if he or she were participating in a conference held in the conference room MR while staying at home RL. Furthermore, this remote conference system allows at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf to feel as if the interlocutor Mg is actually participating in the conference held in the conference room MR. That is, in the remote conference system including the first electronic device 1 and the second electronic device 100, the first electronic device 1 arranged in the conference room MR can play a role like an avatar of the interlocutor Mg. In this case, the first electronic device 1 may function as a physical avatar (such as a telepresence robot or a communication robot) that resembles the interlocutor Mg. Furthermore, the first electronic device 1 may function as a virtual avatar that displays an image of the interlocutor Mg or an image that resembles, for example, a character of the interlocutor Mg on the first electronic device 1. The image of the interlocutor Mg or the image of the interlocutor Mg displayed by the first electronic device 1 may be, for example, a display provided in the first electronic device 1 itself, an external display, or a 3D hologram projected by the first electronic device 1.

　次に、一実施形態に係る第１電子機器１、第２電子機器１００、及び第３電子機器３００の機能的な構成について、それぞれ説明する。 Next, the functional configurations of the first electronic device 1, the second electronic device 100, and the third electronic device 300 according to one embodiment will be described.

　図２は、図１に示した第１電子機器１の機能の構成を概略的に示すブロック図である。以下、一実施形態に係る第１電子機器１の構成の一例について説明する。第１電子機器１は、図１に示したように、例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人などが、会議室ＭＲにおいて使用する機器としてよい。後述する第２電子機器１００は、対話者Ｍｇが発話する際に、第２電子機器１００が取得した対話者Ｍｇの音声、映像、及び／又は視線の情報を、第１電子機器１に出力する機能を有する。また、第１電子機器１は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人などが発話する際に、第１電子機器１が取得した当該対話者候補のうち少なくとも１人などの音声及び／又は映像を、第２電子機器１００に出力する機能を有する。第１電子機器１により、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどのうち少なくとも１人の対話者は、会議室ＭＲにおいて、対話者Ｍｇが離れた場所にいても、リモート会議又はビデオ会議を行うことができる。したがって、第１電子機器１は、適宜、「ローカルで使用される」電子機器とも記す。 FIG. 2 is a block diagram showing a schematic configuration of the functions of the first electronic device 1 shown in FIG. 1. An example of the configuration of the first electronic device 1 according to one embodiment will be described below. As shown in FIG. 1, the first electronic device 1 may be used in the conference room MR by at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf. The second electronic device 100 described later has a function of outputting the voice, video, and/or gaze information of the interlocutor Mg acquired by the second electronic device 100 to the first electronic device 1 when the interlocutor Mg speaks. In addition, the first electronic device 1 has a function of outputting the voice and/or video of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf acquired by the first electronic device 1 to the second electronic device 100 when the interlocutor speaks. The first electronic device 1 allows at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf to hold a remote conference or video conference in the conference room MR even if the interlocutor Mg is in a remote location. Therefore, the first electronic device 1 is also referred to as an electronic device that is "used locally" as appropriate.

　一実施形態に係る第１電子機器１は、対話者Ｍｇの視線の向きを再現するように構成されてよい。すなわち、第１電子機器１は、対話者Ｍｇの視線の向きを模擬するような動作を行うことができるようにしてよい。具体的には、第１電子機器１は、対話者Ｍｇがどの方向を見ているのかを、会議室ＭＲにおいて、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどに認識させることができる。例えば、第１電子機器１は、対話者Ｍｇが対話者Ｍａの方を見ているか、対話者Ｍｇが対話者Ｍｂの方を見ているか、又は、対話者Ｍｇがいずれの他の対話者の方も見ていないのかなどを、会議室ＭＲにおいて第１電子機器１の周囲の者に認識させることができる。 The first electronic device 1 according to one embodiment may be configured to reproduce the direction of gaze of interlocutor Mg. That is, the first electronic device 1 may be capable of performing an operation that simulates the direction of gaze of interlocutor Mg. Specifically, the first electronic device 1 can cause interlocutor candidates Ma, Mb, Mc, Md, Me, Mf, and the like in the conference room MR to recognize in which direction interlocutor Mg is looking. For example, the first electronic device 1 can cause people around the first electronic device 1 in the conference room MR to recognize whether interlocutor Mg is looking at interlocutor Ma, whether interlocutor Mg is looking at interlocutor Mb, or whether interlocutor Mg is not looking at any of the other interlocutors.

　一実施形態に係る第１電子機器１は、各種の機器を想定することができるが、例えば、専用に設計された機器としてもよい。例えば、一実施形態に係る第１電子機器１は、人間などのイラストが描かれた外観の筐体を有してもよいし、人間などの少なくとも一部を模した人形のような形状又はロボットのような形状を有してもよい。また、一実施形態に係る第１電子機器１は、例えば、汎用のスマートフォン、タブレット、ファブレット、ノートパソコン（ノートＰＣ若しくはラップトップ）、又はコンピュータ（デスクトップ）などの機器としてもよい。一実施形態に係る第１電子機器１は、例えばノートＰＣのディスプレイに、人間又はロボットなどの少なくとも一部の画像を描画してもよい。また、一実施形態に係る第１電子機器１は、例えば、人間又はロボットなどの少なくとも一部を３Ｄホログラムとして投影してもよい。例えば、第１電子機器１がロボットのような形状を有する場合、ロボットの目及び／又は頭部の動きによって、対話者Ｍｇの視線の向きを模擬してよい。例えば、第１電子機器１が、ロボットの画像を描画するディスプレイを含んで構成される場合、描画されるロボットの画像の目及び／又は頭部の動きによって、対話者Ｍｇの視線の向きを模擬してよい。 The first electronic device 1 according to one embodiment may be various devices, but may be, for example, a device designed specifically for the device. For example, the first electronic device 1 according to one embodiment may have a housing with an exterior on which an illustration of a human or the like is drawn, or may have a doll-like or robot-like shape that imitates at least a part of a human or the like. The first electronic device 1 according to one embodiment may be, for example, a general-purpose smartphone, tablet, phablet, notebook computer (notebook PC or laptop), or computer (desktop). The first electronic device 1 according to one embodiment may draw at least a part of an image of a human or robot on, for example, the display of a notebook PC. The first electronic device 1 according to one embodiment may project at least a part of a human or robot as a 3D hologram. For example, when the first electronic device 1 has a shape like a robot, the direction of the gaze of the interlocutor Mg may be simulated by the movement of the eyes and/or head of the robot. For example, when the first electronic device 1 is configured to include a display that draws an image of a robot, the direction of the gaze of the interlocutor Mg may be simulated by the movement of the eyes and/or head of the drawn image of the robot.

　図２に示すように、一実施形態に係る第１電子機器１は、制御部１０、記憶部２０、通信部３０、撮像部４０、音声入力部５０、音声出力部６０、表示部７０、駆動部８０、入力部９０、及び視線情報取得部９２などを備えてよい。また、制御部１０は、例えば、取得部１２、検出部１４、選出部１６、及び特定部１８などを含んでもよい。一実施形態において、第１電子機器１は、図２に示す機能部の少なくとも一部を備えなくてもよいし、図２に示す機能部以外の構成要素を備えてもよい。 As shown in FIG. 2, the first electronic device 1 according to one embodiment may include a control unit 10, a memory unit 20, a communication unit 30, an imaging unit 40, an audio input unit 50, an audio output unit 60, a display unit 70, a drive unit 80, an input unit 90, and a gaze information acquisition unit 92. The control unit 10 may also include, for example, an acquisition unit 12, a detection unit 14, a selection unit 16, and an identification unit 18. In one embodiment, the first electronic device 1 may not include at least some of the functional units shown in FIG. 2, or may include components other than the functional units shown in FIG. 2.

　制御部１０は、第１電子機器１を構成する各機能部をはじめとして、第１電子機器１の全体を制御及び／又は管理する機能を有してよい。制御部１０は、種々の機能を実行するための制御及び処理能力を提供するために、例えばＣＰＵ（Central Processing Unit）又はＤＳＰ（Digital Signal Processor）のような、少なくとも１つのプロセッサを含んでよい。制御部１０は、まとめて１つのプロセッサで実現してもよいし、いくつかのプロセッサで実現してもよいし、それぞれ個別のプロセッサで実現してもよい。プロセッサは、単一の集積回路（ＩＣ；Integrated Circuit）として実現されてよい。プロセッサは、複数の通信可能に接続された集積回路及びディスクリート回路として実現されてよい。プロセッサは、他の種々の既知の技術に基づいて実現されてよい。 The control unit 10 may have the function of controlling and/or managing the entire first electronic device 1, including each functional unit constituting the first electronic device 1. The control unit 10 may include at least one processor, such as a CPU (Central Processing Unit) or a DSP (Digital Signal Processor), to provide control and processing power for executing various functions. The control unit 10 may be realized as a single processor, or as several processors, or as individual processors. The processor may be realized as a single integrated circuit (IC). The processor may be realized as multiple communicatively connected integrated circuits and discrete circuits. The processor may be realized based on various other known technologies.

　制御部１０は、１以上のプロセッサ及びメモリを含んでもよい。プロセッサは、特定のプログラムを読み込ませて特定の機能を実行する汎用のプロセッサ、及び特定の処理に特化した専用のプロセッサを含んでよい。専用のプロセッサは、特定用途向けＩＣ（ＡＳＩＣ；Application Specific Integrated Circuit）を含んでよい。プロセッサは、プログラマブルロジックデバイス（ＰＬＤ；Programmable Logic Device）を含んでよい。ＰＬＤは、ＦＰＧＡ（Field-Programmable Gate Array）を含んでよい。制御部１０は、１つ又は複数のプロセッサが協働するＳｏＣ（System-on-a-Chip）、及びＳｉＰ（System In a Package）のいずれかであってもよい。制御部１０は、第１電子機器１の各構成要素の動作を制御する。 The control unit 10 may include one or more processors and memories. The processor may include a general-purpose processor that loads a specific program to execute a specific function, and a dedicated processor specialized for a specific process. The dedicated processor may include an application specific integrated circuit (ASIC). The processor may include a programmable logic device (PLD). The PLD may include a field-programmable gate array (FPGA). The control unit 10 may be either a system-on-a-chip (SoC) or a system in a package (SiP) in which one or more processors work together. The control unit 10 controls the operation of each component of the first electronic device 1.

　制御部１０は、例えば、ソフトウェア及びハードウェア資源の少なくとも一方を含んで構成されてよい。また、一実施形態に係る第１電子機器１において、制御部１０は、ソフトウェアとハードウェア資源とが協働した具体的手段によって構成されてもよい。また、一実施形態に係る第１電子機器１において、他の機能部の少なくともいずれかも、ソフトウェアとハードウェア資源とが協働した具体的手段によって構成されてもよい。 The control unit 10 may be configured to include, for example, at least one of software and hardware resources. Furthermore, in the first electronic device 1 according to one embodiment, the control unit 10 may be configured by specific means in which software and hardware resources work together. Furthermore, in the first electronic device 1 according to one embodiment, at least one of the other functional units may also be configured by specific means in which software and hardware resources work together.

　一実施形態に係る第１電子機器１において、制御部１０が行う制御などの動作については、さらに後述する。また、制御部１０の取得部１２は、各種の取得処理を行うことができる。検出部１４は、各種の検出処理を行うことができる。選出部１６は、各種の選出処理を行うことができる。特定部１４は、各種の特定処理を行うことができる。これらの各機能部が行う動作についても、さらに後述する。 In the first electronic device 1 according to one embodiment, the control unit 10 performs various operations such as control, which will be described later. The acquisition unit 12 of the control unit 10 can perform various acquisition processes. The detection unit 14 can perform various detection processes. The selection unit 16 can perform various selection processes. The identification unit 14 can perform various identification processes. The operations performed by each of these functional units will be described later.

　記憶部２０は、各種の情報を記憶するメモリとしての機能を有してよい。記憶部２０は、例えば制御部１０において実行されるプログラム、及び、制御部１０において実行された処理の結果などを記憶してよい。また、記憶部２０は、制御部１０のワークメモリとして機能してもよい。図２に示すように、記憶部２０は、制御部１０に有線及び／又は無線で接続されてよい。記憶部２０は、例えば、ＲＡＭ（Random Access Memory）及びＲＯＭ（Read Only Memory）の少なくとも一方を含んでもよい。記憶部２０は、例えば半導体メモリ等により構成することができるが、これに限定されず、任意の記憶装置とすることができる。例えば、記憶部２０は、一実施形態に係る第１電子機器１に挿入されたメモリカードのような記憶媒体としてもよい。また、記憶部２０は、制御部１０として用いられるＣＰＵの内部メモリであってもよいし、制御部１０に別体として接続されるものとしてもよい。 The storage unit 20 may function as a memory that stores various information. The storage unit 20 may store, for example, a program executed in the control unit 10 and the results of processing executed in the control unit 10. The storage unit 20 may also function as a work memory for the control unit 10. As shown in FIG. 2, the storage unit 20 may be connected to the control unit 10 by wire and/or wirelessly. The storage unit 20 may include, for example, at least one of a RAM (Random Access Memory) and a ROM (Read Only Memory). The storage unit 20 may be configured, for example, by a semiconductor memory or the like, but is not limited to this, and may be any storage device. For example, the storage unit 20 may be a storage medium such as a memory card inserted into the first electronic device 1 according to one embodiment. The storage unit 20 may also be an internal memory of a CPU used as the control unit 10, or may be connected to the control unit 10 as a separate unit.

　通信部３０は、例えば外部の機器などと無線及び／又は有線により通信するためのインタフェースの機能を有する。一実施形態の通信部３０によって行われる通信方式は、無線通信規格としてよい。例えば、無線通信規格は、２Ｇ、３Ｇ、４Ｇ、及び５Ｇ等のセルラーフォンの通信規格を含む。例えば、セルラーフォンの通信規格は、ＬＴＥ（Long Term Evolution）、Ｗ－ＣＤＭＡ（Wideband Code Division Multiple Access）、ＣＤＭＡ２０００、ＰＤＣ（Personal Digital Cellular）、ＧＳＭ（登録商標）（Global System for Mobile communications）、及びＰＨＳ（Personal Handy-phone System）等を含む。例えば、無線通信規格は、ＷｉＭＡＸ（Worldwide Interoperability for Microwave Access）、ＩＥＥＥ８０２．１１、ＷｉＦｉ、Ｂｌｕｅｔｏｏｔｈ（登録商標）、ＩｒＤＡ（Infrared Data Association）、及びＮＦＣ（Near Field Communication）等を含む。通信部３０は、例えばＩＴＵ－Ｔ(International Telecommunication Union Telecommunication Standardization Sector)において通信方式が標準化されたモデムを含んでよい。通信部３０は、上記の通信規格の１つ又は複数をサポートすることができる。 The communication unit 30 has an interface function for wireless and/or wired communication with, for example, an external device. The communication method performed by the communication unit 30 in one embodiment may be a wireless communication standard. For example, the wireless communication standard includes cellular phone communication standards such as 2G, 3G, 4G, and 5G. For example, the cellular phone communication standards include LTE (Long Term Evolution), W-CDMA (Wideband Code Division Multiple Access), CDMA2000, PDC (Personal Digital Cellular), GSM (Registered Trademark) (Global System for Mobile communications), and PHS (Personal Handy-phone System), etc. For example, wireless communication standards include WiMAX (Worldwide Interoperability for Microwave Access), IEEE 802.11, WiFi, Bluetooth (registered trademark), IrDA (Infrared Data Association), and NFC (Near Field Communication). The communication unit 30 may include, for example, a modem whose communication method is standardized by ITU-T (International Telecommunication Union Telecommunication Standardization Sector). The communication unit 30 can support one or more of the above communication standards.

　通信部３０は、例えば電波を送受信するアンテナ及び適当なＲＦ部などを含めて構成してよい。通信部３０は、例えばアンテナを介して、例えば他の電子機器の通信部と無線通信してもよい。通信部３０は、第１電子機器１から他の機器に任意の情報を送信する機能、及び／又は、第１電子機器１において他の機器から任意の情報を受信する機能を備えてよい。例えば、通信部３０は、図１に示した第２電子機器１００と無線通信してよい。この場合、通信部３０は、第２電子機器１００の通信部１３０（後述）と無線通信してよい。このように、一実施形態において、通信部３０は、第２電子機器１００と通信する機能を有する。また、例えば、通信部３０は、図１に示した第３電子機器３００と無線通信してよい。この場合、通信部３０は、第３電子機器３００の通信部３３０（後述）と無線通信してよい。このように、一実施形態において、通信部３０は、第３電子機器３００と通信する機能を有してよい。また、通信部３０は、外部に有線接続するためのコネクタなどのようなインタフェースとして構成してもよい。通信部３０は、無線通信を行うための既知の技術により構成することができるため、より詳細なハードウェアなどの説明は省略する。 The communication unit 30 may be configured to include, for example, an antenna for transmitting and receiving radio waves and an appropriate RF unit. The communication unit 30 may wirelessly communicate with, for example, a communication unit of another electronic device via an antenna. The communication unit 30 may have a function of transmitting any information from the first electronic device 1 to another device, and/or a function of receiving any information from another device in the first electronic device 1. For example, the communication unit 30 may wirelessly communicate with the second electronic device 100 shown in FIG. 1. In this case, the communication unit 30 may wirelessly communicate with a communication unit 130 (described later) of the second electronic device 100. Thus, in one embodiment, the communication unit 30 has a function of communicating with the second electronic device 100. Also, for example, the communication unit 30 may wirelessly communicate with the third electronic device 300 shown in FIG. 1. In this case, the communication unit 30 may wirelessly communicate with a communication unit 330 (described later) of the third electronic device 300. Thus, in one embodiment, the communication unit 30 may have a function of communicating with the third electronic device 300. The communication unit 30 may also be configured as an interface such as a connector for wired connection to the outside. The communication unit 30 can be configured using known technology for wireless communication, so a detailed description of the hardware and the like is omitted.

　図２に示すように、通信部３０は、制御部１０に有線及び／又は無線で接続されてよい。通信部３０が受信する各種の情報は、例えば記憶部２０及び／又は制御部１０に供給されてよい。通信部３０が受信する各種の情報は、例えば制御部１０に内蔵されたメモリに記憶してもよい。また、通信部３０は、例えば制御部１０による処理結果、及び／又は、記憶部２０に記憶された情報などを外部に送信してもよい。 As shown in FIG. 2, the communication unit 30 may be connected to the control unit 10 via a wired and/or wireless connection. Various information received by the communication unit 30 may be supplied to, for example, the storage unit 20 and/or the control unit 10. Various information received by the communication unit 30 may be stored in, for example, a memory built into the control unit 10. Furthermore, the communication unit 30 may transmit, for example, the results of processing by the control unit 10 and/or information stored in the storage unit 20 to the outside.

　撮像部４０は、例えばデジタルカメラのような、電子的に画像を撮像するイメージセンサを含んで構成されてよい。撮像部４０は、ＣＣＤ（Charge Coupled Device Image Sensor）又はＣＭＯＳ（Complementary Metal Oxide Semiconductor）センサ等のように、光電変換を行う撮像素子を含んで構成されてよい。撮像部４０は、例えば第１電子機器１の周囲の画像を撮像することができる。撮像部４０は、例えば図１に示す会議室ＭＲ内の様子を撮像してよい。一実施形態において、撮像部４０は、例えば図１に示す会議室ＭＲにおいて行われる会議の対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどを撮像してよい。 The imaging unit 40 may be configured to include an image sensor that captures images electronically, such as a digital camera. The imaging unit 40 may be configured to include an imaging element that performs photoelectric conversion, such as a CCD (Charge Coupled Device Image Sensor) or a CMOS (Complementary Metal Oxide Semiconductor) sensor. The imaging unit 40 can capture an image of the surroundings of the first electronic device 1, for example. The imaging unit 40 may capture an image of the inside of the conference room MR shown in FIG. 1, for example. In one embodiment, the imaging unit 40 may capture images of potential interlocutors Ma, Mb, Mc, Md, Me, and Mf of a conference held in the conference room MR shown in FIG. 1, for example.

　撮像部４０は、特定の方向を中心とした所定の範囲の画角を有する映像を撮像するように構成されてよい。例えば、一実施形態に係る撮像部４０は、図１において、対話者候補Ｍｂを中心とする映像であって、対話者候補Ｍａ及び／又は対話者候補Ｍｄなどが画角に含まれない映像を撮像してもよい。また、撮像部４０は、例えば水平方向などの全方位（例えば３６０度）の映像を同時に撮像するように構成されてもよい。例えば、一実施形態に係る撮像部４０は、図１において、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどがいずれも含まれる全方位映像を撮像してもよい。 The imaging unit 40 may be configured to capture an image having a predetermined range of angle of view centered on a specific direction. For example, the imaging unit 40 according to one embodiment may capture an image centered on interlocutor candidate Mb in FIG. 1, and in which interlocutor candidate Ma and/or interlocutor candidate Md are not included in the angle of view. The imaging unit 40 may also be configured to simultaneously capture images in all directions (e.g., 360 degrees), such as the horizontal direction. For example, the imaging unit 40 according to one embodiment may capture an omnidirectional image in FIG. 1 that includes interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf.

　撮像部４０は、撮像した画像を信号に変換して、制御部１０に送信してよい。このため、撮像部４０は、制御部１０に有線及び／又は無線で接続されてよい。また、撮像部４０によって撮像された画像に基づく信号は、記憶部２０、及び／又は表示部７０など、第１電子機器１の任意の機能部に供給されてもよい。撮像部４０は、図１に示す会議室ＭＲ内の様子を撮像するものであれば、デジタルカメラのような撮像デバイスに限定されず、任意のデバイスとしてよい。 The imaging unit 40 may convert the captured image into a signal and transmit it to the control unit 10. For this reason, the imaging unit 40 may be connected to the control unit 10 via a wired and/or wireless connection. Furthermore, a signal based on the image captured by the imaging unit 40 may be supplied to any functional unit of the first electronic device 1, such as the memory unit 20 and/or the display unit 70. The imaging unit 40 is not limited to an imaging device such as a digital camera, and may be any device that captures an image of the state inside the conference room MR shown in FIG. 1.

　一実施形態において、撮像部４０は、例えば会議室ＭＲ内の様子を所定時間ごと（例えば秒間１５フレームなど）の静止画として撮像してもよい。また、一実施形態において、撮像部４０は、例えば会議室ＭＲ内の様子を連続した動画として撮像してもよい。さらに、撮像部４０は、定点カメラを含んで構成してもよいし、可動式のカメラを含んで構成してもよい。 In one embodiment, the imaging unit 40 may capture images of the state inside the conference room MR as still images at predetermined time intervals (e.g., 15 frames per second). Also, in one embodiment, the imaging unit 40 may capture images of the state inside the conference room MR as a continuous video. Furthermore, the imaging unit 40 may be configured to include a fixed camera, or may be configured to include a movable camera.

　音声入力部５０は、人が発する声を含む、第１電子機器１の周囲の音又は音声を検出（取得）する。例えば、音声入力部５０は、音又は音声を空気振動として例えばダイヤフラムなどで検出したものを電気信号に変換するものとしてよい。具体的には、音声入力部５０は、任意のマイク（マイクロフォン）のような音を電気信号に変換する音響機器を含んで構成されてよい。一実施形態において、音声入力部５０は、例えば図１に示した会議室ＭＲにおける対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの音声を検出（取得）してよい。音声入力部５０によって検出された音声（電気信号）は、例えば制御部１０に入力されてよい。このため、音声入力部５０は、制御部１０に有線及び／又は無線で接続されてよい。 The voice input unit 50 detects (acquires) sounds or voices around the first electronic device 1, including human voices. For example, the voice input unit 50 may convert sounds or voices detected as air vibrations, for example, by a diaphragm, into an electrical signal. Specifically, the voice input unit 50 may include an acoustic device that converts sounds into an electrical signal, such as a microphone. In one embodiment, the voice input unit 50 may detect (acquire) the voices of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf in the conference room MR shown in FIG. 1, for example. The voices (electrical signals) detected by the voice input unit 50 may be input to the control unit 10, for example. For this reason, the voice input unit 50 may be connected to the control unit 10 by wire and/or wirelessly.

　一実施形態において、音声入力部５０は、例えば、ステレオマイクロホン又はマイクロホンアレイなどを含んで構成されもよい。ステレオマイクロホン又はマイクロホンアレイのように複数チャンネルを含む音声入力部５０によれば、音源の方向及び／又は音源の位置などを特定（又は推定）することができる。このような音声入力部５０によれば、例えば会議室ＭＲにおいて検出される音が、音声入力部５０を備える第１電子機器１を基準として、どの方向及び／又は位置に存在する音源から発された音なのか、特定（又は推定）することができる。 In one embodiment, the audio input unit 50 may be configured to include, for example, a stereo microphone or a microphone array. An audio input unit 50 including multiple channels, such as a stereo microphone or a microphone array, can identify (or estimate) the direction and/or position of a sound source. With such an audio input unit 50, it can be identified (or estimated) from which direction and/or position a sound detected in, for example, a conference room MR originates, based on the first electronic device 1 equipped with the audio input unit 50.

　音声入力部５０は、取得した音又は音声を電気信号に変換して、制御部１０に供給してよい。また、音声入力部５０は、音又は音声が変換された電気信号（音声信号）を、記憶部２０など、第１電子機器１の機能部に供給してもよい。音声入力部５０は、図１に示す会議室ＭＲ内の音又は音声を検出（取得）するものであれば、任意のデバイスとしてよい。 The audio input unit 50 may convert the acquired sound or voice into an electrical signal and supply it to the control unit 10. The audio input unit 50 may also supply the electrical signal (audio signal) into which the sound or voice has been converted to a functional unit of the first electronic device 1, such as the memory unit 20. The audio input unit 50 may be any device that detects (acquires) sound or voice within the conference room MR shown in FIG. 1.

　音声出力部６０は、制御部１０から供給される音又は音声の電気信号（音声信号）を音に変換することにより、当該音声信号を音又は音声として出力する。音声出力部６０は、制御部１０に有線及び／又は無線で接続されてよい。音声出力部６０は、任意のスピーカ（ラウドスピーカ）などの音を出力する機能を有するデバイスを含めて構成されてよい。一実施形態において、音声出力部６０は、特定の方向に音を伝達する指向性スピーカを含んで構成されてもよい。また、音声出力部６０は、音の指向性を変更可能に構成されていてもよい。音声出力部６０は、電気信号（音声信号）を適宜増幅する増幅器又は増幅回路などを含んでもよい。 The audio output unit 60 converts an electrical signal (audio signal) of sound or voice supplied from the control unit 10 into sound, and outputs the audio signal as sound or voice. The audio output unit 60 may be connected to the control unit 10 by wire and/or wirelessly. The audio output unit 60 may be configured to include a device having a function of outputting sound, such as an arbitrary speaker (loudspeaker). In one embodiment, the audio output unit 60 may be configured to include a directional speaker that transmits sound in a specific direction. The audio output unit 60 may also be configured to be able to change the directionality of the sound. The audio output unit 60 may include an amplifier or an amplification circuit that appropriately amplifies the electrical signal (audio signal).

　一実施形態において、音声出力部６０は、通信部３０が第２電子機器１００から受信する音声信号を増幅してよい。ここで、第２電子機器１００から受信する音声信号とは、例えば、発話している（発話中の）発話者（例えば図１に示した対話者Ｍｇ）の第２電子機器１００から通信部３０が受信する、当該発話者の音声信号としてよい。すなわち、音声出力部６０は、発話者（例えば図１に示した対話者Ｍｇ）の音声信号を、当該発話者の音声として出力してよい。 In one embodiment, the audio output unit 60 may amplify the audio signal that the communication unit 30 receives from the second electronic device 100. Here, the audio signal received from the second electronic device 100 may be, for example, the audio signal of a speaker (e.g., interlocutor Mg shown in FIG. 1) who is speaking (currently speaking) that is received by the communication unit 30 from the second electronic device 100 of that speaker. In other words, the audio output unit 60 may output the audio signal of a speaker (e.g., interlocutor Mg shown in FIG. 1) as the voice of that speaker.

　表示部７０は、例えば、液晶ディスプレイ（Liquid Crystal Display：ＬＣＤ）、有機ＥＬディスプレイ（Organic Electro-Luminescence panel）、又は無機ＥＬディスプレイ（Inorganic Electro-Luminescence panel）等の任意の表示デバイスとしてよい。また、表示部７０は、例えば、３Ｄホログラムを投影するプロジェクタなどであってもよい。表示部７０は、文字、図形、又は記号等の各種の情報を表示してよい。また、表示部７０は、例えば第１電子機器１の操作をユーザに促すために、種々のＧＵＩを構成するオブジェクト及び／又はアイコン画像などを表示してもよい。 The display unit 70 may be any display device, such as a Liquid Crystal Display (LCD), an Organic Electro-Luminescence panel, or an Inorganic Electro-Luminescence panel. The display unit 70 may also be, for example, a projector that projects a 3D hologram. The display unit 70 may display various types of information, such as characters, figures, or symbols. The display unit 70 may also display objects and/or icon images that constitute various GUIs, for example, to prompt the user to operate the first electronic device 1.

　表示部７０は、例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの指又はスタイラスの接触による入力を検出するタッチパネルの機能を備えたタッチスクリーンディスプレイとしてもよい。 The display unit 70 may be, for example, a touch screen display equipped with a touch panel function that detects input by contact with a finger or stylus of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf.

　表示部７０において表示を行うために必要な各種データは、例えば制御部１０又は記憶部２０などから供給されてよい。このため、表示部７０は、制御部１０などに有線及び／又は無線で接続されてよい。また、表示部７０は、例えばＬＣＤなどを含む場合、適宜、バックライトなどを含んで構成されてもよい。 Various data necessary for display on the display unit 70 may be supplied, for example, from the control unit 10 or the memory unit 20. For this reason, the display unit 70 may be connected to the control unit 10 or the like by wire and/or wirelessly. Furthermore, when the display unit 70 includes, for example, an LCD, it may be configured to include a backlight, etc., as appropriate.

　一実施形態において、表示部７０は、第２電子機器１００から送信される映像信号に基づく映像を表示してよい。後述のように、第２電子機器１００は、例えば図１に示した対話者Ｍｇの音声、映像、及び／又は視線の情報を取得して、第１電子機器１に出力する。そこで、第１電子機器１の制御部１０は、第２電子機器１００から取得した情報に基づく映像及び／又は画像などを表示部７０に表示してよい。例えば、表示部７０は、制御部１０から入力される対話者Ｍｇの映像及び／又は視線の情報に基づいて、対話者Ｍｇの視線の向きを表現した映像を表示してもよい。第１電子機器１の表示部７０に対話者Ｍｇの視線の向きを表現した映像が表示されることにより、例えば図１に示す対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどは、会議室ＭＲから離れた場所にいる対話者Ｍｇの視線の様子を、視覚的に知ることができる。 In one embodiment, the display unit 70 may display an image based on a video signal transmitted from the second electronic device 100. As described later, the second electronic device 100 acquires, for example, the voice, video, and/or gaze information of the interlocutor Mg shown in FIG. 1 and outputs it to the first electronic device 1. The control unit 10 of the first electronic device 1 may then display, on the display unit 70, a video and/or image based on the information acquired from the second electronic device 100. For example, the display unit 70 may display an image representing the gaze direction of the interlocutor Mg based on the video and/or gaze information of the interlocutor Mg input from the control unit 10. By displaying an image representing the gaze direction of the interlocutor Mg on the display unit 70 of the first electronic device 1, for example, interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf shown in FIG. 1 can visually know the gaze state of the interlocutor Mg who is located away from the conference room MR.

　表示部７０は、例えば第２電子機器１００によって撮像された対話者Ｍｇの映像をそのまま表示してもよい。一方、表示部７０は、例えば対話者Ｍｇの視線の向きを表現したキャラクタの画像（例えばアバター又はロボットの視線など）を表示してもよい。表示部７０は、第２電子機器１００のユーザの視線の向きを、映像によって表現してよい。また、表示部７０は、第２電子機器１００のユーザの視線の向き及び／又は視線の動きなどを、映像によって表現してもよい。このように、一実施形態に係る第１電子機器１は、第２電子機器１００のユーザの視線及び／又は当該視線の向きを映像によって表現する表示部７０を備えてもよい。 The display unit 70 may directly display an image of the interlocutor Mg captured by the second electronic device 100, for example. On the other hand, the display unit 70 may display an image of a character (e.g., the gaze of an avatar or robot) that represents the direction of the gaze of the interlocutor Mg, for example. The display unit 70 may represent the gaze direction of the user of the second electronic device 100 by an image. The display unit 70 may also represent the gaze direction and/or gaze movement of the user of the second electronic device 100 by an image. In this way, the first electronic device 1 according to one embodiment may include a display unit 70 that represents the gaze and/or gaze direction of the user of the second electronic device 100 by an image.

　駆動部８０は、第１電子機器１における所定の可動部を駆動する。駆動部８０は、第１電子機器１における任意の可動部を駆動するサーボモータなどの動力源を含んで構成されてよい。駆動部８０は、制御部１０の制御によって、第１電子機器１における任意の可動部を駆動してよい。このため、駆動部８０は、制御部１０に有線及び／又は無線で接続されてよい。 The driving unit 80 drives a specific moving part in the first electronic device 1. The driving unit 80 may be configured to include a power source such as a servo motor that drives any moving part in the first electronic device 1. The driving unit 80 may drive any moving part in the first electronic device 1 under the control of the control unit 10. For this reason, the driving unit 80 may be connected to the control unit 10 by wire and/or wirelessly.

　一実施形態において、駆動部８０は、例えば第１電子機器１の筐体の少なくとも一部を駆動してよい。また、駆動部８０は、例えば第１電子機器１が人間などの少なくとも一部を模した人形のような形状又はロボットのような形状を有する場合、人形又はロボットの少なくとも一部を駆動してもよい。特に、駆動部８０は、第１電子機器１が人間の顔の少なくとも一部を模したような形状又はロボットの顔のような形状を有する場合、対話者Ｍｇの視線、視線の向き、及び／又は、視線の動きなどを、人形又はロボットの物理的な構成（形態）及び／又は動きによって表現してよい。 In one embodiment, the driving unit 80 may drive, for example, at least a part of the housing of the first electronic device 1. Furthermore, for example, when the first electronic device 1 has a shape like a doll imitating at least a part of a human or the like, or a shape like a robot, the driving unit 80 may drive at least a part of a doll or a robot. In particular, when the first electronic device 1 has a shape imitating at least a part of a human face or a shape like a robot face, the driving unit 80 may express the line of sight, line of sight direction, and/or line of sight movement of the interlocutor Mg by the physical configuration (shape) and/or movement of the doll or robot.

　後述のように、第２電子機器１００は、例えば図１に示した対話者Ｍｇの音声、映像、及び／又は視線の情報を（視線情報取得部１９２によって）取得して、第１電子機器１に出力する。そこで、駆動部８０は、第１電子機器１から入力される対話者Ｍｇの映像及び／又は視線の情報に基づいて、対話者Ｍｇの映像の視線を、物理的な構成（形態）及び／又は動きによって表現してもよい。第１電子機器１の駆動部８０が対話者Ｍｇの視線を表現することにより、例えば図１に示す対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどは、会議室ＭＲから離れた場所にいる対話者Ｍｇの視線の様子を、視覚的に知ることができる。 As described below, the second electronic device 100 acquires, for example, the voice, video, and/or gaze information of interlocutor Mg shown in FIG. 1 (by the gaze information acquisition unit 192) and outputs it to the first electronic device 1. The drive unit 80 may represent the gaze of the image of interlocutor Mg by a physical configuration (shape) and/or movement based on the video and/or gaze information of interlocutor Mg input from the first electronic device 1, as shown in FIG. 1. By the drive unit 80 of the first electronic device 1 representing the gaze of interlocutor Mg, for example, interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf shown in FIG. 1 can visually know the state of the gaze of interlocutor Mg who is in a location away from the conference room MR.

　駆動部８０は、例えば第２電子機器１００によって撮像された対話者Ｍｇの視線の向き及び／又は動きを、そのまま再現してもよい。一方、駆動部８０は、例えば対話者Ｍｇの視線の向き及び／又は動きを、第１電子機器１が有する人形又はロボットの形状によって表現してもよい。駆動部８０は、第２電子機器１００のユーザの視線、当該視線の向き、及び／又は、当該視線の動きなどを、物理的な構成（形態）及び／又は動きによって表現してもよい。一例として、第１電子機器１がロボットの形状を有する場合に、ロボットの目を動かす、及び／又は首を動かす等によって、第２電子機器１００のユーザの視線の向き及び／又は視線の動きを表現してよい。このように、一実施形態に係る第１電子機器１は、第２電子機器１００のユーザの視線及び／又は当該視線の向きを機械的構造の駆動によって表現する駆動部８０を備えてもよい。 The driving unit 80 may reproduce, for example, the direction and/or movement of the gaze of the interlocutor Mg captured by the second electronic device 100 as is. On the other hand, the driving unit 80 may express, for example, the direction and/or movement of the gaze of the interlocutor Mg by the shape of a doll or robot possessed by the first electronic device 1. The driving unit 80 may express the gaze, the direction of the gaze, and/or the movement of the gaze of the user of the second electronic device 100 by a physical configuration (shape) and/or movement. As an example, when the first electronic device 1 has the shape of a robot, the direction and/or movement of the gaze of the user of the second electronic device 100 may be expressed by moving the eyes and/or the neck of the robot. In this way, the first electronic device 1 according to one embodiment may include a driving unit 80 that expresses the gaze and/or the direction of the gaze of the user of the second electronic device 100 by driving a mechanical structure.

　図３は、一実施形態に係る第１電子機器１における駆動部８０による動作の例を説明する図である。 FIG. 3 is a diagram illustrating an example of the operation of the driving unit 80 in the first electronic device 1 according to one embodiment.

　図３に示すように、一実施形態において、駆動部８０は、人形又はロボットの形状を有する第１電子機器１における駆動軸α、β、γ、δ、ε、及びζの少なくともいずれかを中心とする駆動を実現してよい。例えば、駆動部８０は、第１電子機器１における駆動軸αを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の否定的な動作（首を左右に振る動作）を表現してよい。また、例えば、駆動部８０は、第１電子機器１における駆動軸βを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の肯定的な動作（頷く動作）を表現してよい。また、例えば、駆動部８０は、第１電子機器１における駆動軸γを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）が態度を決めかねるような動作（首をかしげる動作）を表現してよい。また、例えば、駆動部８０は、第１電子機器１における駆動軸δを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の否定的な動作又は拒絶を示す動作（身体を左右に振る動作）を表現してよい。また、例えば、駆動部８０は、第１電子機器１における駆動軸εを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）が礼儀を示す動作（お辞儀をする動作）を表現してよい。また、例えば、駆動部８０は、第１電子機器１における駆動軸ζを中心とする駆動を行うことにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の動作を表現してもよい。 As shown in FIG. 3, in one embodiment, the driving unit 80 may realize driving about at least one of the driving axes α, β, γ, δ, ε, and ζ in the first electronic device 1 having the shape of a doll or robot. For example, the driving unit 80 may express a negative movement (shaking the head from side to side) of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis α in the first electronic device 1. Also, for example, the driving unit 80 may express a positive movement (nodding movement) of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis β in the first electronic device 1. Also, for example, the driving unit 80 may express an undecided movement (tilting the head) of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis γ in the first electronic device 1. Also, for example, the driving unit 80 may express a negative or rejection behavior (such as shaking the body from side to side) of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis δ in the first electronic device 1. Also, for example, the driving unit 80 may express a polite behavior (such as bowing) of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis ε in the first electronic device 1. Also, for example, the driving unit 80 may express a behavior of the user of the second electronic device 100 (e.g., interlocutor Mg) by performing driving about the driving axis ζ in the first electronic device 1.

　また、一実施形態において、駆動部８０は、図３に示す第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び／又は目Ｅ２の動き、すなわち第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してもよい。この場合、駆動部８０は、第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び目Ｅ２の少なくとも一方を駆動することにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してよい。一実施形態において、駆動部８０は、第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び目Ｅ２の少なくとも一方の動きを駆動することにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してよい。具体的には、駆動部８０は、例えば、第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び目Ｅ２の少なくとも一方を、図３に示す矢印のいずれかの方向に動かすようにして、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してよい。駆動部８０が第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び目Ｅ２の少なくとも一方を動かす方向は、図３に示す矢印のいずれかの方向に限定されない。例えば、駆動部８０は、第１電子機器１の顔部分Ｆｃにおける目Ｅ１及び目Ｅ２の少なくとも一方を、図３に示す矢印のいずれかの方向以外の斜めの方向などに動かしてもよい。 In addition, in one embodiment, the driving unit 80 may express the movement of the eye E1 and/or the eye E2 in the face portion Fc of the first electronic device 1 shown in FIG. 3, that is, the line of sight of the user of the second electronic device 100 (e.g., the interlocutor Mg). In this case, the driving unit 80 may express the line of sight of the user of the second electronic device 100 (e.g., the interlocutor Mg) by driving at least one of the eye E1 and the eye E2 in the face portion Fc of the first electronic device 1. In one embodiment, the driving unit 80 may express the line of sight of the user of the second electronic device 100 (e.g., the interlocutor Mg) by driving the movement of at least one of the eye E1 and the eye E2 in the face portion Fc of the first electronic device 1. Specifically, the driving unit 80 may express the line of sight of the user of the second electronic device 100 (e.g., the interlocutor Mg) by, for example, moving at least one of the eye E1 and the eye E2 in the face portion Fc of the first electronic device 1 in any direction of the arrows shown in FIG. 3. The direction in which the driving unit 80 moves at least one of the eyes E1 and E2 in the face portion Fc of the first electronic device 1 is not limited to any of the directions of the arrows shown in Fig. 3. For example, the driving unit 80 may move at least one of the eyes E1 and E2 in the face portion Fc of the first electronic device 1 in a diagonal direction other than any of the directions of the arrows shown in Fig. 3.

　一実施形態において、表示部７０は、例えば図３に示す顔部分Ｆｃにおける目Ｅ１及び／又は目Ｅ２を表示することにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してもよい。一実施形態において、表示部７０及び駆動部８０の少なくとも一方は、第１電子機器１の目Ｅ１及び目Ｅ２の少なくとも一方を表現することにより、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線を表現してよい。 In one embodiment, the display unit 70 may represent the gaze of the user of the second electronic device 100 (e.g., interlocutor Mg) by displaying, for example, the eye E1 and/or the eye E2 in the face portion Fc shown in FIG. 3. In one embodiment, at least one of the display unit 70 and the drive unit 80 may represent the gaze of the user of the second electronic device 100 (e.g., interlocutor Mg) by displaying at least one of the eye E1 and the eye E2 of the first electronic device 1.

　上述のように、表示部７０による表示、及び／又は、駆動部８０の駆動により、例えば対話者Ｍｇのような人間の感情及び／又は行動を表す種々の動作を表現することができる。表示部７０による表示、及び／又は、駆動部８０の駆動により、例えば対話者Ｍｇのような人間の感情及び／又は行動を表す動作は、公知の種々の技術を用いてよい。このため、表示部７０による表示、及び／又は、駆動部８０の駆動により、例えば対話者Ｍｇのような人間の感情及び／又は行動を表す動作については、より詳細な説明は省略する。一実施形態に係る第１電子機器１は、表示部７０による表示、及び／又は、駆動部８０の駆動により、対話者Ｍｇの感情及び／又は行動を表す各種の動作を行うことができる。 As described above, various operations expressing the emotions and/or behavior of a human being, such as the interlocutor Mg, can be expressed by displaying the display unit 70 and/or driving the drive unit 80. Various known technologies may be used for the operations expressing the emotions and/or behavior of a human being, such as the interlocutor Mg, by displaying the display unit 70 and/or driving the drive unit 80. For this reason, a detailed description of the operations expressing the emotions and/or behavior of a human being, such as the interlocutor Mg, by displaying the display unit 70 and/or driving the drive unit 80 will be omitted. The first electronic device 1 according to one embodiment can perform various operations expressing the emotions and/or behavior of the interlocutor Mg by displaying the display unit 70 and/or driving the drive unit 80.

　図２に示す入力部９０は、第１電子機器１のユーザによる入力を検出するための任意のデバイスを含んで構成されてよい。例えば、入力部９０は、各種スイッチ、各種スライダ、各種フェーダ、ジョイスティック、パッド、キーボード、マウス、トラックボール、及びタッチパネルなどの少なくともいずれかを含んで構成されてよい。入力部９０は、公知の種々の技術を用いてよいため、より詳細なハードウェアなどの説明は省略する。一実施形態において、入力部９０は、例えば図１に示す対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどによる入力を検出してよい。 The input unit 90 shown in FIG. 2 may be configured to include any device for detecting input by a user of the first electronic device 1. For example, the input unit 90 may be configured to include at least one of various switches, various sliders, various faders, a joystick, a pad, a keyboard, a mouse, a trackball, and a touch panel. The input unit 90 may use various known technologies, so a more detailed description of the hardware, etc. will be omitted. In one embodiment, the input unit 90 may detect input by interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf shown in FIG. 1, for example.

　視線情報取得部９２は、第１電子機器１のユーザ（例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人）の視線の情報を取得する。視線情報取得部９２は、第１電子機器１のユーザの視線、当該視線の向き、及び／又は、当該視線の動きなど、第１電子機器１のユーザの視線の情報を取得してよい。視線情報取得部９２は、例えばアイトラッカーなどのように、第１電子機器１のユーザ（例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人）の視線の動きを追尾する機能を備えてよい。視線情報取得部９２は、第１電子機器１のユーザの視線、当該視線の向き、及び／又は、当該視線の動きなど、第１電子機器１のユーザの視線の情報を取得することができる任意の部材としてよい。 The gaze information acquisition unit 92 acquires gaze information of the user of the first electronic device 1 (e.g., at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf). The gaze information acquisition unit 92 may acquire gaze information of the user of the first electronic device 1, such as the gaze of the user of the first electronic device 1, the direction of the gaze, and/or the movement of the gaze. The gaze information acquisition unit 92 may have a function of tracking the movement of the gaze of the user of the first electronic device 1 (e.g., at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf), such as an eye tracker. The gaze information acquisition unit 92 may be any component capable of acquiring gaze information of the user of the first electronic device 1, such as the gaze of the user of the first electronic device 1, the direction of the gaze, and/or the movement of the gaze.

　一実施形態に係る第１電子機器１は、撮像部４０によって撮像される第１電子機器１のユーザ（例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人）の目の動きに基づいて、当該ユーザの視線の情報を取得してもよい。この場合、第１電子機器１は、視線情報取得部９２を備えなくてもよいし、撮像部４０が視線情報取得部９２の機能を兼ねてもよい。視線情報取得部９２によって取得された視線情報は、例えば制御部１０に入力されてよい。このため、視線情報取得部９２は、制御部１０に有線及び／又は無線で接続されてよい。 The first electronic device 1 according to one embodiment may acquire gaze information of a user of the first electronic device 1 (e.g., at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf) based on the eye movement of the user captured by the imaging unit 40. In this case, the first electronic device 1 may not include a gaze information acquisition unit 92, and the imaging unit 40 may also function as the gaze information acquisition unit 92. The gaze information acquired by the gaze information acquisition unit 92 may be input to the control unit 10, for example. For this reason, the gaze information acquisition unit 92 may be connected to the control unit 10 via a wired and/or wireless connection.

　一実施形態において、第１電子機器１は、上述のように、専用に設計された機器としてもよい。一方、一実施形態において、第１電子機器１は、図２に示す機能部のうち、例えば音声出力部６０、駆動部８０、入力部９２、及び視線情報取得部９２の少なくともいずれかを備えてもよい。この場合、第１電子機器１は、図２に示す他の機能部の機能の少なくとも一部を補うために、他の電子機器に接続されてもよい。ここで、他の電子機器とは、例えば、汎用のスマートフォン、タブレット、ファブレット、ノートパソコン（ノートＰＣ若しくはラップトップ）、又はコンピュータ（デスクトップ）などの機器としてもよい。 In one embodiment, the first electronic device 1 may be a dedicated device as described above. Meanwhile, in one embodiment, the first electronic device 1 may include at least one of the functional units shown in FIG. 2, such as the audio output unit 60, the drive unit 80, the input unit 92, and the gaze information acquisition unit 92. In this case, the first electronic device 1 may be connected to another electronic device to supplement at least a part of the functions of the other functional units shown in FIG. 2. Here, the other electronic device may be, for example, a general-purpose smartphone, tablet, phablet, notebook computer (notebook PC or laptop), or computer (desktop).

　図３に示した第１電子機器１における表示部７０による表示、及び／又は、駆動部８０の駆動により、対話者Ｍｇのような人間の感情及び／又は行動を表す種々の動作を表現する態様は、あくまでも想定され得る例示としてよい。一実施形態に係る第１電子機器１は、種々の構成及び／又は動作態様によって、対話者Ｍｇのような人間の感情及び／又は行動を表す種々の動作を表現してよい。 The manner in which various actions expressing the emotions and/or behavior of a human being such as interlocutor Mg are expressed by the display unit 70 and/or the drive unit 80 in the first electronic device 1 shown in FIG. 3 may be merely considered as examples that can be envisioned. The first electronic device 1 according to one embodiment may express various actions expressing the emotions and/or behavior of a human being such as interlocutor Mg by using various configurations and/or operating modes.

　図４は、図１に示した第２電子機器１００の構成を概略的に示すブロック図である。以下、一実施形態に係る第２電子機器１００の構成の一例について説明する。第２電子機器１００は、図１に示したように、例えば対話者Ｍｇが、自宅ＲＬにおいて使用する機器としてよい。上述した第１電子機器１は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆのうち少なくとも１人などが発話する際に、第１電子機器１が取得した当該対話者候補のうち少なくとも１人などの音声及び／又は映像を、第２電子機器１００に出力する機能を有する。そして、第１電子機器１は、対話者Ｍｇの視線を表現することができる。また、第２電子機器１００は、対話者Ｍｇが発話する際に、第２電子機器１００が取得した対話者Ｍｇの音声及び／又は映像を、第１電子機器１に出力する機能を有する。さらに、第２電子機器１００は、第２電子機器１００が取得した対話者Ｍｇの視線の情報を、第１電子機器１に出力する機能を有する。第２電子機器１００により、対話者Ｍｇは、会議室ＭＲから離れた場所においても、リモート会議又はビデオ会議を行うことができる。したがって、第２電子機器１００は、適宜、「リモートで使用される」電子機器とも記す。 FIG. 4 is a block diagram showing a schematic configuration of the second electronic device 100 shown in FIG. 1. An example of the configuration of the second electronic device 100 according to an embodiment will be described below. As shown in FIG. 1, the second electronic device 100 may be, for example, a device used by the interlocutor Mg at his/her home RL. The above-mentioned first electronic device 1 has a function of outputting the voice and/or image of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf acquired by the first electronic device 1 to the second electronic device 100 when at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf speaks. The first electronic device 1 can express the gaze of the interlocutor Mg. In addition, the second electronic device 100 has a function of outputting the voice and/or image of the interlocutor Mg acquired by the second electronic device 100 to the first electronic device 1 when the interlocutor Mg speaks. Furthermore, the second electronic device 100 has a function of outputting the gaze information of the interlocutor Mg acquired by the second electronic device 100 to the first electronic device 1. The second electronic device 100 allows the interlocutor Mg to hold a remote conference or a video conference even when the interlocutor Mg is in a location away from the conference room MR. Therefore, the second electronic device 100 is also referred to as an electronic device "used remotely" as appropriate.

　図４に示すように、一実施形態に係る第２電子機器１００は、制御部１１０、記憶部１２０、通信部１３０、撮像部１４０、音声入力部１５０、音声出力部１６０、表示部１７０、入力部１９０、及び視線情報取得部１９２などを備えてよい。また、制御部１１０は、例えば、取得部１１２、検出部１１４、選出部１１６、及び特定部１１８などを含んでもよい。一実施形態において、第２電子機器１００は、図４に示す機能部の少なくとも一部を備えなくてもよいし、図４に示す機能部以外の構成要素を備えてもよい。 As shown in FIG. 4, the second electronic device 100 according to one embodiment may include a control unit 110, a memory unit 120, a communication unit 130, an imaging unit 140, an audio input unit 150, an audio output unit 160, a display unit 170, an input unit 190, and a gaze information acquisition unit 192. The control unit 110 may also include, for example, an acquisition unit 112, a detection unit 114, a selection unit 116, and an identification unit 118. In one embodiment, the second electronic device 100 may not include at least some of the functional units shown in FIG. 4, or may include components other than the functional units shown in FIG. 4.

　制御部１１０は、第２電子機器１００を構成する各機能部をはじめとして、第２電子機器１００の全体を制御及び／又は管理する機能を有してよい。制御部１１０は、基本的に、例えば図２に示した制御部１０と同様の思想に基づく構成としてよい。また、制御部１１０の取得部１１２、検出部１１４、選出部１１６、及び特定部１１８についても、それぞれ、例えば図２に示した制御部１０の取得部１２、検出部１４、選出部１６、及び特定部１８と同様の思想に基づく構成としてよい。 The control unit 110 may have the function of controlling and/or managing the entire second electronic device 100, including each functional unit constituting the second electronic device 100. The control unit 110 may basically be configured based on the same concept as the control unit 10 shown in FIG. 2, for example. The acquisition unit 112, detection unit 114, selection unit 116, and identification unit 118 of the control unit 110 may also be configured based on the same concept as the acquisition unit 12, detection unit 14, selection unit 16, and identification unit 18 of the control unit 10 shown in FIG. 2, for example.

　記憶部１２０は、各種の情報を記憶するメモリとしての機能を有してよい。記憶部１２０は、例えば制御部１１０において実行されるプログラム、及び、制御部１１０において実行された処理の結果などを記憶してよい。また、記憶部１２０は、制御部１１０のワークメモリとして機能してもよい。図４に示すように、記憶部１２０は、制御部１１０に有線及び／又は無線で接続されてよい。記憶部１２０は、基本的に、例えば図２に示した記憶部２０と同様の思想に基づく構成としてよい。 The storage unit 120 may function as a memory that stores various types of information. The storage unit 120 may store, for example, programs executed in the control unit 110 and results of processing executed in the control unit 110. The storage unit 120 may also function as a work memory for the control unit 110. As shown in FIG. 4, the storage unit 120 may be connected to the control unit 110 via a wired and/or wireless connection. The storage unit 120 may basically be configured based on the same concept as the storage unit 20 shown in FIG. 2, for example.

　通信部１３０は、無線及び／又は有線により通信するためのインタフェースの機能を有する。通信部１３０は、例えばアンテナを介して、例えば他の電子機器の通信部と無線通信してもよい。例えば、通信部１３０は、図１に示した第１電子機器１と無線通信してよい。この場合、通信部１３０は、第１電子機器１の通信部３０と無線通信してよい。このように、一実施形態において、通信部１３０は、第１電子機器１と通信する機能を有する。また、例えば、通信部１３０は、図１に示した第３電子機器３００と無線通信してよい。この場合、通信部１３０は、第３電子機器３００の通信部３３０（後述）と無線通信してよい。このように、一実施形態において、通信部１３０は、第３電子機器３００と通信する機能を有してよい。図４に示すように、通信部１３０は、制御部１１０に有線及び／又は無線で接続されてよい。通信部１３０は、基本的に、例えば図２に示した通信部３０と同様の思想に基づく構成としてよい。 The communication unit 130 has an interface function for wireless and/or wired communication. The communication unit 130 may wirelessly communicate with, for example, a communication unit of another electronic device, for example, via an antenna. For example, the communication unit 130 may wirelessly communicate with the first electronic device 1 shown in FIG. 1. In this case, the communication unit 130 may wirelessly communicate with the communication unit 30 of the first electronic device 1. In this way, in one embodiment, the communication unit 130 has a function of communicating with the first electronic device 1. Also, for example, the communication unit 130 may wirelessly communicate with the third electronic device 300 shown in FIG. 1. In this case, the communication unit 130 may wirelessly communicate with the communication unit 330 (described later) of the third electronic device 300. In this way, in one embodiment, the communication unit 130 may have a function of communicating with the third electronic device 300. As shown in FIG. 4, the communication unit 130 may be connected to the control unit 110 in a wired and/or wireless manner. The communication unit 130 may basically have a configuration based on the same idea as the communication unit 30 shown in FIG. 2, for example.

　撮像部１４０は、例えばデジタルカメラのような、電子的に画像を撮像するイメージセンサを含んで構成されてよい。撮像部１４０は、例えば図１に示す自宅ＲＬ内の様子を撮像してよい。一実施形態において、撮像部１４０は、例えば図１に示す自宅ＲＬから会議に参加する対話者Ｍｇなどを撮像してよい。撮像部１４０は、撮像した画像を信号に変換して、制御部１１０に送信してよい。このため、撮像部１４０は、制御部１１０に有線及び／又は無線で接続されてよい。撮像部１４０は、基本的に、例えば図２に示した撮像部４０と同様の思想に基づく構成としてよい。 The imaging unit 140 may be configured to include an image sensor that electronically captures images, such as a digital camera. The imaging unit 140 may capture images of the interior of the home RL shown in FIG. 1, for example. In one embodiment, the imaging unit 140 may capture images of the interlocutor Mg who participates in the conference from the home RL shown in FIG. 1, for example. The imaging unit 140 may convert the captured image into a signal and transmit it to the control unit 110. For this reason, the imaging unit 140 may be connected to the control unit 110 by wire and/or wirelessly. The imaging unit 140 may basically be configured based on the same concept as the imaging unit 40 shown in FIG. 2, for example.

　音声入力部１５０は、人が発する声を含む、第２電子機器１００の周囲の音又は音声を検出（取得）する。例えば、音声入力部１５０は、音又は音声を空気振動として例えばダイヤフラムなどで検出したものを電気信号に変換するものとしてよい。具体的には、音声入力部１５０は、任意のマイク（マイクロフォン）のような音を電気信号に変換する音響機器を含んで構成されてよい。一実施形態において、音声入力部１５０は、例えば図１に示した自宅ＲＬにおける対話者Ｍｇの音声を検出（取得）してよい。音声入力部１５０によって検出された音声（電気信号）は、例えば制御部１１０に入力されてよい。このため、音声入力部１５０は、制御部１１０に有線及び／又は無線で接続されてよい。音声入力部１５０は、基本的に、例えば図２に示した音声入力部５０と同様の思想に基づく構成としてよい。 The voice input unit 150 detects (acquires) sounds or voices around the second electronic device 100, including human voices. For example, the voice input unit 150 may convert sounds or voices detected as air vibrations, for example, by a diaphragm, into an electrical signal. Specifically, the voice input unit 150 may include an acoustic device that converts sounds into an electrical signal, such as an arbitrary microphone. In one embodiment, the voice input unit 150 may detect (acquire) the voice of the interlocutor Mg in the home RL shown in FIG. 1, for example. The voice (electrical signal) detected by the voice input unit 150 may be input to the control unit 110, for example. For this reason, the voice input unit 150 may be connected to the control unit 110 by wire and/or wirelessly. The voice input unit 150 may basically be configured based on the same concept as the voice input unit 50 shown in FIG. 2, for example.

　音声出力部１６０は、制御部１１０から供給される電気信号（音声信号）を音に変換することにより、当該音声信号を音又は音声として出力する。音声出力部１６０は、制御部１１０に有線及び／又は無線で接続されてよい。音声出力部１６０は、任意のスピーカ（ラウドスピーカ）などの音を出力する機能を有するデバイスを含めて構成されてよい。一実施形態において、音声出力部１６０は、第１電子機器１の音声入力部５０が検出した音声を出力してよい。ここで、第１電子機器１の音声入力部５０が検出した音声とは、図１に示した会議室ＭＲにおける対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの音声としてよい。音声出力部１６０は、基本的に、例えば図２に示した音声出力部６０と同様の思想に基づく構成としてよい。 The audio output unit 160 converts an electric signal (audio signal) supplied from the control unit 110 into sound, and outputs the audio signal as sound or voice. The audio output unit 160 may be connected to the control unit 110 by wire and/or wirelessly. The audio output unit 160 may be configured to include a device having a function of outputting sound, such as an arbitrary speaker (loudspeaker). In one embodiment, the audio output unit 160 may output a voice detected by the audio input unit 50 of the first electronic device 1. Here, the voice detected by the audio input unit 50 of the first electronic device 1 may be at least one of the voices of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf in the conference room MR shown in FIG. 1. The audio output unit 160 may basically be configured based on the same idea as the audio output unit 60 shown in FIG. 2, for example.

　表示部１７０は、例えば、液晶ディスプレイ（Liquid Crystal Display：ＬＣＤ）、有機ＥＬディスプレイ（Organic Electro-Luminescence panel）、又は無機ＥＬディスプレイ（Inorganic Electro-Luminescence panel）等の任意の表示デバイスとしてよい。表示部１７０は、基本的に、例えば図２に示した表示部７０と同様の思想に基づく構成としてよい。表示部１７０において表示を行うために必要な各種データは、例えば制御部１１０又は記憶部１２０などから供給されてよい。このため、表示部１７０は、制御部１１０などに有線及び／又は無線で接続されてよい。 The display unit 170 may be any display device, such as a Liquid Crystal Display (LCD), an Organic Electro-Luminescence panel, or an Inorganic Electro-Luminescence panel. The display unit 170 may basically be configured based on the same concept as the display unit 70 shown in FIG. 2, for example. Various data required for display on the display unit 170 may be supplied from, for example, the control unit 110 or the memory unit 120. For this reason, the display unit 170 may be connected to the control unit 110, etc., via a wired and/or wireless connection.

　表示部１７０は、例えば対話者Ｍｇの指又はスタイラスの接触による入力を検出するタッチパネルの機能を備えたタッチスクリーンディスプレイとしてもよい。 The display unit 170 may be, for example, a touch screen display equipped with a touch panel function that detects input by contact with the interlocutor Mg's finger or stylus.

　一実施形態において、表示部１７０は、第１電子機器１から送信される映像信号に基づく映像を表示してよい。表示部１７０は、第１電子機器１から送信される映像信号に基づく映像として、第１電子機器１（の撮像部４０）によって撮像された例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどの映像を表示してもよい。第２電子機器１００の表示部１７０に対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどの映像が表示されることにより、例えば図１に示す対話者Ｍｇは、自宅ＲＬから離れた会議室ＭＲにいる当該対話者候補の様子を視覚的に知ることができる。 In one embodiment, the display unit 170 may display an image based on the video signal transmitted from the first electronic device 1. The display unit 170 may display images of interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, etc., captured by the first electronic device 1 (its imaging unit 40), as an image based on the video signal transmitted from the first electronic device 1. By displaying images of interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, etc. on the display unit 170 of the second electronic device 100, for example, interlocutor Mg shown in FIG. 1 can visually know the state of the interlocutor candidates in a conference room MR away from their home RL.

　表示部１７０は、例えば第１電子機器１によって撮像された対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどの映像をそのまま表示してもよい。一方、表示部１７０は、例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどをキャラクタ化したような画像（例えばアバター）を表示してもよい。 The display unit 170 may directly display images of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, for example, captured by the first electronic device 1. On the other hand, the display unit 170 may display images (e.g., avatars) that characterize the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf, for example.

　入力部１９０は、第２電子機器１００のユーザによる入力を検出するための任意のデバイスを含んで構成されてよい。例えば、入力部１９０は、各種スイッチ、各種スライダ、各種フェーダ、ジョイスティック、パッド、キーボード、マウス、トラックボール、及びタッチパネルなどの少なくともいずれかを含んで構成されてよい。一実施形態において、入力部１９０は、例えば図１に示す対話者Ｍｇによる入力を検出してよい。 The input unit 190 may be configured to include any device for detecting an input by a user of the second electronic device 100. For example, the input unit 190 may be configured to include at least one of various switches, various sliders, various faders, a joystick, a pad, a keyboard, a mouse, a trackball, and a touch panel. In one embodiment, the input unit 190 may detect, for example, an input by the interlocutor Mg shown in FIG. 1.

　視線情報取得部１９２は、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線の情報を取得する。視線情報取得部１９２は、第２電子機器１００のユーザの視線、当該視線の向き、及び／又は、当該視線の動きなど、第２電子機器１００のユーザの視線の情報を取得してよい。視線情報取得部１９２は、例えばアイトラッカーなどのように、第２電子機器１００のユーザ（例えば対話者Ｍｇ）の視線の動きを追尾する機能を備えてよい。視線情報取得部１９２は、第２電子機器１００のユーザの視線、当該視線の向き、及び／又は、当該視線の動きなど、第２電子機器１００のユーザの視線の情報を取得することができる任意の部材としてよい。 The gaze information acquisition unit 192 acquires gaze information of the user of the second electronic device 100 (e.g., interlocutor Mg). The gaze information acquisition unit 192 may acquire gaze information of the user of the second electronic device 100, such as the gaze of the user of the second electronic device 100, the direction of the gaze, and/or the movement of the gaze. The gaze information acquisition unit 192 may have a function of tracking the movement of the gaze of the user of the second electronic device 100 (e.g., interlocutor Mg), such as an eye tracker. The gaze information acquisition unit 192 may be any component capable of acquiring gaze information of the user of the second electronic device 100, such as the gaze of the user of the second electronic device 100, the direction of the gaze, and/or the movement of the gaze.

　一実施形態に係る第２電子機器１００は、撮像部１４０によって撮像される第２電子機器１００のユーザ（例えば対話者Ｍｇ）の目の動きに基づいて、当該ユーザの視線の情報を取得してもよい。この場合、第２電子機器１００は、視線情報取得部１９２を備えなくてもよいし、撮像部１４０が視線情報取得部１９２を兼ねてもよい。視線情報取得部１９２によって取得された視線情報は、例えば制御部１１０に入力されてよい。このため、視線情報取得部１９２は、制御部１１０に有線及び／又は無線で接続されてよい。 The second electronic device 100 according to one embodiment may acquire gaze information of a user (e.g., interlocutor Mg) of the second electronic device 100 based on the eye movement of the user captured by the imaging unit 140. In this case, the second electronic device 100 may not include a gaze information acquisition unit 192, or the imaging unit 140 may also function as the gaze information acquisition unit 192. The gaze information acquired by the gaze information acquisition unit 192 may be input to the control unit 110, for example. For this reason, the gaze information acquisition unit 192 may be connected to the control unit 110 via a wired and/or wireless connection.

　一実施形態において、第２電子機器１００は、上述のように、専用に設計された機器としてもよい。一方、一実施形態において、第２電子機器１００は、例えば図４に示す機能部のうち一部を備えてもよい。この場合、第２電子機器１００は、図４に示す他の機能部の機能の少なくとも一部を補うために、他の電子機器に接続されてもよい。ここで、他の電子機器とは、例えば、汎用のスマートフォン、タブレット、ファブレット、ノートパソコン（ノートＰＣ若しくはラップトップ）、又はコンピュータ（デスクトップ）などの機器としてもよい。 In one embodiment, the second electronic device 100 may be a dedicated device as described above. Meanwhile, in one embodiment, the second electronic device 100 may include some of the functional units shown in FIG. 4, for example. In this case, the second electronic device 100 may be connected to another electronic device to supplement at least some of the functions of the other functional units shown in FIG. 4. Here, the other electronic device may be, for example, a general-purpose smartphone, tablet, phablet, notebook computer (notebook PC or laptop), or computer (desktop), etc.

　特に、スマートフォン又はノートパソコンなどは、図４に示す機能部のうち比較的多くの機能部を備えていることが多い。このため、一実施形態において、第２電子機器１００は、スマートフォン又はノートパソコンなどとしてもよい。この場合、第２電子機器１００は、スマートフォン又はノートパソコンなどにおいて、第１電子機器１と連携するためのアプリケーション（プログラム）をインストールしたものとしてもよい。 In particular, a smartphone or a laptop computer often has a relatively large number of the functional units shown in FIG. 4. For this reason, in one embodiment, the second electronic device 100 may be a smartphone or a laptop computer. In this case, the second electronic device 100 may be a smartphone or a laptop computer on which an application (program) for linking with the first electronic device 1 is installed.

　図５は、図１に示した第３電子機器３００の構成を概略的に示すブロック図である。以下、一実施形態に係る第３電子機器３００の構成の一例について説明する。第３電子機器３００は、図１に示したように、例えば対話者Ｍｇの自宅ＲＬ及び会議室ＭＲとは異なる場所に設置されてよい。また、第３電子機器３００は、例えば対話者Ｍｇの自宅ＲＬ又はその付近に設置されてもよいし、会議室ＭＲ又はその付近に設置されてもよい。 FIG. 5 is a block diagram showing a schematic configuration of the third electronic device 300 shown in FIG. 1. An example of the configuration of the third electronic device 300 according to an embodiment will be described below. The third electronic device 300 may be installed in a location other than the home RL and the conference room MR of the interlocutor Mg, as shown in FIG. 1. The third electronic device 300 may be installed in or near the home RL of the interlocutor Mg, or in or near the conference room MR.

　第１電子機器１は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆなどが発話する際に、第１電子機器１が取得した当該対話者候補などの音声及び／又は映像のデータを、第３電子機器３００に送信する機能を有する。第３電子機器３００は、第１電子機器１から受信した音声及び／又は映像のデータを第２電子機器１００に送信してよい。また、第２電子機器１００は、対話者Ｍｇが発話する際に、第２電子機器１００が取得した対話者Ｍｇの音声及び／又は映像のデータを、第３電子機器３００に送信する機能を有する。第３電子機器３００は、第２電子機器１００から受信した音声及び／又は映像のデータを第１電子機器１に送信してよい。このように、第３電子機器３００は、第１電子機器１と第２電子機器１００とを中継する機能を備えてよい。第３電子機器１００は、適宜、「サーバ」とも記す。 The first electronic device 1 has a function of transmitting audio and/or video data of the interlocutor candidates Ma, Mb, Mc, Md, Me, Mf, etc. acquired by the first electronic device 1 to the third electronic device 300 when the interlocutor candidates Ma, Mb, Mc, Md, Me, Mf, etc. speak. The third electronic device 300 may transmit the audio and/or video data received from the first electronic device 1 to the second electronic device 100. The second electronic device 100 also has a function of transmitting audio and/or video data of the interlocutor Mg acquired by the second electronic device 100 to the third electronic device 300 when the interlocutor Mg speaks. The third electronic device 300 may transmit the audio and/or video data received from the second electronic device 100 to the first electronic device 1. In this way, the third electronic device 300 may have a function of relaying between the first electronic device 1 and the second electronic device 100. The third electronic device 100 is also referred to as a "server" as appropriate.

　図５に示すように、一実施形態に係る第３電子機器３００は、制御部３１０、記憶部３２０、及び通信部３３０を備えてよい。また、制御部３１０は、例えば、特定部３１２及び推定部３１４を含んでよい。一実施形態において、第３電子機器３００は、図５に示す機能部の少なくとも一部を備えなくてもよいし、図に示す機能部以外の構成要素を備えてもよい。 As shown in FIG. 5, the third electronic device 300 according to one embodiment may include a control unit 310, a storage unit 320, and a communication unit 330. The control unit 310 may also include, for example, an identification unit 312 and an estimation unit 314. In one embodiment, the third electronic device 300 may not include at least some of the functional units shown in FIG. 5, or may include components other than the functional units shown in the figure.

　制御部３１０は、第３電子機器３００を構成する各機能部をはじめとして、第３電子機器３００の全体を制御及び／又は管理する機能を有してよい。制御部３１０は、基本的に、例えば図２に示した制御部１０又は図４に示した制御部１１０と同様の思想に基づく構成としてよい。また、制御部３１０の取得部３１２、検出部３１４、選出部３１６、及び特定部３１８についても、それぞれ、例えば図２に示した制御部１０の取得部１２、検出部１４、選出部１６、及び特定部１８と同様の思想に基づく構成としてよい。制御部３１０の取得部３１２、検出部３１４、選出部３１６、及び特定部３１８は、それぞれ、例えば図４に示した取得部１１２、検出部１１４、選出部１１６、及び特定部１１８と同様の思想に基づく構成としてもよい。 The control unit 310 may have a function of controlling and/or managing the entire third electronic device 300, including each functional unit constituting the third electronic device 300. The control unit 310 may basically be configured based on the same concept as the control unit 10 shown in FIG. 2 or the control unit 110 shown in FIG. 4. The acquisition unit 312, detection unit 314, selection unit 316, and identification unit 318 of the control unit 310 may also be configured based on the same concept as the acquisition unit 12, detection unit 14, selection unit 16, and identification unit 18 of the control unit 10 shown in FIG. 2. The acquisition unit 312, detection unit 314, selection unit 316, and identification unit 318 of the control unit 310 may also be configured based on the same concept as the acquisition unit 112, detection unit 114, selection unit 116, and identification unit 118 shown in FIG. 4.

　記憶部３２０は、各種の情報を記憶するメモリとしての機能を有してよい。記憶部３２０は、例えば制御部３１０において実行されるプログラム、及び、制御部３１０において実行された処理の結果などを記憶してよい。また、記憶部３２０は、制御部３１０のワークメモリとして機能してもよい。図５に示すように、記憶部３２０は、制御部３１０に有線及び／又は無線で接続されてよい。記憶部３２０は、基本的に、例えば図２に示した記憶部２０又は図４に示した記憶部１２０と同様の思想に基づく構成としてよい。 The storage unit 320 may function as a memory that stores various types of information. The storage unit 320 may store, for example, programs executed in the control unit 310 and results of processing executed in the control unit 310. The storage unit 320 may also function as a work memory for the control unit 310. As shown in FIG. 5, the storage unit 320 may be connected to the control unit 310 by wire and/or wirelessly. The storage unit 320 may basically be configured based on the same concept as, for example, the storage unit 20 shown in FIG. 2 or the storage unit 120 shown in FIG. 4.

　通信部３３０は、無線及び／又は有線により通信するためのインタフェースの機能を有する。通信部３３０は、例えばアンテナを介して、例えば他の電子機器の通信部と無線通信してもよい。例えば、通信部３３０は、図１に示した第１電子機器１と無線通信してよい。この場合、通信部３３０は、第１電子機器１の通信部３０と無線通信してよい。このように、一実施形態において、通信部３３０は、第１電子機器１と通信する機能を有する。また、例えば、通信部３３０は、図１に示した第２電子機器１００と無線通信してよい。この場合、通信部３３０は、第２電子機器１００の通信部１３０と無線通信してよい。このように、一実施形態において、通信部３３０は、第２電子機器１００と通信する機能を有してよい。図５に示すように、通信部３３０は、制御部３１０に有線及び／又は無線で接続されてよい。通信部３３０は、基本的に、例えば図２に示した通信部３０又は図４に示した通信部１３０と同様の思想に基づく構成としてよい。 The communication unit 330 has an interface function for wireless and/or wired communication. The communication unit 330 may wirelessly communicate with, for example, a communication unit of another electronic device, for example, via an antenna. For example, the communication unit 330 may wirelessly communicate with the first electronic device 1 shown in FIG. 1. In this case, the communication unit 330 may wirelessly communicate with the communication unit 30 of the first electronic device 1. Thus, in one embodiment, the communication unit 330 has a function of communicating with the first electronic device 1. Also, for example, the communication unit 330 may wirelessly communicate with the second electronic device 100 shown in FIG. 1. In this case, the communication unit 330 may wirelessly communicate with the communication unit 130 of the second electronic device 100. Thus, in one embodiment, the communication unit 330 may have a function of communicating with the second electronic device 100. As shown in FIG. 5, the communication unit 330 may be connected to the control unit 310 in a wired and/or wireless manner. The communication unit 330 may basically be configured based on the same concept as, for example, the communication unit 30 shown in FIG. 2 or the communication unit 130 shown in FIG. 4.

　一実施形態において、第３電子機器３００は、例えば専用に設計された機器としてもよい。一方、一実施形態において、第３電子機器３００は、例えば図５に示す機能部のうち一部を備えてもよい。この場合、第３電子機器３００は、図５に示す他の機能部の機能の少なくとも一部を補うために、他の電子機器に接続されてもよい。ここで、他の電子機器とは、例えば、汎用のコンピュータ又はサーバなどの機器としてもよい。一実施形態において、第３電子機器３００は、例えば中継サーバ、ウェブサーバ、又はアプリケーションサーバなどとしてもよい。 In one embodiment, the third electronic device 300 may be, for example, a specially designed device. On the other hand, in one embodiment, the third electronic device 300 may include, for example, some of the functional units shown in FIG. 5. In this case, the third electronic device 300 may be connected to other electronic devices to supplement at least some of the functions of the other functional units shown in FIG. 5. Here, the other electronic devices may be, for example, devices such as a general-purpose computer or server. In one embodiment, the third electronic device 300 may be, for example, a relay server, a web server, or an application server.

　次に、一実施形態に係る第１電子機器１及び第２電子機器１００の基本的な動作について説明する。以下、図１に示すように、会議室ＭＲにおいて実施されるリモート会議に、対話者Ｍｇが自宅ＲＬから参加する状況を想定して説明する。 Next, the basic operation of the first electronic device 1 and the second electronic device 100 according to one embodiment will be described. The following description will be given assuming a situation in which a remote conference is held in a conference room MR, as shown in FIG. 1, and a participant Mg participates from his/her home RL.

　すなわち、一実施形態に係る第１電子機器１は、会議室ＭＲに設置され、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの映像及び／又は音声を取得する。第１電子機器１によって取得された映像及び／又は音声は、対話者Ｍｇの自宅ＲＬに設置された第２電子機器１００に送信される。第２電子機器１００は、第１電子機器１が取得する対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの映像及び／又は音声を出力する。これにより、対話者Ｍｇは、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかの映像及び／又は音声を認識することができる。 That is, the first electronic device 1 according to one embodiment is installed in the conference room MR and acquires video and/or audio of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf. The video and/or audio acquired by the first electronic device 1 is transmitted to the second electronic device 100 installed in the interlocutor Mg's home RL. The second electronic device 100 outputs the video and/or audio of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf acquired by the first electronic device 1. This allows the interlocutor Mg to recognize the video and/or audio of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf.

　一方、一実施形態に係る第２電子機器１００は、対話者Ｍｇの自宅ＲＬに設置され、対話者Ｍｇの音声を取得する。また、第２電子機器１００は、対話者Ｍｇの視線の情報を取得する。第２電子機器１００によって取得された音声及び／又は視線の情報は、会議室ＭＲに設置された第１電子機器１に送信される。第１電子機器１は、第２電子機器１００から受信する対話者Ｍｇの音声を出力する。これにより、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかは、対話者Ｍｇの音声を聞くことができる。また、第１電子機器１は、第２電子機器１００から受信する対話者Ｍｇの視線の情報に基づいて、対話者Ｍｇの視線を表現する。これにより、対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれかは、対話者Ｍｇの視線の様子を視認することができる。さらに、一実施形態に係る第２電子機器１００は、対話者Ｍｇの映像を取得してもよい。第２電子機器１００によって取得された映像は、会議室ＭＲに設置された第１電子機器１に送信されてよい。この場合、第１電子機器１は、第２電子機器１００から受信する対話者Ｍｇの映像を出力してもよい。 On the other hand, the second electronic device 100 according to one embodiment is installed in the home RL of the interlocutor Mg and acquires the voice of the interlocutor Mg. The second electronic device 100 also acquires information on the line of sight of the interlocutor Mg. The voice and/or line of sight information acquired by the second electronic device 100 is transmitted to the first electronic device 1 installed in the conference room MR. The first electronic device 1 outputs the voice of the interlocutor Mg received from the second electronic device 100. As a result, at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf can hear the voice of the interlocutor Mg. The first electronic device 1 also expresses the line of sight of the interlocutor Mg based on the line of sight information of the interlocutor Mg received from the second electronic device 100. As a result, at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf can visually recognize the state of the line of sight of the interlocutor Mg. Furthermore, the second electronic device 100 according to one embodiment may acquire an image of the interlocutor Mg. The video captured by the second electronic device 100 may be transmitted to the first electronic device 1 installed in the conference room MR. In this case, the first electronic device 1 may output the video of the interlocutor Mg received from the second electronic device 100.

　図６は、上述のような一実施形態に係るシステムの基本的な動作について説明するシーケンス図である。図６は、第１電子機器１、第２電子機器１００、及び第３電子機器３００の相互間で行われるデータなどのやり取りを示す図である。以下、図６を参照して、一実施形態に係るシステムを用いてリモート会議又はビデオ会議が行われる際の基本的な動作について説明する。 FIG. 6 is a sequence diagram explaining the basic operation of the system according to the embodiment described above. FIG. 6 is a diagram showing the exchange of data and the like between the first electronic device 1, the second electronic device 100, and the third electronic device 300. Below, the basic operation when a remote conference or video conference is held using the system according to the embodiment will be explained with reference to FIG. 6.

　図６に示す動作において、ローカルで使用される第１電子機器１は、第１ユーザによって使用されるものとしてよい。ここで、第１ユーザとは、例えば図１に示した対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人（以下、ローカルのユーザとも記す）としてよい。また、リモートで使用される第２電子機器１００は、第２ユーザによって使用されるものとしてよい。ここで、第２ユーザとは、例えば図１に示した対話者Ｍｇ（以下、リモートのユーザとも記す）としてよい。以下、第１電子機器１が実行する動作は、より詳細には、例えば第１電子機器１の制御部１０が実行するものとしてよい。本明細書において、第１電子機器１の制御部１０が実行する動作を、第１電子機器１が実行する動作として記すことがある。同様に、第２電子機器１００が実行する動作は、より詳細には、例えば第２電子機器１００の制御部１１０が実行するものとしてよい。本明細書において、第２電子機器１００の制御部１１０が実行する動作を、第２電子機器１００が実行する動作として記すことがある。また、第３電子機器３００が実行する動作は、より詳細には、例えば第３電子機器３００の制御部３１０が実行するものとしてよい。本明細書において、第３電子機器３００の制御部３１０が実行する動作を、第３電子機器３００が実行する動作として記すことがある。 In the operation shown in FIG. 6, the first electronic device 1 used locally may be used by the first user. Here, the first user may be, for example, at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf shown in FIG. 1 (hereinafter also referred to as the local user). The second electronic device 100 used remotely may be used by the second user. Here, the second user may be, for example, the interlocutor Mg shown in FIG. 1 (hereinafter also referred to as the remote user). Hereinafter, the operation performed by the first electronic device 1 may be, in more detail, performed by, for example, the control unit 10 of the first electronic device 1. In this specification, the operation performed by the control unit 10 of the first electronic device 1 may be described as the operation performed by the first electronic device 1. Similarly, the operation performed by the second electronic device 100 may be, in more detail, performed by, for example, the control unit 110 of the second electronic device 100. In this specification, the operation performed by the control unit 110 of the second electronic device 100 may be referred to as the operation performed by the second electronic device 100. In addition, the operation performed by the third electronic device 300 may be more specifically, for example, performed by the control unit 310 of the third electronic device 300. In this specification, the operation performed by the control unit 310 of the third electronic device 300 may be referred to as the operation performed by the third electronic device 300.

　図６に示す動作は、例えば図１に示したようなリモート会議の開始時などに開始するものとしてよい。あるいは、図６に示す動作は、例えば第１電子機器１及び／又は第２電子機器１００の起動時などに開始するものとしてもよい。 The operation shown in FIG. 6 may be initiated, for example, at the start of a remote conference as shown in FIG. 1. Alternatively, the operation shown in FIG. 6 may be initiated, for example, at the start of the first electronic device 1 and/or the second electronic device 100.

　図６に示す動作が開始すると、第１電子機器１は、少なくとも１人の対話者候補に関する情報を取得する（ステップＳ１１）。ステップＳ１１において、第１電子機器１は、第１ユーザ（例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくともいずれか）の映像及び音声の少なくとも一方を取得してよい）。具体的には、ステップＳ１１において、第１電子機器１の取得部１２は、撮像部４０によって第１ユーザの映像を撮像し、音声入力部５０によって第１ユーザの音声を取得（又は検出）してよい。ステップＳ１１において、第１電子機器１は、例えば図１に示した対話者候補Ｍａ，Ｍｂ，Ｍｃ，Ｍｄ，Ｍｅ，及びＭｆの少なくとも１人の映像及び／又は音声の情報を取得してよい。このように、第１電子機器１において、取得部１２は、少なくとも１人の対話者候補に関する情報を取得してよい。 6 starts, the first electronic device 1 acquires information about at least one interlocutor candidate (step S11). In step S11, the first electronic device 1 may acquire at least one of the video and audio of the first user (e.g., at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf). Specifically, in step S11, the acquisition unit 12 of the first electronic device 1 may capture the video of the first user by the imaging unit 40 and acquire (or detect) the audio of the first user by the audio input unit 50. In step S11, the first electronic device 1 may acquire information about the video and/or audio of at least one of the interlocutor candidates Ma, Mb, Mc, Md, Me, and Mf shown in FIG. 1, for example. In this way, in the first electronic device 1, the acquisition unit 12 may acquire information about at least one interlocutor candidate.

　次に、第１電子機器１は、対話者候補に関する情報（例えば映像情報及び／又は音声情報）を、第３電子機器３００に送信する（ステップＳ１２）。具体的には、ステップＳ１２において、第１電子機器１は、映像及び／又は音声のデータを、通信部３０から、第３電子機器３００の通信部３３０に送信する。また、ステップＳ１２において、第３電子機器３００は、第１電子機器１の通信部３０から送信される映像及び／又は音声のデータを、通信部３３０によって受信（取得）してよい。 Next, the first electronic device 1 transmits information (e.g., video information and/or audio information) about the candidate interlocutor to the third electronic device 300 (step S12). Specifically, in step S12, the first electronic device 1 transmits video and/or audio data from the communication unit 30 to the communication unit 330 of the third electronic device 300. Also, in step S12, the third electronic device 300 may receive (acquire) the video and/or audio data transmitted from the communication unit 30 of the first electronic device 1 via the communication unit 330.

　ステップＳ１２において、第１電子機器１の制御部１０は、第１ユーザの映像及び音声の少なくとも一方をエンコードしてから送信してもよい。ここで、エンコードとは、映像及び／又は音声のデータを所定の規則に従って圧縮し、暗号化を含む目的に応じた形式に変換するものとしてよい。第１電子機器１は、ソフトウェアエンコード又はハードウェアエンコードなど、公知の種々のエンコードを行ってよい。この場合、第３電子機器３００は、通信部３０から受信するエンコードされた映像及び／又は音声のデータをデコードしてよい。ここで、デコードとは、エンコードされた映像及び／又は音声のデータの形式を、元の形式に戻すものとしてよい。第３電子機器３００は、ソフトウェアエンコード又はハードウェアエンコードなど、公知の種々のデコードを行ってよい。 In step S12, the control unit 10 of the first electronic device 1 may encode at least one of the video and audio of the first user before transmitting it. Here, encoding may mean compressing the video and/or audio data according to a predetermined rule and converting it into a format according to the purpose, including encryption. The first electronic device 1 may perform various known encoding methods, such as software encoding or hardware encoding. In this case, the third electronic device 300 may decode the encoded video and/or audio data received from the communication unit 30. Here, decoding may mean returning the format of the encoded video and/or audio data to the original format. The third electronic device 300 may perform various known decoding methods, such as software encoding or hardware encoding.

　次に、第３電子機器３００は、ステップＳ１２において取得した少なくとも１人の対話者候補に関する情報に基づいて、少なくとも１人の対話者候補を検出する（ステップＳ１３）。ここで、第３電子機器３００の検出部１１４は、ステップＳ１２において取得した少なくとも１人の対話者候補に関する映像及び／又は音声の情報に基づいて、少なくとも１人の対話者候補を検出してよい。例えば、第３電子機器３００の検出部１１４は、少なくとも１人の対話者候補に関する映像に基づく人物認識又は顔認識などによって、少なくとも１人の対話者候補を検出してよい。また、第３電子機器３００の検出部１１４は、少なくとも１人の対話者候補に関する音声認識などによって、少なくとも１人の対話者候補を検出してよい。 Next, the third electronic device 300 detects at least one interlocutor candidate based on the information on the at least one interlocutor candidate acquired in step S12 (step S13). Here, the detection unit 114 of the third electronic device 300 may detect at least one interlocutor candidate based on video and/or audio information on the at least one interlocutor candidate acquired in step S12. For example, the detection unit 114 of the third electronic device 300 may detect at least one interlocutor candidate by person recognition or face recognition based on video of the at least one interlocutor candidate. Also, the detection unit 114 of the third electronic device 300 may detect at least one interlocutor candidate by audio recognition of the at least one interlocutor candidate.

　例えば、ステップＳ１３において、第３電子機器３００は、第１電子機器１から所定の距離に存在する対話者候補を検出してよい。例えば、第３電子機器３００は、図１に示す第１電子機器１から２ｍ以内に存在する人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄを対話者候補として検出してもよい。この場合、は、図１に示す人物Ｍｅ及びＭｆは、対話者候補として検出されないものとしてよい。 For example, in step S13, the third electronic device 300 may detect interlocutor candidates that are present at a predetermined distance from the first electronic device 1. For example, the third electronic device 300 may detect, as interlocutor candidates, persons Ma, Mb, Mc, and Md that are present within 2 m of the first electronic device 1 shown in FIG. 1. In this case, persons Me and Mf shown in FIG. 1 may not be detected as interlocutor candidates.

　ステップＳ１３において、第３電子機器３００は、種々の条件に基づいて、対話者候補を検出してよい。例えば、第３電子機器３００は、第１電子機器１の視線情報取得部９２によって取得される周囲の人物の視線情報に基づいて、所定以上の強度を示す視線情報が検出される人物を、対話者候補として検出してもよい。すなわち、第３電子機器３００は、第１電子機器１の周囲で第１電子機器１を比較的注視している人物を、対話者候補として検出してもよい。また、第３電子機器３００は、第１電子機器１の撮像部４０及び／又は視線情報取得部９２によって取得される周囲の人物の顔の向き及び／又は視線の方向に基づいて、第１電子機器１の方を向いている人物を、対話者候補として検出してもよい。すなわち、第３電子機器３００は、第１電子機器１の周囲で第１電子機器１の方を向いている人物、及び／又は、第１電子機器１に視線を向けている人を、対話者候補として検出してもよい。また、例えば、第３電子機器３００は、第１電子機器１の撮像部４０（及び／又は視線情報取得部９２）並びに音声入力部５０によって取得される情報に基づいて、対話者候補として検出してもよい。すなわち、第３電子機器３００は、例えば、第１電子機器１を比較的注視し又は第１電子機器１の方を向いていて、かつ、第１電子機器１に対して言葉を発している人物を、対話者候補として検出してもよい。また、第３電子機器３００は、第１電子機器１の撮像部４０によって取得される周囲の人物の位置に基づいて、対話者候補を検出してもよい。例えば、第３電子機器３００は、第１電子機器１から所定距離範囲内（例えば５メートル範囲内）にいる人物を、対話者候補として検出してもよい。また、例えば、第３電子機器３００は、上述した視線及び／又は顔の向き等から対話者候補として検出された人物のうち所定距離内にいる人物を対話者候補として検出してもよい。また、第３電子機器３００は、第１電子機器１の撮像部４０によって取得される周囲の人物の顔画像を認識することにより、対話者候補を検出してもよい。例えば、第３電子機器３００は、予め会議の参加者として登録された人物を顔認識によって特定し、当該人物を対話者候補として検出してもよい。また、第３電子機器３００は、第１電子機器１の撮像部４０によって取得される周囲の人物の動作によって、対話者候補を検出してもよい。例えば、第３電子機器３００は、対話者として認識されることを希望する所定の動作（例えば、挙手、手を振る等）を行う人物を対話者候補として検出してもよい。また、第３電子機器３００は、第１電子機器１の撮像部４０によって取得される周囲の人物の発話内容によって、対話者候補を検出してもよい。例えば、第３電子機器３００は、対話者として認識されることを希望する所定の発話内容（例えば、「こんにちは」、「（第１電子機器１のユーザ名）さん」、「おーい」、「聞こえますか」、「すみません」等）を発した人物を対話者候補として検出してもよい。 In step S13, the third electronic device 300 may detect interlocutor candidates based on various conditions. For example, the third electronic device 300 may detect, as interlocutor candidates, a person whose gaze information indicating a predetermined intensity or higher is detected based on the gaze information of surrounding people acquired by the gaze information acquisition unit 92 of the first electronic device 1. That is, the third electronic device 300 may detect, as interlocutor candidates, a person who is relatively gazing at the first electronic device 1 around the first electronic device 1. Also, the third electronic device 300 may detect, as interlocutor candidates, a person who is looking toward the first electronic device 1 based on the face orientation and/or gaze direction of the surrounding people acquired by the imaging unit 40 and/or the gaze information acquisition unit 92 of the first electronic device 1. That is, the third electronic device 300 may detect, as interlocutor candidates, a person who is looking toward the first electronic device 1 around the first electronic device 1 and/or a person who is looking toward the first electronic device 1. Also, for example, the third electronic device 300 may detect a person as a candidate interlocutor based on information acquired by the imaging unit 40 (and/or the line of sight information acquisition unit 92) and the voice input unit 50 of the first electronic device 1. That is, the third electronic device 300 may detect, for example, a person who is relatively gazing at the first electronic device 1 or facing the first electronic device 1 and speaking to the first electronic device 1 as a candidate interlocutor. Also, the third electronic device 300 may detect a candidate interlocutor based on the positions of surrounding people acquired by the imaging unit 40 of the first electronic device 1. For example, the third electronic device 300 may detect a person within a predetermined distance range (for example, within 5 meters) from the first electronic device 1 as a candidate interlocutor. Also, for example, the third electronic device 300 may detect a person within a predetermined distance from the person detected as a candidate interlocutor based on the above-mentioned line of sight and/or face direction as a candidate interlocutor. The third electronic device 300 may detect interlocutor candidates by recognizing face images of surrounding people acquired by the imaging unit 40 of the first electronic device 1. For example, the third electronic device 300 may identify a person who is registered in advance as a participant of the conference by face recognition and detect the person as an interlocutor candidate. The third electronic device 300 may detect interlocutor candidates by the actions of surrounding people acquired by the imaging unit 40 of the first electronic device 1. For example, the third electronic device 300 may detect as interlocutor candidates a person who performs a predetermined action (e.g., raising a hand, waving, etc.) that the person wishes to be recognized as an interlocutor. The third electronic device 300 may detect as interlocutor candidates a person who performs a predetermined action (e.g., raising a hand, waving, etc.) that the person wishes to be recognized as an interlocutor. The third electronic device 300 may detect as interlocutor candidates a person who utters a predetermined utterance (e.g., "Hello," "(user name of the first electronic device 1)," "Hey," "Can you hear me," "Excuse me," etc.) that the person wishes to be recognized as an interlocutor.

　このように、第３電子機器３００の取得部３１２は、少なくとも１人の対話者候補に関する情報を第１電子機器１から取得してよい。また、第３電子機器３００検出部３１４は、少なくとも１人の対話者候補に関する情報に基づいて、少なくとも１人の対話者候補を検出してよい。 In this way, the acquisition unit 312 of the third electronic device 300 may acquire information about at least one interlocutor candidate from the first electronic device 1. Furthermore, the detection unit 314 of the third electronic device 300 may detect at least one interlocutor candidate based on the information about the at least one interlocutor candidate.

　次に、第３電子機器３００は、検出された対話者候補に関する情報を、第２電子機器１００に送信する（ステップＳ１４）。例えば、第３電子機器３００は、検出された対話者候補として、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄに関する情報を第２電子機器１００に送信してよい。具体的には、ステップＳ１４において、第３電子機器３００は、検出された対話者候補に関する情報を、通信部３３０から、第２電子機器１００の通信部１３０に送信してよい。また、ステップＳ１４において、第２電子機器１００は、第３電子機器３００の通信部３３０から送信される対話者候補に関する情報を、通信部１３０によって受信してよい。 Next, the third electronic device 300 transmits information about the detected interlocutor candidates to the second electronic device 100 (step S14). For example, the third electronic device 300 may transmit information about interlocutor candidates Ma, Mb, Mc, and Md as the detected interlocutor candidates to the second electronic device 100. Specifically, in step S14, the third electronic device 300 may transmit information about the detected interlocutor candidates from the communication unit 330 to the communication unit 130 of the second electronic device 100. Also, in step S14, the second electronic device 100 may receive the information about the interlocutor candidates transmitted from the communication unit 330 of the third electronic device 300 via the communication unit 130.

　第２電子機器１００は、対話者候補に関する情報を受信したら、当該対話者候補に関する情報を、第２ユーザ（例えば対話者Ｍｇ）に提示してもよい。この場合、第２電子機器１００は、表示部１７０及び音声出力部１６０の少なくとも一方から、例えば対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの映像及び音声の少なくとも一方を、第２ユーザ（例えば対話者Ｍｇ）に提示してよい。 When the second electronic device 100 receives information about the interlocutor candidates, it may present the information about the interlocutor candidates to the second user (e.g., interlocutor Mg). In this case, the second electronic device 100 may present at least one of the video and audio of the interlocutor candidates Ma, Mb, Mc, and Md to the second user (e.g., interlocutor Mg) from at least one of the display unit 170 and the audio output unit 160.

　例えば、図１に示した会議室ＭＲにいる各人物が第１電子機器１の撮像部４０によって撮像される場合、第２電子機器１００は、図７に示すように撮像された各人物の映像を表示部１７０に表示してもよい。図７に示すように、表示部１７０は、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄが会議室ＭＲのデスクの周囲に着席している様子を表示している。また、図７に示すように、表示部１７０は、人物Ｍｅ及びＭｆが会議室ＭＲのデスクから少し離れた場所で立っている様子を表示している。ここで、上述のように、第３電子機器３００は、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄを対話者候補として検出し、図１に示す人物Ｍｅ及びＭｆを対話者候補として検出しなかったものとする。この場合、第２電子機器１００は、表示部１７０に表示する映像によって、対話者候補として検出された人物と、対話者候補としてされなかった人物とを区別可能に表示する。一例として、第２電子機器１００は、図８に示すように、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの周囲又は近傍などに例えばオブジェクトＯｂ１を表示することにより、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄが対話者候補として検出されていることを示してよい。図８において、人物Ｍｅ及びＭｆの周囲又は近傍などにオブジェクトＯｂ１が表示されないことにより、人物Ｍｅ及びＭｆが対話者候補として検出されなかったことを示している。 For example, when each person in the conference room MR shown in FIG. 1 is imaged by the imaging unit 40 of the first electronic device 1, the second electronic device 100 may display the image of each person on the display unit 170 as shown in FIG. 7. As shown in FIG. 7, the display unit 170 displays the persons Ma, Mb, Mc, and Md seated around the desk in the conference room MR. Also, as shown in FIG. 7, the display unit 170 displays the persons Me and Mf standing at a location a little away from the desk in the conference room MR. Here, as described above, it is assumed that the third electronic device 300 detects the persons Ma, Mb, Mc, and Md as interlocutor candidates, and does not detect the persons Me and Mf shown in FIG. 1 as interlocutor candidates. In this case, the second electronic device 100 displays the persons detected as interlocutor candidates and the persons not detected as interlocutor candidates in a manner that allows them to be distinguished from each other by the image displayed on the display unit 170. As an example, the second electronic device 100 may indicate that the persons Ma, Mb, Mc, and Md have been detected as interlocutor candidates by displaying, for example, an object Ob1 around or near the persons Ma, Mb, Mc, and Md, as shown in FIG. 8. In FIG. 8, the object Ob1 is not displayed around or near the persons Me and Mf, indicating that the persons Me and Mf have not been detected as interlocutor candidates.

　図８においては、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの周囲又は近傍などに例えばオブジェクトＯｂ１を表示することにより、当該人物が対話者候補として検出されていることを示した。一実施形態において、第２電子機器１００は、例えば音声ガイドによって、人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄが対話者候補として検出されていることを示してもよい。また、一実施形態において、第２電子機器１００は、例えば音声ガイドによって、人物Ｍｅ及びＭｆが対話者候補として検出されなかったことを示してもよい。具体的には、音声ガイドとして、対話者候補として検出された人物、又は対話者候補として検出されなかった人物の名前などを読み上げても良い。 In FIG. 8, for example, object Ob1 is displayed around or near persons Ma, Mb, Mc, and Md to indicate that the persons have been detected as interlocutor candidates. In one embodiment, the second electronic device 100 may indicate, for example, by audio guidance, that persons Ma, Mb, Mc, and Md have been detected as interlocutor candidates. In another embodiment, the second electronic device 100 may indicate, for example, by audio guidance, that persons Me and Mf have not been detected as interlocutor candidates. Specifically, the audio guidance may read out the names of persons detected as interlocutor candidates or persons not detected as interlocutor candidates.

　ステップＳ１４の次に、第２電子機器１００の選出部１１６は、少なくとも１人の対話者候補から、少なくとも１人の対話者を選出する（ステップＳ１５）。ステップＳ１５において、第２電子機器１００の選出部１１６は、少なくとも１人の対話者を、第２電子機器１００のユーザによる入力に基づいて選出してもよい。すなわち、第２電子機器１００のユーザは、検出された対話者候補の中から、対話者を選出する入力を行うことができる。このように、第２電子機器１００において、取得部１１２は、少なくとも１人の対話者候補に関する情報を取得してよい。第２電子機器１００において、選出部１１６は、検出された少なくとも１人の対話者候補から、少なくとも１人の対話者を選出してよい。選出部１１６は、少なくとも１人の対話者を、第２電子機器１００のユーザによる入力に基づいて選出してよい。 After step S14, the selection unit 116 of the second electronic device 100 selects at least one interlocutor from the at least one interlocutor candidate (step S15). In step S15, the selection unit 116 of the second electronic device 100 may select at least one interlocutor based on an input by the user of the second electronic device 100. That is, the user of the second electronic device 100 can perform an input to select an interlocutor from the detected interlocutor candidates. In this way, in the second electronic device 100, the acquisition unit 112 may acquire information on at least one interlocutor candidate. In the second electronic device 100, the selection unit 116 may select at least one interlocutor from the at least one interlocutor candidate detected. The selection unit 116 may select at least one interlocutor based on an input by the user of the second electronic device 100.

　第２電子機器１００において対話者が選出される態様は、各種想定することができる。例えば、第２電子機器１００は、図９に示すように、表示部１７０において対話者候補を示すオブジェクトＯｂ１の近傍などにコンテキストメニューＣｍを表示することで、ユーザによる入力を促してもよい。例えば、第２電子機器１００は、例えば対話者候補Ｍｂの近傍にポインタＰｔが移動されることにより、又は対話者候補Ｍｂの近傍においてクリックが入力されることにより、対話者候補ＭｂについてコンテキストメニューＣｍが表示されるようにしてよい。ここで、第２電子機器１００は、対話者候補として検出された人物を全員、一旦対話者として設定してもよい。この場合、第２電子機器１００は、その後、ユーザの入力によって対話者として不要な人物を対話者から除外させてもよい。 The second electronic device 100 may select an interlocutor in various ways. For example, as shown in FIG. 9, the second electronic device 100 may prompt the user to input by displaying a context menu Cm near an object Ob1 indicating an interlocutor candidate on the display unit 170. For example, the second electronic device 100 may display the context menu Cm for the interlocutor candidate Mb by, for example, moving the pointer Pt near the interlocutor candidate Mb or by inputting a click near the interlocutor candidate Mb. Here, the second electronic device 100 may set all the people detected as interlocutor candidates as interlocutors at once. In this case, the second electronic device 100 may then exclude people who are not required as interlocutors from the interlocutors by user input.

　第２電子機器１００は、図９に示すコンテキストメニューＣｍにおいて、ユーザによる入力に基づいて「追加」が選択されることにより、対話者候補Ｍｂを対話者として選出してよい。一方、第２電子機器１００は、図９に示すコンテキストメニューＣｍにおいて、ユーザによる入力に基づいて「除外」が選択されることにより、対話者又は対話者候補から人物Ｍｂが除外されるようにしてもよい。また、第２電子機器１００は、図９に示すコンテキストメニューＣｍにおいて、ユーザによる入力に基づいて「保留」が選択されることにより、人物Ｍｂが対話者として選出されることを保留するようにしてもよい。 The second electronic device 100 may select the candidate interlocutor Mb as an interlocutor by selecting "Add" based on user input in the context menu Cm shown in FIG. 9. On the other hand, the second electronic device 100 may remove person Mb from interlocutors or interlocutor candidates by selecting "Remove" based on user input in the context menu Cm shown in FIG. 9. Also, the second electronic device 100 may suspend the selection of person Mb as an interlocutor by selecting "Suspend" based on user input in the context menu Cm shown in FIG. 9.

　図９においては、第２電子機器１００の入力部１９０に対するユーザによる入力（例えばマウス操作及び／又はクリックなど）を検出することにより、対話者が選出される例を示した。ここで、対話者候補から対話者が選出される態様は、各種想定することができる。例えば、第２電子機器１００は、撮像部１４０（及び／又は視線情報取得部１９２）によってユーザが所定時間（例えば３秒など）注視していると判定される対話者候補を、対話者として自動的に選出してもよい。また、第２電子機器１００は、ある人物に対するユーザの所定時間の注視及び／又は入力部１９０に対する入力に加えて、さらに所定のコマンドの検出に基づいて、対話者候補を対話者として選出してもよい。ここで、所定のコマンドとは、例えば、音声入力部１５０に対するユーザの「追加」という発声などとしてもよい。例えば、第２電子機器１００は、音声入力部１５０に対してユーザが入力する人物の名前によって特定される対話者候補を対話者として選出してもよい。また、第２電子機器１００は、例えば対話者候補の１人を表示部１７０において強調表示するなど、他の対話者候補とは異なる表示態様で表示して、音声出力部１６０から「この人物を発話者に追加しますか？」などとユーザに問いかけてもよい。この場合、第２電子機器１００は、音声入力部１６０によってユーザの「はい」又は「追加」などの発声を検出することにより、当該人物を対話者として選出してもよい。 9 shows an example in which an interlocutor is selected by detecting an input (e.g., a mouse operation and/or a click) by a user to the input unit 190 of the second electronic device 100. Here, various manners for selecting an interlocutor from interlocutor candidates can be assumed. For example, the second electronic device 100 may automatically select an interlocutor candidate that is determined by the imaging unit 140 (and/or the gaze information acquisition unit 192) to be gazed at by the user for a predetermined time (e.g., 3 seconds) as an interlocutor. In addition to the user's gaze on a certain person for a predetermined time and/or input to the input unit 190, the second electronic device 100 may select an interlocutor candidate as an interlocutor based on the detection of a predetermined command. Here, the predetermined command may be, for example, the user's voice utterance of "add" to the voice input unit 150. For example, the second electronic device 100 may select an interlocutor candidate identified by the name of a person input by the user to the voice input unit 150 as an interlocutor. Furthermore, the second electronic device 100 may display one of the interlocutor candidates in a display mode different from the other interlocutor candidates, for example by highlighting the person on the display unit 170, and may ask the user from the voice output unit 160, for example, "Do you want to add this person to the speakers?" In this case, the second electronic device 100 may select the person as an interlocutor by detecting the user's utterance of "yes" or "add" by the voice input unit 160.

　同様に、対話者又は対話者候補から人物が除外される態様も、各種想定することができる。例えば、第２電子機器１００は、ある人物に対するユーザの所定時間の注視及び／又は入力部１９０に対する入力に加えて、さらに所定のコマンドの検出に基づいて、当該人物を対話者又は対話者候補から除外してよい。ここで、所定のコマンドとは、例えば、音声入力部１５０に対するユーザの「除外」又は「削除」という発声などとしてもよい。また、第２電子機器１００は、例えば対話者候補の１人を表示部１７０において強調表示するなど、他の対話者候補とは異なる表示態様で表示して、音声出力部１６０から「この人物を発話者から除外しますか？」などとユーザに問いかけてもよい。この場合、第２電子機器１００は、音声入力部１６０によってユーザの「はい」又は「除外」などの発声を検出することにより、当該人物を対話者又は対話者候補から除外してよい。さらに、例えば、ある人物に対するユーザの注視がほとんどない場合、及び／又は、音声入力部１６０に対する音声入力がほとんどない場合、例えば所定時間の経過に基づいて、当該人物を対話者又は対話者候補から除外してもよい。例えば、第２電子機器１００は、第１電子機器１から所定距離以上離れた人物を、対話者又は対話者候補から除外してもよい。例えば、第２電子機器１００は、所定の基準に照らして不適切な言動をとる人物（例えば暴言を吐く、等）を、対話者又は対話者候補から除外してもよい。 Similarly, various manners in which a person is excluded from interlocutors or interlocutor candidates can be assumed. For example, the second electronic device 100 may exclude a person from interlocutors or interlocutor candidates based on the user's gaze on a certain person for a certain period of time and/or input to the input unit 190, as well as the detection of a certain command. Here, the certain command may be, for example, the user's voice utterance of "exclude" or "delete" to the voice input unit 150. In addition, the second electronic device 100 may display one of the interlocutor candidates in a display mode different from the other interlocutor candidates, such as highlighting the person on the display unit 170, and ask the user from the voice output unit 160, "Do you want to exclude this person from speakers?" In this case, the second electronic device 100 may exclude the person from interlocutors or interlocutor candidates by detecting the user's voice utterance of "yes" or "exclude" by the voice input unit 160. Furthermore, for example, if the user hardly pays attention to a certain person and/or if there is hardly any voice input to the voice input unit 160, the person may be excluded from interlocutors or interlocutor candidates, for example, based on the passage of a predetermined time. For example, the second electronic device 100 may exclude a person who is a predetermined distance or more away from the first electronic device 1 from interlocutors or interlocutor candidates. For example, the second electronic device 100 may exclude a person who behaves inappropriately in light of a predetermined standard (for example, who uses abusive language, etc.) from interlocutors or interlocutor candidates.

　同様に、対話者候補から選出される対話者が保留される態様も、各種想定することができる。例えば、第２電子機器１００は、例えば対話者候補の１人を表示部１７０において強調表示するなど、他の対話者候補とは異なる表示態様で表示して、音声出力部１６０から「この人物を発話者に追加しますか？」などとユーザに問いかけてもよい。この場合、第２電子機器１００は、音声入力部１６０によってユーザの「いいえ」又は「保留」などの発声を検出することにより、当該人物を対話者に追加するのを保留してよい。さらに、例えば、ある人物に対するユーザの注視がほとんどない場合、及び／又は、音声入力部１６０に対する音声入力がほとんどない場合、例えば所定時間の経過に基づいて、当該人物を対話者に追加するのを保留してもよい。 Similarly, various modes of putting a candidate selected from among candidate interlocutors on hold can be envisioned. For example, the second electronic device 100 may display one of the candidate interlocutors in a display mode different from the other candidate interlocutors, such as highlighting the candidate on the display unit 170, and ask the user from the voice output unit 160, "Do you want to add this person to the speakers?" In this case, the second electronic device 100 may put off adding the person to the interlocutors by detecting the user's utterance of "no" or "hold" by the voice input unit 160. Furthermore, for example, when the user is hardly gazing at a certain person and/or when there is little voice input to the voice input unit 160, the second electronic device 100 may put off adding the person to the interlocutors, for example, based on the passage of a predetermined time.

　ステップＳ１５において対話者候補から対話者が選出されたら、第２電子機器１００は、選出された対話者を示す情報を、第３電子機器３００に送信する（ステップＳ１６）。選出された対話者を示す情報を受信すると、第３電子機器３００の特定部３１８は、選出された対話者を示す情報に基づいて、対話者候補から対話者を特定する（ステップＳ１７）。例えば、ステップＳ１３において対話者候補として検出された人物Ｍａ，Ｍｂ，Ｍｃ，及びＭｄのうち、第２電子機器１００のユーザによる入力に基づいて、人物Ｍｃ及びＭｄが対話者として選出されたとする。この場合、ステップＳ１６において、第２電子機器１００は、選出された対話者が人物Ｍｃ及びＭｄである旨を示す情報を、第３電子機器３００に送信する。そして、ステップＳ１７において、第３電子機器３００は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から、対話者Ｍｃ及びＭｄを特定する。 When an interlocutor is selected from the interlocutor candidates in step S15, the second electronic device 100 transmits information indicating the selected interlocutor to the third electronic device 300 (step S16). Upon receiving the information indicating the selected interlocutor, the identification unit 318 of the third electronic device 300 identifies the interlocutor from the interlocutor candidates based on the information indicating the selected interlocutor (step S17). For example, among the persons Ma, Mb, Mc, and Md detected as interlocutor candidates in step S13, persons Mc and Md are selected as interlocutors based on an input by the user of the second electronic device 100. In this case, in step S16, the second electronic device 100 transmits information indicating that the selected interlocutors are persons Mc and Md to the third electronic device 300. Then, in step S17, the third electronic device 300 identifies interlocutors Mc and Md from the interlocutor candidates Ma, Mb, Mc, and Md.

　ステップＳ１７において対話者が特定されると、第３電子機器３００は、選出された対話者と、対話者候補のうち選出された対話者以外の人物とで、異なる処理を実行することができる。「対話者候補のうち選出された対話者以外の人物」とは、対話者に選出されない対話者候補としてよい。すなわち、第３電子機器３００は、選出された対話者のみに特定の処理を実行することができる。したがって、以後、第３電子機器３００は、選出された対話者に対して、選出された対話者との対話に関する所定の処理を行う（ステップＳ１８）。ステップＳ１８において行う動作を、「対話者に対して行う所定の処理」又は「対話者に対する所定の処理」とも記す。ステップＳ１８以後、第３電子機器３００は、第１電子機器１及び／又は第２電子機器２００が、選出された対話者と、対話者候補のうち選出された対話者以外の人物とで、異なる処理を実行するように制御を行うことができる。そこで、第３電子機器３００は、選出された対話者に対する所定の処理に基づいて、第２電子機器１００を制御してよい（ステップＳ１９）。また、第３電子機器３００は、選出された対話者に対する所定の処理に基づいて、第１電子機器１を制御してもよい（ステップＳ２０）。このように、制御部３１０は、少なくとも１人の対話者候補から第２電子機器１００によって選出された少なくとも１人の対話者との対話に対して所定の処理を実行するように、第１電子機器１及び第２電子機器２００の少なくとも一方を制御してよい。また、制御部３１０は、対話者に選出されない対話者候補に対しては所定の処理を実行しないように、第１電子機器１及び第２電子機器２００の少なくとも一方を制御してよい。 When the interlocutor is specified in step S17, the third electronic device 300 can execute different processes for the selected interlocutor and for the interlocutor candidates other than the selected interlocutor. "A person other than the selected interlocutor among the interlocutor candidates" may be an interlocutor candidate who is not selected as an interlocutor. In other words, the third electronic device 300 can execute a specific process only for the selected interlocutor. Therefore, hereafter, the third electronic device 300 executes a predetermined process for the selected interlocutor regarding the dialogue with the selected interlocutor (step S18). The operation performed in step S18 is also referred to as "a predetermined process performed on the interlocutor" or "a predetermined process on the interlocutor". After step S18, the third electronic device 300 can control the first electronic device 1 and/or the second electronic device 200 to execute different processes for the selected interlocutor and for the interlocutor candidates other than the selected interlocutor. Therefore, the third electronic device 300 may control the second electronic device 100 based on the predetermined process for the selected interlocutor (step S19). The third electronic device 300 may also control the first electronic device 1 based on a predetermined process for the selected interlocutor (step S20). In this way, the control unit 310 may control at least one of the first electronic device 1 and the second electronic device 200 to execute a predetermined process for a conversation with at least one interlocutor selected by the second electronic device 100 from at least one interlocutor candidate. The control unit 310 may also control at least one of the first electronic device 1 and the second electronic device 200 not to execute a predetermined process for an interlocutor candidate not selected as an interlocutor.

　このようにして、第１電子機器１の制御部１０は、少なくとも１人の対話者候補から選出された少なくとも１人の対話者との対話に関する所定の処理を実行する。この場合、制御部１０は、少なくとも１人の対話者として、他の電子機器（第２電子機器１００）によって選出された対話者との対話に関する所定の処理を実行してもよい。また、第２電子機器１００の制御部１１０は、少なくとも１人の対話者との対話に関する所定の処理を実行する。 In this way, the control unit 10 of the first electronic device 1 executes a predetermined process related to a dialogue with at least one interlocutor selected from at least one interlocutor candidate. In this case, the control unit 10 may execute a predetermined process related to a dialogue with an interlocutor selected by another electronic device (the second electronic device 100) as at least one interlocutor. Also, the control unit 110 of the second electronic device 100 executes a predetermined process related to a dialogue with at least one interlocutor.

　上述のステップＳ１８乃至ステップＳ２０において実行される所定の処理は、各種想定することができる。例えば、上述のように、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から、対話者Ｍｃ及びＭｄが選出された場合を想定する。この場合、第３電子機器３００は、第２電子機器１００を制御して、対話者Ｍｃ及びＭｄが、他の対話者候補とは異なる表示態様で表示部１７０に表示されるように制御してよい。例えば図１０に示すように、第２電子機器１００は、対話者Ｍｃ及びＭｄに他と異なるオブジェクトＯｂ１などを付すことにより、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されていることを示してもよい。また、この場合、例えば対話者Ｍｃ及びＭｄを表示部１７０において強調表示するなど、他の対話者候補Ｍａ，Ｍｂなどとは異なる表示態様で対話者Ｍｃ及びＭｄを表示してもよい。これにより、第２電子機器１００のユーザは、表示部１７０を見ることにより、人物Ｍｃ及びＭｄが対話者として選出されていることを把握できる。 Various predetermined processes can be assumed for the process performed in steps S18 to S20 described above. For example, assume that interlocutors Mc and Md are selected from interlocutor candidates Ma, Mb, Mc, and Md as described above. In this case, the third electronic device 300 may control the second electronic device 100 so that interlocutors Mc and Md are displayed on the display unit 170 in a display mode different from that of other interlocutor candidates. For example, as shown in FIG. 10, the second electronic device 100 may indicate that persons Mc and Md have been selected as interlocutors from interlocutor candidates Ma, Mb, Mc, and Md by attaching an object Ob1 or the like different from the others to interlocutors Mc and Md. In this case, interlocutors Mc and Md may be displayed in a display mode different from that of other interlocutor candidates Ma, Mb, etc., for example, by highlighting interlocutors Mc and Md on the display unit 170. As a result, the user of the second electronic device 100 can see on the display unit 170 that persons Mc and Md have been selected as interlocutors.

　また、例えば、第２電子機器１００は、所定の処理として、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、対話者Ｍｃ及びＭｄを拡大して表示部１７０に表示してもよい。この場合、第２電子機器１００は、例えば対話者Ｍｃ及びＭｄの画像のみをくりぬいて表示部１７０に表示してもよい。さらに、第２電子機器１００は、例えば表示部１７０とは別に設置されたサブディスプレイなどに、対話者Ｍｃ及びＭｄのみが表示されるようにしてもよい。また、例えば、第２電子機器１００は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、人物Ｍｃ及びＭｄ以外の対話者候補をぼかして表示部１７０に表示してもよい。また、第２電子機器１００は、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、人物Ｍｃ及びＭｄ以外の対話者候補を表示部１７０の表示から除去してもよい。このように、第２電子機器１００は、少なくとも１人の対話者候補に関する情報を表示する表示部１７０を備えてよい。また、第２電子機器１００の制御部１１０は、対話者との対話に関する所定の処理として、対話者に関する情報を、対話者候補のうち対話者以外の人物に関する情報とは異なる態様で表示する処理を実行してもよい。 Furthermore, for example, when persons Mc and Md are selected as interlocutors from among interlocutor candidates Ma, Mb, Mc, and Md, the second electronic device 100 may enlarge and display interlocutors Mc and Md on the display unit 170 as a predetermined process. In this case, the second electronic device 100 may, for example, cut out only images of interlocutors Mc and Md and display them on the display unit 170. Furthermore, the second electronic device 100 may display only interlocutors Mc and Md on, for example, a sub-display installed separately from the display unit 170. Further, for example, when persons Mc and Md are selected as interlocutors from among interlocutor candidates Ma, Mb, Mc, and Md, the second electronic device 100 may blur interlocutor candidates other than persons Mc and Md and display them on the display unit 170. Furthermore, when persons Mc and Md are selected as interlocutors from among interlocutor candidates Ma, Mb, Mc, and Md, the second electronic device 100 may remove interlocutor candidates other than persons Mc and Md from the display of the display unit 170. In this manner, the second electronic device 100 may include a display unit 170 that displays information about at least one interlocutor candidate. Furthermore, the control unit 110 of the second electronic device 100 may execute a process of displaying information about the interlocutor in a different manner from information about interlocutor candidates other than the interlocutor, as a predetermined process related to the dialogue with the interlocutor.

　また、例えば、第２電子機器１００は、所定の処理として、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、対話者Ｍｃ又はＭｄの発話に基づいて、当該人物が発話している旨を表示部１７０に表示してよい。この場合、第２電子機器１００は、対話者Ｍｃ又はＭｄ以外の人物が発話したときには、それらの人物が発話している旨を表示部１７０に表示しないようにしてもよい。 Furthermore, for example, when persons Mc and Md are selected as interlocutors from among interlocutor candidates Ma, Mb, Mc, and Md as interlocutors as a predetermined process, the second electronic device 100 may display on the display unit 170 that the interlocutor Mc or Md is speaking, based on the speech of the interlocutor Mc or Md. In this case, when a person other than interlocutors Mc or Md speaks, the second electronic device 100 may not display on the display unit 170 that the person is speaking.

　また、例えば、第２電子機器１００は、所定の処理として、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、対話者Ｍｃ及びＭｄが選出されている旨の音声を、音声出力部１６０から出力してもよい。このように、第２電子機器１００は、少なくとも１人の対話者候補に関する情報を音声で出力する音声出力部１６０を備えてもよい。第２電子機器１００の制御部１１０は、対話者との対話に関する所定の処理として、対話者に関する情報を、対話者候補のうち対話者以外の人物に関する情報とは異なる音声で出力する処理を実行してもよい。 Furthermore, for example, as a predetermined process, the second electronic device 100 may output from the audio output unit 160 a voice indicating that interlocutors Mc and Md have been selected when persons Mc and Md are selected from among interlocutor candidates Ma, Mb, Mc, and Md. In this manner, the second electronic device 100 may be provided with an audio output unit 160 that outputs information about at least one interlocutor candidate by audio. The control unit 110 of the second electronic device 100 may execute a process of outputting information about the interlocutor in a voice that is different from information about persons other than the interlocutor among the interlocutor candidates, as a predetermined process related to the dialogue with the interlocutor.

　また、例えば、対話者候補Ｍａ，Ｍｂ，Ｍｃ，及びＭｄの中から人物Ｍｃ及びＭｄが対話者として選出されている場合に、第３電子機器３００は、第１電子機器１を制御してもよい。例えば、第１電子機器１が人形又はロボットの顔の形状を有する構造を備える場合、第１電子機器１は、所定の処理として、人形又はロボットが人物Ｍｃ又はＭｄに対して所定の動作を行うようにしてもよい。具体的には、第１電子機器１は、人物Ｍｇが人物Ｍｃ又はＭｄに対して発話をしている時又は視線を向けている場合に、駆動部８０を駆動して、第１電子機器１が対話者Ｍｃ及びＭｄの方を向くように顔を動かす、又は視線を向けるようにしてもよい。この場合、第１電子機器１は、人物Ｍｇが人物Ｍｃ又はＭｄ以外の人物に対して発話したり、視線向けたりしても、第１電子機器１が対話者Ｍｃ又はＭｄの方を向いたり、視線を向けたりしないようにしてよい。このように、第３電子機器３００の制御部３１０は、対話者との対話に関する所定の処理として、対話者に、対話者候補のうち対話者以外の人物とは異なる動作を行うように、他の電子機器（第１電子機器１）を制御する処理を実行してもよい。また、第３電子機器３００の制御部３１０は、対話者との対話に関する所定の処理として、対話者に、対話者候補のうち対話者以外の人物とは異なる動作を行うように、他の電子機器（第２電子機器１００）を制御する処理を実行してもよい。 Also, for example, when the persons Mc and Md are selected as interlocutors from among the interlocutor candidates Ma, Mb, Mc, and Md, the third electronic device 300 may control the first electronic device 1. For example, when the first electronic device 1 has a structure having a shape of a face of a doll or a robot, the first electronic device 1 may cause the doll or robot to perform a predetermined action toward the persons Mc or Md as a predetermined process. Specifically, when the person Mg is speaking or looking toward the person Mc or Md, the first electronic device 1 may drive the driving unit 80 to move the face or look toward the interlocutors Mc and Md. In this case, the first electronic device 1 may be configured not to look toward or look toward the interlocutors Mc or Md even if the person Mg speaks or looks toward a person other than the person Mc or Md. In this way, the control unit 310 of the third electronic device 300 may execute a process of controlling the other electronic device (the first electronic device 1) as a predetermined process related to the dialogue with the interlocutor, so that the interlocutor behaves differently from the interlocutor candidates other than the interlocutor. Also, the control unit 310 of the third electronic device 300 may execute a process of controlling the other electronic device (the second electronic device 100) as a predetermined process related to the dialogue with the interlocutor, so that the interlocutor behaves differently from the interlocutor candidates other than the interlocutor.

　図６に示した動作は、適宜のタイミングで、開始から繰り返して実行してよい。また、例えば、図１０に示すような状況において、対話者Ｍｇが対話者Ｍｃ及びＭｄと対話していたとする。そこで、例えば、図１０に示す人物Ｍｅ及び／又はＭｆが、第１電子機器１に近づいてきたとする。このような状況において、図６に示した動作が開始したとする。この場合、例えばステップＳ１３において、人物Ｍｅ及び／又はＭｆが第１電子機器１から所定の距離内に存在すれば、対話者候補として検出される。この場合、第３電子機器３００は、ステップＳ１４において、新たな対話者候補Ｍｅ及び／又はＭｆに関する情報を第２電子機器１００に送信する。したがって、ステップＳ１５において、第２電子機器１００のユーザは、新たな対話者候補Ｍｅ及び／又はＭｆを対話者として選出するか否か選択することができる。この場合、第２電子機器１００は、新たな対話者候補Ｍｅ及び／又はＭｆの存在を第２電子機器１００のユーザに知らせてもよい。例えば、第２電子機器１００は、新たな対話者候補Ｍｅ及び／又はＭｆの存在を通知するために、所定の音声又は音などを音声出力部１６０から出力してもよい。また、第２電子機器１００は、新たな対話者候補Ｍｅ及び／又はＭｆの存在を通知するために、表示部１７０において強調表示又はポップアップの表示などをしてもよい。 The operation shown in FIG. 6 may be repeatedly executed from the start at an appropriate timing. Also, for example, in a situation as shown in FIG. 10, it is assumed that interlocutor Mg is conversing with interlocutors Mc and Md. Then, for example, it is assumed that person Me and/or Mf shown in FIG. 10 approaches the first electronic device 1. In such a situation, it is assumed that the operation shown in FIG. 6 is started. In this case, for example, in step S13, if person Me and/or Mf exists within a predetermined distance from the first electronic device 1, they are detected as interlocutor candidates. In this case, in step S14, the third electronic device 300 transmits information about the new interlocutor candidate Me and/or Mf to the second electronic device 100. Therefore, in step S15, the user of the second electronic device 100 can select whether or not to select the new interlocutor candidate Me and/or Mf as an interlocutor. In this case, the second electronic device 100 may inform the user of the second electronic device 100 of the existence of the new interlocutor candidate Me and/or Mf. For example, the second electronic device 100 may output a predetermined voice or sound from the audio output unit 160 to notify the presence of the new interlocutor candidate Me and/or Mf. Also, the second electronic device 100 may highlight or display a pop-up on the display unit 170 to notify the presence of the new interlocutor candidate Me and/or Mf.

　一実施形態に係るシステムによれば、リモートで使用される電子機器と、ローカルで使用される電子機器との間のコミュニケーションにおいて、ユーザがコミュニケーションを取りたいと意図する対話者を取捨選択することができる。一般的に、人間は、文化的背景などに基づいて、ほとんど無意識ターンテイキングを行っている。また、一般的に、人間は、ターンテイキングを共有する一人又は複数の対話者を相互的に認識することにより、対話者とのターンテイキングに意識のリソースを集中することができる。しかしながら、人的な要素が介在しない場合、検出した人物が対話者か否かを区別することが困難である。このため、人間であれば対話者とは認識しない人物に対しても、発話者と判断することにより、物理的及び／又は処理的なリソースを割いてしまうことも想定される。その結果、対話者でない人物に対して認識した処理の結果をユーザに提示してしまうことにより、ユーザと対話者とのターンテイキングに支障をきたすおそれがある。しかしながら、上述した一実施形態に係るシステムによれば、システムが提示する対話者候補からユーザが対話者を選択することで、ユーザが対話者として認識する人物のみに、物理的及び／又は処理的なリソースを集中させることができる。したがって、一実施形態に係るシステムによれば、コミュニケーションを円滑に行うことができる。 According to the system of one embodiment, in communication between an electronic device used remotely and an electronic device used locally, a user can select an interlocutor with whom he or she intends to communicate. Generally, humans perform turn-taking almost unconsciously based on cultural background, etc. In addition, generally, humans can focus conscious resources on turn-taking with an interlocutor by mutually recognizing one or more interlocutors with whom they share turn-taking. However, if there is no human element involved, it is difficult to distinguish whether a detected person is an interlocutor or not. For this reason, it is expected that physical and/or processing resources will be allocated to a person who would not be recognized as an interlocutor by a human being by determining that the person is a speaker. As a result, there is a risk that turn-taking between the user and the interlocutor will be hindered by presenting the user with the results of processing that recognizes a person who is not an interlocutor. However, according to the system of one embodiment described above, a user can select an interlocutor from interlocutor candidates presented by the system, and physical and/or processing resources can be concentrated only on the person the user recognizes as an interlocutor. Therefore, according to the system of one embodiment, communication can be performed smoothly.

　本開示に係る実施形態について、諸図面及び実施例に基づき説明してきたが、当業者であれば本開示に基づき種々の変形又は修正を行うことが容易であることに注意されたい。従って、これらの変形又は修正は本開示の範囲に含まれることに留意されたい。例えば、各構成部又は各ステップなどに含まれる機能などは論理的に矛盾しないように再配置可能であり、複数の構成部又はステップなどを１つに組み合わせたり、或いは分割したりすることが可能である。本開示に係る実施形態について装置を中心に説明してきたが、本開示に係る実施形態は装置の各構成部が実行するステップを含む方法としても実現し得るものである。本開示に係る実施形態は装置が備えるプロセッサなどにより実行される方法、プログラム、又はプログラムを記録した記憶媒体若しくは記録媒体としても実現し得るものである。本開示の範囲にはこれらも包含されるものと理解されたい。 Although the embodiments of the present disclosure have been described based on the drawings and examples, it should be noted that those skilled in the art would easily be able to make various modifications or corrections based on the present disclosure. Therefore, it should be noted that these modifications or corrections are included in the scope of the present disclosure. For example, the functions included in each component or step can be rearranged so as not to cause logical inconsistencies, and multiple components or steps can be combined into one or divided. Although the embodiments of the present disclosure have been described mainly with respect to the device, the embodiments of the present disclosure can also be realized as a method including steps executed by each component of the device. The embodiments of the present disclosure can also be realized as a method, a program executed by a processor or the like included in the device, or a storage medium or storage medium on which a program is recorded. It should be understood that these are also included in the scope of the present disclosure.

　上述した実施形態は、システムとしての実施のみに限定されるものではない。例えば、上述した実施形態は、システムの制御方法として実施してもよいし、システムにおいて実行されるプログラムとして実施してもよい。また、例えば、上述した実施形態は、第１電子機器１、第２電子機器１００、及び第３電子機器３００の少なくともいずれかのような機器として実施してもよい。また、上述した実施形態は、第１電子機器１、第２電子機器１００、及び第３電子機器３００の少なくともいずれかのような機器の制御方法として実施してもよい。さらに、上述した実施形態は、第１電子機器１、第２電子機器１００、及び第３電子機器３００の少なくともいずれかのような機器によって実行されるプログラム、又はプログラムを記録した記憶媒体若しくは記録媒体としてとして実施してもよい。 The above-described embodiments are not limited to implementation as a system. For example, the above-described embodiments may be implemented as a control method for a system, or as a program executed in a system. For example, the above-described embodiments may be implemented as at least one of the first electronic device 1, the second electronic device 100, and the third electronic device 300. The above-described embodiments may be implemented as a control method for at least one of the first electronic device 1, the second electronic device 100, and the third electronic device 300. Furthermore, the above-described embodiments may be implemented as a program executed by at least one of the first electronic device 1, the second electronic device 100, and the third electronic device 300, or as a storage medium or recording medium on which the program is recorded.

　図６に示した動作のステップＳ１４の後、第２電子機器１００は、対話者候補から対話者を選出した（ステップＳ１５）。また、対話者候補から対話者を選出する際に、第２電子機器１００は、ユーザに対し、対話者の「追加」「除外」及び「保留」から選択を促す例について説明した。一方、図１１に示すように、ステップＳ１４の後、第２電子機器１００は、対話者候補から「非対話者」を選出し（ステップＳ２１）、当該「非対話者」を示す情報を、第３電子機器３００に送信してもよい（ステップＳ２２）。この場合、ステップＳ２３において、第３電子機器３００は、対話者候補から、「非対話者」を特定してもよい。その後、対話者との対話に関する所定の処理を、「非対話者」に対しては実行せずに、対話者に対して実行してもよい。 After step S14 of the operation shown in FIG. 6, the second electronic device 100 selected an interlocutor from the interlocutor candidates (step S15). Also, an example was described in which, when selecting an interlocutor from the interlocutor candidates, the second electronic device 100 prompts the user to select from among "add," "remove," and "reserve" the interlocutor. On the other hand, as shown in FIG. 11, after step S14, the second electronic device 100 may select a "non-interlocutor" from the interlocutor candidates (step S21) and transmit information indicating the "non-interlocutor" to the third electronic device 300 (step S22). In this case, in step S23, the third electronic device 300 may identify the "non-interlocutor" from the interlocutor candidates. Thereafter, a predetermined process related to the dialogue with the interlocutor may be performed on the interlocutor without performing it on the "non-interlocutor."

　上述した実施形態においては、第１電子機器１と第２電子機器１００との間のやり取りを、第３電子機器３００が中継する構成について説明した。しかしながら、第３電子機器３００が実行する機能の一部又は全部を、第１電子機器１及び第２電子機器１００の少なくとも一方が実行してもよい。例えば、第１電子機器１の検出部１４は、少なくとも１人の対話者候補に関する情報に基づいて、少なくとも１人の対話者候補を検出してもよい。また、第１電子機器１の制御部１０は、検出部１４によって検出された少なくとも１人の対話者候補に関する情報を、他の電子機器（例えば第２電子機器１００）に送信してもよい。また、第１電子機器１の特定部１８は、検出部１４によって検出された少なくとも１人の対話者候補から、少なくとも１人の対話者として他の電子機器（例えば第２電子機器１００）によって選出された対話者を特定してもよい。第１電子機器１の制御部１０は、少なくとも１人の対話者として特定部１８によって特定された対話者に関する情報を、他の電子機器（例えば第２電子機器１００）に送信してもよい。第１電子機器１の制御部１０は、少なくとも１人の対話者として特定部１８によって特定された対話者との対話に関する所定の処理を実行してもよい。また、第１電子機器１の検出部１４は、視線情報取得部９２によって取得される少なくとも１人の対話者候補の視線に関する情報に基づいて、少なくとも１人の対話者候補を検出してもよい。 In the above-described embodiment, the third electronic device 300 relays the communication between the first electronic device 1 and the second electronic device 100. However, at least one of the first electronic device 1 and the second electronic device 100 may execute some or all of the functions executed by the third electronic device 300. For example, the detection unit 14 of the first electronic device 1 may detect at least one interlocutor candidate based on information about at least one interlocutor candidate. The control unit 10 of the first electronic device 1 may transmit information about at least one interlocutor candidate detected by the detection unit 14 to another electronic device (e.g., the second electronic device 100). The identification unit 18 of the first electronic device 1 may identify an interlocutor selected by the other electronic device (e.g., the second electronic device 100) as at least one interlocutor from the at least one interlocutor candidate detected by the detection unit 14. The control unit 10 of the first electronic device 1 may transmit information about an interlocutor identified by the identification unit 18 as at least one interlocutor to the other electronic device (e.g., the second electronic device 100). The control unit 10 of the first electronic device 1 may execute a predetermined process related to a dialogue with the interlocutor identified by the identification unit 18 as at least one interlocutor. Furthermore, the detection unit 14 of the first electronic device 1 may detect at least one interlocutor candidate based on information regarding the line of sight of the at least one interlocutor candidate acquired by the line of sight information acquisition unit 92.

　また、第２電子機器１００の選出部１１６は、少なくとも１人の対話者候補から少なくとも１人の非対話者をユーザによる入力に基づいて選出してもよい。この場合、第２電子機器１００の制御部１１０は、少なくとも１人の非対話者に対して所定の処理を実行しないようにしてもよい。また、第２電子機器１００は、第２電子機器１００のユーザの視線に関する情報を取得する視線情報取得部１９２を備えてもよい。この場合、第２電子機器１００の選出部１１６は、少なくとも１人の対話者を、ユーザの視線に関する情報に基づいて選出してもよい。 The selection unit 116 of the second electronic device 100 may select at least one non-interlocutor from at least one candidate interlocutor based on input by the user. In this case, the control unit 110 of the second electronic device 100 may not execute a predetermined process for at least one non-interlocutor. The second electronic device 100 may also include a gaze information acquisition unit 192 that acquires information regarding the gaze of the user of the second electronic device 100. In this case, the selection unit 116 of the second electronic device 100 may select at least one interlocutor based on information regarding the user's gaze.

　上述した実施形態に係る第１電子機器１は、第２電子機器１００のユーザによって操作されるものとして説明した。例えば、上述した実施形態に係るシステムは、第２電子機器１００のユーザによって、対話者候補から対話者が選出される態様を説明した。しかしながら、第１電子機器１は、第２電子機器１００のユーザによる一部又は全部の操作を介さずに、少なくとも部分的に自立的な動作を実行してもよい。例えば、一実施形態において、第１電子機器１（又は第３電子機器３００）が自律的に（例えば所定のアルゴリズムに従って）対話者候補から対話者を選出してもよい。 The first electronic device 1 according to the above-described embodiment has been described as being operated by the user of the second electronic device 100. For example, the system according to the above-described embodiment has been described as an aspect in which an interlocutor is selected from interlocutor candidates by the user of the second electronic device 100. However, the first electronic device 1 may perform at least partially autonomous operations without the intervention of partial or full operations by the user of the second electronic device 100. For example, in one embodiment, the first electronic device 1 (or the third electronic device 300) may autonomously (for example, according to a predetermined algorithm) select an interlocutor from interlocutor candidates.

　１　第１電子機器
　１０　制御部
　１２　取得部
　１４　検出部
　１６　選出部
　１８　特定部
　２０　記憶部
　３０　通信部
　４０　撮像部
　５０　音声入力部
　６０　音声出力部
　７０　表示部
　８０　駆動部
　９０　入力部
　９２　視線情報取得部
　１００　第２電子機器
　１１０　制御部
　１１２　取得部
　１１４　検出部
　１１６　選出部
　１１８　特定部
　１２０　記憶部
　１３０　通信部
　１４０　撮像部
　１５０　音声入力部
　１６０　音声出力部
　１７０　表示部
　１９０　入力部
　１９２　視線情報取得部
　３００　第３電子機器
　３１０　制御部
　３１２　取得部
　３１４　検出部
　３１６　選出部
　３１８　特定部
　３２０　記憶部
　３３０　通信部
　Ｎ　ネットワーク 1 First electronic device 10 Control unit 12 Acquisition unit 14 Detection unit 16 Selection unit 18 Identification unit 20 Memory unit 30 Communication unit 40 Imaging unit 50 Audio input unit 60 Audio output unit 70 Display unit 80 Driving unit 90 Input unit 92 Line-of-sight information acquisition unit 100 Second electronic device 110 Control unit 112 Acquisition unit 114 Detection unit 116 Selection unit 118 Identification unit 120 Memory unit 130 Communication unit 140 Imaging unit 150 Audio input unit 160 Audio output unit 170 Display unit 190 Input unit 192 Line-of-sight information acquisition unit 300 Third electronic device 310 Control unit 312 Acquisition unit 314 Detection unit 316 Selection unit 318 Identification unit 320 Memory unit 330 Communication unit N Network

Claims

An acquisition unit that acquires information regarding at least one interlocutor candidate;
a detection unit that detects the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
a control unit that executes a predetermined process for at least one interlocutor selected from the at least one interlocutor candidate, and does not execute the predetermined process for the interlocutor candidate who is not selected as the interlocutor;
An electronic device comprising:

The electronic device according to claim 1, wherein the control unit executes the predetermined process for a party selected by another electronic device as the at least one party.

The electronic device according to claim 1, wherein the detection unit detects a person within a predetermined distance range from the electronic device as the interlocutor candidate.

a gaze information acquisition unit for acquiring information regarding the gaze of the at least one interlocutor candidate,
The electronic device according to claim 1 , wherein the detection unit detects the at least one interlocutor candidate based on information regarding a line of sight of the at least one interlocutor candidate acquired by the line of sight information acquisition unit.

A structure having a shape of a face of a doll or a robot,
The control unit performs, as the predetermined processing, an operation of directing a face or a line of sight of the structure to the interlocutor and not directing a face or a line of sight of the structure to the interlocutor candidate who is not selected as the interlocutor.
2. The electronic device according to claim 1.

An acquisition unit that acquires information regarding at least one interlocutor candidate;
A selection unit that selects at least one interlocutor from the at least one interlocutor candidate;
a control unit that executes a predetermined process for the at least one interlocutor and does not execute the predetermined process for the interlocutor candidates who are not selected as the interlocutor;
An electronic device comprising:

The electronic device according to claim 6, wherein the selection unit selects the at least one interlocutor based on an input by a user.

The selection unit selects at least one non-interlocutor from the at least one interlocutor candidate based on an input by a user;
The electronic device according to claim 6 , wherein the control unit does not execute the predetermined process for the at least one non-interacting party.

a line-of-sight information acquisition unit that acquires information about a line of sight of a user of the electronic device,
The electronic device according to claim 6 , wherein the selection unit selects the at least one interlocutor based on information regarding a line of sight of the user.

a display unit for displaying information about the at least one interlocutor candidate;
The electronic device according to claim 6 , wherein the control unit executes, as the predetermined processing, a process of displaying information about the at least one interlocutor in a manner different from information about the interlocutor candidates who are not selected as the interlocutor.

a voice output unit that outputs information about the at least one interlocutor candidate by voice;
The electronic device according to claim 6 , wherein the control unit executes, as the predetermined processing, a process of outputting information about the at least one interlocutor in a voice different from information about the interlocutor candidates not selected as the interlocutor.

The electronic device according to claim 6, wherein the control unit executes, as the predetermined process, a process of controlling other electronic devices so that the at least one interlocutor operates differently from the interlocutor candidate who is not selected as the interlocutor.

an acquisition unit that acquires information about at least one interlocutor candidate from the first electronic device;
a detection unit that detects the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
a control unit that controls at least one of the first electronic device and the second electronic device so as to execute a predetermined process for at least one interlocutor selected by the second electronic device from the at least one interlocutor candidate, and not to execute the predetermined process for the interlocutor candidate not selected as the interlocutor;
An electronic device comprising:

On the computer,
obtaining information about at least one potential interlocutor;
detecting the at least one candidate interlocutor based on information about the at least one candidate interlocutor;
executing a predetermined process for at least one interlocutor selected from the at least one interlocutor candidate, and not executing the predetermined process for the interlocutor candidate who is not selected as the interlocutor;
A program to execute.