CN1983308A

CN1983308A - Learning apparatus and method

Info

Publication number: CN1983308A
Application number: CN 200610063917
Authority: CN
Inventors: 泽田务
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2005-10-31
Filing date: 2006-10-31
Publication date: 2007-06-20

Abstract

A learning device for establishing a network structure of a Bayesian network based on learning data. In Bayesian networks, the relationship between cause and effect among multiple nodes is represented by a directed graph. The learning device includes a storage part that stores learning data and a learning part that builds a network structure based on the learning data. The learning section prepares an initial individual group formed by individuals each having a genotype in which an order among nodes and a relationship between a cause and an effect are specified, repeatedly performs crossover processing and/or mutation processing in the initial individual group based on a genetic algorithm, Calculate the estimated value of each individual based on the learning data, find the best one among the individuals, and use the phenotype of the best individual as the network structure.

Description

Learning equipment and methods

技术领域technical field

本发明涉及一种基于通过学习获得的数据(在下文中称为“学习数据”)建立贝叶斯(Bayesian)网络的网络结构的学习设备以及建立该网络结构的方法。The present invention relates to a learning device that builds a network structure of a Bayesian network based on data obtained through learning (hereinafter referred to as "learning data") and a method of building the network structure.

背景技术Background technique

最近几年中，信息处理技术应用的范围已经扩展，并且能够根据各种环境和各种用户操作的信息处理机制已经变得重要。就是说，具有不确定性的处理对象(例如，可能预先没有被假定的对象、或可能没有被完全观测的对象)已经变得日益重要。因此，可以要求即使在仅不确定信息可用的情况下处理智能信息来尽可能精确地理解环境并且执行适当处理的机制。In recent years, the range of application of information processing technology has expanded, and an information processing mechanism capable of operating according to various environments and various users has become important. That is, dealing with objects with uncertainty (eg, objects that may not have been assumed in advance, or objects that may not have been fully observed) has become increasingly important. Therefore, mechanisms that process intelligent information to understand the environment as precisely as possible and perform appropriate processing may be required even when only uncertain information is available.

由于这些需要，使用网络结构描述感兴趣对象、并由观测到的事件概率性地预测将要得知的对象的概率模型已经引起了注意。借助有向图在表示变量的节点之间指示起因和结果之间关系(联系)的贝叶斯网络是一种已知的典型概率模型。Because of these needs, probabilistic models that describe objects of interest using network structures and probabilistically predict objects to be learned from observed events have attracted attention. A Bayesian network in which a relationship (connection) between a cause and an effect is indicated between nodes representing variables by means of a directed graph is a known typical probability model.

[非专利文献1]Cooper，G，和Herskovits，E.，“A Bayesian method for theinduction of probabilistic networks from Data(一种从数据归纳概率网络的贝叶斯方法)”，Machine Learning(机器学习)，Vol.9，pp.309-347，1992。[Non-Patent Document 1] Cooper, G, and Herskovits, E., "A Bayesian method for the induction of probabilistic networks from Data (a Bayesian method for induction of probabilistic networks from data)", Machine Learning, Vol.9, pp.309-347, 1992.

[非专利文献2]Hongjun Zhou，Shigeyuki Sakane，“Sensor Planning forMobile Robot Localization Using Structure Learing and Inference of BayesianNetwork(使用贝叶斯网络的结构学习和推理的移动机器人定位的传感器设计)”，Journal of the Japan Robotics Society(日本机器人学会学报)，Vol.22，No.2，pp.245-255，2004。[Non-Patent Document 2] Hongjun Zhou, Shigeyuki Sakane, "Sensor Planning for Mobile Robot Localization Using Structure Learning and Inference of BayesianNetwork (Sensor Design for Mobile Robot Localization Using Structure Learning and Inference of Bayesian Network)", Journal of the Japan Robotics Society (Journal of Robotics Society of Japan), Vol.22, No.2, pp.245-255, 2004.

发明内容Contents of the invention

为了将贝叶斯网络应用于处理真实对象中，建立合适的模型是重要的。In order to apply Bayesian networks to real objects, it is important to build appropriate models.

在实际应用的许多现有实施例中，通过利用在要处理的领域中精通的专家的知识和经验来建立模型。需要一种基于学习数据建立贝叶斯网络的网络结构的方法。然而，基于学习数据建立网络结构是一个NP-hard(nondeterministic polynomial-hard，非确定性的多项式地难)问题。而且，可能需要确保网络结构的有向非循环性。因此，不容易建立最佳网络结构。In many existing embodiments of practical applications, the models are built by utilizing the knowledge and experience of experts versed in the field to be dealt with. What is needed is a method for building the network structure of a Bayesian network based on learning data. However, building a network structure based on learning data is an NP-hard (nondeterministic polynomial-hard, nondeterministic polynomial-hard) problem. Also, it may be necessary to ensure directed acyclicity of the network structure. Therefore, it is not easy to establish an optimal network structure.

因此，为了实际建立一个网络结构，已经提出使用试探法的K2算法(参见非专利文献1)。K2算法包括步骤：1)选择对于每个节点都能是亲节点的受限制的候选；2)选择一个子节点，一个接一个地添加候选亲节点，并且建立网络结构；3)如果并且只有估计值变大时，采用该节点作为亲节点；以及4)如果没有任何其它能够作为亲节点添加的节点，或者如果不能通过节点的添加使估计值增加时，处理转向其它子节点。对每个子节点执行上述步骤1)到步骤4)。因此，能够建立准最佳网络结构。在上述步骤1)，因为以下原因，限制能变为每个节点的亲节点的候选。在预先设计节点之间的次序的情况下，限制寻找网络结构的范围，因此减少计算量。而且，确保网络结构的非循环性。Therefore, in order to actually build a network structure, a K2 algorithm using a heuristic has been proposed (see Non-Patent Document 1). The K2 algorithm includes steps: 1) Selecting restricted candidates that can be parent nodes for each node; 2) Selecting a child node, adding candidate parent nodes one by one, and building a network structure; 3) If and only estimating When the value becomes large, adopt this node as a parent node; and 4) If there is no other node that can be added as a parent node, or if the estimated value cannot be increased by the addition of a node, processing goes to other child nodes. Execute the above steps 1) to 4) for each child node. Therefore, a quasi-optimal network structure can be established. In the above step 1), candidates that can become parent nodes of each node are restricted for the following reason. In the case of pre-designing the order among the nodes, the scope of finding the network structure is limited, thus reducing the amount of computation. Moreover, the acyclicity of the network structure is ensured.

虽然这个K2算法能够在实际中建立网络结构，但也存在限制，即节点之间的次序可能基于设计者的现有知识而预先设计。Although this K2 algorithm is able to build a network structure in practice, there is also a limitation that the order between nodes may be pre-designed based on the designer's existing knowledge.

另一方面，也已经提出一种在使用遗传算法确定节点之间的次序后使用K2算法确定节点之间的连接的方法(参见非专利文献2)。On the other hand, a method of determining connections between nodes using a K2 algorithm after determining the order between nodes using a genetic algorithm has also been proposed (see Non-Patent Document 2).

然而，以这些相关技术算法，根据设计者设计的次序，或者根据使用遗传算法确定的次序，通过以自下而上的方式确定节点之间的连接来建立网络结构。因此，这些算法不适用于网络结构的附加学习。另外，除了在要处理的领域中精通的专家之外，许多人也具有一些关于连接的知识。以相关技术算法，在网络结构中反映关于连接的现有知识已经是困难的。However, with these related art algorithms, a network structure is established by determining connections between nodes in a bottom-up manner according to an order designed by a designer, or according to an order determined using a genetic algorithm. Therefore, these algorithms are not suitable for additive learning of network structures. In addition, many people have some knowledge about connections in addition to being well-versed experts in the field to be dealt with. With related art algorithms, it has been difficult to reflect existing knowledge about connections in the network structure.

考虑到在相关技术方面的这样的实际情况，当存在NP-hard问题时，希望提供一种能够基于学习数据建立贝叶斯网络的网络结构(即，次序和连接)的学习设备和方法，该学习设备和方法能够反映关于网络结构中次序和连接的全部或者某些知识，并且允许网络结构的附加学习。In view of such actual conditions in related technologies, when there is an NP-hard problem, it is desirable to provide a learning device and method capable of establishing a network structure (that is, order and connection) of a Bayesian network based on learning data, which The learning device and method can reflect all or some knowledge about the order and connections in the network structure and allow additional learning of the network structure.

根据本发明的一个实施例，学习设备基于学习数据建立贝叶斯网络的网络结构，在贝叶斯网络中以有向图形式代表多个节点之间的起因和结果之间的关系。学习设备包括(a)存储学习数据的存储装置和(b)用于基于学习数据建立网络结构的学习装置。学习装置准备由每个均具有基因型的个体组成的初期个体群，在该基因型中规定了节点之间的次序以及起因和结果之间的关系，基于遗传算法在初期个体群上重复地执行交叉和/或突变，基于学习数据为每个个体计算估计值，寻找最佳个体，并将最佳个体的表现型(phenotype)作为前述网络结构。According to an embodiment of the present invention, the learning device establishes the network structure of the Bayesian network based on the learning data, and in the Bayesian network, the relationship between the cause and the result among multiple nodes is represented in the form of a directed graph. The learning device includes (a) storage means for storing learning data and (b) learning means for building a network structure based on the learning data. The learning device prepares an initial individual group consisting of individuals each having a genotype in which the order between nodes and the relationship between cause and effect are specified, and is repeatedly executed on the initial individual group based on a genetic algorithm Crossover and/or mutation, calculate an estimated value for each individual based on the learning data, find the best individual, and use the phenotype of the best individual as the aforementioned network structure.

在根据上述实施例的学习设备中，基因型能够根据规定的次序将安排在第一方向上的多个节点作为亲节点，将根据规定的次序安排在垂直于第一方向的第二方向上的多个节点作为子节点，并且通过在每个基因位置上的等位基因来规定相应节点之间的起因和结果关系的存在或不存在，在该基因位置(gene loci)上亲节点和子节点相互对应。In the learning device according to the above-mentioned embodiment, the genotype can have a plurality of nodes arranged in the first direction according to the prescribed order as parent nodes, and a plurality of nodes arranged in the second direction perpendicular to the first direction according to the prescribed order can be used as parent nodes. Multiple nodes act as child nodes, and the presence or absence of causal and consequential relationships between corresponding nodes is specified by alleles at each gene loci where parent and child nodes interact with each other. correspond.

本发明的另一个实施例是建立贝叶斯网络的网络结构的学习方法，其中贝叶斯网络以有向图形式表示多个节点之间的起因和结果之间关系。此方法开始于准备由每个均具有基因型的个体组成的初期个体群，在该基因型中规定了节点之间的次序以及起因和结果之间的关系。基于遗传算法重复地执行初期个体群的交叉和/或突变。基于学习数据为每个个体计算估计值，并寻找最佳个体。将最佳个体的表现型作为前述网络结构。Another embodiment of the present invention is a learning method for establishing a network structure of a Bayesian network, wherein the Bayesian network represents the relationship between causes and results among multiple nodes in the form of a directed graph. The method begins by preparing an initial population of individuals each having a genotype in which the order between nodes and the relationship between cause and effect are specified. Crossover and/or mutation of the initial population of individuals is repeatedly performed based on a genetic algorithm. Estimates are calculated for each individual based on the learning data and the best individual is found. The phenotype of the best individual is used as the aforementioned network structure.

根据本发明的实施例的学习设备和方法，当存在NP-hard问题时能够有效地建立准最佳网络结构。而且，在初期个体群中能够反映出设计者关于网络结构(次序和连接)的全部或者一些知识。此外，使能网络结构的附加学习。According to the learning device and method of the embodiments of the present invention, a quasi-optimal network structure can be effectively established when there is an NP-hard problem. Moreover, all or some of the designer's knowledge about the network structure (order and connection) can be reflected in the initial individual group. Furthermore, additional learning of the network structure is enabled.

附图说明Description of drawings

图1为根据本发明的一个实施例的学习设备的原理图。FIG. 1 is a schematic diagram of a learning device according to an embodiment of the present invention.

图2是示出了当建立贝叶斯网络模型时所使用的一组学习数据条目的例子的表。FIG. 2 is a table showing an example of a set of learning data items used when building a Bayesian network model.

图3A和3B是示出了一个二维基因型及其基因型的例子的示意图。3A and 3B are diagrams showing a two-dimensional genotype and an example of its genotype.

图4为说明使用遗传算法寻找最佳个体的过程的流程图。Fig. 4 is a flow chart illustrating the process of finding the best individual using the genetic algorithm.

图5A和5B为说明用于BD度量(metric)的计算公式的示意图。5A and 5B are diagrams illustrating calculation formulas for a BD metric.

图6A和6B是示出了在经典遗传算法中次序交叉和次序突变期间产生的致命(lethal)基因的例子的示意图。6A and 6B are schematic diagrams showing examples of lethal genes generated during sequential crossover and sequential mutation in a classical genetic algorithm.

图7A和7B是示出了在亲个体的次序相同时连接交叉的例子的示意图。7A and 7B are schematic diagrams showing an example of connecting crossovers when the order of parents is the same.

图8A和8B是示出了在亲个体的次序不同时次序交叉的例子的示意图。8A and 8B are diagrams showing an example of order crossing when the orders of parent individuals are different.

图9A和9B是示出了用于连接要突变的处理的例子的示意图。9A and 9B are schematic diagrams showing examples of processing for linking to be mutated.

图10A和10B是示出了用于次序突变的处理的例子的示意图。10A and 10B are schematic diagrams showing an example of processing for sequential mutation.

图11A、11B和11C是示出了在次序突变处理之后执行用于连接突变的处理的例子的示意图。11A , 11B and 11C are schematic diagrams showing an example in which processing for connection mutation is performed after sequential mutation processing.

图12A和12B是示出了作为次序交叉处理的结果、具有一些子节点的具有起因和结果关系的亲节点的数目超出上限的例子的示意图。12A and 12B are diagrams showing an example in which the number of parent nodes having a cause-and-effect relationship with some child nodes exceeds an upper limit as a result of order crossover processing.

图13A和13B是示出了调整基因以防止具有起因和结果关系的亲节点的数目超过上限的例子的示意图。13A and 13B are diagrams showing an example of adjusting genes to prevent the number of parent nodes having a cause-and-effect relationship from exceeding an upper limit.

图14示出了四个有向非循环图和在N′_ijk＝1条件下计算的估计值的示意图。Fig. 14 shows a schematic diagram of four directed acyclic graphs and estimated values calculated under the condition of N' _ijk =1.

图15A和15B是说明创建当计算图14的估计值时所用的学习数据的过程的示意图。15A and 15B are diagrams illustrating a process of creating learning data used when calculating the estimated value of FIG. 14 .

图16是示出了四个有向非循环图和通过一个实施例的过程计算的估计值的示意图。Figure 16 is a schematic diagram showing four directed acyclic graphs and estimates computed by the process of one embodiment.

图17为说明用于获得具体学习数据的过程的示意图。Fig. 17 is a schematic diagram illustrating a procedure for obtaining specific learning data.

图18为说明用于获得具体学习数据的过程的示意图。Fig. 18 is a schematic diagram illustrating a procedure for obtaining specific learning data.

图19为说明用于获得具体学习数据的过程的示意图。Fig. 19 is a schematic diagram illustrating a procedure for obtaining specific learning data.

图20为说明用于获得具体学习数据的过程的示意图。Fig. 20 is a schematic diagram illustrating a procedure for obtaining specific learning data.

图21是示出了基于学习数据通过K2算法获得的网络结构的示意图。FIG. 21 is a schematic diagram showing a network structure obtained by the K2 algorithm based on learning data.

图22示出了使用图21的网络结构作为初期结构进行附加学习时的第二十代网络结构的示意图。FIG. 22 shows a schematic diagram of the network structure of the twentieth generation when additional learning is performed using the network structure of FIG. 21 as the initial structure.

图23示出了使用图21的网络结构作为初期结构进行附加学习时的第四十代网络结构的示意图。FIG. 23 shows a schematic diagram of the fortieth generation network structure when additional learning is performed using the network structure in FIG. 21 as the initial structure.

图24示出了使用图21的网络结构作为初期结构进行附加学习时的第六十代网络结构的示意图。FIG. 24 shows a schematic diagram of the network structure of the 60th generation when additional learning is performed using the network structure of FIG. 21 as the initial structure.

图25示出了使用图21的网络结构作为初期结构进行附加学习时的第八十代网络结构的示意图。FIG. 25 shows a schematic diagram of the eightieth generation network structure when additional learning is performed using the network structure in FIG. 21 as the initial structure.

具体实施方式Detailed ways

在下文中将结合附图具体说明本发明的具体实施例。在一个实施例中，本发明应用于一种基于关于学习的数据(学习数据)来构建贝叶斯网络的网络结构的学习设备。Specific embodiments of the present invention will be described in detail below with reference to the accompanying drawings. In one embodiment, the present invention is applied to a learning device that constructs a network structure of a Bayesian network based on data on learning (learning data).

首先，根据本发明实施例的学习设备的结构在图1中示意出来。如图1中所示，根据本实施例的学习设备1包括学习数据存储部分10、学习部分11和模型存储部分12。First, the structure of a learning device according to an embodiment of the present invention is schematically shown in FIG. 1 . As shown in FIG. 1 , the learning device 1 according to the present embodiment includes a learning data storage section 10 , a learning section 11 and a model storage section 12 .

在建立贝叶斯网络模型时使用的学习数据被存储到学习数据存储部分10。离散数据的全集的一个实施例在图2中示出，其中具有从X₀到X₄的五个节点。在图2中，以X_i ^jk的形式表示学习数据的每个条目，其中i表示节点ID，j表示事件ID(即，从首先获得的学习数据条目计数的数据条目号)，以及k表示状态ID(即，在每个节点的状态)。那就是说，X_i ^jk代表在节点X_i获得的学习数据条目的状态，并且第j个数据条目由状态ID＝k表示。Learning data used in building the Bayesian network model is stored in the learning data storage section 10 . One embodiment of a corpus of discrete data is shown in Figure 2, with five nodes from _X0 to _X4 . In Fig. 2, each entry of learning data is represented in the form _Xijk , where i denotes the node ID, j denotes the event ID (i.e., the data entry number counted from the first obtained learning ^data entry), and k denotes the state ID (i.e., in the state of each node). That is, X _i ^jk represents the state of the learning data item obtained at node _Xi , and the j-th data item is represented by state ID=k.

基于存储于学习数据存储部分10中的学习数据，学习部分11构建贝叶斯网络模型。具体地，学习部分11通过使用遗传算法来同时确定在构成贝叶斯网络的网络结构的节点间的次序以及连接二者。当存在NP-hard问题时，遗传算法的使用使得有效地构建准最佳网络结构成为可能。通过学习部分11建立的模型存储于模型存储部分12中。Based on the learning data stored in the learning data storage section 10, the learning section 11 constructs a Bayesian network model. Specifically, the learning section 11 simultaneously determines the order among the nodes constituting the network structure of the Bayesian network and connects both by using a genetic algorithm. When there are NP-hard problems, the use of genetic algorithms makes it possible to efficiently construct quasi-optimal network structures. The model established by the learning section 11 is stored in the model storage section 12 .

由学习部分11用于构建网络结构的处理下面具体说明。接下来，为了简单，假定具有5个节点，从X₀到X₄。The processing by the learning section 11 for constructing the network structure will be specifically described below. Next, for simplicity, assume that there are 5 nodes, from X ₀ to X ₄ .

根据本发明，根据如图3A中所示的二维基因型，学习部分11表示在贝叶斯网络的网络结构即遗传算法中使用的个体。在图3A中，在行和列中的X₀，X₁，X₂，X₃和X₄表示节点之间的次序。在行和列中的次序一直是同样的。位于对角线部分以上的三角形区域中的基因位置具有等位基因“0”和“1”，该等位基因表示从亲节点到子节点的连接。“0”表示在亲节点和子节点之间没有起因和结果关系。“1”表示在亲节点和子节点之间存在起因和结果关系。每个对角线部分相应于一个自循环。位于对角线部分以下的三角形区域中的基因位置具有等位基因“0”和“1”，该等位基因表示从子节点到亲节点的连接。为保证网络结构的非循环性，假定位于对角线部分以下的基因没有特性(trait)的表现。因此，图3A中所示的具有二维基因型的个体具有如图3B中所示的表现型。According to the present invention, the learning section 11 represents individuals used in the network structure of the Bayesian network, that is, the genetic algorithm, based on the two-dimensional genotype as shown in FIG. 3A. In FIG. 3A, X ₀ , X ₁ , X ₂ , X ₃ and X ₄ in rows and columns indicate the order among nodes. The order in rows and columns is always the same. Gene positions located in the triangular region above the diagonal part have alleles '0' and '1', which represent connections from parent nodes to child nodes. "0" indicates that there is no cause and effect relationship between parent and child nodes. "1" indicates that a cause and effect relationship exists between a parent node and a child node. Each diagonal section corresponds to a self-loop. Gene positions located in the triangular region below the diagonal part have alleles "0" and "1", which represent connections from child nodes to parent nodes. In order to ensure the acyclicity of the network structure, it is assumed that the genes located below the diagonal part have no trait expression. Thus, an individual with a two-dimensional genotype as shown in Figure 3A has a phenotype as shown in Figure 3B.

学习部分11把许多具有这样的二维基因型的个体作为初期个体群。使用遗传算法在初期个体群中寻找最佳个体。个体的表现型被作为准最佳网络结构。The learning section 11 regards many individuals having such two-dimensional genotypes as an initial individual group. Use genetic algorithm to find the best individual in the initial individual group. Individual phenotypes were taken as quasi-optimal network structures.

在图4的流程图中将说明使用遗传算法寻找最佳个体的过程。The process of finding the best individual using the genetic algorithm will be described in the flowchart of FIG. 4 .

首先，在步骤S1，学习部分11创建初期个体群。在这个时候，学习部分11可以随机创建初期个体群。在具有关于网络结构(次序和连接)知识的设计者的情况下，通过将表现型转换为二维基因型并执行突变处理来创建初期个体群。后面的方法允许设计者的关于网络结构的知识全面或者部分地反映在初期个体群中。而且，学习部分11可以从由学习结果表示的个体中创建初期个体群。在这种情况下，使能网络结构的附加学习。First, at step S1, the learning section 11 creates an initial group of individuals. At this time, the learning section 11 may randomly create an initial individual group. In the case of a designer with knowledge about the network structure (order and connection), an initial population of individuals is created by converting phenotypes into two-dimensional genotypes and performing mutation processing. The latter approach allows the designer's knowledge about the network structure to be fully or partially reflected in the initial population of individuals. Also, the learning section 11 may create an initial individual group from the individuals represented by the learning results. In this case, additional learning of the network structure is enabled.

然后，在步骤S2，学习部分11基于存储于学习数据存储部分10中的学习数据来计算每个个体的估计值(遗传算法中的拟合)。具体地，根据下面的公式(1)计算出BD度量(P(D|B_s))，并且计算的值的对数作为估计值。Then, at step S2 , the learning section 11 calculates an estimated value for each individual based on the learning data stored in the learning data storage section 10 (fitting in the genetic algorithm). Specifically, the BD measure (P(D|B _s )) was calculated according to the following formula (1), and the logarithm of the calculated value was taken as an estimated value.

$P P ((D D. | | {B B}_{s the s})) = = {Π Π}_{i i = = 00}^{n no - - 11} {Π Π}_{j j = = 00}^{{q q}_{i i} - - 11} \frac{Γ Γ (({N N}^{' '}_{ij ij}))}{Γ Γ (({N N}^{' '}_{ij ij} + + {N N}_{ij ij}))} {Π Π}_{k k = = 00}^{{r r}_{i i} - - 11} \frac{Γ Γ (({N N}^{' '}_{ijk ijk} + + {N N}_{ijk ijk}))}{Γ Γ (({N N}^{' '}_{ijk ijk}))} - - - - - - ((11))$

在这个公式(1)中，D为存储在学习数据存储部分10中的学习数据，B_s表示在贝叶斯网络的网络结构即遗传算法中使用的个体，P(D|B_s)为在B_s条件下D的概率，Γ为伽马(gamma)函数并且由Γ(n)＝(n-1)！给出。从(n-1)！＝n！/n，认为0！＝1！/1＝1。因此，为了方便，引入关系0！＝1。而且，如图5A中所示，令n为节点的数目。令X_i为第i个节点。令v_ik为X_i能够假定的第k个值。令r_i为X_i能够假定的值(状态的数目)的数目。如图5B中所示，令Π_i为X_i的亲节点模式(pattern)。令W_ij为Π_i的第j个模式(能够假定的值)。q_i为Π_i的模式的数目。N_ijk为学习数据D中数据条目的数目，该学习数据D使X_i具有值v_ik并且Π_i具有值w_ij。根据下面公式(2)计算出N_ij。N′_ijk和N′_ij与设计者的先前知识有关并且能够被以类似于N_ijk和N_ij的方式处理。它们的具体内容将在下文中叙述。In this formula (1), D is the learning data stored in the learning data storage part 10, B _s represents the individual used in the network structure of the Bayesian network, that is, the genetic algorithm, and P(D|B _s ) is the The probability of D under the condition of B _s , Γ is a gamma (gamma) function and by Γ(n)=(n-1)! give. From (n-1)! =n! /n, think 0! = 1! /1=1. Therefore, for convenience, the relation 0! =1. Also, as shown in FIG. 5A, let n be the number of nodes. Let _Xi be the ith node. Let v _ik be the kth value that _Xi can assume. Let _ri be the number of values (number of states) that _Xi can assume. As shown in FIG. 5B, let Π _i be the parent pattern of X _i . Let W _ij be the jth pattern (value that can be assumed) of Π _i . q _i is the number of patterns of Π _i . N _ijk is the number of data _entries in the learning data D such that Xi has the value vi _ik and Π _i has the value w _ij . N _ij is calculated according to the following formula (2). N' _ijk and N' _ij are related to the prior knowledge of the designer and can be handled in a similar manner to N _ijk and N _ij . Their specific contents will be described below.

${N N}_{ij ij} = = {Σ Σ}_{k k = = 00}^{{r r}_{i i} - - 11} {N N}_{ijk ijk} - - - - - - ((22))$

实际上，存储于学习数据存储部分10中的学习数据可能具有丢失数据条目或者可能是连续数量而不是离散数据。例如，在Richard E.Neapolitan所著“LEARNING BAYESIAN NETWORKS(学习贝叶斯网络)”，ISBN0-13-012534-2中描述了一种应对丢失数据条目或连续数量的方法。Actually, the learning data stored in the learning data storage section 10 may have missing data entries or may be continuous quantities instead of discrete data. For example, in Richard E. Neapolitan, "LEARNING BAYESIAN NETWORKS", ISBN 0-13-012534-2, describes a method for dealing with missing data entries or consecutive quantities.

接下来，在步骤S3，学习部分11作出关于结束条件是否满足的决定。Next, at step S3, the learning section 11 makes a decision as to whether or not the end condition is satisfied.

实际上，结束条件的一个例子是代(generation)的数目已经超过了阈值。另外一个例子是估计值变化率减小到阈值以下。如果结束条件不满足，则控制进行到S4。如果结束条件是满足的，则选择产生最高估计值的个体，并且处理结束。In fact, an example of an end condition is that the number of generations has exceeded a threshold. Another example is when the rate of change of the estimated value decreases below a threshold. If the end condition is not satisfied, control goes to S4. If the end condition is satisfied, the individual producing the highest estimate is selected and the process ends.

接下来，在步骤S4，学习部分11基于估计值从现在的个体群中选择下一个体群。就是说，当许可重叠时，学习部分基于估计值从现在的个体群中选择希望的个体的数目。诸如旋转轮选择(roulette wheel selection)、联赛选择(tournament selection)、和精华保留(elite reservation)的在遗传算法中通常使用的方法能被用作选择方法。然而，作为估计值的BD度量的对数为负值，并且因此难以直接利用如旋转轮选择的对估计值执行具有概率比例的选择的方法。因此，使用玻尔兹曼分布(Boltzmann distribution将估计值)预先转换为正值。Next, at step S4, the learning section 11 selects the next group of individuals from the current group of individuals based on the estimated value. That is, when the permits overlap, the learning section selects the desired number of individuals from the current individual population based on the estimated value. Methods generally used in genetic algorithms such as roulette wheel selection, tournament selection, and elite reservation can be used as the selection method. However, the logarithm of the BD metric as an estimated value is negative, and thus it is difficult to directly utilize a method of performing a selection with a probability ratio on an estimated value such as a rotary wheel selection. Therefore, the estimated values are pre-converted to positive values using a Boltzmann distribution.

接下来，在步骤S5和S6，学习部分11根据希望的交叉概率来执行包含在现在的个体群中的个体交叉的处理。而且，学习部分根据希望的突变率执行突变处理。在这个交叉处理中，从两个亲个体创建两个子个体。在突变处理中，从一个亲个体创建一个子个体。同时，所创建的子个体可以取代亲个体。可以使子个体和亲个体共存。Next, in steps S5 and S6, the learning section 11 executes a process of intersecting individuals contained in the current individual group in accordance with a desired intersecting probability. Also, the learning section performs mutation processing according to a desired mutation rate. In this intersection process, two child individuals are created from two parent individuals. In mutation processing, a child individual is created from a parent individual. At the same time, the created child individual can replace the parent individual. It is possible to make child individuals and parent individuals coexist.

特别地，在次序交叉的处理和次序突变的处理中，使用遗传算法的经典过程情况下，如图6A和6B所示，很容易产生致命基因。例如，如图6A所示，在具有次序X₀、X₁、X₂、X₃和X₄的个体与具有次序X₃、X₁、X₀、X₄和X₂的个体使用在第三和第四节点之间的点作为交叉点而交叉的情况下，在同一个体中存在具有相同节点ID的节点，导致致命基因的产生。而且，如图6B所示，在执行在具有次序X₀、X₁、X₂、X₃和X₄的个体的位置X₂处产生突变的处理从而产生X₄的情况下，在同一个体中出现了具有相同节点ID的节点。这样引起致命基因。在以这种方式很容易产生致命基因的情况下，学习效率低。因此，防止致命基因产生的框架是必须的。In particular, in the processing of sequential crossover and processing of sequential mutation, in the case of the classical process using the genetic algorithm, as shown in FIGS. 6A and 6B , it is easy to generate a lethal gene. For example, as shown in FIG. 6A , when individuals with the order X ₀ , X ₁ , X ₂ , X ₃ , and X ₄ and individuals with the order X ₃ , X ₁ , X ₀ , X ₄ , and X ₂ are used in the third In the case where a point between the node and the fourth node is intersected as an intersection point, a node having the same node ID exists in the same individual, resulting in the generation of a lethal gene. Moreover, as shown in FIG. 6B , in the case of performing a process of generating a mutation at position X ₂ of an individual having the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ to generate X ₄ , in the same individual A node with the same node ID appeared. This causes the lethal gene. In the case where lethal genes are easily produced in this way, the learning efficiency is low. Therefore, a framework to prevent the production of lethal genes is necessary.

当使用遗传算法建立贝叶斯网络的网络结构时，次序交叉或次序突变本质上等同于旅行推销员(travelling salesman)问题，并且已经提出了各种方法(见，由P.Larranaga、C.Kuijpers、R.Murga和Y.Yurramendi所著“以遗传算法寻找最佳次序来学习贝叶斯网络结构”，IEEE Transactions on Systems，Man and Cybemetics(IEEE系统、人、和控制论学报)，26(4)，pp.487-493，1996)。When using genetic algorithms to build the network structure of Bayesian networks, order crossover or order mutation is essentially equivalent to the traveling salesman problem, and various methods have been proposed (see, by P.Larranaga, C.Kuijpers , R.Murga and Y.Yurramendi, "Learning Bayesian Network Structure Using Genetic Algorithms to Find the Best Order", IEEE Transactions on Systems, Man and Cybemetics (IEEE Systems, People, and Cybernetics Transactions), 26(4 ), pp.487-493, 1996).

接下来，首先叙述当发生具体例子时在步骤S5的交叉处理。Next, first, the intersection processing at step S5 when a specific example occurs will be described.

在图7A和7B中示出了在亲个体具有相同次序的情况下交叉处理的例子。在这个交叉处理中，仅处理连接。如图7A所示，关于具有次序X₀、X₁、X₂、X3和X₄的两个亲个体，第三和第四节点之间的点作为交叉点。后面的基因进行交换。结果，获得了如图7B中所示的子个体。如从图7B中能够看到的，亲个体的连接遗传地传给子个体。An example of crossover processing where the parent individuals have the same order is shown in Figures 7A and 7B. In this cross processing, only joins are processed. As shown in FIG. 7A , with respect to two parent individuals having the order X ₀ , X ₁ , X ₂ , X3 and X ₄ , the point between the third and fourth nodes serves as the intersection point. Subsequent genes are exchanged. As a result, child individuals as shown in Fig. 7B are obtained. As can be seen from Figure 7B, the connection of the parent individuals is inherited genetically to the daughter individuals.

在图8A和8B中示出了在亲个体具有不同次序的情况下次序交叉处理的例子。对于次序交叉的处理，例如，可以使用PMX(部分映射交叉)。在此PMX中，1)随机选择两个交叉点，2)在两个交叉点之间的节点互换，3)如果每个节点在个体内都没有被使用(3-1)，那么该节点被无损地(intact)使用，如果该节点已经被使用(3-2)，那么它与这样的节点交换，从该节点将要映射未交换的节点，如果该节点也已经被使用了(3-3)，则该节点与这样的节点交换，从该节点将要映射未交换的节点。此时，已交换节点继承与亲节点或亲节点本身的子节点的连接。如图8A所示，关于具有次序X₀、X₁、X₂、X₃和X₄的亲个体和具有次序X₂、X₀、X₄、X₃和X₁的另一个亲个体，如果分别在第二和第三节点之间和在第四和第五节点之间的交叉点之间的节点根据PMX方法互换，则获得如图8B所示的子个体。如能从图8B看出的，亲个体的次序和连接已经遗传地传给子个体。An example of sequence crossover processing where parent individuals have different orders is shown in FIGS. 8A and 8B . For processing of order intersection, for example, PMX (Partial Mapping Intersection) can be used. In this PMX, 1) two intersections are randomly selected, 2) nodes are swapped between two intersections, 3) if each node is not used within an individual (3-1), then the node is used intact, if the node is already used (3-2), then it is swapped with the node from which the unswapped node is to be mapped, if the node is also already used (3-3 ), then the node is swapped with the node from which the unswapped node is to be mapped. At this point, the switched node inherits the connection to the parent node or to the child nodes of the parent node itself. As shown in Figure 8A, regarding a parent individual with the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ and another parent individual with the order X ₂ , X ₀ , X ₄ , X ₃ and X ₁ , if Nodes respectively between the second and third nodes and between the intersection points between the fourth and fifth nodes are exchanged according to the PMX method, and then a sub-individual as shown in FIG. 8B is obtained. As can be seen from Figure 8B, the order and linkage of the parent individuals has been inherited genetically to the daughter individuals.

在亲个体具有与图7A中所示相同的次序的情况下，如果根据PMX过程执行次序交叉，那么获得如图7B中的相同的子个体。那就是，图7A和7B中所示的连接交叉的处理是次序交叉处理中的具体情况(即，亲个体具有相同的次序)。仅当执行用于次序交叉的处理时，也执行用于连接交叉的处理。In the case of parent individuals having the same order as shown in FIG. 7A, if order crossing is performed according to the PMX process, then the same child individuals as in FIG. 7B are obtained. That is, the processing of connection intersections shown in FIGS. 7A and 7B is a specific case in the processing of order intersections (ie, parents have the same order). Only when processing for sequence crossing is performed, processing for connection crossing is also performed.

接下来，采用具体例子描述在步骤S6中的突变处理。Next, the mutation processing in step S6 is described using a specific example.

图9A和9B中示出了用于连接突变的处理的例子。通过将位于任意基因位置的基因反转到等位基因来实现此用于连接突变的处理。如图9A中所示，关于具有次序X₀、X₁、X₂、X₃和X₄的亲个体，如果在亲节点为X₃且子节点为X₁的情况下的基因位置处的基因“0”反转到等位基因“1”，并且如果在亲节点为X₄且子节点为X₀的情况下的基因位置处的基因“1”反转到等位基因“0”，那么获得如图9B中所示的子个体。An example of processing for linking mutations is shown in Figures 9A and 9B. This processing for junctional mutations is achieved by inverting genes at arbitrary gene positions to alleles. As shown in Figure 9A, with respect to parent individuals with the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ , if the gene at the gene position in the case where the parent node is X ₃ and the child node is X ₁ "0" is inverted to allele "1", and if the gene "1" at the gene position in the case where the parent node is X ₄ and the child node is X ₀ is inverted to allele "0", then A child individual as shown in Figure 9B is obtained.

图10A和10B中示出了用于次序突变处理的例子。例如，反转突变(IVM)可以用于次序突变的处理。此反转突变(IVM)包括1)随机选择多个相继连续的节点并且去除它们，并且2)反转去除的节点的次序并且将次序插入到随机位置。如图10A中所示，关于具有次序X₀、X₁、X₂、X₃和X₄的亲个体，选择并去除两个相继连续的节点X₂和X₃。它们的次序被反转。如果然后在X₄之后插入次序，那么获得如图10B中所示的子个体。An example for sequential mutation processing is shown in Figures 10A and 10B. For example, inversion mutagenesis (IVM) can be used for the treatment of sequential mutations. This inversion mutation (IVM) involves 1) randomly selecting a number of consecutive consecutive nodes and removing them, and 2) reversing the order of the removed nodes and inserting the order at random positions. As shown in FIG. 10A , with respect to parent individuals having the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ , two consecutive nodes X ₂ and X ₃ are selected and removed. Their order is reversed. If the sequence is then inserted after _X4 , a child individual as shown in Figure 10B is obtained.

由于图9A和9B中所示的用于连接突变的处理和图10A和10B中所示的用于次序突变的处理的相互独立的，这两种类型的处理都能够执行。然而，根据首先执行的一种处理类型，获得的子个体不同。图11A-11C中示出了在执行次序突变的处理之后执行连接突变的处理的例子。如图11A中所示，关于具有次序X₀、X₁、X₂、X₃和X₄的亲个体，选择并去除两个相继连续的节点X₂和X₃。它们的次序被反转并且然后在X₄之后插入。那就是，执行用于次序突变的处理。结果，获得如图11B中所示的个体。而且，关于这个个体，在亲节点为X₃且子节点为X₁情况下的基因位置处的基因“0”反转到等位基因“1”。在亲节点为X₄且子节点为X₀的情况下的基因位置处的基因“1”反转到等位基因“0”。那就是，执行用于连接突变的处理。结果，获得了如图11C所示的个体。Since the processing for link mutation shown in FIGS. 9A and 9B and the processing for sequential mutation shown in FIGS. 10A and 10B are independent of each other, both types of processing can be performed. However, the obtained sub-individuals differ depending on which type of processing is performed first. An example of the process of performing link mutation after the process of performing sequential mutation is shown in FIGS. 11A-11C . As shown in FIG. 11A , with respect to parent individuals with the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ , two consecutive nodes X ₂ and X ₃ are selected and removed. Their order is reversed and then inserted after _X4 . That is, processing for sequential mutation is performed. As a result, individuals as shown in Fig. 11B were obtained. Also, with respect to this individual, the gene "0" at the gene position in the case where the parent node is _X3 and the child node is _X1 is reversed to the allele "1". Gene "1" at a gene position where parent node is _X4 and child node is _X0 is inverted to allele "0". That is, processing for connection mutation is performed. As a result, individuals as shown in Fig. 11C were obtained.

回头参考附图4，在步骤S7限制亲节点的数目，然后处理跳回步骤S2。那就是，对于每个个体的每个子节点，关于具有与其自身的起因和结果关系的亲节点(FanIn)的数目，预先设置上限(MaxFanIn)。如果作为在步骤S5和S6的用于交叉的处理和用于突变的处理的结果，与任意子节点具有起因和结果关系的亲节点的数目超出了上限，则基因因此调整为保持关系FanIn≤MaxFanIn。如图12A、12B、13A和13B中所示的例子，其中亲节点的数目以这种方式得到限制。如图12A所示，关于具有次序X₀、X₁、X₂、X₃和X₄的亲个体和另一个具有次序X₂、X₀、X₄、X₃和X₁的亲个体，如果分别在位于第二和第三节点之间和位于第四和第五节点之间的交叉点之间的节点根据PMX过程互换，则获得如图12B中所示的子个体。在这个图中，关于在左侧的子个体，与子节点X₀具有起因和结果关系的亲节点(FanIn)的数目为4，超过了上限(MaxFanIn)3。因此，通过在图13A中所示的个体中选择亲节点为X₃且子节点为X₀的基因位置处的基因“1”，并将所选择的基因“1”反转到等位基因“0”，来产生如图13B中所示的个体，以实现FanIn≤MaxFanIn的关系。Referring back to FIG. 4, the number of parent nodes is limited in step S7, and then the process jumps back to step S2. That is, for each child node of each individual, an upper limit (MaxFanIn) is set in advance with respect to the number of parent nodes (FanIn) having a cause-and-effect relationship with itself. If, as a result of the processing for crossover and processing for mutation at steps S5 and S6, the number of parent nodes having causal and consequential relationships with any child node exceeds the upper limit, the gene is thus adjusted to maintain the relationship FanIn≤MaxFanIn . Examples are shown in Figures 12A, 12B, 13A and 13B, where the number of parent nodes is limited in this way. As shown in Figure 12A, regarding a parent individual with the order X ₀ , X ₁ , X ₂ , X ₃ and X ₄ and another parent individual with the order X ₂ , X ₀ , X ₄ , X ₃ and X ₁ , if Nodes respectively located between the intersections between the second and third nodes and between the fourth and fifth nodes are exchanged according to the PMX process, and then sub-individuals as shown in FIG. 12B are obtained. In this figure, regarding the child individual on the left, the number of parent nodes (FanIn) having a cause-and-effect relationship with the child node X ₀ is 4, which exceeds the upper limit (MaxFanIn)3. Therefore, by selecting the gene "1" at the gene position where the parent node is X ₃ and the child node is X ₀ in the individual shown in Figure 13A, and inverting the selected gene "1" to the allele "0", to generate individuals as shown in Fig. 13B, so as to realize the relationship of FanIn≤MaxFanIn.

当基因被反转为它们的等位基因时，就实现了FanIn≤MaxFanIn的关系，所反转的基因可以是随机选择的。选择也可以使得个体的估计值变得最大。在后面的情况中，可能需要计算关于具有其亲节点数目超过上限的子节点的个体的估计值。关于这个个体，可以不需要在步骤S2计算该估计值。能够利用在步骤S7中计算的估计值。When genes are inverted to their alleles, the FanIn≤MaxFanIn relationship is achieved, and the inverted genes can be randomly selected. Selection can also be made to maximize individual estimates. In the latter case, it may be necessary to compute estimates about individuals with child nodes whose number of parents exceeds an upper limit. With respect to this individual, it may not be necessary to calculate the estimated value at step S2. The estimated value calculated in step S7 can be utilized.

这样，根据本实施例的学习设备1，根据二维基因型，通过表示贝叶斯网络的网络结构(次序和连接)(即，在遗传算法中使用的个体)，将一定数目的具有二维基因型的个体作为初期个体群，使用遗传算法寻找初期个体群中的最佳个体，并将个体的表现型作为贝叶斯网络的网络结构，可以建立对NP-hard问题有效的准最佳网络结构。Thus, according to the learning device 1 of the present embodiment, a certain number of individuals with two-dimensional The genotype individuals are used as the initial individual group, and the genetic algorithm is used to find the best individual in the initial individual group, and the individual phenotype is used as the network structure of the Bayesian network, and a quasi-optimal network that is effective for NP-hard problems can be established structure.

而且，通过将表现型转换为二维基因型，并且在设计者具有关于网络结构(次序和连接)的知识的情况下执行突变处理，学习设备1使得可以全部或部分反映设计者关于在初期个体群中网络结构的知识。在希望在一些节点固定次序和连接的情况下，不同于固定次序和连接的具有二维基因型的个体被认为是致命基因并在上面的步骤S4从选择对象中去除。Furthermore, by converting the phenotype into a two-dimensional genotype and performing mutation processing in the case that the designer has knowledge about the network structure (order and connection), the learning device 1 makes it possible to fully or partially reflect the designer's knowledge about the individual at the initial stage. Knowledge of the network structure in the group. In the case where it is desired to fix the order and connection at some nodes, individuals with two-dimensional genotypes different from the fixed order and connection are regarded as lethal genes and removed from the selection objects in the above step S4.

此外，通过从作为学习结果而获得的个体中创建初期个体群，学习设备1能够进行网络结构的附加学习。Furthermore, the learning device 1 can perform additional learning of the network structure by creating an initial individual group from individuals obtained as a result of learning.

在图4中的流程说明图的叙述中，学习设备11执行交叉处理和突变处理。也可以执行两种类型的处理中的仅一种。In the description of the flowchart explanatory diagram in FIG. 4 , the learning device 11 executes crossover processing and mutation processing. It is also possible to perform only one of the two types of processing.

如公式(1)中所示，BD度量主要由(i)由网络结构和学习数据确定的N_ijk和(ii)由设计者的现有知识确定的N′_ijk组成。通常，在设计者的现有知识能针对关于某节点X_i和诸如p(v_ik，w_ij)的它们的亲节点的所有i′s和j′s而定义的情况下，则根据随后的公式(3)计算出N′_ijk。在公式(3)中，N′为等同样本大小(equivalent sample size)，并且是假定多少数目的样本用于代表从现有知识中获得的信息的参数。As shown in Equation (1), the BD metric mainly consists of (i) N _ijk determined by the network structure and learning data and (ii) N′ _ijk determined by the designer's prior knowledge. In general, where the designer's prior knowledge can be defined for all i's and j's about some node _Xi and their parent nodes such as p(vi _ik , w _ij ), then according to the following Formula (3) calculates N' _ijk . In formula (3), N' is an equivalent sample size, and is a parameter assuming how many samples are used to represent information obtained from existing knowledge.

N′_ijk＝p(v_ik，w_ij)×N′ (3)N' _ijk = p( _vik , w _ij )×N' (3)

在设计者具有关于网络结构的现有知识的情况下，设计者的现有知识能够通过把以这种方式计算出的N′_ijk代入上述公式(1)中而反映。In the case where the designer has prior knowledge about the network structure, the designer's prior knowledge can be reflected by substituting N' _ijk calculated in this way into the above formula (1).

同时，在设计者不具有这样的现有知识的情况下，一般在N′_ijk＝1的条件下计算BD度量。在N′_ijk＝1的条件下计算的BD度量特别被称为K2度量。Meanwhile, in the case where the designer does not have such prior knowledge, the BD metric is generally calculated under the condition of N' _ijk =1. The BD metric calculated under the condition of N' _ijk =1 is especially called K2 metric.

然而，在以此方式假定N′_ijk＝1的情况下，如果有向非循环图(DAG)属于导致相同推论结果的相同的马尔可夫(Morkov)等同类，那么有向非循环图的计算出的BD度量值也可能不同。However, assuming N′ _ijk = 1 in this way, if the directed acyclic graph (DAG) belongs to the same Markovian (Morkov) equivalent class leading to the same inference result, then the calculation of the directed acyclic graph The resulting BD metric may also be different.

作为一个例子，现在考虑如图14中所示的阴天、洒水装置、下雨和湿草四个节点组成的网络结构。As an example, now consider the network structure composed of four nodes of cloudy sky, sprinkler, rain and wet grass as shown in Fig.14.

在图14中所示的有向非循环图(DAG)中，G1到G3具有相同的链接并且具有相同的非耦接的头头相遇(uncoupled head-to-head meeting)(洒水装置→湿草←下雨)，并且因此，这三个图能够以相同的DAG模式gp表示出。然而，G4具有与G1到G3相同的链接，但是具有另外的非耦接的头头相遇(洒水装置→阴天←下雨)并且因此不能以DAG模式gp表示。在图14中还示出了当向四个DAG给出一些学习数据时，在N′_ijk＝1条件下计算出的估计值(BD度量的对数)。In the directed acyclic graph (DAG) shown in Figure 14, G1 to G3 have the same link and have the same uncoupled head-to-head meeting (sprinkler → wet grass ← rain), and thus, these three graphs can be represented by the same DAG schema gp. However, G4 has the same linkage as G1 to G3, but has an additional uncoupled head-to-head encounter (sprinkler → cloudy → rainy) and thus cannot be represented in the DAG schema gp. Also shown in FIG. 14 are estimated values (logarithms of the BD metric) calculated under the condition of N' _ijk =1 when some learning data are given to the four DAGs.

如图15A中所示，以在下面使用具有条件概率表(CPT)的DAG的方式来创建学习数据。那就是，在作为亲子关系的最高等级节点的阴天上，基于条件概率表，首先概率性确定该节点是真还是假。现在假定阴天＝真。在作为阴天的子节点的洒水装置和下雨中的每个，然后基于在亲条件下的条件概率表，概率性地作出关于该节点是真还是假的结论。在此假定洒水装置＝假并且下雨＝真。则在作为洒水装置和下雨的子节点的湿草，基于在亲条件下的条件概率表，概率性地确定该节点是真还是假。这样，则创建了关于一个事件的学习数据。如图15B中所示，类似地创建了关于1000个事件的学习数据。As shown in FIG. 15A , the learning data is created in such a way that a DAG with a conditional probability table (CPT) is used below. That is, on the cloudy sky which is the highest-level node of the parent-child relationship, based on the conditional probability table, it is first probabilistically determined whether the node is true or false. Now assume Cloudy=True. In each of sprinklers and rain as child nodes of cloudy, a conclusion is then made probabilistically as to whether that node is true or false based on the conditional probability table in the parent condition. Here it is assumed that Sprinklers=False and Raining=True. Then at Wet Grass, which is a child node of Sprinkler and Rain, probabilistically determine whether the node is true or false based on the conditional probability table under the parent condition. In this way, learning data on an event is created. As shown in Fig. 15B, learning data on 1000 events was similarly created.

如图14中所示，其中作出N′_ijk＝1的假定，G1和G3的估计值与G2的估计值不同。在以此方式假定N′_ijk＝1的情况下，应该产生相同估计值、并且属于相同Markov等式类的DAG，即能够以相同DAG模式表示的DAG，可以产生不同的BD度量的计算值。As shown in Fig. 14, where the assumption of N' _ijk = 1 is made, the estimated values of G1 and G3 are different from the estimated value of G2. Assuming N' _ijk =1 in this way, DAGs that should produce the same estimated value and belong to the same class of Markov equations, ie DAGs that can be represented in the same DAG mode, can produce different calculated values of the BD metric.

因此，当如上所述将BD度量的对数作为估计值并且基于该估计值寻找最佳网络结构时，假定N′_ijk＝1是不合适的。Therefore, when taking the logarithm of the BD metric as an estimated value and finding an optimal network structure based on the estimated value as described above, it is inappropriate to assume N' _ijk =1.

因此，在本实施例中，如下确定N′_ijk，使属于相同Markov等式类的DAG产生相同的BD度量的计算值。Therefore, in this embodiment, N' _ijk is determined such that DAGs belonging to the same Markov equation class produce the same calculated value of the BD metric.

首先，在第一方法中，在节点X_i的状态的数目设为r_i。根据下面的公式(4)计算所有的它们各自的连接概率分布p(X₀，X₁，......，X_n-1)。First, in the first method, the number of states at node _Xi is set to r _i . All their respective connection probability distributions p(X ₀ , X ₁ , . . . , X _n-1 ) are calculated according to the following formula (4).

$p p (({x x}_{00},, {x x}_{11},, {Λx Λx}_{n no - - 11})) = = 11 / / {Π Π}_{i i = = 00}^{n no - - 11} {r r}_{i i} - - - - - - ((44))$

同时发生频率c(X₀，X₁，......，X_n-1)全部设为1，使现有知识的影响最小化。在这种状态下，由下面给出的公式(5)确定N′_ijk。The simultaneous occurrence frequencies c(X ₀ , X ₁ , . . . , X _n-1 ) are all set to 1 to minimize the influence of existing knowledge. In this state, N' _ijk is determined by the formula (5) given below.

${N N}_{ijk ijk}^{''} = = (({Π Π}_{i i = = 00}^{n no - - 11} {r r}_{i i})) / / {r r}_{i i} {q q}_{i i} - - - - - - ((55))$

在这个第一方法中，N′_ijk的值随着节点n的数目的增大和状态r_i的数目的增大而增大。因此，存在学习数据的影响N_ijk变得比现有知识的影响N′_ijk的小的可能性。因此，在第二方法中，N′_ijk＝0的关系被引入以消除现有知识的影响。In this first method, the value of N' _ijk increases as the number of nodes n increases and the number of states r _i increases. Therefore, there is a possibility that the influence N _ijk of the learning data becomes smaller than the influence N′ _ijk of the existing knowledge. Therefore, in the second method, the relation of N' _ijk =0 is introduced to eliminate the influence of existing knowledge.

关于如图14中所示的四个DAG，图16示出由第一和第二方法确定的N′_ijk来计算估计值。如图16中所示，在由第一和第二方法确定N′_ijk的情况下，G1到G3的估计值全部相同。Regarding the four DAGs as shown in FIG. 14 , FIG. 16 shows N′ _ijk determined by the first and second methods to calculate estimated values. As shown in FIG. 16, in the case where N' _ijk is determined by the first and second methods, the estimated values of G1 to G3 are all the same.

下面描述一种具体实施例。在这个实施例中，假定通过安装到电视接收器(下文中缩写为TV接收器)的摄像机来观察用户从而推断用户行为的贝叶斯网络模型。根据以前准备的学习数据来构建网络结构。A specific embodiment is described below. In this embodiment, a Bayesian network model in which user behavior is inferred by observing the user through a camera mounted to a television receiver (hereinafter abbreviated as TV receiver) is assumed. Build the network structure based on the previously prepared learning data.

学习数据已经以下面所述的方式准备好。The learning data has been prepared in the manner described below.

首先，在TV接收器前进行操作的使用者的照片由摄像机提取。从输入的图像，如图17-20中所示已经认知四种类型的参数((1)FaceDir：(FaceDirection)脸部的方向；(2)FacePlace：脸部的位置；(3)FaceSize：脸部的大小；和(4)OptiFlow(OpticalFlowDirection)：用户的运动)。关于脸部的方向(FaceDir)，如图17中所示，每个输入图像在上下方向被分成3段并且在左右方向被分为5段。假设用户的脸部在中心位置。总共存在16个状态。那就是，该脸部定向于15个区域中的任意一个。在第16状态，在输入的图像中不存在用户的脸部。First, a photo of a user operating in front of a TV receiver is captured by a video camera. From the input image, four types of parameters have been recognized as shown in Figures 17-20 ((1) FaceDir: (FaceDirection) the direction of the face; (2) FacePlace: the position of the face; (3) FaceSize: the size of the face; and (4) OptiFlow (OpticalFlowDirection): user's motion). Regarding the direction of the face (FaceDir), as shown in FIG. 17 , each input image is divided into 3 segments in the up-down direction and into 5 segments in the left-right direction. Assume the user's face is at the center. There are 16 states in total. That is, the face is oriented in any one of 15 regions. In state 16, the user's face does not exist in the input image.

关于脸部的位置(FacePlace)，分类由所有学习数据指示的关于脸部位置的信息，例如，如图18中所示采用矢量量化过程。总共产生10个状态。那就是，判定用户的脸部存在于9个区域中的任意一个。在第10状态，用户的脸部不存在于输入图像中。Regarding the position of the face (FacePlace), the information on the face position indicated by all the learning data is classified, for example, employing a vector quantization process as shown in FIG. 18 . A total of 10 states are generated. That is, it is determined that the user's face exists in any one of the nine areas. In state 10, the user's face does not exist in the input image.

关于脸部的大小(FaceSize)，总共产生5个状态。那就是，判定用户脸部的大小最接近于如图19中所示的4个大小中的任意一个。在第5状态，判定用户的脸部不存在于输入图像中。关于用户的运动(OptiFlow)，总共产生9个状态。那就是，判定用户的运动方向最接近于如图20中所示的8个方向中的任意一个。在第9状态，在输入图像中没有发现任何运动。Regarding the size of the face (FaceSize), a total of 5 states are generated. That is, it is determined that the size of the user's face is closest to any one of the 4 sizes shown in FIG. 19 . In the fifth state, it is determined that the user's face does not exist in the input image. With respect to the user's motion (OptiFlow), a total of 9 states are generated. That is, it is determined that the user's motion direction is closest to any one of the eight directions as shown in FIG. 20 . In state 9, no motion is found in the input image.

以下面的四种方式来标记认知的结果：The results of cognition are marked in the following four ways:

(1)Channel(通信信道)：用户是否面对TV接收器。(1) Channel (communication channel): whether the user is facing the TV receiver.

(2)ComSignal(通信信号)：用户是否正在操作TV接收器。(2) ComSignal (communication signal): Whether the user is operating the TV receiver.

(3)UserGoalTV：用户是否意识到TV接收器。(3) UserGoalTV: Whether the user is aware of the TV receiver.

(4)UserPresence：用户是否出现在TV接收器之前。所有的标记结果均由二元化形式是或否来表示。(4) UserPresence: whether the user appears before the TV receiver. All flagged outcomes are represented as binary yes or no.

而且，为了处理动态事件，讨论上述认知结果以及标记操作发生的时序。在某时刻发生的数据条目加后缀“＿t＿0”。一个时间单位(tick)之前发生的数据条目加后缀“_t_1”。两个时间单位之前发生的数据条目加后缀“_t_2”。表示的一个例子为“FacePlace_ t_0”。Furthermore, in order to deal with dynamic events, the aforementioned cognitive results are discussed as well as the timing at which marking operations occur. Data entries that occur at a certain time are suffixed with "_t_0". Data entries that occurred one time unit (tick) ago are suffixed with "_t_1". Data entries that occurred two time units ago are suffixed with "_t_2". An example of a representation is "FacePlace_t_0".

在四种认知结果和四种标记每个用于3个时间单位的情况下，节点的数目是24。假定时间单位间隔为1秒，从大约90分钟的运动图片序列(30帧/秒)已经准备关于165,000个事件的学习数据。The number of nodes is 24 in the case of four cognitive outcomes and four markers each for 3 time units. Assuming that the time unit interval is 1 second, learning data on 165,000 events has been prepared from about 90 minutes of moving picture sequences (30 frames/second).

基于如图21中所示的学习数据，使用K2算法建立网络结构。在这个时候，在节点之间的次序设置如下：Based on the learning data as shown in FIG. 21, the network structure is established using the K2 algorithm. At this time, the order among the nodes is set as follows:

FacePlace_t_0，FaceSize_t_0，FaceDir_t_0，OptiFlow_t_0，Channel_t_0，ComSignal_t_0，UserGoalTV_t_0，UserPresence_t_0，FacePlace_t_1FaceSize_t_1，FaceDir_t_1，OptiFlow_t_1，Channel_t_1，ComSignal_t_1，UserGoalTV_t_1，UserPresence_t_1，FacePlace_t_2，FaceSize_t_2，FaceDir_t_2，OptiFlow_t_2，Channel_t_2，ComSignal_t_2，UserGoalTV_t_2，UserPresence_t_2。FacePlace_t_0，FaceSize_t_0，FaceDir_t_0，OptiFlow_t_0，Channel_t_0，ComSignal_t_0，UserGoalTV_t_0，UserPresence_t_0，FacePlace_t_1FaceSize_t_1，FaceDir_t_1，OptiFlow_t_1，Channel_t_1，ComSignal_t_1，UserGoalTV_t_1，UserPresence_t_1，FacePlace_t_2，FaceSize_t_2，FaceDir_t_2，OptiFlow_t_2，Channel_t_2，ComSignal_t_2，UserGoalTV_t_2，UserPresence_t_2。

在本实施例中，使用图21中所示的网络结构作为初始结构、并且基于与前述学习数据相同的学习数据，进行网络结构的附加学习。图22-25中所示为在学习处理期间的网络结构变化，分别示出了第20代、第40代、第60代和第80代的网络结构。从图21-25能够看出，随着重复代改变，精华个体的估计值(BD度量的对数)增加。从第80代到第200代估计值不改变。因此，可以说在大约第80代达到收敛并且建立准最佳网络结构。在节点之间的最终次序如下：In this embodiment, additional learning of the network structure is performed using the network structure shown in FIG. 21 as the initial structure and based on the same learning data as the aforementioned learning data. The network structure changes during the learning process are shown in Figs. 22-25, showing the network structures of the 20th generation, 40th generation, 60th generation and 80th generation, respectively. From Figures 21-25, it can be seen that the estimated value (logarithm of the BD metric) of elite individuals increases as the iteration changes. Estimates do not change from generation 80 to generation 200. Therefore, it can be said that convergence is reached at about the 80th generation and a quasi-optimal network structure is established. The final order among the nodes is as follows:

FaceDir_t_0，FaceSize_t_0，FacePlace_t_0，Channel_t_0，OptiFlow_t_0，UserPresence_t_0，FaceDir_t_1，UserGoalTV_t_0，FaceSize_t_1，FacePlace_t_1，ComSignal_t_1，Channel_t_2，Channel_t_1，ComSignal_t_0，OptiFlow_t_1，FaceSize_t_2，FaceDir_t_2，FacePlace_t_2，ComSignal_t_2，OptiFlow_t_2，UserGoalTV_t_1，UserGoalTV_t_2，UserPresence_t_1，UserPresence_t_2。FaceDir_t_0，FaceSize_t_0，FacePlace_t_0，Channel_t_0，OptiFlow_t_0，UserPresence_t_0，FaceDir_t_1，UserGoalTV_t_0，FaceSize_t_1，FacePlace_t_1，ComSignal_t_1，Channel_t_2，Channel_t_1，ComSignal_t_0，OptiFlow_t_1，FaceSize_t_2，FaceDir_t_2，FacePlace_t_2，ComSignal_t_2，OptiFlow_t_2，UserGoalTV_t_1，UserGoalTV_t_2，UserPresence_t_1，UserPresence_t_2。

尽管到目前为止已经描述了实施本发明的最佳模式，但本发明不限于上述的实施例。当然，在不偏离权利要求所描绘的本发明的范围的情况下，能够作出各种修改。Although the best mode for carrying out the invention has been described so far, the invention is not limited to the above-described embodiments. Various modifications can, of course, be made without departing from the scope of the invention as outlined in the claims.

相关申请的交叉参考Cross References to Related Applications

本发明包含分别于2005年10月31日和2006年4月19日向日本专利局提交的日本专利申请JP2005-317031和JP2006-116038的相关主题，在此并入上述申请的全文作为引用。The present application contains subject matter related to Japanese Patent Applications JP2005-317031 and JP2006-116038 filed in the Japan Patent Office on Oct. 31, 2005 and April 19, 2006, respectively, the entire contents of which are hereby incorporated by reference.

Claims

1. facility for study is used for setting up based on learning data the network structure of Bayesian network, represents cause between a plurality of nodes and the relation between the result by digraph in Bayesian network, and described facility for study comprises:

Memory storage, the storage learning data; And

Learning device is used for setting up network structure based on learning data;

Wherein, described learning device prepares all have the initial stage groups of individuals that genotypic individuality is formed by each, order between the node and the relation between cause and the result in this genotype, have been stipulated, repeatedly carry out intersection and/or sudden change processing on the groups of individuals in the early stage based on genetic algorithm, calculate each individual estimated value based on learning data, seek best body one by one, and with the phenotype of this optimized individual as network structure.

2. facility for study as claimed in claim 1, wherein, described genotype order according to the rules will be arranged in a plurality of nodes on the first direction as close node, with according to the rules sequencing perpendicular to a plurality of nodes on the second direction of first direction as child node, and the existence by coming cause between the regulation respective nodes and result relation at the allele on each gene location or do not exist, each close node and each child node are corresponding mutually on this gene location.

3. facility for study as claimed in claim 2, wherein, child node have for or the gene at gene location place that is higher than the order of close node order show the performance do not have feature.

4. facility for study as claimed in claim 2, wherein, have under the situation that outnumbers given number of close node of cause and result relation with the anyon node, gene at least one gene location that described learning device will be associated with this child node is reversed to its allele, makes the number that has a close node of cause and result relation with child node become littler than given number.

5. facility for study as claimed in claim 1, wherein, whole or some existing knowledge of described initial stage groups of individuals reflection deviser.

6. facility for study as claimed in claim 1, wherein, described initial stage groups of individuals is based on the individuality that obtains as learning outcome.

7. facility for study as claimed in claim 1, wherein, described estimated value is the logarithm of BD tolerance, and wherein, i node in n node is as child node X _iThe time, child node X _iCan be from v _I0To v _Iri-1Suppose r _iThe value, and with child node X _iModel number with the value that can suppose of all close nodes of cause and result relation is q _iIf, child node X _iValue be v _IkAnd the generation data are wherein with child node X _iThe value of all close nodes with cause and result relation is supposed the j pattern, then according to following formula calculate the times N of before obtaining learning data, expecting ' _Ijk:

{N'}_{ijk} = (Π_{i = 0}^{n - 1} r_{i}) / r_{i} q_{i} .

8. facility for study as claimed in claim 1, wherein, described estimated value is the logarithm of BD tolerance, and wherein, the i node in n node is as child node X _iThe time, child node X _iCan be from v _I0To v _Iri-1Suppose r _iThe value, and with child node X _iThe number of pattern with the value that can suppose of all close nodes of cause and result relation is q _iIf, child node X _iValue be v _IkAnd the generation data, wherein with child node X _iThe value of all close nodes with cause and result relation is assumed to the j pattern, then before learning data obtains the times N of expection ' _IjkBe set to equal zero (N ' _Ijk=0).

9. set up the learning method of the network structure of Bayesian network based on learning data for one kind, in described Bayesian network, be illustrated between cause between a plurality of nodes and the result by digraph and concern that this learning method comprises step:

Preparation all has the initial stage groups of individuals that genotypic individuality is formed by each, has stipulated the relation between the order between a plurality of nodes and cause and result in described genotype;

Repeat the processing that intersects and/or suddenly change on the groups of individuals in the early stage based on genetic algorithm;

Calculate each individual estimated value based on learning data;

One of the best in the searching individuality; And

With the phenotype of this optimized individual as network structure.

10. facility for study is used for setting up based on learning data the network structure of Bayesian network, in described Bayesian network, is illustrated in cause between a plurality of nodes and the relation between the result by digraph, and described facility for study comprises:

Storage area, the storage learning data; And

The study part is operable as based on learning data and sets up network structure;

Wherein, described study part prepares all have the initial stage groups of individuals that genotypic individuality is formed by each, order between the node and the relation between cause and the result in described genotype, have been stipulated, repeat intersection and/or sudden change processing on the groups of individuals in the early stage based on genetic algorithm, calculate each individual estimated value based on learning data, seek one of the best in the individuality, and with the phenotype of optimized individual as network structure.