CN109918646B

CN109918646B - Method, system and device for judging causality of text

Info

Publication number: CN109918646B
Application number: CN201910089352.1A
Authority: CN
Inventors: 向露; 刘洋; 张家俊; 周玉; 宗成庆
Original assignee: Institute of Automation of Chinese Academy of Science
Current assignee: Institute of Automation of Chinese Academy of Science
Priority date: 2019-01-30
Filing date: 2019-01-30
Publication date: 2020-08-11
Anticipated expiration: 2039-01-30
Also published as: CN109918646A

Abstract

The invention belongs to the technical field of natural language processing, and in particular relates to a method, system and device for judging causality of texts, and aims to solve the problem of judging causality of texts in robot interaction. The method of the invention includes: based on the language activation model, obtaining the registration event with the highest matching degree for each target text in the input target text pair; based on the registration event corresponding to each target text, according to the stored registration event sequence in each scene , calculate the correlation of the two registered events; based on the target text pair and the correlation of the two registered events, calculate the causal relationship of the target text pair. The present invention can accurately judge the causal relationship of the input target text pair.

Description

Method, system and device for judging causality of text

技术领域technical field

本发明属于自然语言处理技术领域，具体涉及一种篇章因果关系判断方法、系统、装置。The invention belongs to the technical field of natural language processing, and in particular relates to a method, system and device for judging causal relationship of a text.

背景技术Background technique

篇章与人们的日常交流几乎是密不可分的。人们更是会频繁使用篇章关系来表述和传达上下文之间的语义关系(例如：因果关系，递进关系，转折关系等)。同时，随着机器人逐步在人们的日常生活中扮演一个越来越重要的角色，如何让机器人理解人们的日常表达中的篇章关系成为了一个不可回避的问题。但是，这个问题的难点在于并没有一个很好的理论以及模型框架，而现有的方法又局限在文本层面并且很难获取文本内在的语义这导致现有的方法依然存在很多问题。Chapters are almost inseparable from people's daily communication. People often use discourse relations to express and convey the semantic relations between contexts (for example: causal relations, progressive relations, transition relations, etc.). At the same time, as robots gradually play an increasingly important role in people's daily life, how to make robots understand the textual relationship in people's daily expressions has become an unavoidable problem. However, the difficulty of this problem is that there is no good theory and model framework, and the existing methods are limited to the text level and it is difficult to obtain the intrinsic semantics of the text, which leads to many problems in the existing methods.

最近，在人工智能领域有研究人员提出了一种成为情景化语言学习的思路，这种思路认为“语言的语义主要来自于环境”，根据这种思路若希望机器能够理解语言，这台机器应该是可感知的，具有交互能力的，其中篇章因果关系的判断在机器人的交互中具有重要作用。Recently, some researchers in the field of artificial intelligence have proposed an idea to become contextualized language learning. This idea believes that "the semantics of language mainly comes from the environment". According to this idea, if you want a machine to understand language, the machine should It is perceptible and interactive, and the judgment of textual causality plays an important role in the interaction of robots.

发明内容SUMMARY OF THE INVENTION

为了解决现有技术中的上述问题，即为了解决机器人交互中的篇章因果关系判断问题，本发明的第一方面，提出了一种篇章因果关系判断方法，该方法包括：In order to solve the above problems in the prior art, that is, in order to solve the problem of judging the causal relationship of texts in robot interaction, the first aspect of the present invention proposes a method for judging the causal relationship of texts, and the method includes:

步骤S10，基于语言激活模型，对输入的目标文本对中每条目标文本分别获取匹配度最高的注册事件；所述目标文本对为输入两条目标文本；Step S10, based on the language activation model, obtain the registration event with the highest matching degree for each target text in the input target text pair; the target text pair is two input target texts;

步骤S20，基于每条目标文本对应的注册事件，依据所存储的各场景中注册事件序列，计算两个注册事件的相关性；所述场景中注册事件序列为基于场景结构化的经验信息构建的具有布尔时序特征的注册事件序列；Step S20, based on the registration event corresponding to each target text, according to the stored registration event sequence in each scene, calculate the correlation of the two registered events; the registered event sequence in the scene is constructed based on the empirical information of the scene structure. A sequence of registered events with Boolean timing characteristics;

步骤S30，基于所述目标文本对、步骤S20得到的两个注册事件的相关性，计算所述目标文本对的因果关系；Step S30, based on the target text pair and the correlation of the two registration events obtained in step S20, calculate the causal relationship of the target text pair;

其中，所述语言激活模型通过经验-语言激活训练语料对机器翻译模型进行训练得到；所述经验-语言激活训练语料基于机器人经验中的注册事件构建。Wherein, the language activation model is obtained by training the machine translation model through experience-language activation training corpus; the experience-language activation training corpus is constructed based on registration events in robot experience.

在一些优选实施方式中，步骤S30中“计算所述目标文本对的因果关系”，其方法为：In some preferred embodiments, in step S30, "calculate the causal relationship of the target text pair", and the method is:

f_r＝softmax(tanh(W_c*[s₁；s₂]+W_t*fea_t+b))f _r =softmax(tanh(W _c *[s ₁ ; s ₂ ]+W _t *fea _t +b))

其中，f_r为因果关系概率值，包括因果关系概率、非因果关系概率；s₁、s₂是目标文本对中两个目标文本通过文本编码模型获得的句子向量；W_c为预设的文本向量的参数矩阵；W_t为预设的布尔时序特征的参数矩阵；b为预设的偏置量。Among them, f _r is the causal relationship probability value, including causal relationship probability and non-causal relationship probability; s ₁ and s ₂ are the sentence vectors obtained by the text encoding model of the two target texts in the target text pair; W _c is the preset text The parameter matrix of the vector; W _t is the parameter matrix of the preset Boolean timing characteristics; b is the preset offset.

在一些优选实施方式中，步骤S20“计算两个注册事件的相关性”，其方法为：In some preferred embodiments, step S20 "calculates the correlation of two registration events", and the method is as follows:

其中，fea_t为所计算的相关性；e₁、e₂分别为目标文本对中第一目标文本、第二目标文本匹配到的注册事件；P(e₁)为所存储全部场景结构化的经验信息中e₁出现的概率；P(e₂|e₁)为出现e₁的结构化的经验信息中e₂的出现概率。Among them, fea _t is the calculated correlation; e ₁ , e ₂ are the registration events matched by the first target text and the second target text in the target text pair, respectively; P(e ₁ ) is the stored structure of all scenes The probability of e ₁ appearing in empirical information; P(e ₂ |e ₁ ) is the appearance probability of e ₂ in the structured empirical information of e ₁ .

在一些优选实施方式中，所述注册事件，In some preferred embodiments, the registration event,

e_i＝{obj_i}，i∈R_i e _i ={obj _i }, i∈R _i

其中，e_i为第i个注册事件；obj_i为该注册事件中处于激活状态的对象；R_i为该注册事件中的对象符号在语言激活模型全部对象的编号。Among them, e _i is the ith registration event; obj _i is the object in the active state in the registration event; R _i is the number of all objects in the language activation model of the object symbol in the registration event.

在一些优选实施方式中，所述经验-语言激活训练语料，其表达式为In some preferred embodiments, the experience-linguistic activation training corpus is expressed as

E_j＝{obj_i}：{LS_i}，i∈R_i E _j = {obj _i }: {LS _i }, i∈R _i

其中，E_j为第j个经验-语言激活训练语料；LS_i为对象obj_i对应的语言字符。Among them, E _j is the jth experience-language activation training corpus; LS _i is the language character corresponding to the object obj _i .

在一些优选实施方式中，步骤S20所述各场景中注册事件序列，其获取方法包括：In some preferred embodiments, the sequence of events registered in each scenario described in step S20, the acquisition method thereof includes:

步骤A10，在机器人的工作环境中，通过机器人自身的感知装置获取的机器人与对象交互得到的对象信息；所述对象信息包括机器人与对象交互得到的对象属性、对象状态信息；Step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object obtained by the robot's own sensing device; the object information includes the object attribute and object state information obtained by the interaction between the robot and the object;

步骤A20，对步骤A10中获取的对象信息进行结构化处理，并按照时间顺序进行组织和存储，作为机器人的运行工作环境得到的结构化的经验信息。In step A20, the object information obtained in step A10 is subjected to structured processing, and is organized and stored in chronological order, as structured experience information obtained from the operating working environment of the robot.

步骤A30，去除所述结构化的经验信息中处于未激活状态的对象，得到对应场景中注册事件序列。Step A30, remove the objects in the inactive state in the structured experience information, and obtain the registration event sequence in the corresponding scene.

在一些优选实施方式中，所述结构化的经验信息，其表达式为In some preferred embodiments, the structured empirical information is expressed as

E_0-t＝{f₀,f₁,f₂,…,f_t}E _0-t ={f ₀ ,f ₁ ,f ₂ ,...,f _t }

f_t＝{obj₁:a₁；…obj_i:a_i；…obj_n:a_n；}f _t ={obj ₁ :a ₁ ;...obj _i :a _i ;...obj _n :a _n ;}

其中，E_0-t为从0时刻到t时刻的所有经验；f_t为第t时刻输入的所有对象信息；obj_i为第i个对象，a_i为对象obj_i的激活状态，a_i＝1时表示第i个对象被激活，a_i＝0时表示第i个对象未被激活，i∈[1,n]。Among them, E _0-t is all experience from time 0 to time t; f _t is all object information input at time t; obj _i is the ith object, a _i is the activation state of object obj _i , a _i = When it is 1, it means that the ith object is activated, and when a _i =0, it means that the ith object is not activated, i∈[1,n].

本发明的第二方面，提出了一种篇章因果关系判断系统，该系统包括注册事件匹配模块、相关性计算模块、因果关系计算模块；In a second aspect of the present invention, a system for judging a causal relationship of a text is proposed, which includes a registration event matching module, a correlation computing module, and a causal relationship computing module;

所述注册事件匹配模块，配置为基于语言激活模型，对输入的目标文本对中每条目标文本分别获取匹配度最高的注册事件；所述目标文本对为输入两条目标文本；The registration event matching module is configured to obtain the registration event with the highest matching degree for each target text in the input target text pair based on the language activation model; the target text pair is two input target texts;

所述相关性计算模块，配置为基于每条目标文本对应的注册事件，依据所存储的各场景中注册事件序列，计算两个注册事件的相关性；所述场景中注册事件序列为基于场景结构化的经验信息构建的具有布尔时序特征的注册事件序列；The correlation calculation module is configured to calculate the correlation of two registered events based on the registered event corresponding to each target text and according to the stored registered event sequence in each scene; the registered event sequence in the scene is based on the scene structure A registration event sequence with Boolean timing characteristics constructed from the transformed empirical information;

所述因果关系计算模块，配置为基于所述目标文本对、步骤S20得到的两个注册事件的相关性，计算所述目标文本对的因果关系；The causal relationship calculation module is configured to calculate the causal relationship of the target text pair based on the target text pair and the correlation of the two registered events obtained in step S20;

本发明第三方面，提出了一种存储装置，其中存储有多条程序，所述程序适于由处理器加载并执行以实现上述篇章因果关系判断方法。In a third aspect of the present invention, a storage device is provided, in which a plurality of programs are stored, and the programs are adapted to be loaded and executed by a processor to realize the above-mentioned method for judging the causality of a chapter.

本发明的第四方面，提出了一种处理装置，包括处理器、存储装置；处理器，适于执行各条程序；存储装置，适于存储多条程序；所述程序适于由处理器加载并执行以实现上述篇章因果关系判断方法。In a fourth aspect of the present invention, a processing device is provided, including a processor and a storage device; the processor is adapted to execute various programs; the storage device is adapted to store multiple programs; the programs are adapted to be loaded by the processor And execute it to realize the above article causality judgment method.

本发明的有益效果：Beneficial effects of the present invention:

通过本发明中场景的注册事件序列进行相关性计算，并进一步结合目标文本对的注册事件，进行目标文本对的因果关系计算，可以得到准确的因果关系判断结果。An accurate causal relationship judgment result can be obtained by performing correlation calculation through the registration event sequence of the scene in the present invention, and further combining the registration events of the target text pair to perform the causal relationship calculation of the target text pair.

附图说明Description of drawings

通过阅读参照以下附图所作的对非限制性实施例所作的详细描述，本申请的其它特征、目的和优点将会变得更明显：Other features, objects and advantages of the present application will become more apparent by reading the detailed description of non-limiting embodiments made with reference to the following drawings:

图1是本发明一种实施例的篇章因果关系判断方法流程示意图；1 is a schematic flowchart of a method for judging a causal relationship of a text according to an embodiment of the present invention;

图2是本发明一种实施例的篇章因果关系判断示例。FIG. 2 is an example of judging the causal relationship of a text according to an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚，下面将结合附图对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings. Obviously, the described embodiments are part of the embodiments of the present invention, not All examples. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

下面结合附图和实施例对本申请作进一步的详细说明。可以理解的是，此处所描述的具体实施例仅仅用于解释相关发明，而非对该发明的限定。另外还需要说明的是，为了便于描述，附图中仅示出了与有关发明相关的部分。The present application will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the related invention, but not to limit the invention. In addition, it should be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

需要说明的是，在不冲突的情况下，本申请中的实施例及实施例中的特征可以相互组合。It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict.

本发明的基本思想是利用机器人工作平台的信号输入，针对机器人构建固化网络，再利用深度神经网络技术实现对文本语义更好地建模以达到最后因果关系判断的目的。The basic idea of the invention is to use the signal input of the robot working platform to build a solidified network for the robot, and then use the deep neural network technology to achieve better modeling of text semantics to achieve the purpose of final causal relationship judgment.

本发明的一种篇章因果关系判断方法，如图1所示，该方法包括：A method for judging the causal relationship of a text of the present invention, as shown in Figure 1, the method includes:

为了更清晰地对本发明篇章因果关系判断方法进行说明，下面分别从三个方面进行性展开详述：1、各场景中注册事件序列的构建；2、语言激活模型的构建；3、篇章因果关系判断。In order to more clearly describe the method for judging the causality of the text of the present invention, the following is a detailed description from three aspects: 1. The construction of the registered event sequence in each scene; 2. The construction of the language activation model; 3. The textual causality judge.

1、各场景中注册事件序列的构建1. Construction of registered event sequences in each scene

步骤A10，在机器人的工作环境中，通过机器人自身的感知装置获取的机器人与对象交互得到的对象信息；所述对象信息包括机器人与对象交互得到的对象属性、对象状态信息。In step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object obtained by the robot's own perception device; the object information includes object attributes and object state information obtained by the interaction between the robot and the object.

将机器人放置于工作环境中，利用机器人的感知装置接收环境中的信号(即机器人与对象交互得到的对象信息)。通过机器人在工作环境中的不断交互动作，不断更新的获取的信息。例如，在场景中，有大量的对象与机器人进行交互。这些对象本身就具有属性，同时这些属性具有对应的状态。特别是，机器人与对象进行交互之后，对象的状态就会发生变化。机器人则需要感知这些属性以及状态来获得这些对象的信息。如机器人面的对象“牛”的属性和状态被显示表现出来，该对象记为obj2，name＝cow)。The robot is placed in the working environment, and the sensing device of the robot is used to receive the signals in the environment (that is, the object information obtained by the interaction between the robot and the object). The information obtained is continuously updated through the continuous interaction of the robot in the working environment. For example, in a scene, there are a large number of objects interacting with the robot. These objects themselves have properties, and these properties have corresponding states. In particular, after the robot interacts with the object, the state of the object changes. Robots need to perceive these properties and states to obtain information about these objects. For example, the attributes and states of the object "cow" on the robot face are displayed, and the object is denoted as obj2, with name=cow).

将来自步骤A10的机器人与对象交互得到的对象信息存储在机器人的长期记忆中。其具体的实施方法为：机器人将这些收集到的对象属性状态信息存储在长期记忆中。在长期记忆中，我们为不同的对象的属性以及状态保留了相应的布尔型结点。The object information obtained from the interaction between the robot and the object from step A10 is stored in the long-term memory of the robot. The specific implementation method is as follows: the robot stores the collected object attribute state information in the long-term memory. In long-term memory, we keep corresponding boolean nodes for the properties and states of different objects.

结构化过程将这些信号进行结构化处理并保存；具体实施方案为：在机器人内部为这些信息布尔型结点保留结构点，以保存信息。并且，这些信息将按照时间顺序进行组织并保存下来。也就是说，令一个时刻t的输入为f_t，那么结构化的经验信息的表达式为：In the structuring process, these signals are processed and stored in a structured manner; the specific implementation is as follows: structure points are reserved for these information Boolean nodes inside the robot, so as to save the information. And, the information will be organized and saved in chronological order. That is to say, let the input of a time t be f _t , then the expression of structured empirical information is:

E_0-t＝{f₀,f₁,f₂,…,f_t}E _0-t ={f ₀ ,f ₁ ,f ₂ ,...,f _t }

其中，E_0-t为从0时刻到t时刻的所有经验；f_t为第t时刻输入的所有对象信息(属性，状态)。Among them, E _0-t is all experience from time 0 to time t; f _t is all the object information (attribute, state) input at time t.

同时，每一个时刻的经验中，包含了所有对象的激活信息(通过布尔值表示)，其形式表达式子为：At the same time, the experience at each moment contains the activation information of all objects (represented by Boolean values), and its formal expression is:

f_t＝{obj₁:a₁；…obj_i:a_i；…obj_n:a_n}f _t ={obj ₁ :a ₁ ;...obj _i :a _i ;...obj _n :a _n }

其中，obj_i为第i个对象，a_i为对象obj_i的激活状态，a_i＝1时表示第i个对象被激活，a_i＝0时表示第i个对象未被激活，i∈[1,n]。Among them, obj _i is the ith object, a _i is the activation state of the object obj _i , when a _i = 1, it means that the ith object is activated, when a _i = 0, it means that the ith object is not activated, i∈[ 1,n].

例如，第5时刻下f₅＝{obj₁:0；obj₂:1；obj₃:1；obj₄:1}，则表示该时刻所有对象中obj₂、obj₃、obj₄处于激活状态，obj₁处于非激活状态。For example, at the fifth moment f ₅ ={obj ₁ :0;obj ₂ :1;obj ₃ :1;obj ₄ :1}, it means that obj ₂ , obj ₃ , and obj ₄ in all objects at this moment are in the active state, obj ₁ is inactive.

例如，上述的f₅，其对应的该场景中注册序列为T₅＝{obj₂；obj₃；obj₄；}。For example, for the above f ₅ , the corresponding registration sequence in the scene is T ₅ ={obj ₂ ; obj ₃ ; obj ₄ ;}.

2、语言激活模型的构建。2. The construction of language activation model.

(1)构造机器人经验中的注册事件。(1) Construct the registration event in the robot experience.

根据机器人的经验，这些对象表现为一个布尔值的状态，如果机器人在工作环境中接受到了相应的信号，那么相应的对象就会被激活。当属于一个事件的对象都被激活的情况下，这个事件就是一个被激活的状态。其中，注册事件表达式为：According to the experience of the robot, these objects represent a state of a boolean value, and if the robot receives the corresponding signal in the working environment, the corresponding object will be activated. An event is an activated state when all objects belonging to it are activated. Among them, the registration event expression is:

e_i＝{obj_i}，i∈R_i e _i ={obj _i }, i∈R _i

例如，基于上述的f₅得到的注册事件为e₅＝{obj₂；obj₃；obj₄}，此时R_i＝[2,3,4]。当e₅被激活时，可以表示为e₅:1＝{obj₂:1；obj₃:1；obj₄:1；}。For example, the registration event obtained based on the above f ₅ is e ₅ ={obj ₂ ; obj ₃ ; obj ₄ }, at this time R _i =[2,3,4]. When _e5 is activated, it can be expressed as e5:1={obj2: ₁ ;obj3: ₁ ; _obj4 : ₁ ;}.

(2)基于所构造的注册事件，构造经验-语言激活训练语料。(2) Based on the constructed registration events, construct an experience-language activation training corpus.

该步骤可以看做一个符号翻译过程，因此，训练语料形式为平行语料。具体将其操作为：另经验内部对象固化为一个符号序列，例如，obj₁₅＝Obj_15。当文本提及的内容与这个符号有关时，这个符号以及相关的概念则会被激活。例如，在我们实际的实验中，obj₁₅(即Obj_15)表示的就是中文中的“牛”，以及英语中的“cow”。该步骤构建一个这样的语言符号对应经验内部符号的平行语料，如下所示：This step can be regarded as a symbol translation process, therefore, the training corpus is in the form of parallel corpus. Specifically, the operation is as follows: another internal object of experience is solidified into a sequence of symbols, for example, obj ₁₅ =Obj_15. When the text refers to the symbol, the symbol and related concepts are activated. For example, in our actual experiment, obj ₁₅ (ie Obj_15) means "cow" in Chinese and "cow" in English. This step constructs a parallel corpus of such linguistic symbols corresponding to empirical internal symbols, as follows:

E_j＝{obj_i}：{LS_i}，i∈R_i E _j = {obj _i }: {LS _i }, i∈R _i

其中，E_j为第j个经验-语言激活训练语料；LS_i为对象ob_j对应的语言字符。Among them, E _j is the jth experience-language activation training corpus; LS _i is the language character corresponding to the object obj _j .

例如，若对象obj₂、obj₅、obj₁₅分别对应的语言字符为Adam、attack、cow，则该训练语聊可以表示为For example, if the language characters corresponding to the objects obj ₂ , obj ₅ , and obj ₁₅ are Adam, attack, and cow, respectively, the training language chat can be expressed as

obj₂,obj₅,obj₁₅∶Adam attack cowobj ₂ , obj ₅ , obj ₁₅ : Adam attack cow

(3)通过经验-语言激活训练语料对机器翻译模型进行训练得到语言激活模型。(3) The language activation model is obtained by training the machine translation model through the experience-language activation training corpus.

根据经验-语言激活训练语料，训练基于机器翻译模型。本实施例中采用了序列到序列的神经网络机器翻译模型，当然，还可以采用其他机器翻译模型。According to the experience-language activation training corpus, the training is based on the machine translation model. In this embodiment, a sequence-to-sequence neural network machine translation model is used, of course, other machine translation models may also be used.

3、篇章因果关系判断3. Discourse causality judgment

步骤S10，基于语言激活模型，对输入的目标文本对中每条目标文本分别获取匹配度最高的注册事件；所述目标文本对为输入两条目标文本。Step S10, based on the language activation model, obtain the registration event with the highest matching degree for each target text in the input target text pair; the target text pair is two input target texts.

上面已经详细描述了语言激活模型的构建过程，此处可以直接通过改模型获取对应的注册事件，下面可以对该模型处理文本的关键过程进行展示。The construction process of the language activation model has been described in detail above. Here, the corresponding registration event can be obtained directly by changing the model. The key process of text processing by the model can be shown below.

(1)通过语言激活模型将输入的文本翻译成注册事件的符号序列，即机器人内部的注册事件中所包含的对象符号。其公式如下所示：(1) Translate the input text into the symbol sequence of the registration event through the language activation model, that is, the object symbol contained in the registration event inside the robot. Its formula is as follows:

y₁,y₂,y₃,…,y_j＝P(y_i|x₁,x₂,x₃,…,x_n)y ₁ , y ₂ , y ₃ ,...,y _j =P(y _i |x ₁ ,x ₂ ,x ₃ ,...,x _n )

其中，y₁,y₂,y₃,…,y_j为对象符号序列；x₁,x₂,x₃,…,x_n为语言符号序列；P(y_i|x₁,x₂,x₃,…,x_n)为根据语言符号序列对对象符号进行解码(即翻译)。Among them, y ₁ , y ₂ , y ₃ ,…,y _j is the object symbol sequence; x ₁ ,x ₂ ,x ₃ ,…,x _n is the language symbol sequence; P(y _i |x ₁ ,x ₂ ,x ₃ ,...,x _n ) is to decode (ie translate) the object symbols according to the language symbol sequence.

在一些实施例中，可以为语言端和经验符号端分别构造了两个词向量集合Emb_l和Emb_e。In some embodiments, two word vector sets _Embl and _Embe may be constructed for the linguistic side and the empirical symbol side, respectively.

(2)语言激活模型根据输入的目标文本激活机器人相应的经验信息。根据翻译得到的对象符号序列，对所有机器人内部所保存的注册事件进行匹配，挑选匹配度最高的事件作为被激活的事件。其中，可以采用一个长短时记忆模型LSTM来作为符号序列向量生成的工具来生成对象符号的序列向量，如下式所示(2) The language activation model activates the corresponding experience information of the robot according to the input target text. According to the translated object symbol sequence, all registered events stored in the robot are matched, and the event with the highest matching degree is selected as the activated event. Among them, a long and short-term memory model LSTM can be used as a tool for generating symbol sequence vectors to generate sequence vectors of object symbols, as shown in the following formula

LSTM(Emb_e[obj₁],Emb_e[obj₂],…,Emb_e[ob_j])＝v_e LSTM(Emb _e [obj ₁ ],Emb _e [obj ₂ ],...,Emb _e [obj 1 ]) ₌ _ve

然后，根据向量的余弦相似度来获取最高匹配度的事件。Then, based on the cosine similarity of the vectors, the event with the highest matching degree is obtained.

其中，v_e为目标文本翻译后的对象符号序列通过LSTM网络和Emb_e得到的向量；v_i为语言激活模型中第i个注册事件通过LSTM网络和Emb_e得到的向量；sim(v_i,v_e)为这两个向量之间的余弦相似度。Among them, v _e is the vector obtained by the LSTM network and Emb _e of the object symbol sequence after translation of the target text; v _i is the vector obtained by the _ith registration event in the language activation model through the LSTM network and Emb _e ; sim(vi , v _e ) is the cosine similarity between these two vectors.

选择余弦相似度最高的注册事件作为激活事件，即输入目标文本对应的注册事件。The registration event with the highest cosine similarity is selected as the activation event, that is, the registration event corresponding to the input target text.

步骤S20，基于每条目标文本对应的注册事件，依据所存储的各场景中注册事件序列，计算两个注册事件的相关性；所述场景中注册事件序列为基于场景结构化的经验信息构建的具有布尔时序特征的注册事件序列。Step S20, based on the registration event corresponding to each target text, according to the stored registration event sequence in each scene, calculate the correlation of the two registered events; the registered event sequence in the scene is constructed based on the empirical information of the scene structure. A sequence of registered events with Boolean timing characteristics.

去除所述结构化的经验信息中处于未激活状态的对象，得到对应场景中注册事件序列Remove the objects in the inactive state in the structured experience information, and obtain the registered event sequence in the corresponding scene

在进行因果关系判断的时候，每一次输入的句子为两个，因此，我们需要一次性寻找两个注册事件。同时根据得到的事件e₁、e₂，需要在机器人所存储的各场景中注册事件序列中得到这两个事件的相应的布尔时序特征，从而计算两个注册事件的相关性。When judging causality, there are two sentences entered each time. Therefore, we need to find two registration events at one time. At the same time, according to the obtained events e ₁ , e ₂ , it is necessary to obtain the corresponding Boolean timing characteristics of the two events in the sequence of registered events in each scene stored by the robot, so as to calculate the correlation of the two registered events.

两个注册事件的相关性的计算方法为：The correlation of two registered events is calculated as:

其中，fea_t为所计算的相关性；e₁、e₂分别为目标文本对中第一目标文本、第二目标文本匹配到的注册事件；P(e₁)为所存储全部场景结构化的经验信息中e₁出现的概率；P(e₂|e₁)为出现e₁的结构化的经验信息中e₂的出现概率。Among them, fea _t is the calculated correlation; e ₁ , e ₂ are the registration events matched by the first target text and the second target text in the target text pair, respectively; P(e ₁ ) is the structure of all the stored scenes The probability of e ₁ appearing in empirical information; P(e ₂ |e ₁ ) is the appearance probability of e ₂ in the structured empirical information of e ₁ .

步骤S30，基于所述目标文本对、步骤S20得到的两个注册事件的相关性，计算所述目标文本对的因果关系。Step S30: Calculate the causal relationship of the target text pair based on the target text pair and the correlation between the two registered events obtained in step S20.

计算所述目标文本对的因果关系的方法为：The method for calculating the causal relationship of the target text pair is:

其中，f_r为因果关系概率值，包括因果关系概率Cause、非因果关系概率Non-Cause；s₁、s₂是目标文本对中两个目标文本通过文本编码模型获得的句子向量；W_c为预设的文本向量的参数矩阵；W_t为预设的布尔时序特征的参数矩阵；b为预设的偏置量。Among them, f _r is the causal relationship probability value, including the causal relationship probability Cause and the non-causal relationship probability Non-Cause; s ₁ and s ₂ are the sentence vectors obtained by the two target texts in the target text pair through the text encoding model; W _c is The parameter matrix of the preset text vector; W _t is the parameter matrix of the preset Boolean time series feature; b is the preset offset.

本实施例中采用softmax的分类模型，softmax函数会输出二类分类中两种分类的概率值，取最大的作为输出。比如：softmax会输出一个概率值Cause:0.7、Non-Cause:0.3，这时我们选Cause作为输出。In this embodiment, the classification model of softmax is used, and the softmax function will output the probability values of the two classifications in the two-class classification, and take the largest one as the output. For example: softmax will output a probability value Cause: 0.7, Non-Cause: 0.3, then we choose Cause as the output.

文本编码模型可以采用LSTM(长短时记忆模型)、BOW(基于词向量的平均值词袋模型)、CNN(卷积神经网络模型)，还可以采用其他模型。The text encoding model can use LSTM (Long Short Term Memory Model), BOW (Bag of Words Model Based on Word Vectors), CNN (Convolutional Neural Network Model), and other models.

如图2所示，该示例输入的目标文本对为S1：Adam attacks the cow with sword,S2:and Adam gains some beef。在Part1中通过语言激活模型获取所述的目标文本对的注册事件e1:Adam attacks cow sword，e2：Adam gains some beef。在Part2中，基于各场景中机器人接受到的感知信号序列(S加数字表述的是第i个场景，每一个场景由很多连续的时刻组成，例如T9-T10-T11等，每一个时刻中包含了大量的事件)，寻找到e1在场景S87、S72中出现、e2在场景S54、S87、S72中出现，并在图中通过箭头定位到出现的那个时刻(即事件定位到机器人记忆中的感知信号序列中)，e1定位到场景S87的时刻T0、S72的时刻T1,e2定位到场景S54的时刻T11、S87的时刻T2、S72的时刻T3，并计算两个注册事件的相关性；在Part3分别对目标文本S1、S2进行文本编码得到句子向量；在Part4基于Part2得到的相关性、Part3得到的两个句子向量进行因果判断，并输出判断结果。As shown in Figure 2, the target text pair input in this example is S1: Adam attacks the cow with sword, S2: and Adam gains some beef. In Part1, the registration events e1: Adam attacks cow sword, e2: Adam gains some beef are obtained through the language activation model. In Part2, based on the perception signal sequence received by the robot in each scene (S plus a number represents the ith scene, each scene consists of many consecutive moments, such as T9-T10-T11, etc., each moment contains A large number of events), find that e1 appears in scenes S87, S72, and e2 appears in scenes S54, S87, S72, and locate the moment of appearance by arrows in the figure (that is, the event locates the perception in the robot's memory In the signal sequence), e1 is located at time T0 of scene S87, time T1 of S72, e2 is located to time T11 of scene S54, time T2 of S87, time T3 of S72, and calculate the correlation of the two registered events; in Part3 Perform text encoding on the target texts S1 and S2 respectively to obtain sentence vectors; in Part4, perform causal judgment based on the correlation obtained in Part2 and the two sentence vectors obtained in Part3, and output the judgment result.

本发明代码实现可以采用python编程语言以及TensorFlow深度学习框架实现完成，开发平台是Ubuntu Linux 16.04。同时，本发明也可以依赖微软发布的通用人工智能平台Malmo作为机器人的工作环境。由于所写程序没有用到任何平台相关的代码，因此所述的系统实现也可以运行于Windows操作系统上。The code realization of the present invention can be realized by using the python programming language and the TensorFlow deep learning framework, and the development platform is Ubuntu Linux 16.04. At the same time, the present invention can also rely on the general artificial intelligence platform Malmo released by Microsoft as the working environment of the robot. Since the written program does not use any platform-dependent code, the described system implementation can also run on the Windows operating system.

本发明一种实施例的篇章因果关系判断系统，该系统包括注册事件匹配模块、相关性计算模块、因果关系计算模块；A text causality judgment system according to an embodiment of the present invention includes a registration event matching module, a correlation calculation module, and a causal relationship calculation module;

所属技术领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的系统的具体工作过程及有关说明，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, for the specific working process and related description of the system described above, reference may be made to the corresponding process in the foregoing method embodiments, which will not be repeated here.

需要说明的是，上述实施例提供的篇章因果关系判断系统，仅以上述各功能模块的划分进行举例说明，在实际应用中，可以根据需要而将上述功能分配由不同的功能模块来完成，即将本发明实施例中的模块或者步骤再分解或者组合，例如，上述实施例的模块可以合并为一个模块，也可以进一步拆分成多个子模块，以完成以上描述的全部或者部分功能。对于本发明实施例中涉及的模块、步骤的名称，仅仅是为了区分各个模块或者步骤，不视为对本发明的不当限定。It should be noted that, the system for judging the causality of the text provided by the above-mentioned embodiment is only illustrated by the division of the above-mentioned functional modules. The modules or steps in the embodiments of the present invention are further decomposed or combined. For example, the modules in the above embodiments may be combined into one module, or may be further split into multiple sub-modules to complete all or part of the functions described above. The names of the modules and steps involved in the embodiments of the present invention are only for distinguishing each module or step, and should not be regarded as an improper limitation of the present invention.

本发明一种实施例的存储装置，其中存储有多条程序，所述程序适于由处理器加载并执行以实现上述篇章因果关系判断方法。In the storage device according to an embodiment of the present invention, a plurality of programs are stored therein, and the programs are adapted to be loaded and executed by a processor to implement the above-mentioned method for judging the causal relationship of a chapter.

本发明一种实施例的处理装置，包括处理器、存储装置；处理器，适于执行各条程序；存储装置，适于存储多条程序；所述程序适于由处理器加载并执行以实现上述篇章因果关系判断方法。A processing device according to an embodiment of the present invention includes a processor and a storage device; the processor is adapted to execute various programs; the storage device is adapted to store multiple programs; the programs are adapted to be loaded and executed by the processor to realize The above chapter causality judgment method.

所属技术领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的存储装置、处理装置的具体工作过程及有关说明，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process and relevant description of the storage device and processing device described above can refer to the corresponding process in the foregoing method embodiments, which is not repeated here. Repeat.

实验验证：Experimental verification:

在实验中所构造的训练语料中，每一个例子都是由两个句子组成，每一个句子描述的都是一个单独的事件。通过在这个语料上进行训练，将本发明方法与传统的基于纯文本的方法进行对比。表格1为本实验所构造的篇章因果关系的语料基本统计情况。In the training corpus constructed in the experiments, each example consists of two sentences, each describing a separate event. By training on this corpus, the method of the present invention is compared with traditional plain text-based methods. Table 1 shows the basic statistics of the corpus of textual causality constructed in this experiment.

附表1Schedule 1

因果关系causation 非因果关系non-causal relationship 总数total 认同度Acceptance 数量quantity 16191619 32333233 48524852 98.2％98.2%

附表2给出了本发明与几种现有篇章因果关系判别方法的结果。其中，在表格的上半部分表达的是目前主流的三种处理句子信息的神经网络方法(BOW，CNN，LSTM)。实验将两个句子作为输入，考察目标模型是否有能力判别这个句对是否是因果关系。可以看到本发明(Our model)在数据集上的表现要明显好于传统文本方法，其中表2中Model+BOW、Model+CNN、Model+LSTM分别表示本发明方法的步骤S30中采用BOW、CNN、LSTM文本编码模型。Attached table 2 presents the results of the present invention and several existing methods for discriminating the causal relationship of the chapters. Among them, the upper part of the table expresses the three current mainstream neural network methods (BOW, CNN, LSTM) for processing sentence information. The experiment takes two sentences as input and examines whether the target model has the ability to discriminate whether the sentence pair is a causal relationship. It can be seen that the performance of the present invention (Our model) on the data set is significantly better than that of the traditional text method, wherein Model+BOW, Model+CNN, Model+LSTM in Table 2 respectively indicate that BOW, CNN, LSTM text encoding model.

与传统文本方法具有的明显区别是，本发明使用了从机器人平台中收集到的经验信息。这个过程与人类非常类似，因此本发明的关键在于提出了一种区别于传统思路的新的方法，它强调了与文本相关的经验的重要性。A distinct difference from traditional textual methods is that the present invention uses empirical information gathered from a robotic platform. This process is very similar to that of humans, so the key to the present invention is to propose a new method that is different from traditional thinking, which emphasizes the importance of text-related experience.

附表2Schedule 2

本领域技术人员应该能够意识到，结合本文中所公开的实施例描述的各示例的模块、方法步骤，能够以电子硬件、计算机软件或者二者的结合来实现，软件模块、方法步骤对应的程序可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。为了清楚地说明电子硬件和软件的可互换性，在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以电子硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。本领域技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本发明的范围。Those skilled in the art should be aware that the modules and method steps of each example described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two, and the programs corresponding to the software modules and method steps Can be placed in random access memory (RAM), internal memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or as known in the art in any other form of storage medium. In order to clearly illustrate the interchangeability of electronic hardware and software, the components and steps of each example have been described generally in terms of functionality in the foregoing description. Whether these functions are performed in electronic hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods of implementing the described functionality for each particular application, but such implementations should not be considered beyond the scope of the present invention.

术语“第一”、“第二”等是用于区别类似的对象，而不是用于描述或表示特定的顺序或先后次序。The terms "first," "second," etc. are used to distinguish between similar objects, and are not used to describe or indicate a particular order or sequence.

术语“包括”或者任何其它类似用语旨在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者设备/装置不仅包括那些要素，而且还包括没有明确列出的其它要素，或者还包括这些过程、方法、物品或者设备/装置所固有的要素。The term "comprising" or any other similar term is intended to encompass a non-exclusive inclusion such that a process, method, article or device/means comprising a list of elements includes not only those elements but also other elements not expressly listed, or Also included are elements inherent to these processes, methods, articles or devices/devices.

至此，已经结合附图所示的优选实施方式描述了本发明的技术方案，但是，本领域技术人员容易理解的是，本发明的保护范围显然不局限于这些具体实施方式。在不偏离本发明的原理的前提下，本领域技术人员可以对相关技术特征作出等同的更改或替换，这些更改或替换之后的技术方案都将落入本发明的保护范围之内。So far, the technical solutions of the present invention have been described with reference to the preferred embodiments shown in the accompanying drawings, however, those skilled in the art can easily understand that the protection scope of the present invention is obviously not limited to these specific embodiments. Without departing from the principle of the present invention, those skilled in the art can make equivalent changes or substitutions to the relevant technical features, and the technical solutions after these changes or substitutions will fall within the protection scope of the present invention.

Claims

1. a method for judging causal relationship of a chapter, is characterized in that, the method comprises:

Step S10, based on the language activation model, obtain the registration event with the highest matching degree for each target text in the input target text pair; the target text pair is two input target texts;

Step S20, based on the registration event corresponding to each target text, according to the stored registration event sequence in each scene, calculate the correlation of the two registered events; the registered event sequence in the scene is constructed based on the empirical information of the scene structure. A sequence of registered events with Boolean timing characteristics;

Step S30, based on the target text pair and the correlation of the two registration events obtained in step S20, calculate the causal relationship of the target text pair;

Wherein, the language activation model is obtained by training the machine translation model through experience-language activation training corpus; the experience-language activation training corpus is constructed based on registration events in robot experience.

2. The method for judging a causal relationship of a text according to claim 1, wherein in step S30, "calculate the causal relationship of the target text pair", and its method is:

f _r =softmax(tanh(W _c *[s ₁ ; s ₂ ]+W _t *fea _t +b))

Among them, f _r is the causal relationship probability value, including causal relationship probability and non-causal relationship probability; s ₁ and s ₂ are the sentence vectors obtained by the text encoding model of the two target texts in the target text pair; W _c is the preset text The parameter matrix of the vector; W _t is the parameter matrix of the preset Boolean timing characteristics; b is the preset offset.

3. The method for judging the causal relationship of a text according to claim 1, wherein the step S20 "calculates the correlation of two registered events", and its method is:

Among them, fea _t is the calculated correlation; e ₁ , e ₂ are the registration events matched by the first target text and the second target text in the target text pair, respectively; P(e ₁ ) is the structure of all the stored scenes The probability of e ₁ appearing in empirical information; P(e ₂ |e ₁ ) is the appearance probability of e ₂ in the structured empirical information of e ₁ .

4. The method for judging a causal relationship of a text according to claim 1, wherein the registration event,

e _i ={obj _i }, i∈R _i

Among them, e _i is the ith registration event; obj _i is the object in the active state in the registration event; R _i is the number of all objects in the language activation model of the object symbol in the registration event.

5. The method for judging a causal relationship of a text according to claim 4, wherein the experience-language activation training corpus is expressed as

E _j = {obj _i }: {LS _i }, i∈R _i

Among them, E _j is the jth experience-language activation training corpus; LS _i is the language character corresponding to the object obj _i .

6. The method for judging a causal relationship of a text according to claim 1, wherein the sequence of events registered in each scenario described in step S20, the acquisition method thereof comprises:

Step A10, in the working environment of the robot, the object information obtained by the interaction between the robot and the object obtained by the robot's own sensing device; the object information includes the object attribute and object state information obtained by the interaction between the robot and the object;

In step A20, the object information obtained in step A10 is subjected to structured processing, and is organized and stored in chronological order, as structured experience information obtained by the operating working environment of the robot;

Step A30, remove the objects in the inactive state in the structured experience information, and obtain the registration event sequence in the corresponding scene.

7. The method for judging a causal relationship of a discourse according to claim 6, wherein the structured empirical information is expressed as

E _0-t ={f ₀ , f ₁ , f ₂ , . . . , f _t }

f _t = {obj ₁ : a ₁ ; ...obj _i : a _i ; ... obj _n : a _n ;}

Among them, E _0-t is all experience from time 0 to time t; f _t is all object information input at time t; obj _i is the ith object, a _i is the activation state of object obj _i , a _i = When it is 1, it means that the ith object is activated, and when a _i =0, it means that the ith object is not activated, i∈[1,n].

8. A textual causality judgment system, characterized in that the system comprises a registration event matching module, a correlation calculation module, and a causal relationship calculation module;

The registration event matching module is configured to obtain the registration event with the highest matching degree for each target text in the input target text pair based on the language activation model; the target text pair is two input target texts;

The correlation calculation module is configured to calculate the correlation of two registered events based on the registered event corresponding to each target text and according to the stored registered event sequence in each scene; the registered event sequence in the scene is based on the scene structure A registration event sequence with Boolean timing characteristics constructed from the transformed empirical information;

The causal relationship calculation module is configured to calculate the causal relationship of the target text pair based on the target text pair and the correlation between the two registered events obtained by the correlation calculation module;

9 . A storage device, wherein a plurality of programs are stored, wherein the programs are adapted to be loaded and executed by a processor to implement the method for judging the causality of a chapter according to claims 1-7 .

10. A processing device, comprising a processor and a storage device; the processor is adapted to execute various programs; the storage device is adapted to store a plurality of programs; characterized in that the programs are adapted to be loaded and executed by the processor to The method for judging the causal relationship of the chapters described in claims 1-7 is realized.