[go: up one dir, main page]

CN105930494A - Multimode matching model based complex event detection method - Google Patents

Multimode matching model based complex event detection method Download PDF

Info

Publication number
CN105930494A
CN105930494A CN201610296398.7A CN201610296398A CN105930494A CN 105930494 A CN105930494 A CN 105930494A CN 201610296398 A CN201610296398 A CN 201610296398A CN 105930494 A CN105930494 A CN 105930494A
Authority
CN
China
Prior art keywords
mode
detection
function
matching model
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610296398.7A
Other languages
Chinese (zh)
Inventor
王建华
王涛
程良伦
彭孝东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
South China Agricultural University
Guangdong University of Technology
Original Assignee
South China Agricultural University
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by South China Agricultural University, Guangdong University of Technology filed Critical South China Agricultural University
Priority to CN201610296398.7A priority Critical patent/CN105930494A/en
Publication of CN105930494A publication Critical patent/CN105930494A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24568Data stream processing; Continuous queries

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a multimode matching model based complex event detection method. A plurality of complex event detection modes are fused to construct a finite state automata, so that the storage and search of numerous redundant automata states and transfer boundaries are greatly reduced, the repeated data operation matching and computing operations are avoided, the detection and matching of the complex event detection modes can be finished by scanning data streams once, and the efficiency of detecting complex events in the massive data streams is improved. According to the method, the shared detection of the complex events in the massive data streams is realized, a current automata-based complex event mode detection method is improved, and an existing complex event detection technology is extended for finishing detection of the complex events in the massive data streams more efficiently.

Description

一种基于多模式匹配模型的复杂事件检测方法 A Complex Event Detection Method Based on Multi-pattern Matching Model

技术领域 technical field

本发明涉及海量数据流处理技术领域,更具体地,涉及海量数据流处理中一种基于多模式匹配模型的复杂事件检测方法。 The present invention relates to the technical field of massive data stream processing, and more specifically, to a complex event detection method based on a multi-pattern matching model in massive data stream processing.

背景技术 Background technique

物联网技术将网络、嵌入式、RFID、传感器及执行器等电子信息技术与传统生产技术相融合,实现涵盖产品设计、生产与服务等环节的全流程的数据感知、传输、计算、控制和服务等,提高产品技术附加值,增强生产与服务过程的管控能力,催生新的现代生产模式的重要手段。 The Internet of Things technology integrates electronic information technologies such as network, embedded, RFID, sensors and actuators with traditional production technologies to realize data perception, transmission, calculation, control and services covering the whole process of product design, production and service. It is an important means to increase the added value of product technology, enhance the management and control capabilities of production and service processes, and give birth to a new modern production model.

在生产环境中,随着生产规模的日益大型化,生产流程的日渐复杂化,生产过程时空分布性与生产环境多源干忧性使得许多大量诸如RFID标签,传感器节点等感知设备被部署到生产现场去监测现场情况而产生各种海量生产数据流。由于这些海量生产数据流存在:1)数据量十分巨大,呈海量性,每秒可以达到TB级甚至PB级规模;2) 数据内容多源,存在人,物料,设备,生产工艺过程,产品,服务等多种数据内容;3) 数据结构复杂,结构化数据,半结构化数据和非结构化数据共存;4) 数据更新速度快,数据每分每秒钟都在产生和更新;5) 数据响应要求高,数据需要实时快速处理响应等数据特征,导致了物联网中海量数据流处理面临着:海量多源动态数据流难以及时处理的重要问题。由于现有数据处理方法难于完全支撑物联网海量数据流的实时高效处理,难于从上述海量数据流中快速找出所需信息并及时做出反应,因而影响生产企业对生产进程调度和决策作用。由于复杂事件检测技术其能够利用事件属性之间的关联,通过匹配规则或代数操作不断过滤连续到达的海量生产数据流,快速找出企业所需要的符合某种关联约束的事件序列,因而近年在各类生产行业中来得到日益广泛的关注。 In the production environment, with the increasing scale of production and the increasing complexity of the production process, the spatiotemporal distribution of the production process and the multi-source interference of the production environment have led to the deployment of a large number of sensing devices such as RFID tags and sensor nodes in the production environment. On-site monitoring of on-site conditions generates a variety of massive production data streams. Due to the existence of these massive production data streams: 1) The amount of data is huge and massive, reaching TB or even PB levels per second; 2) The data content has multiple sources, including people, materials, equipment, production processes, products, 3) The data structure is complex, structured data, semi-structured data and unstructured data coexist; 4) The data update speed is fast, and the data is generated and updated every minute and every second; 5) The data Response requirements are high, and data needs to be processed in real time and quickly to respond to other data characteristics, resulting in the processing of massive data streams in the Internet of Things: the important problem that massive multi-source dynamic data streams are difficult to process in a timely manner. Since the existing data processing methods are difficult to fully support the real-time and efficient processing of the massive data streams of the Internet of Things, it is difficult to quickly find the required information from the massive data streams and respond in a timely manner, thus affecting the scheduling and decision-making of production enterprises. Because the complex event detection technology can use the association between event attributes, continuously filter the massive production data stream arriving continuously through matching rules or algebraic operations, and quickly find out the event sequence that the enterprise needs that meets a certain association constraint, it has become popular in recent years. It has received increasing attention in various production industries.

当前,关于数据流中复杂事件检测方法的研究,主要开展有基于自动机,基于Petri网,基于匹配树和基于有向图等方面的复杂事件检测方法以及它们的一些改进方法,如基于时间Petri网复杂事件检测方法、基于压缩组合物的树复杂事件检测方法、基于改进图的复杂事件检测方法、基于下推自动机结构的复杂事件检测方法、基于匹配树模型匹配结果共享的复杂事件检测方法等方法。但由于现有的上述检测方法只能孤立的对数据流中单个复杂事件进行检测与查询处理,而无法实现对数据流中的多个复杂事件查询进行共享检测。而在现实生活中,通常面临着在一个数据流上查询和检测多个复杂事件问题。如果直接利用上述检测方法去实现数据流中多个复杂事件检测时,将会出现存在大量冗余的自动机状态和转移边,大量数据重复存储、查找和计算操作,从而出现检测时间长,消耗内存大,检测效率低的问题,难于实现海量数据流中事件多模式的实时检测功能。 At present, the research on complex event detection methods in data streams mainly includes automata-based, Petri net-based, matching tree-based and directed graph-based complex event detection methods and some of their improved methods, such as time-based Petri Network complex event detection method, tree complex event detection method based on compressed composition, complex event detection method based on improved graph, complex event detection method based on pushdown automata structure, complex event detection method based on matching tree model matching result sharing and other methods. However, because the above-mentioned existing detection methods can only detect and query a single complex event in a data stream in isolation, they cannot implement shared detection of multiple complex event queries in a data stream. In real life, we usually face the problem of querying and detecting multiple complex events on a data stream. If the above detection method is directly used to detect multiple complex events in the data stream, there will be a large number of redundant automaton states and transition edges, and a large amount of data will be repeatedly stored, searched and calculated, resulting in long detection time and consumption. The problem of large memory and low detection efficiency makes it difficult to realize the real-time detection function of multi-mode events in massive data streams.

发明内容 Contents of the invention

本发明针对当前复杂事件检测方法在实现数据流上多个复杂事件检测时出现检测时间长,消耗内存大,检测效率低的问题,本发明面向海量数据流,提出了一种基于多模式匹配模型的复杂事件检测方法,本发明方法改进了常规的自动机(NFA)序列扫描和序列过程,扩展了现有复杂事件检测技术,大大提高了海量数据流中复杂事件检测能力。 The present invention aims at the problems of long detection time, large memory consumption and low detection efficiency when the current complex event detection method realizes the detection of multiple complex events on the data stream. The present invention faces massive data streams and proposes a multi-pattern matching model The complex event detection method, the method of the invention improves the conventional automatic machine (NFA) sequence scanning and sequence process, expands the existing complex event detection technology, and greatly improves the complex event detection ability in massive data streams.

为解决上述技术问题,本发明的技术方案如下: In order to solve the problems of the technologies described above, the technical solution of the present invention is as follows:

一种基于多模式匹配模型的复杂事件检测方法,包括以下步骤: A complex event detection method based on a multi-pattern matching model, comprising the following steps:

S1:构建多模式有限状态自动机匹配模型,具体通过如下小步骤实现: S1: Construct a multi-mode finite state automaton matching model, which is realized through the following small steps:

S1.1:读取各个单模式事件匹配表达式; S1.1: Read each single-mode event matching expression;

S1.2:根据读取的各个单模式事件匹配表达式构建各自对应的单模式有限状态自动机匹配模型; S1.2: Construct the corresponding single-mode finite state automata matching models according to the read single-mode event matching expressions;

S1.3:根据构建的单模式有限状态自动机匹配模型构建状态转移函数,状态转移函数的功能是实现有限自动机的状态转移路径; S1.3: Construct a state transition function according to the constructed single-mode finite state automaton matching model, and the function of the state transition function is to realize the state transition path of the finite automaton;

S1.4:根据构建的转移函数执行情况构建失败转移函数,失败转移函数(failure)的功能是实现用于当发生失配时,指向下一个状态以继续比较; S1.4: Build a failure transition function based on the execution of the constructed transition function. The function of the failure transition function is to point to the next state to continue the comparison when a mismatch occurs;

S1.5:根据构建的转移函数和失败函执行情况构建输出转移函数,输出转移函数(output)用于当某个模式串获得匹配时,输出或记录一条匹配成功信息,并进行下一个状态继续比较; S1.5: Build an output transfer function based on the constructed transfer function and the execution of the failure letter. The output transfer function (output) is used to output or record a match success message when a certain pattern string is matched, and continue to the next state Compare;

S1.6: 输出多模式有限状态自动机匹配模型; S1.6: Output the multi-mode finite state automaton matching model;

S2:多模式有限状态自动机匹配模型匹配,具体通过如下小步骤实现: S2: Multi-mode finite state automaton matching model matching, specifically through the following small steps:

S2.1:从数据流逐一读入检测数据; S2.1: Read the detection data one by one from the data stream;

S2.2:根据读取的检测数据确定多模式状态转移方向; S2.2: Determine the multi-mode state transfer direction according to the read detection data;

S2.3:根据状态转移路径确定是否执行多模式失败转移功能; S2.3: Determine whether to execute the multi-mode failover function according to the state transition path;

S2.4:根据多模式状态转移方向或多模式匹配失败转移功能确定是否执行多模式匹配输出函数; S2.4: Determine whether to execute the multi-mode matching output function according to the multi-mode state transition direction or the multi-mode matching failure transfer function;

S2.5:判断检测任务是否完成,如果完成,跳转到S2.6执行;否则跳转到S2.1执行; S2.5: Judging whether the detection task is completed, if it is completed, jump to S2.6 for execution; otherwise, jump to S2.1 for execution;

S2.6:输出检测结果。 S2.6: output the detection result.

进一步的,步骤S2.2中,所述检测数据为原子事件。 Further, in step S2.2, the detection data is an atomic event.

在步骤S1中,首先需要根据单模式匹配表达式构建对应单模式有限状态自动机匹配模型;然后再利用构建对应单模式有限状态自动机匹配模型依次构建三个转移函数:状态转移函数、失败转移函数和输出转移函数;最后利用上述三个转移函数相互配合作用,实现将单模式匹配模型融合成融合多模式匹配模型; In step S1, it is first necessary to construct a corresponding single-mode finite state automaton matching model according to the single-mode matching expression; then use the construction of the corresponding single-mode finite state function and output transfer function; finally, using the above three transfer functions to cooperate with each other, the single-mode matching model is fused into a fusion multi-mode matching model;

在步骤S2中,首先需要使用上述生成多模式有限状态自动机匹配模型,逐一从海量数据流中读入原子事件,运用状态转移函数、失败转移函数进行状态转移与匹配;最后使用输出转移函数输出每一次比较匹配结果,最终完成海量数据流的多模式事件匹配; In step S2, it is first necessary to use the above-mentioned multi-mode finite state automaton matching model to read in atomic events one by one from the massive data stream, and use the state transition function and failure transition function to perform state transition and matching; finally use the output transition function to output Comparing the matching results each time, and finally completing the multi-mode event matching of massive data streams;

与现有技术相比,本发明技术方案的有益效果是:本发明公开一种基于多模式匹配模型的复杂事件检测方法,将多个复杂事件检测模式融合构建成一个有限状态自动机,大大减少了许多冗余的自动机状态和转移边存储与查找,避免重复数据操作匹配和计算操作,实现扫描一次数据流即可完成多个复杂事件检测模式的检测与匹配,提高了数据流上复杂事件检测效率。本方法实现了物联网海量数据流中多个复杂事件共享检测,改进了当前基于自动机的复杂事件模式检测方法,对现有的复杂事件检测技术进行扩展,使其能够比较高效地在海量数据上完成多个复杂事件的检测。 Compared with the prior art, the beneficial effect of the technical solution of the present invention is: the present invention discloses a complex event detection method based on a multi-pattern matching model, which fuses multiple complex event detection patterns into a finite state automaton, greatly reducing Many redundant automaton states and transition side storage and lookup avoid repeated data operation matching and calculation operations, and realize the detection and matching of multiple complex event detection modes by scanning the data stream once, which improves the complexity of complex events on the data stream. detection efficiency. This method realizes the shared detection of multiple complex events in the massive data stream of the Internet of Things, improves the current complex event pattern detection method based on automaton, and expands the existing complex event detection technology, so that it can more efficiently detect massive data. detection of multiple complex events.

附图说明 Description of drawings

图1是本发明方法所提的多模式匹配模型构建过程图。 Fig. 1 is a diagram of the construction process of the multi-pattern matching model proposed by the method of the present invention.

图2是检测模式表达式为SEQ(A,B,C, D)的NFA模型图。 Figure 2 is a diagram of the NFA model whose detection mode expression is SEQ(A, B, C, D).

图3是检测模式表达式为SEQ(A,F,E)的NFA模型图。 Fig. 3 is a diagram of the NFA model whose detection mode expression is SEQ(A, F, E).

图4是检测模式表达式为SEQ(M,E,C, A)的NFA模型图。 Figure 4 is a diagram of the NFA model whose detection mode expression is SEQ(M, E, C, A).

图5是本发明方法所提的基于多模式匹配模型。 Fig. 5 is a multi-pattern matching model proposed by the method of the present invention.

图6是本发明方法所提的多模式匹配的复杂事件检测过程。 Fig. 6 is a complex event detection process of multi-pattern matching proposed by the method of the present invention.

图7是本发明与单模式检测方法在检测时间方面比较示意图。 Fig. 7 is a schematic diagram comparing detection time between the present invention and the single-mode detection method.

图8是本发明与单模式检测方法在内存消耗方面比较示意图 Fig. 8 is a schematic diagram comparing memory consumption between the present invention and the single-mode detection method

图9是本发明与单模式检测方法在吞吐量方面比较示意图。 FIG. 9 is a schematic diagram of the comparison between the present invention and the single-mode detection method in terms of throughput.

具体实施方式 detailed description

附图仅用于示例性说明,不能理解为对本专利的限制;下面结合附图和实施例对本发明的技术方案做进一步的说明。 The accompanying drawings are for illustrative purposes only, and should not be construed as limiting the patent; the technical solutions of the present invention will be further described below in conjunction with the accompanying drawings and embodiments.

实施例1 Example 1

一种基于多模式匹配模型的复杂事件检测方法,包括以下步骤: A complex event detection method based on a multi-pattern matching model, comprising the following steps:

S1:构建多模式有限状态自动机匹配模型,具体通过如下小步骤实现: S1: Construct a multi-mode finite state automaton matching model, which is realized through the following small steps:

S1.1:读取各个单模式事件匹配表达式; S1.1: Read each single-mode event matching expression;

S1.2:根据读取的各个单模式事件匹配表达式构建各自对应的单模式有限状态自动机匹配模型; S1.2: Construct the corresponding single-mode finite state automata matching models according to the read single-mode event matching expressions;

S1.3:根据构建的单模式有限状态自动机匹配模型构建转移函数,转移函数的功能是实现有限自动机的状态转移路径; S1.3: Construct a transition function according to the constructed single-mode finite state automata matching model, and the function of the transition function is to realize the state transition path of the finite automaton;

S1.4:根据构建的转移函数执行情况构建失败函数,失败函数的功能是实现用于当发生失配时,指向下一个状态以继续比较; S1.4: Construct a failure function according to the execution of the constructed transition function. The function of the failure function is to point to the next state to continue the comparison when a mismatch occurs;

S1.5:根据构建的状态转移函数和失败转移函数执行情况构建输出函数,输出函数用于当某个模式串获得匹配时,输出或记录一条匹配成功信息; S1.5: Construct an output function according to the execution status of the constructed state transition function and failure transition function. The output function is used to output or record a matching success message when a certain pattern string is matched;

S1.6: 输出多模式有限状态自动机匹配模型; S1.6: Output the multi-mode finite state automaton matching model;

S2:多模式有限状态自动机匹配模型匹配,具体通过如下小步骤实现: S2: Multi-mode finite state automaton matching model matching, specifically through the following small steps:

S2.1:从数据流逐一读入检测数据,所述检测数据为原子事件; S2.1: Read the detection data one by one from the data stream, and the detection data is an atomic event;

S2.2:根据读取的检测数据确定多模式状态转移方向; S2.2: Determine the multi-mode state transfer direction according to the read detection data;

S2.3:根据状态转移路径确定是否执行多模式失败转移功能; S2.3: Determine whether to execute the multi-mode failover function according to the state transition path;

S2.4:根据多模式状态转移方向或多模式失败转移功能确定是否执行多模式输出转移函数; S2.4: Determine whether to execute the multi-mode output transition function according to the multi-mode state transition direction or the multi-mode failure transition function;

S2.5:判断检测任务是否完成,如果完成,跳转到S2.6执行;否则跳转到S2.1执行; S2.5: Judging whether the detection task is completed, if it is completed, jump to S2.6 for execution; otherwise, jump to S2.1 for execution;

S2.6:输出检测结果。 S2.6: output the detection result.

本实施例对一种基于多模式匹配模型的复杂事件检测方法的具体检测过程进行详细的说明。在本实例中,利用数据发生器模块去模拟产生从各类生产行业海量数据流。通过控制数据发生器模块参数生成事件类型的规格,事件流的概率分布等以实现实验要求的需要。本实施例的实验工具为:Visual C++ 6. 0,测试指标为:检测时间,内存消耗和吞吐量三方面,实验比较方法为:单模式检测(Singpattern detection)方法,即用现有基于自动机结构复杂事件检测方法分别实现复杂事件检测表达式。检测表达式分别为SEQ(A,B,C, D),SEQ(A,F,E)和SEQ(M,E,C, A)。本实例以检测上述三个检测表达式为例,说明本发明所提基于多模式匹配模型的复杂事件检测方法具体应用过程。 This embodiment describes in detail the specific detection process of a complex event detection method based on a multi-pattern matching model. In this example, the data generator module is used to simulate and generate massive data streams from various production industries. By controlling the parameters of the data generator module to generate the specification of the event type, the probability distribution of the event flow, etc., the requirements of the experiment can be realized. The experimental tool of the present embodiment is: Visual C++ 6.0, and the test index is: detection time, three aspects of memory consumption and throughput, and the experimental comparison method is: single pattern detection (Singpattern detection) method, promptly uses existing automaton-based Structural complex event detection methods implement complex event detection expressions respectively. The detection expressions are SEQ(A,B,C,D), SEQ(A,F,E) and SEQ(M,E,C,A), respectively. This example takes the detection of the above three detection expressions as an example to illustrate the specific application process of the complex event detection method based on the multi-pattern matching model proposed in the present invention.

图1是本发明方法所提的多模式匹配模型构建过程图。它主要包含了:读单模型匹配表达式,构建单模式匹配模型,创建多模式状态转移函数,创建多模式失败转移函数,创建多模式输出转移函数和多模式匹配模型输出等五大部分功能。其实现的主要功能是把多个单模式事件匹配模型融合构建成一个多模式事件匹配模型,以实现多个模式检测表达式串之间共享,消除它们之间存在的许多冗余的自动机状态和转移边。图2是由检测模式表达式为SEQ(A,B,C, D)生成的NFA模型图;图3是由检测模式表达式为SEQ(A,F,E) 生成的NFA模型图;图4是由检测模式表达式为SEQ(M,E,C, A) 生成的NFA模型图。图5是本发明方法的多模式匹配模型,它是由检测模式表达式为SEQ(A,B,C, D),SEQ(A,F,E)和SEQ(M,E,C, A)共同融合构建而成。 Fig. 1 is a diagram of the construction process of the multi-pattern matching model proposed by the method of the present invention. It mainly includes five major functions: reading a single model matching expression, building a single model matching model, creating a multi-mode state transition function, creating a multi-mode failure transition function, creating a multi-mode output transfer function and multi-mode matching model output. The main function it realizes is to fuse multiple single-mode event matching models into a multi-mode event matching model, so as to realize the sharing among multiple mode detection expression strings and eliminate many redundant automaton states between them and transfer sides. Figure 2 is the NFA model diagram generated by the detection pattern expression as SEQ (A, B, C, D); Fig. 3 is the NFA model diagram generated by the detection pattern expression as SEQ (A, F, E); Fig. 4 is the NFA model graph generated by the detection pattern expression as SEQ(M, E, C, A). Fig. 5 is the multi-pattern matching model of the method of the present invention, and it is expressed as SEQ (A, B, C, D), SEQ (A, F, E) and SEQ (M, E, C, A) by detection mode expression built together.

本发明方法所提的多模式匹配的复杂事件检测过程如图6所示, 它主要包含了:从数据流逐一读入数据,多模式状态转移, 多模式失败转移, 多模式输出转移和多模式匹配结果输出部分功能。其主要功能是使用新融合而成的多模式事件匹配模型实现对数据流的复杂事件进行检测,并输出检测结果。 The multi-mode matching complex event detection process proposed by the method of the present invention is shown in Figure 6, which mainly includes: reading data from the data stream one by one, multi-mode state transition, multi-mode failure transition, multi-mode output transition and multi-mode The matching results output part of the function. Its main function is to use the newly fused multi-mode event matching model to detect complex events in the data stream and output the detection results.

图7本发明与单模式检测方法在检测时间方面比较示意图。从图7可以看到,在相同测试条件下,相比单模式检测方法,本发明方法可以极大地提高检测时间,提高事件检测效率。分析其主要原因在于本发明方法消除了许多单模式检测方法中由于无法共享而存在冗余的自动机状态和转移边, 减少了许多重复数据的存储、查找和计算操作,因而节省了许多检测时间。 Fig. 7 is a schematic diagram comparing detection time between the present invention and the single-mode detection method. It can be seen from FIG. 7 that under the same test conditions, compared with the single-mode detection method, the method of the present invention can greatly increase the detection time and improve the event detection efficiency. The main reason for the analysis is that the method of the present invention eliminates the redundant automaton states and transition edges that cannot be shared in many single-mode detection methods, reduces the storage, search and calculation operations of many repeated data, and thus saves a lot of detection time .

图8是本发明与现有单模式检测方法在内存使用消耗方面比较示意图。从图8可以看到,在相同测试条件下,本发明方法在内存使用消耗方面优于单模式检测方法。分析其主要原因在于,在相同测试条件下,本发明方法中使用基于多模式事件模型去检测海量数据流中相关事件,消除了许多单模式检测方法中存在冗余的自动机状态和转移边, 减少了许多重复数据的存储、查找和计算操作, 因而节省了许多内存使用消耗。 FIG. 8 is a schematic diagram of comparing memory usage and consumption between the present invention and the existing single-mode detection method. It can be seen from FIG. 8 that under the same test conditions, the method of the present invention is superior to the single-mode detection method in terms of memory usage and consumption. The main reason for the analysis is that under the same test conditions, the method of the present invention uses a multi-mode event model to detect related events in massive data streams, eliminating redundant automaton states and transition edges in many single-mode detection methods. It reduces the storage, lookup and calculation operations of many duplicate data, thus saving a lot of memory usage consumption.

图9是本发明与现有单模式检测方法在事件吞吐量方面比较示意图。从图9可以看到,在相同测试条件下,本发明方法在事件吞吐量方面也优于单模式检测方法。分析其主要原因在于本发明中多模式事件匹配模型使用。在本发明中,我们使用多模式事件模型去查找相关事件,实现数据流中相关事件的快速查找,计算和匹配操作,减少单模式检测方法中许多冗余的自动机状态和转移边查找和计算,减少了许多数据的重复匹配和计算操作,进而提高了系统事件处理速度。 FIG. 9 is a schematic diagram comparing the event throughput between the present invention and the existing single-mode detection method. It can be seen from FIG. 9 that under the same test conditions, the method of the present invention is also superior to the single-mode detection method in terms of event throughput. The main reason for the analysis lies in the use of the multi-mode event matching model in the present invention. In this invention, we use the multi-mode event model to find related events, realize fast search, calculation and matching operations of related events in the data stream, and reduce many redundant automaton states and transition edge search and calculation in the single-mode detection method , reducing the repeated matching and calculation operations of many data, thereby improving the processing speed of system events.

显然,本发明的上述实施例仅仅是为清楚地说明本发明所作的举例,而并非是对本发明的实施方式的限定。对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明权利要求的保护范围之内。 Apparently, the above-mentioned embodiments of the present invention are only examples for clearly illustrating the present invention, rather than limiting the implementation of the present invention. For those of ordinary skill in the art, other changes or changes in different forms can be made on the basis of the above description. It is not necessary and impossible to exhaustively list all the implementation manners here. All modifications, equivalent replacements and improvements made within the spirit and principles of the present invention shall be included within the protection scope of the claims of the present invention.

Claims (2)

1. a complex events detecting methods based on multi-mode matching model, it is characterised in that said method comprising the steps of:
S1: build multi-mode finite-state automata Matching Model, realizes especially by following little step:
S1.1: read each monotype event matches expression formula;
S1.2: build each self-corresponding monotype finite-state automata Matching Model according to each monotype event matches expression formula read;
S1.3: building transfer function according to the monotype finite-state automata Matching Model built, the function of transfer function is to realize the state transition path of finite automata;
S1.4: build unsuccessfully function according to the transfer function implementation status built, the function of failure function is to realize, for when there is mismatch, pointing to next state to continue to compare;
S1.5: build output function according to the transfer function built and failure letter implementation status, output function for when certain pattern string obtains coupling, exports or records one the match is successful information;
S1.6: output multi-mode finite-state automata Matching Model;
S2: multi-mode finite-state automata Matching Model is mated, and realizes especially by following little step:
S2.1: read in detection data one by one from data stream;
S2.2: the detection data according to reading determine multi-mode state shift direction;
S2.3: determine whether to perform multi-mode matching failure forwarding function according to state transition path;
S2.4: determine whether to perform multi-mode matching output function according to multi-mode state shift direction or multi-mode matching failure forwarding function;
S2.5: judge whether Detection task completes, if completing, jumping to S2.6 and performing;Otherwise jump to S2.1 perform;
S2.6: output detections result.
Complex events detecting methods based on multi-mode matching model the most according to claim 1, it is characterised in that in step S2.2, described detection data are atomic event.
CN201610296398.7A 2016-05-06 2016-05-06 Multimode matching model based complex event detection method Pending CN105930494A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610296398.7A CN105930494A (en) 2016-05-06 2016-05-06 Multimode matching model based complex event detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610296398.7A CN105930494A (en) 2016-05-06 2016-05-06 Multimode matching model based complex event detection method

Publications (1)

Publication Number Publication Date
CN105930494A true CN105930494A (en) 2016-09-07

Family

ID=56834527

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610296398.7A Pending CN105930494A (en) 2016-05-06 2016-05-06 Multimode matching model based complex event detection method

Country Status (1)

Country Link
CN (1) CN105930494A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112613317A (en) * 2020-12-30 2021-04-06 中国农业银行股份有限公司 Text data cleaning method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100057735A1 (en) * 2008-08-29 2010-03-04 Oracle International Corporation Framework for supporting regular expression-based pattern matching in data streams
CN101753316A (en) * 2008-12-02 2010-06-23 北京启明星辰信息技术股份有限公司 Method and system for intelligently extracting features
CN104361058A (en) * 2014-10-31 2015-02-18 广东工业大学 Hash structure complex event detection method for mass data flow

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100057735A1 (en) * 2008-08-29 2010-03-04 Oracle International Corporation Framework for supporting regular expression-based pattern matching in data streams
CN101753316A (en) * 2008-12-02 2010-06-23 北京启明星辰信息技术股份有限公司 Method and system for intelligently extracting features
CN104361058A (en) * 2014-10-31 2015-02-18 广东工业大学 Hash structure complex event detection method for mass data flow

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘春晖 等: "一种改进的AC多模式匹配算法", 《计算机工程》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112613317A (en) * 2020-12-30 2021-04-06 中国农业银行股份有限公司 Text data cleaning method and device
CN112613317B (en) * 2020-12-30 2023-12-08 中国农业银行股份有限公司 Text data cleaning method and device

Similar Documents

Publication Publication Date Title
CN109991951B (en) Multi-source fault detection and diagnosis method and device
Christou et al. Predictive and explainable machine learning for industrial internet of things applications
CN117235639A (en) A log anomaly detection auxiliary decision-making method and system based on knowledge graph and reinforcement learning
CN104506340A (en) Creation method of decision tree in industrial Ethernet fault diagnosis method
CN118277813B (en) A method for tracing the abnormality of UAV flight trajectory based on fusion clustering algorithm
CN103197168A (en) Fault diagnosis control method in power system based on event set causal chain
CN117354251A (en) Automatic extraction method for electric power Internet of things terminal characteristics
Angelopoulos et al. Impact of classifiers to drift detection method: a comparison
CN105786635B (en) A kind of Complex event processing system and method towards Fault-Sensitive point dynamic detection
Jia et al. Air: Adaptive incremental embedding updating for dynamic knowledge graphs
CN104361058A (en) Hash structure complex event detection method for mass data flow
CN105930494A (en) Multimode matching model based complex event detection method
WO2021171647A1 (en) Operating state classification system, and operating state classification method
Jiang et al. Fault Diagnosis of Gearbox in Multiple Conditions Based on Fine‐Grained Classification CNN Algorithm
Scholz et al. Anomaly Detection in Manufacturing
CN106383863A (en) Isomorphic sub-graph query optimization method
CN117094634A (en) Supply chain inventory collaboration method, system, medium and equipment
Song et al. Robustness analysis of smart manufacturing systems against resource failures: A two-layered network perspective
Wang et al. Predictive maintenance and sensitivity analysis for equipment with multiple quality states
CN120455497B (en) Industrial equipment interconnection method based on industrial control platform
Ghosh et al. PLAT: An automated fault and behavioural anomaly detection tool for PLC controlled manufacturing systems
CN102541855A (en) Information data processing system and method
CN106250549A (en) A kind of Frequent Pattern Mining method based on internal memory
Sun et al. Workload‐Aware Performance Tuning for Multimodel Databases Based on Deep Reinforcement Learning
Huaji et al. Research on the ontology-based complex event processing engine of RFID technology for agricultural products

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160907

RJ01 Rejection of invention patent application after publication