[go: up one dir, main page]

CN107238407A - Project of South-to-North water diversion secure data abnormal patterns find method and system - Google Patents

Project of South-to-North water diversion secure data abnormal patterns find method and system Download PDF

Info

Publication number
CN107238407A
CN107238407A CN201710305503.3A CN201710305503A CN107238407A CN 107238407 A CN107238407 A CN 107238407A CN 201710305503 A CN201710305503 A CN 201710305503A CN 107238407 A CN107238407 A CN 107238407A
Authority
CN
China
Prior art keywords
sensor
safety
data
threshold
sensors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710305503.3A
Other languages
Chinese (zh)
Other versions
CN107238407B (en
Inventor
刘扬
闫新庆
刘雪梅
杨彬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
North China University of Water Resources and Electric Power
Original Assignee
North China University of Water Resources and Electric Power
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North China University of Water Resources and Electric Power filed Critical North China University of Water Resources and Electric Power
Priority to CN201710305503.3A priority Critical patent/CN107238407B/en
Publication of CN107238407A publication Critical patent/CN107238407A/en
Application granted granted Critical
Publication of CN107238407B publication Critical patent/CN107238407B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01DMEASURING NOT SPECIALLY ADAPTED FOR A SPECIFIC VARIABLE; ARRANGEMENTS FOR MEASURING TWO OR MORE VARIABLES NOT COVERED IN A SINGLE OTHER SUBCLASS; TARIFF METERING APPARATUS; MEASURING OR TESTING NOT OTHERWISE PROVIDED FOR
    • G01D21/00Measuring or testing not otherwise provided for
    • G01D21/02Measuring two or more variables by means not covered by a single other subclass
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Public Health (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Water Supply & Treatment (AREA)
  • Theoretical Computer Science (AREA)
  • Alarm Systems (AREA)

Abstract

本发明涉及安全监测技术领域,特别是一种南水北调工程安全数据异常模式发现方法及系统,围绕传感器安全阈值确定、监测数据预测和异常模式发现机制构建三个方面,展开基于“自助法”的传感器安全阈值计算、基于机器学习和多模型方法的监测数据预测、基于智能信息处理的异常模式发现机制构建的研究,为南水北调工程安全异常模式发现提供科学的理论依据和辅助决策,包括:将传感器进行分组;确定每个传感器的安全阈值;确定每组传感器的既定阀值;判断任一传感器收到数据是否超出该传感器的安全阈值;判断该传感器所在组内所发生异常的传感器比率是否超过既定阀值。

The present invention relates to the technical field of safety monitoring, in particular to a method and system for discovering abnormal patterns of safety data in the South-to-North Water Diversion Project. It focuses on sensor safety threshold determination, monitoring data prediction, and abnormal pattern discovery mechanism construction, and develops sensors based on the "self-help method". The research on safety threshold calculation, monitoring data prediction based on machine learning and multi-model methods, and abnormal pattern discovery mechanism based on intelligent information processing provides scientific theoretical basis and auxiliary decision-making for the South-to-North Water Diversion Project safety abnormal pattern discovery, including: Grouping; determining the safety threshold of each sensor; determining the established threshold of each group of sensors; judging whether the data received by any sensor exceeds the safety threshold of the sensor; judging whether the ratio of abnormal sensors in the group where the sensor is located exceeds the established threshold value.

Description

南水北调工程安全数据异常模式发现方法及系统Method and system for discovering abnormal patterns in safety data of South-to-North Water Diversion Project

技术领域technical field

本发明涉及安全监测技术领域,特别是一种南水北调工程安全数据异常模式发现方法及系统。The invention relates to the technical field of safety monitoring, in particular to a method and system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project.

背景技术Background technique

南水北调中线干线工程是缓解京、津、冀、豫等北部地区水资源短缺紧张,优化我国水资源配置的一项战略性基础设施工程。该工程南起湖北丹江口水库,直达北京的团城湖和天津市外环河,工程全长1432km,是世界上最大的跨流域调水工程。南水北调中线干线工程沿线地质及气象条件的复杂性使南水北调工程安全受到严峻挑战。为保障工程安全,目前南水北调工程沿线已布设传感器六万多个,用于实时采集工程安全数据,达到全时段、全方位了解工程安全的目的。然而,传感器采集的海量数据同样存在数据量大但信息量小的问题,如何从海量数据中分析挖掘出有价值信息,并预测数据变化趋势,对保证南水北调中线干线工程安全具有重要意义。The South-to-North Water Diversion Project is a strategic infrastructure project to alleviate the shortage of water resources in Beijing, Tianjin, Hebei, Henan and other northern regions and to optimize the allocation of water resources in my country. The project starts from the Danjiangkou Reservoir in Hubei in the south and goes directly to Tuancheng Lake in Beijing and the Outer Ring River in Tianjin. The project has a total length of 1432km and is the largest cross-basin water diversion project in the world. The complexity of geological and meteorological conditions along the main line of the South-to-North Water Diversion Project poses severe challenges to the safety of the South-to-North Water Diversion Project. In order to ensure the safety of the project, more than 60,000 sensors have been deployed along the route of the South-to-North Water Diversion Project to collect project safety data in real time, so as to achieve full-time and comprehensive understanding of project safety. However, the mass data collected by sensors also has the problem of large amount of data but small amount of information. How to analyze and mine valuable information from the massive data and predict the trend of data changes is of great significance to ensure the safety of the South-to-North Water Diversion Central Route Project.

目前,在利用多传感器对南水北调中线干线进行工程安全数据采集、分析、预警和决策的工作中还存在一些问题,包括:At present, there are still some problems in the work of using multi-sensors for engineering safety data collection, analysis, early warning and decision-making on the main line of the South-to-North Water Diversion Middle Line, including:

1) 在传感和通信系统中,工作环境的复杂性导致监测数据存在大量缺失及异常现象;1) In the sensing and communication system, the complexity of the working environment leads to a large number of missing and abnormal phenomena in the monitoring data;

2) 传统的基于水利工程机理分析的模型能够针对简单状况下的工程实体进行机理分析,但在分析处理复杂问题时有一定的局限性。同时,机理模型在使用过程中通常需要对数据有一定的要求或假设,而模型本身也必须有明确的数学形式。但真实世界数据的分布通常较为复杂,很难做出任何假定。2) The traditional model based on hydraulic engineering mechanism analysis can analyze the mechanism of engineering entities under simple conditions, but it has certain limitations when analyzing and dealing with complex problems. At the same time, the use of the mechanism model usually requires certain requirements or assumptions on the data, and the model itself must have a clear mathematical form. But the distribution of real-world data is often complex and it is difficult to make any assumptions.

3) 传统数理统计方法通过对工程安全数据的分析,能够得到工程安全数据的产生机理,但对监测数据本身没有预测能力。3) The traditional mathematical statistical method can obtain the generation mechanism of engineering safety data through the analysis of engineering safety data, but it has no predictive ability for monitoring data itself.

从以上存在的实际问题出发,本发明选取南水北调中线干线中的典型建筑物为研究对象,针对同一建筑物的多类型传感器探测数据,针对多类型,多维度南水北调工程安全监测数据,进行数据预处理、数据建模、数据预测和工程安全数据预警等问题进行研究。Proceeding from the above practical problems, the present invention selects typical buildings in the main line of the South-to-North Water Diversion Middle Line as the research object, and performs data preprocessing for multi-type sensor detection data of the same building, and multi-type, multi-dimensional South-to-North Water Diversion Project safety monitoring data , data modeling, data prediction and engineering safety data early warning and other issues.

发明内容Contents of the invention

鉴于此,本发明提供一种南水北调工程安全数据异常模式发现方法及系统,围绕传感器安全阈值确定、监测数据预测和异常模式发现机制构建三个方面,展开基于“自助法”的传感器安全阈值计算、基于机器学习和多模型方法的监测数据预测、基于智能信息处理的异常模式发现机制构建的研究,为南水北调工程安全异常模式发现提供科学的理论依据和辅助决策。In view of this, the present invention provides a method and system for discovering abnormal patterns of safety data in the South-to-North Water Diversion Project, focusing on the three aspects of sensor safety threshold determination, monitoring data prediction and abnormal pattern discovery mechanism construction, and launching sensor safety threshold calculation based on "self-help method", The research on monitoring data prediction based on machine learning and multi-model methods, and the construction of anomaly pattern discovery mechanism based on intelligent information processing provides a scientific theoretical basis and auxiliary decision-making for the discovery of safety anomalies in the South-to-North Water Diversion Project.

为了达到上述目的,本发明是通过以下技术方案实现的:In order to achieve the above object, the present invention is achieved through the following technical solutions:

本发明提供一种南水北调工程安全数据异常模式发现方法,包括: 将传感器进行分组; 确定每个传感器的安全阈值; 确定每组传感器的既定阀值; 判断任一传感器收到数据是否超出该传感器的安全阈值:若不超过安全阈值,则记录该数据;若超过安全阈值,则判断该传感器为发生异常,并进行下一步; 判断该传感器所在组内所发生异常的传感器比率是否超过既定阀值:若超过既定阀值,则进行报警;若不超过既定阀值,则对该传感器数据进行预测,并重新确定该传感器的安全阈值。The present invention provides a method for discovering abnormal patterns of safety data in the South-to-North Water Diversion Project, including: grouping sensors; determining the safety threshold of each sensor; determining the predetermined threshold of each group of sensors; judging whether the data received by any sensor exceeds the Safety threshold: If it does not exceed the safety threshold, record the data; if it exceeds the safety threshold, judge that the sensor is abnormal, and proceed to the next step; judge whether the ratio of abnormal sensors in the group where the sensor is located exceeds the predetermined threshold: If it exceeds the predetermined threshold, an alarm will be issued; if it does not exceed the predetermined threshold, the sensor data will be predicted, and the safety threshold of the sensor will be re-determined.

进一步地,将传感器进行分组,包括: 根据传感器的时空序列对传感器进行自动聚类分组。Further, grouping the sensors includes: automatically clustering and grouping the sensors according to the time-space sequence of the sensors.

进一步地,确定每个传感器的安全阈值,包括: 对每个传感器的数据样本进行抽样; 对抽样样本进行计算得到各个传感器的置信区间,即传感器的安全阈值。Further, determining the safety threshold of each sensor includes: sampling the data samples of each sensor; and calculating the sampling samples to obtain the confidence interval of each sensor, that is, the safety threshold of the sensor.

进一步地,确定每组传感器的既定阀值,包括: 预先设定每组传感器中发生异常的传感器的比例上限,超出上限即超出既定阀值。Further, determining the predetermined threshold of each group of sensors includes: presetting the upper limit of the proportion of abnormal sensors in each group of sensors, exceeding the upper limit means exceeding the predetermined threshold.

进一步地,重新确定该传感器的安全阈值,包括: 建立该传感器与该传感器所在组传感器的数据的非线性回归模型; 利用非线性回归模型对该传感器的数据进行预测;根据预测结果重新确定该传感器的安全阈值。Further, re-determining the safety threshold of the sensor includes: establishing a nonlinear regression model of the sensor and the sensor data of the sensor group; using the nonlinear regression model to predict the data of the sensor; re-determining the sensor according to the prediction result safety threshold.

进一步地,重新确定该传感器的安全阈值之后,还包括: 根据重新确定的传感器安全阈值对该传感器所收到的数据进行判断:若不超过新的安全阈值,则记录该数据;若超过新的安全阈值,则判断该传感器为发生异常,并在该传感器所在组内进行判断是否超出既定阀值。Further, after re-determining the safety threshold of the sensor, it also includes: judging the data received by the sensor according to the re-determined sensor safety threshold: if it does not exceed the new safety threshold, record the data; if it exceeds the new If the safety threshold is determined, it is judged that the sensor is abnormal, and it is judged in the group where the sensor is located whether it exceeds the predetermined threshold.

本发明还提供一种南水北调工程安全数据异常模式发现系统,包括: 多个传感器,用于收集工程安全数据; 传感器分组模块,用于将传感器进行分组; 安全阈值确定模块,用于确定每个传感器的安全阈值; 既定阀值确定模块,用于确定每组传感器的既定阀值; 第一判断模块,用于判断传感器收到数据是否超出该传感器的安全阈值; 第二判断模块,用于判断每组传感器内所发生异常的传感器比率是否超过既定阀值。The present invention also provides a system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project, including: a plurality of sensors for collecting engineering safety data; a sensor grouping module for grouping sensors; a safety threshold determination module for determining each sensor The safety threshold of the sensor; The established threshold determination module is used to determine the predetermined threshold of each group of sensors; The first judgment module is used to judge whether the data received by the sensor exceeds the safety threshold of the sensor; The second judgment module is used to judge whether each Whether the ratio of abnormal sensors in the sensor group exceeds the predetermined threshold.

进一步地,所述传感器分组模块根据传感器的时空序列对传感器进行自动聚类分组。Further, the sensor grouping module automatically clusters and groups the sensors according to the time-space sequence of the sensors.

进一步地,还包括: 安全阈值重新确定模块,用于利用非线性回归模型对该传感器的数据进行预测,并根据预测结果重新确定该传感器的安全阈值。Further, it also includes: a module for re-determining the safety threshold, configured to use a nonlinear regression model to predict the data of the sensor, and re-determine the safety threshold of the sensor according to the prediction result.

本发明提供一种南水北调工程安全数据异常模式发现方法,具有如下有益效果:The invention provides a method for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project, which has the following beneficial effects:

本发明方法将机器学习、复杂系统参数估计、时-空序列挖掘方法交叉聚集于“时空序列智能聚类”、“时空序列参数预测”、“时空序列异常模式发现”等复杂前沿科学问题,最终完成南水北调工程安全异常模式预警。既体现了现代机器学习方法与复杂系统参数估计的深度交叉和融合,更是试图通过现代计算机智能方法解决时空序列的精确预测与序列异常模式发现的大胆尝试,本发明方法的优点主要体现在以下几个方面:The method of the present invention integrates machine learning, complex system parameter estimation, and time-space sequence mining methods into complex frontier scientific problems such as "intelligent clustering of time-space sequences", "prediction of time-space sequence parameters", and "discovery of abnormal patterns of time-space sequences". The early warning of the abnormal safety mode of the South-to-North Water Diversion Project was completed. It not only embodies the deep intersection and integration of modern machine learning methods and complex system parameter estimation, but also a bold attempt to solve the precise prediction of time-space sequences and the discovery of sequence abnormal patterns through modern computer intelligence methods. The advantages of the method of the present invention are mainly reflected in the following several aspects:

1.基于时空序列挖掘理论,研究时空序列自适应分割方法和序列聚类方法,完成监测传感器的自动分组;1. Based on the theory of time-space sequence mining, research the adaptive segmentation method and sequence clustering method of time-space sequence, and complete the automatic grouping of monitoring sensors;

2.摆脱传统机理模型在数据处理之前需要建立多个假设和精确数据模型的约束,拓宽水利信息数据处理方法。仅关注原始监测数据本身的数据特点,利用高维数据回归分析理论和现代机器学习方法,构建时空序列参数间的关联模型,达到对时空序列参数精确预测的目标;2. Get rid of the traditional mechanism model that needs to establish multiple assumptions and precise data models before data processing, and broaden the data processing methods of water conservancy information. Only focus on the data characteristics of the original monitoring data itself, and use high-dimensional data regression analysis theory and modern machine learning methods to build a correlation model between time-space sequence parameters to achieve the goal of accurate prediction of time-space sequence parameters;

3.深化现代智能信息处理方法融合复杂系统参数估计技术,构建时空序列异常模式发现方法,完成对南水北调工程安全及时预警的目标。3. Deepen modern intelligent information processing methods and integrate complex system parameter estimation technology, construct a method for discovering abnormal patterns of time-space series, and complete the goal of timely early warning of the safety of the South-to-North Water Diversion Project.

南水北调工程安全数据异常模式发现系统的有益效果与南水北调工程安全数据异常模式发现方法类似,不再赘述。The beneficial effects of the system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project are similar to those of the method for discovering the abnormal patterns of safety data of the South-to-North Water Diversion Project, and will not be repeated here.

附图说明Description of drawings

图1为本发明实施例所提供的南水北调工程安全数据异常模式发现方法的流程示意图;Fig. 1 is a schematic flow diagram of a method for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project provided by an embodiment of the present invention;

图2为本发明实施例所提供的南水北调工程安全数据异常模式发现系统的结构框架图。Fig. 2 is a structural frame diagram of a system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project provided by an embodiment of the present invention.

具体实施方式detailed description

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

以下结合具体情况说明本发明的示例性实施例:Exemplary embodiments of the present invention are described below in conjunction with specific circumstances:

本发明提供一种南水北调工程安全数据异常模式发现方法,主要包括以下步骤:The present invention provides a method for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project, which mainly includes the following steps:

将传感器进行分组;Group sensors into groups;

确定每个传感器的安全阈值;Determine safety thresholds for each sensor;

确定每组传感器的既定阀值;Determine the established threshold for each group of sensors;

判断任一传感器收到数据是否超出该传感器的安全阈值:若不超过安全阈值,则记录该数据;若超过安全阈值,则判断该传感器为发生异常,并进行下一步;Judging whether the data received by any sensor exceeds the safety threshold of the sensor: if it does not exceed the safety threshold, record the data; if it exceeds the safety threshold, judge that the sensor is abnormal and proceed to the next step;

判断该传感器所在组内所发生异常的传感器比率是否超过既定阀值:若超过既定阀值,则进行报警;若不超过既定阀值,则对该传感器数据进行预测,并重新确定该传感器的安全阈值。Judging whether the ratio of abnormal sensors in the group where the sensor is located exceeds the predetermined threshold: if it exceeds the predetermined threshold, an alarm will be issued; if it does not exceed the predetermined threshold, the sensor data will be predicted and the safety of the sensor will be re-determined threshold.

请参考图1,图1为本发明实施例所提供的南水北调工程安全数据异常模式发现方法的流程示意图;本实施例提供一种南水北调工程安全数据异常模式发现方法,具体包括以下步骤:Please refer to Fig. 1, Fig. 1 is a schematic flow chart of the method for discovering the abnormal mode of safety data of the South-to-North Water Diversion Project provided by the embodiment of the present invention; this embodiment provides a method for discovering the abnormal mode of safety data of the South-to-North Water Diversion Project, which specifically includes the following steps:

步骤S101、根据传感器的时空序列对传感器进行自动聚类分组。基于时空序列挖掘理论,研究时空序列自适应分割方法和序列聚类方法,对监测传感器进行自动分组。Step S101, performing automatic clustering and grouping of sensors according to their spatio-temporal sequences. Based on the theory of spatio-temporal sequence mining, the self-adaptive segmentation method and sequence clustering method of time-space sequence are studied, and the monitoring sensors are automatically grouped.

步骤S102、确定每个传感器的安全阈值。对每个传感器的数据样本进行抽样;对抽样样本进行计算得到各个传感器的置信区间,即传感器的安全阈值。Step S102, determining the safety threshold of each sensor. The data samples of each sensor are sampled; the sampled samples are calculated to obtain the confidence interval of each sensor, that is, the safety threshold of the sensor.

在本实施例中,针对南水北调工程安全监测数据,利用“自助法”对数据样本进行有放回的随机抽样100次,每次抽样样本规模为单个传感器监测总数据量的60%,然后对100个抽样样本进行计算得到各个传感器的置信区间,建立传感器安全阈值区间,为下一步的数据异常模式发现提供判断依据。In this embodiment, for the safety monitoring data of the South-to-North Water Diversion Project, the "self-service method" is used to randomly sample the data samples 100 times with replacement, and the sample size of each sampling is 60% of the total data volume monitored by a single sensor, and then 100 The confidence interval of each sensor is calculated by sampling samples, and the sensor safety threshold interval is established to provide a judgment basis for the next step of data anomaly pattern discovery.

作为一种可实施方式,在计算置信区间时,置信度选取为95%。As an implementable manner, when calculating the confidence interval, the confidence degree is selected as 95%.

步骤S103、确定每组传感器的既定阀值。预先设定每组传感器中发生异常的传感器的比例上限,超出上限即超出既定阀值。Step S103, determining a predetermined threshold value of each group of sensors. The upper limit of the proportion of abnormal sensors in each group of sensors is set in advance, exceeding the upper limit means exceeding the established threshold.

步骤S104、判断任一传感器收到数据是否超出该传感器的安全阈值:若不超过安全阈值,则记录该数据,即按正常数据处理;若超过安全阈值,则判断该传感器为发生异常,并进行下一步;Step S104, judging whether the data received by any sensor exceeds the safety threshold of the sensor: if it does not exceed the safety threshold, record the data, that is, process it as normal data; if it exceeds the safety threshold, judge that the sensor is abnormal, and perform Next step;

步骤S105、判断该传感器所在组内所发生异常的传感器比率是否超过既定阀值:若超过既定阀值,则进行报警;若不超过既定阀值,则进行下一步。Step S105 , judging whether the ratio of abnormal sensors in the sensor group exceeds a predetermined threshold: if it exceeds the predetermined threshold, an alarm is issued; if it does not exceed the predetermined threshold, the next step is performed.

步骤S106、建立该传感器与该传感器所在组传感器的数据的非线性回归模型;Step S106, establishing a nonlinear regression model of the sensor and the sensor data of the sensor group;

步骤S107、利用非线性回归模型对该传感器的数据进行预测;Step S107, using a nonlinear regression model to predict the data of the sensor;

步骤S108、根据预测结果重新确定该传感器的安全阈值。Step S108, redetermine the safety threshold of the sensor according to the prediction result.

步骤S109、根据重新确定的传感器安全阈值对该传感器所收到的数据进行判断:若不超过新的安全阈值,则记录该数据;若超过新的安全阈值,则判断该传感器为发生异常,并在该传感器所在组内进行判断是否超出既定阀值。Step S109, judge the data received by the sensor according to the re-determined sensor safety threshold: if it does not exceed the new safety threshold, record the data; if it exceeds the new safety threshold, judge that the sensor is abnormal, and In the group where the sensor is located, it is judged whether the predetermined threshold is exceeded.

利用“自助法”得到各个传感器的安全阈值;当有新的数据来临,首先判断是否超出了传感器安全阈值,如果没有,则按照正常数据处理,如果超出了安全阈值,则选择与当前传感器聚类为同组的传感器组进行考察;对于相关传感器组,如果该时间段内,该组中发生异常的传感器比率大于既定阀值,则进行工程级报警;如果该事件段内相关传感器组中发生异常的传感器比率小于既定阀值,则利用机器学习方法建立当前传感器与相关传感器组之间的非线性回归模型,并利用传感器组的监测数据对当前传感器数据进行预测,并生成该时刻下的数据安全空间;在新的安全区间下,如果该传感器的当前监测值存在于新的安全区间中,则作为正常数据处理,否则进行传感器级别报警。Use the "self-help method" to get the safety threshold of each sensor; when new data comes, first judge whether it exceeds the sensor safety threshold, if not, then process it according to the normal data, if it exceeds the safety threshold, choose to cluster with the current sensor Investigate the sensor groups of the same group; for the relevant sensor group, if the ratio of abnormal sensors in the group is greater than the predetermined threshold within this time period, an engineering-level alarm will be issued; if an abnormality occurs in the relevant sensor group within the event period If the sensor ratio is less than the predetermined threshold, the machine learning method is used to establish a nonlinear regression model between the current sensor and the relevant sensor group, and the monitoring data of the sensor group is used to predict the current sensor data and generate a data security at that moment. Space; under the new safety interval, if the current monitoring value of the sensor exists in the new safety interval, it will be treated as normal data, otherwise, a sensor-level alarm will be issued.

本发明实施例还提供一种南水北调工程安全数据异常模式发现系统,包括:The embodiment of the present invention also provides a system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project, including:

多个传感器,用于收集工程安全数据;Multiple sensors to collect engineering safety data;

传感器分组模块,用于将传感器进行分组;The sensor grouping module is used to group the sensors;

安全阈值确定模块,用于确定每个传感器的安全阈值;a safety threshold determination module, configured to determine a safety threshold for each sensor;

既定阀值确定模块,用于确定每组传感器的既定阀值;An established threshold determination module, configured to determine an established threshold for each group of sensors;

第一判断模块,用于判断传感器收到数据是否超出该传感器的安全阈值;The first judging module is used to judge whether the data received by the sensor exceeds the safety threshold of the sensor;

第二判断模块,用于判断每组传感器内所发生异常的传感器比率是否超过既定阀值。The second judging module is used to judge whether the ratio of abnormal sensors in each group of sensors exceeds a predetermined threshold.

进一步地,所述传感器分组模块根据传感器的时空序列对传感器进行自动聚类分组。Further, the sensor grouping module automatically clusters and groups the sensors according to the time-space sequence of the sensors.

进一步地,还包括:Further, it also includes:

安全阈值重新确定模块,用于利用非线性回归模型对该传感器的数据进行预测,并根据预测结果重新确定该传感器的安全阈值。The safety threshold re-determining module is used for predicting the data of the sensor by using the nonlinear regression model, and re-determining the safety threshold of the sensor according to the prediction result.

最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should also be noted that in this text, relational terms such as first and second etc. are only used to distinguish one entity or operation from another, and do not necessarily require or imply that these entities or operations, any such actual relationship or order exists. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.

以上对本发明所提供的具体实施方式进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The specific implementation methods provided by the present invention have been introduced in detail above, and the principles and implementation modes of the present invention have been explained by using specific examples in this paper. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present invention At the same time, for those skilled in the art, according to the idea of the present invention, there will be changes in the specific implementation and application scope. In summary, the content of this specification should not be construed as limiting the present invention.

Claims (9)

1.一种南水北调工程安全数据异常模式发现方法,其特征在于,包括:1. A method for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project, characterized in that it comprises: 将传感器进行分组;Group sensors into groups; 确定每个传感器的安全阈值;Determine safety thresholds for each sensor; 确定每组传感器的既定阀值;Determine the established threshold for each group of sensors; 判断任一传感器收到数据是否超出该传感器的安全阈值:若不超过安全阈值,则记录该数据;若超过安全阈值,则判断该传感器为发生异常,并进行下一步;Judging whether the data received by any sensor exceeds the safety threshold of the sensor: if it does not exceed the safety threshold, record the data; if it exceeds the safety threshold, judge that the sensor is abnormal and proceed to the next step; 判断该传感器所在组内所发生异常的传感器比率是否超过既定阀值:若超过既定阀值,则进行报警;若不超过既定阀值,则对该传感器数据进行预测,并重新确定该传感器的安全阈值。Judging whether the ratio of abnormal sensors in the group where the sensor is located exceeds the predetermined threshold: if it exceeds the predetermined threshold, an alarm will be issued; if it does not exceed the predetermined threshold, the sensor data will be predicted and the safety of the sensor will be re-determined threshold. 2.根据权利要求1所述的南水北调工程安全数据异常模式发现方法,其特征在于,将传感器进行分组,包括:2. The South-to-North Water Diversion Project safety data anomaly pattern discovery method according to claim 1, is characterized in that, grouping sensors includes: 根据传感器的时空序列对传感器进行自动聚类分组。Automatically cluster and group sensors based on their spatiotemporal sequences. 3.根据权利要求1所述的南水北调工程安全数据异常模式发现方法,其特征在于,确定每个传感器的安全阈值,包括:3. The South-to-North Water Diversion Project safety data anomaly pattern discovery method according to claim 1, is characterized in that, determining the safety threshold of each sensor comprises: 对每个传感器的数据样本进行抽样;Sample data samples from each sensor; 对抽样样本进行计算得到各个传感器的置信区间,即传感器的安全阈值。The confidence interval of each sensor is obtained by calculating the sampling samples, that is, the safety threshold of the sensor. 4.根据权利要求1所述的南水北调工程安全数据异常模式发现方法,其特征在于,确定每组传感器的既定阀值,包括:4. The South-to-North Water Diversion Project safety data abnormal pattern discovery method according to claim 1, characterized in that determining the predetermined threshold of each group of sensors includes: 预先设定每组传感器中发生异常的传感器的比例上限,超出上限即超出既定阀值。The upper limit of the proportion of abnormal sensors in each group of sensors is set in advance, exceeding the upper limit means exceeding the established threshold. 5.根据权利要求1所述的南水北调工程安全数据异常模式发现方法,其特征在于,重新确定该传感器的安全阈值,包括:5. The South-to-North Water Diversion Project safety data abnormal pattern discovery method according to claim 1, characterized in that re-determining the safety threshold of the sensor comprises: 建立该传感器与该传感器所在组传感器的数据的非线性回归模型;Establish a non-linear regression model of the data of the sensor and the sensor group where the sensor is located; 利用非线性回归模型对该传感器的数据进行预测;Using a nonlinear regression model to predict the data of the sensor; 根据预测结果重新确定该传感器的安全阈值。Re-determine the sensor's safety threshold based on the predicted results. 6.根据权利要求1所述的南水北调工程安全数据异常模式发现方法,其特征在于,重新确定该传感器的安全阈值之后,还包括:6. The South-to-North Water Diversion Project safety data abnormal pattern discovery method according to claim 1, characterized in that after re-determining the safety threshold of the sensor, it also includes: 根据重新确定的传感器安全阈值对该传感器所收到的数据进行判断:若不超过新的安全阈值,则记录该数据;若超过新的安全阈值,则判断该传感器为发生异常,并在该传感器所在组内进行判断是否超出既定阀值。Judgment is made on the data received by the sensor according to the re-determined sensor safety threshold: if it does not exceed the new safety threshold, record the data; if it exceeds the new safety threshold, it is judged that the sensor is abnormal, and It is judged within the group whether it exceeds the established threshold. 7.一种南水北调工程安全数据异常模式发现系统,其特征在于,包括:7. A system for discovering abnormal patterns in safety data of the South-to-North Water Diversion Project, characterized in that it includes: 多个传感器,用于收集工程安全数据;Multiple sensors to collect engineering safety data; 传感器分组模块,用于将传感器进行分组;The sensor grouping module is used to group the sensors; 安全阈值确定模块,用于确定每个传感器的安全阈值;a safety threshold determination module, configured to determine a safety threshold for each sensor; 既定阀值确定模块,用于确定每组传感器的既定阀值;An established threshold determination module, configured to determine an established threshold for each group of sensors; 第一判断模块,用于判断传感器收到数据是否超出该传感器的安全阈值;The first judging module is used to judge whether the data received by the sensor exceeds the safety threshold of the sensor; 第二判断模块,用于判断每组传感器内所发生异常的传感器比率是否超过既定阀值。The second judging module is used to judge whether the ratio of abnormal sensors in each group of sensors exceeds a predetermined threshold. 8.根据权利要求7所述的南水北调工程安全数据异常模式发现系统,其特征在于,所述传感器分组模块根据传感器的时空序列对传感器进行自动聚类分组。8. The system for discovering abnormal patterns of safety data of the South-to-North Water Diversion Project according to claim 7, wherein the sensor grouping module performs automatic clustering and grouping of the sensors according to the time-space sequence of the sensors. 9.根据权利要求7所述的南水北调工程安全数据异常模式发现系统,其特征在于,还包括:9. The South-to-North Water Diversion Project safety data abnormal pattern discovery system according to claim 7, is characterized in that, also comprises: 安全阈值重新确定模块,用于利用非线性回归模型对该传感器的数据进行预测,并根据预测结果重新确定该传感器的安全阈值。The safety threshold re-determining module is used for predicting the data of the sensor by using the nonlinear regression model, and re-determining the safety threshold of the sensor according to the prediction result.
CN201710305503.3A 2017-05-03 2017-05-03 Project of South-to-North water diversion secure data abnormal patterns find method and system Active CN107238407B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710305503.3A CN107238407B (en) 2017-05-03 2017-05-03 Project of South-to-North water diversion secure data abnormal patterns find method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710305503.3A CN107238407B (en) 2017-05-03 2017-05-03 Project of South-to-North water diversion secure data abnormal patterns find method and system

Publications (2)

Publication Number Publication Date
CN107238407A true CN107238407A (en) 2017-10-10
CN107238407B CN107238407B (en) 2019-10-08

Family

ID=59984116

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710305503.3A Active CN107238407B (en) 2017-05-03 2017-05-03 Project of South-to-North water diversion secure data abnormal patterns find method and system

Country Status (1)

Country Link
CN (1) CN107238407B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220081880A1 (en) * 2019-03-29 2022-03-17 Yvon Ilaka Mupende Device for determining the actual state and/or the remaining service life of a construction, materials-handling and/or conveyor machine

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5328026B2 (en) * 2009-03-25 2013-10-30 能美防災株式会社 Fire receiver
CN105636094A (en) * 2016-03-16 2016-06-01 中国地质大学(武汉) Wireless sensor network early warning method and system based on clustering compressed sensing
CN105871634A (en) * 2016-06-01 2016-08-17 北京蓝海讯通科技股份有限公司 Method and application for detecting cluster anomalies and cluster managing system
CN106067032A (en) * 2016-05-26 2016-11-02 哈尔滨工业大学 Evaluation method for sensor selection methods for data anomaly monitoring

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5328026B2 (en) * 2009-03-25 2013-10-30 能美防災株式会社 Fire receiver
CN105636094A (en) * 2016-03-16 2016-06-01 中国地质大学(武汉) Wireless sensor network early warning method and system based on clustering compressed sensing
CN106067032A (en) * 2016-05-26 2016-11-02 哈尔滨工业大学 Evaluation method for sensor selection methods for data anomaly monitoring
CN105871634A (en) * 2016-06-01 2016-08-17 北京蓝海讯通科技股份有限公司 Method and application for detecting cluster anomalies and cluster managing system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王晶: "无线传感器网络多维数据异常值检测", 《信息科技辑》 *
胡石: "基于神经网络的无线传感器网络异常数据检测方法", 《计算机科学》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220081880A1 (en) * 2019-03-29 2022-03-17 Yvon Ilaka Mupende Device for determining the actual state and/or the remaining service life of a construction, materials-handling and/or conveyor machine

Also Published As

Publication number Publication date
CN107238407B (en) 2019-10-08

Similar Documents

Publication Publication Date Title
CN115578015B (en) Sewage treatment whole process supervision method, system and storage medium based on Internet of things
CN111898669B (en) Abnormal event early warning system of direct-current submerged arc furnace based on machine learning
CN101726568A (en) Early-warning prediction method for water environment safety
CN103439933B (en) A kind of production run adaptive process monitoring system and method applying OCSVM
CN105608842B (en) A kind of damaged online monitoring alarm device of nuclear reactor fuel
CN106372799B (en) A Grid Security Risk Prediction Method
CN105873105A (en) Method for anomaly detection and positioning of mobile communication network based on network experience quality
CN116304766A (en) A fast assessment method for switchgear status based on multi-sensors
CN110851422A (en) Data anomaly monitoring model construction method based on machine learning
CN110795888A (en) Petroleum drilling risk prediction method
CN104633455A (en) Intelligent security city pipe network real-time monitoring system and method
CN118280070A (en) Artificial intelligence-based geological disaster monitoring, predicting and early warning method
CN114124420A (en) Network flow abnormity detection method based on deep neural network
CN108268893A (en) A kind of chemical industrial park method for early warning and device based on machine learning
Ak et al. Data analytics and uncertainty quantification for energy prediction in manufacturing
CN117408519A (en) Chemical reaction process risk early warning method and device based on deep learning algorithm
CN120085120A (en) A switch cabinet latent fault detection method and system
CN107238407B (en) Project of South-to-North water diversion secure data abnormal patterns find method and system
CN113328986A (en) Network flow abnormity detection method based on combination of convolutional neural network and LSTM
CN111461461B (en) Hydraulic engineering abnormity detection method and system
CN118193954B (en) A method and system for detecting abnormal data in distribution network based on edge computing
CN119030139A (en) Power operation and maintenance management system based on power information acquisition equipment operation status monitoring
CN111160490A (en) Deep learning dangerous rock deformation prediction method and device based on multiple time sequences
CN116704729A (en) Industrial kiln early warning system and method based on big data analysis
CN115988034A (en) Intelligent monitoring method and system for pipeline safety state and service platform

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant