CN108132423B - A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability - Google Patents
A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability Download PDFInfo
- Publication number
- CN108132423B CN108132423B CN201711340903.4A CN201711340903A CN108132423B CN 108132423 B CN108132423 B CN 108132423B CN 201711340903 A CN201711340903 A CN 201711340903A CN 108132423 B CN108132423 B CN 108132423B
- Authority
- CN
- China
- Prior art keywords
- data
- state
- monitoring
- matrix
- equipment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 134
- 230000007704 transition Effects 0.000 title claims abstract description 105
- 238000000034 method Methods 0.000 title claims abstract description 36
- 239000011159 matrix material Substances 0.000 claims abstract description 100
- 238000009826 distribution Methods 0.000 claims abstract description 66
- 238000012546 transfer Methods 0.000 claims abstract description 35
- 230000008859 change Effects 0.000 claims abstract description 29
- 230000008569 process Effects 0.000 claims abstract description 16
- 238000005259 measurement Methods 0.000 claims abstract description 7
- 238000004891 communication Methods 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 18
- 230000009466 transformation Effects 0.000 claims description 15
- 230000004807 localization Effects 0.000 claims description 12
- 208000024891 symptom Diseases 0.000 claims description 6
- 238000005070 sampling Methods 0.000 claims description 5
- 230000002159 abnormal effect Effects 0.000 claims description 4
- 238000001514 detection method Methods 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000013501 data transformation Methods 0.000 claims description 3
- 102100029469 WD repeat and HMG-box DNA-binding protein 1 Human genes 0.000 claims 1
- 101710097421 WD repeat and HMG-box DNA-binding protein 1 Proteins 0.000 claims 1
- 238000000638 solvent extraction Methods 0.000 claims 1
- 230000007480 spreading Effects 0.000 claims 1
- 238000007405 data analysis Methods 0.000 abstract description 2
- 238000004458 analytical method Methods 0.000 abstract 1
- 238000013480 data collection Methods 0.000 description 11
- 238000011160 research Methods 0.000 description 4
- 230000005856 abnormality Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000013524 data verification Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012850 discrimination method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01R—MEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
- G01R31/00—Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
- G01R31/08—Locating faults in cables, transmission lines, or networks
- G01R31/088—Aspects of digital computing
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Supply And Distribution Of Alternating Current (AREA)
- Remote Monitoring And Control Of Power-Distribution Networks (AREA)
Abstract
本发明涉及一种基于状态转移概率电力系统监测数据失真的快速定位方法,本方法重点关注监测数据的状态变化概率,通过多次转移后的概率分布的特征与设备运行时数据正常变化的阈值进行对比,导出失真定位矩阵,快速定位失真数据的位置。整体分为四个步骤1.监测数据属性实体划分,2.电力设备的监测数据转移概率,电力设备的多次监测转移矩阵,4.失真数据定位与子系统失真程度度量。本发明数据采集阶能将各种数据格式或数据结构统一成状态转移概率,因此规避了多源异构数据中不同数据格式对数据分析造成的影响,降低了分析系统的复杂度。
The invention relates to a fast positioning method for monitoring data distortion in a power system based on the state transition probability. The method focuses on the state change probability of the monitoring data, and uses the characteristics of the probability distribution after multiple transfers and the threshold of normal data changes during equipment operation. Compare and export the distortion location matrix to quickly locate the location of the distortion data. The whole process is divided into four steps: 1. Monitoring data attribute entity division, 2. Power equipment monitoring data transfer probability, power equipment multiple monitoring transfer matrix, 4. Distortion data location and subsystem distortion degree measurement. The data acquisition stage of the present invention can unify various data formats or data structures into state transition probabilities, thereby avoiding the influence of different data formats in multi-source heterogeneous data on data analysis, and reducing the complexity of the analysis system.
Description
技术领域technical field
本发明属于电力通信数据与大数据技术相融合的应用研究,通过将检测数据的状态变化概率话,将电力通信系统中各数据的格式统一成数据状态的转移概率分布,再通过切普曼-柯尔莫哥洛夫方程对初始转移分布进行多次转移,得到多次转移的理论分布,通过各个数据的变化阈值限定,计算出能直接定位失真数据的失真定位矩阵。The present invention belongs to the applied research on the fusion of electric power communication data and big data technology. By analyzing the state change probability of the detection data, the format of each data in the electric power communication system is unified into the transition probability distribution of the data state, and then through the Chapman- The Kolmogorov equation performs multiple transfers on the initial transfer distribution to obtain the theoretical distribution of multiple transfers, and calculates the distortion location matrix that can directly locate the distorted data by limiting the change threshold of each data.
背景技术Background technique
电力通信系统是一个广义的概念,泛指与电力电网相关各子系统以及他们产生的数据信息,随着我国电力电网的不断发展,用电需求的不断扩大,电力通信系统中产生的数据也日益庞大起来,同时数据产生的速度也越来越快,不同子系统间的数据结构也有很大差异,电力通信系统产生的数据成为了典型的大数据。The power communication system is a broad concept, which generally refers to the subsystems related to the power grid and the data information generated by them. With the continuous development of my country's power grid and the continuous expansion of electricity demand, the data generated in the power communication system is also increasing. At the same time, the speed of data generation is getting faster and faster, and the data structure between different subsystems is also very different. The data generated by the power communication system has become a typical big data.
电力通信系统是保障电力系统正常运行的重要系统,通过各类传感器对设备进行监测,为设备故障提供决策,为设备维修提供依据。大型的电力通信系统产生海量的监测数据,这些数据在采集、录入、传输、交换与储存过程中不可避免的会出现数据失真现象。现实中,这些失真数据已经成为定位与分析电力设备故障的重要阻碍因素。提高电力通信系统的数据质量是完善电力电网系统的重要环节。国内外专家对电力系统中失真数据的检测提出了多种解决方法,某研究能量管理系统中出现数据失真的原因,然后从原因着手,解决数据失真的问题。某研究是从数据平台着手试图提高数据质量。某研究从插值拟合的角度来预测数据质量.某研究基于公共信息模型(Common Information Model,CIM)的高速模型交换格式CIM/E文本为载体的不同系统间的数据校验技术,采用改进的多源数据筛选较优质量数据的手段,以及根据主站状态估计对现场数据进行反馈的方法,提高了电网调度系统的整体数据质量。The power communication system is an important system to ensure the normal operation of the power system. It monitors equipment through various sensors, provides decision-making for equipment failures, and provides a basis for equipment maintenance. Large-scale power communication systems generate massive amounts of monitoring data, and data distortion will inevitably occur during the collection, entry, transmission, exchange and storage of these data. In reality, these distorted data have become an important obstacle to locate and analyze power equipment faults. Improving the data quality of the power communication system is an important part of perfecting the power grid system. Experts at home and abroad have proposed a variety of solutions to the detection of distorted data in power systems. A researcher studies the causes of data distortion in energy management systems, and then starts from the reasons to solve the problem of data distortion. A certain research started from the data platform to try to improve the data quality. A research predicts data quality from the perspective of interpolation fitting. A research based on the common information model (Common Information Model, CIM) high-speed model exchange format CIM/E text as the data verification technology between different systems, using the improved The means of multi-source data screening for better quality data, and the method of feedbacking field data according to the state estimation of the master station improve the overall data quality of the power grid dispatching system.
以上基于电力设备状态估计的不良数据检测在对待局系统局部数据的质量提高时有一定效果,但是对于整个电力通信系统产生的多源异构大数据仍不具备良好的适用性,并且针对每一种数据失真建立相应知识库的成本是相对较高的。本发明提出基于状态转移概率电力系统监测数据失真的快速定位,不关注电力监测数据的多源异构格式,转而重点关注数据的状态变化,重点考察监测数据变化的概率与设备实际变化是否一致,通过国家电网真实数据集验证算法,实验结果表明该方法适用于大数据环境下的电力通信系统产生的多源异构数据失真的快速定位。The above bad data detection based on the state estimation of power equipment has a certain effect in improving the quality of local data in the off-site system, but it still does not have good applicability to the multi-source heterogeneous big data generated by the entire power communication system, and is aimed at each The cost of establishing a corresponding knowledge base for such data distortion is relatively high. The present invention proposes the rapid positioning of power system monitoring data distortion based on the state transition probability, does not pay attention to the multi-source heterogeneous format of the power monitoring data, and instead focuses on the state change of the data, focusing on whether the probability of the monitoring data change is consistent with the actual change of the equipment , the algorithm is verified by the real data set of the State Grid, and the experimental results show that the method is suitable for the rapid location of multi-source heterogeneous data distortion generated by the power communication system in the big data environment.
发明内容Contents of the invention
电力通信系统是保障电力系统正常稳定运行的必要信息传输系统,通过各类传感器对电气设备进行实时监测,及时上报异常数据保障设备稳定运行,为设备故障提供决策支持,为设备维修提供定位依据。目前,随着电网系统的规模越来越大以及各类电力设备监测传感器的种类逐渐增加,电力通信系统每天都在产生海量的监测数据,这些数据在采集、录入、传输、交换与储存过程中不可避免的会出现数据失真现象。实际上,在大数据时代,此类失真数据已经成为定位与分析电力设备故障的重要阻碍因素之一。电力通信系统中出现的数据失真主要包括以下两个方面:The power communication system is a necessary information transmission system to ensure the normal and stable operation of the power system. It monitors electrical equipment in real time through various sensors, reports abnormal data in time to ensure stable operation of equipment, provides decision support for equipment failures, and provides positioning basis for equipment maintenance. At present, with the increasing scale of the power grid system and the increasing types of monitoring sensors for various power equipment, the power communication system generates a large amount of monitoring data every day. These data are collected, entered, transmitted, exchanged and stored. Data distortion is inevitable. In fact, in the era of big data, such distorted data has become one of the important obstacles to locating and analyzing power equipment faults. Data distortion in power communication systems mainly includes the following two aspects:
1.违反监测数据一致性,监测数据一致是指,系统实际记录到的数据是否满足一定的函数依赖或逻辑关系,是否有超出属性定义域的数据。1. Violation of monitoring data consistency. Monitoring data consistency refers to whether the data actually recorded by the system satisfies a certain functional dependency or logical relationship, and whether there is data beyond the domain of the attribute definition.
2.违反监测数据完整性,监测数据完整性是电力信息系统实际录入的数据是存在缺失,是否完全记录了按设计要求记录的全部数据。2. Violation of the integrity of the monitoring data. The integrity of the monitoring data refers to whether the data actually entered in the power information system is missing, and whether all the data recorded according to the design requirements are completely recorded.
针对目前电力通信系统中出现的数据质量偏低问题,本发明旨在建立一种自动对电力数据进行失真识别、定位与失真程度度量的随机过程快速判别方法,本方法的关注重点是电力数据状态的变化与实际设备运行状态变化的一致性。将电力通信系统中各数据的格式统一成数据状态的转移概率分布,再通过切普曼-柯尔莫哥洛夫方程对初始转移分布进行多次转移,得到多次转移的理论分布,通过各个数据的变化阈值限定,计算出能直接定位失真数据的失真定位矩阵。Aiming at the problem of low data quality in the current power communication system, the present invention aims to establish a random process rapid discrimination method for automatic distortion identification, location and distortion degree measurement of power data. The focus of this method is the status of power data The consistency between the change and the actual equipment running state change. Unify the format of each data in the power communication system into the transition probability distribution of the data state, and then perform multiple transfers on the initial transfer distribution through the Chapman-Kolmogorov equation to obtain the theoretical distribution of multiple transfers. The data change threshold is limited, and the distortion location matrix that can directly locate the distortion data is calculated.
为完成以上目标,本发明整体包含四个步骤,整体流程图见附图1For accomplishing above object, the present invention comprises four steps as a whole, and overall flow chart is shown in accompanying drawing 1
步骤1监测数据属性实体划分Step 1 Monitor data attribute entity division
监测数据采集本质上是指电力电网系统中,各类传感器对设备进行监测并将监测数据传送到指定位置储存的过程。在不同的子系统中监测数据具有多种储存机制,本步骤的目的是使得采集到的数据按照数据源的实体设备划分为实体数据集合。本步骤分为3个子步骤:Monitoring data collection essentially refers to the process in which various sensors monitor equipment in the power grid system and transmit the monitoring data to a designated location for storage. There are multiple storage mechanisms for monitoring data in different subsystems. The purpose of this step is to divide the collected data into entity data sets according to the entity equipment of the data source. This step is divided into 3 sub-steps:
步骤1.1原始监测数据采集Step 1.1 Raw monitoring data collection
定义1原始数据集合Definition 1 Raw data set
D={d11,d12,...,d21,d22,...,dn1,dn2,...,dnk}D={d 11 ,d 12 ,...,d 21 ,d 22 ,...,d n1 ,d n2 ,...,d nk }
其中,D表示系统中采集到的各类原始数据的属性集合,dij表示第i设备的第k属性,在实际信息采集中,由于电网各子系统的数据采集方式不尽相同,所以将各子系统采集后的数据汇总后往往是杂乱的数据集合。Among them, D represents the attribute collection of various raw data collected in the system, and d ij represents the k-th attribute of the i-th device. In actual information collection, because the data collection methods of each subsystem of the power grid are different, so each After the data collected by the subsystem is summarized, it is often a messy data collection.
步骤1.2监测数据按实体来源地址分类Step 1.2 Monitoring data is classified by entity source address
在数据采集过程,将数据按照来源索引字段进行分类,此步骤分为两种形式。In the data collection process, the data is classified according to the source index field. This step is divided into two forms.
形式一:已采集数据的分类Form 1: Classification of collected data
对于形如定义1中已采集到的数据,需要按照数据属性中的数据源索引字段分类,将来源于相同实体的监测数据分为一组For the collected data in the form of definition 1, it needs to be classified according to the data source index field in the data attribute, and the monitoring data from the same entity should be divided into one group
形式二:按实体采集监测数据Form 2: Collect monitoring data by entity
对于可按实体输出监测数据的子系统,直接采集其监测数据,并在数据中注明实体的唯一属性。For subsystems that can output monitoring data by entity, directly collect the monitoring data, and indicate the unique attribute of the entity in the data.
定义2设备数据向量Definition 2 device data vector
di=(di1,di2,...,dik)d i =(d i1 ,d i2 ,...,d ik )
其中,di表示来自于第i(1≤i≤n)设备的监测数据向量,dij表示第i设备的第j,(1≤j≤k)属性。这样按任意实体设备i的数据源归类的数据以向量的形式被di记录Among them, d i represents the monitoring data vector from the i-th (1≤i≤n) device, and d ij represents the j-th (1≤j≤k) attribute of the i-th device. In this way, the data classified according to the data source of any physical device i is recorded by d i in the form of vector
步骤1.3构造属性扩展矩阵Step 1.3 Construct attribute expansion matrix
在电力通信系统中,电力电网包含各种子系统,各类设备通过协同工作来支撑一个子系统的正常运行,一套子系统的数据变化可以通过属性扩展矩阵来形式化。In the power communication system, the power grid contains various subsystems, and all kinds of equipment work together to support the normal operation of a subsystem. The data changes of a set of subsystems can be formalized through the attribute expansion matrix.
定义3属性扩展矩阵Definition 3 attribute expansion matrix
其中,M(t)表示t时刻的属性扩展矩阵,dij(t)表示第i(1≤i≤n)设备的第j,(1≤j≤ki≤k)属性在时刻t时的监测值。上式中定义了设备拥有的属性数量的上界k,其中k=max(k1,k2,...)表示子系统中拥有最多属性设备的属性个数,它规定了扩展矩阵的列数。电力子系统中不同设备的属性个数是可以不同的,这样扩展矩阵M(t)中的多数行向量没有定义的属性值,称此类没有定义的属性为扩展属性,他们的作用在于保持扩展矩阵的矩形结构以便于接下来的数学处理。扩展矩阵M(t)完整包含了电力子系统中在时刻t的属性值。Among them, M(t) represents the attribute expansion matrix at time t, d ij (t) represents the jth of the i (1≤i≤n) device, and (1≤j≤k i ≤k) attributes at time t monitoring value. The upper bound k of the number of attributes owned by the equipment is defined in the above formula, where k=max(k 1 ,k 2 ,...) indicates the number of attributes of the equipment with the most attributes in the subsystem, and it specifies the columns of the extended matrix number. The number of attributes of different devices in the power subsystem can be different, so most of the row vectors in the extended matrix M(t) have no defined attribute values, and such undefined attributes are called extended attributes, and their role is to maintain the extended The rectangular structure of the matrix is convenient for subsequent mathematical processing. The extended matrix M(t) completely contains the attribute values of the power subsystem at time t.
步骤2电力设备的监测数据转移概率Step 2. Monitoring data transfer probability of power equipment
本步骤分为3个子步骤This step is divided into 3 sub-steps
步骤2.1数据的状态划分Step 2.1 State division of data
电力设备各个属性的值都有一定范围正常域,当某一属性值超出其正常域时,称该属性出现异常值。对于离散型属性值,其定义域是可数的离散的点。对于连续型属性值,其定义域是连续的区间。根据具体的电力设备,可以将监测到的不同的数据值根据其定义域,划分到不同的状态中。当设备处于正常状态时,监测数据的状态称为稳定态。对于离散型数据值,可根据具体数据属性特点将不同的点归为一类,组成一个状态。也可以直接将每一个点视为一个状态。对于连续型数据值,可以将连续数值区间按具体特征划分为片段,每个片段为一个状态。The value of each attribute of electrical equipment has a certain range of normal range. When a certain attribute value exceeds its normal range, it is said that the attribute has an abnormal value. For discrete attribute values, its definition domain is countable discrete points. For continuous attribute values, its definition domain is a continuous interval. According to specific electrical equipment, different monitored data values can be divided into different states according to their definition domains. When the equipment is in a normal state, the state of the monitoring data is called steady state. For discrete data values, different points can be classified into one category according to specific data attribute characteristics to form a state. You can also directly treat each point as a state. For continuous data values, the continuous value interval can be divided into segments according to specific characteristics, and each segment is a state.
该步骤的作用是将设备的数据以离散的状态进行描述,以监测数据在状态中的转移来刻画数据的变化。The function of this step is to describe the data of the device in a discrete state, to monitor the transfer of data in the state to describe the change of the data.
步骤2.2设备的状态Step 2.2 Status of the device
电力设备可运行在不同的状态中,不同的设备状态代表了设备运行的阶段特征。例如,变压器的运行状态可以分为正常运行,高温运行,设备异常。不同的设备拥有不同的运行状态,而电力数据不一致的本质表现是实际设备的运行状态与监测数据状态的不一致,即监测数据不能真实反映设备实际运行情况,设备的运行状态多种多样,而数据状态的种类更多。通常一个设备状态对应于一系列数据状态的特定组合。在过去,找到其中不一致的对应关系是复杂的,而本发明采取的技术方式是,不关注设备与数据的状态本身,而是通过考察这些状态的变化,定位到不一致的对应关系。Electric equipment can operate in different states, and different equipment states represent the stage characteristics of equipment operation. For example, the operating status of a transformer can be divided into normal operation, high temperature operation, and equipment abnormality. Different equipment has different operating states, and the essence of power data inconsistency is the inconsistency between the actual operating state of the equipment and the status of the monitoring data, that is, the monitoring data cannot truly reflect the actual operating status of the equipment, and the operating status of the equipment is varied, while the data There are more types of states. Usually a device state corresponds to a specific combination of a series of data states. In the past, it was complicated to find the inconsistent correspondence, but the technical method adopted by the present invention is not to pay attention to the state of the equipment and data itself, but to locate the inconsistent correspondence by examining the changes of these states.
当电力系统中的设备稳定运行时,设备的各类监测数据值也应该稳定在一定范围中,同时其变动规则也具有稳定性。当设备运行出现状态改变时,一部分监测数据就会更大的概率偏离原来的状态,进入新的状态,从而打破之前这种稳定规则。When the equipment in the power system is running stably, the values of various monitoring data of the equipment should also be stable within a certain range, and its change rules are also stable. When the state of the equipment changes, a part of the monitoring data will deviate from the original state with a greater probability and enter a new state, thus breaking the previous stable rules.
该步骤建立了以概率方式来描述数据状态转移的通道,即数据状态的转移可以按照概率分布的数学形式描述。This step establishes a channel to describe the data state transition in a probabilistic manner, that is, the data state transition can be described in the mathematical form of probability distribution.
步骤2.3统计历史监测数据的状态转移频率Step 2.3 Statistical state transition frequency of historical monitoring data
定义4数据状态转移频率Definition 4 Data State Transition Frequency
其中,fij表示对设备监测Ni+Nj次后数据从状态i转移到状态j的频率,Ni表示处于状态i的次数,Nj表示处于状态j的次数。Among them, f ij represents the frequency of data transfer from state i to state j after N i + N j times of equipment monitoring, N i represents the number of times in state i, and N j represents the number of times in state j.
定律1伯努利大数定律Law 1 Bernoulli's law of large numbers
其中,P表示概率,ε表示一个正数,伯努利大数定律表明,通过大量搜集电力设备监测数据的状态转移数据,计算得出的状态转移的频率会依概率收敛。该定律表明可以通过多次监测当期数据或直接统计历史数据的方式来估算电力设备的状态转移概率。Among them, P represents probability, and ε represents a positive number. Bernoulli's law of large numbers shows that the frequency of state transition calculated by collecting a large number of state transition data of power equipment monitoring data will converge according to probability. This law shows that the state transition probability of power equipment can be estimated by monitoring current data multiple times or directly counting historical data.
步骤3电力设备的多次监测转移矩阵Step 3 Multi-monitoring transfer matrix of power equipment
本步骤分为4个子步骤This step is divided into 4 sub-steps
步骤3.1数据转移概率的马尔可夫性与时齐性Step 3.1 Markov property and time homogeneity of data transition probability
由于每次监测数据本质是在离散的时间点上的采样,同时依步骤2.2的状态划分方法,监测数据也被划分为离散的状态,所以电力监测数据的状态转移本质上是一个时间与状态都离散的随机过程。由于每次监测,数据所处的状态只与上一次所处的状态的相关(即以上一次所在状态为初始态进行概率转移),所以此随机过程具有马尔科夫性,本质是马尔科夫链。对数据的每一次监测采样获得的结果与第一次监测所处的状态是无关的,这说明数据的状态转移概率亦具有时齐性。此步骤建立了状态转移概率与马尔科夫链的联系,在理论上说明设备监测数据的转移概率可以构建具有马尔科夫性的转移矩阵。Since each monitoring data is essentially sampled at a discrete time point, and the monitoring data is also divided into discrete states according to the state division method in step 2.2, the state transition of power monitoring data is essentially a time and state transition. Discrete random process. Since each monitoring, the state of the data is only related to the previous state (that is, the previous state is the initial state for probability transfer), so this random process has Markov properties, and its essence is a Markov chain . The result of each monitoring sampling of the data has nothing to do with the state of the first monitoring, which shows that the state transition probability of the data is also time-homogeneous. This step establishes the connection between the state transition probability and the Markov chain, which theoretically shows that the transition probability of equipment monitoring data can construct a transition matrix with Markov properties.
步骤3.2电力数据的多次转移矩阵Step 3.2 Multiple transfer matrix of power data
定义6单次状态转移矩阵Define 6 single state transition matrix
其中,E表示电力数据的单次状态转移矩阵,其元素pij表示数据从状态i经历一次监测时间间隔后转移到状态j的概率。其概率值可以通过步骤2.3中所述方法统计而来。单次状态转移矩阵E完整描述了任一电力数据经历一次监测后所有状态转移的概率分布情况。根据切普曼-柯尔莫哥洛夫方程Among them, E represents a single state transition matrix of electric power data, and its element p ij represents the probability of data transitioning from state i to state j after a monitoring time interval. Its probability value can be calculated by the method described in step 2.3. The single state transition matrix E fully describes the probability distribution of all state transitions after any power data undergoes one monitoring. According to the Chapman-Kolmogorov equation
可以得到电力数据经过任意次状态转移后的转移矩阵,The transition matrix of the power data after any number of state transitions can be obtained,
由切普曼-柯尔莫哥洛夫方程可知,只需通过步骤3.2方法构造电力数据的单次状态转移矩阵,就能直接计算出任意m+n次后的状态转移矩阵,E(n+m)中i行j列的元素表示数据以初始状态i被监测,而后经历m+n次监测后,数据处于状态j的概率。通过这种计算方法得出来的多次转移概率的实际意义是指当电力设备处于稳定状态时,数据应该达到的理论状态转移概率。From the Chapman-Kolmogorov equation, we can directly calculate the state transition matrix after any m+n times, E (n+ The element in row i and column j in m) represents the probability that the data is monitored in the initial state i, and then after m+n times of monitoring, the data is in state j. The actual meaning of the multiple transition probability obtained by this calculation method refers to the theoretical state transition probability that the data should reach when the power equipment is in a steady state.
步骤3.4次转移后所处的理论概率分布Step 3. Theoretical probability distribution after 4 transitions
定义7电力数据初始分布Definition 7 Initial distribution of power data
其中,φ(0)表监测示数据的初始分布向量,元素(0≤j≤n)表示数据在初始时刻0所处的各个状态的概率。根据切普曼-柯尔莫哥洛夫方程,可以得到任意m+n时刻的理论概率分布。Among them, φ(0) represents the initial distribution vector of monitoring data, and the element (0≤j≤n) represents the probability of each state of the data at the initial time 0. According to the Chapman-Kolmogorov equation, the theoretical probability distribution at any m+n time can be obtained.
定理1多次转移状态的理论概率分布Theorem 1 Theoretical probability distribution of multiple transition states
φ(m+n)=φ(0)Em+n φ(m+n)=φ(0)E m+n
其中,φ(m+n)表示计算出的电力数据经历m+n次转移后所处的理论概率分布Among them, φ(m+n) represents the theoretical probability distribution of the calculated power data after m+n transfers
步骤4失真数据定位与子系统失真程度度量Step 4 Distortion data location and measurement of subsystem distortion degree
设备在一定时间段内处于稳定状态是指设备在该时间段内未发生状态的改变,对应的监测数据的状态概率分布反映出了设备在该状态下应有的数据状态概率模式特征。一旦电力通信系统检测到当前电力设备的状态不符合相应的状态转移概率的分布特征时,则可以推断系统可能出现了监测数据失真。监测周期内,如果设备运行状态未发生变化,而相应数据状态发生变化,称这种情况为第一类失真。如果设备运行状态发生变化,而数据状态未发生相应变化,称这种情况为第二类失真。本步骤分为5个子步骤The equipment is in a stable state within a certain period of time, which means that the equipment does not change its state within this period of time, and the corresponding state probability distribution of monitoring data reflects the characteristics of the data state probability mode that the equipment should have in this state. Once the power communication system detects that the current state of the power equipment does not conform to the distribution characteristics of the corresponding state transition probability, it can be inferred that the monitoring data may be distorted in the system. During the monitoring period, if the operating state of the equipment does not change, but the corresponding data state changes, this situation is called the first type of distortion. If the operating state of the device changes without a corresponding change in the state of the data, this is called Type 2 distortion. This step is divided into 5 sub-steps
步骤4.1电力数据状态转移分布的偏离度Step 4.1 Deviation degree of power data state transition distribution
定义9电力数据状态转移分布的偏离度Definition 9 Deviation Degree of Power Data State Transition Distribution
δ>v(m+n)=||φ′(m+n)-φ(m+n)||2 δ>v(m+n)=||φ′(m+n)-φ(m+n)|| 2
其中v(m+n)表示电力数据状态转移分布的偏离度,φ′(m+n)是电力设备在经历m+n次监测后实际的统计出的转移概率分布,φ(m+n)是电力设备经历m+n次监测后的理论概率分布,通过向量的2-范数度量这两种分布的偏离度。δ>0是一个可以根据实际情况定义的偏离阈值,它由子系统中设备的实际固有的状态转移特征决定。Among them, v(m+n) represents the deviation degree of the power data state transition distribution, φ′(m+n) is the actual statistical transition probability distribution of the power equipment after m+n times of monitoring, φ(m+n) is the theoretical probability distribution of power equipment after m+n times of monitoring, and the deviation between the two distributions is measured by the 2-norm of the vector. δ>0 is a deviation threshold that can be defined according to the actual situation, and it is determined by the actual inherent state transition characteristics of the equipment in the subsystem.
当δ>v(m+n)时,说明设备数据的实际状态转移分布偏离程度在合理范围内。When δ>v(m+n), it indicates that the deviation degree of the actual state transition distribution of the equipment data is within a reasonable range.
当δ≤v(m+n)时,说明设备数据的实际转移分布偏离度超出阈值。When δ≤v(m+n), it indicates that the deviation degree of the actual transfer distribution of the device data exceeds the threshold.
当设备稳定运行时,出现δ≤v(m+n),表明设备数据出现第一类失真征兆,因为数据在某些时刻以较大概率出现在了不常出现的状态中。当设备运行过程中出现变化时,出现δ>v(m+n),表明设备出现第二类失真征兆,因为随着设备的运行状态的变化,监测数据理应发生相应状态变化,而实际状态分布并未发生较大变化。When the device is running stably, δ≤v(m+n) appears, indicating that the device data has the first type of distortion symptoms, because the data appears in an infrequent state with a high probability at certain moments. When there is a change during the operation of the equipment, δ>v(m+n) appears, indicating that the equipment has the second type of distortion symptoms, because as the operating state of the equipment changes, the monitoring data should undergo corresponding state changes, while the actual state distribution No major changes occurred.
步骤4.2子系统中失真数据的定位Step 4.2 Localization of distorted data in subsystems
定义10子系统的监测数据偏离矩阵Definition 10 Subsystem monitoring data deviation matrix
电网子系统是包含系统相应设备的有机整体,通过偏离矩阵可以快速定位子系统中失真的数据。The grid subsystem is an organic whole including the corresponding equipment of the system, and the distorted data in the subsystem can be quickly located through the deviation matrix.
其中,V(t)表示子系统的监测数据经历t个标准周期后的偏离矩阵,元素vij(t)表示经历t个标准周期后子系统中第i个设备的第j个属性的数据状态转移分布的偏离度。根据步骤2.1中扩展矩阵的性质,如果矩阵相应位置对应的设备不存在该属性,则命令元素值为-1。Among them, V(t) represents the deviation matrix of the monitoring data of the subsystem after t standard periods, and the element v ij (t) represents the data status of the jth attribute of the i-th device in the subsystem after t standard periods Skewness of the transfer distribution. According to the properties of the extended matrix in step 2.1, if the attribute does not exist in the device corresponding to the corresponding position of the matrix, the value of the command element is -1.
步骤4.3进行失真定位,基于以下公式:Step 4.3 performs distortion localization, based on the following formula:
定义11失真定位矩阵Definition 11 Distortion Localization Matrix
其中,表示一个n×k的失真定位矩阵,元素只由数值0、1和-1组成,F表示矩阵0-1变换函数,它的功能是将矩阵中元素进行0-1变换,Δ(t)表示偏离阈矩阵,其元素δij(t)表示对应于偏离度vij(t)的偏离阈值,函数f(zij∈V(t)-Δ(t))表示对其自变量进行0-1变换,其定义如下定义11元素0-1变换函数in, Represents an n×k distortion positioning matrix, the elements are only composed of values 0, 1 and -1, F represents the matrix 0-1 transformation function, its function is to perform 0-1 transformation on the elements in the matrix, Δ(t) represents Deviation threshold matrix, its element δ ij (t) represents the deviation threshold corresponding to the degree of deviation v ij (t), and the function f(z ij ∈ V(t)-Δ(t)) represents the 0-1 Transformation, which is defined as follows: define an 11-element 0-1 transformation function
失真定位矩阵的实际意义在于把所有超出阈值的元素标记为1而未超过阈值的标记为0,元素在矩阵中的位置又一一对应于数据在子系统中的位置。这样一个失真定位矩阵就包含了一个子系统中失真数据的变化状态与位置信息。The actual significance of the distortion localization matrix is to mark all elements that exceed the threshold as 1 and those that do not exceed the threshold as 0, and the positions of the elements in the matrix correspond to the positions of the data in the subsystem one by one. Such a distortion localization matrix contains the change state and position information of the distortion data in a subsystem.
步骤4.4缺失数据的处理Step 4.4 Handling of missing data
数据缺失是也数据失真的一种表现,无论设备真实的运行状态如何,只要监测数据出现数据空值,则认为是数据失真,此时执行缺失数据变换Missing data is also a manifestation of data distortion. Regardless of the actual operating status of the equipment, as long as there is a data null value in the monitoring data, it is considered to be data distortion. At this time, the missing data transformation is performed.
其中,g(dij)表示缺失变换函数,它将M(t)中所有为空值的元素映射为1,并进入当空值数据倍缺失变换函数映射为中的1元素时,表明其失真的属性。在完整执行一次失真定位后,需要将本轮的原始数据作为历史数据对状态转移频率进行更新。这样就能保证步骤3.2中构造的单次状态转移矩阵的概率元素具有实时准确性。Among them, g(d ij ) represents the missing transformation function, which maps all null elements in M(t) to 1, and enters When the null value data is missing, the transformation function is mapped to When the 1 element in , indicates its distortion property. After a complete distortion location is performed, the original data of this round needs to be used as historical data to update the state transition frequency. In this way, the probability elements of the single state transition matrix constructed in step 3.2 can be guaranteed to have real-time accuracy.
步骤4.5数据失真定位,与电力子系统失真程度度量。Step 4.5 Data distortion location, and power subsystem distortion degree measurement.
遍历失真定位矩阵一次,记录每一等于1的元素所在矩阵中位置一一对应了一个失真数据所在位置,其行标代表设备号。列表代该设备的属性号。同时取出元素占所有元素的比例定量反映了电力子系统失真程度。Traverse the distortion location matrix once, and record the location of each element equal to 1 in the matrix corresponding to the location of a distortion data, and its row label represents the device number. The list represents the property number of the device. The ratio of extracted elements to all elements quantitatively reflects the degree of distortion of the power subsystem.
需要注意的是:步骤3.1中涉及到马氏性证明It should be noted that: step 3.1 involves the Markov property proof
证明:电力数据状态转移过程{Xn,n=0,1,2,…}是马尔科夫链Proof: the power data state transition process {X n ,n=0,1,2,…} is a Markov chain
由于n是有限可列的,并且及状态i,j,i0,i1,…,in-1总存在条件概率P(Xn+1=j|X0=i0,X1=i1,…,Xn-1=in-1,Xn=i)使得P(Xn+1=j|X0=i0,X1=i1,…,Xn-1=in-1,Xn=i)=P(Xn+1=j|Xn=i)Since n is finitely listable, and And state i,j,i 0 ,i 1 ,…,i n-1 total existence conditional probability P(X n+1 =j|X 0 =i 0 ,X 1 =i 1 ,…,X n-1 = i n-1 ,X n =i) such that P(X n+1 =j|X 0 =i 0 ,X 1 =i 1 ,...,X n-1 =i n-1 ,X n =i)= P(X n+1 =j|X n =i)
即采样过程满足马氏性,{Xn,n=0,1,2,…}为一个马尔可夫链。That is, the sampling process satisfies the Markov property, and {X n , n=0, 1, 2,...} is a Markov chain.
因此,本发明具有如下优点:1、本发明数据采集阶能将各种数据格式或数据结构统一成状态转移概率,因此规避了多源异构数据中不同数据格式对数据分析造成的影响,降低了分析系统的复杂度。2、数据的失真判别是可以实时动态进行的,理论上只要给定某一时刻的数据状态初始分布,就能计算出此时的数据失真情况。同时失真的数据的又会返回更新历史数据。3、数据失真判别与定位具有同时性,当数据失真被判定后,失真的位置也同时根据失真定位矩阵被确定,可以缩短设备故障的抢修时间。Therefore, the present invention has the following advantages: 1. The data collection stage of the present invention can unify various data formats or data structures into a state transition probability, thus avoiding the impact of different data formats in multi-source heterogeneous data on data analysis, reducing analyzed the complexity of the system. 2. Data distortion discrimination can be performed dynamically in real time. In theory, as long as the initial distribution of the data state at a certain moment is given, the data distortion at that time can be calculated. At the same time, the distorted data will return to update the historical data. 3. Data distortion discrimination and positioning are simultaneous. When data distortion is determined, the location of the distortion is also determined according to the distortion positioning matrix, which can shorten the repair time of equipment failure.
附图说明Description of drawings
附图1是本发明的整体流程示意图。Accompanying drawing 1 is the overall flow diagram of the present invention.
具体实施方式Detailed ways
步骤1监测数据属性实体划分Step 1 Monitor data attribute entity division
监测数据采集本质上是指电力电网系统中,各类传感器对设备进行监测并将监测数据传送到指定位置储存的过程。在不同的子系统中监测数据具有多种储存机制,本步骤的目的是使得采集到的数据按照数据源的实体设备划分为实体数据集合。本步骤分为3个子步骤Monitoring data collection essentially refers to the process in which various sensors monitor equipment in the power grid system and transmit the monitoring data to a designated location for storage. There are multiple storage mechanisms for monitoring data in different subsystems. The purpose of this step is to divide the collected data into entity data sets according to the entity equipment of the data source. This step is divided into 3 sub-steps
步骤1.1原始监测数据采集Step 1.1 Raw monitoring data collection
定义1原始数据集合Definition 1 Raw data set
D={d11,d12,...,d21,d22,...,dn1,dn2,...,dnk}D={d 11 ,d 12 ,...,d 21 ,d 22 ,...,d n1 ,d n2 ,...,d nk }
其中,D表示系统中采集到的各类原始数据的属性集合,dij表示第i设备的第k属性,在实际信息采集中,由于电网各子系统的数据采集方式不尽相同,所以将各子系统采集后的数据汇总后往往是杂乱的数据集合。Among them, D represents the attribute collection of various raw data collected in the system, and d ij represents the k-th attribute of the i-th device. In actual information collection, because the data collection methods of each subsystem of the power grid are different, so each After the data collected by the subsystem is summarized, it is often a messy data collection.
步骤1.2监测数据按实体来源地址分类Step 1.2 Monitoring data is classified by entity source address
在数据采集过程,将数据按照来源索引字段进行分类,此步骤分为两种形式。形式一:已采集数据的分类In the data collection process, the data is classified according to the source index field. This step is divided into two forms. Form 1: Classification of collected data
对于形如定义1中已采集到的数据,需要按照数据属性中的数据源索引字段分类,将来源于相同实体的监测数据分为一组For the collected data in the form of definition 1, it needs to be classified according to the data source index field in the data attribute, and the monitoring data from the same entity should be divided into one group
形式二:按实体采集监测数据Form 2: Collect monitoring data by entity
对于可按实体输出监测数据的子系统,直接采集其监测数据,并在数据中注明实体的唯一属性。For subsystems that can output monitoring data by entity, directly collect the monitoring data, and indicate the unique attribute of the entity in the data.
定义2设备数据向量Definition 2 device data vector
di=(di1,di2,...,dik)d i =(d i1 ,d i2 ,...,d ik )
其中,di表示来自于第i(1≤i≤n)设备的监测数据向量,dij表示第i设备的第j,(1≤j≤k)属性。这样按任意实体i设备的数据源归类的数据以向量的形式被di记录Among them, d i represents the monitoring data vector from the i-th (1≤i≤n) device, and d ij represents the j-th (1≤j≤k) attribute of the i-th device. In this way, the data classified according to the data source of any entity i device is recorded by d i in the form of vector
步骤1.3构造属性扩展矩阵Step 1.3 Construct attribute expansion matrix
在电力通信系统中,电力电网包含各种子系统,各类设备通过协同工作来支撑一个子系统的正常运行,一套子系统的数据变化可以通过属性扩展矩阵来形式化。In the power communication system, the power grid contains various subsystems, and all kinds of equipment work together to support the normal operation of a subsystem. The data changes of a set of subsystems can be formalized through the attribute expansion matrix.
定义3属性扩展矩阵Definition 3 attribute expansion matrix
其中,M(t)表示t时刻的属性扩展矩阵,dij(t)表示第i(1≤i≤n)设备的第j,(1≤j≤ki≤k)属性在时刻t时的监测值。上式中定义了设备拥有的属性数量的上界k,其中k=max(k1,k2,...)表示子系统中拥有最多属性设备的属性个数,它规定了扩展矩阵的列数。电力子系统中不同设备的属性个数是可以不同的,这样扩展矩阵M(t)中的多数行向量没有定义的属性值,称此类没有定义的属性为扩展属性,他们的作用在于保持扩展矩阵的矩形结构以便于接下来的数学处理。扩展矩阵M(t)完整包含了电力子系统中在时刻t的属性值。Among them, M(t) represents the attribute expansion matrix at time t, d ij (t) represents the jth of the i (1≤i≤n) device, and (1≤j≤k i ≤k) attributes at time t monitoring value. The upper bound k of the number of attributes owned by the equipment is defined in the above formula, where k=max(k 1 ,k 2 ,...) indicates the number of attributes of the equipment with the most attributes in the subsystem, and it specifies the columns of the extended matrix number. The number of attributes of different devices in the power subsystem can be different, so most of the row vectors in the extended matrix M(t) have no defined attribute values, and such undefined attributes are called extended attributes, and their role is to maintain the extended The rectangular structure of the matrix is convenient for subsequent mathematical processing. The extended matrix M(t) completely contains the attribute values of the power subsystem at time t.
步骤2电力设备的监测数据转移概率Step 2. Monitoring data transfer probability of power equipment
本步骤分为3个子步骤This step is divided into 3 sub-steps
步骤2.1数据的状态划分Step 2.1 State division of data
电力设备各个属性的值都有一定范围正常域,当某一属性值超出其正常域时,称该属性出现异常值。对于离散型属性值,其定义域是可数的离散的点。对于连续型属性值,其定义域是连续的区间。根据具体的电力设备,可以将监测到的不同的数据值根据其定义域,划分到不同的状态中。当设备处于正常状态时,监测数据的状态称为稳定态。对于离散型数据值,可根据具体数据属性特点将不同的点归为一类,组成一个状态。也可以直接将每一个点视为一个状态。对于连续型数据值,可以将连续数值区间按具体特征划分为片段,每个片段为一个状态。The value of each attribute of electrical equipment has a certain range of normal range. When a certain attribute value exceeds its normal range, it is said that the attribute has an abnormal value. For discrete attribute values, its definition domain is countable discrete points. For continuous attribute values, its definition domain is a continuous interval. According to specific electrical equipment, different monitored data values can be divided into different states according to their definition domains. When the equipment is in a normal state, the state of the monitoring data is called steady state. For discrete data values, different points can be classified into one category according to specific data attribute characteristics to form a state. You can also directly treat each point as a state. For continuous data values, the continuous value interval can be divided into segments according to specific characteristics, and each segment is a state.
该步骤的作用是将设备的数据以离散的状态进行描述,以监测数据在状态中的转移来刻画数据的变化。The function of this step is to describe the data of the device in a discrete state, to monitor the transfer of data in the state to describe the change of the data.
步骤2.2设备的状态Step 2.2 Status of the device
电力设备可运行在不同的状态中,不同的设备状态代表了设备运行的阶段特征。例如,变压器的运行状态可以分为正常运行,高温运行,设备异常。不同的设备拥有不同的运行状态,而电力数据不一致的本质表现是实际设备的运行状态与监测数据状态的不一致,即监测数据不能真实反映设备实际运行情况,设备的运行状态多种多样,而数据状态的种类更多。通常一个设备状态对应于一系列数据状态的特定组合。在过去,找到其中不一致的对应关系是复杂的,而本发明采取的技术方式是,不关注设备与数据的状态本身,而是通过考察这些状态的变化,定位到不一致的对应关系。Electric equipment can operate in different states, and different equipment states represent the stage characteristics of equipment operation. For example, the operating status of a transformer can be divided into normal operation, high temperature operation, and equipment abnormality. Different equipment has different operating states, and the essence of power data inconsistency is the inconsistency between the actual operating state of the equipment and the status of the monitoring data, that is, the monitoring data cannot truly reflect the actual operating status of the equipment, and the operating status of the equipment is varied, while the data There are more types of states. Usually a device state corresponds to a specific combination of a series of data states. In the past, it was complicated to find the inconsistent correspondence, but the technical method adopted by the present invention is not to pay attention to the state of the equipment and data itself, but to locate the inconsistent correspondence by examining the changes of these states.
当电力系统中的设备稳定运行时,设备的各类监测数据值也应该稳定在一定范围中,同时其变动规则也具有稳定性。当设备运行出现状态改变时,一部分监测数据就会更大的概率偏离原来的状态,进入新的状态,从而打破之前这种稳定规则。When the equipment in the power system is running stably, the values of various monitoring data of the equipment should also be stable within a certain range, and its change rules are also stable. When the state of the equipment changes, a part of the monitoring data will deviate from the original state with a greater probability and enter a new state, thus breaking the previous stable rules.
该步骤建立了以概率方式来描述数据状态转移的通道,即数据状态的转移可以按照概率分布的数学形式描述。This step establishes a channel to describe the data state transition in a probabilistic manner, that is, the data state transition can be described in the mathematical form of probability distribution.
步骤2.3统计历史监测数据的状态转移频率Step 2.3 Statistical state transition frequency of historical monitoring data
定义4数据状态转移频率Definition 4 Data State Transition Frequency
其中,fij表示对设备监测Ni+Nj次后数据从状态i转移到状态j的频率,Ni表示处于状态i的次数,Nj表示处于状态j的次数。Among them, f ij represents the frequency of data transfer from state i to state j after N i + N j times of equipment monitoring, N i represents the number of times in state i, and N j represents the number of times in state j.
定律1伯努利大数定律Law 1 Bernoulli's law of large numbers
其中,P表示概率,ε表示一个正数,伯努利大数定律表明,通过大量搜集电力设备监测数据的状态转移数据,计算得出的状态转移的频率会依概率收敛。该定律表明可以通过多次监测当期数据或直接统计历史数据的方式来估算电力设备的状态转移概率。Among them, P represents probability, and ε represents a positive number. Bernoulli's law of large numbers shows that the frequency of state transition calculated by collecting a large number of state transition data of power equipment monitoring data will converge according to probability. This law shows that the state transition probability of power equipment can be estimated by monitoring current data multiple times or directly counting historical data.
步骤3电力设备的多次监测转移矩阵Step 3 Multi-monitoring transfer matrix of power equipment
本步骤分为4个子步骤This step is divided into 4 sub-steps
步骤3.1数据转移概率的马尔可夫性与时齐性Step 3.1 Markov property and time homogeneity of data transition probability
由于每次监测数据本质是在离散的时间点上的采样,同时依步骤2.2的状态划分方法,监测数据也被划分为离散的状态,所以电力监测数据的状态转移本质上是一个时间与状态都离散的随机过程。由于每次监测,数据所处的状态只与上一次所处的状态的相关(即以上一次所在状态为初始态进行概率转移),所以此随机过程具有马尔科夫性,本质是马尔科夫链。对数据的每一次监测采样获得的结果与第一次监测所处的状态是无关的,这说明数据的状态转移概率亦具有时齐性。此步骤建立了状态转移概率与马尔科夫链的联系,在理论上说明设备监测数据的转移概率可以构建具有马尔科夫性的转移矩阵。Since each monitoring data is essentially sampled at a discrete time point, and the monitoring data is also divided into discrete states according to the state division method in step 2.2, the state transition of power monitoring data is essentially a time and state transition. Discrete random process. Since each monitoring, the state of the data is only related to the previous state (that is, the previous state is the initial state for probability transfer), so this random process has Markov properties, and its essence is a Markov chain . The result of each monitoring sampling of the data has nothing to do with the state of the first monitoring, which shows that the state transition probability of the data is also time-homogeneous. This step establishes the connection between the state transition probability and the Markov chain, which theoretically shows that the transition probability of equipment monitoring data can construct a transition matrix with Markov properties.
步骤3.2电力数据的多次转移矩阵Step 3.2 Multiple transfer matrix of power data
定义6单次状态转移矩阵Define 6 single state transition matrix
其中,E表示电力数据的单次状态转移矩阵,其元素pij表示数据从状态i经历一次监测时间间隔后转移到状态j的概率。其概率值可以通过步骤2.3中所述方法统计而来。单次状态转移矩阵E完整描述了任一电力数据经历一次监测后所有状态转移的概率分布情况。根据切普曼-柯尔莫哥洛夫方程Among them, E represents a single state transition matrix of electric power data, and its element p ij represents the probability of data transitioning from state i to state j after a monitoring time interval. Its probability value can be calculated by the method described in step 2.3. The single state transition matrix E fully describes the probability distribution of all state transitions after any power data undergoes one monitoring. According to the Chapman-Kolmogorov equation
可以得到电力数据经过任意次状态转移后的转移矩阵,The transition matrix of the power data after any number of state transitions can be obtained,
由切普曼-柯尔莫哥洛夫方程可知,只需通过步骤3.2方法构造电力数据的单次状态转移矩阵,就能直接计算出任意m+n次后的状态转移矩阵,E(n+m)中i行j列的元素表示数据以初始状态i被监测,而后经历m+n次监测后,数据处于状态j的概率。通过这种计算方法得出来的多次转移概率的实际意义是指当电力设备处于稳定状态时,数据应该达到的理论状态转移概率。From the Chapman-Kolmogorov equation, we can directly calculate the state transition matrix after any m+n times, E (n+ The element in row i and column j in m) represents the probability that the data is monitored in the initial state i, and then after m+n times of monitoring, the data is in state j. The actual meaning of the multiple transition probability obtained by this calculation method refers to the theoretical state transition probability that the data should reach when the power equipment is in a steady state.
步骤3.4次转移后所处的理论概率分布Step 3. Theoretical probability distribution after 4 transitions
定义7电力数据初始分布Definition 7 Initial distribution of power data
其中,φ(0)表监测示数据的初始分布向量,元素(0≤j≤n)表示数据在初始时刻0所处的各个状态的概率。根据切普曼-柯尔莫哥洛夫方程,可以得到任意m+n时刻的理论概率分布。Among them, φ(0) represents the initial distribution vector of monitoring data, and the element (0≤j≤n) represents the probability of each state of the data at the initial time 0. According to the Chapman-Kolmogorov equation, the theoretical probability distribution at any m+n time can be obtained.
定理1多次转移状态的理论概率分布Theorem 1 Theoretical probability distribution of multiple transition states
φ(m+n)=φ(0)Em+n φ(m+n)=φ(0)E m+n
其中,φ(m+n)表示计算出的电力数据经历m+n次转移后所处的理论概率分布。Among them, φ(m+n) represents the theoretical probability distribution of the calculated power data after m+n transfers.
步骤4失真数据定位Step 4 Distortion data localization
设备在一定时间段内处于稳定状态是指设备在该时间段内未发生状态的改变,对应的监测数据的状态概率分布反映出了设备在该状态下应有的数据状态概率模式特征。一旦电力通信系统检测到当前电力设备的状态不符合相应的状态转移概率的分布特征时,则可以推断系统可能出现了监测数据失真。监测周期内,如果设备运行状态未发生变化,而相应数据状态发生变化,称这种情况为第一类失真。如果设备运行状态发生变化,而数据状态未发生相应变化,称这种情况为第二类失真。本步骤分为5个子步骤The equipment is in a stable state within a certain period of time, which means that the equipment does not change its state within this period of time, and the corresponding state probability distribution of monitoring data reflects the characteristics of the data state probability mode that the equipment should have in this state. Once the power communication system detects that the current state of the power equipment does not conform to the distribution characteristics of the corresponding state transition probability, it can be inferred that the monitoring data may be distorted in the system. During the monitoring period, if the operating state of the equipment does not change, but the corresponding data state changes, this situation is called the first type of distortion. If the operating state of the device changes without a corresponding change in the state of the data, this is called Type 2 distortion. This step is divided into 5 sub-steps
步骤4.1电力数据状态转移分布的偏离度Step 4.1 Deviation degree of power data state transition distribution
定义9电力数据状态转移分布的偏离度Definition 9 Deviation Degree of Power Data State Transition Distribution
δ>v(m+n)=||φ′(m+n)-φ(m+n)||2 δ>v(m+n)=||φ′(m+n)-φ(m+n)|| 2
其中v(m+n)表示电力数据状态转移分布的偏离度,φ′(m+n)是电力设备在经历m+n次监测后实际的统计出的转移概率分布,φ(m+n)是电力设备经历m+n次监测后的理论概率分布,通过向量的2-范数度量这两种分布的偏离度。δ>0是一个可以根据实际情况定义的偏离阈值,当δ>v(m+n)时,说明设备数据的实际状态转移分布偏离程度在合理范围内,当δ≤v(m+n)时,说明设备数据的实际转移分布偏离度超出阈值。Among them, v(m+n) represents the deviation degree of the power data state transition distribution, φ′(m+n) is the actual statistical transition probability distribution of the power equipment after m+n times of monitoring, φ(m+n) is the theoretical probability distribution of power equipment after m+n times of monitoring, and the deviation between the two distributions is measured by the 2-norm of the vector. δ>0 is a deviation threshold that can be defined according to the actual situation. When δ>v(m+n), it means that the deviation degree of the actual state transition distribution of the equipment data is within a reasonable range. When δ≤v(m+n) , indicating that the deviation degree of the actual transfer distribution of the device data exceeds the threshold.
当设备稳定运行时,出现δ≤v(m+n),表明设备数据出现第一类失真征兆,因为数据在某些时刻以较大概率出现在了不常出现的状态中。当设备运行过程中出现变化时,出现δ>v(m+n),表明设备出现第二类失真征兆,因为随着设备的运行状态的变化,监测数据理应发生相应状态变化,而实际状态分布并未发生较大变化。When the device is running stably, δ≤v(m+n) appears, indicating that the device data has the first type of distortion symptoms, because the data appears in an infrequent state with a high probability at certain moments. When there is a change during the operation of the equipment, δ>v(m+n) appears, indicating that the equipment has the second type of distortion symptoms, because as the operating state of the equipment changes, the monitoring data should undergo corresponding state changes, while the actual state distribution No major changes occurred.
步骤4.2子系统中失真数据的定位Step 4.2 Localization of distorted data in subsystems
定义10子系统的监测数据偏离矩阵Definition 10 Subsystem monitoring data deviation matrix
电网子系统是包含系统相应设备的有机整体,通过偏离矩阵可以快速定位子系统中失真的数据。The grid subsystem is an organic whole including the corresponding equipment of the system, and the distorted data in the subsystem can be quickly located through the deviation matrix.
其中,V(t)表示子系统的监测数据经历t个标准周期后的偏离矩阵,元素vij(t)表示经历t个标准周期后子系统中第i个设备的第j个属性的数据状态转移分布的偏离度。根据步骤2.1中扩展矩阵的性质,如果矩阵相应位置对应的设备不存在该属性,则命令元素值为-1。Among them, V(t) represents the deviation matrix of the monitoring data of the subsystem after t standard periods, and the element v ij (t) represents the data status of the jth attribute of the i-th device in the subsystem after t standard periods Skewness of the transfer distribution. According to the properties of the extended matrix in step 2.1, if the attribute does not exist in the device corresponding to the corresponding position of the matrix, the value of the command element is -1.
步骤4.3Step 4.3
定义11失真定位矩阵Definition 11 Distortion Localization Matrix
其中,表示一个n×k的失真定位矩阵,元素只由数值0、1和-1组成,F表示矩阵0-1变换函数,它的功能是将矩阵中元素进行0-1变换,Δ(t)表示偏离阈矩阵,其元素δij(t)表示对应于偏离度vij(t)的偏离阈值,函数f(zij∈V(t)-Δ(t))表示对其自变量进行0-1变换,其定义如下定义11元素0-1变换函数in, Represents an n×k distortion positioning matrix, the elements are only composed of values 0, 1 and -1, F represents the matrix 0-1 transformation function, its function is to perform 0-1 transformation on the elements in the matrix, Δ(t) represents Deviation threshold matrix, its element δ ij (t) represents the deviation threshold corresponding to the degree of deviation v ij (t), and the function f(z ij ∈ V(t)-Δ(t)) represents the 0-1 Transformation, which is defined as follows: define an 11-element 0-1 transformation function
失真定位矩阵的实际意义在于把所有超出阈值的元素标记为1而未超过阈值的标记为0,元素在矩阵中的位置又一一对应于数据在子系统中的位置。这样一个失真定位矩阵就包含了一个子系统中失真数据的变化状态与位置信息。The actual significance of the distortion localization matrix is to mark all elements that exceed the threshold as 1 and those that do not exceed the threshold as 0, and the positions of the elements in the matrix correspond to the positions of the data in the subsystem one by one. Such a distortion localization matrix contains the change state and position information of the distortion data in a subsystem.
步骤4.4缺失数据的处理Step 4.4 Handling of missing data
数据缺失是也数据失真的一种表现,无论设备真实的运行状态如何,只要监测数据出现数据空值,则认为是数据失真,此时执行缺失数据变换Missing data is also a manifestation of data distortion. Regardless of the actual operating status of the equipment, as long as there is a data null value in the monitoring data, it is considered to be data distortion. At this time, the missing data transformation is performed.
其中,g(dij)表示缺失变换函数,它将M(t)中所有为空值的元素映射为1,并进入当空值数据倍缺失变换函数映射为中的1元素时,表明其失真的属性。在完整执行一次失真定位后,需要将本轮的原始数据作为历史数据对状态转移频率进行更新。这样就能保证步骤3.2中构造的单次状态转移矩阵的概率元素具有实时准确性。Among them, g(d ij ) represents the missing transformation function, which maps all null elements in M(t) to 1, and enters When the null value data is missing, the transformation function is mapped to When the 1 element in , indicates its distortion property. After a complete distortion location is performed, the original data of this round needs to be used as historical data to update the state transition frequency. In this way, the probability elements of the single state transition matrix constructed in step 3.2 can be guaranteed to have real-time accuracy.
步骤4.5数据失真定位,与电力子系统失真程度度量。Step 4.5 Data distortion location, and power subsystem distortion degree measurement.
遍历失真定位矩阵一次,记录每一等于1的元素所在矩阵中位置一一对应了一个失真数据所在位置,其行标代表设备号。列表代该设备的属性号。同时取出元素占所有元素的比例定量反映了电力子系统失真程度。Traverse the distortion location matrix once, and record the location of each element equal to 1 in the matrix corresponding to the location of a distortion data, and its row label represents the device number. The list represents the property number of the device. The ratio of extracted elements to all elements quantitatively reflects the degree of distortion of the power subsystem.
本文中所描述的具体实施例仅仅是对本发明精神作举例说明。本发明所属技术领域的技术人员可以对所描述的具体实施例做各种各样的修改或补充或采用类似的方式替代,但并不会偏离本发明的精神或者超越所附权利要求书所定义的范围。The specific embodiments described herein are merely illustrative of the spirit of the invention. Those skilled in the art to which the present invention belongs can make various modifications or supplements to the described specific embodiments or adopt similar methods to replace them, but they will not deviate from the spirit of the present invention or go beyond the definition of the appended claims range.
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711340903.4A CN108132423B (en) | 2017-12-14 | 2017-12-14 | A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711340903.4A CN108132423B (en) | 2017-12-14 | 2017-12-14 | A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108132423A CN108132423A (en) | 2018-06-08 |
CN108132423B true CN108132423B (en) | 2019-11-22 |
Family
ID=62389414
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711340903.4A Expired - Fee Related CN108132423B (en) | 2017-12-14 | 2017-12-14 | A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108132423B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111008238B (en) * | 2019-11-15 | 2023-10-10 | 武汉楚誉科技股份有限公司 | Key mode automatic positioning and early warning method based on associated evolution big data |
CN112256801B (en) * | 2020-10-10 | 2024-04-09 | 深圳力维智联技术有限公司 | Method, system and storage medium for extracting key entity in entity relation diagram |
CN113515158B (en) * | 2021-09-13 | 2021-12-24 | 常州旭泰克系统科技有限公司 | Equipment state monitoring method based on probability hybrid finite state machine |
CN114328150B (en) * | 2021-12-21 | 2025-02-25 | 奇安信科技集团股份有限公司 | Industrial safety control method, electronic equipment, and storage medium |
CN114756722B (en) * | 2022-04-25 | 2022-10-04 | 珠海市鸿瑞信息技术股份有限公司 | Multi-data centralized management control system and method |
CN118199080B (en) * | 2024-05-17 | 2024-08-09 | 深圳市安科讯电子制造有限公司 | Multi-energy power supply combined type communication power supply voltage stabilizing protection method |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101674482A (en) * | 2009-09-25 | 2010-03-17 | 上海大学 | Method for optimized dispatching of extension type video flow in partially observational Markovian decision process |
KR101171027B1 (en) * | 2011-09-20 | 2012-08-08 | 김일동 | An on-line monitoring system for power facilities and its method using circuit parameter measuring |
US8745641B1 (en) * | 2011-07-14 | 2014-06-03 | Google Inc. | Automatic verification and anomaly detection in a representational state transfer (REST) application programming interface |
CN103888849A (en) * | 2014-04-11 | 2014-06-25 | 北京工业大学 | Computing and wireless resource cooperative dispatching method in mobile cloud video transmission |
CN106529715A (en) * | 2016-10-25 | 2017-03-22 | 哈尔滨工业大学 | Aeroengine maintenance strategy optimization method based on Markoff decision process |
-
2017
- 2017-12-14 CN CN201711340903.4A patent/CN108132423B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101674482A (en) * | 2009-09-25 | 2010-03-17 | 上海大学 | Method for optimized dispatching of extension type video flow in partially observational Markovian decision process |
US8745641B1 (en) * | 2011-07-14 | 2014-06-03 | Google Inc. | Automatic verification and anomaly detection in a representational state transfer (REST) application programming interface |
KR101171027B1 (en) * | 2011-09-20 | 2012-08-08 | 김일동 | An on-line monitoring system for power facilities and its method using circuit parameter measuring |
CN103888849A (en) * | 2014-04-11 | 2014-06-25 | 北京工业大学 | Computing and wireless resource cooperative dispatching method in mobile cloud video transmission |
CN106529715A (en) * | 2016-10-25 | 2017-03-22 | 哈尔滨工业大学 | Aeroengine maintenance strategy optimization method based on Markoff decision process |
Non-Patent Citations (2)
Title |
---|
基于模拟退火算法和隐马尔可夫模型的文本信息抽取;王宁,李石君;《微电子学与计算机》;20141231;第31卷(第12期);第52-56页 * |
大系统马尔可夫模型状态转移概率矩阵的快速形成方法;刘艳丽,余贻鑫;《天津大学学报》;20130930;第46卷(第9期);第791-798页 * |
Also Published As
Publication number | Publication date |
---|---|
CN108132423A (en) | 2018-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108132423B (en) | A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability | |
CN113935562B (en) | Intelligent grading and automatic early warning method for health condition of power equipment | |
CN103455563B (en) | A kind of data digging method being applicable to intelligent substation integrated monitoring system | |
CN107561997B (en) | A kind of power equipment state monitoring method based on big data decision tree | |
CN118655413A (en) | Integrated method and device for wide-area fault monitoring and precise positioning of distribution network | |
CN106649840A (en) | Method suitable for power data quality assessment and rule check | |
CN105701596A (en) | Method for lean distribution network emergency maintenance and management system based on big data technology | |
CN103728507A (en) | Grid fault diagnosis method based on data mining | |
CN103245881A (en) | Power distribution network fault analyzing method and device based on tidal current distribution characteristics | |
CN111913935A (en) | A SCADA system of wind farm centralized control center based on CIM model | |
Suo et al. | Computer Assistance Analysis of Power Grid Relay Protection Based on Data Mining. | |
CN119231744A (en) | Wind farm fault analysis method and system based on multi-dimensional information association | |
CN118504393A (en) | Autoformer prediction-based power distribution network digital twin body construction method and system | |
CN104317778A (en) | Massive monitoring data based substation equipment fault diagnosis method | |
Tong et al. | Online monitoring data processing method of transformer oil chromatogram based on association rules | |
CN119863234A (en) | Machine room equipment remote operation and maintenance management method and system based on Internet of things technology | |
CN108399579B (en) | Intelligent Analysis System for Monitoring Data Signals of Substation Equipment | |
CN110968703B (en) | Method and system for constructing abnormal metering point knowledge base based on LSTM end-to-end extraction algorithm | |
CN119337748A (en) | Virtual simulation training system for ground photovoltaic power station based on real-time operation data | |
Liu et al. | The design and implementation of the enterprise level data platform and big data driven applications and analytics | |
CN107818523A (en) | Power communication system data true value based on unstable frequency distribution and frequency factor study differentiates and estimating method | |
Xiaodong et al. | Artificial Intelligence in power multimodal data analysis | |
Li et al. | Research on intelligent algorithm-based power system fault prediction and diagnosis technology | |
Paeizi et al. | Data analytics applications in digital energy system operation | |
Khalyasmaa et al. | Fuzzy inference algorithms for power equipment state assessment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20191122 |
|
CF01 | Termination of patent right due to non-payment of annual fee |