CN108132423B

CN108132423B - A Fast Locating Method for Distortion of Power System Monitoring Data Based on State Transition Probability

Info

Publication number: CN108132423B
Application number: CN201711340903.4A
Authority: CN
Inventors: 李石君; 梁杰; 余放; 汪毅能; 杨济海
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2017-12-14
Filing date: 2017-12-14
Publication date: 2019-11-22
Anticipated expiration: 2037-12-14
Also published as: CN108132423A

Abstract

The invention relates to a fast positioning method for monitoring data distortion in a power system based on the state transition probability. The method focuses on the state change probability of the monitoring data, and uses the characteristics of the probability distribution after multiple transfers and the threshold of normal data changes during equipment operation. Compare and export the distortion location matrix to quickly locate the location of the distortion data. The whole process is divided into four steps: 1. Monitoring data attribute entity division, 2. Power equipment monitoring data transfer probability, power equipment multiple monitoring transfer matrix, 4. Distortion data location and subsystem distortion degree measurement. The data acquisition stage of the present invention can unify various data formats or data structures into state transition probabilities, thereby avoiding the influence of different data formats in multi-source heterogeneous data on data analysis, and reducing the complexity of the analysis system.

Description

A Fast Location of Distortion in Power System Monitoring Data Based on State Transition Probability method

技术领域technical field

本发明属于电力通信数据与大数据技术相融合的应用研究，通过将检测数据的状态变化概率话，将电力通信系统中各数据的格式统一成数据状态的转移概率分布，再通过切普曼-柯尔莫哥洛夫方程对初始转移分布进行多次转移，得到多次转移的理论分布，通过各个数据的变化阈值限定，计算出能直接定位失真数据的失真定位矩阵。The present invention belongs to the applied research on the fusion of electric power communication data and big data technology. By analyzing the state change probability of the detection data, the format of each data in the electric power communication system is unified into the transition probability distribution of the data state, and then through the Chapman- The Kolmogorov equation performs multiple transfers on the initial transfer distribution to obtain the theoretical distribution of multiple transfers, and calculates the distortion location matrix that can directly locate the distorted data by limiting the change threshold of each data.

背景技术Background technique

电力通信系统是一个广义的概念，泛指与电力电网相关各子系统以及他们产生的数据信息，随着我国电力电网的不断发展，用电需求的不断扩大，电力通信系统中产生的数据也日益庞大起来，同时数据产生的速度也越来越快，不同子系统间的数据结构也有很大差异，电力通信系统产生的数据成为了典型的大数据。The power communication system is a broad concept, which generally refers to the subsystems related to the power grid and the data information generated by them. With the continuous development of my country's power grid and the continuous expansion of electricity demand, the data generated in the power communication system is also increasing. At the same time, the speed of data generation is getting faster and faster, and the data structure between different subsystems is also very different. The data generated by the power communication system has become a typical big data.

电力通信系统是保障电力系统正常运行的重要系统，通过各类传感器对设备进行监测，为设备故障提供决策，为设备维修提供依据。大型的电力通信系统产生海量的监测数据，这些数据在采集、录入、传输、交换与储存过程中不可避免的会出现数据失真现象。现实中，这些失真数据已经成为定位与分析电力设备故障的重要阻碍因素。提高电力通信系统的数据质量是完善电力电网系统的重要环节。国内外专家对电力系统中失真数据的检测提出了多种解决方法，某研究能量管理系统中出现数据失真的原因，然后从原因着手，解决数据失真的问题。某研究是从数据平台着手试图提高数据质量。某研究从插值拟合的角度来预测数据质量.某研究基于公共信息模型(Common Information Model,CIM)的高速模型交换格式CIM/E文本为载体的不同系统间的数据校验技术,采用改进的多源数据筛选较优质量数据的手段,以及根据主站状态估计对现场数据进行反馈的方法,提高了电网调度系统的整体数据质量。The power communication system is an important system to ensure the normal operation of the power system. It monitors equipment through various sensors, provides decision-making for equipment failures, and provides a basis for equipment maintenance. Large-scale power communication systems generate massive amounts of monitoring data, and data distortion will inevitably occur during the collection, entry, transmission, exchange and storage of these data. In reality, these distorted data have become an important obstacle to locate and analyze power equipment faults. Improving the data quality of the power communication system is an important part of perfecting the power grid system. Experts at home and abroad have proposed a variety of solutions to the detection of distorted data in power systems. A researcher studies the causes of data distortion in energy management systems, and then starts from the reasons to solve the problem of data distortion. A certain research started from the data platform to try to improve the data quality. A research predicts data quality from the perspective of interpolation fitting. A research based on the common information model (Common Information Model, CIM) high-speed model exchange format CIM/E text as the data verification technology between different systems, using the improved The means of multi-source data screening for better quality data, and the method of feedbacking field data according to the state estimation of the master station improve the overall data quality of the power grid dispatching system.

以上基于电力设备状态估计的不良数据检测在对待局系统局部数据的质量提高时有一定效果，但是对于整个电力通信系统产生的多源异构大数据仍不具备良好的适用性，并且针对每一种数据失真建立相应知识库的成本是相对较高的。本发明提出基于状态转移概率电力系统监测数据失真的快速定位，不关注电力监测数据的多源异构格式，转而重点关注数据的状态变化，重点考察监测数据变化的概率与设备实际变化是否一致，通过国家电网真实数据集验证算法，实验结果表明该方法适用于大数据环境下的电力通信系统产生的多源异构数据失真的快速定位。The above bad data detection based on the state estimation of power equipment has a certain effect in improving the quality of local data in the off-site system, but it still does not have good applicability to the multi-source heterogeneous big data generated by the entire power communication system, and is aimed at each The cost of establishing a corresponding knowledge base for such data distortion is relatively high. The present invention proposes the rapid positioning of power system monitoring data distortion based on the state transition probability, does not pay attention to the multi-source heterogeneous format of the power monitoring data, and instead focuses on the state change of the data, focusing on whether the probability of the monitoring data change is consistent with the actual change of the equipment , the algorithm is verified by the real data set of the State Grid, and the experimental results show that the method is suitable for the rapid location of multi-source heterogeneous data distortion generated by the power communication system in the big data environment.

发明内容Contents of the invention

电力通信系统是保障电力系统正常稳定运行的必要信息传输系统，通过各类传感器对电气设备进行实时监测，及时上报异常数据保障设备稳定运行，为设备故障提供决策支持，为设备维修提供定位依据。目前，随着电网系统的规模越来越大以及各类电力设备监测传感器的种类逐渐增加，电力通信系统每天都在产生海量的监测数据，这些数据在采集、录入、传输、交换与储存过程中不可避免的会出现数据失真现象。实际上，在大数据时代，此类失真数据已经成为定位与分析电力设备故障的重要阻碍因素之一。电力通信系统中出现的数据失真主要包括以下两个方面：The power communication system is a necessary information transmission system to ensure the normal and stable operation of the power system. It monitors electrical equipment in real time through various sensors, reports abnormal data in time to ensure stable operation of equipment, provides decision support for equipment failures, and provides positioning basis for equipment maintenance. At present, with the increasing scale of the power grid system and the increasing types of monitoring sensors for various power equipment, the power communication system generates a large amount of monitoring data every day. These data are collected, entered, transmitted, exchanged and stored. Data distortion is inevitable. In fact, in the era of big data, such distorted data has become one of the important obstacles to locating and analyzing power equipment faults. Data distortion in power communication systems mainly includes the following two aspects:

1.违反监测数据一致性，监测数据一致是指，系统实际记录到的数据是否满足一定的函数依赖或逻辑关系，是否有超出属性定义域的数据。1. Violation of monitoring data consistency. Monitoring data consistency refers to whether the data actually recorded by the system satisfies a certain functional dependency or logical relationship, and whether there is data beyond the domain of the attribute definition.

2.违反监测数据完整性，监测数据完整性是电力信息系统实际录入的数据是存在缺失，是否完全记录了按设计要求记录的全部数据。2. Violation of the integrity of the monitoring data. The integrity of the monitoring data refers to whether the data actually entered in the power information system is missing, and whether all the data recorded according to the design requirements are completely recorded.

针对目前电力通信系统中出现的数据质量偏低问题，本发明旨在建立一种自动对电力数据进行失真识别、定位与失真程度度量的随机过程快速判别方法，本方法的关注重点是电力数据状态的变化与实际设备运行状态变化的一致性。将电力通信系统中各数据的格式统一成数据状态的转移概率分布，再通过切普曼-柯尔莫哥洛夫方程对初始转移分布进行多次转移，得到多次转移的理论分布，通过各个数据的变化阈值限定，计算出能直接定位失真数据的失真定位矩阵。Aiming at the problem of low data quality in the current power communication system, the present invention aims to establish a random process rapid discrimination method for automatic distortion identification, location and distortion degree measurement of power data. The focus of this method is the status of power data The consistency between the change and the actual equipment running state change. Unify the format of each data in the power communication system into the transition probability distribution of the data state, and then perform multiple transfers on the initial transfer distribution through the Chapman-Kolmogorov equation to obtain the theoretical distribution of multiple transfers. The data change threshold is limited, and the distortion location matrix that can directly locate the distortion data is calculated.

为完成以上目标，本发明整体包含四个步骤，整体流程图见附图1For accomplishing above object, the present invention comprises four steps as a whole, and overall flow chart is shown in accompanying drawing 1

步骤1监测数据属性实体划分Step 1 Monitor data attribute entity division

监测数据采集本质上是指电力电网系统中，各类传感器对设备进行监测并将监测数据传送到指定位置储存的过程。在不同的子系统中监测数据具有多种储存机制，本步骤的目的是使得采集到的数据按照数据源的实体设备划分为实体数据集合。本步骤分为3个子步骤：Monitoring data collection essentially refers to the process in which various sensors monitor equipment in the power grid system and transmit the monitoring data to a designated location for storage. There are multiple storage mechanisms for monitoring data in different subsystems. The purpose of this step is to divide the collected data into entity data sets according to the entity equipment of the data source. This step is divided into 3 sub-steps:

步骤1.1原始监测数据采集Step 1.1 Raw monitoring data collection

定义1原始数据集合Definition 1 Raw data set

D＝{d₁₁,d₁₂,...,d₂₁,d₂₂,...,d_n1,d_n2,...,d_nk}D＝{d ₁₁ ,d ₁₂ ,...,d ₂₁ ,d ₂₂ ,...,d _n1 ,d _n2 ,...,d _nk }

其中，D表示系统中采集到的各类原始数据的属性集合，d_ij表示第i设备的第k属性，在实际信息采集中，由于电网各子系统的数据采集方式不尽相同，所以将各子系统采集后的数据汇总后往往是杂乱的数据集合。Among them, D represents the attribute collection of various raw data collected in the system, and d _ij represents the k-th attribute of the i-th device. In actual information collection, because the data collection methods of each subsystem of the power grid are different, so each After the data collected by the subsystem is summarized, it is often a messy data collection.

步骤1.2监测数据按实体来源地址分类Step 1.2 Monitoring data is classified by entity source address

在数据采集过程，将数据按照来源索引字段进行分类，此步骤分为两种形式。In the data collection process, the data is classified according to the source index field. This step is divided into two forms.

形式一：已采集数据的分类Form 1: Classification of collected data

对于形如定义1中已采集到的数据，需要按照数据属性中的数据源索引字段分类，将来源于相同实体的监测数据分为一组For the collected data in the form of definition 1, it needs to be classified according to the data source index field in the data attribute, and the monitoring data from the same entity should be divided into one group

形式二：按实体采集监测数据Form 2: Collect monitoring data by entity

对于可按实体输出监测数据的子系统，直接采集其监测数据，并在数据中注明实体的唯一属性。For subsystems that can output monitoring data by entity, directly collect the monitoring data, and indicate the unique attribute of the entity in the data.

定义2设备数据向量Definition 2 device data vector

d_i＝(d_i1,d_i2,...,d_ik)d _i ＝(d _i1 ,d _i2 ,...,d _ik )

其中，d_i表示来自于第i(1≤i≤n)设备的监测数据向量，d_ij表示第i设备的第j，(1≤j≤k)属性。这样按任意实体设备i的数据源归类的数据以向量的形式被d_i记录Among them, d _i represents the monitoring data vector from the i-th (1≤i≤n) device, and d _ij represents the j-th (1≤j≤k) attribute of the i-th device. In this way, the data classified according to the data source of any physical device i is recorded by d _i in the form of vector

步骤1.3构造属性扩展矩阵Step 1.3 Construct attribute expansion matrix

在电力通信系统中，电力电网包含各种子系统，各类设备通过协同工作来支撑一个子系统的正常运行，一套子系统的数据变化可以通过属性扩展矩阵来形式化。In the power communication system, the power grid contains various subsystems, and all kinds of equipment work together to support the normal operation of a subsystem. The data changes of a set of subsystems can be formalized through the attribute expansion matrix.

定义3属性扩展矩阵Definition 3 attribute expansion matrix

其中，M(t)表示t时刻的属性扩展矩阵，d_ij(t)表示第i(1≤i≤n)设备的第j，(1≤j≤k_i≤k)属性在时刻t时的监测值。上式中定义了设备拥有的属性数量的上界k，其中k＝max(k₁,k₂,...)表示子系统中拥有最多属性设备的属性个数，它规定了扩展矩阵的列数。电力子系统中不同设备的属性个数是可以不同的，这样扩展矩阵M(t)中的多数行向量没有定义的属性值，称此类没有定义的属性为扩展属性，他们的作用在于保持扩展矩阵的矩形结构以便于接下来的数学处理。扩展矩阵M(t)完整包含了电力子系统中在时刻t的属性值。Among them, M(t) represents the attribute expansion matrix at time t, d _ij (t) represents the jth of the i (1≤i≤n) device, and (1≤j≤k _i ≤k) attributes at time t monitoring value. The upper bound k of the number of attributes owned by the equipment is defined in the above formula, where k=max(k ₁ ,k ₂ ,...) indicates the number of attributes of the equipment with the most attributes in the subsystem, and it specifies the columns of the extended matrix number. The number of attributes of different devices in the power subsystem can be different, so most of the row vectors in the extended matrix M(t) have no defined attribute values, and such undefined attributes are called extended attributes, and their role is to maintain the extended The rectangular structure of the matrix is convenient for subsequent mathematical processing. The extended matrix M(t) completely contains the attribute values of the power subsystem at time t.

步骤2电力设备的监测数据转移概率Step 2. Monitoring data transfer probability of power equipment

本步骤分为3个子步骤This step is divided into 3 sub-steps

步骤2.1数据的状态划分Step 2.1 State division of data

电力设备各个属性的值都有一定范围正常域，当某一属性值超出其正常域时，称该属性出现异常值。对于离散型属性值，其定义域是可数的离散的点。对于连续型属性值，其定义域是连续的区间。根据具体的电力设备，可以将监测到的不同的数据值根据其定义域，划分到不同的状态中。当设备处于正常状态时，监测数据的状态称为稳定态。对于离散型数据值，可根据具体数据属性特点将不同的点归为一类，组成一个状态。也可以直接将每一个点视为一个状态。对于连续型数据值，可以将连续数值区间按具体特征划分为片段，每个片段为一个状态。The value of each attribute of electrical equipment has a certain range of normal range. When a certain attribute value exceeds its normal range, it is said that the attribute has an abnormal value. For discrete attribute values, its definition domain is countable discrete points. For continuous attribute values, its definition domain is a continuous interval. According to specific electrical equipment, different monitored data values can be divided into different states according to their definition domains. When the equipment is in a normal state, the state of the monitoring data is called steady state. For discrete data values, different points can be classified into one category according to specific data attribute characteristics to form a state. You can also directly treat each point as a state. For continuous data values, the continuous value interval can be divided into segments according to specific characteristics, and each segment is a state.

该步骤的作用是将设备的数据以离散的状态进行描述，以监测数据在状态中的转移来刻画数据的变化。The function of this step is to describe the data of the device in a discrete state, to monitor the transfer of data in the state to describe the change of the data.

步骤2.2设备的状态Step 2.2 Status of the device

电力设备可运行在不同的状态中，不同的设备状态代表了设备运行的阶段特征。例如，变压器的运行状态可以分为正常运行，高温运行，设备异常。不同的设备拥有不同的运行状态，而电力数据不一致的本质表现是实际设备的运行状态与监测数据状态的不一致，即监测数据不能真实反映设备实际运行情况，设备的运行状态多种多样，而数据状态的种类更多。通常一个设备状态对应于一系列数据状态的特定组合。在过去，找到其中不一致的对应关系是复杂的，而本发明采取的技术方式是，不关注设备与数据的状态本身，而是通过考察这些状态的变化，定位到不一致的对应关系。Electric equipment can operate in different states, and different equipment states represent the stage characteristics of equipment operation. For example, the operating status of a transformer can be divided into normal operation, high temperature operation, and equipment abnormality. Different equipment has different operating states, and the essence of power data inconsistency is the inconsistency between the actual operating state of the equipment and the status of the monitoring data, that is, the monitoring data cannot truly reflect the actual operating status of the equipment, and the operating status of the equipment is varied, while the data There are more types of states. Usually a device state corresponds to a specific combination of a series of data states. In the past, it was complicated to find the inconsistent correspondence, but the technical method adopted by the present invention is not to pay attention to the state of the equipment and data itself, but to locate the inconsistent correspondence by examining the changes of these states.

当电力系统中的设备稳定运行时，设备的各类监测数据值也应该稳定在一定范围中，同时其变动规则也具有稳定性。当设备运行出现状态改变时，一部分监测数据就会更大的概率偏离原来的状态，进入新的状态，从而打破之前这种稳定规则。When the equipment in the power system is running stably, the values of various monitoring data of the equipment should also be stable within a certain range, and its change rules are also stable. When the state of the equipment changes, a part of the monitoring data will deviate from the original state with a greater probability and enter a new state, thus breaking the previous stable rules.

该步骤建立了以概率方式来描述数据状态转移的通道，即数据状态的转移可以按照概率分布的数学形式描述。This step establishes a channel to describe the data state transition in a probabilistic manner, that is, the data state transition can be described in the mathematical form of probability distribution.

步骤2.3统计历史监测数据的状态转移频率Step 2.3 Statistical state transition frequency of historical monitoring data

定义4数据状态转移频率Definition 4 Data State Transition Frequency

其中，f_ij表示对设备监测N_i+N_j次后数据从状态i转移到状态j的频率，N_i表示处于状态i的次数，N_j表示处于状态j的次数。Among them, f _ij represents the frequency of data transfer from state i to state j after N _i + N _j times of equipment monitoring, N _i represents the number of times in state i, and N _j represents the number of times in state j.

定律1伯努利大数定律Law 1 Bernoulli's law of large numbers

其中，P表示概率，ε表示一个正数，伯努利大数定律表明，通过大量搜集电力设备监测数据的状态转移数据，计算得出的状态转移的频率会依概率收敛。该定律表明可以通过多次监测当期数据或直接统计历史数据的方式来估算电力设备的状态转移概率。Among them, P represents probability, and ε represents a positive number. Bernoulli's law of large numbers shows that the frequency of state transition calculated by collecting a large number of state transition data of power equipment monitoring data will converge according to probability. This law shows that the state transition probability of power equipment can be estimated by monitoring current data multiple times or directly counting historical data.

步骤3电力设备的多次监测转移矩阵Step 3 Multi-monitoring transfer matrix of power equipment

本步骤分为4个子步骤This step is divided into 4 sub-steps

步骤3.1数据转移概率的马尔可夫性与时齐性Step 3.1 Markov property and time homogeneity of data transition probability

由于每次监测数据本质是在离散的时间点上的采样，同时依步骤2.2的状态划分方法，监测数据也被划分为离散的状态，所以电力监测数据的状态转移本质上是一个时间与状态都离散的随机过程。由于每次监测，数据所处的状态只与上一次所处的状态的相关(即以上一次所在状态为初始态进行概率转移)，所以此随机过程具有马尔科夫性，本质是马尔科夫链。对数据的每一次监测采样获得的结果与第一次监测所处的状态是无关的，这说明数据的状态转移概率亦具有时齐性。此步骤建立了状态转移概率与马尔科夫链的联系，在理论上说明设备监测数据的转移概率可以构建具有马尔科夫性的转移矩阵。Since each monitoring data is essentially sampled at a discrete time point, and the monitoring data is also divided into discrete states according to the state division method in step 2.2, the state transition of power monitoring data is essentially a time and state transition. Discrete random process. Since each monitoring, the state of the data is only related to the previous state (that is, the previous state is the initial state for probability transfer), so this random process has Markov properties, and its essence is a Markov chain . The result of each monitoring sampling of the data has nothing to do with the state of the first monitoring, which shows that the state transition probability of the data is also time-homogeneous. This step establishes the connection between the state transition probability and the Markov chain, which theoretically shows that the transition probability of equipment monitoring data can construct a transition matrix with Markov properties.

步骤3.2电力数据的多次转移矩阵Step 3.2 Multiple transfer matrix of power data

定义6单次状态转移矩阵Define 6 single state transition matrix

其中，E表示电力数据的单次状态转移矩阵，其元素p_ij表示数据从状态i经历一次监测时间间隔后转移到状态j的概率。其概率值可以通过步骤2.3中所述方法统计而来。单次状态转移矩阵E完整描述了任一电力数据经历一次监测后所有状态转移的概率分布情况。根据切普曼-柯尔莫哥洛夫方程Among them, E represents a single state transition matrix of electric power data, and its element p _ij represents the probability of data transitioning from state i to state j after a monitoring time interval. Its probability value can be calculated by the method described in step 2.3. The single state transition matrix E fully describes the probability distribution of all state transitions after any power data undergoes one monitoring. According to the Chapman-Kolmogorov equation

可以得到电力数据经过任意次状态转移后的转移矩阵，The transition matrix of the power data after any number of state transitions can be obtained,

由切普曼-柯尔莫哥洛夫方程可知，只需通过步骤3.2方法构造电力数据的单次状态转移矩阵，就能直接计算出任意m+n次后的状态转移矩阵，E^(n+m)中i行j列的元素表示数据以初始状态i被监测，而后经历m+n次监测后，数据处于状态j的概率。通过这种计算方法得出来的多次转移概率的实际意义是指当电力设备处于稳定状态时，数据应该达到的理论状态转移概率。From the Chapman-Kolmogorov equation, we can directly calculate the state transition matrix after any m+n times, E ⁽ⁿ⁺ The element in row i and column j in ^m) represents the probability that the data is monitored in the initial state i, and then after m+n times of monitoring, the data is in state j. The actual meaning of the multiple transition probability obtained by this calculation method refers to the theoretical state transition probability that the data should reach when the power equipment is in a steady state.

步骤3.4次转移后所处的理论概率分布Step 3. Theoretical probability distribution after 4 transitions

定义7电力数据初始分布Definition 7 Initial distribution of power data

其中，φ(0)表监测示数据的初始分布向量，元素(0≤j≤n)表示数据在初始时刻0所处的各个状态的概率。根据切普曼-柯尔莫哥洛夫方程，可以得到任意m+n时刻的理论概率分布。Among them, φ(0) represents the initial distribution vector of monitoring data, and the element (0≤j≤n) represents the probability of each state of the data at the initial time 0. According to the Chapman-Kolmogorov equation, the theoretical probability distribution at any m+n time can be obtained.

定理1多次转移状态的理论概率分布Theorem 1 Theoretical probability distribution of multiple transition states

φ(m+n)＝φ(0)E^m+n φ(m+n)=φ(0)E ^m+n

其中，φ(m+n)表示计算出的电力数据经历m+n次转移后所处的理论概率分布Among them, φ(m+n) represents the theoretical probability distribution of the calculated power data after m+n transfers

步骤4失真数据定位与子系统失真程度度量Step 4 Distortion data location and measurement of subsystem distortion degree

设备在一定时间段内处于稳定状态是指设备在该时间段内未发生状态的改变，对应的监测数据的状态概率分布反映出了设备在该状态下应有的数据状态概率模式特征。一旦电力通信系统检测到当前电力设备的状态不符合相应的状态转移概率的分布特征时，则可以推断系统可能出现了监测数据失真。监测周期内，如果设备运行状态未发生变化，而相应数据状态发生变化，称这种情况为第一类失真。如果设备运行状态发生变化，而数据状态未发生相应变化，称这种情况为第二类失真。本步骤分为5个子步骤The equipment is in a stable state within a certain period of time, which means that the equipment does not change its state within this period of time, and the corresponding state probability distribution of monitoring data reflects the characteristics of the data state probability mode that the equipment should have in this state. Once the power communication system detects that the current state of the power equipment does not conform to the distribution characteristics of the corresponding state transition probability, it can be inferred that the monitoring data may be distorted in the system. During the monitoring period, if the operating state of the equipment does not change, but the corresponding data state changes, this situation is called the first type of distortion. If the operating state of the device changes without a corresponding change in the state of the data, this is called Type 2 distortion. This step is divided into 5 sub-steps

步骤4.1电力数据状态转移分布的偏离度Step 4.1 Deviation degree of power data state transition distribution

定义9电力数据状态转移分布的偏离度Definition 9 Deviation Degree of Power Data State Transition Distribution

δ＞v(m+n)＝||φ′(m+n)-φ(m+n)||₂ δ＞v(m+n)＝||φ′(m+n)-φ(m+n)|| ₂

其中v(m+n)表示电力数据状态转移分布的偏离度，φ′(m+n)是电力设备在经历m+n次监测后实际的统计出的转移概率分布，φ(m+n)是电力设备经历m+n次监测后的理论概率分布，通过向量的2-范数度量这两种分布的偏离度。δ＞0是一个可以根据实际情况定义的偏离阈值，它由子系统中设备的实际固有的状态转移特征决定。Among them, v(m+n) represents the deviation degree of the power data state transition distribution, φ′(m+n) is the actual statistical transition probability distribution of the power equipment after m+n times of monitoring, φ(m+n) is the theoretical probability distribution of power equipment after m+n times of monitoring, and the deviation between the two distributions is measured by the 2-norm of the vector. δ>0 is a deviation threshold that can be defined according to the actual situation, and it is determined by the actual inherent state transition characteristics of the equipment in the subsystem.

当δ＞v(m+n)时，说明设备数据的实际状态转移分布偏离程度在合理范围内。When δ>v(m+n), it indicates that the deviation degree of the actual state transition distribution of the equipment data is within a reasonable range.

当δ≤v(m+n)时，说明设备数据的实际转移分布偏离度超出阈值。When δ≤v(m+n), it indicates that the deviation degree of the actual transfer distribution of the device data exceeds the threshold.

当设备稳定运行时，出现δ≤v(m+n)，表明设备数据出现第一类失真征兆，因为数据在某些时刻以较大概率出现在了不常出现的状态中。当设备运行过程中出现变化时，出现δ＞v(m+n)，表明设备出现第二类失真征兆，因为随着设备的运行状态的变化，监测数据理应发生相应状态变化，而实际状态分布并未发生较大变化。When the device is running stably, δ≤v(m+n) appears, indicating that the device data has the first type of distortion symptoms, because the data appears in an infrequent state with a high probability at certain moments. When there is a change during the operation of the equipment, δ>v(m+n) appears, indicating that the equipment has the second type of distortion symptoms, because as the operating state of the equipment changes, the monitoring data should undergo corresponding state changes, while the actual state distribution No major changes occurred.

步骤4.2子系统中失真数据的定位Step 4.2 Localization of distorted data in subsystems

定义10子系统的监测数据偏离矩阵Definition 10 Subsystem monitoring data deviation matrix

电网子系统是包含系统相应设备的有机整体，通过偏离矩阵可以快速定位子系统中失真的数据。The grid subsystem is an organic whole including the corresponding equipment of the system, and the distorted data in the subsystem can be quickly located through the deviation matrix.

其中，V(t)表示子系统的监测数据经历t个标准周期后的偏离矩阵，元素v_ij(t)表示经历t个标准周期后子系统中第i个设备的第j个属性的数据状态转移分布的偏离度。根据步骤2.1中扩展矩阵的性质，如果矩阵相应位置对应的设备不存在该属性，则命令元素值为-1。Among them, V(t) represents the deviation matrix of the monitoring data of the subsystem after t standard periods, and the element v _ij (t) represents the data status of the jth attribute of the i-th device in the subsystem after t standard periods Skewness of the transfer distribution. According to the properties of the extended matrix in step 2.1, if the attribute does not exist in the device corresponding to the corresponding position of the matrix, the value of the command element is -1.

步骤4.3进行失真定位，基于以下公式：Step 4.3 performs distortion localization, based on the following formula:

定义11失真定位矩阵Definition 11 Distortion Localization Matrix

其中，表示一个n×k的失真定位矩阵，元素只由数值0、1和-1组成，F表示矩阵0-1变换函数，它的功能是将矩阵中元素进行0-1变换，Δ(t)表示偏离阈矩阵，其元素δ_ij(t)表示对应于偏离度v_ij(t)的偏离阈值，函数f(z_ij∈V(t)-Δ(t))表示对其自变量进行0-1变换，其定义如下定义11元素0-1变换函数in, Represents an n×k distortion positioning matrix, the elements are only composed of values 0, 1 and -1, F represents the matrix 0-1 transformation function, its function is to perform 0-1 transformation on the elements in the matrix, Δ(t) represents Deviation threshold matrix, its element δ _ij (t) represents the deviation threshold corresponding to the degree of deviation v _ij (t), and the function f(z _ij ∈ V(t)-Δ(t)) represents the 0-1 Transformation, which is defined as follows: define an 11-element 0-1 transformation function

失真定位矩阵的实际意义在于把所有超出阈值的元素标记为1而未超过阈值的标记为0，元素在矩阵中的位置又一一对应于数据在子系统中的位置。这样一个失真定位矩阵就包含了一个子系统中失真数据的变化状态与位置信息。The actual significance of the distortion localization matrix is to mark all elements that exceed the threshold as 1 and those that do not exceed the threshold as 0, and the positions of the elements in the matrix correspond to the positions of the data in the subsystem one by one. Such a distortion localization matrix contains the change state and position information of the distortion data in a subsystem.

步骤4.4缺失数据的处理Step 4.4 Handling of missing data

数据缺失是也数据失真的一种表现，无论设备真实的运行状态如何，只要监测数据出现数据空值，则认为是数据失真，此时执行缺失数据变换Missing data is also a manifestation of data distortion. Regardless of the actual operating status of the equipment, as long as there is a data null value in the monitoring data, it is considered to be data distortion. At this time, the missing data transformation is performed.

其中，g(d_ij)表示缺失变换函数，它将M(t)中所有为空值的元素映射为1，并进入当空值数据倍缺失变换函数映射为中的1元素时，表明其失真的属性。在完整执行一次失真定位后，需要将本轮的原始数据作为历史数据对状态转移频率进行更新。这样就能保证步骤3.2中构造的单次状态转移矩阵的概率元素具有实时准确性。Among them, g(d _ij ) represents the missing transformation function, which maps all null elements in M(t) to 1, and enters When the null value data is missing, the transformation function is mapped to When the 1 element in , indicates its distortion property. After a complete distortion location is performed, the original data of this round needs to be used as historical data to update the state transition frequency. In this way, the probability elements of the single state transition matrix constructed in step 3.2 can be guaranteed to have real-time accuracy.

步骤4.5数据失真定位，与电力子系统失真程度度量。Step 4.5 Data distortion location, and power subsystem distortion degree measurement.

遍历失真定位矩阵一次，记录每一等于1的元素所在矩阵中位置一一对应了一个失真数据所在位置，其行标代表设备号。列表代该设备的属性号。同时取出元素占所有元素的比例定量反映了电力子系统失真程度。Traverse the distortion location matrix once, and record the location of each element equal to 1 in the matrix corresponding to the location of a distortion data, and its row label represents the device number. The list represents the property number of the device. The ratio of extracted elements to all elements quantitatively reflects the degree of distortion of the power subsystem.

需要注意的是：步骤3.1中涉及到马氏性证明It should be noted that: step 3.1 involves the Markov property proof

证明：电力数据状态转移过程{X_n,n＝0,1,2,…}是马尔科夫链Proof: the power data state transition process {X _n ,n=0,1,2,…} is a Markov chain

由于n是有限可列的，并且及状态i,j,i₀,i₁,…,i_n-1总存在条件概率P(X_n+1＝j|X₀＝i₀,X₁＝i₁,…,X_n-1＝i_n-1,X_n＝i)使得P(X_n+1＝j|X₀＝i₀,X₁＝i₁,…,X_n-1＝i_n-1,X_n＝i)＝P(X_n+1＝j|X_n＝i)Since n is finitely listable, and And state i,j,i ₀ ,i ₁ ,…,i _n-1 total existence conditional probability P(X _n+1 =j|X ₀ =i ₀ ,X ₁ =i ₁ ,…,X _n-1 = i _n-1 ,X _n =i) such that P(X _n+1 =j|X ₀ =i ₀ ,X ₁ =i ₁ ,...,X _n-1 =i _n-1 ,X _n =i)= P(X _n+1 ＝j|X _n ＝i)

即采样过程满足马氏性，{X_n,n＝0,1,2,…}为一个马尔可夫链。That is, the sampling process satisfies the Markov property, and {X _n , n=0, 1, 2,...} is a Markov chain.

因此，本发明具有如下优点：1、本发明数据采集阶能将各种数据格式或数据结构统一成状态转移概率，因此规避了多源异构数据中不同数据格式对数据分析造成的影响，降低了分析系统的复杂度。2、数据的失真判别是可以实时动态进行的，理论上只要给定某一时刻的数据状态初始分布，就能计算出此时的数据失真情况。同时失真的数据的又会返回更新历史数据。3、数据失真判别与定位具有同时性，当数据失真被判定后，失真的位置也同时根据失真定位矩阵被确定，可以缩短设备故障的抢修时间。Therefore, the present invention has the following advantages: 1. The data collection stage of the present invention can unify various data formats or data structures into a state transition probability, thus avoiding the impact of different data formats in multi-source heterogeneous data on data analysis, reducing analyzed the complexity of the system. 2. Data distortion discrimination can be performed dynamically in real time. In theory, as long as the initial distribution of the data state at a certain moment is given, the data distortion at that time can be calculated. At the same time, the distorted data will return to update the historical data. 3. Data distortion discrimination and positioning are simultaneous. When data distortion is determined, the location of the distortion is also determined according to the distortion positioning matrix, which can shorten the repair time of equipment failure.

附图说明Description of drawings

附图1是本发明的整体流程示意图。Accompanying drawing 1 is the overall flow diagram of the present invention.

具体实施方式Detailed ways

监测数据采集本质上是指电力电网系统中，各类传感器对设备进行监测并将监测数据传送到指定位置储存的过程。在不同的子系统中监测数据具有多种储存机制，本步骤的目的是使得采集到的数据按照数据源的实体设备划分为实体数据集合。本步骤分为3个子步骤Monitoring data collection essentially refers to the process in which various sensors monitor equipment in the power grid system and transmit the monitoring data to a designated location for storage. There are multiple storage mechanisms for monitoring data in different subsystems. The purpose of this step is to divide the collected data into entity data sets according to the entity equipment of the data source. This step is divided into 3 sub-steps

步骤1.1原始监测数据采集Step 1.1 Raw monitoring data collection

定义1原始数据集合Definition 1 Raw data set

在数据采集过程，将数据按照来源索引字段进行分类，此步骤分为两种形式。形式一：已采集数据的分类In the data collection process, the data is classified according to the source index field. This step is divided into two forms. Form 1: Classification of collected data

定义2设备数据向量Definition 2 device data vector

d_i＝(d_i1,d_i2,...,d_ik)d _i ＝(d _i1 ,d _i2 ,...,d _ik )

其中，d_i表示来自于第i(1≤i≤n)设备的监测数据向量，d_ij表示第i设备的第j，(1≤j≤k)属性。这样按任意实体i设备的数据源归类的数据以向量的形式被d_i记录Among them, d _i represents the monitoring data vector from the i-th (1≤i≤n) device, and d _ij represents the j-th (1≤j≤k) attribute of the i-th device. In this way, the data classified according to the data source of any entity i device is recorded by d _i in the form of vector

步骤1.3构造属性扩展矩阵Step 1.3 Construct attribute expansion matrix

定义3属性扩展矩阵Definition 3 attribute expansion matrix

本步骤分为3个子步骤This step is divided into 3 sub-steps

步骤2.1数据的状态划分Step 2.1 State division of data

步骤2.2设备的状态Step 2.2 Status of the device

定义4数据状态转移频率Definition 4 Data State Transition Frequency

定律1伯努利大数定律Law 1 Bernoulli's law of large numbers

本步骤分为4个子步骤This step is divided into 4 sub-steps

定义6单次状态转移矩阵Define 6 single state transition matrix

定义7电力数据初始分布Definition 7 Initial distribution of power data

φ(m+n)＝φ(0)E^m+n φ(m+n)=φ(0)E ^m+n

其中，φ(m+n)表示计算出的电力数据经历m+n次转移后所处的理论概率分布。Among them, φ(m+n) represents the theoretical probability distribution of the calculated power data after m+n transfers.

步骤4失真数据定位Step 4 Distortion data localization

其中v(m+n)表示电力数据状态转移分布的偏离度，φ′(m+n)是电力设备在经历m+n次监测后实际的统计出的转移概率分布，φ(m+n)是电力设备经历m+n次监测后的理论概率分布，通过向量的2-范数度量这两种分布的偏离度。δ＞0是一个可以根据实际情况定义的偏离阈值，当δ＞v(m+n)时，说明设备数据的实际状态转移分布偏离程度在合理范围内，当δ≤v(m+n)时，说明设备数据的实际转移分布偏离度超出阈值。Among them, v(m+n) represents the deviation degree of the power data state transition distribution, φ′(m+n) is the actual statistical transition probability distribution of the power equipment after m+n times of monitoring, φ(m+n) is the theoretical probability distribution of power equipment after m+n times of monitoring, and the deviation between the two distributions is measured by the 2-norm of the vector. δ>0 is a deviation threshold that can be defined according to the actual situation. When δ>v(m+n), it means that the deviation degree of the actual state transition distribution of the equipment data is within a reasonable range. When δ≤v(m+n) , indicating that the deviation degree of the actual transfer distribution of the device data exceeds the threshold.

步骤4.3Step 4.3

定义11失真定位矩阵Definition 11 Distortion Localization Matrix

步骤4.4缺失数据的处理Step 4.4 Handling of missing data

本文中所描述的具体实施例仅仅是对本发明精神作举例说明。本发明所属技术领域的技术人员可以对所描述的具体实施例做各种各样的修改或补充或采用类似的方式替代，但并不会偏离本发明的精神或者超越所附权利要求书所定义的范围。The specific embodiments described herein are merely illustrative of the spirit of the invention. Those skilled in the art to which the present invention belongs can make various modifications or supplements to the described specific embodiments or adopt similar methods to replace them, but they will not deviate from the spirit of the present invention or go beyond the definition of the appended claims range.

Claims

1. A rapid positioning method for monitoring data distortion of an electric power system based on state transition probability is characterized by comprising the following steps:

step 1, attribute entity division of monitoring data, acquisition of original monitoring data of equipment, and division of the acquired original monitoring data into entity data sets according to entity equipment of a data source, specifically comprising:

step 1.1, collecting original monitoring data, and classifying the collected original detection data into a set according to the following definitions:

definition 1, raw data set

D＝{d₁₁,d₁₂,...,d₂₁,d₂₂,...,d_n1,d_n2,...,d_nk}

Wherein D represents the attribute set of various types of raw data collected in the system, D_ijThe j attribute of the ith equipment is represented, and in the actual information acquisition, because the data acquisition modes of all subsystems of the power grid are different, the data acquired by all subsystems are gathered and are often a disordered data set;

step 1.2, classifying the acquired original monitoring data according to the entity source address, classifying the data according to the source index field in the data acquisition process, and selecting the following form for classification according to the type of the data:

classification type one: for classification of collected data, for the collected data in definition 1, it is necessary to classify the collected data according to the data source index field in the data attribute, and classify the monitoring data from the same entity into a group

Classification type two: classifying the collected monitoring data according to the entity, directly collecting the monitoring data of a subsystem which can output the monitoring data according to the entity, and indicating the unique attribute of the entity in the data;

defining 2 device data vectors

d_i＝(d_i1,d_i2,...,d_ik)

Wherein d is_iRepresenting the vector of monitored data from the ith (1. ltoreq. i.ltoreq.n) device, d_ijJ (1 is not less than j not more than k) attribute of the ith device; data sorted by data source of arbitrary entity device i is represented in vector form by d_iRecording

Step 1.3, constructing an attribute expansion matrix: in an electric power communication system, an electric power grid comprises various subsystems, various devices support the normal operation of one subsystem through cooperative work, and data change of one subsystem can be formalized through an attribute expansion matrix, wherein the attribute expansion matrix is defined based on the following definition

Defining 3-Attribute extension matrices

Where M (t) represents the attribute expansion matrix at time t, d_ij(t) represents the j-th (1. ltoreq. i.ltoreq.n) of the i-th plant (1. ltoreq. j.ltoreq.k)_iK) is less than or equal to the monitoring value of the attribute at the moment t; the above equation defines an upper bound k on the number of attributes owned by the device, where k is max (k)₁,k₂,..) represents the number of attributes of the device with the most attributes in the subsystem, which specifies the number of columns of the spreading matrix; the number of attributes of different devices in the power subsystem can be different, so that most row vectors in the expansion matrix M (t) have undefined attribute values, the undefined attributes are called as expansion attributes, and the undefined attributes are used for maintaining the rectangular structure of the expansion matrix so as to facilitate the subsequent mathematical processing; the expansion matrix M (t) completely comprises the attribute value of the power subsystem at the time t;

step 2, obtaining the monitoring data transfer probability of the power equipment, specifically comprising:

step 2.1 State partitioning of data

The values of all the attributes of the power equipment have a normal domain in a certain range, and when a certain attribute value exceeds the normal domain, the attribute is called to have an abnormal value; for a discrete attribute value, its domain is a countable discrete point; for the continuous type attribute value, the definition domain is a continuous interval;

according to specific power equipment, dividing the monitored different data values into different states according to the definition domains of the data values;

when the equipment is in a normal state, the state of monitoring data is called a stable state;

for discrete data values, different points can be classified into one type according to specific data attribute characteristics to form a state, or each point is directly regarded as a state; for continuous data values, or dividing a continuous numerical value interval into segments according to specific characteristics, wherein each segment is in a state;

step 2.2 State of the devices

The power equipment can operate in different states, and the different equipment states represent the stage characteristics of the equipment operation; different devices have different running states, and the essential expression of inconsistent power data is that the running state of the actual device is inconsistent with the state of the monitored data, namely the monitored data cannot truly reflect the actual running condition of the device, the running states of the devices are various, and the types of the data states are more; a device state corresponds to a particular combination of a series of data states;

when equipment in the power system stably runs, various monitoring data values of the equipment should be stabilized in a certain range, and meanwhile, the change rule of the equipment also has stability; when the state of the equipment is changed during operation, a part of monitoring data deviates from the original state with higher probability and enters a new state, so that the previous stable rule is broken;

step 2.3, counting the state transition frequency of the historical monitoring data, and obtaining the state transition frequency based on the following definitions:

defining 4 data State transition frequencies

Wherein f is_ijIndicating monitoring of a device N_i+N_jFrequency, N, of transition of data from state i to state j_iRepresenting the number of times in state i, N_jRepresents the number of times in state j;

the state transition probability of the power equipment is then estimated by monitoring current date data or directly counting historical data a plurality of times based on the following formula

Wherein, P represents probability, epsilon represents a positive number, state transition data of the monitoring data of the power equipment is collected in a large quantity, and the calculated frequency of state transition can be converged according to the probability; the state transition probability of the power equipment can be estimated by monitoring current date data or directly counting historical data for multiple times;

step 3, acquiring a multiple monitoring transfer matrix of the power equipment, specifically comprising:

step 3.1 Markov and chronology of data transition probability

Since each monitoring data is essentially sampling at a discrete time point, and meanwhile, according to the state division method of step 2.2, the monitoring data is also divided into discrete states, the state transition of the power monitoring data is essentially a random process with discrete time and state; because the state of the data is only related to the state of the data in the last time of monitoring, the random process has Markov property and is essentially a Markov chain; the result obtained by sampling each monitoring of the data is irrelevant to the state of the first monitoring, which shows that the state transition probability of the data also has timeliness;

step 3.2, acquiring a multiple transfer matrix of the power data, wherein the multiple transfer matrix is defined based on the following steps:

defining 6 Single State transition matrices

Where E represents a single state transition matrix of power data, whose element p_ijRepresenting the probability of data transitioning from state i to state j after a monitoring interval; the probability value can be obtained by the method in the step 2.3; the single state transition matrix E completely describes the probability distribution condition of all state transitions after one-time monitoring of any power data; according to the Chipman-Kolmogorov equation

Obtaining a transfer matrix of the electric power data after state transfer for any time,

as can be known from the Chipman-Kolmogorov equation, the state transition matrix after any m + n times can be directly calculated by only constructing the single state transition matrix of the power data through the method of the step 3.2, and E^(n+m)The element in the row i and the column j in the middle represents the probability that the data is monitored in an initial state i and then in a state j after m + n times of monitoring; the practical meaning of the multiple transition probability obtained by the calculation method is the theoretical state transition probability which the data should reach when the power equipment is in a stable state;

step 3.4, obtaining the theoretical probability distribution of the secondary transfer, wherein the theoretical probability distribution is defined based on the following:

defining 7 an initial distribution of power data

Wherein the phi (0) table monitors the initial distribution vector, element, of the data(j is more than or equal to 0 and less than or equal to n) represents the probability of each state of the data at the initial time 0; according to the Chepman-Kolmogorov equation, theoretical probability distribution at any m + n moment can be obtained;

theorem 1 theoretical probability distribution of multiple transition states

φ(m+n)＝φ(0)E^m+n

Wherein phi (m + n) represents the theoretical probability distribution of the calculated power data after m + n transfers

Step 4, obtaining the distortion data positioning and subsystem distortion degree measurement, which specifically comprises the following steps:

step 4.1, calculating the deviation degree of the state transition distribution of the power data, wherein the deviation degree is defined on the basis of the following steps:

defining 9 degrees of deviation of power data state transition distributions

δ＞v(m+n)＝||φ′(m+n)-φ(m+n)||₂

Wherein v (m + n) represents the deviation of the state transition distribution of the power data, phi' (m + n) is the actually counted transition probability distribution of the power equipment after m + n times of monitoring, phi (m + n) is the theoretical probability distribution of the power equipment after m + n times of monitoring, and the deviation of the two distributions is measured by the 2-norm of the vector; delta > 0 is a deviation threshold which can be defined according to actual conditions, when delta > v (m + n), the deviation degree of the actual state transition distribution of the equipment data is in a reasonable range, and when delta is less than or equal to v (m + n), the deviation degree of the actual state transition distribution of the equipment data exceeds the threshold;

when the device is operating stably, delta ≦ v (m + n) occurs, indicating that the device data has a first type of distortion symptom because the data is present in an infrequent state with a greater probability at some time; when the equipment changes in the operation process, delta & gt v (m + n) appears, which indicates that the equipment has a second type of distortion symptoms, because the monitoring data should have corresponding state change along with the change of the operation state of the equipment, and the actual state distribution does not have great change;

and 4.2, positioning the distortion data in the subsystem, wherein the positioning is defined on the basis of the following steps:

defining 10 monitor data deviation matrices for subsystems

The power grid subsystem is an organic whole containing corresponding equipment of the system, and distorted data in the subsystem can be quickly positioned through the deviation matrix;

wherein V (t) represents a deviation matrix of the monitoring data of the subsystem after t standard periods, and element v_ij(t) represents the degree of deviation of the data state transition distribution of the jth attribute of the ith device in the subsystem after t standard cycles; according to the property of the extended matrix in the step 2.1, if the attribute does not exist in the equipment corresponding to the corresponding position of the matrix, the value of the command element is-1;

and 4.3, carrying out distortion positioning based on the following formula:

defining 11 a distortion localization matrix

Wherein,representing an nxk distortion localization matrix with elements consisting of only the values 0,1 and-1, F representing a matrix 0-1 transformation function, the function of which is to transform the elements of the matrix 0-1, Δ (t) representing a deviation threshold matrix with elements δ_ij(t) represents a deviation v corresponding to_ij(t) deviation threshold, function f (z)_ije.V (t) - Δ (t)) represents a 0-1 transformation of its independent variables, which is defined as follows

Defining 11 element 0-1 transformation functions

The practical meaning of the distortion positioning matrix is that all elements exceeding the threshold are marked as 1, and elements not exceeding the threshold are marked as 0, and the positions of the elements in the matrix correspond to the positions of the data in the subsystem one by one; the distortion positioning matrix comprises the change state and position information of the distortion data in a subsystem;

step 4.4, processing the missing data, specifically, executing missing data transformation, based on the following formula

Wherein g (d)_ij) Represents a missing transform function that maps all null elements of M (t) to 1 and entersWhen emptyThe value data multiple missing transform function is mapped intoElement 1 indicates its distorted property; after the primary distortion positioning is completely executed, the state transition frequency needs to be updated by taking the original data of the current round as historical data; thus, the probability elements of the single state transition matrix constructed in the step 3.2 can be ensured to have real-time accuracy;

and 4.5, carrying out data distortion positioning, and calculating a power subsystem distortion degree measurement, specifically: traversing the distortion positioning matrix once, recording the positions of the elements which are equal to 1 in the matrix, wherein the positions of the elements correspond to the positions of distortion data one by one, and the row marks represent equipment numbers; the list represents the attribute number of the device; and meanwhile, the proportion of the extracted elements in all the elements quantitatively reflects the distortion degree of the power subsystem.