CN114386499A - A clustering and separation method of multi-source partial discharge signal data stream based on GIS - Google Patents
A clustering and separation method of multi-source partial discharge signal data stream based on GIS Download PDFInfo
- Publication number
- CN114386499A CN114386499A CN202111663965.5A CN202111663965A CN114386499A CN 114386499 A CN114386499 A CN 114386499A CN 202111663965 A CN202111663965 A CN 202111663965A CN 114386499 A CN114386499 A CN 114386499A
- Authority
- CN
- China
- Prior art keywords
- microclusters
- data
- neighborhood
- algorithm
- microcluster
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01R—MEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
- G01R31/00—Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
- G01R31/12—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing
- G01R31/1227—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing of components, parts or materials
- G01R31/1263—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing of components, parts or materials of solid or fluid materials, e.g. insulation films, bulk material; of semiconductors or LV electronic components or parts; of cable, line or wire insulation
- G01R31/1281—Testing dielectric strength or breakdown voltage ; Testing or monitoring effectiveness or level of insulation, e.g. of a cable or of an apparatus, for example using partial discharge measurements; Electrostatic testing of components, parts or materials of solid or fluid materials, e.g. insulation films, bulk material; of semiconductors or LV electronic components or parts; of cable, line or wire insulation of liquids or gases
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/211—Selection of the most significant subset of features
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Testing Relating To Insulation (AREA)
Abstract
Description
技术领域technical field
本发明属于高压电气设备局部放电检测技术领域,具体为基于GIS多源局部放电信号数据流聚类分离方法。The invention belongs to the technical field of partial discharge detection of high-voltage electrical equipment, in particular to a clustering and separation method based on GIS multi-source partial discharge signal data streams.
背景技术Background technique
在人口稠密的城市中,对变电站的紧凑设计和小尺寸的要求,使得气体绝缘变电站(Gas Insulated Substation,GIS)的安装成为必要条件。由于GIS的紧凑设计,低维护要求和可靠的运行,近年来在电力公用事业中得到了广泛的应用。像其他高压设备一样,GIS设备绝缘体在强电场的作用下内部区域可能出现各种引起危害的潜伏性绝缘缺陷,产生不同类型局部放电(Partial Discharge,PD)。不同类型PD反映的绝缘劣化机理不同,对GIS设备损害程度也不同。识别PD类型可以为变压器的诊断、检修提供依据,从而确保电力系统安全稳定的运行。模式识别是气体绝缘变电站故障诊断的主要内容之一,用数学方法对有故障信息的数据进行自动处理和识别,提取有效的信息,从而对故障的数据点进行聚类和分离。通过PD 信号监测系统采集到PD信号,通过模式识别方法对数据之中能反映气体绝缘变电站PD的特征信息进行辨识,从而可以判断PD的放电类型。如果气体绝缘变电站发生故障,产生了 PD的现象,则可以对PD类型进行判断,为维修提供一定的技术指导。PD类型大概分为:尖端放电、空穴放电、悬浮电极放电、自由金属颗粒放电。这些缺陷主要是由诸如断路器之类的运动部件的机械振动所产生。In densely populated cities, the requirements for compact design and small size of substations make the installation of Gas Insulated Substations (GIS) necessary. Due to the compact design, low maintenance requirements and reliable operation of GIS, it has been widely used in electric utilities in recent years. Like other high-voltage equipment, under the action of a strong electric field, GIS equipment insulators may have various latent insulation defects that cause damage, resulting in different types of partial discharges (PD). Different types of PDs reflect different insulation degradation mechanisms and different degrees of damage to GIS equipment. Identifying the PD type can provide the basis for the diagnosis and maintenance of the transformer, thereby ensuring the safe and stable operation of the power system. Pattern recognition is one of the main contents of fault diagnosis of gas-insulated substations. It uses mathematical methods to automatically process and identify data with fault information, extract effective information, and then cluster and separate fault data points. The PD signal is collected by the PD signal monitoring system, and the characteristic information that can reflect the PD of the gas-insulated substation in the data is identified by the pattern recognition method, so that the discharge type of the PD can be judged. If the gas-insulated substation fails, resulting in the phenomenon of PD, the type of PD can be judged to provide certain technical guidance for maintenance. PD types are roughly divided into: tip discharge, hole discharge, suspended electrode discharge, and free metal particle discharge. These defects are mainly caused by mechanical vibrations of moving parts such as circuit breakers.
发明内容SUMMARY OF THE INVENTION
本发明的内容为实现了一种基于GIS多源局部放电信号数据流聚类分离方法,实现了多种局部放电源有效分离及识别,具体技术方案包括以下4个部分。The content of the present invention is to realize a GIS-based multi-source partial discharge signal data stream clustering separation method, and realize the effective separation and identification of various partial discharge sources. The specific technical scheme includes the following four parts.
相位分辨局部放电(Phase Resolved Partial Discharge,PRPD)模式是PD测量分析中最常用的方法,本文采用的特征提取方法是统计特征法。利用特高频法对PD信号进行监测,对其放电特性进行分析。提取可以反映GIS设备缺陷的特征参数,这些特征参数包括偏斜度、陡峭度、上升时间、下降时间和脉冲宽度等。这些特征参数都应该具有很高的辨识度,将GIS 的运行状态与各个特征参数相关联,来分析并以其结果预警GIS设备潜在故障,利用数据流聚类算法将不用的特征参数的信号分离出来。针对不确定的数据流特征,Cao F等人提出 DenStream聚类算法,该算法引入核心微簇来概括任意形状的簇,同时提出潜在的核心微簇和异常微簇结构来维护和区分潜在簇和异常值。扩展了传统基于密度的聚类算法,重点处理任意形状的数据流聚类问题。DenStream算法存在不足,没有限制核心微簇的数量,同时也没有删减或减少核心微簇的方法,会导致大量的内存开销。Chairukwattana R等人提出SE-Stream 聚类算法,通过减少执行时间和提高微簇的质量来提高算法的性能,确定每个活动微簇的适当维度子集来表达数据流中的微簇的特定特征。支持微簇结构随时间的变化,包括微簇的出现、消失、自我进化、合并和分裂。SE-Stream算法存在着缺陷,需要初始化定义较多的参数,后期的聚类效果受初始化参数的影响。该算法目的是为了在每个微簇中提取最佳的选定维度集,不能保证这些维度是否冗余。针对多种实时PD信号数据且不断变化的问题,系统需要能够实时对多种PD信号进行分离,本文提出了一种高效EAOStream,该算法在传统数据流聚类算法上,利用自然邻域(Natural Neighbor,NaN)的方法,创建KD树(K-dimension Tree) 来提高查询近邻的效率,该方法通过流数据的特征值计算出适应的邻域半径和区域密度进行局部搜索,从而来支持最佳的聚类效果。在形成团簇后半径是可以跟着时间增加或减小,根据数据流结构动态变化,一些微簇会随着时间拆分或合并。实验测试效果表明,该算法为任意形状微簇聚类提供了一种有效的解决方案并具有更高的分类精度。Phase Resolved Partial Discharge (PRPD) mode is the most commonly used method in PD measurement and analysis. The feature extraction method used in this paper is statistical feature method. The PD signal is monitored by the UHF method, and its discharge characteristics are analyzed. Extract the characteristic parameters that can reflect the defects of GIS equipment, these characteristic parameters include skewness, steepness, rise time, fall time and pulse width. These characteristic parameters should have a high degree of identification, correlate the operating status of the GIS with each characteristic parameter, analyze and use the results to warn the potential failure of GIS equipment, and use the data stream clustering algorithm to separate the signals of the unused characteristic parameters. come out. Aiming at the uncertain data flow characteristics, Cao F et al. proposed the DenStream clustering algorithm, which introduced core microclusters to generalize clusters of arbitrary shapes, and proposed potential core microclusters and abnormal microcluster structures to maintain and distinguish potential clusters and outliers. It extends traditional density-based clustering algorithms to focus on the problem of clustering data streams of arbitrary shapes. The DenStream algorithm has shortcomings. It does not limit the number of core microclusters, and there is no way to delete or reduce core microclusters, which will lead to a lot of memory overhead. Chairukwattana R et al. proposed the SE-Stream clustering algorithm, which improves the performance of the algorithm by reducing the execution time and improving the quality of microclusters, and determines the appropriate dimensional subset of each active microcluster to express the specific characteristics of the microclusters in the data stream. . Changes in microcluster structure over time are supported, including the emergence, disappearance, self-evolution, merging and splitting of microclusters. The SE-Stream algorithm has defects, it needs to initialize and define many parameters, and the later clustering effect is affected by the initialization parameters. The algorithm aims to extract the best set of selected dimensions in each microcluster, and there is no guarantee that these dimensions are redundant. Aiming at the problem of a variety of real-time PD signal data and changing problems, the system needs to be able to separate a variety of PD signals in real time. This paper proposes an efficient EAOStream, which uses the natural neighborhood (Natural Neighbor, NaN) method to create a KD tree (K-dimension Tree) to improve the efficiency of querying neighbors. This method calculates the adaptive neighborhood radius and area density through the eigenvalues of the stream data for local search, so as to support the best the clustering effect. After the cluster is formed, the radius can increase or decrease with time. According to the dynamic change of the data flow structure, some microclusters will be split or merged over time. The experimental test results show that the algorithm provides an effective solution for arbitrary-shaped micro-cluster clustering and has higher classification accuracy.
(1)特征提取:实现不同PD故障信号的分离,首先必须提取的特征量能够反映PD信号的时域特征。从而通过特高频提取到的特征值来表示PD信号。多种PD信号,由于放电机理、放电缺陷位置、放电信号传播路径的不同,会在特征值中表现不同的差异,因此根据不同PD信号时域分布特征的不同,就可以将多种PD进行分离,用于类型识别的PD特征量选择方法主要集中在PD相位分析模式,统计特征参数用PRPD的特征描述。本文提取了信号特征中的偏斜度Sk、陡峭度Ku、相位Φ和放电量Q等特征量。(1) Feature extraction: To achieve the separation of different PD fault signals, the feature quantity that must be extracted firstly can reflect the time domain characteristics of PD signals. Therefore, the PD signal is represented by the eigenvalues extracted by the UHF. A variety of PD signals will show different differences in eigenvalues due to different discharge mechanisms, discharge defect locations, and discharge signal propagation paths. Therefore, various PD signals can be separated according to the different time domain distribution characteristics of different PD signals. , the PD feature selection method for type identification mainly focuses on the PD phase analysis mode, and the statistical feature parameters are described by PRPD features. In this paper, the characteristic quantities such as skewness Sk , steepness Ku , phase Φ and discharge quantity Q are extracted from the signal features.
(2)自然邻域算法:自适应半径邻域和区域密度,初始化自然邻域,将数据放入到KD 树上进行最邻近搜索,对其每个数据点找到其K-近邻、逆K-近邻,以及每个数据点近邻的数量,判断是否达到稳定终止平衡条件。(2) Natural neighborhood algorithm: adaptive radius neighborhood and area density, initialize natural neighborhood, put data into KD tree for nearest neighbor search, and find its K-nearest neighbor and inverse K-neighbor for each data point. Neighbors, and the number of neighbors of each data point, determine whether a stable termination equilibrium condition is reached.
(3)参数选择:分为两个阶段,第一阶段引入自然邻域算法,通过自然邻域算法对前n 个数据点组成的数据集进行处理得到自然特征值和微簇最小阈值M,然后计算每个数据点与最小预置的距离之后,然后求其平均值。设算法的邻域半径γ(ε),通过引入自然邻域算法,得到所需要的区域密度和邻域半径,从而无需初始化参数值,通过数据点不断进入,自适应更新M和γ(ε)。(3) Parameter selection: It is divided into two stages. In the first stage, the natural neighborhood algorithm is introduced, and the data set composed of the first n data points is processed by the natural neighborhood algorithm to obtain the natural feature value and the minimum threshold M of microclusters, and then After calculating the distance of each data point from the minimum preset, it is then averaged. Set the neighborhood radius of the algorithm γ(ε), by introducing the natural neighborhood algorithm, the required area density and neighborhood radius are obtained, so that there is no need to initialize the parameter values, and the data points are continuously entered, and M and γ(ε) are adaptively updated. .
(4)聚类分离:分配核心微簇,在新的数据点到达时,判断数据样本是否属于当前微簇,如果不是,则创建一个新的微簇。如果数据在当前微簇中,进一步检查数据在微簇团的核心半径或壳半径内。如果判断数据点落在壳半径区域内,则更新微簇的中心位置。删除衰减到最小阈值的微簇:所有的微簇寿命减少到衰减量时,将微簇移除,并删除与它相连的边。更新集群:聚类图更新发生在已存在的微簇中心点位置已发生改变;微簇发生移动或产生新的微簇;微簇的寿命衰减到设定的阈值。(4) Cluster separation: Allocate core microclusters. When a new data point arrives, determine whether the data sample belongs to the current microcluster. If not, create a new microcluster. If the data is in the current microcluster, further check that the data is within the core radius or shell radius of the microcluster. If it is judged that the data point falls within the shell radius area, the center position of the microcluster is updated. Delete microclusters that decay to the minimum threshold: When all microclusters have reduced their lifespans to the decay amount, remove the microcluster and delete the edges connected to it. Update cluster: Cluster map update occurs when the position of the existing micro-cluster center point has changed; the micro-cluster moves or generates a new micro-cluster; the life of the micro-cluster decays to a set threshold.
与其他聚类分离方法相比,本发明的优点体现在以下几点:1、采用自然邻域创建KD树来提高查询近邻的效率,即通过流数据的特征得到自适应的邻域半径和区域密度,从而达到自适应的目的。2、采用了在线的聚类分离方法,能够实时实现多源局部放电信号的在线分离。Compared with other clustering separation methods, the advantages of the present invention are reflected in the following points: 1. Using natural neighborhoods to create KD trees to improve the efficiency of querying neighbors, that is, to obtain adaptive neighborhood radius and area through the characteristics of streaming data density, so as to achieve the purpose of self-adaptation. 2. The online clustering separation method is adopted, which can realize the online separation of multi-source partial discharge signals in real time.
附图说明Description of drawings
图1为本发明所涉一种基于多源局部放电的数据流聚类分离方法整体流程图FIG. 1 is an overall flow chart of a method for clustering and separating data streams based on multi-source partial discharge according to the present invention.
图2自适应数据流聚类过程和结果图Figure 2 Adaptive data stream clustering process and result diagram
图3特高频缺陷放电模拟信号发生器图Figure 3 UHF defect discharge analog signal generator diagram
图4硬件系统图Figure 4 Hardware system diagram
图5多源局放信号三种聚类算法的效果对比图Figure 5 Comparison of the effects of three clustering algorithms for multi-source PD signals
图6PRPD聚类效果图Figure 6 PRPD clustering effect diagram
具体实施方式Detailed ways
本发明用于提供一种基于GIS多源局部放电信号数据流聚类分离方法,为了使本发明的技术方案及效果更加清晰、明确,下面结合附图,对本发明的具体实施方式进行详细描述。The present invention is used to provide a method for clustering and separating data streams of multi-source partial discharge signals based on GIS.
如图1的多源局部放电信号数据流聚类分离方法流程图。参数选择利用了自然邻域算法对前n个数据得到自然特征值λ,设为算法中密度值D。然后计算每个数据点D近邻距离的和,并求平均值,所得到的值设为算法的邻域半径R,从而能够自适应得到邻域半径和区域密度。初始化参数主要是邻域半径、区域密度和衰减值,算法将第一个数据点用来初始化微簇,并将微簇的属性设定初始值。初始化参数主要是邻域半径、区域密度和衰减值,算法将第一个数据点用来初始化微簇,并将微簇的属性设定初始值。删除衰减到最小阈值的微簇:这一部分减少了微簇的寿命,当它们的寿命低于零时将其删除。所有的微簇寿命减少到衰减量,将微簇移除,并删除与它相连的边。更新集群:聚类图更新发生在已存在的微簇中心点位置已发生改变;微簇发生移动或产生新的微簇;微簇的寿命衰减到设定的阈值。Figure 1 is a flow chart of a method for clustering and separating data streams of multi-source partial discharge signals. The parameter selection uses the natural neighborhood algorithm to obtain the natural eigenvalue λ for the first n data, which is set as the density value D in the algorithm. Then calculate the sum of the distances of the neighbors of each data point D, and calculate the average value. The obtained value is set as the neighborhood radius R of the algorithm, so that the neighborhood radius and area density can be obtained adaptively. The initialization parameters are mainly neighborhood radius, area density and attenuation value. The algorithm uses the first data point to initialize the microcluster, and sets the properties of the microcluster to initial values. The initialization parameters are mainly neighborhood radius, area density and attenuation value. The algorithm uses the first data point to initialize the microcluster, and sets the properties of the microcluster to initial values. Remove microclusters that decay to a minimum threshold: This part reduces the lifetime of microclusters, removing them when their lifetime falls below zero. All microcluster lifetimes are reduced to the decay amount, the microcluster is removed, and the edges connected to it are deleted. Update cluster: Cluster map update occurs when the position of the existing micro-cluster center point has changed; the micro-cluster moves or generates a new micro-cluster; the life of the micro-cluster decays to a set threshold.
1.特征提取1. Feature extraction
实现不同放电故障信号的分离,首先必须提取的特征量能够反映局部放电信号的时域特征。从而通过特高频提取到的特征值来表示局部放电信号。多种局部放电信号,由于放电机理、放电缺陷位置、放电信号传播路径的不同,会在特征值中表现不同的差异,因此根据不同局部放电信号时域分布特征的不同,就可以将多种局放信号进行分离,用于类型识别的PD 特征量选择方法主要集中在PD相位分析模式,统计特征参数用来PRPD的特征描述。本文提取了信号特征中的偏斜度Sk、陡峭度Ku、工频相位Φ、放电量Q等特征量。To achieve the separation of different discharge fault signals, the feature quantity that must be extracted firstly can reflect the time domain characteristics of partial discharge signals. Therefore, the partial discharge signal is represented by the eigenvalues extracted by the UHF. Various partial discharge signals will show different differences in eigenvalues due to different discharge mechanisms, discharge defect locations, and discharge signal propagation paths. The PD feature quantity selection method for type identification mainly focuses on the PD phase analysis mode, and the statistical feature parameters are used for the feature description of PRPD. In this paper, the characteristic quantities such as skewness Sk , steepness Ku , power frequency phase Φ and discharge quantity Q are extracted from the signal features.
2.多源局部放电信号分离2. Multi-source partial discharge signal separation
根据PRPD图谱中统计参数中的偏斜度Sk、陡峭度Ku、工频相位Φ和放电量Q组成一系列二维或三维图谱,并根据这些图谱的统计特征来实现对多种局放信号的分类。根据传统的聚类分离算法存在一定的问题,通过特高频传感器接收到的局放信号是实时且不断的变化,需要保证完整的工频周期,离线的聚类算法例如DBSCAN不能满足此要求。从而提出了一种高效的自适应在线数据流聚类算法。According to the skewness Sk, the steepness Ku, the power frequency phase Φ and the discharge quantity Q in the statistical parameters of the PRPD spectrum, a series of two-dimensional or three-dimensional maps are formed, and the statistical characteristics of these maps are used to realize the detection of various partial discharge signals. Classification. According to the traditional clustering separation algorithm, there are certain problems. The PD signal received by the UHF sensor is real-time and constantly changing, and it is necessary to ensure a complete power frequency cycle. Offline clustering algorithms such as DBSCAN cannot meet this requirement. Therefore, an efficient adaptive online data stream clustering algorithm is proposed.
通过创建KD树来提高查询近邻的效率,遍历整个数据集,从根节点出发,通过递归的方式访问KD树。寻找每一个数据点的K近邻和逆K近邻。NaN方法定义:给定一组数据点ρ1,ρ2,ρ3,…,ρN所有数据点ρi和ρj之间的相似性sij的概念,目的是在数据集中找到这些点的自然邻域,计算并存储在距离矩阵中,测量该距离最流行的选择之一是欧几里得距离。达到自然稳定状态的条件是数据集中的近邻数为零的数据点个数不再发生变化或所有对象都有逆邻居。对于数据点来说,如果同时将点ρi视为ρj并将点ρi视为ρj,则ρi是点ρj,则ρi是点ρj自然邻居之一。数据点的自然稳定结构按以下方式制定:The efficiency of querying neighbors is improved by creating a KD tree, traversing the entire data set, starting from the root node, and accessing the KD tree recursively. Find the K-nearest neighbors and inverse K-nearest neighbors of each data point. NaN method definition: Given a set of data points ρ 1 , ρ 2 , ρ 3 , ..., ρ N the concept of similarity s ij between all data points ρ i and ρ j , the purpose is to find the similarity of these points in the data set Natural neighborhoods, computed and stored in a distance matrix, one of the most popular choices for measuring this distance is the Euclidean distance. The condition for reaching a natural steady state is that the number of data points in the dataset with zero neighbors no longer changes or that all objects have inverse neighbors. For a data point, if point ρ i is considered as ρ j and point ρ i as ρ j , then ρ i is point ρ j , then ρ i is one of the natural neighbors of point ρ j . The natural stable structure of the data points is formulated in the following way:
多源局部放电信号分离方法是自适应完全在线聚类方法,包括两个阶段。第一阶段中,引入自然邻域算法,通过自然邻域算法对前n个数据点组成的数据集进行处理,得到自然特征值λ,设算法中的最小阈值D,然后计算每个数据点与最小密度D的距离之后,然后求其平均值。设算法的邻域半径γ(ε),通过引入自然邻域算法,所需要的最小阈值和邻域半径,从而无需初始化参数值,通过不断数据点进入,自适应更新邻域半径和最小阈值。该算法邻域半径的计算公式如下所示:The multi-source partial discharge signal separation method is an adaptive fully online clustering method, which consists of two stages. In the first stage, the natural neighborhood algorithm is introduced, and the data set composed of the first n data points is processed by the natural neighborhood algorithm to obtain the natural eigenvalue λ, and the minimum threshold D in the algorithm is set, and then the relationship between each data point and each data point is calculated. After the distance of the minimum density D, it is then averaged. Set the neighborhood radius of the algorithm γ(ε), by introducing the natural neighborhood algorithm, the required minimum threshold and neighborhood radius, so that there is no need to initialize the parameter values, and the neighborhood radius and the minimum threshold are adaptively updated by continuously entering data points. The calculation formula of the neighborhood radius of the algorithm is as follows:
第二阶段是相交的微簇团,将微簇团分为壳区域和核心区域,通过考虑与微簇外壳相交的核心区域来对微簇进行分组,可以自动确定边缘的微簇群。不具有最小阈值的微簇会存在异常值,每个微簇都含有一个图形,该图演示了微簇的相交。可以通过应用图结构能够最大程度减少微簇的破裂或最终死亡时分离微簇所需要的计算。采用实时更新图结构的方式得到聚类结果,当数据点到来后,计算修改后的微簇周围的几个相连的微簇的可达性,其余的点不需要修改,能够确保微簇划分的有效性。聚类形成过程和结果如图2所示,不同颜色的微簇代表不同的类别。The second stage is the intersecting microclusters. The microclusters are divided into shell regions and core regions. By grouping the microclusters by considering the core region that intersects the shell of the microclusters, the edge microclusters can be automatically determined. Microclusters that do not have a minimum threshold have outliers, and each microcluster contains a graph that demonstrates the intersection of the microclusters. The computation required to separate the microclusters when they break up or eventually die can be minimized by applying a graph structure. The clustering result is obtained by updating the graph structure in real time. When the data point arrives, the reachability of several connected microclusters around the modified microcluster is calculated, and the rest of the points do not need to be modified, which can ensure the division of the microclusters. effectiveness. The cluster formation process and results are shown in Figure 2. Microclusters with different colors represent different categories.
步骤1参数选择:利用了自然邻域算法对前n个数据得到自然特征值λ,设为算法中密度值D。然后计算每个数据点D近邻距离的和,并求平均值,所得到的值设为算法的邻域半径R,从而能够自适应得到邻域半径和区域密度。Step 1 Parameter selection: The natural neighborhood algorithm is used to obtain the natural eigenvalue λ for the first n data, which is set as the density value D in the algorithm. Then calculate the sum of the distances of the neighbors of each data point D, and calculate the average value. The obtained value is set as the neighborhood radius R of the algorithm, so that the neighborhood radius and area density can be obtained adaptively.
步骤2初始化微簇:初始化参数主要是邻域半径、区域密度和衰减值,算法将第一个数据点用来初始化微簇,并将微簇的属性设定初始值。Step 2: Initialize the micro-cluster: The initialization parameters are mainly neighborhood radius, area density and attenuation value. The algorithm uses the first data point to initialize the micro-cluster, and sets the initial value of the properties of the micro-cluster.
步骤3分配核心微簇:在到达新数据点时,判断数据样本是否属于当前任何微簇。如果不是,则创建一个新的微簇。如果数据在当前微簇中,进一步检查数据在微簇团的核心半径或壳半径内。如果判断数据点落在壳半径区域内,则更新微簇的中心位置。Step 3 Allocate core microclusters: When a new data point is reached, determine whether the data sample belongs to any of the current microclusters. If not, create a new microcluster. If the data is in the current microcluster, further check that the data is within the core radius or shell radius of the microcluster. If it is judged that the data point falls within the shell radius area, the center position of the microcluster is updated.
步骤4删除衰减到最小阈值的微簇:这一部分减少了微簇的寿命,当它们的寿命低于零时将其删除。所有的微簇寿命减少到衰减量,将微簇移除,并删除与它相连的边。Step 4 Remove microclusters that decay to a minimum threshold: This part reduces the lifetime of microclusters, removing them when their lifetime falls below zero. All microcluster lifetimes are reduced to the decay amount, the microcluster is removed, and the edges connected to it are deleted.
步骤5更新集群:聚类图更新发生在已存在的微簇中心点位置已发生改变;微簇发生移动或产生新的微簇;微簇的寿命衰减到设定的阈值。Step 5: Update the cluster: the cluster map update occurs when the center point of the existing micro-cluster has changed; the micro-cluster moves or generates a new micro-cluster; the life of the micro-cluster decays to a set threshold.
在上述情况下可以更改算法的边缘列表,需要更新微簇群的数量,首先可通过自然邻域算法计算出合适的邻域半径和区域密度。将对已修改为最近达到阈值或移动其中心位置的微簇。此时微簇的图形边缘已经发生了修改,则产生的微簇群的个数也发生改变。In the above case, the edge list of the algorithm can be changed, and the number of microclusters needs to be updated. First, the appropriate neighborhood radius and area density can be calculated through the natural neighborhood algorithm. Microclusters that have been modified to have recently reached the threshold or moved their center position. At this time, the graph edges of the micro-clusters have been modified, and the number of the generated micro-clusters has also changed.
3.多种局放源信号聚类分离现场实测3. On-site measurement of clustering and separation of various partial discharge source signals
本文为了验证局部放电信号时域特征聚类分离的可行性和有效性,与和校企合作项目建立局部放电监测系统,在实验室中对多种不同类型混合的局放放电类型进行系统测试,特高频缺陷放电模拟信号发生器实物图如图3所示,将采集的信号输入到硬件系统板中如图4所示,其中包括信号采集单元和信号处理单元,将聚类算法移植硬件系统板中,实时在线处理完后的数据上传到上位机显示。本文选取PRPD图谱中统计参数中的偏斜度Sk和陡峭度Ku组成的二维图谱来实现对多种局放信号的分离。在实验室采集多种局放信号形成数据集,在三种聚类算法的效果对比图5所示,通过颜色来判断类别,相同颜色的团簇代表一类,由图5可知有五类信号,Denstream和SE-Stream算法对图中第一类和第二类的分离的效果不影响,第一类信号和第二类信号微簇之间较近,容易将其判断为一类信号,图 5(b-c)所示第一类信号和第二类信号颜色被算法判断成一种颜色。而本文的算法能够准确将两种信号分离出来。In order to verify the feasibility and effectiveness of clustering separation of partial discharge signal time-domain features, this paper establishes a partial discharge monitoring system with a school-enterprise cooperation project, and conducts systematic tests on a variety of mixed partial discharge types in the laboratory. The physical diagram of the UHF defect discharge analog signal generator is shown in Figure 3, and the collected signals are input into the hardware system board as shown in Figure 4, which includes a signal acquisition unit and a signal processing unit, and the clustering algorithm is transplanted to the hardware system In the board, the data after real-time online processing is uploaded to the upper computer for display. In this paper, a two - dimensional map composed of the skewness Sk and the steepness Ku in the statistical parameters of the PRPD map is selected to realize the separation of various PD signals. A variety of PD signals were collected in the laboratory to form a data set. The comparison of the effects of the three clustering algorithms is shown in Figure 5. The categories are judged by color. The clusters of the same color represent one category. It can be seen from Figure 5 that there are five categories of signals. , Denstream and SE-Stream algorithms have no effect on the separation effect of the first and second types in the figure. The first type of signal and the second type of signal are close to each other, and it is easy to judge them as a type of signal. The colors of the first type signal and the second type signal shown in 5(bc) are determined by the algorithm as one color. The algorithm in this paper can accurately separate the two signals.
在硬件系统板中处理实时数据后,将对不同类别的信号提供标记位上传到上位机中,数据包括相位、放电幅度和标记位三个参数。在PRPD图谱显示结果如图6所示,横坐标表示相位,纵坐标表示放电赋值,放电次数映射到颜色空间上,每个分布位置可以叠加,从而可以做到颜色标识,根据累积的次数分为6个颜色级别。实验中采用两种信号混合输入,分别是悬浮电极放电信号和尖端放电信号。由于实验室悬浮电极放电信号的产生没有固定相位,随着时间的推移,相位会发生偏移,最后形成一条绿色的线条。尖端放电信号由实验室中高压试验变压器和特高频缺陷放电模拟信号发生器产生的,因相位固定,随着时间累计,中间部分颜色会一直变深,最后趋于稳定。After the real-time data is processed in the hardware system board, the marker bits for different types of signals are provided and uploaded to the host computer. The data includes three parameters: phase, discharge amplitude and marker bits. The results of the PRPD map display are shown in Figure 6. The abscissa represents the phase, the ordinate represents the discharge assignment, and the number of discharges is mapped to the color space. Each distribution position can be superimposed, so that color identification can be achieved. 6 color levels. In the experiment, two kinds of signal mixed input are used, which are the discharge signal of the floating electrode and the discharge signal of the tip. Since there is no fixed phase in the generation of the discharge signal from the suspended electrode in the laboratory, the phase shifts over time, resulting in a green line. The tip discharge signal is generated by the laboratory medium and high voltage test transformer and the UHF defect discharge analog signal generator. Due to the fixed phase, the color of the middle part will always become darker with time accumulation, and finally tends to be stable.
Claims (5)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111663965.5A CN114386499A (en) | 2021-12-30 | 2021-12-30 | A clustering and separation method of multi-source partial discharge signal data stream based on GIS |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202111663965.5A CN114386499A (en) | 2021-12-30 | 2021-12-30 | A clustering and separation method of multi-source partial discharge signal data stream based on GIS |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN114386499A true CN114386499A (en) | 2022-04-22 |
Family
ID=81200742
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202111663965.5A Pending CN114386499A (en) | 2021-12-30 | 2021-12-30 | A clustering and separation method of multi-source partial discharge signal data stream based on GIS |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN114386499A (en) |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115187527A (en) * | 2022-06-27 | 2022-10-14 | 上海格鲁布科技有限公司 | A Separation and Identification Method of Multi-source Hybrid UHF Partial Discharge Spectrum |
| CN115727760A (en) * | 2022-11-18 | 2023-03-03 | 广东电网有限责任公司 | Spatial position identification method, system, equipment and storage medium |
| CN116933335A (en) * | 2023-09-13 | 2023-10-24 | 北京安信天行科技有限公司 | Security data analysis method based on real-time aggregation anomaly detection |
| CN118606798A (en) * | 2024-08-08 | 2024-09-06 | 北京科锐特科技有限公司 | A wind turbine fault analysis method, device, intelligent device and storage medium |
| CN119178522A (en) * | 2024-10-12 | 2024-12-24 | 西安交通大学 | Online monitoring and fault early warning equipment for internal temperature field and partial discharge of power equipment |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120054184A1 (en) * | 2010-08-24 | 2012-03-01 | Board Of Regents, The University Of Texas System | Systems and Methods for Detecting a Novel Data Class |
| WO2014101579A1 (en) * | 2012-12-28 | 2014-07-03 | 国家电网公司 | Method for identifying a transformer local discharge mode based on a singular value decomposition algorithm |
| CN108663608A (en) * | 2018-07-31 | 2018-10-16 | 中国南方电网有限责任公司超高压输电公司检修试验中心 | A kind of multi-source local discharge signal automatic cluster separation method |
| CN111967338A (en) * | 2020-07-27 | 2020-11-20 | 广东电网有限责任公司广州供电局 | Method and system for distinguishing partial discharge pulse interference signal based on mean shift clustering algorithm |
| CN112668612A (en) * | 2020-12-09 | 2021-04-16 | 重庆邮电大学 | Partial discharge signal clustering analysis method based on grids |
| CN113033615A (en) * | 2021-03-01 | 2021-06-25 | 电子科技大学 | Radar signal target real-time association method based on online micro-cluster clustering |
| CN113325277A (en) * | 2021-04-30 | 2021-08-31 | 国能大渡河检修安装有限公司 | Partial discharge processing method |
| CN113344128A (en) * | 2021-06-30 | 2021-09-03 | 福建师范大学 | Micro-cluster-based industrial Internet of things adaptive stream clustering method and device |
-
2021
- 2021-12-30 CN CN202111663965.5A patent/CN114386499A/en active Pending
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120054184A1 (en) * | 2010-08-24 | 2012-03-01 | Board Of Regents, The University Of Texas System | Systems and Methods for Detecting a Novel Data Class |
| WO2014101579A1 (en) * | 2012-12-28 | 2014-07-03 | 国家电网公司 | Method for identifying a transformer local discharge mode based on a singular value decomposition algorithm |
| CN108663608A (en) * | 2018-07-31 | 2018-10-16 | 中国南方电网有限责任公司超高压输电公司检修试验中心 | A kind of multi-source local discharge signal automatic cluster separation method |
| CN111967338A (en) * | 2020-07-27 | 2020-11-20 | 广东电网有限责任公司广州供电局 | Method and system for distinguishing partial discharge pulse interference signal based on mean shift clustering algorithm |
| CN112668612A (en) * | 2020-12-09 | 2021-04-16 | 重庆邮电大学 | Partial discharge signal clustering analysis method based on grids |
| CN113033615A (en) * | 2021-03-01 | 2021-06-25 | 电子科技大学 | Radar signal target real-time association method based on online micro-cluster clustering |
| CN113325277A (en) * | 2021-04-30 | 2021-08-31 | 国能大渡河检修安装有限公司 | Partial discharge processing method |
| CN113344128A (en) * | 2021-06-30 | 2021-09-03 | 福建师范大学 | Micro-cluster-based industrial Internet of things adaptive stream clustering method and device |
Non-Patent Citations (3)
| Title |
|---|
| 赵犁丰 等: "基于加权多宽度高斯核函数的聚类算法", 《现代电子技术》, vol. 34, no. 10, 15 May 2011 (2011-05-15), pages 78 - 79 * |
| 陈昌川 等: "基于多源局部放电信号数据流聚类分离方法", 《上海交通大学学报》, vol. 56, no. 8, 31 August 2022 (2022-08-31), pages 1014 - 1023 * |
| 马绍飞: "基于自然邻的在线数据流聚类算法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》, no. 1, 15 January 2021 (2021-01-15), pages 13 - 36 * |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115187527A (en) * | 2022-06-27 | 2022-10-14 | 上海格鲁布科技有限公司 | A Separation and Identification Method of Multi-source Hybrid UHF Partial Discharge Spectrum |
| CN115187527B (en) * | 2022-06-27 | 2023-04-07 | 上海格鲁布科技有限公司 | Separation and identification method for multi-source mixed ultrahigh frequency partial discharge spectrum |
| CN115727760A (en) * | 2022-11-18 | 2023-03-03 | 广东电网有限责任公司 | Spatial position identification method, system, equipment and storage medium |
| CN116933335A (en) * | 2023-09-13 | 2023-10-24 | 北京安信天行科技有限公司 | Security data analysis method based on real-time aggregation anomaly detection |
| CN118606798A (en) * | 2024-08-08 | 2024-09-06 | 北京科锐特科技有限公司 | A wind turbine fault analysis method, device, intelligent device and storage medium |
| CN119178522A (en) * | 2024-10-12 | 2024-12-24 | 西安交通大学 | Online monitoring and fault early warning equipment for internal temperature field and partial discharge of power equipment |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN114386499A (en) | A clustering and separation method of multi-source partial discharge signal data stream based on GIS | |
| Peng et al. | Random forest based optimal feature selection for partial discharge pattern recognition in HV cables | |
| Duan et al. | Identification of partial discharge defects based on deep learning method | |
| Chan et al. | Time-frequency sparsity map on automatic partial discharge sources separation for power transformer condition assessment | |
| CN110750524A (en) | Method and system for determining fault characteristics of active power distribution network | |
| CN109507535B (en) | Method and device for predicting operation stage and operation life of transformer substation grounding grid | |
| Wang et al. | Optimal features selected by NSGA-II for partial discharge pulses separation based on time-frequency representation and matrix decomposition | |
| Wang et al. | Gas-insulated switchgear insulation defect diagnosis via a novel domain adaptive graph convolutional network | |
| CN103558519A (en) | GIS partial discharge ultrasonic signal identification method | |
| Zeng et al. | Fault diagnosis and condition division criterion of DC gas insulating equipment based on SF 6 partial discharge decomposition characteristics | |
| CN114511194B (en) | Operation risk prediction method and system of electric power Internet of things and electronic equipment | |
| CN116577698B (en) | Substation ground fault monitoring method based on electromagnetic field distribution | |
| CN107505541A (en) | A kind of recognition methods of partial discharge of transformer pattern | |
| Rezaeian Koochi et al. | New phasor‐based approach for online and fast prediction of generators grouping using decision tree | |
| CN112364704A (en) | Clustering method and system based on clock synchronization partial discharge | |
| Javandel et al. | Multiple partial discharge sources separation using a method based on laplacian score and correlation coefficient techniques | |
| Wang et al. | A novel domain adversarial graph convolutional network for insulation defect diagnosis in gas-insulated substations | |
| CN108304567A (en) | High-tension transformer regime mode identifies and data classification method and system | |
| Chen et al. | Feature extraction of partial discharge in low‐temperature composite insulation based on VMD‐MSE‐IF | |
| Sun et al. | Application of SCNGO-VMD-SVM in identification of gas insulated switchgear partial discharge | |
| CN115828984A (en) | SSA and RF-based GIS internal intermittent discharge mode identification system, method, electronic equipment and medium | |
| CN117031194B (en) | Ultrasonic hidden danger detection method and system for power distribution network | |
| CN119644068A (en) | DC fault arc detection method, device, equipment, medium and chip | |
| CN117825876A (en) | Cable fault feature extraction method and device combining feature fragments and wavelet packets | |
| Huang et al. | Prediction of salt contamination in the rotating blade of wind turbine under lightning strike occurrence using fuzzy c‐means and k‐means clustering approaches |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |